| Sign In to gain access to subscriptions and/or personal tools. |
Towards Exascale Distributed Data ManagementEURO-MEDITERRANEAN CENTRE FOR CLIMATE CHANGE (CMCC), AND UNIVERSITY OF SALENTO, ITALY, GIOVANNI.ALOISIO{at}UNISALENTO.IT;
EURO-MEDITERRANEAN CENTRE FOR CLIMATE CHANGE (CMCC), AND UNIVERSITY OF SALENTO, ITALY, SANDRO.FIORE{at}UNISALENTO.IT "Exascale eScience infrastructures" will face important and critical challenges, both from computational and data perspectives. Increasingly complex and parallel scientific codes will lead to the production of a huge amount of data. The large volume of data and the time needed to locate, access, analyze and visualize data will greatly impact on the scientific productivity of scientists and researchers in several domains. Significant improvements in the data management field will increase research productivity in solving complex scientific problems. Next-generation eScience infrastructures will start from the assumption that exascale high-performance computing (HPC) applications (running on million of cores) will generate data at a very high rate (terabytes/s). Hundreds of exabytes of data (distributed across several centers) are expected, by 2020, to be available through heterogeneous storage resources for access, analysis, post-processing and other scientific activities.
Key Words: distributed data management data replication metadata management data analysis parallel I/O
This version was published on November
1, 2009 International Journal of High Performance Computing Applications, Vol. 23, No. 4,
398-400 (2009) |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||