![Hier klicken, um den Treffer aus der Auswahl zu entfernen](images/unchecked.gif) |
Titel |
Accelerating climate simulation analytics via multilevel aggregation and synthesis |
VerfasserIn |
Valentine Anantharaj, Krishnaraj Ravindran, Raghul Gunasekaran, Sudharshan Vazhkudai, Ali Butt |
Konferenz |
EGU General Assembly 2015
|
Medientyp |
Artikel
|
Sprache |
Englisch
|
Digitales Dokument |
PDF |
Erschienen |
In: GRA - Volume 17 (2015) |
Datensatznummer |
250110605
|
Publikation (Nr.) |
EGU/EGU2015-10622.pdf |
|
|
|
Zusammenfassung |
A typical set of ultra high resolution (0.25 deg) climate simulation experiments produce over
50,000 files, ranging in sizes from 101 MB to 102 GB each – for a total volume of nearly 1
PB of data. The execution of the experiments will require over 100 Million CPU
hours on the Titan supercomputer at the Oak Ridge Leadership Computing Facility
(OLCF). The output from the simulations must then be archived, analyzed, distributed
to the project partners in a timely manner. Meeting this challenge would require
efficient movement of the data, staging the simulation output to a large and fast
file system that provides high volume access to other computational systems used
to analyze the data and synthesize results. But data movement is one of the most
expensive and time consuming steps in the scientific workflow. It is be expedient to
complete the diagnostics and analytics before the files are archived for long term
storage. Nevertheless, it is often necessary to fetch the files from archive for further
analysis.
We are implementing a solution to query, extract, index and summarize key statistical
information from the individual CF-compliant netCDF files that are then stored for
ready-access in a database. The contents of the database can be related back to the archived
files from which they were extracted. The statistical information can be quickly aggregated to
provide meaningful statistical summaries that could then be related to observations and/or
other simulation results for synthesis and further inference. The scientific workflow at OLCF,
augmented by expedited analytics capabilities, will allow the users of our systems to
shorten the time required to derive meaningful and relevant science results. We
will illustrate some of the timesaving benefits via a few typical use cases, based
on recent large-scale simulation experiments using the Community Earth System
Model (CESM) and the DOE Accelerated Climate Model for Energy (ACME). |
|
|
|
|
|