dot
Detailansicht
Katalogkarte GBA
Katalogkarte ISBD
Suche präzisieren
Drucken
Download RIS
Hier klicken, um den Treffer aus der Auswahl zu entfernen
Titel A Virtual Remote Storage and Data Visualization on the Science Cloud
VerfasserIn Hidenobu Watanabe, Ken T. Murata
Konferenz EGU General Assembly 2013
Medientyp Artikel
Sprache Englisch
Digitales Dokument PDF
Erschienen In: GRA - Volume 15 (2013)
Datensatznummer 250084257
 
Zusammenfassung
Super-computer is now one of the most important tools for scientific research works. Computational power of super-computers is getting higher and higher. Along with the development of its computational power, data size yielded by super-computers is also getting larger. Typical data size by one job is already in TB order. For near future works, we need to prepare PB scale data analysis. Recently, big-data science is paid attention because of such large-scale data is popular. However, since data analysis environment to process such large scale data is not established in any science cloud or academic cloud, big-data science is not successful yet. One of the reasons of this difficulty is techniques to support researchers working on cloud systems, including middle-wares. For example, we save many data files with large-scale on cloud storages. However, reading these files take too long time. Assume that we have 10TB data files on a normal NAS storage. It takes more than 10 days to read all of the data files. This is because of the low speed disk I/O between hard-disk drive and controlling server. We have been developing a technique of a virtual storage system with high speed I/O using parallel storage system named Gfarm (Grid farmware). With combining parallel disk servers and making a virtual storage, we achieved as high as 7Gbps storage. What should be noted is that the 7Gbps is not network speed, but disk I/O speed. Since disk I/O speed of a local SSD storage is about 4Gbps, this suggests that our virtual storage provide us with higher I/O speed than local disks. After saving or reading data files, we next analyze data files. The virtual disk system should be synchronized these data analysis. In the present talk, we are going to present the sequential data processing (visualization) using the virtual high speed storage system.