SCISPACE: A Scientific Collaboration Workspace for File Systems in Geo-Distributed HPC Data Centers

03/22/2018
by   Awais Khan, et al.
0

Future terabit networks are committed to dramatically improving big data motion between geographically dispersed HPC data centers.The scientific community takes advantage of the terabit networks such as DOE's ESnet and accelerates the trend to build a small world of collaboration between geospatial HPC data centers. It improves information and resource sharing for joint simulation and analysis between the HPC data centers. In this paper, we propose to build SCISPACE (Scientific Collaboration Workspace) for collaborative data centers. It provides a global view of information shared from multiple geo-distributed HPC data centers under a single workspace. SCISPACE supports native data-access to gain high-performance when data read or write is required in native data center namespace. It is accomplished by integrating a metadata export protocol. To optimize scientific collaborations across HPC data centers, SCISPACE implements search and discovery service. To evaluate, we configured two geo-distributed small-scale HPC data centers connected via high-speed Infiniband network, equipped with LustreFS. We show the feasibility of SCISPACE using real scientific datasets and applications. The evaluation results show average 36% performance boost when the proposed native-data access is employed in collaborations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2018

Experimental Verification and Analysis of Dynamic Loop Scheduling in Scientific Applications

Scientific applications are often irregular and characterized by large c...
research
05/26/2021

Towards Million-Server Network Simulations on Just a Laptop

The growing size of data center and HPC networks pose unprecedented requ...
research
12/30/2020

SDN helps Big Data to optimize access to data

This chapter introduces the state-of-the-art in the emerging area of com...
research
04/11/2022

Linking Scientific Instruments and HPC: Patterns, Technologies, Experiences

Powerful detectors at modern experimental facilities routinely collect d...
research
07/20/2020

Modernizing the HPC System Software Stack

Through the 1990s, HPC centers at national laboratories, universities, a...
research
08/02/2023

PROV-IO+: A Cross-Platform Provenance Framework for Scientific Data on HPC Systems

Data provenance, or data lineage, describes the life cycle of data. In s...
research
09/19/2022

Snowmass 2021 Computational Frontier CompF4 Topical Group Report: Storage and Processing Resource Access

Computing plays a significant role in all areas of high energy physics. ...

Please sign up or login with your details

Forgot password? Click here to reset