Skip to main content
SHARE
Publication

DataFed: Towards Reproducible Research via Federated Data Management...

by Dale V Stansberry, Suhas Somnath, Jessica U Breet, Gregory L Shutt, Mallikarjun Shankar
Publication Type
Conference Paper
Book Title
2019 International Conference on Computational Science and Computational Intelligence (CSCI)
Publication Date
Page Numbers
1312 to 1317
Conference Name
THE 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI)
Conference Location
Las Vegas, Nevada, United States of America
Conference Sponsor
IEEE
Conference Date
-

The increasingly collaborative, globalized nature of scientific research combined with the need to share data and the explosion in data volumes present an urgent need for a scientific data management system (SDMS). An SDMS presents a logical and holistic view of data that greatly simplifies and empowers data organization, curation, searching, sharing, dissemination, etc. We present DataFed - a lightweight, distributed SDMS that spans a federation of storage systems within a loosely-coupled network of scientific facilities. Unlike existing SDMS offerings, DataFed uses high-performance and scalable user management and data transfer technologies that simplify deployment, maintenance, and expansion of DataFed. DataFed provides web-based and command-line interfaces to manage data and integrate with complex scientific workflows. DataFed represents a step towards reproducible scientific research by enabling reliable staging of the correct data at the desired environment.