Data Management

Data is an essential part of modern computational science that allows researchers to discover and answer new scientific questions. The successful collection, management, security, and storage of petabytes of data — generated by ongoing research and the work of partner programs such as Atmospheric Radiation Measurement, Air Force Weather, and NOAA — is enabled by division-wide efforts to improve the data lifecycle for users and collaborators. These efforts include:
The explosion of data calls for the capability to train and deploy AI and machine learning models at scale to support advanced data analysis and accelerate scientific discoveries. By partnering with both computational scientists and experimentalists, NCCS offers expertise in extreme-scale distributed training of AI models; deploying, evaluating, and improving AI frameworks and methods; and bridging high-performance computing with experimental facilities at the edge.
At the heart of NCCS data projects is the understanding that collaboration, data sharing, and accessibility are critical to scientific progress.