Evaluating Burst Buffer Placement in HPC Systems

Show authors

Publication Type

Conference Paper

Book Title

Proceedings of the 2019 IEEE International Conference on Cluster Computing

Publication Date

September, 2019

Page Numbers

1 to 11

Conference Name

IEEE International Conference on Cluster Computing (IEEE Cluster 2019)

Conference Location

Albuquerque, New Mexico, United States of America

Conference Sponsor

IEEE

Conference Date

Sep 23, 2019 - Sep 26, 2019

Abstract

Burst buffers (BBs) are increasingly exploited in contemporary supercomputers to bridge the performance gap between compute and storage systems. The design of BBs, particularly the placement of these devices and the underlying network topology, impacts both performance and cost. As the cost of other components such as memory and accelerators is increasing, it is becoming more important that HPC centers provision BBs tailored to their workloads.

This work contributes a provisioning system to provide accurate, multi-tenant simulations that model realistic application and storage workloads from HPC systems. The framework aids HPC centers in modeling their workloads against multiple network and BB configurations rapidly. In experiments with our framework, we provide a comparison of representative Oak Ridge Leadership Computing Facility (OLCF) I/O workloads against multiple BB designs. We analyze the impact of these designs on latency, I/O phase lengths, contention for network and storage devices, and choice of network topology.

Evaluating Burst Buffer Placement in HPC Systems

Abstract

Researchers

Organizations