Skip to main content
SHARE
Publication

Did the GPU obfuscate the load imbalance in my MPI simulation?...

by David W Eberius, David Boehme, Olga Pearce
Publication Type
Conference Paper
Journal Name
Supercomputing
Book Title
2021 IEEE/ACM International Workshop on Hierarchical Parallelism for Exascale Computing (HiPar)
Publication Date
Page Numbers
20 to 29
Publisher Location
New Jersey, United States of America
Conference Name
HiPar21: 2nd International Workshop on Hierarchical Parallelism for Exascale Computing
Conference Location
St. Louis, Missouri, United States of America
Conference Sponsor
IEEE
Conference Date
-

The current proliferation of GPU-based HPC systems necessitates a method for assessing the performance of simulations on heterogeneous machines. The addition of GPUs to a system adds multiple hierarchical levels of parallelism to the node architecture. In this paper, we demonstrate that the traditional load imbalance metric is insufficient for capturing the load imbalance on GPU-based machines, since it treats the GPU as a monolithic entity and ignores the internal parallelism. We propose a new hierarchical metric that improves the correlation of measured performance and application workload by up to 20.61%. Using our metric for determining application load instead of the traditional metric as the input for the load balancing algorithm reduces the residual load imbalance by up to 4× in our application.