Climbing the Summit and Pushing the Frontier of Mixed Precision Benchmarks at Extreme Scale

Show authors

Publication Type

Conference Paper

Book Title

SC22: International Conference for High Performance Computing, Networking, Storage and Analysis

Publication Date

November, 2022

Page Numbers

1 to 15

Publisher Location

New Jersey, United States of America

Conference Name

International Conference for High Performance Computing, Networking, Storage and Analysis (SC22)

Conference Location

Dallas TX, Texas, United States of America

Conference Sponsor

OLCF

Conference Date

Nov 13, 2022 - Nov 18, 2022

View DOI Listing

Abstract

The rise of machine learning (ML) applications and their use of mixed precision to perform interesting science are driving forces behind AI for science on HPC. The convergence of ML and HPC with mixed precision offers the possibility of transformational changes in computational science. The HPL-AI benchmark is designed to measure the performance of mixed precision arithmetic as opposed to the HPL benchmark which measures double precision performance. Pushing the limits of systems at extreme scale is nontrivial -little public literature explores optimization of mixed precision computations at this scale. In this work, we demonstrate how to scale up the HPL-AI benchmark on the pre-exascale Summit and exascale Frontier systems at the Oak Ridge Leadership Computing Facility (OLCF) with a cross-platform design. We present the implementation, performance results, and a guideline of optimization strategies employed for delivering portable performance on both AMD and NVIDIA GPUs at extreme scale.

Climbing the Summit and Pushing the Frontier of Mixed Precision Benchmarks at Extreme Scale

Abstract

Researchers

Organizations