Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applications Conference Paper November, 2024
Performance Characterization of a Hierarchical MPI Implementations on Large-scale Distributed-memory Platforms Conference Paper September, 2009
Performance Analysis and Projections for Petascale Applications on Cray XT Series Systems Conference Paper May, 2009