A Framework for Batched and GPU-Resident Factorization Algorithms Applied to Block Householder Transformations Book Chapter June, 2015
Heterogenous Acceleration for Linear Algebra in Multi-coprocessor Environments Book Chapter June, 2015
COMPASS: A Framework for Automated Performance Modeling and Prediction... Conference Paper June, 2015
Automated Characterization of Parallel Application Communication Patterns... Conference Paper June, 2015
A Survey of Software Techniques for Using Non-Volatile Memories for Storage and Main Memory Systems... Journal May, 2015
Opportunities for Nonvolatile Memory Systems in Extreme-Scale High Performance Computing... Journal March, 2015
An OpenACC-Based Unified Programming Model for Multi-accelerator Systems Conference Paper February, 2015