Skip to main content
SHARE
Publication

WIREFRAME: Supporting Data-dependent Parallelism through Dependency Graph Execution in GPUs...

by Amirali Abdolrashidi, Devashree Tripathy, Mehmet E Belviranli, Laxmi Bhuyan, Daniel Wong
Publication Type
Conference Paper
Book Title
2017 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)
Publication Date
Conference Name
The 50th Annual IEEE/ACM International Symposium on Microarchitecture
Conference Location
Boston, Massachusetts, United States of America
Conference Sponsor
IEEE/ACM
Conference Date
-

GPUs lack fundamental support for data-dependent parallelism and synchronization. While CUDA Dynamic Parallelism signals progress in this direction, many limitations and challenges still re-main. This paper introducesWireframe, a hardware-software solution that enables generalized support for data-dependent parallelism and synchronization. Wireframe enables applications to naturally express execution dependencies across different thread blocks through a dependency graph abstraction at run-time, which is sent to the GPU hardware at kernel launch. At run-time, the hardware enforces the dependencies specified in the dependency graph through a dependency-aware thread block scheduler. Overall, Wireframe is able to improve total execution time up to 65.20% with an average of 45.07%.