Skip to main content
SHARE
Publication

Job Management and Task Bundling...

by Evan Berkowitz, Gustav R Jansen, Kenneth Mcelvain, Andre Walker-loud
Publication Type
Conference Paper
Journal Name
EPJ Web of Conferences
Publication Date
Page Number
09007
Volume
175
Conference Name
International Symposium on Lattice Field Theory (Lattice 2017)
Conference Location
Granada, Spain
Conference Sponsor
Universidad de Granada
Conference Date
-

High Performance Computing is often performed on scarce and shared computing resources. To ensure computers are used to their full capacity, administrators often incentivize large workloads that are not possible on smaller systems. Measurements in Lattice QCD frequently do not scale to machine-size workloads. By bundling tasks together we can create large jobs suitable for gigantic partitions. We discuss METAQ and mpi_jm, software developed to dynamically group computational tasks together, that can intelligently backfill to consume idle time without substantial changes to users’ current workflows or executables.