Skip to main content
SHARE
Publication

Towards a Standard Process Management Infrastructure for Workflows Using Python

by Wael R Elwasif, Thomas J Naughton Iii, Matthew B Baker
Publication Type
Conference Paper
Book Title
Proc. Parallel and Distributed Computing, Applications and Technologies. PDCAT 2022, Lecture Notes in Computer Science.
Publication Date
Page Numbers
523 to 534
Publisher Location
Cham, Switzerland
Conference Name
The 23rd International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT’22)
Conference Location
Sendai, Japan
Conference Date
-

Orchestrating the execution of ensembles of processes lies at the core of scientific workflow engines on large scale parallel platforms. This is usually handled using platform-specific command line tools, with limited process management control and potential strain on system resources. The PMIx standard provides a uniform interface to system resources. The low level C implementation of PMIx has hampered its use in workflow engines, leading to the development of Python binding that has yet to gain traction. In this paper, we present our work to harden the PMIx Python client, demonstrating its usability using a prototype Python driver to orchestrate the execution of an ensemble of processes. We present experimental results using the prototype on the Summit supercomputer at Oak Ridge National Laboratory. This work lays the foundation for wider adoption of PMIx for workflow engines, and encourages wider support of more PMIx functionality in vendor provided system software stacks.