WfBench: Automated Generation of Scientific Workflow Benchmarks

Publication Type

Conference Paper

Book Title

2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS)

Publication Date

November, 2022

Page Numbers

100 to 111

Publisher Location

New Jersey, United States of America

Conference Name

13th IEEE International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS)

Conference Location

Dallas, Texas, United States of America

Conference Sponsor

IEEE

Conference Date

Nov 14, 2022 - Nov 14, 2022

View DOI Listing

Abstract

The prevalence of scientific workflows with high computational demands calls for their execution on various distributed computing platforms, including large-scale leadership-class high-performance computing (HPC) clusters. To handle the deployment, monitoring, and optimization of workflow executions, many workflow systems have been developed over the past decade. There is a need for workflow benchmarks that can be used to evaluate the performance of workflow systems on current and future software stacks and hardware platforms.

We present a generator of realistic workflow benchmark specifications that can be translated into benchmark code to be executed with current workflow systems. Our approach generates workflow tasks with arbitrary performance characteristics (CPU, memory, and I/O usage) and with realistic task dependency structures based on those seen in production workflows. We present experimental results that show that our approach generates benchmarks that are representative of production workflows, and conduct a case study to demonstrate the use and usefulness of our generated benchmarks to evaluate the performance of workflow systems under different configuration scenarios.

WfBench: Automated Generation of Scientific Workflow Benchmarks

Abstract

Researchers

Organizations