Skip to main content

Scaling SQL to the Supercomputer for Interactive Analysis of Simulation Data...

Publication Type
Conference Paper
Book Title
Driving Scientific and Engineering Discoveries Through the Integration of Experiment, Big Data, and Modeling and Simulation
Publication Date
Page Numbers
327 to 339
Publisher Location
Cham, Switzerland
Conference Name
Smoky Mountains Computational Sciences and Engineering Conference (SMC)
Conference Location
Kingsport, Tennessee, United States of America
Conference Sponsor
Conference Date

AI and simulation workloads consume and generate large amounts of data that need to be searched, transformed and merged with other data. With the goal of treating data as a first-class citizen inside a traditionally compute-centric HPC environment, we explore how the use of accelerators and high-speed interconnects can speed up tasks which otherwise constitute bottlenecks in computational discovery workflows. BlazingSQL is SQL engine that runs natively on NVIDIA GPUs and supports internode communication for fast analytics on terabyte-scale tabular data sets. We show how a fast interconnect improves query performance if leveraged through the Unified Communication X (UCX) middleware. We envision that future computing platforms will integrate accelerated database query capabilities for immediate and interactive analysis of large simulation data.