Jeffrey S Vetter Section Head - Advanced Computing Systems Research Contact vetter@ornl.gov | 865.576.7115 All Publications IRIS: A Performance-Portable Framework for Cross-Platform Heterogeneous Computing... Clacc: OpenACC for C/C++ in Clang IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming IRIS: Exploring Performance Scaling of the Intelligent Runtime System and its Dynamic Scheduling Policies eCC++ : A Compiler Construction Framework for Embedded Domain-Specific Languages sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores Performance Evaluation of Heterogeneous GPU Programming Frameworks for Hemodynamic Simulations FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code Julia as a unifying end-to-end workflow language on the Frontier exascale system... CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types... IRIS-DMEM: Efficient Memory Management for Heterogeneous Computing Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation Experience Migrating OpenCL to SYCL: A Case Study on Searches for Potential Off-Target Sites of Cas9 RNA-Guided Endonucleases... Experience Deploying Graph Applications on GPUs with SYCL On-Sensor Data Filtering using Neuromorphic Computing for High Energy Physics Experiments Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation Encoding integers and rationals on neuromorphic computers using virtual neuron A survey on processing-in-memory techniques: Advances and challenges... A MultiGPU Performance-Portable Solution for Array Programming Based on Kokkos Abisko: Deep codesign of an architecture for spiking neural networks using novel neuromorphic materials Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes Understanding SYCL Portability for Pseudorandom Number Generation: a Case Study with Gene-Expression Connectivity Mapping A 3D Implementation of Convolutional Neural Network for Fast Inference... Pagination Current page 1 Page 2 Page 3 … Next page ›› Last page Last » Key Links ORCID LinkedIn GitHub Personal Home Page Google Scholar Publications