Pedro Valero Lara Senior Computer Scientist Contact VALEROLARAP@ORNL.GOV All Publications JACC.shared: Leveraging HPC Metaprogramming and Performance Portability for Computations That Use Shared Memory GPUs Large language model evaluation for high‐performance computing software development ChatBLAS: The First AI-Generated and Portable BLAS Library Integrating ORNL’s HPC and Neutron Facilities with a Performance-Portable CPU/GPU Ecosystem JACC: Leveraging HPC Meta-Programming and Performance Portability with the Just-in-Time and LLVM-based Julia Language Clacc: OpenACC for C/C++ in Clang eCC++ : A Compiler Construction Framework for Embedded Domain-Specific Languages MatRIS: Addressing the Challenges for Portability and Heterogeneity Using Tasking for Matrix Decomposition (Cholesky) IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime... Julia as a unifying end-to-end workflow language on the Frontier exascale system Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs IRIS-DMEM: Efficient Memory Management for Heterogeneous Computing Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation S4PST: Sustainability for Programming Systems and Tools Workshop Report A MultiGPU Performance-Portable Solution for Array Programming Based on Kokkos Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes Tiling Framework for Heterogeneous Computing of Matrix based Tiled Algorithms IRIS-BLAS: Towards a Performance Portable and Heterogeneous BLAS Library... SparseLU, A Novel Algorithm and Math Library for Sparse LU Factorization LaRIS: Targeting Portability and Productivity for LAPACK Codes on Extreme Heterogeneous Systems by Using IRIS KokkACC: Enhancing Kokkos with OpenACC Pagination Current page 1 Page 2 Next page ›› Last page Last » Key Links Google Scholar ORCID LinkedIn GitHub DBLP