| 2024 | A Tree-Approach Pauli Decomposition Algorithm with Application to Quantum Computing. Océane Koska, Marc Baboulin, Arnaud Gazda |
| 2024 | Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes on Modern GPU Clusters. Qinghua Zhou, Bharath Ramesh, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda |
| 2024 | Asynchronous Distributed Actor-Based Approach to Jaccard Similarity for Genome Comparisons. Youssef Elmougy, Akihiro Hayashi, Vivek Sarkar |
| 2024 | Calibration and Performance Evaluation of a Superconducting Quantum Processor in an HPC Center. Xiaolong Deng, Stefan Pogorzalek, Florian Vigneau, Ping Yang, Martin Schulz, Laura Brandon Schulz |
| 2024 | Configurable Algorithms for All-to-All Collectives. Ke Fan, Steve Petruzza, Thomas Gilray, Sidharth Kumar |
| 2024 | EcoFreq: Compute with Cheaper, Cleaner Energy via Carbon-Aware Power Scaling. Oleksiy M. Kozlov, Alexandros Stamatakis |
| 2024 | Evaluation of the Classical Hardware Requirements for Large-Scale Quantum Computations. Daan Camps, Ermal Rrapaj, Katherine Klymko, Brian Austin, Nicholas J. Wright |
| 2024 | GPU-Accelerated Vecchia Approximations of Gaussian Processes for Geospatial Data using Batched Matrix Computations. Qilong Pan, Sameh Abdulah, Marc G. Genton, David E. Keyes, Hatem Ltaief, Ying Sun |
| 2024 | HPC-Coder: Modeling Parallel Programs using Large Language Models. Daniel Nichols, Aniruddha Marathe, Harshitha Menon, Todd Gamblin, Abhinav Bhatele |
| 2024 | Hierarchical Multigrid Ansatz for Variational Quantum Algorithms. Christo Meriwether Keller, Stephan J. Eidenbenz, Andreas Bärtschi, Daniel O'Malley, John Golden, Satyajayant Misra |
| 2024 | ISC High Performance 2024 Research Paper Proceedings (39th International Conference), Hamburg, Germany, May 12-16, 2024. |
| 2024 | Multi-GPU Processing of Unstructured Data for Machine Learning. Joel Ratsaby, Alexander Timashkov |
| 2024 | Multithreaded Parallelism for Heterogeneous Clusters of QPUs. Philipp Seitz, Manuel Geiger, Christian B. Mendl |
| 2024 | Optimizing Application Performance with BlueField: Accelerating Large-Message Blocking and Nonblocking Collective Operations. Richard L. Graham, George Bosilca, Yong Qin, Bradley W. Settlemyer, Gilad Shainer, Craig B. Stunkel, Geoffroy Vallée, Brody Williams, Gerardo Cisneros-Stoianowski, Sebastian T. Ohlmann, Markus Rampp |
| 2024 | Optimizing Distributed Training on Frontier for Large Language Models. Sajal Dash, Isaac Lyngaas, Junqi Yin, Xiao Wang, Romain Egele, J. Austin Ellis, Matthias Maiterth, Guojing Cong, Feiyi Wang, Prasanna Balaprakash |
| 2024 | Optimizing Metadata Exchange: Leveraging DAOS for ADIOS Metadata I/O. Ranjan Sarpangala Venkatesh, Greg Eisenhauer, Norbert Podhorszki, Dmitry Ganyushin, Scott Klasky, Ada Gavrilovska |
| 2024 | Porting HPC Applications to AMD Instinct™ MI300A using Unified Memory and OpenMP®. Suyash Tandon, Leopold Grinberg, Gheorghe-Teodor Bercea, Carlo Bertolli, Mark Olesen, Simone Bnà, Nicholas Malaya |
| 2024 | Power Consumption Trends in Supercomputers: A Study of NERSC's Cori and Perlmutter Machines. Ermal Rrapaj, Sridutt Bhalachandra, Zhengji Zhao, Brian Austin, Hai Ah Nam, Nicholas J. Wright |
| 2024 | Programming Model Extensions for General-Purpose Processing-In-Memory. Hyesun Hong, Lukas Sommer, Bongjun Kim, Mikhail Kashkarov, Kumudha Narasimhan, Ilya Veselov, Mehdi Goli, Jaeyeon Kim, Ruymán Reyes Castro, Hanwoong Jung |
| 2024 | ROCm-Aware Leader-based Designs for MPI Neighbourhood Collectives. Yiltan Hassan Temuçin, Mahdieh Gazimirsaeed, Ryan E. Grant, Ahmad Afsahi |
| 2024 | Solving Millions of Eigenvectors in Large-Scale Quantum-Many-Body-Theory Computations. Alexey Tal, Martijn Marsman, Georg Kresse, Anton Anders, Samuel Rodríguez, Kyungjoo Kim, Alexander Kalinkin, Alexey Romanenko, Matthias Noack, Patrick Atkinson, Stefan Maintz |
| 2024 | TinyProf: Towards Continuous Performance Introspection through Scalable Parallel I/O. Ke Fan, Suraj P. Kesavan, Steve Petruzza, Sidharth Kumar |
| 2024 | What is Quantum Parallelism, Anyhow? Stefano Markidis |
| 2024 | Workload Scheduling on Heterogeneous Devices. Harsh Khetawat, Frank Mueller |
| 2024 | iPuma: High-Performance Sequence Alignment on the Graphcore IPU. Max Xiaohang Zhao, Luk Burchard, Daniel Thilo Schroeder, Johannes Langguth, Xing Cai |