| 2020 | Communication-Aware Hardware-Assisted MPI Overlap Engine. Mohammadreza Bayatpour, Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Kaushik Kandadi Suresh, Seyedeh Mahdieh Ghazimirsaeed, Bharath Ramesh, Hari Subramoni, Dhabaleswar K. Panda |
| 2020 | DGEMM Using Tensor Cores, and Its Accurate and Reproducible Versions. Daichi Mukunoki, Katsuhisa Ozaki, Takeshi Ogita, Toshiyuki Imamura |
| 2020 | Desynchronization and Wave Pattern Formation in MPI-Parallel and Hybrid Memory-Bound Programs. Ayesha Afzal, Georg Hager, Gerhard Wellein |
| 2020 | Embedding Algorithms for Quantum Annealers with Chimera and Pegasus Connection Topologies. Stefanie Zbinden, Andreas Bärtschi, Hristo N. Djidjev, Stephan J. Eidenbenz |
| 2020 | Enabling Execution of a Legacy CFD Mini Application on Accelerators Using OpenMP. Ioannis Nompelis, Gabriele Jost, Alice Koniges, Christopher S. Daley, David Eder, Christopher Stone |
| 2020 | FASTHash: FPGA-Based High Throughput Parallel Hash Table. Yang Yang, Sanmukh R. Kuppannagari, Ajitesh Srivastava, Rajgopal Kannan, Viktor K. Prasanna |
| 2020 | Footprint-Aware Power Capping for Hybrid Memory Based Systems. Eishi Arima, Toshihiro Hanawa, Carsten Trinitis, Martin Schulz |
| 2020 | High Performance Computing - 35th International Conference, ISC High Performance 2020, Frankfurt/Main, Germany, June 22-25, 2020, Proceedings Ponnuswamy Sadayappan, Bradford L. Chamberlain, Guido Juckeland, Hatem Ltaief |
| 2020 | HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training with TensorFlow. Ammar Ahmad Awan, Arpan Jain, Quentin Anthony, Hari Subramoni, Dhabaleswar K. Panda |
| 2020 | Load-Balancing Parallel Relational Algebra. Sidharth Kumar, Thomas Gilray |
| 2020 | Offsite Autotuning Approach - Performance Model Driven Autotuning Applied to Parallel Explicit ODE Methods. Johannes Seiferth, Matthias Korch, Thomas Rauber |
| 2020 | Opportunities for Cost Savings with In-Transit Visualization. James Kress, Matthew Larsen, Jong Choi, Mark Kim, Matthew Wolf, Norbert Podhorszki, Scott Klasky, Hank Childs, David Pugmire |
| 2020 | Pattern-Aware Staging for Hybrid Memory Systems. Eishi Arima, Martin Schulz |
| 2020 | Predicting Job Power Consumption Based on RJMS Submission Data in HPC Systems. Théo Saillant, Jean-Christophe Weill, Mathilde Mougeot |
| 2020 | Reinit Giorgis Georgakoudis, Luanzheng Guo, Ignacio Laguna |
| 2020 | Running a Pre-exascale, Geographically Distributed, Multi-cloud Scientific Simulation. Igor Sfiligoi, Frank Würthwein, Benedikt Riedel, David Schultz |
| 2020 | Scalable Hierarchical Aggregation and Reduction Protocol (SHARP) Richard L. Graham, Lion Levi, Devendar Bureddy, Gil Bloch, Gilad Shainer, David Cho, George Elias, Daniel Klein, Joshua Ladd, Ophir Maor, Ami Marelli, Valentin Petrov, Evyatar Romlet, Yong Qin, Ido Zemah |
| 2020 | Scaling Genomics Data Processing with Memory-Driven Computing to Accelerate Computational Biology. Matthias Becker, Umesh Worlikar, Shobhit Agrawal, Hartmut Schultze, Thomas Ulas, Sharad Singhal, Joachim L. Schultze |
| 2020 | Semi-automatic Assessment of I/O Behavior by Inspecting the Individual Client-Node Timelines - An Explorative Study on 10 Eugen Betke, Julian M. Kunkel |
| 2020 | Shared-Memory Parallel Probabilistic Graphical Modeling Optimization: Comparison of Threads, OpenMP, and Data-Parallel Primitives. Talita Perciano, Colleen Heinemann, David Camp, Brenton Lessley, E. Wes Bethel |
| 2020 | Simplifying Communication Overlap in OpenSHMEM Through Integrated User-Level Thread Scheduling. Md. Wasi-ur-Rahman, David Ozog, James Dinan |
| 2020 | Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization. Noha Al-Harthi, Rabab Alomairy, Kadir Akbudak, Rui Chen, Hatem Ltaief, Hakan Bagci, David E. Keyes |
| 2020 | Sparse Linear Algebra on AMD and NVIDIA GPUs - The Race Is On. Yuhsiang M. Tsai, Terry Cojean, Hartwig Anzt |
| 2020 | TeaMPI - Replication-Based Resilience Without the (Performance) Pain. Philipp Samfass, Tobias Weinzierl, Benjamin Hazelwood, Michael Bader |
| 2020 | Time Series Mining at Petascale Performance. Amir Raoofy, Roman Karlstetter, Dai Yang, Carsten Trinitis, Martin Schulz |
| 2020 | Timemory: Modular Performance Analysis for HPC. Jonathan R. Madsen, Muaaz G. Awan, Hugo Brunie, Jack Deslippe, Rahulkumar Gayatri, Leonid Oliker, Yunsong Wang, Charlene Yang, Samuel Williams |
| 2020 | Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake Processors. Christie L. Alappat, Johannes Hofmann, Georg Hager, Holger Fehske, Alan R. Bishop, Gerhard Wellein |
| 2020 | Using High-Level Synthesis to Implement the Matrix-Vector Multiplication on FPGA. Alessandro Marongiu, Paolo Palazzari |