ISC C

19 papers

YearTitle / Authors
2022"Hey CAI" - Conversational AI Enabled User Interface for HPC Tools.
Pouya Kousha, Arpan Jain, Ayyappa Kolli, Prasanna Sainath, Hari Subramoni, Aamir Shafi, Dhabaleswar K. Panda
2022A Motivating Case Study on Code Variant Selection by Reinforcement Learning.
Oliver Hacker, Matthias Korch, Johannes Seiferth
2022A Subset of the CERN Virtual Machine File System: Fast Delivering of Complex Software Stacks for Supercomputing Resources.
Alexandre F. Boyer, Christophe Haen, Federico Stagni, David R. C. Hill
2022Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters.
Qinghua Zhou, Pouya Kousha, Quentin Anthony, Kawthar Shafie Khorassani, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda
2022Accelerating Simulated Quantum Annealing with GPU and Tensor Cores.
Yi-Hua Chung, Cheng-Jhih Shih, Shih-Hao Hung
2022Comparative Evaluation of Call Graph Generation by Profiling Tools.
Onur Cankur, Abhinav Bhatele
2022Dynamic Task Fusion for a Block-Structured Finite Volume Solver over a Dynamically Adaptive Mesh with Local Time Stepping.
Baojiu Li, Holger Schulz, Tobias Weinzierl, Han Zhang
2022Efficient Application of Hanging-Node Constraints for Matrix-Free High-Order FEM Computations on CPU and GPU.
Peter Munch, Karl Ljungkvist, Martin Kronbichler
2022High Performance Computing - 37th International Conference, ISC High Performance 2022, Hamburg, Germany, May 29 - June 2, 2022, Proceedings
Ana Lucia Varbanescu, Abhinav Bhatele, Piotr Luszczek, Marc Baboulin
2022Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters.
Arpan Jain, Aamir Shafi, Quentin Anthony, Pouya Kousha, Hari Subramoni, Dhabaleswar K. Panda
2022Hybrid Parallel ILU Preconditioner in Linear Solver Library GaspiLS.
Raju Ram, Daniel Grünewald, Nicolas R. Gauger
2022LLM: Realizing Low-Latency Memory by Exploiting Embedded Silicon Photonics for Irregular Workloads.
Marjan Fariborz, Mahyar Samani, Pouya Fotouhi, Roberto Proietti, Il-Min Yi, Venkatesh Akella, Jason Lowe-Power, Samuel Palermo, S. J. Ben Yoo
2022MAPredict: Static Analysis Driven Memory Access Prediction Framework for Modern CPUs.
Mohammad Alaul Haque Monil, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony
2022NVIDIA's Quantum InfiniBand Network Congestion Control Technology and Its Impact on Application Performance.
Yuval Shpigelman, Gilad Shainer, Richard L. Graham, Yong Qin, Gerardo Cisneros-Stoianowski, Craig B. Stunkel
2022Rapid Execution Time Estimation for Heterogeneous Memory Systems Through Differential Tracing.
Nicolas Denoyelle, Swann Perarnau, Kamil Iskra, Balazs Gerofi
2022Remote OpenMP Offloading.
Atmn Patel, Johannes Doerfert
2022SU3_Bench on a Programmable Integrated Unified Memory Architecture (PIUMA) and How that Differs from Standard NUMA CPUs.
Jesmin Jahan Tithi, Fabio Checconi, Douglas Doerfler, Fabrizio Petrini
2022Understanding Distributed Deep Learning Performance by Correlating HPC and Machine Learning Measurements.
Ana Luisa Veroneze Solórzano, Lucas Mello Schnorr
2022m-Cubes: An Efficient and Portable Implementation of Multi-dimensional Integration for GPUs.
Ioannis Sakiotis, Kamesh Arumugam, Marc F. Paterno, Desh Ranjan, Balsa Terzic, Mohammad Zubair