| 2021 | A Compressed, Divide and Conquer Algorithm for Scalable Distributed Matrix-Matrix Multiplication. Majid Rasouli, Robert M. Kirby, Hari Sundar |
| 2021 | A Deep Reinforcement Learning Method for Solving Task Mapping Problems with Dynamic Traffic on Parallel Systems. Yucheng Wang, Jerry Chou, I-Hsin Chung |
| 2021 | An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers. Awais Khan, Hyogi Sim, Sudharshan S. Vazhkudai, Ali Raza Butt, Youngjae Kim |
| 2021 | CSPACER: A Reduced API Set Runtime for the Space Consistency Model. Khaled Z. Ibrahim |
| 2021 | Conjugate Gradient Solvers with High Accuracy and Bit-wise Reproducibility between CPU and GPU using Ozaki scheme. Daichi Mukunoki, Katsuhisa Ozaki, Takeshi Ogita, Roman Iakymchuk |
| 2021 | Efficient Contour Integral-based Eigenvalue Computation Using an Iterative Linear Solver with Shift-Invert Preconditioning. Yasunori Futamura, Tetsuya Sakurai |
| 2021 | Efficient Implementation of a Dimensionality Reduction Method Using a Complex Moment-Based Subspace. Takahiro Yano, Yasunori Futamura, Akira Imakura, Tetsuya Sakurai |
| 2021 | GPU Acceleration of Multigrid Preconditioned Conjugate Gradient Solver on Block-Structured Cartesian Grid. Naoyuki Onodera, Yasuhiro Idomura, Yuta Hasegawa, Susumu Yamashita, Takashi Shimokawabe, Takayuki Aoki |
| 2021 | GPU Optimizations for Atmospheric Chemical Kinetics. Theodoros Christoudias, Timo Kirfel, Astrid Kerkweg, Domenico Taraborrelli, Georges-Emmanuel Moulard, Erwan Raffin, Victor Azizi, Gijs van den Oord, Ben van Werkhoven |
| 2021 | HPC Asia 2021: The International Conference on High Performance Computing in Asia-Pacific Region, Virtual Event, Republic of Korea, January 20-21, 2021 Soonwook Hwang, Heon Young Yeom |
| 2021 | HPC LINPACK Parameter Optimization on Homo-/Heterogeneous System of ARM Neoverse N1SDP. Je-Seok Ham, Yong Cheol Peter Cho, Ju-Yeob Kim, Chun-Gi Lyuh, Jin-Kyu Kim, Jinho Han, Youngsu Kwon |
| 2021 | HybridHadoop: CPU-GPU Hybrid Scheduling in Hadoop. Chanyoung Oh, Hyeonjin Jung, Saehanseul Yi, Illo Yoon, Youngmin Yi |
| 2021 | Performance Evaluation of OpenCL-Enabled Inter-FPGA Optical Link Communication Framework CIRCUS and SMI. Ryuta Kashino, Ryohei Kobayashi, Norihisa Fujita, Taisuke Boku |
| 2021 | Performance Modeling of HPC Applications on Overcommitted Systems. Shohei Minami, Toshio Endo, Akihiro Nomura |
| 2021 | SeisSol on Distributed Multi-GPU Systems: CUDA Code Generation for the Modal Discontinuous Galerkin Method. Ravil Dorozhinskii, Michael Bader |
| 2021 | Spectral Element Simulations on the NEC SX-Aurora TSUBASA. Niclas Jansson |
| 2021 | Toward Data-Adaptable TinyML using Model Partial Replacement for Resource Frugal Edge Device. Jisu Kwon, Daejin Park |
| 2021 | neoSYCL: a SYCL implementation for SX-Aurora TSUBASA. Yinan Ke, Mulya Agung, Hiroyuki Takizawa |