| 2022 | 2.5 Million-Atom Ab Initio Electronic-Structure Simulation of Complex Metallic Heterostructures with DGDFT. Wei Hu, Hong An, Zhuoqiang Guo, Qingcai Jiang, Xinming Qin, Junshi Chen, Weile Jia, Chao Yang, Zhaolong Luo, Jielan Li, Wentiao Wu, Guangming Tan, Dongning Jia, Qinglin Lu, Fangfang Liu, Min Tian, Fang Li, Yeqi Huang, Liyi Wang, Sha Liu, Jinlong Yang |
| 2022 | A GPU-Accelerated AMR Solver for Gravitational Wave Propagation. Milinda Fernando, David Neilsen, Eric W. Hirschmann, Yosef Zlochower, Hari Sundar, Omar Ghattas, George Biros |
| 2022 | A Taxonomy of Error Sources in HPC I/O Machine Learning Models. Mihailo Isakov, Mikaela Currier, Eliakin Del Rosario, Sandeep Madireddy, Prasanna Balaprakash, Philip H. Carns, Robert B. Ross, Glenn K. Lockwood, Michel A. Kinsy |
| 2022 | AD for an Array Language with Nested Parallelism. Robert Schenck, Ola Rønning, Troels Henriksen, Cosmin E. Oancea |
| 2022 | AI for Quantum Mechanics: High Performance Quantum Many-Body Simulations via Deep Learning. Xuncheng Zhao, Mingfan Li, Qian Xiao, Junshi Chen, Fei Wang, Li Shen, Meijia Zhao, Wenhao Wu, Hong An, Lixin He, Xiao Liang |
| 2022 | Accelerating Elliptic Curve Digital Signature Algorithms on GPUs. Zonghao Feng, Qipeng Xie, Qiong Luo, Yujie Chen, Haoxuan Li, Huizhong Li, Qiang Yan |
| 2022 | Accelerating Parallel Write via Deeply Integrating Predictive Lossy Compression with HDF5. Sian Jin, Dingwen Tao, Houjun Tang, Sheng Di, Suren Byna, Zarija Lukic, Franck Cappello |
| 2022 | Addressing Irregular Patterns of Matrix Computations on GPUs and Their Impact on Applications Powered by Sparse Direct Solvers. Ahmad Abdelfattah, Pieter Ghysels, Wajih Boukaram, Stanimire Tomov, Xiaoye Sherry Li, Jack J. Dongarra |
| 2022 | AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices. Zhen Du, Jiajia Li, Yinshan Wang, Xueqi Li, Guangming Tan, Ninghui Sun |
| 2022 | Approximate Computing Through the Lens of Uncertainty Quantification. Konstantinos Parasyris, James Diffenderfer, Harshitha Menon, Ignacio Laguna, Jackson Vanover, Ryan Vogt, Daniel Osei-Kuffuor |
| 2022 | Blaze: Fast Graph Processing on Fast SSDs. Juno Kim, Steven Swanson |
| 2022 | Boosting Performance Optimization with Interactive Data Movement Visualization. Philipp Schaad, Tal Ben-Nun, Torsten Hoefler |
| 2022 | Building Blocks for Network-Accelerated Distributed File Systems. Salvatore Di Girolamo, Daniele De Sensi, Konstantin Taranov, Milos Malesevic, Maciej Besta, Timo Schneider, Severin Kistler, Torsten Hoefler |
| 2022 | CA3DMM: A New Algorithm Based on a Unified View of Parallel Matrix Multiplication. Hua Huang, Edmond Chow |
| 2022 | Canary: Fault-Tolerant FaaS for Stateful Time-Sensitive Applications. Moiz Arif, Kevin Assogba, M. Mustafa Rafique |
| 2022 | Charter: Identifying the Most-Critical Gate Operations in Quantum Circuits via Amplified Gate Reversibility. Tirthak Patel, Daniel Silver, Devesh Tiwari |
| 2022 | Climbing the Summit and Pushing the Frontier of Mixed Precision Benchmarks at Extreme Scale. Hao Lu, Michael A. Matheson, Vladyslav Oles, J. Austin Ellis, Wayne Joubert, Feiyi Wang |
| 2022 | CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs. Qingxiao Sun, Yi Liu, Hailong Yang, Ruizhe Zhang, Ming Dun, Mingzhen Li, Xiaoyan Liu, Wencong Xiao, Yong Li, Zhongzhi Luan, Depei Qian |
| 2022 | Combining Hard and Soft Constraints in Quantum Constraint-Satisfaction Systems. Ellis Wilson, Frank Mueller, Scott Pakin |
| 2022 | DayDream: Executing Dynamic Scientific Workflows on Serverless Platforms with Hot Starts. Rohan Basu Roy, Tirthak Patel, Devesh Tiwari |
| 2022 | DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale. Reza Yazdani Aminabadi, Samyam Rajbhandari, Ammar Ahmad Awan, Cheng Li, Du Li, Elton Zheng, Olatunji Ruwase, Shaden Smith, Minjia Zhang, Jeff Rasley, Yuxiong He |
| 2022 | Deinsum: Practically I/O Optimal Multi-Linear Algebra. Alexandros Nikolaos Ziogas, Grzegorz Kwasniewski, Tal Ben-Nun, Timo Schneider, Torsten Hoefler |
| 2022 | Dynamic Quality Metric Oriented Error Bounded Lossy Compression for Scientific Datasets. Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello |
| 2022 | EL-Rec: Efficient Large-Scale Recommendation Model Training via Tensor-Train Embedding Table. Zheng Wang, Yuke Wang, Boyuan Feng, Dheevatsa Mudigere, Bharath Muthiah, Yufei Ding |
| 2022 | Efficient Quantized Sparse Matrix Operations on Tensor Cores. Shigang Li, Kazuki Osawa, Torsten Hoefler |
| 2022 | Exaflops Biomedical Knowledge Graph Analytics. Ramakrishnan Kannan, Piyush Sao, Hao Lu, Jakub Kurzak, Gundolf Schenk, Yongmei Shi, Seung-Hwan Lim, Sharat Israni, Vijay Thakkar, Guojing Cong, Robert M. Patton, Sergio E. Baranzini, Richard W. Vuduc, Thomas E. Potok |
| 2022 | Extreme Scale Earthquake Simulation with Uncertainty Quantification. Tsuyoshi Ichimura, Kohei Fujita, Ryota Kusakabe, Kentaro Koyama, Sota Murakami, Yuma Kikuchi, Takane Hori, Muneo Hori, Hikaru Inoue, Takafumi Nose, Takahiro Kawashima, Maddegedara Lalith |
| 2022 | Extreme-Scale Many-against-Many Protein Similarity Search. Oguz Selvitopi, Saliya Ekanayake, Giulia Guidi, Muaaz G. Awan, Georgios A. Pavlopoulos, Ariful Azad, Nikos Kyrpides, Leonid Oliker, Katherine A. Yelick, Aydin Buluç |
| 2022 | Finding Inputs that Trigger Floating-Point Exceptions in GPUs via Bayesian Optimization. Ignacio Laguna, Ganesh Gopalakrishnan |
| 2022 | From Correctable Memory Errors to Uncorrectable Memory Errors: What Error Bits Tell. Cong Li, Yu Zhang, Jialei Wang, Hang Chen, Xian Liu, Tai Huang, Liang Peng, Shen Zhou, Lixin Wang, Shijian Ge |
| 2022 | GUFI: Fast, Secure File System Metadata Search for Both Privileged and Unprivileged Users. Dominic Manno, Jason Lee, Prajwal Challa, Qing Zheng, David Bonnie, Gary Grider, Bradley W. Settlemyer |
| 2022 | Graph Neural Networks Based Memory Inefficiency Detection Using Selective Sampling. Pengcheng Li, Yixin Guo, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Xu Liu |
| 2022 | GraphFly: Efficient Asynchronous Streaming Graphs Processing via Dependency-Flow. Dan Chen, Chuangyi Gui, Yi Zhang, Hai Jin, Long Zheng, Yu Huang, Xiaofei Liao |
| 2022 | HGL: Accelerating Heterogeneous GNN Training with Holistic Representation and Optimization. Yuntao Gui, Yidi Wu, Han Yang, Tatiana Jin, Boyang Li, Qihui Zhou, James Cheng, Fan Yu |
| 2022 | HammingMesh: A Network Topology for Large-Scale Deep Learning. Torsten Hoefler, Tommaso Bonato, Daniele De Sensi, Salvatore Di Girolamo, Shigang Li, Marco Heddes, Jon Belk, Deepak Goel, Miguel Castro, Steve Scott |
| 2022 | HyLo: A Hybrid Low-Rank Natural Gradient Descent Method. Baorun Mu, Saeed Soori, Bugra Can, Mert Gürbüzbalaban, Maryam Mehri Dehnavi |
| 2022 | Image Gradient Decomposition for Parallel and Memory-Efficient Ptychographic Reconstruction. Xiao Wang, Aristeidis Tsaris, Debangshu Mukherjee, Mohamed Wahib, Peng Chen, Mark Oxley, Olga Ovchinnikova, Jacob D. Hinkle |
| 2022 | LabStor: A Modular and Extensible Platform for Developing High-Performance, Customized I/O Stacks in Userspace. Luke Logan, Jaime Cernuda Garcia, Jay F. Lofstead, Xian-He Sun, Anthony Kougkas |
| 2022 | Large-Scale Simulation of Quantum Computational Chemistry on a New Sunway Supercomputer. Honghui Shang, Li Shen, Yi Fan, Zhiqian Xu, Chu Guo, Jie Liu, Wenhao Zhou, Huan Ma, Rongfen Lin, Yuling Yang, Fang Li, Zhuoya Wang, Yunquan Zhang, Zhenyu Li |
| 2022 | Lessons Learned on MPI+Threads Communication. Rohit Zambre, Aparna Chandramowlishwaran |
| 2022 | LightSeq2: Accelerated Training for Transformer-Based Models on GPUs. Xiaohui Wang, Yang Wei, Ying Xiong, Guyue Huang, Xian Qian, Yufei Ding, Mingxuan Wang, Lei Li |
| 2022 | Mapping Out the HPC Dependency Chaos. Farid Zakaria, Thomas R. W. Scogland, Todd Gamblin, Carlos Maltzahn |
| 2022 | Memory Optimizations in an Array Language. Philip Munksgaard, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea |
| 2022 | MetaWBC: POSIX-Compliant Metadata Write-Back Caching for Distributed File Systems. Yingjin Qian, Wen Cheng, Lingfang Zeng, Marc-André Vef, Oleg Drokin, Andreas Dilger, Shuichi Ihara, Wusheng Zhang, Yang Wang, André Brinkmann |
| 2022 | Mitigating Silent Data Corruptions in HPC Applications across Multiple Program Inputs. Yafan Huang, Shengjian Guo, Sheng Di, Guanpeng Li, Franck Cappello |
| 2022 | Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems. Prasoon Sinha, Akhil Guliani, Rutwik Jain, Brandon Tran, Matthew D. Sinclair, Shivaram Venkataraman |
| 2022 | Optimization of Full-Core Reactor Simulations on Summit. Misun Min, Yu-Hsiang Lan, Paul F. Fischer, Elia Merzari, Stefan Kerkemeier, Malachi Phillips, Thilina Rathnayake, April Novak, Derek Gaston, Noel Chalmers, Tim Warburton |
| 2022 | Optimizing Random Access to Hierarchically-Compressed Data on GPU. Feng Zhang, Yihua Hu, Haipeng Ding, Zhiming Yao, Zhewei Wei, Xiao Zhang, Xiaoyong Du |
| 2022 | Out of Hypervisor (OoH): Efficient Dirty Page Tracking in Userspace Using Hardware Virtualization Features. Stella Bitchebe, Alain Tchana |
| 2022 | P-Massive: A Real-Time Search Engine for a Multi-Terabyte Mass Spectrometry Database. Narangerelt Batsoyol, Benjamin S. Pullman, Mingxun Wang, Nuno Bandeira, Steven Swanson |
| 2022 | Parla: A Python Orchestration System for Heterogeneous Architectures. Hochan Lee, William Ruys, Ian Henriksen, Arthur Michener Peters, Yineng Yan, Sean Stephens, Bozhi You, Henrique Fingler, Martin Burtscher, Milos Gligoric, Karl W. Schulz, Keshav Pingali, Christopher J. Rossbach, Mattan Erez, George Biros |
| 2022 | PolarFly: A Cost-Effective and Flexible Low-Diameter Topology. Kartik Lakhotia, Maciej Besta, Laura Monroe, Kelly Isham, Patrick Iff, Torsten Hoefler, Fabrizio Petrini |
| 2022 | Positive-Phase Temperature Scaling for Quantum-Assisted Boltzmann Machine Training. Jose P. Pinilla, Steven J. E. Wilton |
| 2022 | Predicting Reuse Interval for Optimized Web Caching: An LSTM-Based Machine Learning Approach. Pengcheng Li, Yixin Guo, Yongbin Gu |
| 2022 | ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations. Maciej Besta, Cesare Miglioli, Paolo Sylos Labini, Jakub Tetek, Patrick Iff, Raghavendra Kanakagiri, Saleh Ashkboos, Kacper Janda, Michal Podstawski, Grzegorz Kwasniewski, Niels Gleinig, Flavio Vella, Onur Mutlu, Torsten Hoefler |
| 2022 | Productive Performance Engineering for Weather and Climate Modeling with Python. Tal Ben-Nun, Linus Groner, Florian Deconinck, Tobias Wicky, Eddie Davis, Johann Dahm, Oliver Elbert, Rhea George, Jeremy McGibbon, Lukas Trümper, Elynn Wu, Oliver Fuhrer, Thomas C. Schulthess, Torsten Hoefler |
| 2022 | Pushing the Frontier in the Design of Laser-Based Electron Accelerators with Groundbreaking Mesh-Refined Particle-In-Cell Simulations on Exascale-Class Supercomputers. Luca Fedeli, Axel Huebl, France Boillod-Cerneux, Thomas Clark, Kevin Gott, Conrad Hillairet, Stephan Jaure, Adrien Leblanc, Rémi Lehe, Andrew Myers, Christelle Piechurski, Mitsuhisa Sato, Neïl Zaïm, Weiqun Zhang, Jean-Luc Vay, Henri Vincenti |
| 2022 | QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services. Kaihua Fu, Jiuchen Shi, Quan Chen, Ningxin Zheng, Wei Zhang, Deze Zeng, Minyi Guo |
| 2022 | ReSemble: Reinforced Ensemble Framework for Data Prefetching. Pengmiao Zhang, Rajgopal Kannan, Ajitesh Srivastava, Anant V. Nori, Viktor K. Prasanna |
| 2022 | Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications. Qinglei Cao, Sameh Abdulah, Rabab Alomairy, Yu Pei, Pratik Nag, George Bosilca, Jack J. Dongarra, Marc G. Genton, David E. Keyes, Hatem Ltaief, Ying Sun |
| 2022 | SC22: International Conference for High Performance Computing, Networking, Storage and Analysis, Dallas, TX, USA, November 13-18, 2022 Felix Wolf, Sameer Shende, Candace Culhane, Sadaf R. Alam, Heike Jagode |
| 2022 | SERVIZ: A Shared In Situ Visualization Service. Srinivasan Ramesh, Hank Childs, Allen D. Malony |
| 2022 | SFS: Smart OS Scheduling for Serverless Functions. YuQi Fu, Li Liu, Haoliang Wang, Yue Cheng, Songqing Chen |
| 2022 | SPATL: Salient Parameter Aggregation and Transfer Learning for Heterogeneous Federated Learning. Sixing Yu, Phuong Nguyen, Waqwoya Abebe, Wei Qian, Ali Anwar, Ali Jannesari |
| 2022 | STMatch: Accelerating Graph Pattern Matching on GPU with Stack-Based Loop Optimizations. Yihua Wei, Peng Jiang |
| 2022 | STRONGHOLD: Fast and Affordable Billion-Scale Deep Learning Model Training. Xiaoyang Sun, Wei Wang, Shenghao Qiu, Renyu Yang, Songfang Huang, Jie Xu, Zheng Wang |
| 2022 | Scalable Automatic Differentiation of Multiple Parallel Paradigms through Compiler Augmentation. William S. Moses, Sri Hari Krishna Narayanan, Ludger Paehler, Valentin Churavy, Michel Schanen, Jan Hückelheim, Johannes Doerfert, Paul D. Hovland |
| 2022 | Scalable Deep Learning-Based Microarchitecture Simulation on GPUs. Santosh Pandey, Lingda Li, Thomas Flynn, Adolfy Hoisie, Hang Liu |
| 2022 | Scalable Distributed High-Order Stencil Computations. Mathias Jacquelin, Mauricio Araya-Polo, Jie Meng |
| 2022 | Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way. Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens |
| 2022 | Scalable Linear Time Dense Direct Solver for 3-D Problems without Trailing Sub-Matrix Dependencies. Qianxiang Ma, Sameer Deshmukh, Rio Yokota |
| 2022 | Scaling Correlated Fragment Molecular Orbital Calculations on Summit. Giuseppe M. J. Barca, Calum Snowdon, Jorge L. Galvez Vallejo, Fazeleh S. Kazemian, Alistair P. Rendell, Mark S. Gordon |
| 2022 | Scaling Graph 500 SSSP to 140 Trillion Edges with over 40 Million Cores. Yuanwei Wang, Huanqi Cao, Zixuan Ma, Wanwang Yin, Wenguang Chen |
| 2022 | SeqDLM: A Sequencer-Based Distributed Lock Manager for Efficient Shared File Access in a Parallel File System. Qi Chen, Shaonan Ma, Kang Chen, Teng Ma, Xin Liu, Dexun Chen, Yongwei Wu, Zuoning Chen |
| 2022 | Solving Linear Systems on a GPU with Hierarchically Off-Diagonal Low-Rank Approximations. Chao Chen, Per-Gunnar Martinsson |
| 2022 | SpDISTAL: Compiling Distributed Sparse Tensor Computations. Rohan Yadav, Alex Aiken, Fredrik Kjolstad |
| 2022 | Study of Workload Interference with Intelligent Routing on Dragonfly. Yao Kang, Xin Wang, Zhiling Lan |
| 2022 | Symmetric Block-Cyclic Distribution: Fewer Communications Leads to Faster Dense Cholesky Factorization. Olivier Beaumont, Philippe Duchon, Lionel Eyraud-Dubois, Julien Langou, Mathieu Vérité |
| 2022 | TD-NUCA: Runtime Driven Management of NUCA Caches in Task Dataflow Programming Models. Paul Caheny, Lluc Alvarez, Marc Casas, Miquel Moretó |
| 2022 | Towards Scalable Resource Management for Supercomputers. Yiqin Dai, Yong Dong, Kai Lu, Ruibo Wang, Wei Zhang, Juan Chen, Mingtian Shao, Zheng Wang |
| 2022 | UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation. Chen Zhang, Haojie Wang, Zixuan Ma, Lei Xie, Zeyu Song, Jidong Zhai |
| 2022 | Using Answer Set Programming for HPC Dependency Solving. Todd Gamblin, Massimiliano Culpo, Gregory Becker, Sergei Shudler |
| 2022 | Using Unused: Non-Invasive Dynamic FaaS Infrastructure with HPC-Whisk. Bartlomiej Przybylski, Maciej Pawlik, Pawel Zuk, Bartlomiej Lagosz, Maciej Malawski, Krzysztof Rzadca |
| 2022 | VSGM: View-Based GPU-Accelerated Subgraph Matching on Large Graphs. Guanxian Jiang, Qihui Zhou, Tatiana Jin, Boyang Li, Yunjian Zhao, Yichao Li, James Cheng |
| 2022 | Vectorizing Sparse Matrix Computations with Partially-Strided Codelets. Kazem Cheshmi, Zachary Cetinic, Maryam Mehri Dehnavi |
| 2022 | W-Cycle SVD: A Multilevel Algorithm for Batched SVD on GPUs. Junmin Xiao, Yunfei Pang, Qing Xue, Chaoyang Shui, Ke Meng, Hui Ma, Mingyi Li, Xiaoyang Zhang, Guangming Tan |
| 2022 | WholeGraph: A Fast Graph Neural Network Training Framework with Multi-GPU Distributed Shared Memory Architecture. Dongxu Yang, Junhong Liu, Jiaxing Qi, Junjie Lai |
| 2022 | vGraph: Memory-Efficient Multicore Graph Processing for Traversal-Centric Algorithms. Menghan Jia, Yiming Zhang, Xinbiao Gan, Dongsheng Li, Erci Xu, Ruibo Wang, Kai Lu |