SC A

88 papers

YearTitle / Authors
20222.5 Million-Atom Ab Initio Electronic-Structure Simulation of Complex Metallic Heterostructures with DGDFT.
Wei Hu, Hong An, Zhuoqiang Guo, Qingcai Jiang, Xinming Qin, Junshi Chen, Weile Jia, Chao Yang, Zhaolong Luo, Jielan Li, Wentiao Wu, Guangming Tan, Dongning Jia, Qinglin Lu, Fangfang Liu, Min Tian, Fang Li, Yeqi Huang, Liyi Wang, Sha Liu, Jinlong Yang
2022A GPU-Accelerated AMR Solver for Gravitational Wave Propagation.
Milinda Fernando, David Neilsen, Eric W. Hirschmann, Yosef Zlochower, Hari Sundar, Omar Ghattas, George Biros
2022A Taxonomy of Error Sources in HPC I/O Machine Learning Models.
Mihailo Isakov, Mikaela Currier, Eliakin Del Rosario, Sandeep Madireddy, Prasanna Balaprakash, Philip H. Carns, Robert B. Ross, Glenn K. Lockwood, Michel A. Kinsy
2022AD for an Array Language with Nested Parallelism.
Robert Schenck, Ola Rønning, Troels Henriksen, Cosmin E. Oancea
2022AI for Quantum Mechanics: High Performance Quantum Many-Body Simulations via Deep Learning.
Xuncheng Zhao, Mingfan Li, Qian Xiao, Junshi Chen, Fei Wang, Li Shen, Meijia Zhao, Wenhao Wu, Hong An, Lixin He, Xiao Liang
2022Accelerating Elliptic Curve Digital Signature Algorithms on GPUs.
Zonghao Feng, Qipeng Xie, Qiong Luo, Yujie Chen, Haoxuan Li, Huizhong Li, Qiang Yan
2022Accelerating Parallel Write via Deeply Integrating Predictive Lossy Compression with HDF5.
Sian Jin, Dingwen Tao, Houjun Tang, Sheng Di, Suren Byna, Zarija Lukic, Franck Cappello
2022Addressing Irregular Patterns of Matrix Computations on GPUs and Their Impact on Applications Powered by Sparse Direct Solvers.
Ahmad Abdelfattah, Pieter Ghysels, Wajih Boukaram, Stanimire Tomov, Xiaoye Sherry Li, Jack J. Dongarra
2022AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices.
Zhen Du, Jiajia Li, Yinshan Wang, Xueqi Li, Guangming Tan, Ninghui Sun
2022Approximate Computing Through the Lens of Uncertainty Quantification.
Konstantinos Parasyris, James Diffenderfer, Harshitha Menon, Ignacio Laguna, Jackson Vanover, Ryan Vogt, Daniel Osei-Kuffuor
2022Blaze: Fast Graph Processing on Fast SSDs.
Juno Kim, Steven Swanson
2022Boosting Performance Optimization with Interactive Data Movement Visualization.
Philipp Schaad, Tal Ben-Nun, Torsten Hoefler
2022Building Blocks for Network-Accelerated Distributed File Systems.
Salvatore Di Girolamo, Daniele De Sensi, Konstantin Taranov, Milos Malesevic, Maciej Besta, Timo Schneider, Severin Kistler, Torsten Hoefler
2022CA3DMM: A New Algorithm Based on a Unified View of Parallel Matrix Multiplication.
Hua Huang, Edmond Chow
2022Canary: Fault-Tolerant FaaS for Stateful Time-Sensitive Applications.
Moiz Arif, Kevin Assogba, M. Mustafa Rafique
2022Charter: Identifying the Most-Critical Gate Operations in Quantum Circuits via Amplified Gate Reversibility.
Tirthak Patel, Daniel Silver, Devesh Tiwari
2022Climbing the Summit and Pushing the Frontier of Mixed Precision Benchmarks at Extreme Scale.
Hao Lu, Michael A. Matheson, Vladyslav Oles, J. Austin Ellis, Wayne Joubert, Feiyi Wang
2022CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs.
Qingxiao Sun, Yi Liu, Hailong Yang, Ruizhe Zhang, Ming Dun, Mingzhen Li, Xiaoyan Liu, Wencong Xiao, Yong Li, Zhongzhi Luan, Depei Qian
2022Combining Hard and Soft Constraints in Quantum Constraint-Satisfaction Systems.
Ellis Wilson, Frank Mueller, Scott Pakin
2022DayDream: Executing Dynamic Scientific Workflows on Serverless Platforms with Hot Starts.
Rohan Basu Roy, Tirthak Patel, Devesh Tiwari
2022DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale.
Reza Yazdani Aminabadi, Samyam Rajbhandari, Ammar Ahmad Awan, Cheng Li, Du Li, Elton Zheng, Olatunji Ruwase, Shaden Smith, Minjia Zhang, Jeff Rasley, Yuxiong He
2022Deinsum: Practically I/O Optimal Multi-Linear Algebra.
Alexandros Nikolaos Ziogas, Grzegorz Kwasniewski, Tal Ben-Nun, Timo Schneider, Torsten Hoefler
2022Dynamic Quality Metric Oriented Error Bounded Lossy Compression for Scientific Datasets.
Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello
2022EL-Rec: Efficient Large-Scale Recommendation Model Training via Tensor-Train Embedding Table.
Zheng Wang, Yuke Wang, Boyuan Feng, Dheevatsa Mudigere, Bharath Muthiah, Yufei Ding
2022Efficient Quantized Sparse Matrix Operations on Tensor Cores.
Shigang Li, Kazuki Osawa, Torsten Hoefler
2022Exaflops Biomedical Knowledge Graph Analytics.
Ramakrishnan Kannan, Piyush Sao, Hao Lu, Jakub Kurzak, Gundolf Schenk, Yongmei Shi, Seung-Hwan Lim, Sharat Israni, Vijay Thakkar, Guojing Cong, Robert M. Patton, Sergio E. Baranzini, Richard W. Vuduc, Thomas E. Potok
2022Extreme Scale Earthquake Simulation with Uncertainty Quantification.
Tsuyoshi Ichimura, Kohei Fujita, Ryota Kusakabe, Kentaro Koyama, Sota Murakami, Yuma Kikuchi, Takane Hori, Muneo Hori, Hikaru Inoue, Takafumi Nose, Takahiro Kawashima, Maddegedara Lalith
2022Extreme-Scale Many-against-Many Protein Similarity Search.
Oguz Selvitopi, Saliya Ekanayake, Giulia Guidi, Muaaz G. Awan, Georgios A. Pavlopoulos, Ariful Azad, Nikos Kyrpides, Leonid Oliker, Katherine A. Yelick, Aydin Buluç
2022Finding Inputs that Trigger Floating-Point Exceptions in GPUs via Bayesian Optimization.
Ignacio Laguna, Ganesh Gopalakrishnan
2022From Correctable Memory Errors to Uncorrectable Memory Errors: What Error Bits Tell.
Cong Li, Yu Zhang, Jialei Wang, Hang Chen, Xian Liu, Tai Huang, Liang Peng, Shen Zhou, Lixin Wang, Shijian Ge
2022GUFI: Fast, Secure File System Metadata Search for Both Privileged and Unprivileged Users.
Dominic Manno, Jason Lee, Prajwal Challa, Qing Zheng, David Bonnie, Gary Grider, Bradley W. Settlemyer
2022Graph Neural Networks Based Memory Inefficiency Detection Using Selective Sampling.
Pengcheng Li, Yixin Guo, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Xu Liu
2022GraphFly: Efficient Asynchronous Streaming Graphs Processing via Dependency-Flow.
Dan Chen, Chuangyi Gui, Yi Zhang, Hai Jin, Long Zheng, Yu Huang, Xiaofei Liao
2022HGL: Accelerating Heterogeneous GNN Training with Holistic Representation and Optimization.
Yuntao Gui, Yidi Wu, Han Yang, Tatiana Jin, Boyang Li, Qihui Zhou, James Cheng, Fan Yu
2022HammingMesh: A Network Topology for Large-Scale Deep Learning.
Torsten Hoefler, Tommaso Bonato, Daniele De Sensi, Salvatore Di Girolamo, Shigang Li, Marco Heddes, Jon Belk, Deepak Goel, Miguel Castro, Steve Scott
2022HyLo: A Hybrid Low-Rank Natural Gradient Descent Method.
Baorun Mu, Saeed Soori, Bugra Can, Mert Gürbüzbalaban, Maryam Mehri Dehnavi
2022Image Gradient Decomposition for Parallel and Memory-Efficient Ptychographic Reconstruction.
Xiao Wang, Aristeidis Tsaris, Debangshu Mukherjee, Mohamed Wahib, Peng Chen, Mark Oxley, Olga Ovchinnikova, Jacob D. Hinkle
2022LabStor: A Modular and Extensible Platform for Developing High-Performance, Customized I/O Stacks in Userspace.
Luke Logan, Jaime Cernuda Garcia, Jay F. Lofstead, Xian-He Sun, Anthony Kougkas
2022Large-Scale Simulation of Quantum Computational Chemistry on a New Sunway Supercomputer.
Honghui Shang, Li Shen, Yi Fan, Zhiqian Xu, Chu Guo, Jie Liu, Wenhao Zhou, Huan Ma, Rongfen Lin, Yuling Yang, Fang Li, Zhuoya Wang, Yunquan Zhang, Zhenyu Li
2022Lessons Learned on MPI+Threads Communication.
Rohit Zambre, Aparna Chandramowlishwaran
2022LightSeq2: Accelerated Training for Transformer-Based Models on GPUs.
Xiaohui Wang, Yang Wei, Ying Xiong, Guyue Huang, Xian Qian, Yufei Ding, Mingxuan Wang, Lei Li
2022Mapping Out the HPC Dependency Chaos.
Farid Zakaria, Thomas R. W. Scogland, Todd Gamblin, Carlos Maltzahn
2022Memory Optimizations in an Array Language.
Philip Munksgaard, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea
2022MetaWBC: POSIX-Compliant Metadata Write-Back Caching for Distributed File Systems.
Yingjin Qian, Wen Cheng, Lingfang Zeng, Marc-André Vef, Oleg Drokin, Andreas Dilger, Shuichi Ihara, Wusheng Zhang, Yang Wang, André Brinkmann
2022Mitigating Silent Data Corruptions in HPC Applications across Multiple Program Inputs.
Yafan Huang, Shengjian Guo, Sheng Di, Guanpeng Li, Franck Cappello
2022Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems.
Prasoon Sinha, Akhil Guliani, Rutwik Jain, Brandon Tran, Matthew D. Sinclair, Shivaram Venkataraman
2022Optimization of Full-Core Reactor Simulations on Summit.
Misun Min, Yu-Hsiang Lan, Paul F. Fischer, Elia Merzari, Stefan Kerkemeier, Malachi Phillips, Thilina Rathnayake, April Novak, Derek Gaston, Noel Chalmers, Tim Warburton
2022Optimizing Random Access to Hierarchically-Compressed Data on GPU.
Feng Zhang, Yihua Hu, Haipeng Ding, Zhiming Yao, Zhewei Wei, Xiao Zhang, Xiaoyong Du
2022Out of Hypervisor (OoH): Efficient Dirty Page Tracking in Userspace Using Hardware Virtualization Features.
Stella Bitchebe, Alain Tchana
2022P-Massive: A Real-Time Search Engine for a Multi-Terabyte Mass Spectrometry Database.
Narangerelt Batsoyol, Benjamin S. Pullman, Mingxun Wang, Nuno Bandeira, Steven Swanson
2022Parla: A Python Orchestration System for Heterogeneous Architectures.
Hochan Lee, William Ruys, Ian Henriksen, Arthur Michener Peters, Yineng Yan, Sean Stephens, Bozhi You, Henrique Fingler, Martin Burtscher, Milos Gligoric, Karl W. Schulz, Keshav Pingali, Christopher J. Rossbach, Mattan Erez, George Biros
2022PolarFly: A Cost-Effective and Flexible Low-Diameter Topology.
Kartik Lakhotia, Maciej Besta, Laura Monroe, Kelly Isham, Patrick Iff, Torsten Hoefler, Fabrizio Petrini
2022Positive-Phase Temperature Scaling for Quantum-Assisted Boltzmann Machine Training.
Jose P. Pinilla, Steven J. E. Wilton
2022Predicting Reuse Interval for Optimized Web Caching: An LSTM-Based Machine Learning Approach.
Pengcheng Li, Yixin Guo, Yongbin Gu
2022ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations.
Maciej Besta, Cesare Miglioli, Paolo Sylos Labini, Jakub Tetek, Patrick Iff, Raghavendra Kanakagiri, Saleh Ashkboos, Kacper Janda, Michal Podstawski, Grzegorz Kwasniewski, Niels Gleinig, Flavio Vella, Onur Mutlu, Torsten Hoefler
2022Productive Performance Engineering for Weather and Climate Modeling with Python.
Tal Ben-Nun, Linus Groner, Florian Deconinck, Tobias Wicky, Eddie Davis, Johann Dahm, Oliver Elbert, Rhea George, Jeremy McGibbon, Lukas Trümper, Elynn Wu, Oliver Fuhrer, Thomas C. Schulthess, Torsten Hoefler
2022Pushing the Frontier in the Design of Laser-Based Electron Accelerators with Groundbreaking Mesh-Refined Particle-In-Cell Simulations on Exascale-Class Supercomputers.
Luca Fedeli, Axel Huebl, France Boillod-Cerneux, Thomas Clark, Kevin Gott, Conrad Hillairet, Stephan Jaure, Adrien Leblanc, Rémi Lehe, Andrew Myers, Christelle Piechurski, Mitsuhisa Sato, Neïl Zaïm, Weiqun Zhang, Jean-Luc Vay, Henri Vincenti
2022QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services.
Kaihua Fu, Jiuchen Shi, Quan Chen, Ningxin Zheng, Wei Zhang, Deze Zeng, Minyi Guo
2022ReSemble: Reinforced Ensemble Framework for Data Prefetching.
Pengmiao Zhang, Rajgopal Kannan, Ajitesh Srivastava, Anant V. Nori, Viktor K. Prasanna
2022Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications.
Qinglei Cao, Sameh Abdulah, Rabab Alomairy, Yu Pei, Pratik Nag, George Bosilca, Jack J. Dongarra, Marc G. Genton, David E. Keyes, Hatem Ltaief, Ying Sun
2022SC22: International Conference for High Performance Computing, Networking, Storage and Analysis, Dallas, TX, USA, November 13-18, 2022
Felix Wolf, Sameer Shende, Candace Culhane, Sadaf R. Alam, Heike Jagode
2022SERVIZ: A Shared In Situ Visualization Service.
Srinivasan Ramesh, Hank Childs, Allen D. Malony
2022SFS: Smart OS Scheduling for Serverless Functions.
YuQi Fu, Li Liu, Haoliang Wang, Yue Cheng, Songqing Chen
2022SPATL: Salient Parameter Aggregation and Transfer Learning for Heterogeneous Federated Learning.
Sixing Yu, Phuong Nguyen, Waqwoya Abebe, Wei Qian, Ali Anwar, Ali Jannesari
2022STMatch: Accelerating Graph Pattern Matching on GPU with Stack-Based Loop Optimizations.
Yihua Wei, Peng Jiang
2022STRONGHOLD: Fast and Affordable Billion-Scale Deep Learning Model Training.
Xiaoyang Sun, Wei Wang, Shenghao Qiu, Renyu Yang, Songfang Huang, Jie Xu, Zheng Wang
2022Scalable Automatic Differentiation of Multiple Parallel Paradigms through Compiler Augmentation.
William S. Moses, Sri Hari Krishna Narayanan, Ludger Paehler, Valentin Churavy, Michel Schanen, Jan Hückelheim, Johannes Doerfert, Paul D. Hovland
2022Scalable Deep Learning-Based Microarchitecture Simulation on GPUs.
Santosh Pandey, Lingda Li, Thomas Flynn, Adolfy Hoisie, Hang Liu
2022Scalable Distributed High-Order Stencil Computations.
Mathias Jacquelin, Mauricio Araya-Polo, Jie Meng
2022Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way.
Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens
2022Scalable Linear Time Dense Direct Solver for 3-D Problems without Trailing Sub-Matrix Dependencies.
Qianxiang Ma, Sameer Deshmukh, Rio Yokota
2022Scaling Correlated Fragment Molecular Orbital Calculations on Summit.
Giuseppe M. J. Barca, Calum Snowdon, Jorge L. Galvez Vallejo, Fazeleh S. Kazemian, Alistair P. Rendell, Mark S. Gordon
2022Scaling Graph 500 SSSP to 140 Trillion Edges with over 40 Million Cores.
Yuanwei Wang, Huanqi Cao, Zixuan Ma, Wanwang Yin, Wenguang Chen
2022SeqDLM: A Sequencer-Based Distributed Lock Manager for Efficient Shared File Access in a Parallel File System.
Qi Chen, Shaonan Ma, Kang Chen, Teng Ma, Xin Liu, Dexun Chen, Yongwei Wu, Zuoning Chen
2022Solving Linear Systems on a GPU with Hierarchically Off-Diagonal Low-Rank Approximations.
Chao Chen, Per-Gunnar Martinsson
2022SpDISTAL: Compiling Distributed Sparse Tensor Computations.
Rohan Yadav, Alex Aiken, Fredrik Kjolstad
2022Study of Workload Interference with Intelligent Routing on Dragonfly.
Yao Kang, Xin Wang, Zhiling Lan
2022Symmetric Block-Cyclic Distribution: Fewer Communications Leads to Faster Dense Cholesky Factorization.
Olivier Beaumont, Philippe Duchon, Lionel Eyraud-Dubois, Julien Langou, Mathieu Vérité
2022TD-NUCA: Runtime Driven Management of NUCA Caches in Task Dataflow Programming Models.
Paul Caheny, Lluc Alvarez, Marc Casas, Miquel Moretó
2022Towards Scalable Resource Management for Supercomputers.
Yiqin Dai, Yong Dong, Kai Lu, Ruibo Wang, Wei Zhang, Juan Chen, Mingtian Shao, Zheng Wang
2022UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation.
Chen Zhang, Haojie Wang, Zixuan Ma, Lei Xie, Zeyu Song, Jidong Zhai
2022Using Answer Set Programming for HPC Dependency Solving.
Todd Gamblin, Massimiliano Culpo, Gregory Becker, Sergei Shudler
2022Using Unused: Non-Invasive Dynamic FaaS Infrastructure with HPC-Whisk.
Bartlomiej Przybylski, Maciej Pawlik, Pawel Zuk, Bartlomiej Lagosz, Maciej Malawski, Krzysztof Rzadca
2022VSGM: View-Based GPU-Accelerated Subgraph Matching on Large Graphs.
Guanxian Jiang, Qihui Zhou, Tatiana Jin, Boyang Li, Yunjian Zhao, Yichao Li, James Cheng
2022Vectorizing Sparse Matrix Computations with Partially-Strided Codelets.
Kazem Cheshmi, Zachary Cetinic, Maryam Mehri Dehnavi
2022W-Cycle SVD: A Multilevel Algorithm for Batched SVD on GPUs.
Junmin Xiao, Yunfei Pang, Qing Xue, Chaoyang Shui, Ke Meng, Hui Ma, Mingyi Li, Xiaoyang Zhang, Guangming Tan
2022WholeGraph: A Fast Graph Neural Network Training Framework with Multi-GPU Distributed Shared Memory Architecture.
Dongxu Yang, Junhong Liu, Jiaxing Qi, Junjie Lai
2022vGraph: Memory-Efficient Multicore Graph Processing for Traversal-Centric Algorithms.
Menghan Jia, Yiming Zhang, Xinbiao Gan, Dongsheng Li, Erci Xu, Ruibo Wang, Kai Lu