| 2023 | A Guaranteed Approximation Algorithm for Scheduling Fork-Joins with Communication Delay. Pierre-François Dutot, Yeu-Shin Fu, Nikhil Prasad, Oliver Sinnen |
| 2023 | A Machine Learning Approach Towards Runtime Optimisation of Matrix Multiplication. Yufan Xia, Marco De La Pierre, Amanda S. Barnard, Giuseppe M. J. Barca |
| 2023 | A Novel Framework for Efficient Offloading of Communication Operations to Bluefield SmartNICs. Kaushik Kandadi Suresh, Benjamin Michalowicz, Bharath Ramesh, Nicholas Contini, Jinghan Yao, Shulei Xu, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda |
| 2023 | A Novel Triangular Space-Filling Curve for Cache-Oblivious In-Place Transposition of Square Matrices. João Nuno Ferreira Alves, Luís M. S. Russo, Alexandre P. Francisco, Siegfried Benkner |
| 2023 | Accelerating CNN inference on long vector architectures via co-design. Sonia Rani Gupta, Nikela Papadopoulou, Miquel Pericàs |
| 2023 | Accelerating Distributed Deep Learning Training with Compression Assisted Allgather and Reduce-Scatter Communication. Qinghua Zhou, Quentin Anthony, Lang Xu, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda |
| 2023 | Accelerating Packet Processing in Container Overlay Networks via Packet-level Parallelism. Jiaxin Lei, Manish Munikar, Hui Lu, Jia Rao |
| 2023 | Accurate and Efficient Distributed COVID-19 Spread Prediction based on a Large-Scale Time-Varying People Mobility Graph. Sudipta Saha Shubha, Shohaib Mahmud, Haiying Shen, Geoffrey C. Fox, Madhav V. Marathe |
| 2023 | Alioth: A Machine Learning Based Interference-Aware Performance Monitor for Multi-Tenancy Applications in Public Cloud. Tianyao Shi, Yingxuan Yang, Yunlong Cheng, Xiaofeng Gao, Zhen Fang, Yongqiang Yang |
| 2023 | An Adaptive Hybrid Quantum Algorithm for the Metric Traveling Salesman Problem. Fei Li, Arul Rhik Mazumder |
| 2023 | An Efficient 2D Method for Training Super-Large Deep Learning Models. Qifan Xu, Yang You |
| 2023 | An Experimental Study of Two-level Schwarz Domain-Decomposition Preconditioners on GPUs. Ichitaro Yamazaki, Alexander Heinlein, Sivasankaran Rajamanickam |
| 2023 | AnyQ: An Evaluation Framework for Massively-Parallel Queue Algorithms. Michael Kenzel, Stefan Lemme, Richard Membarth, Matthias Kurtenacker, Hugo Devillers, Markus Steinberger, Philipp Slusallek |
| 2023 | ArkFS: A Distributed File System on Object Storage for Archiving Data in HPC Environment. Kyu-Jin Cho, Injae Kang, Jin-soo Kim |
| 2023 | Asynch-SGBDT: Train Stochastic Gradient Boosting Decision Trees in an Asynchronous Parallel Manner. Daning Cheng, Shigang Li, Yunquan Zhang |
| 2023 | Boosting Multi-Block Repair in Cloud Storage Systems with Wide-Stripe Erasure Coding. Qi Yu, Lin Wang, Yuchong Hu, Yumeng Xu, Dan Feng, Jie Fu, Xia Zhu, Zhen Yao, Wenjia Wei |
| 2023 | ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs. Yujia Zhai, Chengquan Jiang, Leyuan Wang, Xiaoying Jia, Shang Zhang, Zizhong Chen, Xin Liu, Yibo Zhu |
| 2023 | Chic-sched: a HPC Placement-Group Scheduler on Hierarchical Topologies with Constraints. Laurent Schares, Asser N. Tantawi, Pavlos Maniotis, Ming-Hung Chen, Claudia Misale, Seetharami Seelam, Hao Yu |
| 2023 | Communication Optimization for Distributed Execution of Graph Neural Networks. Süreyya Emre Kurt, Jinghua Yan, Aravind Sukumaran-Rajam, Prashant Pandey, P. Sadayappan |
| 2023 | DAOS as HPC Storage: a View From Numerical Weather Prediction. Nicolau Manubens, Tiago Quintino, Simon D. Smart, Emanuele Danovaro, Adrian Jackson |
| 2023 | Data Distribution Schemes for Dense Linear Algebra Factorizations on Any Number of Nodes. Olivier Beaumont, Jean-Alexandre Collin, Lionel Eyraud-Dubois, Mathieu Vérité |
| 2023 | DeepThermo: Deep Learning Accelerated Parallel Monte Carlo Sampling for Thermodynamics Evaluation of High Entropy Alloys. Junqi Yin, Feiyi Wang, Mallikarjun Arjun Shankar |
| 2023 | Designing and Optimizing GPU-aware Nonblocking MPI Neighborhood Collective Communication for PETSc Kawthar Shafie Khorassani, Chen-Chun Chen, Hari Subramoni, Dhabaleswar K. Panda |
| 2023 | Distributed Sparse Random Projection Trees for Constructing K-Nearest Neighbor Graphs. Isuru Ranawaka, Md. Khaledur Rahman, Ariful Azad |
| 2023 | Distributing Simplex-Shaped Nested for-Loops to Identify Carcinogenic Gene Combinations. Sajal Dash, Mohammad Alaul Haque Monil, Junqi Yin, Ramu Anandakrishnan, Feiyi Wang |
| 2023 | Drill: Log-based Anomaly Detection for Large-scale Storage Systems Using Source Code Analysis. Di Zhang, Chris Egersdoerfer, Tabassum Mahmud, Mai Zheng, Dong Dai |
| 2023 | Duo: Improving Data Sharing of Stateful Serverless Applications by Efficiently Caching Multi-Read Data. Zhuo Huang, Hao Fan, Chaoyi Cheng, Song Wu, Hai Jin |
| 2023 | Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams. Yongseok Soh, Ahmed E. Helal, Fabio Checconi, Jan Laukemann, Jesmin Jahan Tithi, Teresa M. Ranadive, Fabrizio Petrini, Jee W. Choi |
| 2023 | Dynasparse: Accelerating GNN Inference through Dynamic Sparsity Exploitation. Bingyi Zhang, Viktor K. Prasanna |
| 2023 | Efficient Hardware Primitives for Immediate Memory Reclamation in Optimistic Data Structures. Ajay Singh, Trevor Brown, Michael Spear |
| 2023 | Engineering Massively Parallel MST Algorithms. Peter Sanders, Matthias Schimek |
| 2023 | Engineering a Distributed-Memory Triangle Counting Algorithm. Peter Sanders, Tim Niklas Uhl |
| 2023 | Evaluating Asynchronous Parallel I/O on HPC Systems. John Ravi, Suren Byna, Quincey Koziol, Houjun Tang, Michela Becchi |
| 2023 | Exact Fault-Tolerant Consensus with Voting Validity. Zhangchen Xu, Yuetai Li, Chenglin Feng, Lei Zhang |
| 2023 | Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU. Jianjin Liao, Mingzhen Li, Hailong Yang, Qingxiao Sun, Biao Sun, Jiwei Hao, Tianyu Feng, Fengwei Yu, Shengdong Chen, Ye Tao, Zicheng Zhang, Zhongzhi Luan, Depei Qian |
| 2023 | Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training. Siddharth Singh, Abhinav Bhatele |
| 2023 | FIRST: Exploiting the Multi-Dimensional Attributes of Functions for Power-Aware Serverless Computing. Lu Zhang, Chao Li, Xinkai Wang, Weiqi Feng, Zheng Yu, Quan Chen, Jingwen Leng, Minyi Guo, Pu Yang, Shang Yue |
| 2023 | Fast And Automatic Floating Point Error Analysis With CHEF-FP. Garima Singh, Baidyanath Kundu, Harshitha Menon, Alexander Penev, David J. Lange, Vassil Vassilev |
| 2023 | Fast Deterministic Gathering with Detection on Arbitrary Graphs: The Power of Many Robots. Anisur Rahaman Molla, Kaushik Mondal, William K. Moses Jr. |
| 2023 | Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks. Ruibo Fan, Wei Wang, Xiaowen Chu |
| 2023 | FaultyRank: A Graph-based Parallel File System Checker. Saisha Kamat, Abdullah Al Raqibul Islam, Mai Zheng, Dong Dai |
| 2023 | Feature-based SpMV Performance Analysis on Contemporary Devices. Panagiotis Mpakos, Dimitrios Galanopoulos, Petros Anastasiadis, Nikela Papadopoulou, Nectarios Koziris, Georgios I. Goumas |
| 2023 | FedBIAD: Communication-Efficient and Accuracy-Guaranteed Federated Learning with Bayesian Inference-Based Adaptive Dropout. Jingjing Xue, Min Liu, Sheng Sun, Yuwei Wang, Hui Jiang, Xuefeng Jiang |
| 2023 | FedTrip: A Resource-Efficient Federated Learning Method with Triplet Regularization. Xujing Li, Min Liu, Sheng Sun, Yuwei Wang, Hui Jiang, Xuefeng Jiang |
| 2023 | GPU-Accelerated Error-Bounded Compression Framework for Quantum Circuit Simulations. Milan Shah, Xiaodong Yu, Sheng Di, Danylo Lykov, Yuri Alexeev, Michela Becchi, Franck Cappello |
| 2023 | GPU-enabled Function-as-a-Service for Machine Learning Inference. Ming Zhao, Kritshekhar Jha, Sungho Hong |
| 2023 | Generalizable Reinforcement Learning-Based Coarsening Model for Resource Allocation over Large and Diverse Stream Processing Graphs. Lanshun Nie, Yuqi Qiu, Fei Meng, Mo Yu, Jing Li |
| 2023 | GraphMetaP: Efficient MetaPath Generation for Dynamic Heterogeneous Graph Models. Haiheng He, Dan Chen, Long Zheng, Yu Huang, Haifeng Liu, Chaoqiang Liu, Xiaofei Liao, Hai Jin |
| 2023 | GraphTensor: Comprehensive GNN-Acceleration Framework for Efficient Parallel Processing of Massive Datasets. Junhyeok Jang, Miryeong Kwon, Donghyun Gouk, Hanyeoreum Bae, Myoungsoo Jung |
| 2023 | H-Cache: Traffic-Aware Hybrid Rule-Caching in Software-Defined Networks. Zeyu Luan, Qing Li, Yi Wang, Yong Jiang |
| 2023 | Harnessing the Crowd for Autotuning High-Performance Computing Applications. Younghyun Cho, James Weldon Demmel, Jacob King, Xiaoye S. Li, Yang Liu, Hengrui Luo |
| 2023 | HyScale-GNN: A Scalable Hybrid GNN Training System on Single-Node Heterogeneous Architecture. Yi-Chien Lin, Viktor K. Prasanna |
| 2023 | IEEE International Parallel and Distributed Processing Symposium, IPDPS 2023, St. Petersburg, FL, USA, May 15-19, 2023 |
| 2023 | Lossy Scientific Data Compression With SPERR. Shaomeng Li, Peter Lindstrom, John P. Clyne |
| 2023 | LowFive: In Situ Data Transport for High-Performance Workflows. Tom Peterka, Dmitriy Morozov, Arnur Nigmetov, Orcun Yildiz, Bogdan Nicolae, Philip E. Davis |
| 2023 | Lyra: Fast and Scalable Resilience to Reordering Attacks in Blockchains. Pouriya Zarbafian, Vincent Gramoli |
| 2023 | MCR-DL: Mix-and-Match Communication Runtime for Deep Learning. Quentin Anthony, Ammar Ahmad Awan, Jeff Rasley, Yuxiong He, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda |
| 2023 | MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism. Zheng Zhang, Donglin Yang, Yaqi Xia, Liang Ding, Dacheng Tao, Xiaobo Zhou, Dazhao Cheng |
| 2023 | Memory-aware Optimization for Sequences of Sparse Matrix-Vector Multiplications. Yichen Zhang, Shengguo Li, Fan Yuan, Dezun Dong, Xiaojian Yang, Tiejun Li, Zheng Wang |
| 2023 | Mimir: Extending I/O Interfaces to Express User Intent for Complex Workloads in HPC. Hariharan Devarajan, Kathryn M. Mohror |
| 2023 | Neural Network Compiler for Parallel High-Throughput Simulation of Digital Circuits. Ignacio Gavier, Joshua Russell, Devdhar Patel, Edward A. Rietman, Hava T. Siegelmann |
| 2023 | On Doorway Egress by Autonomous Robots. Rory Hector, Ramachandran Vaidyanathan, Gokarna Sharma, Jerry L. Trahan |
| 2023 | On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM). Emmanuel Agullo, Alfredo Buttari, Olivier Coulaud, Lionel Eyraud-Dubois, Mathieu Faverge, Alain Franc, Abdou Guermouche, Antoine Jego, Romain Peressoni, Florent Pruvost |
| 2023 | Opportunities and Limitations of Hardware Timestamps in Concurrent Data Structures. Olivia Grimes, Jacob Nelson-Slivon, Ahmed Hassan, Roberto Palmieri |
| 2023 | Optimizing Cloud Computing Resource Usage for Hemodynamic Simulation. William Ladd, Christopher Jensen, Madhurima Vardhan, Jeff Ames, Jeff R. Hammond, Erik W. Draeger, Amanda Randles |
| 2023 | PAQR: Pivoting Avoiding QR factorization. Wissam M. Sid-Lakhdar, Sébastien Cayrols, Daniel Bielich, Ahmad Abdelfattah, Piotr Luszczek, Mark Gates, Stanimire Tomov, Hans Johansen, David B. Williams-Young, Timothy A. Davis, Jack J. Dongarra, Hartwig Anzt |
| 2023 | PFedSA: Personalized Federated Multi-Task Learning via Similarity Awareness. Chuyao Ye, Hao Zheng, Zhigang Hu, Meiguang Zheng |
| 2023 | PRF: A Fast Parallel Relaxed Flooding Algorithm for Voronoi Diagram Generation on GPU. Jue Wang, Fumihiko Ino, Jing Ke |
| 2023 | Porting a Computational Fluid Dynamics Code with AMR to Large-scale GPU Platforms. Joshua Hoke Davis, Justin Shafner, Daniel Nichols, Nathan Grube, Pino Martin, Abhinav Bhatele |
| 2023 | Power Constrained Autotuning using Graph Neural Networks. Akash Dutta, Jee Choi, Ali Jannesari |
| 2023 | Predictive Analysis of Code Optimisations on Large-Scale Coupled CFD-Combustion Simulations using the CPX Mini-App. Archie Powell, Gihan R. Mudalige |
| 2023 | Proactive SLA-aware Application Placement in the Computing Continuum. Zahra Najafabadi Samani, Narges Mehran, Dragi Kimovski, Radu Prodan |
| 2023 | QoS-Aware and Cost-Efficient Dynamic Resource Allocation for Serverless ML Workflows. Hao Wu, Junxiao Deng, Hao Fan, Shadi Ibrahim, Song Wu, Hai Jin |
| 2023 | RLP: Power Management Based on a Latency-Aware Roofline Model. Bo Wang, Anara Kozhokanova, Christian Terboven, Matthias S. Müller |
| 2023 | RT-DBSCAN: Accelerating DBSCAN using Ray Tracing Hardware. Vani Nagarajan, Milind Kulkarni |
| 2023 | SBGT: Scaling Bayesian-based Group Testing for Disease Surveillance. Weicong Chen, Hao Qi, Xiaoyi Lu, Curtis Tatsuoka |
| 2023 | SCONNA: A Stochastic Computing Based Optical Accelerator for Ultra-Fast, Energy-Efficient Inference of Integer-Quantized CNNs. Sairam Sri Vatsavai, Venkata Sai Praneeth Karempudi, Ishan G. Thakkar, Sayed Ahmad Salehi, Jeffrey Todd Hastings |
| 2023 | SLAP: An Adaptive, Learned Admission Policy for Content Delivery Network Caching. Ke Liu, Kan Wu, Hua Wang, Ke Zhou, Ji Zhang, Cong Li |
| 2023 | SRC: Mitigate I/O Throughput Degradation in Network Congestion Control of Disaggregated Storage Systems. Danlin Jia, Yiming Xie, Li Wang, Xiaoqian Zhang, Allen Yang, Xuebin Yao, Mahsa Bayati, Pradeep Subedi, Bo Sheng, Ningfang Mi |
| 2023 | SW-LCM: A Scalable and Weakly-supervised Land Cover Mapping Method on a New Sunway Supercomputer. Yi Zhao, Juepeng Zheng, Haohuan Fu, Wenzhao Wu, Jie Gao, Mengxuan Chen, Jinxiao Zhang, Lixian Zhang, Runmin Dong, Zhenrong Du, Sha Liu, Xin Liu, Shaoqing Zhang, Le Yu |
| 2023 | Satellite Collision Detection using Spatial Data Structures. Christian Hellwig, Fabian Czappa, Martin Michel, Reinhold Bertrand, Felix Wolf |
| 2023 | Scalable adaptive algorithms for next-generation multiphase flow simulations. Kumar Saurabh, Masado Ishii, Makrand A. Khanwale, Hari Sundar, Baskar Ganapathysubramanian |
| 2023 | Scheduling with Many Shared Resources. Max A. Deppert, Klaus Jansen, Marten Maack, Simon Pukrop, Malin Rau |
| 2023 | SelB-k-NN: A Mini-Batch K-Nearest Neighbors Algorithm on AI Processors. Yifeng Tang, Cho-Li Wang |
| 2023 | Signal Detection for Large MIMO Systems Using Sphere Decoding on FPGAs. Mohamed W. Hassan, Adel Dabah, Hatem Ltaief, Suhaib A. Fahmy |
| 2023 | Smart Redbelly Blockchain: Reducing Congestion for Web3. Deepal Tennakoon, Yiding Hua, Vincent Gramoli |
| 2023 | Software-Defined, Fast and Strongly-Consistent Data Replication for RDMA-Based PM Datastores. Haodi Lu, Haikun Liu, Chencheng Ye, Xiaofei Liao, Fubing Mao, Yu Zhang, Hai Jin |
| 2023 | Stochastic Neuromorphic Circuits for Solving MAXCUT. Bradley H. Theilman, Yipu Wang, Ojas Parekh, William Severa, J. Darby Smith, James B. Aimone |
| 2023 | Towards Faster Fully Homomorphic Encryption Implementation with Integer and Floating-point Computing Power of GPUs. Guang Fan, Fangyu Zheng, Lipeng Wan, Lili Gao, Yuan Zhao, Jiankuo Dong, Yixuan Song, Yuewu Wang, Jingqiang Lin |
| 2023 | Traversing Large Compressed Graphs on GPUs. Prasun Gera, Hyesoon Kim |
| 2023 | TurboHE: Accelerating Fully Homomorphic Encryption Using FPGA Clusters. Haohao Liao, Mahmoud A. Elmohr, Xuan Dong, Yanjun Qian, Wenzhe Yang, Zhiwei Shang, Yin Tan |
| 2023 | UnifyFS: A User-level Shared File System for Unified Access to Distributed Local Storage. Michael J. Brim, Adam T. Moody, Seung-Hwan Lim, Ross G. Miller, Swen Boehm, Cameron Stanavige, Kathryn M. Mohror, Sarp Oral |
| 2023 | ZFP-X: Efficient Embedded Coding for Accelerating Lossy Floating Point Compression. Bing Lu, Yida Li, Junqi Wang, Huizhang Luo, Kenli Li |
| 2023 | k-Center Clustering with Outliers in the MPC and Streaming Model. Mark de Berg, Leyla Biabani, Morteza Monemizadeh |
| 2023 | qTask: Task-parallel Quantum Circuit Simulation with Incrementality. Tsung-Wei Huang |
| 2023 | rFaaS: Enabling High Performance Serverless with RDMA and Leases. Marcin Copik, Konstantin Taranov, Alexandru Calotoiu, Torsten Hoefler |