| 2022 | A Data-aware Learned Index Scheme for Efficient Writes. Li Liu, Chunhua Li, Zhou Zhang, Yuhan Liu, Ke Zhou, Ji Zhang |
| 2022 | A Dynamic and Recoverable BMT Scheme for Secure Non-Volatile Memory. Mengya Lei, Fang Wang, Dan Feng, Xiaoyu Shuai, Yuchao Cao |
| 2022 | A single-tree algorithm to compute the Euclidean minimum spanning tree on GPUs. Andrey Prokopenko, Piyush Sao, Damien Lebrun-Grandié |
| 2022 | ADSTS: Automatic Distributed Storage Tuning System Using Deep Reinforcement Learning. Kai Lu, Guokuan Li, Jiguang Wan, Ruixiang Ma, Wei Zhao |
| 2022 | Accelerating Parallel First-Principles Excited-State Calculation by Low-Rank Approximation with K-Means Clustering. Qingcai Jiang, Jielan Li, Junshi Chen, Xinming Qin, Lingyun Wan, Jinlong Yang, Jie Liu, Wei Hu, Hong An |
| 2022 | Accelerating Random Forest Classification on GPU and FPGA. Milan Shah, Reece Neff, Hancheng Wu, Marco Minutoli, Antonino Tumeo, Michela Becchi |
| 2022 | Acuerdo: Fast Atomic Broadcast over RDMA. Joseph Izraelevitz, Gaukas Wang, Rhett Hanscom, Kayli Silvers, Tamara Silbergleit Lehman, Gregory V. Chockler, Alexey Gotsman |
| 2022 | Adaptive and Efficient GPU Time Sharing for Hyperparameter Tuning in Cloud. Liu Liu, Jian Yu, Zhijun Ding |
| 2022 | An Online Learning Approach for Client Selection in Federated Edge Learning under Budget Constraint. Lina Su, Ruiting Zhou, Ne Wang, Guang Fang, Zongpeng Li |
| 2022 | Analyzing Performance and Power-Efficiency Variations among NVIDIA GPUs. Kohei Yoshida, Rio Sageyama, Shinobu Miwa, Hayato Yamaki, Hiroki Honda |
| 2022 | Aperiodic Local SGD: Beyond Local SGD. Hao Zhang, Tingting Wu, Siyao Cheng, Jie Liu |
| 2022 | Atos: A Task-Parallel GPU Scheduler for Graph Analytics. Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens |
| 2022 | Automatic Differentiation of Parallel Loops with Formal Methods. Jan Hückelheim, Laurent Hascoët |
| 2022 | Automatically Generating High-performance Matrix Multiplication Kernels on the Latest Sunway Processor. Xiaohan Tao, Yu Zhu, Boyang Wang, Jinlong Xu, Jianmin Pang, Jie Zhao |
| 2022 | BSCache: A Brisk Semantic Caching Scheme for Cloud-based Performance Monitoring Timeseries Systems. Kai Zhang, Zhiqi Wang, Zili Shao |
| 2022 | BULB: Lightweight and Automated Load Balancing for Fast Datacenter Networks. Yuan Liu, Wenxin Li, Wenyu Qu, Heng Qi |
| 2022 | BWA-MEM-SCALE: Accelerating Genome Sequence Mapping on Commodity Servers. Changdae Kim, Kwangwon Koh, Taehoon Kim, Daegyu Han, Jiwon Seo |
| 2022 | Boosting Cross-rack Multi-stripe Repair in Heterogeneous Erasure-coded Clusters. Hai Zhou, Dan Feng |
| 2022 | Cache-Poll: Containing Pollution in Non-Inclusive Caches Through Cache Partitioning. Lucia Pons, Julio Sahuquillo, Salvador Petit, Julio Pons |
| 2022 | Characterizing Job Microarchitectural Profiles at Scale: Dataset and Analysis. Kangjin Wang, Ying Li, Cheng Wang, Tong Jia, Kingsum Chow, Yang Wen, Yaoyong Dou, Guoyao Xu, Chuanjia Hou, Jie Yao, Liping Zhang |
| 2022 | Characterizing and Optimizing Transformer Inference on ARM Many-core Processor. Jiazhi Jiang, Jiangsu Du, Dan Huang, Dongsheng Li, Jiang Zheng, Yutong Lu |
| 2022 | Counting Induced 6-Cycles in Bipartite Graphs. Jason Niu, Jaroslaw Zola, Ahmet Erdem Sariyüce |
| 2022 | DC4: Reconstructing Data-Credit-Coupled Congestion Control for Data Centers. Shan Huang, Dezun Dong, Lingbin Zeng, Zejia Zhou, Yukun Zhou, Xiangke Liao |
| 2022 | DRAM Cache Management with Request Granularity for NAND-based SSDs. Haodong Lin, Zhibing Sha, Jun Li, Zhigang Cai, Balazs Gerofi, Yuanquan Shi, Jianwei Liao |
| 2022 | DSSA: Dual-Side Sparse Systolic Array Architecture for Accelerating Convolutional Neural Network Training. Zhengbo Chen, Qi Yu, Fang Zheng, Feng Guo, Zuoning Chen |
| 2022 | DeepCAT: A Cost-Efficient Online Configuration Auto-Tuning Approach for Big Data Frameworks. Hui Dou, Yilun Wang, Yiwen Zhang, Pengfei Chen |
| 2022 | Distributed-Memory Parallel Contig Generation for De Novo Long-Read Genome Assembly. Giulia Guidi, Gabriel Raulet, Daniel Rokhsar, Leonid Oliker, Katherine A. Yelick, Aydin Buluç |
| 2022 | Dynamic Strategies for High Performance Training of Knowledge Graph Embeddings. Anwesh Panda, Sathish Vadhiyar |
| 2022 | EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers. Lijuan Jiang, Ping Xu, Qianchao Zhu, Xiuhong Li, Shengen Yan, Xingcheng Zhang, Dahua Lin, Wenjing Ma, Zhouyang Li, Jun Liu, Jinming Ma, Minxi Jin, Chao Yang |
| 2022 | Eco-FL: Adaptive Federated Learning with Efficient Edge Collaborative Pipeline Training. Shengyuan Ye, Liekang Zeng, Qiong Wu, Ke Luo, Qingze Fang, Xu Chen |
| 2022 | Efficient Phase-Functioned Real-time Character Control in Mobile Games: A TVM Enabled Approach. Haidong Lan, Wenxi Zhu, Du Wu, Qian Qiu, Honglin Zhu, Jingjing Zhao, Xinghui Fu, Liu Wei, Jintao Meng, Minwen Deng |
| 2022 | ElastiSim: A Batch-System Simulator for Malleable Workloads. Taylan Özden, Tim Beringer, Arya Mazaheri, Hamid Mohammadi Fard, Felix Wolf |
| 2022 | EmbRace: Accelerating Sparse Communication for Distributed Training of Deep Neural Networks. Shengwei Li, Zhiquan Lai, Dongsheng Li, Yiming Zhang, Xiangyu Ye, Yabo Duan |
| 2022 | Enabling Latency-Sensitive DNN Inference via Joint Optimization of Model Surgery and Resource Allocation in Heterogeneous Edge. Zhaowu Huang, Fang Dong, Dian Shen, Huitian Wang, Xiaolin Guo, Shucun Fu |
| 2022 | Energy-efficient Edge Server Management for Edge Computing: A Game-theoretical Approach. Guangming Cui, Qiang He, Xiaoyu Xia, Feifei Chen, Yun Yang |
| 2022 | Exploiting CXL-based Memory for Distributed Deep Learning. Moiz Arif, Kevin Assogba, M. Mustafa Rafique, Sudharshan Vazhkudai |
| 2022 | Exploiting Parallelism of Disk Failure Recovery via Partial Stripe Repair for an Erasure-Coded High-Density Storage Server. Lin Wang, Yuchong Hu, Qian Du, Dan Feng, Ray Wu, Ingo He, Kevin Zhang |
| 2022 | FAIR-BFL: Flexible and Incentive Redesign for Blockchain-based Federated Learning. Rongxin Xu, Shiva Raj Pokhrel, Qiujun Lan, Gang Li |
| 2022 | FLOPs as a Discriminant for Dense Linear Algebra Algorithms. Francisco López, Lars Karlsson, Paolo Bientinesi |
| 2022 | FedClassAvg: Local Representation Learning for Personalized Federated Learning on Heterogeneous Neural Networks. Jaehee Jang, Heonseok Ha, Dahuin Jung, Sungroh Yoon |
| 2022 | FedDRL: Deep Reinforcement Learning-based Adaptive Aggregation for Non-IID Data in Federated Learning. Nang Hung Nguyen, Phi Le Nguyen, Thuy Dung Nguyen, Trung Thanh Nguyen, Duc Long Nguyen, Thanh Hung Nguyen, Huy Hieu Pham, Truong Thao Nguyen |
| 2022 | FedHiSyn: A Hierarchical Synchronous Federated Learning Framework for Resource and Data Heterogeneity. Guanghao Li, Yue Hu, Miao Zhang, Ji Liu, Quanjun Yin, Yong Peng, Dejing Dou |
| 2022 | Formulating Interference-aware Data Delivery Strategies in Edge Storage Systems. Xiaoyu Xia, Feifei Chen, Qiang He, Guangming Cui, John C. Grundy, Mohamed Almorsy Abdelrazek, Fang Dong |
| 2022 | From RTL to CUDA: A GPU Acceleration Flow for RTL Simulation with Batch Stimulus. Dian-Lun Lin, Haoxing Ren, Yanqing Zhang, Brucek Khailany, Tsung-Wei Huang |
| 2022 | GraphSD: A State and Dependency aware Out-of-Core Graph Processing System. Xianghao Xu, Hong Jiang, Fang Wang, Yongli Cheng, Peng Fang |
| 2022 | HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks. Zining Zhang, Bingsheng He, Zhenjie Zhang |
| 2022 | HSP: Hybrid Synchronous Parallelism for Fast Distributed Deep Learning. Yijun Li, Jiawei Huang, Zhaoyi Li, Shengwen Zhou, Wanchun Jiang, Jianxin Wang |
| 2022 | Highly Parallel Linear Forest Extraction from a Weighted Graph on GPUs. Christoph Klein, Robert Strzodka |
| 2022 | IATF: An Input-Aware Tuning Framework for Compact BLAS Based on ARMv8 CPUs. Cunyang Wei, Haipeng Jia, Yunquan Zhang, Liusha Xu, Ji Qi |
| 2022 | LDPP: A Learned Directory Placement Policy in Distributed File Systems. Yuanzhang Wang, Fengkui Yang, Ji Zhang, Chunhua Li, Ke Zhou, Chong Liu, Zhuo Cheng, Wei Fang, Jinhu Liu |
| 2022 | Learning Mean-Field Control for Delayed Information Load Balancing in Large Queuing Systems. Anam Tahir, Kai Cui, Heinz Koeppl |
| 2022 | Lobster: Load Balance-Aware I/O for Distributed DNN Training. Jie Liu, Bogdan Nicolae, Dong Li |
| 2022 | MG-GCN: A Scalable multi-GPU GCN Training Framework. Muhammed Fatih Balin, Kaan Sancak, Ümit V. Çatalyürek |
| 2022 | Mentha: Enabling Sparse-Packing Computation on Systolic Arrays. Minjin Tang, Mei Wen, Yasong Cao, Junzhong Shen, Jianchao Yang, Jiawei Fei, Yang Guo, Sheng Liu |
| 2022 | Micro-Benchmarking MPI Partitioned Point-to-Point Communication. Yiltan Hassan Temuçin, Ryan E. Grant, Ahmad Afsahi |
| 2022 | Mlog: Multi-log Write Buffer upon Ultra-fast SSD RAID. Shucheng Wang, Qiang Cao, Ziyi Lu, Jie Yao |
| 2022 | Multi Resource Scheduling with Task Cloning in Heterogeneous Clusters. Huanle Xu, Yang Liu, Wing Cheong Lau |
| 2022 | NCC: Neighbor-aware Congestion Control based on Reinforcement Learning for Datacenter Networks. Haoyu Wang, Kevin Zheng, Charles Reiss, Haiying Shen |
| 2022 | NNLQP: A Multi-Platform Neural Network Latency Query and Prediction System with An Evolving Database. Liang Liu, Mingzhu Shen, Ruihao Gong, Fengwei Yu, Hailong Yang |
| 2022 | On the Parallelization of MCMC for Community Detection. Frank Wanye, Vitaliy Gleyzer, Edward K. Kao, Wu-chun Feng |
| 2022 | Online Resource Optimization for Elastic Stream Processing with Regret Guarantee. Yang Liu, Huanle Xu, Wing Cheong Lau |
| 2022 | Online Scheduling of Moldable Task Graphs under Common Speedup Models. Anne Benoit, Lucas Perotin, Yves Robert, Hongyang Sun |
| 2022 | ParaGraph: An application-simulator interface and toolkit for hardware-software co-design. Mikhail Isaev, Nic McDonald, Jeffrey Young, Richard W. Vuduc |
| 2022 | Parallel Algorithms for Masked Sparse Matrix-Matrix Products. Srdan Milakovic, Oguz Selvitopi, Israt Nisa, Zoran Budimlic, Aydin Buluç |
| 2022 | Parallel Network Slicing for Multi-SP Services. Rongxin Han, Dezhi Chen, Song Guo, Xiaoyuan Fu, Jingyu Wang, Qi Qi, Jianxin Liao |
| 2022 | ParallelDualSPHysics: supporting efficient parallel fluid simulations through MPI-enabled SPH method. Sifan Long, Xiaowei Guo, Xiaokang Fan, Chao Li, Kelvin K. L. Wong, Ran Zhao, Yi Liu, Sen Zhang, Canqun Yang |
| 2022 | Penelope: Peer-to-peer Power Management. Tapan Srivastava, Huazhe Zhang, Henry Hoffmann |
| 2022 | Performance Modeling for Short-Term Cache Allocation. Christopher Stewart, Nathaniel Morris, Lydia Y. Chen, Robert Birke |
| 2022 | Postmortem Computation of Pagerank on Temporal Graphs. Md. Maruf Hossain, Erik Saule |
| 2022 | Proceedings of the 51st International Conference on Parallel Processing, ICPP 2022, Bordeaux, France, 29 August 2022 - 1 September 2022 |
| 2022 | ROWE-tree: A Read-Optimized and Write-Efficient B+-tree for Persistent Memory. Xiaomin Zou, Fang Wang, Dan Feng, Tianjin Guan, Nan Su |
| 2022 | Regularizing Sparse and Imbalanced Communications for Voxel-based Brain Simulations on Supercomputers. Yuhao Liu, Xin Du, Zhihui Lu, Qiang Duan, Jianfeng Feng, Minglong Wang, Jie Wu |
| 2022 | Repair-Optimal Data Placement for Locally Repairable Codes with Optimal Minimum Hamming Distance. Shuang Ma, Si Wu, Cheng Li, Yinlong Xu |
| 2022 | SHE: A Generic Framework for Data Stream Mining over Sliding Windows. Yuhan Wu, Zhuochen Fan, Qilong Shi, Yixin Zhang, Tong Yang, Cheng Chen, Zheng Zhong, Junnan Li, Ariel Shtul, Yaofeng Tu |
| 2022 | SMEGA2: Distributed Asynchronous Deep Neural Network Training With a Single Momentum Buffer. Refael Cohen, Ido Hakimi, Assaf Schuster |
| 2022 | SPAMeR: Speculative Push for Anticipated Message Requests in Multi-Core Systems. Qinzhe Wu, Ashen Ekanayake, Ruihao Li, Jonathan C. Beard, Lizy Kurian John |
| 2022 | Scheduling Fork-Join Task Graphs with Communication Delays and Equal Processing Times. Huijun Wang, Oliver Sinnen |
| 2022 | Semi-Online Multi-Machine with Restart Scheduling for Integrated Edge and Cloud Computing Systems. Liming Ge, Zizhao Wang, Wei Bao, Dong Yuan, Nguyen Hoang Tran, Bing Bing Zhou, Albert Y. Zomaya |
| 2022 | Simmer: Rate proportional scheduling to reduce packet drops in vGPU based NF chains. Avinash Kumar Chaurasia, Anshuj Garg, Bhaskaran Raman, Uday Kurkure, Hari Sivaraman, Lan Vu, Sairam Veeraswamy |
| 2022 | Spread: Decentralized Model Aggregation for Scalable Federated Learning. Chuang Hu, Huanghuang Liang, Xiao Ming Han, Boan Liu, Dazhao Cheng, Dan Wang |
| 2022 | TCB: Accelerating Transformer Inference Services with Request Concatenation. Boqian Fu, Fahao Chen, Peng Li, Deze Zeng |
| 2022 | Tensor-Accelerated Fourth-Order Epistasis Detection on GPUs. Ricardo Nobre, Aleksandar Ilic, Sergio Santander-Jiménez, Leonel Sousa |
| 2022 | Tesseract: Parallelize the Tensor Parallelism Efficiently. Boxiang Wang, Qifan Xu, Zhengda Bian, Yang You |
| 2022 | Themis: Fair Memory Subsystem Resource Sharing with Differentiated QoS in Public Clouds. Wenda Tang, Senbo Fu, Yutao Ke, Qian Peng, Feng Gao |
| 2022 | TileSpMSpV: A Tiled Algorithm for Sparse Matrix-Sparse Vector Multiplication on GPUs. Haonan Ji, Huimin Song, Shibo Lu, Zhou Jin, Guangming Tan, Weifeng Liu |
| 2022 | Towards Fast Large-scale Graph Analysis via Two-dimensional Balanced Partitioning. Shuai Lin, Rui Wang, Yongkun Li, Yinlong Xu, John C. S. Lui, Fei Chen, Pengcheng Wang, Lei Han |
| 2022 | Transparent load balancing of MPI programs using [email protected] and DLB. Jimmy Aguilar Mena, Omar Shaaban, Victor Lopez, Marta Garcia, Paul M. Carpenter, Eduard Ayguadé, Jesús Labarta |
| 2022 | UA-Sketch: An Accurate Approach to Detect Heavy Flow based on Uninterrupted Arrival. Jin Ye, Lin Li, Wenlu Zhang, Guihao Chen, Yuanchao Shan, Yijun Li, Weihe Li, Jiawei Huang |
| 2022 | Vectorizing SpMV by Exploiting Dynamic Regular Patterns. Xin You, Changxi Liu, Hailong Yang, Pengbo Wang, Zhongzhi Luan, Depei Qian |