ICPP B

89 papers

YearTitle / Authors
2022A Data-aware Learned Index Scheme for Efficient Writes.
Li Liu, Chunhua Li, Zhou Zhang, Yuhan Liu, Ke Zhou, Ji Zhang
2022A Dynamic and Recoverable BMT Scheme for Secure Non-Volatile Memory.
Mengya Lei, Fang Wang, Dan Feng, Xiaoyu Shuai, Yuchao Cao
2022A single-tree algorithm to compute the Euclidean minimum spanning tree on GPUs.
Andrey Prokopenko, Piyush Sao, Damien Lebrun-Grandié
2022ADSTS: Automatic Distributed Storage Tuning System Using Deep Reinforcement Learning.
Kai Lu, Guokuan Li, Jiguang Wan, Ruixiang Ma, Wei Zhao
2022Accelerating Parallel First-Principles Excited-State Calculation by Low-Rank Approximation with K-Means Clustering.
Qingcai Jiang, Jielan Li, Junshi Chen, Xinming Qin, Lingyun Wan, Jinlong Yang, Jie Liu, Wei Hu, Hong An
2022Accelerating Random Forest Classification on GPU and FPGA.
Milan Shah, Reece Neff, Hancheng Wu, Marco Minutoli, Antonino Tumeo, Michela Becchi
2022Acuerdo: Fast Atomic Broadcast over RDMA.
Joseph Izraelevitz, Gaukas Wang, Rhett Hanscom, Kayli Silvers, Tamara Silbergleit Lehman, Gregory V. Chockler, Alexey Gotsman
2022Adaptive and Efficient GPU Time Sharing for Hyperparameter Tuning in Cloud.
Liu Liu, Jian Yu, Zhijun Ding
2022An Online Learning Approach for Client Selection in Federated Edge Learning under Budget Constraint.
Lina Su, Ruiting Zhou, Ne Wang, Guang Fang, Zongpeng Li
2022Analyzing Performance and Power-Efficiency Variations among NVIDIA GPUs.
Kohei Yoshida, Rio Sageyama, Shinobu Miwa, Hayato Yamaki, Hiroki Honda
2022Aperiodic Local SGD: Beyond Local SGD.
Hao Zhang, Tingting Wu, Siyao Cheng, Jie Liu
2022Atos: A Task-Parallel GPU Scheduler for Graph Analytics.
Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens
2022Automatic Differentiation of Parallel Loops with Formal Methods.
Jan Hückelheim, Laurent Hascoët
2022Automatically Generating High-performance Matrix Multiplication Kernels on the Latest Sunway Processor.
Xiaohan Tao, Yu Zhu, Boyang Wang, Jinlong Xu, Jianmin Pang, Jie Zhao
2022BSCache: A Brisk Semantic Caching Scheme for Cloud-based Performance Monitoring Timeseries Systems.
Kai Zhang, Zhiqi Wang, Zili Shao
2022BULB: Lightweight and Automated Load Balancing for Fast Datacenter Networks.
Yuan Liu, Wenxin Li, Wenyu Qu, Heng Qi
2022BWA-MEM-SCALE: Accelerating Genome Sequence Mapping on Commodity Servers.
Changdae Kim, Kwangwon Koh, Taehoon Kim, Daegyu Han, Jiwon Seo
2022Boosting Cross-rack Multi-stripe Repair in Heterogeneous Erasure-coded Clusters.
Hai Zhou, Dan Feng
2022Cache-Poll: Containing Pollution in Non-Inclusive Caches Through Cache Partitioning.
Lucia Pons, Julio Sahuquillo, Salvador Petit, Julio Pons
2022Characterizing Job Microarchitectural Profiles at Scale: Dataset and Analysis.
Kangjin Wang, Ying Li, Cheng Wang, Tong Jia, Kingsum Chow, Yang Wen, Yaoyong Dou, Guoyao Xu, Chuanjia Hou, Jie Yao, Liping Zhang
2022Characterizing and Optimizing Transformer Inference on ARM Many-core Processor.
Jiazhi Jiang, Jiangsu Du, Dan Huang, Dongsheng Li, Jiang Zheng, Yutong Lu
2022Counting Induced 6-Cycles in Bipartite Graphs.
Jason Niu, Jaroslaw Zola, Ahmet Erdem Sariyüce
2022DC4: Reconstructing Data-Credit-Coupled Congestion Control for Data Centers.
Shan Huang, Dezun Dong, Lingbin Zeng, Zejia Zhou, Yukun Zhou, Xiangke Liao
2022DRAM Cache Management with Request Granularity for NAND-based SSDs.
Haodong Lin, Zhibing Sha, Jun Li, Zhigang Cai, Balazs Gerofi, Yuanquan Shi, Jianwei Liao
2022DSSA: Dual-Side Sparse Systolic Array Architecture for Accelerating Convolutional Neural Network Training.
Zhengbo Chen, Qi Yu, Fang Zheng, Feng Guo, Zuoning Chen
2022DeepCAT: A Cost-Efficient Online Configuration Auto-Tuning Approach for Big Data Frameworks.
Hui Dou, Yilun Wang, Yiwen Zhang, Pengfei Chen
2022Distributed-Memory Parallel Contig Generation for De Novo Long-Read Genome Assembly.
Giulia Guidi, Gabriel Raulet, Daniel Rokhsar, Leonid Oliker, Katherine A. Yelick, Aydin Buluç
2022Dynamic Strategies for High Performance Training of Knowledge Graph Embeddings.
Anwesh Panda, Sathish Vadhiyar
2022EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers.
Lijuan Jiang, Ping Xu, Qianchao Zhu, Xiuhong Li, Shengen Yan, Xingcheng Zhang, Dahua Lin, Wenjing Ma, Zhouyang Li, Jun Liu, Jinming Ma, Minxi Jin, Chao Yang
2022Eco-FL: Adaptive Federated Learning with Efficient Edge Collaborative Pipeline Training.
Shengyuan Ye, Liekang Zeng, Qiong Wu, Ke Luo, Qingze Fang, Xu Chen
2022Efficient Phase-Functioned Real-time Character Control in Mobile Games: A TVM Enabled Approach.
Haidong Lan, Wenxi Zhu, Du Wu, Qian Qiu, Honglin Zhu, Jingjing Zhao, Xinghui Fu, Liu Wei, Jintao Meng, Minwen Deng
2022ElastiSim: A Batch-System Simulator for Malleable Workloads.
Taylan Özden, Tim Beringer, Arya Mazaheri, Hamid Mohammadi Fard, Felix Wolf
2022EmbRace: Accelerating Sparse Communication for Distributed Training of Deep Neural Networks.
Shengwei Li, Zhiquan Lai, Dongsheng Li, Yiming Zhang, Xiangyu Ye, Yabo Duan
2022Enabling Latency-Sensitive DNN Inference via Joint Optimization of Model Surgery and Resource Allocation in Heterogeneous Edge.
Zhaowu Huang, Fang Dong, Dian Shen, Huitian Wang, Xiaolin Guo, Shucun Fu
2022Energy-efficient Edge Server Management for Edge Computing: A Game-theoretical Approach.
Guangming Cui, Qiang He, Xiaoyu Xia, Feifei Chen, Yun Yang
2022Exploiting CXL-based Memory for Distributed Deep Learning.
Moiz Arif, Kevin Assogba, M. Mustafa Rafique, Sudharshan Vazhkudai
2022Exploiting Parallelism of Disk Failure Recovery via Partial Stripe Repair for an Erasure-Coded High-Density Storage Server.
Lin Wang, Yuchong Hu, Qian Du, Dan Feng, Ray Wu, Ingo He, Kevin Zhang
2022FAIR-BFL: Flexible and Incentive Redesign for Blockchain-based Federated Learning.
Rongxin Xu, Shiva Raj Pokhrel, Qiujun Lan, Gang Li
2022FLOPs as a Discriminant for Dense Linear Algebra Algorithms.
Francisco López, Lars Karlsson, Paolo Bientinesi
2022FedClassAvg: Local Representation Learning for Personalized Federated Learning on Heterogeneous Neural Networks.
Jaehee Jang, Heonseok Ha, Dahuin Jung, Sungroh Yoon
2022FedDRL: Deep Reinforcement Learning-based Adaptive Aggregation for Non-IID Data in Federated Learning.
Nang Hung Nguyen, Phi Le Nguyen, Thuy Dung Nguyen, Trung Thanh Nguyen, Duc Long Nguyen, Thanh Hung Nguyen, Huy Hieu Pham, Truong Thao Nguyen
2022FedHiSyn: A Hierarchical Synchronous Federated Learning Framework for Resource and Data Heterogeneity.
Guanghao Li, Yue Hu, Miao Zhang, Ji Liu, Quanjun Yin, Yong Peng, Dejing Dou
2022Formulating Interference-aware Data Delivery Strategies in Edge Storage Systems.
Xiaoyu Xia, Feifei Chen, Qiang He, Guangming Cui, John C. Grundy, Mohamed Almorsy Abdelrazek, Fang Dong
2022From RTL to CUDA: A GPU Acceleration Flow for RTL Simulation with Batch Stimulus.
Dian-Lun Lin, Haoxing Ren, Yanqing Zhang, Brucek Khailany, Tsung-Wei Huang
2022GraphSD: A State and Dependency aware Out-of-Core Graph Processing System.
Xianghao Xu, Hong Jiang, Fang Wang, Yongli Cheng, Peng Fang
2022HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks.
Zining Zhang, Bingsheng He, Zhenjie Zhang
2022HSP: Hybrid Synchronous Parallelism for Fast Distributed Deep Learning.
Yijun Li, Jiawei Huang, Zhaoyi Li, Shengwen Zhou, Wanchun Jiang, Jianxin Wang
2022Highly Parallel Linear Forest Extraction from a Weighted Graph on GPUs.
Christoph Klein, Robert Strzodka
2022IATF: An Input-Aware Tuning Framework for Compact BLAS Based on ARMv8 CPUs.
Cunyang Wei, Haipeng Jia, Yunquan Zhang, Liusha Xu, Ji Qi
2022LDPP: A Learned Directory Placement Policy in Distributed File Systems.
Yuanzhang Wang, Fengkui Yang, Ji Zhang, Chunhua Li, Ke Zhou, Chong Liu, Zhuo Cheng, Wei Fang, Jinhu Liu
2022Learning Mean-Field Control for Delayed Information Load Balancing in Large Queuing Systems.
Anam Tahir, Kai Cui, Heinz Koeppl
2022Lobster: Load Balance-Aware I/O for Distributed DNN Training.
Jie Liu, Bogdan Nicolae, Dong Li
2022MG-GCN: A Scalable multi-GPU GCN Training Framework.
Muhammed Fatih Balin, Kaan Sancak, Ümit V. Çatalyürek
2022Mentha: Enabling Sparse-Packing Computation on Systolic Arrays.
Minjin Tang, Mei Wen, Yasong Cao, Junzhong Shen, Jianchao Yang, Jiawei Fei, Yang Guo, Sheng Liu
2022Micro-Benchmarking MPI Partitioned Point-to-Point Communication.
Yiltan Hassan Temuçin, Ryan E. Grant, Ahmad Afsahi
2022Mlog: Multi-log Write Buffer upon Ultra-fast SSD RAID.
Shucheng Wang, Qiang Cao, Ziyi Lu, Jie Yao
2022Multi Resource Scheduling with Task Cloning in Heterogeneous Clusters.
Huanle Xu, Yang Liu, Wing Cheong Lau
2022NCC: Neighbor-aware Congestion Control based on Reinforcement Learning for Datacenter Networks.
Haoyu Wang, Kevin Zheng, Charles Reiss, Haiying Shen
2022NNLQP: A Multi-Platform Neural Network Latency Query and Prediction System with An Evolving Database.
Liang Liu, Mingzhu Shen, Ruihao Gong, Fengwei Yu, Hailong Yang
2022On the Parallelization of MCMC for Community Detection.
Frank Wanye, Vitaliy Gleyzer, Edward K. Kao, Wu-chun Feng
2022Online Resource Optimization for Elastic Stream Processing with Regret Guarantee.
Yang Liu, Huanle Xu, Wing Cheong Lau
2022Online Scheduling of Moldable Task Graphs under Common Speedup Models.
Anne Benoit, Lucas Perotin, Yves Robert, Hongyang Sun
2022ParaGraph: An application-simulator interface and toolkit for hardware-software co-design.
Mikhail Isaev, Nic McDonald, Jeffrey Young, Richard W. Vuduc
2022Parallel Algorithms for Masked Sparse Matrix-Matrix Products.
Srdan Milakovic, Oguz Selvitopi, Israt Nisa, Zoran Budimlic, Aydin Buluç
2022Parallel Network Slicing for Multi-SP Services.
Rongxin Han, Dezhi Chen, Song Guo, Xiaoyuan Fu, Jingyu Wang, Qi Qi, Jianxin Liao
2022ParallelDualSPHysics: supporting efficient parallel fluid simulations through MPI-enabled SPH method.
Sifan Long, Xiaowei Guo, Xiaokang Fan, Chao Li, Kelvin K. L. Wong, Ran Zhao, Yi Liu, Sen Zhang, Canqun Yang
2022Penelope: Peer-to-peer Power Management.
Tapan Srivastava, Huazhe Zhang, Henry Hoffmann
2022Performance Modeling for Short-Term Cache Allocation.
Christopher Stewart, Nathaniel Morris, Lydia Y. Chen, Robert Birke
2022Postmortem Computation of Pagerank on Temporal Graphs.
Md. Maruf Hossain, Erik Saule
2022Proceedings of the 51st International Conference on Parallel Processing, ICPP 2022, Bordeaux, France, 29 August 2022 - 1 September 2022
2022ROWE-tree: A Read-Optimized and Write-Efficient B+-tree for Persistent Memory.
Xiaomin Zou, Fang Wang, Dan Feng, Tianjin Guan, Nan Su
2022Regularizing Sparse and Imbalanced Communications for Voxel-based Brain Simulations on Supercomputers.
Yuhao Liu, Xin Du, Zhihui Lu, Qiang Duan, Jianfeng Feng, Minglong Wang, Jie Wu
2022Repair-Optimal Data Placement for Locally Repairable Codes with Optimal Minimum Hamming Distance.
Shuang Ma, Si Wu, Cheng Li, Yinlong Xu
2022SHE: A Generic Framework for Data Stream Mining over Sliding Windows.
Yuhan Wu, Zhuochen Fan, Qilong Shi, Yixin Zhang, Tong Yang, Cheng Chen, Zheng Zhong, Junnan Li, Ariel Shtul, Yaofeng Tu
2022SMEGA2: Distributed Asynchronous Deep Neural Network Training With a Single Momentum Buffer.
Refael Cohen, Ido Hakimi, Assaf Schuster
2022SPAMeR: Speculative Push for Anticipated Message Requests in Multi-Core Systems.
Qinzhe Wu, Ashen Ekanayake, Ruihao Li, Jonathan C. Beard, Lizy Kurian John
2022Scheduling Fork-Join Task Graphs with Communication Delays and Equal Processing Times.
Huijun Wang, Oliver Sinnen
2022Semi-Online Multi-Machine with Restart Scheduling for Integrated Edge and Cloud Computing Systems.
Liming Ge, Zizhao Wang, Wei Bao, Dong Yuan, Nguyen Hoang Tran, Bing Bing Zhou, Albert Y. Zomaya
2022Simmer: Rate proportional scheduling to reduce packet drops in vGPU based NF chains.
Avinash Kumar Chaurasia, Anshuj Garg, Bhaskaran Raman, Uday Kurkure, Hari Sivaraman, Lan Vu, Sairam Veeraswamy
2022Spread: Decentralized Model Aggregation for Scalable Federated Learning.
Chuang Hu, Huanghuang Liang, Xiao Ming Han, Boan Liu, Dazhao Cheng, Dan Wang
2022TCB: Accelerating Transformer Inference Services with Request Concatenation.
Boqian Fu, Fahao Chen, Peng Li, Deze Zeng
2022Tensor-Accelerated Fourth-Order Epistasis Detection on GPUs.
Ricardo Nobre, Aleksandar Ilic, Sergio Santander-Jiménez, Leonel Sousa
2022Tesseract: Parallelize the Tensor Parallelism Efficiently.
Boxiang Wang, Qifan Xu, Zhengda Bian, Yang You
2022Themis: Fair Memory Subsystem Resource Sharing with Differentiated QoS in Public Clouds.
Wenda Tang, Senbo Fu, Yutao Ke, Qian Peng, Feng Gao
2022TileSpMSpV: A Tiled Algorithm for Sparse Matrix-Sparse Vector Multiplication on GPUs.
Haonan Ji, Huimin Song, Shibo Lu, Zhou Jin, Guangming Tan, Weifeng Liu
2022Towards Fast Large-scale Graph Analysis via Two-dimensional Balanced Partitioning.
Shuai Lin, Rui Wang, Yongkun Li, Yinlong Xu, John C. S. Lui, Fei Chen, Pengcheng Wang, Lei Han
2022Transparent load balancing of MPI programs using [email protected] and DLB.
Jimmy Aguilar Mena, Omar Shaaban, Victor Lopez, Marta Garcia, Paul M. Carpenter, Eduard Ayguadé, Jesús Labarta
2022UA-Sketch: An Accurate Approach to Detect Heavy Flow based on Uninterrupted Arrival.
Jin Ye, Lin Li, Wenlu Zhang, Guihao Chen, Yuanchao Shan, Yijun Li, Weihe Li, Jiawei Huang
2022Vectorizing SpMV by Exploiting Dynamic Regular Patterns.
Xin You, Changxi Liu, Hailong Yang, Pengbo Wang, Zhongzhi Luan, Depei Qian