| 2019 | A 2D Parallel Triangle Counting Algorithm for Distributed-Memory Architectures. Ancy Sarah Tom, George Karypis |
| 2019 | A Network-aware and Partition-based Resource Management Scheme for Data Stream Processing. Yidan Wang, Zahir Tari, Xiaoran Huang, Albert Y. Zomaya |
| 2019 | A Parallel Graph Algorithm for Detecting Mesh Singularities in Distributed Memory Ice Sheet Simulations. Ian Bogle, Karen D. Devine, Mauro Perego, Sivasankaran Rajamanickam, George M. Slota |
| 2019 | A Plugin Architecture for the TAU Performance System. Allen D. Malony, Srinivasan Ramesh, Kevin A. Huck, Nicholas Chaimov, Sameer Shende |
| 2019 | A Practical, Scalable, Relaxed Priority Queue. Tingzhe Zhou, Maged M. Michael, Michael F. Spear |
| 2019 | A Read-leveling Data Distribution Scheme for Promoting Read Performance in SSDs with Deduplication. Mengting Lu, Fang Wang, Dan Feng, Yuchong Hu |
| 2019 | A Specialized Concurrent Queue for Scheduling Irregular Workloads on GPUs. David Troendle, Tuan Ta, Byunghyun Jang |
| 2019 | A Tale of Two (Flow) Tables: Demystifying Rule Caching in OpenFlow Switches. Rui Li, Yu Pang, Jin Zhao, Xin Wang |
| 2019 | A Unified Optimization Approach for CNN Model Inference on Integrated GPUs. Leyuan Wang, Zhi Chen, Yizhi Liu, Yao Wang, Lianmin Zheng, Mu Li, Yida Wang |
| 2019 | AVR: Reducing Memory Traffic with Approximate Value Reconstruction. Albin Eldstål-Damlin, Pedro Trancoso, Ioannis Sourdis |
| 2019 | Accelerated Work Stealing. D. Brian Larkins, John Snyder, James Dinan |
| 2019 | Accelerating All-Edge Common Neighbor Counting on Three Processors. Yulin Che, Zhuohang Lai, Shixuan Sun, Qiong Luo, Yue Wang |
| 2019 | Accelerating Long Read Alignment on Three Processors. Zonghao Feng, Shuang Qiu, Lipeng Wang, Qiong Luo |
| 2019 | AdaM: An Adaptive Fine-Grained Scheme for Distributed Metadata Management. Shiyi Cao, Yuanning Gao, Xiaofeng Gao, Guihai Chen |
| 2019 | Adaptive Learning for Concept Drift in Application Performance Modeling. Sandeep Madireddy, Prasanna Balaprakash, Philip H. Carns, Robert Latham, Glenn K. Lockwood, Robert B. Ross, Shane Snyder, Stefan M. Wild |
| 2019 | Adaptive Routing Reconfigurations to Minimize Flow Cost in SDN-Based Data Center Networks. Akbar Majidi, Xiaofeng Gao, Shunjia Zhu, Nazila Jahanbakhsh, Guihai Chen |
| 2019 | An Efficient Design Flow for Accelerating Complicated-connected CNNs on a Multi-FPGA Platform. Deguang Wang, Junzhong Shen, Mei Wen, Chunyuan Zhang |
| 2019 | Approximate Code: A Cost-Effective Erasure Coding Framework for Tiered Video Storage in Cloud Systems. Huayi Jin, Chentao Wu, Xin Xie, Jie Li, Minyi Guo, Hao Lin, Jianfeng Zhang |
| 2019 | Artemis: A Practical Low-latency Naming and Routing System. Xuebing Li, Bingyang Liu, Yang Chen, Yu Xiao, Jiaxin Tang, Xin Wang |
| 2019 | Automatic Differentiation for Adjoint Stencil Loops. Jan Hückelheim, Navjot Kukreja, Sri Hari Krishna Narayanan, Fabio Luporini, Gerard Gorman, Paul D. Hovland |
| 2019 | BCL: A Cross-Platform Distributed Data Structures Library. Benjamin Brock, Aydin Buluç, Katherine A. Yelick |
| 2019 | BPP: A Realtime Block Access Pattern Mining Scheme for I/O Prediction. Chunjie Zhu, Fang Wang, Binbing Hou |
| 2019 | Breaking Band: A Breakdown of High-performance Communication. Rohit Zambre, Megan Grodowitz, Aparna Chandramowlishwaran, Pavel Shamis |
| 2019 | Building Scalable NVM-based B+tree with HTM. Mengxing Liu, Jiankai Xing, Kang Chen, Yongwei Wu |
| 2019 | COMBFT: Conflicting-Order-Match based Byzantine Fault Tolerance Protocol with High Efficiency and Robustness. Yingyao Rong, Weigang Wu, Zhiguang Chen |
| 2019 | CPpf: a prefetch aware LLC partitioning approach. Jun Xiao, Andy D. Pimentel, Xu Liu |
| 2019 | Cartesian Collective Communication. Jesper Larsson Träff, Sascha Hunold |
| 2019 | Compiler-Assisted GPU Thread Throttling for Reduced Cache Contention. Hyunjun Kim, Sungin Hong, Hyeonsu Lee, Euiseong Seo, Hwansoo Han |
| 2019 | Controlled Asynchronous GVT: Accelerating Parallel Discrete Event Simulation on Many-Core Clusters. Ali Eker, Barry Williams, Kenneth Chiu, Dmitry Ponomarev |
| 2019 | Cooperative Job Scheduling and Data Allocation for Busy Data-Intensive Parallel Computing Clusters. Guoxin Liu, Haiying Shen, Haoyu Wang |
| 2019 | Cosin: Controllable Social Influence Maximization and Its Distributed Implementation in Large-scale Social Networks. Jingya Zhou, Jianxi Fan, Jin Wang |
| 2019 | CostPI: Cost-Effective Performance Isolation for Shared NVMe SSDs. Jiahao Liu, Fang Wang, Dan Feng |
| 2019 | Cynthia: Cost-Efficient Cloud Resource Provisioning for Predictable Distributed Deep Neural Network Training. Haoyue Zheng, Fei Xu, Li Chen, Zhi Zhou, Fangming Liu |
| 2019 | DICER: Diligent Cache Partitioning for Efficient Workload Consolidation. Konstantinos Nikas, Nikela Papadopoulou, Dimitra Giantsidi, Vasileios Karakostas, Georgios I. Goumas, Nectarios Koziris |
| 2019 | DLBooster: Boosting End-to-End Deep Learning Workflows with Offloading Data Preprocessing Pipelines. Yang Cheng, Dan Li, Zhiyuan Guo, Binyao Jiang, Jiaxin Lin, Xi Fan, Jinkun Geng, Xinyi Yu, Wei Bai, Lei Qu, Ran Shu, Peng Cheng, Yongqiang Xiong, Jianping Wu |
| 2019 | Data and Thread Placement in NUMA Architectures: A Statistical Learning Approach. Nicolas Denoyelle, Brice Goglin, Emmanuel Jeannot, Thomas Ropars |
| 2019 | DeepHash: An End-to-End Learning Approach for Metadata Management in Distributed File Systems. Yuanning Gao, Xiaofeng Gao, Guihai Chen |
| 2019 | Design Exploration of Multi-tier Interconnection Networks for Exascale Systems. Javier Navaridas, Joshua Lant, Jose Antonio Pascual, Mikel Luján, John Goodacre |
| 2019 | Distributed Join Algorithms on Multi-CPU Clusters with GPUDirect RDMA. Chengxin Guo, Hong Chen, Feng Zhang, Cuiping Li |
| 2019 | Dynamic Load Balancing in Hybrid Switching Data Center Networks with Converters. Jiaqi Zheng, Qiming Zheng, Xiaofeng Gao, Guihai Chen |
| 2019 | ECoST: Energy-Efficient Co-Locating and Self-Tuning MapReduce Applications. Maria Malik, Hassan Ghasemzadeh, Tinoosh Mohsenin, Rosario Cammarota, Liang Zhao, Avesta Sasan, Houman Homayoun, Setareh Rafatirad |
| 2019 | EMBA: Efficient Memory Bandwidth Allocation to Improve Performance on Intel Commodity Processor. Yaocheng Xiang, Chencheng Ye, Xiaolin Wang, Yingwei Luo, Zhenlin Wang |
| 2019 | Efficient Data-Parallel Primitives on Heterogeneous Systems. Zhuohang Lai, Qiong Luo, Xiaolong Xie |
| 2019 | Exploiting Vector Processing in Dynamic Binary Translation. Chih-Min Lin, Sheng-Yu Fu, Ding-Yong Hong, Yu-Ping Liu, Jan-Jan Wu, Wei-Chung Hsu |
| 2019 | Express Link Placement for NoC-Based Many-Core Platforms. Yunfan Li, Di Zhu, Lizhong Chen |
| 2019 | Fast Recovery Techniques for Erasure-coded Clusters in Non-uniform Traffic Network. Yunren Bai, Zihan Xu, Haixia Wang, Dongsheng Wang |
| 2019 | Faster parallel collision detection at high resolution for CNC milling applications. Xin Chen, Dmytro Konobrytskyi, Thomas M. Tucker, Thomas R. Kurfess, Richard W. Vuduc |
| 2019 | FlowCon: Elastic Flow Configuration for Containerized Deep Learning Applications. Wenjia Zheng, Michael Tynes, Henry Gorelick, Ying Mao, Long Cheng, Yantian Hou |
| 2019 | FuncyTuner: Auto-tuning Scientific Applications With Per-loop Compilation. Tao Wang, Nikhil Jain, David Beckingsale, David Böhme, Frank Mueller, Todd Gamblin |
| 2019 | Gossip: Efficient Communication Primitives for Multi-GPU Systems. Robin Kobus, Daniel Jünger, Christian Hundt, Bertil Schmidt |
| 2019 | Gravitational Octree Code Performance Evaluation on Volta GPU. Yohei Miki |
| 2019 | HOPE: A Parallel Execution Model Based on Hierarchical Omission. Masahiro Yasugi, Daisuke Muraoka, Tasuku Hiraishi, Seiji Umatani, Kento Emoto |
| 2019 | HPAS: An HPC Performance Anomaly Suite for Reproducing Performance Variations. Emre Ates, Yijia Zhang, Burak Aksar, Jim M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun |
| 2019 | Holistic Slowdown Driven Scheduling and Resource Management for Malleable Jobs. Marco D'Amico, Ana Jokanovic, Julita Corbalán |
| 2019 | How to Make the Preconditioned Conjugate Gradient Method Resilient Against Multiple Node Failures. Carlos Pachajoa, Markus Levonyak, Wilfried N. Gansterer, Jesper Larsson Träff |
| 2019 | HyperPRAW: Architecture-Aware Hypergraph Restreaming Partition to Improve Performance of Parallel Applications Running on High Performance Computing Systems. Carlos Fernandez Musoles, Daniel Coca, Paul Richmond |
| 2019 | I/O Characterization and Performance Evaluation of BeeGFS for Deep Learning. Fahim Chowdhury, Yue Zhu, Todd Heer, Saul Paredes, Adam Moody, Robin Goldstone, Kathryn M. Mohror, Weikuan Yu |
| 2019 | Improved Unconstrained Energy Functional Method for Eigensolvers in Electronic Structure Calculations. Mauro Del Ben, Osni Marques, Andrew Canning |
| 2019 | Improving Short Job Latency Performance in Hybrid Job Schedulers with Dice. Wei Zhou, K. Preston White, Hongfeng Yu |
| 2019 | Incorporating Probabilistic Optimizations for Resource Provisioning of Data Processing Workflows. Amelie Chi Zhou, Yao Xiao, Bingsheng He, Shadi Ibrahim, Reynold Cheng |
| 2019 | JobPacker: Job Scheduling for Data-Parallel Frameworks with Hybrid Electrical/Optical Datacenter Networks. Zhuozhao Li, Haiying Shen |
| 2019 | LFOC: A Lightweight Fairness-Oriented Cache Clustering Policy for Commodity Multicores. Adrian Garcia-Garcia, Juan Carlos Saez, Fernando Castro, Manuel Prieto-Matías |
| 2019 | Lightweight Fault Tolerance in Pregel-Like Systems. Da Yan, James Cheng, Hongzhi Chen, Cheng Long, Purushotham V. Bangalore |
| 2019 | MAC: Memory Access Coalescer for 3D-Stacked Memory. Xi Wang, Antonino Tumeo, John D. Leidel, Jie Li, Yong Chen |
| 2019 | Machine Learning for Fine-Grained Hardware Prefetcher Control. Jason Hiebel, Laura E. Brown, Zhenlin Wang |
| 2019 | Massively Parallel ANS Decoding on GPUs. André Weißenberger, Bertil Schmidt |
| 2019 | Massively Parallel Automated Software Tuning. Jakub Kurzak, Yaohung M. Tsai, Mark Gates, Ahmad Abdelfattah, Jack J. Dongarra |
| 2019 | Modeling the Performance of Atomic Primitives on Modern Architectures. Fazeleh Sadat Hoseini, Aras Atalar, Philippas Tsigas |
| 2019 | Multi-Objective Reinforcement Learning for Reconfiguring Data Stream Analytics on Edge Computing. Alexandre da Silva Veith, Felipe Rodrigo de Souza, Marcos Dias de Assunção, Laurent Lefèvre, Julio Cesar Santos dos Anjos |
| 2019 | N-Code: An Optimal RAID-6 MDS Array Code for Load Balancing and High I/O Performance. Ping Xie, Zhu Yuan, Jianzhong Huang, Xiao Qin |
| 2019 | NFV-Enabled Multicasting in Mobile Edge Clouds with Resource Sharing. Zichuan Xu, Yutong Zhang, Weifa Liang, Qiufen Xia, Omer F. Rana, Alex Galis, Guowei Wu, Pan Zhou |
| 2019 | Near-Data Processing-Enabled and Time-Aware Compaction Optimization for LSM-tree-based Key-Value Stores. Hui Sun, Wei Liu, Jianzhong Huang, Song Fu, Zhi Qiao, Weisong Shi |
| 2019 | Nested Virtualization Without the Nest. Mathieu Bacou, Grégoire Todeschi, Alain Tchana, Daniel Hagimont |
| 2019 | Network Congestion Avoidance through Packet-chaining Reservation. Ke Wu, Dezun Dong, Cunlu Li, Shan Huang, Yi Dai |
| 2019 | Network Congestion-aware Online Service Function Chain Placement and Load Balancing. Xiaojun Shang, Zhenhua Liu, Yuanyuan Yang |
| 2019 | OSP: Overlapping Computation and Communication in Parameter Server for Fast Machine Learning. Haozhao Wang, Song Guo, Ruixuan Li |
| 2019 | On Integration of Appends and Merges in Log-Structured Merge Trees. Caixin Gong, Shuibing He, Yili Gong, Yingchun Lei |
| 2019 | On Max-min Fair Resource Allocation for Distributed Job Execution. Yitong Guan, Chuanyou Li, Xueyan Tang |
| 2019 | Optimized Execution of Parallel Loops via User-Defined Scheduling Policies. Seonmyeong Bak, Yanfei Guo, Pavan Balaji, Vivek Sarkar |
| 2019 | Parallel Algorithms for Evaluating Matrix Polynomials. Sivan Toledo, Amit Waisel |
| 2019 | Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels. Suraj Kumar, Lionel Eyraud-Dubois, Sriram Krishnamoorthy |
| 2019 | Performance, Energy, and Scalability Analysis and Improvement of Parallel Cancer Deep Learning CANDLE Benchmarks. Xingfu Wu, Valerie E. Taylor, Justin M. Wozniak, Rick Stevens, Thomas S. Brettin, Fangfang Xia |
| 2019 | PhSIH: A Lightweight Parallelization of Event Matching in Content-based Pub/Sub Systems. Zhengyu Liao, Shiyou Qian, Jian Cao, Yanhua Cao, Guangtao Xue, Jiadi Yu, Yanmin Zhu, Minglu Li |
| 2019 | Predictable GPUs Frequency Scaling for Energy and Performance. Kaijie Fan, Biagio Cosenza, Ben H. H. Juurlink |
| 2019 | Proceedings of the 48th International Conference on Parallel Processing, ICPP 2019, Kyoto, Japan, August 05-08, 2019 |
| 2019 | QLEC: A Machine-Learning-Based Energy-Efficient Clustering Algorithm to Prolong Network Lifespan for IoT in High-Dimensional Space. Ke Li, Haowei Huang, Xiaofeng Gao, Fan Wu, Guihai Chen |
| 2019 | RFPL: A Recovery Friendly Parity Logging Scheme for Reducing Small Write Penalty of SSD RAID. Gaoxiang Xu, Dan Feng, Zhipeng Tan, Xinyan Zhang, Jie Xu, Xi Shu, Yifeng Zhu |
| 2019 | Reducing Kernel Surface Areas for Isolation and Scalability. Daniel Zahka, Brian Kocoloski, Kate Keahey |
| 2019 | Refactoring and Optimizing WRF Model on Sunway TaihuLight. Kai Xu, Zhenya Song, Yuandong Chan, Shida Wang, Xiangxu Meng, Weiguo Liu, Wei Xue |
| 2019 | Runtime Adaptive Task Inlining on Asynchronous Multitasking Runtime Systems. Bibek Wagle, Mohammad Alaul Haque Monil, Kevin A. Huck, Allen D. Malony, Adrian Serio, Hartmut Kaiser |
| 2019 | SAFE: Service Availability via Failure Elimination Through VNF Scaling. Rui Xia, Haipeng Dai, Jiaqi Zheng, Rong Gu, Xiaoyu Wang, Guihai Chen |
| 2019 | SaC: Exploiting Execution-Time Slack to Save Energy in Heterogeneous Multicore Systems. Muhammad Waqar Azhar, Miquel Pericàs, Per Stenström |
| 2019 | Solving All-Pairs Shortest-Paths Problem in Large Graphs Using Apache Spark. Frank Schoeneman, Jaroslaw Zola |
| 2019 | Spatially-aware Parallel I/O for Particle Data. Sidharth Kumar, Steve Petruzza, Will Usher, Valerio Pascucci |
| 2019 | Speculative Scheduling for Stochastic HPC Applications. Ana Gainaru, Guillaume Pallez, Hongyang Sun, Padma Raghavan |
| 2019 | Stage Delay Scheduling: Speeding up DAG-style Data Analytics Jobs with Resource Interleaving. Wujie Shao, Fei Xu, Li Chen, Haoyue Zheng, Fangming Liu |
| 2019 | TEA: A Traffic-efficient Erasure-coded Archival Scheme for In-memory Stores. Bin Xu, Jianzhong Huang, Qiang Cao, Xiao Qin |
| 2019 | TLB: Traffic-aware Load Balancing with Adaptive Granularity in Data Center Networks. Jinbin Hu, Jiawei Huang, Wenjun Lv, Weihe Li, Jianxin Wang, Tian He |
| 2019 | Tessellating Star Stencils. Liang Yuan, Shan Huang, Yunquan Zhang, Hang Cao |
| 2019 | The Case for Water-Immersion Computer Boards. Michihiro Koibuchi, Ikki Fujiwara, Naoya Niwa, Tomohiro Totoki, Shoichi Hirasawa |
| 2019 | The Communication-Overlapped Hybrid Decomposition Parallel Algorithm for Multi-Scale Fluid Simulations. Yi Liu, Xiaowei Guo, Chao Li, Canqun Yang, Xinbiao Gan, Peng Zhang, Yi Wang, Ran Zhao, Sijiang Fan |
| 2019 | Transfer Learning based Failure Prediction for Minority Disks in Large Data Centers of Heterogeneous Disk Systems. Ji Zhang, Ke Zhou, Ping Huang, Xubin He, Zhili Xiao, Bin Cheng, Yongguang Ji, Yinhu Wang |
| 2019 | Unleashing the Scalability Potential of Power-Constrained Data Center in the Microservice Era. Xiaofeng Hou, Jiacheng Liu, Chao Li, Minyi Guo |
| 2019 | VScan: Efficiently Analyzing Surveillance Videos via Model-joint Mechanism. Chen Zhang, Qiang Cao, Jie Yao, Yuanyuan Dong, Puyuan Yang |
| 2019 | When Power Oversubscription Meets Traffic Flood Attack: Re-Thinking Data Center Peak Load Management. Xiaofeng Hou, Mingyu Liang, Chao Li, Wenli Zheng, Quan Chen, Minyi Guo |
| 2019 | diBELLA: Distributed Long Read to Long Read Alignment. Marquita Ellis, Giulia Guidi, Aydin Buluç, Leonid Oliker, Katherine A. Yelick |
| 2019 | swATOP: Automatically Optimizing Deep Learning Operators on SW26010 Many-Core Processor. Wei Gao, Jiarui Fang, Wenlai Zhao, Jinzhe Yang, Long Wang, Lin Gan, Haohuan Fu, Guangwen Yang |