| 2018 | A Communication-Efficient Causal Broadcast Protocol. João Paulo de Araujo, Luciana Arantes, Elias P. Duarte Jr., Luiz A. Rodrigues, Pierre Sens |
| 2018 | A Comprehensive Study on Bugs in Actor Systems. Brandon Hedden, Xinghui Zhao |
| 2018 | A Distributed Infomap Algorithm for Scalable and High-Quality Community Detection. Jianping Zeng, Hongfeng Yu |
| 2018 | A Fast Sparse Triangular Solver for Structured-grid Problems on Sunway Many-core Processor SW26010. Xinliang Wang, Ping Xu, Wei Xue, Yulong Ao, Chao Yang, Haohuan Fu, Lin Gan, Guangwen Yang, Weimin Zheng |
| 2018 | A Framework for Auto-Parallelization and Code Generation: An Integrative Case Study with Legacy FORTRAN Codes. Konstantinos Krommydas, Paul Sathre, Ruchira Sasanka, Wu-chun Feng |
| 2018 | A Generic Approach to Scheduling and Checkpointing Workflows. Li Han, Valentin Le Fèvre, Louis-Claude Canon, Yves Robert, Frédéric Vivien |
| 2018 | A Multilevel Subtree Method for Single and Batched Sparse Cholesky Factorization. Meng Tang, Mohamed Gadou, Steven C. Rennich, Timothy A. Davis, Sanjay Ranka |
| 2018 | A Performance Model to Execute Workflows on High-Bandwidth-Memory Architectures. Anne Benoit, Swann Perarnau, Loïc Pottier, Yves Robert |
| 2018 | A Write-efficient and Consistent Hashing Scheme for Non-Volatile Memory. Xiaoyi Zhang, Dan Feng, Yu Hua, Jianxi Chen, Mandi Fu |
| 2018 | Accelerating FM-index Search for Genomic Data Processing. Yuanrong Wang, Xueqi Li, Dawei Zang, Guangming Tan, Ninghui Sun |
| 2018 | An Empirical Comparison of k-Shortest Simple Path Algorithms on Multicores. Deepak Ajwani, Erika Duriakova, Neil Hurley, Ulrich Meyer, Alexander Schickedanz |
| 2018 | Balanced k-means for Parallel Geometric Partitioning. Moritz von Looz, Charilaos Tzovas, Henning Meyerhenke |
| 2018 | Bandwidth Reduced Parallel SpMV on the SW26010 Many-Core Platform. Qiao Sun, Changyou Zhang, Changmao Wu, Jiajia Zhang, Leisheng Li |
| 2018 | C-Graph: A Highly Efficient Concurrent Graph Reachability Query Framework. Li Zhou, Ren Chen, Yinglong Xia, Radu Teodorescu |
| 2018 | CAMPS: Conflict-Aware Memory-Side Prefetching Scheme for Hybrid Memory Cube. Muhammad M. Rafique, Zhichun Zhu |
| 2018 | CSTF: Large-Scale Sparse Tensor Factorizations on Distributed Platforms. Zachary Blanco, Bangtian Liu, Maryam Mehri Dehnavi |
| 2018 | Cache Assisted Randomized Sharing Counters in Network Measurement. Qian Liu, Haipeng Dai, Alex X. Liu, Qi Li, Xiaoyu Wang, Jiaqi Zheng |
| 2018 | Characterizing the Impact of Soft Errors Affecting Floating-point ALUs using RTL-Ievel Fault Injection. Omer Subasi, Chun-Kai Chang, Mattan Erez, Sriram Krishnamoorthy |
| 2018 | Charging Task Scheduling for Directional Wireless Charger Networks. Haipeng Dai, Ke Sun, Alex X. Liu, Lijun Zhang, Jiaqi Zheng, Guihai Chen |
| 2018 | Click-Based Asynchronous Mesh Network with Bounded Bundled Data. Anping He, Guangbo Feng, Jilin Zhang, Pengfei Li, Yong Hei, Hong Chen |
| 2018 | Combining Task-based Parallelism and Adaptive Mesh Refinement Techniques in Molecular Dynamics Simulations. Raphaël Prat, Laurent Colombet, Raymond Namyst |
| 2018 | Communication-Avoiding for Dynamical Core of Atmospheric General Circulation Model. Junmin Xiao, Shigang Li, Baodong Wu, He Zhang, Kun Li, Erlin Yao, Yunquan Zhang, Guangming Tan |
| 2018 | Constructing Dynamic Policies for Paging Mode Selection. Jason Hiebel, Laura E. Brown, Zhenlin Wang |
| 2018 | Cross-Rack-Aware Updates in Erasure-Coded Data Centers. Zhirong Shen, Patrick P. C. Lee |
| 2018 | DAG-SFC: Minimize the Embedding Cost of SFC with Parallel VNFs. Xu Lin, Deke Guo, Yulong Shen, Guoming Tang, Bangbang Ren |
| 2018 | Disk Failure Prediction in Data Centers via Online Learning. Jiang Xiao, Zhuang Xiong, Song Wu, Yusheng Yi, Hai Jin, Kan Hu |
| 2018 | Dual-Paradigm Stream Processing. Song Wu, Zhiyi Liu, Shadi Ibrahim, Lin Gu, Hai Jin, Fei Chen |
| 2018 | Duchy: Achieving Both SSD Durability and Controllable SMR Cleaning Overhead in Hybrid Storage Systems. Xuchao Xie, Tianye Yang, Qiong Li, Dengping Wei, Liquan Xiao |
| 2018 | Efficient Runtime Support for a Partitioned Global Logical Address Space. D. Brian Larkins, John Snyder, James Dinan |
| 2018 | Efficient SSD Caching by Avoiding Unnecessary Writes using Machine Learning. Hua Wang, Xinbo Yi, Ping Huang, Bin Cheng, Ke Zhou |
| 2018 | Efficient Search for Free Blocks in the WAFL File System. Ram Kesavan, Matthew Curtis-Maury, Mrinal K. Bhattacharjee |
| 2018 | Energy-Efficient Speculative Execution using Advanced Reservation for Heterogeneous Clusters. Amelie Chi Zhou, Tien-Dat Phan, Shadi Ibrahim, Bingsheng He |
| 2018 | Energy-efficient Application Resource Scheduling using Machine Learning Classifiers. Connor Imes, Steven A. Hofmeyr, Henry Hoffmann |
| 2018 | FFS-VA: A Fast Filtering System for Large-scale Video Analytics. Chen Zhang, Qiang Cao, Hong Jiang, Wenhui Zhang, Jingjun Li, Jie Yao |
| 2018 | FULT: Fast User-Level Thread Scheduling Using Bit-Vectors. Hoang-Vu Dang, Marc Snir |
| 2018 | GLP4NN: A Convergence-invariant and Network-agnostic Light-weight Parallelization Framework for Deep Neural Networks on Modern GPUs. Hao Fu, Shanjiang Tang, Bingsheng He, Ce Yu, Jizhou Sun |
| 2018 | H2Cloud: Maintaining the Whole Filesystem in an Object Storage Cloud. Minghao Zhao, Zhenhua Li, Ennan Zhai, Gareth Tyson, Chen Qian, Zhenyu Li, Leiyu Zhao |
| 2018 | HUS-Graph: I/O-Efficient Out-of-Core Graph Processing with Hybrid Update Strategy. Xianghao Xu, Fang Wang, Hong Jiang, Yongli Cheng, Dan Feng, Yongxuan Zhang |
| 2018 | Heterogeneous Wireless Charger Placement with Obstacles. Xiaoyu Wang, Haipeng Dai, Weijun Wang, Jiaqi Zheng, Guihai Chen, Wanchun Dou, Xiaobing Wu |
| 2018 | IS-ASGD: Accelerating Asynchronous SGD using Importance Sampling. Fei Wang, Xiaofeng Gao, Jun Ye, Guihai Chen |
| 2018 | ImageNet Training in Minutes. Yang You, Zhao Zhang, Cho-Jui Hsieh, James Demmel, Kurt Keutzer |
| 2018 | Implementing Push-Pull Efficiently in GraphBLAS. Carl Yang, Aydin Buluç, John D. Owens |
| 2018 | Improving First Level Cache Efficiency for GPUs Using Dynamic Line Protection. Xian Zhu, Robert Wernsman, Joseph Zambreno |
| 2018 | Improving MPI Multi-threaded RMA Communication Performance. Nathan T. Hjelm, Matthew G. F. Dosanjh, Ryan E. Grant, Taylor L. Groves, Patrick G. Bridges, Dorian C. Arnold |
| 2018 | Improving Resource Utilization through Demand Aware Process Scheduling. Brandon Nesterenko, Qing Yi, Jia Rao |
| 2018 | Index Shard Replication Strategies for Improving Resource Utilization in Large Scale Search Engines. Yusen Li, Xueyan Tang, Wentong Cai, Jiancong Tong, Xiaoguang Liu, Gang Wang, Chuansong Gao, Xuan Cao, Guanhui Geng, Minghui Li |
| 2018 | Integrating Low-latency Analysis into HPC System Monitoring. Ramin Izadpanah, Nichamon Naksinehaboon, Jim M. Brandt, Ann C. Gentile, Damian Dechev |
| 2018 | Interference between I/O and MPI Traffic on Fat-tree Networks. Kevin A. Brown, Nikhil Jain, Satoshi Matsuoka, Martin Schulz, Abhinav Bhatele |
| 2018 | Joint Optimization of MapReduce Scheduling and Network Policy in Hierarchical Clouds. Donglin Yang, Wei Rang, Dazhao Cheng |
| 2018 | KeyBin2: Distributed Clustering for Scalable and In-Situ Analysis. Xinyu Chen, Jeremy Benson, Matt Peterson, Michela Taufer, Trilce Estrada |
| 2018 | Learning Driven Parallelization for Large-Scale Video Workload in Hybrid CPU-GPU Cluster. Haitao Zhang, Bingchang Tang, Xin Geng, Huadong Ma |
| 2018 | Less Provisioning: A Fine-grained Resource Scaling Engine for Long-running Services with Tail Latency Guarantees. Binlei Cai, Rongqi Zhang, Laiping Zhao, Keqiu Li |
| 2018 | Leverage Redundancy in Hardware Transactional Memory to Improve Cache Reliability. Zhichao Yan, Hong Jiang, Witawas Srisa-an, Sharad C. Seth, Yujuan Tan |
| 2018 | Load-Balanced Slim Fly Networks. Md. Shafayat Rahman, Md Atiqul Mollah, Peyman Faizian, Xin Yuan |
| 2018 | MND-MST: A Multi-Node Multi-Device Parallel Boruvka's MST Algorithm. Rintu Panja, Sathish Vadhiyar |
| 2018 | MPI-Vector-IO: Parallel I/O and Partitioning for Geospatial Vector Data. Satish Puri, Anmol Paudel, Sushil K. Prasad |
| 2018 | Massively Parallel Huffman Decoding on GPUs. André Weißenberger, Bertil Schmidt |
| 2018 | Massively Scaling the Metal Microscopic Damage Simulation on Sunway TaihuLight Supercomputer. Shigang Li, Baodong Wu, Yunquan Zhang, Xianmeng Wang, Jianjiang Li, Changjun Hu, Jue Wang, Yangde Feng, Ningming Nie |
| 2018 | Matrix Factorization on GPUs with Memory Optimization and Approximate Computing. Wei Tan, Shiyu Chang, Liana Fong, Cheng Li, Zijun Wang, Liangliang Cao |
| 2018 | Memory Coalescing for Hybrid Memory Cube. Xi Wang, John D. Leidel, Yong Chen |
| 2018 | Modeling Application Resilience in Large-scale Parallel Execution. Kai Wu, Wenqian Dong, Qiang Guan, Nathan DeBardeleben, Dong Li |
| 2018 | NFV Middlebox Placement with Balanced Set-up Cost and Bandwidth Consumption. Yang Chen, Jie Wu |
| 2018 | Nemo: NUMA-aware Concurrency Control for Scalable Transactional Memory. Mohamed Mohamedin, Sebastiano Peluso, Masoomeh Javidi Kishi, Ahmed Hassan, Roberto Palmieri |
| 2018 | NumLock: Towards Optimal Multi-Granularity Locking in Hierarchies. Saurabh Kalikar, Rupesh Nasre |
| 2018 | NumaMMA: NUMA MeMory Analyzer. François Trahay, Manuel Selva, Lionel Morel, Kevin Marquet |
| 2018 | Optimization of the Spherical Harmonics Transform based Tree Traversals in the Helmholtz FMM Algorithm. Michael P. Lingg, Stephen M. Hughey, Hasan Metin Aktulga |
| 2018 | Optimizing for KNL Usage Modes When Data Doesn't Fit in MCDRAM. Neil Butcher, Stephen L. Olivier, Jonathan W. Berry, Simon D. Hammond, Peter M. Kogge |
| 2018 | PBCS: An Efficient Parallel Characteristic Set Method for Solving Boolean Polynomial Systems. Juan Zhao, Junqiang Song, Min Zhu, Jincai Li, Zhenyu Huang, Xiaoyong Li, Xiaoli Ren |
| 2018 | PRIONN: Predicting Runtime and IO using Neural Networks. Michael R. Wyatt II, Stephen Herbein, Todd Gamblin, Adam Moody, Dong H. Ahn, Michela Taufer |
| 2018 | ParaPLL: Fast Parallel Shortest-path Distance Query on Large-scale Weighted Graphs. Kun Qiu, Yuanyang Zhu, Jing Yuan, Jin Zhao, Xin Wang, Tilman Wolf |
| 2018 | Parallelizing Pruning-based Graph Structural Clustering. Yulin Che, Shixuan Sun, Qiong Luo |
| 2018 | Partitioning and Communication Strategies for Sparse Non-negative Matrix Factorization. Oguz Kaya, Ramakrishnan Kannan, Grey Ballard |
| 2018 | Performance & Energy Tradeoffs for Dependent Distributed Applications Under System-wide Power Caps. Huazhe Zhang, Henry Hoffmann |
| 2018 | Power Efficient High Performance Packet I/O. Xuesong Li, Wenxue Cheng, Tong Zhang, Jing Xie, Fengyuan Ren, Bailong Yang |
| 2018 | Proceedings of the 47th International Conference on Parallel Processing, ICPP 2018, Eugene, OR, USA, August 13-16, 2018 |
| 2018 | Reducing Communication in Proximal Newton Methods for Sparse Least Squares Problems. Saeed Soori, Aditya Devarakonda, Zachary Blanco, James Demmel, Mert Gürbüzbalaban, Maryam Mehri Dehnavi |
| 2018 | Reference-distance Eviction and Prefetching for Cache Management in Spark. Tiago B. G. Perez, Xiaobo Zhou, Dazhao Cheng |
| 2018 | Revisiting Multi-pass Scatter and Gather on GPUs. Zhuohang Lai, Qiong Luo, Xiaoying Jia |
| 2018 | SPECTR: Scalable Parallel Short Read Error Correction on Multi-core and Many-core Architectures. Kai Xu, Robin Kobus, Yuandong Chan, Ping Gao, Xiangxu Meng, Yanjie Wei, Bertil Schmidt, Weiguo Liu |
| 2018 | Scalable Behavioral Emulation of Extreme-Scale Systems Using Structural Simulation Toolkit. Ajay Ramaswamy, Nalini Kumar, Aravind Neelakantan, Herman Lam, Greg Stitt |
| 2018 | Scalable Solutions for Automated Single Pulse Identification and Classification in Radio Astronomy. Thomas R. Devine, Katerina Goseva-Popstojanova, Di Pang |
| 2018 | Task-parallel Analysis of Molecular Dynamics Trajectories. Ioannis Paraskevakos, André Luckow, Mahzad Khoshlessan, George Chantzialexiou, Thomas E. Cheatham, Oliver Beckstein, Geoffrey C. Fox, Shantenu Jha |
| 2018 | The Case for Semi-Permanent Cache Occupancy: Understanding the Impact of Data Locality on Network Processing. Matthew G. F. Dosanjh, S. Mahdieh Ghazimirsaeed, Ryan E. Grant, Whit Schonbein, Michael J. Levenhagen, Patrick G. Bridges, Ahmad Afsahi |
| 2018 | Topology-induced Enhancement of Mappings. Roland Glantz, Maria Predari, Henning Meyerhenke |
| 2018 | Toward Performant and Energy-efficient Queries in Three-tier Wireless Sensor Networks. Jiayao Wang, Abdullah Al-Mamun, Tonglin Li, Linhua Jiang, Dongfang Zhao |
| 2018 | UHCL-Darknet: An OpenCL-based Deep Neural Network Framework for Heterogeneous Multi-/Many-core Clusters. Longlong Liao, Kenli Li, Keqin Li, Canqun Yang, Qi Tian |
| 2018 | Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics. Arya Mazaheri, Felix Wolf, Ali Jannesari |
| 2018 | Using Static Allocation Algorithms for Matrix Matrix Multiplication on Multicores and GPUs. Lionel Eyraud-Dubois, Thomas Lambert |
| 2018 | Varbench: an Experimental Framework to Measure and Characterize Performance Variability. Brian Kocoloski, John R. Lange |
| 2018 | Vectorised Computation of Diverging Ensembles. Jan Hückelheim, Paul D. Hovland, Sri Hari Krishna Narayanan, Paulius Velesko |
| 2018 | Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512. Hong Zhang, Richard Tran Mills, Karl Rupp, Barry F. Smith |
| 2018 | ran-GJS: Orchestrating Data Analytics for Heterogeneous Geo-distributed Edges. Yibo Jin, Zhuzhong Qian, Song Guo, Sheng Zhang, Xiaoliang Wang, Sanglu Lu |