ICPP B

92 papers

YearTitle / Authors
2018A Communication-Efficient Causal Broadcast Protocol.
João Paulo de Araujo, Luciana Arantes, Elias P. Duarte Jr., Luiz A. Rodrigues, Pierre Sens
2018A Comprehensive Study on Bugs in Actor Systems.
Brandon Hedden, Xinghui Zhao
2018A Distributed Infomap Algorithm for Scalable and High-Quality Community Detection.
Jianping Zeng, Hongfeng Yu
2018A Fast Sparse Triangular Solver for Structured-grid Problems on Sunway Many-core Processor SW26010.
Xinliang Wang, Ping Xu, Wei Xue, Yulong Ao, Chao Yang, Haohuan Fu, Lin Gan, Guangwen Yang, Weimin Zheng
2018A Framework for Auto-Parallelization and Code Generation: An Integrative Case Study with Legacy FORTRAN Codes.
Konstantinos Krommydas, Paul Sathre, Ruchira Sasanka, Wu-chun Feng
2018A Generic Approach to Scheduling and Checkpointing Workflows.
Li Han, Valentin Le Fèvre, Louis-Claude Canon, Yves Robert, Frédéric Vivien
2018A Multilevel Subtree Method for Single and Batched Sparse Cholesky Factorization.
Meng Tang, Mohamed Gadou, Steven C. Rennich, Timothy A. Davis, Sanjay Ranka
2018A Performance Model to Execute Workflows on High-Bandwidth-Memory Architectures.
Anne Benoit, Swann Perarnau, Loïc Pottier, Yves Robert
2018A Write-efficient and Consistent Hashing Scheme for Non-Volatile Memory.
Xiaoyi Zhang, Dan Feng, Yu Hua, Jianxi Chen, Mandi Fu
2018Accelerating FM-index Search for Genomic Data Processing.
Yuanrong Wang, Xueqi Li, Dawei Zang, Guangming Tan, Ninghui Sun
2018An Empirical Comparison of k-Shortest Simple Path Algorithms on Multicores.
Deepak Ajwani, Erika Duriakova, Neil Hurley, Ulrich Meyer, Alexander Schickedanz
2018Balanced k-means for Parallel Geometric Partitioning.
Moritz von Looz, Charilaos Tzovas, Henning Meyerhenke
2018Bandwidth Reduced Parallel SpMV on the SW26010 Many-Core Platform.
Qiao Sun, Changyou Zhang, Changmao Wu, Jiajia Zhang, Leisheng Li
2018C-Graph: A Highly Efficient Concurrent Graph Reachability Query Framework.
Li Zhou, Ren Chen, Yinglong Xia, Radu Teodorescu
2018CAMPS: Conflict-Aware Memory-Side Prefetching Scheme for Hybrid Memory Cube.
Muhammad M. Rafique, Zhichun Zhu
2018CSTF: Large-Scale Sparse Tensor Factorizations on Distributed Platforms.
Zachary Blanco, Bangtian Liu, Maryam Mehri Dehnavi
2018Cache Assisted Randomized Sharing Counters in Network Measurement.
Qian Liu, Haipeng Dai, Alex X. Liu, Qi Li, Xiaoyu Wang, Jiaqi Zheng
2018Characterizing the Impact of Soft Errors Affecting Floating-point ALUs using RTL-Ievel Fault Injection.
Omer Subasi, Chun-Kai Chang, Mattan Erez, Sriram Krishnamoorthy
2018Charging Task Scheduling for Directional Wireless Charger Networks.
Haipeng Dai, Ke Sun, Alex X. Liu, Lijun Zhang, Jiaqi Zheng, Guihai Chen
2018Click-Based Asynchronous Mesh Network with Bounded Bundled Data.
Anping He, Guangbo Feng, Jilin Zhang, Pengfei Li, Yong Hei, Hong Chen
2018Combining Task-based Parallelism and Adaptive Mesh Refinement Techniques in Molecular Dynamics Simulations.
Raphaël Prat, Laurent Colombet, Raymond Namyst
2018Communication-Avoiding for Dynamical Core of Atmospheric General Circulation Model.
Junmin Xiao, Shigang Li, Baodong Wu, He Zhang, Kun Li, Erlin Yao, Yunquan Zhang, Guangming Tan
2018Constructing Dynamic Policies for Paging Mode Selection.
Jason Hiebel, Laura E. Brown, Zhenlin Wang
2018Cross-Rack-Aware Updates in Erasure-Coded Data Centers.
Zhirong Shen, Patrick P. C. Lee
2018DAG-SFC: Minimize the Embedding Cost of SFC with Parallel VNFs.
Xu Lin, Deke Guo, Yulong Shen, Guoming Tang, Bangbang Ren
2018Disk Failure Prediction in Data Centers via Online Learning.
Jiang Xiao, Zhuang Xiong, Song Wu, Yusheng Yi, Hai Jin, Kan Hu
2018Dual-Paradigm Stream Processing.
Song Wu, Zhiyi Liu, Shadi Ibrahim, Lin Gu, Hai Jin, Fei Chen
2018Duchy: Achieving Both SSD Durability and Controllable SMR Cleaning Overhead in Hybrid Storage Systems.
Xuchao Xie, Tianye Yang, Qiong Li, Dengping Wei, Liquan Xiao
2018Efficient Runtime Support for a Partitioned Global Logical Address Space.
D. Brian Larkins, John Snyder, James Dinan
2018Efficient SSD Caching by Avoiding Unnecessary Writes using Machine Learning.
Hua Wang, Xinbo Yi, Ping Huang, Bin Cheng, Ke Zhou
2018Efficient Search for Free Blocks in the WAFL File System.
Ram Kesavan, Matthew Curtis-Maury, Mrinal K. Bhattacharjee
2018Energy-Efficient Speculative Execution using Advanced Reservation for Heterogeneous Clusters.
Amelie Chi Zhou, Tien-Dat Phan, Shadi Ibrahim, Bingsheng He
2018Energy-efficient Application Resource Scheduling using Machine Learning Classifiers.
Connor Imes, Steven A. Hofmeyr, Henry Hoffmann
2018FFS-VA: A Fast Filtering System for Large-scale Video Analytics.
Chen Zhang, Qiang Cao, Hong Jiang, Wenhui Zhang, Jingjun Li, Jie Yao
2018FULT: Fast User-Level Thread Scheduling Using Bit-Vectors.
Hoang-Vu Dang, Marc Snir
2018GLP4NN: A Convergence-invariant and Network-agnostic Light-weight Parallelization Framework for Deep Neural Networks on Modern GPUs.
Hao Fu, Shanjiang Tang, Bingsheng He, Ce Yu, Jizhou Sun
2018H2Cloud: Maintaining the Whole Filesystem in an Object Storage Cloud.
Minghao Zhao, Zhenhua Li, Ennan Zhai, Gareth Tyson, Chen Qian, Zhenyu Li, Leiyu Zhao
2018HUS-Graph: I/O-Efficient Out-of-Core Graph Processing with Hybrid Update Strategy.
Xianghao Xu, Fang Wang, Hong Jiang, Yongli Cheng, Dan Feng, Yongxuan Zhang
2018Heterogeneous Wireless Charger Placement with Obstacles.
Xiaoyu Wang, Haipeng Dai, Weijun Wang, Jiaqi Zheng, Guihai Chen, Wanchun Dou, Xiaobing Wu
2018IS-ASGD: Accelerating Asynchronous SGD using Importance Sampling.
Fei Wang, Xiaofeng Gao, Jun Ye, Guihai Chen
2018ImageNet Training in Minutes.
Yang You, Zhao Zhang, Cho-Jui Hsieh, James Demmel, Kurt Keutzer
2018Implementing Push-Pull Efficiently in GraphBLAS.
Carl Yang, Aydin Buluç, John D. Owens
2018Improving First Level Cache Efficiency for GPUs Using Dynamic Line Protection.
Xian Zhu, Robert Wernsman, Joseph Zambreno
2018Improving MPI Multi-threaded RMA Communication Performance.
Nathan T. Hjelm, Matthew G. F. Dosanjh, Ryan E. Grant, Taylor L. Groves, Patrick G. Bridges, Dorian C. Arnold
2018Improving Resource Utilization through Demand Aware Process Scheduling.
Brandon Nesterenko, Qing Yi, Jia Rao
2018Index Shard Replication Strategies for Improving Resource Utilization in Large Scale Search Engines.
Yusen Li, Xueyan Tang, Wentong Cai, Jiancong Tong, Xiaoguang Liu, Gang Wang, Chuansong Gao, Xuan Cao, Guanhui Geng, Minghui Li
2018Integrating Low-latency Analysis into HPC System Monitoring.
Ramin Izadpanah, Nichamon Naksinehaboon, Jim M. Brandt, Ann C. Gentile, Damian Dechev
2018Interference between I/O and MPI Traffic on Fat-tree Networks.
Kevin A. Brown, Nikhil Jain, Satoshi Matsuoka, Martin Schulz, Abhinav Bhatele
2018Joint Optimization of MapReduce Scheduling and Network Policy in Hierarchical Clouds.
Donglin Yang, Wei Rang, Dazhao Cheng
2018KeyBin2: Distributed Clustering for Scalable and In-Situ Analysis.
Xinyu Chen, Jeremy Benson, Matt Peterson, Michela Taufer, Trilce Estrada
2018Learning Driven Parallelization for Large-Scale Video Workload in Hybrid CPU-GPU Cluster.
Haitao Zhang, Bingchang Tang, Xin Geng, Huadong Ma
2018Less Provisioning: A Fine-grained Resource Scaling Engine for Long-running Services with Tail Latency Guarantees.
Binlei Cai, Rongqi Zhang, Laiping Zhao, Keqiu Li
2018Leverage Redundancy in Hardware Transactional Memory to Improve Cache Reliability.
Zhichao Yan, Hong Jiang, Witawas Srisa-an, Sharad C. Seth, Yujuan Tan
2018Load-Balanced Slim Fly Networks.
Md. Shafayat Rahman, Md Atiqul Mollah, Peyman Faizian, Xin Yuan
2018MND-MST: A Multi-Node Multi-Device Parallel Boruvka's MST Algorithm.
Rintu Panja, Sathish Vadhiyar
2018MPI-Vector-IO: Parallel I/O and Partitioning for Geospatial Vector Data.
Satish Puri, Anmol Paudel, Sushil K. Prasad
2018Massively Parallel Huffman Decoding on GPUs.
André Weißenberger, Bertil Schmidt
2018Massively Scaling the Metal Microscopic Damage Simulation on Sunway TaihuLight Supercomputer.
Shigang Li, Baodong Wu, Yunquan Zhang, Xianmeng Wang, Jianjiang Li, Changjun Hu, Jue Wang, Yangde Feng, Ningming Nie
2018Matrix Factorization on GPUs with Memory Optimization and Approximate Computing.
Wei Tan, Shiyu Chang, Liana Fong, Cheng Li, Zijun Wang, Liangliang Cao
2018Memory Coalescing for Hybrid Memory Cube.
Xi Wang, John D. Leidel, Yong Chen
2018Modeling Application Resilience in Large-scale Parallel Execution.
Kai Wu, Wenqian Dong, Qiang Guan, Nathan DeBardeleben, Dong Li
2018NFV Middlebox Placement with Balanced Set-up Cost and Bandwidth Consumption.
Yang Chen, Jie Wu
2018Nemo: NUMA-aware Concurrency Control for Scalable Transactional Memory.
Mohamed Mohamedin, Sebastiano Peluso, Masoomeh Javidi Kishi, Ahmed Hassan, Roberto Palmieri
2018NumLock: Towards Optimal Multi-Granularity Locking in Hierarchies.
Saurabh Kalikar, Rupesh Nasre
2018NumaMMA: NUMA MeMory Analyzer.
François Trahay, Manuel Selva, Lionel Morel, Kevin Marquet
2018Optimization of the Spherical Harmonics Transform based Tree Traversals in the Helmholtz FMM Algorithm.
Michael P. Lingg, Stephen M. Hughey, Hasan Metin Aktulga
2018Optimizing for KNL Usage Modes When Data Doesn't Fit in MCDRAM.
Neil Butcher, Stephen L. Olivier, Jonathan W. Berry, Simon D. Hammond, Peter M. Kogge
2018PBCS: An Efficient Parallel Characteristic Set Method for Solving Boolean Polynomial Systems.
Juan Zhao, Junqiang Song, Min Zhu, Jincai Li, Zhenyu Huang, Xiaoyong Li, Xiaoli Ren
2018PRIONN: Predicting Runtime and IO using Neural Networks.
Michael R. Wyatt II, Stephen Herbein, Todd Gamblin, Adam Moody, Dong H. Ahn, Michela Taufer
2018ParaPLL: Fast Parallel Shortest-path Distance Query on Large-scale Weighted Graphs.
Kun Qiu, Yuanyang Zhu, Jing Yuan, Jin Zhao, Xin Wang, Tilman Wolf
2018Parallelizing Pruning-based Graph Structural Clustering.
Yulin Che, Shixuan Sun, Qiong Luo
2018Partitioning and Communication Strategies for Sparse Non-negative Matrix Factorization.
Oguz Kaya, Ramakrishnan Kannan, Grey Ballard
2018Performance & Energy Tradeoffs for Dependent Distributed Applications Under System-wide Power Caps.
Huazhe Zhang, Henry Hoffmann
2018Power Efficient High Performance Packet I/O.
Xuesong Li, Wenxue Cheng, Tong Zhang, Jing Xie, Fengyuan Ren, Bailong Yang
2018Proceedings of the 47th International Conference on Parallel Processing, ICPP 2018, Eugene, OR, USA, August 13-16, 2018
2018Reducing Communication in Proximal Newton Methods for Sparse Least Squares Problems.
Saeed Soori, Aditya Devarakonda, Zachary Blanco, James Demmel, Mert Gürbüzbalaban, Maryam Mehri Dehnavi
2018Reference-distance Eviction and Prefetching for Cache Management in Spark.
Tiago B. G. Perez, Xiaobo Zhou, Dazhao Cheng
2018Revisiting Multi-pass Scatter and Gather on GPUs.
Zhuohang Lai, Qiong Luo, Xiaoying Jia
2018SPECTR: Scalable Parallel Short Read Error Correction on Multi-core and Many-core Architectures.
Kai Xu, Robin Kobus, Yuandong Chan, Ping Gao, Xiangxu Meng, Yanjie Wei, Bertil Schmidt, Weiguo Liu
2018Scalable Behavioral Emulation of Extreme-Scale Systems Using Structural Simulation Toolkit.
Ajay Ramaswamy, Nalini Kumar, Aravind Neelakantan, Herman Lam, Greg Stitt
2018Scalable Solutions for Automated Single Pulse Identification and Classification in Radio Astronomy.
Thomas R. Devine, Katerina Goseva-Popstojanova, Di Pang
2018Task-parallel Analysis of Molecular Dynamics Trajectories.
Ioannis Paraskevakos, André Luckow, Mahzad Khoshlessan, George Chantzialexiou, Thomas E. Cheatham, Oliver Beckstein, Geoffrey C. Fox, Shantenu Jha
2018The Case for Semi-Permanent Cache Occupancy: Understanding the Impact of Data Locality on Network Processing.
Matthew G. F. Dosanjh, S. Mahdieh Ghazimirsaeed, Ryan E. Grant, Whit Schonbein, Michael J. Levenhagen, Patrick G. Bridges, Ahmad Afsahi
2018Topology-induced Enhancement of Mappings.
Roland Glantz, Maria Predari, Henning Meyerhenke
2018Toward Performant and Energy-efficient Queries in Three-tier Wireless Sensor Networks.
Jiayao Wang, Abdullah Al-Mamun, Tonglin Li, Linhua Jiang, Dongfang Zhao
2018UHCL-Darknet: An OpenCL-based Deep Neural Network Framework for Heterogeneous Multi-/Many-core Clusters.
Longlong Liao, Kenli Li, Keqin Li, Canqun Yang, Qi Tian
2018Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics.
Arya Mazaheri, Felix Wolf, Ali Jannesari
2018Using Static Allocation Algorithms for Matrix Matrix Multiplication on Multicores and GPUs.
Lionel Eyraud-Dubois, Thomas Lambert
2018Varbench: an Experimental Framework to Measure and Characterize Performance Variability.
Brian Kocoloski, John R. Lange
2018Vectorised Computation of Diverging Ensembles.
Jan Hückelheim, Paul D. Hovland, Sri Hari Krishna Narayanan, Paulius Velesko
2018Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512.
Hong Zhang, Richard Tran Mills, Karl Rupp, Barry F. Smith
2018ran-GJS: Orchestrating Data Analytics for Heterogeneous Geo-distributed Edges.
Yibo Jin, Zhuzhong Qian, Song Guo, Sheng Zhang, Xiaoliang Wang, Sanglu Lu