ICPP B

107 papers

YearTitle / Authors
2019A 2D Parallel Triangle Counting Algorithm for Distributed-Memory Architectures.
Ancy Sarah Tom, George Karypis
2019A Network-aware and Partition-based Resource Management Scheme for Data Stream Processing.
Yidan Wang, Zahir Tari, Xiaoran Huang, Albert Y. Zomaya
2019A Parallel Graph Algorithm for Detecting Mesh Singularities in Distributed Memory Ice Sheet Simulations.
Ian Bogle, Karen D. Devine, Mauro Perego, Sivasankaran Rajamanickam, George M. Slota
2019A Plugin Architecture for the TAU Performance System.
Allen D. Malony, Srinivasan Ramesh, Kevin A. Huck, Nicholas Chaimov, Sameer Shende
2019A Practical, Scalable, Relaxed Priority Queue.
Tingzhe Zhou, Maged M. Michael, Michael F. Spear
2019A Read-leveling Data Distribution Scheme for Promoting Read Performance in SSDs with Deduplication.
Mengting Lu, Fang Wang, Dan Feng, Yuchong Hu
2019A Specialized Concurrent Queue for Scheduling Irregular Workloads on GPUs.
David Troendle, Tuan Ta, Byunghyun Jang
2019A Tale of Two (Flow) Tables: Demystifying Rule Caching in OpenFlow Switches.
Rui Li, Yu Pang, Jin Zhao, Xin Wang
2019A Unified Optimization Approach for CNN Model Inference on Integrated GPUs.
Leyuan Wang, Zhi Chen, Yizhi Liu, Yao Wang, Lianmin Zheng, Mu Li, Yida Wang
2019AVR: Reducing Memory Traffic with Approximate Value Reconstruction.
Albin Eldstål-Damlin, Pedro Trancoso, Ioannis Sourdis
2019Accelerated Work Stealing.
D. Brian Larkins, John Snyder, James Dinan
2019Accelerating All-Edge Common Neighbor Counting on Three Processors.
Yulin Che, Zhuohang Lai, Shixuan Sun, Qiong Luo, Yue Wang
2019Accelerating Long Read Alignment on Three Processors.
Zonghao Feng, Shuang Qiu, Lipeng Wang, Qiong Luo
2019AdaM: An Adaptive Fine-Grained Scheme for Distributed Metadata Management.
Shiyi Cao, Yuanning Gao, Xiaofeng Gao, Guihai Chen
2019Adaptive Learning for Concept Drift in Application Performance Modeling.
Sandeep Madireddy, Prasanna Balaprakash, Philip H. Carns, Robert Latham, Glenn K. Lockwood, Robert B. Ross, Shane Snyder, Stefan M. Wild
2019Adaptive Routing Reconfigurations to Minimize Flow Cost in SDN-Based Data Center Networks.
Akbar Majidi, Xiaofeng Gao, Shunjia Zhu, Nazila Jahanbakhsh, Guihai Chen
2019An Efficient Design Flow for Accelerating Complicated-connected CNNs on a Multi-FPGA Platform.
Deguang Wang, Junzhong Shen, Mei Wen, Chunyuan Zhang
2019Approximate Code: A Cost-Effective Erasure Coding Framework for Tiered Video Storage in Cloud Systems.
Huayi Jin, Chentao Wu, Xin Xie, Jie Li, Minyi Guo, Hao Lin, Jianfeng Zhang
2019Artemis: A Practical Low-latency Naming and Routing System.
Xuebing Li, Bingyang Liu, Yang Chen, Yu Xiao, Jiaxin Tang, Xin Wang
2019Automatic Differentiation for Adjoint Stencil Loops.
Jan Hückelheim, Navjot Kukreja, Sri Hari Krishna Narayanan, Fabio Luporini, Gerard Gorman, Paul D. Hovland
2019BCL: A Cross-Platform Distributed Data Structures Library.
Benjamin Brock, Aydin Buluç, Katherine A. Yelick
2019BPP: A Realtime Block Access Pattern Mining Scheme for I/O Prediction.
Chunjie Zhu, Fang Wang, Binbing Hou
2019Breaking Band: A Breakdown of High-performance Communication.
Rohit Zambre, Megan Grodowitz, Aparna Chandramowlishwaran, Pavel Shamis
2019Building Scalable NVM-based B+tree with HTM.
Mengxing Liu, Jiankai Xing, Kang Chen, Yongwei Wu
2019COMBFT: Conflicting-Order-Match based Byzantine Fault Tolerance Protocol with High Efficiency and Robustness.
Yingyao Rong, Weigang Wu, Zhiguang Chen
2019CPpf: a prefetch aware LLC partitioning approach.
Jun Xiao, Andy D. Pimentel, Xu Liu
2019Cartesian Collective Communication.
Jesper Larsson Träff, Sascha Hunold
2019Compiler-Assisted GPU Thread Throttling for Reduced Cache Contention.
Hyunjun Kim, Sungin Hong, Hyeonsu Lee, Euiseong Seo, Hwansoo Han
2019Controlled Asynchronous GVT: Accelerating Parallel Discrete Event Simulation on Many-Core Clusters.
Ali Eker, Barry Williams, Kenneth Chiu, Dmitry Ponomarev
2019Cooperative Job Scheduling and Data Allocation for Busy Data-Intensive Parallel Computing Clusters.
Guoxin Liu, Haiying Shen, Haoyu Wang
2019Cosin: Controllable Social Influence Maximization and Its Distributed Implementation in Large-scale Social Networks.
Jingya Zhou, Jianxi Fan, Jin Wang
2019CostPI: Cost-Effective Performance Isolation for Shared NVMe SSDs.
Jiahao Liu, Fang Wang, Dan Feng
2019Cynthia: Cost-Efficient Cloud Resource Provisioning for Predictable Distributed Deep Neural Network Training.
Haoyue Zheng, Fei Xu, Li Chen, Zhi Zhou, Fangming Liu
2019DICER: Diligent Cache Partitioning for Efficient Workload Consolidation.
Konstantinos Nikas, Nikela Papadopoulou, Dimitra Giantsidi, Vasileios Karakostas, Georgios I. Goumas, Nectarios Koziris
2019DLBooster: Boosting End-to-End Deep Learning Workflows with Offloading Data Preprocessing Pipelines.
Yang Cheng, Dan Li, Zhiyuan Guo, Binyao Jiang, Jiaxin Lin, Xi Fan, Jinkun Geng, Xinyi Yu, Wei Bai, Lei Qu, Ran Shu, Peng Cheng, Yongqiang Xiong, Jianping Wu
2019Data and Thread Placement in NUMA Architectures: A Statistical Learning Approach.
Nicolas Denoyelle, Brice Goglin, Emmanuel Jeannot, Thomas Ropars
2019DeepHash: An End-to-End Learning Approach for Metadata Management in Distributed File Systems.
Yuanning Gao, Xiaofeng Gao, Guihai Chen
2019Design Exploration of Multi-tier Interconnection Networks for Exascale Systems.
Javier Navaridas, Joshua Lant, Jose Antonio Pascual, Mikel Luján, John Goodacre
2019Distributed Join Algorithms on Multi-CPU Clusters with GPUDirect RDMA.
Chengxin Guo, Hong Chen, Feng Zhang, Cuiping Li
2019Dynamic Load Balancing in Hybrid Switching Data Center Networks with Converters.
Jiaqi Zheng, Qiming Zheng, Xiaofeng Gao, Guihai Chen
2019ECoST: Energy-Efficient Co-Locating and Self-Tuning MapReduce Applications.
Maria Malik, Hassan Ghasemzadeh, Tinoosh Mohsenin, Rosario Cammarota, Liang Zhao, Avesta Sasan, Houman Homayoun, Setareh Rafatirad
2019EMBA: Efficient Memory Bandwidth Allocation to Improve Performance on Intel Commodity Processor.
Yaocheng Xiang, Chencheng Ye, Xiaolin Wang, Yingwei Luo, Zhenlin Wang
2019Efficient Data-Parallel Primitives on Heterogeneous Systems.
Zhuohang Lai, Qiong Luo, Xiaolong Xie
2019Exploiting Vector Processing in Dynamic Binary Translation.
Chih-Min Lin, Sheng-Yu Fu, Ding-Yong Hong, Yu-Ping Liu, Jan-Jan Wu, Wei-Chung Hsu
2019Express Link Placement for NoC-Based Many-Core Platforms.
Yunfan Li, Di Zhu, Lizhong Chen
2019Fast Recovery Techniques for Erasure-coded Clusters in Non-uniform Traffic Network.
Yunren Bai, Zihan Xu, Haixia Wang, Dongsheng Wang
2019Faster parallel collision detection at high resolution for CNC milling applications.
Xin Chen, Dmytro Konobrytskyi, Thomas M. Tucker, Thomas R. Kurfess, Richard W. Vuduc
2019FlowCon: Elastic Flow Configuration for Containerized Deep Learning Applications.
Wenjia Zheng, Michael Tynes, Henry Gorelick, Ying Mao, Long Cheng, Yantian Hou
2019FuncyTuner: Auto-tuning Scientific Applications With Per-loop Compilation.
Tao Wang, Nikhil Jain, David Beckingsale, David Böhme, Frank Mueller, Todd Gamblin
2019Gossip: Efficient Communication Primitives for Multi-GPU Systems.
Robin Kobus, Daniel Jünger, Christian Hundt, Bertil Schmidt
2019Gravitational Octree Code Performance Evaluation on Volta GPU.
Yohei Miki
2019HOPE: A Parallel Execution Model Based on Hierarchical Omission.
Masahiro Yasugi, Daisuke Muraoka, Tasuku Hiraishi, Seiji Umatani, Kento Emoto
2019HPAS: An HPC Performance Anomaly Suite for Reproducing Performance Variations.
Emre Ates, Yijia Zhang, Burak Aksar, Jim M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun
2019Holistic Slowdown Driven Scheduling and Resource Management for Malleable Jobs.
Marco D'Amico, Ana Jokanovic, Julita Corbalán
2019How to Make the Preconditioned Conjugate Gradient Method Resilient Against Multiple Node Failures.
Carlos Pachajoa, Markus Levonyak, Wilfried N. Gansterer, Jesper Larsson Träff
2019HyperPRAW: Architecture-Aware Hypergraph Restreaming Partition to Improve Performance of Parallel Applications Running on High Performance Computing Systems.
Carlos Fernandez Musoles, Daniel Coca, Paul Richmond
2019I/O Characterization and Performance Evaluation of BeeGFS for Deep Learning.
Fahim Chowdhury, Yue Zhu, Todd Heer, Saul Paredes, Adam Moody, Robin Goldstone, Kathryn M. Mohror, Weikuan Yu
2019Improved Unconstrained Energy Functional Method for Eigensolvers in Electronic Structure Calculations.
Mauro Del Ben, Osni Marques, Andrew Canning
2019Improving Short Job Latency Performance in Hybrid Job Schedulers with Dice.
Wei Zhou, K. Preston White, Hongfeng Yu
2019Incorporating Probabilistic Optimizations for Resource Provisioning of Data Processing Workflows.
Amelie Chi Zhou, Yao Xiao, Bingsheng He, Shadi Ibrahim, Reynold Cheng
2019JobPacker: Job Scheduling for Data-Parallel Frameworks with Hybrid Electrical/Optical Datacenter Networks.
Zhuozhao Li, Haiying Shen
2019LFOC: A Lightweight Fairness-Oriented Cache Clustering Policy for Commodity Multicores.
Adrian Garcia-Garcia, Juan Carlos Saez, Fernando Castro, Manuel Prieto-Matías
2019Lightweight Fault Tolerance in Pregel-Like Systems.
Da Yan, James Cheng, Hongzhi Chen, Cheng Long, Purushotham V. Bangalore
2019MAC: Memory Access Coalescer for 3D-Stacked Memory.
Xi Wang, Antonino Tumeo, John D. Leidel, Jie Li, Yong Chen
2019Machine Learning for Fine-Grained Hardware Prefetcher Control.
Jason Hiebel, Laura E. Brown, Zhenlin Wang
2019Massively Parallel ANS Decoding on GPUs.
André Weißenberger, Bertil Schmidt
2019Massively Parallel Automated Software Tuning.
Jakub Kurzak, Yaohung M. Tsai, Mark Gates, Ahmad Abdelfattah, Jack J. Dongarra
2019Modeling the Performance of Atomic Primitives on Modern Architectures.
Fazeleh Sadat Hoseini, Aras Atalar, Philippas Tsigas
2019Multi-Objective Reinforcement Learning for Reconfiguring Data Stream Analytics on Edge Computing.
Alexandre da Silva Veith, Felipe Rodrigo de Souza, Marcos Dias de Assunção, Laurent Lefèvre, Julio Cesar Santos dos Anjos
2019N-Code: An Optimal RAID-6 MDS Array Code for Load Balancing and High I/O Performance.
Ping Xie, Zhu Yuan, Jianzhong Huang, Xiao Qin
2019NFV-Enabled Multicasting in Mobile Edge Clouds with Resource Sharing.
Zichuan Xu, Yutong Zhang, Weifa Liang, Qiufen Xia, Omer F. Rana, Alex Galis, Guowei Wu, Pan Zhou
2019Near-Data Processing-Enabled and Time-Aware Compaction Optimization for LSM-tree-based Key-Value Stores.
Hui Sun, Wei Liu, Jianzhong Huang, Song Fu, Zhi Qiao, Weisong Shi
2019Nested Virtualization Without the Nest.
Mathieu Bacou, Grégoire Todeschi, Alain Tchana, Daniel Hagimont
2019Network Congestion Avoidance through Packet-chaining Reservation.
Ke Wu, Dezun Dong, Cunlu Li, Shan Huang, Yi Dai
2019Network Congestion-aware Online Service Function Chain Placement and Load Balancing.
Xiaojun Shang, Zhenhua Liu, Yuanyuan Yang
2019OSP: Overlapping Computation and Communication in Parameter Server for Fast Machine Learning.
Haozhao Wang, Song Guo, Ruixuan Li
2019On Integration of Appends and Merges in Log-Structured Merge Trees.
Caixin Gong, Shuibing He, Yili Gong, Yingchun Lei
2019On Max-min Fair Resource Allocation for Distributed Job Execution.
Yitong Guan, Chuanyou Li, Xueyan Tang
2019Optimized Execution of Parallel Loops via User-Defined Scheduling Policies.
Seonmyeong Bak, Yanfei Guo, Pavan Balaji, Vivek Sarkar
2019Parallel Algorithms for Evaluating Matrix Polynomials.
Sivan Toledo, Amit Waisel
2019Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels.
Suraj Kumar, Lionel Eyraud-Dubois, Sriram Krishnamoorthy
2019Performance, Energy, and Scalability Analysis and Improvement of Parallel Cancer Deep Learning CANDLE Benchmarks.
Xingfu Wu, Valerie E. Taylor, Justin M. Wozniak, Rick Stevens, Thomas S. Brettin, Fangfang Xia
2019PhSIH: A Lightweight Parallelization of Event Matching in Content-based Pub/Sub Systems.
Zhengyu Liao, Shiyou Qian, Jian Cao, Yanhua Cao, Guangtao Xue, Jiadi Yu, Yanmin Zhu, Minglu Li
2019Predictable GPUs Frequency Scaling for Energy and Performance.
Kaijie Fan, Biagio Cosenza, Ben H. H. Juurlink
2019Proceedings of the 48th International Conference on Parallel Processing, ICPP 2019, Kyoto, Japan, August 05-08, 2019
2019QLEC: A Machine-Learning-Based Energy-Efficient Clustering Algorithm to Prolong Network Lifespan for IoT in High-Dimensional Space.
Ke Li, Haowei Huang, Xiaofeng Gao, Fan Wu, Guihai Chen
2019RFPL: A Recovery Friendly Parity Logging Scheme for Reducing Small Write Penalty of SSD RAID.
Gaoxiang Xu, Dan Feng, Zhipeng Tan, Xinyan Zhang, Jie Xu, Xi Shu, Yifeng Zhu
2019Reducing Kernel Surface Areas for Isolation and Scalability.
Daniel Zahka, Brian Kocoloski, Kate Keahey
2019Refactoring and Optimizing WRF Model on Sunway TaihuLight.
Kai Xu, Zhenya Song, Yuandong Chan, Shida Wang, Xiangxu Meng, Weiguo Liu, Wei Xue
2019Runtime Adaptive Task Inlining on Asynchronous Multitasking Runtime Systems.
Bibek Wagle, Mohammad Alaul Haque Monil, Kevin A. Huck, Allen D. Malony, Adrian Serio, Hartmut Kaiser
2019SAFE: Service Availability via Failure Elimination Through VNF Scaling.
Rui Xia, Haipeng Dai, Jiaqi Zheng, Rong Gu, Xiaoyu Wang, Guihai Chen
2019SaC: Exploiting Execution-Time Slack to Save Energy in Heterogeneous Multicore Systems.
Muhammad Waqar Azhar, Miquel Pericàs, Per Stenström
2019Solving All-Pairs Shortest-Paths Problem in Large Graphs Using Apache Spark.
Frank Schoeneman, Jaroslaw Zola
2019Spatially-aware Parallel I/O for Particle Data.
Sidharth Kumar, Steve Petruzza, Will Usher, Valerio Pascucci
2019Speculative Scheduling for Stochastic HPC Applications.
Ana Gainaru, Guillaume Pallez, Hongyang Sun, Padma Raghavan
2019Stage Delay Scheduling: Speeding up DAG-style Data Analytics Jobs with Resource Interleaving.
Wujie Shao, Fei Xu, Li Chen, Haoyue Zheng, Fangming Liu
2019TEA: A Traffic-efficient Erasure-coded Archival Scheme for In-memory Stores.
Bin Xu, Jianzhong Huang, Qiang Cao, Xiao Qin
2019TLB: Traffic-aware Load Balancing with Adaptive Granularity in Data Center Networks.
Jinbin Hu, Jiawei Huang, Wenjun Lv, Weihe Li, Jianxin Wang, Tian He
2019Tessellating Star Stencils.
Liang Yuan, Shan Huang, Yunquan Zhang, Hang Cao
2019The Case for Water-Immersion Computer Boards.
Michihiro Koibuchi, Ikki Fujiwara, Naoya Niwa, Tomohiro Totoki, Shoichi Hirasawa
2019The Communication-Overlapped Hybrid Decomposition Parallel Algorithm for Multi-Scale Fluid Simulations.
Yi Liu, Xiaowei Guo, Chao Li, Canqun Yang, Xinbiao Gan, Peng Zhang, Yi Wang, Ran Zhao, Sijiang Fan
2019Transfer Learning based Failure Prediction for Minority Disks in Large Data Centers of Heterogeneous Disk Systems.
Ji Zhang, Ke Zhou, Ping Huang, Xubin He, Zhili Xiao, Bin Cheng, Yongguang Ji, Yinhu Wang
2019Unleashing the Scalability Potential of Power-Constrained Data Center in the Microservice Era.
Xiaofeng Hou, Jiacheng Liu, Chao Li, Minyi Guo
2019VScan: Efficiently Analyzing Surveillance Videos via Model-joint Mechanism.
Chen Zhang, Qiang Cao, Jie Yao, Yuanyuan Dong, Puyuan Yang
2019When Power Oversubscription Meets Traffic Flood Attack: Re-Thinking Data Center Peak Load Management.
Xiaofeng Hou, Mingyu Liang, Chao Li, Wenli Zheng, Quan Chen, Minyi Guo
2019diBELLA: Distributed Long Read to Long Read Alignment.
Marquita Ellis, Giulia Guidi, Aydin Buluç, Leonid Oliker, Katherine A. Yelick
2019swATOP: Automatically Optimizing Deep Learning Operators on SW26010 Many-Core Processor.
Wei Gao, Jiarui Fang, Wenlai Zhao, Jinzhe Yang, Long Wang, Lin Gan, Haohuan Fu, Guangwen Yang