| 2019 | 2019 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2019, Rio de Janeiro, Brazil, May 20-24, 2019 |
| 2019 | A Bin-Based Bitstream Partitioning Approach for Parallel CABAC Decoding in Next Generation Video Coding. Philipp Habermann, Chi Ching Chi, Mauricio Alvarez-Mesa, Ben H. H. Juurlink |
| 2019 | A Deep Recurrent Neural Network Based Predictive Control Framework for Reliable Distributed Stream Data Processing. Jielong Xu, Jian Tang, Zhiyuan Xu, Chengxiang Yin, Kevin A. Kwiat, Charles A. Kamhoua |
| 2019 | A High-Performance Distributed Relational Database System for Scalable OLAP Processing. Jason Arnold, Boris Glavic, Ioan Raicu |
| 2019 | A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning. Tal Ben-Nun, Maciej Besta, Simon Huber, Alexandros Nikolaos Ziogas, Daniel Peter, Torsten Hoefler |
| 2019 | A Scalable Clustering-Based Task Scheduler for Homogeneous Processors Using DAG Partitioning. M. Yusuf Özkaya, Anne Benoit, Bora Uçar, Julien Herrmann, Ümit V. Çatalyürek |
| 2019 | Accelerating Sequence Alignment to Graphs. Chirag Jain, Sanchit Misra, Haowen Zhang, Alexander T. Dilthey, Srinivas Aluru |
| 2019 | Accurate, Efficient and Scalable Graph Embedding. Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, Viktor K. Prasanna |
| 2019 | Adapting Batch Scheduling to Workload Characteristics: What Can We Expect From Online Learning? Arnaud Legrand, Denis Trystram, Salah Zrigui |
| 2019 | Aladdin: Optimized Maximum Flow Management for Shared Production Clusters. Heng Wu, Wenbo Zhang, Yuanjia Xu, Hao Xiang, Tao Huang, Haiyang Ding, Zheng Zhang |
| 2019 | Always be Two Steps Ahead of Your Enemy. Thorsten Götte, Vipin Ravindran Vijayalakshmi, Christian Scheideler |
| 2019 | An Approach for Parallel Loading and Pre-Processing of Unstructured Meshes Stored in Spatially Scattered Fashion. Ondrej Meca, Lubomír Ríha, Tomás Brzobohatý |
| 2019 | An Architecture and Stochastic Method for Database Container Placement in the Edge-Fog-Cloud Continuum. Petar Kochovski, Rizos Sakellariou, Marko Bajec, Pavel D. Drobintsev, Vlado Stankovski |
| 2019 | An Efficient Collaborative Communication Mechanism for MPI Neighborhood Collectives. S. Mahdieh Ghazimirsaeed, Seyed Hessam Mirsadeghi, Ahmad Afsahi |
| 2019 | An Error-Reflective Consistency Model for Distributed Data Stores. Philip Dexter, Kenneth Chiu, Bedri Sendir |
| 2019 | Architecting Racetrack Memory Preshift through Pattern-Based Prediction Mechanisms. Adrian Colaso, Pablo Prieto, Pablo Abad Fidalgo, José-Ángel Gregorio, Valentin Puente |
| 2019 | Asynchronous Multigrid Methods. Jordi Wolfson-Pou, Edmond Chow |
| 2019 | BigSpa: An Efficient Interprocedural Static Analysis Engine in the Cloud. Zhiqiang Zuo, Rong Gu, Xi Jiang, Zhaokang Wang, Yihua Huang, Linzhang Wang, Xuandong Li |
| 2019 | C-GDR: High-Performance Container-Aware GPUDirect MPI Communication Schemes on RDMA Networks. Jie Zhang, Xiaoyi Lu, Ching-Hsiang Chu, Dhabaleswar K. Panda |
| 2019 | Coding the Continuum. Ian T. Foster |
| 2019 | Combining Prefetch Control and Cache Partitioning to Improve Multicore Performance. Gongjin Sun, Junjie Shen, Alexander V. Veidenbaum |
| 2019 | Communication-Avoiding Cholesky-QR2 for Rectangular Matrices. Edward Hutter, Edgar Solomonik |
| 2019 | Composing Optimization Techniques for Vertex-Centric Graph Processing via Communication Channels. Yongzhe Zhang, Zhenjiang Hu |
| 2019 | Computation of Matrix Chain Products on Parallel Machines. Elad Weiss, Oded Schwartz |
| 2019 | Containers in HPC: A Scalability and Portability Study in Production Biological Simulations. Oleksandr Rudyy, Marta Garcia-Gasulla, Filippo Mantovani, Alfonso Santiago, Raül Sirvent, Mariano Vázquez |
| 2019 | Cpp-Taskflow: Fast Task-Based Parallel Programming Using Modern C++. Tsung-Wei Huang, Chun-Xun Lin, Guannan Guo, Martin D. F. Wong |
| 2019 | CuSP: A Customizable Streaming Edge Partitioner for Distributed Graph Analytics. Loc Hoang, Roshan Dathathri, Gurbinder Gill, Keshav Pingali |
| 2019 | D3: Deterministic Data Distribution for Efficient Data Reconstruction in Erasure-Coded Distributed Storage Systems. Zhipeng Li, Min Lv, Yinlong Xu, Yongkun Li, Liangliang Xu |
| 2019 | DLHub: Model and Data Serving for Science. Ryan Chard, Zhuozhao Li, Kyle Chard, Logan T. Ward, Yadu N. Babuji, Anna Woodard, Steven Tuecke, Ben Blaiszik, Michael J. Franklin, Ian T. Foster |
| 2019 | DYRS: Bandwidth-Aware Disk-to-Memory Migration of Cold Data in Big-Data File Systems. Simbarashe Dzinamarira, Florin Dinu, T. S. Eugene Ng |
| 2019 | Data Jockey: Automatic Data Management for HPC Multi-tiered Storage Systems. Woong Shin, Christopher Brumgard, Bing Xie, Sudharshan S. Vazhkudai, Devarshi Ghoshal, Sarp Oral, Lavanya Ramakrishnan |
| 2019 | Design Space Exploration of Next-Generation HPC Machines. Constantino Gómez, Francesc Martínez, Adrià Armejach, Miquel Moretó, Filippo Mantovani, Marc Casas |
| 2019 | Distributed Approximate k-Core Decomposition and Min-Max Edge Orientation: Breaking the Diameter Barrier. T.-H. Hubert Chan, Mauro Sozio, Bintao Sun |
| 2019 | Distributed Dominating Set and Connected Dominating Set Construction Under the Dynamic SINR Model. Dongxiao Yu, Yifei Zou, Yong Zhang, Feng Li, Jiguo Yu, Yu Wu, Xiuzhen Cheng, Francis C. M. Lau |
| 2019 | Distributed Weighted All Pairs Shortest Paths Through Pipelining. Udit Agarwal, Vijaya Ramachandran |
| 2019 | Double-Precision FPUs in High-Performance Computing: An Embarrassment of Riches? Jens Domke, Kazuaki Matsumura, Mohamed Wahib, Haoyu Zhang, Keita Yashima, Toshiki Tsuchikawa, Yohei Tsuji, Artur Podobas, Satoshi Matsuoka |
| 2019 | Drowsy-DC: Data Center Power Management System. Mathieu Bacou, Grégoire Todeschi, Alain Tchana, Daniel Hagimont, Baptiste Lepers, Willy Zwaenepoel |
| 2019 | Dual Pattern Compression Using Data-Preprocessing for Large-Scale GPU Architectures. Kyung Hoon Kim, Priyank Devpura, Abhishek Nayyar, Andrew Doolittle, Ki Hwan Yum, Eun Jung Kim |
| 2019 | Dynamic Memory Management for GPU-Based Training of Deep Neural Networks. Shriram S. B, Anshuj Garg, Purushottam Kulkarni |
| 2019 | Effects and Benefits of Node Sharing Strategies in HPC Batch Systems. Alvaro Frank, Tim Süß, André Brinkmann |
| 2019 | Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. Md. Vasimuddin, Sanchit Misra, Heng Li, Srinivas Aluru |
| 2019 | Excavating the Potential of GPU for Accelerating Graph Traversal. Pengyu Wang, Lu Zhang, Chao Li, Minyi Guo |
| 2019 | Exploiting Adaptive Data Compression to Improve Performance and Energy-Efficiency of Compute Workloads in Multi-GPU Systems. Mohammad Khavari Tavana, Yifan Sun, Nicolas Bohm Agostini, David R. Kaeli |
| 2019 | Exploiting Flow Graph of System of ODEs to Accelerate the Simulation of Biologically-Detailed Neural Networks. Bruno R. C. Magalhães, Thomas Sterling, Felix Schürmann, Michael L. Hines |
| 2019 | Exploring MPI Communication Models for Graph Applications Using Graph Matching as a Case Study. Sayan Ghosh, Mahantesh Halappanavar, Ananth Kalyanaraman, Arif Khan, Assefaw H. Gebremedhin |
| 2019 | FALCON: Efficient Designs for Zero-Copy MPI Datatype Processing on Emerging Architectures. Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. Panda |
| 2019 | Fast Batched Matrix Multiplication for Small Sizes Using Half-Precision Arithmetic on GPUs. Ahmad Abdelfattah, Stanimire Tomov, Jack J. Dongarra |
| 2019 | FastJoin: A Skewness-Aware Distributed Stream Join System. Shunjie Zhou, Fan Zhang, Hanhua Chen, Hai Jin, Bing Bing Zhou |
| 2019 | GraphTinker: A High Performance Data Structure for Dynamic Graph Processing. Wole Jaiyeoba, Kevin Skadron |
| 2019 | HART: A Concurrent Hash-Assisted Radix Tree for DRAM-PM Hybrid Memory Systems. Wen Pan, Tao Xie, Xiaojia Song |
| 2019 | Identifying Latent Reduced Models to Precondition Lossy Compression. Huizhang Luo, Dan Huang, Qing Liu, Zhenbo Qiao, Hong Jiang, Jing Bi, Haitao Yuan, MengChu Zhou, Jinzhen Wang, Zhenlu Qin |
| 2019 | Improving Strong-Scaling of CNN Training by Exploiting Finer-Grained Parallelism. Nikoli Dryden, Naoya Maruyama, Tom Benson, Tim Moon, Marc Snir, Brian Van Essen |
| 2019 | Incremental Graph Processing for On-line Analytics. Scott Sallinen, Roger Pearce, Matei Ripeanu |
| 2019 | Incrementalization of Vertex-Centric Programs. Timothy A. K. Zakian, Ludovic Anthony Richard Capelli, Zhenjiang Hu |
| 2019 | LACC: A Linear-Algebraic Algorithm for Finding Connected Components in Distributed Memory. Ariful Azad, Aydin Buluç |
| 2019 | LLC-Guided Data Migration in Hybrid Memory Systems. Evangelos Vasilakis, Vassilis Papaefstathiou, Pedro Trancoso, Ioannis Sourdis |
| 2019 | Language Modeling at Scale. Md. Mostofa Ali Patwary, Milind Chabbi, Heewoo Jun, Jiaji Huang, Greg Diamos, Kenneth Church |
| 2019 | Load-Balanced Sparse MTTKRP on GPUs. Israt Nisa, Jiajia Li, Aravind Sukumaran-Rajam, Richard W. Vuduc, P. Sadayappan |
| 2019 | Local Distributed Algorithms in Highly Dynamic Networks. Philipp Bamberger, Fabian Kuhn, Yannic Maus |
| 2019 | MD-GAN: Multi-Discriminator Generative Adversarial Networks for Distributed Datasets. Corentin Hardy, Erwan Le Merrer, Bruno Sericola |
| 2019 | MOARD: Modeling Application Resilience to Transient Faults on Data Objects. Luanzheng Guo, Dong Li |
| 2019 | MULTISKIPGRAPH: A Self-Stabilizing Overlay Network that Maintains Monotonic Searchability. Linghui Luo, Christian Scheideler, Thim Strothmann |
| 2019 | Matrix Powers Kernels for Thick-Restart Lanczos with Explicit External Deflation. Ichitaro Yamazaki, Zhaojun Bai, Ding Lu, Jack J. Dongarra |
| 2019 | Modelling DVFS and UFS for Region-Based Energy Aware Tuning of HPC Applications. Mohak Chadha, Michael Gerndt |
| 2019 | NCQ-Aware I/O Scheduling for Conventional Solid State Drives. Hao Fan, Song Wu, Shadi Ibrahim, Ximing Chen, Hai Jin, Jiang Xiao, Haibing Guan |
| 2019 | Network Size Estimation in Small-World Networks Under Byzantine Faults. Soumyottam Chatterjee, Gopal Pandurangan, Peter Robinson |
| 2019 | Northup: Divide-and-Conquer Programming in Systems with Heterogeneous Memories and Processors. Shuai Che, Jieming Yin |
| 2019 | On Optimizing Complex Stencils on GPUs. Prashant Singh Rawat, Miheer Vaidya, Aravind Sukumaran-Rajam, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan |
| 2019 | Online Live VM Migration Algorithms to Minimize Total Migration Time and Downtime. Nikos Tziritas, Thanasis Loukopoulos, Samee Khan, Cheng-Zhong Xu, Albert Y. Zomaya |
| 2019 | Optimal Placement of In-memory Checkpoints Under Heterogeneous Failure Likelihoods. Zaeem Hussain, Taieb Znati, Rami G. Melhem |
| 2019 | Optimizing the Parity Check Matrix for Efficient Decoding of RS-Based Cloud Storage Systems. Junqing Gu, Chentao Wu, Xin Xie, Han Qiu, Jie Li, Minyi Guo, Xubin He, Yuanyuan Dong, Yafei Zhao |
| 2019 | Overlapping Communications with Other Communications and Its Application to Distributed Dense Matrix Computations. Hua Huang, Edmond Chow |
| 2019 | PaKman: Scalable Assembly of Large Genomes on Distributed Memory Machines. Priyanka Ghosh, Sriram Krishnamoorthy, Ananth Kalyanaraman |
| 2019 | ParILUT - A Parallel Threshold ILU for GPUs. Hartwig Anzt, Tobias Ribizel, Goran Flegar, Edmond Chow, Jack J. Dongarra |
| 2019 | Peace Through Superior Puzzling: An Asymmetric Sybil Defense. Diksha Gupta, Jared Saia, Maxwell Young |
| 2019 | Portal: A High-Performance Language and Compiler for Parallel N-Body Problems. Laleh Aghababaie Beni, Saikiran Ramanan, Aparna Chandramowlishwaran |
| 2019 | Power and Performance Tradeoffs for Visualization Algorithms. Stephanie Labasan, Matthew Larsen, Hank Childs, Barry Rountree |
| 2019 | Practically Efficient Scheduler for Minimizing Average Flow Time of Parallel Jobs. Kunal Agrawal, I-Ting Angelina Lee, Jing Li, Kefu Lu, Benjamin Moseley |
| 2019 | QoS-Driven Coordinated Management of Resources to Save Energy in Multi-core Systems. Mehrzad Nejat, Miquel Pericàs, Per Stenström |
| 2019 | Reservation Strategies for Stochastic Jobs. Guillaume Aupy, Ana Gainaru, Valentin Honoré, Padma Raghavan, Yves Robert, Hongyang Sun |
| 2019 | Rethinking Support for Region Conflict Exceptions. Swarnendu Biswas, Rui Zhang, Michael D. Bond, Brandon Lucia |
| 2019 | Revisiting the I/O-Complexity of Fast Matrix Multiplication with Recomputations. Roy Nissim, Oded Schwartz |
| 2019 | Robust Dynamic Resource Allocation via Probabilistic Task Pruning in Heterogeneous Computing Systems. James Gentry, Chavit Denninnart, Mohsen Amini Salehi |
| 2019 | Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training. Jiawen Liu, Dong Li, Gokcen Kestor, Jeffrey S. Vetter |
| 2019 | SAC Goes Cluster: Fully Implicit Distributed Computing. Thomas Macht, Clemens Grelck |
| 2019 | SAFIRE: Scalable and Accurate Fault Injection for Parallel Multithreaded Applications. Giorgis Georgakoudis, Ignacio Laguna, Hans Vandierendonck, Dimitrios S. Nikolopoulos, Martin Schulz |
| 2019 | Scheduling on (Un-)Related Machines with Setup Times. Klaus Jansen, Marten Maack, Alexander Mäcker |
| 2019 | Semantics-Aware Virtual Machine Image Management in IaaS Clouds. Nishant Saurabh, Julian Remmers, Dragi Kimovski, Radu Prodan, Jorge G. Barbosa |
| 2019 | Shared-Memory Exact Minimum Cuts. Monika Henzinger, Alexander Noe, Christian Schulz |
| 2019 | SimFS: A Simulation Data Virtualizing File System Interface. Salvatore Di Girolamo, Pirmin Schmid, Thomas C. Schulthess, Torsten Hoefler |
| 2019 | Sizing and Partitioning Strategies for Burst-Buffers to Reduce IO Contention. Guillaume Aupy, Olivier Beaumont, Lionel Eyraud-Dubois |
| 2019 | Slate: Enabling Workload-Aware Efficient Multiprocessing for Modern GPGPUs. Tyler N. Allen, Xizhou Feng, Rong Ge |
| 2019 | Software-Based Buffering of Associative Operations on Random Memory Addresses. Matthias Hauck, Marcus Paradies, Holger Fröning |
| 2019 | SprintCon: Controllable and Efficient Computational Sprinting for Data Center Servers. Wenli Zheng, Xiaorui Wang, Yue Ma, Chao Li, Hao Lin, Bin Yao, Jianfeng Zhang, Minyi Guo |
| 2019 | Stochastic Gradient Descent on Modern Hardware: Multi-core CPU or GPU? Synchronous or Asynchronous? Yujing Ma, Florin Rusu, Martin Torres |
| 2019 | SunwayLB: Enabling Extreme-Scale Lattice Boltzmann Method Based Computing Fluid Dynamics Simulations on Sunway TaihuLight. Zhao Liu, Xuesen Chu, Xiaojing Lv, Hongsong Meng, Shupeng Shi, Wenji Han, Jingheng Xu, Haohuan Fu, Guangwen Yang |
| 2019 | The Path to Delivering Programable Exascale Systems. Luiz DeRose |
| 2019 | Themis: Predicting and Reining in Application-Level Slowdown on Spatial Multitasking GPUs. Wenyi Zhao, Quan Chen, Hao Lin, Jianfeng Zhang, Jingwen Leng, Chao Li, Wenli Zheng, Li Li, Minyi Guo |
| 2019 | Tight & Simple Load Balancing. Petra Berenbrink, Tom Friedetzky, Dominik Kaaser, Peter Kling |
| 2019 | Two Elementary Instructions Make Compare-and-Swap. Pankaj Khanchandani, Roger Wattenhofer |
| 2019 | Two Roads to Parallelism: From Serial Code to Programming with STAPL. Lawrence Rauchwerger |
| 2019 | UPC++: A High-Performance Communication Framework for Asynchronous Computation. John Bachan, Scott B. Baden, Steven A. Hofmeyr, Mathias Jacquelin, Amir Kamil, Dan Bonachea, Paul H. Hargrove, Hadia Ahmed |
| 2019 | Understanding the Impact of Dynamic Power Capping on Application Progress. Srinivasan Ramesh, Swann Perarnau, Sridutt Bhalachandra, Allen D. Malony, Peter H. Beckman |
| 2019 | VeloC: Towards High Performance Adaptive Asynchronous Checkpointing at Large Scale. Bogdan Nicolae, Adam Moody, Elsa Gonsiorowski, Kathryn M. Mohror, Franck Cappello |
| 2019 | Z-Dedup: A Case for Deduplicating Compressed Contents in Cloud. Zhichao Yan, Hong Jiang, Yujuan Tan, Stan Skelton, Hao Luo |
| 2019 | iez: Resource Contention Aware Load Balancing for Large-Scale Parallel File Systems. Bharti Wadhwa, Arnab Kumar Paul, Sarah Neuwirth, Feiyi Wang, Sarp Oral, Ali Raza Butt, Jon Bernard, Kirk W. Cameron |
| 2019 | mmWave Wireless Backhaul Scheduling of Stochastic Packet Arrivals. Pawel Garncarek, Tomasz Jurdzinski, Dariusz R. Kowalski, Miguel A. Mosteiro |