HPDC A

29 papers

YearTitle / Authors
2021A Serverless Framework for Distributed Bulk Metadata Extraction.
Tyler J. Skluzacek, Ryan Wong, Zhuozhao Li, Ryan Chard, Kyle Chard, Ian T. Foster
2021AITurbo: Unified Compute Allocation for Partial Predictable Training in Commodity Clusters.
Laiping Zhao, Fangshu Li, Wenyu Qu, Kunlin Zhan, Qingman Zhang
2021ARC: An Automated Approach to Resiliency for Lossy Compressed Data via Error Correcting Codes.
Dakota Fulp, Alexandra Poulos, Robert Underwood, Jon C. Calhoun
2021Achieving Scalable Consensus by Being Less Writey.
Michael Davis, Hans Vandierendonck
2021Adaptive Configuration of In Situ Lossy Compression for Cosmology Simulations via Fine-Grained Rate-Quality Modeling.
Sian Jin, Jesus Pulido, Pascal Grosset, Jiannan Tian, Dingwen Tao, James P. Ahrens
2021An Oracle for Guiding Large-Scale Model/Hybrid Parallel Training of Convolutional Neural Networks.
Albert Njoroge Kahira, Truong Thao Nguyen, Leonardo Bautista-Gomez, Ryousei Takano, Rosa M. Badia, Mohamed Wahib
2021Apollo: : An ML-assisted Real-Time Storage Resource Observer.
Neeraj Rajesh, Hariharan Devarajan, Jaime Cernuda Garcia, Keith Bateman, Luke Logan, Jie Ye, Anthony Kougkas, Xian-He Sun
2021Cache-aware Sparse Patterns for the Factorized Sparse Approximate Inverse Preconditioner.
Sergi Laut, Ricard Borrell, Marc Casas
2021CharminG: A Scalable GPU-resident Runtime System.
Jaemin Choi, David F. Richards, Laxmikant V. Kalé
2021Computing Challenges for High Energy Physics.
Maria Girone
2021DLion: Decentralized Distributed Deep Learning in Micro-Clouds.
Rankyung Hong, Abhishek Chandra
2021DRLPart: A Deep Reinforcement Learning Framework for Optimally Efficient and Robust Resource Partitioning on Commodity Servers.
Ruobing Chen, Jinping Wu, Haosen Shi, Yusen Li, Xiaoguang Liu, Gang Wang
2021DStore: A Fast, Tailless, and Quiescent-Free Object Store for PMEM.
Shashank Gugnani, Xiaoyi Lu
2021File System Semantics Requirements of HPC Applications.
Chen Wang, Kathryn M. Mohror, Marc Snir
2021HPDC '21: The 30th International Symposium on High-Performance Parallel and Distributed Computing, Virtual Event, Sweden, June 21-25, 2021.
Erwin Laure, Stefano Markidis, Ana Lucia Verbanescu, Jay F. Lofstead
2021Hardware Specialization for Distributed Computing.
Gustavo Alonso
2021Jigsaw: A High-Utilization, Interference-Free Job Scheduler for Fat-Tree Clusters.
Staci A. Smith, David K. Lowenthal
2021LaSS: Running Latency Sensitive Serverless Computations at the Edge.
Bin Wang, Ahmed Ali-Eldin, Prashant J. Shenoy
2021MPI-CorrBench: Towards an MPI Correctness Benchmark Suite.
Jan-Patrick Lehr, Tim Jammer, Christian H. Bischof
2021Machine Learning Augmented Hybrid Memory Management.
Thaleia Dimitra Doudali, Ada Gavrilovska
2021Parallel Program Scaling Analysis using Hardware Counters.
Shobhit Jagga, Preeti Malakar
2021Productive Programming of Distributed Systems with the SHAD C++ Library.
Vito Giovanni Castellana, Marco Minutoli
2021Q-adaptive: A Multi-Agent Reinforcement Learning Based Routing on Dragonfly Network.
Yao Kang, Xin Wang, Zhiling Lan
2021Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters.
Piyush Sao, Hao Lu, Ramakrishnan Kannan, Vijay Thakkar, Richard W. Vuduc, Thomas E. Potok
2021SnuRHAC: A Runtime for Heterogeneous Accelerator Clusters with CUDA Unified Memory.
Jaehoon Jung, Daeyoung Park, Gangwon Jo, Jungho Park, Jaejin Lee
2021Superscalar Programming Models: A Perspective from Barcelona.
Rosa M. Badia
2021TEMPI: An Interposed MPI Library with a Canonical Representation of CUDA-aware Datatypes.
Carl Pearson, Kun Wu, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu
2021Towards Exploiting CPU Elasticity via Efficient Thread Oversubscription.
Hang Huang, Jia Rao, Song Wu, Hai Jin, Hong Jiang, Hao Che, Xiaofeng Wu
2021Using Pilot Jobs and CernVM File System for Simplified Use of Containers and Software Distribution.
Namratha Urs, Marco Mambelli, Dave Dykstra