IPDPS A

96 papers

YearTitle / Authors
2023A Guaranteed Approximation Algorithm for Scheduling Fork-Joins with Communication Delay.
Pierre-François Dutot, Yeu-Shin Fu, Nikhil Prasad, Oliver Sinnen
2023A Machine Learning Approach Towards Runtime Optimisation of Matrix Multiplication.
Yufan Xia, Marco De La Pierre, Amanda S. Barnard, Giuseppe M. J. Barca
2023A Novel Framework for Efficient Offloading of Communication Operations to Bluefield SmartNICs.
Kaushik Kandadi Suresh, Benjamin Michalowicz, Bharath Ramesh, Nicholas Contini, Jinghan Yao, Shulei Xu, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda
2023A Novel Triangular Space-Filling Curve for Cache-Oblivious In-Place Transposition of Square Matrices.
João Nuno Ferreira Alves, Luís M. S. Russo, Alexandre P. Francisco, Siegfried Benkner
2023Accelerating CNN inference on long vector architectures via co-design.
Sonia Rani Gupta, Nikela Papadopoulou, Miquel Pericàs
2023Accelerating Distributed Deep Learning Training with Compression Assisted Allgather and Reduce-Scatter Communication.
Qinghua Zhou, Quentin Anthony, Lang Xu, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda
2023Accelerating Packet Processing in Container Overlay Networks via Packet-level Parallelism.
Jiaxin Lei, Manish Munikar, Hui Lu, Jia Rao
2023Accurate and Efficient Distributed COVID-19 Spread Prediction based on a Large-Scale Time-Varying People Mobility Graph.
Sudipta Saha Shubha, Shohaib Mahmud, Haiying Shen, Geoffrey C. Fox, Madhav V. Marathe
2023Alioth: A Machine Learning Based Interference-Aware Performance Monitor for Multi-Tenancy Applications in Public Cloud.
Tianyao Shi, Yingxuan Yang, Yunlong Cheng, Xiaofeng Gao, Zhen Fang, Yongqiang Yang
2023An Adaptive Hybrid Quantum Algorithm for the Metric Traveling Salesman Problem.
Fei Li, Arul Rhik Mazumder
2023An Efficient 2D Method for Training Super-Large Deep Learning Models.
Qifan Xu, Yang You
2023An Experimental Study of Two-level Schwarz Domain-Decomposition Preconditioners on GPUs.
Ichitaro Yamazaki, Alexander Heinlein, Sivasankaran Rajamanickam
2023AnyQ: An Evaluation Framework for Massively-Parallel Queue Algorithms.
Michael Kenzel, Stefan Lemme, Richard Membarth, Matthias Kurtenacker, Hugo Devillers, Markus Steinberger, Philipp Slusallek
2023ArkFS: A Distributed File System on Object Storage for Archiving Data in HPC Environment.
Kyu-Jin Cho, Injae Kang, Jin-soo Kim
2023Asynch-SGBDT: Train Stochastic Gradient Boosting Decision Trees in an Asynchronous Parallel Manner.
Daning Cheng, Shigang Li, Yunquan Zhang
2023Boosting Multi-Block Repair in Cloud Storage Systems with Wide-Stripe Erasure Coding.
Qi Yu, Lin Wang, Yuchong Hu, Yumeng Xu, Dan Feng, Jie Fu, Xia Zhu, Zhen Yao, Wenjia Wei
2023ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs.
Yujia Zhai, Chengquan Jiang, Leyuan Wang, Xiaoying Jia, Shang Zhang, Zizhong Chen, Xin Liu, Yibo Zhu
2023Chic-sched: a HPC Placement-Group Scheduler on Hierarchical Topologies with Constraints.
Laurent Schares, Asser N. Tantawi, Pavlos Maniotis, Ming-Hung Chen, Claudia Misale, Seetharami Seelam, Hao Yu
2023Communication Optimization for Distributed Execution of Graph Neural Networks.
Süreyya Emre Kurt, Jinghua Yan, Aravind Sukumaran-Rajam, Prashant Pandey, P. Sadayappan
2023DAOS as HPC Storage: a View From Numerical Weather Prediction.
Nicolau Manubens, Tiago Quintino, Simon D. Smart, Emanuele Danovaro, Adrian Jackson
2023Data Distribution Schemes for Dense Linear Algebra Factorizations on Any Number of Nodes.
Olivier Beaumont, Jean-Alexandre Collin, Lionel Eyraud-Dubois, Mathieu Vérité
2023DeepThermo: Deep Learning Accelerated Parallel Monte Carlo Sampling for Thermodynamics Evaluation of High Entropy Alloys.
Junqi Yin, Feiyi Wang, Mallikarjun Arjun Shankar
2023Designing and Optimizing GPU-aware Nonblocking MPI Neighborhood Collective Communication for PETSc
Kawthar Shafie Khorassani, Chen-Chun Chen, Hari Subramoni, Dhabaleswar K. Panda
2023Distributed Sparse Random Projection Trees for Constructing K-Nearest Neighbor Graphs.
Isuru Ranawaka, Md. Khaledur Rahman, Ariful Azad
2023Distributing Simplex-Shaped Nested for-Loops to Identify Carcinogenic Gene Combinations.
Sajal Dash, Mohammad Alaul Haque Monil, Junqi Yin, Ramu Anandakrishnan, Feiyi Wang
2023Drill: Log-based Anomaly Detection for Large-scale Storage Systems Using Source Code Analysis.
Di Zhang, Chris Egersdoerfer, Tabassum Mahmud, Mai Zheng, Dong Dai
2023Duo: Improving Data Sharing of Stateful Serverless Applications by Efficiently Caching Multi-Read Data.
Zhuo Huang, Hao Fan, Chaoyi Cheng, Song Wu, Hai Jin
2023Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams.
Yongseok Soh, Ahmed E. Helal, Fabio Checconi, Jan Laukemann, Jesmin Jahan Tithi, Teresa M. Ranadive, Fabrizio Petrini, Jee W. Choi
2023Dynasparse: Accelerating GNN Inference through Dynamic Sparsity Exploitation.
Bingyi Zhang, Viktor K. Prasanna
2023Efficient Hardware Primitives for Immediate Memory Reclamation in Optimistic Data Structures.
Ajay Singh, Trevor Brown, Michael Spear
2023Engineering Massively Parallel MST Algorithms.
Peter Sanders, Matthias Schimek
2023Engineering a Distributed-Memory Triangle Counting Algorithm.
Peter Sanders, Tim Niklas Uhl
2023Evaluating Asynchronous Parallel I/O on HPC Systems.
John Ravi, Suren Byna, Quincey Koziol, Houjun Tang, Michela Becchi
2023Exact Fault-Tolerant Consensus with Voting Validity.
Zhangchen Xu, Yuetai Li, Chenglin Feng, Lei Zhang
2023Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU.
Jianjin Liao, Mingzhen Li, Hailong Yang, Qingxiao Sun, Biao Sun, Jiwei Hao, Tianyu Feng, Fengwei Yu, Shengdong Chen, Ye Tao, Zicheng Zhang, Zhongzhi Luan, Depei Qian
2023Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training.
Siddharth Singh, Abhinav Bhatele
2023FIRST: Exploiting the Multi-Dimensional Attributes of Functions for Power-Aware Serverless Computing.
Lu Zhang, Chao Li, Xinkai Wang, Weiqi Feng, Zheng Yu, Quan Chen, Jingwen Leng, Minyi Guo, Pu Yang, Shang Yue
2023Fast And Automatic Floating Point Error Analysis With CHEF-FP.
Garima Singh, Baidyanath Kundu, Harshitha Menon, Alexander Penev, David J. Lange, Vassil Vassilev
2023Fast Deterministic Gathering with Detection on Arbitrary Graphs: The Power of Many Robots.
Anisur Rahaman Molla, Kaushik Mondal, William K. Moses Jr.
2023Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks.
Ruibo Fan, Wei Wang, Xiaowen Chu
2023FaultyRank: A Graph-based Parallel File System Checker.
Saisha Kamat, Abdullah Al Raqibul Islam, Mai Zheng, Dong Dai
2023Feature-based SpMV Performance Analysis on Contemporary Devices.
Panagiotis Mpakos, Dimitrios Galanopoulos, Petros Anastasiadis, Nikela Papadopoulou, Nectarios Koziris, Georgios I. Goumas
2023FedBIAD: Communication-Efficient and Accuracy-Guaranteed Federated Learning with Bayesian Inference-Based Adaptive Dropout.
Jingjing Xue, Min Liu, Sheng Sun, Yuwei Wang, Hui Jiang, Xuefeng Jiang
2023FedTrip: A Resource-Efficient Federated Learning Method with Triplet Regularization.
Xujing Li, Min Liu, Sheng Sun, Yuwei Wang, Hui Jiang, Xuefeng Jiang
2023GPU-Accelerated Error-Bounded Compression Framework for Quantum Circuit Simulations.
Milan Shah, Xiaodong Yu, Sheng Di, Danylo Lykov, Yuri Alexeev, Michela Becchi, Franck Cappello
2023GPU-enabled Function-as-a-Service for Machine Learning Inference.
Ming Zhao, Kritshekhar Jha, Sungho Hong
2023Generalizable Reinforcement Learning-Based Coarsening Model for Resource Allocation over Large and Diverse Stream Processing Graphs.
Lanshun Nie, Yuqi Qiu, Fei Meng, Mo Yu, Jing Li
2023GraphMetaP: Efficient MetaPath Generation for Dynamic Heterogeneous Graph Models.
Haiheng He, Dan Chen, Long Zheng, Yu Huang, Haifeng Liu, Chaoqiang Liu, Xiaofei Liao, Hai Jin
2023GraphTensor: Comprehensive GNN-Acceleration Framework for Efficient Parallel Processing of Massive Datasets.
Junhyeok Jang, Miryeong Kwon, Donghyun Gouk, Hanyeoreum Bae, Myoungsoo Jung
2023H-Cache: Traffic-Aware Hybrid Rule-Caching in Software-Defined Networks.
Zeyu Luan, Qing Li, Yi Wang, Yong Jiang
2023Harnessing the Crowd for Autotuning High-Performance Computing Applications.
Younghyun Cho, James Weldon Demmel, Jacob King, Xiaoye S. Li, Yang Liu, Hengrui Luo
2023HyScale-GNN: A Scalable Hybrid GNN Training System on Single-Node Heterogeneous Architecture.
Yi-Chien Lin, Viktor K. Prasanna
2023IEEE International Parallel and Distributed Processing Symposium, IPDPS 2023, St. Petersburg, FL, USA, May 15-19, 2023
2023Lossy Scientific Data Compression With SPERR.
Shaomeng Li, Peter Lindstrom, John P. Clyne
2023LowFive: In Situ Data Transport for High-Performance Workflows.
Tom Peterka, Dmitriy Morozov, Arnur Nigmetov, Orcun Yildiz, Bogdan Nicolae, Philip E. Davis
2023Lyra: Fast and Scalable Resilience to Reordering Attacks in Blockchains.
Pouriya Zarbafian, Vincent Gramoli
2023MCR-DL: Mix-and-Match Communication Runtime for Deep Learning.
Quentin Anthony, Ammar Ahmad Awan, Jeff Rasley, Yuxiong He, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda
2023MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism.
Zheng Zhang, Donglin Yang, Yaqi Xia, Liang Ding, Dacheng Tao, Xiaobo Zhou, Dazhao Cheng
2023Memory-aware Optimization for Sequences of Sparse Matrix-Vector Multiplications.
Yichen Zhang, Shengguo Li, Fan Yuan, Dezun Dong, Xiaojian Yang, Tiejun Li, Zheng Wang
2023Mimir: Extending I/O Interfaces to Express User Intent for Complex Workloads in HPC.
Hariharan Devarajan, Kathryn M. Mohror
2023Neural Network Compiler for Parallel High-Throughput Simulation of Digital Circuits.
Ignacio Gavier, Joshua Russell, Devdhar Patel, Edward A. Rietman, Hava T. Siegelmann
2023On Doorway Egress by Autonomous Robots.
Rory Hector, Ramachandran Vaidyanathan, Gokarna Sharma, Jerry L. Trahan
2023On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM).
Emmanuel Agullo, Alfredo Buttari, Olivier Coulaud, Lionel Eyraud-Dubois, Mathieu Faverge, Alain Franc, Abdou Guermouche, Antoine Jego, Romain Peressoni, Florent Pruvost
2023Opportunities and Limitations of Hardware Timestamps in Concurrent Data Structures.
Olivia Grimes, Jacob Nelson-Slivon, Ahmed Hassan, Roberto Palmieri
2023Optimizing Cloud Computing Resource Usage for Hemodynamic Simulation.
William Ladd, Christopher Jensen, Madhurima Vardhan, Jeff Ames, Jeff R. Hammond, Erik W. Draeger, Amanda Randles
2023PAQR: Pivoting Avoiding QR factorization.
Wissam M. Sid-Lakhdar, Sébastien Cayrols, Daniel Bielich, Ahmad Abdelfattah, Piotr Luszczek, Mark Gates, Stanimire Tomov, Hans Johansen, David B. Williams-Young, Timothy A. Davis, Jack J. Dongarra, Hartwig Anzt
2023PFedSA: Personalized Federated Multi-Task Learning via Similarity Awareness.
Chuyao Ye, Hao Zheng, Zhigang Hu, Meiguang Zheng
2023PRF: A Fast Parallel Relaxed Flooding Algorithm for Voronoi Diagram Generation on GPU.
Jue Wang, Fumihiko Ino, Jing Ke
2023Porting a Computational Fluid Dynamics Code with AMR to Large-scale GPU Platforms.
Joshua Hoke Davis, Justin Shafner, Daniel Nichols, Nathan Grube, Pino Martin, Abhinav Bhatele
2023Power Constrained Autotuning using Graph Neural Networks.
Akash Dutta, Jee Choi, Ali Jannesari
2023Predictive Analysis of Code Optimisations on Large-Scale Coupled CFD-Combustion Simulations using the CPX Mini-App.
Archie Powell, Gihan R. Mudalige
2023Proactive SLA-aware Application Placement in the Computing Continuum.
Zahra Najafabadi Samani, Narges Mehran, Dragi Kimovski, Radu Prodan
2023QoS-Aware and Cost-Efficient Dynamic Resource Allocation for Serverless ML Workflows.
Hao Wu, Junxiao Deng, Hao Fan, Shadi Ibrahim, Song Wu, Hai Jin
2023RLP: Power Management Based on a Latency-Aware Roofline Model.
Bo Wang, Anara Kozhokanova, Christian Terboven, Matthias S. Müller
2023RT-DBSCAN: Accelerating DBSCAN using Ray Tracing Hardware.
Vani Nagarajan, Milind Kulkarni
2023SBGT: Scaling Bayesian-based Group Testing for Disease Surveillance.
Weicong Chen, Hao Qi, Xiaoyi Lu, Curtis Tatsuoka
2023SCONNA: A Stochastic Computing Based Optical Accelerator for Ultra-Fast, Energy-Efficient Inference of Integer-Quantized CNNs.
Sairam Sri Vatsavai, Venkata Sai Praneeth Karempudi, Ishan G. Thakkar, Sayed Ahmad Salehi, Jeffrey Todd Hastings
2023SLAP: An Adaptive, Learned Admission Policy for Content Delivery Network Caching.
Ke Liu, Kan Wu, Hua Wang, Ke Zhou, Ji Zhang, Cong Li
2023SRC: Mitigate I/O Throughput Degradation in Network Congestion Control of Disaggregated Storage Systems.
Danlin Jia, Yiming Xie, Li Wang, Xiaoqian Zhang, Allen Yang, Xuebin Yao, Mahsa Bayati, Pradeep Subedi, Bo Sheng, Ningfang Mi
2023SW-LCM: A Scalable and Weakly-supervised Land Cover Mapping Method on a New Sunway Supercomputer.
Yi Zhao, Juepeng Zheng, Haohuan Fu, Wenzhao Wu, Jie Gao, Mengxuan Chen, Jinxiao Zhang, Lixian Zhang, Runmin Dong, Zhenrong Du, Sha Liu, Xin Liu, Shaoqing Zhang, Le Yu
2023Satellite Collision Detection using Spatial Data Structures.
Christian Hellwig, Fabian Czappa, Martin Michel, Reinhold Bertrand, Felix Wolf
2023Scalable adaptive algorithms for next-generation multiphase flow simulations.
Kumar Saurabh, Masado Ishii, Makrand A. Khanwale, Hari Sundar, Baskar Ganapathysubramanian
2023Scheduling with Many Shared Resources.
Max A. Deppert, Klaus Jansen, Marten Maack, Simon Pukrop, Malin Rau
2023SelB-k-NN: A Mini-Batch K-Nearest Neighbors Algorithm on AI Processors.
Yifeng Tang, Cho-Li Wang
2023Signal Detection for Large MIMO Systems Using Sphere Decoding on FPGAs.
Mohamed W. Hassan, Adel Dabah, Hatem Ltaief, Suhaib A. Fahmy
2023Smart Redbelly Blockchain: Reducing Congestion for Web3.
Deepal Tennakoon, Yiding Hua, Vincent Gramoli
2023Software-Defined, Fast and Strongly-Consistent Data Replication for RDMA-Based PM Datastores.
Haodi Lu, Haikun Liu, Chencheng Ye, Xiaofei Liao, Fubing Mao, Yu Zhang, Hai Jin
2023Stochastic Neuromorphic Circuits for Solving MAXCUT.
Bradley H. Theilman, Yipu Wang, Ojas Parekh, William Severa, J. Darby Smith, James B. Aimone
2023Towards Faster Fully Homomorphic Encryption Implementation with Integer and Floating-point Computing Power of GPUs.
Guang Fan, Fangyu Zheng, Lipeng Wan, Lili Gao, Yuan Zhao, Jiankuo Dong, Yixuan Song, Yuewu Wang, Jingqiang Lin
2023Traversing Large Compressed Graphs on GPUs.
Prasun Gera, Hyesoon Kim
2023TurboHE: Accelerating Fully Homomorphic Encryption Using FPGA Clusters.
Haohao Liao, Mahmoud A. Elmohr, Xuan Dong, Yanjun Qian, Wenzhe Yang, Zhiwei Shang, Yin Tan
2023UnifyFS: A User-level Shared File System for Unified Access to Distributed Local Storage.
Michael J. Brim, Adam T. Moody, Seung-Hwan Lim, Ross G. Miller, Swen Boehm, Cameron Stanavige, Kathryn M. Mohror, Sarp Oral
2023ZFP-X: Efficient Embedded Coding for Accelerating Lossy Floating Point Compression.
Bing Lu, Yida Li, Junqi Wang, Huizhang Luo, Kenli Li
2023k-Center Clustering with Outliers in the MPC and Streaming Model.
Mark de Berg, Leyla Biabani, Morteza Monemizadeh
2023qTask: Task-parallel Quantum Circuit Simulation with Incrementality.
Tsung-Wei Huang
2023rFaaS: Enabling High Performance Serverless with RDMA and Leases.
Marcin Copik, Konstantin Taranov, Alexandru Calotoiu, Torsten Hoefler