IPDPS A

127 papers

YearTitle / Authors
2022"Smarter" NICs for faster molecular dynamics: a case study.
Sara Karamati, Clayton Hughes, K. Scott Hemmert, Ryan E. Grant, Whit Schonbein, Scott Levy, Thomas M. Conte, Jeffrey Young, Richard W. Vuduc
20222022 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2022, Lyon, France, May 30 - June 3, 2022
2022A Fine-grained Prefetching Scheme for DGEMM Kernels on GPU with Auto-tuning Compatibility.
Jialin Li, Huang Ye, Shaobo Tian, Xinyuan Li, Jian Zhang
2022A Framework to Exploit Data Sparsity in Tile Low-Rank Cholesky Factorization.
Qinglei Cao, Rabab Alomairy, Yu Pei, George Bosilca, Hatem Ltaief, David E. Keyes, Jack J. Dongarra
2022A General Offloading Approach for Near-DRAM Processing-In-Memory Architectures.
Dan Chen, Hai Jin, Long Zheng, Yu Huang, Pengcheng Yao, Chuangyi Gui, Qinggang Wang, Haifeng Liu, Haiheng He, Xiaofei Liao, Ran Zheng
2022A Quantitative Study of the Spatiotemporal I/O Burstiness of HPC Application.
Wenxiang Yang, Xiangke Liao, Dezun Dong, Jie Yu
2022A Swap Dominated Tensor Re-Generation Strategy for Training Deep Learning Models.
Lijie Wen, Zan Zong, Li Lin, Leilei Lin
2022A scalable adaptive-matrix SPMV for heterogeneous architectures.
Han D. Tran, Milinda Fernando, Kumar Saurabh, Baskar Ganapathysubramanian, Robert M. Kirby, Hari Sundar
2022A self-stabilizing 2-minimal dominating set algorithm based on loop composition in networks of girth at least 7.
Syohei Maruyama, Yuichi Sudo, Sayaka Kamei, Hirotsugu Kakugawa
2022Accelerating Encrypted Computing on Intel GPUs.
Yujia Zhai, Mohannad Ibrahim, Yiqin Qiu, Fabian Boemer, Zizhong Chen, Alexey Titov, Alexander Lyashevsky
2022Accuracy vs. Cost in Parallel Fixed-Precision Low-Rank Approximations of Sparse Matrices.
Robert Ernstbrunner, Viktoria Mayer, Wilfried N. Gansterer
2022Adaptive Verifiable Coded Computing: Towards Fast, Secure and Private Distributed Machine Learning.
Tingting Tang, Ramy E. Ali, Hanieh Hashemi, Tynan Gangwani, Salman Avestimehr, Murali Annavaram
2022Alias-Chain: Improving Blockchain Scalability via Exploring Content Locality among Transactions.
Jintong Liu, Shenggang Wan, Xubin He
2022An Efficient Block Validation Mechanism for UTXO-based Blockchains.
Xiaohai Dai, Bin Xiao, Jiang Xiao, Hai Jin
2022An Efficient Vectorization Scheme for Stencil Computation.
Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao
2022An End-to-end and Adaptive I/O Optimization Tool for Modern HPC Storage Systems.
Bin Yang, Yanliang Zou, Weiguo Liu, Wei Xue
2022An Integral-equation-oriented Vectorized SpMV Algorithm and its Application on CT Imaging Reconstruction.
Weicai Ye, Chenghuan Huang, Jiasheng Huang, Jiajun Li, Yao Lu, Ying Jiang
2022Archpipe: Fast and Flexible Pipelined Erasure-coded Archival Scheme for Heterogeneous Networks.
Bin Xu, Jianzhong Huang, Xiao Qin, Qiang Cao, Yuanyuan Dong, Weikang Kong
2022As easy as ABC: Optimal (A)ccountable (B)yzantine (C)onsensus is easy!
Pierre Civit, Seth Gilbert, Vincent Gramoli, Rachid Guerraoui, Jovan Komatovic
2022Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching.
András Strausz, Flavio Vella, Salvatore Di Girolamo, Maciej Besta, Torsten Hoefler
2022AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning.
Siddharth Singh, Abhinav Bhatele
2022Batched sparse iterative solvers on GPU for the collision operator for fusion plasma simulations.
Aditya Kashi, Pratik Nayak, Dhruva Kulkarni, Aaron Scheinberg, Paul Lin, Hartwig Anzt
2022Bit-GraphBLAS: Bit-Level Optimizations of Matrix-Centric Graph Processing on GPU.
Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Nathan R. Tallent, Kevin J. Barker, Ang Li
2022Booster: An Accelerator for Gradient Boosting Decision Trees Training and Inference.
Mingxuan He, Mithuna Thottethodi, T. N. Vijaykumar
2022Bounding the Flow Time in Online Scheduling with Structured Processing Sets.
Louis-Claude Canon, Anthony Dugois, Loris Marchal
2022CSC: Collaborative System Configuration for I/O-Intensive Applications in Multi-Tenant Clouds.
Haowei Huang, Pu Pang, Quan Chen, Jieru Zhao, Wenli Zheng, Minyi Guo
2022CSMV: A Highly Scalable Multi-Versioned Software Transactional Memory for GPUs.
Diogo Nunes, Daniel Castro, Paolo Romano
2022Challenges and Opportunities in Designing High-Performance and Scalable Middleware for HPC and AI: Past, Present, and Future.
Dhabaleswar K. Panda
2022Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution.
Johannes Doerfert, Atmn Patel, Joseph Huber, Shilei Tian, Jose Manuel Monsalve Diaz, Barbara M. Chapman, Giorgis Georgakoudis
2022Coloring the Vertices of 9-pt and 27-pt Stencils with Intervals.
Dante Durrman, Erik Saule
2022Colza: Enabling Elastic In Situ Visualization for High-performance Computing Simulations.
Matthieu Dorier, Zhe Wang, Utkarsh Ayachit, Shane Snyder, Robert B. Ross, Manish Parashar
2022Communication-efficient Massively Distributed Connected Components.
Sebastian Lamm, Peter Sanders
2022Compiler-Directed Incremental Checkpointing for Low Latency GPU Preemption.
Zhuoran Ji, Cho-Li Wang
2022Coupling streaming AI and HPC ensembles to achieve 100-1000× faster biomolecular simulations.
Alexander Brace, Igor Yakushin, Heng Ma, Anda Trifan, Todd S. Munson, Ian T. Foster, Arvind Ramanathan, Hyungro Lee, Matteo Turilli, Shantenu Jha
2022DEAN: A Lightweight and Resource-efficient Blockchain Protocol for Reliable Edge Computing.
Abdullah Al-Mamun, Haoting Shen, Dongfang Zhao
2022DENOVA: Deduplication Extended NOVA File System.
Hyungjoon Kwon, Yonghyeon Cho, Awais Khan, Yeohyeon Park, Youngjae Kim
2022DFMan: A Graph-based Optimization of Dataflow Scheduling on High-Performance Computing Systems.
Fahim Chowdhury, Francesco Di Natale, Adam Moody, Kathryn M. Mohror, Weikuan Yu
2022DGSF: Disaggregated GPUs for Serverless Functions.
Henrique Fingler, Zhiting Zhu, Esther Yoon, Zhipeng Jia, Emmett Witchel, Christopher J. Rossbach
2022Degree-Aware Kernels for Computing Jaccard Weights on GPUs.
Amro Alabsi Aljundi, Taha Atahan Akyildiz, Kamer Kaya
2022Direct solution of larger coupled sparse/dense linear systems using low-rank compression on single-node multi-core machines in an industrial context.
Emmanuel Agullo, Marek Felsöci, Guillaume Sylvand
2022DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices.
Xueyu Hou, Yongjie Guan, Tao Han, Ning Zhang
2022Distributed-Memory Sparse Kernels for Machine Learning.
Vivek Bharadwaj, Aydin Buluç, James Demmel
2022Dynamic Computation Offloading for Green Things-Edge-Cloud Computing with Local Caching.
Xianzhong Tian, Huixiao Meng, Yanjun Li, Pingting Miao, Pengcheng Xu
2022Dynamic Task Shaping for High Throughput Data Analysis Applications in High Energy Physics.
Ben Tovar, Ben Lyons, Kelci Mohrman, Barry Sly-Delgado, Kevin Lannon, Douglas Thain
2022Excavating the Potential of Graph Workload on RDMA-based Far Memory Architecture.
Jing Wang, Chao Li, Taolei Wang, Lu Zhang, Pengyu Wang, Junyi Mei, Minyi Guo
2022Exploiting Reduced Precision for GPU-based Time Series Mining.
Yi Ju, Amir Raoofy, Dai Yang, Erwin Laure, Martin Schulz
2022Exploring Efficient Microservice Level Parallelism.
Xinkai Wang, Chao Li, Lu Zhang, Xiaofeng Hou, Quan Chen, Minyi Guo
2022FAM-Graph: Graph Analytics on Disaggregated Memory.
Daniel Zahka, Ada Gavrilovska
2022Falcon: A Timestamp-based Protocol to Maximize the Cache Efficiency in the Distributed Shared Memory.
Jin Zhang, Xiangyao Yu, Zhengwei Qi, Haibing Guan
2022Fast Convergence to Fairness for Reduced Long Flow Tail Latency in Datacenter Networks.
John Snyder, Alvin R. Lebeck
2022Fast Parallel Bayesian Network Structure Learning.
Jiantong Jiang, Zeyi Wen, Ajmal Mian
2022Fast and High-Quality Influence Maximization on Multiple GPUs.
Gökhan Göktürk, Kamer Kaya
2022Fault-tolerant Snapshot Objects in Message Passing Systems.
Vijay K. Garg, Saptaparni Kumar, Lewis Tseng, Xiong Zheng
2022FlashWalker: An In-Storage Accelerator for Graph Random Walks.
Fuping Niu, Jianhui Yue, Jiangqiu Shen, Xiaofei Liao, Haikun Liu, Hai Jin
2022Frugal Decentralized Learning.
Anne-Marie Kermarrec
2022GSpecPal: Speculation-Centric Finite State Machine Parallelization on GPUs.
Yuguang Wang, Robbie Watling, Junqiao Qiu, Zhenlin Wang
2022Generalized Flow-Graph Programming Using Template Task-Graphs: Initial Implementation and Assessment.
Joseph Schuchart, Poornima Nookala, Mohammad Mahdi Javanmard, Thomas Hérault, Edward F. Valeev, George Bosilca, Robert J. Harrison
2022HACCS: Heterogeneity-Aware Clustered Client Selection for Accelerated Federated Learning.
Joel Wolfrath, Nikhil Sreekumar, Dhruv Kumar, Yuanli Wang, Abhishek Chandra
2022HDagg: Hybrid Aggregation of Loop-carried Dependence Iterations in Sparse Matrix Computations.
Behrooz Zarebavani, Kazem Cheshmi, Bangtian Liu, Michelle Mills Strout, Maryam Mehri Dehnavi
2022HRaft: Adaptive Erasure Coded Data Maintenance for Consensus in Distributed Networks.
Yulei Jia, Guangping Xu, Chi Wan Sung, Salwa Mostafa, Yulei Wu
2022HTS: A Threaded Multilevel Sparse Hybrid Solver.
Joshua Dennis Booth
2022High-order Line Graphs of Non-uniform Hypergraphs: Algorithms, Applications, and Experimental Analysis.
Xu T. Liu, Jesun Firoz, Sinan G. Aksoy, Ilya Amburg, Andrew Lumsdaine, Cliff A. Joslyn, Brenda Praggastis, Assefaw H. Gebremedhin
2022Hybrid Workload Scheduling on HPC Systems.
Yuping Fan, Zhiling Lan, Paul Rich, William E. Allcock, Michael E. Papka
2022I/O-Optimal Cache-Oblivious Sparse Matrix-Sparse Matrix Multiplication.
Niels Gleinig, Maciej Besta, Torsten Hoefler
2022In-Memory Indexed Caching for Distributed Data Processing.
Alexandru Uta, Bogdan Ghit, Ankur Dave, Jan S. Rellermeyer, Peter Boncz
2022Landau collision operator in the CUDA programming model applied to thermal quench plasmas.
Mark F. Adams, Dylan P. Brennan, Matthew G. Knepley, Peng Wang
2022Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization.
Ali TehraniJamsaz, Mihail Popov, Akash Dutta, Emmanuelle Saillard, Ali Jannesari
2022Lightning: Scaling the GPU Programming Model Beyond a Single GPU.
Stijn Heldens, Pieter Hijma, Ben van Werkhoven, Jason Maassen, Rob V. van Nieuwpoort
2022MICCO: An Enhanced Multi-GPU Scheduling Framework for Many-Body Correlation Functions.
Qihan Wang, Bin Ren, Jie Chen, Robert G. Edwards
2022MLCNN: Cross-Layer Cooperative Optimization and Accelerator Architecture for Speeding Up Deep Learning Applications.
Beilei Jiang, Xianwei Cheng, Sihai Tang, Xu Ma, Zhaochen Gu, Song Fu, Qing Yang, Mingxiong Liu
2022Memory Access Granularity Aware Lossless Compression for GPUs.
Sohan Lal, Manuel Renz, Julian Hartmer, Ben H. H. Juurlink
2022Memory-Aware Scheduling of Tasks Sharing Data on Multiple GPUs with Dynamic Runtime Systems.
Maxime Gonthier, Loris Marchal, Samuel Thibault
2022Minerva: Rethinking Secure Architectures for the Era of Fabric-Attached Memory Architectures.
Mazen Al-Wadi, Rujia Wang, David Mohaisen, Clayton Hughes, Simon David Hammond, Amro Awad
2022Mixed Precision $s$-step Conjugate Gradient with Residual Replacement on GPUs.
Ichitaro Yamazaki, Erin Carson, Brian Michael Kelley
2022Mnemonic: A Parallel Subgraph Matching System for Streaming Graphs.
Bibek Bhattarai, H. Howie Huang
2022Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA.
Hongkuan Zhou, Bingyi Zhang, Rajgopal Kannan, Viktor K. Prasanna, Carl E. Busart
2022Modeling Matrix Engines for Portability and Performance.
Nicholai Tukanov, Rajalakshmi Srinivasaraghavan, José E. Moreira, Tze Meng Low
2022Multi-Phase Task-Based HPC Applications: Quickly Learning how to Run Fast.
Lucas Leandro Nesi, Lucas Mello Schnorr, Arnaud Legrand
2022Neon: A Multi-GPU Programming Model for Grid-based Computations.
Massimiliano Meneghin, Ahmed H. Mahmoud, Pradeep Kumar Jayaraman, Nigel J. W. Morris
2022Next-Generation Local Time Stepping for the ADER-DG Finite Element Method.
Alexander Breuer, Alexander Heinecke
2022OmpSs@cloudFPGA: An FPGA Task-Based Programming Model with Message Passing.
Juan Miguel De Haro Ruiz, Rubén Cano, Carlos Álvarez, Daniel Jiménez-González, Xavier Martorell, Eduard Ayguadé, Jesús Labarta, François Abel, Burkhard Ringlein, Beat Weiss
2022On the Parallel Reconstruction from Pooled Data.
Oliver Gebhard, Max Hahn-Klimroth, Dominik Kaaser, Philipp Loick
2022Optimal Arbitrary Pattern Formation on a Grid by Asynchronous Autonomous Robots.
Rory Hector, Gokarna Sharma, Ramachandran Vaidyanathan, Jerry L. Trahan
2022Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs.
Cody Rivera, Sheng Di, Jiannan Tian, Xiaodong Yu, Dingwen Tao, Franck Cappello
2022P-ckpt: Coordinated Prioritized Checkpointing.
Subhendu Behera, Lipeng Wan, Frank Mueller, Matthew Wolf, Scott Klasky
2022PARSEC: PARallel Subgraph Enumeration in CUDA.
Vibhor Dodeja, Mohammad Almasri, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu
2022PINT: Parallel INTerval-Based Race Detector.
Yifan Xu, Anchengcheng Zhou, Kunal Agrawal, I-Ting Angelina Lee
2022ParaTreeT: A Fast, General Framework for Spatial Tree Traversal.
Joseph Hutter, Justin Josef Szaday, Jaemin Choi, Simeng Liu, Laxmikant V. Kalé, Spencer Wallace, Thomas R. Quinn
2022Parallel Approximations of the Tukey g-and-h Likelihoods and Predictions for Non-Gaussian Geostatistics.
Sagnik Mondal, Sameh Abdulah, Hatem Ltaief, Ying Sun, Marc G. Genton, David E. Keyes
2022Parallel Global Edge Switching for the Uniform Sampling of Simple Graphs with Prescribed Degrees.
Daniel Allendorf, Ulrich Meyer, Manuel Penschuck, Hung Tran
2022Parallel Tensor Train Rounding using Gram SVD.
Hussam Al Daas, Grey Ballard, Lawton Manning
2022Parallel Vertex Cover Algorithms on GPUs.
Peter Yamout, Karim Barada, Adnan Jaljuli, Amer E. Mouawad, Izzat El Hajj
2022Parallel, Portable Algorithms for Distance-2 Maximal Independent Set and Graph Coarsening.
Brian Michael Kelley, Sivasankaran Rajamanickam
2022Parallelizing and Balancing Coupled DSMC/PIC for Large-scale Particle Simulations.
Haozhong Qiu, Chuanfu Xu, Dali Li, Haoyu Wang, Jie Li, Zheng Wang
2022PokéMem: Taming Wild Memory Consumers in Apache Spark.
Minhyeok Kweun, Goeun Kim, Byungsoo Oh, Seongho Jung, Taegeon Um, Woo-Yeon Lee
2022PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling.
Xin You, Hailong Yang, Zhibo Xuan, Zhongzhi Luan, Depei Qian
2022Preprocessing Pipeline Optimization for Scientific Deep Learning Workloads.
Khaled Z. Ibrahim, Leonid Oliker
2022QoS-awareness of Microservices with Excessive Loads via Inter-Datacenter Scheduling.
Jiuchen Shi, Jiawen Wang, Kaihua Fu, Quan Chen, Deze Zeng, Minyi Guo
2022RLRP: High-Efficient Data Placement with Reinforcement Learning for Modern Distributed Storage Systems.
Kai Lu, Nannan Zhao, Jiguang Wan, Changhong Fei, Wei Zhao, Tongliang Deng
2022Resilience at Extreme Scale and Connections with Other Domains.
Leonardo Bautista-Gomez
2022Resource Utilization Aware Job Scheduling to Mitigate Performance Variability.
Daniel Nichols, Aniruddha Marathe, Kathleen Shoga, Todd Gamblin, Abhinav Bhatele
2022SALoBa: Maximizing Data Locality and Workload Balance for Fast Sequence Alignment on GPUs.
Seongyeon Park, Hajin Kim, Tanveer Ahmad, Nauman Ahmed, Zaid Al-Ars, H. Peter Hofstee, Youngsok Kim, Jinho Lee
2022SFP: Service Function Chain Provision on Programmable Switches for Cloud Tenants.
Hongyi Huang, Wenfei Wu, Yongchao He, Zehua Guo
2022SPIDER: An Effective, Efficient and Robust Load Scheduler for Real-time Split Frame Rendering.
Bingzheng Ma, Ziqiang Zhang, Yusen Li, Wentong Cai, Gang Wang, Xiaoguang Liu
2022SSB-Tree: Making Persistent Memory B+- Trees Crash-Consistent and Concurrent by Lazy-Box.
Tongliang Li, Haixia Wang, Airan Shao, Dongsheng Wang
2022Scalable Low-Latency Inter-FPGA Networks.
Kien Trung Pham, Truong Thao Nguyen, Hiroshi Yamaguchi, Yutaka Urino, Michihiro Koibuchi
2022Scalable Multi-Versioning Ordered Key-Value Stores with Persistent Memory Support.
Bogdan Nicolae
2022Scaling and Selecting GPU Methods for All Pairs Shortest Paths (APSP) Computations.
Yang Xia, Peng Jiang, Gagan Agrawal, Rajiv Ramnath
2022Scheduling on Uniform and Unrelated Machines with Bipartite Incompatibility Graphs.
Tytus Pikies, Hanna Furmanczyk
2022SecFortress: Securing Hypervisor using Cross-layer Isolation.
Qihang Zhou, Xiaoqi Jia, Shengzhi Zhang, Nan Jiang, Jiayun Chen, Weijuan Zhang
2022Shared-Memory Parallel Algorithms for Fully Dynamic Maintenance of 2-Connected Components.
Chirayu Anant Haryan, G. Ramakrishna, Kishore Kothapalli, Dip Sankar Banerjee
2022Sparsity-Aware Tensor Decomposition.
Süreyya Emre Kurt, Saurabh Raje, Aravind Sukumaran-Rajam, P. Sadayappan
2022SpectralFly: Ramanujan Graphs as Flexible and Efficient Interconnection Networks.
Stephen J. Young, Sinan G. Aksoy, Jesun Firoz, Roberto Gioiosa, Tobias Hagge, Mark Kempton, Juan Escobedo, Mark Raugas
2022StencilMART: Predicting Optimization Selection for Stencil Computations across GPUs.
Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian
2022TEE-based decentralized recommender systems: The raw data sharing redemption.
Akash Balasaheb Dhasade, Nevena Dresevic, Anne-Marie Kermarrec, Rafael Pires
2022TagTree: Global Tagging Index with Efficient Querying for Time Series Databases.
Jin Xue, Zhiqi Wang, Tianyu Wang, Zili Shao
2022Task-based Acceleration of Bidirectional Recurrent Neural Networks on Multi-core Architectures.
Robin Kumar Sharma, Marc Casas
2022The Fast and Scalable MPI Application Launch of the Tianhe HPC system.
Yiqin Dai, Yong Dong, Min Xie, Kai Lu, Ruibo Wang, Mingtian Shao, Juan Chen
2022The Universal Gossip Fighter.
Anastasiia Gorbunova, Rachid Guerraoui, Anne-Marie Kermarrec, Anastasiia Kucherenko, Rafaël Pinot
2022Top-Down Performance Profiling on NVIDIA's GPUs.
Alvaro Saiz, Pablo Prieto, Pablo Abad Fidalgo, José-Ángel Gregorio, Valentin Puente
2022Topological Modeling and Parallelization of Multidimensional Data on Microelectrode Arrays.
Olamide Timothy Tawose, Bin Li, Lei Yang, Feng Yan, Dongfang Zhao
2022Towards Distributed 2-Approximation Steiner Minimal Trees in Billion-edge Graphs.
Tahsin Reza, Geoffrey Sanders, Roger Pearce
2022Traffic-Optimal Virtual Network Function Placement and Migration in Dynamic Cloud Data Centers.
Vincent Tran, Jingsong Sun, Bin Tang, Deng Pan
2022Understanding the Design-Space of Sparse/Dense Multiphase GNN dataflows on Spatial Accelerators.
Raveesh Garg, Eric Qin, Francisco Muñoz-Martínez, Robert Guirado, Akshay Jain, Sergi Abadal, José L. Abellán, Manuel E. Acacio, Eduard Alarcón, Sivasankaran Rajamanickam, Tushar Krishna
2022Unlocking Personalized Healthcare on Modern CPUs/GPUs: Three-way Gene Interaction Study.
Diogo Marques, Rafael Campos, Sergio Santander-Jiménez, Zakhar Matveev, Leonel Sousa, Aleksandar Ilic
2022Why Globally Re-shuffle? Revisiting Data Shuffling in Large Scale Deep Learning.
Truong Thao Nguyen, François Trahay, Jens Domke, Aleksandr Drozd, Emil Vatai, Jianwei Liao, Mohamed Wahib, Balazs Gerofi
2022pFedGF: Enabling Personalized Federated Learning via Gradient Fusion.
Xinghao Wu, Jianwei Niu, Xuefeng Liu, Tao Ren, Zhangmin Huang, Zhetao Li