IPDPS - RankMe

127 papers

Year	Title / Authors
2022	"Smarter" NICs for faster molecular dynamics: a case study. Sara Karamati, Clayton Hughes, K. Scott Hemmert, Ryan E. Grant, Whit Schonbein, Scott Levy, Thomas M. Conte, Jeffrey Young, Richard W. Vuduc
2022	2022 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2022, Lyon, France, May 30 - June 3, 2022
2022	A Fine-grained Prefetching Scheme for DGEMM Kernels on GPU with Auto-tuning Compatibility. Jialin Li, Huang Ye, Shaobo Tian, Xinyuan Li, Jian Zhang
2022	A Framework to Exploit Data Sparsity in Tile Low-Rank Cholesky Factorization. Qinglei Cao, Rabab Alomairy, Yu Pei, George Bosilca, Hatem Ltaief, David E. Keyes, Jack J. Dongarra
2022	A General Offloading Approach for Near-DRAM Processing-In-Memory Architectures. Dan Chen, Hai Jin, Long Zheng, Yu Huang, Pengcheng Yao, Chuangyi Gui, Qinggang Wang, Haifeng Liu, Haiheng He, Xiaofei Liao, Ran Zheng
2022	A Quantitative Study of the Spatiotemporal I/O Burstiness of HPC Application. Wenxiang Yang, Xiangke Liao, Dezun Dong, Jie Yu
2022	A Swap Dominated Tensor Re-Generation Strategy for Training Deep Learning Models. Lijie Wen, Zan Zong, Li Lin, Leilei Lin
2022	A scalable adaptive-matrix SPMV for heterogeneous architectures. Han D. Tran, Milinda Fernando, Kumar Saurabh, Baskar Ganapathysubramanian, Robert M. Kirby, Hari Sundar
2022	A self-stabilizing 2-minimal dominating set algorithm based on loop composition in networks of girth at least 7. Syohei Maruyama, Yuichi Sudo, Sayaka Kamei, Hirotsugu Kakugawa
2022	Accelerating Encrypted Computing on Intel GPUs. Yujia Zhai, Mohannad Ibrahim, Yiqin Qiu, Fabian Boemer, Zizhong Chen, Alexey Titov, Alexander Lyashevsky
2022	Accuracy vs. Cost in Parallel Fixed-Precision Low-Rank Approximations of Sparse Matrices. Robert Ernstbrunner, Viktoria Mayer, Wilfried N. Gansterer
2022	Adaptive Verifiable Coded Computing: Towards Fast, Secure and Private Distributed Machine Learning. Tingting Tang, Ramy E. Ali, Hanieh Hashemi, Tynan Gangwani, Salman Avestimehr, Murali Annavaram
2022	Alias-Chain: Improving Blockchain Scalability via Exploring Content Locality among Transactions. Jintong Liu, Shenggang Wan, Xubin He
2022	An Efficient Block Validation Mechanism for UTXO-based Blockchains. Xiaohai Dai, Bin Xiao, Jiang Xiao, Hai Jin
2022	An Efficient Vectorization Scheme for Stencil Computation. Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao
2022	An End-to-end and Adaptive I/O Optimization Tool for Modern HPC Storage Systems. Bin Yang, Yanliang Zou, Weiguo Liu, Wei Xue
2022	An Integral-equation-oriented Vectorized SpMV Algorithm and its Application on CT Imaging Reconstruction. Weicai Ye, Chenghuan Huang, Jiasheng Huang, Jiajun Li, Yao Lu, Ying Jiang
2022	Archpipe: Fast and Flexible Pipelined Erasure-coded Archival Scheme for Heterogeneous Networks. Bin Xu, Jianzhong Huang, Xiao Qin, Qiang Cao, Yuanyuan Dong, Weikang Kong
2022	As easy as ABC: Optimal (A)ccountable (B)yzantine (C)onsensus is easy! Pierre Civit, Seth Gilbert, Vincent Gramoli, Rachid Guerraoui, Jovan Komatovic
2022	Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching. András Strausz, Flavio Vella, Salvatore Di Girolamo, Maciej Besta, Torsten Hoefler
2022	AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning. Siddharth Singh, Abhinav Bhatele
2022	Batched sparse iterative solvers on GPU for the collision operator for fusion plasma simulations. Aditya Kashi, Pratik Nayak, Dhruva Kulkarni, Aaron Scheinberg, Paul Lin, Hartwig Anzt
2022	Bit-GraphBLAS: Bit-Level Optimizations of Matrix-Centric Graph Processing on GPU. Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Nathan R. Tallent, Kevin J. Barker, Ang Li
2022	Booster: An Accelerator for Gradient Boosting Decision Trees Training and Inference. Mingxuan He, Mithuna Thottethodi, T. N. Vijaykumar
2022	Bounding the Flow Time in Online Scheduling with Structured Processing Sets. Louis-Claude Canon, Anthony Dugois, Loris Marchal
2022	CSC: Collaborative System Configuration for I/O-Intensive Applications in Multi-Tenant Clouds. Haowei Huang, Pu Pang, Quan Chen, Jieru Zhao, Wenli Zheng, Minyi Guo
2022	CSMV: A Highly Scalable Multi-Versioned Software Transactional Memory for GPUs. Diogo Nunes, Daniel Castro, Paolo Romano
2022	Challenges and Opportunities in Designing High-Performance and Scalable Middleware for HPC and AI: Past, Present, and Future. Dhabaleswar K. Panda
2022	Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution. Johannes Doerfert, Atmn Patel, Joseph Huber, Shilei Tian, Jose Manuel Monsalve Diaz, Barbara M. Chapman, Giorgis Georgakoudis
2022	Coloring the Vertices of 9-pt and 27-pt Stencils with Intervals. Dante Durrman, Erik Saule
2022	Colza: Enabling Elastic In Situ Visualization for High-performance Computing Simulations. Matthieu Dorier, Zhe Wang, Utkarsh Ayachit, Shane Snyder, Robert B. Ross, Manish Parashar
2022	Communication-efficient Massively Distributed Connected Components. Sebastian Lamm, Peter Sanders
2022	Compiler-Directed Incremental Checkpointing for Low Latency GPU Preemption. Zhuoran Ji, Cho-Li Wang
2022	Coupling streaming AI and HPC ensembles to achieve 100-1000× faster biomolecular simulations. Alexander Brace, Igor Yakushin, Heng Ma, Anda Trifan, Todd S. Munson, Ian T. Foster, Arvind Ramanathan, Hyungro Lee, Matteo Turilli, Shantenu Jha
2022	DEAN: A Lightweight and Resource-efficient Blockchain Protocol for Reliable Edge Computing. Abdullah Al-Mamun, Haoting Shen, Dongfang Zhao
2022	DENOVA: Deduplication Extended NOVA File System. Hyungjoon Kwon, Yonghyeon Cho, Awais Khan, Yeohyeon Park, Youngjae Kim
2022	DFMan: A Graph-based Optimization of Dataflow Scheduling on High-Performance Computing Systems. Fahim Chowdhury, Francesco Di Natale, Adam Moody, Kathryn M. Mohror, Weikuan Yu
2022	DGSF: Disaggregated GPUs for Serverless Functions. Henrique Fingler, Zhiting Zhu, Esther Yoon, Zhipeng Jia, Emmett Witchel, Christopher J. Rossbach
2022	Degree-Aware Kernels for Computing Jaccard Weights on GPUs. Amro Alabsi Aljundi, Taha Atahan Akyildiz, Kamer Kaya
2022	Direct solution of larger coupled sparse/dense linear systems using low-rank compression on single-node multi-core machines in an industrial context. Emmanuel Agullo, Marek Felsöci, Guillaume Sylvand
2022	DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices. Xueyu Hou, Yongjie Guan, Tao Han, Ning Zhang
2022	Distributed-Memory Sparse Kernels for Machine Learning. Vivek Bharadwaj, Aydin Buluç, James Demmel
2022	Dynamic Computation Offloading for Green Things-Edge-Cloud Computing with Local Caching. Xianzhong Tian, Huixiao Meng, Yanjun Li, Pingting Miao, Pengcheng Xu
2022	Dynamic Task Shaping for High Throughput Data Analysis Applications in High Energy Physics. Ben Tovar, Ben Lyons, Kelci Mohrman, Barry Sly-Delgado, Kevin Lannon, Douglas Thain
2022	Excavating the Potential of Graph Workload on RDMA-based Far Memory Architecture. Jing Wang, Chao Li, Taolei Wang, Lu Zhang, Pengyu Wang, Junyi Mei, Minyi Guo
2022	Exploiting Reduced Precision for GPU-based Time Series Mining. Yi Ju, Amir Raoofy, Dai Yang, Erwin Laure, Martin Schulz
2022	Exploring Efficient Microservice Level Parallelism. Xinkai Wang, Chao Li, Lu Zhang, Xiaofeng Hou, Quan Chen, Minyi Guo
2022	FAM-Graph: Graph Analytics on Disaggregated Memory. Daniel Zahka, Ada Gavrilovska
2022	Falcon: A Timestamp-based Protocol to Maximize the Cache Efficiency in the Distributed Shared Memory. Jin Zhang, Xiangyao Yu, Zhengwei Qi, Haibing Guan
2022	Fast Convergence to Fairness for Reduced Long Flow Tail Latency in Datacenter Networks. John Snyder, Alvin R. Lebeck
2022	Fast Parallel Bayesian Network Structure Learning. Jiantong Jiang, Zeyi Wen, Ajmal Mian
2022	Fast and High-Quality Influence Maximization on Multiple GPUs. Gökhan Göktürk, Kamer Kaya
2022	Fault-tolerant Snapshot Objects in Message Passing Systems. Vijay K. Garg, Saptaparni Kumar, Lewis Tseng, Xiong Zheng
2022	FlashWalker: An In-Storage Accelerator for Graph Random Walks. Fuping Niu, Jianhui Yue, Jiangqiu Shen, Xiaofei Liao, Haikun Liu, Hai Jin
2022	Frugal Decentralized Learning. Anne-Marie Kermarrec
2022	GSpecPal: Speculation-Centric Finite State Machine Parallelization on GPUs. Yuguang Wang, Robbie Watling, Junqiao Qiu, Zhenlin Wang
2022	Generalized Flow-Graph Programming Using Template Task-Graphs: Initial Implementation and Assessment. Joseph Schuchart, Poornima Nookala, Mohammad Mahdi Javanmard, Thomas Hérault, Edward F. Valeev, George Bosilca, Robert J. Harrison
2022	HACCS: Heterogeneity-Aware Clustered Client Selection for Accelerated Federated Learning. Joel Wolfrath, Nikhil Sreekumar, Dhruv Kumar, Yuanli Wang, Abhishek Chandra
2022	HDagg: Hybrid Aggregation of Loop-carried Dependence Iterations in Sparse Matrix Computations. Behrooz Zarebavani, Kazem Cheshmi, Bangtian Liu, Michelle Mills Strout, Maryam Mehri Dehnavi
2022	HRaft: Adaptive Erasure Coded Data Maintenance for Consensus in Distributed Networks. Yulei Jia, Guangping Xu, Chi Wan Sung, Salwa Mostafa, Yulei Wu
2022	HTS: A Threaded Multilevel Sparse Hybrid Solver. Joshua Dennis Booth
2022	High-order Line Graphs of Non-uniform Hypergraphs: Algorithms, Applications, and Experimental Analysis. Xu T. Liu, Jesun Firoz, Sinan G. Aksoy, Ilya Amburg, Andrew Lumsdaine, Cliff A. Joslyn, Brenda Praggastis, Assefaw H. Gebremedhin
2022	Hybrid Workload Scheduling on HPC Systems. Yuping Fan, Zhiling Lan, Paul Rich, William E. Allcock, Michael E. Papka
2022	I/O-Optimal Cache-Oblivious Sparse Matrix-Sparse Matrix Multiplication. Niels Gleinig, Maciej Besta, Torsten Hoefler
2022	In-Memory Indexed Caching for Distributed Data Processing. Alexandru Uta, Bogdan Ghit, Ankur Dave, Jan S. Rellermeyer, Peter Boncz
2022	Landau collision operator in the CUDA programming model applied to thermal quench plasmas. Mark F. Adams, Dylan P. Brennan, Matthew G. Knepley, Peng Wang
2022	Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization. Ali TehraniJamsaz, Mihail Popov, Akash Dutta, Emmanuelle Saillard, Ali Jannesari
2022	Lightning: Scaling the GPU Programming Model Beyond a Single GPU. Stijn Heldens, Pieter Hijma, Ben van Werkhoven, Jason Maassen, Rob V. van Nieuwpoort
2022	MICCO: An Enhanced Multi-GPU Scheduling Framework for Many-Body Correlation Functions. Qihan Wang, Bin Ren, Jie Chen, Robert G. Edwards
2022	MLCNN: Cross-Layer Cooperative Optimization and Accelerator Architecture for Speeding Up Deep Learning Applications. Beilei Jiang, Xianwei Cheng, Sihai Tang, Xu Ma, Zhaochen Gu, Song Fu, Qing Yang, Mingxiong Liu
2022	Memory Access Granularity Aware Lossless Compression for GPUs. Sohan Lal, Manuel Renz, Julian Hartmer, Ben H. H. Juurlink
2022	Memory-Aware Scheduling of Tasks Sharing Data on Multiple GPUs with Dynamic Runtime Systems. Maxime Gonthier, Loris Marchal, Samuel Thibault
2022	Minerva: Rethinking Secure Architectures for the Era of Fabric-Attached Memory Architectures. Mazen Al-Wadi, Rujia Wang, David Mohaisen, Clayton Hughes, Simon David Hammond, Amro Awad
2022	Mixed Precision $s$-step Conjugate Gradient with Residual Replacement on GPUs. Ichitaro Yamazaki, Erin Carson, Brian Michael Kelley
2022	Mnemonic: A Parallel Subgraph Matching System for Streaming Graphs. Bibek Bhattarai, H. Howie Huang
2022	Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA. Hongkuan Zhou, Bingyi Zhang, Rajgopal Kannan, Viktor K. Prasanna, Carl E. Busart
2022	Modeling Matrix Engines for Portability and Performance. Nicholai Tukanov, Rajalakshmi Srinivasaraghavan, José E. Moreira, Tze Meng Low
2022	Multi-Phase Task-Based HPC Applications: Quickly Learning how to Run Fast. Lucas Leandro Nesi, Lucas Mello Schnorr, Arnaud Legrand
2022	Neon: A Multi-GPU Programming Model for Grid-based Computations. Massimiliano Meneghin, Ahmed H. Mahmoud, Pradeep Kumar Jayaraman, Nigel J. W. Morris
2022	Next-Generation Local Time Stepping for the ADER-DG Finite Element Method. Alexander Breuer, Alexander Heinecke
2022	OmpSs@cloudFPGA: An FPGA Task-Based Programming Model with Message Passing. Juan Miguel De Haro Ruiz, Rubén Cano, Carlos Álvarez, Daniel Jiménez-González, Xavier Martorell, Eduard Ayguadé, Jesús Labarta, François Abel, Burkhard Ringlein, Beat Weiss
2022	On the Parallel Reconstruction from Pooled Data. Oliver Gebhard, Max Hahn-Klimroth, Dominik Kaaser, Philipp Loick
2022	Optimal Arbitrary Pattern Formation on a Grid by Asynchronous Autonomous Robots. Rory Hector, Gokarna Sharma, Ramachandran Vaidyanathan, Jerry L. Trahan
2022	Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs. Cody Rivera, Sheng Di, Jiannan Tian, Xiaodong Yu, Dingwen Tao, Franck Cappello
2022	P-ckpt: Coordinated Prioritized Checkpointing. Subhendu Behera, Lipeng Wan, Frank Mueller, Matthew Wolf, Scott Klasky
2022	PARSEC: PARallel Subgraph Enumeration in CUDA. Vibhor Dodeja, Mohammad Almasri, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu
2022	PINT: Parallel INTerval-Based Race Detector. Yifan Xu, Anchengcheng Zhou, Kunal Agrawal, I-Ting Angelina Lee
2022	ParaTreeT: A Fast, General Framework for Spatial Tree Traversal. Joseph Hutter, Justin Josef Szaday, Jaemin Choi, Simeng Liu, Laxmikant V. Kalé, Spencer Wallace, Thomas R. Quinn
2022	Parallel Approximations of the Tukey g-and-h Likelihoods and Predictions for Non-Gaussian Geostatistics. Sagnik Mondal, Sameh Abdulah, Hatem Ltaief, Ying Sun, Marc G. Genton, David E. Keyes
2022	Parallel Global Edge Switching for the Uniform Sampling of Simple Graphs with Prescribed Degrees. Daniel Allendorf, Ulrich Meyer, Manuel Penschuck, Hung Tran
2022	Parallel Tensor Train Rounding using Gram SVD. Hussam Al Daas, Grey Ballard, Lawton Manning
2022	Parallel Vertex Cover Algorithms on GPUs. Peter Yamout, Karim Barada, Adnan Jaljuli, Amer E. Mouawad, Izzat El Hajj
2022	Parallel, Portable Algorithms for Distance-2 Maximal Independent Set and Graph Coarsening. Brian Michael Kelley, Sivasankaran Rajamanickam
2022	Parallelizing and Balancing Coupled DSMC/PIC for Large-scale Particle Simulations. Haozhong Qiu, Chuanfu Xu, Dali Li, Haoyu Wang, Jie Li, Zheng Wang
2022	PokéMem: Taming Wild Memory Consumers in Apache Spark. Minhyeok Kweun, Goeun Kim, Byungsoo Oh, Seongho Jung, Taegeon Um, Woo-Yeon Lee
2022	PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling. Xin You, Hailong Yang, Zhibo Xuan, Zhongzhi Luan, Depei Qian
2022	Preprocessing Pipeline Optimization for Scientific Deep Learning Workloads. Khaled Z. Ibrahim, Leonid Oliker
2022	QoS-awareness of Microservices with Excessive Loads via Inter-Datacenter Scheduling. Jiuchen Shi, Jiawen Wang, Kaihua Fu, Quan Chen, Deze Zeng, Minyi Guo
2022	RLRP: High-Efficient Data Placement with Reinforcement Learning for Modern Distributed Storage Systems. Kai Lu, Nannan Zhao, Jiguang Wan, Changhong Fei, Wei Zhao, Tongliang Deng
2022	Resilience at Extreme Scale and Connections with Other Domains. Leonardo Bautista-Gomez
2022	Resource Utilization Aware Job Scheduling to Mitigate Performance Variability. Daniel Nichols, Aniruddha Marathe, Kathleen Shoga, Todd Gamblin, Abhinav Bhatele
2022	SALoBa: Maximizing Data Locality and Workload Balance for Fast Sequence Alignment on GPUs. Seongyeon Park, Hajin Kim, Tanveer Ahmad, Nauman Ahmed, Zaid Al-Ars, H. Peter Hofstee, Youngsok Kim, Jinho Lee
2022	SFP: Service Function Chain Provision on Programmable Switches for Cloud Tenants. Hongyi Huang, Wenfei Wu, Yongchao He, Zehua Guo
2022	SPIDER: An Effective, Efficient and Robust Load Scheduler for Real-time Split Frame Rendering. Bingzheng Ma, Ziqiang Zhang, Yusen Li, Wentong Cai, Gang Wang, Xiaoguang Liu
2022	SSB-Tree: Making Persistent Memory B+- Trees Crash-Consistent and Concurrent by Lazy-Box. Tongliang Li, Haixia Wang, Airan Shao, Dongsheng Wang
2022	Scalable Low-Latency Inter-FPGA Networks. Kien Trung Pham, Truong Thao Nguyen, Hiroshi Yamaguchi, Yutaka Urino, Michihiro Koibuchi
2022	Scalable Multi-Versioning Ordered Key-Value Stores with Persistent Memory Support. Bogdan Nicolae
2022	Scaling and Selecting GPU Methods for All Pairs Shortest Paths (APSP) Computations. Yang Xia, Peng Jiang, Gagan Agrawal, Rajiv Ramnath
2022	Scheduling on Uniform and Unrelated Machines with Bipartite Incompatibility Graphs. Tytus Pikies, Hanna Furmanczyk
2022	SecFortress: Securing Hypervisor using Cross-layer Isolation. Qihang Zhou, Xiaoqi Jia, Shengzhi Zhang, Nan Jiang, Jiayun Chen, Weijuan Zhang
2022	Shared-Memory Parallel Algorithms for Fully Dynamic Maintenance of 2-Connected Components. Chirayu Anant Haryan, G. Ramakrishna, Kishore Kothapalli, Dip Sankar Banerjee
2022	Sparsity-Aware Tensor Decomposition. Süreyya Emre Kurt, Saurabh Raje, Aravind Sukumaran-Rajam, P. Sadayappan
2022	SpectralFly: Ramanujan Graphs as Flexible and Efficient Interconnection Networks. Stephen J. Young, Sinan G. Aksoy, Jesun Firoz, Roberto Gioiosa, Tobias Hagge, Mark Kempton, Juan Escobedo, Mark Raugas
2022	StencilMART: Predicting Optimization Selection for Stencil Computations across GPUs. Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian
2022	TEE-based decentralized recommender systems: The raw data sharing redemption. Akash Balasaheb Dhasade, Nevena Dresevic, Anne-Marie Kermarrec, Rafael Pires
2022	TagTree: Global Tagging Index with Efficient Querying for Time Series Databases. Jin Xue, Zhiqi Wang, Tianyu Wang, Zili Shao
2022	Task-based Acceleration of Bidirectional Recurrent Neural Networks on Multi-core Architectures. Robin Kumar Sharma, Marc Casas
2022	The Fast and Scalable MPI Application Launch of the Tianhe HPC system. Yiqin Dai, Yong Dong, Min Xie, Kai Lu, Ruibo Wang, Mingtian Shao, Juan Chen
2022	The Universal Gossip Fighter. Anastasiia Gorbunova, Rachid Guerraoui, Anne-Marie Kermarrec, Anastasiia Kucherenko, Rafaël Pinot
2022	Top-Down Performance Profiling on NVIDIA's GPUs. Alvaro Saiz, Pablo Prieto, Pablo Abad Fidalgo, José-Ángel Gregorio, Valentin Puente
2022	Topological Modeling and Parallelization of Multidimensional Data on Microelectrode Arrays. Olamide Timothy Tawose, Bin Li, Lei Yang, Feng Yan, Dongfang Zhao
2022	Towards Distributed 2-Approximation Steiner Minimal Trees in Billion-edge Graphs. Tahsin Reza, Geoffrey Sanders, Roger Pearce
2022	Traffic-Optimal Virtual Network Function Placement and Migration in Dynamic Cloud Data Centers. Vincent Tran, Jingsong Sun, Bin Tang, Deng Pan
2022	Understanding the Design-Space of Sparse/Dense Multiphase GNN dataflows on Spatial Accelerators. Raveesh Garg, Eric Qin, Francisco Muñoz-Martínez, Robert Guirado, Akshay Jain, Sergi Abadal, José L. Abellán, Manuel E. Acacio, Eduard Alarcón, Sivasankaran Rajamanickam, Tushar Krishna
2022	Unlocking Personalized Healthcare on Modern CPUs/GPUs: Three-way Gene Interaction Study. Diogo Marques, Rafael Campos, Sergio Santander-Jiménez, Zakhar Matveev, Leonel Sousa, Aleksandar Ilic
2022	Why Globally Re-shuffle? Revisiting Data Shuffling in Large Scale Deep Learning. Truong Thao Nguyen, François Trahay, Jens Domke, Aleksandr Drozd, Emil Vatai, Jianwei Liao, Mohamed Wahib, Balazs Gerofi
2022	pFedGF: Enabling Personalized Federated Learning via Gradient Fusion. Xinghao Wu, Jianwei Niu, Xuefeng Liu, Tao Ren, Zhangmin Huang, Zhetao Li