| 2022 | "Smarter" NICs for faster molecular dynamics: a case study. Sara Karamati, Clayton Hughes, K. Scott Hemmert, Ryan E. Grant, Whit Schonbein, Scott Levy, Thomas M. Conte, Jeffrey Young, Richard W. Vuduc |
| 2022 | 2022 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2022, Lyon, France, May 30 - June 3, 2022 |
| 2022 | A Fine-grained Prefetching Scheme for DGEMM Kernels on GPU with Auto-tuning Compatibility. Jialin Li, Huang Ye, Shaobo Tian, Xinyuan Li, Jian Zhang |
| 2022 | A Framework to Exploit Data Sparsity in Tile Low-Rank Cholesky Factorization. Qinglei Cao, Rabab Alomairy, Yu Pei, George Bosilca, Hatem Ltaief, David E. Keyes, Jack J. Dongarra |
| 2022 | A General Offloading Approach for Near-DRAM Processing-In-Memory Architectures. Dan Chen, Hai Jin, Long Zheng, Yu Huang, Pengcheng Yao, Chuangyi Gui, Qinggang Wang, Haifeng Liu, Haiheng He, Xiaofei Liao, Ran Zheng |
| 2022 | A Quantitative Study of the Spatiotemporal I/O Burstiness of HPC Application. Wenxiang Yang, Xiangke Liao, Dezun Dong, Jie Yu |
| 2022 | A Swap Dominated Tensor Re-Generation Strategy for Training Deep Learning Models. Lijie Wen, Zan Zong, Li Lin, Leilei Lin |
| 2022 | A scalable adaptive-matrix SPMV for heterogeneous architectures. Han D. Tran, Milinda Fernando, Kumar Saurabh, Baskar Ganapathysubramanian, Robert M. Kirby, Hari Sundar |
| 2022 | A self-stabilizing 2-minimal dominating set algorithm based on loop composition in networks of girth at least 7. Syohei Maruyama, Yuichi Sudo, Sayaka Kamei, Hirotsugu Kakugawa |
| 2022 | Accelerating Encrypted Computing on Intel GPUs. Yujia Zhai, Mohannad Ibrahim, Yiqin Qiu, Fabian Boemer, Zizhong Chen, Alexey Titov, Alexander Lyashevsky |
| 2022 | Accuracy vs. Cost in Parallel Fixed-Precision Low-Rank Approximations of Sparse Matrices. Robert Ernstbrunner, Viktoria Mayer, Wilfried N. Gansterer |
| 2022 | Adaptive Verifiable Coded Computing: Towards Fast, Secure and Private Distributed Machine Learning. Tingting Tang, Ramy E. Ali, Hanieh Hashemi, Tynan Gangwani, Salman Avestimehr, Murali Annavaram |
| 2022 | Alias-Chain: Improving Blockchain Scalability via Exploring Content Locality among Transactions. Jintong Liu, Shenggang Wan, Xubin He |
| 2022 | An Efficient Block Validation Mechanism for UTXO-based Blockchains. Xiaohai Dai, Bin Xiao, Jiang Xiao, Hai Jin |
| 2022 | An Efficient Vectorization Scheme for Stencil Computation. Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao |
| 2022 | An End-to-end and Adaptive I/O Optimization Tool for Modern HPC Storage Systems. Bin Yang, Yanliang Zou, Weiguo Liu, Wei Xue |
| 2022 | An Integral-equation-oriented Vectorized SpMV Algorithm and its Application on CT Imaging Reconstruction. Weicai Ye, Chenghuan Huang, Jiasheng Huang, Jiajun Li, Yao Lu, Ying Jiang |
| 2022 | Archpipe: Fast and Flexible Pipelined Erasure-coded Archival Scheme for Heterogeneous Networks. Bin Xu, Jianzhong Huang, Xiao Qin, Qiang Cao, Yuanyuan Dong, Weikang Kong |
| 2022 | As easy as ABC: Optimal (A)ccountable (B)yzantine (C)onsensus is easy! Pierre Civit, Seth Gilbert, Vincent Gramoli, Rachid Guerraoui, Jovan Komatovic |
| 2022 | Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching. András Strausz, Flavio Vella, Salvatore Di Girolamo, Maciej Besta, Torsten Hoefler |
| 2022 | AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning. Siddharth Singh, Abhinav Bhatele |
| 2022 | Batched sparse iterative solvers on GPU for the collision operator for fusion plasma simulations. Aditya Kashi, Pratik Nayak, Dhruva Kulkarni, Aaron Scheinberg, Paul Lin, Hartwig Anzt |
| 2022 | Bit-GraphBLAS: Bit-Level Optimizations of Matrix-Centric Graph Processing on GPU. Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Nathan R. Tallent, Kevin J. Barker, Ang Li |
| 2022 | Booster: An Accelerator for Gradient Boosting Decision Trees Training and Inference. Mingxuan He, Mithuna Thottethodi, T. N. Vijaykumar |
| 2022 | Bounding the Flow Time in Online Scheduling with Structured Processing Sets. Louis-Claude Canon, Anthony Dugois, Loris Marchal |
| 2022 | CSC: Collaborative System Configuration for I/O-Intensive Applications in Multi-Tenant Clouds. Haowei Huang, Pu Pang, Quan Chen, Jieru Zhao, Wenli Zheng, Minyi Guo |
| 2022 | CSMV: A Highly Scalable Multi-Versioned Software Transactional Memory for GPUs. Diogo Nunes, Daniel Castro, Paolo Romano |
| 2022 | Challenges and Opportunities in Designing High-Performance and Scalable Middleware for HPC and AI: Past, Present, and Future. Dhabaleswar K. Panda |
| 2022 | Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution. Johannes Doerfert, Atmn Patel, Joseph Huber, Shilei Tian, Jose Manuel Monsalve Diaz, Barbara M. Chapman, Giorgis Georgakoudis |
| 2022 | Coloring the Vertices of 9-pt and 27-pt Stencils with Intervals. Dante Durrman, Erik Saule |
| 2022 | Colza: Enabling Elastic In Situ Visualization for High-performance Computing Simulations. Matthieu Dorier, Zhe Wang, Utkarsh Ayachit, Shane Snyder, Robert B. Ross, Manish Parashar |
| 2022 | Communication-efficient Massively Distributed Connected Components. Sebastian Lamm, Peter Sanders |
| 2022 | Compiler-Directed Incremental Checkpointing for Low Latency GPU Preemption. Zhuoran Ji, Cho-Li Wang |
| 2022 | Coupling streaming AI and HPC ensembles to achieve 100-1000× faster biomolecular simulations. Alexander Brace, Igor Yakushin, Heng Ma, Anda Trifan, Todd S. Munson, Ian T. Foster, Arvind Ramanathan, Hyungro Lee, Matteo Turilli, Shantenu Jha |
| 2022 | DEAN: A Lightweight and Resource-efficient Blockchain Protocol for Reliable Edge Computing. Abdullah Al-Mamun, Haoting Shen, Dongfang Zhao |
| 2022 | DENOVA: Deduplication Extended NOVA File System. Hyungjoon Kwon, Yonghyeon Cho, Awais Khan, Yeohyeon Park, Youngjae Kim |
| 2022 | DFMan: A Graph-based Optimization of Dataflow Scheduling on High-Performance Computing Systems. Fahim Chowdhury, Francesco Di Natale, Adam Moody, Kathryn M. Mohror, Weikuan Yu |
| 2022 | DGSF: Disaggregated GPUs for Serverless Functions. Henrique Fingler, Zhiting Zhu, Esther Yoon, Zhipeng Jia, Emmett Witchel, Christopher J. Rossbach |
| 2022 | Degree-Aware Kernels for Computing Jaccard Weights on GPUs. Amro Alabsi Aljundi, Taha Atahan Akyildiz, Kamer Kaya |
| 2022 | Direct solution of larger coupled sparse/dense linear systems using low-rank compression on single-node multi-core machines in an industrial context. Emmanuel Agullo, Marek Felsöci, Guillaume Sylvand |
| 2022 | DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices. Xueyu Hou, Yongjie Guan, Tao Han, Ning Zhang |
| 2022 | Distributed-Memory Sparse Kernels for Machine Learning. Vivek Bharadwaj, Aydin Buluç, James Demmel |
| 2022 | Dynamic Computation Offloading for Green Things-Edge-Cloud Computing with Local Caching. Xianzhong Tian, Huixiao Meng, Yanjun Li, Pingting Miao, Pengcheng Xu |
| 2022 | Dynamic Task Shaping for High Throughput Data Analysis Applications in High Energy Physics. Ben Tovar, Ben Lyons, Kelci Mohrman, Barry Sly-Delgado, Kevin Lannon, Douglas Thain |
| 2022 | Excavating the Potential of Graph Workload on RDMA-based Far Memory Architecture. Jing Wang, Chao Li, Taolei Wang, Lu Zhang, Pengyu Wang, Junyi Mei, Minyi Guo |
| 2022 | Exploiting Reduced Precision for GPU-based Time Series Mining. Yi Ju, Amir Raoofy, Dai Yang, Erwin Laure, Martin Schulz |
| 2022 | Exploring Efficient Microservice Level Parallelism. Xinkai Wang, Chao Li, Lu Zhang, Xiaofeng Hou, Quan Chen, Minyi Guo |
| 2022 | FAM-Graph: Graph Analytics on Disaggregated Memory. Daniel Zahka, Ada Gavrilovska |
| 2022 | Falcon: A Timestamp-based Protocol to Maximize the Cache Efficiency in the Distributed Shared Memory. Jin Zhang, Xiangyao Yu, Zhengwei Qi, Haibing Guan |
| 2022 | Fast Convergence to Fairness for Reduced Long Flow Tail Latency in Datacenter Networks. John Snyder, Alvin R. Lebeck |
| 2022 | Fast Parallel Bayesian Network Structure Learning. Jiantong Jiang, Zeyi Wen, Ajmal Mian |
| 2022 | Fast and High-Quality Influence Maximization on Multiple GPUs. Gökhan Göktürk, Kamer Kaya |
| 2022 | Fault-tolerant Snapshot Objects in Message Passing Systems. Vijay K. Garg, Saptaparni Kumar, Lewis Tseng, Xiong Zheng |
| 2022 | FlashWalker: An In-Storage Accelerator for Graph Random Walks. Fuping Niu, Jianhui Yue, Jiangqiu Shen, Xiaofei Liao, Haikun Liu, Hai Jin |
| 2022 | Frugal Decentralized Learning. Anne-Marie Kermarrec |
| 2022 | GSpecPal: Speculation-Centric Finite State Machine Parallelization on GPUs. Yuguang Wang, Robbie Watling, Junqiao Qiu, Zhenlin Wang |
| 2022 | Generalized Flow-Graph Programming Using Template Task-Graphs: Initial Implementation and Assessment. Joseph Schuchart, Poornima Nookala, Mohammad Mahdi Javanmard, Thomas Hérault, Edward F. Valeev, George Bosilca, Robert J. Harrison |
| 2022 | HACCS: Heterogeneity-Aware Clustered Client Selection for Accelerated Federated Learning. Joel Wolfrath, Nikhil Sreekumar, Dhruv Kumar, Yuanli Wang, Abhishek Chandra |
| 2022 | HDagg: Hybrid Aggregation of Loop-carried Dependence Iterations in Sparse Matrix Computations. Behrooz Zarebavani, Kazem Cheshmi, Bangtian Liu, Michelle Mills Strout, Maryam Mehri Dehnavi |
| 2022 | HRaft: Adaptive Erasure Coded Data Maintenance for Consensus in Distributed Networks. Yulei Jia, Guangping Xu, Chi Wan Sung, Salwa Mostafa, Yulei Wu |
| 2022 | HTS: A Threaded Multilevel Sparse Hybrid Solver. Joshua Dennis Booth |
| 2022 | High-order Line Graphs of Non-uniform Hypergraphs: Algorithms, Applications, and Experimental Analysis. Xu T. Liu, Jesun Firoz, Sinan G. Aksoy, Ilya Amburg, Andrew Lumsdaine, Cliff A. Joslyn, Brenda Praggastis, Assefaw H. Gebremedhin |
| 2022 | Hybrid Workload Scheduling on HPC Systems. Yuping Fan, Zhiling Lan, Paul Rich, William E. Allcock, Michael E. Papka |
| 2022 | I/O-Optimal Cache-Oblivious Sparse Matrix-Sparse Matrix Multiplication. Niels Gleinig, Maciej Besta, Torsten Hoefler |
| 2022 | In-Memory Indexed Caching for Distributed Data Processing. Alexandru Uta, Bogdan Ghit, Ankur Dave, Jan S. Rellermeyer, Peter Boncz |
| 2022 | Landau collision operator in the CUDA programming model applied to thermal quench plasmas. Mark F. Adams, Dylan P. Brennan, Matthew G. Knepley, Peng Wang |
| 2022 | Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization. Ali TehraniJamsaz, Mihail Popov, Akash Dutta, Emmanuelle Saillard, Ali Jannesari |
| 2022 | Lightning: Scaling the GPU Programming Model Beyond a Single GPU. Stijn Heldens, Pieter Hijma, Ben van Werkhoven, Jason Maassen, Rob V. van Nieuwpoort |
| 2022 | MICCO: An Enhanced Multi-GPU Scheduling Framework for Many-Body Correlation Functions. Qihan Wang, Bin Ren, Jie Chen, Robert G. Edwards |
| 2022 | MLCNN: Cross-Layer Cooperative Optimization and Accelerator Architecture for Speeding Up Deep Learning Applications. Beilei Jiang, Xianwei Cheng, Sihai Tang, Xu Ma, Zhaochen Gu, Song Fu, Qing Yang, Mingxiong Liu |
| 2022 | Memory Access Granularity Aware Lossless Compression for GPUs. Sohan Lal, Manuel Renz, Julian Hartmer, Ben H. H. Juurlink |
| 2022 | Memory-Aware Scheduling of Tasks Sharing Data on Multiple GPUs with Dynamic Runtime Systems. Maxime Gonthier, Loris Marchal, Samuel Thibault |
| 2022 | Minerva: Rethinking Secure Architectures for the Era of Fabric-Attached Memory Architectures. Mazen Al-Wadi, Rujia Wang, David Mohaisen, Clayton Hughes, Simon David Hammond, Amro Awad |
| 2022 | Mixed Precision $s$-step Conjugate Gradient with Residual Replacement on GPUs. Ichitaro Yamazaki, Erin Carson, Brian Michael Kelley |
| 2022 | Mnemonic: A Parallel Subgraph Matching System for Streaming Graphs. Bibek Bhattarai, H. Howie Huang |
| 2022 | Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA. Hongkuan Zhou, Bingyi Zhang, Rajgopal Kannan, Viktor K. Prasanna, Carl E. Busart |
| 2022 | Modeling Matrix Engines for Portability and Performance. Nicholai Tukanov, Rajalakshmi Srinivasaraghavan, José E. Moreira, Tze Meng Low |
| 2022 | Multi-Phase Task-Based HPC Applications: Quickly Learning how to Run Fast. Lucas Leandro Nesi, Lucas Mello Schnorr, Arnaud Legrand |
| 2022 | Neon: A Multi-GPU Programming Model for Grid-based Computations. Massimiliano Meneghin, Ahmed H. Mahmoud, Pradeep Kumar Jayaraman, Nigel J. W. Morris |
| 2022 | Next-Generation Local Time Stepping for the ADER-DG Finite Element Method. Alexander Breuer, Alexander Heinecke |
| 2022 | OmpSs@cloudFPGA: An FPGA Task-Based Programming Model with Message Passing. Juan Miguel De Haro Ruiz, Rubén Cano, Carlos Álvarez, Daniel Jiménez-González, Xavier Martorell, Eduard Ayguadé, Jesús Labarta, François Abel, Burkhard Ringlein, Beat Weiss |
| 2022 | On the Parallel Reconstruction from Pooled Data. Oliver Gebhard, Max Hahn-Klimroth, Dominik Kaaser, Philipp Loick |
| 2022 | Optimal Arbitrary Pattern Formation on a Grid by Asynchronous Autonomous Robots. Rory Hector, Gokarna Sharma, Ramachandran Vaidyanathan, Jerry L. Trahan |
| 2022 | Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs. Cody Rivera, Sheng Di, Jiannan Tian, Xiaodong Yu, Dingwen Tao, Franck Cappello |
| 2022 | P-ckpt: Coordinated Prioritized Checkpointing. Subhendu Behera, Lipeng Wan, Frank Mueller, Matthew Wolf, Scott Klasky |
| 2022 | PARSEC: PARallel Subgraph Enumeration in CUDA. Vibhor Dodeja, Mohammad Almasri, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu |
| 2022 | PINT: Parallel INTerval-Based Race Detector. Yifan Xu, Anchengcheng Zhou, Kunal Agrawal, I-Ting Angelina Lee |
| 2022 | ParaTreeT: A Fast, General Framework for Spatial Tree Traversal. Joseph Hutter, Justin Josef Szaday, Jaemin Choi, Simeng Liu, Laxmikant V. Kalé, Spencer Wallace, Thomas R. Quinn |
| 2022 | Parallel Approximations of the Tukey g-and-h Likelihoods and Predictions for Non-Gaussian Geostatistics. Sagnik Mondal, Sameh Abdulah, Hatem Ltaief, Ying Sun, Marc G. Genton, David E. Keyes |
| 2022 | Parallel Global Edge Switching for the Uniform Sampling of Simple Graphs with Prescribed Degrees. Daniel Allendorf, Ulrich Meyer, Manuel Penschuck, Hung Tran |
| 2022 | Parallel Tensor Train Rounding using Gram SVD. Hussam Al Daas, Grey Ballard, Lawton Manning |
| 2022 | Parallel Vertex Cover Algorithms on GPUs. Peter Yamout, Karim Barada, Adnan Jaljuli, Amer E. Mouawad, Izzat El Hajj |
| 2022 | Parallel, Portable Algorithms for Distance-2 Maximal Independent Set and Graph Coarsening. Brian Michael Kelley, Sivasankaran Rajamanickam |
| 2022 | Parallelizing and Balancing Coupled DSMC/PIC for Large-scale Particle Simulations. Haozhong Qiu, Chuanfu Xu, Dali Li, Haoyu Wang, Jie Li, Zheng Wang |
| 2022 | PokéMem: Taming Wild Memory Consumers in Apache Spark. Minhyeok Kweun, Goeun Kim, Byungsoo Oh, Seongho Jung, Taegeon Um, Woo-Yeon Lee |
| 2022 | PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling. Xin You, Hailong Yang, Zhibo Xuan, Zhongzhi Luan, Depei Qian |
| 2022 | Preprocessing Pipeline Optimization for Scientific Deep Learning Workloads. Khaled Z. Ibrahim, Leonid Oliker |
| 2022 | QoS-awareness of Microservices with Excessive Loads via Inter-Datacenter Scheduling. Jiuchen Shi, Jiawen Wang, Kaihua Fu, Quan Chen, Deze Zeng, Minyi Guo |
| 2022 | RLRP: High-Efficient Data Placement with Reinforcement Learning for Modern Distributed Storage Systems. Kai Lu, Nannan Zhao, Jiguang Wan, Changhong Fei, Wei Zhao, Tongliang Deng |
| 2022 | Resilience at Extreme Scale and Connections with Other Domains. Leonardo Bautista-Gomez |
| 2022 | Resource Utilization Aware Job Scheduling to Mitigate Performance Variability. Daniel Nichols, Aniruddha Marathe, Kathleen Shoga, Todd Gamblin, Abhinav Bhatele |
| 2022 | SALoBa: Maximizing Data Locality and Workload Balance for Fast Sequence Alignment on GPUs. Seongyeon Park, Hajin Kim, Tanveer Ahmad, Nauman Ahmed, Zaid Al-Ars, H. Peter Hofstee, Youngsok Kim, Jinho Lee |
| 2022 | SFP: Service Function Chain Provision on Programmable Switches for Cloud Tenants. Hongyi Huang, Wenfei Wu, Yongchao He, Zehua Guo |
| 2022 | SPIDER: An Effective, Efficient and Robust Load Scheduler for Real-time Split Frame Rendering. Bingzheng Ma, Ziqiang Zhang, Yusen Li, Wentong Cai, Gang Wang, Xiaoguang Liu |
| 2022 | SSB-Tree: Making Persistent Memory B+- Trees Crash-Consistent and Concurrent by Lazy-Box. Tongliang Li, Haixia Wang, Airan Shao, Dongsheng Wang |
| 2022 | Scalable Low-Latency Inter-FPGA Networks. Kien Trung Pham, Truong Thao Nguyen, Hiroshi Yamaguchi, Yutaka Urino, Michihiro Koibuchi |
| 2022 | Scalable Multi-Versioning Ordered Key-Value Stores with Persistent Memory Support. Bogdan Nicolae |
| 2022 | Scaling and Selecting GPU Methods for All Pairs Shortest Paths (APSP) Computations. Yang Xia, Peng Jiang, Gagan Agrawal, Rajiv Ramnath |
| 2022 | Scheduling on Uniform and Unrelated Machines with Bipartite Incompatibility Graphs. Tytus Pikies, Hanna Furmanczyk |
| 2022 | SecFortress: Securing Hypervisor using Cross-layer Isolation. Qihang Zhou, Xiaoqi Jia, Shengzhi Zhang, Nan Jiang, Jiayun Chen, Weijuan Zhang |
| 2022 | Shared-Memory Parallel Algorithms for Fully Dynamic Maintenance of 2-Connected Components. Chirayu Anant Haryan, G. Ramakrishna, Kishore Kothapalli, Dip Sankar Banerjee |
| 2022 | Sparsity-Aware Tensor Decomposition. Süreyya Emre Kurt, Saurabh Raje, Aravind Sukumaran-Rajam, P. Sadayappan |
| 2022 | SpectralFly: Ramanujan Graphs as Flexible and Efficient Interconnection Networks. Stephen J. Young, Sinan G. Aksoy, Jesun Firoz, Roberto Gioiosa, Tobias Hagge, Mark Kempton, Juan Escobedo, Mark Raugas |
| 2022 | StencilMART: Predicting Optimization Selection for Stencil Computations across GPUs. Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian |
| 2022 | TEE-based decentralized recommender systems: The raw data sharing redemption. Akash Balasaheb Dhasade, Nevena Dresevic, Anne-Marie Kermarrec, Rafael Pires |
| 2022 | TagTree: Global Tagging Index with Efficient Querying for Time Series Databases. Jin Xue, Zhiqi Wang, Tianyu Wang, Zili Shao |
| 2022 | Task-based Acceleration of Bidirectional Recurrent Neural Networks on Multi-core Architectures. Robin Kumar Sharma, Marc Casas |
| 2022 | The Fast and Scalable MPI Application Launch of the Tianhe HPC system. Yiqin Dai, Yong Dong, Min Xie, Kai Lu, Ruibo Wang, Mingtian Shao, Juan Chen |
| 2022 | The Universal Gossip Fighter. Anastasiia Gorbunova, Rachid Guerraoui, Anne-Marie Kermarrec, Anastasiia Kucherenko, Rafaël Pinot |
| 2022 | Top-Down Performance Profiling on NVIDIA's GPUs. Alvaro Saiz, Pablo Prieto, Pablo Abad Fidalgo, José-Ángel Gregorio, Valentin Puente |
| 2022 | Topological Modeling and Parallelization of Multidimensional Data on Microelectrode Arrays. Olamide Timothy Tawose, Bin Li, Lei Yang, Feng Yan, Dongfang Zhao |
| 2022 | Towards Distributed 2-Approximation Steiner Minimal Trees in Billion-edge Graphs. Tahsin Reza, Geoffrey Sanders, Roger Pearce |
| 2022 | Traffic-Optimal Virtual Network Function Placement and Migration in Dynamic Cloud Data Centers. Vincent Tran, Jingsong Sun, Bin Tang, Deng Pan |
| 2022 | Understanding the Design-Space of Sparse/Dense Multiphase GNN dataflows on Spatial Accelerators. Raveesh Garg, Eric Qin, Francisco Muñoz-Martínez, Robert Guirado, Akshay Jain, Sergi Abadal, José L. Abellán, Manuel E. Acacio, Eduard Alarcón, Sivasankaran Rajamanickam, Tushar Krishna |
| 2022 | Unlocking Personalized Healthcare on Modern CPUs/GPUs: Three-way Gene Interaction Study. Diogo Marques, Rafael Campos, Sergio Santander-Jiménez, Zakhar Matveev, Leonel Sousa, Aleksandar Ilic |
| 2022 | Why Globally Re-shuffle? Revisiting Data Shuffling in Large Scale Deep Learning. Truong Thao Nguyen, François Trahay, Jens Domke, Aleksandr Drozd, Emil Vatai, Jianwei Liao, Mohamed Wahib, Balazs Gerofi |
| 2022 | pFedGF: Enabling Personalized Federated Learning via Gradient Fusion. Xinghao Wu, Jianwei Niu, Xuefeng Liu, Tao Ren, Zhangmin Huang, Zhetao Li |