| 2023 | 32nd International Conference on Parallel Architectures and Compilation Techniques, PACT 2023, Vienna, Austria, October 21-25, 2023 |
| 2023 | A CPU-FPGA Holistic Source-To-Source Compilation Approach for Partitioning and Optimizing C/C++ Applications. Tiago Santos, João Bispo, João M. P. Cardoso |
| 2023 | A Silicon Photonic Multi-DNN Accelerator. Yuan Li, Ahmed Louri, Avinash Karanth |
| 2023 | Accelerating Decision-Tree-Based Inference Through Adaptive Parallelization. Jan van Lunteren |
| 2023 | Architecture-Aware Currying. Mahmut Taylan Kandemir, Gulsum Gudukbay Akbulut, Wonil Choi, Mustafa Karaköy |
| 2023 | Automatic Algorithm-Based Fault Tolerance (AABFT) of Stencil Computations. Louis Narmour, Steven Derrien, Sanjay V. Rajopadhye |
| 2023 | Automatic Code Generation for High-Performance Graph Algorithms. Zhen Peng, Rizwan A. Ashraf, Luanzheng Guo, Ruiqin Tian, Gokcen Kestor |
| 2023 | Barad-dur: Near-Storage Accelerator for Training Large Graph Neural Networks. Jiyoung An, Esmerald Aliaj, Sang-Woo Jun |
| 2023 | Boustrophedonic Frames: Quasi-Optimal L2 Caching for Textures in GPUs. Diya Joseph, Juan L. Aragón, Joan-Manuel Parcerisa, Antonio González |
| 2023 | CELLO: Compiler-Assisted Efficient Load-Load Ordering in Data-Race-Free Regions. Sawan Singh, Josué Feliu, Manuel E. Acacio, Alexandra Jimborean, Alberto Ros |
| 2023 | Drishyam: An Image is Worth a Data Prefetcher. Shubdeep Mohapatra, Biswabandan Panda |
| 2023 | Dynamic Allocation of Processor Cores to Graph Applications on Commodity Servers. Lucia Pons, Julio Sahuquillo, Timothy M. Jones |
| 2023 | G-Sparse: Compiler-Driven Acceleration for Generalized Sparse Computation for Graph Neural Networks on Modern GPUs. Yue Jin, Chengying Huan, Heng Zhang, Yongchao Liu, Shuaiwen Leon Song, Rui Zhao, Yao Zhang, Changhua He, Wenguang Chen |
| 2023 | GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs. Juelin Liu, Sandeep Polisetty, Hui Guan, Marco Serafini |
| 2023 | HugeGPT: Storing Guest Page Tables on Host Huge Pages to Accelerate Address Translation. Weiwei Jia, Jiyuan Zhang, Jianchen Shan, Yiming Du, Xiaoning Ding, Tianyin Xu |
| 2023 | INTERPRET: Inter-Warp Register Reuse for GPU Tensor Core. Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, Won Woo Ro |
| 2023 | MBAPIS: Multi-Level Behavior Analysis Guided Program Interval Selection for Microarchitecture Studies. Hongwei Cui, Yujie Cui, Honglan Zhan, Shuhao Liang, Xianhua Liu, Chun Yang, Xu Cheng |
| 2023 | Parallelizing Maximal Clique Enumeration on GPUs. Mohammad Almasri, Yen-Hsiang Chang, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, Wen-mei W. Hwu |
| 2023 | Performance Characterization of Popular DNN Models on Out-of-Order CPUs. Pablo Prieto, Pablo Abad Fidalgo, José-Ángel Gregorio, Valentin Puente |
| 2023 | PreFlush: Lightweight Hardware Prediction Mechanism for Cache Line Flush and Writeback. Hussein Elnawawy, James Tuck, Gregory T. Byrd |
| 2023 | QeiHaN: An Energy-Efficient DNN Accelerator that Leverages Log Quantization in NDP Architectures. Bahareh Khabbazan, Marc Riera, Antonio González |
| 2023 | Quickloop: An Efficient, FPGA-Accelerated Exploration of Parameterized DNN Accelerators. Tayyeb Mahmood, Kashif Inayat, Jaeyong Chung |
| 2023 | Retargeting Applications for Heterogeneous Systems with the Tribble Source-to-Source Framework. Luís Miguel Sousa, João Bispo, Nuno Paulino |
| 2023 | SDM: Sharing-Enabled Disaggregated Memory System with Cache Coherent Compute Express Link. Hyokeun Lee, Kwanseok Choi, Hyuk-Jae Lee, Jaewoong Sim |
| 2023 | SLIDEX: Sliding Window Extension for Image Processing. Raúl Taranco, José-María Arnau, Antonio González |
| 2023 | Separating Mechanism from Policy in STM. Yaodong Sheng, Ahmed Hassan, Michael F. Spear |
| 2023 | SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory. Jinfan Chen, Juan Gómez-Luna, Izzat El Hajj, Yuxin Guo, Onur Mutlu |
| 2023 | SparseFT: Sparsity-aware Fault Tolerance for Reliable CNN Inference on GPUs. Gwangeun Byeon, Seungtae Lee, Seongwook Kim, Yongjun Kim, Prashant J. Nair, Seokin Hong |
| 2023 | SpecCheck: A Tool for Systematic Identification of Vulnerable Transient Execution in gem5. Zack McKevitt, Ashutosh Trivedi, Tamara Silbergleit Lehman |
| 2023 | TSUNAMI: A GPU Implementation of the WFA Algorithm. Giulia Gerometta, Alberto Zeni, Marco D. Santambrogio |
| 2023 | Thread-to-Core Allocation in ARM Processors Building Synergistic Pairs. Marta Navarro, Josué Feliu, Salvador Petit, María Engracia Gómez, Julio Sahuquillo |
| 2023 | UWOmppro: UWOmp++ with Point-to-Point Synchronization, Reduction and Schedules. Aditya Agrawal, V. Krishna Nandivada |
| 2023 | Virtual PIM: Resource-Aware Dynamic DPU Allocation and Workload Scheduling Framework for Multi-DPU PIM Architecture. Donghyeon Kim, Taehoon Kim, Inyong Hwang, Taehyeong Park, Hanjun Kim, Youngsok Kim, Yongjun Park |
| 2023 | mlirSynth: Automatic, Retargetable Program Raising in Multi-Level IR Using Program Synthesis. Alexander Brauckmann, Elizabeth Polgreen, Tobias Grosser, Michael F. P. O'Boyle |