PACT B

34 papers

YearTitle / Authors
202332nd International Conference on Parallel Architectures and Compilation Techniques, PACT 2023, Vienna, Austria, October 21-25, 2023
2023A CPU-FPGA Holistic Source-To-Source Compilation Approach for Partitioning and Optimizing C/C++ Applications.
Tiago Santos, João Bispo, João M. P. Cardoso
2023A Silicon Photonic Multi-DNN Accelerator.
Yuan Li, Ahmed Louri, Avinash Karanth
2023Accelerating Decision-Tree-Based Inference Through Adaptive Parallelization.
Jan van Lunteren
2023Architecture-Aware Currying.
Mahmut Taylan Kandemir, Gulsum Gudukbay Akbulut, Wonil Choi, Mustafa Karaköy
2023Automatic Algorithm-Based Fault Tolerance (AABFT) of Stencil Computations.
Louis Narmour, Steven Derrien, Sanjay V. Rajopadhye
2023Automatic Code Generation for High-Performance Graph Algorithms.
Zhen Peng, Rizwan A. Ashraf, Luanzheng Guo, Ruiqin Tian, Gokcen Kestor
2023Barad-dur: Near-Storage Accelerator for Training Large Graph Neural Networks.
Jiyoung An, Esmerald Aliaj, Sang-Woo Jun
2023Boustrophedonic Frames: Quasi-Optimal L2 Caching for Textures in GPUs.
Diya Joseph, Juan L. Aragón, Joan-Manuel Parcerisa, Antonio González
2023CELLO: Compiler-Assisted Efficient Load-Load Ordering in Data-Race-Free Regions.
Sawan Singh, Josué Feliu, Manuel E. Acacio, Alexandra Jimborean, Alberto Ros
2023Drishyam: An Image is Worth a Data Prefetcher.
Shubdeep Mohapatra, Biswabandan Panda
2023Dynamic Allocation of Processor Cores to Graph Applications on Commodity Servers.
Lucia Pons, Julio Sahuquillo, Timothy M. Jones
2023G-Sparse: Compiler-Driven Acceleration for Generalized Sparse Computation for Graph Neural Networks on Modern GPUs.
Yue Jin, Chengying Huan, Heng Zhang, Yongchao Liu, Shuaiwen Leon Song, Rui Zhao, Yao Zhang, Changhua He, Wenguang Chen
2023GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs.
Juelin Liu, Sandeep Polisetty, Hui Guan, Marco Serafini
2023HugeGPT: Storing Guest Page Tables on Host Huge Pages to Accelerate Address Translation.
Weiwei Jia, Jiyuan Zhang, Jianchen Shan, Yiming Du, Xiaoning Ding, Tianyin Xu
2023INTERPRET: Inter-Warp Register Reuse for GPU Tensor Core.
Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, Won Woo Ro
2023MBAPIS: Multi-Level Behavior Analysis Guided Program Interval Selection for Microarchitecture Studies.
Hongwei Cui, Yujie Cui, Honglan Zhan, Shuhao Liang, Xianhua Liu, Chun Yang, Xu Cheng
2023Parallelizing Maximal Clique Enumeration on GPUs.
Mohammad Almasri, Yen-Hsiang Chang, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, Wen-mei W. Hwu
2023Performance Characterization of Popular DNN Models on Out-of-Order CPUs.
Pablo Prieto, Pablo Abad Fidalgo, José-Ángel Gregorio, Valentin Puente
2023PreFlush: Lightweight Hardware Prediction Mechanism for Cache Line Flush and Writeback.
Hussein Elnawawy, James Tuck, Gregory T. Byrd
2023QeiHaN: An Energy-Efficient DNN Accelerator that Leverages Log Quantization in NDP Architectures.
Bahareh Khabbazan, Marc Riera, Antonio González
2023Quickloop: An Efficient, FPGA-Accelerated Exploration of Parameterized DNN Accelerators.
Tayyeb Mahmood, Kashif Inayat, Jaeyong Chung
2023Retargeting Applications for Heterogeneous Systems with the Tribble Source-to-Source Framework.
Luís Miguel Sousa, João Bispo, Nuno Paulino
2023SDM: Sharing-Enabled Disaggregated Memory System with Cache Coherent Compute Express Link.
Hyokeun Lee, Kwanseok Choi, Hyuk-Jae Lee, Jaewoong Sim
2023SLIDEX: Sliding Window Extension for Image Processing.
Raúl Taranco, José-María Arnau, Antonio González
2023Separating Mechanism from Policy in STM.
Yaodong Sheng, Ahmed Hassan, Michael F. Spear
2023SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory.
Jinfan Chen, Juan Gómez-Luna, Izzat El Hajj, Yuxin Guo, Onur Mutlu
2023SparseFT: Sparsity-aware Fault Tolerance for Reliable CNN Inference on GPUs.
Gwangeun Byeon, Seungtae Lee, Seongwook Kim, Yongjun Kim, Prashant J. Nair, Seokin Hong
2023SpecCheck: A Tool for Systematic Identification of Vulnerable Transient Execution in gem5.
Zack McKevitt, Ashutosh Trivedi, Tamara Silbergleit Lehman
2023TSUNAMI: A GPU Implementation of the WFA Algorithm.
Giulia Gerometta, Alberto Zeni, Marco D. Santambrogio
2023Thread-to-Core Allocation in ARM Processors Building Synergistic Pairs.
Marta Navarro, Josué Feliu, Salvador Petit, María Engracia Gómez, Julio Sahuquillo
2023UWOmppro: UWOmp++ with Point-to-Point Synchronization, Reduction and Schedules.
Aditya Agrawal, V. Krishna Nandivada
2023Virtual PIM: Resource-Aware Dynamic DPU Allocation and Workload Scheduling Framework for Multi-DPU PIM Architecture.
Donghyeon Kim, Taehoon Kim, Inyong Hwang, Taehyeong Park, Hanjun Kim, Youngsok Kim, Yongjun Park
2023mlirSynth: Automatic, Retargetable Program Raising in Multi-Level IR Using Program Synthesis.
Alexander Brauckmann, Elizabeth Polgreen, Tobias Grosser, Michael F. P. O'Boyle