CGO A

39 papers

YearTitle / Authors
2024A Framework for Fine-Grained Synchronization of Dependent GPU Kernels.
Abhinav Jangda, Saeed Maleki, Maryam Mehri Dehnavi, Madan Musuvathi, Olli Saarikivi
2024A System-Level Dynamic Binary Translator Using Automatically-Learned Translation Rules.
Jinhu Jiang, Chaoyi Liang, Rongchao Dong, Zhaohui Yang, Zhongjun Zhou, Wenwen Wang, Pen-Chung Yew, Weihua Zhang
2024A Tensor Algebra Compiler for Sparse Differentiation.
Amir Shaikhha, Mathieu Huot, Shideh Hashemian
2024AXI4MLIR: User-Driven Automatic Host Code Generation for Custom AXI-Based Accelerators.
Nicolas Bohm Agostini, Jude Haris, Perry Gibson, Malith Jayaweera, Norm Rubin, Antonino Tumeo, José L. Abellán, José Cano, David R. Kaeli
2024AskIt: Unified Programming Interface for Programming with Large Language Models.
Katsumi Okuda, Saman P. Amarasinghe
2024BEC: Bit-Level Static Analysis for Reliability against Soft Errors.
Yousun Ko, Bernd Burgstaller
2024Boosting the Performance of Multi-Solver IFDS Algorithms with Flow-Sensitivity Optimizations.
Haofeng Li, Jie Lu, Haining Meng, Liqing Cao, Lian Li, Lin Gao
2024Compile-Time Analysis of Compiler Frameworks for Query Compilation.
Alexis Engelke, Tobias Schwarz
2024Compiler Testing with Relaxed Memory Models.
Luke Geeson, Lee Smith
2024DrPy: Pinpointing Inefficient Memory Usage in Multi-Layer Python Applications.
Jinku Cui, Qidong Zhao, Yueming Hao, Xu Liu
2024EasyTracker: A Python Library for Controlling and Inspecting Program Execution.
Théo Barollet, Christophe Guillon, Manuel Selva, François Broquedis, Florent Bouchez-Tichadou, Fabrice Rastello
2024EasyView: Bringing Performance Profiles into Integrated Development Environments.
Qidong Zhao, Milind Chabbi, Xu Liu
2024Ecmas: Efficient Circuit Mapping and Scheduling for Surface Code.
Mingzheng Zhu, Hao Fu, Jun Wu, Chi Zhang, Wei Xie, Xiang-Yang Li
2024Enabling Fine-Grained Incremental Builds by Making Compiler Stateful.
Ruobing Han, Jisheng Zhao, Hyesoon Kim
2024Energy-Aware Tile Size Selection for Affine Programs on GPUs.
Malith Jayaweera, Martin Kong, Yanzhi Wang, David R. Kaeli
2024Enhancing Performance Through Control-Flow Unmerging and Loop Unrolling on GPUs.
Alnis Murtovi, Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, Ignacio Laguna, Bernhard Steffen
2024Experiences Building an MLIR-Based SYCL Compiler.
Ettore Tiotto, Victor Perez, Whitney Tsang, Lukas Sommer, Julian Oppermann, Victor Lomüller, Mehdi Goli, James Brodman
2024High-Throughput, Formal-Methods-Assisted Fuzzing for LLVM.
Yuyou Fan, John Regehr
2024IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2024, Edinburgh, United Kingdom, March 2-6, 2024
Tobias Grosser, Christophe Dubach, Michel Steuwer, Jingling Xue, Guilherme Ottoni, ernando Magno Quintão Pereira
2024Instruction Scheduling for the GPU on the GPU.
Ghassan Shobaki, Pinar Muyan-Özçelik, Josh Hutton, Bruce Linck, Vladislav Malyshenko, Austin Kerbow, Ronaldo Ramirez-Ortega, Vahl Scott Gordon
2024JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication.
Qiang Fu, Thomas B. Rolinger, H. Howie Huang
2024Latent Idiom Recognition for a Minimalist Functional Array Language Using Equality Saturation.
Jonathan Van der Cruysse, Christophe Dubach
2024One Automaton to Rule Them All: Beyond Multiple Regular Expressions Execution.
Luisa Cicolini, Filippo Carloni, Marco D. Santambrogio, Davide Conficconi
2024OptiWISE: Combining Sampling and Instrumentation for Granular CPI Analysis.
Yuxin Guo, Alexandra W. Chadwick, Márton Erdos, Utpal Bora, Ilias Vougioukas, Giacomo Gabrielli, Timothy M. Jones
2024PolyTOPS: Reconfigurable and Flexible Polyhedral Scheduler.
Gianpietro Consolaro, Zhen Zhang, Harenome Razanajato, Nelson Lossing, Nassim Tchoulak, Adilla Susungi, Artur Cesar Araujo Alves, Renwei Zhang, Denis Barthou, Corinne Ancourt, Cédric Bastoul
2024PresCount: Effective Register Allocation for Bank Conflict Reduction.
Xiaofeng Guan, Hao Zhou, Guoqing Bao, Handong Li, Liang Zhu, Jianguo Yao
2024Representing Data Collections in an SSA Form.
Tommy McMichen, Nathan Greiner, Peter Zhong, Federico Sossai, Atmn Patel, Simone Campanoni
2024Retargeting and Respecializing GPU Workloads for Performance Portability.
Ivan R. Ivanov, Oleksandr Zinenko, Jens Domke, Toshio Endo, William S. Moses
2024Revamping Sampling-Based PGO with Context-Sensitivity and Pseudo-instrumentation.
Wenlei He, Hongtao Yu, Lei Wang, Taewook Oh
2024Revealing Compiler Heuristics Through Automated Discovery and Optimization.
Volker Seeker, Chris Cummins, Murray Cole, Björn Franke, Kim M. Hazelwood, Hugh Leather
2024SCHEMATIC: Compile-Time Checkpoint Placement and Memory Allocation for Intermittent Systems.
Hugo Reymond, Jean-Luc Béchennec, Mikaël Briday, Sébastien Faucou, Isabelle Puaut, Erven Rohou
2024SLaDe: A Portable Small Language Model Decompiler for Optimized Assembly.
Jordi Armengol-Estapé, Jackson Woodruff, Chris Cummins, Michael F. P. O'Boyle
2024Seer: Predictive Runtime Kernel Selection for Irregular Problems.
Ryan Swann, Muhammad Osama, Karthik Sangaiah, Jalal Mahmud
2024Tackling the Matrix Multiplication Micro-Kernel Generation with Exo.
Adrián Castelló, Julian Bellavita, Grace Dinh, Yuka Ikarashi, Héctor Martínez
2024TapeFlow: Streaming Gradient Tapes in Automatic Differentiation.
Milad Hakimi, Arrvindh Shriraman
2024Unveiling and Vanquishing Goroutine Leaks in Enterprise Microservices: A Dynamic Analysis Approach.
Georgian-Vlad Saioc, Dmitriy Shirchenko, Milind Chabbi
2024Welcome from the Program Chairs.
Guilherme Ottoni, Fernando Pereira, Michel Steuwer, Jingling Xue
2024Whose Baseline Compiler is it Anyway?
Ben L. Titzer
2024oneDNN Graph Compiler: A Hybrid Approach for High-Performance Deep Learning Compilation.
Jianhui Li, Zhennan Qin, Yijie Mei, Jingze Cui, Yunfei Song, Ciyong Chen, Yifei Zhang, Longsheng Du, Xianhang Cheng, Baihui Jin, Yan Zhang, Jason Ye, Eric Lin, Dan Lavery