CGO A

57 papers

YearTitle / Authors
2026A Reinforcement Learning Environment for Automatic Code Optimization in the MLIR Compiler.
Mohammed Tirichine, Nassim Ameur, Nazim Bendib, Iheb Nassim Aouadj, Djad Bouchama, Rafik Bouloudene, Riyadh Baghdadi
2026Accelerating App Recompilation across Android System Updates by Code Reusing.
Hongtao Wu, Yu Chen, Mengfei Xie, Futeng Yang, Jun Yan, Jiang Ma, Jianming Fu, Chun Jason Xue, Qingan Li
2026Automatic Data Enumeration for Fast Collections.
Tommy McMichen, Simone Campanoni
2026BIT: Empowering Binary Analysis through the LLVM Toolchain.
Puzhuo Liu, Peng Di, Jingling Xue, Yu Jiang
2026Binary Diffing via Library Signatures.
Andrei Rimsa, Anderson Faustino da Silva, Camilo Santana, Fernando Magno Quintão Pereira
2026Compilation of Generalized Matrix Chains with Symbolic Sizes.
Francisco López, Lars Karlsson, Paolo Bientinesi
2026Compiler-Assisted Instruction Fusion.
Ravikiran Ravindranath Reddy, Sawan Singh, Arthur Perais, Alberto Ros, Alexandra Jimborean
2026Compiler-Runtime Co-operative Chain of Verification for LLM-Based Code Optimization.
Hyunho Kwon, Sanggyu Shin, Ju Min Lee, Hoyun Youm, Seungbin Song, Seongho Kim, Hanwoong Jung, Seungwon Lee, Hanjun Kim
2026Dependence-Driven, Scalable Quantum Circuit Mapping with Affine Abstractions.
Marouane Benbetka, Merwan Bekkar, Riyadh Baghdadi, Martin Kong
2026Dr.avx: A Dynamic Compilation System for Seamlessly Executing Hardware-Unsupported Vectorization Instructions.
Yue Tang, Mianzhi Wu, Yufeng Li, Haoyu Liao, Jianmei Guo, Bo Huang
2026DyPARS: Dynamic-Shape DNN Optimization via Pareto-Aware MCTS for Graph Variants.
Hao Qian, Guangli Li, Qiuchu Yu, Xueying Wang, Jingling Xue
2026Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators.
Prasanth Chatarasi, Alex Gatea, Bardia Mahjour, Jintao Zhang, Alberto Mannari, Chris Bowler, Shubham Jain, Masoud Ataei Jaliseh, Nicole Khoun, Kamlesh Kumar, Viji Srinivasan, Swagath Venkataramani
2026Ember: A Compiler for Embedding Operations on Decoupled Access-Execute Architectures.
Marco Siracusa, Olivia Hsu, Víctor Soria Pardos, Joshua Randall, Arnaud Grasset, Eric Biscondi, Douglas J. Joseph, Randy Allen, Fredrik Kjolstad, Miquel Moretó Planas, Adrià Armejach
2026Enabling Automatic Compiler-Driven Vectorization of Transformers.
Shreya Alladi, Alberto Ros, Alexandra Jimborean
2026Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization.
Prasanth Chatarasi, Alex Gatea, Wei Wang, Chris Bowler, Shubham Jain, Masoud Ataei Jaliseh, Nicole Khoun, Alberto Mannari, Bardia Mahjour, Viji Srinivasan, Swagath Venkataramani
2026FHEFusion: Enabling Operator Fusion in FHE Compilers for Depth-Efficient DNN Inference.
Tianxiang Sui, Jianxin Lai, Long Li, Peng Yuan, Yan Liu, Qing Zhu, Xiaojing Zhang, Linjie Xiao, Mingzhe Zhang, Jingling Xue
2026FORTE: Online DataFrame Query Optimizer.
Yoonho Choi, Kyoungtae Lee, Minji Kim, Hyungsoo Jung, Hyojin Sung
2026FRUGAL: Pushing GPU Applications beyond Memory Limits.
Lingqi Zhang, Tengfei Wang, Jiajun Huang, Chen Zhuang, Ivan R. Ivanov, Peng Chen, Toshio Endo, Mohamed Wahib
2026Fast Autoscheduling for Sparse ML Frameworks.
Bobby Yan, Alexander J. Root, Trevor Gale, David Broman, Fredrik Kjolstad
2026Flow-Graph-Aware Tiling and Rescheduling for Memory-Efficient On-Device Inference.
Yeonoh Jeong, Taehyeong Park, Yongjun Park
2026From Threads to Tiles: T2T, a Compiler for CUDA-to-NPU Translation via 2D Vectorization.
Shuaijiang Li, Jiacheng Zhao, Ying Liu, Shuoming Zhang, Lei Chen, Yijin Li, Yangyu Zhang, Zhicheng Li, Runyu Zhou, Xiyu Shi, Chunwei Xia, Yuan Wen, Xiaobing Feng, Huimin Cui
2026GRANII: Selection and Ordering of Primitives in GRAph Neural Networks using Input Inspection.
Damitha Lenadora, Vimarsh Sathia, Gerasimos Gerogiannis, Serif Yesil, Josep Torrellas, Charith Mendis
2026Hexcute: A Compiler Framework for Automating Layout Synthesis in GPU Programs.
Xiao Zhang, Yaoyao Ding, Bolin Sun, Yang Hu, Tatiana Shpeisman, Gennady Pekhimenko
2026IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2026, Sydney, Australia, January 31 - Feb. 4, 2026
Stephen M. Blackburn, Albert Cohen, Timothy M. Jones
2026LEGO: A Layout Expression Language for Code Generation of Hierarchical Mapping.
Amir Mohammad Tavakkoli, Cosmin E. Oancea, Mary W. Hall
2026LLM-VeriOpt: Verification-Guided Reinforcement Learning for LLM-Based Compiler Optimization.
Xiangxin Fang, Jiaqin Kang, Rodrigo Rocha, Sam Ainsworth, Lev Mukhanov
2026Multidirectional Propagation of Sparsity Information across Tensor Slices.
Kaio Henrique Andrade Ananias, Danila Seliayeu, José Nelson Amaral, Fernando Magno Quintão Pereira
2026On the Precision of Dynamic Program Fingerprints Based on Performance Counters.
Anderson Faustino da Silva, Marcelo Borges Nogueira, Sérgio Queiroz de Medeiros, Jerónimo Castrillón, Fernando Magno Quintão Pereira
2026OpenQudit: Extensible and Accelerated Numerical Quantum Compilation via a JIT-Compiled DSL.
Ed Younis
2026PASTA: A Modular Program Analysis Tool Framework for Accelerators.
Mao Lin, Hyeran Jeon, Keren Zhou
2026PIP: Making Andersen's Points-to Analysis Sound and Practical for Incomplete C Programs.
Håvard Rognebakke Krogstie, Helge Bahmann, Magnus Själander, Nico Reissmann
2026Partial-Evaluation Templates: Accelerating Partial Evaluation with Pre-compiled Templates.
Florian Huemer, Aleksandar Prokopec, David Leopoldseder, Raphael Mosaner, Hanspeter Mössenböck
2026PolyUFC: Polyhedral Compilation Meets Roofline Analysis for Uncore Frequency Capping.
Nilesh Rajendra Shah, M. V. V. S. Manoj Kumar, Dhairya Baxi, Ramakrishna Upadrasta
2026Practical: Are Abstract-Interpreter Baseline JITs Worth It? An Empirical Evaluation through Metacompilation.
Nahuel Palumbo, Guillermo Polito, Stéphane Ducasse, Pablo Tesone
2026PriTran: Privacy-Preserving Inference for Transformer-Based Language Models under Fully Homomorphic Encryption.
Yuechen Mu, Guangli Li, Shiping Chen, Jingling Xue
2026Progressive Low-Precision Approximation of Tensor Operators on GPUs: Enabling Greater Trade-Offs between Performance and Accuracy.
Fan Luo, Guangli Li, Zhaoyang Hao, Xueying Wang, Xiaobing Feng, Huimin Cui, Jingling Xue
2026Proton: Towards Multi-level, Adaptive Profiling for Triton.
Keren Zhou, Tianle Zhong, Hao Wu, Jihyeong Lee, Yue Guan, Yufei Ding, Corbin Robeck, Yuanwei Fang, Jeff Niu, Philippe Tillet
2026Pushing Tensor Accelerators beyond MatMul in a User-Schedulable Language.
Yihong Zhang, Derek K. Gerstmann, Andrew Adams, Maaz Bin Safeer Ahmad
2026Pyls: Enabling Python Hardware Synthesis with Dynamic Polymorphism via LCRS Encoding.
Bolei Tong, Yongyan Fang, Chaorui Wang, Qingan Li, Jingling Xue, Mengting Yuan
2026QIGen: A Kernel Generator for Inference on Nonuniformly Quantized Large Language Models.
Tommaso Pegolotti, Dan Alistarh, Markus Püschel
2026SecSwift, a Compiler-Based Framework for Software Countermeasures in Cybersecurity.
François de Ferrière, Yves Janin, Sirine Mechmech
2026Selene: Cross-Level Barrier-Free Pipelining for Irregular Nested Loops in High-Level Synthesis.
Sungwoo Yun, Seonyoung Cheon, Dongkwan Kim, Heelim Choi, Kunmo Jeong, Chan Lee, Yongwoo Lee, Hanjun Kim
2026SkeleShare: Algorithmic Skeletons and Equality Saturation for Hardware Resource Sharing.
Jonathan Van der Cruysse, Tzung-Han Juang, Shakiba Bolbolian Khah, Christophe Dubach
2026Space-Time Optimisations for Early Fault-Tolerant Quantum Computation.
Sanaa Sharma, Prakash Murali
2026SparseX: Synergizing GPU Libraries for Sparse Matrix Multiplication on Heterogeneous Processors.
Ruifeng Zhang, Xiangwei Wang, Ang Li, Xipeng Shen
2026Synthesizing Instruction Selection Back-Ends from ISA Specifications Made Practical.
Florian Drescher, Alexis Engelke
2026Synthesizing Specialized Sparse Tensor Accelerators for FPGAs via High-Level Functional Abstractions.
Hamza Javed, Christophe Dubach
2026TPDE: A Fast Adaptable Compiler Back-End Framework.
Tobias Schwarz, Tobias Kamm, Alexis Engelke
2026TRACE4J: A Lightweight, Flexible, and Insightful Performance Tracing Tool for Java.
Haide He, Pengfei Su
2026Tawa: Automatic Warp Specialization for Modern GPUs with Asynchronous References.
Hongzheng Chen, Bin Fan, Alexander Collins, Bastian Hagedorn, Evghenii Gaburov, Masahiro Masuda, Matthew Brookhart, Chris Sullivan, Jason Knight, Zhiru Zhang, Vinod Grover
2026Tensor Program Superoptimization through Cost-Guided Symbolic Program Synthesis.
Alexander Brauckmann, Aarsh Chaube, José Wesley de S. Magalhães, Elizabeth Polgreen, Michael F. P. O'Boyle
2026The Parallel-Semantics Program Dependence Graph for Parallel Optimization.
Yian Su, Brian Homerding, Haocheng Gao, Federico Sossai, Yebin Chon, David I. August, Simone Campanoni
2026Thinking Fast and Correct: Automated Rewriting of Numerical Code through Compiler Augmentation.
Siyuan Brant Qian, Vimarsh Sathia, Ivan R. Ivanov, Jan Hückelheim, Paul Hovland, William S. Moses
2026Towards Path-Aware Coverage-Guided Fuzzing.
Giacomo Priamo, Daniele Cono D'Elia, Mathias Payer, Leonardo Querzoni
2026Towards Threading the Needle of Debuggable Optimized Binaries.
Cristian Assaiante, Simone Di Biasio, Snehasish Kumar, Giuseppe Antonio Di Luna, Daniele Cono D'Elia, Leonardo Querzoni
2026Unlocking Python Multithreading Capabilities using OpenMP-Based Programming with OMP4Py.
César Piñeiro, Juan Carlos Pichel
2026VFlatten: Selective Value-Object Flattening using Hybrid Static and Dynamic Analysis.
Arjun H. Kumar, Bhavya Hirani, Hang Shao, Tobi Ajila, Vijay Sundaresan, Daryl Maier, Manas Thakur