CGO A

49 papers

YearTitle / Authors
2025A Multi-level Compiler Backend for Accelerated Micro-kernels Targeting RISC-V ISA Extensions.
Alexandre Lopoukhine, Federico Ficarelli, Christos Vasiladiotis, Anton Lydike, Josse Van Delm, Alban Dutilleul, Luca Benini, Marian Verhelst, Tobias Grosser
2025A Priori Loop Nest Normalization: Automatic Loop Scheduling in Complex Applications.
Lukas Trümper, Philipp Schaad, Berke Ates, Alexandru Calotoiu, Marcin Copik, Torsten Hoefler
2025ANT-ACE: An FHE Compiler Framework for Automating Neural Network Inference.
Long Li, Jianxin Lai, Peng Yuan, Tianxiang Sui, Yan Liu, Qing Zhu, Xiaojing Zhang, Linjie Xiao, Wenguang Chen, Jingling Xue
2025ASDF: A Compiler for Qwerty, a Basis-Oriented Quantum Programming Language.
Austin J. Adams, Sharjeel Khan, Arjun S. Bhamra, Ryan R. Abusaada, Anthony M. Cabrera, Cameron C. Hoechst, Travis S. Humble, Jeffrey S. Young, Thomas M. Conte
2025Accelerating LLMs using an Efficient GEMM Library and Target-Aware Optimizations on Real-World PIM Devices.
Hyeoncheol Kim, Taehoon Kim, Taehyeong Park, Donghyeon Kim, Yongseung Yu, Hanjun Kim, Yongjun Park
2025An Efficient Polynomial Multiplication Derived Implementation of Convolution in Neural Networks.
Haoke Xu, Yulin Zhang, Zitong Cheng, Xiaoming Li
2025Automatic Synthesis of Specialized Hash Functions.
Renato B. Hoffmann, Leonardo G. Faé, Dalvan Griebler, Xinliang David Li, Fernando Magno Quintão Pereira
2025CUrator: An Efficient LLM Execution Engine with Optimized Integration of CUDA Libraries.
Yoon Noh Lee, Yongseung Yu, Yongjun Park
2025Cage: Hardware-Accelerated Safe WebAssembly.
Martin Fink, Dimitrios Stavrakakis, Dennis Sprokholt, Soham Chakraborty, Jan-Erik Ekberg, Pramod Bhatotia
2025Calibro: Compilation-Assisted Linking-Time Binary Code Outlining for Code Size Reduction in Android Applications.
Zhanhao Liang, Hanming Sun, Wenhan Shang, Mengting Yuan, Jingqin Fu, Jiang Ma, Chun Jason Xue, Qingan Li
2025Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU.
Naifeng Zhang, Franz Franchetti
2025Combining MLIR Dialects with Domain-Specific Architecture for Efficient Regular Expression Matching.
Andrea Somaini, Filippo Carloni, Giovanni Agosta, Marco D. Santambrogio, Davide Conficconi
2025CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning.
Guoliang He, Eiko Yoneki
2025DialEgg: Dialect-Agnostic MLIR Optimizer using Equality Saturation with Egglog.
Abd-El-Aziz Zayed, Christophe Dubach
2025Enhancing Deployment-Time Predictive Model Robustness for Code Analysis and Optimization.
Huanting Wang, Patrick Lenihan, Zheng Wang
2025FastFlip: Compositional SDC Resiliency Analysis.
Keyur Joshi, Rahul Singh, Tommaso Bassetto, Sarita V. Adve, Darko Marinov, Sasa Misailovic
2025GoFree: Reducing Garbage Collection via Compiler-Inserted Freeing.
Haoran Peng, Yu Zhang, Michael D. Ernst, Jinbao Chen, Boyao Ding
2025GraalNN: Context-Sensitive Static Profiling with Graph Neural Networks.
Lazar Milikic, Milan Cugurovic, Vojin Jovanovic
2025Honey Potion: An eBPF Backend for Elixir.
Kael Soares Augusto, Vinícius Pacheco, Marcos A. M. Vieira, Rodrigo Geraldo Ribeiro, Fernando Magno Quintão Pereira
2025Improving Native-Image Startup Performance.
Matteo Basso, Aleksandar Prokopec, Andrea Rosà, Walter Binder
2025IntelliGen: Instruction-Level Auto-tuning for Tensor Program with Monotonic Memory Optimization.
Zixuan Ma, Haojie Wang, Jingze Xing, Shuhong Huang, Liyan Zheng, Chen Zhang, Huanqi Cao, Kezhao Huang, Mingshu Zhai, Shizhi Tang, Penghan Wang, Jidong Zhai
2025Janitizer: Rethinking Binary Tools for Practical and Comprehensive Security.
Mahwish Arif, Sam Ainsworth, Timothy M. Jones
2025LLM-Vectorizer: LLM-Based Verified Loop Vectorizer.
Jubi Taneja, Avery Laird, Cong Yan, Madan Musuvathi, Shuvendu K. Lahiri
2025MTE4JNI: A Memory Tagging Method to Protect Java Heap Memory from Illicit Native Code Access.
Huinan Chen, Jiang Ma, Chun Jason Xue, Qingan Li
2025Memory Safety Instrumentations in Practice: Usability, Performance, and Security Guarantees.
Tina Jung, Fabian Ritter, Sebastian Hack
2025Parallaft: Runtime-Based CPU Fault Tolerance via Heterogeneous Parallelism.
Boyue Zhang, Sam Ainsworth, Lev Mukhanov, Timothy M. Jones
2025Pattern Matching in AI Compilers and Its Formalization.
Joseph W. Cutler, Alex Collins, Bin Fan, Mahesh Ravishankar, Vinod Grover
2025Postiz: Extending Post-increment Addressing for Loop Optimization and Code Size Reduction.
Enming Fan, Xiaofeng Guan, Fan Hu, Heng Shi, Hao Zhou, Jianguo Yao
2025PreFix: Optimizing the Performance of Heap-Intensive Applications.
Chaitanya Mamatha Ananda, Rajiv Gupta, Sriraman Tallam, Han Shen, Xinliang David Li
2025Proceedings of the 23rd ACM/IEEE International Symposium on Code Generation and Optimization, CGO 2025, Las Vegas, NV, USA, March 1-5, 2025
Johannes Doerfert, Tobias Grosser, Hugh Leather, P. Sadayappan
2025Proteus: Portable Runtime Optimization of GPU Kernel Execution with Just-in-Time Compilation.
Giorgis Georgakoudis, Konstantinos Parasyris, David Beckingsale
2025Qiwu: Exploiting Ciphertext-Level SIMD Parallelism in Homomorphic Encryption Programs.
Zhongcheng Zhang, Ying Liu, Yuyang Zhang, Zhenchuan Chen, Jiacheng Zhao, Xiaobing Feng, Huimin Cui, Jingling Xue
2025Qubit Movement-Optimized Program Generation on Zoned Neutral Atom Processors.
Enhyeok Jang, Youngmin Kim, Hyungseok Kim, Seungwoo Choi, Yipeng Huang, Won Woo Ro
2025Scalar Interpolation: A Better Balance between Vector and Scalar Execution for SuperScalar Architectures.
Reza Ghanbari, Henry Kao, João P. L. de Carvalho, Ehsan Amiri, José Nelson Amaral
2025SkipFlow: Improving the Precision of Points-to Analysis using Primitive Values and Predicate Edges.
David Kozak, Codrut Stancu, Tomás Vojnar, Christian Wimmer
2025Speeding up the Local C++ Development Cycle with Header Substitution.
Nader Al Awar, Zijian Yi, George Biros, Milos Gligoric
2025Stack Filtering: Elevating Precision and Efficiency in Rust Pointer Analysis.
Wei Li, Dongjie He, Wenguang Chen, Jingling Xue
2025Stardust: Compiling Sparse Tensor Algebra to a Reconfigurable Dataflow Architecture.
Olivia Hsu, Alexander Rucker, Tian Zhao, Varun Desai, Kunle Olukotun, Fredrik Kjolstad
2025SySTeC: A Symmetric Sparse Tensor Compiler.
Radha Patel, Willow Ahrens, Saman P. Amarasinghe
2025Synthesis of Quantum Simulators by Compilation.
Meisam Tarabkhah, Mahshid Delavar, Mina Doosti, Amir Shaikhha
2025Synthesis of Sorting Kernels.
Marcel Ullrich, Sebastian Hack
2025Teapot: Efficiently Uncovering Spectre Gadgets in COTS Binaries.
Fangzheng Lin, Zhongfa Wang, Hiroshi Sasaki
2025Tensorize: Fast Synthesis of Tensor Programs from Legacy Code using Symbolic Tracing, Sketching and Solving.
Alexander Brauckmann, Luc Jaulmes, José Wesley de Souza Magalhães, Elizabeth Polgreen, Michael F. P. O'Boyle
2025The MLIR Transform Dialect: Your Compiler Is More Powerful Than You Think.
Martin Paul Lücke, Oleksandr Zinenko, William S. Moses, Michel Steuwer, Albert Cohen
2025Towards Efficient Compiler Auto-tuning: Leveraging Synergistic Search Spaces.
Haolin Pan, Yuanyu Wei, Mingjie Xing, Yanjun Wu, Chen Zhao
2025VEGA: Automatically Generating Compiler Backends using a Pre-trained Transformer Model.
Ming Zhong, Fang Lv, Lulin Wang, Lei Qiu, Yingying Wang, Ying Liu, Huimin Cui, Xiaobing Feng, Jingling Xue
2025Vectron: A Dynamic Programming Auto-vectorization Framework.
Sourena Naser Moghaddasi, Haris Smajlovic, Ariya Shajii, Ibrahim Numanagic
2025Weaver: A Retargetable Compiler Framework for FPQA Quantum Architectures.
Oguzcan Kirmemis, Francisco Romão, Emmanouil Giortamis, Pramod Bhatotia
2025xDSL: Sidekick Compilation for SSA-Based Compilers.
Mathieu Fehr, Michel Weber, Christian Ulmann, Alexandre Lopoukhine, Martin Paul Lücke, Théo Degioanni, Christos Vasiladiotis, Michel Steuwer, Tobias Grosser