| 2025 | "My productivity is boosted, but ..." Demystifying Users' Perception on AI Coding Assistants. Yunbo Lyu, Zhou Yang, Jieke Shi, Jianming Chang, Yue Liu, David Lo |
| 2025 | 40th IEEE/ACM International Conference on Automated Software Engineering, ASE 2025, Seoul, Korea, Republic of, November 16-20, 2025 |
| 2025 | A Characterization Study of Bugs in LLM Agent Workflow Orchestration Frameworks. Ziluo Xue, Yanjie Zhao, Shenao Wang, Kai Chen, Haoyu Wang |
| 2025 | A Large Scale Study of AI-based Binary Function Similarity Detection Techniques for Security Researchers and Practitioners. Jingyi Shi, Yufeng Chen, Yang Xiao, Yuekang Li, Zhengzi Xu, Sihao Qiu, Chi Zhang, Keyu Qi, Yeting Li, Xingchu Chen, Yanyan Zou, Yang Liu, Wei Huo |
| 2025 | A Large-Scale Evolvable Dataset for Model Context Protocol Ecosystem and Security Analysis. Zhiwei Lin, Bonan Ruan, Jiahao Liu, Weibo Zhao |
| 2025 | A Multi-Modality Evaluation of the Reality Gap in Autonomous Driving Systems. Stefano Carlo Lambertenghi, Mirena Flores Valdez, Andrea Stocco |
| 2025 | A Secure Mocking Approach towards Software Supply Chain Security. Daisuke Yamaguchi, Shinobu Saito, Takuya Iwatsuka, Nariyoshi Chida, Tachio Terauchi |
| 2025 | ACTaint: Agent-Based Taint Analysis for Access Control Vulnerabilities in Smart Contracts. Huarui Lin, Zhipeng Gao, Jiachi Chen, Xiang Chen, Xiaohu Yang, Lingfeng Bao |
| 2025 | ADPerf: Investigating and Testing Performance in Autonomous Driving Systems. Tri Minh-Triet Pham, Diego Elias Costa, Weiyi Shang, Jinqiu Yang |
| 2025 | AMPLE: Fine-grained File Access Policies for Server Applications. Seyedhamed Ghavamnia, Julien Vanegue |
| 2025 | APIDA-Chat: Structured Synthesis of API Search Dialogues to Bootstrap Conversational Agents. Zachary Eberhart, Collin McMillan |
| 2025 | APKARMOR: Low-Cost Lightweight Anti-Decompilation Techniques for Android Apps. Jiayang Liu, Yanjie Zhao, Pengcheng Xia, Haoyu Wang |
| 2025 | ARG: Testing Query Rewriters via Abstract Rule Guided Fuzzing. Dawei Li, Yuxiao Guo, Qifan Liu, Jie Liang, Zhiyong Wu, Jingzhou Fu, Chi Zhang, Yu Jiang |
| 2025 | Acceleration of Automotive Software Development by Retrieval Augmented Integration Test Script Generation. Masashi Mizoguchi, Kentaro Yoshimura, Keita Nakazawa, Yasuomi Sato, Takahiro Iida, Fumio Narisawa |
| 2025 | AdaptEval: A Benchmark for Evaluating Large Language Models on Code Snippet Adaptation. Tanghaoran Zhang, Xinjun Mao, Shangwen Wang, Yuxin Zhao, Yao Lu, Jin Zhang, Zhang Zhang, Kang Yang, Yue Yu |
| 2025 | Adaptive Performance Regression Detection Using A Semi-Supervised Siamese Network. Yongqian Sun, Mengyao Li, Xiao Xiong, Lei Tao, Yimin Zuo, Wenwei Gu, Shenglin Zhang, Junhua Kuang, Yu Luo, Huandong Zhuang, Bowen Deng, Dan Pei |
| 2025 | AdaptiveGuard: Towards Adaptive Runtime Safety for LLM-Powered Software. Rui Yang, Michael Fu, Chakkrit Tantithamthavorn, Chetan Arora, Gunel Gulmammadova, Joselito Joey Chua |
| 2025 | Advancing Automated Ethical Profiling in SE: a Zero-Shot Evaluation of LLM Reasoning. Patrizio Migliarini, Mashal Afzal Memon, Marco Autili, Paola Inverardi |
| 2025 | Advancing Binary Code Similarity Detection via Context-Content Fusion and LLM Verification. Chaopeng Dong, Jingdong Guo, Shouguo Yang, Yi Li, Dongliang Fang, Yang Xiao, Yongle Chen, Limin Sun |
| 2025 | AgentDroid: A Multi-Agent Tool for Detecting Fraudulent Android Applications. Ruwei Pan, Hongyu Zhang, Zhonghao Jiang, Ran Hou |
| 2025 | Agentic Specification Generator for Move Programs. Yu-Fu Fu, Meng Xu, Taesoo Kim |
| 2025 | Agents in the Sandbox: End-to-End Crash Bug Reproduction for Minecraft. Eray Yapagci, Yavuz Alp Sencer Öztürk, Eray Tüzün |
| 2025 | AlertGuardian: Intelligent Alert Life-Cycle Management for Large-scale Cloud Systems. Guangba Yu, Genting Mai, Rui Wang, Ruipeng Li, Pengfei Chen, Long Pan, Ruijie Xu |
| 2025 | Algernon: A Flag-Guided Hybrid Fuzzer for Unlocking Hidden Program Paths. Peng Deng, Lei Zhang, Jingqi Long, Wenzheng Hong, Zhemin Yang, Yuan Zhang, Donglai Zhu, Min Yang |
| 2025 | AlignCoder: Aligning Retrieval with Target Intent for Repository-Level Code Completion. Tianyue Jiang, Yanlin Wang, Yanli Wang, Daya Guo, Ensheng Shi, Yuchi Ma, Jiachi Chen, Zibin Zheng |
| 2025 | Aligning LLMs to Fully Utilize the Cross-file Context in Repository-level Code Completion. Jia Li, Hao Zhu, Huanyu Liu, Xianjie Shi, He Zong, Yihong Dong, Kechi Zhang, Siyuan Jiang, Zhi Jin, Ge Li |
| 2025 | Altered Histories in Version Control System Repositories: Evidence from the Trenches. Solal Rapaport, Laurent Pautet, Samuel Tardieu, Stefano Zacchiroli |
| 2025 | Amur: Fixing Multi-Resource Leaks Guided by Resource Flow Analysis. Jinyoung Kim, Eunseok Lee |
| 2025 | An Agent-based Evaluation Framework for Complex Code Generation. Xinchen Wang, Ruida Hu, Pengfei Gao, Chao Peng, Cuiyun Gao |
| 2025 | An Empirical Study of Knowledge Transfer in AI Pair Programming. Alisa Welter, Niklas Schneider, Tobias Dick, Kallistos Weis, Christof Tinnes, Marvin Wyrich, Sven Apel |
| 2025 | An Empirical Study of Python Library Migration Using Large Language Models. Mohayeminul Islam, Ajay Kumar Jha, May Mahmoud, Ildar Akhmetov, Sarah Nadi |
| 2025 | An Empirical Study on UI Overlap in OpenHarmony Applications. Farong Liu, Mingyi Zhou, Li Li |
| 2025 | An LLM-based multi-agent framework for agile effort estimation. Thanh-Long Bui, Hoa Khanh Dam, Rashina Hoda |
| 2025 | AndroFL: Evolutionary-Driven Fault Localization for Android Apps. Vishal Singh, Ravi Shankar Das, Prajwal H. G, Subhajit Roy |
| 2025 | AppBDS: LLM-Powered Description Synthesis for Sensitive Behaviors in Mobile Apps. Zichen Liu, Xusheng Xiao |
| 2025 | Are We SOLID Yet? An Empirical Study on Prompting LLMs to Detect Design Principle Violations. Fatih Pehlivan, Arçin Ülkü Ergüzen, Sahand Moslemi Yengejeh, Mayasah Lami, Anil Koyuncu |
| 2025 | Argus: Resilience-Oriented Safety Assurance Framework for End-to-End ADSs. Dingji Wang, You Lu, Bihuan Chen, Shuo Hao, Haowen Jiang, Yifan Tian, Xin Peng |
| 2025 | AutoFid: Adaptive and Noise-Aware Fidelity Measurement for Quantum Programs via Circuit Graph Analysis. Tingting Li, Ziming Zhao, Jianwei Yin |
| 2025 | AutoPLC: Generating Vendor-Aware Structured Text for Programmable Logic Controllers. Donghao Yang, Aolang Wu, Tianyi Zhang, Li Zhang, Xiaoli Lian, Fang Liu, Yuming Ren, Jiaji Tian, Xiaoyin Che |
| 2025 | Automated Combinatorial Test Generation for Alloy. Agustín Borda, Germán Regis, Nazareno Aguirre, Marcelo F. Frias, Pablo Ponzio |
| 2025 | Automated Detection of Web Application Navigation Barriers for Screen Reader Users. Shubhi Jain, Syed Fatiul Huq, Ziyao He, Sam Malek |
| 2025 | Automated Generation of Issue-Reproducing Tests by Combining LLMs and Search-Based Testing. Konstantinos Kitsios, Marco Castelluccio, Alberto Bacchelli |
| 2025 | Automated Inline Comment Smell Detection and Repair with Large Language Models. Hatice Kübra Çaglar, Semih Çaglar, Eray Tüzün |
| 2025 | Automated Insertion of Flushes and Fences for Persistency. Yutong Guo, Weiyu Luo, Brian Demsky |
| 2025 | Automated Proactive Logging Quality Improvement for Large-Scale Codebases. Yichen Li, Jinyang Liu, Junsong Pu, Zhihan Jiang, Zhuangbin Chen, Xiao He, Tieying Zhang, Jianjun Chen, Yi Li, Rui Shi, Michael R. Lyu |
| 2025 | Automated Prompt Generation for Code Intelligence: An Empirical study and Experience in WeChat. Kexing Ji, Shiyun Fu, Cuiyun Gao, Yujia Chen, Zezhou Yang, Chaozheng Wang, Yuetang Deng |
| 2025 | Automated Repair of Ambiguous Problem Descriptions for LLM-Based Code Generation. Haoxiang Jia, Robbie Morris, He Ye, Federica Sarro, Sergey Mechtaev |
| 2025 | Automated Repair of OpenID Connect Programs. Tamjid Al Rahat, Yanju Chen, Yu Feng, Yuan Tian |
| 2025 | Automatic Fixing of Missing Dependency Errors. Jun Lyu, He Zhang, Lanxin Yang, Yue Li, Chenxing Zhong, Manuel Rigger |
| 2025 | Autonomous Agents for Accessibility: Simulating Visual Impairments in Web Interfaces. Juan Diego Yepes-Parra, Camilo Escobar-Velásquez |
| 2025 | BASHIRI: Learning Failure Oracles from Execution Features. Marius Smytzek, Martin Eberlein, Tural Mammadov, Lars Grunske, Andreas Zeller |
| 2025 | BCFuzz: Bytecode-Driven Fuzzing for JavaScript Engines. Jiming Wang, Chenggang Wu, Jikai Ren, Yuhao Hu, Yan Kang, Xiaojie Wei, Yuanming Lai, Mengyao Xie, Zhe Wang |
| 2025 | Backdoors in Code Summarizers: How Bad Is It? Chenyu Wang, Zhou Yang, Yaniv Harel, David Lo |
| 2025 | Belief Propagation with Local Structure and Its Applications in Program Analysis. Yiqian Wu, Yifan Chen, Yingfei Xiong, Xin Zhang |
| 2025 | BenGQL: An Extensible Benchmarking Framework for Automated GraphQL Testing. Abenezer Angamo, Marcello Maugeri |
| 2025 | Better Safe than Sorry: Preventing Policy Violations through Predictive Root-Cause-Analysis for IoT Systems. Michael Norris, Syed Rafiul Hussain, Gang Tan |
| 2025 | Beyond Static GUI Agent: Evolving LLM-based GUI Testing via Dynamic Memory. Mengzhuo Chen, Zhe Liu, Chunyang Chen, Junjie Wang, Yangguang Xue, Boyu Wu, Yuekai Huang, Libin Wu, Qing Wang |
| 2025 | BinStruct: Binary Structure Recovery Combining Static Analysis and Semantics. Yiran Zhang, Zhengzi Xu, Zhe Lang, Chengyue Liu, Yuqiang Sun, Wenbo Guo, Chengwei Liu, Weisong Sun, Yang Liu |
| 2025 | BitsAI-Fix: LLM-Driven Approach for Automated Lint Error Resolution in Practice. Yuanpeng Li, Qi Long, Zhiyuan Yao, Jian Xu, Lintao Xie, Xu He, Lu Geng, Xin Han, Yueyan Chen, Wenbo Duan |
| 2025 | Breaking the Traffic Barrier: Unveiling Multi-Format of Protocols via Autonomous Program Exploration. Dingzhao Xue, Yibo Qu, Bowen Jiang, Xin Chen, Shuaizong Si, Shichao Lv, Zhiqiang Shi, Limin Sun |
| 2025 | Bridging Natural Language and Formal Specification-Automated Translation of Software Requirements to LTL via Hierarchical Semantics Decomposition Using LLMs. Zhi Ma, Cheng Wen, Zhexin Su, Xiao Liang, Cong Tian, Shengchao Qin, Mengfei Yang |
| 2025 | Bridging Research and Practice in Simulation-based Testing of Industrial Robot Navigation Systems. Sajad Mazraeh Khatiri, Francisco Eli Vina Barrientos, Maximilian Wulf, Paolo Tonella, Sebastiano Panichella |
| 2025 | BuilDroid: A Self-Correcting LLM Agent for Automated Android Builds. Jaehyeon Kim, Rui Rua, Karim Ali |
| 2025 | Can Mamba Be Better? An Experimental Evaluation of Mamba in Code Intelligence. Shuo Liu, Jacky Keung, Zhen Yang, Zhenyu Mao, Yicheng Sun |
| 2025 | Characterizing Multi-Hunk Patches: Divergence, Proximity, and LLM Repair Challenges. Noor Nashid, Daniel Ding, Keheliya Gallaba, Ahmed E. Hassan, Ali Mesbah |
| 2025 | Characterizing and Repairing Color-Related Accessibility Issues in Android Apps. Jiahao Gu, Huaxun Huang |
| 2025 | Chrysalis: A Lightweight Logging and Replay Framework for Metamorphic Testing in Python. Jai Parera, Nathan Huey, Ben Limpanukorn, Miryung Kim |
| 2025 | Clara: A Developer's Companion for Code Comprehension and Analysis. Ahmed Adnan, Mushfiqur Rahman, Saad Sakib Noor, Kazi Sakib |
| 2025 | Clarifying Semantics of In-Context Examples for Unit Test Generation. Chen Yang, Lin Yang, Ziqi Wang, Dong Wang, Jianyi Zhou, Junjie Chen |
| 2025 | CoTune: Co-evolutionary Configuration Tuning. Gangda Xiong, Tao Chen |
| 2025 | Code-DiTing: Automatic Evaluation of Code Generation without References or Test Cases. Guang Yang, Yu Zhou, Xiang Chen, Wei Zheng, Xing Hu, Xin Zhou, David Lo, Taolue Chen |
| 2025 | CodeACT-R: A Cognitive Simulation Framework for Human Attention in Code Reading. Yueke Zhang, Zihan Fang, Greg Trafton, Daniel Levin, Kevin Leach, Yu Huang |
| 2025 | CodeGenLink: A Tool to Find the Likely Origin and License of Automatically Generated Code. Daniele Bifolco, Guido Annicchiarico, Pierluigi Barbiero, Massimiliano Di Penta, Fiorella Zampetti |
| 2025 | Coding-Fuse: Efficient Fusion of Code Pre-Trained Models for Classification Tasks. Yu Zhao, Lina Gong, Zhiqiu Huang, Yuchen Jin, Mingqiang Wei |
| 2025 | Comprehend, Imitate, and then Update: Unleashing the Power of LLMs in Test Suite Evolution. Tangzhi Xu, Jianhan Liu, Yuan Yao, Cong Li, Feng Xu, Xiaoxing Ma |
| 2025 | ConfuseTaint: Exploiting Vulnerabilities to Bypass Dynamic Taint Analysis. Yufei Wu, Alexandre Bartel |
| 2025 | Context-Aware CodeLLM Eviction for AI-assisted Coding. Kishanthan Thangarajah, Boyuan Chen, Shi Chang, Ahmed E. Hassan |
| 2025 | Context-Sensitive Pointer Analysis for ArkTS. Yizhuo Yang, Lingyun Xu, Mingyi Zhou, Li Li |
| 2025 | CoorLog: Efficient-Generalizable Log Anomaly Detection via Adaptive Coordinator in Software Evolution. Pei Xiao, Chiming Duan, Minghua He, Tong Jia, Yifan Wu, Jing Xu, Gege Gao, Lingzhe Zhang, Weijie Hong, Ying Li, Gang Huang |
| 2025 | Coverage-Based Harmfulness Testing for LLM Code Transformation. Honghao Tan, Haibo Wang, Diany Pressato, Yisen Xu, Shin Hwei Tan |
| 2025 | Cross2OH: Enabling Seamless Porting of C/C++ Software Libraries to OpenHarmony. Qian Zhang, Tsz-On Li, Ying Wang, Li Li, Shing-Chi Cheung |
| 2025 | Cryptbara: Dependency-Guided Detection of Python Cryptographic API Misuses. Seogyeong Cho, SeungEun Yu, Seunghoon Woo |
| 2025 | DALEQ - Explainable Equivalence for Java Bytecode. Jens Dietrich, Behnaz Hassanshahi |
| 2025 | DESIGNATOR: a Toolset for Automated GAN-enhanced Search-based Testing and Retraining of DNNs in Martian Environments. Mohammed Oualid Attaoui, Fabrizio Pastore |
| 2025 | DLBench: A Comprehensive Benchmark for SQL Translation with Large Language Models. Li Lin, Hongqiao Chen, Qinglin Zhu, Liehang Chen, Linlong Tang, Rongxin Wu |
| 2025 | DNAFuzz: Descriptor-Aware Fuzzing for USB Drivers. Zhengshu Wang, Peng He, Fuchen Ma, Yuanliang Chen, Shuoshuo Duan, Yiyuan Bai, Yu Jiang |
| 2025 | DRIFT: Debug-based Trace Inference for Firmware Testing. Changming Liu, Alejandro Mera, Meng Xu, Engin Kirda |
| 2025 | DSBox: A Data Selection Framework for Efficient Deep Code Learning. Xinyang Liu, Lili Quan, Qiang Hu |
| 2025 | Data Dependency-Aware Code Generation from Enhanced UML Sequence Diagrams. Wenxin Mao, Zhitao Wang, Long Wang, Sirong Chen, Cuiyun Gao, Luyang Cao, Ziming Liu, Qiming Zhang, Jun Zhou, Zhi Jin |
| 2025 | DebCovDiff: Differential Testing of Coverage Measurement Tools on Real-World Projects. Wentao Zhang, Jinghao Jia, Erkai Yu, Darko Marinov, Tianyin Xu |
| 2025 | Debugging the Undebuggable: Why Multi-Fault Programs Break Debugging and Repair Tools. Omar I. Al-Bataineh |
| 2025 | Debun: Detecting Bundled JavaScript Libraries on Web using Property-Order Graphs. Seojin Kim, Sungmin Park, Jihyeok Park |
| 2025 | DeepExploitor: LLM-Enhanced Automated Exploitation of DeepLink Attack in Hybrid Apps. Zhangyue Zhang, Lei Zhang, Zhibo Zhang, Yongheng Liu, Zhemin Yang, Yuan Zhang, Min Yang |
| 2025 | DeepTx: Real-Time Transaction Risk Analysis via Multi-Modal Features and LLM Reasoning. Yixuan Liu, Xinlei Li, Yi Li |
| 2025 | Defects4C: Benchmarking Large Language Model Repair Capability with C/C++ Bugs. Jian Wang, Xiaofei Xie, Qiang Hu, Shangqing Liu, Jiongchi Yu, Jiaolong Kong, Yi Li |
| 2025 | Defects4Log: Benchmarking LLMs for Logging Code Defect Detection and Reasoning. Xin Wang, Zhenhao Li, Zishuo Ding |
| 2025 | Democratizing the Cryptocurrency Ecosystem by Just-In-Time Transformation of Mining Programs. Wei Liu, Zhenhua Li, Feng Qian, Feiyu Jin, Hao Lin, Yannan Zheng, Bo Xiao, Xiaokang Qin, Tianyin Xu |
| 2025 | Demystifying Cookie Sharing Risks in WebView-based Mobile App-in-app Ecosystems. Miao Zhang, Shenao Wang, Guilin Zheng, Yanjie Zhao, Haoyu Wang |
| 2025 | Demystifying Cross-Language C/C++ Binaries: A Robust Software Component Analysis Approach. Meiqiu Xu, Ying Wang, Wei Tang, Xian Zhan, Shing-Chi Cheung, Hai Yu, Zhiliang Zhu |
| 2025 | Demystifying OpenZeppelin's Own Vulnerabilities and Analyzing Their Propagation in Smart Contracts. Han Liu, Daoyuan Wu, Yuqiang Sun, Shuai Wang, Yang Liu, Yixiang Chen |
| 2025 | Demystifying the Evolution of Neural Networks with BOM Analysis: Insights from a Large-Scale Study of 55,997 GitHub Repositories. Xiaoning Ren, Yuhang Ye, Xiongfei Wu, Yueming Wu, Yinxing Xue |
| 2025 | Destabilizing Neurons to Generate Challenging Neural Network Verification Benchmarks. Linhan Li, ThanhVu Nguyen |
| 2025 | Detecting Semantic Clones of Unseen Functionality. Konstantinos Kitsios, Francesco Sovrano, Earl T. Barr, Alberto Bacchelli |
| 2025 | Detecting Various DeFi Price Manipulations with LLM Reasoning. Juantao Zhong, Daoyuan Wu, Ye Liu, Maoyi Xie, Yang Liu, Yi Li, Ning Liu |
| 2025 | Detecting Vulnerabilities from Issue Reports for Internet-of-Things. Sogol Masoumzadeh |
| 2025 | Detecting and Mitigating Inconsistencies Between Code, Documentation and Tests. Tobias Kiecker |
| 2025 | Detecting and Repairing Incomplete Software Requirements with Multi-LLM Ensembles. Mohamad Kassab, Marwan AbdElhameed |
| 2025 | Diagnosing Performance Differences in Model Checkers via Runtime-Guided Problem Generation. Yibo Dong, Yicong Xu, Wenjing Deng, Yu Chen, Xiaoyu Zhang, Jianwen Li, Chengyu Zhang, Geguang Pu |
| 2025 | DiffFix: Incrementally Fixing AST Diffs via Context and Type Information. Guofeng Zeng, Chang-Ai Sun, Kai Gao, Huai Liu |
| 2025 | Diplomatist: What Do Cross-language Dependencies Reflect Software Ecosystem Health? Fanyi Meng, Ying Wang, Chun Yong Chong, Hai Yu, Zhiliang Zhu |
| 2025 | Do LLMs Generate Useful Test Oracles? An Empirical Study with an Unbiased Dataset. Davide Molinelli, Luca Di Grazia, Alberto Martin-Lopez, Michael D. Ernst, Mauro Pezzè |
| 2025 | Don't Mess with Bro's Cheese! An Empirical Study of Resource Conflict in Android Multi-window. Chenkai Guo, Huimin Zhao, Tianhong Wang, Naipeng Dong, Qingqing Dong, Jiarui Che, Yaqiong Qiao, Xiangyang Luo, Zheli Liu |
| 2025 | DrainCode: Stealthy Energy Consumption Attacks on Retrieval-Augmented Code Generation via Context Poisoning. Yanli Wang, Jiadong Wu, Tianyue Jiang, Mingwei Liu, Jiachi Chen, Chong Wang, Ensheng Shi, Xilin Liu, Yuchi Ma, Zibin Zheng |
| 2025 | DualFuzz: Detecting Vulnerability in Wi-Fi NICs through Dual-Directional Fuzzing. Yuanliang Chen, Fuchen Ma, Yanyang Zhao, Yuanyi Li, Yu Jiang |
| 2025 | Dynamic Testing of GUI Exercises in Headless Environments. Benjamin Schmitz |
| 2025 | EPSO: A Caching-Based Efficient Superoptimizer for BPF Bytecode. Qian Zhu, Yuxuan Liu, Ziyuan Zhu, Shangqing Liu, Lei Bu |
| 2025 | EditFusion: Resolving Code Merge Conflicts via Edit Selection. Changxin Wang, Lei Xu, Rundong Wang, Yiming Ma, Weifeng Zhang |
| 2025 | Effective Code Membership Inference for Code Completion Models via Adversarial Prompts. Yuan Jiang, Zehao Li, Shan Huang, Christoph Treude, Xiaohong Su, Tiantian Wang |
| 2025 | Efficient Understanding of Machine Learning Model Mispredictions. Martin Eberlein, Jürgen Cito, Lars Grunske |
| 2025 | Efficient and Verifiable Proof Logging for MaxSAT Solving. Raoul Van Doren, Timos Antonopoulos, Ruzica Piskac |
| 2025 | EfficientEdit: Accelerating Code Editing via Edit-Oriented Speculative Decoding. Peiding Wang, Li Zhang, Fang Liu, Yinghao Zhu, Wang Xu, Lin Shi, Xiaoli Lian, Minxiao Li, Bo Shen, An Fu |
| 2025 | Element-Aware Fine-Tuning of Vision-Language Models for Cost-Efficient GUI Testing in an Industrial Setting. Mengzhou Wu, Yuzhe Guo, Yuan Cao, Haochuan Lu, Hengyu Zhang, Xia Zeng, Liangchao Yao, Yuetang Deng, Dezhi Ran, Wei Yang, Tao Xie |
| 2025 | Enhancing LLM to Decompile Optimized PTX to Readable CUDA for Tensor Programs. Xinyu Sun, Fugen Tang, Yu Zhang, Han Shen, Chengru Song, Di Zhang |
| 2025 | Enhancing LLM's Ability to Generate More Repository-Aware Unit Tests Through Precise Context Injection. Xin Yin, Chao Ni, Xinrui Li, Liushan Chen, Guojun Ma, Xiaohu Yang |
| 2025 | Enhancing LLMs with Staged Grouping and Dehallucination for Header File Decomposition. Yue Wang, Jiaxuan Sun, Yanzhen Zou, Bing Xie |
| 2025 | Envisioning Intelligent Requirements Engineering via Knowledge-Guided Multi-Agent Collaboration. Jiangping Huang, Dongming Jin, Weisong Sun, Yang Liu, Zhi Jin |
| 2025 | ErrorPrism: Reconstructing Error Propagation Paths in Cloud Service Systems. Junsong Pu, Yichen Li, Zhuangbin Chen, Jinyang Liu, Zhihan Jiang, Jianjun Chen, Rui Shi, Zibin Zheng, Tieying Zhang |
| 2025 | Evaluating Large Language Models for Functional and Maintainable Code in Industrial Settings: A Case Study at ASML. Yash Mundhra, Max Valk, Maliheh Izadi |
| 2025 | Evaluating Large Language Models for Time Series Anomaly Detection in Aerospace Software. Yang Liu, Yixing Luo, Xiaofeng Li, Xiaogang Dong, Bin Gu, Zhi Jin |
| 2025 | Evaluating Program Coverage for Code-Model Training. Nandakishore Menon, Diptikalyan Saha |
| 2025 | Evaluating and Improving Framework-based Parallel Code Completion with Large Language Models. Ke Liu, Qinglin Wang, Xiang Chen, Guang Yang, Yigui Feng, Gencheng Liu, Jie Liu |
| 2025 | Evolution-Aware Heuristics for GR(1) Realizability Checking. Dor Ma'ayan, Shahar Maoz, Jan Oliver Ringert |
| 2025 | Exact Inference for Quantum Circuits: A Testing Oracle for Quantum Software Stacks. Kanguk Lee, Jaemin Hong, Sukyoung Ryu |
| 2025 | Execution-Aware Program Reduction for WebAssembly via Record and Replay. Doehyun Baek, Daniel Lehmann, Ben L. Titzer, Sukyoung Ryu, Michael Pradel |
| 2025 | Explainable Fault Localization for Programming Assignments via LLM-Guided Annotation. Fang Liu, Tianze Wang, Li Zhang, Zheyu Yang, Jing Jiang, Zian Sun |
| 2025 | Exploring Autonomous Agents: A Closer Look at Why They Fail When Completing Tasks. Ruofan Lu, Yichen Li, Yintong Huo |
| 2025 | Exploring Static Taint Analysis in LLMs: A Dynamic Benchmarking Framework for Measurement and Enhancement. Haoran Zhao, Lei Zhang, Keke Lian, Fute Sun, Bofei Chen, Yongheng Liu, Zhiyu Wu, Yuan Zhang, Min Yang |
| 2025 | EyeNav: Accessible Webpage Interaction and Testing using Eye-tracking and NLP. Juan Diego Yepes-Parra, Camilo Escobar-Velásquez |
| 2025 | FETT: Fault Injection as an Educational and Training Tool in Cybersecurity. Anaé De Baets, Guillaume Nguyen, Xavier Devroey, Fabian Gilson |
| 2025 | FGit: Fault-Guided Fine-Tuning for Code Generation. Lishui Fan, Zhongxin Liu, Haoye Wang, Lingfeng Bao, Xin Xia, Shanping Li |
| 2025 | Fact-Aligned and Template-Constrained Static Analyzer Rule Enhancement with LLMs. Zongze Jiang, Ming Wen, Ge Wen, Hai Jin |
| 2025 | FailMapper: Automated Generation of Unit Tests Guided by Failure Scenarios. Ruiqi Dong, Zehang Deng, Xiaogang Zhu, Xiaoning Du, Huai Liu, Shaohua Wang, Sheng Wen, Yang Xiang |
| 2025 | FastCoder: Accelerating Repository-level Code Generation via Efficient Retrieval and Verification. Qianhui Zhao, Li Zhang, Fang Liu, Xiaoli Lian, Qiaoyuanhe Meng, Ziqian Jiao, Zetong Zhou, Jia Li, Lin Shi |
| 2025 | Faster Runtime Verification during Testing via Feedback-Guided Selective Monitoring. Shinhae Kim, Saikat Dutta, Owolabi Legunsen |
| 2025 | Fault Injection for Simulink-based CPS Models: Insights and Future Directions. Drishti Yadav, Claudio Mandrioli, Ezio Bartocci, Domenico Bianculli |
| 2025 | Faultseeker: LLM-Empowered Framework for Blockchain Transaction Fault Localization. Kairan Sun, Zhengzi Xu, Kaixuan Li, Lyuye Zhang, Yuqiang Sun, Liwei Tan, Yang Liu |
| 2025 | Finding Bugs in MLIR Compiler Infrastructure via Lowering Space Exploration. Jingjing Liang, Shan Huang, Ting Su |
| 2025 | Finding Bugs in WebAssembly Interface Type Binding Generators. Ethan Stanley, Eric Eide |
| 2025 | Finding Insecure State Dependency in DApps via Multi-Source Tracing and Semantic Enrichment. Jingwen Zhang, Yuhong Nan, Wei Li, Kaiwen Ning, Zewei Lin, Zitong Yao, Yuming Feng, Weizhe Zhang, Zibin Zheng |
| 2025 | FirmProj: Detecting Firmware Leakage in IoT Update Processes via Companion App Analysis. Wenzhi Li, Jialong Guo, Jiongyi Chen, Fan Li, Yujie Xing, Yanbo Xu, Shishuai Yang, Wenrui Diao |
| 2025 | First-Order Quantified Separator in Alloy Analyzer. One An |
| 2025 | Fixing Broken Graphs: LLM-Powered Automatic Code Optimization for DNN Programs. Haotian Wang, Yicheng Sui, Yudong Xie, Yicong Liu, Yufei Sun, Changqing Shi, Yuzhi Zhang |
| 2025 | FlakyGuard: Automatically Fixing Flaky Tests at Industry Scale. Chengpeng Li, Farnaz Behrang, August Shi, Peng Liu |
| 2025 | FlowStrider: Low-Friction Continuous Threat Modeling. Bernd Gruner, Noah Erthel, Clemens-Alexander Brust |
| 2025 | Forcrat: Automatic I/O API Translation from C to Rust via Origin and Capability Analysis. Jaemin Hong, Sukyoung Ryu |
| 2025 | From Characters to Structure: Rethinking Real-Time Collaborative Programming Models. Leon Freudenthaler, Bernhard Taufner, Karl Michael Göschka |
| 2025 | From Modules to Marketplaces: A Vision for Composable Capability Sharing Across Organizations. Wei-Ji Wang |
| 2025 | From Redundancy to Efficiency: Exploiting Shared UI Interactions towards Efficient LLM-Based Testing. Xuan Wang, Yingchuan Wang, Yongxiang Hu, Yu Zhang, Hailiang Jin, Shiyu Guo, Juxing Yuan, Yangfan Zhou |
| 2025 | From Sparse to Structured: A Diffusion-Enhanced and Feature-Aligned Framework for Coincidental Correctness Detection. Huan Xie, Chunyan Liu, Yan Lei, Zhenyu Wu, Jinping Wang |
| 2025 | From Technical Excellence to Practical Adoption: Lessons Learned Building an ML-Enhanced Trace Analysis Tool. Kaveh Shahedi, Matthew Khouzam, Heng Li, Maxime Lamothe, Foutse Khomh |
| 2025 | Function Clustering-Based Fuzzing Termination: Toward Smarter Early Stopping. Liang Ding, Wenzhang Yang, Yinxing Xue |
| 2025 | GUI-ReRank: Enhancing GUI Retrieval with Multi-Modal LLM-based Reranking. Kristian Kolthoff, Felix Kretzer, Alexander Maedche, Simone Paolo Ponzetto, Christian Bartelt |
| 2025 | GUIFuzz++: Unleashing Grey-box Fuzzing on Desktop Graphical User Interfacing Applications. Dillon Otto, Tanner Rowlett, Stefan Nagy |
| 2025 | Generating Failure-Based Oracles to Support Testing of Reported Bugs in Android Apps. Jack Johnson, Junayed Mahmud, Oscar Chaparro, Kevin Moran, Mattia Fazzini |
| 2025 | GlassWing: A Tailored Static Analysis Approach for Flutter Android Apps. Xiangyu Zhang, Yucheng Su, Lingling Fan, Miaoying Cai, Sen Chen |
| 2025 | HFuzzer: Testing Large Language Models for Package Hallucinations via Phrase-based Fuzzing. Yukai Zhao, Menghan Wu, Xing Hu, Xin Xia |
| 2025 | HarmoBridge: Bridging ArkTS and C/C++ for Cross-Language Static Analysis on HarmonyOS. Jiale Wu, Jiapeng Deng, Yanjie Zhao, Li Li, Haoyu Wang |
| 2025 | Have We Solved Access Control Vulnerability Detection in Smart Contracts? A Benchmark Study. Han Liu, Daoyuan Wu, Yuqiang Sun, Shuai Wang, Yang Liu |
| 2025 | Hierarchical Knowledge Injection for Improving LLM-based Program Repair. Ramtin Ehsani, Esteban Parra, Sonia Haiduc, Preetha Chatterjee |
| 2025 | Hit The Bullseye On The First Shot: Improving LLMs Using Multi-Sample Self-Reward Feedback for Vulnerability Repair. Rui Jiao, Yue Zhang, Jinku Li, Jianfeng Ma |
| 2025 | How Big is the Automaton? Certified Lower Bounds on the Size of Presburger DFAs. Nicolas Amat, Pierre Ganty, Alessio Mansutti |
| 2025 | How Can Infrastructure as Code Accelerate Data Center Bring-ups? A Case Study at ByteDance. Xianhao Jin, Yifei Feng, Yufei Gao, Yongning Hu, Jie Huang, Kun Xia, Luchuan Guo |
| 2025 | How Does ChatGPT Make Assumptions When Creating Erroneous Programs? Sadia Jahan, Xiaoyin Wang |
| 2025 | Human-Centered Evaluation of REST API Fuzzing Tools: Bridging Academia and Industry. Fanny Febriani Susilo |
| 2025 | Human-In-The-Loop Oracle Learning for Simulation-Based Testing. Ben-Hau Chia, Eunsuk Kang, Christopher Steven Timperley |
| 2025 | HybridSIMD: A Super C++ SIMD Library with Integrated Auto-tuning Capabilities. Haolin Pan, Xulin Zhou, Mingjie Xing, Yanjun Wu |
| 2025 | Hypergraph Neural Network-based Multi-Granular Root Cause Localization for Microservice Systems. Yaxiao Li, Lu Wang, Chenxi Zhang, Qingshan Li, Siming Rong, Baiyang Wen, Xuyang Li, Kun Ma, Quanwei Du, KeYang Li, Lingfeng Pan, Xinyue Li, Mingxuan Hui |
| 2025 | IDBFuzz: Web Storage DataBase Fuzzing with Controllable Semantics. Jingyi Chen, Jinfu Chen, Saihua Cai, Shengran Wang |
| 2025 | IMUFuzzer: Resilience-based Discovery of Signal Injection Attacks on Robotic Aerial Vehicles. Sudharssan Mohan, Kyeongseok Yang, Zelun Kong, Yonghwi Kwon, Junghwan Rhee, Tyler Summers, Hongjun Choi, Heejo Lee, Chung Hwan Kim |
| 2025 | Improving LLM-based Log Parsing by Learning from Errors in Reasoning Traces. Jialai Wang, Juncheng Lu, Jie Yang, Junjie Wang, Zeyu Gao, Chao Zhang, Zhenkai Liang, Ee-Chien Chang |
| 2025 | Improving NLSAT for Nonlinear Real Arithmetic. Zhonghan Wang |
| 2025 | Improving Quality of LLM Code Generation in Low-Resource Programming Languages via Uncertainty Estimation. Georgii Andriushchenko |
| 2025 | Incremental Program Analysis in the Wild: An Empirical Study on Real-World Program Changes. Xizao Wang, Xiangrong Bin, Lanxin Huang, Shangqing Liu, Jianhua Zhao, Lei Bu |
| 2025 | Industry Practice of LLM-Assisted Protocol Fuzzing for Commercial Communication Modules. Qiang Fu, Changjian Liu, Yuan Ding, Chao Fan, Yulai Fu, Yuhan Chen, Ying Fu, Ronghua Shi, Fuchen Ma, Heyuan Shi |
| 2025 | IntelliTopo: An IaC Generation Service for Industrial Network Topology Construction. Mingyu Shao, Zhao Liu, Weihong Han, Cuiyun Gao, Jiachen Liu, Qing Liao |
| 2025 | Interaction-Aware Patch Assessment for Multi-Fault Automated Program Repair. Omar I. Al-Bataineh |
| 2025 | Interaction2Code: Benchmarking MLLM-based Interactive Webpage Code Generation from Interactive Prototyping. Jingyu Xiao, Yuxuan Wan, Yintong Huo, Zixin Wang, Xinyi Xu, Wenxuan Wang, Zhiyao Xu, Yuhang Wang, Michael R. Lyu |
| 2025 | Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR. Zeyu Sun, Jingjing Liang, Weiyi Wang, Chenyao Suo, Junjie Chen, Fanjiang Xu |
| 2025 | Interpretable Vulnerability Detection Reports. Cláudia Mamede, José Campos, Claire Le Goues, Rui Abreu |
| 2025 | Is Measurement Enough? Rethinking Output Validation in Quantum Program Testing. Jiaming Ye, Xiongfei Wu, Shangzhou Xia, Fuyuan Zhang, Jianjun Zhao |
| 2025 | Issue Localization via LLM-Driven Iterative Code Graph Searching. Zhonghao Jiang, Xiaoxue Ren, Meng Yan, Wei Jiang, Yong Li, Zhongxin Liu |
| 2025 | It's Not Easy Being Green: On the Energy Efficiency of Programming Languages. Nicolas van Kempen, Hyuk-Je Kwon, Dung Tuan Nguyen, Emery D. Berger |
| 2025 | JSidentify-V2: Leveraging Dynamic Memory Fingerprinting for Mini-Game Plagiarism Detection. Zhihao Li, Chaozheng Wang, Zongjie Li, Xinyong Peng, Qun Xia, Haochuan Lu, Ting Xiong, Shuzheng Gao, Cuiyun Gao, Shuai Wang, Yuetang Deng, Huafeng Ma |
| 2025 | KAIOPS: A Platform Solution of End-to-End Multi-Modal AIOps for AI Training at Scale. Zeying Wang, Junhong Liu, Penghao Zhang, Xiaoyang Sun, Xu Wang, Tianyu Wo, Chunming Hu, Chengru Song, Jin Ouyang, Renyu Yang |
| 2025 | Kair: A Statistical and Causal Approach to Pinpointing Stragglers in Distributed Model Training. Yitang Yang, Junhong Liu, Jiapeng Chen, Xiaoyang Sun, Tianyu Wo, Chunming Hu, Chengru Song, Jin Ouyang, Renyu Yang |
| 2025 | LAURA: Enhancing Code Review Generation with Context-Enriched Retrieval-Augmented LLM. Yuxin Zhang, Yuxia Zhang, Zeyu Sun, Yanjie Jiang, Hui Liu |
| 2025 | LLM-Assisted Synthesis of High-Assurance C Programs. Prasita Mukherjee, Minghai Lu, Benjamin Delaware |
| 2025 | LLM-Based Identification of Null Pointer Exception Patches. Tahir Ullah, Waseem Akram, Fiza Khaliq, Hui Liu |
| 2025 | LLM-Guided Genetic Improvement: Envisioning Semantic Aware Automated Software Evolution. Karine Even-Mendoza, Alexander E. I. Brownlee, Alina Geiger, Carol Hanna, Justyna Petke, Federica Sarro, Dominik Sobania |
| 2025 | LLM-Powered Fully Automated Chaos Engineering: Towards Enabling Anyone to Build Resilient Software Systems at Low Cost. Daisuke Kikuta, Hiroki Ikeuchi, Kengo Tajiri |
| 2025 | LLM-Powered Multi-Agent Collaboration for Intelligent Industrial On-Call Automation. Ruowei Fu, Yang Zhang, Zeyu Che, Xin Wu, Zhenyu Zhong, Zhiqiang Ren, Shenglin Zhang, Feng Wang, Yongqian Sun, Xiaozhou Liu, Kexin Liu, Yu Zhang |
| 2025 | LLM-assisted Industrial-Scale Differential Testing of Package Incompatibilities in Linux Distributions. Yuhao Yang, Chijin Zhou, Runzhe Wang, Weibo Zhang, Yuheng Shen, Xiaohai Shi, Tao Ma, Chang Gao, Zhe Wang, Ying Fu, Heyuan Shi |
| 2025 | LLM-based Dynamic Differential Testing for Database Connectors with Reinforcement Learning-Guided Prompt Selection. Ce Lyu, Yanhao Wang, Jie Liang, Minghao Zhao |
| 2025 | LLMPort: Cross-file Patch Porting via Task Decomposition and Self-correction. Bofei Chen, Lei Zhang, Peng Deng, Nan Wang, Haoyu Xu, Mingda Guo, Yuan Zhang, Min Yang |
| 2025 | LLMorph: Automated Metamorphic Testing of Large Language Models. Steven Cho, Stefano Ruberto, Valerio Terragni |
| 2025 | LLMs for Automated Unit Test Generation and Assessment in Java: The AgoneTest Framework. Andrea Lops, Fedelucio Narducci, Azzurra Ragone, Michelantonio Trizio, Claudio Bartolini |
| 2025 | LOSVER: Line-Level Modifiability Signal-Guided Vulnerability Detection and Classification. Doha Nam, Jongmoon Baik |
| 2025 | Lares: LLM-driven Code Slice Semantic Search for Patch Presence Testing. Siyuan Li, Yaowen Zheng, Hong Li, Jingdong Guo, Chaopeng Dong, Chunpeng Yan, Weijie Wang, Yimo Ren, Limin Sun, Hongsong Zhu |
| 2025 | Latra: A Template-Based Language-Agnostic Transformation Framework for Effective Program Reduction. Zhenyang Xu, Yiran Wang, Yongqiang Tian, Mengxiao Zhang, Chengnian Sun |
| 2025 | Learning Project-wise Subsequent Code Edits via Interleaving Neural-based Induction and Tool-based Deduction. Chenyan Liu, Yun Lin, Yuhuan Huang, Jiaxin Chang, Binhang Qi, Bo Jiang, Zhiyong Huang, Jinsong Dong |
| 2025 | Learning from the Past: Real-World Exploit Migration for Smart Contract PoC Generation. Kairan Sun, Zhengzi Xu, Kaixuan Li, Lyuye Zhang, Yebo Feng, Daoyuan Wu, Yang Liu |
| 2025 | Let the Code Speak: Incorporating Program Dynamic State for Better Method-Level Fault Localization. Yihao Qin, Shangwen Wang, Bo Lin, Xin Peng, Sheng Ouyang, Liqian Chen, Xiaoguang Mao |
| 2025 | Leveraging Mixture-of-Experts Framework for Smart Contract Vulnerability Repair with Large Language Model. Hang Yuan, Xizhi Hou, Lei Yu, Li Yang, Jiayue Tang, Jiadong Xu, Yifei Liu, Fengjun Zhang, Chun Zuo |
| 2025 | LineBreaker: Finding Token-Inconsistency Bugs with Large Language Models. Hongbo Chen, Yifan Zhang, Xing Han, Tianhao Mao, Huanyao Rong, Yuheng Zhang, Xiaofeng Wang, Luyi Xing, Xun Chen, Hang Zhang |
| 2025 | Linguistic Theories Coincide with Misformalization in Temporal Logic. Colin S. Gordon |
| 2025 | LitterBox Benedikt Fein, Florian Obermüller, Gordon Fraser |
| 2025 | LogAction: Consistent Cross-system Anomaly Detection through Logs via Active Domain Adaptation. Chiming Duan, Minghua He, Pei Xiao, Tong Jia, Xin Zhang, Zhewei Zhong, Xiang Luo, Yan Niu, Lingzhe Zhang, Siyu Yu, Yifan Wu, Weijie Hong, Ying Li, Gang Huang |
| 2025 | LogMoE: Lightweight Expert Mixture for Cross-System Log Anomaly Detection. Jiaxing Qi, Zhongzhi Luan, Shaohan Huang, Carol J. Fung, Yuchen Wang, Aibin Wang, Hongyu Zhang, Hailong Yang, Depei Qian |
| 2025 | LogPilot: Intent-aware and Scalable Alert Diagnosis for Large-scale Online Service Systems. Zhihan Jiang, Jinyang Liu, Yichen Li, Haiyu Huang, Xiao He, Tieying Zhang, Jianjun Chen, Yi Li, Rui Shi, Michael R. Lyu |
| 2025 | LogSage: An LLM-Based Framework for CI/CD Failure Detection and Remediation with Industrial Validation. Weiyuan Xu, Juntao Luo, Tao Huang, Kaixin Sui, Jie Geng, Qijun Ma, Isami Akasaka, Xiaoxue Shi, Jing Tang, Peng Cai |
| 2025 | LongCodeZip: Compress Long Context for Code Language Models. Yuling Shi, Yichun Qian, Hongyu Zhang, Beijun Shen, Xiaodong Gu |
| 2025 | Loupe: End-to-End Learning of Loop Unrolling Heuristics for Abstract Interpretation. Maykel Mattar, Michele Alberti, Valentin Perrelle, Salah Sadou |
| 2025 | LspFuzz: Hunting Bugs in Language Servers. Hengcheng Zhu, Songqiang Chen, Valerio Terragni, Lili Wei, Yepang Liu, Jiarong Wu, Shing-Chi Cheung |
| 2025 | M2QCode: A Model-Driven Framework for Generating Multi-Platform Quantum Programs. Xiaoyu Guo, Shinobu Saito, Jianjun Zhao |
| 2025 | MCTS-Refined CoT: High-Quality Fine-Tuning Data for LLM-Based Repository Issue Resolution. Yibo Wang, Zhihao Peng, Ying Wang, Zhao Wei, Hai Yu, Zhiliang Zhu |
| 2025 | MIMIC: Integrating Diverse Personality Traits for Better Game Testing Using Large Language Model. Yifei Chen, Sarra Habchi, Lili Wei |
| 2025 | Measuring LLM Code Generation Stability via Structural Entropy. Yewei Song, Tiezhu Sun, Xunzhu Tang, Prateek Rajput, Tegawendé F. Bissyandé, Jacques Klein |
| 2025 | Measuring Software Resilience Using Socially Aware Truck Factor Estimation. Alexis Butler, Daniel O'Keeffe, Santanu Kumar Dash |
| 2025 | Metamorphic Testing for Audio Content Moderation Software. Wenxuan Wang, Yongjiang Wu, Junyuan Zhang, Shuqing Li, Yun Peng, Wenting Chen, Shuai Wang, Michael R. Lyu |
| 2025 | Metamorphic Testing of Deep Reinforcement Learning Agents with MDPMorph. Jiapeng Li, Zheng Zheng, Yuning Xing, Daixu Ren, Steven Cho, Valerio Terragni |
| 2025 | Metrics Driven Reengineering and Continuous Code Improvement at Meta. Audris Mockus, Peter C. Rigby, Rui Abreu, Anatoly Akkerman, Yogesh Bhootada, Payal Bhuptani, Gurnit Ghardhora, Lan Hoang Dao, Chris Hawley, Renzhi He, Sagar Krishnamoorthy, Sergei Krauze, Jianmin Li, Anton Lunov, Dragos Martac, François Morin, Neil Mitchell, Venus Montes, Maher Saba, Matt Steiner, Andrea Valori, Shanchao Wang, Nachiappan Nagappan |
| 2025 | Minuku: Detecting Diverse Display Issues in Mobile Apps with Small-scale Dataset. Yongxiang Hu, Ke Liu, Hailiang Jin, Shiyu Guo, Juxing Yuan, Xin Wang, Yangfan Zhou |
| 2025 | Mixture-of-Experts Low-Rank Adaptation for Multilingual Code Summarization. Tianchen Yu, Li Yuan, Hailing Huang, Jiexin Wang, Yi Cai |
| 2025 | MobileUPReg: Identifying User-Perceived Performance Regressions in Mobile OS Versions. Wei Liu, Yi Wen Heng, Feng Lin, Tse-Hsun Peter Chen, Ahmed E. Hassan |
| 2025 | Mockingbird: Efficient Excessive Data Exposures Detection via Dynamic Code Instrumentation. Chenxiao Xia, Jiazheng Sun, Jun Zheng, Yu-an Tan, Hongyi Su |
| 2025 | Multi-Modal Requirements Data-based Acceptance Criteria Generation using LLMs. Fanyu Wang, Chetan Arora, Yonghui Liu, Kaicheng Huang, Chakkrit Tantithamthavorn, Aldeida Aleti, Dishan Sambathkumar, David Lo |
| 2025 | Multi-dimensional Assessment of Crowdsourced Testing Reports via LLMs. Yue Wang, Yuan Zhang, Shengcheng Yu, Zhenyu Chen |
| 2025 | Multiple Schema-Conformant Declarative Code Generation. Mehant Kammakomati, Srikanth G. Tamilselvam |
| 2025 | NATE: A Network-Aware Testing Enhancer for Network-Related Fault Detection in Android Apps. Yuanhong Lan, Shaoheng Cao, Yifei Lu, Minxue Pan, Xuandong Li |
| 2025 | Navigating the Labyrinth: Path-Sensitive Unit Test Generation with Large Language Models. Dianshu Liao, Xin Yin, Shidong Pan, Chao Ni, Zhenchang Xing, Xiaoyu Sun |
| 2025 | Non-termination Witnesses and Their Validation. Zsófia Ádám, Paulína Ayaziová, Levente Bajczi, Dirk Beyer, Marek Jankola, Marian Lingsch Rosenfeld, Jan Strejcek |
| 2025 | Not Every Patch is an Island: LLM-Enhanced Identification of Multiple Vulnerability Patches. Yi Song, Dongchen Xie, Lin Xu, He Zhang, Chunying Zhou, Xiaoyuan Xie |
| 2025 | NovaQ: Improving Quantum Program Testing through Diversity-Guided Test Case Generation. Tiancheng Jin, Shangzhou Xia, Jianjun Zhao |
| 2025 | ORFuzz: Fuzzing the "Other Side" of LLM Safety - Testing Over-Refusal. Haonan Zhang, Dongxia Wang, Yi Liu, Kexin Chen, Jiashui Wang, Xinlei Ying, Long Liu, Wenhai Wang |
| 2025 | ORMorpher: An Interactive Framework for ORM Translation and Optimization. Milan Abrahám, Pavel Koupil |
| 2025 | OSSPREY: AI-Driven Forecasting and Intervention for OSS Project Sustainability. Nafiz Imtiaz Khan, Priyal Soni, Arjun Ashok, Vladimir Filkov |
| 2025 | On Automating Configuration Dependency Validation via Retrieval-Augmented Generation. Sebastian Simon, Alina Mailach, Johannes Dorn, Norbert Siegmund |
| 2025 | On the (In)Security of Non-resettable Device Identifiers in Custom Android Systems. Zikan Dong, Liu Wang, Guoai Xu, Haoyu Wang |
| 2025 | On the Correctness of Software Merge. Akira Mori, Masatomo Hashimoto |
| 2025 | On the Robustness Evaluation of 3D Obstacle Detection Against Specifications in Autonomous Driving. Tri Minh-Triet Pham, Bo Yang, Jinqiu Yang |
| 2025 | Out of Distribution Detection in Self-adaptive Robots with AI-powered Digital Twins. Erblin Isaku, Hassan Sartaj, Shaukat Ali, Beatriz Sanguino, Tongtong Wang, Guoyuan Li, Houxiang Zhang, Thomas Peyrucain |
| 2025 | PALM: Synergizing Program Analysis and LLMs to Enhance Rust Unit Test Coverage. Bei Chu, Yang Feng, Kui Liu, Hange Shi, Zifan Nan, Zhaoqiang Guo, Baowen Xu |
| 2025 | PAT-Agent: Autoformalization for Model Checking. Xinyue Zuo, Yifan Zhang, Hongshu Wang, Yufan Cai, Zhe Hou, Jing Sun, Jin Song Dong |
| 2025 | PEACE: Towards Efficient Project-Level Efficiency Optimization via Hybrid Code Editing. Xiaoxue Ren, Jun Wan, Yun Peng, Zhongxin Liu, Ming Liang, Dajun Chen, Wei Jiang, Yong Li |
| 2025 | PROXiFY: A Bytecode Analysis Tool for Detecting and Classifying Proxy Contracts in Ethereum Smart Contracts. Ilham A. Qasse, Mohammad Hamdaqa, Björn Þór Jónsson |
| 2025 | PoliCond: Condition-Aware Ontology-Driven LLMs for Privacy Policy Contradiction Analysis. Yalin Feng, Yifei Lu, Minxue Pan |
| 2025 | Polyglot: An Extensible Framework to Benchmark Code Translation with LLMs. Marco Vieira, Priyam Ashish Shah, Bhavain Shah, Rrezarta Krasniqi |
| 2025 | Practical Escape of Exploration Tarpits for Mini-Game Testing in an Industrial Setting. Yuan Cao, Dezhi Ran, Haochuan Lu, Chao Guo, Xuran Hao, Zhuoru Chen, Ting Xiong, Yuetang Deng, Tao Xie |
| 2025 | PrefGen: A Preference-Driven Methodology for Secure Yet Gas-Efficient Smart Contract Generation. Zhiyuan Peng, Xin Yin, Zijie Zhou, Chenhao Ying, Chao Ni, Yuan Luo |
| 2025 | PrioTestCI: Efficient Test Case Prioritization in GitHub Workflows for CI Optimization. Shubham Vasudeo Desai, Shonil Bhide, Souhaila Serbout, Luciano Marchezan, Wesley K. G. Assunção |
| 2025 | ProfMal: Detecting Malicious NPM Packages by the Synergy between Static and Dynamic Analysis. Yiheng Huang, Wen Zheng, Susheng Wu, Bihuan Chen, You Lu, Zhuotong Zhou, Yiheng Cao, Xiaoyu Li, Xin Peng |
| 2025 | Profile Coverage: Using Android Compilation Profiles to Evaluate Dynamic Testing. Jakob Bleier, Felix Kehrer, Jürgen Cito, Martina Lindorfer |
| 2025 | Programmers' Visual Attention on Function Call Graphs During Code Summarization. Samantha McLoughlin, Zachary Karas, Robert Wallace, Aakash Bansal, Collin McMillan, Yu Huang |
| 2025 | PromFuzz: Leveraging LLM-Driven and Bug-Oriented Composite Analysis for Detecting Functional Bugs in Smart Contracts. Xingshuang Lin, Qinge Xie, Binbin Zhao, Yuan Tian, Saman A. Zonouz, Na Ruan, Jiliang Li, Raheem Beyah, Shouling Ji |
| 2025 | Prompt-with-Me: in-IDE Structured Prompt Management for LLM-Driven Software Engineering. Ziyou Li, Agnia Sergeyuk, Maliheh Izadi |
| 2025 | Propagation-Based Vulnerability Impact Assessment for Software Supply Chains. Bonan Ruan, Zhiwei Lin, Jiahao Liu, Chuqi Zhang, Kaihang Ji, Zhenkai Liang |
| 2025 | Protecting Source Code Privacy When Hunting Memory Bugs. Jielun Wu, Bing Shui, Hongcheng Fan, Shengxin Wu, Rongxin Wu, Yang Feng, Baowen Xu, Qingkai Shi |
| 2025 | Provable Fairness Repair for Deep Neural Networks. Jianan Ma, Jingyi Wang, Qi Xuan, Zhen Wang |
| 2025 | PseudoFix: Refactoring Distorted Structures in Decompiled C Pseudocode. Gangyang Li, Xiuwei Shang, Shaoyin Cheng, Junqi Zhang, Li Hu, Xu Zhu, Weiming Zhang, Nenghai Yu |
| 2025 | PyGress: Tool for Analyzing the Progression of Code Proficiency in Python OSS Projects. Rujiphart Charatvaraphan, Bunradar Chatchaiyadech, Thitirat Sukijprasert, Chaiyong Ragkhitwetsagul, Morakot Choetkiertikul, Raula Gaikovina Kula, Thanwadee Sunetnanta, Kenichi Matsumoto |
| 2025 | PyTrim: A Practical Tool for Reducing Python Dependency Bloat. Konstantinos Karakatsanis, Georgios Alexopoulos, Ioannis Karyotakis, Foivos Timotheos Proestakis, Evangelos Talos, Panos Louridas, Dimitris Mitropoulos |
| 2025 | QuanBench: Benchmarking Quantum Code Generation with Large Language Models. Xiaoyu Guo, Minggu Wang, Jianjun Zhao |
| 2025 | Quantum Machine Learning-based Test Oracle for Autonomous Mobile Robots. Xinyi Wang, Qinghua Xu, Paolo Arcaini, Shaukat Ali, Thomas Peyrucain |
| 2025 | Quirx: A Mutation-Based Framework for Evaluating Prompt Robustness in LLM-based Software. Souhaila Serbout |
| 2025 | R Muzhi Yu, Zhengran Zeng, Wei Ye, Jinan Sun, Xiaolong Bai, Shikun Zhang |
| 2025 | RAML: Toward Retrieval-Augmented Localization of Malicious Payloads in Android Apps. Tiezhu Sun, Marco Alecci, Yewei Song, Xunzhu Tang, Kisub Kim, Jordan Samhi, Tegawendé F. Bissyandé, Jacques Klein |
| 2025 | RFCAudit: AI Agent for Auditing Protocol Implementations Against RFC Specifications. Mingwei Zheng, Chengpeng Wang, Xuwei Liu, Jinyao Guo, Shiwei Feng, Xiangyu Zhang |
| 2025 | RFCScope: Detecting Logical Ambiguities in Internet Protocol Specifications. Mrigank Pawagi, Lize Shao, Hyeonmin Lee, Yixin Sun, Wenxi Wang |
| 2025 | RPG: Linux Kernel Fuzzing Guided by Distribution-Specific Runtime Parameter Interfaces. Yuhan Chen, Yuheng Shen, Guoyu Yin, Fan Ding, Runzhe Wang, Tao Ma, Xiaohai Shi, Qiang Fu, Ying Fu, Heyuan Shi |
| 2025 | RSFuzz: A Robustness-Guided Swarm Fuzzing Framework Based on Behavioral Constraints. Ruoyu Zhou, Zhiwei Zhang, Haocheng Han, Xiaodong Zhang, Zehan Chen, Jun Sun, Yulong Shen, Dehai Xu |
| 2025 | ReFuzzer: Feedback-Driven Approach to Enhance Validity of LLM-Generated Test Programs. Iti Shree, Karine Even-Mendoza, Tomasz Radzik |
| 2025 | RealisticCodeBench: Towards More Realistic Evaluation of Large Language Models for Code Generation. Xiao Yu, Haoxuan Chen, Lei Liu, Xing Hu, Jacky Wai Keung, Xin Xia |
| 2025 | Rechecking Recheck Requests in Continuous Integration: An Empirical Study of OpenStack. Yelizaveta Brus, Rungroj Maipradit, Earl T. Barr, Shane McIntosh |
| 2025 | Reflective Unit Test Generation for Precise Type Error Detection with Large Language Models. Chen Yang, Ziqi Wang, Yanjie Jiang, Lin Yang, Yuteng Zheng, Jianyi Zhou, Junjie Chen |
| 2025 | Relia: Accelerating the Analysis of Cloud Access Control Policies. Dan Wang, Peng Zhang, Zhenrong Gu, Weibo Lin, Shibiao Jiang, Zhu He, Xu Du, Longfei Chen, Jun Li, Xiaohong Guan |
| 2025 | Repairing Leaks in Resource Wrappers. Sanjay Malakar, Michael D. Ernst, Martin Kellogg, Manu Sridharan |
| 2025 | RepoMasterEval: Evaluating Code Completion via Real-World Repositories. Qinyun Wu, Chao Peng, Pengfei Gao, Ruida Hu, Haoyu Gan, Bo Jiang, Jinhe Tang, Zhiwen Deng, Zhanming Guan, Cuiyun Gao, Xia Liu, Ping Yang |
| 2025 | Requirements Development and Formalization for Reliable Code Generation: A Multi-Agent Vision. Xu Lu, Weisong Sun, Yiran Zhang, Ming Hu, Cong Tian, Zhi Jin, Yang Liu |
| 2025 | Risk Estimation in Differential Fuzzing via Extreme Value Theory. Rafael Baez, Alejandro Olivas, Nathan K. Diamond, Marcelo F. Frias, Yannic Noller, Saeid Tizpaz-Niari |
| 2025 | Root Cause Analysis of RISC-V Build Failures via LLM and MCTS Reasoning. Weipeng Shuai, Jie Liu, Zhirou Ma, Liangyi Kang, Zehua Wang, Shuai Wang, Dan Ye, Hui Li, Wei Wang, Jiaxin Zhu |
| 2025 | RustAssure: Differential Symbolic Testing for LLM-Transpiled C-to-Rust Code. Yubo Bai, Tapti Palit |
| 2025 | RustRepoTrans: Repository-level Context Code Translation Benchmark Targeting Rust. Guangsheng Ou, Mingwei Liu, Yuxuan Chen, Yanlin Wang, Xin Peng, Zibin Zheng |
| 2025 | SATORI: Static Test Oracle Generation for REST APIs. Juan C. Alonso, Alberto Martin-Lopez, Sergio Segura, Gabriele Bavota, Antonio Ruiz-Cortés |
| 2025 | SCOPE: Evaluating and Enhancing Permission Explanation Transparency in Mobile Apps. Liu Wang, Tianshu Zhou, Haoyu Wang, Xiyuan Liu, Yi Wang |
| 2025 | SE-Jury: An LLM-as-Ensemble-Judge Metric for Narrowing the Gap with Human Evaluation in SE. Xin Zhou, Kisub Kim, Ting Zhang, Martin Weyssow, Luís F. Gomes, Guang Yang, Kui Liu, Xin Xia, David Lo |
| 2025 | SGCR: A Specification-Grounded Framework for Trustworthy LLM Code Review. Kai Wang, Bingcheng Mao, Shuai Jia, Yujie Ding, Dongming Han, Tianyi Ma, Bin Cao |
| 2025 | SMTgazer: Learning to Schedule SMT Algorithms via Bayesian Optimization. Chuan Luo, Shaoke Cui, Jianping Song, Xindi Zhang, Wei Wu, Chanjuan Liu, Shaowei Cai, Chunming Hu |
| 2025 | SPICE: An Automated SWE-Bench Labeling Pipeline for Issue Clarity, Test Coverage, and Effort Estimation. Gustavo Ansaldi Oliva, Gopi Krishnan Rajbahadur, Aaditya Bhatia, Haoxiang Zhang, Yihao Chen, Zhilong Chen, Arthur Leung, Dayi Lin, Boyuan Chen, Ahmed E. Hassan |
| 2025 | SSR: Safeguarding Staking Rewards by Defining and Detecting Logical Defects in DeFi Staking. Zewei Lin, Jiachi Chen, Jingwen Zhang, Zexu Wang, Yuming Feng, Weizhe Zhang, Zibin Zheng |
| 2025 | STaint: Detecting Second-Order Vulnerabilities in PHP Applications with LLM-Assisted Bi-Directional Static Taint Analysis. Yuchen Ji, Hongchen Cao, Jingzhu He |
| 2025 | SateLight: A Satellite Application Update Framework for Satellite Computing. Jinfeng Wen, Jianshu Zhao, Zixi Zhu, Xiaomin Zhang, Qi Liang, Ao Zhou, Shangguang Wang |
| 2025 | ScaleCirc: Scaling the Analysis over Circom Circuits. Jinan Jiang, Haoran Qin, Xiapu Luo |
| 2025 | Secure Transaction Semantics: Analysis, Vulnerability Detection, and Attack Modeling. Yixuan Liu |
| 2025 | Securing Millions of Decentralized Identities in Alipay Super App with End-to-End Formal Verification. Ziyu Mao, Xiaolin Ma, Lin Huang, Huan Yang, Wu Zhang, Weichao Sun, Yongtao Wang, Jingling Xue, Jingyi Wang |
| 2025 | Securing Self-Managed Third-Party Libraries. Xin Zhou, Jinwei Xu, He Zhang, Yanjing Yang, Lanxin Yang, Bohan Liu, Hongshan Tang |
| 2025 | Security Debt in LLM Agent Applications: A Measurement Study of Vulnerabilities and Mitigation Trade-offs. Zhuoxiang Shen, Jiarun Dai, Yuan Zhang, Min Yang |
| 2025 | Seeing is Fixing: Cross-Modal Reasoning with Multimodal LLMs for Visual Software Issue Repair. Kai Huang, Jian Zhang, Xiaofei Xie, Chunyang Chen |
| 2025 | SemGuard: Real-Time Semantic Evaluator for Correcting LLM-Generated Code. Qinglin Wang, Zhihong Sun, Ruyun Wang, Tao Huang, Zhi Jin, Ge Li, Chen Lyu |
| 2025 | Should We Evaluate LLM Based Security Analysis Approaches on Open Source Systems? Kohei Dozono, Jonas Engesser, Benjamin Hummel, Tobias Roehm, Alexander Pretschner |
| 2025 | Shrunk, Yet Complete: Code Shrinking-Resilient Android Third-Party Library Detection. Jingkun Zhang, Jingzheng Wu, Xiang Ling, Tianyue Luo, Bolin Zhou, Mutian Yang |
| 2025 | Sifting Truth from Coincidences: A Two-Stage Positive and Unlabeled Learning Model for Coincidental Correctness Detection. Chunyan Liu, Huan Xie, Yan Lei, Zhenyu Wu, Jinping Wang |
| 2025 | Simulated Interactive Debugging. Yannic Noller, Erick Chandra, Srinidhi Chandrashekar, Kenny T. W. Choo, Cyrille Jégourel, Oka Kurniawan, Christopher M. Poskitt |
| 2025 | SolContractEval: A Benchmark for Evaluating Contract-Level Solidity Code Generation. Zhifan Ye, Jiachi Chen, Zhenzhe Shao, Lingfeng Bao, Xiaohu Yang, Zhongxin Liu |
| 2025 | Soleker: Uncovering Vulnerabilities in Solana Smart Contracts. Kunsong Zhao, Yunpeng Tian, Zuchao Ma, Xiapu Luo |
| 2025 | Spec2Code: Mapping Protocol Specification to Function-Level Code Implementation. Yuekun Wang, Lili Quan, Xiaofei Xie, Junjie Wang, Jianjun Chen |
| 2025 | Speculative Automated Refactoring of Imperative Deep Learning Programs to Graph Execution. Raffi Khatchadourian, Tatiana Castro Vélez, Mehdi Bagherzadeh, Nan Jia, Anita Raja |
| 2025 | Spinner: Detecting Locking Violations in the eBPF Runtime. Priya Govindasamy, Joseph Bursey, Hsin-Wei Hung, Ardalan Amiri Sani |
| 2025 | StackPlagger: A System for Identifying AI-Code Plagiarism on Stack Overflow. Aman Swaraj, Harsh Goyal, Sumit Chadgal, Sandeep Kumar |
| 2025 | State Field Coverage: A Metric for Oracle Quality. Facundo Molina, Nazareno Aguirre, Alessandra Gorla |
| 2025 | Streamlining Acceptance Test Generation for Mobile Applications Through Large Language Models: An Industrial Case Study. Pedro Luís Fonseca, Bruno Lima, João Pascoal Faria |
| 2025 | Taming Uncertainty via Automation: Observing, Analyzing, and Optimizing Agentic AI Systems. Dany Moshkovich, Sergey Zeltyn |
| 2025 | TensorGuard: Gradient-Based Model Fingerprinting for LLM Similarity Detection and Family Classification. Zehao Wu, Yanjie Zhao, Haoyu Wang |
| 2025 | Tephra: Principled Discovery of Fuzzer Limitations. Vasil Sarafov, David Markvica, Stefan Brunthaler |
| 2025 | Terminator: Enabling Efficient Fuzzing of Closed-Source GUI Programs by Automatic Coverage-Guided Termination. Jonas Zabel, Philip Kolvenbach, Steven Arzt |
| 2025 | Testing Autonomous Driving Systems Through Blind-Spot Guided Fuzzing. Sali Moussa |
| 2025 | Tether: A Personalized Support Assistant for Software Engineers with ADHD. Aarsh Shah, Cleyton V. C. de Magalhães, Kiev Gama, Ronnie de Souza Santos |
| 2025 | The Cost of Downgrading Build Systems : A Case Study of Kubernetes. Gareema Ranjan, Mahmoud Alfadel, Gengyi Sun, Shane McIntosh |
| 2025 | The Fault in our Stats. Alexi Turcotte, Neev Nirav Mehta |
| 2025 | The Future of Software Transparency: Bridging Understanding, Measurement, and Practice. Gregorio Dalia, Annibale Panichella, Andrea Di Sorbo, Gerardo Canfora, Corrado Aaron Visaggio |
| 2025 | The Gold Digger in the Dark Forest: Industrial-Scale MEV Analysis in Ethereum. Ningyu He, Tianyang Chi, Xiaohui Hu, Haoyu Wang |
| 2025 | Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute. Yingwei Ma, Yongbin Li, Yihong Dong, Xue Jiang, Yanhao Li, Yue Liu, Rongyu Cao, Jue Chen, Fei Huang, Binhua Li |
| 2025 | Token Sugar: Making Source Code Sweeter for LLMs through Token-Efficient Shorthand. Zhensu Sun, Chengran Yang, Xiaoning Du, Zhou Yang, Li Li, David Lo |
| 2025 | Towards Automated Governance: A DSL for Human-Agent Collaboration in Software Projects. Adem Ait, Gwendal Jouneaux, Javier Luis Cánovas Izquierdo, Jordi Cabot |
| 2025 | Towards Context-aware Mobile Privacy Notice: Implementation of A Deployable Contextual Privacy Policies Generator. Haochen Gong, Zhen Tao, Shidong Pan, Zhenchang Xing, Xiaoyu Sun |
| 2025 | Towards Generalizable Instruction Vulnerability Prediction via LLM-Enhanced Code Representation. Bao Wen, Jingjing Gu, Jingxuan Zhang, Yang Liu, Pengfei Yu, Yanchao Zhao |
| 2025 | Towards More Accurate Static Analysis for Taint-Style Bug Detection in Linux Kernel. Haonan Li, Hang Zhang, Kexin Pei, Zhiyun Qian |
| 2025 | Towards Reliable LLM-based Exam Generation Lessons Learned and Open Challenges in an Industrial Project. Renzo Degiovanni, Jordi Cabot |
| 2025 | Training-Control-as-Code: Towards a declarative solution to control training. Padmanabha Venkatagiri Seshadri, Harikrishnan Balagopal, Mehant Kammakomati, Ashok Pon Kumar, Dushyant Behl |
| 2025 | TreeRanker: Fast and Model-Agnostic Ranking System for Code Suggestions in IDEs. Daniele Cipollone, Egor Bogomolov, Arie van Deursen, Maliheh Izadi |
| 2025 | Triangle: Empowering Incident Triage with Multi-Agent. Zhaoyang Yu, Aoyang Fang, Minghua Ma, Jaskaran Singh Walia, Chaoyun Zhang, Shu Chi, Ze Li, Murali Chintalapati, Xuchao Zhang, Rujia Wang, Chetan Bansal, Saravan Rajmohan, Qingwei Lin, Shenglin Zhang, Dan Pei, Pinjia He |
| 2025 | TrioXpert: An Automated Incident Management Framework for Microservice System. Yongqian Sun, Yu Luo, Xidao Wen, Yuan Yuan, Xiaohui Nie, Shenglin Zhang, Tong Liu, Xi Luo |
| 2025 | Tron: Fuzzing Linux Network Stack via Protocol-System Call Payload Synthesis. Qiang Zhang, Yifei Chu, Yuheng Shen, Jianzhong Liu, Heyuan Shi, Yu Jiang, Wanli Chang |
| 2025 | TrustVis: A Multi-Dimensional Trustworthiness Evaluation Framework for Large Language Models. Ruoyu Sun, Da Song, Jiayang Song, Yuheng Huang, Lei Ma |
| 2025 | Tuning LLM-based Code Optimization via Meta-Prompting: An Industrial Perspective. Jingzhi Gong, Rafail Giavrimis, Paul Brookes, Vardan Voskanyan, Fan Wu, Mari Ashiga, Matthew Truscott, Mike Basios, Leslie Kanthan, Jie Xu, Zheng Wang |
| 2025 | Uncovering Discrimination Clusters: Quantifying and Explaining Systematic Fairness Violations. Ranit Debnath Akash, Ashish Kumar, Verya Monjezi, Ashutosh Trivedi, Gang Tan, Saeid Tizpaz-Niari |
| 2025 | Uncovering Prompt Elements: Cloning System Prompts from Behavioral Traces. Yi Qian, Fei Peng, Hao Wu, Ligeng Chen, Bing Mao |
| 2025 | Uncovering Systematic Failures of LLMs in Verifying Code Against Natural Language Specifications. Haolin Jin, Huaming Chen |
| 2025 | Understanding Feature Request Practice on GitHub via a Large-Scale Empirical Study. Jiajun Li, Wenhua Yang, Minxue Pan, Yu Zhou |
| 2025 | Understanding Resource Injection Vulnerabilities in Kubernetes Ecosystems. Defang Bo, Jie Lu, Feng Li, Jingting Chen, Jinchen Wang, Chendong Yu, Yeting Li, Wei Huo |
| 2025 | Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories. Islem Bouzenia, Michael Pradel |
| 2025 | Understanding Uncertainty In LLMs. Chandan Kumar Sah |
| 2025 | Unit Test Update through LLM-Driven Context Collection and Error-Type-Aware Refinement. Yuanhe Zhang, Zhiquan Yang, Shengyi Pan, Zhongxin Liu |
| 2025 | United We Stand: Towards End-to-End Log-based Fault Diagnosis via Interactive Multi-Task Learning. Minghua He, Chiming Duan, Pei Xiao, Tong Jia, Siyu Yu, Lingzhe Zhang, Weijie Hong, Jin Han, Yifan Wu, Ying Li, Gang Huang |
| 2025 | Unlocking Reproducibility: Automating re-Build Process for Open-Source Software. Behnaz Hassanshahi, Trong Nhan Mai, Benjamin Selwyn-Smith, Nicholas Allen |
| 2025 | Unseen Data Detection using Routing Entropy in Mixture-of-Experts for Autonomous Vehicles. Sang In Lee, Donghwan Shin, Jihun Park |
| 2025 | Using Active Learning to Train Predictive Mutation Testing with Minimal Data. Miklos Borsi |
| 2025 | Using Fourier Analysis and Mutant Clustering to Accelerate DNN Mutation Testing. Ali Ghanbari, Sasan Tavakkol |
| 2025 | VERT: Polyglot Verified Equivalent Rust Transpilation with Large Language Models. Aidan Z. H. Yang, Yoshiki Takashima, Brandon Paulsen, Josiah Dodds, Daniel Kroening |
| 2025 | VRExplorer: A Model-based Approach for Semi-Automated Testing of Virtual Reality Scenes. Zhengyang Zhu, Hong-Ning Dai, Hanyang Guo, Zeqin Liao, Zibin Zheng |
| 2025 | VRTestSniffer: Test Smell Detector for Virtual Reality (VR) Software Projects. Faraz Gurramkonda, Avishak Chakroborty, Bruce Maxim, Mohamed Wiem Mkaouer, Foyzul Hassan |
| 2025 | VUSC: An Extensible Research Platform for Java-Based Static Analysis. Marc Miltenberger, Steven Arzt |
| 2025 | VeriExploit: Automatic Bug Reproduction in Smart Contracts via LLMs and Formal Methods. Chenfeng Wei, Shiyu Cai, Yiannis Charalambous, Tong Wu, Sangharatna Godboley, Lucas C. Cordeiro |
| 2025 | Verification and Classification of Exploits for Node.js Vulnerabilities. Sungmin Park |
| 2025 | Vessel: A Taxonomy of Reproducibility Issues for Container Images. Kevin Pitstick, Alex Derr, Lihan Zhan, Sebastián Echeverría |
| 2025 | Vul-R2: A Reasoning LLM for Automated Vulnerability Repair. Xin-Cheng Wen, Zirui Lin, Yijun Yang, Cuiyun Gao, Deheng Ye |
| 2025 | Vulnerability-Affected Versions Identification: How Far Are We? Xingchu Chen, Chengwei Liu, Jialun Cao, Yang Xiao, Xinyue Cai, Yeting Li, Jingyi Shi, Tianqi Sun, Haiming Chen, Wei Huo |
| 2025 | WEST: Specification-Based Test Generation for WebAssembly. Dongjun Youn, Wonho Shin, Sukyoung Ryu |
| 2025 | WIBE: Watermarks for generated Images - Benchmarking & Evaluation. Aleksey Yakushev, Aleksandr Akimenkov, Khaled Abud, Dmitry Obydenkov, Irina Serzhenko, Kirill Aistov, Egor Kovalev, Stanislav Fomin, Anastasia Antsiferova, Kirill Lukianov, Yury Markin |
| 2025 | Walk the Talk: Is Your Log-based Software Reliability Maintenance System Really Reliable? Minghua He, Tong Jia, Chiming Duan, Pei Xiao, Lingzhe Zhang, Kangjin Wang, Yifan Wu, Ying Li, Gang Huang |
| 2025 | Watson: A Cognitive Observability Framework for the Reasoning of LLM-Powered Agents. Benjamin Rombaut, Sogol Masoumzadeh, Kirill Vasilevski, Dayi Lin, Ahmed E. Hassan |
| 2025 | What Types of Code Review Comments Do Developers Most Frequently Resolve? Saul Goldman, Hong Yi Lin, Jirat Pasuksmit, Patanamon Thongtanunam, Chakkrit Kla Tantithamthavorn, Zhe Wang, Ray Zhang, Ali Behnaz, Fan Jiang, Michael Siers, Ryan Jiang, Mike Buller, Minwoo Jeong, Ming Wu |
| 2025 | What's DAT Smell? Untangling and Weaving the Disjoint Assertion Tangle Test Smell. Monil Narang, Hang Du, James A. Jones |
| 2025 | When Abstraction Breaks Physics: Rethinking Modular Design in Quantum Software. Jianjun Zhao |
| 2025 | When AllClose Fails: Round-Off Error Estimation for Deep Learning Programs. Qi Zhan, Xing Hu, Yuanyi Lin, Tongtong Xu, Xin Xia, Shanping Li |
| 2025 | When Autonomous Vehicle Meets V2X Cooperative Perception: How Far Are We? An Guo, Shuoxiao Zhang, Enyi Tang, Xinyu Gao, Haomin Pang, Haoxiang Tian, Yanzhou Mu, Wu Wen, Chunrong Fang, Zhenyu Chen |
| 2025 | When Control Flows Deviate: Directed Grey-box Fuzzing with Probabilistic Reachability Analysis. Peihong Lin, Pengfei Wang, Xu Zhou, Wei Xie, Xin Ren, Kai Lu |
| 2025 | When Does Wasm Malware Detection Fail? A Systematic Analysis of Their Robustness to Evasion. TaeYoung Kim, Sanghak Oh, Kiho Lee, Weihang Wang, Yonghwi Kwon, Sanghyun Hong, Hyoungshick Kim |
| 2025 | When Faster Isn't Greener: The Hidden Costs of LLM-Based Code Optimization. Tristan Coignion, Clément Quinton, Romain Rouvoy |
| 2025 | Which Is Better For Reducing Outdated and Vulnerable Dependencies: Pinning or Floatingƒ. Imranur Rahman, Jill Marley, William Enck, Laurie A. Williams |
| 2025 | Who's to Blame? Rethinking the Brittleness of Automated Web GUI Testing from a Pragmatic Perspective. Haonan Zhang, Kundi Yao, Zishuo Ding, Lizhi Liao, Weiyi Shang |
| 2025 | Why AI Agents Still Need You: Findings from Developer-Agent Collaborations in the Wild. Aayush Kumar, Yasharth Bajpai, Sumit Gulwani, Gustavo Soares, Emerson R. Murphy-Hill |
| 2025 | Why Is My Transaction Risky? Understanding Smart Contract Semantics and Interactions in the NFT Ecosystem. Yujing Chen, Xuanming Liu, Zhiyuan Wan, Zuobin Wang, David Lo, Difan Xie, Xiaohu Yang |
| 2025 | WingMuzz: Blackbox Testing of IoT Protocols via Two-dimensional Fuzzing Schedule. Xiaogang Zhu, Enze Dai, Xiaotao Feng, Shaohua Wang, Xin Xia, Sheng Wen, Kwok-Yan Lam, Yang Xiang |
| 2025 | Wired for Reuse: Automating Context-Aware Code Adaptation in IDEs via LLM-Based Agent. Taiming Wang, Yanjie Jiang, Chunhao Dong, Yuxia Zhang, Hui Liu |
| 2025 | XRintTest: An Automated Framework for User Interaction Testing in Extended Reality Applications. Ruizhen Gu, José Miguel Rojas, Donghwan Shin |
| 2025 | Your Build Scripts Stink: The State of Code Smells in Build Scripts. Mahzabin Tamanna, Yash Chandrani, Matthew Burrows, Brandon Wroblewski, Laurie A. Williams, Dominik Wermke |
| 2025 | ZendDiff: Differential Testing of PHP Interpreter. Yuancheng Jiang, Jianing Wang, Qiange Liu, Yeqi Fu, Jian Mao, Roland H. C. Yap, Zhenkai Liang |
| 2025 | evalSmarT: An LLM-Based Framework for Evaluating Smart Contract Generated Comments. Fatou Ndiaye Mbodji, Mame Marieme C. Sougoufara, Wendkûuni C. Ouédraogo, Alioune Diallo, Kui Liu, Jacques Klein, Tegawendé F. Bissyandé |
| 2025 | iCodeReviewer: Improving Secure Code Review with Mixture of Prompts. Yun Peng, Kisub Kim, Linghan Meng, Kui Liu |
| 2025 | iKnow: an Intent-Guided Chatbot for Cloud Operations with Retrieval-Augmented Generation. Junjie Huang, Yuedong Zhong, Guangba Yu, Zhihan Jiang, Minzhi Yan, Wenfei Luan, Tianyu Yang, Rui Ren, Michael R. Lyu |