| 2025 | "Is It Responsible?"Emerging Results on Comparing Guardrails for Harm Mitigation in LLM-Enhanced Software Applications. Manoel Veríssimo dos Santos Neto, Valdemar Vicente Graciano Neto, Arlindo Rodrigues Galvão Filho, Mohamad Kassab, Edson OliveiraJr |
| 2025 | A Defect Taxonomy for Infrastructure as Code: A Replication Study. Wendell Oliveira, Filipe Paiva, Thiago Emmanuel Pereira, João Brunet |
| 2025 | A Fully Automated Agent for End-to-End Code Translation and Validation. Eray Erer, Aysun Bozanta, Turgay Aytac, Ayse Basar |
| 2025 | A Preliminary Assessment of SLR's Reliance on Preprints, in the Area of LLMs4SE. Sarah Buckley, Abdul Razzaq, Michael English |
| 2025 | A Proposal on an AI-Based Framework for Software Defect Detection Using Multimodality in Software Industries. Shrabanti Kundu, Deepti Mishra, Alok Mishra |
| 2025 | A Vision for Debiasing Confirmation Bias in Software Testing via LLM. Iflaah Salman, Muhammad Waseem, Vladimir Mandic, Rasanjana Dhanushkha De Alwis |
| 2025 | ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2025, Honolulu, HI, USA, October 2-3, 2025 |
| 2025 | Aggregating Empirical Evidence from Data Strategies Studies: A Case on Model Quantization. Santiago del Rey, Paulo Sérgio Medeiros dos Santos, Guilherme Horta Travassos, Xavier Franch, Silverio Martínez-Fernández |
| 2025 | An Empirical Investigation into Maintenance of Load Testing Scripts. Ibuki Nakamura, Kosei Horikawa, Brittany Reid, Yutaro Kashiwa, Hajimu Iida |
| 2025 | Another Systematic Review? A Critical Analysis of Systematic Literature Reviews on Agile Effort and Cost Estimation. Henry Edison, Nauman Bin Ali |
| 2025 | Assessing Diversity in Creating Seed Set for Snowballing Search for Systematic Literature Review in Software Engineering. Kátia Romero Felizardo, Francisco Carlos M. Souza, Alinne Cristinne Corrêa Souza, Bianca Minetto Napoleão, Igor Steinmacher, Marco Aurélio Gerosa |
| 2025 | Beyond Binary Moderation: Identifying Fine-Grained Sexist and Misogynistic Behavior on GitHub with Large Language Models. Tanni Dev, Sayma Sultana, Amiangshu Bosu |
| 2025 | Beyond the Job Posting: What Hiring Managers Seek in Entry-Level Software Engineering Candidates. Spencer Baloga Loufek, Fabio Santos, Bianca Trinkenreich |
| 2025 | Can User Feedback Help Issue Detection? An Empirical Study on a One-Billion-User Online Service System. Shuyao Jiang, Jiazhen Gu, Wujie Zheng, Yangfan Zhou, Michael R. Lyu |
| 2025 | Cognitive Biases in Software Engineering: Debiasing Through Reconception. Heidi Hietala, Burak Turhan |
| 2025 | Contextual Code Retrieval for Commit Message Generation: A Preliminary Study. Bo Xiong, Linghao Zhang, Chong Wang, Peng Liang |
| 2025 | Contribution History as a Key Feature in OSS Task Recommendation: An LLM-Based Empirical Study. Md Abdul Hannan, Mohammad Habibullah Rakib, Khondaker Masfiq Reza, Fabio Santos |
| 2025 | Dealing with SonarQube Cloud: Initial Results from a Mining Software Repository Study. Sabato Nocera, Davide Fucci, Giuseppe Scanniello |
| 2025 | Developer Prompts in Practice: An Empirical Study of Bias, Security, and Optimization. Dhia Elhaq Rzig, Dhruba Jyothi Paul, Kaiser Pister, Jordan Henkel, Foyzul Hassan |
| 2025 | Empirical Insights into Microservice Language Heterogeneity in Practice. Amr S. Abdelfattah, Tomás Cerný, Marwa Elsayed |
| 2025 | Explainable AI for Identifying and Managing Test Debt in Automated Testing. Mahsa Radnejad |
| 2025 | Exploring Engagement in Hybrid Meetings. Daniela Grassi, Fabio Calefato, Darja Smite, Nicole Novielli, Filippo Lanubile |
| 2025 | Exploring LLMs for Stakeholder-Specific Insight Generation from Software Contracts. Jyoti S. Shukla, Aditya Kahol, Mohit Chaudhary, Preethu Rose Anish |
| 2025 | Exploring Large Language Models for Analyzing and Improving Method Names in Scientific Code. Gunnar Larsen, Carol Wong, Anthony Peruma |
| 2025 | Exploring the Evidence-Based SE Beliefs of Generative AI Tools. Chris Brown, Jason Cusati |
| 2025 | Exploring the Jupyter Ecosystem: An Empirical Study of Bugs and Vulnerabilities. Wenyuan Jiang, Diany Pressato, Harsh Darji, Thibaud Lutellier |
| 2025 | From Assessment to Enhancement of Pull Requests at Scale: Aligning Code Reviews with Developer Competencies Using Large Language Models. Luca Mariotto, Christian Medeiros Adriano, René Eichhorn, Daniel Burgstahler, Holger Giese |
| 2025 | Go-Oracle: Automated Test Oracle for Go Concurrency Bugs. Foivos Tsimpourlas, Chao Peng, Carlos Rosuero, Ping Yang, Ajitha Rajan |
| 2025 | How Do Programmers Evaluate AI-Generated Code? Samuli Määttä |
| 2025 | How Small is Enough? Empirical Evidence of Quantized Small Language Models for Automated Program Repair. Kazuki Kusama, Honglin Shu, Masanari Kondo, Yasutaka Kamei |
| 2025 | How do Community Smells Influence Self-Admitted Technical Debt in Machine Learning Projects? Shamse Tasnim Cynthia, Nuri Almarimi, Banani Roy |
| 2025 | Human Factors in Industrial Software Engineering - A Theory-Based Empirical Study Focused on Challenges in the Context of Software Quality and Cyber Resilience. Philipp A. Müller |
| 2025 | Identifier Name Similarities: An Exploratory Study. Carol Wong, Mai Abe, Silvia De Benedictis, Marissa Halim, Anthony Peruma |
| 2025 | Interrogative Comments Posed by Review Comment Generators: An Empirical Study of Gerrit. Farshad Kazemi, Maxime Lamothe, Shane McIntosh |
| 2025 | Invisible Risks, Visible Code: A Vision for Understanding Ethical Debt in AI-Based Coding. Dina Salah |
| 2025 | Is Diversity a Meaningful Metric in Fairness Testing? Kazuki Funamoto, Takashi Kitamura, Shingo Takada |
| 2025 | Is LLM-Generated Code More Maintainable & Reliable Than Human-Written Code? Alfred Santa Molison, Marcia Moraes, Glaucia Melo, Fabio Santos, Wesley K. G. Assunção |
| 2025 | Mapping Code Smells and Refactorings Accurately: Insights from an Empirical Study. Gautam Shetty, Tushar Sharma |
| 2025 | On the Harmfulness of Test Smells in Manual System Testing: A Controlled Experiment. Gabriela Soares, Vanessa Santos, Márcio Ribeiro, Luana Almeida Martins, Valeria Pontillo, Manoel Aranda III, Rohit Gheyi, Ivan Machado, Fabio Palomba |
| 2025 | One Size Does Not Fit All: How to Organize Hybrid Work in Agile Software Development? Fateme Broomandi, Emily Laue Christensen, Maria Paasivaara |
| 2025 | Perspectives, Needs and Challenges for Sustainable Software Engineering Teams: A FinServ Case Study. Satwik Ghanta, Peggy Gregory, Gül Çalikli |
| 2025 | Project OSCAR: Promoting CrOss-Cutting Digital Skills Through Europe-Wide Non-Conventional LeARning Experiences. Ilenia Fronza, Tommi Mikkonen |
| 2025 | ROSE: Transformer-Based Refactoring Recommendation for Architectural Smells. Samal Nursapa, Anastassiya Samuilova, Alessio Bucaioni, Phuong T. Nguyen |
| 2025 | Rethinking Code Review Workflows with LLM Assistance: An Empirical Study. Fannar Steinn Aðalsteinsson, Björn Borgar Magnússon, Mislav Milicevic, Adam Nirving Davidsson, Chih-Hong Cheng |
| 2025 | Robust or Overfitted? Investigating the Generalization of Pretrained Models in Requirement Classification. Farha Kamal, Md Rakibul Islam |
| 2025 | SESR-Eval: Dataset for Evaluating LLMs in the Title-Abstract Screening of Systematic Reviews. Aleksi Huotala, Miikka Kuutila, Mika Mäntylä |
| 2025 | SIExVulTS: Sensitive Information Exposure Vulnerability Detection System Using Transformer Models and Static Analysis. Kyler Katz, Sara Moshtari, Ibrahim Mujhid, Mehdi Mirakhorli, Derek Garcia |
| 2025 | Secret Breach Detection in Source Code with Large Language Models. Md Nafiu Rahman, Sadif Ahmed, Zahin Wahab, S. M. Sohan, Rifat Shahriyar |
| 2025 | Secure Software Engineering Through Sensible AutoMation (SESAM). Davide Fucci |
| 2025 | The Shifting Sands of Toxicity: The Evolving Nature of Interpersonal Challenges in Open Source. Sarthak Bharadwaj, Fabio Santos, Bianca Trinkenreich |
| 2025 | Threat Modeling for Large Language Model-Integrated Applications (Thremolia). Felix Viktor Jedrzejewski, Oleksandr Adamov, Davide Fucci |
| 2025 | Toward Real-Time Intrusion Detection for Autonomous Vehicles: A Vision for Deep Learning-Based Security Frameworks. Damiano Torre, Amirpasha Javid |
| 2025 | Understanding Everything as Code: A Taxonomy and Conceptual Model. Haoran Wei, Nazim H. Madhavji, John Steinbacher |
| 2025 | Using Voting and Stacking Ensemble Techniques to Optimize Software Requirements Classification. María Isabel Limaylla Lunarejo, Nelly Condori-Fernández, Miguel R. Luaces |
| 2025 | We Know What You're Looking For: Recommendation for Large-Scale Open Source Software. Xing Cui, Jingzheng Wu, Xiang Ling, Tianyue Luo |
| 2025 | What About Our Bug? A Study on the Responsiveness of NPM Package Maintainers. Mohammadreza Saeidi, Ethan Thoma, Raula Gaikovina Kula, Gema Rodríguez-Pérez |
| 2025 | When Domains Collide: An Activity Theory Exploration of Cross-Disciplinary Collaboration. Zixuan Feng, Thomas Zimmermann, Lorenzo Pisani, Christopher Gooley, Jeremiah Wander, Anita Sarma |
| 2025 | When Retriever Meets Generator: A Joint Model for Code Comment Generation. Tien P. T. Le, Anh M. T. Bui, Huy N. D. Pham, Alessio Bucaioni, Phuong T. Nguyen |
| 2025 | Where Tests Fall Short: Empirically Analyzing Oracle Gaps in Covered Code. Megan Maton, Gregory M. Kapfhammer, Phil McMinn |