| 2023 | 20th IEEE/ACM International Conference on Mining Software Repositories, MSR 2023, Melbourne, Australia, May 15-16, 2023 |
| 2023 | A Dataset of Bot and Human Activities in GitHub. Natarajan Chidambaram, Alexandre Decan, Tom Mens |
| 2023 | A Large Scale Analysis of Semantic Versioning in NPM. Donald Pinckney, Federico Cassano, Arjun Guha, Jonathan Bell |
| 2023 | A Study of Gender Discussions in Mobile Apps. Mojtaba Shahin, Mansooreh Zahedi, Hourieh Khalajzadeh, Ali Rezaei Nasab |
| 2023 | An Empirical Study of High Performance Computing (HPC) Performance Bugs. Md. Abul Kalam Azad, Nafees Iqbal, Foyzul Hassan, Probir Roy |
| 2023 | An Empirical Study on the Performance of Individual Issue Label Prediction. Jueun Heo, Seonah Lee |
| 2023 | An Empirical Study to Investigate Collaboration Among Developers in Open Source Software (OSS). Weijie Sun, Samuel Iwuchukwu, Abdul Ali Bangash, Abram Hindle |
| 2023 | An Exploratory Study on Energy Consumption of Dataframe Processing Libraries. Shriram Shanbhag, Sridhar Chimalakonda |
| 2023 | Are We Speeding Up or Slowing Down? On Temporal Aspects of Code Velocity. Gunnar Kudrjavets, Nachiappan Nagappan, Ayushi Rastogi |
| 2023 | AutoML from Software Engineering Perspective: Landscapes and Challenges. Chao Wang, Zhenpeng Chen, Minghui Zhou |
| 2023 | Automating Arduino Programming: From Hardware Setups to Sample Source Code Generation. Imam Nur Bani Yusuf, Diyanah Binte Abdul Jamal, Lingxiao Jiang |
| 2023 | Boosting Just-in-Time Defect Prediction with Specific Features of C/C++ Programming Languages in Code Changes. Chao Ni, Xiaodan Xu, Kaiwen Yang, David Lo |
| 2023 | CLEAN++: Code Smells Extraction for C++. Tom Mashiach, Bruno Sotto-Mayor, Gal A. Kaminka, Meir Kalech |
| 2023 | Characterizing and Understanding Software Security Vulnerabilities in Machine Learning Libraries. Nima Shiri Harzevili, Jiho Shin, Junjie Wang, Song Wang, Nachiappan Nagappan |
| 2023 | Connecting the .dotfiles: Checked-In Secret Exposure with Extra (Lateral Movement) Steps. Gerhard Jungwirth, Aakanksha Saha, Michael Schröder, Tobias Fiebig, Martina Lindorfer, Jürgen Cito |
| 2023 | Control and Data Flow in Security Smell Detection for Infrastructure as Code: Is It Worth the Effort? Ruben Opdebeeck, Ahmed Zerouali, Coen De Roover |
| 2023 | Cross-Domain Evaluation of a Deep Learning-Based Type Inference System. Bernd Gruner, Tim Sonnekalb, Thomas S. Heinze, Clemens-Alexander Brust |
| 2023 | DACOS - A Manually Annotated Dataset of Code Smells. Himesh Nandani, Mootez Saad, Tushar Sharma |
| 2023 | DGMF: Fast Generation of Comparable, Updatable Dependency Graphs for Software Repositories. Tobias Litzenberger, Johannes Düsing, Ben Hermann |
| 2023 | Dealing with Popularity Bias in Recommender Systems for Third-party Libraries: How far Are We? Phuong T. Nguyen, Riccardo Rubei, Juri Di Rocco, Claudio Di Sipio, Davide Di Ruscio, Massimiliano Di Penta |
| 2023 | DeepScenario: An Open Driving Scenario Dataset for Autonomous Driving System Testing. Chengjie Lu, Tao Yue, Shaukat Ali |
| 2023 | Defectors: A Large, Diverse Python Dataset for Defect Prediction. Parvez Mahbub, Ohiduzzaman Shuvo, Mohammad Masudur Rahman |
| 2023 | Determining Open Source Project Boundaries. Sophia Vargas |
| 2023 | Do Subjectivity and Objectivity Always Agreeƒ A Case Study with Stack Overflow Questions. Saikat Mondal, Mohammad Masudur Rahman, Chanchal K. Roy |
| 2023 | DocMine: A Software Documentation-Related Dataset of 950 GitHub Repositories. Akhila Sri Manasa Venigalla, Sridhar Chimalakonda |
| 2023 | Don't Forget the Exception! : Considering Robustness Changes to Identify Design Problems. Anderson Oliveira, João Lucas Correia, Leonardo da Silva Sousa, Wesley K. G. Assunção, Daniel Coutinho, Alessandro F. Garcia, Willian Nalepa Oizumi, Caio Barbosa, Anderson G. Uchôa, Juliana Alves Pereira |
| 2023 | EGAD: A moldable tool for GitHub Action analysis. Pablo Valenzuela-Toledo, Alexandre Bergel, Timo Kehrer, Oscar Nierstrasz |
| 2023 | Enabling Analysis and Reasoning on Software Systems through Knowledge Graph Representation. Satrio Adi Rukmono, Michel R. V. Chaudron |
| 2023 | Energy Consumption Estimation of API-usage in Smartphone Apps via Static Analysis. Abdul Ali Bangash, Kalvin Eng, Jamal Qasim, Karim Ali, Abram Hindle |
| 2023 | Enriching Source Code with Contextual Data for Code Completion Models: An Empirical Study. Tim van Dam, Maliheh Izadi, Arie van Deursen |
| 2023 | Evaluating Software Documentation Quality. Henry Tang, Sarah Nadi |
| 2023 | Evolution of the Practice of Software Testing in Java Projects. Anisha Islam, Nipuni Tharushika Hewage, Abdul Ali Bangash, Abram Hindle |
| 2023 | Feature Toggle Usage Patterns: A Case Study on Google Chromium. Md Tajmilur Rahman |
| 2023 | GIRT-Data: Sampling GitHub Issue Report Templates. Nafiseh Nikeghbal, Amir Hossein Kargaran, Abbas Heydarnoori, Hinrich Schütze |
| 2023 | GitHub OSS Governance File Dataset. Yibo Yan, Seth Frey, Amy X. Zhang, Vladimir Filkov, Likang Yin |
| 2023 | GiveMeLabeledIssues: An Open Source Issue Recommendation System. Joseph Vargovich, Fabio Santos, Jacob Penney, Marco Aurélio Gerosa, Igor Steinmacher |
| 2023 | HasBugs - Handpicked Haskell Bugs. Leonhard Applis, Annibale Panichella |
| 2023 | Helm Charts for Kubernetes Applications: Evolution, Outdatedness and Security Risks. Ahmed Zerouali, Ruben Opdebeeck, Coen De Roover |
| 2023 | Improving Agile Planning for Reliable Software Delivery. Jirat Pasuksmit, Fan Jiang, Kemp Thornton, Arik Friedman, Natalija Fuksmane, Isabelle Kohout, Julian Connor |
| 2023 | Insights into Female Contributions in Open-Source Projects. Arifa I. Champa, Md. Fazle Rabbi, Minhaz F. Zibran, Md. Rakibul Islam |
| 2023 | Intertwining Communities: Exploring Libraries that Cross Software Ecosystems. Kanchanok Kannee, Raula Gaikovina Kula, Supatsara Wattanakriengkrai, Kenichi Matsumoto |
| 2023 | Investigating the Resolution of Vulnerable Dependencies with Dependabot Security Updates. Hamid Mohayeji, Andrei Agaronian, Eleni Constantinou, Nicola Zannone, Alexander Serebrenik |
| 2023 | Keep the Ball Rolling: Analyzing Release Cadence in GitHub Projects. Oz Kilic, Nathaniel Bowness, Olga Baysal |
| 2023 | LLMSecEval: A Dataset of Natural Language Prompts for Security Evaluations. Catherine Tony, Markus Mutas, Nicolás E. Díaz Ferreyra, Riccardo Scandariato |
| 2023 | Large Language Models and Simple, Stupid Bugs. Kevin Jesse, Toufique Ahmed, Premkumar T. Devanbu, Emily Morgan |
| 2023 | MANDO-HGT: Heterogeneous Graph Transformers for Smart Contract Vulnerability Detection. Hoang H. Nguyen, Nhat-Minh Nguyen, Chunyao Xie, Zahra Ahmadi, Daniel Kudendo, Thanh-Nam Doan, Lingxiao Jiang |
| 2023 | Method Chaining Redux: An Empirical Study of Method Chaining in Java, Kotlin, and Python. Ali M. Keshk, Robert Dyer |
| 2023 | Model-Agnostic Syntactical Information for Pre-Trained Programming Language Models. Iman Saberi, Fatemeh H. Fard |
| 2023 | NICHE: A Curated Dataset of Engineered Machine Learning Projects in Python. Ratnadira Widyasari, Zhou Yang, Ferdian Thung, Sheng Qin Sim, Fiona Wee, Camellia Lok, Jack Phan, Haodi Qi, Constance Tan, Qijin Tay, David Lo |
| 2023 | On Codex Prompt Engineering for OCL Generation: An Empirical Study. Seif Abukhalaf, Mohammad Hamdaqa, Foutse Khomh |
| 2023 | Optimizing Duplicate Size Thresholds in IDEs. Konstantin Grotov, Sergey Titov, Alexandr Suhinin, Yaroslav Golubev, Timofey Bryksin |
| 2023 | PENTACET data - 23 Million Contextual Code Comments and 250,000 SATD comments. Murali Sridharan, Leevi Rantala, Mika Mäntylä |
| 2023 | PTMTorrent: A Dataset for Mining Open-source Pre-trained Model Packages. Wenxin Jiang, Nicholas Synovic, Purvish Jajal, Taylor R. Schorlemmer, Arav Tewari, Bhavesh Pareek, George K. Thiruvathukal, James C. Davis |
| 2023 | Phylogenetic Analysis of Reticulate Software Evolution. Akira Mori, Masatomo Hashimoto |
| 2023 | Picaso: Enhancing API Recommendations with Relevant Stack Overflow Posts. Ivana Clairine Irsan, Ting Zhang, Ferdian Thung, Kisub Kim, David Lo |
| 2023 | Pre-trained Model Based Feature Envy Detection. Wenhao Ma, Yaoxiang Yu, Xiaoming Ruan, Bo Cai |
| 2023 | PyMigBench: A Benchmark for Python Library Migration. Mohayeminul Islam, Ajay Kumar Jha, Sarah Nadi, Ildar Akhmetov |
| 2023 | SecretBench: A Dataset of Software Secrets. Setu Kumar Basak, Lorenzo Neil, Bradley Reaves, Laurie A. Williams |
| 2023 | Semantically-enriched Jira Issue Tracking Data. Themistoklis Diamantopoulos, Dimitrios-Nikitas Nastos, Andreas L. Symeonidis |
| 2023 | She Elicits Requirements and He Tests: Software Engineering Gender Bias in Large Language Models. Christoph Treude, Hideaki Hata |
| 2023 | Snapshot Testing Dataset. Emily Bui, Henrique Rocha |
| 2023 | State of Refactoring Adoption: Better Understanding Developer Perception of Refactoring. Eman Abdullah AlOmar |
| 2023 | Tell Me Who Are You Talking to and I Will Tell You What Issues Need Your Skills. Fabio Santos, Jacob Penney, João Felipe Pimentel, Igor Wiese, Igor Steinmacher, Marco Aurélio Gerosa |
| 2023 | The ABLoTS Approach for Bug Localization: is it replicable and generalizable? Feifei Niu, Christoph Mayr-Dorn, Wesley K. G. Assunção, Liguo Huang, Jidong Ge, Bin Luo, Alexander Egyed |
| 2023 | The Atlassian Data Lake: consolidating enriched software development data in a single, queryable system. Arik Friedman, Rohan Dhupelia, Ben Jackson |
| 2023 | The Secret Life of CVEs. Piotr Przymus, Mikolaj Fejzer, Jakub Narebski, Krzysztof Stencel |
| 2023 | TypeScript's Evolution: An Analysis of Feature Adoption Over Time. Joshua D. Scarsbrook, Mark Utting, Ryan K. L. Ko |
| 2023 | UnGoML: Automated Classification of unsafe Usages in Go. Anna-Katharina Wickert, Clemens Damke, Lars Baumgärtner, Eyke Hüllermeier, Mira Mezini |
| 2023 | Understanding the Role of Images on Stack Overflow. Dong Wang, Tao Xiao, Christoph Treude, Raula Gaikovina Kula, Hideaki Hata, Yasutaka Kamei |
| 2023 | Understanding the Time to First Response in GitHub Pull Requests. Kazi Amit Hasan, Marcos Macedo, Yuan Tian, Bram Adams, Steven H. H. Ding |
| 2023 | Unveiling the Relationship Between Continuous Integration and Code Coverage. Diego Saraiva, Daniel Alencar da Costa, Uirá Kulesza, Gustavo Sizílio, José Gameleira Neto, Roberta Coelho, Meiyappan Nagappan |
| 2023 | Wasmizer: Curating WebAssembly-driven Projects on GitHub. Alexander Nicholson, Quentin Stiévenart, Arash Mazidi, Mohammad Ghafari |
| 2023 | What Do Users Ask in Open-Source AI Repositories? An Empirical Study of GitHub Issues. Zhou Yang, Chenyu Wang, Jieke Shi, Thong Hoang, Pavneet Singh Kochhar, Qinghua Lu, Zhenchang Xing, David Lo |
| 2023 | What Happens When We Fuzz? Investigating OSS-Fuzz Bug History. Brandon N. Keller, Benjamin S. Meyers, Andrew Meneely |
| 2023 | What Warnings Do Engineers Really Fix? The Compiler That Cried Wolf. Gunnar Kudrjavets, Aditya Kumar, Ayushi Rastogi |
| 2023 | Whistleblowing and Tech on Twitter. Laura Duits, Isha Kashyap, Joey Bekkink, Kousar Aslam, Emitzá Guzmán |
| 2023 | microSecEnD: A Dataset of Security-Enriched Dataflow Diagrams for Microservice Applications. Simon Schneider, Tufan Özen, Michael Chen, Riccardo Scandariato |