| 2019 | A benchmark of data loss bugs for Android apps. Oliviero Riganelli, Marco Mobilio, Daniela Micucci, Leonardo Mariani |
| 2019 | A data set of program invariants and error paths. Dirk Beyer |
| 2019 | A dataset of non-functional bugs. Aida Radu, Sarah Nadi |
| 2019 | A dataset of parametric cryptographic misuses. Anna-Katharina Wickert, Michael Reif, Michael Eichberg, Anam Dodhy, Mira Mezini |
| 2019 | A large-scale study about quality and reproducibility of jupyter notebooks. João Felipe Pimentel, Leonardo Murta, Vanessa Braganholo, Juliana Freire |
| 2019 | A manually-curated dataset of fixes to vulnerabilities of open-source software. Serena Elisa Ponta, Henrik Plate, Antonino Sabetta, Michele Bezzi, Cédric Dangremont |
| 2019 | A panel data set of cryptocurrency development activity on GitHub. Rijnard van Tonder, Asher Trockman, Claire Le Goues |
| 2019 | An empirical history of permission requests and mistakes in open source Android apps. Gian Luca Scoccia, Anthony Peruma, Virginia Pujols, Ben Christians, Daniel E. Krutz |
| 2019 | An empirical study of multiple names and email addresses in OSS version control repositories. Jiaxin Zhu, Jun Wei |
| 2019 | Analyzing comment-induced updates on stack overflow. Abhishek Soni, Sarah Nadi |
| 2019 | Assessing diffusion and perception of test smells in scala projects. Jonas De Bleser, Dario Di Nucci, Coen De Roover |
| 2019 | Automated software vulnerability assessment with concept drift. Triet Huynh Minh Le, Bushra Sabir, Muhammad Ali Babar |
| 2019 | Automatically generating documentation for lambda expressions in Java. Anwar Alqaimi, Patanamon Thongtanunam, Christoph Treude |
| 2019 | Beyond GumTree: a hybrid approach to generate edit scripts. Junnosuke Matsumoto, Yoshiki Higo, Shinji Kusumoto |
| 2019 | Boa meets python: a boa dataset of data science software in python language. Sumon Biswas, Md Johirul Islam, Yijia Huang, Hridesh Rajan |
| 2019 | Can duplicate questions on stack overflow benefit the software development community? Durham Abric, Oliver E. Clark, Matthew Caminiti, Keheliya Gallaba, Shane McIntosh |
| 2019 | Can issues reported at stack overflow questions be reproduced?: an exploratory study. Saikat Mondal, Mohammad Masudur Rahman, Chanchal K. Roy |
| 2019 | Challenges with responding to static analysis tool alerts. Nasif Imtiaz, Akond Rahman, Effat Farhana, Laurie A. Williams |
| 2019 | Characterizing duplicate code snippets between stack overflow and tutorials. Manziba Akanda Nishi, Agnieszka Ciborowska, Kostadin Damevski |
| 2019 | Characterizing the roles of contributors in open-source scientific software projects. Reed Milewicz, Gustavo Pinto, Paige Rodeghero |
| 2019 | Cleaning StackOverflow for machine translation. Musfiqur Rahman, Peter C. Rigby, Dharani Palani, Tien N. Nguyen |
| 2019 | ConPan: a tool to analyze packages in software containers. Ahmed Zerouali, Valerio Cosentino, Gregorio Robles, Jesús M. González-Barahona, Tom Mens |
| 2019 | Cross-language clone detection by learning over abstract syntax trees. Daniel Perez, Shigeru Chiba |
| 2019 | Crossflow: a framework for distributed mining of software repositories. Dimitris S. Kolovos, Patrick Neubauer, Konstantinos Barmpis, Nicholas Matragkas, Richard F. Paige |
| 2019 | Data-driven solutions to detect API compatibility issues in Android: an empirical study. Simone Scalabrino, Gabriele Bavota, Mario Linares-Vásquez, Michele Lanza, Rocco Oliveto |
| 2019 | DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction. Thong Hoang, Hoa Khanh Dam, Yasutaka Kamei, David Lo, Naoyasu Ubayashi |
| 2019 | Dependency versioning in the wild. Jens Dietrich, David J. Pearce, Jacob Stringer, Amjed Tahir, Kelly Blincoe |
| 2019 | Does UML modeling associate with lower defect proneness?: a preliminary empirical investigation. Adithya Raghuraman, Truong Ho-Quang, Michel R. V. Chaudron, Alexander Serebrenik, Bogdan Vasilescu |
| 2019 | Empirical study in using version histories for change risk classification. Max Kiehn, Xiangyi Pan, Fatih Camci |
| 2019 | Exploratory study of slack Q&A chats as a mining source for software engineering tools. Preetha Chatterjee, Kostadin Damevski, Lori L. Pollock, Vinay Augustine, Nicholas A. Kraft |
| 2019 | Exploring word embedding techniques to improve sentiment analysis of software engineering texts. Eeshita Biswas, K. Vijay-Shanker, Lori L. Pollock |
| 2019 | Extracting API tips from developer question and answer websites. Shaohua Wang, NhatHai Phan, Yan Wang, Yong Zhao |
| 2019 | Generating commit messages from diffs using pointer-generator network. Qin Liu, Zihe Liu, Hongming Zhu, Hongfei Fan, Bowen Du, Yu Qian |
| 2019 | GreenHub farmer: real-world data for Android energy mining. Hugo Matalonga, Bruno Cabral, Fernando Castor, Marco Couto, Rui Pereira, Simão Melo de Sousa, João Paulo Fernandes |
| 2019 | GreenSource: a large-scale collection of Android code, tests and energy metrics. Rui Rua, Marco Couto, João Saraiva |
| 2019 | How often and what StackOverflow posts do developers reference in their GitHub projects? Saraj Singh Manes, Olga Baysal |
| 2019 | Identifying experts in software libraries and frameworks among GitHub users. João Eduardo Montandon, Luciana Lourdes Silva, Marco Túlio Valente |
| 2019 | Impact of stack overflow code snippets on software cohesion: a preliminary study. Mashal Ahmad, Mel Ó Cinnéide |
| 2019 | Impacts of daylight saving time on software development. Junichi Hayashi, Yoshiki Higo, Shinsuke Matsumoto, Shinji Kusumoto |
| 2019 | Import2vec learning embeddings for software libraries. Bart Theeten, Frederik Vandeputte, Tom Van Cutsem |
| 2019 | Investigating next steps in static API-misuse detection. Sven Amann, Hoan Anh Nguyen, Sarah Nadi, Tien N. Nguyen, Mira Mezini |
| 2019 | Lessons learned from using a deep tree-based model for software defect prediction in practice. Hoa Khanh Dam, Trang Pham, Shien Wee Ng, Truyen Tran, John C. Grundy, Aditya Ghose, Taeksu Kim, Chul-Joo Kim |
| 2019 | Man vs machine: a study into language identification of stack overflow code snippets. Jens Dietrich, Markus Luczak-Rösch, Elroy Dalefield |
| 2019 | Mining rule violations in JavaScript code snippets. Uriel Campos, Guilherme Smethurst, João Pedro Moraes, Rodrigo Bonifácio, Gustavo Pinto |
| 2019 | Negative results on mining crypto-API usage rules in Android apps. Jun Gao, Pingfan Kong, Li Li, Tegawendé F. Bissyandé, Jacques Klein |
| 2019 | On the effectiveness of manual and automatic unit test generation: ten years later. Domenico Serra, Giovanni Grano, Fabio Palomba, Filomena Ferrucci, Harald C. Gall, Alberto Bacchelli |
| 2019 | PathMiner: a library for mining of path-based representations of code. Vladimir Kovalenko, Egor Bogomolov, Timofey Bryksin, Alberto Bacchelli |
| 2019 | Predicting co-changes between functionality specifications and source code in behavior driven development. Aidan Z. H. Yang, Daniel Alencar da Costa, Ying Zou |
| 2019 | Predicting good configurations for GitHub and stack overflow topic models. Christoph Treude, Markus Wagner |
| 2019 | Proceedings of the 16th International Conference on Mining Software Repositories, MSR 2019, 26-27 May 2019, Montreal, Canada. Margaret-Anne D. Storey, Bram Adams, Sonia Haiduc |
| 2019 | Python coding style compliance on stack overflow. Nikolaos Bafatakis, Niels Boecker, Wenjie Boon, Martin Cabello Salazar, Jens Krinke, Gazi Oznacar, Robert White |
| 2019 | RapidRelease: a dataset of projects and issues on github with rapid releases. Saket Dattatray Joshi, Sridhar Chimalakonda |
| 2019 | Recommending energy-efficient Java collections. Wellington Oliveira, Renato O. Santos, Fernando Castor, Benito Fernandes, Gustavo Pinto |
| 2019 | RmvDroid: towards a reliable Android malware dataset with app metadata. Haoyu Wang, Junjun Si, Hao Li, Yao Guo |
| 2019 | SCOR: source code retrieval with semantics and order. Shayan A. Akbar, Avinash C. Kak |
| 2019 | SOTorrent: studying the origin, evolution, and usage of stack overflow code snippets. Sebastian Baltes, Christoph Treude, Stephan Diehl |
| 2019 | STRAIT: a tool for automated software reliability growth analysis. Stanislav Chren, Radoslav Micko, Barbora Buhnova, Bruno Rossi |
| 2019 | STYLE-ANALYZER: fixing code style inconsistencies with interpretable unsupervised algorithms. Vadim Markovtsev, Waren Long, Hugo Mougard, Konstantin Slavnov, Egor Bulychev |
| 2019 | Scalable software merging studies with MergAnser. Moein Owhadi-Kareshk, Sarah Nadi |
| 2019 | SeSaMe: a data set of semantically similar Java methods. Marius Kamp, Patrick Kreutzer, Michael Philippsen |
| 2019 | Semantic source code models using identifier embeddings. Vasiliki Efstathiou, Diomidis Spinellis |
| 2019 | Snakes in paradise?: insecure python-related coding practices in stack overflow. Akond Rahman, Effat Farhana, Nasif Imtiaz |
| 2019 | Snoring: a noise in defect prediction datasets. Aalok Ahluwalia, Davide Falessi, Massimiliano Di Penta |
| 2019 | Splitting APIs: an exploratory study of software unbundling. Anderson S. Matos, João Bosco Ferreira Filho, Lincoln S. Rocha |
| 2019 | Standing on shoulders or feet?: the usage of the MSR data papers. Zoe Kotti, Diomidis Spinellis |
| 2019 | Striking gold in software repositories?: an econometric study of cryptocurrencies on GitHub. Asher Trockman, Rijnard van Tonder, Bogdan Vasilescu |
| 2019 | Test coverage in python programs. Hongyu Zhai, Casey Casalnuovo, Premkumar T. Devanbu |
| 2019 | The emergence of software diversity in maven central. César Soto-Valero, Amine Benelallam, Nicolas Harrand, Olivier Barais, Benoit Baudry |
| 2019 | The impact of systematic edits in history slicing. Ryosuke Funaki, Shinpei Hayashi, Motoshi Saeki |
| 2019 | The maven dependency graph: a temporal graph-based representation of maven central. Amine Benelallam, Nicolas Harrand, César Soto-Valero, Benoit Baudry, Olivier Barais |
| 2019 | The rise of Android code smells: who is to blame? Sarra Habchi, Naouel Moha, Romain Rouvoy |
| 2019 | The software heritage graph dataset: public software development under one roof. Antoine Pietri, Diomidis Spinellis, Stefano Zacchiroli |
| 2019 | Time present and time past: analyzing the evolution of JavaScript code in the wild. Dimitris Mitropoulos, Panos Louridas, Vitalis Salis, Diomidis Spinellis |
| 2019 | Towards mining answer edits to extract evolution patterns in stack overflow. Themistoklis Diamantopoulos, Maria-Ioanna Sifaki, Andreas L. Symeonidis |
| 2019 | Tracing back log data to its log statement: from research to practice. Daan Schipper, Maurício Finavaro Aniche, Arie van Deursen |
| 2019 | We need to talk about microservices: an analysis from the discussions on StackOverflow. Alan Bandeira, Carlos Alberto Medeiros, Matheus Paixão, Paulo Henrique M. Maia |
| 2019 | What do developers know about machine learning: a study of ML discussions on StackOverflow. Abdul Ali Bangash, Hareem Sahar, Shaiful Alam Chowdhury, Alexander William Wong, Abram Hindle, Karim Ali |
| 2019 | What edits are done on the highly answered questions in stack overflow?: an empirical study. Xianhao Jin, Francisco Servant |
| 2019 | World of code: an infrastructure for mining the universe of open source VCS data. Yuxing Ma, Chris Bogart, Sadika Amreen, Russell Zaretzki, Audris Mockus |
| 2019 | git2net: mining time-stamped co-editing networks from large git repositories. Christoph Gote, Ingo Scholtes, Frank Schweitzer |