MSR A

75 papers

YearTitle / Authors
202020-MAD: 20 Years of Issues and Commits of Mozilla and Apache Development.
Maëlick Claes, Mika V. Mäntylä
2020A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries.
Jiahao Fan, Yi Li, Shaohua Wang, Tien N. Nguyen
2020A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared Commits.
Audris Mockus, Diomidis Spinellis, Zoe Kotti, Gabriel John Dusing
2020A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits.
Tanner Fry, Tapajit Dey, Andrey Karnauch, Audris Mockus
2020A Dataset for GitHub Repository Deduplication.
Diomidis Spinellis, Zoe Kotti, Audris Mockus
2020A Dataset of Dockerfiles.
Jordan Henkel, Christian Bird, Shuvendu K. Lahiri, Thomas W. Reps
2020A Dataset of Enterprise-Driven Open Source Software.
Diomidis Spinellis, Zoe Kotti, Konstantinos Kravvaritis, Georgios Theodorou, Panos Louridas
2020A Large-Scale Comparative Evaluation of IR-Based Tools for Bug Localization.
Shayan A. Akbar, Avinash C. Kak
2020A Machine Learning Approach for Vulnerability Curation.
Yang Chen, Andrew E. Santosa, Ming Yi Ang, Abhishek Sharma, Asankhaya Sharma, David Lo
2020A Mixed Graph-Relational Dataset of Socio-technical Interactions in Open Source Systems.
Usman Ashraf, Christoph Mayr-Dorn, Alexander Egyed, Sebastiano Panichella
2020A Soft Alignment Model for Bug Deduplication.
Irving Muller Rodrigues, Daniel Aloise, Eraldo Rezende Fernandes, Michel R. Dagenais
2020A Study of Potential Code Borrowing and License Violations in Java Projects on GitHub.
Yaroslav Golubev, Maria Eliseeva, Nikita Povarov, Timofey Bryksin
2020A Study on the Accuracy of OCR Engines for Source Code Transcription from Programming Screencasts.
Abdulkarim Khormi, Mohammad Alahmadi, Sonia Haiduc
2020AIMMX: Artificial Intelligence Model Metadata Extractor.
Jason Tsay, Alan Braz, Martin Hirzel, Avraham Shinnar, Todd W. Mummert
2020An Empirical Study of Build Failures in the Docker Context.
Yiwen Wu, Yang Zhang, Tao Wang, Huaimin Wang
2020An Empirical Study of Method Chaining in Java.
Tomoki Nakamaru, Tomomasa Matsunaga, Tetsuro Yamazaki, Soramichi Akiyama, Shigeru Chiba
2020An Empirical Study on Regular Expression Bugs.
Peipei Wang, Chris Brown, Jamie A. Jennings, Kathryn T. Stolee
2020An Empirical Study on the Impact of Deimplicitization on Comprehension in Programs Using Application Frameworks.
Jürgen Cito, Jiasi Shen, Martin C. Rinard
2020An Exploratory Study to Find Motives Behind Cross-platform Forks from Software Heritage Dataset.
Avijit Bhattacharjee, Sristy Sumana Nath, Shurui Zhou, Debasish Chakroborti, Banani Roy, Chanchal K. Roy, Kevin A. Schneider
2020AndroZooOpen: Collecting Large-scale Open Source Android Apps for the Research Community.
Pei Liu, Li Li, Yanjie Zhao, Xiaoyu Sun, John Grundy
2020Automatically Granted Permissions in Android apps: An Empirical Study on their Prevalence and on the Potential Threats for Privacy.
Paolo Calciati, Konstantin Kuznetsov, Alessandra Gorla, Andreas Zeller
2020Behind the Intents: An In-depth Empirical Study on Software Refactoring in Modern Code Review.
Matheus Paixão, Anderson G. Uchôa, Ana Carla Bibiano, Daniel Oliveira, Alessandro Garcia, Jens Krinke, Emilio Arvonio
2020Beyond the Code: Mining Self-Admitted Technical Debt in Issue Tracker Systems.
Laerte Xavier, Fabio Ferreira, Rodrigo Brito, Marco Túlio Valente
2020Boa Views: Easy Modularization and Sharing of MSR Analyses.
Che Shian Hung, Robert Dyer
2020Can We Use SE-specific Sentiment Analysis Tools in a Cross-Platform Setting?
Nicole Novielli, Fabio Calefato, Davide Dongiovanni, Daniela Girardi, Filippo Lanubile
2020Capture the Feature Flag: Detecting Feature Flags in Open-Source.
Jens Meinicke, Juan Hoyos, Bogdan Vasilescu, Christian Kästner
2020Challenges in Chatbot Development: A Study of Stack Overflow Posts.
Ahmad Abdellatif, Diego Costa, Khaled Badran, Rabe Abdalkareem, Emad Shihab
2020Characterizing and Identifying Composite Refactorings: Concepts, Heuristics and Patterns.
Leonardo da Silva Sousa, Diego Cedrim, Alessandro Garcia, Willian Nalepa Oizumi, Ana Carla Bibiano, Daniel Oliveira, Miryung Kim, Anderson Oliveira
2020Cheating Death: A Statistical Survival Analysis of Publicly Available Python Projects.
Rao Hamza Ali, Chelsea Parlett-Pelleriti, Erik Linstead
2020Dataset of Video Game Development Problems.
Cristiano Politowski, Fábio Petrillo, Gabriel Cavalheiro Ullmann, Josias de Andrade Werly, Yann-Gaël Guéhéneuc
2020Detecting Video Game-Specific Bad Smells in Unity Projects.
Antonio Borrelli, Vittoria Nardone, Giuseppe A. Di Lucca, Gerardo Canfora, Massimiliano Di Penta
2020Detecting and Characterizing Bots that Commit Code.
Tapajit Dey, Sara Mousavi, Eduardo Ponce, Tanner Fry, Bogdan Vasilescu, Anna Filippova, Audris Mockus
2020Determining the Intrinsic Structure of Public Software Development History.
Antoine Pietri, Guillaume Rousseau, Stefano Zacchiroli
2020Developer-Driven Code Smell Prioritization.
Fabiano Pecorelli, Fabio Palomba, Foutse Khomh, Andrea De Lucia
2020Did You Remember To Test Your Tokens?
Danielle Gonzalez, Michael Rath, Mehdi Mirakhorli
2020Do Explicit Review Strategies Improve Code Review Performance?
Pavlína Wurzel Gonçalves, Enrico Fregnan, Tobias Baum, Kurt Schneider, Alberto Bacchelli
2020Embedding Java Classes with code2vec: Improvements from Variable Obfuscation.
Rhys Compton, Eibe Frank, Panos Patros, Abigail M. Y. Koay
2020Empirical Study of Restarted and Flaky Builds on Travis CI.
Thomas Durieux, Claire Le Goues, Michael Hilton, Rui Abreu
2020Employing Contribution and Quality Metrics for Quantifying the Software Development Process.
Themistoklis Diamantopoulos, Michail D. Papamichail, Thomas Karanikiotis, Kyriakos C. Chatzidimitriou, Andreas L. Symeonidis
2020Ethical Mining: A Case Study on MSR Mining Challenges.
Nicolas E. Gold, Jens Krinke
2020Exploring the Security Awareness of the Python and JavaScript Open Source Communities.
Gábor Antal, Márton Keleti, Péter Hegedüs
2020Forking Without Clicking: on How to Identify Software Repository Forks.
Antoine Pietri, Guillaume Rousseau, Stefano Zacchiroli
2020From Innovations to Prospects: What Is Hidden Behind Cryptocurrencies?
Ang Jia, Ming Fan, Xi Xu, Di Cui, Wenying Wei, Zijiang Yang, Kai Ye, Ting Liu
2020GitterCom: A Dataset of Open Source Developer Communications in Gitter.
Esteban Parra, Ashley Ellis, Sonia Haiduc
2020Hall-of-Apps: The Top Android Apps Metadata Archive.
Laura Bello-Jiménez, Camilo Escobar-Velásquez, Anamaria Mojica-Hanke, Santiago Cortés-Fernández, Mario Linares-Vásquez
2020How Often Do Single-Statement Bugs Occur?: The ManySStuBs4J Dataset.
Rafael-Michael Karampatsis, Charles Sutton
2020Improved Automatic Summarization of Subroutines via Attention to File Context.
Sakib Haque, Alexander LeClair, Lingfei Wu, Collin McMillan
2020Investigating Severity Thresholds for Test Smells.
Davide Spadini, Martin Schvarcbacher, Ana-Maria Oprescu, Magiel Bruntink, Alberto Bacchelli
2020JTeC: A Large Collection of Java Test Classes for Test Code Analysis and Processing.
Federico Corò, Roberto Verdecchia, Emilio Cruciani, Breno Miranda, Antonia Bertolino
2020Large-Scale Manual Validation of Bugfixing Changes.
Steffen Herbold, Alexander Trautsch, Benjamin Ledel
2020LogChunks: A Data Set for Build Log Analysis.
Carolin E. Brandt, Annibale Panichella, Andy Zaidman, Moritz Beller
2020MSR '20: 17th International Conference on Mining Software Repositories, Seoul, Republic of Korea, 29-30 June, 2020
Sunghun Kim, Georgios Gousios, Sarah Nadi, Joseph Hejderup
2020Multi-language Design Smells: A Backstage Perspective.
Mouna Abidi, Moses Openja, Foutse Khomh
2020Need for Tweet: How Open Source Developers Talk About Their GitHub Work on Twitter.
Hongbo Fang, Daniel Klug, Hemank Lamba, James D. Herbsleb, Bogdan Vasilescu
2020On the Prevalence, Impact, and Evolution of SQL Code Smells in Data-Intensive Systems.
Biruk Asmare Muse, Mohammad Masudur Rahman, Csaba Nagy, Anthony Cleve, Foutse Khomh, Giuliano Antoniol
2020On the Relationship between User Churn and Software Issues.
Omar El Zarif, Daniel Alencar da Costa, Safwat Hassan, Ying Zou
2020On the Shoulders of Giants: A New Dataset for Pull-based Development Research.
Xunhui Zhang, Ayushi Rastogi, Yue Yu
2020PUMiner: Mining Security Posts from Developer Question and Answer Websites with PU Learning.
Triet Huynh Minh Le, David Hin, Roland Croft, Muhammad Ali Babar
2020Painting Flowers: Reasons for Using Single-State State Machines in Model-Driven Engineering.
Nan Yang, Pieter J. L. Cuijpers, Ramon R. H. Schiffelers, Johan Lukkien, Alexander Serebrenik
2020Polyglot and Distributed Software Repository Mining with Crossflow.
Konstantinos Barmpis, Patrick Neubauer, Jonathan Co, Dimitris S. Kolovos, Nicholas Matragkas, Richard F. Paige
2020RTPTorrent: An Open-source Dataset for Evaluating Regression Test Prioritization.
Toni Mattis, Patrick Rein, Falco Dürsch, Robert Hirschfeld
2020SoftMon: A Tool to Compare Similar Open-source Software from a Performance Perspective.
Shubhankar Suman Singh, Smruti R. Sarangi
2020Software-related Slack Chats with Disentangled Conversations.
Preetha Chatterjee, Kostadin Damevski, Nicholas A. Kraft, Lori L. Pollock
2020TestRoutes: A Manually Curated Method Level Dataset for Test-to-Code Traceability.
András Kicsi, László Vidács, Tibor Gyimóthy
2020The Impact of Dynamics of Collaborative Software Engineering on Introverts: A Study Protocol.
Ingrid Nunes, Christoph Treude, Fabio Calefato
2020The Impact of a Major Security Event on an Open Source Project: The Case of OpenSSL.
James Walden
2020The Scent of Deep Learning Code: An Empirical Study.
Hadhemi Jebnoun, Houssem Ben Braiek, Mohammad Masudur Rahman, Foutse Khomh
2020The Software Heritage Graph Dataset: Large-scale Analysis of Public Software Development History.
Antoine Pietri, Diomidis Spinellis, Stefano Zacchiroli
2020The State of the ML-universe: 10 Years of Artificial Intelligence & Machine Learning Software Development on GitHub.
Danielle Gonzalez, Thomas Zimmermann, Nachiappan Nagappan
2020Traceability Support for Multi-Lingual Software Projects.
Yalin Liu, Jinfeng Lin, Jane Cleland-Huang
2020Using Large-Scale Anomaly Detection on Code to Improve Kotlin Compiler.
Timofey Bryksin, Victor Petukhov, Ilya Alexin, Stanislav Prikhodko, Alexey Shpilman, Vladimir Kovalenko, Nikita Povarov
2020Using Others' Tests to Identify Breaking Updates.
Suhaib Mujahid, Rabe Abdalkareem, Emad Shihab, Shane McIntosh
2020Visualization of Methods Changeability Based on VCS Data.
Sergey Svitkov, Timofey Bryksin
2020What constitutes Software?: An Empirical, Descriptive Study of Artifacts.
Rolf-Helge Pfeiffer
2020What is the Vocabulary of Flaky Tests?
Gustavo Pinto, Breno Miranda, Supun Dissanayake, Marcelo d'Amorim, Christoph Treude, Antonia Bertolino