


default search action
18th MSR 2021: Madrid, Spain
- 18th IEEE/ACM International Conference on Mining Software Repositories, MSR 2021, Madrid, Spain, May 17-19, 2021. IEEE 2021, ISBN 978-1-7281-8710-5

Technical Papers
- Huy Tu, George Papadimitriou

, Mariam Kiran, Cong Wang, Anirban Mandal
, Ewa Deelman, Tim Menzies
:
Mining Workflows for Anomalous Data Transfers. 1-12 - Egor Spirin

, Egor Bogomolov
, Vladimir Kovalenko, Timofey Bryksin:
PSIMiner: A Tool for Mining Rich Abstract Syntax Trees from Code. 13-17 - Ruchika Malhotra, Ritvik Kapoor, Deepti Aggarwal, Priya Garg:

Comparative Study of Feature Reduction Techniques in Software Change Prediction. 18-28 - Sofonias Yitagesu

, Xiaowang Zhang
, Zhiyong Feng, Xiaohong Li, Zhenchang Xing:
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions. 29-40 - Rolf-Helge Pfeiffer:

Identifying Critical Projects via PageRank and Truck Factor. 41-45 - Md. Abdullah Al Alamin, Sanjay Malakar, Gias Uddin, Sadia Afroz, Tameem Bin Haider, Anindya Iqbal:

An Empirical Study of Developer Discussions on Low-Code Software Development Challenges. 46-57 - Hendrig Sellik, Onno van Paridon, Georgios Gousios, Maurício Aniche:

Learning Off-By-One Mistakes: An Empirical Study. 58-67 - Ahmed Imam, Tapajit Dey

, Alexander Nolte
, Audris Mockus
, James D. Herbsleb:
The Secret Life of Hackathon Code Where does it come from and where does it go? 68-79 - Christoph Gote, Christian Zingg

:
gambit - An Open Source Name Disambiguation Tool for Version Control Systems. 80-84 - Samuel W. Flint

, Jigyasa Chauhan, Robert Dyer
:
Escaping the Time Pit: Pitfalls and Guidelines for Using Time-Based Git Data. 85-96 - Jiayan Pei, Yimin Wu, Zishan Qin, Yao Cong, Jingtao Guan:

Attention-based model for predicting question relatedness on Stack Overflow. 97-107 - Matteo Ciniselli, Nathan Cooper, Luca Pascarella

, Denys Poshyvanyk
, Massimiliano Di Penta, Gabriele Bavota
:
An Empirical Study on the Usage of BERT Models for Code Completion. 108-119 - Quentin Fournier

, Daniel Aloise
, Seyed Vahid Azhari, François Tetreault:
On Improving Deep Learning Trace Analysis with System Call Arguments. 120-130 - Zhen Yu Ding, Claire Le Goues

:
An Empirical Study of OSS-Fuzz Bugs. 131-142 - Jeanderson Cândido, Jan Haesen, Maurício Aniche, Arie van Deursen

:
An Exploratory Study of Log Placement Recommendation in an Enterprise System. 143-154 - Sina Gholamian, Paul A. S. Ward:

On the Naturalness and Localness of Software Logs. 155-166 - Mia Mohammad Imran

, Agnieszka Ciborowska, Kostadin Damevski
:
Automatically Selecting Follow-up Questions for Deficient Bug Reports. 167-178 - Alexandra-Maria Chaniotaki, Tushar Sharma

:
Architecture Smells and Pareto Principle: A Preliminary Empirical Exploration. 190-194 - Zadia Codabux, Melina C. Vidoni

, Fatemeh H. Fard
:
Technical Debt in the Peer-Review Documentation of R Packages: a rOpenSci Case Study. 195-206 - Diego Marcilio, Carlo A. Furia:

How Java Programmers Test Exceptional Behavior. 207-218 - Guillaume Haben

, Sarra Habchi, Mike Papadakis
, Maxime Cordy, Yves Le Traon
:
A Replication Study on the Usability of Code Vocabulary in Predicting Flaky Tests. 219-229 - Golnaz Gharachorlu, Nick Sumner

:
Leveraging Models to Reduce Test Cases in Software Repositories. 230-241 - Jean-Gabriel Young, Amanda Casari

, Katie McLaughlin
, Milo Z. Trujillo
, Laurent Hébert-Dufresne, James P. Bagrow:
Which contributions count? Analysis of attribution in open source. 242-253 - Mahmoud Alfadel, Diego Elias Costa

, Emad Shihab, Mouafak Mkhallalati:
On the Use of Dependabot Security Pull Requests. 254-265 - Aleksandr Khvorov, Roman Vasiliev, George A. Chernishev, Irving Muller Rodrigues, Dmitrij V. Koznov, Nikita Povarov:

S3M: Siamese Stack (Trace) Similarity Measure. 266-270 - Gian Luca Scoccia, Patrizio Migliarini

, Marco Autili
:
Challenges in Developing Desktop Web Apps: a Study of Stack Overflow and GitHub. 271-282 - Saraj Singh Manes, Olga Baysal:

Studying the Change Histories of Stack Overflow and GitHub Snippets. 283-294 - Nikolai Sviridov, Mikhail Evtikhiev, Vladimir Kovalenko:

TNM: A Tool for Mining of Socio-Technical Data from Git Repositories. 295-299 - Ivano Malavolta

, Katerina Chinnappan, Stan Swanborn, Grace A. Lewis
, Patricia Lago:
Mining the ROS ecosystem for Green Architectural Tactics in Robotics and an Empirical Evaluation. 300-311 - Andreas Schuler, Gabriele Kotsis

:
Mining API Interactions to Analyze Software Revisions for the Evolution of Energy Consumption. 312-316 - André C. Hora:

Googling for Software Development: What Developers Search For and What They Find. 317-328 - Alexey Svyatkovskiy, Sebastian Lee, Anna Hadjitofi

, Maik Riechert, Juliana Vicente Franco, Miltiadis Allamanis:
Fast and Memory-Efficient Neural Code Completion. 329-340 - Ahmed Zerouali, Camilo Velázquez-Rodríguez

, Coen De Roover
:
Identifying Versions of Libraries used in Stack Overflow Code Snippets. 341-345 - Fabio Santos, Igor Wiese, Bianca Trinkenreich, Igor Steinmacher, Anita Sarma, Marco Aurélio Gerosa:

Can I Solve It? Identifying APIs Required to Complete OSS Tasks. 346-257 - Murali Sridharan, Mika Mäntylä, Leevi Rantala, Maëlick Claes:

Data Balancing Improves Self-Admitted Technical Debt Detection. 358-368 - Chanathip Pornprasit, Chakkrit Tantithamthavorn:

JITLine: A Simpler, Better, Faster, Finer-grained Just-In-Time Defect Prediction. 369-379 - Saikat Mondal, Gias Uddin, Chanchal K. Roy:

Rollback Edit Inconsistencies in Developer Forum. 380-391 - André C. Hora:

What Code Is Deliberately Excluded from Test Coverage and Why? 392-402 - Gianmarco Fucci, Nathan Cassee, Fiorella Zampetti, Nicole Novielli, Alexander Serebrenik, Massimiliano Di Penta:

Waiting around or job half-done? Sentiment in self-admitted technical debt. 403-414 - Maria Papoutsoglou, Johannes Wachs

, Georgia M. Kapitsaki:
Mining DEV for social and technical insights about software development. 415-419 - Timothy Kinsman, Mairieli Santos Wessel, Marco Aurélio Gerosa, Christoph Treude

:
How Do Software Developers Use GitHub Actions to Automate Their Workflows? 420-431 - Jirayus Jiarpakdee, Chakkrit Tantithamthavorn, John C. Grundy:

Practitioners' Perceptions of the Goals and Visual Explanations of Defect Prediction Models. 432-443 - Panyawut Sri-Iesaranusorn

, Raula Gaikovina Kula
, Takashi Ishio
:
Does Code Review Promote Conformance? A Study of OpenStack Patches. 444-448 - Kalvin Eng, Abram Hindle:

Revisiting Dockerfiles in Open Source Software Over Time. 449-459 - Mahfouth Alghamdi, Shinpei Hayashi, Takashi Kobayashi, Christoph Treude

:
Characterising the Knowledge about Primitive Variables in Java Code Comments. 460-470 - Anderson G. Uchôa

, Caio Barbosa, Daniel Coutinho
, Willian Nalepa Oizumi
, Wesley K. G. Assunção
, Silvia Regina Vergilio, Juliana Alves Pereira, Anderson Oliveira, Alessandro F. Garcia:
Predicting Design Impactful Changes in Modern Code Review: A Large-Scale Empirical Study. 471-482 - Michel Albonico, Ivano Malavolta

, Gustavo Pinto, Emitza Guzman, Katerina Chinnappan, Patricia Lago:
Mining Energy-Related Practices in Robotics Software. 483-494
MSR Challenge
- Balázs Mosolygó, Norbert Vándor, Gábor Antal, Péter Hegedüs:

On the Rise and Fall of Simple Stupid Bugs: a Life-Cycle Analysis of SStuBs. 495-499 - Jasmine Latendresse, Rabe Abdalkareem

, Diego Elias Costa
, Emad Shihab:
How Effective is Continuous Integration in Indicating Single-Statement Bugs? 500-504 - Ehsan Mashhadi, Hadi Hemmati:

Applying CodeBERT for Automated Program Repair of Java Simple Bugs. 505-509 - Fernanda Madeiral

, Thomas Durieux
:
A large-scale study on human-cloned changes for automated program repair. 510-514 - Wenhan Zhu, Michael W. Godfrey:

Mea culpa: How developers fix their own simple bugs differently from other developers. 515-519 - Arthur V. Kamienski, Luisa Palechor, Cor-Paul Bezemer

, Abram Hindle:
PySStuBs: Characterizing Single-Statement Bugs in Popular Open-Source Python Projects. 520-524 - Anthony Peruma, Christian D. Newman:

On the Distribution of "Simple Stupid Bugs" in Unit Test Files: An Exploratory Study. 525-529 - Jiayi Hua, Haoyu Wang:

On the Effectiveness of Deep Vulnerability Detectors to Simple Stupid Bug Detection. 530-534
MSR Data
- Sebastian Nielebock, Paul Blockhaus, Jacob Krüger, Frank Ortmeier:

AndroidCompass: A Dataset of Android Compatibility Checks in Code Repositories. 535-539 - Misoo Kim

, Youngkyoung Kim
, Eunseok Lee
:
Denchmark: A Bug Benchmark of Deep Learning-related Software. 540-544 - Thomas Durieux

, César Soto-Valero, Benoit Baudry:
Duets: A Dataset of Reproducible Pairs of Java Library-Clients. 545-549 - Luigi Quaranta

, Fabio Calefato, Filippo Lanubile:
KGTorrent: A Dataset of Python Jupyter Notebooks from Kaggle. 550-554 - Mouna Hammoudi, Christoph Mayr-Dorn, Atif Mashkoor, Alexander Egyed:

A Traceability Dataset for Open Source Systems. 555-559 - Ozren Dabic

, Emad Aghajani, Gabriele Bavota
:
Sampling Projects in GitHub for MSR Studies. 560-564 - Nafise Eskandani, Guido Salvaneschi

:
The Wonderless Dataset for Serverless Computing. 565-569 - Wen Li, Xiaoqin Fu, Haipeng Cai

:
AndroCT: Ten Years of App Call Traces in Android. 570-574 - Nikitha Rao, Chetan Bansal, Joe Guan:

Search4Code: Code Search Intent Classification Using Weak Supervision. 575-579 - Ruben Opdebeeck

, Ahmed Zerouali, Coen De Roover
:
Andromeda: A Dataset of Ansible Galaxy Roles and Their Evolution. 580-584 - Amir M. Mir

, Evaldas Latoskinas, Georgios Gousios:
ManyTypes4Py: A Benchmark Python Dataset for Machine Learning-based Type Inference. 585-589 - Tushar Sharma

, Marouane Kessentini:
QScored: A Large Dataset of Code Smells and Quality Metrics. 590-594 - Likang Yin, Zhiyuan Zhang, Qi Xuan, Vladimir Filkov:

Apache Software Foundation Incubator Project Sustainability Dataset. 595-599 - Tyler Wendland, Jingyang Sun, Junayed Mahmud, S. M. Hasan Mansur, Steven Huang, Kevin Moran, Julia Rubin, Mattia Fazzini

:
Andror2: A Dataset of Manually-Reproduced Bug Reports for Android apps. 600-604 - Dheeraj Vagavolu, Vartika Agrahari, Sridhar Chimalakonda, Akhila Sri Manasa Venigalla:

GE526: A Dataset of Open-Source Game Engines. 605-609 - Sahar Badihi, Yi Li, Julia Rubin:

EqBench: A Dataset of Equivalent and Non-equivalent Program Pairs. 610-614
MSR Data Hackathon
- Ahmed Imam, Tapajit Dey

:
Tracking Hackathon Code Creation and Reuse. 615-617 - Elena Lyulina, Mahmoud Jahanshahi

:
Building the Collaboration Graph of Open-Source Software Ecosystem. 618-620 - David Reid, Kalvin Eng, Chris Bogart

, Adam Tutko:
Tracing Vulnerable Code Lineage. 621-623 - James Walden

, Noah Burgin, Kuljit Kaur:
An Exploratory Study of Project Activity Changepoints in Open Source Software Evolution. 624-626 - Mengchen Sam Yong, Lavínia Paganini

, Huilian Sophie Qiu
, José Bayoán Santiago Calderón:
The Diversity-Innovation Paradox in Open-Source Software. 627-629

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














