Technical Debt in the Peer-Review Documentation of R Packages: a rOpenSci Case Study (MSR 2021 - Technical Papers)

Who

Zadia Codabux, Melina Vidoni, Fatemeh Hendijani Fard

Track

MSR 2021 Technical Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 18 May 2021 03:11 - 03:15 at MSR Room 1 - Technical Debt and Smells Chair(s): Gema Rodríguez-Pérez

Abstract

Context: Technical Debt (TD) is a metaphor used to describe code that is ``not quite right." Although TD studies have gained momentum in the last decade, TD has yet to be studied as thoroughly in non-object-oriented or scientific software such as R. The latter is a multi-paradigm, package-based programming language, whose popularity in data science and statistical applications has amplified in recent years. Due to R’s inherent ability to expand through user-contributed packages, several community-led organizations were created to organize and peer-review packages in a concerted effort to increase their quality. Nonetheless, it is well-known that most R users do not have a technical programming background, being from multiple disciplines. Objective: The goal of this study is to investigate TD in the documentation of the peer-review of R packages led by rOpenSci. Method: We collected over 5,000 comments from 157 packages that had been reviewed and approved to be published at rOpenSci. We manually analyzed a sample dataset of these comments posted by package authors, editors of rOpenSci, and reviewers during the review process to investigate the types of TD present in these reviews. Results: The findings of our study include (i) a taxonomy of TD derived from our analysis of the peer-reviews (ii) documentation debt as being the most prevalent type of debt (iii) different user roles are concerned with different types of TD. For instance, reviewers tend to report some types of TD more than other roles, and the types of TD they report are different from those reported by the authors of a package. Conclusion: TD analysis in scientific software or peer-review is almost non-existent. Our study is a pioneer, but within the context of R packages. However, our findings can serve as a starting point for replication studies, given our public datasets, to perform similar analyses in other scientific software or to investigate the rationale behind our findings.

Link to Preprint

https://arxiv.org/pdf/2103.09340.pdf

Zadia Codabux

University of Saskatchewan

Melina Vidoni

RMIT University

Australia

Fatemeh Hendijani Fard

University of British Columbia

Canada

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 18 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

03:10 - 04:00	Technical Debt and SmellsTechnical Papers / Data Showcase at MSR Room 1 Chair(s): Gema Rodríguez-Pérez University of Waterloo

03:11 4m Talk		Technical Debt in the Peer-Review Documentation of R Packages: a rOpenSci Case Study Technical Papers Zadia Codabux University of Saskatchewan, Melina Vidoni RMIT University, Fatemeh Hendijani Fard University of British Columbia Pre-print
03:15 3m Talk		QScored: A Large Dataset of Code Smells and Quality Metrics Data Showcase Tushar Sharma Siemens Research, Marouane Kessentini University of Michigan Pre-print
03:18 3m Talk		Architecture Smells and Pareto Principle: A Preliminary Empirical Exploration Technical Papers Alexandra-Maria Chaniotaki , Tushar Sharma Siemens Research Pre-print
03:21 4m Talk		Self-Admitted Technical Debt in R Packages: An Exploratory Study Technical Papers Melina Vidoni RMIT University Pre-print
03:25 4m Full-paper		An Empirical Study of Developer Discussions on Low Code Software Development Challenges Technical Papers Md Abdullah Al Alamin University of Calgary, Sanjay Malakar Bangladesh University of Engineering and Technology, Gias Uddin University of Calgary, Canada, Sadia Afroz Bangladesh University of Engineering and Technology, Tameem Bin Haider Bangladesh University of Engineering and Technology, Anindya Iqbal Bangladesh University of Engineering and Technology Dhaka, Bangladesh Pre-print
03:29 31m Live Q&A		Discussions and Q&A Technical Papers

Information for Participants

Tue 18 May 2021 03:10 - 04:00 at MSR Room 1 - Technical Debt and Smells Chair(s): Gema Rodríguez-Pérez

Info for room MSR Room 1:

Go directly to this room on Clowdr