Write a Blog >>
MSR 2021
Mon 17 - Wed 19 May 2021
co-located with ICSE 2021
Mon 17 May 2021 10:16 - 10:19 at MSR Room 1 - Resources for MSR Research Chair(s): Felipe Ebert

Name disambiguation is a complex but highly relevant challenge whenever analysing real-world user data, such as data from version control systems. We propose gambit, a rule-based disambiguation tool that only relies on name and email information. We evaluate its performance against two commonly used algorithms with similar characteristics, on manually disambiguated ground-truth data from the Gnome GTK project. Our results show that gambit significantly outperforms both algorithms in terms of precision as well as F1 score.

Conference Day
Mon 17 May

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

10:00 - 10:50
Resources for MSR ResearchTechnical Papers / Data Showcase at MSR Room 1
Chair(s): Felipe EbertEindhoven University of Technology
10:01
3m
Talk
PSIMiner: A Tool for Mining Rich Abstract Syntax Trees from Code
Technical Papers
Egor SpirinJetBrains Research; National Research University Higher School of Economics, Egor BogomolovJetBrains Research, Vladimir KovalenkoJetBrains Research, Timofey BryksinJetBrains Research, Saint Petersburg State University
Pre-print
10:04
3m
Talk
Mining DEV for social and technical insights about software development
Technical Papers
Maria PapoutsoglouAristotle University of Thessaloniki, Johannes WachsVienna University of Economics and Business & Complexity Science Hub Vienna, Georgia KapitsakiUniversity of Cyprus
Pre-print
10:07
3m
Talk
TNM: A Tool for Mining of Socio-Technical Data from Git Repositories
Technical Papers
Nikolai SviridovITMO University, Mikhail EvtikhievJetBrains Research, Vladimir KovalenkoJetBrains Research
Pre-print
10:10
3m
Talk
Identifying Versions of Libraries used in Stack Overflow Code Snippets
Technical Papers
Ahmed ZeroualiVrije Universiteit Brussel, Camilo Velázquez-RodríguezVrije Universiteit Brussel, Coen De RooverVrije Universiteit Brussel
Pre-print Media Attached
10:13
3m
Talk
Sampling Projects in GitHub for MSR Studies
Data Showcase
Ozren DabicSoftware Institute, Università della Svizzera italiana (USI), Switzerland, Emad AghajaniSoftware Institute, USI Università della Svizzera italiana, Gabriele BavotaSoftware Institute, USI Università della Svizzera italiana
Pre-print
10:16
3m
Talk
gambit – An Open Source Name Disambiguation Tool for Version Control Systems
Technical Papers
Christoph GoteChair of Systems Design, ETH Zurich, Christian ZinggChair of Systems Design, ETH Zurich
Pre-print Media Attached
10:19
31m
Live Q&A
Discussions and Q&A
Technical Papers


Information for Participants
Info for MSR Room 1: