%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
MSR 2016 data showcase - Mining Code Review Repositories
1. Mining the Modern Code Review Repositories:
A Dataset of People, Process and Product
Xin Yang Raula G. Kula Norihiro Yoshida Hajimu Iida
May 14–15, 2016. Austin, Texas
MSR 2016 data showcase
Osaka University
Japan
Nagoya University
Japan
NAIST
Japan
NAIST
Japan
2. A Code Review Dataset
1
Code Review
Source Code
Human / Social
(anonymized usernames and email addresses)
3. Why we made this dataset?
2
*
Hamasaki et al., “Who does what during a code review? datasets of OSS peer review
repositories”. MSR '13
Our previous work
(Hamasaki et al. MSR '13)*
4. Our previous work
(Hamasaki et al. MSR '13)*
Why we made this dataset?
2
Some feedback:
“Hard to query...”
“Hard to convert...”
“Unable to access the source
code...”
*
Hamasaki et al., “Who does what during a code review? datasets of OSS peer review
repositories”. MSR '13
5. Our previous work
(Hamasaki et al. MSR '13)*
Why we made this dataset?
2
Some feedback:
“Hard to query...”
“Hard to convert...”
“Unable to access the source
code...”
*
Hamasaki et al., “Who does what during a code review? datasets of OSS peer review
repositories”. MSR '13
6. *
Hamasaki et al., “Who does what during a code review? datasets of OSS peer review
repositories”. MSR '13
Our previous work
(Hamasaki et al. MSR '13)*
Some feedback:
“Hard to query...”
“Hard to convert...”
“Unable to access the source
code...”
Why we made this dataset?
2
★ Easy to query / analyze
★ Easy to export / convert
★ Able to access the source code
7. 3
Modern Code Review (MCR) Key Attributes
Large
Codebases
(Repositories)
High Volume
Submissions
(Patches)
Large
Communities
(Participants)
</></></></></></>
</></></>
10. 4 years 3 years 7 years 4 years 3 years
611 20 567 111 189
173,749 13,597 63,610 110,172 9,168
5,091 437 3,334 1,437 759
Dataset Statistics (updated to May 2015)
5
</></></>
12. ★ Promote peer review research and link to
other research topics
★ Encourage researchers to use this as a
benchmark of techniques and different
approaches
7
Our Goals