SlideShare uma empresa Scribd logo
1 de 14
Baixar para ler offline
Mining the Modern Code Review Repositories:
A Dataset of People, Process and Product
Xin Yang Raula G. Kula Norihiro Yoshida Hajimu Iida
May 14–15, 2016. Austin, Texas
MSR 2016 data showcase
Osaka University
Japan
Nagoya University
Japan
NAIST
Japan
NAIST
Japan
A Code Review Dataset
1
Code Review
Source Code
Human / Social
(anonymized usernames and email addresses)
Why we made this dataset?
2
*
Hamasaki et al., “Who does what during a code review? datasets of OSS peer review
repositories”. MSR '13
Our previous work
(Hamasaki et al. MSR '13)*
Our previous work
(Hamasaki et al. MSR '13)*
Why we made this dataset?
2
Some feedback:
“Hard to query...”
“Hard to convert...”
“Unable to access the source
code...”
*
Hamasaki et al., “Who does what during a code review? datasets of OSS peer review
repositories”. MSR '13
Our previous work
(Hamasaki et al. MSR '13)*
Why we made this dataset?
2
Some feedback:
“Hard to query...”
“Hard to convert...”
“Unable to access the source
code...”
*
Hamasaki et al., “Who does what during a code review? datasets of OSS peer review
repositories”. MSR '13
*
Hamasaki et al., “Who does what during a code review? datasets of OSS peer review
repositories”. MSR '13
Our previous work
(Hamasaki et al. MSR '13)*
Some feedback:
“Hard to query...”
“Hard to convert...”
“Unable to access the source
code...”
Why we made this dataset?
2
★ Easy to query / analyze
★ Easy to export / convert
★ Able to access the source code
3
Modern Code Review (MCR) Key Attributes
Large
Codebases
(Repositories)
High Volume
Submissions
(Patches)
Large
Communities
(Participants)
</></></></></></>
</></></>
The Concept
4
Process
Product
People
The Concept
4
4 years 3 years 7 years 4 years 3 years
611 20 567 111 189
173,749 13,597 63,610 110,172 9,168
5,091 437 3,334 1,437 759
Dataset Statistics (updated to May 2015)
5
</></></>
6
Dataset Schema (Check our wiki for details)
★ Promote peer review research and link to
other research topics
★ Encourage researchers to use this as a
benchmark of techniques and different
approaches
7
Our Goals
goo.gl/Wi4UoJ
Get Your Copy
Now!!!
Thanks!
Any questions?
Contact: Xin Yang
kin-y@is.naist.jp
@seeleather

Mais conteúdo relacionado

Destaque

Kisahkatakkeciil 110114114005-phpapp01
Kisahkatakkeciil 110114114005-phpapp01Kisahkatakkeciil 110114114005-phpapp01
Kisahkatakkeciil 110114114005-phpapp01Bayu Haidar
 
Convert Globally - 4th e-Business World Conference - Athens 150915
Convert Globally - 4th e-Business World Conference - Athens 150915Convert Globally - 4th e-Business World Conference - Athens 150915
Convert Globally - 4th e-Business World Conference - Athens 150915Manos Koumantakis
 
Teknikmenjualproduk 100930024518-phpapp01
Teknikmenjualproduk 100930024518-phpapp01Teknikmenjualproduk 100930024518-phpapp01
Teknikmenjualproduk 100930024518-phpapp01Bayu Haidar
 
กบฏใต้ (Rebellion in Southern Thailand)
กบฏใต้ (Rebellion in Southern Thailand)กบฏใต้ (Rebellion in Southern Thailand)
กบฏใต้ (Rebellion in Southern Thailand)Daisuke Matsumoto
 
Peluang usaha buat para pemula
Peluang usaha buat para pemulaPeluang usaha buat para pemula
Peluang usaha buat para pemulaBayu Haidar
 
Brosur produk neo clean
Brosur produk neo cleanBrosur produk neo clean
Brosur produk neo cleanBayu Haidar
 
5 farmacologasimpaticomimeticosysimpaticolticos
5 farmacologasimpaticomimeticosysimpaticolticos5 farmacologasimpaticomimeticosysimpaticolticos
5 farmacologasimpaticomimeticosysimpaticolticosSolev Cabrejos
 
Mining Unstructured Software Repositories Using IR Models
Mining Unstructured Software Repositories Using IR ModelsMining Unstructured Software Repositories Using IR Models
Mining Unstructured Software Repositories Using IR ModelsSAIL_QU
 
Mineograph Mining Automation Software
Mineograph Mining Automation SoftwareMineograph Mining Automation Software
Mineograph Mining Automation SoftwareMineograph Software
 
빅데이터와 교육데이터마이닝 (고려대학교 대학원 강의) 6주차
빅데이터와 교육데이터마이닝 (고려대학교 대학원 강의) 6주차빅데이터와 교육데이터마이닝 (고려대학교 대학원 강의) 6주차
빅데이터와 교육데이터마이닝 (고려대학교 대학원 강의) 6주차JM code group
 
Data mining software comparison
Data mining software comparison Data mining software comparison
Data mining software comparison Esteban Alcaide
 
임태현, software catastrophe
임태현, software catastrophe임태현, software catastrophe
임태현, software catastrophe태현 임
 
Macs 323 project 3
Macs 323 project 3Macs 323 project 3
Macs 323 project 3doreenshou
 
BACTERAS GRAM + Y NEGATIVAS
BACTERAS GRAM + Y NEGATIVASBACTERAS GRAM + Y NEGATIVAS
BACTERAS GRAM + Y NEGATIVASSolev Cabrejos
 
Mining Software Archives to Support Software Development
Mining Software Archives to Support Software DevelopmentMining Software Archives to Support Software Development
Mining Software Archives to Support Software DevelopmentThomas Zimmermann
 
Friendship and mobility user movement in location based social networks
Friendship and mobility user movement in location based social networksFriendship and mobility user movement in location based social networks
Friendship and mobility user movement in location based social networksFread Mzee
 

Destaque (19)

Trabajo de inmuno
Trabajo de inmunoTrabajo de inmuno
Trabajo de inmuno
 
Kisahkatakkeciil 110114114005-phpapp01
Kisahkatakkeciil 110114114005-phpapp01Kisahkatakkeciil 110114114005-phpapp01
Kisahkatakkeciil 110114114005-phpapp01
 
Convert Globally - 4th e-Business World Conference - Athens 150915
Convert Globally - 4th e-Business World Conference - Athens 150915Convert Globally - 4th e-Business World Conference - Athens 150915
Convert Globally - 4th e-Business World Conference - Athens 150915
 
Roll up
Roll upRoll up
Roll up
 
Weblogs for ESL
Weblogs for ESLWeblogs for ESL
Weblogs for ESL
 
Teknikmenjualproduk 100930024518-phpapp01
Teknikmenjualproduk 100930024518-phpapp01Teknikmenjualproduk 100930024518-phpapp01
Teknikmenjualproduk 100930024518-phpapp01
 
กบฏใต้ (Rebellion in Southern Thailand)
กบฏใต้ (Rebellion in Southern Thailand)กบฏใต้ (Rebellion in Southern Thailand)
กบฏใต้ (Rebellion in Southern Thailand)
 
Peluang usaha buat para pemula
Peluang usaha buat para pemulaPeluang usaha buat para pemula
Peluang usaha buat para pemula
 
Brosur produk neo clean
Brosur produk neo cleanBrosur produk neo clean
Brosur produk neo clean
 
5 farmacologasimpaticomimeticosysimpaticolticos
5 farmacologasimpaticomimeticosysimpaticolticos5 farmacologasimpaticomimeticosysimpaticolticos
5 farmacologasimpaticomimeticosysimpaticolticos
 
Mining Unstructured Software Repositories Using IR Models
Mining Unstructured Software Repositories Using IR ModelsMining Unstructured Software Repositories Using IR Models
Mining Unstructured Software Repositories Using IR Models
 
Mineograph Mining Automation Software
Mineograph Mining Automation SoftwareMineograph Mining Automation Software
Mineograph Mining Automation Software
 
빅데이터와 교육데이터마이닝 (고려대학교 대학원 강의) 6주차
빅데이터와 교육데이터마이닝 (고려대학교 대학원 강의) 6주차빅데이터와 교육데이터마이닝 (고려대학교 대학원 강의) 6주차
빅데이터와 교육데이터마이닝 (고려대학교 대학원 강의) 6주차
 
Data mining software comparison
Data mining software comparison Data mining software comparison
Data mining software comparison
 
임태현, software catastrophe
임태현, software catastrophe임태현, software catastrophe
임태현, software catastrophe
 
Macs 323 project 3
Macs 323 project 3Macs 323 project 3
Macs 323 project 3
 
BACTERAS GRAM + Y NEGATIVAS
BACTERAS GRAM + Y NEGATIVASBACTERAS GRAM + Y NEGATIVAS
BACTERAS GRAM + Y NEGATIVAS
 
Mining Software Archives to Support Software Development
Mining Software Archives to Support Software DevelopmentMining Software Archives to Support Software Development
Mining Software Archives to Support Software Development
 
Friendship and mobility user movement in location based social networks
Friendship and mobility user movement in location based social networksFriendship and mobility user movement in location based social networks
Friendship and mobility user movement in location based social networks
 

Semelhante a MSR 2016 data showcase - Mining Code Review Repositories

Introduction to metadata, part 2
Introduction to metadata, part 2Introduction to metadata, part 2
Introduction to metadata, part 2Richard.Sapon-White
 
Ischools workshop - 4 - data discovery
Ischools workshop - 4 - data discoveryIschools workshop - 4 - data discovery
Ischools workshop - 4 - data discoveryARDC
 
Big Data Analytics - Introduction
Big Data Analytics - IntroductionBig Data Analytics - Introduction
Big Data Analytics - IntroductionAlex Meadows
 
Search Analytics: Diagnosing what ails your site
Search Analytics:  Diagnosing what ails your siteSearch Analytics:  Diagnosing what ails your site
Search Analytics: Diagnosing what ails your siteLouis Rosenfeld
 
Search Analytics: Diagnosing what ails your site
Search Analytics:  Diagnosing what ails your siteSearch Analytics:  Diagnosing what ails your site
Search Analytics: Diagnosing what ails your siteLouis Rosenfeld
 
BigData Search Simplified with ElasticSearch
BigData Search Simplified with ElasticSearchBigData Search Simplified with ElasticSearch
BigData Search Simplified with ElasticSearchTO THE NEW | Technology
 
Analyzing Big Data's Weakest Link (hint: it might be you)
Analyzing Big Data's Weakest Link  (hint: it might be you)Analyzing Big Data's Weakest Link  (hint: it might be you)
Analyzing Big Data's Weakest Link (hint: it might be you)HPCC Systems
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest linkCS, NcState
 
Appendix A Work Distribution
Appendix A  Work DistributionAppendix A  Work Distribution
Appendix A Work DistributionSoumyadeepBasu4
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysisLuke Czarnecki
 
(Big) Data for Research for "Science, Technology and Entrepreneurship"
(Big) Data for Research for "Science, Technology and Entrepreneurship"(Big) Data for Research for "Science, Technology and Entrepreneurship"
(Big) Data for Research for "Science, Technology and Entrepreneurship"Yasushi Hara
 
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...Advanced-Concepts-Team
 
Appendix A Work Distribution
Appendix A  Work DistributionAppendix A  Work Distribution
Appendix A Work DistributionSoumyadeepBasu4
 
MongoDB for Time Series Data: Sharding
MongoDB for Time Series Data: ShardingMongoDB for Time Series Data: Sharding
MongoDB for Time Series Data: ShardingMongoDB
 
Advance Data Mining Project Report
Advance Data Mining Project ReportAdvance Data Mining Project Report
Advance Data Mining Project ReportArnab Mukhopadhyay
 
Big data forum 19 march 2014
Big data forum   19 march 2014Big data forum   19 march 2014
Big data forum 19 march 2014Matt Carroll
 
ER 2016 Tutorial
ER 2016 TutorialER 2016 Tutorial
ER 2016 TutorialRim Moussa
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceLeon Derczynski
 

Semelhante a MSR 2016 data showcase - Mining Code Review Repositories (20)

Introduction to metadata, part 2
Introduction to metadata, part 2Introduction to metadata, part 2
Introduction to metadata, part 2
 
Ischools workshop - 4 - data discovery
Ischools workshop - 4 - data discoveryIschools workshop - 4 - data discovery
Ischools workshop - 4 - data discovery
 
Big Data Analytics - Introduction
Big Data Analytics - IntroductionBig Data Analytics - Introduction
Big Data Analytics - Introduction
 
Oslo
OsloOslo
Oslo
 
Search Analytics: Diagnosing what ails your site
Search Analytics:  Diagnosing what ails your siteSearch Analytics:  Diagnosing what ails your site
Search Analytics: Diagnosing what ails your site
 
Search Analytics: Diagnosing what ails your site
Search Analytics:  Diagnosing what ails your siteSearch Analytics:  Diagnosing what ails your site
Search Analytics: Diagnosing what ails your site
 
OpenML data@Sheffield
OpenML data@SheffieldOpenML data@Sheffield
OpenML data@Sheffield
 
BigData Search Simplified with ElasticSearch
BigData Search Simplified with ElasticSearchBigData Search Simplified with ElasticSearch
BigData Search Simplified with ElasticSearch
 
Analyzing Big Data's Weakest Link (hint: it might be you)
Analyzing Big Data's Weakest Link  (hint: it might be you)Analyzing Big Data's Weakest Link  (hint: it might be you)
Analyzing Big Data's Weakest Link (hint: it might be you)
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Appendix A Work Distribution
Appendix A  Work DistributionAppendix A  Work Distribution
Appendix A Work Distribution
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysis
 
(Big) Data for Research for "Science, Technology and Entrepreneurship"
(Big) Data for Research for "Science, Technology and Entrepreneurship"(Big) Data for Research for "Science, Technology and Entrepreneurship"
(Big) Data for Research for "Science, Technology and Entrepreneurship"
 
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...
 
Appendix A Work Distribution
Appendix A  Work DistributionAppendix A  Work Distribution
Appendix A Work Distribution
 
MongoDB for Time Series Data: Sharding
MongoDB for Time Series Data: ShardingMongoDB for Time Series Data: Sharding
MongoDB for Time Series Data: Sharding
 
Advance Data Mining Project Report
Advance Data Mining Project ReportAdvance Data Mining Project Report
Advance Data Mining Project Report
 
Big data forum 19 march 2014
Big data forum   19 march 2014Big data forum   19 march 2014
Big data forum 19 march 2014
 
ER 2016 Tutorial
ER 2016 TutorialER 2016 Tutorial
ER 2016 Tutorial
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
 

Último

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...masabamasaba
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburgmasabamasaba
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Hararemasabamasaba
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationJuha-Pekka Tolvanen
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Bert Jan Schrijver
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...masabamasaba
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...chiefasafspells
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...masabamasaba
 

Último (20)

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 

MSR 2016 data showcase - Mining Code Review Repositories

  • 1. Mining the Modern Code Review Repositories: A Dataset of People, Process and Product Xin Yang Raula G. Kula Norihiro Yoshida Hajimu Iida May 14–15, 2016. Austin, Texas MSR 2016 data showcase Osaka University Japan Nagoya University Japan NAIST Japan NAIST Japan
  • 2. A Code Review Dataset 1 Code Review Source Code Human / Social (anonymized usernames and email addresses)
  • 3. Why we made this dataset? 2 * Hamasaki et al., “Who does what during a code review? datasets of OSS peer review repositories”. MSR '13 Our previous work (Hamasaki et al. MSR '13)*
  • 4. Our previous work (Hamasaki et al. MSR '13)* Why we made this dataset? 2 Some feedback: “Hard to query...” “Hard to convert...” “Unable to access the source code...” * Hamasaki et al., “Who does what during a code review? datasets of OSS peer review repositories”. MSR '13
  • 5. Our previous work (Hamasaki et al. MSR '13)* Why we made this dataset? 2 Some feedback: “Hard to query...” “Hard to convert...” “Unable to access the source code...” * Hamasaki et al., “Who does what during a code review? datasets of OSS peer review repositories”. MSR '13
  • 6. * Hamasaki et al., “Who does what during a code review? datasets of OSS peer review repositories”. MSR '13 Our previous work (Hamasaki et al. MSR '13)* Some feedback: “Hard to query...” “Hard to convert...” “Unable to access the source code...” Why we made this dataset? 2 ★ Easy to query / analyze ★ Easy to export / convert ★ Able to access the source code
  • 7. 3 Modern Code Review (MCR) Key Attributes Large Codebases (Repositories) High Volume Submissions (Patches) Large Communities (Participants) </></></></></></> </></></>
  • 10. 4 years 3 years 7 years 4 years 3 years 611 20 567 111 189 173,749 13,597 63,610 110,172 9,168 5,091 437 3,334 1,437 759 Dataset Statistics (updated to May 2015) 5 </></></>
  • 11. 6 Dataset Schema (Check our wiki for details)
  • 12. ★ Promote peer review research and link to other research topics ★ Encourage researchers to use this as a benchmark of techniques and different approaches 7 Our Goals
  • 14. Thanks! Any questions? Contact: Xin Yang kin-y@is.naist.jp @seeleather