SlideShare uma empresa Scribd logo
1 de 21
Towards Mining Software Repositories
Research that Matters
Tao Xie
Department of Computer Science
University of Illinois at Urbana-Champaign, USA
taoxie@illinois.edu
Machine Learning that Matters
“The basic argument in her paper is that machine learning
might be in danger of losing its impact because the
community as a whole has become quite self-referential.
People are probably solving real-world problems using ML
methods, but there is little sharing of these results within
the community. Instead, people focus on existing
benchmarks which might have originally had some
connection to real-world problems which has been long
forgotten, however.”
“She proposes a number of tasks like $100M solved
through ML based decision making or a human life saved
through a diagnosis or an intervention recommended by
an ML system to get ML back on track.”
ICML’12
http://icml.cc/2012/papers/298.pdf
http://blog.mikiobraun.de/2012/06/is-machine-learning-losing-impact.html
Redwine and Riddle Study (1985)
• From idea to “the point it can be popularized and
disseminated to the technical community at
large”
– Worst case: 23 years
– Best case: 11 years
– Mean: 17 years
• 7.5 years from developed technology to wide
availability
Source©S. L. Pfleeger
Sam Redwine Jr., William Riddle: Software Technology Maturation, In Proc. ICSE 1985.
Technology Maturation: Middleware
Source©A. Wolfhttp://www.sigsoft.org/impact/docs/ImpactWolfBCS2008.pdf
15-20 years between first
publication of an idea and
widespread availability in
products
Technology Maturation: Middleware
Source©A.http://www.sigsoft.org/impact/docs/ImpactWolfBCS2008.pdf
15-20 years between first
publication of an idea and
widespread availability in
products
Shall we just stay in our comfort
zone to wait for 15-20 years for
our research to (or not to)
produce practice impact??
How about the research that we
did 15-20 years ago??
[Caveat: don’t forget the need of
long-term/blue-sky research!!]
2012 NSF Workshop on Formal Methods
• Goal: to identify the future directions in research in
formal methods and its transition to industrial
practice.
• Success examples mentioned by the attendees
– SLAM/SDV
– ASTREE
– SMT-based tools
– …
http://goto.ucsd.edu/~rjhala/NSFWorkshop/
“What Happened to the Promise
of Software Tools?” – Jim Larus
http://www.srl.inf.ethz.ch/workshop2014/eth-larus.pdf
https://www.youtube.com/watch?v=kO9OYnkeRTM
Impacts, Impacts, Impacts, …
Image source: http://engage.synecoretech.com/marketing-technology-for-growth/bid/155279/How-Online-Content-Impacts-Your-Social-Media-Marketing-Strategy
Research Impacts
99319
22786
32987
Research Impacts
SIGSOFT Impact Paper Awards, ICSE MIP awards, …
…
Practice Impacts ACM Software System Awards
31 Awardees
http://awards.acm.org/software_system/
Practice Impacts ACM Software System Awards
• Development Environments/Tools
– 2013: Coq
– 2012: LLVM
– 2011: Eclipse
– 2007: Statemate
– 2006: Eiffel
– 2005: The Boyer-Moore Theorem Prover (ACL2)
– 2003: MAKE
– 2001: SPIN
– 1992: Interlisp
• Languages
– 2002: Java
– 1998: The S System (R statistical analysis)
– 1997: Tcl/Tk
– 1987: SMALLTALK
2012 LLVM born at Illinois
• The openness of the LLVM technology and the quality of its
architecture and engineering design are key factors in
understanding the success it has had both in academia and
industry
Vikram Adve Chris Lattner Evan Cheng
http://llvm.org/
Practice Impacts commercialization/industrial adoption
…
SAGE
ASTRÉE
Statechart
SPIN
Moles
Microsoft Research
…
…
Practice Impacts
research publications  industrial adoption done by others
…
• ICSE 00 Daikon paper by Ernst et al.  Agitar Agitator
– https://homes.cs.washington.edu/~mernst/pubs/invariants-relevance-icse2000.pdf
• ASE 04 Rostra paper by Xie et al.  Parasoft Jtest improvement
– http://web.engr.illinois.edu/~taoxie/publications/ase04.pdf
• PLDI/FSE 05 DART/CUTE papers by Sen et al.  MSR SAGE, Pex
– http://srl.cs.berkeley.edu/~ksen/papers/dart.pdf
– http://srl.cs.berkeley.edu/~ksen/papers/C159-sen.pdf
HOW???
• Are these impact goals too far from you?
• Can you plan for that?
• What if you are in a university research
group?
• …
(How) Can A University Group Do It?
• Aim for research impacts more commonly
– but sometimes/often may not be predicted well,
e.g., Whyper [USENIX SEC 13] http://web.engr.illinois.edu/~taoxie/publications/usenixsec13-whyper.pdf
• Start a startup
– but desirable to have right people (e.g., former students) to start
– but software engineering tools may not sell crazily
• Collaborate with industrial research labs
– but many research lab projects may look like univ. projects
• Collaborate with industrial product groups
– but many probs faced by product groups may not be “researchy”
• At least focus on problems that matter (now or future)!
(How) Can A University Group Do It?
• Need to balance/unify producing great
students vs./and great (high practice-impact)
research
http://www.cs.washington.edu/people/faculty/notkin/students
conts.
Experience Reports on Successful Tool Transfer
• Nikolai Tillmann, Jonathan de Halleux, and Tao Xie. Transferring an Automated Test
Generation Tool to Practice: From Pex to Fakes and Code Digger. In Proceedings of ASE
2014, Experience Papers. http://web.engr.illinois.edu/~taoxie/publications/ase14-
pexexperiences.pdf
• Jian-Guang Lou, Qingwei Lin, Rui Ding, Qiang Fu, Dongmei Zhang, and Tao Xie. Software
Analytics for Incident Management of Online Services: An Experience Report. In
Proceedings ASE 2013, Experience Paper.
http://web.engr.illinois.edu/~taoxie/publications/ase13-sas.pdf
• Dongmei Zhang, Shi Han, Yingnong Dang, Jian-Guang Lou, Haidong Zhang, and Tao Xie.
Software Analytics in Practice. IEEE Software, Special Issue on the Many Faces of Software
Analytics, 2013. http://web.engr.illinois.edu/~taoxie/publications/ieeesoft13-softanalytics.pdf
• Yingnong Dang, Dongmei Zhang, Song Ge, Chengyun Chu, Yingjun Qiu, and Tao Xie. XIAO:
Tuning Code Clones at Hands of Engineers in Practice. In Proceedings of ACSAC 2012.
http://web.engr.illinois.edu/~taoxie/publications/acsac12-xiao.pdf
Q & A
http://www.cs.illinois.edu/homes/taoxie/
Contact: taoxie@illinois.edu
Supported in part by a Microsoft Research Award, NSF grants CCF-1349666, CNS-1434582, CCF-1434596, CCF-
1434590, CNS-1439481, and the USA National Security Agency (NSA) Science of Security Lablet.
Discussion
Discussion Topics: HOW???
• Are these impact goals too far from you?
• Can you plan for that?
• What if you are in a university research
group?
• …

Mais conteúdo relacionado

Mais procurados

Strategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSSStrategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSS
webuploader
 
Using electronic laboratory notebooks in the academic life sciences: a group ...
Using electronic laboratory notebooks in the academic life sciences: a group ...Using electronic laboratory notebooks in the academic life sciences: a group ...
Using electronic laboratory notebooks in the academic life sciences: a group ...
SC CTSI at USC and CHLA
 
Agile Methods Cost of Quality: Benefits of Testing Early & Often
Agile Methods Cost of Quality: Benefits of Testing Early & OftenAgile Methods Cost of Quality: Benefits of Testing Early & Often
Agile Methods Cost of Quality: Benefits of Testing Early & Often
David Rico
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
dgarijo
 

Mais procurados (20)

Software Analytics: Data Analytics for Software Engineering and Security
Software Analytics: Data Analytics for Software Engineering and SecuritySoftware Analytics: Data Analytics for Software Engineering and Security
Software Analytics: Data Analytics for Software Engineering and Security
 
DSML 2021 Keynote: Intelligent Software Engineering: Working at the Intersect...
DSML 2021 Keynote: Intelligent Software Engineering: Working at the Intersect...DSML 2021 Keynote: Intelligent Software Engineering: Working at the Intersect...
DSML 2021 Keynote: Intelligent Software Engineering: Working at the Intersect...
 
SETTA'18 Keynote: Intelligent Software Engineering: Synergy between AI and So...
SETTA'18 Keynote: Intelligent Software Engineering: Synergy between AI and So...SETTA'18 Keynote: Intelligent Software Engineering: Synergy between AI and So...
SETTA'18 Keynote: Intelligent Software Engineering: Synergy between AI and So...
 
Transferring Software Testing Tools to Practice (AST 2017 Keynote)
Transferring Software Testing Tools to Practice (AST 2017 Keynote)Transferring Software Testing Tools to Practice (AST 2017 Keynote)
Transferring Software Testing Tools to Practice (AST 2017 Keynote)
 
"Writing a Story" Electronic Laboratory Notebooks - ORC Seminar Univeristy ...
"Writing a Story"   Electronic Laboratory Notebooks - ORC Seminar Univeristy ..."Writing a Story"   Electronic Laboratory Notebooks - ORC Seminar Univeristy ...
"Writing a Story" Electronic Laboratory Notebooks - ORC Seminar Univeristy ...
 
Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory Notebooks
 
Most important features when choosing an electronic lab notebook
Most important features when choosing an electronic lab notebookMost important features when choosing an electronic lab notebook
Most important features when choosing an electronic lab notebook
 
Strategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSSStrategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSS
 
Using electronic laboratory notebooks in the academic life sciences: a group ...
Using electronic laboratory notebooks in the academic life sciences: a group ...Using electronic laboratory notebooks in the academic life sciences: a group ...
Using electronic laboratory notebooks in the academic life sciences: a group ...
 
2010 ICGSE - Challenges and Solutions in Distributed Software Development Pro...
2010 ICGSE - Challenges and Solutions in Distributed Software Development Pro...2010 ICGSE - Challenges and Solutions in Distributed Software Development Pro...
2010 ICGSE - Challenges and Solutions in Distributed Software Development Pro...
 
The Curious Case of Fuzzing for Automated Software Testing
The Curious Case of Fuzzing for Automated Software TestingThe Curious Case of Fuzzing for Automated Software Testing
The Curious Case of Fuzzing for Automated Software Testing
 
Customer Success Story: IEEE Provides Ongoing Education
Customer Success Story: IEEE Provides Ongoing EducationCustomer Success Story: IEEE Provides Ongoing Education
Customer Success Story: IEEE Provides Ongoing Education
 
Customer Success Story: IEEE Xplore Saves Time
Customer Success Story: IEEE Xplore Saves TimeCustomer Success Story: IEEE Xplore Saves Time
Customer Success Story: IEEE Xplore Saves Time
 
Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?Scientific Software - what happens after the grant?
Scientific Software - what happens after the grant?
 
Agile Methods Cost of Quality: Benefits of Testing Early & Often
Agile Methods Cost of Quality: Benefits of Testing Early & OftenAgile Methods Cost of Quality: Benefits of Testing Early & Often
Agile Methods Cost of Quality: Benefits of Testing Early & Often
 
Human-centric Software Development Tools
Human-centric Software Development ToolsHuman-centric Software Development Tools
Human-centric Software Development Tools
 
Fuzzing: Challenges and Reflections
Fuzzing: Challenges and ReflectionsFuzzing: Challenges and Reflections
Fuzzing: Challenges and Reflections
 
IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)IEEE augmented reality learning experience model (ARLEM)
IEEE augmented reality learning experience model (ARLEM)
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
 
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
IEEE p1589 'ARLEM' virtual meeting, September 9, 2015
 

Semelhante a Towards Mining Software Repositories Research that Matters

Inuse seminar Nov 20, 2012 Salovaara
Inuse seminar Nov 20, 2012 SalovaaraInuse seminar Nov 20, 2012 Salovaara
Inuse seminar Nov 20, 2012 Salovaara
inuseproject
 

Semelhante a Towards Mining Software Repositories Research that Matters (20)

Storyboard moores2
Storyboard moores2Storyboard moores2
Storyboard moores2
 
Top Trends Guiding Tech Use in Your Career Practice
Top Trends Guiding Tech Use in Your Career PracticeTop Trends Guiding Tech Use in Your Career Practice
Top Trends Guiding Tech Use in Your Career Practice
 
Pathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesPathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and Challenges
 
SBQS 2013 Keynote: Cooperative Testing and Analysis
SBQS 2013 Keynote: Cooperative Testing and AnalysisSBQS 2013 Keynote: Cooperative Testing and Analysis
SBQS 2013 Keynote: Cooperative Testing and Analysis
 
Inuse seminar Nov 20, 2012 Salovaara
Inuse seminar Nov 20, 2012 SalovaaraInuse seminar Nov 20, 2012 Salovaara
Inuse seminar Nov 20, 2012 Salovaara
 
Storyboard moores2
Storyboard moores2Storyboard moores2
Storyboard moores2
 
Software Analytics: Towards Software Mining that Matters (2014)
Software Analytics:Towards Software Mining that Matters (2014)Software Analytics:Towards Software Mining that Matters (2014)
Software Analytics: Towards Software Mining that Matters (2014)
 
Analysis and Design of Information Systems
Analysis and Design of Information SystemsAnalysis and Design of Information Systems
Analysis and Design of Information Systems
 
"From Making to Learning" : Dev Camps as a Blueprint for Re-inventing Project...
"From Making to Learning" : Dev Camps as a Blueprint for Re-inventing Project..."From Making to Learning" : Dev Camps as a Blueprint for Re-inventing Project...
"From Making to Learning" : Dev Camps as a Blueprint for Re-inventing Project...
 
Storyboard moores week 11 multimedia presentation
Storyboard moores week 11 multimedia presentationStoryboard moores week 11 multimedia presentation
Storyboard moores week 11 multimedia presentation
 
Entrepreneur way of thinking
Entrepreneur way of thinkingEntrepreneur way of thinking
Entrepreneur way of thinking
 
Synergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software EngineeringSynergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software Engineering
 
From DevOps to Operations Science
From DevOps to Operations Science From DevOps to Operations Science
From DevOps to Operations Science
 
Wcet Denver: Re-Thinking E-Learning Research
Wcet Denver: Re-Thinking E-Learning ResearchWcet Denver: Re-Thinking E-Learning Research
Wcet Denver: Re-Thinking E-Learning Research
 
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
 
Aect 2018 workshop
Aect 2018 workshopAect 2018 workshop
Aect 2018 workshop
 
Aect2018 workshop-v6ij-compressed
Aect2018 workshop-v6ij-compressedAect2018 workshop-v6ij-compressed
Aect2018 workshop-v6ij-compressed
 
2015-11-11 research seminar
2015-11-11 research seminar2015-11-11 research seminar
2015-11-11 research seminar
 
Social media marketing loss of control
Social media marketing   loss of controlSocial media marketing   loss of control
Social media marketing loss of control
 
Data-X-v3.1
Data-X-v3.1Data-X-v3.1
Data-X-v3.1
 

Mais de Tao Xie

Csise15 codehunt
Csise15 codehuntCsise15 codehunt
Csise15 codehunt
Tao Xie
 

Mais de Tao Xie (18)

MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...
MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...
MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...
 
Intelligent Software Engineering: Synergy between AI and Software Engineering
Intelligent Software Engineering: Synergy between AI and Software EngineeringIntelligent Software Engineering: Synergy between AI and Software Engineering
Intelligent Software Engineering: Synergy between AI and Software Engineering
 
Diversity and Computing/Engineering: Perspectives from Allies
Diversity and Computing/Engineering: Perspectives from AlliesDiversity and Computing/Engineering: Perspectives from Allies
Diversity and Computing/Engineering: Perspectives from Allies
 
Intelligent Software Engineering: Synergy between AI and Software Engineering...
Intelligent Software Engineering: Synergy between AI and Software Engineering...Intelligent Software Engineering: Synergy between AI and Software Engineering...
Intelligent Software Engineering: Synergy between AI and Software Engineering...
 
MSRA 2018: Intelligent Software Engineering: Synergy between AI and Software ...
MSRA 2018: Intelligent Software Engineering: Synergy between AI and Software ...MSRA 2018: Intelligent Software Engineering: Synergy between AI and Software ...
MSRA 2018: Intelligent Software Engineering: Synergy between AI and Software ...
 
Advances in Unit Testing: Theory and Practice
Advances in Unit Testing: Theory and PracticeAdvances in Unit Testing: Theory and Practice
Advances in Unit Testing: Theory and Practice
 
Common Technical Writing Issues
Common Technical Writing IssuesCommon Technical Writing Issues
Common Technical Writing Issues
 
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William EnckHotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
 
Transferring Software Testing and Analytics Tools to Practice
Transferring Software Testing and Analytics Tools to PracticeTransferring Software Testing and Analytics Tools to Practice
Transferring Software Testing and Analytics Tools to Practice
 
User Expectations in Mobile App Security
User Expectations in Mobile App SecurityUser Expectations in Mobile App Security
User Expectations in Mobile App Security
 
Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and Challenges
 
Software Mining and Software Datasets
Software Mining and Software DatasetsSoftware Mining and Software Datasets
Software Mining and Software Datasets
 
Next Generation Developer Testing: Parameterized Testing
Next Generation Developer Testing: Parameterized TestingNext Generation Developer Testing: Parameterized Testing
Next Generation Developer Testing: Parameterized Testing
 
Csise15 codehunt
Csise15 codehuntCsise15 codehunt
Csise15 codehunt
 
Text Analytics for Security
Text Analytics for SecurityText Analytics for Security
Text Analytics for Security
 
Gamifying Teaching and Learning of Software Engineering and Programming
Gamifying Teaching and Learning of Software Engineering and ProgrammingGamifying Teaching and Learning of Software Engineering and Programming
Gamifying Teaching and Learning of Software Engineering and Programming
 
Tutorial: Text Analytics for Security
Tutorial: Text Analytics for SecurityTutorial: Text Analytics for Security
Tutorial: Text Analytics for Security
 
Teaching and Learning Programming and Software Engineering via Interactive Ga...
Teaching and Learning Programming and Software Engineering via Interactive Ga...Teaching and Learning Programming and Software Engineering via Interactive Ga...
Teaching and Learning Programming and Software Engineering via Interactive Ga...
 

Towards Mining Software Repositories Research that Matters

  • 1. Towards Mining Software Repositories Research that Matters Tao Xie Department of Computer Science University of Illinois at Urbana-Champaign, USA taoxie@illinois.edu
  • 2. Machine Learning that Matters “The basic argument in her paper is that machine learning might be in danger of losing its impact because the community as a whole has become quite self-referential. People are probably solving real-world problems using ML methods, but there is little sharing of these results within the community. Instead, people focus on existing benchmarks which might have originally had some connection to real-world problems which has been long forgotten, however.” “She proposes a number of tasks like $100M solved through ML based decision making or a human life saved through a diagnosis or an intervention recommended by an ML system to get ML back on track.” ICML’12 http://icml.cc/2012/papers/298.pdf http://blog.mikiobraun.de/2012/06/is-machine-learning-losing-impact.html
  • 3. Redwine and Riddle Study (1985) • From idea to “the point it can be popularized and disseminated to the technical community at large” – Worst case: 23 years – Best case: 11 years – Mean: 17 years • 7.5 years from developed technology to wide availability Source©S. L. Pfleeger Sam Redwine Jr., William Riddle: Software Technology Maturation, In Proc. ICSE 1985.
  • 4. Technology Maturation: Middleware Source©A. Wolfhttp://www.sigsoft.org/impact/docs/ImpactWolfBCS2008.pdf 15-20 years between first publication of an idea and widespread availability in products
  • 5. Technology Maturation: Middleware Source©A.http://www.sigsoft.org/impact/docs/ImpactWolfBCS2008.pdf 15-20 years between first publication of an idea and widespread availability in products Shall we just stay in our comfort zone to wait for 15-20 years for our research to (or not to) produce practice impact?? How about the research that we did 15-20 years ago?? [Caveat: don’t forget the need of long-term/blue-sky research!!]
  • 6. 2012 NSF Workshop on Formal Methods • Goal: to identify the future directions in research in formal methods and its transition to industrial practice. • Success examples mentioned by the attendees – SLAM/SDV – ASTREE – SMT-based tools – … http://goto.ucsd.edu/~rjhala/NSFWorkshop/
  • 7. “What Happened to the Promise of Software Tools?” – Jim Larus http://www.srl.inf.ethz.ch/workshop2014/eth-larus.pdf https://www.youtube.com/watch?v=kO9OYnkeRTM
  • 8. Impacts, Impacts, Impacts, … Image source: http://engage.synecoretech.com/marketing-technology-for-growth/bid/155279/How-Online-Content-Impacts-Your-Social-Media-Marketing-Strategy
  • 10. Research Impacts SIGSOFT Impact Paper Awards, ICSE MIP awards, … …
  • 11. Practice Impacts ACM Software System Awards 31 Awardees http://awards.acm.org/software_system/
  • 12. Practice Impacts ACM Software System Awards • Development Environments/Tools – 2013: Coq – 2012: LLVM – 2011: Eclipse – 2007: Statemate – 2006: Eiffel – 2005: The Boyer-Moore Theorem Prover (ACL2) – 2003: MAKE – 2001: SPIN – 1992: Interlisp • Languages – 2002: Java – 1998: The S System (R statistical analysis) – 1997: Tcl/Tk – 1987: SMALLTALK
  • 13. 2012 LLVM born at Illinois • The openness of the LLVM technology and the quality of its architecture and engineering design are key factors in understanding the success it has had both in academia and industry Vikram Adve Chris Lattner Evan Cheng http://llvm.org/
  • 14. Practice Impacts commercialization/industrial adoption … SAGE ASTRÉE Statechart SPIN Moles Microsoft Research … …
  • 15. Practice Impacts research publications  industrial adoption done by others … • ICSE 00 Daikon paper by Ernst et al.  Agitar Agitator – https://homes.cs.washington.edu/~mernst/pubs/invariants-relevance-icse2000.pdf • ASE 04 Rostra paper by Xie et al.  Parasoft Jtest improvement – http://web.engr.illinois.edu/~taoxie/publications/ase04.pdf • PLDI/FSE 05 DART/CUTE papers by Sen et al.  MSR SAGE, Pex – http://srl.cs.berkeley.edu/~ksen/papers/dart.pdf – http://srl.cs.berkeley.edu/~ksen/papers/C159-sen.pdf
  • 16. HOW??? • Are these impact goals too far from you? • Can you plan for that? • What if you are in a university research group? • …
  • 17. (How) Can A University Group Do It? • Aim for research impacts more commonly – but sometimes/often may not be predicted well, e.g., Whyper [USENIX SEC 13] http://web.engr.illinois.edu/~taoxie/publications/usenixsec13-whyper.pdf • Start a startup – but desirable to have right people (e.g., former students) to start – but software engineering tools may not sell crazily • Collaborate with industrial research labs – but many research lab projects may look like univ. projects • Collaborate with industrial product groups – but many probs faced by product groups may not be “researchy” • At least focus on problems that matter (now or future)!
  • 18. (How) Can A University Group Do It? • Need to balance/unify producing great students vs./and great (high practice-impact) research http://www.cs.washington.edu/people/faculty/notkin/students conts.
  • 19. Experience Reports on Successful Tool Transfer • Nikolai Tillmann, Jonathan de Halleux, and Tao Xie. Transferring an Automated Test Generation Tool to Practice: From Pex to Fakes and Code Digger. In Proceedings of ASE 2014, Experience Papers. http://web.engr.illinois.edu/~taoxie/publications/ase14- pexexperiences.pdf • Jian-Guang Lou, Qingwei Lin, Rui Ding, Qiang Fu, Dongmei Zhang, and Tao Xie. Software Analytics for Incident Management of Online Services: An Experience Report. In Proceedings ASE 2013, Experience Paper. http://web.engr.illinois.edu/~taoxie/publications/ase13-sas.pdf • Dongmei Zhang, Shi Han, Yingnong Dang, Jian-Guang Lou, Haidong Zhang, and Tao Xie. Software Analytics in Practice. IEEE Software, Special Issue on the Many Faces of Software Analytics, 2013. http://web.engr.illinois.edu/~taoxie/publications/ieeesoft13-softanalytics.pdf • Yingnong Dang, Dongmei Zhang, Song Ge, Chengyun Chu, Yingjun Qiu, and Tao Xie. XIAO: Tuning Code Clones at Hands of Engineers in Practice. In Proceedings of ACSAC 2012. http://web.engr.illinois.edu/~taoxie/publications/acsac12-xiao.pdf
  • 20. Q & A http://www.cs.illinois.edu/homes/taoxie/ Contact: taoxie@illinois.edu Supported in part by a Microsoft Research Award, NSF grants CCF-1349666, CNS-1434582, CCF-1434596, CCF- 1434590, CNS-1439481, and the USA National Security Agency (NSA) Science of Security Lablet. Discussion
  • 21. Discussion Topics: HOW??? • Are these impact goals too far from you? • Can you plan for that? • What if you are in a university research group? • …