Code coverage for MSR Researches [Work in Progress]

•

1 gostou•912 visualizações

Maurício Aniche

My talk about the tool and heuristic I am developing to calculate code coverage statically.

Tecnologia Educação

Code Coverage
for MSR Researchers
Mauricio Aniche
aniche@ime.usp.br
Monday, November 18, 13

What is Code Coverage?
• Describes how much a production code is
tested by the test suite.
• It basically counts the numbers of executed
lines (when running the test suite) divided
by the number of total lines.
Monday, November 18, 13

Why Do We Need This?
• It is hard to calculate code coverage when
studying a large quantity of repositories.
• Compiled code needed
• Test suite execution needed
• As we know, every project contains a
different way to compile/run.
Monday, November 18, 13

Statical Analysis
• Statical Analysis would solve this problem.
• It is impossible to execute the code
statically.
• We need heuristics!
Monday, November 18, 13

Our idea
• A production method contains a certain
level of complexity (which can be measured
by McCabe’s number)
• public void a() {
if(x) return 1; else return 2;
}
• If a method contains 2 different paths, then
it probably needs two different tests.
Monday, November 18, 13

Our formula
• Method-level: Qty of tests / McCabe’s
number
• Class-Level: Sum(Qty of tests per
method) / Sum(McCabe’s number per
method)
Monday, November 18, 13

Identifying test
methods
@Test
public void testaOMetodo2() {
A a = new A();

int resultado = a.fazAlgo();

Assert.assertEquals(1, resultado);
}
@Test
public void testaOMetodo() {
A a = new A();

int resultado = a.getB().fazAlgo();

Assert.assertEquals(1, resultado);
}
1st impl.
tests fazAlgo()
2nd impl.
tests getB(),
fazAlgo()
Monday, November 18, 13

Comparing the solution
• I want to compare to Emma (a tool that
does dynamic analysis on the source code)
• I don’t want to replace the tool (it does
not make sense)
• I want to discover the error average
• If it is small, then we can use it.
Monday, November 18, 13

Calculating the
difference
• All charts were based on the difference
between our calculated number minus
Emma’s number.
• It means that a “0” means that the
numbers were the same
• A negative number indicates the our tool
calculates a smaller code coverage than
Emma.
• A positive number, the other way around.
Monday, November 18, 13

Spearman Correlation
Monday, November 18, 13

Discussion
• Looks like the tool can differ from dynamic
analysis by 25%~30%.
• Questions:
• How can I eliminate big mistakes?
• How can I determine if the tool is valid
or not?
Monday, November 18, 13

Advantages
• Really fast. It does not need to compile and
run the tests.
• If the test fails, dynamic analysis may fail.
Static analysis do not.
Monday, November 18, 13

Disadvantages
• It is an heuristic.
• The implementation is very complicated.
• There might be bugs on the
implementation.
• There are a few things that is pretty hard
to identify. Mainly inheritance and
polymorphism.
• AOP code.
Monday, November 18, 13

Wanna help?
• github.com/mauricioaniche/gelato2
• github.com/metricminer-msr/codemetrics
• My goal: MSR MSR MSR!
Monday, November 18, 13

Thanks!
• aniche@ime.usp.br
Monday, November 18, 13

Mais conteúdo relacionado

Destaque

Minicurso sobre Evolução de Software no CBSoft 2011Maurício Aniche

O que estamos temos feito com mineração de repositório de código no IME?Maurício Aniche

Does the Act of Refactoring Really Make Code Simpler? A Preliminary Study - W...Maurício Aniche

Test-Driven Development serve pra mim?Maurício Aniche

TDD depois do mainstream. E agora?Maurício Aniche

Como eu aprendi que testar software é importante?Maurício Aniche

Efeitos da Prática de Revisão de Código na Caelum: Um Estudo Preliminar em Du...Maurício Aniche

Você tem um xerife olhando seu código?Maurício Aniche

DNAD 2015 - Métricas de código, pra que te quero?Maurício Aniche

Proposta: Métricas e Heurísticas para Detecção de Problemas em Aplicações WebMaurício Aniche

Mining the Modern Code Review Repositories: A Dataset of People, Process and ...Norihiro Yoshida

MSR mining challenge 2015 - Quick TriggerXin Yang

MSR 2016 data showcase - Mining Code Review RepositoriesXin Yang

Affiliate Marketing Group Cguestd3d19d

The 25 Th Anniversary Of The Surgeon General’S Workshop On Breastfeeding And ...Biblioteca Virtual

Taking Care of BusinessIFS

Health System Factors Contributing To Breastfeeding SuccessBiblioteca Virtual

ComunicaCity call4ideas2015 ForumPAComunicaCity

Panic Anxiety Attacksguesta985eb

How To Support A Breastfeeding Mother NotasBiblioteca Virtual

Destaque (20)

Minicurso sobre Evolução de Software no CBSoft 2011

O que estamos temos feito com mineração de repositório de código no IME?

Does the Act of Refactoring Really Make Code Simpler? A Preliminary Study - W...

Test-Driven Development serve pra mim?

TDD depois do mainstream. E agora?

Como eu aprendi que testar software é importante?

Efeitos da Prática de Revisão de Código na Caelum: Um Estudo Preliminar em Du...

Você tem um xerife olhando seu código?

DNAD 2015 - Métricas de código, pra que te quero?

Proposta: Métricas e Heurísticas para Detecção de Problemas em Aplicações Web

Mining the Modern Code Review Repositories: A Dataset of People, Process and ...

MSR mining challenge 2015 - Quick Trigger

MSR 2016 data showcase - Mining Code Review Repositories

Affiliate Marketing Group C

The 25 Th Anniversary Of The Surgeon General’S Workshop On Breastfeeding And ...

Taking Care of Business

Health System Factors Contributing To Breastfeeding Success

ComunicaCity call4ideas2015 ForumPA

Panic Anxiety Attacks

How To Support A Breastfeeding Mother Notas

Semelhante a Code coverage for MSR Researches [Work in Progress]

Lecture01 algorithm analysisZara Nawaz

Computational thinkingr123457

daa unit 1.pptxLakshayYadav46

CS8461 - Design and Analysis of AlgorithmsKrishnan MuthuManickam

Basic Problems and Solving AlgorithmsNopadon Juneam

DAA 1 ppt.pptxRAJESH S

DAA ppt.pptxRAJESH S

DAA Slides for Multiple topics such as different algorithmsDEVARSHHIRENBHAIPARM

Algo_Lecture01.pptxShaistaRiaz4

Lecture01.pptInamUllahKhan961803

Chapter1.1 Introduction.pptTekle12

Chapter1.1 Introduction to design and analysis of algorithm.pptTekle12

Data Structures - Lecture 1 [introduction]Muhammad Hammad Waseem

Algorithm evaluation using Item Response TheoryCSIRO

Analysis and Design of AlgorithmsBulbul Agrawal

Algorithm analysis (All in one)jehan1987

DA lecture 3.pptxSayanSen36

Lecture 2 role of algorithms in computingjayavignesh86

CH-1.1 Introduction (1).pptxsatvikkushwaha1

VSSML18. Advanced WhizzML WorkflowsBigML, Inc

Semelhante a Code coverage for MSR Researches [Work in Progress] (20)

Lecture01 algorithm analysis

Computational thinking

daa unit 1.pptx

CS8461 - Design and Analysis of Algorithms

Basic Problems and Solving Algorithms

DAA 1 ppt.pptx

DAA ppt.pptx

DAA Slides for Multiple topics such as different algorithms

Algo_Lecture01.pptx

Lecture01.ppt

Chapter1.1 Introduction.ppt

Chapter1.1 Introduction to design and analysis of algorithm.ppt

Data Structures - Lecture 1 [introduction]

Algorithm evaluation using Item Response Theory

Analysis and Design of Algorithms

Algorithm analysis (All in one)

DA lecture 3.pptx

Lecture 2 role of algorithms in computing

CH-1.1 Introduction (1).pptx

VSSML18. Advanced WhizzML Workflows

Mais de Maurício Aniche

Can ML help software developers? (TEQnation 2022)Maurício Aniche

Tracing Back Log Data to its Log Statement: From Research to PracticeMaurício Aniche

Pragmatic software testing education - SIGCSE 2019Maurício Aniche

Test Automation Day 2018Maurício Aniche

Software Testing with Caipirinhas and StroopwafelsMaurício Aniche

Code smells in MVC applications (Dutch Spring meetup)Maurício Aniche

A Collaborative Approach to Teach Software Architecture - SIGCSE 2017Maurício Aniche

[TDC 2014] Métricas de código, pra que te quero?Maurício Aniche

Métricas de código, pra que te quero?Maurício Aniche

Defesa de mestrado: Como a prática de TDD influencia no projeto de classes em...Maurício Aniche

O que é código bonito?Maurício Aniche

Métodos Ágeis de Desenvolvimento de Software: Uma introduçãoMaurício Aniche

The relationship between test and production code quality (@ SIG)Maurício Aniche

What Do the Asserts in a Unit Test Tell Us About Code Quality? (CSMR2013)Maurício Aniche

Mais de Maurício Aniche (14)

Can ML help software developers? (TEQnation 2022)

Tracing Back Log Data to its Log Statement: From Research to Practice

Pragmatic software testing education - SIGCSE 2019

Test Automation Day 2018

Software Testing with Caipirinhas and Stroopwafels

Code smells in MVC applications (Dutch Spring meetup)

A Collaborative Approach to Teach Software Architecture - SIGCSE 2017

[TDC 2014] Métricas de código, pra que te quero?

Métricas de código, pra que te quero?

Defesa de mestrado: Como a prática de TDD influencia no projeto de classes em...

O que é código bonito?

Métodos Ágeis de Desenvolvimento de Software: Uma introdução

The relationship between test and production code quality (@ SIG)

What Do the Asserts in a Unit Test Tell Us About Code Quality? (CSMR2013)

Último

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

DBX First Quarter 2024 Investor PresentationDropbox

Why Teams call analytics are critical to your entire businesspanagenda

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

ICT role in 21st century education and its challengesrafiqahmad00786416

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Code coverage for MSR Researches [Work in Progress]

1. Code Coverage for MSR Researchers Mauricio Aniche aniche@ime.usp.br Monday, November 18, 13

2. What is Code Coverage? • Describes how much a production code is tested by the test suite. • It basically counts the numbers of executed lines (when running the test suite) divided by the number of total lines. Monday, November 18, 13

3. Why Do We Need This? • It is hard to calculate code coverage when studying a large quantity of repositories. • Compiled code needed • Test suite execution needed • As we know, every project contains a different way to compile/run. Monday, November 18, 13

4. Statical Analysis • Statical Analysis would solve this problem. • It is impossible to execute the code statically. • We need heuristics! Monday, November 18, 13

5. Our idea • A production method contains a certain level of complexity (which can be measured by McCabe’s number) • public void a() { if(x) return 1; else return 2; } • If a method contains 2 different paths, then it probably needs two different tests. Monday, November 18, 13

6. Our formula • Method-level: Qty of tests / McCabe’s number • Class-Level: Sum(Qty of tests per method) / Sum(McCabe’s number per method) Monday, November 18, 13

7. Identifying test methods @Test public void testaOMetodo2() { A a = new A(); int resultado = a.fazAlgo(); Assert.assertEquals(1, resultado); } @Test public void testaOMetodo() { A a = new A(); int resultado = a.getB().fazAlgo(); Assert.assertEquals(1, resultado); } 1st impl. tests fazAlgo() 2nd impl. tests getB(), fazAlgo() Monday, November 18, 13

8. Comparing the solution • I want to compare to Emma (a tool that does dynamic analysis on the source code) • I don’t want to replace the tool (it does not make sense) • I want to discover the error average • If it is small, then we can use it. Monday, November 18, 13

9. Calculating the difference • All charts were based on the difference between our calculated number minus Emma’s number. • It means that a “0” means that the numbers were the same • A negative number indicates the our tool calculates a smaller code coverage than Emma. • A positive number, the other way around. Monday, November 18, 13

10. A few examples Monday, November 18, 13

11. Spearman Correlation Monday, November 18, 13

12. Metric Miner Monday, November 18, 13

13. Gnarus Monday, November 18, 13

14. Caelum Web Monday, November 18, 13

15. Discussion • Looks like the tool can differ from dynamic analysis by 25%~30%. • Questions: • How can I eliminate big mistakes? • How can I determine if the tool is valid or not? Monday, November 18, 13

16. Advantages • Really fast. It does not need to compile and run the tests. • If the test fails, dynamic analysis may fail. Static analysis do not. Monday, November 18, 13

17. Disadvantages • It is an heuristic. • The implementation is very complicated. • There might be bugs on the implementation. • There are a few things that is pretty hard to identify. Mainly inheritance and polymorphism. • AOP code. Monday, November 18, 13

18. Wanna help? • github.com/mauricioaniche/gelato2 • github.com/metricminer-msr/codemetrics • My goal: MSR MSR MSR! Monday, November 18, 13

19. Thanks! • aniche@ime.usp.br Monday, November 18, 13

Code coverage for MSR Researches [Work in Progress]

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (20)

Semelhante a Code coverage for MSR Researches [Work in Progress]

Semelhante a Code coverage for MSR Researches [Work in Progress] (20)

Mais de Maurício Aniche

Mais de Maurício Aniche (14)

Último

Último (20)

Code coverage for MSR Researches [Work in Progress]