MIning Software Repositories (MSR) 2010 presentation

•

1 gostou•795 visualizações

The document describes a study that aimed to predict the severity of reported software bugs by analyzing their textual descriptions. The study used a Bayesian classifier trained on bug reports from Mozilla, Eclipse, and GNOME to classify bug severity. Results showed the approach could predict severity with reasonable precision and recall when using short descriptions and training per component. Combining components required a larger training set.

Tecnologia

Proceedings of the 2010 7th IEEE Working Conference on Mining Software Repositories, p.1-10

Predicting the Severity of a Reported Bug

Ahmed Lamkanﬁ, Serge Demeyer | Emanuel Giger | Bart Goethals
Ansymo | s.e.a.l. | ADReM

Severity of a bug is important
✓ Critical factor in deciding how soon it needs to
be ﬁxed, i.e. when prioritizing bugs

✓ Severity varies:
➡ trivial, minor, normal major, critical and blocker
➡ clear guidelines exist to classify severity of bug
reports

✓ Severity varies:
➡ trivial, minor, normal major, critical and blocker
➡ clear guidelines exist to classify severity of bug
reports

✓ Both a short and longer description of the
problem

Can we accurately predict the severity of a reported
bug by analyzing its textual descriptions?

Can we accurately predict the severity of a reported
bug by analyzing its textual descriptions?

Also the following questions:

Can we accurately predict the severity of a reported
bug by analyzing its textual descriptions?

Also the following questions:

Potential indicators?

We use text mining to classify bug reports
• Bayesian classiﬁer: based on the probabilistic
occurrence of words
• training and evaluation period
• in ﬁrst instance, per component

Evaluation of the approach:
✓ precision and recall:

Cases drawn from the open-source community
✓ Mozilla, Eclipse and GNOME

How does the basic approach perform?
➡ per component and using short description

How does the basic approach perform?
➡ per component and using short description

Non-severe Severe
component precision recall precision recall
Mozilla: Layout
0.701 0.785 0.752 0.653

Mozilla: Bookmarks
0.692 0.703 0.698 0.687

Eclipse: UI
0.707 0.633 0.668 0.738

Eclipse: JDT-UI
0.653 0.714 0.685 0.621

GNOME: Calendar
0.828 0.783 0.794 0.837

GNOME:Contacts
0.767 0.706 0.728 0.785

What keywords are good indicators of
severity?

What keywords are good indicators of
severity?

Component Non-severe Severe
inconsist, favicon, credit, Fault, machin, reboot,
Mozilla Firefox- General extra, consum, licens, reinstal, lockup,
underlin, typo, inspector, seemingli, perman,
titlebar instantli, segfault, compil

deprec, style, runnabl, hang, freez, deadlock,
Eclipse JDT UI system, cce, tvt35, thread, slow, anymor,
whitespac, node, put, param memori, tick, jvm, adapt

mnemon, outbox, typo, pad, deadlock, sigsegv, relat,
GNOME Mailer follow, titl, high, caus, snapshot, segment,
acceler, decod, reflec core, unexpectedli, build,
loop

How does the approach perform when using
the longer description?

How does the approach perform when using
the longer description?

Non-severe Severe
component precision recall precision recall
Mozilla: Layout
0.583 0.961 0.890 0.314

Mozilla: Bookmarks
0.536 0.963 0.820 0.166

Mozilla: Firefox
0.578 0.948 0.856 0.308
general
Eclipse: UI
0.548 0.976 0.892 0.197

Eclipse: JDT-UI
0.547 0.973 0.881 0.195

Eclipse: JDT-Text
0.570 0.988 0.955 0.257

How does the approach perform when
combining bugs from different components?

How does the approach perform when
combining bugs from different components?

Non-severe Severe
component precision recall precision recall

Mozilla
0.704 0.750 0.733 0.685

Eclipse
0.693 0.553 0.628 0.755

GNOME
0.817 0.737 0.760 0.835

Conclusions
✓ It is possible to predict the severity of a
reported bug

✓ Short description better source for
predictions

✓ Cross-component approach works, but
requires more training samples

Mais conteúdo relacionado

Semelhante a MIning Software Repositories (MSR) 2010 presentation

Frontend automation and stabilityMáté Nádasdi

Sitecore on AzureClearPeople

Dev and Ops Collaboration and Awareness at Etsy and FlickrJohn Allspaw

Rails in the Cloud - Experiences from running on EC2Jonathan Weiss

Rails in the CloudJonathan Weiss

Configuration management 101 - A tale of disaster recovery using CFEngine 3RUDDER

Operational Visibiliy and Analytics - BU SeminarCanturk Isci

Dev Environments: The Next GenerationTravis Thieman

So. many. vulnerabilities. Why are containers such a mess and what to do abou...Eric Smalling

Returnil 2010Rose Banioki

Javaland 2017: "You´ll do microservices now". Now what?André Goliath

Structured Software DesignGiorgio Zoppi

TYPO3 CMS deployment with Jenkins CIderdanne

Project SpaceLock - Architecture & DesignAbhishek Mishra

Sai devops - the art of being specializing generalistOdd-e

StHack 2014 - Jerome "@funoverip" Nokin Turning your managed av into my botnetStHack

Lessons Learned from Migrating Legacy Enterprise Applications to MicroservicesVMware Tanzu

Configuration management: automating and rationalizing server setup with CFEn...Jonathan Clarke

Configuration management: automating and rationalizing server setup with CFEn...RUDDER

State of jQuery June 2013 - Portlanddmethvin

Semelhante a MIning Software Repositories (MSR) 2010 presentation (20)

Frontend automation and stability

Sitecore on Azure

Dev and Ops Collaboration and Awareness at Etsy and Flickr

Rails in the Cloud - Experiences from running on EC2

Rails in the Cloud

Configuration management 101 - A tale of disaster recovery using CFEngine 3

Operational Visibiliy and Analytics - BU Seminar

Dev Environments: The Next Generation

So. many. vulnerabilities. Why are containers such a mess and what to do abou...

Returnil 2010

Javaland 2017: "You´ll do microservices now". Now what?

Structured Software Design

TYPO3 CMS deployment with Jenkins CI

Project SpaceLock - Architecture & Design

Sai devops - the art of being specializing generalist

StHack 2014 - Jerome "@funoverip" Nokin Turning your managed av into my botnet

Lessons Learned from Migrating Legacy Enterprise Applications to Microservices

Configuration management: automating and rationalizing server setup with CFEn...

State of jQuery June 2013 - Portland

Último

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

How to convert PDF to text with Nanonetsnaman860154

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

MIning Software Repositories (MSR) 2010 presentation

1. Proceedings of the 2010 7th IEEE Working Conference on Mining Software Repositories, p.1-10 Predicting the Severity of a Reported Bug Ahmed Lamkanﬁ, Serge Demeyer | Emanuel Giger | Bart Goethals Ansymo | s.e.a.l. | ADReM

2. Proceedings of the 2010 7th IEEE Working Conference on Mining Software Repositories, p.1-10 Predicting the Severity of a Reported Bug Ahmed Lamkanﬁ, Serge Demeyer | Emanuel Giger | Bart Goethals Ansymo | s.e.a.l. | ADReM

5. Severity of a bug is important ✓ Critical factor in deciding how soon it needs to be ﬁxed, i.e. when prioritizing bugs

6. Priority is business

7. Severity is techn ical

8. ✓ Severity varies: ➡ trivial, minor, normal major, critical and blocker ➡ clear guidelines exist to classify severity of bug reports

9. ✓ Severity varies: ➡ trivial, minor, normal major, critical and blocker ➡ clear guidelines exist to classify severity of bug reports ✓ Both a short and longer description of the problem

10. ✓ Severity varies: ➡ trivial, minor, normal major, critical and blocker ➡ clear guidelines exist to classify severity of bug reports ✓ Both a short and longer description of the problem ✓ Bugs are grouped according to products and components ➡ e.g.: plug-ins, bookmarks are components of product Firefox

11. Can we accurately predict the severity of a reported bug by analyzing its textual descriptions?

12. Can we accurately predict the severity of a reported bug by analyzing its textual descriptions? Also the following questions:

13. Can we accurately predict the severity of a reported bug by analyzing its textual descriptions? Also the following questions: Potential indicators?

14. Can we accurately predict the severity of a reported bug by analyzing its textual descriptions? Also the following questions: Potential indicators? Short versus long description?

15. Can we accurately predict the severity of a reported bug by analyzing its textual descriptions? Also the following questions: Potential indicators? Short versus long description? Per component versus cross-component?

16. Approach

17. We use text mining to classify bug reports • Bayesian classiﬁer: based on the probabilistic occurrence of words • training and evaluation period • in ﬁrst instance, per component

18. We use text mining to classify bug reports • Bayesian classiﬁer: based on the probabilistic occurrence of words • training and evaluation period • in ﬁrst instance, per component

19. We use text mining to classify bug reports • Bayesian classiﬁer: based on the probabilistic occurrence of words • training and evaluation period • in ﬁrst instance, per component

20. We use text mining to classify bug reports • Bayesian classiﬁer: based on the probabilistic occurrence of words • training and evaluation period • in ﬁrst instance, per component Non-severe bugs Severe bugs (trivial, minor) (major, critical, blocker)

21. We use text mining to classify bug reports • Bayesian classiﬁer: based on the probabilistic occurrence of words • training and evaluation period • in ﬁrst instance, per component Undecided Non-severe bugs Default Severe bugs (trivial, minor) (normal) (major, critical, blocker)

22. Evaluation of the approach: ✓ precision and recall: Cases drawn from the open-source community ✓ Mozilla, Eclipse and GNOME

23. Results

24. How does the basic approach perform? ➡ per component and using short description

25. How does the basic approach perform? ➡ per component and using short description Non-severe Severe component precision recall precision recall Mozilla: Layout 0.701 0.785 0.752 0.653 Mozilla: Bookmarks 0.692 0.703 0.698 0.687 Eclipse: UI 0.707 0.633 0.668 0.738 Eclipse: JDT-UI 0.653 0.714 0.685 0.621 GNOME: Calendar 0.828 0.783 0.794 0.837 GNOME:Contacts 0.767 0.706 0.728 0.785

26. What keywords are good indicators of severity?

27. What keywords are good indicators of severity? Component Non-severe Severe inconsist, favicon, credit, Fault, machin, reboot, Mozilla Firefox- General extra, consum, licens, reinstal, lockup, underlin, typo, inspector, seemingli, perman, titlebar instantli, segfault, compil deprec, style, runnabl, hang, freez, deadlock, Eclipse JDT UI system, cce, tvt35, thread, slow, anymor, whitespac, node, put, param memori, tick, jvm, adapt mnemon, outbox, typo, pad, deadlock, sigsegv, relat, GNOME Mailer follow, titl, high, caus, snapshot, segment, acceler, decod, reflec core, unexpectedli, build, loop

28. How does the approach perform when using the longer description?

29. How does the approach perform when using the longer description? Non-severe Severe component precision recall precision recall Mozilla: Layout 0.583 0.961 0.890 0.314 Mozilla: Bookmarks 0.536 0.963 0.820 0.166 Mozilla: Firefox 0.578 0.948 0.856 0.308 general Eclipse: UI 0.548 0.976 0.892 0.197 Eclipse: JDT-UI 0.547 0.973 0.881 0.195 Eclipse: JDT-Text 0.570 0.988 0.955 0.257

30. How does the approach perform when using the longer description? Non-severe Severe component precision recall precision recall Mozilla: Layout 0.583 0.961 0.890 0.314 Mozilla: Bookmarks 0.536 0.963 0.820 0.166 Mozilla: Firefox 0.578 0.948 0.856 0.308 general Eclipse: UI 0.548 0.976 0.892 0.197 Eclipse: JDT-UI 0.547 0.973 0.881 0.195 Eclipse: JDT-Text 0.570 0.988 0.955 0.257

31. How does the approach perform when combining bugs from different components?

32. How does the approach perform when combining bugs from different components? Non-severe Severe component precision recall precision recall Mozilla 0.704 0.750 0.733 0.685 Eclipse 0.693 0.553 0.628 0.755 GNOME 0.817 0.737 0.760 0.835

33. How does the approach perform when combining bugs from different components? Non-severe Severe component precision recall precision recall Mozilla 0.704 0.750 0.733 0.685 Eclipse 0.693 0.553 0.628 0.755 GNOME 0.817 0.737 0.760 0.835 Much larger training set necessary ✓± 2000 reports instead of ± 500 per severity!

34. Conclusions ✓ It is possible to predict the severity of a reported bug ✓ Short description better source for predictions ✓ Cross-component approach works, but requires more training samples

MIning Software Repositories (MSR) 2010 presentation

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a MIning Software Repositories (MSR) 2010 presentation

Semelhante a MIning Software Repositories (MSR) 2010 presentation (20)

Último

Último (20)

MIning Software Repositories (MSR) 2010 presentation