Sarcia idoese08

An Approach to Improving Parametric Estimation Models in case of Violation of Assumptions 1 Dept. of Informatica, Sistemi e Produzione University of Rome “Tor Vergata” S. Alessandro Sarcià 1,2 [email_address] Giovanni Cantone 1 Victor R. Basili 2,3 2 Dept. of Computer Science University of Maryland and 2 Fraunhofer Center for ESE Maryland Author Advisors

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Outline

Predicting software engineering variables accurately is the basis for success of mature organizations. This is still an unsolved problem. Our point of view: Prediction is about estimating values based on mathematical and statistical approaches (no guessing), e.g., regression functions Variables are cost, effort, size, defects, fault proneness, number of test cases and so forth Success refers to delivering software systems on time, on budget, and on quality as initially required. In software estimation , success is about providing estimates as close to the actual values as possible (the error is less than a stated threshold). Focus: We consider a wider meaning of it as keeping prediction uncertainty within acceptable thresholds (risk analysis on the estimation model) Organizations that we refer to are learning organizations that aim at improving their success over time.

Objectives ,[object Object],[object Object],[object Object],EM  Estimation Model

An overview on the approach ,[object Object],[object Object],[object Object],[object Object],[object Object],To analyze the uncertainty … To implement our solution To apply our solution The Problem The Solution The Application

Regression functions EM: y = f (x,  ) +  , E(  ) = 0 and cov(  ) = I  2 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ŷ = f(x, B ) with B   and y  ŷ ; r = (y- ŷ)   e.g., Least Squares estimates

Regression assumptions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

In case of violations, when we estimate the uncertainty on the next estimate the prediction interval may be unreliable (type I – II errors). Violation of Regression assumptions If normality does not hold we cannot use t-Student’s percentiles This is no longer constant This is not the standard error This is not the spread It may be correct Estimate Prediction Interval

Violation of Regression assumptions

The mathematical solution ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

The Quality Improvement Paradigm

The Estimation Improvement Process

Building the BDF Non-linear x-dependent median Class A Class B BDF 0 1 0.5 RE KSLOC (Posterior) Probability RE RE (P1) RE (P2) fixing  A family

Inverting the BDF (Sigmoid is smooth and monotonic) Inv(BDF) Fixing the probability RE KSLOC (fixed) 0 0.975 0.5 (Posterior) Probability RE Me UP Fixing a credibility range (95%) 1 0.025 Me DOWN (Bayesian) Error Prediction Interval

Analyzing the model behavior 0 Flatter Steeper Biased Biased Unbiased Unbiased KSLOC = 0.95 KSLOC = 0.55 KSLOC = 0.32 KSLOC = 0.11

Estimate Prediction Interval (M. Jørgensen ) RE = (Act – Est)/Act To estimate the Estimate Prediction Interval from the Error Prediction Interval, we can substitute and inverting the formula: [Me DOWN , Me UP ] = (Act – Est)/ Act O N+1 DOWN = Act DOWN = Est/(1 – Me DOWN ) O N+1 UP = Act UP = Est/(1 – Me UP ) Estimate Prediction Interval

Scope Error (similarity analysis with estimated data)

Assumption Error (estimated data)

Improving the model (actual data) Scope extension

Improving the model (actual data) Error magnitude and bias What we need to be worried about is the relative error magnitude not the bias

Improving the model (actual data) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

The NASA COCOMO data set [PROMISE] UB BS UB BS -0.9 -2.4 Relative Error EXT EXT EXT UB UB UB UB UB UB 77 historical projects (before 1985), 16 projects being estimated (from 1985 to 1987)

Benefits of using this approach ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Sarcia idoese08

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (18)

Destaque

Destaque (8)

Semelhante a Sarcia idoese08

Semelhante a Sarcia idoese08 (20)

Último

Último (20)

Sarcia idoese08