SlideShare uma empresa Scribd logo
1 de 25
Multiple Regression
Introduction
• Describe some of the differences between
the multiple regression and bi-variate
regression
• Assess the importance of the R-squared
statistic.
• Examine the F-test and distribution
• Show how we can use the F-test to
determine joint significance.
Multiple Regression
• In general the regression estimates are
more reliable if:
i) n is large (large dataset)
ii) The sample variance of the explanatory
variable is high.
iii) the variance of the error term is small
iv) The less closely related are the
explanatory variables.
Multiple Regression
• The constant and parameters are derived
in the same way as with the bi-variate
model. It involves minimising the sum of
the error terms. The equation for the slope
parameters (α) contains an expression for
the covariance between the explanatory
variables.
• When a new variable is added it affects
the coefficients of the existing variables
Regression
)
tan
,
45
(
56
.
1
,
3
.
0
R
(0.3)
(0.4)
(0.1)
9
.
0
4
.
0
6
.
0
ˆ
2
brackets
in
errors
dard
s
ns
observatio
DW
z
x
y t
t
t





Regression
• In the previous slide, a unit rise in x produces
0.4 of a unit rise in y, with z held constant.
• Interpretation of the t-statistics remains the
same, i.e. 0.4-0/0.4=1 (critical value is 2.02), so
we fail to reject the null and x is not significant.
• The R-squared statistic indicates 30% of the
variance of y is explained
• DW statistic indicates we are not sure if there is
autocorrelation, as the DW statistic lies in the
zone of indecision (Dl=1.43, Du=1.62)
Adjusted R-squared Statistic
• This statistic is used in a multiple regression
analysis, because it does not automatically
rise when an extra explanatory variable is
added.
• Its value depends on the number of
explanatory variables
• It is usually written as (R-bar squared):
2
R
Adjusted R-squared
• In generally rises when the t-statistic of an
extra variable exceeds unity (1),so does not
necessarily imply the extra variable is
significant.
• It has the following formula (n-number of
observations, k-number of parameters):
)
1
(
1 2
2
2
R
k
n
k
R
R 




The F-test
• The F-test is an analysis of the variance of a regression
• It can be used to test for the significance of a group of
variables or for a restriction
• It has a different distribution to the t-test, but can be used
to test at different levels of significance
• When determining the F-statistic we need to collect
either the residual sum of squares (RSS) or the R-
squared statistic
• The formula for the F-test of a group of variables can be
expressed in terms of either the residual sum of squares
(RSS) or explained sum of squares (ESS)
F-test of explanatory power
• This is the F-test for the goodness of fit of
a regression and in effect tests for the joint
significance of the explanatory variables.
• It is based on the R-squared statistic
• It is routinely produced by most computer
software packages
• It follows the F-distribution, which is quite
different to the t-test
F-test formula
• The formula for the F-test of the goodness
of fit is:
1
2
2
)
/(
)
1
(
1
/






k
k
n
F
k
n
R
k
R
F
F-distribution
• To find the critical value of the F-
distribution, in general you need to know
the number of parameters and the
degrees of freedom
• The number of parameters is then read
across the top of the table, the d of f. from
the side. Where these two values
intersect, we find the critical value.
F-test critical value
5.1
5.2
5.4
5.8
6.6
5
6.3
6.4
6.6
7.0
7.7
4
9.0
9.1
9.3
9.6
10.1
3
19.3
19.3
19.2
19.0
18.5
2
230.2
224.6
215.7
199.5
161.4
1
5
4
3
2
1
F-distribution
• Both go up to infinity
• If we wanted to find the critical value for
F(3,4), it would be 6.6
• The first value (3) is often termed the
numerator, whilst the second (4) the
denominator.
• It is often written as:
3
4
F
F-statistic
• When testing for the significance of the
goodness of fit, our null hypothesis is that
the explanatory variables jointly equal 0.
• If our F-statistic is below the critical value
we fail to reject the null and therefore we
say the goodness of fit is not significant.
Joint Significance
• The F-test is useful for testing a number of
hypotheses and is often used to test for
the joint significance of a group of
variables
• In this type of test, we often refer to
‘testing a restriction’
• This restriction is that a group of
explanatory variables are jointly equal to 0
F-test for joint significance
• The formula for this test can be viewed as:
remaining
freedom
of
Degrees
remaining/
squares
of
sum
Residual
up
used
freedom
of
degrees
Extra
fit/
in
t
Improvemen
F-tests
• The test for joint significance has its own
formula, which takes the following form:
RSS
restricted
RSS
RSS
ed
unrestrict
RSS
el
ed
unrestrict
in
parameters
k
ns
restrictio
of
number
m
k
n
RSS
m
RSS
RSS
F
R
u
u
u
R
mod
/
/







Joint Significance of a group of
variables
• To carry out this test you need to conduct two
separate OLS regression, one with all the
explanatory variables in (unrestricted equation),
the other with the variables whose joint
significance is being tested, removed.
• Then collect the RSS from both equations.
• Put the values in the formula
• Find the critical value and compare with the test
statistic. The null hypothesis is that the variables
jointly equal 0.
Joint Significance
• If we have a 3 explanatory variable model and
wish to test for the joint significance of 2 of the
variables (x and z), we need to run the following
restricted and unrestricted models:
ed
unrestrict
u
z
x
w
y
restricted
u
w
y
t
t
t
t
t
t
t
t










3
2
1
0
1
0






Example of the F-test for joint
significance
• Given the following model, we wish to test the joint
significance of w and z. Having estimated them, we
collect their respective RSSs (n=60).
5
.
1
75
.
0
1
0
3
2
1
0














R
t
t
t
u
t
t
t
t
t
RSS
restricted
v
x
y
RSS
ed
unrestrict
u
z
w
x
y






Joint significance
s
y variable
explanator
are
,
,
s
error term
are
,
.
parameters
slope
are
,
....
constants.
are
,
:
1
3
1
0
0
t
t
t
t
t
z
w
x
v
u
where





Joint significance
• Having obtained the RSSs, we need to input
the values into the earlier formula (slide 18):
15
.
3
:
value
critical
28
0134
.
0
375
.
0
4
60
/
75
.
0
2
/
75
.
0
5
.
1
2
56 




F
Joint significance
• As the F statistic is greater than the critical
value (28>3.15), we reject the null hypothesis
and conclude that the variables w and z are
jointly significant and should remain in the
model.
0
:
0
:
3
2
1
3
2
0








H
H
Conclusion
• Multiple regression analysis is similar to bi-
variate analysis, however correlation between
the x variables needs to be taken into account
• The adjusted R-squared statistic tends to be
used in this case
• The F-test is used to test for joint explanatory
power of the whole regression or a sub-set of
the variables
• We often use the F-test when testing for things
like seasonal effects in the data.

Mais conteúdo relacionado

Semelhante a Multiple Regression.ppt

Business Quantitative Lecture 3
Business Quantitative Lecture 3Business Quantitative Lecture 3
Business Quantitative Lecture 3
saark
 
Research method ch09 statistical methods 3 estimation np
Research method ch09 statistical methods 3 estimation npResearch method ch09 statistical methods 3 estimation np
Research method ch09 statistical methods 3 estimation np
naranbatn
 
Stability criteria and Analysis_Control Systems Engineering_MEB 4101.pdf
Stability criteria and Analysis_Control Systems Engineering_MEB 4101.pdfStability criteria and Analysis_Control Systems Engineering_MEB 4101.pdf
Stability criteria and Analysis_Control Systems Engineering_MEB 4101.pdf
MUST
 
IBM401 Lecture 5
IBM401 Lecture 5IBM401 Lecture 5
IBM401 Lecture 5
saark
 
LECTURE 4. Stability criteria and Analysis_Control Systems Engineering_MEB 41...
LECTURE 4. Stability criteria and Analysis_Control Systems Engineering_MEB 41...LECTURE 4. Stability criteria and Analysis_Control Systems Engineering_MEB 41...
LECTURE 4. Stability criteria and Analysis_Control Systems Engineering_MEB 41...
MUST
 

Semelhante a Multiple Regression.ppt (20)

Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Practice test ch 10 correlation reg ch 11 gof ch12 anova
Practice test ch 10 correlation reg ch 11 gof ch12 anovaPractice test ch 10 correlation reg ch 11 gof ch12 anova
Practice test ch 10 correlation reg ch 11 gof ch12 anova
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Stats chapter 3
Stats chapter 3Stats chapter 3
Stats chapter 3
 
Contingency Table Test, M. Asad Hayat, UET Taxila
Contingency Table Test, M. Asad Hayat, UET TaxilaContingency Table Test, M. Asad Hayat, UET Taxila
Contingency Table Test, M. Asad Hayat, UET Taxila
 
Correlation
CorrelationCorrelation
Correlation
 
Business Quantitative Lecture 3
Business Quantitative Lecture 3Business Quantitative Lecture 3
Business Quantitative Lecture 3
 
An intrduction to the Multiple Regression.pptx
An intrduction to the Multiple Regression.pptxAn intrduction to the Multiple Regression.pptx
An intrduction to the Multiple Regression.pptx
 
Research method ch09 statistical methods 3 estimation np
Research method ch09 statistical methods 3 estimation npResearch method ch09 statistical methods 3 estimation np
Research method ch09 statistical methods 3 estimation np
 
CORRELATION AND REGRESSION.pptx
CORRELATION AND REGRESSION.pptxCORRELATION AND REGRESSION.pptx
CORRELATION AND REGRESSION.pptx
 
Diagnostic Tests.ppt
Diagnostic Tests.pptDiagnostic Tests.ppt
Diagnostic Tests.ppt
 
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...
 
Stability criteria and Analysis_Control Systems Engineering_MEB 4101.pdf
Stability criteria and Analysis_Control Systems Engineering_MEB 4101.pdfStability criteria and Analysis_Control Systems Engineering_MEB 4101.pdf
Stability criteria and Analysis_Control Systems Engineering_MEB 4101.pdf
 
UNIT 5.pptx
UNIT 5.pptxUNIT 5.pptx
UNIT 5.pptx
 
IBM401 Lecture 5
IBM401 Lecture 5IBM401 Lecture 5
IBM401 Lecture 5
 
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and IndependenceThe Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
 
Linearity cochran test
Linearity cochran testLinearity cochran test
Linearity cochran test
 
LECTURE 4. Stability criteria and Analysis_Control Systems Engineering_MEB 41...
LECTURE 4. Stability criteria and Analysis_Control Systems Engineering_MEB 41...LECTURE 4. Stability criteria and Analysis_Control Systems Engineering_MEB 41...
LECTURE 4. Stability criteria and Analysis_Control Systems Engineering_MEB 41...
 
Effect Sizes (ES) for Meta-Analyses.pdf
Effect Sizes (ES) for Meta-Analyses.pdfEffect Sizes (ES) for Meta-Analyses.pdf
Effect Sizes (ES) for Meta-Analyses.pdf
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptx
 

Último

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Último (20)

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 

Multiple Regression.ppt

  • 2. Introduction • Describe some of the differences between the multiple regression and bi-variate regression • Assess the importance of the R-squared statistic. • Examine the F-test and distribution • Show how we can use the F-test to determine joint significance.
  • 3. Multiple Regression • In general the regression estimates are more reliable if: i) n is large (large dataset) ii) The sample variance of the explanatory variable is high. iii) the variance of the error term is small iv) The less closely related are the explanatory variables.
  • 4. Multiple Regression • The constant and parameters are derived in the same way as with the bi-variate model. It involves minimising the sum of the error terms. The equation for the slope parameters (α) contains an expression for the covariance between the explanatory variables. • When a new variable is added it affects the coefficients of the existing variables
  • 6. Regression • In the previous slide, a unit rise in x produces 0.4 of a unit rise in y, with z held constant. • Interpretation of the t-statistics remains the same, i.e. 0.4-0/0.4=1 (critical value is 2.02), so we fail to reject the null and x is not significant. • The R-squared statistic indicates 30% of the variance of y is explained • DW statistic indicates we are not sure if there is autocorrelation, as the DW statistic lies in the zone of indecision (Dl=1.43, Du=1.62)
  • 7. Adjusted R-squared Statistic • This statistic is used in a multiple regression analysis, because it does not automatically rise when an extra explanatory variable is added. • Its value depends on the number of explanatory variables • It is usually written as (R-bar squared): 2 R
  • 8. Adjusted R-squared • In generally rises when the t-statistic of an extra variable exceeds unity (1),so does not necessarily imply the extra variable is significant. • It has the following formula (n-number of observations, k-number of parameters): ) 1 ( 1 2 2 2 R k n k R R     
  • 9. The F-test • The F-test is an analysis of the variance of a regression • It can be used to test for the significance of a group of variables or for a restriction • It has a different distribution to the t-test, but can be used to test at different levels of significance • When determining the F-statistic we need to collect either the residual sum of squares (RSS) or the R- squared statistic • The formula for the F-test of a group of variables can be expressed in terms of either the residual sum of squares (RSS) or explained sum of squares (ESS)
  • 10. F-test of explanatory power • This is the F-test for the goodness of fit of a regression and in effect tests for the joint significance of the explanatory variables. • It is based on the R-squared statistic • It is routinely produced by most computer software packages • It follows the F-distribution, which is quite different to the t-test
  • 11. F-test formula • The formula for the F-test of the goodness of fit is: 1 2 2 ) /( ) 1 ( 1 /       k k n F k n R k R F
  • 12. F-distribution • To find the critical value of the F- distribution, in general you need to know the number of parameters and the degrees of freedom • The number of parameters is then read across the top of the table, the d of f. from the side. Where these two values intersect, we find the critical value.
  • 14. F-distribution • Both go up to infinity • If we wanted to find the critical value for F(3,4), it would be 6.6 • The first value (3) is often termed the numerator, whilst the second (4) the denominator. • It is often written as: 3 4 F
  • 15. F-statistic • When testing for the significance of the goodness of fit, our null hypothesis is that the explanatory variables jointly equal 0. • If our F-statistic is below the critical value we fail to reject the null and therefore we say the goodness of fit is not significant.
  • 16. Joint Significance • The F-test is useful for testing a number of hypotheses and is often used to test for the joint significance of a group of variables • In this type of test, we often refer to ‘testing a restriction’ • This restriction is that a group of explanatory variables are jointly equal to 0
  • 17. F-test for joint significance • The formula for this test can be viewed as: remaining freedom of Degrees remaining/ squares of sum Residual up used freedom of degrees Extra fit/ in t Improvemen
  • 18. F-tests • The test for joint significance has its own formula, which takes the following form: RSS restricted RSS RSS ed unrestrict RSS el ed unrestrict in parameters k ns restrictio of number m k n RSS m RSS RSS F R u u u R mod / /       
  • 19. Joint Significance of a group of variables • To carry out this test you need to conduct two separate OLS regression, one with all the explanatory variables in (unrestricted equation), the other with the variables whose joint significance is being tested, removed. • Then collect the RSS from both equations. • Put the values in the formula • Find the critical value and compare with the test statistic. The null hypothesis is that the variables jointly equal 0.
  • 20. Joint Significance • If we have a 3 explanatory variable model and wish to test for the joint significance of 2 of the variables (x and z), we need to run the following restricted and unrestricted models: ed unrestrict u z x w y restricted u w y t t t t t t t t           3 2 1 0 1 0      
  • 21. Example of the F-test for joint significance • Given the following model, we wish to test the joint significance of w and z. Having estimated them, we collect their respective RSSs (n=60). 5 . 1 75 . 0 1 0 3 2 1 0               R t t t u t t t t t RSS restricted v x y RSS ed unrestrict u z w x y      
  • 22. Joint significance s y variable explanator are , , s error term are , . parameters slope are , .... constants. are , : 1 3 1 0 0 t t t t t z w x v u where     
  • 23. Joint significance • Having obtained the RSSs, we need to input the values into the earlier formula (slide 18): 15 . 3 : value critical 28 0134 . 0 375 . 0 4 60 / 75 . 0 2 / 75 . 0 5 . 1 2 56      F
  • 24. Joint significance • As the F statistic is greater than the critical value (28>3.15), we reject the null hypothesis and conclude that the variables w and z are jointly significant and should remain in the model. 0 : 0 : 3 2 1 3 2 0         H H
  • 25. Conclusion • Multiple regression analysis is similar to bi- variate analysis, however correlation between the x variables needs to be taken into account • The adjusted R-squared statistic tends to be used in this case • The F-test is used to test for joint explanatory power of the whole regression or a sub-set of the variables • We often use the F-test when testing for things like seasonal effects in the data.