SlideShare uma empresa Scribd logo
1 de 85
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 1
Chapter 9
Modelling volatility and correlation
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 2
An Excursion into Non-linearity Land
• Motivation: the linear structural (and time series) models cannot
explain a number of important features common to much financial data
- leptokurtosis
- volatility clustering or volatility pooling
- leverage effects
• Our “traditional” structural model could be something like:
yt = 1 + 2x2t + ... + kxkt + ut, or more compactly y = X + u
• We also assumed that ut  N(0,2).
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 3
A Sample Financial Asset Returns Time Series
Daily S&P 500 Returns for August 2003 – August 2013
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 4
Non-linear Models: A Definition
• Campbell, Lo and MacKinlay (1997) define a non-linear data
generating process as one that can be written
yt = f(ut, ut-1, ut-2, …)
where ut is an iid error term and f is a non-linear function.
• They also give a slightly more specific definition as
yt = g(ut-1, ut-2, …)+ ut2(ut-1, ut-2, …)
where g is a function of past error terms only and 2 is a variance
term.
• Models with nonlinear g(•) are “non-linear in mean”, while those with
nonlinear 2(•) are “non-linear in variance”.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 5
Types of non-linear models
• The linear paradigm is a useful one.
• Many apparently non-linear relationships can be made linear by a
suitable transformation.
• On the other hand, it is likely that many relationships in finance are
intrinsically non-linear.
• There are many types of non-linear models, e.g.
- ARCH / GARCH
- switching models
- bilinear models
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 6
Testing for Non-linearity – The RESET Test
• The “traditional” tools of time series analysis (acf’s, spectral analysis)
may find no evidence that we could use a linear model, but the data
may still not be independent.
• Portmanteau tests for non-linear dependence have been developed.
• The simplest is Ramsey’s RESET test, which took the form:
• Here the dependent variable is the residual series and the independent
variables are the squares, cubes, …, of the fitted values.
   ... 
u y y y v
t t t p t
p
t
     

   
0 1
2
2
3
1
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 7
Testing for Non-linearity – The BDS Test
• Many other non-linearity tests are available - e.g., the BDS and
bispectrum test
• BDS is a pure hypothesis test. That is, it has as its null hypothesis that
the data are pure noise (completely random)
• It has been argued to have power to detect a variety of departures from
randomness – linear or non-linear stochastic processes, deterministic
chaos, etc)
• The BDS test follows a standard normal distribution under the null
• The test can also be used as a model diagnostic on the residuals to ‘see
what is left’
• If the proposed model is adequate, the standardised residuals should be
white noise.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 8
Chaos Theory
• Chaos theory is a notion taken from the physical sciences
• It suggests that there could be a deterministic, non-linear set of
equations underlying the behaviour of financial series or markets
• Such behaviour will appear completely random to the standard
statistical tests
• A positive sighting of chaos implies that while, by definition, long-
term forecasting would be futile, short-term forecastability and
controllability are possible, at least in theory, since there is some
deterministic structure underlying the data
• Varying definitions of what actually constitutes chaos can be found in
the literature.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 9
Detecting Chaos
• A system is chaotic if it exhibits sensitive dependence on initial
conditions (SDIC)
• So an infinitesimal change is made to the initial conditions (the initial
state of the system), then the corresponding change iterated through the
system for some arbitrary length of time will grow exponentially
• The largest Lyapunov exponent is a test for chaos
• It measures the rate at which information is lost from a system
• A positive largest Lyapunov exponent implies sensitive dependence,
and therefore that evidence of chaos has been obtained
• Almost without exception, applications of chaos theory to financial
markets have been unsuccessful
• This is probably because financial and economic data are usually far
noisier and ‘more random’ than data from other disciplines
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 10
Neural Networks
• Artificial neural networks (ANNs) are a class of models whose
structure is broadly motivated by the way that the brain performs
computation
• ANNs have been widely employed in finance for tackling time series
and classification problems
• Applications have included forecasting financial asset returns,
volatility, bankruptcy and takeover prediction
• Neural networks have virtually no theoretical motivation in finance
(they are often termed a ‘black box’)
• They can fit any functional relationship in the data to an arbitrary
degree of accuracy.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 11
Feedforward Neural Networks
• The most common class of ANN models in finance are known as
feedforward network models
• These have a set of inputs (akin to regressors) linked to one or more
outputs (akin to the regressand) via one or more ‘hidden’ or
intermediate layers
• The size and number of hidden layers can be modified to give a closer
or less close fit to the data sample
• A feedforward network with no hidden layers is simply a standard
linear regression model
• Neural network models work best where financial theory has virtually
nothing to say about the likely functional form for the relationship
between a set of variables.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 12
Neural Networks – Some Disadvantages
• Neural networks are not very popular in finance and suffer from
several problems:
– The coefficient estimates from neural networks do not have any real
theoretical interpretation
– Virtually no diagnostic or specification tests are available for estimated
models
– They can provide excellent fits in-sample to a given set of ‘training’ data,
but typically provide poor out-of-sample forecast accuracy
– This usually arises from the tendency of neural networks to fit closely to
sample-specific data features and ‘noise’, and so they cannot ‘generalise’
– The non-linear estimation of neural network models can be cumbersome
and computationally time-intensive, particularly, for example, if the model
must be estimated repeatedly when rolling through a sample.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 13
Models for Volatility
• Modelling and forecasting stock market volatility has been the subject
of vast empirical and theoretical investigation
• There are a number of motivations for this line of inquiry:
– Volatility is one of the most important concepts in finance
– Volatility, as measured by the standard deviation or variance of returns, is
often used as a crude measure of the total risk of financial assets
– Many value-at-risk models for measuring market risk require the
estimation or forecast of a volatility parameter
– The volatility of stock market prices also enters directly into the Black–
Scholes formula for deriving the prices of traded options
• We will now examine several volatility models.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 14
Historical Volatility
• The simplest model for volatility is the historical estimate
• Historical volatility simply involves calculating the variance (or
standard deviation) of returns in the usual way over some historical
period
• This then becomes the volatility forecast for all future periods
• Evidence suggests that the use of volatility predicted from more
sophisticated time series models will lead to more accurate forecasts
and option valuations
• Historical volatility is still useful as a benchmark for comparing the
forecasting ability of more complex time models
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 15
Heteroscedasticity Revisited
• An example of a structural model is
with ut  N(0, ).
• The assumption that the variance of the errors is constant is known as
homoscedasticity, i.e. Var (ut) = .
• What if the variance of the errors is not constant?
- heteroscedasticity
- would imply that standard error estimates could be wrong.
• Is the variance of the errors likely to be constant over time? Not for
financial data.
u
2
u
2
yt = 1 + 2x2t + 3x3t + 4x4t + ut
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 16
Autoregressive Conditionally Heteroscedastic
(ARCH) Models
• So use a model which does not assume that the variance is constant.
• Recall the definition of the variance of ut:
= Var(ut ut-1, ut-2,...) = E[(ut-E(ut))2 ut-1, ut-2,...]
We usually assume that E(ut) = 0
so = Var(ut  ut-1, ut-2,...) = E[ut
2 ut-1, ut-2,...].
What could the current value of the variance of the errors plausibly
depend upon?
– Previous squared error terms.
• This leads to the autoregressive conditionally heteroscedastic model
for the variance of the errors:
= 0 + 1
• This is known as an ARCH(1) model
• The ARCH model due to Engle (1982) has proved very useful in
finance.
t
2
t
2
t
2 ut1
2
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 17
Autoregressive Conditionally Heteroscedastic
(ARCH) Models (cont’d)
• The full model would be
yt = 1 + 2x2t + ... + kxkt + ut , ut  N(0, )
where = 0 + 1
• We can easily extend this to the general case where the error variance
depends on q lags of squared errors:
= 0 + 1 +2 +...+q
• This is an ARCH(q) model.
• Instead of calling the variance , in the literature it is usually called ht,
so the model is
yt = 1 + 2x2t + ... + kxkt + ut , ut  N(0,ht)
where ht = 0 + 1 +2 +...+q
t
2
t
2
t
2
ut1
2
ut q

2
ut q

2
t
2
2
1

t
u
2
2

t
u
2
1

t
u
2
2

t
u
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 18
Another Way of Writing ARCH Models
• For illustration, consider an ARCH(1). Instead of the above, we can
write
yt = 1 + 2x2t + ... + kxkt + ut , ut = vtt
, vt  N(0,1)
• The two are different ways of expressing exactly the same model. The
first form is easier to understand while the second form is required for
simulating from an ARCH model, for example.
  
t t
u
  
0 1 1
2
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 19
Testing for “ARCH Effects”
1. First, run any postulated linear regression of the form given in the equation
above, e.g. yt = 1 + 2x2t + ... + kxkt + ut
saving the residuals, .
2. Then square the residuals, and regress them on q own lags to test for ARCH
of order q, i.e. run the regression
where vt is iid.
Obtain R2 from this regression
3. The test statistic is defined as TR2 (the number of observations multiplied
by the coefficient of multiple correlation) from the last regression, and is
distributed as a 2(q).
t
û
t
q
t
q
t
t
t v
u
u
u
u 




 


2
2
2
2
2
1
1
0
2
ˆ
...
ˆ
ˆ
ˆ 



‘Introductory Econometrics for Finance’ © Chris Brooks 2013 20
Testing for “ARCH Effects” (cont’d)
4. The null and alternative hypotheses are
H0 : 1 = 0 and 2 = 0 and 3 = 0 and ... and q = 0
H1 : 1  0 or 2  0 or 3  0 or ... or q  0.
If the value of the test statistic is greater than the critical value from the
2 distribution, then reject the null hypothesis.
• Note that the ARCH test is also sometimes applied directly to returns
instead of the residuals from Stage 1 above.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 21
Problems with ARCH(q) Models
• How do we decide on q?
• The required value of q might be very large
• Non-negativity constraints might be violated.
– When we estimate an ARCH model, we require i >0  i=1,2,...,q
(since variance cannot be negative)
• A natural extension of an ARCH(q) model which gets around some of
these problems is a GARCH model.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 22
Generalised ARCH (GARCH) Models
• Due to Bollerslev (1986). Allow the conditional variance to be dependent
upon previous own lags
• The variance equation is now
(1)
• This is a GARCH(1,1) model, which is like an ARMA(1,1) model for the
variance equation.
• We could also write
• Substituting into (1) for t-1
2 :
t
2
= 0 + 1
2
1

t
u +t-1
2
t-1
2
= 0 + 1
2
2

t
u +t-2
2
t-2
2
= 0 + 1
2
3

t
u +t-3
2
t
2
= 0 + 1
2
1

t
u +(0 + 1
2
2

t
u +t-2
2
)
= 0 + 1
2
1

t
u +0 + 1 2
2

t
u +t-2
2
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 23
Generalised ARCH (GARCH) Models (cont’d)
• Now substituting into (2) for t-2
2
• An infinite number of successive substitutions would yield
• So the GARCH(1,1) model can be written as an infinite order ARCH model.
• We can again extend the GARCH(1,1) model to a GARCH(p,q):
t
2
=0 + 1
2
1

t
u +0 + 1 2
2

t
u +2
(0 + 1
2
3

t
u +t-3
2
)
t
2
= 0 + 1
2
1

t
u +0 + 1 2
2

t
u +02
+ 12 2
3

t
u +3
t-3
2
t
2
= 0 (1++2
) + 1
2
1

t
u (1+L+2
L2
) + 3
t-3
2
t
2
= 0 (1++2
+...) + 1
2
1

t
u (1+L+2
L2
+...) + 
0
2
t
2
= 0+1
2
1

t
u +2
2
2

t
u +...+q
2
q
t
u  +1t-1
2
+2t-2
2
+...+pt-p
2
t
2
=  
 

 

q
i
p
j
j
t
j
i
t
iu
1 1
2
2
0 



‘Introductory Econometrics for Finance’ © Chris Brooks 2013 24
Generalised ARCH (GARCH) Models (cont’d)
• But in general a GARCH(1,1) model will be sufficient to capture the
volatility clustering in the data.
• Why is GARCH Better than ARCH?
- more parsimonious - avoids overfitting
- less likely to breech non-negativity constraints
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 25
The Unconditional Variance under the GARCH
Specification
• The unconditional variance of ut is given by
when
• is termed “non-stationarity” in variance
• is termed intergrated GARCH
• For non-stationarity in variance, the conditional variance forecasts will
not converge on their unconditional value as the horizon increases.
Var(ut) =
)
(
1 1
0






 
1 < 1

 
1  1

 
1 = 1
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 26
Estimation of ARCH / GARCH Models
• Since the model is no longer of the usual linear form, we cannot use
OLS.
• We use another technique known as maximum likelihood.
• The method works by finding the most likely values of the parameters
given the actual data.
• More specifically, we form a log-likelihood function and maximise it.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 27
Estimation of ARCH / GARCH Models (cont’d)
• The steps involved in actually estimating an ARCH or GARCH model
are as follows
1. Specify the appropriate equations for the mean and the variance - e.g. an
AR(1)- GARCH(1,1) model:
2. Specify the log-likelihood function to maximise:
3. The computer will maximise the function and give parameter values and
their standard errors
yt =  + yt-1 + ut , ut  N(0,t
2
)
t
2
= 0 + 1
2
1

t
u +t-1
2

 








T
t
t
t
t
T
t
t y
y
T
L
1
2
2
1
1
2
/
)
(
2
1
)
log(
2
1
)
2
log(
2





‘Introductory Econometrics for Finance’ © Chris Brooks 2013 28
Parameter Estimation using Maximum Likelihood
• Consider the bivariate regression case with homoscedastic errors for
simplicity:
• Assuming that ut  N(0,2), then yt  N( , 2) so that the
probability density function for a normally distributed random variable
with this mean and variance is given by
(1)
• Successive values of yt would trace out the familiar bell-shaped curve.
• Assuming that ut are iid, then yt will also be iid.
t
t
t u
x
y 

 2
1 

t
x
2
1 
 





 



 2
2
2
1
2
2
1
)
(
2
1
exp
2
1
)
,
(







 t
t
t
t
x
y
x
y
f
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 29
Parameter Estimation using Maximum Likelihood
(cont’d)
• Then the joint pdf for all the y’s can be expressed as a product of the
individual density functions
(2)
• Substituting into equation (2) for every yt from equation (1),
(3)





 



 

T
t
t
t
T
T
t
T
x
y
x
y
y
y
f
1
2
2
2
1
2
2
1
2
1
)
(
2
1
exp
)
2
(
1
)
,
,...,
,
(

















T
t
t
t
T
t
T
X
y
f
X
y
f
X
y
f
X
y
f
X
y
y
y
f
1
2
2
1
2
4
2
1
2
2
2
1
2
2
1
2
1
1
2
2
1
2
1
)
,
(
)
,
(
)...
,
(
)
,
(
)
,
,...,
,
(















‘Introductory Econometrics for Finance’ © Chris Brooks 2013 30
Parameter Estimation using Maximum Likelihood
(cont’d)
• The typical situation we have is that the xt and yt are given and we want to
estimate 1, 2, 2. If this is the case, then f() is known as the likelihood
function, denoted LF(1, 2, 2), so we write
(4)
• Maximum likelihood estimation involves choosing parameter values (1,
2,2) that maximise this function.
• We want to differentiate (4) w.r.t. 1, 2,2, but (4) is a product containing
T terms.





 


 

T
t
t
t
T
T
x
y
LF
1
2
2
2
1
2
2
1
)
(
2
1
exp
)
2
(
1
)
,
,
(








‘Introductory Econometrics for Finance’ © Chris Brooks 2013 31
• Since , we can take logs of (4).
• Then, using the various laws for transforming functions containing
logarithms, we obtain the log-likelihood function, LLF:
• which is equivalent to
(5)
• Differentiating (5) w.r.t. 1, 2,2, we obtain
(6)
max ( ) maxlog( ( ))
x x
f x f x

Parameter Estimation using Maximum Likelihood
(cont’d)








T
t
t
t x
y
T
T
LLF
1
2
2
2
1 )
(
2
1
)
2
log(
2
log













T
t
t
t x
y
T
T
LLF
1
2
2
2
1
2 )
(
2
1
)
2
log(
2
log
2 









 2
2
1
1
1
.
2
).
(
2
1




 t
t x
y
LLF
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 32
(7)
(8)
• Setting (6)-(8) to zero to minimise the functions, and putting hats above
the parameters to denote the maximum likelihood estimators,
• From (6),
(9)
Parameter Estimation using Maximum Likelihood
(cont’d)





 4
2
2
1
2
2
)
(
2
1
1
2 




 t
t x
y
T
LLF





 2
2
1
2
.
2
).
(
2
1




 t
t
t x
x
y
LLF
 

 0
)
ˆ
ˆ
( 2
1 t
t x
y 

   

 0
ˆ
ˆ
2
1 t
t x
y 

  

 0
ˆ
ˆ
2
1 t
t x
T
y 

  

 0
1
ˆ
ˆ
1
2
1 t
t x
T
y
T


x
y 2
1
ˆ
ˆ 
 

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 33
• From (7),
(10)
• From (8),
Parameter Estimation using Maximum Likelihood
(cont’d)
 

 0
)
ˆ
ˆ
( 2
1 t
t
t x
x
y 

 
 

 0
ˆ
ˆ 2
2
1 t
t
t
t x
x
x
y 

 
 

 0
ˆ
ˆ 2
2
1 t
t
t
t x
x
x
y 

 
 

 t
t
t
t x
x
y
x
y
x )
ˆ
(
ˆ
2
2
2 


 

 2
2
2
2
ˆ
ˆ x
T
y
x
T
x
y
x t
t
t 


 

 y
x
T
x
y
x
T
x t
t
t )
(
ˆ 2
2
2

)
(
ˆ
2
2
2
x
T
x
y
x
T
x
y
t
t
t






 

 2
2
1
4
2
)
ˆ
ˆ
(
ˆ
1
ˆ t
t x
y
T




‘Introductory Econometrics for Finance’ © Chris Brooks 2013 34
• Rearranging,
(11)
• How do these formulae compare with the OLS estimators?
(9) & (10) are identical to OLS
(11) is different. The OLS estimator was
• Therefore the ML estimator of the variance of the disturbances is biased,
although it is consistent.
• But how does this help us in estimating heteroscedastic models?
 
2 2
1
 
T
ut
 
2 2
1



T k
ut
Parameter Estimation using Maximum Likelihood
(cont’d)
 

 2
2
1
2
)
ˆ
ˆ
(
1
ˆ t
t x
y
T



‘Introductory Econometrics for Finance’ © Chris Brooks 2013 35
Estimation of GARCH Models Using
Maximum Likelihood
• Now we have yt =  + yt-1 + ut , ut  N(0, )
• Unfortunately, the LLF for a model with time-varying variances cannot be
maximised analytically, except in the simplest of cases. So a numerical
procedure is used to maximise the log-likelihood function. A potential
problem: local optima or multimodalities in the likelihood surface.
• The way we do the optimisation is:
1. Set up LLF.
2. Use regression to get initial guesses for the mean parameters.
3. Choose some initial guesses for the conditional variance parameters.
4. Specify a convergence criterion - either by criterion or by value.
t
2
t
2
= 0 + 1
2
1

t
u +t-1
2

 








T
t
t
t
t
T
t
t y
y
T
L
1
2
2
1
1
2
/
)
(
2
1
)
log(
2
1
)
2
log(
2





‘Introductory Econometrics for Finance’ © Chris Brooks 2013 36
Non-Normality and Maximum Likelihood
• Recall that the conditional normality assumption for ut is essential.
• We can test for normality using the following representation
ut = vtt vt  N(0,1)
• The sample counterpart is
• Are the normal? Typically are still leptokurtic, although less so than
the . Is this a problem? Not really, as we can use the ML with a robust
variance/covariance estimator. ML with robust standard errors is called Quasi-
Maximum Likelihood or QML.
    
t t t
u
  
 
0 1 1
2
2 1
2 v
u
t
t
t


t
t
t
u
v

ˆ
ˆ
ˆ 
t
v̂ t
v̂
t
û
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 37
Extensions to the Basic GARCH Model
• Since the GARCH model was developed, a huge number of extensions
and variants have been proposed. Three of the most important
examples are EGARCH, GJR, and GARCH-M models.
• Problems with GARCH(p,q) Models:
- Non-negativity constraints may still be violated
- GARCH models cannot account for leverage effects
• Possible solutions: the exponential GARCH (EGARCH) model or the
GJR model, which are asymmetric GARCH models.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 38
The EGARCH Model
• Suggested by Nelson (1991). The variance equation is given by
• Advantages of the model
- Since we model the log(t
2), then even if the parameters are negative, t
2
will be positive.
- We can account for the leverage effect: if the relationship between
volatility and returns is negative, , will be negative.



























2
)
log(
)
log(
2
1
1
2
1
1
2
1
2
t
t
t
t
t
t
u
u
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 39
The GJR Model
• Due to Glosten, Jaganathan and Runkle
where It-1 = 1 if ut-1 < 0
= 0 otherwise
• For a leverage effect, we would see  > 0.
• We require 1 +   0 and 1  0 for non-negativity.
t
2
= 0 + 1
2
1

t
u +t-1
2
+ut-1
2
It-1
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 40
An Example of the use of a GJR Model
• Using monthly S&P 500 returns, December 1979- June 1998
• Estimating a GJR model, we obtain the following results.
)
198
.
3
(
172
.
0

t
y
)
772
.
5
(
)
999
.
14
(
)
437
.
0
(
)
372
.
16
(
604
.
0
498
.
0
015
.
0
243
.
1 1
2
1
2
1
2
1
2



 


 t
t
t
t
t I
u
u 

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 41
News Impact Curves
The news impact curve plots the next period volatility (ht) that would arise from various
positive and negative values of ut-1, given an estimated model.
News Impact Curves for S&P 500 Returns using Coefficients from GARCH and GJR
Model Estimates:
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
-1 -0.9 -0.8 -0.7 -0.6 -0.5 -0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Value of Lagged Shock
Value
of
Conditional
Variance GARCH
GJR
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 42
GARCH-in Mean
• We expect a risk to be compensated by a higher return. So why not let
the return of a security be partly determined by its risk?
• Engle, Lilien and Robins (1987) suggested the ARCH-M specification.
A GARCH-M model would be
•  can be interpreted as a sort of risk premium.
• It is possible to combine all or some of these models together to get
more complex “hybrid” models - e.g. an ARMA-EGARCH(1,1)-M
model.
yt =  + t-1+ ut , ut N(0,t
2
)
t
2
= 0 + 1
2
1

t
u +t-1
2
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 43
What Use Are GARCH-type Models?
• GARCH can model the volatility clustering effect since the conditional
variance is autoregressive. Such models can be used to forecast volatility.
• We could show that
Var (yt  yt-1, yt-2, ...) = Var (ut  ut-1, ut-2, ...)
• So modelling t
2 will give us models and forecasts for yt as well.
• Variance forecasts are additive over time.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 44
Forecasting Variances using GARCH Models
• Producing conditional variance forecasts from GARCH models uses a
very similar approach to producing forecasts from ARMA models.
• It is again an exercise in iterating with the conditional expectations
operator.
• Consider the following GARCH(1,1) model:
, ut  N(0,t
2),
• What is needed is to generate forecasts of T+1
2 T, T+2
2 T, ...,
T+s
2 T where T denotes all information available up to and
including observation T.
• Adding one to each of the time subscripts of the above conditional
variance equation, and then two, and then three would yield the
following equations
T+1
2 = 0 + 1uT
2 +T
2 , T+2
2 = 0 + 1uT+1
2 +T+1
2 , T+3
2 = 0 + 1
uT+2+T+2
2
t
t u
y 
  2
1
2
1
1
0
2

 

 t
t
t u 



‘Introductory Econometrics for Finance’ © Chris Brooks 2013 45
Forecasting Variances
using GARCH Models (Cont’d)
• Let be the one step ahead forecast for 2 made at time T. This is
easy to calculate since, at time T, the values of all the terms on the
RHS are known.
• would be obtained by taking the conditional expectation of the
first equation at the bottom of slide 36:
• Given, how is , the 2-step ahead forecast for 2 made at time T,
calculated? Taking the conditional expectation of the second equation
at the bottom of slide 36:
= 0 + 1E(  T) +
• where E(  T) is the expectation, made at time T, of , which is
the squared disturbance term.
2
,
1
f
T

2
,
1
f
T

2
,
1
f
T
 = 0 + 1
2
T
u +T
2
2
,
1
f
T

2
,
2
f
T

2
,
2
f
T

2
1

T
u
2
,
1
f
T

2
1

T
u 2
1

T
u
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 46
Forecasting Variances
using GARCH Models (Cont’d)
• We can write
E(uT+1
2  t) = T+1
2
• But T+1
2 is not known at time T, so it is replaced with the forecast for
it, , so that the 2-step ahead forecast is given by
= 0 + 1 +
= 0 + (1+)
• By similar arguments, the 3-step ahead forecast will be given by
= ET(0 + 1uT+2
2 + T+2
2)
= 0 + (1+)
= 0 + (1+)[ 0 + (1+) ]
= 0 + 0(1+) + (1+)2
• Any s-step ahead forecast (s  2) would be produced by
2
,
1
f
T
 2
,
2
f
T

2
,
1
f
T

2
,
1
f
T

2
,
2
f
T

2
,
1
f
T

2
,
3
f
T

2
,
2
f
T

2
,
1
f
T

2
,
1
f
T

2
,
1
1
1
1
1
1
1
0
2
, )
(
)
( f
T
s
s
i
i
f
T
s 





 






 
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 47
What Use Are Volatility Forecasts?
1. Option pricing
C = f(S, X, 2, T, rf)
2. Conditional betas
3. Dynamic hedge ratios
The Hedge Ratio - the size of the futures position to the size of the
underlying exposure, i.e. the number of futures contracts to buy or sell per
unit of the spot good.



i t
im t
m t
,
,
,
 2
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 48
What Use Are Volatility Forecasts? (Cont’d)
• What is the optimal value of the hedge ratio?
• Assuming that the objective of hedging is to minimise the variance of the
hedged portfolio, the optimal hedge ratio will be given by
where h = hedge ratio
p = correlation coefficient between change in spot price (S) and
change in futures price (F)
S = standard deviation of S
F = standard deviation of F
• What if the standard deviations and correlation are changing over time?
Use
h p s
F



t
F
t
s
t
t p
h
,
,



‘Introductory Econometrics for Finance’ © Chris Brooks 2013 49
Testing Non-linear Restrictions or
Testing Hypotheses about Non-linear Models
• Usual t- and F-tests are still valid in non-linear models, but they are
not flexible enough.
• There are three hypothesis testing procedures based on maximum
likelihood principles: Wald, Likelihood Ratio, Lagrange Multiplier.
• Consider a single parameter,  to be estimated, Denote the MLE as
and a restricted estimate as .
~

ˆ
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 50
Likelihood Ratio Tests
• Estimate under the null hypothesis and under the alternative.
• Then compare the maximised values of the LLF.
• So we estimate the unconstrained model and achieve a given maximised
value of the LLF, denoted Lu
• Then estimate the model imposing the constraint(s) and get a new value of
the LLF denoted Lr.
• Which will be bigger?
• Lr  Lu comparable to RRSS  URSS
• The LR test statistic is given by
LR = -2(Lr - Lu)  2(m)
where m = number of restrictions
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 51
Likelihood Ratio Tests (cont’d)
• Example: We estimate a GARCH model and obtain a maximised LLF of
66.85. We are interested in testing whether  = 0 in the following equation.
yt =  + yt-1 + ut , ut  N(0, )
= 0 + 1 + 
• We estimate the model imposing the restriction and observe the maximised
LLF falls to 64.54. Can we accept the restriction?
• LR = -2(64.54-66.85) = 4.62.
• The test follows a 2(1) = 3.84 at 5%, so reject the null.
• Denoting the maximised value of the LLF by unconstrained ML as L( )
and the constrained optimum as . Then we can illustrate the 3 testing
procedures in the following diagram:
t
2
t
2
ut1
2
L(
~
)

2
1

t

ˆ
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 52
Comparison of Testing Procedures under Maximum
Likelihood: Diagramatic Representation
 

L
A
 
ˆ
L
B
 

~
L

~
ˆ 
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 53
Hypothesis Testing under Maximum Likelihood
• The vertical distance forms the basis of the LR test.
• The Wald test is based on a comparison of the horizontal distance.
• The LM test compares the slopes of the curve at A and B.
• We know at the unrestricted MLE, L( ), the slope of the curve is zero.
• But is it “significantly steep” at ?
• This formulation of the test is usually easiest to estimate.
L(
~
)

ˆ
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 54
An Example of the Application of GARCH Models
- Day & Lewis (1992)
• Purpose
• To consider the out of sample forecasting performance of GARCH and
EGARCH Models for predicting stock index volatility.
• Implied volatility is the markets expectation of the “average” level of
volatility of an option:
• Which is better, GARCH or implied volatility?
• Data
• Weekly closing prices (Wednesday to Wednesday, and Friday to Friday)
for the S&P100 Index option and the underlying 11 March 83 - 31 Dec. 89
• Implied volatility is calculated using a non-linear iterative procedure.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 55
The Models
• The “Base” Models
For the conditional mean
(1)
And for the variance (2)
or (3)
where
RMt denotes the return on the market portfolio
RFt denotes the risk-free rate
ht denotes the conditional variance from the GARCH-type models while
t
2 denotes the implied variance from option prices.
t
t
Ft
Mt u
h
R
R 


 1
0 

1
1
2
1
1
0 
 

 t
t
t h
u
h 


)
2
(
)
ln(
)
ln(
2
/
1
1
1
1
1
1
1
1
0






























t
t
t
t
t
t
h
u
h
u
h
h
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 56
The Models (cont’d)
• Add in a lagged value of the implied volatility parameter to equations (2)
and (3).
(2) becomes
(4)
and (3) becomes
(5)
• We are interested in testing H0 :  = 0 in (4) or (5).
• Also, we want to test H0 : 1 = 0 and 1 = 0 in (4),
• and H0 : 1 = 0 and 1 = 0 and  = 0 and  = 0 in (5).
2
1
1
1
2
1
1
0 

 


 t
t
t
t h
u
h 



)
ln(
)
2
(
)
ln(
)
ln( 2
1
2
/
1
1
1
1
1
1
1
1
0 




 


















 t
t
t
t
t
t
t
h
u
h
u
h
h 







‘Introductory Econometrics for Finance’ © Chris Brooks 2013 57
The Models (cont’d)
• If this second set of restrictions holds, then (4) & (5) collapse to
(4’)
• and (3) becomes
(5’)
• We can test all of these restrictions using a likelihood ratio test.
2
1
0
2


 t
t
h 

)
ln(
)
ln( 2
1
0
2


 t
t
h 


‘Introductory Econometrics for Finance’ © Chris Brooks 2013 58
In-sample Likelihood Ratio Test Results:
GARCH Versus Implied Volatility
t
t
Ft
Mt u
h
R
R 


 1
0 
 (8.78)
1
1
2
1
1
0 
 

 t
t
t h
u
h 

 (8.79)
2
1
1
1
2
1
1
0 

 


 t
t
t
t h
u
h 


 (8.81)
2
1
0
2


 t
t
h 
 (8.81)
Equation for
Variance
specification
0 1 010-4
1 1  Log-L 2
(8.79) 0.0072
(0.005)
0.071
(0.01)
5.428
(1.65)
0.093
(0.84)
0.854
(8.17)
- 767.321 17.77
(8.81) 0.0015
(0.028)
0.043
(0.02)
2.065
(2.98)
0.266
(1.17)
-0.068
(-0.59)
0.318
(3.00)
776.204 -
(8.81) 0.0056
(0.001)
-0.184
(-0.001)
0.993
(1.50)
- - 0.581
(2.94)
764.394 23.62
Notes: t-ratios in parentheses, Log-L denotes the maximised value of the log-likelihood function in
each case. 2
denotes the value of the test statistic, which follows a 2
(1) in the case of (8.81) restricted
to (8.79), and a 2
(2) in the case of (8.81) restricted to (8.81). Source: Day and Lewis (1992).
Reprinted with the permission of Elsevier Science.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 59
In-sample Likelihood Ratio Test Results:
EGARCH Versus Implied Volatility
t
t
Ft
Mt u
h
R
R 


 1
0 
 (8.78)
)
2
(
)
ln(
)
ln(
2
/
1
1
1
1
1
1
1
1
0






























t
t
t
t
t
t
h
u
h
u
h
h (8.80)
)
ln(
)
2
(
)
ln(
)
ln( 2
1
2
/
1
1
1
1
1
1
1
1
0 




 


















 t
t
t
t
t
t
t
h
u
h
u
h
h 






 (8.82)
)
ln(
)
ln( 2
1
0
2


 t
t
h 

 (8.82)
Equation for
Variance
specification
0 1 010-4
1    Log-L 2
(c) -0.0026
(-0.03)
0.094
(0.25)
-3.62
(-2.90)
0.529
(3.26)
-0.273
(-4.13)
0.357
(3.17)
- 776.436 8.09
(e) 0.0035
(0.56)
-0.076
(-0.24)
-2.28
(-1.82)
0.373
(1.48)
-0.282
(-4.34)
0.210
(1.89)
0.351
(1.82)
780.480 -
(e) 0.0047
(0.71)
-0.139
(-0.43)
-2.76
(-2.30)
- - - 0.667
(4.01)
765.034 30.89
Notes: t-ratios in parentheses, Log-L denotes the maximised value of the log-likelihood function in
each case. 2
denotes the value of the test statistic, which follows a 2
(1) in the case of (8.82) restricted
to (8.80), and a 2
(2) in the case of (8.82) restricted to (8.82). Source: Day and Lewis (1992).
Reprinted with the permission of Elsevier Science.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 60
Conclusions for In-sample Model Comparisons &
Out-of-Sample Procedure
• IV has extra incremental power for modelling stock volatility beyond
GARCH.
• But the models do not represent a true test of the predictive ability of
IV.
• So the authors conduct an out of sample forecasting test.
• There are 729 data points. They use the first 410 to estimate the
models, and then make a 1-step ahead forecast of the following week’s
volatility.
• Then they roll the sample forward one observation at a time,
constructing a new one step ahead forecast at each step.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 61
Out-of-Sample Forecast Evaluation
• They evaluate the forecasts in two ways:
• The first is by regressing the realised volatility series on the forecasts plus
a constant:
(7)
where is the “actual” value of volatility, and is the value forecasted
for it during period t.
• Perfectly accurate forecasts imply b0 = 0 and b1 = 1.
• But what is the “true” value of volatility at time t ?
Day & Lewis use 2 measures
1. The square of the weekly return on the index, which they call SR.
2. The variance of the week’s daily returns multiplied by the number
of trading days in that week.
  
t ft t
b b
 
  
1
2
0 1
2
1
t1
2
ft
2
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 62
Out-of Sample Model Comparisons
1
2
1
0
2
1 
 

 t
ft
t b
b 

 (8.83)
Forecasting Model Proxy for ex
post volatility
b0 b1 R2
Historic SR 0.0004
(5.60)
0.129
(21.18)
0.094
Historic WV 0.0005
(2.90)
0.154
(7.58)
0.024
GARCH SR 0.0002
(1.02)
0.671
(2.10)
0.039
GARCH WV 0.0002
(1.07)
1.074
(3.34)
0.018
EGARCH SR 0.0000
(0.05)
1.075
(2.06)
0.022
EGARCH WV -0.0001
(-0.48)
1.529
(2.58)
0.008
Implied Volatility SR 0.0022
(2.22)
0.357
(1.82)
0.037
Implied Volatility WV 0.0005
(0.389)
0.718
(1.95)
0.026
Notes: Historic refers to the use of a simple historical average of the squared returns to forecast
volatility; t-ratios in parentheses; SR and WV refer to the square of the weekly return on the S&P 100,
and the variance of the week’s daily returns multiplied by the number of trading days in that week,
respectively. Source: Day and Lewis (1992). Reprinted with the permission of Elsevier Science.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 63
Encompassing Test Results: Do the IV Forecasts
Encompass those of the GARCH Models?
1
2
4
2
3
2
2
2
1
0
2
1 
 




 t
Ht
Et
Gt
It
t b
b
b
b
b 




 (8.86)
Forecast comparison b0 b1 b2 b3 b4 R2
Implied vs. GARCH
-0.00010
(-0.09)
0.601
(1.03)
0.298
(0.42)
- - 0.027
Implied vs. GARCH
vs. Historical
0.00018
(1.15)
0.632
(1.02)
-0.243
(-0.28)
- 0.123
(7.01)
0.038
Implied vs. EGARCH -0.00001
(-0.07)
0.695
(1.62)
- 0.176
(0.27)
- 0.026
Implied vs. EGARCH
vs. Historical
0.00026
(1.37)
0.590
(1.45)
-0.374
(-0.57)
- 0.118
(7.74)
0.038
GARCH vs. EGARCH 0.00005
(0.37)
- 1.070
(2.78)
-0.001
(-0.00)
- 0.018
Notes: t-ratios in parentheses; the ex post measure used in this table is the variance of the week’s daily
returns multiplied by the number of trading days in that week. Source: Day and Lewis (1992).
Reprinted with the permission of Elsevier Science.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 64
Conclusions of Paper
• Within sample results suggest that IV contains extra information not
contained in the GARCH / EGARCH specifications.
• Out of sample results suggest that nothing can accurately predict
volatility!
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 65
Stochastic Volatility Models
• It is a common misconception that GARCH-type specifications are
stochastic volatility models
• However, as the name suggests, stochastic volatility models differ
from GARCH principally in that the conditional variance equation of a
GARCH specification is completely deterministic given all
information available up to that of the previous period
• There is no error term in the variance equation of a GARCH model,
only in the mean equation
• Stochastic volatility models contain a second error term, which enters
into the conditional variance equation.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 66
Autoregressive Volatility Models
• A simple example of a stochastic volatility model is the autoregressive
volatility specification
• This model is simple to understand and simple to estimate, because it
requires that we have an observable measure of volatility which is then
simply used as any other variable in an autoregressive model
• The standard Box-Jenkins-type procedures for estimating
autoregressive (or ARMA) models can then be applied to this series
• For example, if the quantity of interest is a daily volatility estimate, we
could use squared daily returns, which trivially involves taking a
column of observed returns and squaring each observation
• The model estimated for volatility, t
2, is then
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 67
A Stochastic Volatility Model Specification
• The term ‘stochastic volatility’ is usually associated with a different
formulation to the autoregressive volatility model, a possible example
of which would be
where ηt is another N(0,1) random variable that is independent of ut
• The volatility is latent rather than observed, and so is modelled
indirectly
• Stochastic volatility models are superior in theory compared with
GARCH-type models, but the former are much more complex to
estimate.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 68
Covariance Modelling: Motivation
• A limitation of univariate volatility models is that the fitted conditional
variance of each series is entirely independent of all others
• This is potentially an important limitation for two reasons:
– If there are ‘volatility spillovers’ between markets or assets, the univariate
model will be mis-specified
– It is often the case that the covariances between series are of interest too
– The calculation of hedge ratios, portfolio value at risk estimates, CAPM
betas, and so on, all require covariances as inputs
• Multivariate GARCH models can be used for estimation of:
– Conditional CAPM betas
– Dynamic hedge ratios
– Portfolio variances
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 69
Simple Covariance Models:
Historical and Implied
• In exactly the same fashion as for volatility, the historical covariance
or correlation between two series can be calculated from a set of
historical data
• Implied covariances can be calculated using options whose payoffs are
dependent on more than one underlying asset
• The relatively small number of such options that exist limits the
circumstances in which implied covariances can be calculated
• Examples include rainbow options, ‘crack spread’ options for different
grades of oil, and currency options.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 70
Implied Covariance Models
• To give an illustration for currency options, the implied variance of the
cross-currency returns is given by
where and are the implied variances of the x and y
returns respectively, and is the implied covariance between
x and y
• So if the implied covariance between USD/DEM and USD/JPY is of
interest, then the implied variances of the returns of USD/DEM and
USD/JPY and the returns of the cross-currency DEM/JPY are required.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 71
EWMA Covariance Models
• A EWMA specification gives more weight in the covariance to recent
observations than an estimate based on the simple average
• The EWMA model estimates for variances and covariances at time t in
the bivariate setup with two returns series x and y may be written as
hij,t = λhij,t−1 + (1 − λ)xt−1yt−1
where i  j for the covariances and i = j; x = y for the variances
• The fitted values for h also become the forecasts for subsequent
periods
• λ (0 < λ < 1) denotes the decay factor determining the relative weights
attached to recent versus less recent observations
• This parameter could be estimated but is often set arbitrarily (e.g.,
Riskmetrics use a decay factor of 0.97 for monthly data but 0.94 for
daily).
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 72
EWMA Covariance Models - Limitations
• This equation can be rewritten as an infinite order function of only the
returns by successively substituting out the covariances:
• The EWMA model is a restricted version of an integrated GARCH
(IGARCH) specification, and it does not guarantee the fitted variance-
covariance matrix to be positive definite
• EWMA models also cannot allow for the observed mean reversion in
the volatilities or covariances of asset returns that is particularly
prevalent at lower frequencies.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 73
Multivariate GARCH Models
• Multivariate GARCH models are used to estimate and to forecast
covariances and correlations.
• The basic formulation is similar to that of the GARCH model, but where the
covariances as well as the variances are permitted to be time-varying.
• There are 3 main classes of multivariate GARCH formulation that are widely
used: VECH, diagonal VECH and BEKK.
VECH and Diagonal VECH
• e.g. suppose that there are two variables used in the model. The conditional
covariance matrix is denoted Ht, and would be 2  2. Ht and VECH(Ht) are











t
t
t
t
h
h
h
H
VECH
12
22
11
)
(







t
t
t
t
t
h
h
h
h
H
22
21
12
11
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 74
VECH and Diagonal VECH
• In the case of the VECH, the conditional variances and covariances would
each depend upon lagged values of all of the variances and covariances
and on lags of the squares of both error terms and their cross products.
• In matrix form, it would be written
• Writing out all of the elements gives the 3 equations as
• Such a model would be hard to estimate. The diagonal VECH is much
simpler and is specified, in the 2 variable case, as follows:
1
12
2
1
2
1
1
1
0
12
1
22
2
2
1
2
1
0
22
1
11
2
2
1
1
1
0
11
















t
t
t
t
t
t
t
t
t
t
h
u
u
h
h
u
h
h
u
h









     
1
1
1 

 



 t
t
t
t H
VECH
B
VECH
A
C
H
VECH  
t
t
t H
N ,
0
~
1

 
1
12
33
1
22
32
1
11
31
2
1
33
2
2
32
2
1
31
31
12
1
12
23
1
22
22
1
11
21
2
1
23
2
2
22
2
1
21
21
22
1
12
13
1
22
12
1
11
11
2
1
13
2
2
12
2
1
11
11
11






























t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
t
h
b
h
b
h
b
u
u
a
u
a
u
a
c
h
h
b
h
b
h
b
u
u
a
u
a
u
a
c
h
h
b
h
b
h
b
u
u
a
u
a
u
a
c
h
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 75
BEKK and Model Estimation for M-GARCH
• Neither the VECH nor the diagonal VECH ensure a positive definite
variance-covariance matrix.
• An alternative approach is the BEKK model (Engle & Kroner, 1995).
• The BEKK Model uses a Quadratic form for the parameter matrices to
ensure a positive definite variance / covariance matrix Ht.
• In matrix form, the BEKK model is
• Model estimation for all classes of multivariate GARCH model is
again performed using maximum likelihood with the following LLF:
where N is the number of variables in the system (assumed 2 above), 
is a vector containing all of the parameters, and T is the number of obs.
B
B
A
H
A
W
W
H t
t
t
t 1
1
1 

 







   









T
t
t
t
t
t H
H
TN
1
1
'
log
2
1
2
log
2



‘Introductory Econometrics for Finance’ © Chris Brooks 2013 76
Correlation Models and the CCC
• The correlations between a pair of series at each point in time can be
constructed by dividing the conditional covariances by the product of
the conditional standard deviations from a VECH or BEKK model
• A subtly different approach would be to model the dynamics for the
correlations directly
• In the constant conditional correlation (CCC) model, the correlations
between the disturbances to be fixed through time
• Thus, although the conditional covariances are not fixed, they are tied
to the variances
• The conditional variances in the fixed correlation model are identical
to those of a set of univariate GARCH specifications (although they
are estimated jointly):
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 77
More on the CCC
• The off-diagonal elements of Ht, hij,t (i  j), are defined indirectly via
the correlations, denoted ρij:
• Is it empirically plausible to assume that the correlations are constant
through time?
• Several tests of this assumption have been developed, including a test
based on the information matrix due and a Lagrange Multiplier test
• There is evidence against constant correlations, particularly in the
context of stock returns.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 78
The Dynamic Conditional Correlation Model
• Several different formulations of the dynamic conditional correlation
(DCC) model are available, but a popular specification is due to Engle
(2002)
• The model is related to the CCC formulation but where the correlations
are allowed to vary over time.
• Define the variance-covariance matrix, Ht, as Ht = DtRtDt
• Dt is a diagonal matrix containing the conditional standard deviations
(i.e. the square roots of the conditional variances from univariate
GARCH model estimations on each of the N individual series) on the
leading diagonal
• Rt is the conditional correlation matrix
• Numerous parameterisations of Rt are possible, including an
exponential smoothing approach
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 79
The DCC Model – A Possible Specification
• A possible specification is of the MGARCH form:
where:
• S is the unconditional correlation matrix of the vector of standardised
residuals (from the first stage estimation), ut = Dt
−1ϵt
• ι is a vector of ones
• Qt is an N × N symmetric positive definite variance-covariance matrix
• ◦ denotes the Hadamard or element-by-element matrix multiplication
procedure
• This specification for the intercept term simplifies estimation and
reduces the number of parameters.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 80
The DCC Model – A Possible Specification
• Engle (2002) proposes a GARCH-esque formulation for dynamically
modelling Qt with the conditional correlation matrix, Rt then constructed
as
where diag(·) denotes a matrix comprising the main diagonal elements
of (·) and Q∗ is a matrix that takes the square roots of each element in Q
• This operation is effectively taking the covariances in Qt and dividing
them by the product of the appropriate standard deviations in Qt
∗ to
create a matrix of correlations.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 81
DCC Model Estimation
• The model may be estimated in a single stage using ML although this will
be difficult. So Engle advocates a two-stage procedure where each variable
in the system is first modelled separately as a univariate GARCH
• A joint log-likelihood function for this stage could be constructed, which
would simply be the sum (over N) of all of the log-likelihoods for the
individual GARCH models
• In the second stage, the conditional likelihood is maximised with respect
to any unknown parameters in the correlation matrix
• The log-likelihood function for the second stage estimation will be of the
form
• where θ1 and θ2 denote the parameters to be estimated in the 1st and 2nd
stages respectively.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 82
Asymmetric Multivariate GARCH
• Asymmetric models have become very popular in empirical applications,
where the conditional variances and / or covariances are permitted to react
differently to positive and negative innovations of the same magnitude
• In the multivariate context, this is usually achieved in the Glosten et al.
(1993) framework
• Kroner and Ng (1998), for example, suggest the following extension to the
BEKK formulation (with obvious related modifications for the VECH or
diagonal VECH models)
where zt−1 is an N-dimensional column vector with elements taking the
value −ϵt−1 if the corresponding element of ϵt−1 is negative and zero
otherwise.
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 83
An Example: Estimating a Time-Varying Hedge Ratio
for FTSE Stock Index Returns
(Brooks, Henry and Persand, 2002).
• Data comprises 3580 daily observations on the FTSE 100 stock index and
stock index futures contract spanning the period 1 January 1985 - 9 April 1999.
• Several competing models for determining the optimal hedge ratio are
constructed. Define the hedge ratio as .
– No hedge (=0)
– Naïve hedge (=1)
– Multivariate GARCH hedges:
• Symmetric BEKK
• Asymmetric BEKK
In both cases, estimating the OHR involves forming a 1-step ahead
forecast and computing
t
t
F
t
CF
t
h
h
OHR 





1
,
1
,
1
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 84
OHR Results
In Sample
Unhedged
 = 0
Naïve Hedge
 = 1
Symmetric Time
Varying
Hedge
t
F
t
FC
t
h
h
,
,


Asymmetric
Time Varying
Hedge
t
F
t
FC
t
h
h
,
,


Return 0.0389
{2.3713}
-0.0003
{-0.0351}
0.0061
{0.9562}
0.0060
{0.9580}
Variance 0.8286 0.1718 0.1240 0.1211
Out of Sample
Unhedged
 = 0
Naïve Hedge
 = 1
Symmetric Time
Varying
Hedge
t
F
t
FC
t
h
h
,
,


Asymmetric
Time Varying
Hedge
t
F
t
FC
t
h
h
,
,


Return 0.0819
{1.4958}
-0.0004
{0.0216}
0.0120
{0.7761}
0.0140
{0.9083}
Variance 1.4972 0.1696 0.1186 0.1188
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 85
Plot of the OHR from Multivariate GARCH
Conclusions
- OHR is time-varying and less
than 1
- M-GARCH OHR provides a
better hedge, both in-sample
and out-of-sample.
- No role in calculating OHR for
asymmetries
Symmetric BEKK
Asymmetric BEKK
Time Varying Hedge Ratios
500 1000 1500 2000 2500 3000
0.65
0.70
0.75
0.80
0.85
0.90
0.95
1.00

Mais conteúdo relacionado

Semelhante a Ch9_slides.ppt

Sensitivity analysis
Sensitivity analysisSensitivity analysis
Sensitivity analysis
sunilgv06
 
Jacobs Kiefer Bayes Guide 3 10 V1
Jacobs Kiefer Bayes Guide 3 10 V1Jacobs Kiefer Bayes Guide 3 10 V1
Jacobs Kiefer Bayes Guide 3 10 V1
Michael Jacobs, Jr.
 
Declarative data analysis
Declarative data analysisDeclarative data analysis
Declarative data analysis
South West Data Meetup
 
Informs presentation new ppt
Informs presentation new pptInforms presentation new ppt
Informs presentation new ppt
Salford Systems
 

Semelhante a Ch9_slides.ppt (20)

Ch5_slides Qwertr12234543234433444344.ppt
Ch5_slides Qwertr12234543234433444344.pptCh5_slides Qwertr12234543234433444344.ppt
Ch5_slides Qwertr12234543234433444344.ppt
 
Ch5_slides.ppt
Ch5_slides.pptCh5_slides.ppt
Ch5_slides.ppt
 
Time series
Time seriesTime series
Time series
 
Chapters 14 and 15 presentation
Chapters 14 and 15 presentationChapters 14 and 15 presentation
Chapters 14 and 15 presentation
 
Sensitivity analysis
Sensitivity analysisSensitivity analysis
Sensitivity analysis
 
205250 crystall ball
205250 crystall ball205250 crystall ball
205250 crystall ball
 
Ch2_slides.ppt
Ch2_slides.pptCh2_slides.ppt
Ch2_slides.ppt
 
Regression
Regression Regression
Regression
 
FA_Unit 4 (Time Series Analysis E-views).pdf
FA_Unit 4 (Time Series Analysis E-views).pdfFA_Unit 4 (Time Series Analysis E-views).pdf
FA_Unit 4 (Time Series Analysis E-views).pdf
 
M 3 iot
M 3 iotM 3 iot
M 3 iot
 
Introduction to Statistics and Probability:
Introduction to Statistics and Probability:Introduction to Statistics and Probability:
Introduction to Statistics and Probability:
 
20150404 rm - autocorrelation
20150404   rm - autocorrelation20150404   rm - autocorrelation
20150404 rm - autocorrelation
 
Ch8 slides
Ch8 slidesCh8 slides
Ch8 slides
 
Ch8_slides.ppt
Ch8_slides.pptCh8_slides.ppt
Ch8_slides.ppt
 
Econometrics ch12
Econometrics ch12Econometrics ch12
Econometrics ch12
 
Jacobs Kiefer Bayes Guide 3 10 V1
Jacobs Kiefer Bayes Guide 3 10 V1Jacobs Kiefer Bayes Guide 3 10 V1
Jacobs Kiefer Bayes Guide 3 10 V1
 
1.1.Introduction Econometrics.pptx
1.1.Introduction Econometrics.pptx1.1.Introduction Econometrics.pptx
1.1.Introduction Econometrics.pptx
 
Declarative data analysis
Declarative data analysisDeclarative data analysis
Declarative data analysis
 
Optimal design & Population mod pyn.pptx
Optimal design & Population mod pyn.pptxOptimal design & Population mod pyn.pptx
Optimal design & Population mod pyn.pptx
 
Informs presentation new ppt
Informs presentation new pptInforms presentation new ppt
Informs presentation new ppt
 

Último

Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Último (20)

Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 

Ch9_slides.ppt

  • 1. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 1 Chapter 9 Modelling volatility and correlation
  • 2. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 2 An Excursion into Non-linearity Land • Motivation: the linear structural (and time series) models cannot explain a number of important features common to much financial data - leptokurtosis - volatility clustering or volatility pooling - leverage effects • Our “traditional” structural model could be something like: yt = 1 + 2x2t + ... + kxkt + ut, or more compactly y = X + u • We also assumed that ut  N(0,2).
  • 3. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 3 A Sample Financial Asset Returns Time Series Daily S&P 500 Returns for August 2003 – August 2013
  • 4. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 4 Non-linear Models: A Definition • Campbell, Lo and MacKinlay (1997) define a non-linear data generating process as one that can be written yt = f(ut, ut-1, ut-2, …) where ut is an iid error term and f is a non-linear function. • They also give a slightly more specific definition as yt = g(ut-1, ut-2, …)+ ut2(ut-1, ut-2, …) where g is a function of past error terms only and 2 is a variance term. • Models with nonlinear g(•) are “non-linear in mean”, while those with nonlinear 2(•) are “non-linear in variance”.
  • 5. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 5 Types of non-linear models • The linear paradigm is a useful one. • Many apparently non-linear relationships can be made linear by a suitable transformation. • On the other hand, it is likely that many relationships in finance are intrinsically non-linear. • There are many types of non-linear models, e.g. - ARCH / GARCH - switching models - bilinear models
  • 6. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 6 Testing for Non-linearity – The RESET Test • The “traditional” tools of time series analysis (acf’s, spectral analysis) may find no evidence that we could use a linear model, but the data may still not be independent. • Portmanteau tests for non-linear dependence have been developed. • The simplest is Ramsey’s RESET test, which took the form: • Here the dependent variable is the residual series and the independent variables are the squares, cubes, …, of the fitted values.    ...  u y y y v t t t p t p t            0 1 2 2 3 1
  • 7. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 7 Testing for Non-linearity – The BDS Test • Many other non-linearity tests are available - e.g., the BDS and bispectrum test • BDS is a pure hypothesis test. That is, it has as its null hypothesis that the data are pure noise (completely random) • It has been argued to have power to detect a variety of departures from randomness – linear or non-linear stochastic processes, deterministic chaos, etc) • The BDS test follows a standard normal distribution under the null • The test can also be used as a model diagnostic on the residuals to ‘see what is left’ • If the proposed model is adequate, the standardised residuals should be white noise.
  • 8. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 8 Chaos Theory • Chaos theory is a notion taken from the physical sciences • It suggests that there could be a deterministic, non-linear set of equations underlying the behaviour of financial series or markets • Such behaviour will appear completely random to the standard statistical tests • A positive sighting of chaos implies that while, by definition, long- term forecasting would be futile, short-term forecastability and controllability are possible, at least in theory, since there is some deterministic structure underlying the data • Varying definitions of what actually constitutes chaos can be found in the literature.
  • 9. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 9 Detecting Chaos • A system is chaotic if it exhibits sensitive dependence on initial conditions (SDIC) • So an infinitesimal change is made to the initial conditions (the initial state of the system), then the corresponding change iterated through the system for some arbitrary length of time will grow exponentially • The largest Lyapunov exponent is a test for chaos • It measures the rate at which information is lost from a system • A positive largest Lyapunov exponent implies sensitive dependence, and therefore that evidence of chaos has been obtained • Almost without exception, applications of chaos theory to financial markets have been unsuccessful • This is probably because financial and economic data are usually far noisier and ‘more random’ than data from other disciplines
  • 10. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 10 Neural Networks • Artificial neural networks (ANNs) are a class of models whose structure is broadly motivated by the way that the brain performs computation • ANNs have been widely employed in finance for tackling time series and classification problems • Applications have included forecasting financial asset returns, volatility, bankruptcy and takeover prediction • Neural networks have virtually no theoretical motivation in finance (they are often termed a ‘black box’) • They can fit any functional relationship in the data to an arbitrary degree of accuracy.
  • 11. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 11 Feedforward Neural Networks • The most common class of ANN models in finance are known as feedforward network models • These have a set of inputs (akin to regressors) linked to one or more outputs (akin to the regressand) via one or more ‘hidden’ or intermediate layers • The size and number of hidden layers can be modified to give a closer or less close fit to the data sample • A feedforward network with no hidden layers is simply a standard linear regression model • Neural network models work best where financial theory has virtually nothing to say about the likely functional form for the relationship between a set of variables.
  • 12. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 12 Neural Networks – Some Disadvantages • Neural networks are not very popular in finance and suffer from several problems: – The coefficient estimates from neural networks do not have any real theoretical interpretation – Virtually no diagnostic or specification tests are available for estimated models – They can provide excellent fits in-sample to a given set of ‘training’ data, but typically provide poor out-of-sample forecast accuracy – This usually arises from the tendency of neural networks to fit closely to sample-specific data features and ‘noise’, and so they cannot ‘generalise’ – The non-linear estimation of neural network models can be cumbersome and computationally time-intensive, particularly, for example, if the model must be estimated repeatedly when rolling through a sample.
  • 13. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 13 Models for Volatility • Modelling and forecasting stock market volatility has been the subject of vast empirical and theoretical investigation • There are a number of motivations for this line of inquiry: – Volatility is one of the most important concepts in finance – Volatility, as measured by the standard deviation or variance of returns, is often used as a crude measure of the total risk of financial assets – Many value-at-risk models for measuring market risk require the estimation or forecast of a volatility parameter – The volatility of stock market prices also enters directly into the Black– Scholes formula for deriving the prices of traded options • We will now examine several volatility models.
  • 14. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 14 Historical Volatility • The simplest model for volatility is the historical estimate • Historical volatility simply involves calculating the variance (or standard deviation) of returns in the usual way over some historical period • This then becomes the volatility forecast for all future periods • Evidence suggests that the use of volatility predicted from more sophisticated time series models will lead to more accurate forecasts and option valuations • Historical volatility is still useful as a benchmark for comparing the forecasting ability of more complex time models
  • 15. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 15 Heteroscedasticity Revisited • An example of a structural model is with ut  N(0, ). • The assumption that the variance of the errors is constant is known as homoscedasticity, i.e. Var (ut) = . • What if the variance of the errors is not constant? - heteroscedasticity - would imply that standard error estimates could be wrong. • Is the variance of the errors likely to be constant over time? Not for financial data. u 2 u 2 yt = 1 + 2x2t + 3x3t + 4x4t + ut
  • 16. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 16 Autoregressive Conditionally Heteroscedastic (ARCH) Models • So use a model which does not assume that the variance is constant. • Recall the definition of the variance of ut: = Var(ut ut-1, ut-2,...) = E[(ut-E(ut))2 ut-1, ut-2,...] We usually assume that E(ut) = 0 so = Var(ut  ut-1, ut-2,...) = E[ut 2 ut-1, ut-2,...]. What could the current value of the variance of the errors plausibly depend upon? – Previous squared error terms. • This leads to the autoregressive conditionally heteroscedastic model for the variance of the errors: = 0 + 1 • This is known as an ARCH(1) model • The ARCH model due to Engle (1982) has proved very useful in finance. t 2 t 2 t 2 ut1 2
  • 17. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 17 Autoregressive Conditionally Heteroscedastic (ARCH) Models (cont’d) • The full model would be yt = 1 + 2x2t + ... + kxkt + ut , ut  N(0, ) where = 0 + 1 • We can easily extend this to the general case where the error variance depends on q lags of squared errors: = 0 + 1 +2 +...+q • This is an ARCH(q) model. • Instead of calling the variance , in the literature it is usually called ht, so the model is yt = 1 + 2x2t + ... + kxkt + ut , ut  N(0,ht) where ht = 0 + 1 +2 +...+q t 2 t 2 t 2 ut1 2 ut q  2 ut q  2 t 2 2 1  t u 2 2  t u 2 1  t u 2 2  t u
  • 18. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 18 Another Way of Writing ARCH Models • For illustration, consider an ARCH(1). Instead of the above, we can write yt = 1 + 2x2t + ... + kxkt + ut , ut = vtt , vt  N(0,1) • The two are different ways of expressing exactly the same model. The first form is easier to understand while the second form is required for simulating from an ARCH model, for example.    t t u    0 1 1 2
  • 19. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 19 Testing for “ARCH Effects” 1. First, run any postulated linear regression of the form given in the equation above, e.g. yt = 1 + 2x2t + ... + kxkt + ut saving the residuals, . 2. Then square the residuals, and regress them on q own lags to test for ARCH of order q, i.e. run the regression where vt is iid. Obtain R2 from this regression 3. The test statistic is defined as TR2 (the number of observations multiplied by the coefficient of multiple correlation) from the last regression, and is distributed as a 2(q). t û t q t q t t t v u u u u          2 2 2 2 2 1 1 0 2 ˆ ... ˆ ˆ ˆ    
  • 20. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 20 Testing for “ARCH Effects” (cont’d) 4. The null and alternative hypotheses are H0 : 1 = 0 and 2 = 0 and 3 = 0 and ... and q = 0 H1 : 1  0 or 2  0 or 3  0 or ... or q  0. If the value of the test statistic is greater than the critical value from the 2 distribution, then reject the null hypothesis. • Note that the ARCH test is also sometimes applied directly to returns instead of the residuals from Stage 1 above.
  • 21. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 21 Problems with ARCH(q) Models • How do we decide on q? • The required value of q might be very large • Non-negativity constraints might be violated. – When we estimate an ARCH model, we require i >0  i=1,2,...,q (since variance cannot be negative) • A natural extension of an ARCH(q) model which gets around some of these problems is a GARCH model.
  • 22. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 22 Generalised ARCH (GARCH) Models • Due to Bollerslev (1986). Allow the conditional variance to be dependent upon previous own lags • The variance equation is now (1) • This is a GARCH(1,1) model, which is like an ARMA(1,1) model for the variance equation. • We could also write • Substituting into (1) for t-1 2 : t 2 = 0 + 1 2 1  t u +t-1 2 t-1 2 = 0 + 1 2 2  t u +t-2 2 t-2 2 = 0 + 1 2 3  t u +t-3 2 t 2 = 0 + 1 2 1  t u +(0 + 1 2 2  t u +t-2 2 ) = 0 + 1 2 1  t u +0 + 1 2 2  t u +t-2 2
  • 23. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 23 Generalised ARCH (GARCH) Models (cont’d) • Now substituting into (2) for t-2 2 • An infinite number of successive substitutions would yield • So the GARCH(1,1) model can be written as an infinite order ARCH model. • We can again extend the GARCH(1,1) model to a GARCH(p,q): t 2 =0 + 1 2 1  t u +0 + 1 2 2  t u +2 (0 + 1 2 3  t u +t-3 2 ) t 2 = 0 + 1 2 1  t u +0 + 1 2 2  t u +02 + 12 2 3  t u +3 t-3 2 t 2 = 0 (1++2 ) + 1 2 1  t u (1+L+2 L2 ) + 3 t-3 2 t 2 = 0 (1++2 +...) + 1 2 1  t u (1+L+2 L2 +...) +  0 2 t 2 = 0+1 2 1  t u +2 2 2  t u +...+q 2 q t u  +1t-1 2 +2t-2 2 +...+pt-p 2 t 2 =         q i p j j t j i t iu 1 1 2 2 0    
  • 24. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 24 Generalised ARCH (GARCH) Models (cont’d) • But in general a GARCH(1,1) model will be sufficient to capture the volatility clustering in the data. • Why is GARCH Better than ARCH? - more parsimonious - avoids overfitting - less likely to breech non-negativity constraints
  • 25. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 25 The Unconditional Variance under the GARCH Specification • The unconditional variance of ut is given by when • is termed “non-stationarity” in variance • is termed intergrated GARCH • For non-stationarity in variance, the conditional variance forecasts will not converge on their unconditional value as the horizon increases. Var(ut) = ) ( 1 1 0         1 < 1    1  1    1 = 1
  • 26. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 26 Estimation of ARCH / GARCH Models • Since the model is no longer of the usual linear form, we cannot use OLS. • We use another technique known as maximum likelihood. • The method works by finding the most likely values of the parameters given the actual data. • More specifically, we form a log-likelihood function and maximise it.
  • 27. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 27 Estimation of ARCH / GARCH Models (cont’d) • The steps involved in actually estimating an ARCH or GARCH model are as follows 1. Specify the appropriate equations for the mean and the variance - e.g. an AR(1)- GARCH(1,1) model: 2. Specify the log-likelihood function to maximise: 3. The computer will maximise the function and give parameter values and their standard errors yt =  + yt-1 + ut , ut  N(0,t 2 ) t 2 = 0 + 1 2 1  t u +t-1 2            T t t t t T t t y y T L 1 2 2 1 1 2 / ) ( 2 1 ) log( 2 1 ) 2 log( 2     
  • 28. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 28 Parameter Estimation using Maximum Likelihood • Consider the bivariate regression case with homoscedastic errors for simplicity: • Assuming that ut  N(0,2), then yt  N( , 2) so that the probability density function for a normally distributed random variable with this mean and variance is given by (1) • Successive values of yt would trace out the familiar bell-shaped curve. • Assuming that ut are iid, then yt will also be iid. t t t u x y    2 1   t x 2 1               2 2 2 1 2 2 1 ) ( 2 1 exp 2 1 ) , (         t t t t x y x y f
  • 29. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 29 Parameter Estimation using Maximum Likelihood (cont’d) • Then the joint pdf for all the y’s can be expressed as a product of the individual density functions (2) • Substituting into equation (2) for every yt from equation (1), (3)              T t t t T T t T x y x y y y f 1 2 2 2 1 2 2 1 2 1 ) ( 2 1 exp ) 2 ( 1 ) , ,..., , (                  T t t t T t T X y f X y f X y f X y f X y y y f 1 2 2 1 2 4 2 1 2 2 2 1 2 2 1 2 1 1 2 2 1 2 1 ) , ( ) , ( )... , ( ) , ( ) , ,..., , (               
  • 30. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 30 Parameter Estimation using Maximum Likelihood (cont’d) • The typical situation we have is that the xt and yt are given and we want to estimate 1, 2, 2. If this is the case, then f() is known as the likelihood function, denoted LF(1, 2, 2), so we write (4) • Maximum likelihood estimation involves choosing parameter values (1, 2,2) that maximise this function. • We want to differentiate (4) w.r.t. 1, 2,2, but (4) is a product containing T terms.             T t t t T T x y LF 1 2 2 2 1 2 2 1 ) ( 2 1 exp ) 2 ( 1 ) , , (        
  • 31. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 31 • Since , we can take logs of (4). • Then, using the various laws for transforming functions containing logarithms, we obtain the log-likelihood function, LLF: • which is equivalent to (5) • Differentiating (5) w.r.t. 1, 2,2, we obtain (6) max ( ) maxlog( ( )) x x f x f x  Parameter Estimation using Maximum Likelihood (cont’d)         T t t t x y T T LLF 1 2 2 2 1 ) ( 2 1 ) 2 log( 2 log              T t t t x y T T LLF 1 2 2 2 1 2 ) ( 2 1 ) 2 log( 2 log 2            2 2 1 1 1 . 2 ). ( 2 1      t t x y LLF
  • 32. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 32 (7) (8) • Setting (6)-(8) to zero to minimise the functions, and putting hats above the parameters to denote the maximum likelihood estimators, • From (6), (9) Parameter Estimation using Maximum Likelihood (cont’d)       4 2 2 1 2 2 ) ( 2 1 1 2       t t x y T LLF       2 2 1 2 . 2 ). ( 2 1      t t t x x y LLF     0 ) ˆ ˆ ( 2 1 t t x y         0 ˆ ˆ 2 1 t t x y        0 ˆ ˆ 2 1 t t x T y        0 1 ˆ ˆ 1 2 1 t t x T y T   x y 2 1 ˆ ˆ    
  • 33. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 33 • From (7), (10) • From (8), Parameter Estimation using Maximum Likelihood (cont’d)     0 ) ˆ ˆ ( 2 1 t t t x x y         0 ˆ ˆ 2 2 1 t t t t x x x y         0 ˆ ˆ 2 2 1 t t t t x x x y         t t t t x x y x y x ) ˆ ( ˆ 2 2 2        2 2 2 2 ˆ ˆ x T y x T x y x t t t        y x T x y x T x t t t ) ( ˆ 2 2 2  ) ( ˆ 2 2 2 x T x y x T x y t t t           2 2 1 4 2 ) ˆ ˆ ( ˆ 1 ˆ t t x y T    
  • 34. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 34 • Rearranging, (11) • How do these formulae compare with the OLS estimators? (9) & (10) are identical to OLS (11) is different. The OLS estimator was • Therefore the ML estimator of the variance of the disturbances is biased, although it is consistent. • But how does this help us in estimating heteroscedastic models?   2 2 1   T ut   2 2 1    T k ut Parameter Estimation using Maximum Likelihood (cont’d)     2 2 1 2 ) ˆ ˆ ( 1 ˆ t t x y T   
  • 35. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 35 Estimation of GARCH Models Using Maximum Likelihood • Now we have yt =  + yt-1 + ut , ut  N(0, ) • Unfortunately, the LLF for a model with time-varying variances cannot be maximised analytically, except in the simplest of cases. So a numerical procedure is used to maximise the log-likelihood function. A potential problem: local optima or multimodalities in the likelihood surface. • The way we do the optimisation is: 1. Set up LLF. 2. Use regression to get initial guesses for the mean parameters. 3. Choose some initial guesses for the conditional variance parameters. 4. Specify a convergence criterion - either by criterion or by value. t 2 t 2 = 0 + 1 2 1  t u +t-1 2            T t t t t T t t y y T L 1 2 2 1 1 2 / ) ( 2 1 ) log( 2 1 ) 2 log( 2     
  • 36. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 36 Non-Normality and Maximum Likelihood • Recall that the conditional normality assumption for ut is essential. • We can test for normality using the following representation ut = vtt vt  N(0,1) • The sample counterpart is • Are the normal? Typically are still leptokurtic, although less so than the . Is this a problem? Not really, as we can use the ML with a robust variance/covariance estimator. ML with robust standard errors is called Quasi- Maximum Likelihood or QML.      t t t u      0 1 1 2 2 1 2 v u t t t   t t t u v  ˆ ˆ ˆ  t v̂ t v̂ t û
  • 37. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 37 Extensions to the Basic GARCH Model • Since the GARCH model was developed, a huge number of extensions and variants have been proposed. Three of the most important examples are EGARCH, GJR, and GARCH-M models. • Problems with GARCH(p,q) Models: - Non-negativity constraints may still be violated - GARCH models cannot account for leverage effects • Possible solutions: the exponential GARCH (EGARCH) model or the GJR model, which are asymmetric GARCH models.
  • 38. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 38 The EGARCH Model • Suggested by Nelson (1991). The variance equation is given by • Advantages of the model - Since we model the log(t 2), then even if the parameters are negative, t 2 will be positive. - We can account for the leverage effect: if the relationship between volatility and returns is negative, , will be negative.                            2 ) log( ) log( 2 1 1 2 1 1 2 1 2 t t t t t t u u
  • 39. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 39 The GJR Model • Due to Glosten, Jaganathan and Runkle where It-1 = 1 if ut-1 < 0 = 0 otherwise • For a leverage effect, we would see  > 0. • We require 1 +   0 and 1  0 for non-negativity. t 2 = 0 + 1 2 1  t u +t-1 2 +ut-1 2 It-1
  • 40. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 40 An Example of the use of a GJR Model • Using monthly S&P 500 returns, December 1979- June 1998 • Estimating a GJR model, we obtain the following results. ) 198 . 3 ( 172 . 0  t y ) 772 . 5 ( ) 999 . 14 ( ) 437 . 0 ( ) 372 . 16 ( 604 . 0 498 . 0 015 . 0 243 . 1 1 2 1 2 1 2 1 2         t t t t t I u u  
  • 41. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 41 News Impact Curves The news impact curve plots the next period volatility (ht) that would arise from various positive and negative values of ut-1, given an estimated model. News Impact Curves for S&P 500 Returns using Coefficients from GARCH and GJR Model Estimates: 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 -1 -0.9 -0.8 -0.7 -0.6 -0.5 -0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Value of Lagged Shock Value of Conditional Variance GARCH GJR
  • 42. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 42 GARCH-in Mean • We expect a risk to be compensated by a higher return. So why not let the return of a security be partly determined by its risk? • Engle, Lilien and Robins (1987) suggested the ARCH-M specification. A GARCH-M model would be •  can be interpreted as a sort of risk premium. • It is possible to combine all or some of these models together to get more complex “hybrid” models - e.g. an ARMA-EGARCH(1,1)-M model. yt =  + t-1+ ut , ut N(0,t 2 ) t 2 = 0 + 1 2 1  t u +t-1 2
  • 43. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 43 What Use Are GARCH-type Models? • GARCH can model the volatility clustering effect since the conditional variance is autoregressive. Such models can be used to forecast volatility. • We could show that Var (yt  yt-1, yt-2, ...) = Var (ut  ut-1, ut-2, ...) • So modelling t 2 will give us models and forecasts for yt as well. • Variance forecasts are additive over time.
  • 44. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 44 Forecasting Variances using GARCH Models • Producing conditional variance forecasts from GARCH models uses a very similar approach to producing forecasts from ARMA models. • It is again an exercise in iterating with the conditional expectations operator. • Consider the following GARCH(1,1) model: , ut  N(0,t 2), • What is needed is to generate forecasts of T+1 2 T, T+2 2 T, ..., T+s 2 T where T denotes all information available up to and including observation T. • Adding one to each of the time subscripts of the above conditional variance equation, and then two, and then three would yield the following equations T+1 2 = 0 + 1uT 2 +T 2 , T+2 2 = 0 + 1uT+1 2 +T+1 2 , T+3 2 = 0 + 1 uT+2+T+2 2 t t u y    2 1 2 1 1 0 2      t t t u    
  • 45. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 45 Forecasting Variances using GARCH Models (Cont’d) • Let be the one step ahead forecast for 2 made at time T. This is easy to calculate since, at time T, the values of all the terms on the RHS are known. • would be obtained by taking the conditional expectation of the first equation at the bottom of slide 36: • Given, how is , the 2-step ahead forecast for 2 made at time T, calculated? Taking the conditional expectation of the second equation at the bottom of slide 36: = 0 + 1E(  T) + • where E(  T) is the expectation, made at time T, of , which is the squared disturbance term. 2 , 1 f T  2 , 1 f T  2 , 1 f T  = 0 + 1 2 T u +T 2 2 , 1 f T  2 , 2 f T  2 , 2 f T  2 1  T u 2 , 1 f T  2 1  T u 2 1  T u
  • 46. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 46 Forecasting Variances using GARCH Models (Cont’d) • We can write E(uT+1 2  t) = T+1 2 • But T+1 2 is not known at time T, so it is replaced with the forecast for it, , so that the 2-step ahead forecast is given by = 0 + 1 + = 0 + (1+) • By similar arguments, the 3-step ahead forecast will be given by = ET(0 + 1uT+2 2 + T+2 2) = 0 + (1+) = 0 + (1+)[ 0 + (1+) ] = 0 + 0(1+) + (1+)2 • Any s-step ahead forecast (s  2) would be produced by 2 , 1 f T  2 , 2 f T  2 , 1 f T  2 , 1 f T  2 , 2 f T  2 , 1 f T  2 , 3 f T  2 , 2 f T  2 , 1 f T  2 , 1 f T  2 , 1 1 1 1 1 1 1 0 2 , ) ( ) ( f T s s i i f T s                
  • 47. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 47 What Use Are Volatility Forecasts? 1. Option pricing C = f(S, X, 2, T, rf) 2. Conditional betas 3. Dynamic hedge ratios The Hedge Ratio - the size of the futures position to the size of the underlying exposure, i.e. the number of futures contracts to buy or sell per unit of the spot good.    i t im t m t , , ,  2
  • 48. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 48 What Use Are Volatility Forecasts? (Cont’d) • What is the optimal value of the hedge ratio? • Assuming that the objective of hedging is to minimise the variance of the hedged portfolio, the optimal hedge ratio will be given by where h = hedge ratio p = correlation coefficient between change in spot price (S) and change in futures price (F) S = standard deviation of S F = standard deviation of F • What if the standard deviations and correlation are changing over time? Use h p s F    t F t s t t p h , ,   
  • 49. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 49 Testing Non-linear Restrictions or Testing Hypotheses about Non-linear Models • Usual t- and F-tests are still valid in non-linear models, but they are not flexible enough. • There are three hypothesis testing procedures based on maximum likelihood principles: Wald, Likelihood Ratio, Lagrange Multiplier. • Consider a single parameter,  to be estimated, Denote the MLE as and a restricted estimate as . ~  ˆ
  • 50. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 50 Likelihood Ratio Tests • Estimate under the null hypothesis and under the alternative. • Then compare the maximised values of the LLF. • So we estimate the unconstrained model and achieve a given maximised value of the LLF, denoted Lu • Then estimate the model imposing the constraint(s) and get a new value of the LLF denoted Lr. • Which will be bigger? • Lr  Lu comparable to RRSS  URSS • The LR test statistic is given by LR = -2(Lr - Lu)  2(m) where m = number of restrictions
  • 51. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 51 Likelihood Ratio Tests (cont’d) • Example: We estimate a GARCH model and obtain a maximised LLF of 66.85. We are interested in testing whether  = 0 in the following equation. yt =  + yt-1 + ut , ut  N(0, ) = 0 + 1 +  • We estimate the model imposing the restriction and observe the maximised LLF falls to 64.54. Can we accept the restriction? • LR = -2(64.54-66.85) = 4.62. • The test follows a 2(1) = 3.84 at 5%, so reject the null. • Denoting the maximised value of the LLF by unconstrained ML as L( ) and the constrained optimum as . Then we can illustrate the 3 testing procedures in the following diagram: t 2 t 2 ut1 2 L( ~ )  2 1  t  ˆ
  • 52. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 52 Comparison of Testing Procedures under Maximum Likelihood: Diagramatic Representation    L A   ˆ L B    ~ L  ~ ˆ 
  • 53. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 53 Hypothesis Testing under Maximum Likelihood • The vertical distance forms the basis of the LR test. • The Wald test is based on a comparison of the horizontal distance. • The LM test compares the slopes of the curve at A and B. • We know at the unrestricted MLE, L( ), the slope of the curve is zero. • But is it “significantly steep” at ? • This formulation of the test is usually easiest to estimate. L( ~ )  ˆ
  • 54. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 54 An Example of the Application of GARCH Models - Day & Lewis (1992) • Purpose • To consider the out of sample forecasting performance of GARCH and EGARCH Models for predicting stock index volatility. • Implied volatility is the markets expectation of the “average” level of volatility of an option: • Which is better, GARCH or implied volatility? • Data • Weekly closing prices (Wednesday to Wednesday, and Friday to Friday) for the S&P100 Index option and the underlying 11 March 83 - 31 Dec. 89 • Implied volatility is calculated using a non-linear iterative procedure.
  • 55. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 55 The Models • The “Base” Models For the conditional mean (1) And for the variance (2) or (3) where RMt denotes the return on the market portfolio RFt denotes the risk-free rate ht denotes the conditional variance from the GARCH-type models while t 2 denotes the implied variance from option prices. t t Ft Mt u h R R     1 0   1 1 2 1 1 0      t t t h u h    ) 2 ( ) ln( ) ln( 2 / 1 1 1 1 1 1 1 1 0                               t t t t t t h u h u h h
  • 56. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 56 The Models (cont’d) • Add in a lagged value of the implied volatility parameter to equations (2) and (3). (2) becomes (4) and (3) becomes (5) • We are interested in testing H0 :  = 0 in (4) or (5). • Also, we want to test H0 : 1 = 0 and 1 = 0 in (4), • and H0 : 1 = 0 and 1 = 0 and  = 0 and  = 0 in (5). 2 1 1 1 2 1 1 0        t t t t h u h     ) ln( ) 2 ( ) ln( ) ln( 2 1 2 / 1 1 1 1 1 1 1 1 0                           t t t t t t t h u h u h h        
  • 57. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 57 The Models (cont’d) • If this second set of restrictions holds, then (4) & (5) collapse to (4’) • and (3) becomes (5’) • We can test all of these restrictions using a likelihood ratio test. 2 1 0 2    t t h   ) ln( ) ln( 2 1 0 2    t t h   
  • 58. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 58 In-sample Likelihood Ratio Test Results: GARCH Versus Implied Volatility t t Ft Mt u h R R     1 0   (8.78) 1 1 2 1 1 0      t t t h u h    (8.79) 2 1 1 1 2 1 1 0        t t t t h u h     (8.81) 2 1 0 2    t t h   (8.81) Equation for Variance specification 0 1 010-4 1 1  Log-L 2 (8.79) 0.0072 (0.005) 0.071 (0.01) 5.428 (1.65) 0.093 (0.84) 0.854 (8.17) - 767.321 17.77 (8.81) 0.0015 (0.028) 0.043 (0.02) 2.065 (2.98) 0.266 (1.17) -0.068 (-0.59) 0.318 (3.00) 776.204 - (8.81) 0.0056 (0.001) -0.184 (-0.001) 0.993 (1.50) - - 0.581 (2.94) 764.394 23.62 Notes: t-ratios in parentheses, Log-L denotes the maximised value of the log-likelihood function in each case. 2 denotes the value of the test statistic, which follows a 2 (1) in the case of (8.81) restricted to (8.79), and a 2 (2) in the case of (8.81) restricted to (8.81). Source: Day and Lewis (1992). Reprinted with the permission of Elsevier Science.
  • 59. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 59 In-sample Likelihood Ratio Test Results: EGARCH Versus Implied Volatility t t Ft Mt u h R R     1 0   (8.78) ) 2 ( ) ln( ) ln( 2 / 1 1 1 1 1 1 1 1 0                               t t t t t t h u h u h h (8.80) ) ln( ) 2 ( ) ln( ) ln( 2 1 2 / 1 1 1 1 1 1 1 1 0                           t t t t t t t h u h u h h         (8.82) ) ln( ) ln( 2 1 0 2    t t h    (8.82) Equation for Variance specification 0 1 010-4 1    Log-L 2 (c) -0.0026 (-0.03) 0.094 (0.25) -3.62 (-2.90) 0.529 (3.26) -0.273 (-4.13) 0.357 (3.17) - 776.436 8.09 (e) 0.0035 (0.56) -0.076 (-0.24) -2.28 (-1.82) 0.373 (1.48) -0.282 (-4.34) 0.210 (1.89) 0.351 (1.82) 780.480 - (e) 0.0047 (0.71) -0.139 (-0.43) -2.76 (-2.30) - - - 0.667 (4.01) 765.034 30.89 Notes: t-ratios in parentheses, Log-L denotes the maximised value of the log-likelihood function in each case. 2 denotes the value of the test statistic, which follows a 2 (1) in the case of (8.82) restricted to (8.80), and a 2 (2) in the case of (8.82) restricted to (8.82). Source: Day and Lewis (1992). Reprinted with the permission of Elsevier Science.
  • 60. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 60 Conclusions for In-sample Model Comparisons & Out-of-Sample Procedure • IV has extra incremental power for modelling stock volatility beyond GARCH. • But the models do not represent a true test of the predictive ability of IV. • So the authors conduct an out of sample forecasting test. • There are 729 data points. They use the first 410 to estimate the models, and then make a 1-step ahead forecast of the following week’s volatility. • Then they roll the sample forward one observation at a time, constructing a new one step ahead forecast at each step.
  • 61. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 61 Out-of-Sample Forecast Evaluation • They evaluate the forecasts in two ways: • The first is by regressing the realised volatility series on the forecasts plus a constant: (7) where is the “actual” value of volatility, and is the value forecasted for it during period t. • Perfectly accurate forecasts imply b0 = 0 and b1 = 1. • But what is the “true” value of volatility at time t ? Day & Lewis use 2 measures 1. The square of the weekly return on the index, which they call SR. 2. The variance of the week’s daily returns multiplied by the number of trading days in that week.    t ft t b b      1 2 0 1 2 1 t1 2 ft 2
  • 62. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 62 Out-of Sample Model Comparisons 1 2 1 0 2 1      t ft t b b    (8.83) Forecasting Model Proxy for ex post volatility b0 b1 R2 Historic SR 0.0004 (5.60) 0.129 (21.18) 0.094 Historic WV 0.0005 (2.90) 0.154 (7.58) 0.024 GARCH SR 0.0002 (1.02) 0.671 (2.10) 0.039 GARCH WV 0.0002 (1.07) 1.074 (3.34) 0.018 EGARCH SR 0.0000 (0.05) 1.075 (2.06) 0.022 EGARCH WV -0.0001 (-0.48) 1.529 (2.58) 0.008 Implied Volatility SR 0.0022 (2.22) 0.357 (1.82) 0.037 Implied Volatility WV 0.0005 (0.389) 0.718 (1.95) 0.026 Notes: Historic refers to the use of a simple historical average of the squared returns to forecast volatility; t-ratios in parentheses; SR and WV refer to the square of the weekly return on the S&P 100, and the variance of the week’s daily returns multiplied by the number of trading days in that week, respectively. Source: Day and Lewis (1992). Reprinted with the permission of Elsevier Science.
  • 63. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 63 Encompassing Test Results: Do the IV Forecasts Encompass those of the GARCH Models? 1 2 4 2 3 2 2 2 1 0 2 1         t Ht Et Gt It t b b b b b       (8.86) Forecast comparison b0 b1 b2 b3 b4 R2 Implied vs. GARCH -0.00010 (-0.09) 0.601 (1.03) 0.298 (0.42) - - 0.027 Implied vs. GARCH vs. Historical 0.00018 (1.15) 0.632 (1.02) -0.243 (-0.28) - 0.123 (7.01) 0.038 Implied vs. EGARCH -0.00001 (-0.07) 0.695 (1.62) - 0.176 (0.27) - 0.026 Implied vs. EGARCH vs. Historical 0.00026 (1.37) 0.590 (1.45) -0.374 (-0.57) - 0.118 (7.74) 0.038 GARCH vs. EGARCH 0.00005 (0.37) - 1.070 (2.78) -0.001 (-0.00) - 0.018 Notes: t-ratios in parentheses; the ex post measure used in this table is the variance of the week’s daily returns multiplied by the number of trading days in that week. Source: Day and Lewis (1992). Reprinted with the permission of Elsevier Science.
  • 64. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 64 Conclusions of Paper • Within sample results suggest that IV contains extra information not contained in the GARCH / EGARCH specifications. • Out of sample results suggest that nothing can accurately predict volatility!
  • 65. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 65 Stochastic Volatility Models • It is a common misconception that GARCH-type specifications are stochastic volatility models • However, as the name suggests, stochastic volatility models differ from GARCH principally in that the conditional variance equation of a GARCH specification is completely deterministic given all information available up to that of the previous period • There is no error term in the variance equation of a GARCH model, only in the mean equation • Stochastic volatility models contain a second error term, which enters into the conditional variance equation.
  • 66. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 66 Autoregressive Volatility Models • A simple example of a stochastic volatility model is the autoregressive volatility specification • This model is simple to understand and simple to estimate, because it requires that we have an observable measure of volatility which is then simply used as any other variable in an autoregressive model • The standard Box-Jenkins-type procedures for estimating autoregressive (or ARMA) models can then be applied to this series • For example, if the quantity of interest is a daily volatility estimate, we could use squared daily returns, which trivially involves taking a column of observed returns and squaring each observation • The model estimated for volatility, t 2, is then
  • 67. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 67 A Stochastic Volatility Model Specification • The term ‘stochastic volatility’ is usually associated with a different formulation to the autoregressive volatility model, a possible example of which would be where ηt is another N(0,1) random variable that is independent of ut • The volatility is latent rather than observed, and so is modelled indirectly • Stochastic volatility models are superior in theory compared with GARCH-type models, but the former are much more complex to estimate.
  • 68. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 68 Covariance Modelling: Motivation • A limitation of univariate volatility models is that the fitted conditional variance of each series is entirely independent of all others • This is potentially an important limitation for two reasons: – If there are ‘volatility spillovers’ between markets or assets, the univariate model will be mis-specified – It is often the case that the covariances between series are of interest too – The calculation of hedge ratios, portfolio value at risk estimates, CAPM betas, and so on, all require covariances as inputs • Multivariate GARCH models can be used for estimation of: – Conditional CAPM betas – Dynamic hedge ratios – Portfolio variances
  • 69. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 69 Simple Covariance Models: Historical and Implied • In exactly the same fashion as for volatility, the historical covariance or correlation between two series can be calculated from a set of historical data • Implied covariances can be calculated using options whose payoffs are dependent on more than one underlying asset • The relatively small number of such options that exist limits the circumstances in which implied covariances can be calculated • Examples include rainbow options, ‘crack spread’ options for different grades of oil, and currency options.
  • 70. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 70 Implied Covariance Models • To give an illustration for currency options, the implied variance of the cross-currency returns is given by where and are the implied variances of the x and y returns respectively, and is the implied covariance between x and y • So if the implied covariance between USD/DEM and USD/JPY is of interest, then the implied variances of the returns of USD/DEM and USD/JPY and the returns of the cross-currency DEM/JPY are required.
  • 71. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 71 EWMA Covariance Models • A EWMA specification gives more weight in the covariance to recent observations than an estimate based on the simple average • The EWMA model estimates for variances and covariances at time t in the bivariate setup with two returns series x and y may be written as hij,t = λhij,t−1 + (1 − λ)xt−1yt−1 where i  j for the covariances and i = j; x = y for the variances • The fitted values for h also become the forecasts for subsequent periods • λ (0 < λ < 1) denotes the decay factor determining the relative weights attached to recent versus less recent observations • This parameter could be estimated but is often set arbitrarily (e.g., Riskmetrics use a decay factor of 0.97 for monthly data but 0.94 for daily).
  • 72. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 72 EWMA Covariance Models - Limitations • This equation can be rewritten as an infinite order function of only the returns by successively substituting out the covariances: • The EWMA model is a restricted version of an integrated GARCH (IGARCH) specification, and it does not guarantee the fitted variance- covariance matrix to be positive definite • EWMA models also cannot allow for the observed mean reversion in the volatilities or covariances of asset returns that is particularly prevalent at lower frequencies.
  • 73. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 73 Multivariate GARCH Models • Multivariate GARCH models are used to estimate and to forecast covariances and correlations. • The basic formulation is similar to that of the GARCH model, but where the covariances as well as the variances are permitted to be time-varying. • There are 3 main classes of multivariate GARCH formulation that are widely used: VECH, diagonal VECH and BEKK. VECH and Diagonal VECH • e.g. suppose that there are two variables used in the model. The conditional covariance matrix is denoted Ht, and would be 2  2. Ht and VECH(Ht) are            t t t t h h h H VECH 12 22 11 ) (        t t t t t h h h h H 22 21 12 11
  • 74. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 74 VECH and Diagonal VECH • In the case of the VECH, the conditional variances and covariances would each depend upon lagged values of all of the variances and covariances and on lags of the squares of both error terms and their cross products. • In matrix form, it would be written • Writing out all of the elements gives the 3 equations as • Such a model would be hard to estimate. The diagonal VECH is much simpler and is specified, in the 2 variable case, as follows: 1 12 2 1 2 1 1 1 0 12 1 22 2 2 1 2 1 0 22 1 11 2 2 1 1 1 0 11                 t t t t t t t t t t h u u h h u h h u h                1 1 1         t t t t H VECH B VECH A C H VECH   t t t H N , 0 ~ 1    1 12 33 1 22 32 1 11 31 2 1 33 2 2 32 2 1 31 31 12 1 12 23 1 22 22 1 11 21 2 1 23 2 2 22 2 1 21 21 22 1 12 13 1 22 12 1 11 11 2 1 13 2 2 12 2 1 11 11 11                               t t t t t t t t t t t t t t t t t t t t t t t t h b h b h b u u a u a u a c h h b h b h b u u a u a u a c h h b h b h b u u a u a u a c h
  • 75. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 75 BEKK and Model Estimation for M-GARCH • Neither the VECH nor the diagonal VECH ensure a positive definite variance-covariance matrix. • An alternative approach is the BEKK model (Engle & Kroner, 1995). • The BEKK Model uses a Quadratic form for the parameter matrices to ensure a positive definite variance / covariance matrix Ht. • In matrix form, the BEKK model is • Model estimation for all classes of multivariate GARCH model is again performed using maximum likelihood with the following LLF: where N is the number of variables in the system (assumed 2 above),  is a vector containing all of the parameters, and T is the number of obs. B B A H A W W H t t t t 1 1 1                         T t t t t t H H TN 1 1 ' log 2 1 2 log 2   
  • 76. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 76 Correlation Models and the CCC • The correlations between a pair of series at each point in time can be constructed by dividing the conditional covariances by the product of the conditional standard deviations from a VECH or BEKK model • A subtly different approach would be to model the dynamics for the correlations directly • In the constant conditional correlation (CCC) model, the correlations between the disturbances to be fixed through time • Thus, although the conditional covariances are not fixed, they are tied to the variances • The conditional variances in the fixed correlation model are identical to those of a set of univariate GARCH specifications (although they are estimated jointly):
  • 77. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 77 More on the CCC • The off-diagonal elements of Ht, hij,t (i  j), are defined indirectly via the correlations, denoted ρij: • Is it empirically plausible to assume that the correlations are constant through time? • Several tests of this assumption have been developed, including a test based on the information matrix due and a Lagrange Multiplier test • There is evidence against constant correlations, particularly in the context of stock returns.
  • 78. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 78 The Dynamic Conditional Correlation Model • Several different formulations of the dynamic conditional correlation (DCC) model are available, but a popular specification is due to Engle (2002) • The model is related to the CCC formulation but where the correlations are allowed to vary over time. • Define the variance-covariance matrix, Ht, as Ht = DtRtDt • Dt is a diagonal matrix containing the conditional standard deviations (i.e. the square roots of the conditional variances from univariate GARCH model estimations on each of the N individual series) on the leading diagonal • Rt is the conditional correlation matrix • Numerous parameterisations of Rt are possible, including an exponential smoothing approach
  • 79. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 79 The DCC Model – A Possible Specification • A possible specification is of the MGARCH form: where: • S is the unconditional correlation matrix of the vector of standardised residuals (from the first stage estimation), ut = Dt −1ϵt • ι is a vector of ones • Qt is an N × N symmetric positive definite variance-covariance matrix • ◦ denotes the Hadamard or element-by-element matrix multiplication procedure • This specification for the intercept term simplifies estimation and reduces the number of parameters.
  • 80. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 80 The DCC Model – A Possible Specification • Engle (2002) proposes a GARCH-esque formulation for dynamically modelling Qt with the conditional correlation matrix, Rt then constructed as where diag(·) denotes a matrix comprising the main diagonal elements of (·) and Q∗ is a matrix that takes the square roots of each element in Q • This operation is effectively taking the covariances in Qt and dividing them by the product of the appropriate standard deviations in Qt ∗ to create a matrix of correlations.
  • 81. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 81 DCC Model Estimation • The model may be estimated in a single stage using ML although this will be difficult. So Engle advocates a two-stage procedure where each variable in the system is first modelled separately as a univariate GARCH • A joint log-likelihood function for this stage could be constructed, which would simply be the sum (over N) of all of the log-likelihoods for the individual GARCH models • In the second stage, the conditional likelihood is maximised with respect to any unknown parameters in the correlation matrix • The log-likelihood function for the second stage estimation will be of the form • where θ1 and θ2 denote the parameters to be estimated in the 1st and 2nd stages respectively.
  • 82. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 82 Asymmetric Multivariate GARCH • Asymmetric models have become very popular in empirical applications, where the conditional variances and / or covariances are permitted to react differently to positive and negative innovations of the same magnitude • In the multivariate context, this is usually achieved in the Glosten et al. (1993) framework • Kroner and Ng (1998), for example, suggest the following extension to the BEKK formulation (with obvious related modifications for the VECH or diagonal VECH models) where zt−1 is an N-dimensional column vector with elements taking the value −ϵt−1 if the corresponding element of ϵt−1 is negative and zero otherwise.
  • 83. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 83 An Example: Estimating a Time-Varying Hedge Ratio for FTSE Stock Index Returns (Brooks, Henry and Persand, 2002). • Data comprises 3580 daily observations on the FTSE 100 stock index and stock index futures contract spanning the period 1 January 1985 - 9 April 1999. • Several competing models for determining the optimal hedge ratio are constructed. Define the hedge ratio as . – No hedge (=0) – Naïve hedge (=1) – Multivariate GARCH hedges: • Symmetric BEKK • Asymmetric BEKK In both cases, estimating the OHR involves forming a 1-step ahead forecast and computing t t F t CF t h h OHR       1 , 1 , 1
  • 84. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 84 OHR Results In Sample Unhedged  = 0 Naïve Hedge  = 1 Symmetric Time Varying Hedge t F t FC t h h , ,   Asymmetric Time Varying Hedge t F t FC t h h , ,   Return 0.0389 {2.3713} -0.0003 {-0.0351} 0.0061 {0.9562} 0.0060 {0.9580} Variance 0.8286 0.1718 0.1240 0.1211 Out of Sample Unhedged  = 0 Naïve Hedge  = 1 Symmetric Time Varying Hedge t F t FC t h h , ,   Asymmetric Time Varying Hedge t F t FC t h h , ,   Return 0.0819 {1.4958} -0.0004 {0.0216} 0.0120 {0.7761} 0.0140 {0.9083} Variance 1.4972 0.1696 0.1186 0.1188
  • 85. ‘Introductory Econometrics for Finance’ © Chris Brooks 2013 85 Plot of the OHR from Multivariate GARCH Conclusions - OHR is time-varying and less than 1 - M-GARCH OHR provides a better hedge, both in-sample and out-of-sample. - No role in calculating OHR for asymmetries Symmetric BEKK Asymmetric BEKK Time Varying Hedge Ratios 500 1000 1500 2000 2500 3000 0.65 0.70 0.75 0.80 0.85 0.90 0.95 1.00