Beruflich Dokumente
Kultur Dokumente
Theologos Pantelidis National University of Ireland, Maynooth Preliminary Version Nikitas Pittis University of Piraeus
Abstract This paper aims at explaining the poor forecasting performance of the GARCH(1,1) model reported in many empirical studies. A number of previous applied studies compare the conditional volatility forecasts of the GARCH(1,1) model to those of a model that assumes constant conditional variance (homoscedasticity) and conclude that the GARCH(1,1) model does not outperform the homoscedastic one. In these studies, the comparison of the relative forecasting accuracy of the two models is usually based on a selected formal statistical test that compares the Mean Squared Errors (MSEs) of the volatility forecasts of the two models. Some recent theoretical studies argue that the poor forecasting performance of the GARCH(1,1) model found in empirical studies is deceptive, since the utilization of the squared shocks as a proxy for the unobserved true conditional volatility increases the MSE of the volatility forecasts (MSE-ination), rendering the MSE criterion invalid for proper evaluation of the forecasting accuracy of GARCH(1,1). However, the analytical results presented in this paper show that the utilization of the squared shocks as a proxy for the unobserved true conditional volatility "inates" equally the MSEs of the forecasts of the GARCH(1,1) and the homoscedastic models and thus the MSE criterion remains capable to evaluate properly the relative forecasting performance of the two models. We also provide useful suggestions for proper statistical evaluation of the volatility forecasts of GARCH(1,1), together with an empirical illustration based on ve bilateral exchange rates for the US dollar. Keywords : GARCH, forecasting accuracy, volatility, MSE, nested models, exchange rates. JEL Classication : C22, C53
1 Introduction
During the last two decades, emphasis has been given in forecasting the volatility of nancial time series. This issue is crucial for policy makers, option traders and investors, since volatility forecasts are important for many nancial decisions, such as implementation and evaluation of asset pricing theories and risk management (i.e. calculation of the Value-at-Risk). The ARCH class of models, pioneered by Engle (1982) and generalized by Bollerslev (1986), is by far the most popular class of econometric models for describing a series with time-varying conditional variance. The out-of-sample forecasting accuracy of the GARCH models is questioned by a number of studies (see, e.g. West et al., 1993; West and Cho, 1995; Franses and van Dijk, 2000). In most of these papers, the evaluation of the GARCH models is based on the Mean Squared Error (MSE) of the forecasts. However, Andersen and Bollerslev (1998) and Christodoulakis and Satchell (1998, 2003) have recently argued that the poor forecasting performance of the GARCH models in empirical studies is deceptive. More specically, these studies reveal that the seemingly poor out-of-sample forecasting behavior of the GARCH models is due to the utilization of the squared shocks as a proxy for the unobserved true conditional volatility. This approximation results in an increase in the MSE of the forecasts (MSE-ination), delivering a misleading evaluation of the GARCH models. In this paper, we examine this issue more thoroughly by deriving analytical formulas of the MSE of the volatility forecasts of the popular GARCH(1,1) model when (i) the true conditional volatility is used and (ii) the squared shocks are used as a proxy for the true conditional volatility. Thus, we provide an analytical formula for the MSE-ination in the case of a GARCH(1,1) model. Furthermore, we go one step forward and examine whether the poor forecasting performance of the GARCH(1,1) model reported in many empirical studies can be attributed to the utilization of a proxy for the unobserved conditional volatility. In most empirical studies, the evaluation of the forecasting accuracy of GARCH(1,1) is based on the comparison of the volatility forecasts of GARCH(1,1) to the volatility forecasts of a model that assumes homoscedasticity, which serves as a benchmark. The comparison of the two models (i.e. GARCH(1,1) and homoscedastic) is usually performed in the context of a formal statistical test for evaluating the relative forecasting performance of alternative econometric models. In order to reveal the eect of the utilization of the squared shocks as a proxy for the unobserved conditional volatility on the
comparison of the relative forecasting performance of the GARCH(1,1) and the homoscedastic models, we derive analytical formulas for the MSE of the forecasts of an homoscedastic model. We reach to two important conclusions with signicant implications for empirical studies. First, the dierence in the forecast accuracy between the GARCH(1,1) model and the homoscedastic model in terms of the MSE criterion is not aected by the utilization of the squared shocks as a proxy for the conditional variance. This means that the MSE criterion remains capable to evaluate properly the relative forecast performance of the two models. Second, the dierence between the two models is maximized for one-step ahead forecasts, while the two models become equivalent as the forecast horizon tends to innity. Our second nding suggests that applied researchers should base the statistical testing of the relative forecast accuracy of the two models on sequences of one-step ahead forecasts, rather than sequences of sstep ahead forecasts (s > 1), in order to increase the power of the tests. Moreover, by means of Monte Carlo simulations, we calculate the empirical size and the size-adjusted power of a variety of statistical procedures to select the GARCH(1,1) model over the misspecied homoscedastic one when the MSE is used for the evaluation of the volatility forecasts. In the literature, there are numerous statistical procedures properly designed to compare either non-nested models, such as the Diebold-Mariano statistic (Diebold and Mariano, 1995), or nested models, such as the test statistics proposed by McCracken (2004). In our case, we are interested in comparing two nested models. However, we consider tests designed for comparing both nested and non-nested models for comparison reasons.1 The simulation results indicate that the majority of the test statistics under scrutiny suer from signicant size distortions, with the exception of the two out of the three statistics proposed by McCracken (2004). For these two statistics only minor nite sample size distortions are observed. As far as the power of the statistics is concerned, the simulation results reveal two important issues. First, the power of the tests is only marginally aected by the sample size used in the estimation procedure as long as the number of observations is sucient to ensure reliable estimates of the GARCH parameters. On the other hand, the power of the tests heavily depends on the number of out-of-sample forecasts. More specically, the power of the tests increases substantially with the number of the out-of-sample forecasts. This suggests that applied researchers should use as many observations as possible for the out-of-sample
Christodoulakis and Satchell (2003) show that the utilization of the squared shocks as a proxy for the true but unobserved conditional volatility invalidates the Diebold-Mariano statistic, even if we are interested in comparing two non-nested GARCH models.
1
forecast exercise. Second, the power of the tests increases with the persistence of the series under scrutiny. This is a favorable result for practitioners, since conditional volatilities are usually highly persistent in empirical studies. In summary, the simulation results show that McCrackens statistics are in general able to indicate whether the CARCH(1,1) model provides more accurate volatility forecasts than the homoscedastic model. Finally, we illustrate the usefulness of our results with an application to ve bilateral exchange rates for the US dollar. We consider the data used by West and Cho (1995). Contrary to the ndings of West and Cho (1995), our results indicate the superiority of the GARCH(1,1) model over the homoscedastic model for forecasting the conditional variance in three out of the ve exchange rates considered. The rest of the paper is organized as follows. In Section 2, we derive the MSEs of the volatility forecasts for the GARCH(1,1) model and the homoscedastic model. Section 3 describes briey selected statistical procedures for comparing the forecast accuracy of two models and presents the results of a Monte Carlo simulation that examines the empirical size and the size-adjusted power of these tests to choose the GARCH(1,1) model over the homoscedastic one. Section 4 contains an illustrative example and Section 5 concludes the paper.
et = ht zt
2 2 h2 t = w + aet1 + bht1
(1)
where w, a and b are positive and zt iid(0, 1). The unit variance assumption for the innovation process, zt , ensures that h2 t is the variance of et , conditional on the information set, Ft1 , that contains the past history of the process up to period t 1. We do not assume a specic distribution for the innovations. However, similarly to Baillie and Bollerslev (1992) (BB henceforth) and for simplicity, we assume that the conditional distribution of et given Ft1 is symmetric with all the existing even-ordered moments2 proportional to the corresponding powers of the
2
r = 0, 1, ..., K 1
The optimal predictor, b h2 t+s , of the conditional variance for forecast horizon s is the condi2 b h2 t+s = Et (ht+s ) = w s1 X (a + b)i1 + (a + b)s1 h2 t+1 i=1
(2)
2 2 where h2 t+1 = w + aet + bht is known at time t. The necessary and sucient condition for the
w 1ab
2 s1 2 b h2 (ht+1 2 ) t+s = + (a + b)
It is obvious that as s , the predictor tends to the unconditional variance 2 . On the other hand, if a + b = 1, we obtain the following predictor:
2 b h2 t+s = w (s 1) + ht+1
2.1 Non-Feasible and Feasible MSEs of the Forecasts of the GARCH(1,1) Model
NF (s) := E [(h2 b2 2 Initially, we calculate the non-feasible conditional MSE, M SEG,t t t+s ht+s ) ], of NF of the volatility forecasts based on provide the following formula for the calculation of M SEG,t
5 the GARCH(1,1) model for speriods ahead based on the true conditional variance h2 t+s . BB
b h2 t+s :
3
NF M SEG,t (s)
i=1
1 1 n tn distribution, kr = (n 2)r (r + 1 ( 2 )1 ( n 2 )( 2 r ) 2 ), r = 1, 2, .., K where () denotes the gamma n function and K = int( 2 ). 4 2 By denition Et (h2 t+s ) = Et (et+s ) for s > 0. 5 In practice, the calculation of this MSE is non-feasible, since h2 t+s is unobservable.
2 X 2 2 i 2 2 where Et (h4 i w2i Et (h2 t+s ) = w + t+s1 ), 1 = (a + b) and 2 = k2 a + ab + b . i i=1 Upon repeated substitution, we obtain that
NF M SEG,t (s) s1 s i1 X X 1 si1 4 2(i1) 2 = (k2 1)a (a + b) { 2 ht+1 + w q + 2 2 i=1 q =1
+2 1 w
s i1 X q =1
1 { q 2 w
sX iq 1 p=1
need to assume that the fourth moment of et is nite. Among others, Hafner (2003) proves
2 2 that the necessary and sucient condition for the niteness of E (e4 t ) is that 3ca + 2ab + b < 1
NF := E [(h2 b2 2 In order to obtain the non-feasible unconditional MSE, M SEG t+s ht+s ) ], we
where c :=
4) E (zt 3 .
In that case:
2 E (e4 t ) = 3cw
NF as follows: M SE NF (s) := We can now derive the formula for the unconditional M SEG G s 1 X 2 2 b2 2 b2 2 E [(h2 (a + b)2(i1) Et (h4 t+s ht+s ) ] = E {Et [(ht+s ht+s ) ]} = E {(k2 1)a t+si )}. After
i=1
So far, we have calculated the M SE of the forecasts of the conditional variance assuming that
2 h2 t+s is known. However, we cannot observe ht+s . Therefore, in empirical studies we evaluate
the forecasting accuracy of GARCH models by using the squared shocks e2 t+s as a proxy for h2 t+s . We will now show that the use of this proxy results in higher M SEs of the volatility forecasts, since e2 t+s is a noisy estimator of the actual conditional volatility. For example, Lopez (2001) states that "...even if one is willing to accept a proxy that is up to 50% dierent from
2 h2 t+s , et+s would fulll this condition only 25% of the time". To be more specic, he shows that 1 2 3 2 7 Pr(e2 t+s [ 2 ht+s , 2 ht+s ]) = 0.2588.
6 7
4 We use the following equality: Et (e4 t+s ) = k2 Et (ht+s ) for s > 0. Lopez (2001) assumes normal innovations, that is zt N (0, 1).
F (s) := E [(e2 b 2 First of all, we calculate the feasible conditional MSE, M SEG,t h2 t t+s t+s ) ]. We
+2 1 w
2 2 {[ 2 + (a + b)s1 (h2 t+1 )] } F (s) := E [(e2 2 b2 2 The feasible unconditional MSE, dened as M SEG t+s ht+s ) ] = E {Et [(et+s
s1 X i=1
It is easy to see that the MSE-ination of the volatility forecasts of CARCH(1,1), M SEIG :=
F (s) M SE NF (s), which corresponds to the increase in the M SE due to the use of e2 M SEG t+s G
Remark (1): M SEIG is independent of s and an increasing function of the GARCH parameters (w, a and b). Finally, M SEIG increases with k2 , suggesting that the eect of
2 approximating h2 t+s with et+s is more pronounced when zt follows a leptokurtic distribution.
This is illustrated in Figure 1, which shows M SEIG for the Gaussian and t(5)-distributions respectively.
2.2 Non-Feasible and Feasible MSEs of the Forecasts of the Homoscedastic Model
In the previous section, we showed that squared residuals e2 t+s are noisy estimators of the unobserved conditional variance h2 t+s . Therefore, the evaluation of the forecast accuracy of
F (s), can be misleading. In GARCH models based on the feasible unconditional MSE, M SEG
this section, we move one step forward and examine whether the poor volatility forecasting performance of GARCH models reported in empirical studies is the result of the utilization of a noisy proxy for the unobserved actual volatility. The utilization of a GARCH model in empirical studies would be justied if this model could at least produce more accurate forecasts
8
Once again, we assume niteness of the fourth moment of et . That is, we assume that E (e4 t ) < , t.
than a model that assumes constant conditional volatility (homoscedasticity). For this reason, many empirical studies compare the volatility forecasting accuracy of GARCH(1,1) to that of an homoscedastic model that serves as a benchmark. In order to reveal the eect of the utilization of the squared shocks as a proxy for the unobserved conditional volatility on the comparison of the relative forecasting performance of the GARCH(1,1) and the homoscedastic models, we need to derive analytical formulas for the MSE of the forecasts of an homoscedastic model. Assume that a researcher ignores the existence of time heterogeneity in the conditional variance and she chooses to estimate a model with constant variance. That is, she erroneously
2 2 b2 assumes that h2 t = , t and she utilizes instead of ht+s to forecast the volatility. We will
calculate the feasible and non-feasible M SEs of the volatility forecasts in this case. First, we
NF (s) := E [(h2 2 )2 ], which is based on the unobserved derive the non-feasible M SE , M SEH t+s
F (s) := E [(e2 2 2 Similarly, we derive the feasible M SE , M SEH t+s ) ]: F (s) = 2 [3cw M SEH
2 Remark (2): Interestingly, the employment of e2 t+s as a proxy for ht+s in the deni-
That is,
M SEIH = 3cw 2
2.3 The Relative Forecasting Performance of the GARCH(1,1) Model and the Homoscedastic Model
We now compare the volatility forecasts of the GARCH(1,1) model to those of the naive homoscedastic model. The following proposition is rather trivial stating that the forecasts of the conditional variance based on the GARCH model are more accurate (in terms of the M SE criterion) than those based on an homoscedastic model. Proposition 1Let R1(s) :=