Beruflich Dokumente
Kultur Dokumente
April 2017
PRELIMINARY DRAFT
Abstract
This paper proposes a new methodology for macroeconomic forecasting in a small open
economy environment based on dynamic factor models (DFMs). Unlike in existing appli-
cations of DFMs for forecasting in small open economies, the proposed methodology allows
for block exogeneity restrictions reflecting that external variables can be taken as exogenous
for domestic variables in a small open economy. The proposed methodology is applied to
forecast real GDP growth and consumer price inflation in Chile, using a large dataset of
domestic and external macroeconomic time series. Out-of-sample forecast comparisons sug-
gest considerable gains in terms of forecast accuracy from incorporating block exogeneity
restrictions, as the restricted DFM produces significantly smaller forecast errors than its
unrestricted version at a medium-term horizon relevant for macroeconomic policy.
Keywords: Factor models; Time series models; Forecasting; Small open economies.
We would like to thank Elas Albagli, Javier Garca-Cicco, Alberto Naudon and seminar participants at the
Central Bank of Chile for useful comments. The views expressed are those of the authors and do not necessarily
represent official positions of the Central Bank of Chile or its Board members.
Phone: +56 22 670 2731; e-mail address: dchernin@bcentral.cl.
Phone: +56 22 670 2193; e-mail address: mkirchner@bcentral.cl.
Address: Agustinas 1180, Santiago, Chile.
1 Introduction
In this paper we propose a new methodology for macroeconomic forecasting in a small open
economy environment based on dynamic factor models (DFMs). DFMs are often adopted for
macroeconomic forecasting (see Eickmeier and Ziegler, 2008). These models allow to capture
the dynamics of a large number of variables through a few latent factors. Thereby, the number
of estimated parameters does not increase as quickly with the number of variables as in other
multivariate time series models, implying potential gains in terms of forecast accuracy. However,
most existing applications of DFMs for forecasting in small open economies do not impose the
theoretical restriction that foreign variables can be taken as exogenous for domestic variables in
a small open economy, and might thus leave additional accuracy gains unexploited. We therefore
aim to assess the benefits of explicitly incorporating this kind of small open economy restrictions
through an application of the proposed methodology for Chile.
The application focuses on forecasting quarterly real gross domestic product (GDP) growth
and consumer price index (CPI) inflation in Chile. Our dataset starts with the introduction of
the Chilean inflation targeting monetary framework in 2001, and includes a broad set of quarterly
macroeconomic indicators with 114 domestic and 64 international variables. These variables are
modelled using a DFM. Only 3 domestic and 3 foreign factors are sufficient to explain about 50%
of the variability of the underlying macroeconomic indicators. However, while the unrestricted
version of our specified DFM would have more than 1,100 parameters to be estimated, the small
open economy restrictions that we impose allow to reduce the number of parameters by about a
third and thereby yields a significantly more parsimonious framework.
We compare the restricted DFM with its unrestricted version and several simple univariate
benchmark models in terms of their out-of-sample forecast performance for real GDP growth
and CPI inflation, conducting recursive regressions and computing the implied forecasts up to 8
quarters ahead in a pseudo real-time setup. We find that the restricted DFM significantly out-
performs the unrestricted model in terms of forecast accuracy, according to standard statistical
tests on the root mean squared errors for the point forecasts. In addition, the restricted DFM
beats the univariate benchmarks for GDP forecasts at medium-term horizons, while it tends to
produce similar forecast errors as the benchmarks for CPI inflation.
Several previous studies have applied DFM frameworks and other types of multivariate factor
models for macroeconomic forecasting in small open economies; see, for instance, Marcellino,
Stock, and Watson (2003) for several euro area countries, Brisson, Campbell, and Galbraith
(2003), Cheung and Demers (2007) and Gosselin and Tkacz (2010) for Canada, Camacho and
Sancho (2003) for Spain, Artis, Banerjee, and Marcellino (2005) for the United Kingdom, Ferreira,
1
Bierens, and Castelar (2005) for Brazil, Matheson (2006) and Giannone and Matheson (2007)
for New Zealand, Moser, Rumler, and Scharler (2007) for Austria, Bandt, Michaux, Bruneau,
and Flageollet (2007) for France, Chow and Choy (2009) for Singapore, Gupta and Kabundi
(2010) for South Africa, Martinsen, Ravazzolo, and Wulfsberg (2014) for Norway, and Aguirre
and Cspedes (2004) and Echavarra and Gonzlez (2011) for Chile. However, while theoretical
small open economy (i.e., block exogeneity) restrictions have often been applied in studies based
on structural factor-augmented vector autoregressive (FAVAR) models (see, e.g., Boivin and
Giannoni, 2007; Charnavoki and Dolado, 2014; Kamber, Karagedikli, Ryan, and Vehbi, 2016), to
our knowledge none of the available studies on forecasting has explored the usefulness of block
exogeneity restrictions in a small open economy environment, as we do in this paper.
The result that (restricted) DFMs can be useful devices for macroeconomic forecasting at
medium-term horizons, even in a relatively short sample such as ours, is especially relevant for a
small open emerging economy with an inflation targeting monetary framework such as Chile. In
particular, the Central Bank of Chile (CBCH) is conducting forecasts of real GDP and inflation
up to a two-year horizon, which is the maximum period during which the CBCH normally
attempts to take the projected annual consumer price inflation rate back to 3% (CBCH, 2007),
an operational definition of price stability that is paralleled (with different specified targets and
policy horizons) in many other small open economies (see Hammond, 2012).1 Reliable forecasts
of macroeconomic variables are therefore a key requirement for a successful implementation of
inflation targeting frameworks. However, given the relatively short available time series in many
emerging economies compared to most advanced economies and the relatively recent adoption of
inflation targeting regimes in most cases, it seems important that the models used for forecasting
in such an environment are specified in a parsimonious way, for instance, by exploiting potentially
useful theoretical (e.g., small open economy) restrictions in popular forecasting models such as
DFMs. Our results support that hypothesis for the case of Chile.
The rest of the paper is structured as follows. Section 2 describes the model and its estimation.
Section 3 presents our empirical application and results. Finally, Section 4 concludes.
2 Econometric Model
In this section we describe the dynamic factor model with block exogeneity restrictions and its
estimation. The model assumes that the latent factor dynamics follow a vector autoregressive
(VAR) process and that the (explicit) relationship between the latent factors and the observed
1
This kind of central bank decision-making process is often called inflation-forecast targeting (Svensson,
2002).
2
set of macroeconomic indicators is contemporaneous. Since our goal is to produce forecasts for
a small open economy, we consider two blocks: domestic and external. We impose a structure
that allows spillover effects from the external block onto the domestic block, but not vice-versa.
2.1 The Dynamic Factor Model (DFM) with block exogeneity restrictions
Let XtD represent an N D 1 vector of domestic macroeconomic time series (domestic block) and
XtE an N E 1 vector of external macroeconomic time series (external block). Data is available
for t = 1, ..., T, and all elements in XtD and XtE are assumed to be standardized and properly
differentiated, such that they are rendered to stationary processes. Also, let the information
in XtD and XtE be summarized by the latent factors FtD and FtE , respectively, where FtD is a
K D 1 vector of domestic factors and FtE is a K E 1 vector of external factors. The dimensions
of FtD and FtE are such that K D N D and K E N E . We assume that XtD and XtE admit a
factor model representation given by:
XtE EE 0 FtE eE
t
= + , (1)
XtD ED DD FtD eD
t
where EE , ED and DD are conformable matrices of factor loadings, and the idiosyncratic
innovations in eE D
t and et are allowed to follow univariate autoregressions as in Stock and Watson
(2010):
i (L) eji,t = j + vi,t , i = 1, . . . , N j , (2)
where j {E, D}, i (L) are conformable lag polynomials of finite order, j are constants,
and vi,t are i.i.d. disturbances such that vt N (0, v ) with v diagonal. The presence (and
structure) of serial correlation in the idiosyncratic innovations of (1) will be determined on a
case-by-case basis, using standard univariate serial correlation tests. This type of specification is
often called an exact factor model, and can be thought of as a version of the model introduced
by Chamberlain and Rothschild (1983) without cross-correlation in the error terms.
We further assume that the joint dynamics of the latent factors are given by the following
reduced-form VAR:
EE
(L) 0 FtE E E
t
= + , (3)
ED DD
(L) (L) FtD D D
t
where (L)EE , (L)ED and (L)DD are conformable lag polynomials of finite order p, E and
D are constant terms, and the reduced-form innovations E D
t and t may be cross-correlated,
3
such that [E D
t , t ] N (0, ). Finally, we assume that E[et ts ] = 0 s.
Equations (1), (2) and (3) represent our DFM framework, which we will use to construct
forecasts for the Chilean economy. The specification of (1) ensures that we capture the contem-
poraneous effects that the external factors exert upon the domestic block of observables through
the matrix ED . In turn, the block of zeros in the upper right region of the factor loadings
matrix reflects our small open economy assumption, imposing that the evolution of the domestic
factors has no impact on the external block of observables. Similarly, the specification of (3)
ensures that there is no dependence of the external factors with respect to the domestic ones.
Most of the available literature for small open economies that employs DFMs with block
exogeneity restrictions such as in (1) and (3) addresses structural impulse response analysis
based on identified FAVARs. However, the imposition of block exogeneity restrictions may
also be useful in a forecasting application of DFMs: by focusing on a small open economy
we may assume absence of spillovers (both past and present) from the domestic factors onto
the external factors, and from the domestic factors onto the external block of macroeconomic
indicators. This should result in efficiency gains for the parameter estimates, due to the associated
dimensionality reduction of the econometric problem. On the other hand, the risk of potential
model misspecification (and thus presence of biased and inconsistent estimates) is relatively low
given the factual nature of the small open economy assumption.
2.2 Estimation
We now discuss the main issues regarding parameter estimation involving (1), (2) and (3). For
this, it is useful to define X E = [X1E , . . . , XTE ] as the T N E matrix of external time series,
F E = [F1E , . . . , FTE ] as the T K E matrix of external factors, X D = [X1D , ..., XTD ] as the
T N D matrix of domestic time series, and F D = [F1D , ..., FTD ] as the T K D matrix of domestic
factors. Furthermore, for the version of the model without block exogeneity restrictions, to which
we will refer to as unrestricted DFM, we define a T N matrix X = [X1 , . . . , XT ] including
the full set of external and domestic observables in Xt = [XtE , XtD ] for t = 1, . . . , T (with
N = NE + ND ), and accordingly a T K factor matrix F = [F1 , ..., FT ] and an unrestricted
N K matrix of factor loadings (where we will set K = KE + KD in our application).
2.2.1 Identification
The model defined by (1), (2) and (3) is econometrically not identified and thus cannot be
Ft }T be a solution to the estimation
estimated. To see this, without loss of generality let {, t=1
of the unrestricted version of (1). This implies that there always exists an arbitrary non-singular
4
Ft }T defined by Ft = H 1 Ft for t = 1, . . . , T and = H
matrix H such that the set {,
t=1
following most the DFM literature. A useful feature of this identification scheme is that, even
though the contemporary factors are orthogonal, serial and lagged cross-correlation between
them is still present, maintaining therefore their suitability for VAR forecasting.2
The estimation via principal components of the external factors and their associated loadings
relies on the covariance matrix of the observables. More specifically, the covariance matrix
d E ) = X E X E /(T 1). Then, EE
of the standarized variables X E is estimated as Cov(X
d E ), normalized and ordered decreasingly. It can be
is estimated as the eigenvectors of Cov(X
shown that the eigenvalues associated with these eigenvectors represent the portion of the total
variance explained by each of the factors.3 Once EE is ready, the factors are estimated as linear
combinations of X E , where the weights correspond to the factor loadings: F E = X E EE .
Up to this point, F E spans the same space as X E . The previous procedure simply constitues a
rotation of X E , and thus, generates the same number of factors as observables (i.e., K E = N E ).
However, since our goal is to summarize information, it is necessary to remove factors and
loadings, while minimizing the loss of explained variance. Since the eigenvalues associated with
the eigenvectors in EE represent the portion of the total variance in X E explained by each
of the factors, the first obvious candidate for elimination is the eigenvector associated with the
lowest eigenvalue (which corresponds to the last column of EE and FE , respectively). This
2
For further references regarding identification see Bai and Ng (2013).
3 d
Since Cov(X E
) is by construction a real symmetric matrix, its eigenvectors will be real and orthogonal,
d
which alongside their normalization, render EE to be an orthonormal basis of the space spanned by Cov(X E
).
d
Furthermore, since Cov(X E
) is positive definite, all its eigenvalues will be positive.
5
elimination step is repeated until the desired number of factors is achieved (see Section 3.2.1).
Once the external factors and loadings are estimated, it is possible to estimate the domestic block
imposing the block exogeneity restrictions present in (1). This is done following the iterative
approach proposed by Boivin and Giannoni (2007); see also Charnavoki and Dolado (2014) and
Kamber et al. (2016). The idea behind this approach is that the estimated external factors
can be used to orthogonalize (or externally correct) the estimated domestic factors, so as to
control for the effect of foreign factors on the domestic block. This guarantees that the estimated
domestic factors capture only the dynamics in the domestic variables not captured by the foreign
factors. Formally, this is achieved using the following iterative approach:
D(0) T
1. Estimate {EE , FtE }Tt=1 and {DD(0) , Ft }t=1 via principal components independently
following the estimation procedure described in Section 2.2.2 (suitably adapted for the
domestic block) to obtain K E external and K D domestic factors.
D(i)
2. For i = 0, regress XtD on FtE and Ft using Ordinary Least Squares (OLS) to obtain
ED(i) (which corresponds to the OLS coefficients associated with FtE ).
D(i) = X D ED(i) F E .
3. Compute Xt t t
D(i+1) T D(i) .
4. Estimate {DD(i+1) , Ft }t=1 using the first K D principal components of Xt
D(i+1) T
5. Repeat steps 2-4 for i = 1, 2, 3, . . . , until convergence of {Ft }t=1 .
The block exogeneity restrictions present in the VAR from (3) are imposed by a reparametrization
of the unrestricted version of (3) that contains the desired linear constraints (i.e., exogeneity of the
external factors with respect to the domestic factors). As a consequence of this reparametrization,
not all the regressors are shared throughout all the equations present in the system. Thus,
standard equation-by-equation OLS does not yield efficient parameter estimates.4 This issue can
be tackled by applying Feasible Generalized Least Squares (FGLS), which yields asymptotically
efficient and consistent parameter estimates by estimating the covariance matrix of the residuals
via an iterative approach. Also, since the VAR under consideration is stationary, standard
asymptotic inference continues to apply. For further details see, e.g., Ltkepohl (2007).
4
See Zellner (1962) for a complete discussion.
6
2.2.5 Unrestricted DFM
The estimation of the unrestricted DFM encompasses similar steps as the ones described pre-
viously. Under this case, however, the estimated loadings are extracted from the complete
standarized dataset X, which includes all external and domestic observables. This yields the
resulting estimated eigenvalues (and their corresponding eigenvectors) to be associated with the
joint dispersion of all the variables under consideration. Consequently, F constitutes linear
combinations of both external and domestic observables. Once the factors F are available, the
unrestricted VAR which models their joint dynamics is estimated. Unlike for the VAR with block
exogeneity restrictions, equation-by-equation OLS yields asymptotically efficient and consistent
parameter estimates, given that the regressors are shared across all equations (Zellner, 1962).
2.3 Forecasts
Forecasts from (3) are obtained using a standard recursive scheme, in which the point forecast
corresponds to the conditional mean of each variable given the current information set. The
sample under consideration is quarterly, and forecasts for time horizons ranging from 1 up to 8
quarters ahead are computed. Point forecasts for the observables are then obtained by projecting
the forecasts from (3) onto the observables using the estimated loadings matrix, and adding the
forecasts for the idiosyncratic innovations from (2).
Formally, consider for simplicity the forecasts of the unrestricted DFM, in which the lag
polynomials, the idiosyncratic innovations of (1) and the cross-correlated error terms of (3) are
given by (L), et and t respectively, and are analogous to their counterparts from the restricted
DFM. Then, the forecasts of the observable i are determined by:
= E[i (L)Ft |Xi,t1 , ...] + E[i (L)(Xi,t i (L)Ft )|Xi,t1 , ...], (4)
The last equality above follows from the assumption that E[et tj ] = 0 j, alongside the i.i.d.
assumptions for vi,t and t . As (4) shows, under this framework, the correlation among the series
of observables in Xt only occurs through the factors Ft . However, in those cases in which ei,t is
serially correlated, the term E [ei,t |Xt1 , ...] leads to the appearance of lagged terms of Xi,t in
the forecast equation.
7
3 Empirical Implementation and Results
In this section we present our empirical implementation and results for Chile, focusing on real
GDP and CPI forecasts. The model specification, which includes determining the number of
factors in (1), the number of lags in (3) and the autocorrelation structure of (2) is determined
considering the complete sample. In the pseudo real-time setup for the forecast accuracy assess-
ment, we maintain the model specification selected for the complete sample.
3.1 Data
The data used for the estimation of the DFM is a balanced panel of 178 quarterly series for the
period from 2001Q1 to 2016Q3. The starting point for this panel coincides with the kick-off of
Chiles inflation targeting monetary framework. The data series are divided into external and
domestic variables. The external block comprises 64 international series that are categorized
into 14 groups: GDP, consumption, investment, government expenditure, prices, unemployment,
foreign trade, monetary policy rates, real exchange rates, sovereign bond yields, stock market
indicators, sectoral economic indicators, commodity prices, and volatility indexes. With the ex-
ception of the latter two, all other groups include variables (when available) from China, the
United States, Japan and the euro area, which combined constitute 63% of Chiles total foreign
trade.5 The domestic block includes 114 series, divided into 13 groups: GDP, sectoral economic
indicators, expenditure, prices, employment, other labor market indicators, wages, credit, mon-
etary aggregates, interest rates, stock market, exchange rates and economic surveys. All data
series are transformed to induce stationarity, demeaned and standardized prior to the estima-
tion. Finally, the variables present in the external block were mostly retrieved using Bloomberg,
whereas the domestic variables were obtained through the CBCHs statistical database.6 The
complete list of data series and description of data transformations is provided in the appendix.
We determine the total number of factors jointly based on the criteria proposed by Bai and Ng
(2002), which we compute for different combinations of domestic and foreign factors, as well as
the implied explained variance. Bai and Ng (2002) propose selecting the K factors of X based on
information criteria. The idea is to assess the trade-off between the benefit in increased goodness-
5
Source: Ministry of Foreign Affairs of Chile, General Directorate for International Economic Relations (DI-
RECON), Foreign Trade Quarterly Report, February 2017.
6
See http://si3.bcentral.cl/Siete/secure/cuadros/home.aspx?Idioma=en-US.
8
Table 1: Bai and Ng (2002) selection criteria for the DFM with block exogeneity.
Note: KD and KE , respectively, indicate the number of domestic and external factors considered.
of-fit through including an additional factor, against the associated cost in parametric variability
arising from its inclusion. This is done by minimizing the following penalized likelihood (or log
sum of squares):
1
IC (K) = ln tr X F X F + Kg (N, T ) , (5)
NT
where g(N, T ) is a penalty factor such that g(N, T ) 0 and min{N, T }g(N, T ) as N, T
. Bai and Ng (2002) show that the value of K that minimizes an information criterion for
is a consistent estimate for the true
given g(N, T ) satisfying these conditions, denoted by K,
value of K, assuming the latter is finite and does not increase with N, T . Following Stock and
Watson (2002), we use three of the penalty functions suggested by Bai and Ng (2002):
N +T NT
g1 (N, T ) = ln , (6)
NT N +T
N +T
g2 (N, T ) = ln (min {N, T }) , (7)
NT
9
Table 2: Bai and Ng (2002) selection criteria for the unrestricted DFM.
Penalty functions
g1 (N, T ) g2 (N, T ) g3 (N, T )
K =1 -0.105 -0.098 -0.121
K =2 -0.156 -0.143 -0.189
K =3 -0.201 -0.181 -0.251
K =4 -0.206 -0.180 -0.272
K =5 -0.215 -0.183 -0.299
K =6 -0.210 -0.171 -0.310
K =7 -0.205 -0.159 -0.321
K =8 -0.184 -0.132 -0.318
K =9 -0.164 -0.106 -0.314
K = 10 -0.142 -0.077 -0.309
ln min {N, T }2
g3 (N, T ) = . (8)
min {N, T }2
For the case of the restricted DFM, we estimate the joint information criteria of Bai and Ng
(2002) applying the subsequent iterative steps:
E(i)
1. For i = 1, compute {EE(i) , Ft }Tt=1 via principal components, following the estimation
procedure described in Section 2.2.2 to obtain i external factors.
E(i) D(j) T
2. Given {EE(i) , Ft }Tt=1 , for j = 1 apply steps 2-5 of Section 2.2.3 to obtain {DD(j) , Ft }t=1 ,
the set of externally-corrected domestic factors and loadings.
E(i) D(j) T
3. Join {EE(i) , Ft }Tt=1 and {DD(j) , Ft Ft }T and compute the information
}t=1 in {, t=1
criterion from (5), using the penalty functions specified in (6), (7) and (8).
4. Repeat steps 1-3 for i = 2, 3, 4, . . . and j = 2, 3, 4, . . . until the desired maximum number
of external/domestic factors is reached.
Table 1 documents the resulting statistics for the restricted DFM. The joint information
criteria suggest fairly parsimonious specifications: all three penalty functions generate elliptic
paraboloids (within the range analyzed) with minimums at relatively low values of KE and KD .
For the penalty function g1 , the preferred specification has 3 external and 2 domestic factors.
Penalty function g2 implies the most parsimonious specification, with 2 external factors and 1
domestic factor. Finally, penalty function g3 suggests 4 external and 2 domestic factors. For the
case of the unrestricted DFM, Table 2 shows that specifications with 5 factors (functions g1 and
g2 ) or 7 factors (function g3 ) are preferred according to the information criteria.
10
External Block
Explanied variance by PC
0.5
0
10 20 30 40 50 60
Domestic Block
1
0.5
0
10 20 30 40 50 60 70 80 90 100 110
Unrestricted
1
0.5
0
20 40 60 80 100 120 140 160
Number of factors
Figure 1: Explained variance for the external and domestic observables (NE = 64 and ND = 114)
in the restricted DFM, and all observables (N = 178) in the unrestricted DFM. Note: The
selected number of factors, highlighted by the horizontal lines, corresponds to KE = 3, KD = 3
and K = 6, respectively. Under these specifications, the amount of total explained variance is:
45,1% for the external block and 49.5% for the domestic (externally-corrected) block in the
restricted DFM, and 50% for the complete dataset in the unrestricted DFM.
Although in general desirable, parsimonious specifications have some caveats in DFMs. The
overall level of explained variance attained by the principal components estimator is a monotoni-
cally increasing and concave function of the number of factors. Thus, specifications that consider
a very small number of factors are only able to explain a small fraction of the total variance of
the observables. This leads to potential presence of strong cross-correlations in the idiosyncratic
innovations of (1) and lower forecasting power. We amend this issue by defining an ad-hoc
minimum threshold level for explained variance in each block, which we combine with the infor-
mation of Tables 1 and 2. Given that we want to maintain the models structure as parsimonious
as possible, we set that ad-hoc level of minimum explained variance at around 50%. Under this
constraint, the specification that achieves the optimal averaged value for the information criteria
over the three penalty functions is: KE = 3 and KD = 3. For the latter, the total explained
variances for the external and domestic blocks reach 45.1% and 49.5% respectively. This can be
11
Table 3: Schwarz information criteria for vector autoregression.
DF Mr DF Mu
p=0 13.34 15.53
p=1 9.52 8.91
p=2 9.91 10.25
p=3 10.95 11.36
p=4 10.59 11.70
Note: p indicates the lag order of the VAR, and DF Mr and DF Mu refer to the restricted and
unrestricted DFM, respectively.
seen in Figure 1, which shows the level of explained variance for every combination of factors in
each block, alongside the resulting explained variance for the selected specification (horizontal
line). For the unrestricted DFM, we set K = 6 to achieve a level of explained variance of 50%.
The number of lags p considered for the VAR from (2) is determined using the Schwarz-Bayesian
information criterion. We choose this criterion over other common alternatives present in the
literature (e.g., Akaike information criterion) based on: (1) its consistency and (2) its tendency
towards choosing parsimonious specifications. The results are shown in Table 3 and suggest that
the optimal specification includes only 1 lag for both versions of the model.
Once the factors and loadings of (1) are computed, the series of idiosyncratic innovations for
observable i can be obtained as: eE E EE FtE if i belongs to the external block (the
i,t = Xi,t i
the domestic block.7 Then, standard serial correlation tests are conducted, so as to determine
(on a case-by-case basis) if the imposition of an autoregressive structure for the idiosyncratic
innovations is warranted. In that case, the parameters of the univariate model i (L) are esti-
mated using OLS. We choose the Lagrange multiplier (LM) test, setting the lag order pLM of
the auxiliary regression to 4. We compute the LM-statistic,8 which is asymptotically distributed
as 2 (pLM ), and assess its statistical significance considering a 5% confidence level.9
Table 4 exhibits the LM test results under different specifications for the idiosyncratic inno-
vations of the real GDP growth rate and the CPI inflation rate. The results for the real GDP
7
The subscript i used in the loadings matrices denotes their i-th row.
8
The LM-statistic is defined as obs R2 , where obs denotes the number of observations and R2 the uncentered
coefficient of determination of the auxiliary regression.
9
This confidence level is maintained throughout this paper.
12
Table 4: Serial correlation (Lagrange multiplier) tests for idiosyncratic innovations.
GDP CPI
DF Mr DF Mu DF Mr DF Mu
AR(0) AR(0) AR(0) AR(1) AR(0)
LM-statistic 3.95 0.78 10.80 6.21 5.64
Prob. 2 (4) 0.41 0.94 0.03 0.18 0.23
Note: The null hypothesis of the Lagrange multiplier (LM) test is absence of residual serial
correlation. The test statistic has a chi-squared distribution with 4 degrees of freedom. AR(0)
indicates a white noise specification, while AR(1) indicates an autoregressive specification of
order 1. DF Mr and DF Mu refer to the restricted and unrestricted DFM, respectively.
growth rate suggest that there is no need to impose additional innovation structure for that vari-
able. The null (absence of serial correlation) is not rejected for both versions of the DFM. Hence,
we use a white noise specification for real GDP growth. In turn, the p-values associated with the
test for the CPI inflation rate under the restricted DFM fall inside the rejection region of the
null at a 5% confidence level. Consequently, an AR(1) structure is imposed for CPI inflation,
which effectively renders the innovations to white noise according to the test results for that
specification. Finally, the unrestricted DFM exhibits white noise innovations for both variables
considered according to the LM test, and therefore, no further residual autocorrelation structure
is included in the case of the unrestricted DFM.
Recursive forecasts are computed using 24 samples, going from 2001Q1-2010Q4 up to 2001Q1-
2016Q3 (the latter corresponds to the complete dataset). We use an incremental scheme, given
the relatively narrow set of observations at hand. The model is reestimated at every iteration,
and recursive out-of-sample forecasts of 1 up to 8 quarters ahead are generated at each step.
We compute the implied root mean squared errors (RMSEs) of the real GDP growth and CPI
inflation forecasts for each of the time horizons under consideration. We conduct this recursive
forecasting exercise for both versions of the DFM, as well as for five univariate benchmark models:
AR(1), AR(2), AR(3), AR(4) and a random walk (RW) specification.
In order to formally assess the forecast accuracy of the aforementioned set of models, we
apply the modified version of the Diebold-Mariano (DM) test proposed by Harvey, Leybourne,
and Newbold (1997). Through simulation experiments, Diebold and Mariano (1995) showed that
the normal distribution can be a very poor approximation of the DM tests finite-sample null
distribution. Their results indicate that the DM test can have the wrong size and generate a
13
GDP growth - restricted
0.04
0.02
0.02
Figure 2: Recursive forecasts for real GDP growth with KE = 3, KD = 3 (restricted DFM) and
K = 6 (unrestricted DFM). Note: Recursive forecasts generated using an incremental window
scheme for 1-8 steps ahead.
bias towards type I errors depending on the degree of serial correlation among the forecast errors
and the sample size. In turn, the modified DM test has better small-sample properties by: (1)
making a bias correction to the DM test statistic and (2) comparing the corrected statistic with
a Students t-distribution. We refer to Harvey et al. (1997) for a complete discussion.
Figure 2 displays the results (point forecasts) from the recursive exercise for real GDP growth.
The results show that the unrestricted DFM tends to overpredict the observed growth rates
towards the end of the sample, unlike the restricted DFM. In general, the unrestricted model
seems to produce more dispersed forecasts compared to the restricted model, which might be
due to the relatively heavy parameterization of the unrestricted model.
The implied RMSEs and prediction equality tests for real GDP growth are exhibited in the
top panels of Tables 5 and 6.10 These results suggest that for GDP forecasts, the restricted DFM
10
Table 6 exhibits blank values for forecast horizons in which the RMSEs implied by the restricted DFM are
14
Table 5: Root mean squared errors (103 ) of recursive forecasts.
Note: Root mean squared errors (1-8 steps ahead) for real GDP growth and CPI inflation
forecasts. These values where generated using an incremental window scheme ranging from
2001Q1-2010Q3 to 2001Q1-2016Q3.
(DF Mr ) outperforms the unrestricted DFM (DF Mu ) in terms of forecast accuracy at medium-
term forecast horizons. In particular, for forecast horizons of more than a few quarters the
RMSEs implied by the restricted DFM are considerably lower than the RMSEs implied by the
unrestricted model. The p-values from Table 6 show that statistically significant differences start
to appear for forecast horizons from 3 quarters onwards. Similarly, the restricted DFM beats all
univariate benchmark models considered starting from 4-step-ahead forecasts and onwards. In
addition, the results from Table 6 indicate that the improved performance of the restricted DFM
with respect to the univariate benchmarks is statistically significant.
Figure 3 displays the results from the recursive exercise for CPI inflation. The results are similar
to the ones for real GDP growth in that the restricted DFM seems to produce more accurate
forecasts. The larger dispersion and excessive persistence of the forecasts from the unrestricted
greater than the RMSEs from the unrestricted DFM.
15
Table 6: Harvey et al. (1997) prediction equality tests for RMSEs, DF Mr vs. benchmarks.
Note: p-values for the Harvey et al. (1997) prediction equality tests. The null hypothesis is
equal predictive ability between the restrictive DFM and the respective benchmark. Blanks are
left in those cases in which the RMSE implied by the benchmark resulted lower than the RMSE
from the restrictive DFM.
model is even more notorious in this case, generating apparent upward and downward biases in
the first and second half of the evaluation sample, respectively. These issues are corrected in the
more parsimonious restricted model specification.
The bottom panels of Tables 5 and 6 show the RMSEs and prediction equality tests for
the case of CPI inflation forecasts. The results suggest that the restricted DFM beats the
unrestricted DFM at almost every forecast horizon considered. Nevertheless, the differences
are only statistically significant for 3-step-ahead forecasts, partly due to the relatively large
dispersion of the RMSEs for CPI inflation. Similarly, the restricted DFM beats several of the
univariate benchmarks at short horizons of less than a year, although the differences are not
found to be statistically significant in most cases, with the exception of the random walk (where
the differences are significant for horizons from 2 quarters onwards).
16
CPI inflation - restricted
0.02
0.01
-0.01
2002 2004 2006 2008 2010 2012 2014 2016 2018
0.02
0.01
-0.01
2002 2004 2006 2008 2010 2012 2014 2016 2018
Actuals Forecasts
Figure 3: Recursive forecasts for CPI inflation with KE = 3, KD = 3 (restricted DFM) and
K = 6 (unrestricted DFM). Note: Recursive forecasts generated using an incremental window
scheme for 1-8 steps ahead.
3.4 Robustness
In order to assess the robustness of the results presented in Section 3.3, we compute the RMSEs
for several variations of our baseline specification of the DFM (KE = 3, KD = 3 for the restricted
version and K = 6 for the unrestricted version), for 1-8 step-ahead forecasts. The alternative
specifications are selected using the results of Tables 1 and 2 as reference. In both cases it
can seen that the lowest values for the information criteria are achieved for combinations of 4
and 5 factors. The results, exhibited in Figure 4, suggest that the restricted DFM continues to
outperform its unrestricted counterpart, even when (slightly) changing the factor specification of
the model. Specifically, for real GDP forecasts, Figure 4 reveals a consistent gap in the RMSEs
between the restricted and the unrestricted model, for medium-term forecast horizons. Regarding
the CPI inflation forecasts, a slight advantage is seen for the restricted model for relatively short
horizons.
17
10-3 GDP growth
1 2 3 4 5 6 7 8
4.5
1 2 3 4 5 6 7 8
Forecast horizon
base r base u alt1 r alt1 u alt2 r alt2 u alt3 r
Figure 4: RMSEs for different specifications of the DFM. Note: base r: KE = 3, KD = 3; base
u: K = 6; alt1 r: KE = 2, KD = 2; alt1 u: K = 4; alt2 r: KE = 2, KD = 3; alt2 u:
K = 5; alt3 r: KE = 3, KD = 2.
4 Conclusions
This paper has proposed a new methodology for macroeconomic forecasting in a small open
economy environment based on a dynamic factor model. The proposed methodology allows to
incorporate block exogeneity restrictions of external with respect to domestic observables, both
in the observation equation and the transition equation of the model, in order to replicate the
theoretical interaction among these two groups of variables in small open economies. In compar-
ison to standard unrestricted DFMs, this requires a number of modifications in the estimation
of the model and adaptations of specification tests.
An application for Chile focusing on forecasting real GDP growth and CPI inflation shows
that the restricted DFM significantly outperforms the unrestricted version of the model in terms
of the accuracy of point forecasts at medium-term horizons, according to standard statistical
tests. In addition, by incorporating block exogeneity restrictions the DFM is able to beat several
simple univariate benchmark models for real GDP forecasts and tends to produce similar fore-
18
cast errors for CPI inflation, whereas without the restrictions the model would produce inferior
forecasts than most of the benchmark models considered. Hence, based on this application we
conclude that small open economy restrictions can be a useful means to improve the forecast
performance of DFMs in a small open economy environment.
References
Aguirre, A., and L. F. Cspedes (2004): Uso de Anlisis Factorial Dinmico para Proyecciones
Macroeconmicas, Working Papers Central Bank of Chile 274, Central Bank of Chile.
Artis, M. J., A. Banerjee, and M. Marcellino (2005): Factor forecasts for the UK, Journal of
Forecasting, 24(4), 279298.
Bai, J., and S. Ng (2002): Determining the Number of Factors in Approximate Factor Models,
Econometrica, 70(1), 191221.
(2013): Principal components estimation and identification of static factors, Journal of Econo-
metrics, 176(1), 1829.
Bandt, O. D., E. Michaux, C. Bruneau, and A. Flageollet (2007): Forecasting inflation using
economic indicators: the case of France, Journal of Forecasting, 26(1), 122.
Boivin, J., and M. P. Giannoni (2007): Global Forces and Monetary Policy Effectiveness, in Inter-
national Dimensions of Monetary Policy, NBER Chapters, pp. 429478. National Bureau of Economic
Research, Inc.
Brisson, M., B. Campbell, and J. W. Galbraith (2003): Forecasting some low-predictability time
series using diffusion indices, Journal of Forecasting, 22(6-7), 515531.
Camacho, M., and I. Sancho (2003): Spanish diffusion indexes, Spanish Economic Review, 5(3),
173203.
CBCH (2007): Central Bank of Chile: Monetary Policy in an Inflation Targeting Framework, Central
Bank of Chile.
Chamberlain, G., and M. Rothschild (1983): Arbitrage, Factor Structure, and Mean-Variance
Analysis on Large Asset Markets, Econometrica, 51(5), 12811304.
Charnavoki, V., and J. Dolado (2014): The Effects of Global Shocks on Small Commodity-
Exporting Economies: Lessons from Canada, American Economic Journal: Macroeconomics, 6(2),
20737.
Cheung, C., and F. Demers (2007): Evaluating Forecasts from Factor Models for Canadian GDP
Growth and Core Inflation, Staff Working Papers 07-8, Bank of Canada.
19
Chow, H. K., and K. M. Choy (2009): Analyzing and forecasting business cycles in a small open
economy: A dynamic factor model for Singapore, OECD Journal: Journal of Business Cycle Mea-
surement and Analysis, 2009(1), 1941.
Diebold, F. X., and R. S. Mariano (1995): Comparing Predictive Accuracy, Journal of Business
& Economic Statistics, 13(3), 253263.
Echavarra, G., and W. Gonzlez (2011): Un Modelo de Factores Dinmicos de Pequea Escala
para el Imacec, Notas de Investigacin Journal Economa Chilena (The Chilean Economy), 14(2),
109118.
Eickmeier, S., and C. Ziegler (2008): How successful are dynamic factor models at forecasting
output and inflation? A meta-analytic approach, Journal of Forecasting, 27(3), 237265.
Ferreira, R. T., H. Bierens, and I. Castelar (2005): Forecasting Quarterly Brazilian GDP
Growth Rate With Linear and NonLinear Diffusion Index Models, Economia, 6(3), 261292.
Giannone, D., and T. D. Matheson (2007): A New Core Inflation Indicator for New Zealand,
International Journal of Central Banking, 3(4), 145180.
Gosselin, M.-A., and G. Tkacz (2010): Using dynamic factor models to forecast Canadian inflation:
the role of US variables-super-1, Applied Economics Letters, 17(1), 1518.
Gupta, R., and A. Kabundi (2010): Forecasting Macroeconomic Variables in a Small Open Economy:
A Comparison between Small- and Large-Scale Models, Journal of Forecasting, 29(1-2), 168185.
Hammond, G. (2012): State of the art of inflation targeting, no. 29 in Handbooks. Centre for Central
Banking Studies, Bank of England.
Harvey, D., S. Leybourne, and P. Newbold (1997): Testing the equality of prediction mean
squared errors, International Journal of Forecasting, 13(2), 281291.
Kamber, G., O. Karagedikli, M. Ryan, and T. Vehbi (2016): International spill-overs of un-
certainty shocks: Evidence from a FAVAR, CAMA Working Papers 2016-61, Centre for Applied
Macroeconomic Analysis, Crawford School of Public Policy, The Australian National University.
Ltkepohl, H. (2007): New Introduction to Multiple Time Series Analysis. Springer Berlin Heidelberg.
Marcellino, M., J. H. Stock, and M. W. Watson (2003): Macroeconomic forecasting in the Euro
area: Country specific versus area-wide information, European Economic Review, 47(1), 118.
Matheson, T. D. (2006): Factor Model Forecasts for New Zealand, International Journal of Central
Banking, 2(2), 169237.
20
Moser, G., F. Rumler, and J. Scharler (2007): Forecasting Austrian inflation, Economic Mod-
elling, 24(3), 470480.
Stock, J. H., and M. W. Watson (2002): Forecasting Using Principal Components From a Large
Number of Predictors, Journal of the American Statistical Association, 97, 11671179.
Zellner, A. (1962): An Efficient Method of Estimating Seemingly Unrelated Regressions and Tests for
Aggregation Bias, Journal of the American Statistical Association, 57(298), 348368.
21
A Data Appendix
The external series where directly taken from Bloomberg. In particular, country-specific data was ob-
tained through its World Economic Statistics feature. The domestic series were taken from the CBCHs
database. Table 7 summarizes this information. The transformation codes are: 0 - no transformation, 1 -
first differences, 2 - natural logarithms and 3 - first differences of natural logarithms. The sample ranges
from 2001Q1 to 2016Q3.
22
Table 7: Data description.
23
Table 7: Data description.
24
Table 7: Data description.
25
Table 7: Data description.
26