Beruflich Dokumente
Kultur Dokumente
Forecasting
Regression overview
Relationships between variables
Model building
Least Squares estimation
Introduction to Regression
Inference
Dynamics
Prediction
2
Relationships Between
Regression Overview Variables
Regression is a useful tool for
Economic variables often do not exist in
Quantification of business process isolation
Policy analysis
There exist relationships between them
However, regression analysis requires
Sales = f(Advertising)
* Model building
More data collection
Consumption = f(Income)
Price inflation = f(Interest rates)
Regression based forecasting requires
Changes in one variable may precede or
* Prior forecasts of explanatory series
* Sources of uncertainty & error
cause changes in the other
3 4
2.5 180
How might it be described?
2.0 150
1.5
120
1.0 Which is the best trend to
0.5 90 describe this data?
0.0
60
0% 5% 10% 15% 20% 25% 30% 35%
Annual Price Inflation 1955 1965 1975 1985 1995 2005
5 6
1
Model Building Model Building
25 A linear relationship
Suppose we have a dependent variable, y
When x increases/decreases by one unit,
which we wish to explain in terms of another 20
y increases/decreases by 2 units
variable x y = 3 + 2x
2
15
As a first approximation we might assume
y
Unfortunately exact relationships are The model is amended to reflect the impact
extremely rare of these random departures from the
Deviations from the relationship may be due relationship as follows
to E(y) = 0 + 1x
Unforeseen (random) events or
Data errors
y = 0 + 1x +
Model errors
where represents the sum of random
elements in the model
11 12
2
Linear Regression Linear Regression
30 Number of Heads when coins are tossed Because of the random factors (errors) in the
relationship we can not determine exactly the
y = Number of heads
25
20
{ true values of 0 & 1
15 E(y) = 0 + x We will denote the estimated relationship
10
y = b0 + b1 x
5 How should we choose b0 & b1 to give an
0
estimated relationship that best describes the
15 20 25 30 35 40 45 data?
x = Number of coins tossed
13 14
30 Number of Heads when coins are tossed The best fitted line is defined as the one
define residuals which minimises the (sum of) squared
y = Number of heads
25
20
e = y (b0 + b1x) { residuals (SSR)
15 This is the principle of Ordinary Least
10
Squares
5 Which values of b0 & b1 define a
line that best fits the data?
0
15 20 25 30 35 40 45
x = Number of coins tossed
15 16
3
Least Squares Regression Least Squares Regression
We may calculate the SSR 30 Number of Heads when coins are tossed
The estimated relationship is not
y = Number of heads
25
x y y e = y y e necessarily equal to the actual
20 8 7 1 1 20 average relationship
Because the parameters are estimated using Given the Least Squares Assumptions, we can
random data, they are themselves random describe the random nature of our estimates
Given our assumptions, Least Squares Let bk be the estimate of k in
estimates are BLU yi = 0 + 1xi + 2wi + + K-1zi + i ; i = 1,,n
Best (minimum variance) of all Linear Unbiased
estimators bk ~ N( k , var(bk) )
The (estimated) standard deviation of bk is
called the standard error
23 24
4
Least Squares Regression Inference
(y i y ) ANOVA analysis
2
i =1 General restrictions
SSR
An estimator of var() = 2 2
is s =
n K
25 26
5
Confidence Intervals Hypothesis Testing
31 32
If u R0 = {u: |u| < c} we accept H0 ANOVA analysis allows us to test the total
explanatory power of the model
If u RA = {u: |u| > c} we accept HA with
H0 the dependent variable is not significantly
(1 ) confidence
related to any of the regressors
c is the /2 value from tnK tables H0 all of k = 0 for k = 1, K-1
HA the dependent variable is significantly
related to at least one regressor
HA at least one k 0 for k = 1, K-1
33 34
6
General Restrictions Dynamics
The test statistic may be calculated using
In time series analysis, we allow for variables
n K 1 SSR 2 SSR1 leading or lagging each other using
u = dynamics
K1 K2 SSR1
Let yt denote UK Real GDP measured at time t
If u R0 = {u: u < c} we accept H0 Let rt denote UK interest rates measured at
If u RA = {u: u > c} we accept HA with time t
(1 ) confidence
c is the value from F(K 1 K 2 ,n K 1 ) tables
37 38
Dynamics Dynamics
It is argued by the Bank of England that a Lags are easily created in our data set by
change in r will not impact on y for 18 shifting the observations through time
months. If the data is collected quarterly, this t UK GDP (M) = yt yt-1 yt-2 yt-3
Mar-90 132951
is six quarters Jun-90 132552 132951
Thus, instead of regressing yt on rt we should Sep-90 139408 132552 132951
Dec-90 146207 139408 132552 132951
regress it on rt-6 Mar-91 136716 146207 139408 132552
Jun-91 139950 136716 146207 139408
Sep-91 146830 139950 136716 146207
Dec-91 152178 146830 139950 136716
39 40
7
Prediction
43