Lecture 2.1 Regression

Lecture 2.
1
BEA653
Applied Quantitative Finance
Vladimir Volkov
Tasmanian School of Business and Economics
Sandy Bay, Hobart
Chapter 3
A brief overview of the

classical linear regression model
‘Introductory Econometrics for Finance’ © Chris Brooks 2013 2

Regression
• Regression is probably the single most important tool at the

econometrician’s disposal.
But what is regression analysis?
• It is concerned with describing and evaluating the relationship between

a given variable (usually called the dependent variable) and one or
more other variables (usually known as the independent variable(s)).

Some Notation
• Denote the dependent variable by y and the independent variable(s) by x1, x2,
... , xk where there are k independent variables.
• Some alternative names for the y and x variables:

y x
dependent variable independent variables
regressand regressors
effect variable causal variables
explained variable explanatory variable
• Note that there can be many x variables but we will limit ourselves to the
case where there is only one x variable to start with. In our set-up, there is
only one y variable.
Regression is different from Correlation
• If we say y and x are correlated, it means that we are treating y and x in

a completely symmetrical way.
• In regression, we treat the dependent variable (y) and the independent

variable(s) (x’s) very differently. The y variable is assumed to be
random or “stochastic” in some way, i.e. to have a probability
distribution. The x variables are, however, assumed to have fixed
(“non-stochastic”) values in repeated samples.

Simple Regression
• For simplicity, say k=1. This is the situation where y depends on only one x
variable.
• Examples of the kind of relationship that may be of interest include:

– How asset returns vary with their level of market risk
– Measuring the long-term relationship between stock prices and
dividends.
– Constructing an optimal hedge ratio

Simple Regression: An Example
• Suppose that we have the following data on the excess returns on a fund
manager’s portfolio (“fund XXX”) together with the excess returns on a
market index:
Year, t Excess return Excess return on market index
= rXXX,t – rft = rmt - rft
1 17.8 13.7
2 39.0 23.2
3 12.8 6.9
4 24.2 16.8
5 17.2 12.3
• We have some intuition that the beta on this fund is positive, and we
therefore want to find whether there appears to be a relationship between
x and y given the data that we have. The first stage would be to form a
scatter plot of the two variables.

Graph (Scatter Diagram)
45
Excess return on fund XXX
40
35
30
25
20
15
10
5
0
0 5 10 15 20 25
Excess return on market portfolio

Finding a Line of Best Fit
• We can use the general equation for a straight line,

y=a+bx
to get the line that best “fits” the data.
• However, this equation (y=a+bx) is completely deterministic.
• Is this realistic? No. So what we do is to add a random disturbance

term, u into the equation.
yt = α + βxt + ut
where t = 1,2,3,4,5

Why do we include a Disturbance term?
• The disturbance term can capture a number of features:
- We always leave out some determinants of yt

- There may be errors in the measurement of yt that cannot be
modelled.
- Random outside influences on yt which we cannot model

Determining the Regression Coefficients
• So how do we determine what α and β are?

• Choose α and β so that the (vertical) distances from the data points to the
fitted lines are minimised (so that the line fits the data as closely as
possible):

Ordinary Least Squares
• The most common method used to fit a line to the data is known as
OLS (ordinary least squares).
• What we actually do is take each distance and square it (i.e. take the
area of each of the squares in the diagram) and minimise the total sum
of the squares (hence least squares).
• Tightening up the notation, let

yt denote the actual data point t
ŷt denote the fitted value from the regression line
ût denote the residual, yt - ŷt

Actual and Fitted Value
• - y
yi
û i
ŷi
xi x

How OLS Works
• So min. uˆ1 + uˆ 2 + uˆ3 + uˆ 4 + uˆ5 , or minimise

2 2 2 2 2
∑ uˆ
t =1
2
t . This is known
as the residual sum of squares.
• But what was ût ? It was the difference between the actual point and
the line, yt - ŷt .
• So minimising ( y − ˆ
y )
∑ t t is equivalent to minimising
2
∑ t
ˆ
u 2
with respect to α and β .

Deriving the OLS Estimator
ˆ t = αˆ + βˆxt , so let
• But y L = ∑ ( yt − yˆ t ) 2 = ∑ ( yt − αˆ − βˆxt ) 2
t i
• Want to minimise L with respect to (w.r.t.) α and β , so differentiate L

w.r.t. α and β
∂L
∂αˆ t
∑
= −2 ( yt − αˆ − βˆxt ) = 0 (1)
∂L (2)
= −2∑ xt ( yt − αˆ − βˆxt ) = 0
∂βˆ t
• From (1), ∑ ( y t − αˆ − βˆx t ) = 0⇔ ∑ y t − Tαˆ − βˆ ∑ x t = 0

t
• But ∑ y t = Ty and ∑ x t = Tx .

Deriving the OLS Estimator (cont’d)
• So we can write Ty − Tαˆ − Tβˆx = 0 or y − αˆ − βˆx = 0 (3)

• From (2), ∑ x ( y − αˆ − βˆx ) = 0
t t t
(4)
t
• From (3), αˆ = y − βˆx (5)

• Substitute into (4) for α from (5),
∑ xt ( yt − y + βˆx − βˆxt ) = 0
t
∑ t t ∑ t
x y − y x + β
ˆx ∑ t
x − β
ˆ ∑ t =0
x
2
∑ t t
x y − T y x + β
ˆT x 2
− β
ˆ ∑ t =0
x
2

Deriving the OLS Estimator (cont’d)
• Rearranging for β ,
βˆ (Tx 2 − ∑ xt2 ) = Tyx − ∑ xt yt
• So overall we have
β=
ˆ ∑ xt yt − Tx y
andαˆ = y − β
ˆx
∑ xt2 − Tx 2
• This method of finding the optimum is known as ordinary least squares.

What do We Use α and β For?
• In the CAPM example, plugging the 5 observations in to make up the formulae

given above would lead to the estimates
α = -1.74 and β= 1.64. We would write the fitted line as:
yˆ t = −1.74 + 1.64 x t
• Question: If an analyst tells you that she expects the market to yield a return
20% higher than the risk-free rate next year, what would you expect the return
on fund XXX to be?
• Solution: We can say that the expected value of y = “-1.74 + 1.64 * value of x”,
so plug x = 20 into the equation to get the expected value for y:
yˆ i = −1.74 + 1.64 ×20 = 31.06
Linearity
• In order to use OLS, we need a model which is linear in the parameters (α

and β ). It does not necessarily have to be linear in the variables (y and x).
• Linear in the parameters means that the parameters are not multiplied
together, divided, squared or cubed etc.
• Some models can be transformed to linear ones by a suitable substitution

or manipulation, e.g. the exponential regression model
Yt = eα X tβ e ut ⇔ln Yt = α + β ln X t + ut
• Then let yt=ln Yt and xt=ln Xt
yt = α + βxt + ut
Linear and Non-linear Models
• This is known as the exponential regression model. Here, the coefficients

can be interpreted as elasticities.
• Similarly, if theory suggests that y and x should be inversely related:

β
yt = α + + ut
xt
then the regression can be estimated using OLS by substituting
1
zt =
xt
• But some models are intrinsically non-linear, e.g.
β
yt = α + xt + ut

Lecture 2.1 Regression

Hochgeladen von

Dokumentinformationen

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Lecture 2.1 Regression

Hochgeladen von

Copyright:

Verfügbare Formate

Lecture 2.

Applied Quantitative Finance

A brief overview of the

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 2

• Regression is probably the single most important tool at the

But what is regression analysis?

• It is concerned with describing and evaluating the relationship between

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 3

• Some alternative names for the y and x variables:

• If we say y and x are correlated, it means that we are treating y and x in

• In regression, we treat the dependent variable (y) and the independent

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 5

• Examples of the kind of relationship that may be of interest include:

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 6

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 7

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 8

• We can use the general equation for a straight line,

• However, this equation (y=a+bx) is completely deterministic.

• Is this realistic? No. So what we do is to add a random disturbance

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 9

• The disturbance term can capture a number of features:

- We always leave out some determinants of yt

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 10

• So how do we determine what α and β are?

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 11

• Tightening up the notation, let

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 12

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 13

• So min. uˆ1 + uˆ 2 + uˆ3 + uˆ 4 + uˆ5 , or minimise

with respect to α and β .

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 14

• Want to minimise L with respect to (w.r.t.) α and β , so differentiate L

• From (1), ∑ ( y t − αˆ − βˆx t ) = 0⇔ ∑ y t − Tαˆ − βˆ ∑ x t = 0

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 15

• So we can write Ty − Tαˆ − Tβˆx = 0 or y − αˆ − βˆx = 0 (3)

• From (3), αˆ = y − βˆx (5)

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 16

βˆ (Tx 2 − ∑ xt2 ) = Tyx − ∑ xt yt

• This method of finding the optimum is known as ordinary least squares.

‘Introductory Econometrics for Finance’ © Chris Brooks 2013 17

• In the CAPM example, plugging the 5 observations in to make up the formulae

• In order to use OLS, we need a model which is linear in the parameters (α

• Some models can be transformed to linear ones by a suitable substitution

• This is known as the exponential regression model. Here, the coefficients

• Similarly, if theory suggests that y and x should be inversely related:

Das könnte Ihnen auch gefallen