Beruflich Dokumente
Kultur Dokumente
Qualitative Response
Regression Models
House ownership, Y
12
16
20
24
Income, in thousand, X
Houseownership, Y
12
16
20
24
Shortcomings:
The LPM is plagued by
several problems, some are
surmountable and can be
overcome but some are
fundamental.
Yi = 0 . 8423 + 0 . 0915 X i
X = 8, Y =
Yi = 0 . 8423 + 0 . 0915 ( 8 )
= 0 . 1103
1
Hosue ownership, Y
1.
X = 20, Y =
Yi = 0 . 8423 + 0 . 0915 ( 22 )
= 1 . 1707
12
16
20
24
10
4.
5.
11
Nonlinear models!
12
13
Index Models
General form of the models:
Pr(Y=1|x) = G(,X)
Where G is usually a cumulative distribution function (CDF).
Through a link function, a variable which is (0,1) can be
transformed into a continuous variable.
Two wisely used link function
In logit models: G is the CDF of a logistic distribution.
In Probit Models: G is the CDF of a standard normal
distribution.
14
Index Models
Since y* is unobserved, we use do not know the
distribution of the errors,
In order to use maximum likelihood estimation (ML),
we need to make some assumption about the
distribution of the errors.
The difference between Logistic and Probit models lies
in this assumption about the distribution of the errors
15
Index Models
Probit:
standard
Normal
distribution
logit:
Standard
logistic
distribution
16
Index Models
17
How it works?
In the logit model the probability p
that the observed value y takes the
value 1 is
Pi = E (Y = 1 X i ) =
1
1+ e
-( 0 + i X i )
2.0
1.5
1.0
0.5
1
1 + e
then
=
-0.5
1
= 1
1
1
1 + e
-1.0
2
If 0 + 1 X 1i = then
P
0.0
10
12
P_LPM
14
16
18
20
22
24
P_LOGIT
= 0
18
Log-Odds Ratio:
Note that :
Probability of yes:
Pi = E (Y = 1 X i ) =
1
1+ e
( 0 + i X i )
Probability of No:
1 Pi = E (Y = 1 X i ) =
1
1 + e ( 0 + i X i )
ln
( )=
Pi
1 Pi
0 + X + ui
i i
( 0 + 1 X i )
pi
1 + e ( 0 + i X i )
=
=
e
1 pi
1 + e (0 + i X i )
19
Ni
ni
40
50
12
10
60
18
13
80
28
15
100
45
20
70
36
25
65
39
30
50
33
35
40
30
40
25
20
20
LOG_ODDS
10
5
0
-5
-10
-15
-20
2
10
12
14
16
18
20
22
24
16
18
20
22
24
X ($'000)
P_LOGIT
1.0
0.8
Prob
0.6
0.4
0.2
0.0
2
10
12
14
X ($'000)
21
5
0
-5
-10
-15
-20
2
10
12
14
16
18
20
22
24
16
18
20
22
24
X ($'000)
P_LOGIT
1.0
0.8
Prob
0.6
0.4
0.2
0.0
2
10
12
14
X ($'000)
22
23
24
1
1 + e -( 0 + i X i )
25
N(0,2)
26
27
28
Model Evaluation
Goodness of Fit
McFadden R-Squared
Overall significant
Likelihood Ratio TEST
1-LnUR/LnLR
LR=2(lnLU R lnLR)
Where the LUR is the calculated
likelihood for the full model and
LR is the calculated likelihood for
the restricted model.
The test is distributed according
to a chi-squares distributed, with
the
degree
of
freedom
corresponding to the number of
restrictions.
29