Beruflich Dokumente
Kultur Dokumente
Linearity
Normality
Constant variance
Influential points
Covariate overlap
1
If model is correct
residuals have mean zero at every value of predictor
Lowess smooth
linear fit
E[y|x]
Lowess smooth
-2
-5
-2
2
x
linear fit
E[y|x]
-2
Lowess smooth
2
x
linear fit
E[y|x]
Lowess smooth
-5
-2
-2
2
x
-2
2
x
-.4
-.2
BMD Residual
0
.2
.4
.6
50
100
150
weight (kg)
Residuals
50
100
150
weight (kg)
Predictor transformations
square of x
0
0
log of x
square root of x
1
0
0
10
3.5
4.5
11
-.4
-.2
BMD Residual
0
.2
.4
.6
3.5
4.5
12
BMD (gm/cm^2)
.5
1
1.5
10
20
BMD
30
BMI (kg/m^2)
Categorical Fit
40
50
Lowess Fit
14
BMD (gm/cm^2)
.5
1
1.5
A better tradeoff
10
20
BMD
30
BMI (kg/m^2)
Categorical Fit
40
50
Lowess Fit
15
17
.4
.6
BMD (gm/cm^2)
.8
1.2
10
20
30
BMI (kg/m^2)
BMD
40
50
18
1)
2)
3)
4)
F(
bmi1
bmi1
bmi1
bmi1
+
+
+
+
bmi2
bmi3
bmi4
bmi5
=
=
=
=
4,
272) =
Prob > F =
0
0
0
0
2.24
0.0654
19
5.5
200
400
Days Since HIV Infection
Wild Type
600
800
Any Resistance
21
24
25
100
Residuals
200
300
-100
100
Residuals
200
300
-100
Density
.005
.01
Residuals
100 200
300
.015
-100
-100
Density
.005
.01
Residuals
100
200
300
.015
-200
-100
0
Inverse Normal
100
200
27
-1.00e+07
-5000000 0 5000000
1.00e+07
1.50e+07
identity
0 100200300400
0 50000
100000
150000
-20000
40000
60000
10
15
20
-.05
5
5.5
-.12
-.1
-.08
-.06
-.04
1/cubic
-.0008
-.0006
-.0004
-.00020
-.005
300
-.1
4.5
1/square
-.01
200
-.15
4
inverse
-.015
100
1/sqrt
15
10
5
5
log
20
sqrt
20000
.00005
-.00002
-.000015
-.00001
-5.00e-06
0
-2.00e+07
0
2.00e+07
4.00e+07
6.00e+07
cubic
-3.00e-06
-2.00e-06
-1.00e-06 0
1.00e-062.00e-06
29
.4
Fraction
.3
.2
.1
-1
0
-1
0
Residuals
Density
Residuals
Inverse Normal
Residuals
Density
1.5
1
.5
-1
0
-2
-1
Residuals
1
-1
-.5
0
Inverse Normal
.5
30
(1)
34
36
37
38
20
10
Residuals
0
10
20
Fitted values
39
transformation
square root
log
arcsin
log[(1 + )/(1 )]
40
Residuals
0
1.5
2.5
Fitted values
41
42
Variance-to-Mean
Relationship
2 constant
2 = n(1 )
2 n(1 )
2 =
2
2 = + 2/k
Outcome
Continuous
Successes in n trials
Clustered successes
Counts
Counts
Counts
Continuous
over-dispersed
46
40
35
30
.
.
20
. . .
.. .
.
.
.
. .
.. ...
.
..
30
25
20
15
10
30
35
40
x
45
leverage = 0.04
. .....
.
. . .
.
.
. .
. .
.
50
dfbeta = -0.25
.
30
. ..
40
leverage = 0.52
50
60
dfbeta = -.61
.
25
20
15
.
30
. ..
. .....
.
.
.
. .
.
.
.
. .
. .
40
leverage = 0.52
50
60
dfbeta = -2.09
48
49
.2
.1
.1
.2
.3
DFbmi
DFnonwhite
DFdrinkany
DFage10
DFsmoking
50
Solution
Identify up to 10 observations with biggest DFbetas
All observations
P -Value
Omitting 4 points
P -Value
BMI
Age
Nonwhite
Smoking
Alcohol Use
0.36
1.89
5.22
4.75
2.72
0.34
1.86
4.19
3.78
2.64
0.007
0.090
0.025
0.032
0.069
0.010
0.090
0.066
0.072
0.072
52
53
30
40
50
Age
60
70
55
Number of obs
F( 3,
27)
Prob > F
R-squared
Adj R-squared
Root MSE
=
=
=
=
=
=
31
15.42
0.0000
0.6315
0.5906
1.0011
-----------------------------------------------------------------------------del_bdi |
Coef.
Std. Err.
t
P>|t|
[95% Conf. Interval]
-------------+---------------------------------------------------------------1.treatment |
3.217112
1.88746
1.70
0.100
-.6556366
7.08986
age |
.1247361
.0194101
6.43
0.000
.0849098
.1645623
|
treatment#|
c.age |
1 | -.0429515
.0445653
-0.96
0.344
-.1343918
.0484889
|
_cons | -1.483581
.9770828
-1.52
0.141
-3.488389
.5212275
-----------------------------------------------------------------------------56
58
.5
Density
1
1.5
-2
-1.5
-1
-.5
Logit Propensity Score
Treated
.5
Untreated
59
60
30
40
50
Age
60
70
Inference region
61
62
63
Non-normality:
Diagnostics: curvature in QQ-plot
Solutions: transform outcome, use bootstrap CIs, GLM
or ordinal model
64