Beruflich Dokumente
Kultur Dokumente
BY
RAHUL A. PARSA
DRAKE UNIVERSITY
&
STUART A. KLUGMAN
SOCIETY OF ACTUARIES
Outline of Talk
OLS Regression
Generalized Linear Models (GLM)
Copula Regression
Continuous case
Discrete Case
Examples
Notation
Notation:
Y Dependent Variable
X 1 , X 2 , X k Independent Variables
Assumption
Y is related to Xs in some functional form
E[ Y | X 1 x1 X n xn ] f ( X 1 , X 2 , X n )
OLS Regression
Y is linearly related to Xs
OLS Model
Yi 0 1 X 1i 2 X 2i k X ki i
OLS Regression
2
min (Yi Yi )
Estimated Model
1
Y X ' X X 'Y
Yi 0 1 X 1i k X ki
OLS
Multivariate Normal Distribution
Assume Y , X 1 , X 2 , X k
Jointly follow a multivariate normal distribution
Then the conditional distribution of Y | X follows
normal distribution with mean and variance given by
E(Y | X x) y YX ( x x )
1
XX
Variance YY YX
1
XX
YX
GLM
Y belongs to an exponential family of distributions
E (Y | X x) g ( 0 1 x1 k xk )
1
methods
GLM
Generalization of GLM: Y can be any distribution
Copula Regression
Y can have any distribution
Each Xi can have any distribution
The joint distribution is described by a Copula
Estimate Y by E(Y|X=x) conditional mean
Copula
Ideal Copulas will have the following properties:
ease of simulation
closed form for conditional density
different degrees of association available for different
pairs of variables.
MVN Copula
CDF for MVN is Copula is
1
F ( x1 , x2 , , xn ) G ( [ F ( x1 )], [ F ( xn )])
Where G is the multivariate normal cdf with zero
Conditional Distribution in
MVN Copula
The conditional distribution of xn given x1 .xn-1 is
{F ( xn ) r T Rn11vn1}2
1
2
T 1
0.5
f ( xn | x1 xn1 ) f ( xn ) * exp 0.5 *
[
F
(
x
)]}
*
(
1
r
R
r
)
n
n
T 1
(
1
r
Rn1r )
Where
vn 1 (v1 , vn 1 )
Rn1
R T
r
Copula Regression
Continuous Case
If Y , X 1 , X k
Copula Regression
Discrete Case
When one of the covariates is discrete
Problem:
determining discrete probabilities from the Gaussian
copula requires computing many multivariate
normal distribution function values and thus
computing the likelihood function is difficult
Solution:
Replace discrete distribution by a continuous
distribution using a uniform kernel.
I (q ) n * E 2 ln( f ( X ,q ))
q
Examples
All examples have three variables
R Matrix :
0.7
0.7
0.7
0.7
0.7
0.7
Error measured by
(Yi Yi )
Example 1
Dependent X3 - Gamma
X1-Pareto
X2-Pareto
X3-Gamma
Parameters
3, 100
4, 300
3, 100
3.44, 161.11
1.04, 112.003
3.77, 85.93
MLE
Error:
Copula
59000.5
OLS
637172.8
Ex 1 - Standard Errors
Diagonal terms are standard deviations and off-
Theta1
X2 Gamma
Alpha2
Theta2
X3 Gamma
Alpha3
Theta3
R(2,1)
R(3,1)
R(3,2)
Alpha1
Theta1
Alpha2
Theta2
Alpha3
Theta3
R(2,1)
R(3,1)
R(3,2)
0.35608 0.470009
Example 1 - Cont
Maximum likelihood Estimate of Correlation Matrix
R-hat =
0.711
0.699
0.711
0.713
0.699
0.713
Example 2
Dependent X3 - Gamma
X1-Pareto
X2-Pareto
X3-Gamma
Parameters
3, 100
4, 300
3, 100
4.03, 81.04
MLE
Error:
Copula
595,947.5
OLS
637,172.8
GLM
814,264.754
Example 3
Dependent X3 - Gamma
X1-Poisson
X2-Pareto
X3-Gamma
Parameters
4, 300
3, 100
5.65
119.39
3.67, 88.98
MLE
Error:
Copula
574,968
OLS
582,459.5
Example 4
Dependent X3 - Gamma
X1-Poisson
X2-Pareto
X3-Gamma
Parameters
4, 300
3, 100
3.96, 82.48
MLE
Error:
Copula
OLS
GLM
559,888.8
582,459.5
652,708.98
Example 5
Dependent X1 - Poisson
X1-Poisson
X2-Pareto
X3-Gamma
Parameters
4, 300
3, 100
5.65
119.39
3.66, 88.98
MLE
Error:
Copula
108.97
OLS
114.66
Example 6
Dependent X1 - Poisson
Variables
X1-Poisson
X2-Pareto
X3-Gamma
Parameters
4, 300
3, 100
5.67
MLE
Error:
Copula
110.04
OLS
114.66