Sie sind auf Seite 1von 3

1

Appendix 14: Boxs M Test


Boxs M statistic is used to test for homogeneity of covariance matrices. The jth set
of r dependent variables in the ith cell are = + y x e
ij ij ij
B where e N w
ij r ij i
~ , 0
1

,

for i g =1, , K and j n
i
=1, , K . The null hypothesis of the test for homogeneity of
covariance matrices is H
o g
:
1
= = L . Box (1949) derived a test statistic based
on the likelihood-ratio test. The test statistic is called Boxs M statistic. For
moderate to small sample sizes, an F approximation is used to compute its
significance.
Boxs M statistic is not designed to be used in a linear model context;
1
therefore
the observed cell means are used in computing the statistic.
Notation
The following notation is used throughout this chapter, unless otherwise stated:

g Number of cells with non-singular covariance matrices.
n
i
Number of cases in the ith cell.
n Total sample size, n n n
g
= + +
1
L .
y
ij

The jth set of dependent variables in the ith cell. A column vector of length r.
w
ij

Regression weight associated with y
ij
. It is assumed w
ij
> 0.

1
Although Anderson (1958, Section 10.2) mentioned that the population cell
means can be expressed as linear combinations of parameters, he assumed that the
combination coefficients are different for different cells, which is not the model
assumed for GLM .
2 Appendix 14

Statistics
Means
y y
i ij i
j
n
n
i
=
=

1

Cell Covariance Matrix
S
y y y y
0
i
ij ij i
j
n
ij i i i
i
w n n
n
i
=


>

, ,
1
1 1
1
if
if

Pooled Covariance Matrix
S
S
0
=

>

n n g
n g
n g
i i
i
g
1
1

if
if

Boxs M Statistic
M
n g n
i i
i
g
=
>

log log S S S
S
1 0
0
1
if
SYSMIS if

Appendix 14 3

Significance
1
1 2
CDF. F , , M f f
where CDF.F is the SPSS function for the cumulative F distribution and
f g r r
r r
r g n n g
r r
g
n n g
f
f
f f
f
i
i
g
i
i
g
1
2
1
2
1
2
2
1
2
1 2
1
1 1 2
1
2 3 1
6 1 1
1
1
1
1 2
6 1
1
1
1
2
1
= +
=
+
+

=
+

=

=
=



The significance is a system-missing value whenever the denominator is zero in the
above expression.
References
Anderson, T. W. 1958. Introduction to multivariate statistical analysis. New York:
John Wiley & Sons, Inc.
Box, G. E. P., 1949. A general distribution theory for a class of likelihood criteria.
Biometrika, 36: 317346.
Seber, G. A. F. 1984. Multivariate observations. New York: John Wiley & Sons,
Inc. (Section 9.2.6).