Beruflich Dokumente
Kultur Dokumente
2
3
4
5
6
Mean
18.56 14.64 12.39 19.48 7.34 11.99 12.67
o
1
o
2
o
3
o
4
o
5
o
6
o
EFS: Total
Alcoholic
Beverages,
Tobacco
St. Dev.
19.0 18.5 15.0 19.7 14.6 19.1 17.8
Are there significant difference across the means of
these groups?
Or do the differences depend on the different levels of
variability across the groups?
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
5
Analysis of Variance
Here the target variable is alcohol
expenditure, the factor is the economic
position of the HRP
Basically we investigate the attribution of a
variation in the metric target variable to
variations in one on more categorical
explanatory variables (the factors)
A treatment is a combination of different
factors in n-way analysis of variance
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
6
One-way ANOVA
Only one categorical variable (a single factor)
Several levels (categories) for that factor
The typical hypothesis tested through ANOVA is
that the factor is irrelevant to explain differences
in the target variable (i.e. the means are equal, as
in bivariate mean comparisons/t-tests)
Apart from the tested factor(s), the groups should
be safely considered homogeneous between each
other
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
7
Null and alternative hypothesis for
ANOVA
Null hypothesis (H
0
): all the means are equal
Alternative hypothesis (H
1
): at least two
means are different
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
8
Arranging data for ANOVA
Economic position of Household Reference Person
Group (g)
1 2 3 4 5 6
Self-
employed
Fulltime
employee
Pt
employee
Unempl.
Ret unoc
over min
ni age
Unoc -
under
min ni
age
Observations
x
11
x
21
x
13
x
14
x
15
x
16
x
21
x
22
x
23
x
24
x
25
x
26
x
31
x
32
x
33
x
34
x
35
x
36
Number of observations (n)
n
1
n
2
n
3
n
4
n
5
n
6
Means
x
1
x
2
x
3
x
4
x
5
x
6
Overall mean x
t
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
9
The statistical distribution to carry out
ANOVA
1. Decompose the total variation (sum of
squares corrected for the mean)
2. Compute the F-test statistic
3. Choose the critical value
4. Interpret the result
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
10
One-way ANOVA: data
Suppose that we have n observation within each group and g group
Obs.
Group (factor level)
1 2 j g
1 x
11
x
12
x
1j
x
1g
2 x
21
x
22
x
2j
x
2g
i x
i1
x
i2
x
ij
x
ig
n x
n1
x
n2
x
nj
x
nn
Group mean
1
x
2
x
j
x
g
x
1
1
g
j
j
x x
g
=
=
TOTAL MEAN
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
11
Measuring and decomposing the total
variation
SUM OF SQUARES (corrected for the mean)
VARIATION BETWEEN THE GROUPS +
VARIATION WITHIN EACH GROUP =
________________________________
TOTAL VARIATION
Statistics for Marketing & Consumer Research
Copyright 2008 - Mario Mazzocchi
12
Variance decomposition
( )
2
2
1 1
1
c
n g
rc
c r
T
x x
s
n
= =
=
( )
2
2
1
1
g
c c
c
BW
x x n
s
g
=
=
( )
2
2
1 1
1
c
n g
rc c
W
c r
c
x x
s
n
= =
(TOTAL VARIANCE)
(VARIANCE BETWEEN GROUPS)
(VARIANCE WITHIN GROUPS)
( ) ( ) ( )
2 2 2
1 1 1 1 1
c r
n n g g g
rc rc c c c
c r c r c
x x x x x x n
= = = = =
= +
DEGREES OF FREEDOM
1
1 1 ( 1) 1
g
c
c
n g n g n g
=
= + = +
DEGREES OF FREEDOM
1
1 1 ( 1) 1
g
c
c
n g n g n g
=
= + = +