Beruflich Dokumente
Kultur Dokumente
Outline
Features of Multi-stage Sample Designs Selection probabilities in multistage sampling Estimation of parameters Calculation of standard errors Efficiency of multi-stage samples
2
Introduction
Multi-stage sampling means what its name suggests -> there are multiple stages in the sampling process The number of stages can be numerous, although it is rare to have more than 3 For this topic we will concentrate on two-stage sampling
EAs
Dwelling
A
Persons
C
5
Your Examples
Estimation Domains Stratification Number of stages Sampling units for each stage Sample selection scheme in each stage Sampling frames used in each stage
Two-Stage Sampling
Stage One. Select sample of clusters from population of clusters.
Using any sampling scheme, usually: SRSWOR, PPSWR, LSS
Stage Two. Select sample of elements within each of the sample clusters.
Language: also referred to as subsample of elements within a cluster Subsampling can be done also using any sampling scheme
8
Most Large-Scale Surveys Use Multi-stage Sampling Because Sampling frames are available at higher
stages but not for the ultmate sampling units. Construction of sampling frames at each lower stage becomes less costly. Cost efficiency with use of clusters at higher stages of selection Flexibility in choice of sampling units and methods of selection at different stages Contributions of different stages towards 9 sampling variance may be estimated
Probabilities of Selection
Probability that an element in the population is selected in a 2-stage sample is the product of
Probability that the cluster to which it belongs is selected at the first stage Probability that the element is selected at the second stage given that the cluster to which it belongs is selected at the first stage
10
11
SRS at stage 1 and SRS at stage 2 SRS at stage 1 and LSS at stage 2 (b from B) PPSWR at stage 1 and SRS at stage 2 (b from B)
12
Y1 s
A a A a = Y = B y a =1 a =1
1 A
()
1 1 A A 1 1 1 1 Sb + B S A a =1 b B
1 Sb = 1( Y Y ) A 1=
1 B
1 S = =1(Y Y ) B 1
13
()
1 1 A A 1 1 1 1 Sb + B S A a =1 b B
( )
+
a
1 1 B b B
1
1 1 sb = 1 Y a 1Y a 1= =
1 a 1 b b
1 1 s = y b y b 1 1 = 1 =
15
()
A
where
1 S = (Y Y ) A 1 1 = 1A 1 1 B 1 1 S w = S ; S = Y 1 ( Y A =1 B 1 =
1 b
16
()
1 s = =1 a1
1 b
s b a
y Y 1 1 s = ( b 1
b
b sw + 1 B ab
1a 1 1 sw = s ; a =1
1 =
y )
17
) =( 1 1 + 1 V (Y1 )S 11 b a A aA
1 B 1 1 (1 B )S B b =1
) = ( 1 1 s 1 + 1 B ( 1 1) s 1 v(Y1 )1 B b B b a A aA =1
a
1 1
18
1 A )= bias( Y1 1 ( B B)Y AB =
) = ( 1 1 S1 + 1 ( 1 1) S1 V (Y1 ) 1 1 b B b a A aA =
A
19
Y1=
B
=1 a =1
1 1 1 1 A 1 1 1 1 V (Y1) = ( ) S1 + 1 B ( ) S b a A B aA =1 b B
20
1 1 A1 1 1 1 V Y = b s + 1 ( =A a A ab B 1
()
1 S
( ))
1 + b
s v Y a
( )
1 b
21
; Y = B y
a 1
Y 1 v Y = Y p a ( a 1 =1 )
( )
22
If is positive, the design effect decreases as the subsample size b decreases. For fixed n=ab, the smaller the sub-sample size and, hence, the larger the number of clusters included in the sample, the more
23
25
c1 w / c1 b1 S1 S
a* =
Sb1 * (C C1) c1 cS + c S
1 1 b 1 1 w
28
1 w 1 b
29