Beruflich Dokumente
Kultur Dokumente
Survey elements
SAMPLING BREAKDOWN
Sampling error
Difference between sample and population
Biased sample does not represent population
some groups are over-represented; others are
under-represented
sources of bias
Probability sampling
The four stage process
1. Identify sampling frame from research
objectives
2. Decide on a suitable sample size
3. Select the appropriate technique and the
sample
4. Check that the sample is representative
4 types of probability
sample
1. Simple random sample
2. Systematic sample
3. Stratified random
sample
4. Multi-stage cluster
sample
Sample size
Choice of sample size is influenced by
10
Systematic sampleing
Select units directly from sampling frame
From a random starting point, choose every nth unit
(e.g. every 4th name)
a starting point is chosen at random , choose every nth
unit , and choices thereafter are at regular intervals. For
example, suppose you want to sample 450 employees
from 9000. 9000/450=20, so every 20 employee is
chosen after a random starting point between 1 and 20.
If the random starting point is 16, then the employees
selected are 16, 36, 56, 76, 96, 116, etc.
Make sure sampling frame has no inherent ordering if
it has, rearrange it to remove bias
12
Stratified random
sampling
Starting point is to categorise population into strata
(relevant divisions, or departments of companies for
example) i.e. stratifying the population by criterion,
Five departments, will result in 5 strata
So the sample can be proportionately
representative of each stratum
Then, randomly select within each category as for
a simple random sample
This approach will ensure the resulting sample will
be distributed in the same as the population
We can also stratify by several criteria, i.e. by both
department and gender and whether or not
employees are above or below a certain salary
level or occupational grade.
Note we can only stratify the sample if we have
the relevant information accessible.
13
14
TATISTICAL SAMPLING
Sample size
required from
a population
5,000 given a
95%
confidence
level for 2.5
margin of
error you
need a
sample size
of 1176
15
Multi-stage cluster
sampleing
Useful for widely dispersed populations
First, divide population into groups (clusters) of
units, like geographic areas, or industries,
for example
Sub-clusters (sub-groups) can then be sampled
from these clusters, if appropriate
Now randomly select units from each
(sub)cluster
Collect data from each cluster of units,
consecutively
16
Primary
Secondary
Clusters
Clusters
1
2
2
3
3
4
5
4
5
6
7
8
10
11
8
9
12
13
14
10
15
17
Multi-stage cluster
sampleing
Example: We want a nationally
representative sample of 5,000 employees
who are working for the 100 largest
companies in the UK.
Problem: Using simple random or
systematic sampling would yield a widely
dispersed sample, which would result in a
great deal of travel for interviewers.
One solution to sample companies and then
employees from each company. We
randomly sample ten companies from the
entire population of 100 largest companies
in the UK, resulting in ten clusters, and we
would then interview 500 randomly selected
Qualities of a probability
sample
Good Representative Sampleallows for generalization from
sample to population
Use inferential statistical tests
to generalize
Sample means can be used to
estimate population means
19
Sample size
Absolute size matters more than
relative size
The larger the sample, the more
precise and representative it is likely
to be
As sample size increases, sampling
error decreases
Important to be honest about the
limitations of your sample
20
Non-response
response rate = % of sample who agree to
participate (or % who provide usable data)
responders and non-responders may differ
on a crucial variable
22
Types of non-probability
sampling: 1
1. Convenience sampling
the most easily accessible individuals
useful when piloting a research instrument
may be a chance to collect data that is too good
to miss
2. Snowball sampling
researcher makes initial contact with a small
group
these respondents introduce others in their
network
23
24
Limits to generalization
findings can only be generalized to
the population from which the
sample was selected
be wary of over-generalizing in terms of
locality
Sampling-related error
inadequate sampling frame; non-response
makes it difficult to generalize findings
26