Beruflich Dokumente
Kultur Dokumente
Estimator
• A point estimate is a single value (point) derived
from a sample and used to estimate a population
value.
• A confidence interval estimate is a range of values
constructed from sample data so that the
population parameter is likely to occur within
that range at a specified probability.
• The specified probability is called the level of
confidence.
Interval Estimates - Interpretation
For a 95% confidence interval about 95% of the similarly
constructed intervals will contain the parameter being
estimated. Also 95% of the sample means for a specified
sample size will lie within 1.96 standard deviations of the
hypothesized population
How to Obtain z value for a Given
Confidence Level
The 95 percent confidence refers to
the middle 95 percent of the
observations. Therefore, the
remaining 5 percent are
equally divided between the
two tails.
Following is a portion of Appendix B.1.
Point Estimates and Confidence Intervals for a
Mean – σ Known
x sample mean
z z - value for a particular confidence level
σ the population standard deviation
n the number of observations in the sample
Sample
Difference 1 2 X1 X 2
© 2002 Prentice-Hall, Inc.
Chap 8-10
Interval Estimates
• Provides range of values
– Takes into consideration variation in sample
statistics from sample to sample
– Is based on observation from one sample
– Gives information about closeness to unknown
population parameters
– Is stated in terms of level of confidence
• Never 100% certain
Confidence
Intervals
Mean Proportion
Known Unknown
X Z / 2 X Z / 2
n n
© 2002 Prentice-Hall, Inc.
Chap 8-13
Elements of
Confidence Interval Estimation
• Level of confidence
– Confidence in which the interval will contain the
unknown population parameter
• Precision (range)
– Closeness to the unknown parameter
• Cost
– Cost required to obtain a sample of size n
Intervals X
X
extend from of1intervals
100%
X Z X constructed
to contain ;
X Z X do not.
100 %
© 2002 Prentice-Hall, Inc. Confidence Intervals
Chap 8-16
Factors Affecting
Interval Width (Precision)
• Data variation
Intervals Extend from
– Measured by
X - Z to X + Z
• Sample size x x
–
X
n
• Level of confidence
– 100 1 %
• Requires • Won’t do
the job
too many
resources
Z 1.645 45
2 2 2 2
219.2 220
n 2
2
Error 5
Round Up
© 2002 Prentice-Hall, Inc.
Chap 8-19
Sample Size Determination
20
Sample Size Determination
An essential part of planning any study is to decide how
many people need to be studied
22
Sample Size------
Generally sample size determination depends on
the:
• objective of the study;
27
1.The Level of Precision
The level of precision, sometimes called sampling error, is
the range in which the true value of the population is
estimated to be.
This range is often expressed in percentage (e.g., ±5
percent)
The absolute precision (d) is half of the confidence
interval. d = Z α /2 x SE
29
The Level of Precision
idea of how precise or narrow you want the
confidence interval
706 people
32
The Confidence Level
33
The Confidence Level
The key idea encompassed in the Central Limit Theorem
is that when a population is repeatedly sampled, the
average value of the attribute obtained by those samples
is equal to the true population value
34
Degree of Variability
The third criterion, the degree of variability in the
attributes being measured, refers to the distribution of
attributes in the population.
35
Degree of Variability
Note that a proportion of 50% indicates a greater
level of variability than either 20% or 80%. This is
because 20% & 80% indicate that a large majority
do not or do, respectively, have the attribute of
interest.
estimate ± d units
37
Estimating the number needed to estimate a single
proportion
n = (z α/2)2 p(1-p)
d2
n=number required
P=population proportion (you need to guess this
as if you knew it you would not need to do
the calculation!)
d=precision of the estimate (how close do you
want to be?)
38
Sample size for single sample
includes:
d2
40
1. Suppose that you are interested to know the
proportion of infants who breastfed >18 months of
age in a rural area. Suppose that in a similar area, the
proportion (p) of breastfed infants was found to be
0.20. What sample size is required to estimate the true
proportion within ±3% points with 95% confidence.
Let p=0.20, d=0.03, α=5%
• Suppose there is no prior information about
the proportion (p) who breastfeed
• Assume p=q=0.5 (most conservative)
• Then the required sample size increases
An estimate of p is not always available.
• However, the formula may also be used for
sample size calculation based on various
assumptions for the values of p.
• P = 0.1 n = (1.96)2(0.1)(0.9)/(0.05)2 = 138
P = 0.2 n = (1.96)2(0.2)(0.8)/(0.05)2 = 246
P = 0.3 n = (1.96)2(0.3)(0.7)/(0.05)2 = 323
P = 0.5 n = (1.96)2(0.5)(0.5)/(0.05)2 = 384
P = 0.7 n = (1.96)2(0.7)(0.3)/(0.05)2 = 323
P = 0.8 n = (1.96)2(0.8)(0.2)/(0.05)2 = 246
• For a fixed absolute precision (d), the required
sample size increases as P increases form 0 to
0.5, and then decreases in the same way as
the prevalence approaches 1.
2. A survey is planned to determine what proportion
of the medical students have regularly chewed
khat. If no estimate of p is available and a pilot
sample cannot be drawn, what sample size would
be required if a 95% confidence is desired, and
d=0.04 is to be used.
Ans: 600 students
2. Sample Size: Two Samples
A. Estimation of the difference between two
population means
B. Estimation of the difference between two
population proportions
Comparison of two proportions
• n (in each region) = (p1q1 + p2q2) (f(,)) / ((p1 - p2)²
• = type I error (level of significance)
• = type II error ( 1- = power of the study)
2 2 2 2
• Use σ1 , σ2 or estimate using s1 and s2
B. Sample size for estimating a difference
in two proportions
• Aim: Estimate p1-p2
• Want: within ± d units
where d = Zα/2•SE
(95% CI of width = w = 2d)
• If equal sample sizes in both groups, then:
n0
n=
n0
1 +
N
where n0 is the sample from an infinite population. When N is large in
comparison to n, (i.e., n/N ≤ 0.05), the finite population correction
may be ignored.