Beruflich Dokumente
Kultur Dokumente
Estimation
Estimation is a procedure by which a numerical value or values are assigned to a
population parameter based on the information collected from a sample.
Estimation The assignment of value(s) to a population parameter based on a value of the
corresponding sample statistic is called estimation.
The value(s) assigned to a population parameter based on the value of a sample statistic is
called an estimate. The sample statistic used to estimate a population parameter is called
an estimator.
Point Estimate
The value of a sample statistic that is used to estimate a population parameter is called a point
estimate.
Thus, the value computed for the sample mean 𝑥̅ , from a sample is a point estimate of the
corresponding population mean, 𝜇 .
Interval Estimation
In interval estimation, an interval is constructed around the point estimate, and it is stated that this
interval is likely to contain the corresponding population parameter.
The confidence level associated with a confidence interval states how much confidence we have
that this interval contains the true population parameter. The confidence level is denoted by
(1−∝)100%.
1
Although any value of the confidence level can be chosen to construct a confidence interval, the
more common values are 90%, 95%, and 99%. The corresponding confidence coefficients are .90,
.95, and .99, respectively.
𝜎 Is Known
2
Confidence Interval for 𝝁
𝑥̅ ± 𝑧𝜎𝑥̅
𝜎
where 𝜎𝑥̅ = √𝑛
The value of z used here is obtained from the standard normal distribution table.
Margin of Error The margin of error for the estimate for , denoted by E, is the quantity that is
subtracted from and added to the value of 𝑥̅ to obtain a confidence interval for 𝜇. Thus,
𝐸 = 𝑧𝜎𝑥̅
Problem
A publishing company has just published a new college textbook. Before the company decides the
price at which to sell this textbook, it wants to know the average price of all such textbooks in the
market. The research department at the company took a sample of 25 comparable textbooks and
collected information on their prices. This information produced a mean price of $145 for this
sample. It is known that the standard deviation of the prices of all such textbooks is $35 and the
population of such prices is normal.
(a) What is the point estimate of the mean price of all such college textbooks?
(b) Construct a 90% confidence interval for the mean price of all such college textbooks.
Solution
Here, 𝜎 is known and, although n < 30, the population is normally distributed.
Hence, we can use the normal distribution to find confidence interval for 𝜇. From the given
information,
(a) The point estimate of the mean price of all such college textbooks is $145; that is,
3
Point estimate of 𝜇 = 𝑥̅ = $145
(b) The confidence level is 90%, or .90. First we find the z value for a 90% confidence level.
The value is z =1.65 (from Table).
Next, we substitute all the values in the confidence interval formula for 𝜇. The 90% confidence
interval for 𝜇 is
𝑥̅ ± 𝑧𝜎𝑥̅ = 145 ± 1.65(7.0) = 145 ± 11.55
= (145 − 11.55)𝑡𝑜 (145 + 11.55) = $133.45 to $156.55
Thus, we are 90% confident that the mean price of all such college textbooks is between $133.45
and $156.55.
Problem: A sample of 1500 homes sold recently in a state gave the mean price of homes equal to
$299,720. The population standard deviation of the prices of homes in this state is $68,650.
Construct a 99% confidence interval for the mean price of all homes in this state.
Solution:
From the given information,
Problem
4
An alumni association wants to estimate the mean debt of this year’s college graduates. It is known
that the population standard deviation of the debts of this year’s college graduates is $11,800. How
large a sample should be selected so that the estimate with a 99% confidence level is within $800
of the population mean?
Solution
The alumni association wants the 99% confidence interval for the mean debt of
this year’s college graduates to be
𝑥̅ ± 800
Hence, the maximum size of the margin of error of estimate is to be $800; that is,
The value of z for a 99% confidence level is 2.58. The value of 𝜎 is given to be $11,800.
5
The following chart summarizes the above three cases.
𝜎 Is not Known
Confidence Interval for 𝝁 The (1−∝)100% confidence interval for 𝜇 under Cases I and II
is 𝑥̅ ± 𝑡𝑠𝑥̅
𝑠
where 𝑠𝑥̅ = √𝑛
The value of z used here is obtained from the standard normal distribution table.
Margin of Error The margin of error for the estimate for , denoted by E, is the quantity that is
subtracted from and added to the value of 𝑥̅ to obtain a confidence interval for 𝜇. Thus,
𝐸 = 𝑡𝑠𝑥̅
The t Distribution
The t distribution is a specific type of bell-shaped distribution with a lower height and a wider
spread than the standard normal distribution. As the sample size becomes larger, the t distribution
approaches the standard normal distribution. The t distribution has only one parameter, called the
degrees of freedom (df ). The mean of the t distribution is equal to 0, and its standard deviation is
(df)/(df –2).
EXAMPLE
Dr. Moore wanted to estimate the mean cholesterol level for all adult men living in Hartford. He
took a sample of 25 adult men from Hartford and found that the mean cholesterol level for this
sample is 186 mg/dL with a standard deviation of 12 mg/dL. Assume that the cholesterol levels
for all adult men in Hartford are (approximately) normally distributed. Construct a 95% confidence
interval for the population mean𝜇.
6
Solution
Here, 𝜎 is not known, n <30, and the population is normally distributed. Therefore, we will use
the t distribution to make a confidence interval for 𝜇. From the given information,
When we substitute all values in the formula for the confidence interval for 𝜇, the 95% confidence
interval is
Thus, we can state with 95% confidence that the mean cholesterol level for all adult men
living in Hartford is between 181.05 and 190.95 mg/dL.
EXAMPLE
Sixty-four randomly selected adults who buy books for general reading were asked how much they
usually spend on books per year. The sample produced a mean of $1450 and a standard deviation
of $300 for such annual expenses. Determine a 99% confidence interval for the corresponding
population mean.
Solution
From the given information,
Here is not known, but the sample size is large (n >30). Hence, we will use the t distribution to
make a confidence interval for 𝜇. First we calculate the standard deviation of 𝑥̅ , the number of
degrees of freedom, and the area in each tail of the t distribution:
𝑠 300
𝑠𝑥̅ = = = 37.50
√𝑛 √64
df= n−1=64 − 1 = 63
From the t distribution table, the value of t for df = 63 and .005 area in the right tail is 2.656.
7
When we substitute all values in the formula for the confidence interval for 𝜇, the 95% confidence
interval is
Thus, we can state with 99% confidence that based on this sample the mean annual expenditure
on books by all adults who buy books for general reading is between $1350.40 and $1549.60.
Recall that the population proportion is denoted by p, and the sample proportion is denoted by 𝑝̂ .
The sample proportion 𝑝̂ , is a sample statistic, and it possesses a sampling distribution. We know
that for large samples:
Note:
In the case of a proportion, a sample is considered to be large if np and nq are both greater than 5.
If p and q are not known, then 𝑛𝑝̂ and 𝑛𝑞̂ should each be greater than 5 for the sample to be large.
𝑝̂𝑞̂
𝑠𝑝̂ = √
𝑛
𝑝̂ ± 𝑧𝑠𝑝̂
The value of z used here is obtained from the standard normal distribution table for the given
8
𝑝̂𝑞̂
confidence level 𝛼, 𝑠𝑝̂ = √ 𝑛 . The term 𝑧𝑠𝑝̂ is called the margin of error, E.
Problem:
According to a survey conducted by Pew Research Center in June 2009, 44% of people aged 18
to 29 years said that religion is very important to them. Suppose this result is based on a sample of
1000 people aged 18 to 29 years.
Solution:
Let p be the proportion of all people aged 18 to 29 years who will say that religion is very important
to them, and let 𝑝̂ be the corresponding sample proportion. From the
given information,
First, we calculate the value of the standard deviation of the sample proportion as follows:
𝑝̂𝑞̂ .44×.56
𝑠𝑝̂ = √ 𝑛 = √ =.01569713
1000
Note that 𝑛𝑝̂ and 𝑛𝑞̂ are both greater than 5. Consequently, the sampling distribution of is
approximately normal, and we will use the normal distribution to make a confidence interval
about p.
(a) The point estimate of the proportion of all people aged 18 to 29 years who will say that
religion is very important to them is equal to .44; that is,
Point estimate of p =𝑝̂ =.44
(b) The confidence level is 99%, or .99. To find z for a 99% confidence level, first we find the
1−.99
area in each of the two tails of the normal distribution curve, which is 2 = .005. Then,
we look for .0050 and .0050 + .99 =.9950 areas in the normal distribution table to find the
two values of z. These two z values are (approximately) -2.58 and 2.58. Thus, we will use
z = 2.58 in the confidence interval formula. Substituting all the values in the confidence
interval formula for p, we obtain
𝑝̂ ± 𝑧𝑠𝑝̂ = .44 ± 2.58(. 01569713) = .44 ± .04 = .40 𝑡𝑜 .48 𝑜𝑟 40% 𝑡𝑜 48%
Thus, we can state with 99% confidence that .40 to .48, or 40% to 48%, of all people aged 18 to
29 years will say that religion is very important to them.
(c) The margin of error associated with this estimate of p is .04 or 4%.