Beruflich Dokumente
Kultur Dokumente
HYPOTHESIS TESTING
3.1 BASIC CONCEPTS IN HYPOTHESIS TESTING
Statistical Decisions
Very often in practice we are called up n to make decisions about populations on the basis of
sample information. Such decisions are called statistical decisions.
For example, we may wish to decide on the basis of sample data whether a new serum is really
effective in curling a disease, whether one education procedure is better than another, etc.
Actual situation
Ho is true Ho is false
Correct Type II or
Do not reject Ho
decision b error
Decision
Type I or Correct
Reject Ho
a error decision
Type of Tests
Based on the form of the null and alternative hypotheses, we have two types of tests: one-sided
(one tailed) tests and two-sided (two tailed) tests.
i) Test of the form
H o : m = mo H o : m < m o
H A : m < mo H A : m m o
2
OR OR
H o : m = mo H o : m > mo
H A : m > mo H A : m mo
Where the alternative hypotheses are inequality (ies) type are called one-sided (one-tailed) tests.
mo is the hypothesized (assumed) mean.
ii) Test of the form
H o : m = mo
H A : m mo ,
in which the critical region (rejection region) includes both large and small values of the test
statistic are called two-sided or two-tailed tests.
3.1.2 TESTS ABOUT MEAN
i) Population normal, s 2 known, sample large or small
Suppose we have a random sample of size n (small or large) from a normal population with mean
m and variance s 2 , where s 2 is known.
1. To test the hypothesis
H o : m = mo
H A : m > mo , mo is a specific value.
i.e. There is no difference between the population mean m and the specified value( mo ). Since
the best estimator of m is X , the test statistic must be dependent on X ,
s2
We know that X : N m , , then
n
X -m
Z= : N ( 0,1)
s
n
X - mo
Z= : N ( 0,1)
If Ho is true, then s .
n
X - mo
=
s Zcalculated is called the test statistic for testing single mean.
n
Let a = level of significance (type I error).
1-a Area= a
3
za
X - m
P o
> za = a
s
n
X - mo
> za
The critical region (Ho rejected) is s .
n
Where a z = z tabulated is the critical value that can be obtained form the standard normal
distribution table.
X - mo
s is the test statistic calculated.
n
2. If H o : m = mo
H A : m < mo
Acceptance region
Critical region
1-a
Area= a
- za
X - m
P o
< - za = a .
s
n
X - mo
< - za
The critical region is s .
n
3. For the two tailed test
H o : m = mo
H A : m mo
4
Critical region Acceptance Critical region
region
1-a
a a
2 2
-z a za
2 2
X - mo
P- za < < za = 1 - a = Acceptance region
s 2
2
n
X - mo X - mo
P < - za = a and P > za =a
s 2 s 2 2
2
n n
X - mo
P > za = a
s 2
n
Thus the critical region is (rejected HO)
X - mo
if P > za .
s 2
n
5
H A : m mo = 1800
Step2: a = 5% = 0.05
Step 3: Population is normal, s = 200 ( known ) , n = 400
Then the test statistic is
X - mo
Z cal =
s
n
Step 4: The critical region is
X - mo
> za
s 2
n
But X = 1780, s = 200, n = 400 and mo = 1800
1780 - 1800
Then Z cal = = -2
200
400
=2
Z tab = za = z 0.05 = z 0.025 = 1.96 (From a table).
2
2
Step 5: Since Z cal > Z tab , reject Ho . That is the claim of the producer is not correct.
b) H o : m = 1800
H A : m < 1800
a = 0.01
The test statistic is
X - mo
Z=
s
n
Critical region (Ho is rejected) when
X - mo
Z= < - za
s
n
X - mo 1780 - 1800
Z= =
s 200
n 400
= -2
Z tab = - za = - z0.01
= -2.33
Since -2 > -2.33 , then accept Ho. That is the average life length is not less than the producers
claim.
6
Example 8.15: According to the advertisement of car manufacturing company, their cars
averaged at least 32 miles per gallon (mpg) in the city. From past records it is known that
mileage is normally distributed with a standard deviation of 2.5 mpg. Tests on 16 cars showed
that mean mileage in the city is 31.5 mpg. Do the data support the advertisement at the 99
percent confidence level?
Solution: Given: mo = 32mpg , s = 2.5mpg ,
X = 31.5mpg , n = 16
Step1: H o : m 32 mpg
H A : m < 32 mpg
Step 2: a = 0.01
Step 3: The test statistic is:
X - mo 31.5 - 32
Z cal = =
s 2.5
n 16
= -0.8
Step 4: The critical region is
X - mo
< - za
s
n
zcal = -0.8, ztab = - za = -2.33
Step 5: Since -0.8>-2.33, we accept Ho. That is the mean mileage is at least 32 mpg.
ii) Non normal population, large sample, s known /unknown
We wish to test
H o : m = mo or H o : m = mo or H o : m = mo
H A : m > mo H A : m < mo H A : m mo
By central limit theorem, if n is large ( n 30 ) sample mean, X , is approximately normal.
s2
i.e. X : N m , .
n
X -m
Z = : N ( 0,1)
s
n
The critical regions in all these tests in this case are the same as case (i) above. If s is
unknown, estimate it by sample standard deviation, S.
n
Where (X i - X )2
S = i =1
n -1
7
X -m
Z= : N ( 0,1)
S
n
Example 8.16: A company has a computer system that can process at most 1200 bills per hour.
A new system is tested which processes an average of 1260 bills per hour with a standard
deviation of 215 bills in a sample of 40 hours. Test if the new system is significantly better than
old one at the 5% level of significance.
Solution:
Step 1: H o : m 1200
H A : m > 1200
Step 2: a = 5% = 0.05
Step3: Population is non- normal, s is unknown.
But n = 40 30 is large, S= 215, x =1260
Then by CLT the test statistic is
X - mo
zcal =
S
n
X - mo 1260 - 1200
Z cal = =
S 215
n 40
60
=
215
40
60
=
215
6.32
Z cal = 1.76
X -m
Step4: The critical region is S > za
n
But Z tab = z a = Z 0.05 = 1.64
Step 5: Since zcal > ztab , we reject Ho .that is we conclude that the new system represents an
improvement over the old system at the a = 0.05 level of significance
8
i) To test the hypothesis: H o : m = mo
H A : m mo
X - mo
The critical region is: > ta ( n-1)
S 2
n
ii) To test the hypothesis : H o : m mo
H A : m < mo ,
X - mo
The critical region is S < -ta ( n -1)
n
iii) To test the hypothesis : H o : m mo
H a : m > mo
X - mo
The critical region is: S > ta ( n -1)
n
Example 8.17: The labour management contract calls for the mean daily out put of a particular
production section to be no less than 50 units. A random sample of 22 days reveals a mean of
48.2 units with a standard deviation of 4 units. Assume that the daily out put levels are
approximately normally distributed. Is the contract provision fulfilled? Test it at the 5%
significance level
Solution:
Given mo = 50units, S = 4units, X = 48.2units n = 22
The null and alternative hypotheses are
H o : m 50 units
H a : m > 50units
a = 5% = 0.05
Since n < 30 and the population standard deviation s is unknown, we use the student-t
distribution table to find the critical value.
X - mo 48.2 - 50
The test statistic is: tcal = = = - 2.11
S 4
n 22
X - mo
The critical region is S < ta ( n -1) .
n
But ta ( n -1) = t0.05( 21) = 1.72
-t 0.05 ( 21) = -1.72
Since t cal < t tab , we reject Ho (accept Ha), and conclude that the provision is not fulfilled.
Exercise:
9
It is believed that the average hour of study per day to pass a certain exam is a normal variable,
with 4 hrs. A sample of 16 students was asked and gave the following opinion of their own
hours:
3,4,4,5,2,5,4,3,3,2,6,6,2,1,7,4,
a) Test at 5% level of significance whether the data are consistent with the specified study
hr.
b) Test at 1% level of significance whether students study per day less hrs to pass the same
exam.
Case I
If the hypothesis to be tested is
H o : P = Po or H o : P Po
H A : P > Po H A : P > Po
P - P
P > za = a
PQ
n
P - P
> za
PQ
n
Case II
H o : P = Po or H o : P Po
H A : P < Po H A : P < Po
The critical region is
10
P - Po
< - za
PoQo
n
Case III
For the two tailed tests
H o : P = Po
H A : P Po
The critical region is
P - Po za
>
PQ 2
n
Example 8.11: A television manufacturer claims that at least 90% of his TV sets do not require
any repair during the 1st two years of operation. A consumer protection agency selects a random
sample of 100 sets and finds that 14 sets required some repair with in the first two years of
operation. At the 1% level of significance, what conclusion should be reached by the consumer
protection agency?
Solution: Given Po = 90% = 0.9, n = 100 .
14 sets required repair means that 100-14= 86 did not require repair. Hence, the proportion of TV
sets that did not require repair in the sample was
x 86
P = = = 0.86 .
n 100
The null and alternative hypotheses are
H o : P 0.90
H A : P < 0.9
nP = 100 x0.9 = 90 > 5
nQ = 100 X 0.1 = 10 > 5
Then by CLT we use Z-distribution to test the hypothesis.
P - Po
The test statistics is: Z= PoQo
n
P - Po
< - za
The critical region is PoQo
n
11
P - Po 0.86 - 0.9
zcal = =
PoQo 0.9 x0.1
n 100
= -1.33
zcal = - za = - z0.01
= -2.33
Since z cal > z tab , we accept H o . Therefore, we conclude that the claim is true.
12