Beruflich Dokumente
Kultur Dokumente
11.1
Introduction
In addition to estimation, hypothesis testing is a procedure for making inferences about a population.
Population Sample
Inference
Statistic Parameter
Hypothesis testing allows us to determine whether enough statistical evidence exists to conclude that a belief (i.e. hypothesis) about a parameter is supported by the data.
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.2
H0: the null hypothesis H1: the alternative or research hypothesis The null hypothesis (H0) will always state that the parameter equals the value specified in the alternative hypothesis (H1)
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.3
interested in determining
11.4
11.5
11.6
11.7
11.8
11.9
Types of Errors
A Type I error occurs when we reject a true null hypothesis (i.e. Reject H0 when it is TRUE)
H0
Reject Reject
T I
II
A Type II error occurs when we dont reject a false null hypothesis (i.e. Do NOT reject H0 when it is FALSE)
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.10
Types of Errors
Back to our example, we would commit a Type I error if: Reject H0 when it is TRUE We reject H0 ( = 350) in favor of H1 ( 350) when in fact the real value of is 350. We would commit a Type II error in the case where: Do NOT reject H0 when it is FALSE We believe H0 is correct ( = 350), when in fact the real value of is something other than 350.
11.11
Recap I
1) Two hypotheses: H0 & H1 2) ASSUME H0 is TRUE 3) GOAL: determine if there is enough evidence to infer that H1 is TRUE 4) Two possible decisions: Reject H0 in favor of H1 NOT Reject H0 in favor of H1 5) Two possible types of errors: Type I: reject a true H0 [P(Type I)= ] Type II: not reject a false H0 [P(Type II)= ]
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.12
Recap II
The null hypothesis must specify a single value of the parameter (e.g. =___) Assume the null hypothesis is TRUE. Sample from the population, and build a statistic related to the parameter hypothesized (e.g. the sample mean, ) Compare the statistic with the value specified in the first step
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.13
Example 11.1
A department store manager determines that a new billing system will be cost-effective only if the mean monthly account is more than $170. A random sample of 400 monthly accounts is drawn, for which the sample mean is $178. The accounts are approximately normally distributed with a standard deviation of $65. Can we conclude that the new system will be cost-effective?
11.14
Example 11.1
The system will be cost effective if the mean account balance for all customers is greater than $170. We express this belief as a our research hypothesis, that is: H1: > 170 (this is what we want to determine) Thus, our null hypothesis becomes: H0: = 170 (this specifies a single value for the parameter of interest)
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.15
Example 11.1
What we want to show: H1: > 170 H0: = 170 (well assume this is true) We know: n = 400, = 178, and = 65 Hmm. What to do next?!
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.16
Example 11.1
To test our hypotheses, we can use two different approaches: The rejection region approach (typically used when computing statistics manually), and The p-value approach (which is generally used with a computer and statistical software). We will explore both in turn
11.17
to reject H0.
11.18
Example 11.1
It seems reasonable to reject the null hypothesis in favor of the alternative if the value of the sample mean is large relative to 170, that is if > .
11.19
Example 11.1
All thats left to do is calculate and compare it to 170.
11.20
Example 11.1
At a 5% significance level (i.e. value of = 175.34 =0.05), we get a critical
Since our sample mean (178) is greater than the critical value we calculated (175.34), we reject the null hypothesis in favor of H1, i.e. that: > 170 and that it is cost effective to install the new billing system
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.21
=175.34 =178
11.22
11.23
p-Value
The p-value of a test is the probability of observing a test statistic at least as extreme as the one computed given that the null hypothesis is true. In the case of our department store example, what is the probability of observing a sample mean at least as extreme as the one already observed (i.e. = 178), given that the null hypothesis (H0: = 170) is true?
p-value
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.24
11.25
.01
.05
.10
11.26
11.27
p-value
11.28
11.29
11.30
11.31
Right-Tail Testing
Calculate the critical value of the mean ( ) and compare against the observed value of the sample mean ( )
11.32
Left-Tail Testing
Calculate the critical value of the mean ( ) and compare against the observed value of the sample mean ( )
11.33
TwoTail Testing
Two tail testing is used when we want to test a research hypothesis that a parameter is not equal () to some value
11.34
Example 11.2
AT&Ts argues that its rates are such that customers wont see a difference in their phone bills between them and their competitors. They calculate the mean and standard deviation for all their customers at $17.09 and $3.87 (respectively). They then sample 100 customers at random and recalculate a monthly phone bill based on competitors rates. What we want to show is whether or not: H1: 17.09. We do this by assuming that: H0: = 17.09
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.35
Example 11.2
The rejection region is set up so we can reject the null hypothesis when the test statistic is large or when it is small.
stat is small
stat is large
That is, we set up a two-tail rejection region. The total area in the rejection region must sum to , so we divide this probability by 2.
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.36
Example 11.2
At a 5% significance level (i.e. = .05), we have /2 = .025. Thus, z.025 = 1.96 and our rejection region is: z < 1.96 -orz > 1.96
-z.025
+z.025
11.37
Example 11.2
From the data, we calculate = 17.55
We find that: Since z = 1.19 is not greater than 1.96, nor less than 1.96 we cannot reject the null hypothesis in favor of H1. That is there is insufficient evidence to infer that there is a difference between the bills of AT&T and the competitor.
Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.
11.38
11.39
Two-Tail Test
11.40
11.41
11.42
11.43
11.44
Effects on
of Changing
Decreasing the significance level , increases the value of and vice versa. Consider this diagram again. Shifting the critical value line to the right (to decrease ) will mean a larger area under the lower curve for (and vice versa)
11.45
11.46
11.47
11.48
Beta-mean.xls
The Beta-mean workbook is a handy tool for calculating for any test of hypothesis. For example, comparing n=400 to n=1,000 for our department store example
the power of the test has increased by increasing the same size
11.49