Beruflich Dokumente
Kultur Dokumente
NONPARAMETRIC
METHODS
Opening Example
Definition
The sign test is used to make hypothesis
tests about preferences, a single median,
and the median of paired differences for two
dependent populations. We use only plus
and minus signs to perform these tests.
Example 15-1
Example 15-1
Example 15-1
Example 15-1
Step 1:
Step 3:
True
sample size n = 9
Figure 15.1
Critical Value(s) of X
Table 15.1
Step 4:
Observed Value of X
( X .5)
z
np
and
npq
Example 15-2
Example 15-2
Step 1:
Figure 15.2
Step 4:
np 70(.50) 35
npq 70(.50)(.50) 17.5 4.18330013
n 70
X 40 and
35
2 2
( X .5) (40 .5) 35
z
1.08
4.18330013
Prem Mann, Introductory Statistics, 7/E
Copyright 2010 John Wiley & Sons. All right reserved
Example 15-3
Example 15-3
Table 15.2
Figure 15.3
Observed Value of X
Example 15-4
Figure 15.4
Step 4:
n 89
p .50
q 1 p .50
np 89(.50) 44.50
npq
89(.50)(.50) 4.71699057
n 89
X 38 and
44.5
2
2
( X .5)
(38 .5) 44.50
z
1.27
4.71699057
Prem Mann, Introductory Statistics, 7/E
Copyright 2010 John Wiley & Sons. All right reserved
Example 15-5
Example 15-5
Figure 15.5
Hence, we reject H0
Example 15-6
Example 15-6
Figure 15.6
Step 4:
np 40(.50) 20
npq 40(.50)(.50) 3.16227766
X 7
( X .5)
(7 .5) 20
z
3.95
3.16227766
Step 5:
Consequently, we reject H0
Example 15-7
Example 15-7
Figure 15.7
Decision Rule
Step 4:
Step 4:
Observed value of T
= sum of the positive ranks = 3
Example 15-8
Example 15-8
Example 15-8
H1: MA > MB
Figure 15.8
Observed Value of z
In a Wilcoxon signed-rank test for two
dependent samples, when the sample size
is large (n > 15), the observed value of z
for the test statistic T is calculated as
where
T T
z
T
n n 1
T
4
and
n(n 1)(2n 1)
24
Observed Value of z
The value of T that is used to calculate the
value of z is determined based on the
alternative hypothesis, as explained next.
Observed Value of z
1. If the test is two-tailed with the alternative
hypothesis that the two distributions are not
the same, then the value of T may be equal
to either of the two sums, the sum of the
positive ranks or the sum of the absolute
values of the negative ranks. We will reject
H0 if the observed value of z falls in either of
the rejection regions.
Observed Value of z
2. If the test is right-tailed with the alternative
hypothesis that the distribution of after
values is to the right of the distribution of
before values, then the value of T is equal to
the sum of the absolute values of the
negative ranks. We will reject H0 if the
observed value of z is greater than or equal
to the critical value of z.
Observed Value of z
3. If the test is left-tailed with the alternative
hypothesis that the distribution of after
values is to the left of the distribution of
before values, then the value of T is equal to
the sum of the absolute values of the
negative ranks. We will reject H0 if the
observed value of z is less than or equal to
the critical value of z.
Observed Value of z
Remember, for the above to be true, the
paired difference is defined as the before
value minus the after value. In other words,
the differences are obtained by subtracting
the after values from the before values.
Also, note that whether the test is righttailed or left-tailed, the value of T in both
cases is equal to the sum of the absolute
values of the negative ranks.
Prem Mann, Introductory Statistics, 7/E
Copyright 2010 John Wiley & Sons. All right reserved
Step 4:
n n 1 23 23 1
T
138
4
4
n n 1 2n 1
23 23 1 46 1
T
32.87856445
24
24
T T
218 138
z
2.43
T
32.87856445
Example 15-9
A researcher wants to determine whether
the distributions of daily crimes in two cities
are identical. The following data give the
numbers of violent crimes on eight randomly
selected days for City A and on nine days for
City B.
Example 15-9
Figure 15.9
Step 4:
It is between TL = 51 and TU = 93
Observed Value of z
In the case of a large sample, the observed
value of z is calculated as
T T
z
T
Observed Value of z
n1(n1 n2 1)
T
2
and
n1n2 (n1 n2 1)
T
12
Example 15-10
A researcher wanted to find out whether
job-related stress is lower for college and
university professors than for physicians.
She took random samples of 14 professors
and 11 physicians and tested them for jobrelated stress. The following data give the
stress levels for professors and physicians
on a scale of 1 to 20, where 1 is the lowest
level of stress and 20 is the highest.
Prem Mann, Introductory Statistics, 7/E
Copyright 2010 John Wiley & Sons. All right reserved
Example 15-10
Figure 15.10
Step 4:
143
2
2
n1n2 (n1 n2 1)
11(14)(11 14 1)
T
18.26654501
12
12
T T
188.5 143
z
2.49
T
18.26654501
Hence, we reject H0
and
n1n2 (n1 n2 1)
T
12
Rk2
12 R12 R22
H
... 3(n 1)
n(n 1) n1 n2
nk
Example 15-11
A researcher wanted to find out whether the
population distributions of salaries of
computer programmers are identical in
three cities, Boston, San Francisco, and
Atlanta. Three different samples one from
each city - produced the following data on
the annual salaries (in thousands of dollars)
of computer programmers.
Example 15-11
Example 15-11
Using the 2.5% significance level, can you
conclude that the population distributions
of salaries for computer programmers in
these three cities are all identical?
Figure 15.11
Step 4:
... 3(n 1)
n(n 1) n1 n2
nk
(75)2 (60.5)2 (95.5)2
12
3(21 1)
21(21 1) 6
7
8
1.543
rs 1
6 d
n(n 2 1)
Example 15-12
Suppose we want to investigate the
relationship between the per capita income
(in thousands of dollars) and the infant
mortality rate (in percent) for different
states. The following table gives data on
these two variables for a random sample of
eight states.
Example 15-12
Figure 15.12
Critical Value of rs
The critical value of rs is obtained from
Table XI for the given sample size and
significance level. If the test is two-tailed,
we use two critical values, one negative
and one positive. However, we use only the
negative value of rs if the test is left-tailed,
and only the positive value of rs if the test
is right-tailed.
Step 4:
Hence, we reject H0
Example 15-13
A college admissions office is interested in
knowing whether applications for admission
arrive randomly with respect to gender. The
gender of 25 consecutively arriving
applications were found to arrive in the
following order (here M denotes a male
applicant and F a female applicant).
Example 15-13
M F M M F F F M F M M M
F F F F M M M F F M F M M
Can you conclude that the applications for
admission arrive randomly with respect to
gender? Use = .05.
n1 = 13
and
n2 = 12
Figure 15.13
It is between 9 and 18
Observed Value of z
For large values of n1 and n2, the distribution
of R (the number of runs in the sample) is
approximately normal with its mean and
standard deviation given as
2n1n2
R
1
n1 n2
and
2n1n2 (2n1n2 n1 n2 )
R
(n1 n2 )2 (n1 n2 1)
Observed Value of z
The observed value of z for R is calculated
using the formula
R R
z
R
Example 15-14
Refer to Example 15-13. Suppose that the
admissions officer examines 50 consecutive
applications and observes that n1 = 22,
n2
= 28, and R = 20, where n1 is the number
of male applicants, n2 the number of female
applicants, and R the number of runs. Can
we conclude that the applications for
admission arrive randomly with respect to
gender? Use = .01.
Figure 15.14
Step 4:
R
2n1n2
2(22)(28)
1
1 25.64
n1 n2
22 28
2n1n2 (2n1n2 n1 n2 )
2(22)(28)(2 22 28 22 28)
R
2
(n1 n2 ) (n1 n2 1)
(22 28)2 (22 28 1)
3.44783162
z
R R
20 25.64
1.64
R
3.44783162
TI-84
Minitab
Minitab
Minitab
Minitab
Minitab
Minitab
Minitab
Minitab
Minitab
Minitab
Excel