Beruflich Dokumente
Kultur Dokumente
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://www.jstor.org/page/
info/about/policies/terms.jsp
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content
in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship.
For more information about JSTOR, please contact support@jstor.org.
Sage Publications, Inc., American Statistical Association and American Educational Research Association are
collaborating with JSTOR to digitize, preserve and extend access to Journal of Educational and Behavioral Statistics.
http://www.jstor.org
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
TEACHER'S CORNER
oy.
u_ = ug + op- 2p7o o"
The correlationterm reduces the variance of the difference between means and
increases the t ratio. In the context of interval estimation, the reduced standard
This researchwas supportedby a CarletonUniversity researchgrant.A listing of the
computerprogram,writtenin TurboBASIC, Version 1.0 (Borland,Inc.) can be obtained
by writing to the authorat 15078 Eagle Place, Surrey,BC V3R 4W2, Canada.
349
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
Teacher's Corner
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
Teacher's Corner
of the two significance tests when the correlationis quite small. In the case of
naturally paired data, even correlations of .10, .15, or .20 make the pairedsamples t test mandatoryin orderto protectagainstdistortionof the significance
level. These conclusions are based on examinationof power functions as well as
degrees of freedom and Type I errors.
The presentstudy also comparesthe two tests from anotherpoint of view and
focuses attentionon an aspect of the problem which has been overlooked. The
comparison of the two proceduresfrequently made in textbooks fails to take
account of an important effect: Nonindependence of observations depresses
both Type I errorprobabilitiesand the power of the test to detect differences. In
other words, a correlationbetween samples that are believed to be independent
compromisesnot only the efficiency but also the validityof the significance test.
Furthermore,the change that occurs is quite large.
Many years ago, Cochran(1974), Scheff6 (1959), Walsh (1947), and others
discovered that violation of the independenceassumptionunderlyingthe t and F
tests distorts Type I and Type II errorprobabilities.(See also a recent study by
Zimmerman,Williams, & Zumbo, 1993.) However, investigatorshave not considered these results in the context of paired-samplesexperimentaldesigns. The
present note examines some implications of nonindependenceof observations,
as investigated in these studies, for interpretationof the paired-samplest statistic.
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
Teacher's Corner
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
n = 20
0.12
0.11
S0.10
O
0.09
0.08
t-Independent
t-Paired
0.07
0.06
4
0
0.05
0.04
QQ3
-Q
0
L
0.02
0.01
0.00
-0.5 -0.4 -0.3 -0.2 -0.1 0.0 0.1 0.2 0.3 0.4 0.5
Correlation
n = 20
0.55
0.5o
0.45
t-Paired
0.40
0.35
t-Independent
0.30
- -- -- -- -- -- -
-?--------
0.25
S0.20
S0.15
0
0.10
0.05
0.00
-0.5
-0.4
-0.3
-0.2
-0.1
0.0
0.1
0.2
0.3
0.4
0.5
Correlation
FIGURE 1. Probability of rejecting Ho by the independent samples t test and the
paired-samplest test as a function of correlation
Note. The differencebetween populationmeans is zero in the uppersection and 1.5arin the lower
section.
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
n = 20
p = 0
1.0
O
0
0.9
t-Independent
t-Paired
-1- 0.8
0.7
.C. 0.6
S
*
0.6
0.5
S0.4
0.3
c3
-0
0
L
0.2
0.1
I
0.0
0
I
3
Difference
1.0
0.9
n = 20
0.8
in Standard
Units
p = .30
t-Independent
t-Paired
t-Adjusted
0.7
0.6
>
0.4
.-0
0.3
0.2
L-
0.1
0.0
0
Difference
in Standard
Units
FIGURE 2. Probability of rejecting Ho by the independent-samplest test, the pairedsamples t test, and the paired-samples t test with an adjusted significance level as a
function of the differencebetween means (incrementsof .5cr)
Note. The correlationis zero in the uppersection and .30 in the lower section.
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
Teacher'sCorner
comparable.The modified curve remainsslightly below that of the independentsamples test over the entire range of differences between means. This slight
disparityof the two curves apparentlyreflects differences in degrees of freedom.
It is evident from these figures that even a moderate correlation between
observations has a stronger influence on the probability of Type II errors and
power than does reductionof degrees of freedom from 18 to 9.
Table 1 provides simulation data for sample sizes of 10, 20, 40, and 80 for
both one-tailed and two-tailed tests. The differencebetween means increasedin
incrementsof 1.25r. It is evident that depressionof the Type I errorprobability
of the independent-samplestest occurs consistently for all sample sizes examined. Furthermore,the relative advantageof the paired-samplestest for correlated samples is apparentfor all sample sizes.
Conclusions
Inspection of Figures 1 and 2 and Table 1 certainly confirms the widespread
belief among researchersand applied statisticiansthat one should substitutethe
paired-samplest test for the independent-samplestest whenever subjects are
coupled or matched in some way in an experimentaldesign. The magnitudeof
the effect producedby slight correlationsprobablyis greaterthan most researchers realize. The present results disclose that even a correlationof .10 or .20
seriously distorts the significance level of the t statistic based on independent
samples. When power functions are examined, it is apparentthat advantagesof
the paired-samplestest are not negligible for small correlationsand are exceptional for correlationsas high as .40 or .50.
We now examine the problemfrom anotherpoint of view. In makingcomparisons in the presentcontext, one can ask two distinctquestions.The first question
is, What gain in efficiency results from using a matched-pairsexperimental
design instead of an independent-samplesdesign, if matchinginduces a correlation? The answer to this question is found by comparingthe curve representing
the paired-samplestest in the lower section of Figure 1 with the horizontal
broken line. The line represents a constant probability of .308, which is the
power of the independent-samplestest when the correlationis zero. This comparison makes it clear that the advantageof the paired-samplesdesign becomes
greateras the correlationincreases from 0 to .50, and that the advantageis quite
large for higher correlations.This outcome is consistent with the usual interpretation of the two tests. Of course, the amountof gain depends on the parameters
chosen for this particular example. The figure also reveals that a negative
correlationresults in a loss ratherthan a gain.
A second question is, What loss occurs if one performs the independentsamples t test inappropriatelyon measureswhich are correlated?This question is
somewhat more complicated, but it has significant practical applications.The
answer can be found by inspecting the two curves (open circles and filled
circles) in the lower section of Figure 1. These curves reveal that the difference
in the probabilities of rejecting the null hypothesis for the two tests becomes
355
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
TABLE 1
Probability of rejecting Ho by independent-samplest test and paired-samples t test for
various numbersof pairs (n) and degrees of correlationbetween samples (p)-one-tailed
and two-tailed tests
p= .20
p=0
n
t
indep.
t
paired
t
indep.
p= .40
t
paired
t
indep.
t
paired
.051
.331
.766
.968
.052
.345
.797
.984
.051
.350
.807
.986
.048
.346
.812
.986
.022
.249
.762
.980
.019
.261
.786
.988
.019
.258
.792
.988
.017
.267
.791
.989
.051
.391
.860
.993
.051
.417
.891
.997
.050
.426
.898
.996
.051
.435
.899
.997
.048
.211
.634
.931
.051
.229
.684
.953
.050
.238
.710
.963
.049
.246
.719
.969
.016
.149
.609
.946
.014
.145
.636
.961
.013
.143
.652
.966
.0 13
.149
.672
.972
.051
.270
.758
.976
.049
.288
.798
.988
.052
.302
.820
.991
.051
.316
.834
.992
One-tailedtests
10
20
40
80
0
1
2
3
0
1
2
3
0
1
2
3
0
1
2
3
.049
.297
.716
.952
.048
.295
.724
.957
.052
.309
.744
.962
.050
.317
.743
.962
.050
.283
.683
.937
.047
.285
.711
.951
.051
.304
.735
.961
.050
.314
.736
.958
.035
.284
.734
.964
.034
.288
.748
.977
.034
.288
.755
.977
.033
.279
.761
.978
Two-tailedtests
10
20
40
80
0
1
2
3
0
1
2
3
0
1
2
3
0
1
2
3
.051
.199
.583
.899
.050
.195
.603
.921
.051
.218
.630
.930
.051
.215
.627
.926
.050
.182
.531
.863
.048
.185
.580
.903
.051
.215
.620
.923
.051
.210
.617
.922
.030
.170
.601
.926
.031
.175
.622
.941
.030
.178
.636
.944
.030
.185
.649
.952
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
Teacher's Corner
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
Teacher's Corner
TABLE 2
Example of a design in which initially there is an undetected correspondenceof values
t paired
t indep.
X
Pair
25
32
43
16
34
25
17
18
29
24
34
36
34
39
34
30
35
46
23
27
43
45
28
29
1
2
3
4
5
6
7
8
9
10
11
12
17
25
16
24
43
18
34
25
36
34
32
29
45
35
27
34
46
30
29
39
23
43
34
28
D=Y-
28
10
11
10
3
12
-5
14
-13
9
2
-1
Note.An independent-samples
withoutconsideration
of possible
Studentt test was firstperformed
pairingof scores.Then,pairingwas recognized,anda one-sampleStudentt test (i.e., a pairedon differencescores.
samplest test)wasperformed
Independent:t = 2.052, df-= 22, p > .05. Paired:t = 2.2 10, df = 11,p < .05. Pxy= .100. rxy = .139.
358
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
Teacher's Corner
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions
Teacher's Corner
the range of .30 to .40 or higher," and the answer to the second is "in the range
of .05 to .15 or lower." In any case, choice of an appropriatetest cannot be
determined solely by the error term in the denominatorof the t statistic and
calculation of the power of the test. It must also take into consideration a
possible alterationof the significance level by a correlationbetween observations.
References
Box, G. E. P., & Muller,M. (1958). A note on the generationof normaldeviates. Annals
of MathematicalStatistics,29, 610-611.
Cochran, W. G. (1947). Some consequences when the assumptions in the analysis of
varianceare not satisfied.Biometrics,3, 22-38.
Coren, S., & Hakstian,A. R. (1990). Methodological implications of interauralcorrelation: Count heads not ears. Perception and Psychophysics,48, 291-294.
Edwards,A. L. (1979). Multipleregression and the analysis of variance and covariance.
New York: Freeman.
Hays, W. L. (1988). Statistics (4th ed.). New York: Holt, Rinehart,& Winston.
Howell, D. C. (1987). Statisticalmethodsfor psychology. Boston: DuxburyPress.
Kurtz, K. H. (1965). Foundations of psychological research. Boston: Allyn & Bacon.
Loether, H. J., & McTavish, D. G. (1993). Descriptive and inferential statistics: An
introduction(4th ed.). Boston: Allyn & Bacon.
Pagano, R. R. (1986). Understandingstatistics in the behavioral sciences (2nd ed.). St.
Paul, MN: West.
Scheff6, H. (1959). The analysis of variance. New York: Wiley.
Walsh,J. E. (1947). Concerningthe effect of intraclasscorrelationon certainsignificance
tests. Annals of MathematicalStatistics, 18, 88-96.
Zimmerman,D. W., Williams,R. H., & Zumbo,B. D. (1993). Effect of nonindependence
of sample observationson some parametricand nonparametricstatistical tests. Communicationsin Statistics: Simulationand Computation,22, 779-789.
Zumbo, B. D. (1996). Randomizationtest for coupled data. Perception and Psychophysics, 58, 471-478.
Author
DONALD W. ZIMMERMANis Professor of Psychology (Retired),CarletonUniversity,
Ottawa, Canada,and can currentlybe reached at 15078 Eagle Place, Surrey,BC V3R
4W2, Canada;zimmerma@direct.ca.He specializes in psychology.
Received August 10, 1996
Revision received September3, 1996
Second revision received October 20, 1996
Accepted December 3, 1996
360
This content downloaded from 128.240.233.146 on Fri, 25 Dec 2015 18:58:05 UTC
All use subject to JSTOR Terms and Conditions