Beruflich Dokumente
Kultur Dokumente
a
,
3
2
1
0
.1
.2
.3
T#e normality plot
The problem with the normality of this
variable*s distribution is reinforced by the
normality plot.
%f the variable were normally distributed,
the red dots would fit the green line very
closely. %n this case, the red points in the
upper right of the chart indicate the
severe skewing caused by the extremely
large data values.
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
6
Tests of Normality
.246 93 .000 .606 93 .000
TOTAL TIME SPENT
ON THE INTERNET
Statisti& (/ Sig. Statisti& (/ Sig.
4o,ogoro*.Sir%o*
a
S5a3iro.6i,7
Li,,i"/ors Sig%i/i&a%&" 8orr"&tio%
a.
T#e test of normality
(roblem + asks about the results of the test of normality. Since the sample
si,e is larger than -., we use the 'olmogorov$Smirnov test. %f the sample
si,e were -. or less, we would use the Shapiro$/ilk statistic instead.
The null hypothesis for the test of normality states that the actual
distribution of the variable is e!ual to the expected distribution, i.e., the
variable is normally distributed. Since the probability associated with the
test of normality is 0 ....+ is less than or e!ual to the level of significance
1...+2, we re3ect the null hypothesis and conclude that total hours spent on
the %nternet is not normally distributed. 14ote5 we report the probability as
0....+ instead of .... to be clear that the probability is not really ,ero.2
The answer to problem + is false.
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
7
T#e assumption of normality script
)n S(SS script to produce all
of the output that we have
produced manually is
available on the course web
site.
)fter downloading the script,
run it to test the assumption
of linearity.
Select Run Script
from the 6tilities
menu.
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
8
Selectin) t#e assumption of normality script
First, navigate to the folder containing your
scripts and highlight the
4ormality)ssumption)ndTransformations.S"S
script.
Second, click on
the Run button to
activate the script.
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
9
Specifications for normality script
The default output is to do all of the
transformations of the variable. To
exclude some transformations from the
calculations, clear the checkboxes.
Third, click on the OK
button to run the script.
First, move variables from
the list of variables in the
data set to the aria!les to
"est list box.
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
2!
Tests of Normality
.246 93 .000 .606 93 .000
TOTAL TIME SPENT
ON THE INTERNET
Statisti& (/ Sig. Statisti& (/ Sig.
4o,ogoro*.Sir%o*
a
S5a3iro.6i,7
Li,,i"/ors Sig%i/i&a%&" 8orr"&tio%
a.
T#e test of normality
The script produces the same output that we
computed manually, in this example, the tests
of normality.
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
2
Prolem 3
In t#e dataset +SS3...'sa&( is t#e follo$in)
statement true( false( or an incorrect application of a
statistic4
6ased on t#e rule of t#um for t#e allo$ale
ma)nitude of s,e$ness and ,urtosis( total #ours
spent on t#e Internet is normally distriuted'
1' True
3' True $it# caution
3' 7alse
8' Incorrect application of a statistic
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
22
Descriptives
10.731 1.5918
7.570
13.893
8.295
5.500
235.655
15.3511
.2
102.0
101.8
10.200
3.532 .250
15.614 .495
M"a%
Lo9"r :o$%(
;33"r :o$%(
95< 8o%/i("%&"
I%t"r*a, /or M"a%
5< Tri"( M"a%
M"(ia%
1aria%&"
St(. )"*iatio%
Mi%i$
Ma2i$
Ra%g"
I%t"r#$arti," Ra%g"
S7"9%"ss
4$rtosis
TOTAL TIME SPENT
ON THE INTERNET
Statisti& St(. Error
Tale of descripti&e statistics
To answer problem
7, we look at the
values for skewness
and kurtosis in the
Descriptives table.
The skewness and kurtosis for the variable both exceed the rule of
thumb criteria of +... The variable is not normally distributed.
The answer to problem 7 if false.
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
23
Prolem 3
In t#e dataset +SS3...'sa&( is t#e follo$in) statement
true( false( or an incorrect application of a statistic4
5se .'.1 as t#e le&el of si)nificance'
6ased on a dia)nostic #ypot#esis test of normality(
;total #ours spent on t#e Internet; is not normally
distriuted' A lo)arit#mic transformation of ;total
#ours spent on t#e Internet; results in a &ariale t#at
is normally distriuted'
1' True
3' True $it# caution
3' 7alse
8' Incorrect application of a statistic
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
24
Tests of Normality
.047 93 .200= .994 93 .951
.118 93 .003 .868 93 .000
.288 93 .000 .495 93 .000
Logarit5 o/ NETIME
>L?10@NETIMEAB
S#$ar" Root o/ NETIME
>S-RT@NETIMEAB
I%*"rs" o/ NETIME
>1C@NETIMEAB
Statisti& (/ Sig. Statisti& (/ Sig.
4o,ogoro*.Sir%o*
a
S5a3iro.6i,7
T5is is a ,o9"r 0o$%( o/ t5" tr$" sig%i/i&a%&".
=.
Li,,i"/ors Sig%i/i&a%&" 8orr"&tio%
a.
T#e test of normality
(roblem 8 specifically asks about the results of the test of
normality for the logarithmic transformation. Since our sample
si,e is larger than -., we use the 'olmogorov$Smirnov test.
The null hypothesis for the 'olmogorov$Smirnov test of
normality states that the actual distribution of the transformed
variable is e!ual to the expected distribution, i.e., the
transformed variable is normally distributed. Since the
probability associated with the test of normality 1..7..2 is
greater than the level of significance, we fail to re3ect the null
hypothesis and conclude that the logarithmic transformation of
total hours spent on the %nternet is normally distributed.
The answer to problem 8 is true.
SW388
R7
Data
Analysi
s &
Compu
ters II
Slide
25
<t#er prolems on assumption of normality