Beruflich Dokumente
Kultur Dokumente
Descriptive Statistics:
Measures of Dispersion & Skewness
Data in
75% ascending
25% IQR
order
(X X ) 2 _
s 2
i
SD
i
( X X ) 2
n n
SD is the square root of the average squared deviation from
the mean.
Standard Deviation: B&O Prices
(The Harder Way)
_ _
The variance is Xi Xi X (X i X )2
800 -231.38 53536.70
687969.84
85996.23 891 -140.38 19706.54
8 1295 263.62 69495.50
Units difficult to interpret. 1451 419.62 176080.94
Now it is measured in 2. 580 -451.38 203743.90
Solution: use the standard 1192 160.62 25798.78
deviation in units of s. 1285 253.62 64323.10
757 -274.38 75284.38
687969.84
293.25 8251 0 687969.84
8
Mean = 8251/8 = 1031.38
Standard Deviation: B&O Prices
(The Easier Way)
SD = (Mean of the Squares Price ( X i )2
minus Square of the Mean)
800 640000
X X
2
2
891 793881
SD
n 1295 1677025
n
1451 2105401
X2 = (X1)2 + (X2)2 ++ (Xn)2
580 336400
(X)2 = (X1 + X2 ++ Xn)2 1192 1420864
Formula looks more complex, but 1285 1651225
needs fewer calculations. 757 573049
8251 9197845
2
9197845 8251
SD = 293.25 (trust me!)
8 8
Interpreting the Standard Deviation
Q: B&Os prices have a SD of about 300. High or low?
Difficult to say with only one data series. Easier to think in
comparative terms but only if we have two (+) variables.
(Xi X ) X X
2
2 2
n
sn 1 SD
n 1 n n n 1
8
SD = 293.25 x = 293.25 x 1.069 = 313.48
7
Be careful: Excel commands STDEV or STDEVP.
To avoid confusion: we will treat data as a population!
Comparing Measures of Dispersion
Frequency
Histogram: useful graphical way to 10
plot frequency of values against a 5
numerical scale. 0
Q: What is the relative position 1 2 3 4 5 6 7 8 9 10 11 12 13
Frequency
10
Frequency
10
5 5
0 0
1 2 3 4 5 6 7 8 9 10 11 12 13 1 2 3 4 5 6 7 8 9 10 11 12 13
Central Measures of
Tendency Dispersion
Lower Upper
Quartile Median Quartile
Building the Box Plot Cont.
Whiskers are the max. and min. data values between the
upper and lower fences (not always shown but should be!)
Upper: Q3+ (1.5xIQR) = 580 +(1.5 x 343) = 1094.5
Lower: Q1 - (1.5xIQR) = 237 - (1.5 x 343) = -277.5 0.
Most expensive television (1217) is greater than the upper
fence, it is a possible outlier (*).
2nd most expensive TV is not an outlier = 904
0 200 400 600 800 1000 1200 1400
*
Outlier
Cheapest TV Most expensive TV
Upper
inside the fence.
Fence
SPSS Summary Statistics:
Price of Selected Televisions
Statistic
Mean 436.97
Median 417.82
Mode 150.19a
Variance 62785.59
Std. Deviation(pop) 246.50
Minimum 150.19
Maximum 1216.95
Range 1066.76
Interquartile Range 342.27
Percentiles 25 237.47
Percentiles 50 417.82
Percentiles 75 579.74
Television Monthly Sales (Dataset#1)
Upper
fence
i
0 500 1000 1500
2000
3.5
3.0
2.5
2.0
1.5 530
423
309
157
426
1.0
N= 261 296
Junior senior