Beruflich Dokumente
Kultur Dokumente
6.1 Introduction
In the previous units we studied about, the tendency of data to cluster
around a figure which is in central location. In this unit we will study about
different measures of dispersion. Dispersion means scatteredness. So it is
mean deviation or spread of values from their central values. It can also be
defined as the difference between two extreme values of the series.
Objectives:
At the end of this unit the student should be able to:
Explain the concept of dispersion and the significance of measuring it
Describe and compute different measures of dispersion
Decide about the use of an appropriate measure of dispersion under
different situations.
Range
Range is the simplest method of studying variation. It represents the
differences between the values of the extremes. That is it is the difference
between the highest and the lowest values in the series. It is denoted by R.
In a frequency distribution, range is the difference between the upper limit of
the largest class interval and the lower limit of the smallest class interval.
Range can be computed using following equation.
Range = Large value Small value
L arg e value Small value
Coefficien t of Range
L arg e value Small value
Uses:
In Statistical Quality control
When the study does not require deep analysis
When data has no abnormal values
Example: Find the range for the continuous series of data shown in table.
Frequency table for data
Class Interval 0-5 5-10 10-15 15-20 20-25
Frequency 10 15 25 12 8
Example: Compute the range and also the co-efficient of range of the given
series of state which one is more dispersed and which is more uniform.
R = LV SV = 21 9 = 12 R = LV SV = 29 1 = 28
12 12 R 28
CR = = 0.4 CR = = 0.933
21 9 30 LV SV 30
A: 10 11 12 13 14
B: 40 41 42 43 44
R =LV SV
= 14 10 R = 44 - 40 R = 104 - 100
= 4 = 4 = 4
R R R
CR = CR = CR =
LV SV LV SV LV SV
4 4 4
= = =
24 84 204
= 0.166 = 0.0476 = 0.0196
SAQ 1: Compute range and coefficient of range for the following data.
x: 6 12 18 24 30 36 42
f: 20 130 16 14 20 15 40
SAQ 2: From the following distribution find the value of range and its
coefficient:
X: 14, 16, 16, 14, 22, 13, 15, 24, 12, 12, 23, 14, 20, 17, 21, 22
Interquartile range = Q3 Q1
1
Quartile deviation is defined as QD = (Q3 Q1).
2
Q 3 Q1
Relative measure of dispersion coefficient of QD =
Q 3 Q1
Note:
1. Q3-Q1 gives the middle 50% of reading. Q3 and Q1 are also known as
upper and lower limit of middle 50% of readings.
2. Quartile range is not capable of further algebraic treatment.
Merits Demerits
It is easy to understand and to It is not based on all values.
compute.
It is rigidly defined. It is affected by sampling fluctuations.
It is not affected by extreme values. It is not capable of further algebraic
treatment.
Example: Find quartile deviation and coefficient of quartile deviation for the
given grouped data also compute middle quartile.
Class f
1 10 3
11 20 16
21 30 26
31 40 31
41 50 16
51 60 8
f = N = 100
Solution:
Class f Cf
1 10 3 3
11 20 16 19
21 30 26 45 Q1 Class
31 40 31 76 Q2 & Q3 Class
41 50 16 92
51 60 8 100
N = 100
N 100
Q1 = 25
4 4
N / 4 Cf p
Q1 L.L. C.I .
fc
Where L.L= lower limit of the quartile class, C.I = Class width, fc = frequency
of quartile class, N = total frequency and C.fp= Cumulative frequency of
class preceding the quartile class.
Q1 = 20.5
10
25 19
26
Q1 = 22.80
N / 2 Cf p
Q 2 L.L. C.I .
fc
Q2 = 30.5
10
50 45
31
Q2 = 32.11
3N / 4 Cf p
Q3 L.L. C.I
fc
Q3 = 30.5
10
75 45
31
Q3 = 40.17
1
Q.D = (Q3 Q1) = 0.5 (Q3 Q1)
2
1
= (40.17 22.80)
2
= 8.685
Q 3 Q1
Coef. of Q.D =
Q 3 Q1
40.17 22.80
=
40.17 22.80
17.37
62.97
= 0.275
25, 30, 37, 43, 48, 54, 61, 67, 72, 80, 84, 89
S. No. Marks
1. 25
2. 30
3. 37
4. 43
5. 48
6. 54
7. 61
8. 67
9. 72
10. 80
11. 84
12. 89
N 1 12 1
th th
4 4
= 3rd item + 0.25 of item
= 37 + 0.25 (43 - 37)
Q1 = 38.5
Q3 = = = 9.75 item
th
4 4
= 9 + 0.75rd item
= 72 + 0.75 (80- 72)
Q3 = 78
1
QD = (Q3 Q1)
2
1
= (78 38.3)
2
QD = 19.75
Q 3 Q1
Coef. QD =
Q 3 Q1
78 38.5
= = 0.339
78 38.5
Sikkim Manipal University Page No.: 173
Probability and Statistics Unit 6
Example: Compute quartile deviation and its Coefficient for the data given
below:
X 58 59 60 61 62 63 64 65 66
f 15 20 32 35 33 22 20 10 8
Solution:
X f Cf
58 15 15
59 20 35
60 32 67 Q1 Class
61 35 102
62 33 135
63 22 157 Q3 Class
64 20 177
65 10 187
66 8 195
N = 195
( N 1) th
Q1 = size
4
195 1th
= size
4
Q1 = 48.78th size and corresponding to cf 67, which gives
Q1 = 60
Q3 =
3
N 1th size
4
=
3
196th 146.33th size.
4
SAQ 3: Compute the inter quartile range, Q.D and coefficient of Q.D for the
age distributions shown in table.
Age (Years) 18 21 22 24 27 30 32
Frequency 7 13 20 36 14 8 2
M .D( X )
(X i X)
N
M .D( X )
f (X X )
i
f
In case of continuous series X represents mid value of class-interval.
Merits Demerits
It is based on all values. It is not capable of further algebraic
treatment.
It is less affected by extreme values. It does not take into account negative
signs.
It is not affected much by sampling
fluctuations.
Example: Compute MD and Coeff. 0f MD from mean for the following data
21, 32, 38, 41, 49, 54, 59, 66, 68
Solution:
X d= xi x
21 26.55
32 15.55
38 9.55
41 6.55
49 1.45
54 6.45
59 11.45
66 18.45
68 20.45
x = 428 x i x = d= 116.45
x
i 1
i
428
x 47.35
N 9
xi x 116.45
MD = = 12.938
N 9
MD 12.938
Coefficient of MD = = = 0.272
x 47.55
Example: Following are the wages of workers. Find mean deviation from
median and its coefficient.
X 59 32 67 43 22 17 64 55 47 80 25
Wages 17 22 25 32 43 47 55 59 64 67 80
X Wages (f) x i Me = x i 47 f . xi Me
59 17 30 510
32 22 25 550
67 25 22 550
43 32 15 480
22 43 4 172
17 47 M 0 0
64 55 8 440
55 59 12 708
47 64 17 1088
80 67 20 1340
25 80 33 2640
11 1 th 11 1
Median = item = = 6 item = 47
th
2 2
8478
MD= = = 16.591
511
MD 16.591
Coefficient of MD = = 0.353
M edian 47
Solution:
X F d= x i Mode fd
20 6 100 600
40 19 80 1520
60 40 60 2400
80 23 40 920
100 65 20 1300
140 55 20 1100
160 20 40 800
180 9 60 5401
f xi Mode 9180
MD= = = 28.68
N 320
28.68
Coefficient of MD = = 0.239
120
Example: Find out the mean deviation from the data given below about its
median.
Salaries 40 50 50-100 100-200 200-400
No. of Employees 22 18 10 8 2
Solution:
No. of x(mid
X
Employees value)
cf d= x i Me fd
40 22 40 22 10 220
50 18 50 40 0 0
50-100 10 75 50 25 250
100-200 8 150 58 100 800
200-400 2 300 60 250 500
N= f = 60 f x i Me = 1770
N 1 60 1 61
th
f xi Median 1770
MD = = = 29.5
N 60
MD 29.5
Coefficient of MD = = = 0.59
M edian 50
The standard deviation is Root Mean Square (RMS) average of all the
deviations from the mean. It is denoted by sigma ().
Variance =
(X X ) 2
N (A)
= ( Variance)
Variance =
(X X ) 2
f
f ..(B)
= ( Variance)
In case of grouped data, alternative form for (A) & (B) are the followings
(d ) 2
F
= ( Variance)
fd 2 fd 2
For (B), Variance =
(C.F .) 2
f f
= ( Variance)
X A
Where, d : here, A is assumed mean
C.F
And C.F.= Class Width
Sikkim Manipal University Page No.: 181
Probability and Statistics Unit 6
n1 (12 d12 ) n 2 ( 2 2 d2 2 )
Variance = ; Variance
n1 n 2
Where, d1 = X X1 and d2 = X X 2
d1 = x 1 x
d2 = x 2 x
Example: The means of two samples of sizes 50 and 100 respectively are
54.1 and 50.3 and there standard deviations are 8 and 7 respectively.
x 2 = 50.3, 2 = 7
Solution:
n1 x1 n 2 x 2
x
(n 1 n 2 )
(50 x 54.1) (100 x 50.3)
x = 51.56
50 100
2 (n1 + n2) = n1 (12 + d12) + n2 (22 + d22)
d1 = x 1 x , d2 = x 2 x
d1 = 54.1 51.56
d1 = 2.54, d12 = 6.45
d2 = 50.3 51.56
d2 = - 1.26, d22 = 1.56
2 150 = 50 (82 + 6.45) + 100 (72 + 1.58)
32 = (64 + 6.45) + 2 (49 + 1.58)
32 = 70.45 + 2 x 50.58
= 7.56
Example: The mean wage is Rs. 75 per day, SD wage is Rs. 5 per day for a
group of 1000 workers and the same is Rs. 60 and Rs. 4.5 for the other
group of 1500 workers. Find mean and standard deviation for the entire
group.
n1 x1 n 2 x 2
Consider x
n1 n 2
1000 75 1500 60
i.e., x 66
1000 1500
Also we have,
(n1 + n2) 2 = n1 (12 + d12) + n2 (22 + d22),
where d1 = x 1 - x = 75 66 = 9; d2 = x 2 - x = 60 66 = -6
fd 2 fd 2
=
2 (C.F ) 2
f f
122 32 2
2 = (2) 2 1.525 0.16 4 5.46 (mm) = Variance
80 80
x 385
x = = 38.5
N 10
=
xi x
2
=
5320.5
= 23.066
N 10
SAQ 7: The average weight of 100 apples from area A is 150gms with
standard deviation of 10gms. Similarly the average weight of 200 apples
from area B is 200gms with standard deviation of 15gms. Find the
combined standard deviation.
Example: The runs scored by 3 batsman are 50, 48 and 12. Arithmetic
means respectively. The SD of there runs are 15, 12 and 2 respectively.
Who is the most consistent of the three batsman?
A B C
AM ( x ) 50 48 12
SD() 15 12 2
A 15
Solution: CVA = x 100 = x 100 = 30%
xA 50
B 12
CVB = x 100 = x 100 = 25%
xB 48
C 2
CVC = x 100 = x 100 = 16.66%
xC 12
Evaluation Criteria
Less CV indicates more constant player and hence more consistent player
is (Player C)
Example: Goals scored by two teams A & B in a foot ball season are as
shown below. By calculating CV in each, find which team may be
considered as more consistent.
Solution:
fx 56
xA= = = 1.056
N 53
fx 48
xB= = = 1.2
N 40
A
2 fx 2
N
x
2
=
150
53
1.056 1.715 = 1.30
2
A
B
2 fx 2
N
x
2
=
126
40
1.2 1.95 = 1.30
2
B
A 1.30
CVA = x 100 = x 100 = 123.8%
xA 1.056
B 1.30
CVB = x 100 = x 100 = 109%
xB 1.2
Example: The prices of x and y share A & B respectively state which share
more stable in its value.
x 530
xA= = = 53
N 10
x 1050
xB= = = 105
N 10
70
A 2.64
10
40
B B 2
10
A 2.64
CVA = x 100 = 100 = 4.98%
x 53
B 2
CVB = x 100 = 100 = 1.903%
x 105
x x
Solution: x i.e. 40
n 100
x (incorrect) = 4000
Now correct x = 4000 50 + 40 = 3990
3990
correct x = 39.9
100
Let us consider 2
x2
n
x 2
x2
5.12 40
2
100
x2 x2
i.e. 40 5.1 1626.01
2 2
or
100 100
x2 (incorrect) = 100 x 1626.01 = 162601
Now correct x2 = 162601 (50)2 + (40)2 = 161701
correct 2 = correct
x2
n
correct x 2
39.9 25
161701
i.e., correct 2 =
2
100
5
Now correct efficient of variation = 100 100 12.56%
x 39.9
Hence correct C.V. = 12.53%
x x
Solution: x i.e. 30 or x 630
n 21
incorrect x = 630
Now omitting the incorrect value 10,
New x = 630 10 = 620
n = 21 1 = 20
620
New x 31
20
Next consider 2
x2
n
x 2
x2
52 30
2
100
x2
i.e. 900 25
21
incorrect x 2 925 21 19425
Again omitting the incorrect value 10.
New x = 19425 (10)2 = 19325, n = 20
19325
(31) 2 5.25
20
New = 5.25 = 2.29
2.29
Now correct efficient of variation = 100 100 7.38%
x 31
Example: Find standard deviations of the two series shown in table. State
which series is more stable?
Series A 192, 288, 236, 229, 184, 160, 384, 291, 330, 243
Series B 31, 48, 13, 51, 38, 43, 50, 36, 47, 82
37
260 263.7
10
2
2 = 34247 37 (58.4) 2
10 10
58.4
Sikkim Manipal University Page No.: 191
Probability and Statistics Unit 6
58.4
CV% = 100 22.15%
263.7
The coefficient of variation for the data of series A is 22.15%.
The table displays the values required to calculate coefficient of variation for
data of series B.
2
Series B ( X) X
31 961
48 2304
13 169
51 2601
38 1444
43 1849
50 2500
36 1296
47 2209
82 6724
Total 439 22057
X 43.9
2
22057 439 2
=
2
2205 .7 ( 43.9) 2205 .7 1927 .21 278.49
10 10
278.49 16.68802
16.69
CV.% = 100 38.0154 %
43.9
The series A is more stable, since the CV for series A (22.15) is less than
the CV for series B (38.02).
SAQ 9: In two factories A and B engaged in the same industry, the average
monthly wages and standard derivatives are as follows:
Factory Average Monthly wages S.D. of wages No. of wage Earners
A 4600 500 100
B 4900 400 80
SAQ 10: Find the sum and sum of squares of 100 items whose mean is 50
and S.D. is 5. Also find the coefficient of S.D. and the co-efficient of variation
6.4 Summary
In this unit, we studied about different measures of dispersion like Range
(R), Quartile Deviations (Q.D), Mean Deviations (M.D), Standard Deviations
(S.D), Coefficient of Range, Coefficient of Quartile Deviations,
Coefficient of Mean Deviations, Coefficient of Variations with examples.
2. From the following data given below determine the value of Range and
its coefficient.
Marks 5-10 10-15 15-20 20-25 25-30 30-35 35-40 40-45
No. of 2 2 4 5 5 6 15 10
students
3. From the following frequency distribution find the quartile deviation and
its coefficient.
Wages 12.50- 17.50- 22.50- 27.50- 32.50- 37.50- 42.50- 47.50-
17.50 22.50 27.50 32.50 37.50 42.50 47.50 52.50
Freq. 1 6 4 3 14 19 22 2
4. From the following random variable find the range, coefficient of range,
quartile deviation and its coefficient.
X 0 1 2 3 4 5 6 7 8
F 1 3 5 7 9 11 13 15 17
5. From the following series find the mean deviation from the mean and its
co-efficient
Grades 0-5 5-10 10-15 15-20 20-25 25-30 30-35
Freq. 2 7 10 12 9 6 4
6. Determine the mean deviation and its coefficient from median for the
following continuous series
Marks 0-10 10-20 20-30 30-40 40-50 50-60 60-70
No. of
4 8 11 15 12 6 3
students
Calculate the average breaking strength of the alloy and the standard
deviation.
10. The following distribution shows the turnover of the branches of a group
of a group f multiple shops in March 2010
Turnover (in Rs. Lakhs) No. of shops
5 and under 10 8
10 and under 15 18
15 and under 20 42
20 and under 25 62
25 and under 30 30
30 and under 35 10
35 and over 4
Using assumed mean of Rs. 22,500, calculate (i) mean, (ii) S.D.,
(iii) Coefficient of variation.
6.6 Answers
Self Assessment Questions
R 36
1. Range = LV SV = 42 6 = 36 , CR = = = 0.75
LV SV 48
2. Largest value is 24 and smallest value is 12
Range = LV SV = 24 12 = 12,
R 12
Coefficient of range = = = 0.33
LV SV 36
3. The table shows the cumulative frequency distributions for the age
distributions.
Age (Years) Frequency Cumulative Frequency
18 7 7
21 13 20
22 20 40
24 36 76
27 14 90
30 8 98
32 2 100
Total 100
100 1th
Q1 value 25.25th value
4
Q1 22
3(100 1)th
Q3 value 75.75th value
4
Q3 = 24
Therefore, the inter quartile range, Q3 Q1 = 24-22 = 2 Yrs.
24 22
Q . D. = 1 year
2
24 22 2
Coefficient of Q.D.
24 22 26
4.
Weekly wages No. of workers c.f
Below 1350 8 8
1350-1370 16 24
1370-1390 39 63
1390-1410 58 121
1410-1430 60 181
1430-1450 40 221
1450-1470 22 243
1470-1490 15 258
1490-1510 15 273
1510-1530 9 282
1530 & above 10 292
N = 292
Where L.L= lower limit of the quartile class, C.I = Class width,
fc = frequency of quartile class, N = total frequency and C.fp=
Cumulative frequency of class preceding the quartile class.
Q2 = 1410
20
146 121
60
Q2 = 1418.333
N
Q1 =
4
Q1 lies in the class 1390-1410
N / 4 Cf p
Q1 L.L. C.I .
fc
Q1 = 1390
20
73 63
58
Q1 = 1393.448
Q3 = Size of 3N / 4 th observation = 3(292) / 4 = 219 th observation
So, Q3 lies in the class 1430 - 1450
3N / 4 Cf p
Q3 L.L. C.I
fc
Q3 = 1430
20
219 181 = 1449
40
1
Q.D = (Q3 Q1) = 0.5 (1449 1393.448)= 27.776
2
Q 3 Q1
Coef. of Q.D =
Q 3 Q1
1449 1393.448 55.552
= 0.020
1449 1393.448 2842.448
5.
1152
Mean( X ) 144 cms.
8
22
Mean deviation from mean = N
8
2.75
2.75
Coefficient of MD ( X) = 0.0191
144
(8 1)th
Median is value = 4.5th value
2
Median = 143 + 0.5(144 143) = 143.5 cms
20
Mean deviation from median = 2.5
8
2.5
Coefficient of MD ( X) = 0.01742
143.5
The mean deviation from median (2.5 cms) is less than that of the mean
deviation from mean (3.75 cms).
6.
Mid X 28
Efficiency
Frequency value d fd f X - 24 Cf X Med)
fX
Index 4 Med)
X
( X) A
fd h 28 (65) 4 24
f 65
160
M.D.( X) 2.46
65
N 65
Median th Value th Value 32.5th Value
2 2
Median class is 22 26
32.5 20 12.5 50
Median 22 4 22 4 22 22 1.66 23.66
30 30 30
168
M .D. ( Median ) 2.58
65
2.46
Coefficien t of M..D.(X) 0.1025
24
2.58
Coefficient of M.D.( from Median ) 0.1091
23.6
Therefore, the mean deviation and coefficient of mean deviation from
mean are 2.46 and 0.1025 respectively. The mean deviation and
coefficient of mean deviation from median are 2.58 and 0.1091
respectively.
Combined Average = n1 X1 n 2 X 2
n1 n 2
fd 2 fd 2
Variance =
(C.F .) 2
f f
= = 45.2434
Standard deviation =
9. (i) For finding which factory A or B pays larger amount as monthly
wages, we have to compare the total wage bill.
Factory A: Total wage bill = (4600) (100) = Rs 4, 60,000
Factory B: Total wage bill = (4900) (80) = Rs 3, 92, 000
Hence factory A pays larger amount as monthly wages.
(ii) For determining which factory shows greater variation in the
distribution of wages, we have to compare coefficient of variation
C. V (factory A) = = = 10.87
C. V (factory B) = = = 8.16
Or
And C. V (2) =
Or 22.5
11. Given
We know,
(iii) Co-efficient of =
Terminal Questions
1. Range = 15, Coefficient of range = 0.42
2. Range = 45 - 5 = 40, Coeff. Of Range = 0.80
3. Q.D. = 5.04, Coefficient of Q.D. = 0.13
4. Range = 8, Coefficient of range = 1, Quartile Deviation = 1.5,
Coefficient of Quartile deviation = 0.27
5. 6.372, 0.36
6. 12.35, 0.36
7. Average breaking strength = 49.025, S.D = 1.96
8. 4.99
9. 34.6%
10. Mean = 21.35, S.D. = 6.375, C.V. = 29.86%