Beruflich Dokumente
Kultur Dokumente
STATISTICAL METHODS
Secular trends
Seasonal Variation
Cyclical variation
Random variation
Secular trend refers to the general tendency of the data and it is
knows as long period or secular trend. This can be upward or
downward, depending upon the behaviour.
Seasonal variations refer to changes which occur during a climatic
season or a festival season. It may be that of festival season like
1986
1987
1988
1989
1990
45
52
48
55
60
In the graph, it can be seen that sales have fallen in 1988 and then have
increased. It shows that sales are fluctuating. But during the whole
period, the sales trend is upward.
The common technique used in constructing the line of best fit is by the
method of least squares. The trend is assumed to be linear.
The equation for the straight-line trend is: Y = a+ bx, where a is the
intercept and b shows the impact of the independent variable. We
have to estimate the values of a and b. This we do by least square
equations which incidentally enables us to fit the trend line as well.
In the above table, which is a time series data set, sales are the
dependent variable Y and t or time or the years are the independent
variable. Since sales vary with time, the time periods will be the
independent variable X.
The Y intercept and the slope of the line are found by making
appropriate substitutions in the following normal (least square)
equations.
Y = na + b x. (1)
XY = a x + b x2. (2)
Year
1986
1987
1988
1989
1990
n=5
Sales Rs. In
Lakhs
45
52
48
55
60
y 260
X
1
2
3
4
5
x=15
X2
XY
1
4
9
16
25
2
x =55
45
104
114
220
300
xy=813
(2)
Moving Averages
The MA can also be used to develop trend lines.
The calculations depend upon whether the period should be odd or
even.
In the case of odd periods like (5,7,9), the average of the observations is
calculated for a given period and the calculated value is written in front
of central variable for the period, say 5 years. The average of the values
of five years is calculated and recorded against the third year. In the
case of five yearly moving averages, the first two years and the last two
years of the data will not have any average value.
If the period of observation is even, say four years, then the average of
the four yearly observations is written between second and third year
values. After this, centering is done by finding the average of paired
values.
Take the following examples:
The following are the annual sales of clothes during the period of 19932003. How will we find out the trend of the sales using (1) 3 yearly
moving averages, (2) 4 yearly moving averages and (3) forecast the
value for 2005.
Year
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
Demand Analysis
Solution: 3 yearly period:
e.g: The value of 1994 + value of 1995 + value of 1996
12 + 15 +14 = 41 written at the capital period 1995 of the years 1994,
1995 and 1996
Year
Sales in
Rs. Lakhs
3 Yearly
moving total
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
12
15
14
16
18
17
19
20
22
25
24
(-)
41
45
48
51
54
56
61
67
71
-
3 Yearly moving
Average
trend values
41/3=13.7
45/3=15
48/3=16
51/3=17
54/3=18
56/3=18.7
61/3=20.2
67/3=22.3
71/3=23.7
-
Year
Sales in Rs.
Lakhs
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
12
15
14
16
18
17
19
20
22
25
24
4 yearly
Moving total
moving total of pairs of
yearly total
57
63
65
70
74
78
86
91
-
120
128
135
144
152
164
177
-
4 yearly
moving
average
trend values
120/8 =15
128/8 =16
135/8 =16.9
144/8 =18
152/8 =19
164/8 =20.5
177/8 =22.1
-
Regression method
The sales of any commodity does not depend only on time . It may be
associated with competitors, advertising, ones own advertising, change
in population, income and size of the family, environmental factors, etc.
The nature of relationship between these factors can be used and future
sales can be forecast. Regression analysis denotes methods by which the
relationship between quantity demanded and one or more independent
variables (income price of the commodity, prices of related commodities
advertisement expenses, etc.) can be estimated.
Trend Projection by Regrestion Method
1998
1999
2000
2001
2002
240
280
240
300
340
From this data project the sales for 2003, 2004, 2005.
Solution
10
First we have to calculate required values. They are (i) Time Deviation,
(ii) Deviation Squares, (iii) Product of time deviation and sales.
Year
N
Sales
(Rs. In
crores)
(y)
1998
240
1999
280
2000
240
2001
300
2002
340
X=5
y=1400
The equation is y=a+bx.
Time
deviation
TD squared
from middle
year 2000
(x)
(x2)
-2
4
-1
1
0
0
+1
1
+2
4
x =0
x2=10
Product of
Time
Deviation and
sales
(xy)
-480
-280
-760
0
+300
+680 +980
xy=220
12
Example
Fit a linear regression line to the following data and estimate the
demand at price Rs.30.
Year 1981 82
Price
15 15
Pi
Sales
Si in 52
46
1000
Units
83
12
84
26
85
18
86
12
87
8
88
38
89
26
90
19
91
29
92
22
38
37
37
37
34
25
22
22
20
14
Solution:
Find the values of a and b and then the following table is constituted:
Pi
15
15
12
26
18
12
8
38
26
19
29
22
Pi 240
Si
52
46
38
37
37
37
34
25
22
22
20
14
Si 384
Pi2
225
225
144
676
324
144
64
1444
676
361
841
484
2
Pi 5708
Si2
2704
2116
1444
1369
1369
1369
1156
625
484
484
400
196
2
Si 13716
SiPi
780
690
456
962
666
444
272
950
572
418
580
308
SiPi7098
Leading Indicators
There are leading indicators, which serve as useful guide for forecasting
the demand for some products. The sales of baby milk can be forecast
with the help of birth of children in the past five years. There is a
correlation between the demand for baby milk and the birth rate of
children. If the indicator is known, the sale forecast may be made on
that basis.
Coincident Indicators
14
There are certain indicators which coincide with the rise or fall of
general economic activity. The coinciding indicators are the gross
national product index of industrial production, retail sales and labour
force in the economy.
Diffusion Indices
These indices help the forecasters in relying on the leading indicators
used. These indices move up and down behind some other series.
The following indicators are used in assessing the future demand for
certain products:
Indicator
Demand for the Product
Construction Contracts
Demand for building materials
Increased prices of agricultural Demand for agricultural inputs
commodities
A rise in disposable income
Demand for consumer goods
Automobile registration
Demand for petrol
In the barometric method, it is necessary to find out the existence of any
relation between the indicator and the demand for the product. After
establishing the relationship through the method of least squares, we
have to derive the regression equation. If the relationships is linear, the
equation will be Y = a +bx. Once the equation is derived, the demand
for the product can be estimated.
15
From this data, compute the possible demand for the year 2005, if the
index would be 250
Solution
Regression equation of y on x (Y = Sales & X = DPI index)
Y= a +bx Adopting simultaneous equations, find out the value of a
and b
y =na +b x (i)
xy = ax + b x2 (ii)
Year
DPI Index
(x)
1994
1995
1996
1997
1998
N=5
100
120
150
160
220
x =750
Sales of
Square of
Product of
2
Refrigerator
DP II (X )
DPI and
(y)
Sales (xy)
100
10,000
10,000
140
14,400
16,800
150
22,500
22,500
170
25,600
27,000
200
48,400
44,000
2
y= 760
x =1,20,900 xy=1,20,500
xy = ax + b x2 (ii)
Computation of Values
By attributing values to the above equations
760 = 5a +750b (1)
1,20,500 =750a + 120, 900 b (2)
to bring equations to common base multiply it by 150
114000 =750a + 112500 b
120500 =750a + 120900 b
-6500 = 8400b
thus b = 6500 8400 = 0.77
Substituting the value of b in equation (iii)
760 =5a + 750 (0.77)
760 = 5a + 577.5
5a = 182.5
a =36.5
Thus Y = 36.5 +0.77 X
If the index would be 250 for 1999 the number of refrigerators sold
would be: =36.5 + 0.77 (250)=36.5 + 192.5 = 229 numbers
---------------------------
17
18
where
S
^
Residual (Si - Si)
A
B
^ ^ ^
Si = b0 + b1 Pi
19
This means that we cannot reject the hypothesis that the true parameter
b equals 0.
Therefore we may conclude that our estimate is not statistically
significant.
However if the t statistic is greater than 1.96 in absolute value we
reject the hypothesis that b = 0 and call our estimate statistically
significant.
- 0.42P
(0.13)
- 3.23
+0.046I
(0.006)
7.67
- 0.84R
(0.32)
- 2.63
We can calculate the SD from this and also comment on its statistical
significance
Interpretation
The SE is given within brackets. For instance for P, SE = 0.13
which is very small relative to coefficient 0.42. In fact we can be 95%
certain that the true value of the price coefficient is 0.42 + or 1.96
SE or 1.96 * 0.13 = 0.25 +. (The true value of the coefficient could
range between 0.17 and 0.67.)
In this range there is no 0. Therefore the effect of price is both
significantly different from 0 and negative. (It is also statistically
significant)
The t value of 3.23 for P is equal to 0.42 / 0.13. Because this
exceeds 1.96 in absolute value, we conclude that the price is a significant
determinant of sales.
21
1991
1992
1993
1994
1995
45
52
48
55
60
Y = 42.1 + 3.3X
(1)
Here we can estimate the SE of 3.3 by first finding out the SD of the
estimated parameters and dividing the SD by 5 (number of
22
23