Beruflich Dokumente
Kultur Dokumente
Many engineering design and analysis problems involve factors that are
interrelated and dependent. E.g., (1) runoff volume, rainfall; (2) evaporation,
temperature, wind speed; (3) peak discharge, drainage area, rainfall intensity;
(4) crop yield, irrigated water, fertilizer.
Due to inherent complexity of system behaviors and lack of full understanding
of the procedure involved, the relationship among the various relevant factors
or variables are established empirically or semi-empirically.
Regression analysis is a useful and widely used statistical tool dealing with
investigation of the relationship between two or more variables related in a
non-deterministic fashion.
If a variable Y is related to several variables X 1, X2, , XK and their
relationships can be expressed, in general, as
Y = g(X1, X2, , XK)
where g(.) = general expression for a function;
Y = Dependent (or response) variable;
X1, X2,, XK = Independent (or explanatory) variables.
Correlation
where <Cov(X,Y)<
and
(X,Y) .
Cases of Correlation
Perfectly linearly
correlated in opposite
direction
Uncorrelated in
linear fashion
Perfectly correlated in
nonlinear fashion, but
uncorrelated linearly.
Auto-correlation
Consider following daily stream flows (in 1000 m 3) in June 2001 at Chung Mei
Upper Station (610 ha) located upstream of a river feeding to Plover Cove
Reservoir. Determine its 1-day auto-correlation coefficient, i.e., (Qt, Qt+1).
Day (t) Flow Q(t)
1
8.35
2
6.78
3
6.32
4
17.36
5
191.62
6
82.33
7
524.45
8
196.77
9
785.09
10
562.05
600
500
400
300
200
100
0
Day
10
20
30
200
400
600
Autocorrelation
700
900
800
700
600
500
400
300
200
100
0
1.0
0.8
0.6
0.4
0.2
0.0
-0.2
-0.4
-0.6
-0.8
-1.0
800
1000
Regression Models
due to the presence of uncertainties a deterministic functional
relationship generally is not very appropriate or realistic.
The deterministic model form can be modified to account for
uncertainties in the model as
Y = g(X1, X2, , XK) +
where = model error term with E()=0, Var()=2.
In engineering applications, functional forms commonly used for
establishing empirical relationships are
Additive: Y = 0 + 1X1 + 2X2 + + KXK +
1
2
K
Multiplicative: Y 0 X1 X 2 ... X K
^ = error (residual)
ei = y i y
i
^
yi
yi
0
xi
n
D
2 y i 0 1 xi 1 0
i 1
0
n
D 2 y x x 0
i
0
1 i
i
1
i 1
i 1
x y
n
i 1
i 1
i 1
x
n
i 1
2
i
Normal equations:
x
i 1
yi
i 1
i 1
n
n
xi2 xi y i
i 1
i 1
i 1
xi 0
x y
i 1
xi 0
LS Solution (2 Unknowns)
i 1
i 1
y x
n
n
n
1
xi y i xi y i
n i 1 i 1
i 1
x 1
n
i 1
n
2
i
i 1
x y
n
i 1
n
x
i 1
2
i
nx y
nx
D= yi xi xi2 xik
n
minimize
i 1
, ,
D
0 , for j 0,1,2, , k
j
Set
i 1
i 1
i 1
2
i
i 1
n
k
i
i 1
yi
k 1
i
i 1
i 1
y i xi
x
i 1
k
i
x
i 1
k 1
i
x
i 1
2k
i
i 1
y i xik
Minimize D= yi xi x1 xk
, 1 ,, k
Set
i 1
D
0 , for j 0, 1, 2, , k
j
Normal equations:
i 1
i 1
i 1
i1
2
i1
x
i 1
ik
x
i 1
x
i 1
ik
x
i 1
ik
xi1
yi
i 1
x y i xi1
i 1
i1 ik
x
i 1
2
ik
i 1
y i xik
1
x
x
x
21
22
2
k
1 xn1 x n 2 x nk
y1
y
2
yn
1
2
k n
y=X+
in short
LS criterion is:
n
Set
X' ( y - X ) 0
X' X X' y
Measure of Goodness-of-Fit
R2 = Coefficient of Determination
n 2
i
i
1
1
n
2
y
y
i
i 1
LS Example