Beruflich Dokumente
Kultur Dokumente
Control Systems Research Group, School of Engineering, Liverpool John Moores University, Byrom Street, Liverpool L3 3AF, UK
b
Department of Automation, Northeast University at Qinhuangdao, China
Available online 29 March 2007
Abstract
A stable self-learning PID proportional integral derivative control scheme for multivariable nonlinear systems with unknown
dynamics is proposed in this paper. The control scheme is based on a neural network (NN) model of the plant. The NN model is adapted
by an extended Kalman lter (EKF) to learn plant dynamic change, while the PID control parameters are adapted by the Lyapunov
method to minimize squared tracking error. Therefore, the model output is guaranteed to converge to the desired trajectory
asymptotically, and the plant output also tracks the desired trajectory due to model adaptation. The proposed scheme is evaluated by
applying it to a simulated multivariable continuous stirred tank reactor (CSTR). The self-learning PID controller is also compared with a
xed parameter PID controller for a single-input single-output CSTR and the superiority of the self-learning PID is demonstrated.
r 2007 Elsevier Ltd. All rights reserved.
Keywords: Self-learning PID; Adaptive NN models; Nonlinear systems; EKF; CSTR
1. Introduction
It is well known that PID proportional integral
derivative controllers have dominated industrial control
applications for a half of century, although there has been
considerable research interest in the implementation of
advanced controllers. This is due to the fact that the PID
control has a simple structure that is easily understood by
eld engineers and is robust against disturbance and system
uncertainty. As most of the industrial processes demonstrate nonlinearity in the system dynamics in a wide
operating range, different self-tuning PID control strategies
have been investigated in the past two decades.
Relay feedback is a simple and reliable test that keeps the
process output under closed-loop control and makes it
close to the operating point. Astrom and Hagglund (1984)
combined the strengths of both PID and relay control and
invented the relay auto-tuner for a single-input singleoutput (SISO) PID controller. The tuner has been widely
and successfully applied in industry, and further research
Corresponding author. Tel.: +44 151 231 2229; fax: +44 151 298 2624.
ARTICLE IN PRESS
1578
1
.
1 ezk
ARTICLE IN PRESS
D.L. Yu et al. / Control Engineering Practice 15 (2007) 15771587
wh1 T
..
.
whq T
7
7
7;
5
6
6
y^ y 6
4
wy1 T
..
.
wyp T
3
7
7
7
5
(6)
yk 1 yk,
(7)
^ em k,
yk yk
(8)
where
^
^
yk
gyk;
xk W
"
C w k
.
qyy yyk1
^
^
qyh yyk1
qy yyk1
^
(17)
The rst term in the matrix in right-hand side of Eq. (17),
can be expressed as
^
^
qyk
qyk
qhk qzk
where whi is the ith row vector in W h and wyi is the ith row
vector in W y . Let the desired network model, for which the
network to be trained is learning to achieve, have a
parameter vector y that is dened in the same way as the
above, then this target network model is described as
1579
1 eW
#T
T
1
xT k 1T
(9)
and em k 2 Rp is the modelling error vector. The parameter vector of the target network, y is estimated by y^ that
is updated in every sample step with the EKF algorithm to
Eqs. (8) and (9) as
^
^ 1 K w kyk yj
^ kjk1 ,
yk
yk
(10)
E w k I K w kC w kE w k 1,
(11)
^
qyk
C w k
qy yyk1
^
hkjyyk1
^
1
^
Rk Rk 1 fyk ykj
^
yyk
k
T
^
yk ykj
^ Rk 1g.
yyk
1
1e
zkjyyk1
^
(13)
^ h k 1
zkjyyk1
W
^
15
16
6
6
6
6
6
4
^
qyk
^
qyy jyyk1
hkjyyk1
^
1
xk
(14)
qhi k
hi kjyyk1
1 hi kjyyk1
,
^
^
qzi k yyk1
^
12
where
^ 1; xk,
^ kjk1 g
^ yk
yj
where
i 1; . . . ; q,
K w k E w k 1C w kT
Rjkjk1 C w kE w k 1C w kT 1 ,
qq:n1
.
(19)
can be expressed as
qyy
T
^
h kjyyk1
..
.
01q1
1
01q1
..
..
.
^
h kjyyk1
1
7
7
7
7
7
5
.
ppq1
20
ARTICLE IN PRESS
D.L. Yu et al. / Control Engineering Practice 15 (2007) 15771587
1580
ci qi t ct 2Ahko c t exp
,
dt
Ah
Rv
RT r t
(21)
ci, Ti(t),
h(t)
A
Rv
c(t), Tr(t)
Fig. 1. CSTR process.
dT r t
1
h
r sr qi tT i t T r t
dt
rr sr Ah r
Rv
EA
2
DHAhko c t exp
RT r t
U 1 T h T r t U 2 T r t T x .
22
T i t 2 273; 480(K).
(23)
ARTICLE IN PRESS
0.17
0.12
0.11
0.06
0.07
1
201
401
601
801
1001
1581
0.16
c(k) mol/l
c(k) mol/l
201
401
601
401
601
801
1001 sample
801
1001
sample
630
Tr(k) K
Tr(k) K
530
430
530
430
330
330
201
401
601
Process output
801
1001
NN output
sample
0.01
(24)
0.005
0
401
601
801
1001
eTr(k)
60
40
20
0
1
k4300,
201
401
601
801
1001
sample
0.008
ec(k)
201
sample
0.004
0
1
201
401
601
801
1001 sample
201
401
601
801
1001 sample
40
eTr(k)
sample
NN output
201
process output
ec(k)
20
0
ARTICLE IN PRESS
D.L. Yu et al. / Control Engineering Practice 15 (2007) 15771587
1582
k
X
eT i
i1
eT k eT k 1
,
25
T
where eT k 2 Rp is the process tracking error dened as
K d k
eT k yd k yk
with yd k 2 Rp being the desired trajectory. T is the
sampling time, K p ; K i ; K d 2 Rmp are PID controller
parameter matrices and m is the number of input.
The objective of the proposed method is to achieve
tracking control when system dynamics change. To realize
this objective, the conguration of the control system is
proposed and displayed in Fig. 6.
In this system, a MLP model is updated on-line to
capture the time-varying dynamics of the process. Then,
the updated model is used in an iterative algorithm for
auto-learning of the PID controller. In this way an optimal
control is achieved by minimizing the squared error
between the desired trajectory and the model prediction.
The thick line in Fig. 6 denotes the information sharing
between the two MLP models.
In order to estimate the optimum PID parameters,
K p k, K i k and K d k, a parameter vector ypid k 2 R3mp
is formulated as
2
3
kp1 k ki1 k kd 1 k T
6
7
..
6
7
(26)
ypid k 6
7
.
4
5
T
kpm k kim k kd m k
where kpj k denotes the jth row in K p k and it is the same
for K i k and K d k, j 1; . . . ; m. An iterative algorithm is
to be developed and used to obtain the optimal control
variable in each sample period k by minimizing an
objective function of the predictive tracking error. Such
tuned PID parameters will be used at the end of the sample
period to produce a control variable. A new argument, i is
introduced to denote the iterative step within one sample
NN-based PID auto-tuning controller
eT (k)
eT|k
yd(k) yd
+-
PID
et(i) Auto-tuning u(i)
algorithm
y(i)
NN
model
u(k)
y(k)
Process
NN
model
(27)
(28)
^ g
^ ui;
^ X jk .
yi
(29)
^ is
yd jk 2 Rp is the desired output at sample time k and yi
the NN model output in iterative step i within the sample
^ is a function of the predicted optimum control
period. yi
^ 2 Rm ,
variable, ui
^ K p ieT jk K i iE pid jk K d i
ui
eT jk eT jk1
.
T
(30)
(31)
(32)
Fault, noise
z-1
y(k)
-
em(k)
DV i V i 1 V i
2dDet iT et i dDet i
"
#T
^
qyi
2d
K pid iet i
qy^ pid i
"
#
^
qyi
K pid iet i
et i d
qy^ pid i
ARTICLE IN PRESS
D.L. Yu et al. / Control Engineering Practice 15 (2007) 15771587
"
#T
^
qyi
K pid i
2det i
qy^ pid i
"
#
^
qyi
K pid i et i.
I d
qy^ pid i
1583
34
(35)
P
T jk1
q K p ieT jk K i i eT jk K d i eT jk e
^
qui
T
qy^ pid i
qy^ pid i
3
2
Qpid 013p
7
6
6 .
.. 7
..
..
39
6
. 7
.
7
6
5
4
013p Qpid
with
.
^
^
^ qypid i
qypid i qui
(37)
^
^ for
The rst term in the right-hand side of (37), qyi=q
ui
the MLP network model is derived as follows:
^
^ qoi qzi
qyi
qyi
^
^
qui
qoi qzi qui
2
qo1 i
6 qz1 i
6
6
qW y oi
..
.
1 6
6 .
.
qoi 6 .
6
4
0
2
wy1;1
6
6 .
.
6
6 .
4
wyp;1
2
wh1;1
6
6 .
.
6
6 .
4
whq;1
..
..
^
ui
6
7
h
X jk 7
7 qW 6
4
5
7
7
1
.. 7
. 7
7
^
qui
qoq i 7
5
qzq i
3
2
qo1 i
y 3
0
w1;q 6 qz1 i
7
7
76
7
6
7
.. 76 ..
.. 7
..
7
. 76
.
.
.
7
56
6
qoq i 7
5
wyp;q 4 0
qzq i
3
wh1;m
7
.. 7
38
. 7
7,
5
whq;m
0
where
2
qoj i
oj i1 oj i;
qzj i
(36)
^
ui
6
7
j 1; . . . ; q; zi W h 4 X jk 5.
1
Qpid
3T
eT jk
P
6
7
eT jk
6
7
6
7 .
4 eT jk eT jk1 5
T
ARTICLE IN PRESS
D.L. Yu et al. / Control Engineering Practice 15 (2007) 15771587
4
qi(k) l/sec
1584
3.5
3
2.5
c(k) mol
0.14
0.12
0.1
101
201
301
401
501
601
sample
201
301
401
501
601 sample
101
201
301
401
501
601 sample
Ti(k) K
360
260
Cai(t), q, Ti
v
h(t)
qc(t), T ci
C a(t), q, T
Fig. 9. SISO CSTR process.
500
Tr(k) K
101
460
460
420
380
1
101
201
301
desire trajectory
401
501
601
sample
process output
(40)
41
ARTICLE IN PRESS
D.L. Yu et al. / Control Engineering Practice 15 (2007) 15771587
1585
ut qc t;
where umax and umin are the maximal and minimal values of
the input in the data set and the same applies to the output
y. The NN model is trained using the EKF algorithm
described in Section 2 in the iterative but off-line mode.
These initial training is the preparation for on-line training
of the model and the model-based PID control.
yt C a t.
T 448:752 (K)
qc(k) l/min
115
105
95
85
601
1201
1801
ys k
yk ymin
,
ymax ymin
with
eT t yd t yt
being the tracking error. The controller parameters K P , T I
and T D are designed initially using the ZieglerNichols
method and then are ne tuned with trial and error method
to minimize the integrated quadratic tracking error around
the middle operating point in the operating region,
C a 0:105. The PID controller parameters are obtained
as follows:
K P 6;
T I 39;
T D 9:75.
sample
0.15
uk umin
;
umax umin
0.12
Ca(k) mol/l
Ca(k) mol/l
^
yk
g^ 6:10:1 uk 1; . . . ; uk 3; yk 1; . . . ; yk 3.
0.11
0.1
0.1
0.09
1
0.05
1
601
1201
1801 sample
101
201
301
desire trajectory
401
501
601
Process output
sample
ARTICLE IN PRESS
D.L. Yu et al. / Control Engineering Practice 15 (2007) 15771587
1586
0.15
Ca(k) mol/l
Ca(k) mol/l
0.12
0.11
0.1
0.09
101
201
301
desire trajectory
401
qc k 0:08qc k;
k4250.
0.11
0.09
501
601
sample
Process output
101
201
301
desire trajectory
401
501
601
sample
Process output
0.15
Ca(k) mol/l
0.13
0.13
0.11
0.09
1
101
201
301
desire trajectory
401
501
601
sample
Process output
ARTICLE IN PRESS
D.L. Yu et al. / Control Engineering Practice 15 (2007) 15771587
Davidson, E. J. (1976). Multivariable tuning regulators: The feed-forward
and robust control of general servomechanism problem. IEEE
Transactions on Automatic Control, 21, 3547.
Funahashi, K. (1989). On the approximate realization of continuous
mappings by neural networks. Neural Networks, 2, 303314.
Gawthrop, P. J. (1986). Self-tuning PID controller: Algorithms and
implementation. IEEE Transactions on Automatic Control, 31(3),
201209.
Gyongy, I. J., & Clarke, D. W. (2006). On the automatic tuning and
adaptation of PID controllers. Control Engineering Practice, 14(2),
149163.
Halevi, Y., Palmor, Z. J., & Efrati, T. (1997). Automatic tuning of
decentralized PID controllers for MIMO processes. Journal of Process
Control, 7(2), 119128.
Hang, C. C., Astrom, K. J., & Ho, W. K. (1993). Relay auto-tuning in the
presence of static load disturbance. Automatica, 29(2), 563564.
Hang, C. C., Astrom, K. J., & Wang, Q. G. (2002). Relay feedback autotuning of process controllersa tutorial review. Journal of Process
Control, 12(1), 143162.
Ho, W. K., Hang, C. C., & Cao, L. S. (1995). Tuning of PID controllers based
on gain and phase margin specications. Automatica, 31(3), 497502.
Ho, W. K., Hong, Y., Hansson, A., Hjalmarsson, H., & Deng, J. W.
(2003). Relay auto-tuning of PID controllers using iterative feedback
tuning. Automatica, 39(1), 149157.
Iiguni, Y., Sakai, H., & Tokumaru, H. (1992). A real-time algorithm for a
multi-layered neural network based on the extended Kalman lter.
IEEE Transactions on Signal Processing, 40(4), 959966.
Kim, S. M., & Han, W. Y. (2006). Induction motor servo drive using
robust PID-like neuro-fuzzy controller. Control Engineering Practice,
14(5), 481487.
Koivo, H. N., & Tanttu, J. T. (1991). Tuning of PID controllersurvey of
SISO and MIMO. Preprints of IFAC international symposium on
intelligent tuning and adaptive control, Singapore.
Lee, T. H., Hang, C. C., Ho, W. K., & Yue, P. K. (1993). Implementation of a knowledge-based PID auto-tuner. Automatica, 29(4),
11071113.
1587