Beruflich Dokumente
Kultur Dokumente
Communicated by H. J. Pesch
1. Introduction
Germany.
595
0022-3239/04/1200-0595/0 © 2004 Springer Science+Business Media, Inc.
596 JOTA: VOL. 123, NO. 3, DECEMBER 2004
(SSC) in a possibly weak form which takes into account the strongly active
constraints. In the analysis, the regularity of the control as well as of the
associated Lagrange multipliers plays a crucial role. We consider the case
where the control is a continuous function and we impose constraint qual-
ifications which ensure existence, uniqueness, and regularity of the nor-
mal Lagrange multipliers. By these constraints qualifications, the analysis
is confined to first-order state constraints (cf. Hartl, Sethi, and Vickson,
Ref. 3). The relevant regularity results in Malanowski (Ref. 4) are recalled
in Section 2. Note that, in the regularity analysis, a Lagrangian in the
indirect form is more convenient, while in the analysis of sufficiency in
Sections 3 and 4, a Lagrangian in the direct form is used; see Ref. 3 for
the definitions.
Following the approach in Refs. 1–2, we formulate in Section 3 a
second-order sufficient optimality condition in terms of the solution to a
Hamilton-Jacobi inequality. A quadratic form is chosen as a candidate for
the solution to the HJ inequality and conditions are derived under which
this quadratic form is the needed solution. These conditions involve the
previous constraint qualifications as well as some pointwise coercivity con-
ditions with respect to the control (Legendre-Clebsch condition) and state
variable. The latter is expressed in terms of the solvability of an auxiliary
matrix Riccati equation. These conditions have to be satisfied uniformly
with respect to the time, which is treated as a parameter. An important
point is that the obtained coercivity conditions are weakened by taking
into account all the strongly active constraints. At that point, for mixed
constraints, we follow the approach used in Maurer and Pickenhain (Ref.
2); see also Zeidan (Ref. 5) and Malanowski and Maurer (Ref. 6). How-
ever, this approach is not applicable to pure state-space constraints. To
cope with them, a different concept is used, which leads in Section 4 to a
modified form of the Riccati equation. We believe that this form is orig-
inal; it constitutes the main contribution of this paper to the sufficiency
analysis for optimal control problems. In Section 5, two numerical exam-
ples are presented.
2. Preliminaries
where
ϕ : Rn × Rm → R, ψ : Rn × Rn → R, f : Rn × Rm → Rn ,
ξ : Rn × Rn → Rd , θ : Rn × Rm → Rk , ϑ : Rn → Rl .
Remark 2.1. The Lagrangian (2) is in normal form; i.e., the Lag-
range multiplier corresponding to the functional F (x, u) is equal to one.
The Lagrangian is in the so-called indirect or Pontryagin form with an
absolutely continuous adjoint function q; cf. Section 7 in Hartl, Sethi, and
Vickson (Ref. 3) as well as Hager (Ref. 7) and Neustadt (Ref. 8). The state
constraints are considered in W 1,∞ (0, T ; Rl ), rather than in C(0, 1; Rl ).
This form is convenient in the analysis of existence and regularity of the
Lagrange multipliers (cf. Malanowski, Ref. 4). In the sequel, we still intro-
duce another form of the Lagrangian.
Let (x0 , u0 ) with u0 ∈ C(0, 1; Rm ) be a fixed pair that is admissible for
problem (O). We are going to analyze conditions under which (x0 , u0 ) is
598 JOTA: VOL. 123, NO. 3, DECEMBER 2004
a locally isolated local solution to problem (O). For the sake of simplic-
ity, the functions evaluated at (x0 , u0 ) will be denoted by the subscript 0,
e.g.,
Remark 2.2. Similar techniques can be used in the more general sit-
uation of piece-wise continuous control. In that case, jumps can appear in
the solutions of the adjoint equation and the Hamilton-Jacobi inequality.
Accordingly, the assumed regularity of the matrix function Q in (26) has
to be modified.
Di ξ := Dx(i) ξ, i = 0, 1.
JOTA: VOL. 123, NO. 3, DECEMBER 2004 599
Similarly,
Dij2 ξ := Dx(i)x(j
2
) ξ, i, j = 0, 1.
i.e., pointwise along the trajectory (x0 , u0 ), the gradients of all the active
constraints (control-state and state) are linearly independent, uniformly on
[0,1].
The following result is proved in Ref. 4, Theorem 4.3.
Dx L̃(x0 , u0 , q0 , κ0 , ρ0 , µ0 ) = 0, (9)
Du L̃(x0 , u0 , q0 , κ0 , ρ0 , µ0 ) = 0, (10)
κ0 ∈ K + ,
(κ0 , θ (x0 , u0 )) = 0, (11a)
µ0 (0), ϑ(x(0)) + (µ̇0 , Dx ϑ(x0 )f (x0 , u0 )) = 0, µ0 ∈ L̃+ , (11b)
where
Corollary 2.1. If (A1), (A2), (B) hold, then ẋ0 , u0 , q̇0 , κ0 , µ̇0 are Lips-
chitz continuous on (0, 1).
: R n × Rm × R n × Rk × Rl →
We introduce also still another Hamiltonian H
R,
0 , u0 , p0 , κ0 , ν0 )
0 = p˙0 + Dx H(x
= ṗ0 + Dx f (x0 , u0 )∗ p0 + Dx ϕ(x0 , u0 )
+Dx θ (x0 , u0 )∗ κ0 + Dx ϑ(x0 )∗ ν0 , (16)
0 = p0 (0) + Dx(0) ψ(x0 (0), x0 (1))
+Dx(0) ξ(x0 (0), x0 (1))∗ ρ0 + Dx ϑ(x0 (0))∗ σ00 , (17)
0 = −p0 (1) + Dx(1) ψ(x0 (0), x0 (1))
+Dx(1) ξ(x0 (0), x0 (1))∗ ρ0 + Dx ϑ(x0 (1))∗ σ01 , (18)
0 , u0 , p0 , κ0 , ν0 )
0 = Du H(x
= Du ϕ(x0 , u0 ) + Du f (x0 , u0 )∗ p0 + Du θ (x0 , u0 )∗ κ0 , (19)
κ0 , θ (x0 , u0 ) = 0, (ν0 , ϑ(x0 )) = 0,
σ00 , ϑ(x0 (0)) = 0, σ01 , ϑ(x0 (1)) = 0,
where
In particular,
that is,
0 , u0 , p0 , κ0 , ν0 ),
Du H̃(x0 , u0 , q0 , κ0 , µ˙0 ) = Du H(x (20b)
2 2
Duu H̃(x0 , u0 , q0 , κ0 , µ˙0 ) = Duu H(x0 , u0 , p0 , κ0 , ν0 ). (20c)
BX (x) := {y ∈ X|
y − x
X ≤ },
The following theorem can be proved in the same way as Theorem 3.1
in Ref. 2; see also Assertion 2 in Ref. 1.
F (x, u) ≥ F (x0 , u0 ) + c
(x, u) − (x0 , u0 )
2X2 (25)
∞
holds for all admissible pairs (x, u) ∈ BX (x0 , u0 ); i.e., (x0 , u0 ) is a locally
isolated weak local minimizer.
where e ∈ W 1,∞ (0, 1; R), p0 ∈ W 1,∞ (0, 1; Rn ) is the adjoint function given in
(16), while Q(t) is a symmetric matrix and Q ∈ W 1,∞ (0, 1; Rn×n ). To sim-
plify notation, introduce the function F0 : R1+n+m → R defined by
The function e is chosen in such a way that the HJ equality (23) holds
along (x0 , u0 ):
i.e.,
ė(t) = p0 (t)∗ x˙0 (t) − H0 (t).
We are going to find conditions under which (28) is satisfied. To this end,
for almost all t ∈ [0, 1], consider the following mathematical program:
Hence, choosing
i.e., the Lagrangian l assumes a stationary point at (x0 (t),u0 (t),κ0 (t),ν0 (t)).
On the other hand, condition (8) ensures that the gradients of all con-
straints of (P(t)) active at (x0 (t), u0 (t)) are linearly independent, uniformly
with respect to t ∈ [0, 1]. Hence, the multipliers κ0 (t) and ν0 (t) are defined
uniquely. For a fixed α ≥ 0, introduce the following sets:
where I0 (t) and J0 (t) are given in (4). These are the sets of those active
constraints for which pointwise strict complementarity is satisfied with
margin α. In view of definitions (27) and (29), we obtain
2
D(x,u),(x,u) l(t, x0 (t), u0 (t), κ0 (t), ν0 (t))
Q̇ + QDx f0 + Dx f0∗ Q + Dxx 2 H 0 Dxu 0 + QDu f0
2 H
= 2 H0 + Du f ∗ Q 0
2 H (t). (31)
Dux 0 Duu
Obviously,
Moreover, Conditions (A1) and (A2) imply (see Lemma 4.1 in Ref. 4) that
the matrix
D0 ξ(x0 (0), x0 (1)) D1 ξ(x0 (0), x0 (1))
has full row-rank; (36)
ϒ0 (0) ϒ0 (1)
i.e., the gradients of all constraints of (P(0,1) ) active at (x0 (0), x0 (1)) are
linearly independent. Note that, in view of (34) and (35), we have
2 2
D00 0 = Q(0) + D00 (ψ(x0 (0), x0 (1))
+ξ(x0 (0), x0 (1))∗ ρ0 + ϑ(x0 (0))∗ σ00 ), (37a)
2
D01 2
0 = D01 (ψ(x0 (0), x0 (1)) + ξ(x0 (0), x0 (1))∗ ρ0 ), (37b)
2 2
D11 0 = −Q(1) + D11 (ψ(x0 (0), x0 (1))
+ξ(x0 (0), x0 (1))∗ ρ0 + ϑ(x0 (1))∗ σ01 ). (37c)
In view of (36), Condition (A3) implies that there exist > 0 and c > 0
such that
Theorem 3.2. Suppose that (A1), (A2), (B) hold and there exists a
symmetric matrix function Q ∈ W 1,∞ (0, 1; Rn×n ) such that Conditions (C)
and (A3), together with (31) and (37), are satisfied. Then, there exist c > 0
and > 0 such that
F (x, u) − F (x0 , u0 ) ≥ c
(x, u) − (x0 , u0 )
2X2 ,
∞
for all admissible (x, u) ∈ BX (x0 , u0 ); i.e., (x0 , u0 ) is a locally isolated
strong local minimizer of problem (O).
606 JOTA: VOL. 123, NO. 3, DECEMBER 2004
In this section, we are going to show that Conditions (C) and (A3) of
positive definiteness can be expressed in terms of the solvability of an aux-
iliary matrix Riccati differential equation along with appropriate boundary
conditions. In view of (31), conditions (32) and (33) in (C) take the form
∗ ∗ Q̇ + QDx f0 + Dx f0∗ Q + Dxx
2 H0 Dxu 0 + QDu f0
2 H y
[y , v ] 2 H0 + Du f ∗ Q
2 H (t)
Dux 0 D uu 0 v
≥ δ(|y|2 + |v|2 ), ∀(y, v) ∈ Rn+m such that (38)
α (t)v = 0,
α (t)y +
(39)
ϒα (t)y = 0, (40)
where
α (t) := Dx θ0i (t) , (41a)
+
i∈Iα (t)
α (t) := Du θ i (t)
, (41b)
0 +
i∈Iα (t)
α (t) := Dx ϑ j (t)
ϒ . (41c)
0 +
j ∈Jα (t)
We will derive a weak form of the Riccati equation, where the con-
straints (39) and (40) are taken into account. For this purpose, we need
the following lemma; see Theorem 1 in Haynsworth (Ref. 11). Here, π(S)
denotes the number of positive eigenvalues of a symmetric matrix S.
First, let us recall the strong form of the Riccati equation, which is
obtained if we require that the inequality (38) is valid for all (y, v) ∈ Rn+m
ignoring the constraints (39) and (40). To this end, we need the following
strong form of the Legendre Clebsch condition;
2
v, Duu H0 (t)v ≥ γ |v|2 , for some γ > 0 and ∀v ∈ Rn , t ∈ [0, 1]. (42)
JOTA: VOL. 123, NO. 3, DECEMBER 2004 607
2
A = Q̇ + QDx f0 + Dx f0∗ Q + Dxx H0 ,
∗ 2
B = Dxu H0 + QDu f0 ,
2
C = Duu H0 ,
and applying Lemma 4.1, we find that (32) is satisfied for all (y, v) ∈ Rn+m
if there exists δ > 0 such that
But this holds if the following Riccati equation has a bounded solution:
2
Q̇ + QDx f0 + Dx f0∗ Q + Dxx H0
2 2 −1 2
−(Dxu H0 + QDu f0 )(Duu H0 ) (Dux H0 + Du f ∗ Q) = 0. 0 (43)
This property follows from a well-known stability result for ODEs (see
the proof of Proposition 4.1 below). Essentially, this strong form of the
Riccati equation has been used in all the examples where a test for SSC
has been performed and also in sensitivity analysis (Refs. 12–15). To get a
weak form of the Riccati equation, we take into account the constraints
(39) and (40). This can be done by projecting the matrix in (38) onto the
subspace defined by (39) and (40). Note that, in view of (A1), the matrices
α (t) and ϒ
2 ı(t)
in terms of
v2 ∈ Rn−ı(t) and y:
where
−1 −1
R1 (t) = −
1α (t)
α (t), R2 (t) = −
1α (t) 2α (t).
where
P ∗ [Q̇ + QA11 + A∗11 Q + B11 ]P P ∗ [QA12 + B12 ]
M= , (46a)
[A∗12 Q + B12
∗ ]P C22
A11 = Dx f0 + Du1 f0 R1 , (46b)
2
B11 = Dxx 2
H0 + Dxu1
H0 R1 + R1∗ Du21 x H0 , (46c)
A12 = Du1 f0 R1 + Du2 f0 , (46d)
2
B12 = Dxu1
2
H0 R2 + Dxu2
H0 + R1∗ Du21 u1 H0 R2 + R1∗ Du21 u2 H0 , (46e)
C22 = R2∗ Du21 u1 H0 R2 + R2∗ Du21 u2 H0 + Du22 u1 H0 R2 + Du22 u2 H0 . (46f )
v2 , C22 (t)v2 ≥ γ |v2 |2 , ∀v2 ∈ Rm−ı(t) and a.a. t ∈ [0, 1]. (47)
Proposition 4.1. Suppose that (A1), (A2), (A4) hold. Let S ∈ L∞ (0, 1;
Rn×n
) be a symmetric matrix function, such that
−1
Q̇ + QA11 + A∗11 Q + B11 + (QA12 + B12 )C22 (A∗12 Q + B12
∗
)+S =0 (49)
has a solution uniformly bounded on (0, 1). Then, (45) is satisfied for a.a.
t ∈ (0, 1).
JOTA: VOL. 123, NO. 3, DECEMBER 2004 609
Simple calculations show that R(t) is the Schur complement of the subm-
atrix C22 − δI of the matrix M(t) − δI, where M(t) is defined in (46).
Clearly, (51) yields that
π(R(t)) = n − (t).
F (x, u) − F (x0 , u0 ) ≥ c
(x, u) − (x0 , u0 )
2X2 ,
∞
for all admissible (x, u) ∈ BX ((x0 , u0 ));
5. Numerical Examples
Example 5.1. Weak Riccati Test for SSC. Consider the following var-
iational problem in a time interval [0, T ] with fixed endtime T > 0 but free
endpoint x(T ),
T
min F (x, u) = 0.5 [u(t)2 − x(t)2 ]dt, (52)
0
s.t. ẋ(t) = u(t), x(0) = x0 , t ∈ [0, T ], (53)
−a ≤ x(t) ≤ b, t ∈ [0, T ], 0 < a ≤ b. (54)
−a < x0 < b.
This is the same control problem as in Example 7.1 of Kawasaki and Ze-
idan (Ref. 16), except that we consider a free endpoint. The Hamiltonian
(15) takes the form
ṗ = x + νa − νb .
In view of
u = u + p = 0,
Du H
u = −p.
JOTA: VOL. 123, NO. 3, DECEMBER 2004 611
To determine the unknown entry time t1 , we use the continuity of the con-
trol and obtain the entry conditions
Thus, the following boundary value problem has to be solved on the inte-
rior arc:
The solution is
with
β = x0 , α = x0 tan(t1 ).
Q̇ − Q2 − 1 = 0.
612 JOTA: VOL. 123, NO. 3, DECEMBER 2004
S(t) ≡ Q(t1 )2 + 1,
we find that (49) reduces to Q̇ = 0. The boundary condition for Q(0) and
Q(T ) are both vacuous, since the only admissible variations in (A3) are
y 0 = y 1 = 0. Hence, the solution of (49) needed in Theorem 4.1 takes the
form
tan(t − t1/2 ), t ∈ [0, t1 ],
Q(t) =
tan(t1/2 ), t ∈ [t1 , T ],
and by that theorem the extremal is the solution of the optimal control
problem.
Example 5.2. Optimal Control of the van der Pol Oscillator. The fol-
lowing optimal control model for the van der Pol oscillator has been dis-
cussed in Augustin and Maurer (Ref. 14) using the multiplier µ in the
Lagrangian (2). It is instructive to discuss the same example on the basis
of the multiplier ν in the Hamiltonian (15), which was considered through-
out the paper. Moreover, we shall verify the SSC test in its weak form
as given in Theorem 4.3. The state variables are the voltage x1 (t) and the
electric current x2 (t); the control u(t) represents the voltage at the gener-
ator. The optimal control problem is
5
min F (x, u) = [u2 (t) + x12 (t) + x22 (t)] dt, (58)
0
s.t. ẋ1 = x2 , ẋ2 = −x1 + x2 (1 − x12 ) + u, t ∈ [0, 5], (59)
x1 (0) = 1, x2 (0) = 0, (60)
−x2 (t) − 0.4 ≤ 0, t ∈ [0, 5]. (61)
Since
the state constraint has order one and satisfies the regularity condition (6).
The Hamiltonian (15) takes the form
(x1 , x2 , u, p1 , p2 , ν) = u2 + x 2 + x 2 + p1 x2 + p2 [−x1 + x2 (1 − x 2 ) + u]
H 1 2 1
+ν(−x2 − 0.4). (62)
u = 2u + p2 = 0
Du H ⇒ u = −p2 /2. (64)
u = −p2 /2,
Fig. 1. State variables x1 (t), x2 (t) (top row) and adjoint variables p1 (t), p2 (t) (bottom row).
Due to the continuity of the optimal control and the control law u =
−p2 /2, the junction conditions at the entry and exit point t1 and t2 are
x2 (t1 ) = −0.4, x12 (ti ) − 1 x2 (ti ) + x1 (ti ) + 0.5p2 (ti ) = 0, i = 1, 2.
(68)
F (x, u) = 2.95370134
Thus, we conclude that there exists α > 0 such that the index sets Jα+ (t) =
J0 (t) in (30) coincide for all t ∈ [t1 , t2 ]. This allows us to verify the SSC
JOTA: VOL. 123, NO. 3, DECEMBER 2004 615
The boundary conditions (A3) together with (31) and (37) lead to
the terminal condition Q(5) < 0 (negative definite). Obviously, it suffices to
require the condition
Q(5) = 0,
i.e.,
In view of (48), on the boundary interval [t1 , t2 ] we can choose the matrix
function S in (49) in the form
0 S12 (t)
S= ,
S12 (t) S22 (t)
where S12 and S22 are arbitrary essentially bounded functions. We choose
these functions in such a way that Q̇12 (t) ≡ 0 and Q̇22 (t) ≡ 0 holds in
(t1 , t2 ), i.e.,
Q12 (t) ≡ Q12 (t2 ) and Q22 (t) ≡ Q22 (t2 ) in [t1 , t2 ].
S12 (t) = −Q11 (t) − Q12 (t2 )(1 − x1 (t)2 ) + Q22 (t2 )(1 + 2x1 (t)x2 (t))
+2p2 (t)x1 (t) + 0.5Q12 (t2 )Q22 (t2 ), (70a)
2 2
S22 (t) = −2Q12 (t2 ) − 2Q22 (t2 )(1 − x1 (t) ) − 2 + 0.5Q22 (t2 ) . (70b)
The first equation in (69) then becomes the following linear equation:
Q̇11 (t) − 2Q12 (t2 )(1 + 2x1 (t)x2 (t)) + 2 − 2p2 (t)x2 (t) − 0.5Q12 (t2 )2 = 0.
(71)
616 JOTA: VOL. 123, NO. 3, DECEMBER 2004
Fig. 2. Solutions Q11 , Q12 , Q22 to the Riccati equations (69) and (71).
To calculate S12 (t) and S22 (t), we integrate first the Riccati equation (69)
on [t2 , 5], backward from Q(5) = 0. We find numerically
References
1. Pickenhain, S., and Tammer, K., Sufficient Conditions for Local Optimality
in Multidimensional Control Problems with State Restrictions, Zeitschrift für
Analysis und ihre Anwendungen, Vol. 10, pp. 397–405, 1991.
2. Maurer, H., and Pickenhain, S., Second-Order Sufficient Conditions for Con-
trol Problems with Mixed Control-State Constraints, Journal of Optimization
Theory and Applications, Vol. 86, pp. 649–667, 1995.
3. Hartl, R. F., Sethi, S. P., and Vickson, R. G., A Survey of the Maximum
Principle for Optimal Control Problems with State Constraints, SIAM Review,
Vol. 37, pp. 181–218, 1995.
4. Malanowski, K., On Normality of Lagrange Multipliers for State-Constrained
Optimal Control Problems, Optimization, Vol. 52, pp. 75–91, 2003.
5. Zeidan, V., The Riccati Equation for Optimal Control Problems with Mixed
State-Control Problems: Necessity and Sufficiency, SIAM Journal on Control
and Optimization, Vol. 32, pp. 1297–1321, 1994.
JOTA: VOL. 123, NO. 3, DECEMBER 2004 617
6. Malanowski, K., and Maurer, H., Sensitivity Analysis for Parametric Prob-
lems with Control-State Constraints, Computational Optimization and Appli-
cations, Vol. 5, pp. 253–283, 1996.
7. Hager, W. W., Lipschitz Continuity for Constrained Processes, SIAM Journal
on Control and Optimization, Vol. 17, pp. 321–338, 1979.
8. Neustadt, L. W., Optimization: A Theory of Necessary Conditions, Princeton
University Press, Princeton, New Jersey, 1976.
9. Fiacco, A. V., and McCormick, G. P., Nonlinear Programming: Sequential
Unconstrained Minimization Techniques, John Wiley, New York, NY, 1968.
10. Maurer, H., and Zowe, J., First and Second-Order Necessary and Sufficient
Optimality Conditions for Infinite-Dimensional Programming Problems, Mathe-
matical Programming, Vol. 16, pp. 98–110, 1979.
11. Haynsworth, E. V., Determination of the Inertia of a Partitioned Hermitian
Matrix, Linear Algebra and Applications, Vol. 1, pp. 73–81, 1968.
12. Maurer, H., First and Second-Order Sufficient Optimality Conditions in
Mathematical Programming and Optimal Control, Mathematical Programming
Study, Vol. 14, pp. 163–177, 1981.
13. Malanowski, K., and Maurer, H., Sensitivity Analysis for State–Constrained
Optimal Control Problems, Discrete and Continuous Dynamical Systems,
Vol. 4, pp. 241–272, 1998.
14. Augustin, D., and Maurer, H., Computational Sensitivity Analysis for State-
Constrained Control Problems, Annals of Operations Research, Vol. 101,
pp. 75–99, 2001.
15. Augustin, D., and Maurer, H., Second-Order Sufficient Conditions and Sensi-
tivity Analysis for the Optimal Control of a Container Crane under State Con-
straints, Optimization, Vol. 49, pp. 351–368, 2001.
16. Kawasaki, H., and Zeidan, V., Conjugate Points for Variational Problems
with Equality State Constraints, SIAM Journal on Control and Optimization,
Vol. 39, pp. 433–456, 2000.
17. Oberle, H. J., and Grimm, W., BNDSCO : A Program for the Numerical Solu-
tion of Optimal Control Problems, Report 515-89/22, Institute for Flight Sys-
tems Dynamics, DLR, Oberpfaffenhofen, Germany, 1989.