Beruflich Dokumente
Kultur Dokumente
Robert Stengel! Optimal Control and Estimation, MAE 546 ! Princeton University, 2013 " !! Nonlinear systems with random inputs and perfect measurements" !! Nonlinear systems with random inputs and imperfect measurements" !! Certainty equivalence and separation" !! Stochastic neighboring-optimal control" !! Linear-quadratic-Gaussian (LQG) control"
Copyright 2013 by Robert Stengel. All rights reserved. For educational use only.! http://www.princeton.edu/~stengel/MAE546.html! http://www.princeton.edu/~stengel/OptConEst.html!
! (t ) = f ! x " x (t ), u (t ), w (t ), t # $ z (t ) = x (t )
E! " x ( 0 )# $ = x (0)
T
E ! " x ( 0 ) % x ( 0 )# $! " x ( 0 ) % x ( 0 )# $
}= 0
E! " w (t )# $=0
Assume that random disturbance effects are small and additive "
! (t ) = f ! x " x (t ), u (t ), t # $ + L (t ) w (t )
tf
( ' H [ x(t ), u(t ), ! (t ), t ] + ! 2) E " , # ! (t ) $ % = &E ) 'x * # ! H [ x(t ), u(t ), " (t ), t ] & 3) E $ '=0 !u % (
t1 ) + + $ V * ( t1 ) = E *! " x *( t ) & L x *( ' ), u *( ' ) d ' [ ] . f # % ( t + + f , / t 1 ) + + $ = min E *! " x *( t ) & L x *( ' ), u ( ' ) d ' [ ] . f # % ( u tf + + , /
dV * = ! L [ x * (t1 ), u * (t1 )] dt t = t1
) "V * , dV * "V * 1 # T "2 V * & 2 ! !t + % x ! ! ( !t + ". !V * = !t = E + !t + x x 2 dt " t " x 2 " x $ ' * 2 ) "V * , & T " V * "V * 1# = E+ !t + f (.) + Lw (.)) !t + % ( f (.) + Lw (.)) f . + Lw (.))( !t 2 + ". ( 2 ( ( ) " t " x 2 " x $ ' * -
* "V * "V * T& 1 # "2 V * = E, + f (.) + Lw (.)) + Tr % f . + Lw (.)) ( f (.) + Lw (.)) ( )t / ( 2 ( ( ) "x 2 $ "x ' . + "t
Cancel !t
E! " w (t )# $=0
Uncertain disturbance input can only increase the value function rate of change "
( !2 V * $ + dV * !V * !V * 1 T T = + f (.) + lim Tr ) E f (.) f (.) "t + LE w (.) w (.) LT & "t , ' dt !t !x 2 "t#0 * !x 2 % = & !V * !V * 1 $ !2 V * (t ) + (t ) f (.) + Tr . 2 (t ) L (t ) W (t ) L (t )T / !t !x 2 % !x '
!! Control has no effect on the disturbance input" !! Criterion for optimality is the same as for the deterministic case" !! Disturbance uncertainty increases the magnitude of the total optimal value function, V*(0)"
( )
( )
I [ t o , t1 ] = {z [ t o , t1 ], u [ t o , t1 ]}
!! Plus available model structure, parameters, and statistics"
I [ t o , t1 ] = {z [ t o , t1 ], u [ t o , t1 ], f ( ) , Q, R,!}
( t1 ) , P ( t1 ) , u ( t1 )} I MD ( t1 ) = {x
!! Multiple model derived information set"
!! Parallel estimates of current mean, covariance, and hypothesis probability mass function"
[ t o , t1 ], P [ t o , t1 ], u [ t o , t1 ]} I D [ t o , t1 ] = {x
A ( t1 ) , PA ( t1 ) , u ( t1 ) , Pr ( H A ) # B ( t1 ) , PB ( t1 ) , u ( t1 ) , Pr ( H B ) # I MM ( t1 ) = ! "x $, ! "x $ ,!
I! "t o , t f # $
!! Separate information set into knowable and predictable parts"
(t )# (t )# E ! " x (t ) % x $! " x (t ) % x $ | I D = P (t )
T
I! "t o , t f # $ = I [ t o , t1 ] + I ! "t1 , t f # $
!! !! Knowable information has been received" Predictable information is to come"
(t ) E! " x (t ) | I D # $=x
"... where the conditional expected values are estimates from an optimal lter "
( )
(t ) x (t ) ! x (t ) x (t ) + x (t ) x (t )$ =E " # x (t ) x (t ) ! x % | ID
T T T T
(t )$ (t )$ P (t ) = E " # x (t ) ! x %" # x (t ) ! x % | ID
T
! J CE + J S
}
J CE =
T (t )# ( t ) xT ( t ) # (t ) x T (t ) E ! " x (t ) x $ | ID = E ! "x $ | ID = x
} {
(t ) x (t ) P (t ) = E ! " x (t ) x (t )# $ | ID % x or
T T
(t ) x (t ) E ! " x (t ) x (t )# $ | I D = P (t ) + x
T T
"... where the conditional expected values are obtained from an optimal lter "
JS =
( )
( )
) {
) {
) {
V (t ) =
1 T x (t )S(t )x(t ) + v ( t ) 2
to " Q(t ) M(t ) $ " x(t ) $ 1 ) + + 10 E *xT (t f )S(t f )x(t f ) & ( " xT (t ) uT (t ) $ 0 1 dt # % 0 MT (t ) R(t ) 1 0 u(t ) 1 . 2 + tf % + # %# , /
VCE ( t ) !
1 T x (t )S(t )x(t ) 2
VCE ( t ) !
1 T x (t )S(t )x(t ) 2
!V (t ) = xT (t )S(t ) !x
Hessian with respect to the state "
!V (t ) = S(t ) !x 2
2
V tf =
( )
1 T x (t f )S(t f )x(t f ) 2
! ( !V !t ) =0=" xT M + uT R + xT SG $ # % !u
! !C (t ) x (t )
Zero-mean, white-noise disturbance has no effect on the structure and gains of the LQ feedback control law "
! !C (t ) x (t )
1 T T T Tr S(t f )E ! " x(t )x (t ) # $ + R(t )E ! " u(t )u (t ) # $ dt " x(t f )x (t f ) # $ + ' Q(t )E ! 2 to
tf
( )
( )
where"
J=
!= v
1 Tr SLWLT 2
T U (t ) ! E ! " u(t )u (t ) # $
(t ) u (t ) = !C (t ) x
Optimal control covariance "
= R !1 ( t ) GT ( t ) S ( t ) P ( t ) S ( t ) G ( t ) R !1 ( t ) U ( t ) = C ( t ) P ( t ) CT ( t )
! ( t ) = F ( t ) P ( t ) + P ( t ) FT ( t ) + L ( t ) W ( t ) LT ( t ) , P ( t o ) given P
!! Adjoint covariance response to terminal cost"
! (t ) = ! F (t ) S (t ) ! S (t ) F (t ) ! Q (t ), S t f S
T
( )
! ( t ) = ! FT ( t ) S ( t ) ! S ( t ) F ( t ) ! Q ( t ) ! S ( t ) G ( t ) R !1 ( t ) GT ( t ) S ( t ) S
Independent of P(t)"
given
J no control
!! With optimal control, the equation for the cost is the same"
tf % 1 " = Tr $S(t o )P ( t o ) + ! S ( t ) L ( t ) W ( t ) LT ( t ) dt ' 2 $ ' to # &
J optimal control
!! ... but evolutions of S(t) and S(to) are different in each case"
Supplemental Material
Immune Response with Full-State Stochastic Optimal Feedback Control " (Random Disturbance and Measurement Error Not Simulated) "
Low-Bandwidth Estimator (|W| < |N|) " High-Bandwidth Estimator (|W| > |N|) "
W = I4" N = I2 / 20"
!!
!!
Immune Response to Random Disturbance with Two-Measurement Stochastic Neighboring-Optimal Control "
! Disturbance due to"
! Re-infection" ! Sequestered pockets of pathogen"
! ! !
Noisy measurements" Closed-loop therapy is robust " ... but not robust enough:"
! Organ death occurs in one case"
u( t ) = c[ x ( t ), a, y * ( t )]
! On-line adaptive critic controller"
! Initially, c[x, a, y*] is unknown" ! Design PI-LQ controllers with integral compensation that satisfy requirements at n operating points " ! Scheduling variable, a"
! Nonlinear control law ( action network )" ! Criticizes non-optimal performance via critic network "
! Adapts control gains to improve performance" ! Adapts cost model to improve estimate"
!V [x a (t )] = NNC[x a (t ), a (t )] !x a ( t )
xa(t) a(t)
NNA
Aircraft Model
! Transition Matrices ! State Prediction
xa(t) a(t)
NNA
Aircraft Model
! Transition Matrices ! State Prediction
NNC
NNC Target
Target Generation
Target Generation