Beruflich Dokumente
Kultur Dokumente
Robot
Manipulators
A Unified Regressor-Free Approach
Adaptive Control of
Robot
Manipulators
A Unified
Regressor-Free
Approach
An-Chyau Huang
Ming-Chih Chien
National Taiwan University of Science and Technology, Taiwan
World Scientific
NEW JERSEY
LONDON
SINGAPORE
BEIJING
SHANGHAI
HONG KONG
TA I P E I
CHENNAI
Published by
World Scientific Publishing Co. Pte. Ltd.
5 Toh Tuck Link, Singapore 596224
USA office: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601
UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE
For photocopying of material in this volume, please pay a copying fee through the Copyright
Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to
photocopy is not required from the publisher.
ISBN-13 978-981-4307-41-3
ISBN-10 981-4307-41-6
Printed in Singapore.
To Our Families
Preface
The traditional computed torque design is only adequate for the control
of robot manipulators with precisely known dynamics. In the industrial
environment, however, an accurate robot model is not generally available, and
most robots are limited to be operated under slow motion conditions so that
some system dynamics can be ignored. When performing the tasks with precise
tracking of fast trajectories under time-varying payloads, several considerations
such as the joint flexibility and actuator dynamics are unavoidable. This will
generally lead to some extremely complex robot model which greatly increases
the difficulty in the controller design. What is worse is that estimation of the
system parameters in this complex model becomes more challenging. It is
reasonable to regard some system dynamics as uncertainties to simplify the
modeling tasks. The robust controls and adaptive designs are then utilized to
deal with these uncertainties. However, the former needs the knowledge of the
variation bounds for the uncertainties, while the later requires the linear
parameterization of the uncertainties into a known regressor multiplied by an
unknown constant parameter vector. When the system contains time-varying
uncertainties whose variation bounds are not given (defined later as general
uncertainties), both the robust control and adaptive design are not feasible in
general.
In the conventional adaptive control of robot manipulators, the robot
model is assumed to be linearly parameterizable into the regressor form. But the
derivation of the regressor matrix is tedious in most cases, and computation of
the regressor matrix during each sampling period in the real-time realization is
too time-consuming. This suggests the need for some regressor-free adaptive
designs.
The aim of this book is to address recent developments of the unified
regressor-free adaptive controller designs for robot manipulators with consideration
of joint flexibility and actuator dynamics. The unified approach is still valid for
vii
viii
Preface
the robot control in the compliant motion environment. The main tool used in
this new design is the function approximation technique which represents the
general uncertainties in the robot model as finite combinations of basis functions
weighted with unknown constant coefficients.
The book has been written as a text suitable for postgraduate students in the
advanced course for the control of robot manipulators. In addition, it is also
intended to provide valuable tools for researches and practicing engineers who
currently employ the regressor-based algorithms but would benefit significantly
from the use of the regressor-free strategies.
We would like to thank all of our colleagues and students with whom we
have discussed the basic problems in the control of robot manipulators over the
last few years. Without them, many issues would never have been clarified. This
work was partially supported by the National Science Council of the Republic of
China government and by the National Taiwan University of Science and
Technology. The authors are grateful for their support.
An-Chyau Huang,
Ming-Chih Chien
Mechanical Engineering Department
National Taiwan University of Science and Technology
Contents
Preface ...........................................................................................................
vii
Introduction............................................................................................
Preliminaries ..........................................................................................
2.1 Introduction....................................................................................
2.2 Vector Spaces ................................................................................
2.2.1 Metric space ......................................................................
2.2.2 Normed vector space.........................................................
2.3 Best Approximation Problem in Hilbert Space..............................
2.4 Orthogonal Functions.....................................................................
2.5 Vector and Matrix Analysis ...........................................................
2.5.1 Properties of matrices........................................................
2.5.2 Differential calculus of vectors and matrices ....................
2.6 Various Norms...............................................................................
2.6.1 Vector norms.....................................................................
2.6.2 Matrix norms.....................................................................
2.6.3 Function norms and normed function spaces....................
2.7 Representations for Approximation ...............................................
2.8 Lyapunov Stability Theory ............................................................
2.8.1 Concepts of stability..........................................................
2.8.2 Lyapunov stability theorem...............................................
2.9 Sliding Control...............................................................................
2.10 Model Reference Adaptive Control ...............................................
2.10.1 MRAC of LTI scalar systems ...........................................
2.10.2 MRAC of LTI systems: vector case..................................
2.10.3 Persistent excitation ..........................................................
2.10.4 Robust adaptive control ....................................................
11
11
11
13
14
15
17
24
24
26
28
28
29
30
31
35
35
37
39
45
45
48
50
54
ix
Contents
57
58
71
71
71
73
75
76
77
78
79
59
61
80
129
129
130
134
136
146
147
148
Contents xi
163
163
164
167
168
181
184
186
201
201
202
205
207
220
223
224
Chapter 1
Introduction
Introduction
computation of the regressor matrix. Derivation of the regressor matrix is wellknown to be very tedious for a robot manipulator with more than four joints.
The regressor-free strategy greatly simplified the controller design process.
Because, for the traditional robot adaptive control, the complex regressor matrix
has to be updated in every control cycle in the real-time implementation,
the regressor-free algorithm also effectively simplified the programming
complexity.
When the robot end-effector contacts with the environment, the controllers
designed for performing the free space tracking tasks cannot provide appropriate
control performance. In this book, the renowned impedance control strategy will
be incorporated into the FAT-based design so that some regressor-free adaptive
controllers with consideration of the actuator dynamics for the robot manipulators
can be obtained.
To have a better understanding of the problems we are going to deal with,
Table 1.1 presents the systems considered in this book. The abbreviations listed
will be used throughout this book to simplify the presentation.
Table 1.1 Systems considered in this book
1
2
3
4
5
6
7
8
Systems
Rigid robot in the free space
Rigid robot interacting with the environment
Electrically-driven rigid robot in the free space
Electrically-driven rigid robot interacting with the environment
Flexible-joint robot in the free space
Flexible-joint robot interacting with the environment
Electrically-driven flexible-joint robot in the free space
Electrically-driven flexible-joint robot interacting with environment
Abbreviation
RR
RRE
EDRR
EDRRE
FJR
FJRE
EDFJR
EDFJRE
Introduction 3
but a regressor can be derived such that all uncertain parameters are collected
into an unknown vector. Conventional adaptive strategies can thus be applied to
give update laws to this unknown parameter vector, and closed loop stability can
also be proved easily. However, implementation of this scheme requires the
information of joint accelerations which is impractical in most industrial
applications. What is worse is that the estimation in the inertia matrix might
suffer the singularity problem. A well-known design proposed by Slotine and Li
is then reviewed to get rid of the need for joint acceleration feedback and avoid
the singularity problem. In the above designs, the robot dynamics should be
linearly parameterized into a known regressor matrix multiplied by an unknown
parameter vector. The regressor matrix is known to be tedious in its derivation
for a robot with degree of freedom more than 4. The regressor matrix is not
unique for a given robot, but depending on the selection of the parameter vector.
The entries of this vector should be constants that are combinations of unknown
system parameters. However, these parameters are mostly easier to be found
than the derivation of the regressor matrix. For example, the weight, length,
moment of inertia and gravity center of a link are frequently seen in the
parameter vector and their values are very easy to measure in practice. It is not
reasonable to construct a controller whose design needs to know an extremely
complex regressor matrix but to update an easy-to-obtain parameter vector.
Motivated by this reasoning, the regressor-free adaptive control approach is
developed. The uncertain matrices and vectors in the robot model will be
represented as finite combinations of basis functions. Update laws for the
weighting matrices can be obtained by the Lyapunov-like design. The effect
of the approximation error is investigated with rigorous mathematical
justifications. The output error can thus be proved to be uniformly ultimately
bounded. Finally, the trajectory of the output error is bounded by a weighted
exponential function plus some constant. With proper adjustment of controller
parameters, both the transient performance and the steady state error can be
modified.
Compliant motion control of a rigid robot
Item 2, 4, 6 and 8 in Table 1.1 relate to the compliant motion control of
robot manipulators whose dynamic model include the effect of the external force
vector exerted by the environment. Many control strategies are available for
rigid robots to give closed loop stability in the compliant motion applications.
Among them, the impedance control employed in this book is the most widely
used one which is a unified approach for controlling robot manipulators in both
Introduction
free space tracking and compliant motion phases. The impedance controller
makes the robot system behave like a target impedance in the Cartesian space,
and the target impedance is specified as a mass-spring-damper system. For rigid
robots, we start with the case when the robot system and the environment are
known and the impedance controller is designed. The regressor-based adaptive
impedance controller is then derived for robot systems containing uncertainties.
To avoid derivation of the regressor matrix, the regressor-free adaptive
impedance controller using function approximation techniques is introduced.
For the impedance controller of EDRRE, FJRE and EDFJRE, much more
complex derivations will be involved due to the complexity in the system model.
Unlike the rigid robots, the regressor-based designs of these robots need
additional information such as the derivative of the regressor matrix, the joint
accelerations and derivative of the external force. All of these are not generally
available, and hence regressor-free designs are introduced to get rid of their
necessity. The unified approach in the FAT-based regressor-free adaptive
impedance controller designs for RRE, EDRRE, FJRE and EDFJRE can all give
uniformly ultimately bounded performance to the output error and the transient
performance can also be evaluated by using the bound for the output error
signal.
Consideration of the actuator dynamics
The control problem of rigid robot manipulators has been well developed
under the assumption that all actuator dynamics are neglected. However, it had
been reported that the robot control problem should carefully consider the
actuator dynamics to have good tracking performance, especially in the cases of
high-velocity movement and highly varying loads. Therefore, in item 3, 4, 7 and
8 of Table 1.1, we include considerations of actuator dynamics in the system
equations to investigate their effects in performance improvement. The input
vector to a robot without consideration of the actuator dynamics contains
torques to the joints, while the input vector to electrically-driven robots is with
signals in voltages. This special cascade structure connecting the actuator and
the robot dynamics enables us to employ backstepping-like designs to eliminate
uncertainties entering the system in a mismatched fashion. The regressor-based
adaptive designs are introduced first for items 3, 4, 7 and 8 followed by their
regressor-free counterparts. Implementations of most regressor-based methods
introduced here need the information of the derivative of the regressor matrix
and joint accelerations, but the regressor-free designs do not. Besides, the
uniformly ultimately bounded performance can be proved to be maintained
Introduction 5
Introduction
unknown parameter vector need to be done based on the system model. With
proper definitions of the entries in the parameter vector, the regressor matrix can
then be determined. Since these definitions are not unique, the regressor matrix
for a given robot is not unique either. Some definitions will give relatively
simple forms for the regressor matrix, while some will become very complex.
When the degree of freedom of the robot is more than four, the derivation of the
regressor matrix becomes very tedious. In general, the entries in the parameter
vector are combinations of the quantities such as the link masses, dimensions of
the links, and moments of inertia. These quantities are relatively easier to
measure compared with the derivation of the regressor matrix. However, the
traditional adaptive designs are only capable of updating these easy-to-measure
parameters, but require the complex regressor matrix to be known. In addition,
in every control cycle of the real-time implementation, the calculation of the
regressor matrix is also time consuming which largely limits the computation
hardware selections, especially in the embedded applications. In this book, a
unified approach for the design of regressor-free adaptive controllers for robot
manipulators is introduced which is feasible for robots with considerations of
the actuator dynamics, joint flexibilities as well as the interaction with the
environment. All of these designs will end up with the uniformly ultimately
bounded closed loop performance via the proofs using the Lyapunov-like
techniques.
The FAT-based design
Two main approaches are available for dealing with uncertainties in control
systems. The robust strategies need to know the worst case of the system so that
a fixed controller is able to be constructed to cover the uncertainties. In most
cases, the worst case of the system is evaluated by proper modeling of the
uncertainties either in the time domain or frequency domain. The variation
bounds estimated from the uncertainty model are then used to design the robust
terms in the controller. In some practical cases, however, these variation bounds
are not available, and hence most robust strategies are infeasible. The other
approach for dealing with system uncertainties is the adaptive method. Although
intuitively we think that an adaptive controller should be able to give good
performance to a system with time-varying uncertainties, conventional adaptive
designs can actually be useful to systems with constant uncertainties. Therefore,
to be feasible for the adaptive designs all time-varying parts in the system
dynamics should be collected into a known regressor matrix, while the unknown
constant parameters are put into a parameter vector. This process is called the
Introduction 7
Introduction
orthonormal functions are also listed with their effective ranges for the
convenience in the selection of basis functions for the FAT-based designs. Then
the Lyapunov stability theory and the Lyapunov-like methods are reviewed in
detail followed by the introduction of the control theories such as the sliding
control and model reference adaptive control. After these conventional robust
and adaptive designs, the concept of general uncertainties is presented.
Limitations in the sliding controller designs when the variation bounds for the
uncertainties are not available are investigated. Likewise, the problem for the
model reference adaptive control when the system contains time-varying
parameters is illustrated. Finally, the FAT-based adaptive controller is designed
for these systems with general uncertainties in detail.
Chapter 3 collects all dynamic equations for systems listed in Table 1.1.
These equations will be used in later chapters for controller designs. Examples
for these systems will also be presented, and they will also be used in the
simulation studies later. Adaptive control strategies for the rigid robots are
introduced in Chapter 4. Traditional regressor-based adaptive rules will be
derived first followed by some investigation into the detail of the regressor
matrix and the parameter vector. This justifies the necessity for the regressorfree adaptive designs. A FAT-based regressor-free adaptive controller is then
derived for the rigid robot with consideration of the approximation errors. The
rigorous proof for the closed loop stability is presented to give uniformly
ultimately bounded performance. Next, the actuator dynamics is included into
the system model and adaptive controllers are derived using regressor-based
designs and regressor-free designs. Significant amount of simulation results are
provided to justify the efficacy of the controllers when actuator dynamics are
considered.
Chapter 5 considers the compliant motion control of rigid robot
manipulators. The impedance controller is employed to enable the robot to
interact with the environment compliantly while maintaining good performance
in the free space tracking. The traditional impedance controller is reviewed first
for the system with known dynamics. A regressor-based and a regressor-free
adaptive controller are then derived. Finally, the actuator dynamics is considered
to improve the control performance.
Chapter 6 includes joint flexibility into consideration such that the order of
the system model is doubled compared with its rigid joint counterpart. Control
of a known FJR is firstly reviewed. The regressor-based adaptive controller is
then introduced followed by the derivation of the regressor-free controller. The
actuator dynamics will be considered in the last section of this chapter. A 5th
Introduction 9
order differential equation should be used to describe a single link in this case
which makes the controller design problem become extremely challenging.
The last chapter deals with the problem of the adaptive impedance control
for FJR. We review the control of a known robot first to have some basic
understanding of this problem. The regressor-based adaptive controller is then
designed, but it requires some impractical knowledge in the real-time
implementation. The regressor-free adaptive controller is derived without any
requirements on additional information. Consideration of the actuator dynamics
further complicated the problem, and the regressor-free adaptive design is still
able to give good performance to the closed loop system.
Chapter 2
Preliminaries
2.1 Introduction
Some mathematical backgrounds are reviewed in this chapter. They can be
found in most elementary mathematics books; therefore, most results are
provided without proof. On the other hand, some preliminaries in control
theories will also be presented in this chapter as the background for the
theoretical development introduced in the later chapters. In Section 2.2, some
notions of vector spaces are introduced. Some best approximation problems in
the Hilbert space will be mentioned in Section 2.3. Various orthogonal functions
are collected in Section 2.4 to facilitate the selection of basis functions in FAT
applications. The vector and matrix analysis is reviewed in Section 2.5 which
includes their differential calculus operations. Norms for functions, vectors and
matrices are summarized in Section 2.6, and some normed spaces are also
introduced. Section 2.7 illustrates the approximation representations of
functions, vectors and matrices. The Lyapunov stability theory is reviewed in
Section 2.8. The concept of sliding control is provided in Section 2.9 as an
introduction to the robust design for a system containing uncertainties defined in
compact sets. Section 2.10 gives the basics of the model reference adaptive
control to linear time-invariant systems. To robustify the adaptive control loop,
some modifications of the adaptive designs are also presented in Section 2.10.
The concept of general uncertainties is clarified in Section 2.11. The limitations
for traditional MRAC and robust designs will also be discussed. Finally, the
FAT-based adaptive controller is introduced in Section 2.12 to cover the general
uncertainties.
12
Chapter 2 Preliminaries
x + y X , x, y X
x + y = y + x , x, y X
( x + y ) + z = x + ( y + z ) , x, y , z X
There is a unique vector 0 X such that x + 0 = x , x X
x X , x X ,
( x) = ( )x , x X , ,
1x = x , x X
( + )x = x + x , x X , ,
(x + y ) = x + y , x, y X ,
For example, the set of all n-tuples of real numbers is a real vector space and is
known as n . The set of all real-valued functions defined over an interval
[a, b] is also a vector space. The set of all functions maps the interval
[a, b] into n can also be proved to be a vector space. In some literature,
the real vector space is also known as the real linear space.
n
If x1 ,..., x k and c1 ,..., c k , a vector of the form c1x1 + ... + ck x k
is said to be a linear combination of the vectors x1 ,..., x k . The set of vectors
x1 ,..., x k n is said to be independent if the relation c1x1 + ... + ck x k = 0
implies ci = 0, i = 1,..., k ; otherwise, the set of vectors is dependent. If S n
and if R is the set of all linear combinations of elements of S, then S spans R, or
we may say that R is the span of S. An independent subset of a vector space
X n which spans X is called a basis of X. A vector space is n-dimensional
if it contains an independent set of n vectors, but every set of n+1 vectors is a
dependent set.
d ( p, q) > 0 if p q ;
d ( p, p ) = 0 ;
d ( p, q ) = d ( q, p ) ;
d ( p, q ) d ( p, r ) + d (r , q ) for any r X .
The distance function on a metric space can be thought of as the length of a
vector; therefore, many useful concepts can be defined. A set E X is said
to be open if for every x E , there is a ball B ( x, r ) = { y X d ( y , x ) < r}
such that B ( x, r ) E for some positive r. A set is closed if and only if its
complement in X is open. A set E is bounded if r > 0 such that
d ( x, y ) < r x, y E . Let S T be two subsets of X. S is said to be dense in
T if for each element t in T and each > 0 , there exists an element s in S such
that d ( s, t ) < . Thus every element of T can be approximated to arbitrary
precision by elements of S.
Let X and Y be metric spaces with distance functions d X and d Y ,
respectively. A function f : X Y is said to be continuous at a point x X if
f ( x + x) f ( x) whenever x 0 . Or, we may say, f is continuous at x if
given > 0 , there exist > 0 such that d X ( x, y ) < d Y ( f ( x ), f ( y )) < .
A function f is continuous on E X if it is continuous at every points of E,
and it is uniformly continuous on E if given > 0 , there exist ( ) > 0 such
that d X ( x, y ) < d Y ( f ( x ), f ( y )) < for all x, y E .
The distance function generates the notion of convergence: A sequence
{xi} in a metric space X is said to be convergent to an element x if for each
> 0 there exists an integer n such that d ( x, xi ) whenever i > n . A set
E X is compact if each sequence of points in E contains a subsequence
which converges to a point in E. In particular, a compact subset of n is
necessarily closed and bounded. The sequence {xi} is a Cauchy sequence if for
each > 0 there exists an integer n such that p, q > n d ( x p , x q ) .
Clearly, every convergent sequence is a Cauchy sequence, but the converse is
14
Chapter 2 Preliminaries
not true in general. A complete metric space X is a space where every Cauchy
sequence converges to a point in X.
x > 0, x X , x 0
0 =0
is said to converge to x X if
i =1
the sequence of partial sums converges to x, i.e., if > 0, n > 0 such that
m
i =1
x, y = y , x
x, y = x, y
x + y , z = x, z + y , z , z X
x, x > 0 x 0
A real vector space with an inner product defined is called a real inner product
space. An inner product space is a normed vector space and hence a metric
space with
d ( x, y ) = x y =
x y, x y
For any two vectors x and y in an inner product space, we have the Schwarz
inequality in the form x, y x y . The equality holds if and only if x and
y are dependent. Two vectors x, y are orthogonal if x, y = 0 . Let {xi} be a
set of elements in an inner product space X. {xi} is an orthogonal set if
x i , x j = 0, i j . If in addition every vector in the set has unit norm, the set
is orthonormal.
A Hilbert space is a complete inner product space with the norm induced
by its inner product. For example, n is a Hilbert space with inner product
n
x, y =
x y
i i
i =1
x(t ) y (t )dt .
D
16
Chapter 2 Preliminaries
n
that
c x
i i
i =1
n
2
= x +
ci e i
i =1
x, e i c i
i =1
x, e i
(1)
i =1
x, e
i =1
2
x, e i e i
n
2
= x
i =1
x, e i
(2)
i =1
x, e
i =1
the previous one except that one extra term x, e i +1 e i +1 is added. This implies
that previously calculated Fourier coefficients do not need to be recalculated. It
can also be observed that the right hand side of (2) gets smaller when the
orthonormal set is taken larger. Hence, the best approximation to x improves as
we use more terms in the orthonormal set. As the number of terms used goes to
x, e
i =1
ei .
x, e i
(3)
i =1
which is known as the Parsevals identity. Convergence of the Fourier series can
be proved by using the Riesz-Fischer theorem with the fact that the partial sum
of the series
x, e i
i =1
=
( x ) ( x ) dx
i
j
0 i j
0 i= j
(1)
b
a
i2 ( x)dx = 1 for
= 0 i j
p( x) i ( x) j ( x)dx
0 i = j
(2)
18
Chapter 2 Preliminaries
f ( x) = c11 ( x) + c 2 2 ( x) + + c n n ( x) +
(3)
b
a
f ( x) n ( x)dx = cn
b
a
n2 ( x)dx
(4)
cn =
b
a
f ( x) n ( x) dx
b
a
(5)
2
n ( x ) dx
b
a
1. Taylor polynomials
In the calculus courses, it is well known that given a function f ( x) and a
point c in the domain of f, suppose the function is n-times differentiable at c,
then we can construct a polynomial
Pn ( x) = f (c) + f '(c)( x c) +
+ +
f (c)
( x c) 2
2!
f ( n ) (c )
( x c) n
n!
(6)
Tn +1 ( x) = 2 xTn ( x) Tn 1 ( x)
for all n = 1, 2,... For convenience, we list the first 7 polynomials below
T0 ( x) = 1
T1 ( x) = x
T2 ( x) = 2 x 2 1
T3 ( x) = 4 x 3 3 x
T4 ( x) = 8 x 4 8 x 2 + 1
T5 ( x) = 16 x 5 20 x 3 + 5 x
(7)
20
Chapter 2 Preliminaries
T6 ( x) = 32 x 6 48 x 4 + 18 x 2 1
T7 ( x) = 64 x 7 112 x 5 + 56 x 3 7 x
3. Legendre polynomials
The set of Legendre polynomials is orthogonal with respect to the weight
function p( x) = 1 on the interval [-1,1]. The first two polynomials are
L0 ( x) = 1 and L1 ( x) = x , and the remaining polynomials can be determined by
the recurrence relation
(8)
for all n = 1, 2,... Here, we list the first 7 polynomials for convenience
L0 ( x) = 1
L1 ( x) = x
L2 ( x ) =
L3 ( x) =
1
(3 x 2 1)
2
1
(5 x 3 3 x)
2
1
L4 ( x) = (35 x 4 30 x 2 + 3)
8
1
L5 ( x) = (63 x 5 70 x 3 + 15 x)
8
L6 ( x) =
L7 ( x) =
1
(231x 6 315 x 4 + 105 x 2 5)
16
1
(429 x 7 693 x 5 + 315 x 3 35 x)
16
4. Hermite polynomials
The set of Hermite
polynomials is orthogonal with respect to the weight
2
function p( x) = e x on the interval ( , ) . The first two polynomials are
H n +1 ( x) = 2 xH n ( x) 2nH n 1 ( x)
(9)
H 0 ( x) = 1
H 1 ( x) = 2 x
H 2 ( x) = 4 x 2 2
H 3 ( x) = 8 x 3 12 x
H 4 ( x) = 16 x 4 48 x 2 + 12
H 5 ( x) = 32 x 5 160 x 3 + 120 x
H 6 ( x) = 64 x 6 480 x 4 + 720 x 2 120
H 7 ( x) = 128 x 7 1344 x 5 + 3360 x 3 1680 x
5. Laguerre polynomials
The set of Laguerre polynomials is orthogonal with respect to the weight
function p( x) = e x on the interval [0, ) . The first two polynomials are
L0 ( x) = 1 and L1 ( x) = x + 1 , and the remaining polynomials can be determined
by the recurrence relation
Ln +1 ( x) = (2n + 1 x) Ln ( x) n 2 Ln 1 ( x)
for all n = 1, 2,... The following are the first 7 polynomials
L0 ( x) = 1
L1 ( x) = x + 1
L2 ( x ) = x 2 4 x + 2
(10)
22
Chapter 2 Preliminaries
L3 ( x) = x 3 + 9 x 2 18 x + 6
L4 ( x) = x 4 16 x 3 + 72 x 2 96 x + 24
L5 ( x) = x 5 + 25 x 4 200 x 3 + 600 x 2 600 x + 120
L6 ( x) = x 6 36 x 5 + 450 x 4 2400 x 3 + 5400 x 2 4320 x + 720
L7 ( x) = x 7 + 49 x 6 882 x 5 + 7350 x 4 29400 x 3
+52920 x 2 35280 x + 5040
6. Bessel polynomials
The set of Bessel polynomials is orthogonal with respect to the weight
function p( x) = x on the interval [0,b] in the form
xJ n (k i x) J n ( k j x)dx = 0
(11)
J n ( x) = x n
(1) m x 2 m
2 2 m + n m !(n + m)!
m=0
(12)
J 0 ( x) = 1
J 1 ( x) =
x2
x4
x6
+
+
22 22 42 22 42 62
x
x3
x5
x7
2 + 2 2 2 2 2 +
2 2 4 2 4 6 2 4 6 8
These two series converge very rapidly, so that they are useful in computations.
The recurrence relation below can also be used to find other Bessel polynomials
based on J 0 ( x) and J 1 ( x) given above.
J n +1 ( x) = J n 1 ( x) +
2n
J n ( x)
x
(13)
f ( x) =
c J
i
n (k i x)
(14)
i =1
f ( x) =
a0
+
2
a
n =1
cos
n x
n x
+ bn sin
T
T
(15)
if in any one period it has at most a finite number of local extreme values and a
finite number of discontinuities. (15) is called the Fourier series of function
f ( x) . The constants a0 , a n and bn , n =1,2,3, are called Fourier coefficients,
and the value 2T is the period of f ( x) . It can be proved that the Fourier series
converges to f ( x) at all points where f ( x) is continuous and converges to the
average of the right- and left-hand limits of f ( x) at each point where it is
discontinuous.
Table 2.1 summarizes the orthonormal functions introduced in this section.
When using these functions in approximation applications, it is very important
that the valid range for the functions to be orthonormal is ensured.
24
Chapter 2 Preliminaries
Table 2.1 Some useful orthonormal functions
Polynomial
Taylor
Valid
Interval
[a,b]
Forms
Pn ( x) = f (c) + f '(c)( x c) +
f (c )
( x c) 2
2!
f ( n ) (c)
( x c) n
n!
+ +
Chebyshev
[-1,1]
T0 ( x) = 1
T1 ( x) = x
Tn +1 ( x) = 2 xTn ( x) Tn 1 ( x)
Legendre
[-1,1]
L0 ( x) = 1
L1 ( x) = x
(n + 1) Ln +1 ( x) = (2n + 1) xLn ( x) nLn 1 ( x)
Hermite
(, )
H 0 ( x) = 1
H 1 ( x) = 2 x
H n +1 ( x) = 2 xH n ( x) 2nH n 1 ( x)
Laguerre
[0, )
L0 ( x) = 1
L1 ( x) = x + 1
Ln +1 ( x) = (2n + 1 x) Ln ( x) n 2 Ln 1 ( x)
Bessel
[0,b]
Fourier
series
One
period
x2
x4
x6
+ 2 2 2 2 2 +
2
2
2 4
2 4 6
x
x3
x5
J 1 ( x) = 2 + 2 2
2 2 4 2 4 6
2n
J n +1 ( x) = J n 1 ( x) +
J n ( x)
x
J 0 ( x) = 1
f ( x) =
a0
+
2
a
n =1
cos
n x
n x
+ bn sin
T
T
( A T )T = A
(1a)
( A + B) T = A T + B T B n m
(1b)
( AB) T = B T A T B m n
(1c)
( A 1 ) 1 = A
(2a)
( A T ) 1 = ( A 1 )T
(2b)
( A) 1 =
A 1 , 0
(2c)
(2d)
Tr ( A ) =
a
i =1
ii
(3)
26
Chapter 2 Preliminaries
where aii is the ith diagonal element of A. The trace operation has the following
properties:
Tr ( A ) = Tr ( A T )
(4a)
Tr ( A) = Tr ( A)
(4b)
Tr ( A + B) = Tr ( A ) + Tr (B) B n n
(4c)
Tr ( AB) = Tr (BA) = Tr ( A T B T ) A n m , B m n
(4d)
x T y = Tr (xy T ) = Tr (yx T ) = y T x x, y n
(4e)
f
f
f ( x) =
x
xn
x1
(5)
2 f
2 f
2
x1 xn
x1
2
f ( x) =
x 2
2
2
f f
xn x1
xn2
(6)
da1m ( x)
da11 ( x)
dx
dx
d
A( x) =
dx
da n1 ( x)
da nm ( x)
dx
dx
(7)
f
f
a
a1m
11
f ( A) =
A
f
f
a n1
a nm
(8)
The following basic properties of matrix calculus are useful in this book.
d
dA dB
( A + B) =
+
A, B n m
dx
dx dx
(9a)
d
dA
dB
( AB) =
B+A
A n m , B m n
dx
dx
dx
(9b)
d 1
dA 1
A = A 1
A
A n n
dx
dx
(9c)
T
(x y ) = y x, y n
x
(9d)
T
(x y ) = x x, y n
y
(9e)
( Ax) = A T
x
T
(x Ax) = xx T
A
A n m , x m
A n n , x n
T T
(x A Ax) = 2 Axx T
A
A n n , x n
(9f)
(9g)
(9h)
T
(x Ax) = Ax + A T x = 2 Ax A n n , x n
x
(9i)
T
(y Ax) = Ax A n m , x m , y n
y
(9j)
28
Chapter 2 Preliminaries
T
(y Ax) = A T y A n m , x m , y n
x
Tr ( AB) =
Tr ( A T B T ) =
Tr (B T A T )
A
A
A
=
Tr (BA ) = B T A n m , B m n
A
Tr (BAC) =
Tr (B T A T CT ) =
Tr (CT A T B T )
A
A
A
Tr ( ACB) =
Tr (CBA )
=
A
A
Tr ( A T B T CT ) = B T CT A , B ,C n n
=
A
(9k)
(9l)
(9m)
p
n
p
=
xi
i =1
(1)
l1 norm on n :
x1=
(2a)
i =1
n
l 2 norm on n :
l norm on n :
(2b)
i =1
= max xi
1 i n
(2c)
p1
inequality
min ( A) x x T Ax max ( A) x .
2
(3)
= sup
x0
Ax
x
= sup Ax
x
=1
(4)
A 1 = max
1 j n
ij
(5a)
i =1
= max ( A T A)
(5b)
= max
1 i n
ij
(5c)
i =1
AB A B
(6a)
A+B A + B
(6b)
AB A B
(6c)
30
Chapter 2 Preliminaries
p
= 0 f (t ) dt p for p [1, )
(7a)
= sup f (t ) for p =
(7b)
t [0, )
L1 = f (t ) : + f
L2 = f (t ) : + f
L = f (t ) : + f
= 0 f (t ) dt <
(8a)
2
f (t ) dt <
= sup f (t ) <
t [0, )
(8b)
(8c)
L1n
Ln2
= f (t ) : + n f
= f (t ) : + n f (t )
=
1
0 i =1
0 i =1
Ln = f (t ) : + n f (t )
f i (t ) dt <
f i (t ) dt <
= max f i (t ) <
1 i n
(9a)
(9b)
(9c)
t2
t1
0, i j
z i (t ) z j (t )dt =
1, i = j
(1)
t2
t1
f (t ) =
w z (t )
(2)
i i
i =1
where wi =< f , z i > is the Fourier coefficient, and the series converges in the
sense of mean square as
t
2
lim
n
t1
f (t )
w z (t )
i i
dt = 0 .
(3)
i =1
This implies that any function f(t) in the current Hilbert space can be
approximated to arbitrarily prescribed accuracy by finite linear combinations of
the orthonormal basis {zi(t)} as
k
f (t )
w z (t ) .
i i
(4a)
i =1
f (t ) w T z (t )
(4b)
32
Chapter 2 Preliminaries
f (t ) = w T z (t )
(4c)
provided a sufficient number of the basis functions are used. In this book,
equation (4) is used to represent time-varying parameters in the system dynamic
equation. The time-varying vector z(t) is known while w is an unknown
constant vector. With this approximation, the estimation of the unknown timevarying function f(t) is reduced to the estimation of a vector of unknown
constants w.
In the following, three representations are introduced for approximating a
matrix M (t ) p q . By letting q=1, the same technique can be used to
approximate vectors.
Representation 1: We may use the technique in (4) to represent individual
matrix elements. Let w ij , z ij k for all i, j, then matrix M is represented
to be
m11 m12
m
m22
21
M=
m p1 m p 2
T
m1q w 11z 11
m2 q w T21z 21
=
m pq w Tp1z p1
T
w 12
z 12
w T22 z 22
w Tp 2 z p 2
w 1Tq z1q
w T2 q z 2 q
(5)
w Tpq z pq
T
T
w p1z p1 w p 2 z p 2
w 1Tq z1q
w T2 q z 2 q
w Tpq z pq
T
T
w 11
w 12
T
T
w 21 w 22
=
T
w p1 w Tp 2
w 1Tq z 11
w T2 q z 21
w Tpq z p1
z 12
z 22
z p2
z 1q
z 2q
z pq
M = WT Z
(6)
where W is a matrix containing all wij and Z is a matrix of all zij. Since this is
not a conventional operation of matrices, dimensions of all involved matrices do
T
not follow the rule for matrix multiplication. Here, W is a p kq matrix and
Z is a kp q matrix, but the dimension of M after the operation is still p q .
This notation can be used to facilitate the derivation of update laws.
Representation 2: Let us assume that all matrix elements are approximated
using the same number, say , of orthonormal functions, and then the matrix
M (t ) p q can be represented in the conventional form for matrix
multiplications
M = WT Z
where M , Z
T
w 11
0
T
0
w
21
WT =
0
0
T
z11
0
ZT =
pq p
w Tp1
|
|
|
z T21 z Tp1 |
0
(7)
0
w T22
| |
| |
w T2 q
| |
w Tpq
| | w 1Tq
w Tp 2
| |
T
z 12
z T22
z Tp 2
| |
| |
| | z 1Tq
z T2 q
z Tpq
The matrix elements wij and zij are 1 vectors. It can be easily check that
T
w 11
z11
T
w 21z 21
M = WT Z =
w Tp1z p1
T
w 12
z12
w T22 z 22
w Tp 2 z p 2
w 1Tq z 1q
w T2 q z 2 q
w Tpq z pq
(8)
In this representation, we use the usual matrix operation to represent M, but the
sizes of W and Z are apparently much larger than those in the representation 1.
34
Chapter 2 Preliminaries
f ( x) = [ f1 ( x )
f m (x)]T
f 2 ( x)
(9)
f i (x) = w Tf i z f i
(10)
f ( x) = [w Tf1 z f1
w Tf 2 z f 2
w Tf m z f m ]T
21 21
22 22
2 p2 2 p2
wm1 z m1 + wm 2 z m 2 + + wmpm z mp m
(11)
Define p max = max pi and let wij = 0 for all i=1,...,m and j > pi , then
i =1,, m
w1 pmax
z11
0
z
21 + +
wm1 z m1
0
0
w21
0
0
0
0
w2 pmax
z1 pmax
z
2 pmax (12)
wmpmax z mpmax
0
0
Define
w1i
0
Wi =
0
0
, z i = [ z1i
wmi
0
w2i
z 2i z mi ]T ,
(13)
35
f ( x) =
Wz
(14)
i i
i =1
p1max
M=
W1i z 1i
i =1
p q max
i =1
Wqi z qi
(15)
x = f (x, t )
(1)
36
Chapter 2 Preliminaries
x = f (x)
(2)
37
(0) = 0
( r ) > 0 r > 0
(r1 ) (r2 ) r1 > r2 .
A continuous function V ( x, t ) : n + is locally positive definite if
there exists a class K function () such that V ( x, t ) ( x ) for all t 0 in
the neighborhood N of the origin of n . It is positive definite if N = n . A
continuous function V ( x, t ) : n + is locally decrescent if there exists
a class K function () such that V ( x, t ) ( x ) for all t 0 in the
neighborhood N of the origin of n . It is decrescent if N = n . A continuous
function V ( x, t ) : n + is radially unbounded if V ( x, t )
uniformly in time as x . If function V ( x, t ) is locally positive definite
and has continuous partial derivatives, and if its time derivative along the
trajectory of (1) is negative semi-definite then it is called a Lyapunov function
for system (1).
Lyapunov stability theorem for autonomous systems
Given the autonomous system (2) with an equilibrium point at the origin,
and let N be a neighborhood of the origin, then the origin is (i) stable if there
exists a scalar function V ( x) > 0 x N such that V ( x) 0 ; (ii) asymptotic
stable if V ( x) > 0 and V ( x) < 0 ; (iii) globally asymptotically stable if
V (x) > 0 , V (x) < 0 and V (x) is radially unbounded.
LaSalles theorem (Invariant set theorem)
Given the autonomous system (2) suppose V ( x) > 0 and V ( x) 0 along
the system trajectory. Then (2) is asymptotically stable if V does not vanish
identically along any trajectory of (2) other than the trivial solution x = 0 . The
result is global if the properties hold for the entire state space and V ( x) is
radially unbounded.
Lyapunov stability theorem for non-autonomous systems
Given the non-autonomous system (1) with an equilibrium point at the
origin, and let N be a neighborhood of the origin, then the origin is (i) stable if
38
Chapter 2 Preliminaries
x N , there exists a scalar function V (x, t ) such that V (x, t ) > 0 and
V (x, t ) 0 ; (ii) uniformly stable if V (x, t ) > 0 and decrescent and V (x, t ) 0 ;
(iii) asymptotically stable if V (x, t ) > 0 and V (x, t ) < 0 ; (iv) globally
n
asymptotically stable if x , there exists a scalar function V (x, t ) such
that V (x, t ) > 0 and V (x, t ) < 0 and V (x, t ) is radially unbounded; (v)
uniformly asymptotically stable if x N , there exists a scalar function
V (x, t ) such that V (x, t ) > 0 and decrescent and V (x, t ) < 0 ; (vi) globally
n
uniformly asymptotically stable if x , there exists a scalar function
V (x, t ) such that V (x, t ) > 0 and decrescent and is radially unbounded and
V (x, t ) < 0 ; (vii) exponentially stable if there exits , , > 0 such that
2
2
2
x N , x V ( x, t ) x
and V (x, t ) x ; (viii) globally
exponentially stable if it is exponentially stable and V (x, t ) is radially
unbounded.
Barbalats lemma
La Salles theorem is very useful in the stability analysis of autonomous
systems when asymptotic stability is desired but only with negative semidefinite result for the time derivative of the Lyapunov function. Unfortunately,
La Salles theorem does not apply to non-autonomous systems. Therefore, to
conclude asymptotic stability of a non-autonomous system with V 0 , we need
to find a new approach. A simple and powerful tool called Barbalats lemma can
be used to partially remedy this situation. Let f (t ) be a differentiable function,
then Barbalats lemma states that if lim f (t ) = k < and f (t ) is uniformly
t
continuous, then lim f (t ) = 0 . It can be proved that a differentiable function is
t
uniformly continuous if its derivative is bounded. Hence, the lemma can be
rewritten as: if lim f (t ) = k < and
f (t ) exists and is bounded, then
t
f 0 as t . In the Lyapunov stability analysis, Barbalats lemma can be
applied in the fashion similar to La Salles theorem: If V (x, t ) is lower
bounded, V 0 , and V is bounded, then V 0 as t . It should be noted
that the Lyapunov function is only required to be lower bounded in stead of
positive definite. In addition, we can only conclude convergence of V , not the
states. In this book, we would like to use the other form of Barbalats lemma to
prove closed loop stability. If we can prove that a time function e is bounded
and square integrable, and its time derivative is also bounded, then e is going to
converge to zero asymptotically. It can be restated as: if e L L2 and
e L , then e 0 as t .
x ( n ) = f (x, t ) + g (x, t )u + d (t )
(1)
f = f m + f
(2a)
d = d m + d
(2b)
f (x, t )
(3a)
d (x, t )
(3b)
40
Chapter 2 Preliminaries
u=
1
[ f ( x, t ) d (t ) + v ]
g ( x, t )
(4)
is not realizable. We would like to design a tracking controller so that the output
d
+ ) n 1 e
dt
(5)
where > 0 determines the behavior of the error dynamics. Selection of the
sliding surface is not unique, but the one in (5) is preferable simply because it is
linear and it will result in a relative degree one dynamics, i.e. u appears when we
differentiate s once. One way to achieve output error convergence is to find a
control u such that the state trajectory converges to the sliding surface. Once on
the surface, the system behaves like a stable linear system (
d
+ ) n 1 e = 0 ;
dt
ss < 0
(6)
Using (5), the sliding surface for system (1) is of the form
d
+ ) n 1 e
dt
= c1e + c 2 e + + cn 1e ( n 2) + c n e ( n 1)
s=(
(7)
(n 1)! n k
, k=1,...,n-1, and cn =1. Let us differentiate s with
(n k )!(k 1)!
respect to time once to make u appear
where ck =
s = c1e + + c n 1e ( n 1) + cn e ( n )
= c1e + + c n 1e ( n 1) + x ( n ) x d( n )
= c1e + + c n 1e ( n 1) + f + gu + d x d( n )
(8)
u=
1
[c1e cn 1e ( n 1) f m d m + xd( n ) 1 sgn( s )]
g
(9)
s = f + d 1 sgn( s )
(10)
To satisfy the sliding condition (6), let us multiply both sides of (10) with s as
ss = (f + d ) s 1 s
( + ) s 1 s
(11)
ss s
(12)
Therefore, with the controller (9), the sliding surface (7) is attractive, and the
tracking error converges asymptotically regardless of the uncertainties in f and
d, once the sliding surface is reached. Now, let us consider the case when
g (x, t ) is not available, but we do know that it is non-singular for all admissible
state and time t, and its variation bound is known with 0 < g min g g max . In
stead of the additive uncertainty model we used for f and d, a multiplicative
model is chosen for g as
g = g m g
(13)
42
Chapter 2 Preliminaries
0 min
g min
g
g max max .
gm
gm
(14)
u=
1
[ c1e c n 1e ( n 1) f m d m + x d( n ) 1 sgn( s)]
gm
(15)
s = (1 g )[c1e + + c n 1e ( n 1) + f m + d m x d( n ) ]
+f + d g1 sgn( s )
(16)
ss = (1 g )[c1e + + cn 1e ( n 1) + f m + d m xd( n ) ]s
+ ( f + d ) s g1 s
(1 min ) c1e + + c n 1e ( n 1) + f m + d m x d( n ) s
+ ( + ) s min1 s
(17)
1 =
(18)
where is a positive number. Therefore, we can also have the result in (12).
Smoothed sliding control law
Both controllers (9) and (15) contain the switching function sgn(s). In
practical implementation, the switching induced from this function will
sometimes result in control chattering. Consequently, the tracking performance
degrades, and the high-frequency unmodeled dynamics may be excited. In some
cases, the switching controller has to be modified with a continuous
if
sat( ) =
sgn( ) if >
(19)
where > 0 is called the boundary layer of the sliding surface. When s is
outside the boundary layer, i.e., s > , the sliding controller with sgn(s) is
exactly the same as the one with sat( ) . Hence, the boundary layer is also
s + 1
= f + d
(20)
This implies that the signal s is the output of a stable first-order filter whose
input is the bounded model error f + d . Thus, the chattering behavior can
indeed be eliminated with proper selection of the filer bandwidth and as long as
the high-frequency unmodeled dynamics is not excited. One drawback of this
smoothed sliding controller is the degradation of the tracking accuracy. At best
we can say that once the signal s converges to the boundary layer, the output
tracking error is bounded by the value .
Instead of the saturation function, we may also use
to have a smoothed
u=
1
s
[c1e cn 1e ( n 1) f m d m + xd( n ) 1 ]
g
(21)
s = f + d 1
(22)
44
Chapter 2 Preliminaries
ss = (f + d ) s 1
s2
( + ) s ( + + )
s2
s
s2
= ( + ) s 1
(23)
Since s > , i.e., when outside the boundary layer, we may have the result
ss
s2
u=
as
1
s
[u + xd( n ) 1 ]
gm
(24)
s2
ss [(1 min ) u + + ] s 1
If s is outside the boundary layer, (25) implies ss
s2
(25)
will eventually converge to the boundary layer even though the system contains
uncertainties. Since inside the boundary layer there is also an equivalent first
order filter dynamics, the chattering activity can be effectively eliminated. The
output tracking error, however, can only be concluded to be uniformly bounded.
There is one more drawback for the smoothing using
x p = a p x p + b p u
(1)
where x p is the state of the plant and u the control input. The
parameters ap and bp are unknown constants, but sgn(bp) is available. The pair
(ap, bp) is controllable. The problem is to design a control u and an update law
so that all signals in the closed loop plant are bounded and the system output xp
tracks the output xm of the reference model
x m = a m xm + bm r
(2)
46
Chapter 2 Preliminaries
u = ax p + br
(3)
am a p
bm
and b =
are perfect gains for transforming dynamics in
bp
bp
(1) into (2). Since the values of ap and bp are not given, we may not select these
where a =
perfect gains to complete the MRC design in (3) and the model reference
adaptive control (MRAC) rule is constructed instead
u = a(t ) x p + b(t )r
(4)
where a and b are estimates of a and b, respectively, and proper update laws
are to be selected to give a a and b b . Define the output tracking error as
e = x p xm
(5)
e = a m e + b p (a a ) x p + b p (b b) r
(6)
p + b p br
e = a m e + b p ax
(7)
This is the dynamics of the output error e, which is a stable linear system driven
by the parameter errors. Therefore, if update laws are found to have convergence
of these parameter errors, convergence of the output error is then ensured. To
find these update laws for a and b , let us define a Lyapunov function
candidate
V (e, a , b ) =
1 2 1
e + b p (a 2 + b 2 )
2
2
(8)
)
+ bb
V = ee + b p (aa
) + b p (aa
)
p + b p br
+ bb
= e(a m e + b p ax
(9)
a = sgn(b p )ex p
(10a)
b = sgn(b p )er
(10b)
V = a m e 2 0
(11)
e 2 dt = a m1
= a m1 (V0 V ) <
Vdt
=0
p + br
ax
(12)
bm
am
is the d.c. gain of the reference model (2). Equation (12) further implies
ka + b = 0
(13)
which is exactly a straight line in the parameter error space. Therefore, for a
constant reference input r, both the estimated parameters do not necessarily
converge to zero. To investigate the problem of parameter convergence, we need
the concept of persistent excitation. A signal v n is said to satisfy the
persistent excitation (PE) condition if , T > 0 such that
48
Chapter 2 Preliminaries
t +T
vv dt I
(14)
Define v = [ x p r ]T and = [ a
represented into the vector form
a
r] = 0
v T = [ x p
(15)
t +T
T
vv dt = 0
(16)
t +T
vv dt = 0
(17)
x p = A p x p + B p u
(18)
x m = A m x m + B m r
(19)
nm
where A m
and B m
are known and r m is a bounded
reference input vector. All eigenvalues of Am are assumed to be with strictly
negative real parts. A control law for this problem can be designed in the form
u = BAx p + Br
(20)
where A and B are feedback gain matrices. Substituting (20) into (18), the
closed loop system is derived as
x p = ( A p + B p BA) x p + B p Br
(21)
A p + B p BA = A m
(22a)
B pB = B m
(22b)
then the behavior of system (18) is identical to that of the reference model. The
control in (20) is called the MRC rule if A and B satisfy (22). Since the values
of Ap and Bp are not given, the MRC rule is not realizable. Now, let us replace
(t ) and B (t ) , respectively, to
the values A and B in (20) with their estimates A
have the MRAC rule
(t )x p + B (t )r
u = B (t ) A
(23)
A and B B .
We would like to design proper update laws to have A
Define the tracking error
e = x p xm
(24)
A)x p
e = A m (x p x m ) + B m ( A
p + (B p B B m )r
+ (B p B B m ) Ax
p + (B p B B m )( Ax
p + r)
= A me + B m Ax
(25)
50
Chapter 2 Preliminaries
=A
A . Using the relations in (22), the above equation is reduced to
where A
p + B m (B 1B I )B 1 (BAx
p + Br
)
e = A me + B m Ax
p + B m (B 1 B 1 )u
= A me + B m Ax
(26)
(27)
, B 1 ) = eT Pe + Tr ( A
TA
+ B 1T B 1 )
V (e, A
(28)
A
= B Tm Pex Tp
(29a)
B 1 = B Tm Peu T
(29b)
Tm Peu T B
B = BB
(29c)
, B 1 ) = 0 is uniformly stable
Therefore, we have proved that the origin (e, A
using controller (23) with update laws in (29). It should be noted that the control
, B }
scheme can only guarantee uniform stability of the origin in the {e, A
space. Since the Lyapunov function in (28) is not radially unbounded, the global
behavior cannot be concluded. If the reference input is PE, convergence of the
estimated parameters can further be proved.
x = Ax + T (t )z
(30a)
z = (t )Px
(30b)
V (x, z ) = x T Px + z T z
(31)
Along the trajectory of (30), the time derivative of V can be computed to have
V = x T ( A T P + PA )x = x T Qx 0
(32)
x T Qxdt =
= V0 V <
Vdt
52
Chapter 2 Preliminaries
x T (T )Px(T ) + z T (T )z (T ) x T (t )Px(t ) + z T (t )z (t )
z T (t )z (t )
(33)
max (P ) x(T ) + z (T ) z (t )
z (t ) max (P ) x(T ) + z (T )
(34)
x(t )
2max (P )
(35)
z (t )
2
2
+ z (T )
(36)
If we may claim that > 0 , T > 0 and for any initial conditions of x and z,
t > T such that z (t ) < , then we may use this property to say that > 0 ,
2
2
2
2
z 0 as t , if the claim is justified. Since all properties hold for all x and
z, and are uniform respect to the initial time, the equilibrium point (x, z ) = 0 is
globally uniformly stable. Since the system is linear, it is also globally
exponentially stable.
Now let us prove the above claim by contradiction, i.e., to prove that
> 0 , we cannot find t1>0 such that z (t ) > for all t t1 . Suppose there
exists a t1 > 0 such that z (t ) > for all t t1 . The PE condition says that
there exist T, k>0, such that
t +T
( )T ( )d kI > 0 t t 0 .
(37)
t +T
Let w =
w T ( )PT ( )w d k min (P ) t t 0
(38)
z
, then (38) becomes
z
t +T
t t 0
t t1
(39)
1
2
T (z (t ), t ) = [z T (t + T )z (t + T ) z T (t )z (t )]
(40)
T = z T (t + T ) z (t + T ) z T (t )z (t )
=
t +T
d T
[z ( )z ( )] d
d
(41)
T =
t +T
Px z T PAx] d
[ x T PT Px z T
t +T
t +T
z T PT z d
Px z T PAx] d
[x T PT Px z T
k min (P ) 2
t t1
(42)
54
Chapter 2 Preliminaries
t +T
Px z T PAx] d
[x T PT Px z T
2
[ max
(P ) 2 x + max ( P) z + max (P ) z A ]
t +T
x( ) d
(43)
t +T
Px z T PAx] d 1 k min (P ) 2
[x T PT Px z T
2
(44)
1
2
1
2
Dead-Zone
Consider the uncertain linear time-invariant system
x p = a p x p + b p u + d (t )
(45)
x m = a m xm + bm r
where a m < 0 and bm = b p . Let a =
(46)
am a p
and b = 1 be ideal feedback
bp
gains for the MRAC law (4). Since ap is not given, a practical feedback law is
designed to be
u = a(t ) x p + br
(47)
p + d (t )
e = a m e + b p ax
(48)
V (e, a ) =
1 2
(e + a 2 )
2
V = a m e 2 + a (b p ex p + a ) + ed
(49)
a = b p ex p
(50)
(49) becomes
V = a m e 2 + ed
= ( d a m e )e
( a m e ) e
(51)
56
Chapter 2 Preliminaries
am
;
a m
ex p if (e, a ) D c
a =
if (e, a ) D
0
(52)
assures boundedness of all signals in the system. The notation D c denotes the
complement of D. The modified update law (52) implies that when the error e is
within the dead-zone D, the update law is inactive to avoid possible parameter
drift. It should be noted that, however, the asymptotic convergence of the error
signal e is no longer valid after the dead-zone modification even when the
disturbance is removed.
-modification
In applying the dead-zone modification, the upper bound of the disturbance
signal is required to be given. Here, a technique called -modification is
introduced which does not need the information of disturbance bounds.
Instead of (52), the update law (50) is modified as
a = b p ex p a
(53)
+ ed
V = a m e 2 aa
+ e
a m e 2 a 2 aa
(54)
for some unknown > 0 . Rewrite the two terms in (54) involving e as
1
am e + e = am e
2
1
1 2
am e 2 +
2
2 am
a m
1
1 2
am e 2 +
2
2 am
(55)
a 2 + a a
a 2 aa
1
1
2
a 2 + a
2
2
(56)
1
1 2 1
1
2
V a m e 2 +
a 2 + a
2
2 am 2
2
(57)
1 2 1
1
1
2
V V +
+ a + [ a m ] e 2 + [ ] a 2
2 am 2
2
2
(58)
1 2 1
2
V V +
+ a
2 am 2
(59)
Therefore, V 0 , if
1 2
1
2
a .
+
2 a m 2
(60)
This implies that signals in the closed loop system are uniformly bounded.
Hence, the additional term a in the update law makes the adaptive control
system robust to bounded external disturbances, although bounds of these
disturbances are not given. One drawback of this method is that the origin of the
system (48) and (53) is no longer an equilibrium point, i.e., the error signal e
will not converge to zero even when the disturbance is removed.
58
Chapter 2 Preliminaries
the knowledge of the uncertainty variation bounds is a must for almost all robust
control strategies. This is because the robust controllers need to cover system
uncertainties even for the worst case. One the other hand, we know from
Section 2.10 that for the adaptive controller to be feasible the unknown
parameters should be time-invariant. This is also almost true for most adaptive
control schemes. Let us now consider the case when a system contains timevarying uncertainties whose variation bounds are not known. Since it is timevarying, traditional adaptive design is not feasible. Because the variation bounds
are not given, the robust strategies fail. We would like to call this kind of
uncertainties the general uncertainties. It is challenging to design controllers for
systems containing general uncertainties.
In Section 2.11.1, we are going to have some investigation on the
difficulties for the design of adaptive controllers when the system has timevarying parameters. In Section 2.11.2, we will look at the problem in designing
robust controllers for systems containing uncertain parameters without knowing
their bounds.
x p = a p (t ) x p + b p u
(1)
x m = a m xm + bm r
(2)
a m a p (t )
and b = 1 be ideal feedback
bp
gains for the MRC law u = a (t ) x p + br . Since ap(t) is not given, a practical
u = a(t ) x p + br
(3)
p
e = a m e + b p ax
(4)
1
1
V (e, a ) = e 2 + b p a 2
2
2
(5)
V = a m e 2 + b p a (ex p + a a )
(6)
a = ex p
(7)
V = a m e 2 b p aa
(8)
Since a and a are not available, the definiteness of V can not be determined.
Therefore, we are not able to conclude anything about the properties of the
signals in the closed loop system. From here, we know that the assumption for
the unknown parameters to be time-invariant is very important for the feasibility
of the design of adaptive controllers. It is equivalent to say that the traditional
MRAC fails in controlling systems with time-varying uncertainties.
x = f ( x, t ) + g ( x, t )u
(9)
60
Chapter 2 Preliminaries
f ( x, t ) = f m + f
(10)
(11)
s = f + gu x d
(12)
u=
1
[ f m + xd 1 sgn( s )]
g
(13)
s = f 1 sgn( s)
(14)
ss = fs 1 s
( 1 ) s
(15)
Since is not given, we may not select 1 similar to the one we have in (2.911). Therefore, the sliding condition can not be satisfied and the sliding control
fails in this case.
Bound estimation for uncertain parameter
An intuitive attempt in circumvent the difficulty here is to estimate by
using conventional adaptive strategies. Since itself is a constant, it might be
possible to design a proper update law for its estimate .
Let be a positive number, then we may pick 1 = + so that
equation (15) becomes
ss s s
(16)
61
V=
1 2 1 2
s +
2
2
(17)
V = ss
s + ( s )
(18)
= s
(19)
V s
(20)
It seems that the estimation of the uncertainty bound can result in closed loop
stability. However, in practical applications, the error signal s will never be zero,
and the update law (19) implies an unbounded . Therefore, the concept in
estimation of the uncertainty bound is not realizable.
62
Chapter 2 Preliminaries
x p = a p (t ) x p + b p u
(1)
e = a m e + b p (a a ) x p
where a (t ) =
(2)
a m a p (t )
is the perfect gain in the MRAC rule. Since it is
bp
a = wT z +
Tz
a = w
(3)
63
(4)
1
1
) = e2 + bpw
Tw
V (e, w
2
2
(5)
Tw
V = ee b p w
)
T ( zx p e w
= ame 2 + b p w
(6)
= zx e
w
p
(7)
V = a m e 2 0
(8)
we may have
64
Chapter 2 Preliminaries
x1 = x 2
x 2 = x3
x n 1 = x n
x n = f (x, t ) + g (x, t )u
(9)
0 < min
g min
g
g max max
gm
gm
We would like to design a controller such that the system state x tracks the
desired trajectory x d d , where d is a compact subset of . Define the
tracking error vector as e = x x d = [ x1 x1d x 2 x 2 d x n x nd ]T .
The control law can be selected as
u=
1
( f + u r )
gm
(10)
and = x nd
k e
i i +1
i=0
A=
0
k 0
1
0
0
1
0
k1
0
k2
n n
1
k n 1
0
0
65
is Hurwitz. With the controller (10), the last line of (9) becomes
g
( f + u r )
gm
= ( f f ) + (1 g )( f ) + gu r
x n = f +
en +
k e
i i +1
= ( f f ) + (1 g )( f ) gu r
i=0
e = Ae + b[( f f ) + (1 g )( f ) gu r ]
(11)
f = wT z +
Tz
f = w
Then (11) becomes
T z + + (1 g )( f ) gu r ]
e = Ae + b[w
(12)
=ww
. To find the update law, let us consider the Lyapunov-like
where w
function candidate
T w
V = e T Pe + w
(13)
66
Chapter 2 Preliminaries
V = e T ( A T P + PA) e + 2[(1 g )( f ) gu r ] b T Pe
T (zb T Pe w
)
+ 2 b T Pe + 2w
eT Qe + 2(1 + max ) f b T Pe 2 min u r b T Pe
)
T (zb T Pe w
+ 2 b T Pe + 2w
We may thus select
1 + max
f sgn(b T Pe)
(14a)
= 1 (zb T Pe w
), > 0
w
(14b)
ur =
min
The signum function in (14a) might induce chattering control activity which
would excite un-modeled system dynamics. Some modifications can be used to
smooth out the control law. The most intuitive way is to replace the signum
function with the saturation function as
ur =
1 + max
f sat(b T Pe)
(14c)
min
One drawback for this modification is the reduction in the output tracking
accuracy. It is also noted that the -modification term in (14b) is to robustify the
adaptive loop. With (14), the time derivative of V becomes
Tw
V eT Qe + 2 b T Pe + 2 w
T (w w
)
= eT Qe + 2 b T Pe + 2 w
Tw w
]
min (Q ) e + 2 max (P) e + 2 [w
2
(a)
(b )
(15)
67
2 max (P )
1
= min (Q) e
2
min (Q)
1
4 2 (P) 2
2
min (Q ) e max
2
min (Q)
1
2 2 (P) 2
2
min (Q ) e + max
min (Q)
2
Likewise, part (b) can also be written as
Tw w
w w
1
1
w )2 ( w
= ( w
2
2
1
2
2
w )
( w
2
w )
1
2 2 (P) 2
2
2
V min (Q) e + max
w + w
2
min (Q)
1
2
2
+ w
= min (Q) e w
2
2
2 max
(P) 2
min (Q)
(16)
(c )
T w
max (P ) e + max () w
V = e T Pe + w
2
(17)
68
Chapter 2 Preliminaries
(Q )
V V + max (P ) min
e
2
+ w
+[max () ] w
2
2
2 max
(P) 2
min (Q)
min (Q )
,
, then we have
2 max (P ) max ()
Pick min
2 (P ) 2
2
V V + w + max
min (Q)
2
(18)
1
2 2 (P)
2
) E (e, w
) V > w + max
(e, w
sup 2 ( ) .
min (Q) t0
2 2 (P)
2
w + max
sup 2 ( )
min (Q) t0 t
T w
min (P ) e + min ( ) w
V = e T Pe + w
2
1
1
2
]
[V min () w
V
min (P )
min (P )
(19)
69
V (t 0 )
e
min (P )
e
+
(t t0 )
2
min (P )
2
2max
(P)
sup ( )
min (P ) min (Q) t0 t
(20)
ur =
1 + max
f sgn(b T Pe) +
sgn(b T Pe)
min
min
If the control law and the update law are still selected as (10) and (14b) with
V eT Qe + 2 b T Pe 2 b T Pe
eT Qe 0
Therefore, we may also have asymptotical convergence of the output error by
using Barbalats lemma.
Chapter 3
3.1 Introduction
In this chapter, we review the mathematical models for the robot
manipulators considered in this book. For their detailed derivation, please refer
to any robotics textbooks. In Section 3.2, a set of 2n coupled nonlinear ordinary
differential equations are used to describe the dynamics of an n-link rigid robot.
When interacting with the environment, the external force is included into the
dynamics equation which is presented in Section 3.3. The actuator dynamics is
considered in Section 3.4, and a motor model is coupled to each joint dynamics,
resulting in a set of 3n differential equations in its dynamics. In Section 3.5, the
dynamics for an electrically driven rigid robot interacting with the environment
is presented. It is composed of 3n differential equations with the inclusion of the
external force. Section 3.6 takes the joint flexibility into account where a set of
4n differential equations are used to represent the dynamics of an n-link flexiblejoint robot. To investigate the effect when interacting with the environment, the
external force is added into the dynamics equation in Section 3.7. With
consideration of the motor dynamics in each joint of the flexible joint robot, its
model becomes a set of 5n differential equations in Section 3.8. Finally, we
include the external force in Section 3.9 to have the most complex dynamics
considered in this book, i.e., the dynamics for an electrically driven flexible joint
robot interacting with the environment.
+ C(q, q )q + g(q) =
D(q)q
71
(1)
72
(q ) 2C(q, q ) is skew-symmetric.
Property 2: D
Property 3: The left-hand side of (1) can be linearly parameterized as the
) n r with a parameter
multiplication of a known regressor matrix Y (q, q , q
vector p r , i.e.
(2)
Consider a planar robot with two rigid links and two rigid revolute joints
shown in Figure 3.1. Its governing equation can be represented by (Slotine and
Li 1991)
73
(3)
d 22 = m2lc22 + I 2
c11 = m2l1lc 2 sin q 2 q 2
c12 = m2l1l c 2 sin q 2 ( q1 + q 2 )
c21 = m2l1l c 2 sin q 2 q1
c22 = 0
g1 = m1l c1 g cos q1 + m2 g[lc 2 cos(q1 + q 2 ) + l1 cos q1 ]
g 2 = m2lc 2 g cos(q1 + q 2 )
Property 1 and 2 can be confirmed easily by direct derivation, while property 3
will further be investigated in Chapter 4.
(1)
D x (x)
x + C x (x, x )x + g x (x) = J a T (q) Fext
(2)
74
where x n is the coordinate in the Cartesian space, and other symbols are
defined as
(3)
d x11
d
x 21
d x12
x c x11 c x12 x g x1
+
+
d x 22
y c x 21 c x 22 y g x 2
J a11
=
J a 21
J a12
J a 22
1 f ext
+ 0
2
(4)
where
J a11
J
a 21
d x11
d
x 21
d 21
J a12
J a 22
J a12
J a 22
c11
c
22
d12 J a11
d 22 J a 21
21 d 22 J a 21
-1
c12
c22
-1
J a12 J a11
J a 22 J a 21
g x1 J a11
g = J
x 2 a 21
J a12
J a 22
J a12
J a 22
J a12 J a11
J a 22 J a 21
g1
g
2
J a12
J a 22
-1
75
+ C(q, q )q + g (q) = Hi
D(q)q
(1a)
Li + Ri + K bq = u
(1b)
76
Example 3.3: With the consideration of the motor dynamics, the 2-D robot
introduced in Example 3.1 is now rewritten as
L1
0
0 i1 r1
+
L2 i2 0
0 i1 k b1
+
r2 i2 0
0 q1 u1
=
k b2 q 2 u 2
(2a)
(2b)
(1a)
Li + Ri + K b q = u
(1b)
We may also use equation (3.3-3) to transform (1) to the Cartesian space
D x (x)
x + C x (x, x )x + g x (x) = J a T (q)Hi Fext
(2a)
Li + Ri + K b q = u
(2b)
Example 3.4: The robot in Example 3.3 can be modified to include the
effect of the interaction force with the environment in the Cartesian space as
d x11
d
x 21
d x12
x c x11 c x12 x g x1
+
+
=
d x 22
y c x 21 c x 22 y g x 2
J a11
J
a 21
L1
0
0 i1 r1
+
L2 i2 0
J a12
J a 22
h1 0 i1 f ext
0 h i + 0
2 2
0 i1 k b1
+
r2 i2 0
0 q1 u1
=
k b2 q 2 u 2
(3a)
(3b)
+ C(q, q )q + g(q) = K( q)
D(q)q
(1a)
+ B + K( q) = a
J
(1b)
78
x1 = x2
x 2 =
MgL
K
sin x1 ( x1 x3 )
I
I
x3 = x 4
x 4 =
(2)
1
K
( x1 x3 ) + u
J
J
where xi , i=1,,4 are state variables, and the output y=x1 is the link
angular displacement, i.e., 2 in the figure. The link inertial I, the rotor inertia J,
the stiffness K, the link mass M, the gravity constant g, and the center of mass L
are positive numbers. The control u is the torque delivered by the motor.
Example 3.6: A 2-D rigid-link flexible-joint robot
The equation of motion for the robot introduced in Example 3.1 with
consideration of joint flexibility can be represented as
(3a)
21 d 22 q2 c21 c 22 q 2 g 2 0 k 2 2 q 2
j1
0
0 1 b1 0 1 k1 0 1 q1 a1
=
+
+
j 2 2 0 b2 2 0 k 2 2 q 2 a2
(3b)
D x (x)
x + C x (x, x )x + g x (x) = J a T (q)K( q) Fext
(1a)
+ B + K ( q) = a
J
(1b)
79
When the robot in Example 3.6 performs compliant motion control, its
dynamic equation in the Cartesian space becomes
d x11
d
x 21
d x12
x c x11 c x12 x g x1
+
+
=
d x 22
y c x 21 c x 22 y g x 2
j1
0
k1 0 1 q1 f ext
0 k q + 0
2 2
2
(2a)
0 1 b1 0 1 k1 0 1 q1 a1
=
+
+
j 2 2 0 b2 2 0 k 2 2 q 2 a2
(2b)
J a11
J
a 21
J a12
J a 22
+ C(q, q )q + g(q) = K ( q)
D(q)q
(1a)
+ B + K ( q) = Hi
J
(1b)
Li + Ri + K b q = u
(1c)
21 d 22 q2 c21 c 22 q 2 g 2 0 k 2 2 q 2
j1
0
(2a)
0 1 b1 0 1 k1 0 1 q1 h1 0 i1
=
(2b)
+
+
j 2 2 0 b2 2 0 k 2 2 q 2 0 h2 i2
L1
0
0 i1 r1
+
L2 i2 0
0 i1 k b1
+
r2 i2 0
0 q1 u1
=
k b2 q 2 u 2
(2c)
80
D x (x)
x + C x (x, x )x + g x (x) = J a T (q)K( q) Fext
(1a)
+ B + K ( q) = Hi
J
(1b)
Li + Ri + K b q = u
(1c)
It is the most complex system considered in this book. For each joint, a 5th
order differential equation is needed to model its dynamics.
Example 3.9: A 2-D electrically-driven flexible-joint robot interacting with
environment
When considering the actuator dynamics, the robot in Example 3.7 can be
rewritten into the form
d x11
d
x 21
d x12
x c x11 c x12 x g x1
+
+
=
d x 22
y c x 21 c x 22 y g x 2
J a11
J
a 21
j1
0
J a12
J a 22
k1 0 1 q1 f ext
0 k q + 0
2 2
2
0 1 b1 0 1 k1 0 1 q1 h1 0 i1
=
+
+
j 2 2 0 b2 2 0 k 2 2 q 2 0 h2 i2
L1
0
0 i1 r1
+
L2 i2 0
0 i1 k b1
+
r2 i2 0
0 q1 u1
=
k b2 q 2 u 2
(2a)
(2b)
(2c)
3.10 Conclusions
In this chapter, eight robot models have been introduced. Some of them
consider the actuator dynamics, some take the joint flexibility into account, and
some allow the robot to interact with the environment. These robot models are
summarized in Table 3-1 for comparison. In the following chapters, controllers
3.10 Conclusions 81
will be designed for these robots when the models are known followed by
derivations of adaptive controllers for uncertain robots.
Table 3.1 Robot models considered in this book
Systems
Dynamics Models
Equation
Numbers
RR
+ C(q, q )q + g (q) =
D(q)q
(3.2-1)
RRE
D x (x)
x + C x (x, x )x + g x (x) = J aT (q) Fext
(3.3-2)
EDRR
+ C(q, q )q + g (q ) = Hi
D (q )q
Li + Ri + K bq = u
(3.4-1)
EDRRE
D x (x)
x + C x (x, x )x + g x (x) = J aT (q)Hi Fext
Li + Ri + K bq = u
(3.5-2)
FJR
+ C(q, q )q + g(q ) = K ( q)
D (q )q
J + B + K( q) = a
(3.6-1)
FJRE
D x (x)
x + C x (x, x )x + g x (x)
(3.7-1)
J aT (q)K( q) Fext
J + B + K ( q) = a
EDFJR
+ C(q, q )q + g(q) = K ( q)
D(q)q
+ B + K ( q) = Hi
J
(3.8-1)
Li + Ri + K bq = u
EDFJRE
D x (x)
x + C x (x, x )x + g x (x)
= J aT (q)K( q) Fext
+ B + K( q) = Hi
J
Li + Ri + K bq = u
(3.9-1)
Chapter 4
4.1 Introduction
The dynamics of a rigid robot is well-known to be modeled by a set of
coupled highly nonlinear differential equations. Its controller design is generally
not easy even when the system model is precisely known. In practical operations
of an industrial robot, since the mathematical model inevitably contains various
uncertainties and disturbances, the widely used computed-torque controller may
not give high precision performance. Under this circumstance, several robust
control schemes (Abdallah et. al. 1991) and adaptive control strategies (Ortega
and Spong 1988, Pagilla and Tomizuka 2001) are suggested.
For the adaptive approaches, although these control laws can give proper
tracking performance under various uncertainties, most of them require
computation of the regressor matrix. This is because, with the regressor matrix,
the robot dynamics is able to be expressed in a linearly parameterized form so
that a proper Lyapunov function candidate can be found to give stable update
laws for uncertain parameters. Since the regressor matrix depends on the joint
position, velocity and acceleration, it should be updated in every control cycle.
Due to the complexity in the regressor computation, these approaches may have
difficulties in practical implementation. Sadegh and Horowitz (1990) proposed a
method to allow off-line computation of the regressor using the desired
trajectories instead of actual measurements. Sometimes a large memory space
should be allocated to store the look-up table containing the regressor. Lu and
Meng (1991a, 1993) proposed some recursive algorithms for general n DOF
robots. Kawasaki et al. (1996) presented a model-based adaptive control for a
robot manipulator whose regressor was computed explicitly by a recursive
algorithm based on the Newton-Euler formulation. Yang (1999) proposed a
robust adaptive tracking controller for manipulators whose regressor depends
only on the desired trajectory and hence can be calculated off-line.
83
84
+ C(q, q )q + g(q) =
D(q)q
(1)
d K d (q q d ) K p (q q d )] + Cq + g
= D[q
(2)
e + K d e + K p e = 0, e = q q d
(3)
(4)
+ g
[q
d K d (q q d ) K p (q q d )] + Cq
=D
(5)
(6)
= CC
, and g = g g are estimation errors. By defining
= DD
,C
where D
r
is
p = p p with p an estimate of p in (4) and assuming that D
invertible for all time t 0 , equation (6) can be further written to be
1Y (q, q , q
e + K d e + K p e = D
)p
(7)
86
The above error dynamics implies that if we may find a proper update law for p
such that p p asymptotically, then (7) converges to (3) as t , and
hence, we may have convergence of the output tracking error. To this end, let us
denote x = [e T e T ]T 2 n to represent equation (7) in its state space form
1Yp
x = Ax BD
0
K p
where A =
(8)
In
0
2 n 2 n and B = 2 n n . To design an
K d
I n
V (x, p ) =
1 T
1
x Px + p T p
2
2
(9)
1
1Y) T B T Px + p ]
V = x T Qx p T [(D
2
(10)
1Y(q, q , q
)]T B T Px
p = 1[ D
(11)
1
V = x T Qx 0
2
(12)
( Qx)T ( Qx)dt =
x T Qxdt =
Hence, (11) might suffer the singularity problem when the determinant of D
gets very close to 0, and some projection modification should be applied. To
solve these problems, a well-known strategy proposed by Slotine and Li (1988,
1991) based on the passivity design for rigid robots will be introduced in next
section.
d De + Cq d C e =
Ds + Cs + g + Dq
(1)
Suppose the robot model is precisely known, then we may pick an intuitive
controller for (1) as
d De + Cq d C e + g K d s
= Dq
(2)
where Kd is a positive definite matrix. Hence, the closed loop system becomes
Ds + Cs + K d s = 0
(3)
1 T
s Ds. Its time derivative along the trajectory of (3)
2
can be computed as
1
2C)s
V = s T K d s + s T (D
2
88
V = s T K d s 0
(4)
d C
e + g K d s
d D
e + Cq
= Dq
(5)
(6)
where v = q d e is a known signal vector. With this control law, the closed
loop system can be represented in the form
g
Cv
Ds + Cs + K d s = Dv
(7)
The right hand side of the above equation can be further expressed in the
linearly parameterized form
Ds + Cs + K d s = Y(q, q , v, v )p
(8)
) in (4.2-7), the
It is worth to mention that unlike the regressor matrix Y (q, q , q
regressor matrix Y (q, q , v, v ) in (8) is independent to the joint accelerations.
On the other hand, if we may find an appropriate update law for p such that
p 0 as t , then (8) converges to (3) asymptotically, and the closed loop
stability can be ensured. To find the update law, let us define a Lyapunov-like
function candidate as
1
1
V (x, p ) = s T Ds + p T p
2
2
(9)
V = s T K d s p T (p + Y T s)
(10)
p = 1Y T (q, q , v, v )s
(11)
V = s T K d s 0.
(12)
This is the same result we have in (4) and same convergence performance can
thus be concluded for the tracking error. To implement the control law (6) and
update law (11), we do not need the information of joint accelerations and it is
free from the singularity problem in the estimation of the inertia matrix.
m1lc21
2
m2l1
2
m2l c 2
m2 I1lc 2
p = I1
I2
m1lc1 g
m2l1 g
m2lc 2 g
(1)
90
q1 q2
) =
Y (q, q , q
0 0
q1 + q2
q1 + q2
y14
y 24
q1
q2
q1 + q2
q2
where
v2
v1
Y (q, q , v, v ) =
0 v1 + v2
v2 cos q 2
0
y14
y 24
cos q1 cos(q1 + q 2 )
(3)
0
cos(q1 + q 2 )
where
y14
= v2 cos q 2 + v1q 2 sin q 2 + (q1 + q 2 )v 2 sin q 2
y 24 = v1 cos q 2 + v1q1 sin q 2
and the corresponding parameter vector is
m2l c 2 2 + I 2
2m2l1lc 2
p=
m2l1lc 2
m1lc1 g + m2l1 g
m 2l c 2 g
(4)
In (1) and (4), the entries are some combinations of system parameters such as
link lengths, masses, and moments of inertia,, etc. Obviously, these quantities
are relatively easy to measure in practical applications compared to the
derivation of the regressor matrix. However, in traditional robot adaptive control
designs, we are required to know the complex regressor matrix, but update
the easy-to-obtain parameter vector. To ease the controller design and
implementation, it is suggested to consider the regressor-free adaptive control
+ g K d s
+ Cv
= Dv
(1a)
g
Cv
Ds + Cs + K d s = Dv
(1b)
This implies that s is an output of a stable first order filter driven by the
approximation errors and tracking errors. If some proper update laws can be
C and g g , then e 0 can be concluded
D, C
found so that D
from (1b). Since D, C and g are functions of states and hence functions of time,
traditional adaptive controllers are not applicable to give proper update laws
except that the linearly parameterization assumption as shown in (4.3-8) is
feasible. On the other hand, since their variation bounds are not given,
conventional robust designs do not work either. Here, we would like to use FAT
to representation D, C and g with the assumption that proper numbers of basis
functions are employed
D = WDT Z D + D
C = WCT Z C + C
(2a)
g = WgT z g + g
2
n n
where WD n D n ,2 WC n C 2n and Wg g
are weighting
n 1
matrices and Z D n D n , Z C n C n and z g g are matrices of
basis functions. The number () represents the number of basis functions used.
Using the same set of basis functions, the corresponding estimates can be
represented as
=W
DT Z D
D
=W
CT Z C
C
gT z g
g = W
(2b)
92
DT Z D v + W
CT Z C v + W
gT z g K d s
=W
(3)
DT Z D v W
CT Z C v W
gT z g + 1
Ds + Cs + K d s = W
(4)
() = W() W
() and 1 = 1 ( D , C , g , s, q
d ) n is a lumped
where W
approximation error vector. Since W() are constant vectors, their update laws
can be easily found by proper selection of a Lyapunov-like function. Let us
consider a candidate
D, W
C, W
g ) = 1 s T Ds
V (s, W
2
1
DT Q D W
D+W
CT Q C W
C+W
gT Q g W
g)
+ Tr ( W
2
(5)
n n
g
where Q D n D n D , Q C n C n C and Q g g
are positive
definite weighting matrices. The time derivative of V along the trajectory of (4)
can be computed as
DT Z D v W
CT Z C v W
gT z g ]
V = s T [Cs K d s W
1
T
DT Q D W
D + W
CT Q C W
T
+ s T Ds
Tr ( W
C + Wg Q g Wg ) + s 1
2
2C is skew-symmetric, we further have
Using the fact that the matrix D
DT (Z D vs
T + Q DW
V = s T K d s Tr[ W
D )]
CT (Z C vs T + Q C W
C )] Tr[ W
gT (z g s T + Q g W
g )] + s T 1 (6)
Tr[ W
Let us select the update laws with -modifications to be
1
D)
T + DW
W
D = Q D ( Z D vs
1
T
W
C = Q C ( Z C vs + C WC )
1
T
W
g = Q g ( z g s + g Wg )
(7)
DT W
D)
V = s T K d s + s T 1 + DTr ( W
CT W
C ) + gTr ( W
gT W
g)
+ CTr ( W
(8)
Remark 1: Suppose a sufficient number of basis functions are used and the
approximation error can be ignored, then it is not necessary to include the modification terms in (7). Hence, (8) can be reduced to (4.3-12), and
convergence of s can be further proved by Barbalats lemma.
Remark 2: If the approximation error cannot be ignored, but we can find a
positive number such that 1 , then a robust term robust can be added
into (1a) to have a new control law
+ g K d s + robust
+ Cv
= Dv
(9)
Consider the Lyapunov-like function candidate (5) again, and the update law (7)
without -modification; then the time derivative of V becomes
V s T K d s + s + s T robust
(10)
1
1
2
s T K d s + s T 1 min (K d ) s
min (K d )
2
(11a)
(T) W
() ) 1 Tr ( W(T) W() ) 1 Tr ( W
(T) W
( ) )
Tr ( W
2
2
(11b)
The proof for the first inequality is straightforward, and the proof for the second
one can be found in the Appendix. Together with the relationship
94
1
DT Q D W
D+W
CT Q C W
C +W
gT Q g W
g )]
V = [s T Ds + Tr ( W
2
1
2
DT W
D)
[ max (D) s + max (Q D )Tr ( W
2
CT W
C ) + max (Q g )Tr ( W
gT W
g )]
+ max (Q C )Tr ( W
(12)
V V +
2 min (K d )
1
2
+ {[max (D) min (K d )] s + [max (Q D )
2
DT W
D )} + 1 { [max (Q C ) C ]Tr ( W
CT W
C)
D ]Tr ( W
2
gT W
g ) } + 1 [ DTr ( WDT WD )
+[max (Q g ) g ]Tr ( W
2
T
T
+ CTr ( WC WC ) + gTr ( Wg Wg )]
(13)
min (K d )
min
(14)
V V +
1
1
+ [ DTr ( WDT WD ) +
2 min (K d ) 2
(15)
V>
sup 1 ( ) +
2min (K d ) t 0
1
[ DTr ( WDT WD )
2
(16)
min (K d )
min
Since will not be used in the realization of the control law or the update law,
we are going to use similar treatment in (14) in later chapters to simplify the
derivation.
It can be seen that all terms in the right side of (16) are constants. By
proper selection of the parameters there, we may adjust the set where V 0 to
D, W
C and W
g are
be sufficiently small. Hence, we have proved that s, W
uniformly ultimately bounded. In addition, we may also compute the upper
bound for V by solving the differential inequality of V in (15) as
V (t ) e (t t 0 )V (t 0 ) +
+
sup 1 ( )
1
[ DTr ( WDT WD ) + CTr ( WCT WC ) + gTr ( WgT Wg )]
2
(17)
1
2
DT W
D)
V [ min (D) s + min (Q D )Tr ( W
2
CT W
C ) + min (Q g )Tr ( W
gT W
g )]
+ min (Q C )Tr ( W
we may find the upper bound for s
as
1
DT W
D)
[2V min (Q D )Tr ( W
min ( D)
CT W
C ) min (Q g )Tr ( W
gT W
g )]
min (Q C )Tr ( W
2
V
min ( D)
(18)
96
min (D)
e (t t 0 )V (t 0 ) +
min (D)
sup 1 ( )
(19)
s
+
2V (t 0 ) 2 (t t 0 )
1
+
e
sup 1 ( )
min (D)
min (D) min (K d ) t0 < < t
1
min (D)
(20)
Inequality (20) implies that the time history of the error signal s is bounded by
an exponential function plus some constants. This completes the transient
performance analysis.
Table 4.1 summarizes the adaptive control laws derived in this section.
Two columns are arranged to present the regressor-based and regressor-free
designs respectively according to their controller forms, update laws, and
implementation issues.
Table 4.1 Summary of the adaptive control for RR
Rigid-Link Rigid-Joint Robot
Controller
Regressor-based
Regressor-free
+ g K d s
+ Cv
= Dv
= Y(q, q , v, v )p K d s
+ g K d s
+ Cv
= Dv
T
g zg + W
DT Z D v
=W
(4.3-6)
CT Z C v K d s
+W
(4.5-1)
Adaptive Law
p = 1Y T s
(4.3-11)
Realization
Issue
1
D)
T + DW
W
D = Q D ( Z D vs
1
T
W
C = Q C ( Z C vs + C WC )
1
T
W
g = Q g ( z g s + g Wg )
(4.5-7)
Does not need regressor matrix
Example 4.1:
Consider a 2-DOF planar robot represented in example 3.1, and we are
going to verify the control strategy developed in this section by using computer
simulations. Actual values of link parameters are selected as m1 =m2 =0.5(kg),
l1 =l2 =0.75(m), lc1 =lc2 =0.375(m), and I1 =I2 =0.0234(kg-m2). We would
like the endpoint to track a 0.2m radius circle centered at (0.8m, 1.0m) in 10
seconds without knowing its precise model. The initial condition of the
generalized coordinate vector is set to be q(0) = [0.0022 1.5019 0 0]T ,
i.e., the endpoint is initially at (0.8m, 0,75m). Since it is away from the desired
initial endpoint position (0.8m, 0,8m), some significant transient response can
be observed. The controller in (4.5-1a) is applied with the gain matrices
20 0
10 0
, and =
Kd =
.
0 20
0 10
Since we have assumed that the entries of D, C and g are all unavailable, and
their variation bounds are not known, we employ the FAT to have the
representations in (2). The 11-term Fourier series is selected as the basis
D and W
C are in 44 2 , and
function for the approximation. Therefore, W
22
2
g is in
W
. The initial weighting vectors for the entries are assigned to be
Q D1 = I 44 , Q C1 = I 44 and Q g1 = 100I 22 .
98
In this simulation, we assume that the approximation error can be neglected, and
hence the -modification parameters are chosen as () = 0 . The simulation
results are shown in Figure 4.1 to 4.6. Figure 4.1 shows the tracking
performance of the robot endpoint and its desired trajectory in the Cartesian
space. It is observed that the endpoint trajectory converges nicely to the desired
trajectory, although the initial position error is quite large. After the transient
state, the tracking error is small regardless of the time-varying uncertainties in
D, C and g. Computation of the complex regressor is avoided in this strategy
which greatly simplifies the design and implementation of the control law.
Figure 4.2 presents the time history of the joint space tracking performance. The
transient states converge very fast without unwanted oscillations. The control
efforts to the two joints are reasonable that can be verified in Figure 4.3. Figure
4.4 to 4.6 are the performance of function approximation. Although most
parameters do not converge to their actual values, they still remain bounded as
desired.
1.2
1.15
1.1
1.05
0.95
0.9
0.85
0.8
0.75
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
0.8
0.6
0.4
0.2
5
Time(sec)
10
5
Time(sec)
10
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
14
12
10
8
6
4
2
0
5
Time(sec)
10
5
Time(sec)
10
5
0
5
10
15
20
Figure 4.3 The control efforts for both joints are all reasonable
0.3
0.8
0.2
0.6
0.1
0.2
0
0.1
0.2
0.2
0.3
0.4
D(21)
D(12)
0.4
4
6
Time(sec)
0.4
10
0.6
0.4
0.5
0.35
0.4
0.3
0.3
0.25
D(22)
D(11)
0.2
0.1
4
6
Time(sec)
10
4
6
Time(sec)
10
0.2
0.15
0.1
0.1
0.05
4
6
Time(sec)
10
0.1
0.1
0.05
C(12)
C(11)
0.05
0.05
0.05
4
6
Time(sec)
0.1
10
0.1
4
6
Time(sec)
10
4
6
Time(sec)
10
0.2
0.05
0.15
C(22)
C(21)
100
0.05
0.1
0.05
0.1
0.15
0
0
4
6
Time(sec)
10
101
10
8
g(1)
6
4
2
0
5
Time(sec)
10
5
Time(sec)
10
3
2
g(2)
1
0
1
2
3
+ C(q, q )q + g (q) = Hi
D(q)q
(1a)
Li + Ri + K bq = u
(1b)
We firstly consider the case when all robot parameters are known, and then the
regressor-based adaptive controller is derived if the robot parameters are not
available. Finally, a regressor-free adaptive controller is introduced. Several
simulation results will be presented to justify the necessity for the consideration
of the actuator dynamics, and to evaluate the performance of the controllers
designed here.
With the definitions of s and v in Section 4.2, we may rewrite (1a) into the
form
Ds + Cs + g + Dv + Cv = Hi
(2)
102
Suppose all of the robot parameters are known, then we may design a proper
control law u such that the current i in (1b) follows the trajectory
i = H 1 (g + Dv + Cv K d s)
(3)
where K d is a positive definite matrix. Substituting (3) into (2), the closed
loop dynamics becomes Ds + Cs + K d s = 0. It is exactly the same as the
one in (4.3-3), and hence convergence of s follows. To realize the perfect
current vector in (3), we have to design a control input u in (1b) to ensure that
the actual current can converge to the perfect one. Since all parameters are
assumed to be known at the present stage, we may construct the control input in
the form
u = Li d + Ri + K bq K c e i
(4)
Le i + K c e i = 0
(5)
On the other hand, since the desired current is defined according to (3) as
i d = H 1 (g + Dv + Cv K d s)
(6)
we may rewrite (2) into the form below to represent the dynamics of the output
tracking loop
Ds + Cs + K d s = H(i i d )
(7)
To prove the close loop stability, let us consider the Lyapunov-like function
candidate
1
1
V (s, ei ) = sT Ds + eTi Lei
2
2
(8)
103
Along the trajectory of (5) and (7), we may compute the time derivative of V as
1
2C)s + sT Hei eTi K cei
V = sT K d s + sT (D
2
s
= [sT eTi ]Q 0
ei
(9)
H
Kd
2
where Q =
is positive definite by proper selection of Kc and
1
H K
c
2
Kd. It is easy to further prove that s and ei are also square integrable, and their
time derivatives are uniformly bounded. Hence, by Barbalats lemma, we may
conclude asymptotic convergence of s and ei.
In summary, if all parameters in the EDRR (1) are available, the controller
(4) with the perfect current trajectory (6) can give asymptotic convergence of the
output error.
Remark 1: It has to be noted that in controller (4), we have to find the time
derivative of the desired current trajectory which implies that we need to
feedback the joint accelerations to complete that computation. This necessity
will be eliminated in the following design of the FAT-based regressor-free
adaptive controller in section 4.6.2.
In next step, we would like to derive a regressor-based adaptive controller
for EDRR. The regressor-free design will be developed in section 4.6.2.
Ds + Cs + g + Dv + Cv = Hi
(10a)
Li + Ri + K b q = u
(10b)
Suppose D, C, g, L, R, and Kb are not available, and we may not realize the
control laws in (4) and (6). Instead, let us consider the desired current trajectory
104
K d s)
+ Cv
i d = H 1 (g + Dv
= H 1[ Y (q, q , v , v )p K d s ]
(11)
(12)
=CC
, g = g g and p = p p . If we may design a
= DD
,C
where D
controller u and an update law such that i i d and p p, then (12) implies
convergence of the output error vector. Let us consider the controller which is a
modification of (4) as
d + Ri
+K
bq K c e i
u = Li
(13)
iT
q T ]T 3n
p i = [LT
RT
K Tb ]T 3n n
p i = [L T
T
R
Tb ]T 3n n
K
u = p Ti K c e i
(14)
Substituting (14) into (10b), we may have the dynamics in the current tracking
loop
Le i + K ce i = p Ti
(15)
1
1
1
1
V (s, ei , p , p i ) = sT Ds + eTi Lei + p T p + Tr(p Ti ip i )
2
2
2
2
(16)
105
3n 3n
(17)
p = 1Y T s
p i = i1eTi
(18)
V = [s T
s
e Ti ]Q 0
e i
(19)
H
Kd
2
where Q =
is positive definite by proper selection of K d
1 H K
c
2
and K c . Equation (19) implies that s and ei are uniformly bounded and square
integrable. Their time derivatives can also be proved to be bounded. Hence,
asymptotic convergence of s and ei can be obtained by the Barbalats lemma.
This further implies q q d and i i d as t .
Remark 2: Realization of controller (14) and update law (18) needs to know the
time derivative of the desired current which implies the need for the joint
acceleration feedback and the time derivative of the regressor matrix. All of
these requirements will be eliminated in the following design of the FAT-based
regressor-free adaptive controller.
106
K d s)
+ Cv
i d = H 1 (g + Dv
(20)
(21)
u = f K c e i
(22)
Le i + K ce i = f f
(23)
D = WDT Z D + D
C = WCT Z C + C
(24)
g = WgT z g + g
f = WfT z f + f
n n
n n
where WD n D n , WC n C n , Wg g , and Wf f
2
2
weighting matrices, ZD n D n ,
Z C n C n , z g n g 1 ,
2
are
and
z f
107
n f 1
=W
DT Z D
D
=W
CT Z C
C
(25)
gT z g
g = W
fT z f
f = W
then equation (21) and (23) becomes
DT Z D v W
CT Z C v W
gT z g + 1
Ds + Cs + K d s = H(i i d ) W
(26a)
fT z f + 2
Le i + K c e i = W
(26b)
(27)
n n
g
The matrices Q D n D n D , Q C n C n C , Q g g
, and
n f n f
Q f
are all positive definite. The time derivative of V along the
trajectory of (26) can be computed as
2
T
DT (Z Dvs
T
T + Q DW
Tr[W
D ) + WD (ZC vs + QC WC )
gT (z gsT + Qg W
g ) + W
fT (z f eTi + Qf W
f )]
+W
(28)
108
1
D)
T + DW
W
D = Q D ( Z D vs
C = Q C1 (Z C vs T + C W
C)
W
g = Q g1 (z g s T + g W
g)
W
(29)
f = Q f1 (z f e Ti + f W
f)
W
where (.) are positive numbers. With these selections, equation (28) becomes
s
1
DT W
D)
e Ti ]Q + [s T e Ti ] + DTr ( W
e
i
2
T
T
fT W
f)
+ CTr ( WC WC ) + gTr ( Wg Wg ) + f Tr ( W
V = [s T
(30)
H
Kd
2
where Q =
is positive definite which can be achieved by
1
H K
c
2
K d s + robust 1 )
+ Cv
i d = H 1 (g + Dv
u = f K c e i + robust 2
109
Consider the Lyapunov-like function candidate (27) again, and the update law
(29) without -modification; then the time derivative of V becomes
V [s T
s
eTi ]Q + 1 s + 2 e i + s T robust 1 + e Ti robust 2
e i
[s T
s
e Ti ]Q + [s T
e i
1
e Ti ]
2
1
min (Q)
2
s
1
e (Q)
min
i
1
2
(31)
(T) W
() ) 1 Tr ( W(T) W() ) 1 Tr ( W
(T) W
( ) )
Tr ( W
2
2
Together with the relationship
1
DT Q D W
D +W
CT Q C W
C
V = [s T Ds + e Ti Le i + Tr ( W
2
gT Q g W
g +W
fT Q f W
f )]
+W
2
s
T
e + max (Q D )Tr ( WD WD )
i
CT W
C ) + max (Q g )Tr ( W
gT W
g)
+ max (Q C )Tr ( W
fT W
f )]
+ max (Q f )Tr ( W
1
[ max ( A )
2
D 0
, we may rewrite (30) into the form
0 L
where A =
(32)
110
s
1
1
e + 2 (Q)
min
i
2
1
DT W
D ) + 1 [max (Q C ) C ]Tr ( W
CT W
C)
+ [max (Q D ) D ]Tr ( W
2
2
1
gT W
g ) + 1 [max (Q f ) f ]Tr ( W
fT W
f)
+ [max (Q g ) g ]Tr ( W
2
2
1
+ [ DTr ( WDT WD ) + CTr ( WCT WC ) + gTr ( WgT Wg ) + f Tr ( WfT Wf )] (33)
2
1
V V + [max ( A) min (Q)]
2
min (Q)
(34)
max ( A ) max (Q D ) max (Q C ) max (Q g ) max (Q f )
min
V V +
1
1
+ [ DTr ( WDT WD )
2 min (Q) 2
2
1
(35)
V>
1 ( )
1
sup
+
[ DTr ( WDT WD )
(
)
2min (Q) t 0 2
2
1
(36)
D, W
C, W
g and W
f are uniformly
Hence, we have proved that s, ei, W
ultimately bounded. From (35), we may also compute the upper bound for V as
V (t ) e
1 ( )
V (t 0 ) +
sup
2min (Q) t 0 < < t 2 ( )
( t t 0 )
1
[ DTr ( WDT WD ) + CTr ( WCT WC )
2
+ gTr ( WgT Wg ) + f Tr ( WfT Wf )]
(37)
111
1
min ( A)
2
s
1
T
e + 2 [ min (Q D )Tr ( WD WD )
i
CT W
C ) + min (Q g )Tr ( W
gT W
g)
+ min (Q C )Tr ( W
fT W
f )]
+ min (Q f )Tr ( W
we may find the upper bound for [s T
s
e
i
e Ti ]T
(38)
2
as
2
1
sup
e (t t 0 )V (t 0 ) +
min ( A)
min ( A) min (Q) t0 < < t
1
min ( A)
1 ( )
( )
2
s
e
i
+
1 ( )
2V (t 0 ) 2 (t t 0 )
1
+
e
sup
min ( A)
min ( A) min (Q) t0 < < t 2 ( )
min ( A)
112
Ds + Cs + g + Dv + Cv = Hi
(4.6-1), (4.6-2)
Li + Ri + K bq = u
Regressor-based
1
Controller
Regressor-free
1
i d = H [Y (q, q , v , v )p K d s]
u = p Ti K c e i
K d s)
+ Cv
i d = H (g + Dv
u = f K ce i
(4.6-11) (4.6-14)
(4.6-20) (4.6-22)
p = 1Y T s
p = 1e T
Adaptive
Law
(4.6-18)
W
D
WC
W
g
Wf
D)
T + DW
= Q D1 (Z D vs
C)
= Q C1 (Z C vs T + C W
g)
= Q g1 (Z gs T + g W
f)
= Q f1 (Z f eTi + f W
(4.6-29)
Realization
Issue
Example 4.2:
Consider the same 2-DOF planar robot in example 4.1 with the inclusion of
the actuator dynamics, and we are going to verify the control strategy developed
in this section by using computer simulations. Actual values of link parameters
are selected as m1 =m2 =0.5(kg), l1 =l2 =0.75(m), lc1 =lc2 =0.375(m), and
I1 =I2 =0.0234(kg-m2). Parameters related to the actuator dynamics are
given with h1 =h2 =10(N-m/A), L1 =L2 =0.025(H), r1 =r2 =1(), and
kb1 =kb2 =1(Vol/rad/sec). In order to observe the effect of the actuator
dynamics, the endpoint is required to track a 0.2m radius circle centered at
(0.8m, 1.0m) in 2 seconds which is much faster then the case in example
4.1. The initial conditions of the generalized coordinate vector is
q(0) = [0.0022 1.5019 0 0]T , i.e., the endpoint is still at (0.8m, 0,75m)
initially. Three cases will be investigated in this example to clarify the
significance of the actuator dynamics in the closed loop stability. In case 1, an
adaptive controller designed for EDRR robots is applied to a EDRR robot. Since
the actuator dynamics is considered in the controller, good performance is to be
expected. However, if an adaptive controller for RR is applied to the same
113
EDRR with the same set of controller parameters, the performance would, of
course, be unsatisfactory which will be presented in case 2. In the last case, we
consider the same configuration in case 2, but with improvements in the tracking
performance via controller gain adjustments. It is seen that although the
tracking error can be limited to some range, the control effort would become
impractically huge. Hence, we may arrive at a conclusion later that
consideration of the actuator dynamics is very important if good performance is
required. Table 4.3 summarizes the configuration of the simulation cases.
Table 4.3 Simulation cases
Plant
Controller
Remark
Case 1
EDRR
Case 2
EDRR
Designed for RR
Case 3
EDRR
Designed for RR
It has to be emphasized that, in case 2 and 3, the robot models are in the
voltage level, i.e., the input to the joint is voltage (Figure 4.7). However, the
controllers designed for RR are in the torque level, i.e., their outputs are torque.
Hence, some modification is needed so that the robot and the controller are
compatible.
Torque
RR
Voltage
EDRR
Figure 4.7 The input signal for a RR is in the torque level and its controller should also be in
the torque level. The input for the EDRR is in the voltage level implying that its controller
must be with output in voltage
The adaptive controller in (4.5-1a) is designed for the rigid robot in the
torque level. Now, let us introduce a conversion matrix K which satisfies
K u( ) = Hi( ) = ( ) so that the controller becomes in the voltage level
+ g K d s)
+ Cv
u = K 1 (Dv
DT Z D v + W
CT Z C v + W
gT z g K d s)
= K 1 ( W
(39)
114
1
D)
T + DW
W
D = Q D ( Z D vs
C = Q C1 (Z C vs T + C W
C)
W
(40)
g = Q g1 (z g s T + g W
g)
W
Therefore, some more detail in the simulation cases can be summarized as
shown in Table 4.4.
Case 1: Controller for EDRR applied to EDRR
The controller in (22) is applied with the gain matrices
20 0
10 0
50 0
, =
and K c =
Kd =
.
0 20
0 10
0 50
Table 4.4 Realization details in simulation cases
Plant
Case 1
EDRR
(4.6-1)
Controller
K d s) W
+ Cv
1
D)
i d = H (g + Dv
T + DW
D = Q D ( Z D vs
1 T
T
= H ( Wg z g + WD Z D v
C = Q C1 (Z C vs T + C W
C)
W
T
+ WC Z C v K d s )
g = Q g1 (z g s T + g W
g)
W
u = f K c e i
f = Q f1 (z f e Ti + f W
f)
W
fT z f K c e i
=W
(4.6-20), (4.6-22)
Case 2
EDRR
(4.6-1)
+ g K d s)
+ Cv
u = K 1 (Dv
1 T
CT Z C v
= K (WD Z D v + W
gT z g K d s)
+W
(4.6-39)
Case 3
EDRR
(4.6-1)
Update Laws
+ g K d s)
+ Cv
u = K (Dv
1 T
CT Z C v
= K (WD Z D v + W
T
g z g K d s)
+W
(4.6-39)
(4.6-29)
1
D)
T + DW
W
D = Q D ( Z D vs
1
T
C)
WC = Q C (Z C vs + C W
1
T
g)
Wg = Q g (z g s + g W
(4.6-40)
1
D)
T + DW
W
D = Q D ( Z D vs
1
T
W
C = Q C ( Z C vs + C WC )
1
T
W
g = Q g ( z g s + g Wg )
(4.6-40)
115
The initial value for the desired current can be found by calculation from (20) as
i d (0) = i(0) = [1.5498 3.3570]T . The 11-term Fourier series is selected as
D and W
C are in 44 2 ,
the basis function for the approximation so that W
22 2
1.15
1.1
1.05
0.95
0.9
0.85
0.8
0.75
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
0.8
0.6
0.4
0.2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
1.6
1.4
116
1.2
1
0.8
0.6
0.4
0.2
1.4
i(1)
1.2
1
0.8
0.6
0.4
0.2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
1
0
i(2)
1
2
3
4
(
2
1.5
1
0.5
0
0.5
1
1.5
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
117
0.3
0.2
0.4
0.2
0
0.1
0.5
1
Time(sec)
1.5
0.2
0.3
0.5
0.2
0.4
0.1
0.3
D(22)
D(21)
0.1
D(12)
D(11)
0.6
0
0.1
0.2
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
0.2
0.1
0.5
1
Time(sec)
1.5
0.3
0.2
0.2
0.15
0.1
0.1
0.05
C(12)
C(11)
0.1
0.2
0
0.05
0.3
0.1
0.4
0.15
0.5
1
Time(sec)
1.5
0.15
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
0.2
0.1
0.15
0.05
0
C(22)
C(21)
118
0.05
0.1
0.1
0.05
0.15
0.2
0
0
0.5
1
Time(sec)
1.5
g(1)
6
4
2
0
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
2
1
g(2)
0
1
2
3
4
f(1)
0.5
0
0.5
1
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
f(2)
119
120
10
0
(N-m/Vol). All other required
10
parameters for the controller and update law (40) are the same as those in the
previous case. The purpose of using the same set of parameters is to have an
effective comparison. In the following figures we may observe that the
controller designed for RR is not able to give acceptable performance to a
EDRR under the conditions when the actuator dynamics is important such as the
fast motion trajectory tested here. The simulation results are presented in
Figure 4.16 to 4.22. Figure 4.16 shows that the endpoint motion does not
converge to the desired trajectory. Figure 4.17 indicates that the joint space
motion deviates from the desired trajectory after 0.6 seconds. Figure 4.18 and 19
presents the motor current and the control effort respectively, and impractically
large values can be seen in both curves. Function approximation results shown
in Figure 20 to 22 are not satisfactory either.
1.4
1.3
1.2
1.1
0.9
0.8
0.7
0.6
0.5
0.6
0.7
0.8
0.9
1.1
1.2
Figure 4.16 Tracking performance in the Cartesian space. It can be observed that the
controller designed for RR is not able to give satisfactory performance when applied to
EDRR under fast motion condition
0.6
0.4
0.2
0
0.2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
i(1)
200
0
200
400
600
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
i(2)
500
500
Figure 4.18 Motor currents go to impractically large values after 1.8 seconds
121
x 10
0.5
0.5
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
x 10
1.5
1
0.5
0
0.5
1
Figure 4.19 The control efforts become very large after 1.8 seconds
150
150
100
D(12)
D(11)
100
50
50
0
0
0
0.5
1
Time(sec)
1.5
50
20
300
250
20
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
200
40
D(22)
D(21)
122
60
150
100
80
50
100
120
0
0
0.5
1
Time(sec)
1.5
15
C(12)
C(11)
5
10
5
10
15
0.5
1
Time(sec)
1.5
25
10
80
60
40
C(22)
C(21)
20
5
10
15
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
20
0
0.5
1
Time(sec)
1.5
20
g(1)
200
0
200
400
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
1500
1000
g(2)
500
0
500
1000
123
124
200 0
100 0
and =
Kd =
.
0 200
0 100
The gain matrices in the update laws are selected as
1.15
1.1
1.05
0.95
0.9
0.85
0.8
0.75
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
Figure 4.23 Robot endpoint tracking performance in the Cartesian space. After proper gain
adjustment, significant improvement in the tracking performance can be observed. However,
the tracking error is still unacceptable
0.8
0.6
0.4
0.2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
i(1)
5000
5000
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
4000
i(2)
2000
2000
4000
125
1.5
x 10
1
0.5
0
0.5
1
1.5
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
x 10
2
1
0
1
2
5
0
60
D(12)
D(11)
5
40
20
10
15
20
0
0
0.5
1
Time(sec)
1.5
25
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
150
0
100
D(22)
D(21)
126
10
50
15
20
25
0
0
0.5
1
Time(sec)
1.5
C(12)
C(11)
0.5
1
Time(sec)
1.5
0.2
0.15
0.1
C(22)
C(21)
0.05
0
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
0.05
0.1
g(1)
4
2
0
2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
5
0
g(2)
5
10
15
20
127
128
4.7 Conclusions
In this chapter, we consider the adaptive control of rigid robots in the free
space. In Section 4.2, a regressor based adaptive controller is derived. To
implement the update law, the estimate of the inertia matrix is required to be
nonsingular and the joint accelerations should be known. Slotine and Lis
approach derived in Section 4.3 is well-known to be free from the singularity
problem in the estimate of the inertia matrix, and its realization does not need
the information of the joint accelerations. All of these approaches need to
calculate of the regressor matrix. We have seen in Section 4.4 that computation
of the regressor matrix is tedious in general; a regressor-free adaptive controller
is derived in Section 4.5 based on the FAT. In some operation conditions, the
actuator dynamics should be carefully considered to give better control
performance. A regressor-based adaptive controller is derived in Section 4.6.1
for a EDRR. However, it not only needs the joint acceleration feedback but also
the calculation of the time derivative of the regressor matrix. A regressor-free
adaptive controller for EDRR is then introduced in Section 4.6.2, whose
implementation is similar to those in Section 4.4. Finally, example 4.2
investigates the necessity for the consideration of the actuator dynamics in three
cases. The simulation results show that, under the fast motion condition, only
the controller designed with consideration of actuator dynamics can give good
performance with reasonable control efforts for a EDRR.
Chapter 5
5.1 Introduction
The impedance control of robot manipulators is to maintain a desired
dynamic relationship between the end-effector and the environment where a
second order mass-spring-damper system is used to specify the target behavior.
It gives a unified approach for controlling the robot in both free space and
constrained motion phases. Following the work of Hogan (1985), several studies
of the impedance control have been proposed. Anderson and Spong (1988)
combined the impedance control with the hybrid control. Goldenberg (1988)
used feedback and feedforward compensation for both force and impedance
control. Mills and Liu (1991) proposed an impedance control method to control
the generalized contact force and position. Gonzalez and Widmann (1995)
presented a hybrid impedance control scheme which uses force commands to
replace desired trajectory. Yoshikawa (2000) surveyed the force control for
robot manipulators.
In Hogans design, the entire robot dynamics is required to be known, and
the impedance controller is derived so that the closed loop system behaves like
the target impedance which can interact with the environment compliantly. In
practical applications, however, the dynamics of the robot manipulator and the
environment inevitably contains various uncertainties and disturbances. Under
this circumstance, one of the effective ways to deal with this difficulty is to
apply the adaptive strategy to the impedance control. Slotine and Li (1987)
extended the adaptive free motion control to the constrained manipulators. Kelly
et al. (1987) suggested two adaptive impedance controllers to reduce model
uncertainties. Lu and Meng (1991b) presented a concept of target-impedance
reference trajectories to deal with the problems of imperfect sensor feedback and
uncertain robot parameters. Based on singular motion robot representation,
Carelli and Kelly (1991) designed an adaptive position/force controller for
constrained robots to achieve global stability results. Colbaugh et al. (1991)
129
130
(1)
D x (x)
x + C x (x, x )x + g x (x) = J a T Fext
(2a)
where
D x (x) = J a T D(q)J a1
C x (x, x ) = J aT [C(q, q ) D(q)J a1J a ]J a1
g x (x) = J aT g (q)
(2b)
In the traditional impedance control, all system parameters are given and a
controller is designed so that the closed-loop system behaves like the target
impedance
M i (
x
x d ) + B i (x x d ) + K i (x x d ) = Fext
nn
(3)
nn
= J Ta (Fext + C x x + g x )
+ J Ta D x {
x d M i1[B i (x x d ) + K i (x x d ) + Fext ]}
(4)
where the terms in the first parenthesis is to cancel the corresponding dynamics
in the robot model, while the rest of the terms are for completing the target
impedance in (3). It is obvious that by plugging (4) into (2), the closed loop
system becomes exactly the target impedance (3).
Suppose some of the system parameters are not available and the above
impedance controller cannot be realized. An adaptive controller can thus be
designed by referring (4) as
x x + g x )
= J Ta (Fext + C
x {
+ J Ta D
x d M i1[B i (x x d ) + K i (x x d ) + Fext ]}
(5)
where quantities with hats are respective estimates. Substituting (5) into (2) and
after some straightforward manipulations, we may have
132
M i (
x
x d ) + B i (x x d ) + K i (x x d )
x C x )x + (g x g x )] Fext
x 1[(D
x D x )
x + (C
= MiD
(6)
x = Cx C
x and g x = g x g x , then (6) can be
x = Dx D
x, C
Define D
further written as
x x + g x ) M i1Fext
x 1 (D
x
e + M i1B i e + M i1K i e = D
x+C
(7)
Represent the right side of (7) into a linearly parameterized regressor form to
have
x 1Y(x, x ,
e + M i1B i e + M i1K i e = D
x)p x M i1Fext
(8)
x 1Yp x M i1Fext )
x = A x x B x (D
0n n
where A x =
1
M i K i
In
M i1B i
(9)
0 n n
2 n 2 n and B x =
2 n n . To
In
V (x, p ) =
1 T
1
x Px + p Tx p x
2
2
(10)
1
x 1Y ) T B Tx Px + p x ] + x T PB x M i1Fext
V = x T Qx p Tx [(D
2
(11)
In the free space tracking phase, the external force is zero, i.e. Fext = 0 , and the
selection of the update law
x 1Y ) T B Tx Px
p x = 1 (D
(12)
1
V = x T Qx 0 .
2
(13)
2n
1
V = x T Qx + x T PB x M i1Fext
2
(14)
Therefore, we may not conclude any stability property for the system states. A
possible modification to the controller (5) is to include an additional term as
x x + g x ) + J Ta D
x {
= J Ta (Fext + C
x d M i1[B i (x x d )
+ K i (x x d ) Fext ]} + J Ta 1
(15)
M i (
x
x d ) + B i (x x d ) + K i (x x d )
x C x )x + (g x g x )] Fext + M i D
x 1[(D
x D x )
x 11 (16)
= MiD
x + (C
1
(17)
x 1Yp x
x = A x x B x D
(18)
Select the same Lyapunov-like function candidate in (10), and then we may
have
1
x 1Y) T B Tx Px + p x ]
V = x T Qx p Tx [(D
2
(19)
134
With the selection of the update law in (12), equation (19) becomes (13).
Therefore, the system is stable for both the free space tracking and constrained
motion phases. However, it should be noted that, according to (16), the modified
controller (15) can only ensure convergence of the closed loop system to the
dynamics
M i (
x
x d ) + B i (x x d ) + K i ( x x d ) = 0
(20)
M i (
x i
x d ) + B i (x i x d ) + K i (x i x d ) = Fext
(1a)
M i
x i + B i x i + K i x i = M i
x d + B i x d + K i x d Fext
(1b)
D x s + C x s + g x + D x v + C x v = J a T Fext
(2)
x v + Fext K d s)
x v + C
= J Ta (g x + D
(3)
x v g x
x v C
D xs + C x s + K d s = D
(4)
Represent the right side of (4) into a linearly parameterized regressor form to
have
D xs + C x s + K d s = Y( x, x , v, v )p x
(5)
It is noted that the regressor matrix here is not a function of Cartesian space
accelerations. To find the update law, define the Lyapunov-like function
candidate as
1
1
V (x, p x ) = s T D xs + p Tx p x
2
2
(6)
V = s T K d s p Tx (p x + Y T s)
(7)
p x = 1Y T (x, x , v, v )s
(8)
V = s T K d s 0.
(9)
136
reference model in (1b), and hence the closed loop system will converge to the
target impedance in (5.2-3). This solves one of the difficulties encountered in
previous section.
Remark 2: To implement the control (3) and update law (8), we do not need the
information of Cartesian space accelerations and there is no singularity problem
in the inertia matrix estimation. However, the regressor matrix is still needed in
the update law.
x v + Fext K d s)
x v + C
= J Ta (g x + D
(1)
x v g x
x v C
D xs + C x s + K d s = D
(2)
x C x , and
x Dx , C
If we may design appropriate update laws such that D
g x g x , then (2) becomes
D xs + C x s + K d s = 0
(3)
D x = WDT x Z D x + D x
C x = WCT x Z C x + C x
(4)
g x = WgTx z g x + g x
n n
where WD x n D2 n , WC x n 2 C n and Wg x g
are weighting
n 1
matrices, ZDx n D n , Z C x n C n and z g x g are matrices of
2
x=W
DT Z D
D
x
x
x=W
CT Z C
C
x
x
(5)
gT z g
g x = W
x
x
D ,W
C and W
g are respectively the estimates of WD , WC and
where W
x
x
x
x
x
Wg x . With these representations, equation (2) becomes
DT Z D v W
CT Z C v W
gT z g + 1
D xs + C x s + K d s = W
x
x
x
x
x
x
(6)
() = W() W
() and 1 = 1 ( D , C , g , s, x i ) n is a lumped
where W
x
x
x
vector of approximation errors. Since W() are constant vectors, their update
laws can be easily found by proper selection of the Lyapunov-like function. Let
us consider a candidate
D ,W
C ,W
g ) = 1 sT D xs
V (s, W
x
x
x
2
1
DT Q D W
D +W
CT Q C W
C +W
gT Q g W
g )
+ Tr ( W
x
x
x
x
x
x
x
x
x
2
(7)
n n
g
where Q D x n D n D , Q C x n C n C and Q g x g
are positive
definite weighting matrices. The time derivative of V along the trajectory of (6)
can be computed as
2
DT ( Z D vs
T + Q Dx W
V = s T K d s + s T 1 Tr[ W
Dx )
x
x
T
CT (Z C vs T + Q C W
T
+W
C x ) + Wg x ( z g x s + Q g x Wg x )]
x
x
x
(8)
1
D )
T + Dx W
W
D x = Q D x ( Z D x vs
x
C = Q C1 (Z C vs T + C W
C )
W
x
x
x
x
x
g )
Wg x = Q g1x ( z g x s T + g x W
x
(9)
138
DT W
D )
V = s T K d s + s T 1 + D x Tr ( W
x
x
T
T
+ C x Tr ( WC x WC x ) + g x Tr ( Wg x Wg x )
(10)
1
2
DT W
D )
V V + {[max (D x ) min (K d )] s + [max (Q D x ) D x ]Tr ( W
x
x
2
CT W
C ) + [max (Q g ) g ]Tr ( W
gT W
g )
+ [max (Q C x ) C x ]Tr ( W
x
x
x
x
x
x
2
1
+ [ D x Tr ( WDT x WD x ) + C x Tr ( WCT x WC x ) + g x Tr ( WgTx Wg x )]}
min (K d )
By selecting
min (K d )
,
max (D x ) max (Q D x ) max (Q C x ) max (Q g x )
min
Dx
Cx
gx
we may have
2
V V +
1
1
+ [ D x Tr ( WDT x WD x )
2 min (K d )
2
+ C x Tr ( WCT x WC x ) + g x Tr ( WgTx Wg x )]
(11)
sup 1 ( )
V>
t0
2min (K d )
1
[ D x Tr ( WDT x WD x )
2
+ C x Tr ( WCT x WC x ) + g x Tr ( WgTx Wg x )]
D ,W
C and W
g are uniformly ultimately
and we have proved that s, W
x
x
x
bounded. On the other hand, differential inequality (11) implies
V (t ) e (t t 0 )V (t 0 ) +
+
sup 1 ( )
1
[ D x Tr ( WDT x WD x ) + C x Tr ( WCT x WC x ) + g x Tr ( WgTx Wg x )]
2
1
2
DT W
D )
V [ min (D x ) s + min (Q D x )Tr ( W
x
x
2
CT W
C ) + min (Q g )Tr ( W
gT W
g )]
+ min (Q C x )Tr ( W
x
x
x
x
x
we may derive the bound for s as
2V (t 0 ) 2 (t t 0 )
1
e
+
sup 1 ( )
min (D x )
min (D x ) min (K d ) t0 < < t
s(t )
+
min (D x )
[ D x Tr ( WDT x WD x ) + C x Tr ( WCT x WC x )
1
+ g x Tr ( WgTx Wg x )] 2
Remark 3: If a sufficient number of basis functions are employed in the
function approximation so that 1 0 , the -modification terms in (9) can be
T
n
n
eliminated and (10) becomes V = s K d s which implies s L L2 . It is
straightforward to prove s to be uniformly bounded, and hence convergence of
s can be concluded by Barbalats lemma.
Remark 4: Suppose 1 cannot be ignored and there exist a positive number
such that 1 for all t 0 , then, instead of (1), a new controller can be
constructed as
x v K d s + robust )
x v + C
= J Ta (Fext + g x + D
(12)
where robust is the robust term to be designed. Let us consider the Lyapunov
function candidate (7) and the update law (9) again but with () = 0 . The time
derivative of V can be computed as
V s T K d s + s + s T robust
By picking robust = [sgn( s1 ) sgn( s n )]T , where si, i =1,, n are the
i-th element of the vector s, we may have V 0 , and asymptotic convergence
of s can be concluded by Barbalats lemma.
140
D x ( x)
x + C x ( x, x ) x + g x ( x) = J aT Fext (5.2-2a)
Controller
Adaptive Law
Regressor-based
Regressor-free
xv
x v + C
= J Ta (g x + D
xv
x v + C
= J Ta (g x + D
+ Fext K d s)
+ Fext K d s)
(5.3-3)
(5.4-1)
p x = 1Y T ( x, x , v, v )s
(5.3-8)
1
D )
T + Dx W
W
D x = Q D x ( Z D x vs
x
C = Q C1 (Z C vs T + C W
C )
W
x
x
x
x
x
1
T
Wg x = Q g x (z g x s + g x Wg x )
(5.4-9)
Realization
Example 5.1:
The 2-DOF planar robot (3.3-4) is considered in this example to verify the
efficacy of the strategy developed in this section by computer simulation. Actual
values of system parameters are the same as those in example 4.1, i.e.
m1 =m2 =0.5(kg), l1 =l2 =0.75(m), lc1 =lc2 =0.375(m), and I1 =I2 =0.0234(kg-m2). The
endpoint starts from x(0) = x i (0) = [0.8m 0.75m 0 0]T to track a 0.2m
radius circle centered at (0.8m, 1.0m) in 10 seconds without knowing its precise
model. The constraint surface is smooth and can be modeled as a linear spring
(Spong and Vidyasagr 1989) fext =kw(x-xw) where fext is the force acting on the
surface, kw =5000N/m is the environmental stiffness, x is the coordinate of the
end-point in the X direction, and xw =0.95m is the position of the surface.
Hence, the external force vector becomes Fext = [ f ext 0]T . Since the surface is
away from the desired initial endpoint position (0.8m, 0,8m), different phases of
operations can be observed. The controller in (1) is applied with the gain
matrices
50 0
20 0
, and =
Kd =
.
0 50
0 20
Parameter matrices in the target impedance are selected as
0
0.5 0
100 0
1500
, Bi =
and K i =
.
Mi =
1500
0 0.5
0 100
0
The 11-term Fourier series is selected as the basis function for the approximation
D and W
C are in 44 2 , and W
g
of entries in D x , C x and g x . Therefore, W
22 2
is in
. The initial weighting vectors for the entries are assigned to be
142
space nicely. Afterwards, the endpoint contacts with the constraint surface at
xw =0.95(m) compliantly. When entering the free space again, the endpoint
follows the desired trajectory with very small tracking error regardless of the
system uncertainties. Computation of the complex regressor is avoided in this
strategy which greatly simplifies the design and implementation of the control
law. Figure 5.2 presents the time history of the joint space tracking performance.
The transient states converge very fast without unwanted oscillations. The joint
space trajectory in the constraint motion phase is smooth. The control efforts to
the two joints are reasonable that can be verified in Figure 5.3. The external
forces exerted on the endpoint during the constraint motion phase are shown in
Figure 5.4. Figure 5.5 to 5.7 are the performance of function approximation.
Although most parameters do not converge to their actual values, they still
remain bounded as desired.
1.2
1.15
1.1
1.05
0.95
0.9
0.85
0.8
0.75
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
Figure 5.1 Robot endpoint tracking performance in the Cartesian space. After some transient
the endpoint converges to the desired trajectory in the free space nicely. Afterwards, the
endpoint contacts with the constraint surface compliantly. When entering the free space
again, the endpoint follows the desired trajectory with very small tracking error regardless
of the system uncertainties
0.8
0.6
0.4
0.2
5
Time(sec)
10
5
Time(sec)
10
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
Figure 5.2 The joint space tracking performance. The transient is very fast and the constraint
motion phase is smooth
20
0
20
40
60
80
5
Time(sec)
10
5
Time(sec)
10
10
control input 2 (N.m)
0
10
20
30
40
50
60
Figure 5.3 The control efforts for both joints are all reasonable
80
60
40
20
5
Time(sec)
10
5
Time(sec)
10
0.5
0.5
0.8
1.5
0.6
1
Dx(12)
Dx(11)
Figure 5.4 Time histories of the external forces in the Cartesian space
0.4
0.5
0.2
4
6
Time(sec)
10
0.5
1.5
4
6
Time(sec)
10
4
6
Time(sec)
10
3.5
3
2.5
Dx(22)
Dx(21)
144
0.5
2
1.5
1
0.5
0.5
4
6
Time(sec)
10
0.6
0.5
Cx(12)
Cx(11)
0.4
0.2
0
0.2
0.4
0.6
4
6
Time(sec)
10
0.5
4
6
Time(sec)
10
4
6
Time(sec)
10
1.5
2
Cx(22)
Cx(21)
1
0.5
0
1
0
0.5
1
1
1.5
4
6
Time(sec)
10
gx(1)
4
3
2
1
0
1
5
Time(sec)
10
5
Time(sec)
10
14
12
gx(2)
10
8
6
4
2
0
146
(1a)
Li + Ri + K bq = u
(1b)
In this section, we firstly consider the case when all parameters in (1) are known
and a controller is developed so that the closed loop system behaves like the
target impedance in (5.2-3). Then, a regressor-based adaptive controller is
designed for (1) under the assumption that the system parameters contain
uncertainties. Finally, we will derive a regressor-free adaptive controller for the
impedance control of system (1).
With the same definitions of s and v in (5.3-2), equation (1a) can be
represented in the Cartesian space as
D x s + C x s + g x + D x v + C x v = J a T Hi Fext
(2)
Suppose D x , C x and g x are known, and we may design a proper control law
such that motor armature current i follows the trajectory
i = H 1J Ta (g x + D x v + C x v K d s + Fext )
(3)
where K d is a positive definite matrix. Then the closed loop dynamics becomes
D xs + C x s + K d s = 0. With proper selection of Kd, we may have asymptotic
convergence of s which further implies convergence of the closed loop system
to the target impedance. To make the actual current i converge to the perfect
current in (3), let us select the control input in (1b) as
u = Li d + Ri + K bq K c e i
(4)
147
x v K d s + Fext )
x v + C
i d = H 1J Ta (g x + D
= H 1J Ta [ Y ( x, x , v, v )p x K d s + Fext ]
(5)
where quantities with hats are respectively the estimated values, and regressor
matrix Y is defined similarly to the one in (5.2-8). The output tracking loop
dynamics can be obtained from (2) and (5) as
x v g x + J a T H(i i d )
x v C
D xs + C x s + K d s = D
= Y(x, x , v, v )p x + J aT H(i i d )
(6)
d + Ri
+K
bq K c e i
u = Li
= p Ti K c e i
(7)
Le i + K ce i = p Ti
(8)
1
1
1
V (s, e i , p x , p i ) = s T D xs + e Ti Le i + p Tx p x + Tr (p Ti ip i )
2
2
2
(9)
148
V = s T K d s + s T J a T He i e Ti K ce i
p T (p + Y T s) Tr[p T ( p + e T )]
x
(10)
p x = 1Y T s
p i = i1eTi
(11)
V = [s T
s
e Ti ]Q 0
e i
(12)
J a T H
Kd
2
where Q =
is positive definite by proper selection
1
J T H
Kc
a
2
of Kd and Kc. Equation (12) implies that s and ei are uniformly bounded and
square integrable. It can also be proved that s and e i are uniformly bounded.
Hence, asymptotic convergence in the current tracking loop and output tracking
loop can be concluded from Barbalats lemma.
Remark 6: Realization of controller (7) and update laws in (11) require the time
ext and Y
. Availability
derivative of id in (5) which implies the needs for x , F
of all of these quantities is impractical in general; therefore, some more feasible
controllers are to be developed.
149
Consider the robot model in (2) and (1b) again. Since most robot
parameters are unavailable, controller (4) and perfect current trajectory (3) are
not realizable. The perfect current trajectory can be modified similar to (5) as
x v K d s)
x v + C
i d = H 1J Ta (Fext + g x + D
(13)
Substituting (13) into (2), we may obtain the output tracking loop dynamics
x v g x + J aT H(i i d )
x v C
D xs + C x s + K d s = D
(14)
If controller u can be designed such that i i d , and there are some update
x C x and g x g x , then (14) reduces to
x Dx , C
laws to have D
D xs + C x s + K d s = 0, and convergence of the system dynamics to the target
impedance can be obtained. According to (7), let us select the control input in
(1b) as
u = f K c e i
(15)
Le i + K ce i = f f
(16)
D x = WDT x Z D x + D x
(17a)
C x = WCT x Z C x + C x
(17b)
g x = WgTx z g x + g x
(17c)
f = WfT z f + f
(17d)
n n
n n
where WD x n D n , WC x 2 n C n , Wg x 2 g
and Wf f
n 1
n D n
n C n
are weighting matrices, ZDx
, Z C x
, z g x g
and
n f 1
z f
are matrices of basis functions, and () are approximation error
2
150
matrices. The number () represents the number of basis functions used. Using
the same set of basis functions, the corresponding estimates can also be
represented as
x=W
DT Z D
D
x
x
(17e)
x=W
CT Z C
C
x
x
(17f)
gT z g
g x = W
x
x
(17g)
fT z f
f = W
(17h)
( ) = W
() W() , then equation (14) and (16) become
Define W
DT Z D v
D xs + C x s + K d s = J aT H (i i d ) W
x
x
T
T
WC Z C v Wg z g + 1
(18a)
fT z f + 2
Le i + K c e i = W
(18b)
D ,W
C ,W
g ,W
f ) = 1 s T D xs + 1 eTi Le i + 1 Tr ( W
DT Q D W
D
V (s, e i , W
x
x
x
x
x
x
2
2
2
CT Q C W
C +W
gT Q g W
g +W
fT Q f W
f)
+W
(19)
x
x
x
x
x
x
2
n n
g
The matrices Q D x n D n D , Q C x n C n C , Q g x g
and
n f n f
Q f
are all positive definite. The time derivative of V along the
trajectory of (18) can be computed as
V = s T K d s + s T J aT He i e Ti K c e i + s T 1 + e Ti 2
DT (Z D vs
D ) + W
CT (Z C vs T + Q C W
C )
T + Q Dx W
Tr[ W
x
x
x
x
x
x
x
T
T
T
T
g (z g s + Q g W
g )+W
f (z f e i + Q f W
f )]
+W
x
x
x
x
(20)
151
1
D )
T + Dx W
W
D x = Q D x ( Z D x vs
x
(21a)
1
T
W
C x = Q C x ( Z C x vs + C x WC x )
(21b)
1
T
W
g x = Q g x ( z g x s + g x Wg x )
(21c)
1
T
W
f = Q f ( z f e i + f Wf )
(21d)
s
1
DT W
D )
e Ti ]Q + [s T e Ti ] + D x Tr ( W
x
x
e
i
2
CT W
gT W
fT W
C ) + g Tr ( W
g ) + f Tr ( W
f)
+ C x Tr ( W
x
x
x
x
x
V = [s T
(22)
J a T H
Kd
2
where Q =
is positive definite by proper selection of
1 J T H
Kc
2 a
s
1
T
e + 2 [ max (Q D x )Tr ( WD x WD x )
i
CT W
C ) + max (Q g )Tr ( W
gT W
g )
+ max (Q C x )Tr ( W
x
x
x
x
x
T
+ max (Q f )Tr ( Wf Wf )]
1
V max ( A )
2
D x
0
where A =
0
, and the inequalities
L
[s T
s
e Ti ]Q + [s T
e i
1
min (Q)
2
1
e Ti ]
2
2
s
1
e (Q)
min
i
1
2
152
(T) W
() ) 1 Tr ( W(T) W() ) 1 Tr ( W
(T) W
( ) )
Tr ( W
2
2
then we may rewrite (22) into
2
s
1
1
DT W
D )
V V + [max ( A ) min (Q)] + {[max (Q D x ) D x ]Tr ( W
x
x
2
2
e i
CT W
C ) + [max (Q g ) g ]Tr ( W
gT W
g )
+[max (Q C ) C ]Tr ( W
x
fT W
f )} +
+[max (Q f ) f ]Tr ( W
1
1
+ { D x Tr ( WDT x WD x )
2 min (Q) 2
2
1
min (Q)
.
max ( A) max (Q D x ) max (Q C x ) max (Q g x ) max (Q f )
min
Dx
Cx
gx
V V +
1
1
+ [ D x Tr ( WDT x WD x )
2 min (Q ) 2
2
1
(23)
1 ( )
1
V>
sup
+
[ D x Tr ( WDT x WD x )
2min (Q) t 0 2 ( )
2
1
(24)
D , W
C , W
g and W
f are uniformly
Hence, we have proved that s, e i , W
x
x
x
ultimately bounded. On the other hand, (23) also implies
V (t ) e (t t 0 )V (t 0 ) +
+
1 ( )
sup
2min (Q) t 0 < < t 2 ( )
1
153
1
[ D x Tr ( WDT x WD x ) + C x Tr ( WCT x WC x )
2
+ g x Tr ( WgTx Wg x ) + f Tr ( WfT Wf )]
(25)
s
1
1
DT W
D )
V min (A) + [min (QDx )Tr (W
x
x
2
2
ei
CT W
C ) + min (Qg )Tr(W
gT W
g ) + min (Qf )Tr(W
fT W
f )]
+min (QCx )Tr(W
x
x
x
x
x
s
e i
2V
. Together with (25), we have the bound for
min ( A)
This implies
the error signal vector as
s
e
i
1 ( )
2V (t 0 ) 2 (t t 0 )
1
e
+
sup
min ( A )
2min ( A) min (Q) t 0 < < t 2 ( )
1
2min ( A )
[ D x Tr ( WDT x WD x ) + C x Tr ( WCT x WC x )
1
+ g x Tr ( WgTx Wg x ) + f Tr ( WfT Wf )] 2
Remark 7: If the number of basis functions is chosen to be sufficiently large
such that 1 0 and 2 0 , then (22) becomes
V = [s T
s
e Ti ]Q 0
e i
This implies that s and ei are uniformly bounded and square integrable. It is also
easy to prove that s and e i are uniformly bounded; as a result, asymptotic
154
x v K d s + robust 1 )
x v + C
i d = H 1J Ta (Fext + g x + D
u = f K c e i + robust 2
where robust 1 and robust 2 are robust terms to be designed. Let us consider
the Lyapunov-like function candidate (19) and the update law (21) without
-modification again. The time derivative of V can be computed as
V = [s T
s
e Ti ] Q + 1 s + 2 e i + s T robust 1 + e Ti robust 2
e i
155
D x s + C x s + g x + D x v + C x v = J a T Hi Fext
Li + Ri + K bq = u
(5.5-1), (5.5-2)
Controller
Regressor-based
Regressor-free
xv
x v + C
i d = H 1J Ta (g x + D
K d s + Fext )
x v
i d = H 1J Ta (Fext + g x + D
x v K d s)
+C
= H 1J Ta [ Y ( x, x , v , v )p x
K d s + Fext ]
u = f K ce i
(5.5-13), (5.5-15)
d + Ri
+K
b q K c e i
u = Li
= p Ti K c e i
(5.5-5), (5.5-7)
Adaptive Law
p x = 1Y T s
p = 1e T
i
(5.5-11)
1
D )
T + Dx W
W
D x = Q D x ( Z D x vs
x
1
T
WC x = Q C x (Z C x vs + C x WC x )
1
T
W
g x = Q g x ( z g x s + g x Wg x )
1
T
W
f = Q f ( z f e i + f Wf )
(5.5-21)
Realization Issue
Example 5.2:
Consider the same 2-DOF planar robot in example 5.1 with the inclusion of
the actuator dynamics, and we would like to verify the controller developed in this
section by computer simulations. Actual values of link parameters are selected as
m1 =m2 =0.5(kg), l1 =l2 =0.75(m), lc1 =lc2 =0.375(m), and I1 =I2 =0.0234(kg-m2).
Parameters related to the actuator dynamics are the same with those used in
chapter 4 and are given with h1 =h2 =10(N-m/A), L1 = L2 =0.025(H), r1 =r2 =1(),
and kb1 =kb2 =1(Vol/rad/sec). In order to observe the effect of the actuator
dynamics, the endpoint is required to track a 0.2m radius circle centered at
156
(0.8m, 1.0m) in 2 seconds which is much faster than the case in example
5.1. The initial conditions of the generalized coordinate vector is
q(0) = [0.0022 1.5019 0 0]T , i.e., the endpoint is still at (0.8m, 0,75m)
initially. The controller gain matrices are selected as
50 0
20 0
100 0
, =
and K c =
Kd =
.
0 50
0 20
0 100
The initial value for the desired current can be found by calculation as
0
0.5 0
100 0
1500
, Bi =
and K i =
.
Mi =
1500
0 0.5
0 100
0
The 11-term Fourier series is selected as the basis function for the
f are
D and W
C are in 44 2 , while W
g and W
approximation so that W
x
x
x
22 2
. The initial weighting vectors for the entries are assigned to be
in
157
1.2
1.15
1.1
1.05
0.95
0.9
0.85
0.8
0.75
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
0.8
0.6
0.4
0.2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
2
0
2
4
6
8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
158
0
1
2
3
4
5
6
8
6
4
2
0
2
4
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
4
2
0
2
4
6
80
60
40
20
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.5
0.5
159
0.8
1.5
0.6
Dx(12)
Dx(11)
0.4
0.5
0.2
0.5
1
Time(sec)
1.5
0.5
1.5
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
3.5
3
2.5
Dx(22)
Dx(21)
0.5
2
1.5
1
0.5
0.5
0.5
1
Time(sec)
1.5
Cx(12)
Cx(11)
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
10
4
5
Cx(22)
Cx(21)
160
2
5
4
6
0.5
1
Time(sec)
1.5
10
gx(1)
4
3
2
1
0
1
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
14
12
gx(2)
10
8
6
4
2
0
f(1)
3
2
1
0
1
2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
3
2
f(2)
1
0
1
2
3
4
161
162
5.6 Conclusions
Compliant interaction between the robot and environment is very important
in the industrial applications. In this chapter, we consider the adaptive
impedance control of rigid robots where the impedance control enables the robot
to have good performance in the free space tracking phase and to behave like the
target impedance in the constrained motion phase. In Section 5.2, a regressorbased impedance controller is derived, but the closed loop system may not
converge to the target impedance even when all parameters converge to their
actual values. In Section 5.3, an adaptive impedance control is constructed by
following the design introduced in Section 4.3. However, implementation of this
controller requires the knowledge of not only the regressor matrix and its time
derivative, but also the joint accelerations and time derivative of the external
force. Therefore, it is not feasible for practical applications. A regressor-free
adaptive impedance controller is thus designed in Section 5.4 which does not
need the availability of the additional information required in the previous
section. Finally, the actuator dynamics is considered in Section 5.5. Simulation
cases show that the robot can be operated at a much higher speed with good
performance for both the free space tracking and compliant motion control.
Chapter 6
6.1 Introduction
Most controllers for industrial robots are designed based on the rigid robot
assumption. Consideration of the joint flexibility in the controller design is one
of the approaches to increase the control performance. For a robot with n links,
we need to use 2n generalized coordinates to describe its whole dynamic
behavior when taking the joint flexibility into account. Therefore, the modeling
of the flexible joint robot is far more complex than that of the rigid robot. Since
the mathematical model is only an approximation of the real system, the
simplified representation of the system behavior will contain model inaccuracies
such as parametric uncertainties, unmodeled dynamics and external
disturbances. Because these inaccuracies may degrade the performance of the
closed-loop system, any practical design should consider their effects. The
inherent highly nonlinear coupling and model inaccuracies make the controller
design for a flexible joint robot extremely difficult. Spong (1989, 1995)
proposed one of the first adaptive controllers for flexible joint robots based on
the singular perturbation formulation of the robot dynamics. A simple composite
controller was designed such that the joint elastic forces were stabilized by a fast
feedback control and link variables were controlled by a slow control law. Ge
(1996) suggested a robust adaptive controller based on a new singular
perturbation model where the motor tracking error was modeled as the fast
variables instead of the joint elastic forces. This innovative approach leads to a
controller more robust to the case of load sensor failure, gives a new insight into
the control design problem of flexible joint robots and presents an alternative
singular perturbation model for controller design. Ott et. al. (2000) verified a
singular perturbation based adaptive controller for flexible joint robots
experimentally. The tracking quality was improved significantly by the use of
the adaptive control law compared to the non-adaptive one. Dixon et. al. (1999)
proposed an adaptive partial state feedback controller for flexible joint robots
163
164
based on the backstepping design with the knowledge of the regression matrix.
A backstepping design based output feedback adaptive controller for flexible
joint robot was suggested in Yim (2001). Kozlowski and Sauer (1999a, 1999b)
suggested an adaptive controller to have semi-global convergence to an
arbitrarily small neighborhood of the equilibrium point in the presence of
bounded disturbances. Like other adaptive strategies, the uncertain parameters
are required to be time-invariant and capable of being collected to form a
parameter vector. Huang and Chen (2004a) proposed an adaptive backsteppinglike controller based on FAT for single-link flexible-joint robots with
mismatched uncertainties. Similar to most backstepping designs, the derivation
is too complex to robots with more joints. Chien and Huang (2006b) suggested a
FAT-based adaptive controller for general flexible-joint robots without requiring
the computation of the regressor matrix. Chien and Huang (2006a) designed an
adaptive impedance controller for the flexible-joint robots to give good
performance in both the free space tracking and compliant motion phase. Chien
and Huang (2007a) included the actuator into consideration in the design of an
adaptive controller for flexible-joint robots.
In this chapter, we would like to study the FAT-based adaptive controller
designs for n-link flexible-joint robots. The tedious computation of the regressor
matrix is avoided. This chapter is organized as following: in Section 6.2, we
consider the control of a known flexible-joint robot. Section 6.3 derives the
regressor-based adaptive controller and Section 6.4 presents regressor-free
adaptive controller. Section 6.5 considers the actuator dynamics.
+ C(q, q )q + g (q) = K ( q)
D(q)q
(1a)
+ B + K ( q) = a
J
(1b)
+ Cq + g = t
Dq
(2a)
)
J t
t + B t t + t = a q(q , q
(2b)
) = Jq
+ Bq . Since (2) is in a
where J t = JK , B t = BK , and q (q , q
cascade form connected by the torque t, a backstepping-like design procedure
can be employed by regarding t as a control signal to (2a), and a desired
trajectory td is designed for the convergence of q. If a proper a can
be constructed such that t td , then we may have q q d . Define the
tracking error e = q q d and the error signals s = e + e and v = q d e .
Equation (2a) becomes
Ds + Cs + g + Dv + Cv = t
(3)
Suppose that all parameters in the system model are available, then the desired
torque td can be defined as
td = g + Dv + Cv K d s
(4)
With this desired torque, (3) gives the output tracking dynamics
Ds + Cs + K d s = t td
(5a)
Ds + Cs + K d s = 0
(5b)
and convergence of the output error follows. To this end, we would like to
employ the model reference control (MRC) rule below. Let us consider a
reference model
J r
r + B r r + K r r = J r td + B r td + K r td
nn
(6)
nn
, and
where r n is the state vector, and matrices J r , B r
nn
K r
are selected to give proper dynamics for the convergence of r to
td . Define td ( td ,
td ) = K r 1 (B r td + J r
td ) , and rewrite (2b) and (6) into
the state space form
x p = A p x p + B p a B p q
(7a)
x m = A m x m + B m ( td + td )
(7b)
166
where x p = [ Tt Tt ]T 2n and x m = [ Tr
vectors. A p =
1
J t
are
augmented
Inn
I nn
0
2n 2n
2n 2n
and A m = 1
1
1
J t B t
J r K r J r B r
0
system matrices, and B p = 1 2 n n
and
J t
0
B m = 1 2 n n are augmented input gain matrices. Let t = C p x p
J r K r
and r = C m x m be respectively the output signal vector for (7a) and (7b),
where C p = C m = [I n n 0] n 2 n are augmented output signal matrices. It
is noted that ( A m , B m ) is controllable, ( A m , C m ) is observable, and the
transfer function C m ( sI A m ) 1 B m is strictly positive real. According to the
MRC rule, the control torque a is selected as
a = x p + td + h( td , q)
(8)
where
x p = A m x p + B m ( td + td )
(9)
e m = A me m
(10a)
e = C m e m
(10b)
To prove stability in the output tracking loop (5a) and the torque tracking loop
(10), a Lyapunov-like function candidate is designed as
1
V (s, e m ) = s T Ds + e Tm Pt e m
2
2n 2n
(11)
where Pt = Pt
is a positive definite matrix satisfying the Lyapunov
T
T
equation A m Pt + Pt A m = C mC m . Along the trajectory (5a) and (10), the time
derivative of V is computed as
T
V = [s T
s
eT ]Q 0
e
(12)
Kd
where Q =
1 I
2 n n
I nn
2
is positive definite by proper selection of Kd.
I nn
Ds + Cs + g + Dv + Cv = t
(1a)
)
J t
t + B t t + t = a q(q , q
(1b)
K ds
+ Cv
td = g + Dv
= Y (q, q , v, v )p K d s
(2)
The dynamics of the output tracking loop can then be obtained by plugging (2)
into (1a) as
g + ( t td )
Cv
Ds + Cs + K d s = Dv
= Y (q, q , v, v )p + ( t td )
(3)
168
1
1
V (s, e m , p ) = s T Ds + e Tm Pt e m + p T p
2
2
T
(4)
2n 2n
V = s T K d s + s T e eT e p T (p + Y T s)
(5)
p = 1Y T s
(6)
Hence, (5) becomes exactly (6.2-12), and all stability properties are the same
there.
Remark 2: To implement the strategy designed in this section, we do not need
the knowledge of D, C and g. However, it still needs the feedback of joint
accelerations and their higher order time derivatives. In addition, computation of
the regressor matrix and its time derivatives are necessary here.
Ds + Cs + g + Dv + Cv = t
(1a)
)
J t
t + B t t + t = a q(q , q
(1b)
and g(q) are not available and their variation bounds are
like to design a desired transmission torque td so that a
a can be constructed to have convergence in the torque
td . Since D(q), C(q, q ) and g(q) are not available,
169
the traditional adaptive control and robust control are not easy to be applied
here. In the following, we would like to use the function approximation
technique to design an adaptive controller for the rigid-link flexible-joint robot
without the knowledge of the regressor matrix. Since the adaptive control of
flexible-joint robots is much more difficult than that for its rigid-joint
counterpart, avoidance of the regressor computation in the FAT-based design
largely simplifies the implementation in the real-time environment.
The desired transmission torque d can be designed as
K ds
+ Cv
td = g + Dv
(2)
(3)
=CC
, and g = g g . If a proper controller a and
= DD
, C
where D
(4)
x p = A p x p + B p a B p q
(5a)
x m = A m x m + B m ( td + td )
(5b)
a = x p + td + h
(6)
170
x p = A m x p + B m ( td + td ) + B p (h h)
(7)
e m = A me m + B p (h h)
(8a)
e = C m e m
(8b)
If we may design an appropriate update law such that h h , then (8) implies
e m 0 as t . To proceed further, let us apply the function approximation
representation
D = WDT Z D + D
C = WCT Z C + C
(9)
g = WgT z g + g
h = WhT z h + h
n n
n n
=W
DT Z D
D
=W
CT Z C
C
gT z g
g = W
(10)
hT z h
h = W
() = W() W
() , then the output error tracking dynamics (3) and the
Define W
torque tracking dynamics (8a) become
171
DT Z D v
Ds + Cs + K d s = ( t td ) W
CT Z C v W
gT z g + 1
W
hT z h + B p 2
e m = A m e m B p W
(11a)
(11b)
where Pt = Pt
is a positive definite matrix satisfying the Lyapunov
2
2
T
T
equation A m Pt + Pt A m = C mC m . The matrices Q D n D n D ,
2
2
Q C n C n C , Q g n g n g and Q h n h n h are positive definite.
The time derivative of V along the trajectory of (11) can be computed as
T
V = s T K d s + s T e eT e + s T 1 + e Tm Pt B p 2
T
DT ( Z D vs
T
T + QDW
Tr[ W
D ) + WC ( Z C vs + Q C WC )]
T
gT ( z g s T + Q g W
T
Tr[ W
g ) + Wh ( z h e m Pt B p + Q h Wh )]
(13)
1
D)
T + DW
W
D = Q D ( Z D vs
(14a)
1
T
W
C = Q C ( Z C vs + C WC )
(14b)
1
T
W
g = Q g ( z g s + g Wg )
(14c)
1
T
W
h = Q h ( z h e + h Wh )
(14d)
172
s
1
DT W
D)
eT ]Q + [s T eT ] + DTr ( W
e
T
T
hT W
h)
+ CTr ( WC WC ) + gTr ( Wg Wg ) + hTr ( W
V = [s T
Kd
where Q =
1 I
2 n n
gain matrix Kd.
(15)
I nn
2
is positive definite due to the selection of
I nn
V = [s T
s
eT ] Q 0
e
Therefore it implies that that e and s are uniformly bounded and square
integrable. Furthermore, e and s can be shown to be uniformly bounded; as a
result, asymptotic convergence of e and s can easily be concluded by
Barbalats lemma. This further implies that d and q q d as t
even though D, C, g and h are all unknown.
Remark 4: Suppose 1 and 2 cannot be ignored but their variation bounds are
available i.e. there exists positive constants 1 , 2 > 0 such that 1 1 and
2 2 . To cover the effect of these bounded approximation errors, the
desired transmission torque (2) and actuator input (6) are modified to be
+ g K d s + robust 1
+ Cv
td = Dv
a = x p + td + h + robust 2
where robust 1 and robust 2 are robust terms to be designed. Let us consider the
Lyapunov-like function candidate (12) and update laws (14) without the
-modification terms again. The time derivative of V can be computed as
V [s T
173
s
eT ]Q + 1 s + 2 e + s T robust 1 + eT robust 2
e
define A =
2CTm Pt C m
1
V V + [max ( A ) min (Q)]
2
s
1
e + 2 (Q)
min
1
2
1
DT W
D ) + [max (Q C )
+ {[max (Q D ) D ]Tr ( W
2
CT W
C ) + [max (Q g ) g ]Tr ( W
gT W
g)
C ]Tr ( W
hT W
h )} + 1 [ DTr ( WDT WD )
+[max (Q h ) h ]Tr ( W
2
T
T
+ CTr ( WC WC ) + gTr ( Wg Wg ) + hTr ( WhT Wh )]
(16)
min (Q)
min
so that we have
V V +
1
1
+ [ DTr ( WDT WD )
2 min (Q ) 2
2
1
(17)
174
1 ( )
1
V>
sup
[ DTr ( WDT WD )
+
2min (Q) t 0 2 ( )
2
1
(18)
D,W
C,W
g , and W
h are uniformly ultimately
This verifies that s, e , W
bounded. Differential inequality (17) can be solved to have the upper bound for
V(t) as
V (t ) e
1 ( )
V (t 0 ) +
sup
2min (Q) t 0 < < t 2 ( )
1
( t t 0 )
1
[ DTr ( WDT WD ) + CTr ( WCT WC )
2
+ gTr ( WgT Wg ) + hTr ( WhT Wh )]
+
(19)
s
1
From (12), we may estimate the lower bound for V as V min ( A )
2
e
s
e
2V (t )
min ( A)
Together with (19), we may have the bound for the error signals as
s
e
+
2V (t 0 ) 2 (t t0 )
1
e
+
sup
min ( A)
min ( A) min (Q) t0 < < t
1
min ( A)
1 ( )
( )
2
175
Remark 5: To implement the desired transmission torque (2), actuator input (6)
and update law (14), we do not need the regressor matrix, the knowledge of joint
accelerations, or their derivatives. Therefore, it is feasible for realization.
Table 6.1 summarizes the adaptive control laws derived in this section
based on their controller forms, update laws and implementation issues.
Table 6.1 Summary of the adaptive control for FJR
Flexible-Joint Robot
Ds + Cs + g + Dv + Cv = t
)
J t
t + B t t + t = a q (q , q
Regressor-based
K ds
+ Cv
td = g + Dv
= Y (q, q , v, v )p K d s
a = x p + td + h ( td , q )
Controller
(6.3-1)
Regressor-free
Kds
+ Cv
td = g + Dv
a = x p + td + h
(6.4-2), (6.4-6)
(6.3-2), (6.2-8)
p = 1Y T s
Adaptive Law
(6.3-6)
1
D)
T + DW
W
D = Q D ( Z D vs
1
T
W
C = Q C ( Z C vs + C WC )
1
T
W
g = Q g ( z g s + g Wg )
1
T
W
h = Q h ( z h e + h Wh )
(6.4-14)
Realization
Issue
Example 6.1:
Consider the flexible-joint robot in (3.6-3), and we are going to verify the
regressor-free adaptive control strategy developed in this section by computer
simulations. Actual values of link parameters are selected as m1 =m2 =0.5(kg),
l1 =l2 =0.75(m), lc1 =lc2 =0.375(m), I1 =I2 =0.0234(kg-m2), and k1 =k2 =100(N-m/rad).
Parameters for the actuator part are chosen as j1 =0.02(kg-m2), j2 =0.01(kg-m2),
b1 =5(N-m-sec/rad), and b2 =4(N-m-sec/rad)(Chien and Huang 2007a). We
would like the endpoint to track a 0.2m radius circle centered at (0.8m, 1.0m) in
176
10 seconds without knowing its precise model. The initial condition for the
generalized coordinate is at q(0) = (0) = [0.0022 1.5019 0 0]T , i.e., the
endpoint is initially at (0.8m, 0,75m). It is away form the desired initial endpoint
position (0.8m, 0,8m) for observation of the transient. The initial state for the
reference model is r (0) = [1.8 2.8 0 0]T , which is the same as the
initial state for the desired torque. The controller in (6) is applied with the gain
matrices
5 0
5 0
, and =
Kd =
.
0 5
0 5
The 11-term Fourier series is selected as the basis function for the
D and W
C are in 44 2 , while W
h are
g and W
approximation. Therefore, W
22 2
in
. The initial weighting vectors for the entries are assigned to be
177
The simulation results are shown in Figure 6.1 to 6.8. Figure 6.1 shows the
tracking performance of the robot endpoint and its desired trajectory in the
Cartesian space. It is observed that the endpoint trajectory converges nicely to
the desired trajectory, although the initial position error is quite large. After the
transient state, the tracking error is small regardless of the time-varying
uncertainties in D, C and g. Computation of the complex regressor is avoided in
this strategy which greatly simplifies the design and implementation of the
control law. Figure 6.2 presents the time history of the joint space tracking
performance. The transient states converge very fast and the tracking errors are
small. The control efforts to the two joints are reasonable that can be verified in
Figure 6.3. The control torque for both joints can be seen in Figure 6.4.
Figure 6.5 to 6.8 are the performance of function approximation. Although most
parameters do not converge to their actual values, they still remain bounded as
desired.
1.2
1.15
1.1
1.05
1
0.95
0.9
0.85
0.8
0.75
0.7
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
0.6
0.4
0.2
0
0.2
5
Time(sec)
10
5
Time(sec)
10
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
Figure 6.2 Joint space tracking performance. It can be seen that the transient is fast,
and the tracking error is very small
10
8
6
4
2
0
5
Time(sec)
10
5
Time(sec)
10
1
torque output 2 (N.m)
178
0
1
2
3
4
Figure 6.3 Torque tracking performance. It can be seen that the torque errors for
both joints are small
80
60
40
20
5
Time(sec)
10
5
Time(sec)
10
10
0
10
20
30
40
50
60
1.4
0.6
1.2
0.4
0.2
0.8
D(12)
D(11)
Figure 6.4 The control torques for both joints are reasonable
0.6
0
0.2
0.4
0.4
0.2
0.6
4
6
Time(sec)
10
0.8
2.5
1.4
1.2
D(22)
D(21)
4
6
Time(sec)
10
4
6
Time(sec)
10
1.5
1
0.5
0.8
0.6
0.4
0
0.5
0.2
0
4
6
Time(sec)
10
179
0.2
0.15
0.6
0.1
C(12)
C(11)
0.4
0.2
0.05
0
0.05
0
0.2
0.1
0
4
6
Time(sec)
0.15
10
0.2
4
6
Time(sec)
10
4
6
Time(sec)
10
0.25
0.2
0
0.2
C(22)
C(21)
0.15
0.4
0.1
0.05
0
0.6
0.8
0.05
0
4
6
Time(sec)
10
0.1
g(1)
5
Time(sec)
10
5
Time(sec)
10
g(2)
180
181
10
8
h(1)
6
4
2
0
2
5
Time(sec)
10
5
Time(sec)
10
5
0
h(2)
5
10
15
20
Ds + Cs + g + Dv + Cv = t
(1a)
)
J t
t + B t t + t = Hi q(q , q
(1b)
Li + Ri + K b q = u
(1c)
This system is in a cascade form with the configuration shown in Figure 6.9.
u
i
Equation (1c)
Equation (1b)
s
Equation (1a)
182
td = g + Dv + Cv K d s
(2)
Ds + Cs + K d s = t td
(3)
To ensure torque tracking in (1b), the MRC rule is applied with the reference
model
J r
r + B r r + K r r = J r td + B r td + K r td
(4)
nn
x p = A p x p + B p Hi B p q
(5a)
x m = A m x m + B m ( td + td )
(5b)
where x p = [ Tt Tt ]T 2n and x m = [ Tr
vectors. A p =
1
J t
are
augmented
I nn
I nn
0
2 n 2n
2n 2n
and A m = 1
1
1
J t B t
J
K
J
B
r
r
r r
0
system matrices, and B p = 1 2 n n
and
J t
0
B m = 1 2 n n are augmented input gain matrices. Let t = C p x p
J r K r
and r = C m x m be respectively the output signal vector for (5a) and (5b),
where C p = C m = [I n n 0] n 2 n are augmented output signal matrices.
The pair ( A m , B m ) is controllable, ( A m , C m ) is observable, and the transfer
183
i d = H 1[x p + td + h( td , q )]
(6)
x p = A m x p + B m ( d + td ) + B p H(i i d )
(7)
e m = A me m + B p H (i i d )
(8a)
e = C m e m
(8b)
u = Li d + Ri + K b q K c e i
(9)
Le i + K c e i = 0
(10)
At this stage, we have the output error dynamics in (3), the dynamics of the
torque tracking loop in (8) and the dynamics of the current tracking loop in (10).
We have to ensure that all of these dynamics be stable. To this end, let us
consider a Lyapunov-like function candidate
1
1
V (s, e m , e i ) = s T Ds + eTm Pt e m + eTi Le i
2
2
2n 2n
(11)
where Pt = Pt
is a positive definite matrix satisfying the Lyapunov
T
T
equation A m Pt + Pt A m = C mC m . Along the trajectories of (3), (8) and (10),
the time derivative of V can be computed as
T
184
1
2C)s + s T e
V = s T K d s + s T (D
2
eT e + eTm Pt B p He i eTi K c e i
(12)
V = [s
eTi
s
] Q e 0
e i
(13)
I nn
0
Kd
2
1
1
I nn
H is positive definite due to proper
where Q = I n n
2
2
1
0
H
Kc
2
K ds
+ Cv
td = g + Dv
= Y (q, q , v, v )p K d s
(14)
185
(15)
=CC
, g = g g , and p = p p . Therefore, if we
= DD
, C
where D
may find a control law to drive t td and an update law to have p p ,
then (15) implies convergence of the output error. To this end, we would like to
use the MRC rule with the reference model in (4) and the state space
representation in (5). The desired current trajectory is designed as the one in (6)
to have the dynamics for the torque tracking loop as in (8). Instead of the
controller in (9), we use
d + Ri
+K
bq K c e i
u = Li
= p Ti K c e i
(16)
T R
T K
Tb ]T 3n n , and we
where = [i Td i T q T ]T 3n , p i = [L
may have the dynamics for the current tracking loop as
Le i + K ce i = p Ti
(17)
1 T
(s Ds + 2eTm Pt e m + eTi Le i
2
+p T p ) + Tr (p Ti i p i )
V (s, e m , e i , p , p i ) =
where Pt = Pt
T
2n 2n
(18)
V = s T K d s + s T e eT e + eTm Pt B p He i e Ti K c e i
p T (p + Y T s) Tr[p T ( p + e T )]
i
(19)
186
p = 1Y T s
(20a)
p i = i1e Ti
(20b)
Thus, (19) becomes (13); therefore, we have proved that s, e and e i are
uniformly bounded, and their square integrability can also be proved from (13).
Furthermore, uniformly boundedness of s , e , and e i are also easy to be
proved, and hence, q q d , t td , and i i d follow by Barbalats lemma.
Remark 7: To implement the controller strategy, we do not need to have the
and calculate
knowledge of most system parameters, but we have to feedback q
the regressor matrix and their higher order derivatives. Therefore, the design
introduced in this section is not feasible for practical applications, either.
K ds
+ Cv
td = g + Dv
(21)
The dynamics for the output tracking loop can thus be written as
g + ( t td )
Cv
Ds + Cs + K d s = Dv
(22)
i d = H 1[x p + td + h ]
(23)
187
e m = A me m + B p H (i i d ) + B p (h h)
(24a)
e = C m e m
(24b)
u = f K c e i
(25)
Le i + K ce i = f f
(26)
D = WDT Z D + D
(27a)
C = WCT Z C + C
(27b)
g = WgT z g + g
(27c)
h = WhT z h + h
(27d)
f = WfT z f + f
(27e)
n n
n n
where WD n D n , WC n C n , Wg g , Wh h , and
n n
Wf 2 f are weighting
matrices for D, C, g, h, and f, respectively, while
2
n 1
n 1
n 1
n D n
ZD
, Z C n C n , z g g , z h h , and z f f
are basis function matrices. Likewise, we have the representations for the
estimates as
2
=W
DT Z D
D
(27f)
=W
CT Z C
C
(27g)
188
gT z g
g = W
(27h)
hT z h
h = W
(27i)
fT z f
f = W
(27j)
Thus the output error dynamics (22), torque tracking error dynamics (24a), and
current tracking error dynamics (26) can be rewritten as
DT Z D v W
CT Z C v W
gT z g + 1 (28a)
Ds + Cs + K d s = ( t td ) W
hT z h + B p He i + B p 2
e m = A me m B p W
(28b)
fT z f + 3
Le i + K ce i = W
(28c)
d ) , 2 = 2 ( h , e m ) , and 3 = 3 ( f , e i ) are
where 1 = 1 ( D , C , g , s, q
lumped approximation error vectors. Define the Lyapunov-like function
candidate as
D, W
C, W
g, W
h, W
f ) = 1 sT Ds + eTmPt e m
V (s, e m , ei , W
2
1
1
DT QDW
D +W
CT QCW
C
+ eTi Lei + Tr(W
2
2
gT Qg W
g +W
hT Qh W
h +W
fT Qf W
f)
+W
(29)
n n
g
where matrices Q D n D n D , Q C n C n C , Q g g
,
n
f
Q h n h n h , and Q f f
are positive definite. Along the trajectory
of (28), we may compute the time derivative of V as
V = s T K d s + s T e eT e + eTm Pt B p He i eTi K c e i + s T 1
DT (Z D vs
T + Q DW
+e Tm Pt B p 2 + e Ti 3 Tr[ W
D)
CT (Z C vs T + Q C W
C )] Tr[ W
gT (z g s T + Q g W
g )
+W
hT (z he Tm Pt B p + Q h W
h )] Tr[ W
fT (z f e Ti + Q f W
f )]
+W
(30)
189
1
D)
T + DW
W
D = Q D ( Z D vs
(31a)
1
T
W
C = Q C ( Z C vs + C WC )
(31b)
1
T
W
g = Q g ( z g s + g Wg )
(31c)
1
T
W
h = Q h ( z h e + h Wh )
(31d)
1
T
W
f = Q f ( z f e i + f Wf )
(31e)
V = [s
eTi
s
]Q e + [s T
e i
e Ti
1
] 2
3
DT W
D ) + CTr ( W
CT W
C ) + gTr ( W
gT W
g)
+ DTr ( W
hT W
h ) + f Tr ( W
fT W
f)
+ hTr ( W
Kd
1
where Q = I n n
2
(32)
1
I nn
H is positive definite due to proper
2
1
H
Kc
2
Remark 8: Realization of the control law (25) and update laws (31) does not
need the information of joint accelerations, regressor matrix, or their higher
order derivatives, which largely simplified its implementation.
Remark 9: Suppose a sufficient number of basis functions are used and the
approximation error can be ignored, then it is not necessary to include the
190
K d s + robust 1
+ Cv
td = g + Dv
i d = H 1[x p + td + h + robust 2 ]
u = f K c e i + robust 3
Consider the Lyapunov-like function candidate (29) again, and the update law
(31) without -modification; then the time derivative of V becomes
V = [s T
e Ti
s
]Q e + 1 s + 2 e + 3 e i
e i
s
1
V max ( A) e
2
e i
1
DT W
D ) + max (Q C )Tr ( W
CT W
C)
+ [ max (Q D )Tr ( W
2
gT W
g ) + max (Q h )Tr ( W
hT W
h ) + max (Q f )Tr ( W
fT W
f )]
+ max (Q g )Tr ( W
0
D
T
where A = 0 2C m Pt C m
0
0
191
s
1
1
1
V V + [max ( A ) min (Q )] e
+
2
2
2min (Q)
e i
3
1
DT W
D ) + [max (Q C ) C ]Tr ( W
CT W
C)
+ {[max (Q D ) D ]Tr ( W
2
gT W
g ) + [max (Q h ) h ]Tr ( W
hT W
h)
+[max (Q g ) g ]Tr ( W
fT W
f )} + 1 [ DTr ( WDT WD ) + CTr ( WCT WC )
+[max (Q f ) f ]Tr ( W
2
T
T
+ gTr ( Wg Wg ) + hTr ( Wh Wh ) + f Tr ( WfT Wf )]
where is selected to satisfy
min (Q)
min
1
1
+ 1 [ Tr ( W T W )
V V +
2
D
D
D
2 min (Q )
2
3
+ CTr ( WCT WC ) + gTr ( WgT Wg )
+ hTr ( WhT Wh ) + f Tr ( WfT Wf )]
(33)
1 ( )
1
1
V>
sup 2 ( ) +
[ DTr ( WDT WD )
2min (Q) t0
2
3 ( )
+ CTr ( WCT WC ) + gTr ( WgT Wg ) + hTr ( WhT Wh ) + f Tr ( WfT Wf )]
192
D, W
C, W
g, W
h , and W
f are uniformly ultimately
i.e., s, e , e i , W
bounded. In addition, (33) implies
1 ( )
1
( t t 0 )
sup 2 ( )
V (t ) e
V (t 0 ) +
1
[ DTr ( WDT WD ) + CTr ( WCT WC )
2
(34)
s
1
1
DT W
D ) + min (Q C )Tr ( W
CT W
C)
V min ( A ) e + [ min (Q D )Tr ( W
2
2
e i
gT W
g ) + min (Q h )Tr ( W
hT W
h ) + min (Q f )Tr ( W
fT W
f )]
+ min (Q g )Tr ( W
we may find the error bound
s
e
e i
+
1 ( )
2V (t 0 ) 2 (t t0 )
1
e
+
sup 2 ( )
min ( A)
min ( A) min (Q) t0 < < t
3 ( )
1
min ( A)
193
Ds + Cs + g + Dv + Cv = t
)
J t
t + B t t + t = Hi q (q , q
Li + Ri + K b q = u
Regressor-based
Controller
(6.5-1)
Regressor-free
K ds
+ Cv
td = g + Dv
= Y(q, q , v, v )p K d s
K ds
+ Cv
td = g + Dv
i d = H 1[x p + td + h ]
i d = H 1[x p + td + h ( td , q )]
d + Ri
+K
b q K c e i
u = Li
u = f K ce i
p Ti K c e i
(6.5-21), (6.5-23),
(6.5-25)
p = 1Y T s
p = 1e T
Adaptive
Law
(6.5-20)
1
D)
T + DW
W
D = Q D ( Z D vs
1
T
W
C = Q C ( Z C vs + C WC )
1
T
W
g = Q g ( z g s + g Wg )
1
T
W
h = Q h ( z h e + h Wh )
f)
Wf = Q f1 (z f e Ti + f W
(6.5-31)
Realization
Issue
Example 6.2:
Consider the flexible-joint robot in example 6.1 but with consideration of the
motor dynamics. Actual values of link parameters are selected as m1 =m2 =0.5(kg),
l1=l2=0.75(m), lc1=lc2 =0.375(m), I1=I2 =0.0234(kg-m2), and k1=k2=100(N-m/rad).
Parameters for the actuator part are chosen as j1 =0.02(kg-m2), j2 =0.01(kg-m2),
b1 =5(N-m-sec/rad), b2 =4(N-m-sec/rad), and h1 =h2 =10(N-m/A). Electrical
parameter for the actuator are L1 =L2 =0.025(H), r1 = r2 =1(), kb1 =kb2 =1
(Vol/rad/sec) (Chien and Huang 2007a). In order to observe the effect of the
actuator dynamics, the endpoint is required to track a 0.2m radius circle centered
at (0.8m, 1.0m) in 2 seconds which is much faster than the case in example 6.1.
The initial condition for the generalized coordinate vector is at
q(0) = (0) = [0.0022 1.5019 0 0]T , i.e., the endpoint is initially at
194
(0.8m, 0,75m). It is away from the desired initial endpoint position (0.8m, 0,8m)
for observation of the transient. The initial state for the reference model is
r (0) = [15.5 33.6 0 0]T , which is the same as the initial state for the
desired torque. The controller gain matrices are selected as
20 0
10 0
50 0
, =
, and K c =
Kd =
.
0 20
0 10
0 50
The initial value for the desired current can be found by calculation as
The gain matrices in the update law (31) are selected as Q D = 0.1I 44 ,
Q C1 = 0.1I 44 , Q g1 = 50I 22 , Q h1 = 1000I 22 , and Q f1 = 10000I 22 . The
approximation error is assumed to be neglected in this simulation, and the
-modification parameters are chosen as () = 0 .
195
The simulation results are shown in Figure 6.10 to 6.19. Figure 6.10 shows
the tracking performance of the robot endpoint and its desired trajectory in the
Cartesian space. It is observed that the endpoint trajectory converges nicely to
the desired trajectory, although the initial position error is quite large. After the
transient state, the tracking error is small regardless of the time-varying
uncertainties in D, C, g, h, and f. Computation of the complex regressor is
avoided in this strategy which greatly simplifies the design and implementation
of the control law. Figure 6.11 presents the time history of the joint space
tracking performance. The torque tracking performance is shown in Figure 6.12.
It can be seen that the torque errors for both joints are small. Figure 6.13
presents the current tracking performance. It is observed that the strategy can
give very small current error. The control voltages to the two motors are
reasonable that can be verified in Figure 6.14. Figure 6.15 to 6.19 are the
performance of function approximation. Although most parameters do not
converge to their actual values, they still remain bounded as desired.
1.2
1.15
1.1
1.05
0.95
0.9
0.85
0.8
0.75
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
Figure 6.10 Robot endpoint tracking performance in the Cartesian space. After some
transient, the endpoint converges to the desired trajectory nicely regardless of the
uncertainties in the robot model
0.8
0.6
0.4
0.2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
Figure 6.11 Joint space tracking performance. It can be seen that the transient is fast,
and the tracking error is very small
20
15
10
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
20
196
10
0
10
20
30
40
Figure 6.12 Torque tracking performance. It can be seen that the torque errors for
both joints are small
80
60
40
20
0
20
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
40
20
0
20
40
60
80
100
Figure 6.13 Current tracking performance. It can be seen that the current errors for
both joints are small
60
40
20
0
20
0
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
40
20
0
20
40
60
80
Figure 6.14 The control voltages for both joints are reasonable
197
0.25
0.2
0.15
0.1
D(12)
D(11)
0.6
0.4
0.05
0
0.2
0.05
0
0.5
1
Time(sec)
1.5
0.1
0.3
0.2
0.25
0.1
0.2
D(22)
0.3
0
0.1
0.2
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
0.15
0.1
0.5
1
Time(sec)
1.5
0.05
0.3
0.3
0.2
0.2
0.1
C(12)
C(11)
0.1
0
0.1
0.1
0.2
0.3
0.5
1
Time(sec)
1.5
0.2
0.15
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
0.2
0.1
0.15
0.05
0
C(22)
D(21)
C(21)
198
0.05
0.1
0.1
0.05
0.15
0.2
0
0
0.5
1
Time(sec)
1.5
g(1)
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
2
1
g(2)
0
1
2
3
4
h(1)
4
2
0
2
4
6
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
10
5
h(2)
0
5
10
15
20
25
199
200
f(1)
30
20
10
0
10
20
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
60
40
f(2)
20
0
20
40
60
80
6.6 Conclusions
Adaptive controllers for flexible-joint robots in the free space are derived in
this chapter. The MRC rule is utilized in Section 6.2 for the control of a known
FJR. In Section 6.3, a regressor based adaptive controller is derived. However,
its implementation requires the knowledge of joint accelerations, the regressor
matrix, and their higher order derivatives. Therefore, the control strategy is not
practical. The regressor-free adaptive controller based on FAT is designed in
Section 6.4 whose realization do not need the joint accelerations, the regressor
matrix, or their higher order derivatives. The actuator dynamics is considered in
Section 6.5. A regreesor-based adaptive controller is developed for EDFJR in
Section 6.5.1. However, its realization still needs the joint accelerations,
regressor matrix, and their derivatives. A regreesor-free adaptive controller for
EDFJR is then introduced in Section 6.5.2 and it is free from the information for
joint accelerations or the regressor matrix.
Chapter 7
7.1 Introduction
Many practical operations of industrial robots such as grinding and
assembling involve contact problems between the end-effector and the
environment. To control the robot to interact compliantly with the environment,
several approaches have been proposed. Two major strategies are hybrid control
presented by Raibert and Craig (1981), and impedance control proposed by
Hogan (1985). These two approaches are based on the same assumption that the
robot is constructed with rigid links and joints. However, a lot of industrial robot
manipulators are designed with harmonic drives to gain high torque with
reduced motor speed. To achieve better output performance, the joint flexibility
due to the harmonic drives should be carefully considered.
Using the singular perturbation formulation and the concept of integral
manifold, Spong (1989) derived a control approach for force/impedance control
of flexible-joint robot. Jankowski and ElMaraghy (1991) proposed a nonlinear
decoupling and linearizing feedback control based on inverse dynamics. Ahmad
(1993) addressed the problem of hybrid force/position control utilizing the
constraint formulation developed by Yoshikawa (1986). Based on the solution
of the acceleration level inverse dynamic equations, Ider (2000) presented a
hybrid force and motion trajectory tracking control law. Ott et. al. (2003)
proposed an impedance controller based on an internal torque controller in a
cascade structure. Schaffer et. al. (2003) implemented the impedance controllers
based on three different kinds of nullspace projections for the realization of
nullspace stiffness.
Since the mathematical model is only an approximation of the real system,
the simplified representation of the system behavior will contain model
inaccuracies such as parametric uncertainties, and unmodeled dynamics.
201
202
D x
x + C x x + g x = J aT t Fext
(1a)
)
J t
t + B t t + t = a q(q , q
(1b)
) = Jq
+ Bq and t = K ( - q) .
where x n , J t = JK , B t = BK , q (q , q
Suppose all system parameters are known and a controller is to be designed so
that the closed-loop system behaves like the target impedance
M i (
x
x d ) + B i (x x d ) + K i (x x d ) = Fext
nn
(2)
nn
M i (
x i
x d ) + B i (x i x d ) + K i (x i x d ) = Fext
(3)
where x i is the state vector of (3). If we can design a controller such that
x converges asymptotically to xi then the two target impedances are equivalent.
Define e = x x i , s = e + e , and v = x i e , then we may rewrite (1a)
into
n
D x s + C x s + g x + D x v + C x v = J a T t Fext
(4)
We would like to regard t as the control torque to drive the system in (4) so
that the output tracking error will converge. This can be done by considering the
desired torque trajectory
td = J Ta (Fext + g x + D x v + C x v K d s)
(5)
Plugging (5) into (4), we have the dynamics for the output tracking error
D xs + C x s + K d s = J aT ( t td )
(6)
204
closed-loop system dynamics to the target impedance. To this end, we are going
to employ the MRC rule with the reference model
J r
r + B r r + K r r = J r td + B r td + K r td
(7)
nn
x p = A p x p + B p a B p q
(8a)
x m = A m x m + B m ( td + td )
(8b)
where x p = [ Tt Tt ]T 2n and x m = [ Tr
vectors, A p =
1
J t
are
augmented
I nn
I nn
0
2n 2n
2n 2n
and A m = 1
1
1
J t B t
J
K
J
B
r
r
r r
0
system matrices, and B p = 1 2 n n
and
J t
0
B m = 1 2 n n are augmented input gain matrices. The pair
J r K r
( A m , B m ) is controllable, and ( A m , C m ) is observable, where
C p = C m = [I n n 0] n 2 n are augmented output matrices characterizing
the output signals t = C p x p and r = C m x m , respectively. The transfer
function C m ( sI A m ) 1 B m is SPR due to proper selection of the matrices Jr,
Br and Kr. The MRC rule can thus be designed as
a = x p + td + h( td , q)
(9)
x p = A m x p + B m ( td + td )
(10)
e m = A me m
(11a)
e = C m e m
(11b)
1
V (s, e m ) = s T D x s + eTm Pt e m
2
(12)
2n 2n
where Pt = Pt
is a positive definite matrix satisfying the Lyapunov
T
T
equation A m Pt + Pt A m = C mC m . Along the trajectory of (6) and (11), we
may compute the time derivative of V as
T
V = [s T
s
eT ]Q 0
e
(13)
J a T
Kd
2
where Q =
is positive definite with proper selection of
1
J T
I nn
a
2
Kd and Kc. Therefore, s and em are uniformly bounded and square integrable. It
is also easy to prove that s and e are uniformly bounded. Hence, convergence
of s and em can be concluded by Barbalats lemma. This implies that the closed
loop system converges asymptotically to the target impedance.
Remark 1: This strategy is feasible only when all system parameters are
available. In addition, we need to feedback accelerations and external forces as
well as their higher order derivatives that largely restrict the practical
applications.
206
D x s + C x s + g x + D x v + C x v = J a T t Fext
(1a)
)
J t
t + B t t + t = a q(q , q
(1b)
We would like to design a control torque a such that the closed-loop system
converges to the target impedance (7.2-3)
M i (
x i
x d ) + B i (x i x d ) + K i (x i x d ) = Fext
(2)
x v K d s)
x v + C
td = J Ta (Fext + g x + D
= J Ta [Fext + Y ( x, x , v, v )p x K d s ]
(3)
(4)
x = Cx C
x , g x = g x g x , and p x = p x p x .
x = Dx D
x, C
where D
Hence, if a proper control torque a can be constructed such that t td ,
and an update law can be designed to have p x p x , then (4) implies
convergence of s, i.e., the closed-loop system behaves like the target impedance.
Here, the MRC rule is going to be used again to complete the design. Let us
consider the reference model (7.2-7) and the state space representation (7.2-8).
The control torque is selected as (7.2-9) to have the torque tracking loop
dynamics (7.2-11)
e m = A me m
(5a)
e = C m e m
(5b)
1
1
V (s, e m , p x ) = s T D x s + eTm Pt e m + p Tx p x
2
2
(6)
2n 2n
V = s T K d s + s T J a T e eT e p Tx (p x + Y T s)
(7)
p x = 1Y T s
(8)
D x s + C x s + g x + D x v + C x v = J a T t Fext
(1a)
)
J t
t + B t t + t = a q(q , q
(1b)
x v K d s)
x v + C
td = J Ta (Fext + g x + D
(2)
x v g x + J aT ( t td )
x v C
D xs + C x s + K d s = D
(3)
Along the same line as in the previous section, we would like to employ the MRC
rule to have t td , and the reference model in (7.2-7) is to be used again
J r
r + B r r + K r r = J r td + B r td + K r td
(4)
208
x p = A p x p + B p a B p q
(5a)
x m = A m x m + B m ( td + td )
(5b)
The control torque can thus be designed based on the MRC rule as
a = x p + td + h
(6)
x p = A m x p + B m ( td + td ) + B p (h h)
(7)
e m = A me m + B p (h h)
(8a)
e = C m e m
(8b)
Therefore, if we may find a proper update law to have h h, then the torque
tracking can be achieved. To proceed, let us consider the function approximation
representations of D x , C x , g x , and h as
D x = WDT x Z D x + D x
(9a)
C x = WCT x Z C x + C x
(9b)
g x = WgTx z g x + g x
(9c)
h = WhT z h + h
(9d)
x=W
DT Z D
D
x
x
(9e)
x=W
CT Z C
C
x
x
(9f)
gT z g
g x = W
x
x
(9g)
hT z h
h = W
(9h)
DT Z D v
D xs + C x s + K d s = J aT ( t td ) W
x
x
T
T
C ZC v W
g z g + 1
W
(10a)
hT z h + B p 2
e m = A me m B p W
(10b)
D ,W
C ,W
g ,W
h ) = 1 sT Dxs + eTmPt e m + 1 Tr (W
DT QD W
D
V (s, e m , W
x
x
x
x
x
x
2
2
CT QC W
C +W
gT Qg W
g +W
hT Qh W
h)
+W
x
where
Q h
Q D x n
n h n h
D n2 D
Q C x n
C n2 C
(11)
Q g x n g n g , and
T
Pt 2 n 2 n is a positive
A Tm Pt + Pt A m = CTmC m .
V = s T K d s + s T J aT e eT e + s T 1 + e Tm Pt B p 2
T
DT (Z D vs
T
T + Q Dx W
Tr[ W
D x ) + WC x ( Z C x vs + Q C x WC x )]
x
x
T
gT (z g s T + Q g W
T
Tr[ W
(12)
g x ) + Wh ( z h e m Pt B p + Q h Wh )]
x
x
x
By defining B m = B p to have eTm Pt B p = eT , and selecting the update laws as
1
D )
T + Dx W
W
D x = Q D x ( Z D x vs
x
(13a)
1
T
W
C x = Q C x ( Z C x vs + C x WC x )
(13b)
1
T
W
g x = Q g x ( z g x s + g x Wg x )
(13c)
1
T
W
h = Q h ( z h e + h Wh )
(13d)
210
s
1
DT W
D )
eT ]Q + [s T eT ] + D x Tr ( W
x
x
e
2
CT W
gT W
hT W
C ) + g Tr ( W
g ) + hTr ( W
h)
+ C x Tr ( W
x
x
x
x
x
V = [s T
Kd
where Q =
1 J T
2 a
(14)
J a T
2
is positive definite by proper selection of Kd.
I nn
s
eT ]Q 0 which
e
x v K d s + robust 1 )
x v + C
td = J Ta (Fext + g x + D
a = x p + td + h + robust 2
where robust 1 and robust 2 are robust terms to be designed. Let us consider the
Lyapunov function candidate (11) and the update law (13) again but with
() = 0 . The time derivative of V can be computed as
V = [s T
s
eT ]Q + 1 s + 2 e + s T robust 1 + eT robust 2
e
1
V max (A)
2
s
1
T
T
e + 2 [max (QDx )Tr(WDx WDx ) + max (QCx )Tr(WCx WCx )
gT W
g ) + max (Qh )Tr(W
hT W
h )]
+ max (Qg x )Tr(W
x
x
D
where A =
2CTm Pt C m
1
V V + [max ( A ) min (Q)]
2
s
1
e + 2 (Q )
min
1
2
1
DT W
D ) + [max (Q C )
+ {[max (Q D x ) D x ]Tr ( W
x
x
x
2
CT W
C ) + [max (Q g ) g ]Tr ( W
gT W
g )
C ]Tr ( W
x
hT W
h )} + 1 [ D Tr ( WDT WD )
+[ max (Q h ) h ]Tr ( W
x
x
x
2
+ C x Tr ( WCT x WC x ) + g x Tr ( WgTx Wg x ) + hTr ( WhT Wh )]
where is selected to satisfy
min (Q)
.
max ( A ) max (Q D x ) max (Q C x ) max (Q g x ) max (Q h )
min
Dx
Cx
gx
V V +
1
1
+ [ D x Tr ( WDT x WD x )
2 min (Q) 2
2
1
(15)
212
1 ( )
1
V>
sup
[ D x Tr ( WDT x WD x )
+
2min (Q) t0 2 ( )
2
1
1 ( )
V (t 0 ) +
sup
2min (Q) t 0 < < t 2 ( )
1
( t t 0 )
1
[ D x Tr ( WDT x WD x ) + C x Tr ( WCT x WC x )
2
+ g x Tr ( WgTx Wg x ) + hTr ( WhT Wh )]
+
1
V min (A)
2
s
1
T
T
e + 2 [min (QDx )Tr(WDx WDx ) + min (QCx )Tr(WCx WCx )
gT W
g ) + min (Qh )Tr(W
hT W
h )]
+min (Qg x )Tr(W
x
x
we may find the error bound
s(t )
e (t )
+
2V (t 0 ) 2 (t t 0 )
1
e
+
sup
min ( A)
min ( A) min (Q) t0 < < t
1
min ( A)
1 ( )
( )
2
[ D x Tr ( WDT x WD x ) + C x Tr ( WCT x WC x )
1
Table 7.1 summarizes the adaptive impedance control designs for FJR in this
chapter in terms of the controller forms, adaptive laws and implementation issues.
Table 7.1 Summary of the adaptive impedance control for FJR
Flexible-joint robots interacting with environment
D x s + C x s + g x + D x v + C x v = J aT t Fext
)
J t
t + B t t + t = a q (q , q
Regressor-based
td =
Controller
J Ta
x v
(Fext + g x + D
x v K d s)
+C
= J Ta [Fext + Y ( x, x , v , v )p x
K d s]
a = x p + td + h( td , q)
(7.3-1)
Regressor-free
td =
J Ta
x v
(Fext + g x + D
x v K d s)
+C
a = x p + td + h
(7.4-2), (7.4-6)
(7.3-3), (7.2-9)
p x = 1Y T s
Adaptive Law
(7.3-8)
1
D )
T + Dx W
W
D x = Q D x ( Z D x vs
x
1
T
WC x = Q C x (Z C x vs + C x WC x )
g = Q g1 (z g s T + g W
g )
W
x
x
x
x
x
1
T
Wh = Q h (z he + h Wh )
(7.4-13)
Realization
Issue
Example 7.1:
Consider the flexible-joint robot (3.6-3), and we would like to verify the
efficacy of the strategy developed in this section by computer simulation. Actual
values of the system parameters are m1 =m2 =0.5(kg), l1 =l2 =0.75(m),
lc1 =lc2 =0.375(m), I1 =I2 =0.0234(kg-m2), and k1 =k2 =100(N-m/rad). Parameters
at the motor side are j1 =0.02(kg-m2), j2 =0.01(kg-m2), b1 =5(N-m-sec/rad), and
b2=4(N-m-sec/rad). The endpoint and the motor angle start from the initial
value x(0) = [0.8m 0.75m 0 0]T and (0) = [0.0022 1.5019 0 0]T
respectively to track a 0.2m radius circle centered at (0.8m, 1.0m) in 10 seconds
without knowing its precise model. The initial state for the reference model is
r (0) = [9.8 3.2 0 0]T , which is the same as the initial value for the
214
20 0
10 0
, and =
Kd =
.
0 20
0 10
Parameter matrices in the target impedance are selected as
0
0.5 0
100 0
1000
, Bi =
, and K i =
.
Mi =
1000
0 0.5
0 100
0
The 11-term Fourier series is selected as the basis function for the
D and W
C are
approximation of entries in D x , C x , g x , and h. Therefore, W
44 2
22 2
in
, and Wg and Wh are in
. The initial weighting vectors for the
entries are assigned to be
We assume in the simulation that the approximation error can be neglected, and
hence the -modification parameters are chosen as () = 0 . The simulation
results are shown in Figure 7.1 to 7.9. Figure 7.1 shows the robot endpoint
tracking performance in the Cartesian space. It can be seen that after some
transient response the endpoint converges to the desired trajectory in the free
space nicely. Afterwards, the endpoint contacts with the constraint surface at
xw =0.95(m) compliantly. When entering the free space again, the endpoint
follows the desired trajectory with very small tracking error regardless of the
system uncertainties. Figure 7.2 presents the time history of the joint space
tracking performance. The transient states converge very fast without unwanted
oscillations. The joint space trajectory in the constraint motion phase is smooth.
Figure 7.3 gives the torque tracking performance. The control efforts to the two
joints are reasonable that can be verified in Figure 7.4. The external forces
exerted on the endpoint during the constraint motion phase are shown in Figure
7.5. Figure 7.6 to 7.9 are the performance of function approximation. Although
most parameters do not converge to their actual values, they still remain
bounded as desired.
1.2
1.15
1.1
1.05
0.95
0.9
0.85
0.8
0.75
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
Figure 7.1 Robot endpoint tracking performance in the Cartesian space. After some transient
the endpoint converges to the desired trajectory in the free space nicely. Afterwards, the
endpoint contacts with the constraint surface compliantly. When entering the free space again,
the endpoint follows the desired trajectory with very small tracking error regardless of the
system uncertainties
216
0.8
0.6
0.4
0.2
5
Time(sec)
10
10
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
Figure 7.2 The joint space tracking performance. The transient is very fast and the constraint
motion phase is smooth
20
10
0
10
20
30
40
50
5
Time(sec)
10
5
Time(sec)
10
10
0
10
20
30
40
60
40
20
0
20
40
0
5
Time(sec)
10
5
Time(sec)
10
10
0
10
20
30
40
Figure 7.4 The control efforts for both joints are all reasonable
50
40
30
20
10
0
5
Time(sec)
10
5
Time(sec)
10
0.5
0.5
Figure 7.5 Time histories of the external forces in the Cartesian space
1.5
0.6
Dx(12)
Dx(11)
0.8
0.4
0.5
0.2
0
4
6
Time(sec)
0.5
10
1.5
4
6
Time(sec)
10
4
6
Time(sec)
10
3.5
3
2.5
Dx(22)
Dx(21)
0.5
2
1.5
1
0.5
0.5
4
6
Time(sec)
10
0.8
0.8
0.6
0.6
Cx(12)
Cx(11)
0.4
0.2
0.4
0.2
0.2
0.2
0.4
4
6
Time(sec)
10
0.4
4
6
Time(sec)
10
4
6
Time(sec)
10
1.5
2
Cx(22)
1
Cx(21)
218
0.5
0
1
0
0.5
1
1
1.5
4
6
Time(sec)
10
gx(1)
4
3
2
1
0
1
5
Time(sec)
10
5
Time(sec)
10
14
12
gx(2)
10
8
6
4
2
0
h(1)
2
1
0
1
5
Time(sec)
10
5
Time(sec)
10
2
0
h(2)
2
4
6
8
220
D x s + C x s + g x + D x v + C x v = J a T t Fext
(1a)
)
J t
t + B t t + t = Hi q(q , q
(1b)
Li + Ri + K b q = u
(1c)
td = J Ta (Fext + g x + D x v + C x v K d s)
(2)
D xs + C x s + K d s = J a T ( t td )
(3)
To ensure torque tracking in (1b), the MRC rule is applied with the reference
model
J r
r + B r r + K r r = J r td + B r td + K r td
(4)
nn
x p = A p x p + B p Hi B p q
(5a)
x m = A m x m + B m ( td + td )
(5b)
221
vectors. A p =
1
J t
are
augmented
Inn
I nn
0
2 n 2n
2n 2 n
and A m = 1
1
1
J t Bt
J r K r J r B r
0
system matrices, and B p = 1 2 n n
and
J t
0
B m = 1 2 n n are augmented input gain matrices. Let t = C p x p
J r K r
and r = C m x m be respectively the output signal vector for (5a) and (5b),
where C p = C m = [I n n 0] n 2 n are augmented output signal matrices.
The pair ( A m , B m ) is controllable, ( A m , C m ) is observable, and the transfer
function C m ( sI A m ) 1 B m is SPR. According to the MRC design, the desired
current id is selected as
i d = H 1[x p + td + h( td , q )]
(6)
x p = A m x p + B m ( d + td ) + B p H(i i d )
(7)
e m = A me m + B p H (i i d )
(8a)
e = C m e m
(8b)
u = Li d + Ri + K b q K c e i
(9)
Le i + K c e i = 0
(10)
222
To prove the closed loop stability of the whole system, let us consider a
Lyapunov-like function candidate
1
1
V (s, e m , e i ) = s T Ds + eTm Pt e m + eTi Le i
2
2
(11)
2n 2n
where Pt = Pt
is a positive definite matrix satisfying the Lyapunov
T
T
equation A m Pt + Pt A m = C mC m . Along the trajectories of (3), (8) and (10),
the time derivative of V can be computed as
T
1
x 2C x )s + s T J aT e
V = s T K d s + s T (D
2
eT e + e Tm Pt B p He i e Ti K ce i
(12)
V = [s
eTi
s
] Q e 0
e i
(13)
J aT
0
Kd
2
1
1
T
where Q = J a
I nn
H is positive definite due to proper
2
2
1
0
H
Kc
2
selection of Kd and Kc. Therefore, we have proved that s, e and e i are
uniformly bounded, and their square integrability can also be proved from (13).
Furthermore, uniformly boundedness of s , e , and e i are also easy to be
proved, and q q d , t td , and i i d follow by Barbalats lemma. Hence,
the closed-loop system behaves like the target impedance.
Remark 6: To implement the control strategy, all system parameters are
and its higher order
required to be available, and we need to feedback q
derivatives. Therefore, the design introduced in this section is not feasible for
practical applications.
223
x v K d s)
x v + C
td = J Ta (Fext + g x + D
= J Ta [Fext + Y ( x, x , v, v )p x K d s ]
(14)
= Y (x, x , v, v )p x + J a T ( t td )
(15)
x = Cx C
x , g x = g x g x , and p x = p x p x .
x = Dx D
x, C
where D
Therefore, if we may find a controller so that t td and an update law to
have p p, then (15) implies asymptotic convergence of the output error. To
this end, we would like to use the MRC rule with the reference model in (4) and
the state space representation in (5). The desired current trajectory is designed as
the one in (6) to have the dynamics for the torque tracking loop as in (8). Instead
of (9), a new controller is designed as
d + Ri
+K
b q K c e i
u = Li
= p Ti K c e i
(16)
T R
T K
Tb ]T 3n n , and we
where = [i Td i T q T ]T 3n , p i = [L
may have the dynamics for the current tracking loop as
Le i + K ce i = p Ti
(17)
1
V (s, e m , e i , p x , p i ) = s T D xs + e Tm Pt e m
2
1
1
+ e Ti Le i + p Tx p x + Tr (p Ti i p i )
2
2
(18)
224
where Pt = Pt
T
2n 2n
A Tm Pt
CTmC m .
V = s T K d s + s T J a T e eT e + e Tm Pt B p He i e Ti K c e i
p T (p + Y T s) Tr[p T ( p + e T )]
x
(19)
p x = 1Y T s
(20a)
p i = i1e Ti
(20b)
Thus, (19) becomes (13); therefore, we have proved that s, e and e i are
uniformly bounded, and their square integrability can also be proved from (13).
Furthermore, uniformly boundedness of s , e , and e i are also easy to be
proved, and hence, q q d , t td , and i i d follow by Barbalats
lemma. Consequently, we may conclude that the closed-loop system will behave
like the target impedance regardless of the system uncertainties.
Remark 7: To implement the controller strategy, we do not need to have the
and calculate
knowledge of most system parameters, but we have to feedback q
the regressor matrix and their higher order derivatives. Therefore, the design
introduced in this section is not feasible for practical applications, either.
x v K d s)
x v + C
td = J Ta (Fext + g x + D
(21)
The dynamics for the output tracking loop can thus be written as
x v g x + J aT ( t td )
x v C
D xs + C x s + K d s = D
(22)
225
the output error. To this end, we would like to use the MRC rule again with the
reference model in (4) and the state space representation in (5). The desired
current trajectory is designed according to (6) to be
i d = H 1[x p + td + h ]
(23)
e m = A me m + B p H (i i d ) + B p (h h)
(24a)
e = C m e m
(24b)
u = f K c e i
(25)
Le i + K ce i = f f
(26)
D x = WDT x Z D x + D x
(27a)
C x = WCT x Z C x + C x
(27b)
g x = WgTx z g x + g x
(27c)
h = WhT z h + h
(27d)
f = WfT z f + f
(27e)
226
n n
where WD x n D n , WC x n C n , Wg x g , Wh h ,
n n
and Wf f 2 are weighting matrices
for Dx, Cx, gx, h, and f, respectively,
2
n 1
n 1
while ZDx n D n , Z C x n C n , z g x g , z h h , and
n 1
z f f are basis function matrices. Likewise, we have the representations
for the estimates as
2
x=W
DT Z D
D
x
x
(27f)
x=W
CT Z C
C
x
x
(27g)
gT z g
g x = W
x
x
(27h)
hT z h
h = W
(27i)
fT z f
f = W
(27j)
Thus, the output error dynamics (22), torque tracking error dynamics (24a), and
current tracking error dynamics (26) can be rewritten as
DT Z D v
D xs + C x s + K d s = J a T ( t td ) W
x
x
T
T
WC Z C v Wg z g + 1
(28a)
hT z h + B p He i + B p 2
e m = A me m B p W
(28b)
fT z f + 3
Le i + K ce i = W
(28c)
D ,W
C ,W
g ,W
h, W
f ) = 1 [sT D xs + 2eTmPt e m
V (s, e m , ei , W
x
x
x
2
T
T
T
C +W
gT Qg W
g )
+ ei Lei + Tr(WDx QDx WDx + WCx QCx W
x
x
x
x
T
T
+ Tr(Wh Qh Wh + Wf Qf Wf )]
(29)
n n
g
where matrices Q D x n D n D , Q C x n C n C , Q g x g
,
n f n f
n h n h
Q h
, and Q f
are positive definite. Along the trajectory
of (28), we may compute the time derivative of V as
227
V = s T K d s + s T J a T e eT e + e Tm Pt B p He i e Ti K ce i
DT (Z D vs
T + Q Dx W
+s T 1 + eTm Pt B p 2 + e Ti 3 Tr[ W
D x )]
x
x
T
CT (Z C vs T + Q C W
T
Tr[ W
C x ) + Wg x ( z g x s + Q g x Wg x )]
x
x
x
T
hT (z he Tm Pt B p + Q h W
T
Tr[ W
h ) + Wf ( z f e i + Q f Wf )]
(30)
1
D )
T + Dx W
W
D x = Q D x ( Z D x vs
x
(31a)
1
T
W
C x = Q C x ( Z C x vs + C x WC x )
(31b)
1
T
W
g x = Q g x ( z g x s + g x Wg x )
(31c)
1
T
W
h = Q h ( z h e + h Wh )
(31d)
1
T
W
f = Q f ( z f e i + f Wf )
(31e)
V = [s
eTi
s
]Q e + [s T
e i
e Ti
1
] 2
3
DT W
D ) + C Tr ( W
CT W
C ) + g Tr ( W
gT W
g )
+ D x Tr ( W
x
x
x
x
x
x
x
x
hT W
h ) + f Tr ( W
fT W
f)
+ hTr ( W
1
J a T
Kd
2
1
T
where Q = J a
I nn
2
1
0
H
2
selection of Kd and Kc.
(32)
1
H is positive definite due to proper
2
Kc
228
Remark 8: Realization of the control law (25) and update laws (31) does not
need the information of joint accelerations, regressor matrix, or their higher
order derivatives, which largely simplified their implementation.
Remark 9: Suppose a sufficient number of basis functions are used and the
approximation error can be ignored, then it is not necessary to include the
-modification terms in (31). Hence, (32) can be reduced to (13), and
convergence of s, e and ei can be further proved by Barbalats lemma.
Therefore, the closed loop system behaves like the target impedance.
Remark 10: If the approximation error cannot be ignored, but we can find
positive numbers 1, 2 and 3 such that i i , i=1,2,3, then robust terms
robust 1, robust 2 and robust 3 can be included into (21), (23) and (25) to have
x v K d s + robust )
x v + C
td = J Ta (Fext + g x + D
i d = H 1[x p + td + h + robust 2 ]
u = f K c e i + robust 3
Consider the Lyapunov-like function candidate (29) again, and the update law
(31) without -modification; then the time derivative of V becomes
V = [s T
e Ti
s
]Q e + 1 s + 2 e + 3 e i
e i
229
s
1
1
DT W
D )
V max ( A ) e + [ max (Q D x )Tr ( W
x
x
2
2
e i
CT W
C ) + max (Q g )Tr ( W
gT W
g )
+ max (Q C x )Tr ( W
x
x
x
x
x
T
T
hW
h ) + max (Q f )Tr ( W
f W
f )]
+ max (Q h )Tr ( W
0
D
where A = 0 2CTm Pt C m
0
0
s
1
1
V V + [max ( A ) min (Q)] e +
2
2 min (Q)
e i
1
2
3
1
DT W
D ) + [max (Q C )
+ {[max (Q D x ) D x ]Tr ( W
x
x
x
2
T
T
C W
C ) + [max (Q g ) g ]Tr ( W
g W
g )
C x ]Tr ( W
x
x
x
x
x
x
T
min
Dx
Cx
gx
1
1
+ 1 [ Tr ( W T W )
V V +
2
Dx
Dx
Dx
2 min (Q)
2
3
+ C x Tr ( WCT x WC x ) + g x Tr ( WgTx Wg x )
+ hTr ( WhT Wh ) + f Tr ( WfT Wf )]
(33)
230
1 ( )
1
1
V>
sup 2 ( ) +
[ D x Tr ( WDT x WD x )
2min (Q) t0
2
3 ( )
+ C x Tr ( WCT x WC x ) + g x Tr ( WgTx Wg x )
+ hTr ( WhT Wh ) + f Tr ( WfT Wf )]
D , W
C , W
g , W
h , and W
f are uniformly ultimately
i.e., s, e , e i , W
x
x
x
bounded. In addition, (33) implies
2
1 ( )
1
( t t 0 )
V (t ) e
V (t 0 ) +
sup 2 ( )
s
1
1
DT W
D )
V min ( A ) e + [ min (Q D )Tr ( W
x
x
2
2
e i
CT W
C ) + min (Q g )Tr ( W
gT W
g )
+ min (Q C x )Tr ( W
x
x
x
x
x
T
T
hW
h ) + min (Q f )Tr ( W
f W
f )]
+ min (Q h )Tr ( W
we may find the error bound
1 ( )
2V (t 0 ) 2 (t t0 )
1
e
+
sup 2 ( )
min ( A)
min ( A) min (Q) t0 < < t
3 ( )
1
+
[ D x Tr ( WDT x WD x ) + C x Tr ( WCT x WC x )
min ( A)
s
e
e i
T
g x Tr ( Wg x Wg x
) +
T
hTr ( Wh Wh ) +
T
f Tr ( Wf Wf
1
)] 2
231
D x s + C x s + g x + D x v + C x v = J a T t Fext
(7.5-1)
)
J t
t + B t t + t = Hi q (q , q
Li + Ri + K bq = u
Regressor-based
x v
(Fext + g x + D
+C x v K d s)
= J Ta [Fext + Y ( x, x , v , v )p x
K d s]
td =
Controller
J Ta
Regressor-free
x v
(Fext + g x + D
+C x v K d s)
i d = H 1[x p + td + h ]
u = f K c e i
td =
J Ta
= p Ti K ce i
(7.5-14), (7.5-6), (7.5-16)
p x = 1Y T s
p = 1eT
Adaptive Law
(7.5-20)
1
D )
T + Dx W
W
D x = Q D x ( Z D x vs
x
1
T
C )
WC x = Q C x ( Z C x vs + C x W
x
g = Q g1 (z g s T + g W
g )
W
x
x
x
x
x
1
T
Wh = Q h ( z h e + h Wh )
f = Q f1 ( z f e Ti + f W
f)
W
(7.5-31)
Realization
Issue
Example 7.2:
Consider flexible-joint robot in example 7.1 but with consideration of the
motor dynamics. Actual values of link parameters are selected as m1= m2= 0.5(kg),
l1= l2= 0.75(m), lc1= lc2=0.375(m), I1= I2=0.0234(kg-m2), and k1= k2=100(N-m/rad).
Parameters for the actuator part are chosen as j1= 0.02(kg-m2), j2= 0.01(kg-m2),
b1= 5(N-m-sec/rad), b2= 4(N-m-sec/rad), and h1= h2= 10(N-m/A). Electrical
parameter for the actuator are L1= L2= 0.025(H), r1= r2= 1(), kb1= kb2= 1
(Vol/rad/sec) (Chien and Huang 2007a). The stiffness of the constrained surface
232
50 0
20 0
200 0
, =
, and K c =
Kd =
.
0 50
0 20
0 200
The initial value for the desired current can be found by calculation as
0
0.5 0
100 0
1500
, Bi =
, and K i =
.
Mi =
1500
0 0.5
0 100
0
The 11-term Fourier series is selected as the basis function for the approximation.
h , and W
f are in
D and W
C are in 44 2 , while W
g , W
Therefore, W
x
x
x
22 2
233
The gain matrices in the update law (31) are selected as Q D1x = 0.001I 44 ,
Q C1x = 0.001I 44 , Q g1x = 100I 22 , Q h1 = 10I 22 , and Q f1 = 10000I 22 . In this
simulation, the approximation error is assumed to be neglected, and the modification parameters are chosen as () = 0 .
The simulation results are shown in Figure 7.10 to 7.20. Figure 7.10 shows
the tracking performance of the robot endpoint and its desired trajectory in the
Cartesian space. It is observed that the endpoint trajectory converges smoothly
to the desired trajectory in the free space tracking and contacts compliantly in
the constrained motion phase. Computation of the complex regressor is avoided
in this strategy which greatly simplifies the design and implementation of the
control law. Although the initial error is quite large, the transient state takes only
about 0.2 seconds which can be justified from the joint space tracking history in
Figure 7.11. The torque tracking performance is shown in Figure 7.12. It can be
seen that the torque errors for both joints are small. Figure 7.13 presents the
current tracking performance. It is observed that the strategy can give very small
current error. The control voltages to the two motors are reasonable that can be
verified in Figure 7.14. Figure 7.15 to 7.20 are the performance of function
approximation. Although most parameters do not converge to their actual
values, they still remain bounded as desired.
1.2
1.15
1.1
1.05
0.95
0.9
0.85
0.8
0.75
0.6
0.65
0.7
0.75
0.8
X
0.85
0.9
0.95
0.8
0.6
0.4
0.2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
60
40
20
0
20
40
60
80
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
10
234
0
10
20
30
40
50
60
20
15
10
5
0
5
10
15
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
10
10
200
150
100
50
0
50
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
80
60
40
20
0
20
40
235
80
60
40
20
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.5
0.5
0.8
1.5
0.6
Dx(12)
Dx(11)
0.4
0.5
0.2
0.5
1
Time(sec)
1.5
0.5
1.5
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
3.5
3
2.5
Dx(22)
Dx(21)
236
0.5
2
1.5
1
0.5
0.5
0.5
1
Time(sec)
1.5
3
2
Cx(12)
Cx(11)
1
0
1
0
1
2
3
2
0
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
0.5
1
Time(sec)
1.5
10
4
5
Cx(22)
Cx(21)
2
0
2
5
4
6
0.5
1
Time(sec)
1.5
10
gx(1)
8
6
4
2
0
2
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
20
gx(2)
15
10
237
h(1)
4
2
0
2
4
6
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
10
5
h(2)
0
5
10
15
f(1)
100
50
0
50
100
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
Time(sec)
1.2
1.4
1.6
1.8
100
80
60
f(2)
238
40
20
0
20
40
7.6 Conclusions
239
7.6 Conclusions
In this chapter, we consider the case when the flexible-joint robot contacts
with the environment. The adaptive impedance controllers are derived to give
good performance in both free space tracking and constraint motion phase.
Firstly, the MRC rule is utilized in Section 7.2 for the impedance control of a
known FJR. To deal with the uncertainties, a regressor based adaptive controller
is derived in Section 7.3. However, its implementation requires the knowledge
of joint accelerations, the regressor matrix, and their higher order derivatives.
Therefore, the control strategy is not practical. The regressor-free adaptive
impedance controller is developed in Section 7.4 whose realization do not need
the joint accelerations, the regressor matrix, or their higher order derivatives.
The actuator dynamics is included in the system equation in Section 7.5. A
regreesor-based adaptive impedance controller is then developed for EDFJR in
Section 7.5.1. However, its realization still needs the joint accelerations,
regressor matrix, and their derivatives. A regreesor-free adaptive impedance
controller for EDFJR is then introduced in Section 7.5.2 and it is free from the
information for joint accelerations or the regressor matrix. Several simulation
results show that the regressor-free design can give good performance to an
EDFJR operating in a compliant motion environment.
Appendix
Lemma A1:
Let w Ti = [ wi1 wi 2 win ] 1 n , i=1,,m and W is a block
diagonal matrix defined as W = diag{w 1 , w 2 , , w m } mn m . Then,
m
Tr ( W W) =
T
2
i
i =1
w 1T
0
T
W W=
0 w1
0 0
w Tm 0
0
w T2
w 1T w 1
0
T
0
w2w2
=
0
0
w1
w2
w Tm w m
2
w m
0
2
Therefore, we have Tr ( W T W) =
0
w2
2
i
i =1
241
0
0
w m
242
Appendix
Lemma A2:
w Ti = [ wi1 wi 2 win ] 1 n
and
= [vi1 vi 2 vin ] 1 n , i=1,,m. Let W and V be block diagonal
matrices that are defined as W = diag{w 1 , w 2 , , w m } mn m and
Suppose
v Ti
wi .
i =1
v1T
0
T
V W =
0 w1
0 0
v Tm 0
0
v T2
v1T w 1
0
=
v T2 w 2
w2
0
0
wm
v Tm w m
Hence,
Tr (V T W ) = v1T w 1 + v T2 w 2 + ... + v Tm w m
v1 w 1 + v 2 w 2 + ... + v m w m
m
wi .
i =1
Lemma A3:
is a matrix defined as
Let W be defined as in lemma A1, and W
TW
) 1 Tr ( W T W) 1 Tr ( W
TW
)
Tr ( W
2
2
Appendix
243
Proof:
TW
) = Tr ( W
T W ) Tr ( W
TW
)
Tr ( W
m
( w
i )
wi w
i =1
1
2
[ w
i
w
i )
w
i wi )2 ]
( w
i =1
m
( w
i =1
1
1
TW
)
= Tr ( W T W) Tr ( W
2
2
w
i =1 j =1
Proof:
W W
= [ W1T
W pT
W1
]
W p
= W1T W1 + + W pT W p
2
ij
244
Appendix
Tr ( W T W ) = Tr ( W1T W1 ) + + Tr ( W pT W p )
m
w1 j
+ +
j =1
p
pj
j =1
2
ij
i =1 j =1
Lemma A5:
Let
and
p
Tr (V T W)
be
matrices
defined
in
lemma
A4,
Then,
ij
w ij .
i =1 j =1
Proof:
Tr (V T W) = Tr (V1T W1 ) + + Tr (V pT W p )
m
1j
w1 j + +
j =1
p
pj
w pj
j =1
ij
w ij
i =1 j =1
Lemma A6:
is a matrix defined as
Let W be defined as in lemma A4, and W
=WW
, where W
is a matrix with proper dimension. Then
W
TW
) 1 Tr ( W T W) 1 Tr ( W
TW
)
Tr ( W
2
2
Appendix
Proof:
TW
) = Tr ( W
T W ) Tr ( W
TW
)
Tr ( W
p
( w
ij )
w ij w
ij
i =1 j =1
1
2
[ w
ij
w
ij )
w
ij
ij w ij ) 2 ]
( w
i =1 j =1
p
( w
ij
i =1 j =1
1
1
TW
)
= Tr ( W T W) Tr ( W
2
2
245
References
248
References
References 249
250
References
References 251
252
References
C. Ott, A. A. Schaffer and G. Hirzinger, Comparison of Adaptive and Nonadaptive Tracking Control Laws for a Flexible Joint Manipulator, Proceedings
IEEE/RSJ International Conference on Intelligent Robots and Systems,
Lausanne, Switzerland, pp. 2018-2024, Oct. 2000.
A. Ott, A. A. Schaffer, A. Kugi and G. Hirzinger, Decoupling Based
Cartesian Impedance Control of Flexible-joint Robots, Proceedings IEEE
International Conference on Robotics and Automation, Taipei, Taiwan,
pp. 3101-3107, Sept. 2003.
S. Ozgoli and H. D. Taghirad, A Survey on the Control of Flexible Joint
Robots, Asian Journal of Control, vol. 8, no. 4, pp. 1-15, Dec. 2006.
P. R. Pagilla and M. Tomizuka, An Adaptive Output Feedback Controller
for Robot Arms: Stability and Experiments, Automatica, vol. 37, no. 7,
pp. 983-995, July 2001.
J. S. Park, Y. A. Jiang, T. Hesketh and D. J. Clements, Trajectory Control of
Manipulators Using Adaptive Sliding Mode Control, Proceedings of IEEE
SOUTHEASTCON, pp. 142-146, 1994.
Z. Qu and J. Dorsey, Robust Tracking Control of Robots by a Linear
Feedback Law, IEEE Transactions on Automatic Control, vol. 36, no. 9,
pp. 1081-1084, 1991.
M. H. Raibert and J. J. Craig, Hybrid Position/Force Control of
Manipulators, ASME Journal of Dynamic Systems, Measurements and Control,
vol. 102, pp. 126-133, 1981.
C. E. Rohrs, L. S. Valavani, M. Athans and G. Stein, Robustness of
Continuous-time Adaptive Control Algorithms in the Presence of Unmodeled
Dynamics, IEEE Transactions on Automatic Control, vol. 30, pp. 881-889,
1985.
N. Sadegh and R. Horowitz, Stability and Robustness Analysis of a Class of
Adaptive Controller for Robotic Manipulators, International Journal of
Robotics Research, vol. 9, no. 3, pp. 74-92, 1990.
References 253
254
References
References 255
Abbreviations
RR
RRE
EDRR
EDRRE
FJR
FJRE
EDFJR
EDFJRE
PE
FAT
MRC
MRAC
SPR
UUB
: Rigid Robot
: Rigid Robot interacting with Environment
: Electrically-Driven Rigid Robot
: Electrically-Driven Rigid Robot interacting with Environment
: Flexible-Joint Robot
: Flexible-Joint Robot interacting with Environment
: Electrically-Driven Flexible-Joint Robot
: Electrically-Driven FJR interacting with Environment
: Persistent Excitation
: Function Approximation Technique
: Model Reference Control
: Model Reference Adaptive Control
: Strictly Positive Real
: Uniformly Ultimately Bounded
258
Tr(A)
a
a
a
a
A
i ( A )
min ( A )
max ( A )
a
a
A
A
min{.}
sup(.)
diag{}
: trace of matrix A
: estimation of scalar a
: error between a and a
: estimation of vector a
: error between a and a
: estimation of matrix A
li
mi
Ii
B
Bi
C
Cx
D
Dx
Fext
g
gx
H
i
id
J
Ja
: length of link i
: mass of link i
: moment of inertia of link i
: actuator damping matrix
: desired apparent damping
: vector of centrifugal and Coriolis forces
: C in the Cartesian space
: inertia matrix
: D in the Cartesian space
: external force
: gravitational force vector
: g in the Cartesian space
: electro-mechanical conversion matrix
: armature current vector
: desired current trajectory
: actuator inertia matrix
: Jacobian matrix
K
Kb
Ki
L
Mi
p
px
q
qd
R
x
xd
Y
a
t
d
e
e
ei
em
e
kw
Kd
Kp
K
P, Q ,
s
s
sat(.)
sgn(.)
u
u
: disturbance
: error signal
: error vector
: current error vector
: output tracking error vector
: torque tracking error vector
: stiffness of the wall
: gain matrix for velocity error
: gain matrix for position error
: conversion matrix
: positive matrices
: sliding surface variable
: error vector
: saturation function
: signum function
: control input signal
: control input vector
260
v
V
w
W
z
Z
robust
Index
EDRRE, 2, 4, 76
Equilibrium point, 36, 37, 51, 52, 57
Harmonic drive, 5
Hilbert space, 11, 15, 16, 31, 117
Impedance control, 4, 129, 201
Inner product space, 14, 15
Invariant set theorem, 35, 37
Inversion-based controller, 40
262
Index
Matrix norms, 29
Model reference adaptive control, 11,
45
MRAC, 11, 45, 46, 48, 50, 58, 62
MRC, 46, 49, 58, 165, 204
Nonautonomous system, 36, 37, 39,
63
Normed function space, 30
Normed vector space, 14, 15
Orthogonal functions, 11, 17, 18
Orthonormal function, 18, 24, 31
Output tracking loop, 102, 106, 147,
223
PE, 47, 50, 87
Persistent excitation, 50, 51
Polynomials
Taylor, 19, 24
Chebyshev, 19, 24
Legendre, 20, 24
Hermite, 20, 24
Laguerre, 21, 24
Bessel, 22, 24
Radially unbounded, 37, 38, 50
Real linear space, 12
Regressor matrix, 1-8, 83, 85, 88, 89,
134, 186, 207
Rigid robots, 2, 83-85, 87, 129
Robust adaptive control, 54
RR, 2, 71, 140
RRE, 2, 4, 73