Beruflich Dokumente
Kultur Dokumente
Tu
Dynamical Systems
An Introduction with Applications
in Economics and Biology
Second Revised
and Enlarged Edition
Springer-Verlag
Berlin Heidelberg N ew York
London Paris Tokyo
Hong Kong Barcelona
Budapest
Professor Dr. Pierre N. V. Tu
Department of Economics
The University of Calgary
2500 University Drive N.W.
Calgary, Alberta T2N IN4
Canada
This work is subject to copyright. All rights are reserved, whether the whole or part of the mate-
rial is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recita-
tion, broadcasting, reproduction on microfilms or in other ways, and storage in data banks. Du-
plication of this publication or parts thereofis only permitted under the provisions of the Ger-
man Copyright Law of September 9, 1965, in its version of June 24,1985, and a copyright fee
must always be paid. Violations fall under the prosecution act of the German Copyright Law.
© Springer-Verlag Berlin· Heidelberg 1992, 1994
The use of registered names, trademarks, etc. in this publication does not imply, even in the ab-
sence of a specific statement, that such names are exempt from the relevant protective laws and
regulations and therefore free for general use.
42/2202-5432 1 0 - Printed on acid-free paper
Preface to the Second Edition
The favourable reception of the first edition and the encouragement received
from many readers have prompted the author to bring out this new edition. This
provides the opportunity for correcting a number of errors, typographical and others,
contained in the first edition and making further improvements.
This second edition has a new chapter on simplifying Dynamical Systems
covering Poincare map, Floquet theory, Centre Manifold Theorems, normal forms
of dynamical systems, elimination of passive coordinates and Liapunov-Schmidt
reduction theory. It would provide a gradual transition to the study of Bifurcation,
Chaos and Catastrophe in Chapter 10. Apart from this, most others - in fact all
except the first three and last chapters - have been revised and enlarged to bring
in some new materials, elaborate some others, especially those sections which many
readers felt were rather too concise in the first edition, by providing more explana-
tion, examples and applications. Chapter 11 provides some good examples of this.
Another example may be found in Chapter 4 where the review of Linear Algebra has
been enlarged to incorporate further materials needed in this edition, for example
the last section on idempotent matrices and projection would prove very useful to
follow Liapunov-Schmidt reduction theory presented in Chapter 9.
The purpose of this book is to equip students with the essential tools needed
for understanding the literature and doing research in the field of dynamic analysis,
not to teach economic or biological theory in various areas. Examples, illustrations
and applications have consequently been chosen among the simplest and best knmvn
ones, in order not to divert students' effort to increasingly sophisticated economic
and biological theory, and away from the main objective of acquiring the tools needed
for theory.
Students can start from anywhere depending on their background and interest.
The first six chapters could provide materials for the first half course: in fact, these
cover about half of Economics 304, and the remaining chapters are basically the
lecture materials for Economic 505, a half course in Dynamic Economics at the
University of Calgary.
In the preparation of this second edition, I have benefited from many readers'
feedback, comments and suggestions, for which I am thankful. J. Longworth deserves
highest commendation for her amazing skill and exemplary patience in correcting,
editing, typing and retyping the book. Last but not least, I am grateful to my wife
Elise, for her continuous encouragement and active participation in proof-reading
and diagram drawing. The remaining errors are, of course, my sole responsibility.
Pierre N.V. Tu
The University of Calgary
February 1994
Preface to the First Edition
Students can start from anywhere. Those who only need to know scalar differen-
tial and difference equations will find the first three chapters adequate and stop after
them. Those who do not need these elementary reviews can skip the first three or
four chapters. Those who are familiar with linear DS can skip the first six chapters
and start at Chapter 7.
Finally, only deterministic DS are studied in this book. Stochastic DS,
although increasingly used, is a field in itself and would require a separate book.
In the preparation of this book, I have benefitted from discussion with several
colleagues. Ngo Van Long in particular should be thanked for his patience in reading
the first draft and his encouraging comments. The students on which this book was
tested have all made their contribution, by their enthusiastic response and their
searching questions, to the improvement of the clarity and quality of my exposition.
Joanne Longworth deserves more than a casual commendation for her typing and
computing skill and her patience in dealing with several corrections. Last but not
least, I must here record my appreciation to Elise, my wife, for her encouragement
and especially for her active participation in the proof reading and diagrams drawing.
Needless to say that I alone am responsible for any remaining errors.
Pierre N.V. Th
The University of Calgary
January 1992
Contents
Preface v
1 Introduction 1
Bibliography 295
List of Frequently Used Abbreviations
and Symbols
1. Abbreviations
2. Symbols
II
;=1
= product of n terms
L
;=1
= summation of n terms
E = surface of a section
( ,) = inner product
sgn = sign of
II II = norm
r, tr(A) = trace A = sum of diagonal terms of A
a(A) = spectrum of A
>'(A), >. = eigenvalues of A, eigen value >.
Re(>.), Im(>') = Real part of eigenvalue >., Imaginary part of >.
N(A) = nullity of A
kerA = kernel of A
R(A) = range of A
IAI or detA = determinant of A
codf = co dimension of f
r(A) = rank of A
Chapter 1
Introduction
A dynamical system (DS) describes the evolution over time of all points in a
given subspace of an n-Euclidean space. For example consumption c = (Cl, ... , en)
is a point in the commodity space. Similarly, price p E R+ is a point in the price
space and Walras' law p = kE(p) describes the evolution over time of prices in
response to excess demand E{p), with speed k where k = diag{kb k2 , ••• , kn ).
The term "evolution" implies position and velocity: at any point xES eRn,
a velocity vector may be derived (by simply differentiating x). The set of all such
velocity vectors in S is called a velocity vector field. The vector field will be assumed
smooth, i.e. continuous and continuously differentiable as many times as required.
Thus, from a knowledge of the initial position (xo) and velocity (xo) of a state, the
DS tells us where x will be and has been at any time t E (-oo, 00) i.e. the DS gives
the whole past and future life of x.
The evolution of x over time in the state space S is a flow ¢(t,x) : R x S -+
Rn where S C Rn at the various times t E (a, b) C R, or written differently,
¢t{x) : S -+ Rn, taking x into Xt. Clearly Xt = ¢t{x) == ¢(t,x) = position at
t of a path starting at x, and ¢o (x) == ¢( 0, x) = Xo by definition. The flow ¢t
satisfies the group properties ¢o = id and ¢t 0 ¢8 = ¢t+". This means that the
state y = ¢s{x) into which x goes after time s will itself go to state z = ¢t{Y) after
time t, i.e. z = ¢d¢8{X)] == ¢t 0 ¢8{X) = ¢t+8{X). In particular, for s = -t we have
¢t+8 = ¢t 0 ¢-t = ¢o = eO = id (identity). Thus ¢t and ¢-t are inverse of each other.
By an equilibrium or fixed point of a flow ¢ is meant ¢t{x) = xVt E R, i.e. a phase
point is itself a phase curve.
Given the DS
(1.1)
f is said to generate a local flow ¢t : S -+ Rn where ¢t{x) == ¢(x, t) is a smooth
function defined for all xES and t E (a, b) E R. For example, x = Ax gives rise to a
flow map ¢(x, t) = eAtx : Rn -+ Rn, i.e. eAt defines on Rn a flow which is generated
by the vector field Ax. Thus, f{x) is the tangent vector to the curve ¢t{x) at t.
Similarly, it can also be said that a DS ¢t on S generates a differential equation
system x = f{x), i.e.
(1.2)
where f can be considered a vector field on Rn, since to each point x E Rn, a
vector f{x), representing the velocity of the flow ¢t{x), is assigned. In other words,
Xt = ¢t{x) is a curve in an n-Euclidean space whose tangent (x) is equal to f{x).
Solving (1.1) for x{O) = Xo means finding a curve or flow ¢t{xo) passing through
Xo at t = 0 and satisfying (1.1) at all times. It is an integral curve of (1.1). The
family of all such curves forms a phase portrait of (1.1). Do solutions exist and how
many are there? If f is locally Lipschitz, i.e. if If{y) - f{x)1 ~ K{y - x) for x '" y
2
and K is the Lipschitz constant for f, then the solution exists and is unique. More
precisely
Note that in some cases, flat Euclidean spaces will be inadequate and curved
spaces (manifolds) will be necessary. The state space is now curved, and the velocity
vector will "stick out" of the space. All tangent curves passing through a point will
lie in the same plane and form the tangent space at that point. For example the
surface of a table on which a tennis ball sits is the tangent space at that point of
contact. Note also that, although tangent vectors stick out of the curved surface,
the integral or trajectory of a vector field stays in it. To see this, roll the ball slightly
so that the table surface becomes the tangent space of another nearby point of the
ball. Now instead of rolling the ball, we let these tangent spaces through each point
of the ball wrap around the ball, then the integral curves thus obtained stay on the
curved surface of the ball.
Where do these trajectories of the solution curves go? In the end, some con-
verge, either monotonically or periodically to a limit which could be a point or a
set of points such as a circle: these are stable. Some others will move away from
equilibrium, either to come back to it (for example the saddle connexion or homo-
clinic orbit) or never to be seen again (unstable). Some do not converge to any
limit: they come very close to an equilibrium point only to veer away from it (for
example all trajectories of a saddle point except those on the stable and unstable
arms). Some are attractors, attracting to themselves neighbouring curves (for exam-
ple stable Limit Cycles), others are rep ellers, repelling neighbouring curves. Some
go around and around for ever (such as the neutral stability of harmonic motions).
Some are locally stable, converging to an equilibrium from a nearby initial point,
others are globally (Liapunov) stable, converging to equilibrium from any initial
position. Some bifurcate into several branches, leading to chaos, some others display
discontinuous catastrophic jumps in response to an infinitesimal change of some pa-
rameter. These are all subject matters of DS. This book is devoted to an elementary
discussion of these, starting from differential and difference equations, proceeding
to some nonlinear systems such as Gradient, Lagrangean and Hamiltonian Systems,
then Bifurcation, Chaos and Catastrophes as well as optimal DS, illustrating the
discussion with some applications in Economics and Biology.
Before proceeding, let us note the difference between flows and maps, associated
respectively with continuous and discrete DS. In the first case - which has been
discussed so far - the time variable t is continuous (t E R) and the DS is represented
as
~~ == x = f(x). (1.1)
3
Its solution can be represented as a flow whose velocity is given by the vector f(x),
as discussed above. In the second case, the variable time t is discrete (t E Z where
Z is the set of integers). In fact t is continuous but observations of Xt are made
and recorded at discrete points of time. For example, GNP is continuously evolving
over time but its measurements are taken and recorded once a year, or once every
quarter. This gives the discrete DS
(1.3)
When these derivatives are partial, (for example ~~, ~:) they are called partial
differential equations. Only ODE will be dealt with here.
Definition 2.2. The order of an ODE is given by the highest order derivative in-
volved. For example (2.1) is an nth-order ODE. The degree of an ODE is given by
the highest exponent of any derivative. For example, writing x == dx / dt, x == lfx / dt 2
etc ...
x+ax3 +bx+c=O (2.2)
is an ODE of order 2 and degree 3.
Definition 2.4. An ODE containing terms of a higher degree (than the first) is
called a non-linear ODE. For example
x + ax + bx2 = 0
x + ax3 + bxt = O.
6
Definition 2.5. By a solution of an ODE is meant any relation between the vari-
ables not involving derivatives and satisfying the ODE. The solution is also called
the integral curve of an ODE. For example the solution of x = ax for x(O} = xo
is x(t} = ceot = xoeot where c is an arbitrary constant to be determined by initial
conditions. Here c = xo since at t = 0, x(t} = ceo = c = xo. Note that the number
of initial conditions must be equal to the order of the ODE. Once these initial con-
ditions are given, there exists only one solution curve (of the solution family) that
goes through a specific point.
Proof. The proof makes use of the Contraction Mapping theorem, itself an ap-
plication of Banach's Fixed Point Theorem, which is beyond this level. See any
standard textbook on Ordinary Differential Equations such as Pontryagin (1962),
Kaplan (1958), Coddington & Levinson (1955) among others. This theorem guar-
antees both the existence and uniqueness of the solution x(t} = cp(t}, by virtue of
the Contraction Mapping. Thus if there exists any other solution t/J(t} to x = f(x, t}
satisfying x(to} = Xo, for a < t < b where I = [a, b], then cp(t} and t/J(t} are identical
curves.
is
x(t) = xoe- at . (2.7)
Proof. Writing (2.6) as dxjx = -adt and integrating, gives lnx = -at + c and
e1nx == x(t) = e-ate c where eC == A, some arbitrary constant to be determined by
initial conditions Xo. This gives eC = Xo = A.
If a = 0, integration gives x(t) = c, some constant which, with the given initial
conditions x(O) = xo, gives x(t) = xo, i.e. x(t) maintains its initial value Xo over
time.
X(t) X(t)
~a<o
Xel-------- b/a 1-------- b/a = Xe
a<O
L...-------t L...-------t
As can be seen (Figures 2.1 and 2.2) the deviation xc(t) = (xo - x.)e- at from its
equilibrium level b/a decreases over time when a > 0 and increases over time when
a < o.
Remark 2.3. In terms of Remark 2.2, we can solve (2.8) directly by finding its
equilibrium,for x = 0 i.e. ax = b giving xp = b/a = x •. As for the complemen-
tary function, theorem 2.1 (or 2.2) assures us that the solution takes some form
9
xc(t) = Ae>.t. We can therefore assume it is of this form and x(t) = Ae>.t, and hence
x = >.Ae>.t for some constant, yet undetermined>., then write the LHS of (2.8) as
x + ax == Ae>.t(>. + a) = o. (2.11)
Since Ae>.t =f: 0, >. = -a and xc(t) = Ae-at as required. The complete solution then
is
x(t) = xc(t) + Xe = Ae-at + b/a (2.12)
which is (2.9).
A is given by the initial condition x(O) = Xo, i.e. at t = 0, x(t) = Xo = Aeo+b/a =
A + b/ a giving A = Xo - b/ a == Xo - Xe which is the initial deviation of the trajectory
x(t) from its equilibrium at t = o.
Clearly limHoo x(t) = 5 + 0, i.e. the initial deviation Xo - Xe = 3 from its equilibrium
level b/a = 5 decreases over time and x(t) tends to its equilibrium value Xe = 5.
is
x(t) = Aef a(t) dt
where A is an arbitrary constant to be determined by the initial condition x(O) = Xo.
10
dx
- = -a(t)dt
x
and integrating gives
lnx + c = - Ja{t) dt
or e1nz == x{t) = e-Ce- I a(t) dt == Ae- I a(t) dt. (QED)
is
x(t) = e- I a(t) dt[A + Jg(t)eI a(t) dt dt] (2.16)
1t
Proof. Note that (xel a(t) dt) = el a(t) dt[x + a{t)x] which is just the LHS of (2.15)
multiplied by el a(t) dt. Multiplying both sides of (2.15) by el a(t) dt which is called
the integrating factor (Lf.) gives
Example 2.4.
x +2xft = 5t2.
In this example a(t) == 2ft, iJ. == el a(t) dt = e2 It dt = e1n t2 = t 2.
(2.17) gives xt 2 = I 5t 2t 2dt + c = t 5 + c Le. the solution is xt 2 = t 5 + c.
11
Example 2.5.
. 1
x+-x=t
t
a(t) = t. iJ. = el t dt = elnt (2.17) gives
x(t)elnt = Jelnttdt Jedt
=
x(t) = e- Int Jt dt 2
= e-In t (t; + c) t; + i .
=
The equation
i; + a(t)x = g(t)x n (n i- 0,1) (2.18)
is called the Bernouilli's equation, after James Bernouilli who solved it in 1695. It
is non linear, because of the presence of x n , but could be made linear by dividing
through by xn and defining the new variable y == x 1- n
x-ni; + a(t)x 1- n = g(t)
or, in the new variable y,
Example 2.7. x - f = _t 2 X 3 •
As in (2.19), multiplying through by - 2x- 3 (== (1 - n)x- n ) gives
dx 2 2
-2x- 3 - + _x- = 2t 2 •
dt t
Substituting y == x- 2 gives
dy +~y= 2t2
dt t
which is similar to Example 2.4 above.
-~+--~----x ------~~-------x
Clearly ± > 0 above the horizontal axis i.e. x increases over time as indicated
by arrows going from left to right (see figure 2.3 and 2.4). Similarly ± < 0 below
the horizontal axis, i.e. x decreases over time. If f(x) is an increasing function, the
arrows showing the direction of the velocity of x, rise from left to right above the
horizontal axis and fall from right to left everywhere below the horizontal axis. On
the horizontal axis itself, ± = 0 Le. x remains unchanged: it is an equilibrium or a
fixed point, (see figure 2.3 and 2.4) which is a stable equilibrium if f'(x) < 0 Le. if
f(x) is a decreasing function and an unstable equilibrium if f'(x) > 0 Le. if f(x) is
an increasing function. The case f(x) = ax is a particular case, the linear case, and
follows the same rule. The stable fixed point (where a < 0 or f'(x) < 0 at ± = 0)
is called an attractor in that the system is attracted to it from above and below
the horizontal axis. The unstable fixed point (where a > 0 or f'(x) > 0 at ± = 0,
Le. at the point of intersection of the trajectory with the horizontal axis) is called a
repeller in that the system moves away from it. The fixed point through which the
system goes without reversing its direction, is called a shunt (see figure 2.5 (e) and
U)). Some examples are
x = x(a-bx)
(a,b> 0)
o
--~~----nr----~~--x
p
o
o
o o
o o
°
EO° ~:E E °° JIr
(repellei) (attractor) (repeller)
X =ax 3 x = ax 3
(a < 0 (a >0)
--------~~~--------x --------~-rn~--------x
--------~JIr~o~<~-------x ------~<E--.o~->~------x
Z z=ax 2
(a > 0)
z =ax 2
(a < 0)
--------~~~--------x --------~~~--------x
E
o E
This is a first order constant coefficient linear differential equation (2.8), whose
solution is, by (2.9)
p(t) = , -
(3-8
a+ (po _,- a) e
(3-8
k (fJ- 6)t
(2.23)
which is stable if (3 - 8 < 0, and unstable otherwise. But (3 = slope of demand curve
and 8 = slope of supply curve, and when (3 - 8 < 0, (which is always fulfilled when
demand is downward sloping and supply is upward sloping), the market is stable
i.e. excess demand is reduced and eventually wiped out by rising prices. If (3 - 8 > 0,
it is unstable: continuous and indefinite inflation will take place.
15
Y = k(D - Y) = k(eo + cY + 1 + G - Y]
= k(c -I)Y + k(eo + 1 + G). (2.24)
This is a typical first order linear differential equation of the type (2.8) where
a == k(1 - c) and b == k(Co + 1 + G), whose solution, by (2.9), is
Y(t) = Co+1+G +
l-c
(1'0- Co+1+G)
l-c
ek(c-l)t.
Clearly the stability condition is c - 1 < 0 i.e. the marginal propensity to consume,
c, is less than unity.
For the non-linear increasing consumption function. C = C(Y) where C' > 0,
(2.2.4) is
Y = k(D - Y) = k[C(Y) - Y + (1 + G)] == kf(Y)
where f(Y) == C(Y)-Y +1+G. Clearly, the modelisstable if f'(Y) == C'(Y)-1 < 0
and unstable if f'(Y) > 0, i.e. stable if C' < 1 or the marginal propensity to consume
(C') is less than one. At Y = 0, f (Y) = 0 i.e. C (Y) + 1 + G = Y, aggregate demand
(D) = aggregate supply (Y).
I(Y)
-.orl-----------~~---------Y
Harrod (1939) and Domar's (1946) growth models are often presented together
in view of the similarity of their results, although they are different. They are
interesting applications of differential equations. Harrod assumes (i) Saving (8)
is a linear increasing function of income, 8 = sY(O < s < 1), (ii) capital K(t)
accumulates in response to increases in income, K == 1= vY where v is a positive
constant capital output ratio and (iii) 8 = I this gives
8= sY= vY = I
or
Y= (s/v)Y (2.25)
whose solution, by (2.7) is
Y(t) = Yoe(s/v)t
where s / v is the ''warranted rate of growth g.. " of the economy, the rate which keeps
producers happy with their investment decisions.
The model is "unstable" not in the sense that Y(t) -+ 00 as t -+ 00 but in the
sense that, should income rise less quickly than would be required to warrant the
level of investment, then there would be overproduction (i.e. production in excess of
what could be sold). If, on the other hand, income (production) rises more rapidly
than gw, there would be underproduction. Thus if production increases too fast, we
would have produced too little. The gap between the actual (gt) and warranted (gw)
rate of growth would be widened overtime and hence the economy is unstable.
Many attempts have been subsequently made to formalize this "paradoxical"
result, leading to different conclusions. Jorgensen (1960) confirms instability and
Rose (1959) proves stability. We shall only present Rose's argument (see also Hahn
Mathews 1964) which provides in itself an interesting application of first order dif-
ferential equations. Let the actual rate of growth of capital be gt == Kt/ K t and K;
be the desired capital stock. Suppose at date t, producers, being caught short of
capital, want to catch up in T periods from now and expect output to grow at the
warranted rate gw' Then
or
InKt + gT = InKt + gwT
gt - gw == ! (InKt -lnKn = -~ (InKt -lnKn
or
. 1
x=-f x (2.26)
t = -k Yoe rt + A == Do + -k Yo (rt
D () e - 1)
r r
and the debt ratio DIY is
D(t) = (Do _ ~) e- rt + ~.
Y(t) Yo r r
Clearly this is stable.
In an economy where profit (1r) is a decreasing function of the capital stock (K)
and investment (I == K) is an increasing function of profit, the behaviour of capital
stock could be described as
1r = -{3K
K = 0'1r + A = -O'{3K + A (2.28)
18
The instability of Harrod-Domar's economy partly stems from the rigidity of its
technology: its production function, of the type Y = min (~ , ~) does not allow
any factor substitutability. This is corrected by Swan (1956) and Solow (1956) in
their neo-classical model which is based on the following assumptions.
(i) Labour (L) grows at a constant rate n i.e. tiL = n
(ii) All saving S = sY are invested in capital (K) formation I = 1< + 8K (s,8
are constant positive fractions)
(iii) Production takes place under constant returns conditions:
k 1< t sY
k= K - L= K - (8 +n)k
(2.30)
which is the Bernouilli equation (2.18). Defining x == k1- 0 and substituting gives
Coming back to the general case of y = / (k), we can use a phase diagram to
solve the problem qualitatively, as follows
k = s/(k) - >'k
>'k
--~--------------~~----k
o
~4~O~~------------~~~k-·~~~--k
where g{t), ai{t) (i = 0,1,2, ... , n with ao{t) =F 0) are any differentiable functions of
time and xn = = =
{d n/dtn)x with i: {d/dt)x, x (rP /dt 2 )x etc ...
Using the operator Di = di/dti{i = 0,1, ... ,n) on x, for example
= = = =
DOx {~/dtO)x x; Dx {d/dt)x, D 2 x (rP/dt 2 )x, we can write (2.31) as
or
L{D)x(t) = g(t) (2.33)
where L(D) represents all the terms inside the square brackets of (2.32).
20
Xl
X, Xn
X' )
W(t) == det (
:1 n -::F O (2.34)
n-l X~n-l)
Xl
For example if the solutions of (2.31) are e~lt, e~lt, . .. ,e~ .. t where A; are all distinct,
then W(t) -::F 0, since
e~lt e~lt e~ .. t 1 1 1
Ale~lt A2e~lt Ane~ .. t
W(t) == = e(~l+"+~n)t Al A2 An
Example 2.8. The solution of x -6£ + ll.i - 6x = 0 is x(t) = (e t , e2t , e3t ) which
has the Wronksian W(t) (-::F 0)
et e2t 33t 1 1 1
W(t) = e 2e 3e = e 1 2 3 = 2e6t
t 2t 3t 6t -::F O.
et 4e 2t ge3t 149
will be reviewed in this chapter, with particular attention being devoted to the
constant coefficient case where a, b, c are constant, with a = 1 and also d(t) = d
constant, i.e.
x + bi; + ex = d. (2.36)
We shall examine the complementary function xc(t) and the particular integral
Xp separately, concentrating on the case of second order constant coefficient.
The complementary function xc(t) is obtained by finding the x(t) which satisfies
L(D)x = O. In the light of theorem 2.2 (see also Remark 2.3), we can try a solution
of the form x(t) = eAt. Substituting into (2.38) gives L(D)x = 0 on
eAt (A2 + bA + e) = O.
Since eAt =I 0, the characteristic equation A2 + bA + e = O. This gives the
t(
solution A = -b ± Jb 2 - 4c) == (AI, A2)' Thus eAit (i = 1,2) are solutions and so
is Ale A1t + A2e A2t where AI, A2 are arbitrary constants to be determined by initial
conditions. Clearly b2 - 4c ~ 0 and three cases must be examined separately.
>
Case (i) b2 - 4c > 0: The characteristic equation has two real and distinct roots
(i.e. Al =I A2, AI, A2 real).
(2.38)
t(
Case (ii) b2 - 4e = 0, A = -b ± 0) = -b/2 = Al = A2: the characteristic equation
has two identical roots A = -b/2. The solution to be tried is now x = teAt, not eAt
since this would lead to x(t) = Ae At where A = Al + A2 and A is determined by
two initial conditions Xo and x'(O). Differentiating i; = (1 + At)e At , x = (2A + A2t)e At
and substituting into L(D)x = 0 gives
(2.39)
where a == Re(A) == -b/2, the real part of A and f3 == ";4~-b2 , the imaginary part of
A and i 2 = -1.
The complementary function is
22
(2.41 )
Theorem 2.5. The second order linear differential equation with constant
coefficients of the form
(i) (2.44 )
if Al and A2, the roots of c(A) = 0, are real and simple (i.e. distinct);
(ii) (2.45 )
if Al = A2 == A al'e real and equal i.e. real and of multiplicity 2 (see Ch. 4).
(iii) X(t) = eat(Bl cos f3t + B2 sin f3t) (2.46)
== eat A cos{.f3t - f)
if the roots of C(A) = 0 is a pair of complex number i.e. Al = o:+if3, A2 == 'Xl = o:-if3
where of course, 0: == -b/2 = Re(A) and f3 == ~J4c - b2 == Im(A) and i = p.
( ~~ ) = ~ ( ~: ~ ~~~ ) . Thus Al and A2 are conjugate complex numbers, i.e. A2 = AI. But
then BI = Al + A2 and B2 = i(AI - A 2) will be real numbers. For example if Al = a + ib,
A2 == Al = a - ib then BI = a + ib + a - ib = 2a and B2 = i(a + ib) - i(a - ib) = -2b. Hence BI
and B2(== i(AI - A 2» are both real numbers.
2put BI = Acos!, B2 = A sin! we have tan! = sin, = BB, i.e. c = tan- I (~BB ). ~B< B2
cos 1
=
f
=
1 1
But BI cos /3i + B2 sin Oi A( cos c cos /3i + sin c sin /3i) A cos(/3i - c) which is a trigonometric
oscillation with period 27r / /3 and amplitude A. Note that had we defined B2 = -A sin c, we
would have, for (2.46), xCi) = eat A cos(/3i + c) (as in the first edition), which is the same, since
cos c = cos( -f).
23
Note that AI, A2 and B 1 , B2 are all real arbitrary constants, to be determined by
initial conditions and ACOSf == B I , Asinf = B 2, A ±.JB? + B?,
f == tan-I(Bd Bd. (See footnote 2). Note also that a and f3 are both real numbers.
Note that (2.42) is the case of homogeneous differential equation. If (2.42) is
L(D)x = d then the above results in (2.44), (2.45) and (2.46) are the solution to
the homogeneous part L(D)x = 0 or to the complementary function L(D)x = O. In
this case, to avoid confusion, the solution x(t) in (2.44), (2.45) and (2.46) should be
written as x c ( t) since the complete solution is
(2.48)
Example 2.11. x - 6± + 9x = O.
The characteristic equation A2 - 6A + 9 = 0 gives A = 3 ± v'o this is case (ii) of
repeated roots. The solution is
Example 2.12. x + 25x = 0; A = ±5i, the solution is, by (2.46) case (iii) is
x(t) = BI cos 5t + B2 sin 5t. This is a particular case of (iii) where a = 0,
x(t) = Acos(f3t - f), the solution x(t) fluctuates for ever: it is said to have neutral
stability.
xp == # 0).
Xe = die (e
If e = 0, try Xp = kt which gives xp = k, xp = O. Substitution into (2.38) gives
bk = d i.e. k = dlb and xp = kt = dtlb. If b = 0 = e, try xp = ke, differentiate
xp = 2kt; xp = 2k, and substitute into (2.38) gives 2k + 0 + 0 = d or k = d/2,
.I.e. xp = kt 2dt
2' = 2
xp = die if e #0
Xp = dtlb if e = 0 (2.49)
dt 2
Xp = -2- if e = 0 = b.
25
When the function g(t) on the RHS of L(D)x = g(t) is some function oftime, the
most commonly encountered being polynomial, exponential and trigonometric func-
tions and their combinations - several methods could be used to find the particular
integral. The most commonly used are the following.
xp = at 3 + bt2 + ct + d + pe 2t t 2
xp = 3at2 + 2bt + c + 2pe2tt2 + 2tpe2t
x = 6at + 2b + 4pee2t + 8pte2t + 2pe2t .
Substituting into the original equation gives
(D2 - 4D + 4)x = 4at3 + (-12a + 4b)t2+ (6a - 8b + 4c)t + (2b - 4c+ 4d) + 0 + 2pe2t.
Equating coefficients gives
4a = 1 ~ a = 1/4
4b -12a = 0 ~ b = 3/4
6a - 8b + 4c = 2 ~ c = 13/8
2b - 4c + 4d = 3 ~ d = 2
2p = 1 ~ p = 1/2.
t
Thus xp = ~ + ~ t 2 + 1:t + 2 + e2t . The complete solution is x(t) = xe(t) + xp(t)
Y
i.e. x(t) = (A1 + A2t)e2t + ~ + ~ t 2 + t + 2 + e2t . t
26
L(D)x = g(t).
± + ax = g(t)
(D + a)x == L(D)x = g(t)
1 1
xp = L(D) g(t) == D + a g(t)
i.e.
where AI, A2 are the roots of e(A) where e(A) == A2 + bA + e = (A - Ad(A - A2) = o.
Repeated integration gives
27
Example 2.18. Ii - 5x + 6x == et
or L(D) == (D2 - 5D + 6) = 1 - 5 + 6 == L(a) since a = 1 in this example
(g(t) == et = eat).
g(t) g(t) et
xp = L(D) = L(a) = 2'
This is example 2.15 using the inverse operator method. It is easy to check this
result.
Note that L(a) '" 0 i.e. a must not be equal to any root of the characteristic
equation c(>.) = 0, otherwise L(a) = 0 and this method would involve division by
zero. For example, if g(t) in this example is g(t) = e2t then L(D) = L(a) = 0, since
c(>.) = >.2 - 5>' + 6 = (>. - 2) (>. - 3) = 0 i.e. >'1 = 2 = a. In this case using repeated
integration gives
etc ....
Source W. Kaplan (1958, p. 165-166) (where the table gives 21 functions).
29
This consists in taking the Laplace transform of the differential equation, solving
it and using the inverse transform to get back to the original unit. This method
has the advantage of solving, not only the particular integral (xp(t)) but also the
complementary function (xc(t)) in the process.
Although a treatment of Laplace transform would carry us too far away from
the main theme, this can nevertheless be briefly, very briefly indeed, introduced and
its role in the solution of ODE be shown.
The Laplace transform of any function f(t) is defined as
where s > 0 to ensure the convergence of the integral. Some of the more commonly
encountered are f(t) = (1, t, tn, eat). Carrying out the calculation, frequently using
integration by parts and partial fraction, gives
(2.53)
es-a) 100 1
(iv) T(e at ) = F(s) == 10o 00
e-steat dt = - - - = - - (s > a).
s-a o s-a
i.e. the Laplace transform of a sum is a sum of the transforms of individual functions.
The inverse transform of F(s), designated by T-l[F(s)] is a function f(t) such
that L[J(t)] = F(s). For example T-l(l/s) = 1 since T(l) = l/s T-l(1/s 2 ) = t
since T(t) = 1/s2 etc ...
Our interest in Laplace transform in this context lies in its application to the
solution of O.D.E.
30
Note that the above steps are greatly facilitated in practice by use of Laplace
transform tables, available in most text books on O.D.E.
x(t} = T- 1 C: + 8! J
1 = 2et + 3e- t .
.
Example 2.21. x - Ax, Xo = ( 1) -1 ; A == [3 -2]
4 -1 .
[ 8 ~43 8 ! 1] [ :~ ] = [ _ ~]
or (81 - A)x = Xo whose solution for x is, by matrix inversion (see Ch. 4)
__ [ Xl ] _
X = X2
1
- 82 _ 28 + 5
[8-8 ++ 37 ] =_ (81 - A )-1 Xo
1.e.
8+38+3
Xl = == -:----:-::-~
82 - 28 + 5 (8 - 1)2 + 4
-8+7
X2 = .
82 - 28 + 5
Taking the inverse transform, by use of Table, gives
-1 ( 8 + 3 )
X1(t) = T 82 _ 28 + 5
= et cos2t + 2et sin 2t = et(cos 2t + 2 sin2t)
( -1 ( 8 +7 )
X2 t) = T 82 _ 28 + 5
= _e t cos2t + 3et sin 2t = et(-cos2t + 3sin2t)
Note that if the initial conditions are zero, i.e. Xo = ( ~) we have (81 -A)x = 0,
i.e. 8 is the eigenvalue of A, and X == (X1,X2)' is the corresponding eigenvector.
(See Ch. 4). Thus det(81 - A) = 0 => 8 = 1 ± 2i.
The treatment of the case of second order differential equations could be extended
to the nth order (n > 2) in a natural way.
Consider a typical nth order linear differential equation with constant coefficients
(2.54)
with ao :F 0 and Dr == d'" jdtr(r = 0,1, ... , n with DO == 1) and the associated
characteristic equation
(2.55)
n
C(A} = E (A - Ar) = o. (2.58)
I
If we associate a function Xr = Xr(A r} to each root Ar(r = 1,2, ... , n}, then the
solution is a linear combination of Xr, i.e. the solution is
(2.59)
where CI, C2, ... , Cn are arbitrary constants to be determined by initial conditions and
thexr(A r} are
(i) e~rt for each simple real root Ar of C(A} = 0
(ii) e~rt, te~rt, t 2e>'rt, . .. , tk-Ie~rt for each root Ar of multiplicity k > 1,
(iii) eOrt cos /3rt and e Ort sin /3rt for each simple (non-repeated) pair of complex roots
Ar = Or ± i/3r
Thus theorem 2.5 emerges as a special case of this theorem. For a formal proof,
see any text books on differential equations, for example, Coddington and Levinson
(1955), Maxwell, book 4 (1968) Kaplan (1958).
gives A = (±i, ±i, -1 ± i) i.e. one simple pair of complex roots -1 ± i and one pair
of complex roots of multiplicity 2. This is case (iv). The solution, by (iv), is
It can be seen that if All A2 < 0 in case (i) and A < 0 in case (ii), then as t ~ 00,
A1e'\lt + A 2e'\2 t ~ 0 and x(t) ~ dlc, i.e. limHoo x(t) = dlc: the system is by
definition, stable. Note that this is called asymptotically stable. For case (iii)
Bl cos fJt, B2 sin fJt and A cos(fJt + t") are all circular functions which fluctuate end-
lessly with period 271"1 fJ and amplitude B 1, B2 and A respectively. On multiplication
by eQt , these fluctuations are damped over time if a: < 0, magnified if a: > O. Hence,
for all cases the stability conditions are simply a: < O. But a: is the real part of the
eigenvalue, A = a: ± ifJ, this amounts to saying that Re(A) < O.
Of course, for the case of real A, fJ = 0 i.e. A = a: ± ifJ = a: + 0, and thus the
rule for all cases is simply Re(A) < 0 where Re(A) = a: in the case of complex root
A = a: ± ifJ and Re(A) = A, the root itself, in the case of real root A.
34
Consider a simple closed economy in which national income (Y) rises in response
to excess aggregate demand (D) and interest (r) rises in response to excess money
demand L(Y), i.e. Y = h(D - S) and r = m(L(Y) - M). Aggregate demand
(D) is consumption (G) and investment (I) both assumed to be linear functions i.e.
G = cY(O < c < 1) and I = -ar where c, a > o. Aggregate supply S is national
output, i.e. S = Y. Money demand is assumed to be a linear increasing function of
income, L(Y) = kY (k > 0), i.e. money is demanded for transactions purposes only,
in the spirit of Fisher's quantity theory of money which was subsequently developed
by the Cambridge School as represented by Marshall and Pigou. Money supply (M)
is assumed to be undertaken by the Central Bank. Defining the marginal property
to save (s) to be s == 1 - c, and setting the constant speeds of adjustment hand m
at 1, i.e. h = 1 = m for simplicity, we can describe the economy as
or
Y + sY + akY = aM
the solution of which, by, (2.44) is
differentiating,
j=Y+s¥. (2.63)
Equating (2.63) to (2.60) gives, on substitution for I from (2.62)
Y+s¥ =v¥-I
or
Y + (1 + s - v)Y + sY = Co. (2.64)
The solution of which, by (2.38) is
Y(t) = s-lCo + A1e.\lt + A 2 e.\2 t
3. Stabilization Policies.
The fluctuations of the economy can be brought under control by varying the
level of government expenditure (G) every time this falls short of some desired level
(G*), i.e.
G = fl(G· - G)
or
(D + fl)C = flC·
C = _fl_G*
D+fl
36
Y= h(X - Y)
or
h
Y= D+hX
where X = (1 - s)Y + G + eo(O < s < 1) = eo a constant) this gives
Differentiation gives
The Walrasian tatonnement model where price rises in response to excess demand
overlooks the role of stock in the equilibrium price movement, either because all
goods are assumed perishable or goods at different dates and locations are considered
to be different goods, a. la Debreu (1952). When stock plays an important role in
pricing, it must be modelled explicitly.
Consider the case where merchants raise price (P) as his stock (Q) decreases to
some critical level (Q), i.e.
where .\ = ±Jk((3 - 8) which is a saddle point .\1 < 0 < .\2 if 8 < (3 and a complex
root if (3 < 8. In this case
2.6. Conclusion
This chapter was devoted to a brief and yet fairly comprehensive review of the
theory of ODE of first and higher order. It is bound to be selective: only those parts
of ODE which will be needed in later chapters were treated. Examples and some
selective economic applications were provided. This selection is rather difficult since
ODE are commonly used in almost every field of Economics. It is hoped that this
review of ODE provides a useful background and a handy reference for use in later
chapters.
Chapter 3
Review of Difference Equations
3.1. Introduction
In Chapter 2, we discussed ODE which involve a variable x(t) and its derivatives X, X,
x which give continuous rates of change. In this chapter, we are dealing with Difference
Equations (d.e.) involving a variable Xt and its differences ~Xt, ~2Xt etc .... The variable
in question varies discretely, or more correctly, although it changes continuously, the
observations of these changes are made and recorded only at intervals. For example, if Xt
is gross national product at time t, it is measured only once a year, say on 31 December and
recorded on that day. They are called difference equations since they involve differences
in functions. For example, if Xt = f(t) the first difference is
~Xt = Xt+! - Xt = f(t + 1) - f(t)
~Xt+! = Xt+2 - Xt+! = f(t + 2) - f(t + 1)
the second difference is
Definition 3.2. By solution of a d.e. is meant all values of Xt not involving differences
and satisfying (3.2).
It could be proved that if xc(t) is a solution of (3.2), so is kxc(t) for any arbitrary
constant k; and also if xc(t) and xp(t) are solutions of (3.2), so are their linear combinations
kIXc(t)+k2Xp(t) for any constants kl and k2. Similarly, it could be proved that the solution
exists and is unique.
Finally, note that the solution of a d.e. of order n requires n initial conditions in order
to quantize the n arbitrary constants which appear in the solution.
We shall discuss first, second and higher order d.e. their stability, and provide some
economic applications.
(3.5)
Proof. By iteration,
Xl = axo
X2 = aXI = a(axo) = a2xo
Theorem 3.2. The solution of the first order non-homogeneous d.e. (3.4) is
b
xt=atc+- 1 --a ifa#l
Xt = Xo + bt if a = 1
tXt+! = aXt + b
o Xl = axo + b
1 X2 = aXl + b = a(axo + b) + b
2 Xa = aX2 + b = a3xo + (1 + a + a2)b
Xt = at (xo - _b_)
I-a
+ -1
b
-a
(a ~ 1).
If a = 1, Xn = anxo+bn 2:0- 1 ai = xo+bn for a = 1. Hence Xt = xo+bt if a = 1. (QED)
Remark 3.1. The homogeneous case (3.5) emerges as a special case of (3.6) where b = 0;
and if a = 1, b = 0, Xt = atxo = Xo.
Remark 3.2. Like the ODE, the solution (3.6) has two parts, xc(t) and Xe. The first one,
xc(t) == ate == at(xo - b/(l- a)) = at(xo - x e) is called the complementary function which
is the solution to the homogeneous part Xt+l - aXt = 0 and xp == Xe = b/(l - a), called
particular integral (xp) or equilibrium (xe), which is the particular value of Xt that fits
the given equation (3.4). Xe indicates the "equilibrium" value of Xt, since by definition,
Xt is the same for different t i.e. Xt+! = Xt = Xt-l = Xe in equilibrium. Using this in (3.4)
gives Xe = aXe + b i.e. Xe = b/(l - a) (a ~ 1). xc(t) indicates the deviation at time t, of
Xt from its equilibrium Xe. (See figure 3.1) for the case of 0 < a < 1).
Xc (0 < a < 1)
x. f - - - - - - - - - - - b / ( l - a) = x.
Remark 3.3. In the light of Remark 3.2 and of our knowledge of the solution (3.6), we
may just as well start the solution process by trying the solution xc(t) = c>..t where c is an
arbitrary constant and ). is as yet undetermined, for the homogeneous part. This gives
c).t+! - ca).t = 0 i.e. c>.t(). - a) = 0 giving). = a and thus xc(t) = c).t = cat. For a given
Xo, we have at t = 0,
i.e.
c = Xo - b/(l - a)
which indicates the initial deviation of Xo from its equilibrium value b/(l- a). (See figure
3.1).
=
Remark 3.4. If lal < 1 i.e. -1 < a < 1, limHoo at 0 giving liffit-too Xt 0 + b/(l- a): =
the system is stable in that Xt will converge to its equilibrium value Xe overtime (see figure
3.1). If lal > 1 i.e. a < -lor 1 < a, liffit-too at = ±oo: the system is unstable.
Example 3.1.
Xt+l = 0.5xt + 2, Xo = 10.
The solution, by (3.6) is
Xt = (0.5)t(xo - Xe) + Xe
= (0.5)t(10 - 4) + 4
o < a = .5 < 1 : the system is stable since Xt -t 4 over time.
(3.8
The exact form of f need not be known: so long as some properties concerning it:
slope, curvature (convex or concave), behaviour at Xt = 0 and Xt = 00 are given, th,
equation is solvable qualitatively by use of phase diagrams. This consists of drawinJ
a 45°-line (Xt+l = xd in the Xt+lXt plane and look for the fixed point at which f(xt
intersects this line, then examine its stability. (See figure 3.2).
XI+l
Xl
X2
f(xt}
.
45° :
XI XI
0 Xo Xl
Starting from an arbitrary x(O) = Xo, f(xt) allows us to read Xl = f(xo). The 45°-line
then translates Xl for t = 1 from the vertical to the horizontal axis where Xl is now
taken as the new initial condition for t = 1. X2 = f(Xl) then gives X2 for t = 2 and so
on. Equilibrium, or the graphical or qualitative solution is the fixed point, if this exists,
where f(xt) intersects the 45°-line i.e. where Xt+l = Xt. It is easy to see that, provided
the solution (i.e. the fixed point) exists, stability conditions are If'l < 1. In the linear
case, f' = a and hence lal < 1 is the stability condition (see Remark 3.4). Clearly the
existence of the solution depends on whether f(xt) intersects the 45°-line. In figure 3.2,
it is easy to see that if 1f'1 > 1, the system is unstable: Xt moves away from the 45°-line
as time goes on.
Dt = a + /3Pt
St = / + 8pt-l
where a, /3, /, 8 are constants. While demand D t is a function of current price Pt, supply
St is a function of price prevailing on the market at some previous period, Pt-b due to
production lag. Equilibrium takes place when D t = St i.e. when
P P
Po
PI
S
:~DI"I<l
Po
PI
.~S~I>l
.....
D
D,S D,S
0 0
44
p p
p
I--+--+---'p'----P. I---t-+-+-+--p.
lal < 1 lal > 1
o o
Stability conditions require lal < 1 which is always satisfied if S is flatter than D i.e. if
lal == 16/.81 < 1, where 6 and .8 are, of course, the slope of the supply and demand curves
respectively. It can be seen, from figure 3.3 that, starting from some Po at t = 0, price will
fluctuate around its equilibrium level P. = b/(1 - a) to which it will converge if lal < 1
and from which it will diverge if lal > 1. On the price-quantity diagrams, this looks like
a convergent and divergent cobweb, hence its name.
Ct = cYi-I + Co
It = 7
Yi = Ct + It = CYi-I + Co + 7.
The solution, by (3.6) is
where Ye == Y" = ~ .
Stability conditions require lei < 1. Since 0 < c =MPC we have 0 < c < 1. Thm
given the constant c and Co, even if we do not assume 0 < c < 1 to start with we CaIl
still use Samuelson's (1947) Correspondence Principle to conclude that an increase ir
investment will cause equilibrium national income (Ye) to increase by the static multipliel
effect 1/(1 - c) if the corresponding dynamic model is to be stable, i.e. if 0 < c < 1.
The overlapping generations model of Allais (1947) Samuelson (1958) and Diamoll(
(1965) has been widely applied. It will be presented here as an application of non-linea:
first order d.e. In this model, individuals live two periods, work in the first for an incoml
45
Wt and retire on their saving{s} in the second. Goods do not keep and thus generations
trade with one another. Compared with Ramsey's {1928} model, it arrived at different
results: competitive equilibrium is different from planners' equilibrium, and may not be
Pareto optimal in that people oversave.
Let there be Nt people born at time t, each working in period t, consuming Cit when
he works and C2,t+l when he retires, next period. Population grows at a constant rate n
i.e. Nt = N o{1 + n}t. .
A typical individual derives utility from his consumption in both periods i.e.
{3.9}
{3.11}
(3.13)
whose solution, by the implicit function theorem, gives the saving as a function of income
(Wt) and price: interest rate (rt+1) being the price at which present goods are exchanged
for future goods, i.e.
{3.14}
with
Sw(== as/aWt} > 0 and Sr ~ o.
Firms maximize profit by observing marginal conditions, i.e. by equating marginal
product of labour with wage (wd and marginal product of capital with interest rate (rt)
1.e.
f(k t ) - kd'(k t } = Wt (3.15)
!' (kt ) = rt·
Market equilibrium requires investment Kt+1 - K t to be equal to the saving
Nts( Wt, rt+1} of the young net of the dissaving of the old K t , i.e.
(3.16)
46
(3.17)
aXt + bXt-l + CXt-2 = g(t) given x(O) = Xo and x(1) = Xl. (3.20)
If g(t) = 0, (3.20) is called a homogeneous d.e. or the reduced form of d.e. whose solutio[
is called the complementary function xc(t). If g(t) =j:. 0, it is a non-homogeneous d.e. Th{
particular integral (x p ) or equilibrium solution (xe) is the particular value of Xt which fit~
g(t) in (3.20).
1. Consider the case where g(t) = d, some constant, in (3.20). The particular solutiO!
is obtained by trying a solution x = kti where k is a constant and i = 0,1,2: i = (
if a + b + C =j:. 0 in (3.20); i = 1 if a + b + C = 0 and i = 2 if b + 2c = O.
47
This is the solution of the homogeneous part, or reduced form, of (3.20) with g(t) = 0,
i.e. xc(t) is the solution ofaxt + bXt-1 + eXt_2 = 0 (or aXt+2 + bXt+l + eXt = 0). In the
light of first order d.e., we can try xc(t) = A~t, xc(t + 1) = A~t+l etc ... where A is some
arbitrary constant. Substitution into (3.20), with g(t) = 0, gives
A(a~t + b~t-l + cAt- 2) = A~t-2(a~2 + b~ + e) = O.
Since A~t-2 ¥- 0, the characteristic equation e(~) = a~2 + b~ + e must vanish. This has 2
roots i.e.
(3.21)
gives ~l' ~2 = f,; 2
(-b ± -/b - 4ae) and the complementary function we are searching is
(3.22)
where Al and A2 are two arbitrary constants to be determined by the 2 initial conditions
Xo and Xl. Three cases arise, just like the ODE of chapter 2, as follows:
(i) ~ == b2 - 4ae > 0 : ~l and ~2 are both real and distinct roots of e(~) = O. The solution
is (3.22).
(ii) ~ = b2 - 4ae = 0 : ~l = ~2 = ~ (say) i.e. e(~) = 0 has two identical roots, ~ = "2: .
The solution (3.22) obtained by trying xc(t) = At~t instead, substituting into (3.20) with
g(t) = 0, remembering that b2 = 4ae and ~ = -~, is
(3.23)
48
(iii) .6. < 0, C(A) = 0 gives A = -f,; ± iy'4~,;b2 == 0: ± if3 where 0: == Re(A) = -f,; and
13 == y'4~:-b2 , both real numbers, and if3 = Im(>'). Solution (3.22) is now
(3.24)
In polar coordinates, 0: = r cos 6 and 13 = r sin 6 where 6 is the angle between the real
axis 0: and the radius r (see figure 3.4).
..\ = ct + if3
---;<"iE-ir---+-------ct = Re..\
x= ct - if3
is
Xt = X. + Al~~ + A2~~ (3.27)
where ~1 and ~2 are the roots of the characteristic equation
{i} (3.27)
(ii) (3.28)
or
(3.30)
when ~1 and >'2 are the complex roots of c(A) = 0 .i.e. ~2 = XI, or if ~1 == >. then .\2 = X).
50
Example 3.2.
Xt + 4Xt-1 + 3Xt-2 = 8, Xo = 3; Xl = 4
c(,x) = ,X2 + 4'x + 3 = 0 ~ ,X = (-3, -1), case (i) Xe = 8/8 = 1. Solution (3.27) gives
Xt = 1 + AI(-3)t + A2(-lr
At t = 0, Xo = 1 + Al + A2 = 3 (given).
At t = 1, Xl = 1 - 3AI - A2 = 4 (given).
These 2 equations in Al and A2 give (AI, A 2) = (-3,5) and thus the complete solution is
Example 3.3.
Xt + 4Xt-1 + 4Xt-2 = 9, (xo, xd = (2,3)
c(,x) = ,X2 + 4,X + 4 = 0 ~ 'xl = 'x2 = -2 (case(ii)).
The solution, by (3.28) is
Example 3.4.
Xt+2 - 2Xt+1 + lOXt = 9 (xo, xd = (2,3)
c(,x) = ,X2 - 2,X + 10 = 0 ~ ,x = 1 ± 3i (case (iii))
Xe = 9/9 = 1.
Xt = 1 + Art cos(8t - f}
= 1 + (1.05)(3.16}t(cos 71.56t - 18.435)
with n given initial conditions. Without loss of generality, we can set ao = 1 (and of
course an ¥ 0) and g(t) = 0, and obtain an exact parallel of the treatment of ODE in
Chapter 2.
The characteristic equation c('x) is given by
,Xt + al,Xt-l + ... + an,Xt-n
= ,Xt-n(,Xn + al,Xn-l + ... + an) = 0
= ,Xt-nc('x) (3.32)
where c(,X) = ,Xn + al,Xn-l + ... + an, an nth order polynomial whose solution involves n
roots which may be all real and distinct, or complex, or repeated on lines (multiplicity
m, see Chapter 4). The solution for the n distinct real roots is
(3.33)
(3.34)
If roots are complex but distinct, they come in conjugate pairs and each pair has the form
(3.25)
rt(Bl cos8t + B2 sin8t). (3.35)
If some complex root j is repeated mj times, it is
It will be recalled that the solution of Xt = aXt-l + b, given xo, is Xt = (xo - x.)at + x •.
The model is stable if Xt -+ x. as t -+ 00. For this to happen, lal < 1 i.e. -1 < a < 1.
If 0 < a < 1, i.e. a is a positive fraction, at -+ 0 as t -+ 00 in positive descending steps.
For example if a = ~, at = 1, 1/2, 1/4, 1/16 for t = 0,1,2,3 i.e. 0.5 t tends to zero in
descending steps. (see figure 3.6). If - 1 < a < 0, at overshoots its equilibrium (being
alternatively positive and negative) while approaching it (see figure 3.6). Eg. if a = -1/2,
at = 1, -1/2, 1/4, -1/16 for t = 0,1,2,3. Outside this range (-1,1), i.e. if lal > 1 i.e. if
a < -1 or 1 < a, at will climb up the steps indefinitely, diverging from x. in larger
and larger steps and if a < -1, at will diverge from x. in larger and larger positive and
negative steps (see figure 3.6)
XI Xt x.
XI (xo > x.)
(I < a) (a < -I)
Xo Xo
x. x. x.
Xo
(xo - x. < 0)
XI
0 I<a 0 a< -I
XI XI
Xo (xo > x.) (-I<a<O)
(0 < a < I) Xo x.
x. XI x. x. x.
(3.38)
(3.39)
(3.40)
Stability conditions are IAil < 1 for all i in the real root case and Irl < 1 in the complex
root case.
Schur's theorem (see Chipman 1951) is often used to check these stability conditions.
To use the theorems, write determinant ~n as follows
1 0 0 an an-1 a1
a1 1 0 0
~n =
an-1
an 0
1
0
0
1
0
a1
0 an
a n-1 == I ~~ A2
A'1
I
an-1 0
a1 an 0 0 0 1
Let ~1 be ~n where only the first rows and column of each submatrix Ai are retained,
and ~2 is ~n where only the first two rows and columns of each submatrix Ai (i = 1,2)
are retained, and so on, i.e.
1 0 an a n-1
~1 = 11
an
an
1
I' ~2=
a1
an
1
0
0
1
an
a1
etc ...
a n-1 an 0 1
Schur's theorem says that lAd < 1 for all i iff ~1, ~2' .•• , ~n are all positive.
54
Second order d.e. have been widely used in Economics. Suffice to review only two
of them, the Samuelson-Hicks models in view of their importance and their pioneering
character.
where Yo == Yp = 1/(1- c) = multiplier = particular integral of Yi and AI, A2 are the roots
of C(A) where
1.e.
A = ~ [C(1 + v) ± Jc2(1 + v)2 - 4CV] . (3.43)
Clearly the two parameters c and v determine A and hence the stabili ty of Yi. Three cases
arise depending on whether the discriminant ~ == ~(1 +v)2 -4cv ~ 0, and in the complex
root case (~ < 0), whether Yi is periodically convergent or divergent. Only the limiting
case ~ = 0 need be examined. Since c > 0, ~ = 0 ::} c(v) = 4v/(1 + V)2, a concave
function over v E (0,2) which reaches a maximum at (c, v) = (1,1). At v = 2, c(v) has an
inflexion point and becomes convex thereafter, i.e. c'(I) = 0, c"( v) = 8( v - 2) /(1 + v)4 and
c"(v) ; 0 for v ; 2. Above this line, roots are real and distinct, indicating monotonically
increasing or decreasing Yi (i.e. no fluctuations) and below it, roots are complex indicating
fluctuations (see figure 3.7).
55
c CVgt\ dvgt
cv = 1
1=:==.,-:----=::0;:::::--------------- c = 1
monotonic dvg
· (1)
·
periodic: cvg l'
(3) : c(v) = 4v/(1 + v)2
nO~----------~1----------~2~-------------v
In the complex zone below c( v) = 4v / (1 = V)2 line, stability is given by Ir I = IFvl < 1,
in the solution
(3.44)
Since (c, v) > 0, Irl = 1 =? r = cv = 1 divides the parameter space into a divergence
zone above it and convergence zone below it (see figure 3.7). Since 0 < c < 1, only the
area below c = 1 line is of interest. The two curves cv = 1 and c = 4v/(1 +V)2 delineate
the parameter subspace into 4 zones of interest: monotonic divergence in (1), periodic
divergence (2), periodic convergence (3) and monotonic convergence (4). This completes
the analysis of Samuelson's cycle.
Hicks' Trade Cycle is, like Samuelson's, based on the interaction of the multiplier (1/ s)
and accelerator (v). It provides an independent and interesting application of the solution
techniques discussed above. Hicks' model is
Ct = (1 - S)Yi-l: Consumption function
It = Ao(l + g)t + V(Yi-l - Yi-2): Investment function
Yt = C t + It: National income.
Consumption C t is a lagged linear function of Yt-l (0 < S < 1, s being the marginal
propensity to save). Investment (It) has two components: autonomous investment
Ao(l + g)t growing exogenously at some constant rate 9 and induced Investment
V(Yt-l - Yt-2) which responds to changes in GNP, v > 0 being the accelerator. Sub-
stitution gives
Yi - (1 - s + V)Yt-l + VYt-2 = Ao(l + g)t (3.45)
The particular solution Yp can be found by trying a solution Y t = kmt, Y t - 1 = kmt-I,
Y t -2 = kmt - 2 (as discussed in 3.3.1, 2) and substituting into (3.45)
km t - (1 - s + v)km t- 1 + vkm t- 2 = Aomt
56
k = aOm
2
. (3.46)
m2-(-s+v)m+v
Y.,(t) == Yp(t) = km t
(Aom 2)m'
= m2 - (1 - s + v)m
~--~~~~----
+v
(3.47)
(3.48)
where Al and A2 are the roots of the characteristic equation C{A) = A2_ {1-s+v}A+v = 0
of (3.45), i.e.
(3.49)
Stability depends on the discriminant ~ == (l-s+v)2-4v. Two real distinct roots (Al, A2)
exist when ~ > 0 i.e. when 1 - s + v > 2.jV i.e. (1 - .jV)2 > s :::} (i) 1 - .jV > .jS and
(ii) -1 + .jV> .jS, since (1 - s) and v are both positive. Thus ~ > 0 :::} v < (1 - .jS)2
and (1 + JS)2 < v giving respectively monotonic decreasing and increasing GNP over
time. ~ < 0 for (1 - ..[i)2 < V < (1 + JS)2 where Y, is periodically convergent for
(1 - .jS)2 < V < 1 periodically divergent (unstable) for (1 + JS)2 > V > 1 and Y, exhibits
a constant cycle for v = 1 (see figure 3.8 and 3.9). Figure 3.9 identifies zones of periodic
convergence (p.c.), periodic divergence (p.d.) from monotonic convergence (m.c.) and
monotonic divergence (m.d.) The complete solution is
ir------.----------;--- s = 1
v = (1- VS)2
(p.d.)
(m.e.) (m.d.)
-L~~--~~~~~~~-------v
o
Figure 3.9. The parameter space
This chapter was devoted to the treatment of first, second and higher order d.e.,
both linear and non-linear, with some economic examples. It was brief, in view of the
similarities with ODE in Chapter 2. Yet it is comprehensive: it covers all important
points which are useful for the understanding of Dynamical Systems.
Finally, it will be useful to keep clear in mind the differences between the solutions of
ODE and d.e. Take the second order linear case, for definiteness, these are summarized
in the following table.
ODE d.e.
Form of ax + bi: + ex =0 aXt+2 + bXt+1 + eXt = 0
equation (or aXt + bXt_1 + eXt-2 = 0)
e(A) aA 2 + bA + e = 0 aA 2 + bA + e = 0
A = ~ a( -b ± y'b 2 - 4ae ) A = ~ a( -b ± y'b2 - 4ae)
Solution
where for case (iii) (~ < 0), A = a ± if3 = Re(A) ± Im(A), r = ;;;;',8 = tan- I (f3/a),
E = tan- 1 (B 2 / Bd. Obvious as these are, they often constitute a source of confusion for
some students.
Chapter 4
Review of Some Linear Algebra
This chapter provides a brief review of some elements of linear algebra which will be
used in later chapters, in the treatment of dynamical systems.
It will soon be clear that in the solution of the dynamic system x = !(x, t) we will
need matrix theory, especially eigenvalues and eigenvectors, both real and complex, to
deal with the linearized part. A brief review of vectors and matrices would provide a
useful reference. The review will be selective and kept to the minimum needed. For
any further materials and any in depth treatment, the readers should consult excellent
textbooks on the subject, some of which are Pedis (1958), Gantmacher (1959), Bellman
(1960), Hadley (1965), Lancaster (1969), Strang (1976), and Anton {1984}.
Definition 4.1. A vector x = (Xl, X2 , ... , Xn) of dimension n is an ordered set of n scalars
Xi (1 ~ i ~ n) called component of x.
Two vectors of the same dimension are added (or subtracted) by adding the corre-
sponding elements. For example:
X+ Y = (Xl + Yl,'" Xn + Yn).
A vector X is multiplied by a scalar k when each of its components is multiplied by k,
I.e.:
kx = {kXI' kX2,' .. , kXn}.
The multiplication of a column vector X by a row vector y of the same dimension is
called an outer product: it gives a square matrix C, i.e.:
xyT == [
Xl
: 1 [Yl ... Ynl =
[: Cll
...
Cl
n
1== C (4.1)
Xn Cnl Cnn
The multiplication of row X by column y of the same dimension gives a scalar. This
is called scalar product or inner product or dot product. I
n
For any position vectors u, v, w and any real numbers k, h, we have the following:
1. u+v=v+u
2. u+(v+w)=(u+v)+w
3. u + 0 = 0 + u =u
4. u+(-u)=O or u-u=O
5. k(h u) = (k h}u
6. k(u+v}=ku+kv
7. (k + h}u = ku + hu
8. 1.u = u. (4.2)
Definition 4.2. If V is an arbitrary set of objects on which the above operations are
defined, then V is said to be a vector space. If the above scalars k, h are real numbers, V
is a real vector space and if k, h are complex numbers, V is a complex space. V is closed
under addition and scalar multiplication: i.e.: if u and v are two vectors in V, so are u + v
and ku, hv.
Definition 4.5. The vectors vIt V2, ••. ,vn over the real or complex field 2 F are said to
be independent if there exists a set of n scalars (kIt k2' kn ) such that
(4.3)
implies ki = 0 for all i where ki is a real number for the real vector space and ki is a
complex number in the complex vector space.
2Roughly speaking, a field F is defined as a set of numbers which contains the sum, difference,
product and quotient (except by zero) of every two numbers in the set. Furthennore, the numbers must
be commutative and associative, and distributive and associative with respect to addition, subtraction
and multiplication, and distributive with respect to multiplication.
61
Definition 4.6. The set S = {VI, V2, ... , vr } in the vector space V is said to be a basis
)f V if S is linearly independent and S spans V.
Definition 4.7. The dimension oia finite vector space V is defined to be the number of
"ectors in a basis for V.
Definition 4.8. An inner product on a real vector space V is a function that associates
~ach vector x and y in V a number (x, y) which satisfies the following axioms
Definition 4.9. An inner product on a complex vector space V is a function that asso-
ciates to each complex vector x E en and y E en, a number (x,y) == Ei=l XiYi where xr
is the conjugate of x" i.e. if Xr = a r + ibr then xr = a r - ibr for all r = 1,2, ... , n where
i = A. Inner products on a complex vector space satisfy the following axioms:
Note the difference between (i), (iii) and (iv) in (4.4) and (4.5). If x is a complex vector
in (4.5) then (x,x) ~ 0 is no longer true if(x,x}is defined as Ex? Hence, the necessity
in (4.5) to replace (x, x) == E XiXi in the complex space. The order of x and y in (x, y)
is important in the complex space, but is immaterial in the real space. Note that by our
definition, (x,ky) = k(x,y} but (kx,y) = k(x,y} in (iii) of (4.5).
A complex vector space with the above inner product (4.5) is called a complex inner
product space or unitary space.
The length, or magnitude, of a vector x is
Any non-zero vector x in an inner product space can be normalized on division by its
length IIxll, Le. II~II. Such a normalized vector has norm 1, since by a property of norms,
(4.7)
A set of vectors in an inner product space is called an orthogonal set if each pair oj
distinct vectors in the set is orthogonal. An orthogonal set of vectors with norm 1 is
called orthonormal. These definitions carry over to the complex inner product space.
If V and Ware vector spaces and F associates with each vector v E V, a uniqUE
vector W E W, then F is said to map V into W i.e. F : V -+ W. F is a linear mappin~
or transformation if F(u, v) = F(u) + F(v) and F(ku) = kF(u) V u, v E V and k E R.
For example if A is an m x n matrix then a function T : Rn -+ Rm can be defined by
T(x) = Ax
Definition 4.10. If T : V -+ W where VeRn and W C Rm then the set of all vecton
v in V that T maps into 0 is called the kernel or null space of T, denoted kerT ane
N(T). The set of all vectors w in W that are images under T of V is called the range 0
T, denoted R(T). The dimension of R(T) is called the rank of T, denoted r(T) and tht
dimension of kerT is called the nullity of T.
In other words, the set of all nontrivial vectors x such that Ax = 0 is N(A) and tht
range of A, R(A) is the number of independent column vectors of A. Thus, ifT : Rn -+ Rn
i.e. if A is an (m x n) matrix and y E R(A) then y = Ax for some x ERn. Clearl)
y = Ax is a linear combination of all columns of A and R(A) is the set of all sud
linear combinations i.e. R(A) is the column space of A. This can be seen by writin!
A = (al, a2, ... , an) as n columns, and y = Ax as alxl + a2x2 + ... + anX n i.e. y is a linea
combination of the column vectors of A: y E R(A).
Proof. See any textbook in Linear Algebra, for example Hirsch and Smale (1974), Gant
macher (1959), Strang (1976).
Theorem 4.2. The dimension of the solution space of Ax = 0 is n - r(A) i.e. the
number of columns of A minus its mnk, where A is an m x n matrix. This theorem plays
an important part in eigenspace where m = n, as can be seen later.
4.2. Matrices
Definition 4.11. A matrix A is a rectangular array of entries aij (1 :::; i :::; m, 1 :::; j :::; n)
from a field F (field of real or complex numbers):
(4.8)
<
where m ;:; n. A is said to be of dimension or order m x n. Where m = n, A is said to
>
be a square matrix; when n = 1, A is a column vector of m elements (i.e. A is m xl) and
when m = 1, then 1 x n matrix is a row vector of n elements.
In terms of vectors, A may be looked at as a set of m row vectors of n elements each,
or a set of n columns of m component each, i.e.
al.]
[ a2'
= [al·,a2·"" ,an']' (4.9)
a~.
The sub-space of Rn spanned by the row vectors is called the row space of A and the
sub-space of Rm spanned by the column vectors is called the column space of A. It can be
shown that these spaces are not changed by elementary operations (multiplying a row by
a non-zero constant, interchanging two rows and adding a multiple of a row to another).
It could also be shown that for any matrix A, the row space and column space have the
same dimension, which is called the rank of A denoted as r(A). Thus r(A) is the number
of independent rows (or columns) of a matrix A.
Two matrices A and B are said to be equal if they have the same order and same
corresponding elements i.e. A = B means aij = bij for all i, j.
AT or A' is called A transposed when rows and column are interchanged. More pre-
cisely, for an m X n matrix A = [aji] (1 :::; i :::; m; 1 :::; j :::; n) we can define an m x n
matrix A' = [aij] called transposed A.
..
D efi mtlOn 4 .12. D'lagona1 matrIX. A = aiivii
I: h
were I:
Vij = {10 if ii
if = jj ..I.e.
:# .
a matrIX
with all its off-diagonal terms zero. A diagonal matrix with 1 on the diagonal is called a
unit matrix I = [~ij].
Definition 4.13. Symmetric matrix if aij = aji for all i,j, i.e. A = AT A' i.e. A
is its own transpose. A skew-symmetric if aij = -aji IrJ i,j i.e. A = _AT i.e. when
=
A is the negative of its own transpose. By definition, diagonal terms are zero, since
aii = =
-aii :} aii 0 IrJ i. Note that every square matrix A can be expressed as a sum of
a symmetric As and skew-symmetric Ar i.e.
A=As+Ar (4.10)
where As = !(A + AT) and Ar = !(A - AT).
Addition: Two matrices A and B of the same order can be added by addin~
corresponding elements i.e. A ± B = C = [eij] where Cij = aij + bij lrJi,j. AdditioI
is commutative and associative, i.e. A + B = B + A and A + (B + C) = (A + B) + C.
Ie=l
,m
... ,n
Matrix multiplication is associative and distributive but in general not commutative, i.e.
= A(BG)
(AB)G
(A + B)G = AG + Be
but AB # BA in general.
Note that the transpose of a matrix product is the product in reverse order i.e.
where il, h, ... ,jn is a permutation p of 1,2, ... , n, where p varies over all permutations
and t(p) is the number of permutations needed to restore natural order. Note that this
number is always even (0,2,4 ... ) or odd (1,3,5 ... ) and the sign of t(p) is respectively +
or -. For example when n = 2, detA = ana22 - a12a2b the latter has one permutation
in the second subscript, and hence is preceded by minus.
Thus det A is by definition a product alii' ... anj. in which one element is taken from
each row and from each column in each term of the expansion of det A, i.e. det A is an
additive homogeneous function of its columns or rows.
The properties of determinants follow from the above definition. We are simply listing
the main ones here, leaving their proofs (based on t(p) and homogeneity above) as an
exercise to interested readers.
P.1 IAI = IATI i.e. a matrix A and its transpose AT have the same determinant.
P.2 If A has 2 identical rows (or columns) or one row (column) as a multiple of another,
then det A = O.
66
P.3 If B is obtained from A by interchanging any two rows or columns, then det B =
-detA.
PA If A has a row or column of zeros, then det A = o.
P.5 If A is a diagonal matrix or a triangular matrix then det A = ITa;; = product of
diagonal terms.
P.6 If any row or column of A is mUltiplied by a scalar k, then its determinant is k det A.
Hence det(kA) = k n det A. In particular, det( -A) = (_l)n det A.
P.7 If B is obtained from A by adding a multiple of one row (or column) of A to another,
then det B = det A.
P.8 If the elements of the (n x n) matrix A are differentiable functions of x and
A == [at, a2, ... ,an] then by product rule, ~ (det A)
d d
= detLt ai, a2, ... ,an] +
det[al Tza2, ... , an] + ... + det[al, ... , Tzan].
Z
Expansion by Co-factors.
The (n - 1) determinant Mij obtained from det A of order n, by deleting row i and
column j, is called the minor of aij in A and Aij = (-l)i+jMij its cofactor. Then by
definition
det A == IAI = L aijAij for all i
j
The first, Ej aijAij is called the expansion of det A by the cofactors of row i and the
second, Ei aijAij is called the expansion by the cofactors of column j (any j, i).
Note that from the definition of determinants, Ej aijAkj = 0 if i :j:. k and Ei aijAik = 0
if j :j:. k. These are referred to as an expansion of det A by alien cofactors.
Laplace Expansion.
When aij above is, instead, an (r x r) block in an n x n matrix A (r < n), call it B r ,
then the (n - r) determinant obtained by deleting the above r rows and r columns, is
called the (n-r) complementary minor, Mn-r and C n- r == (_1)8 Mn-r its complementary
cofactor or algebraic complement, where s is the sum s = E i + E j. Then
det A = L IBrlICn-rl (4.14)
(j)
67
where the summation is taken over all possible permutations of the n second subscripts
taken r at a time. This is called a Laplace expansion of det A.
Example 4.1.
A- [1
The expansion by the first 2 rows gives
Example 4.2.
Lemma 4.1. Let M by any 2n x 2n matrix and N == [~ ~] then det M = det(M N).
Proof. Det(MN) is det M to the first n columns of which are added its last n
columns each multiplied by elements of C in N. Hence by property P.7 above,
detM = det(MN). (QED)
Theorem 4.3.
det(AB) = det A . det B. (4.15)
Proof.
[~;][~ ~]=[}B;]
By Lemma 4.1, det [~ -:] [~ ~] = det [~ -:]; and by Laplace expansion
det [~ -:] = det B· det A and equally det [:B -:] = det(AB). (QED)
68
Proof.
[C~-l ~] [~ ~ _CA-iB ] [~ A-;B] = [~ ~].
By Theorem 4.3 the product of the 3 determinants on the LHS is (I) det(A) det(D -
CA- 1B)(1) = det [~ ~] on the RHS. (QED)
Note that the term D - CA-1B is called the Schur complement of A in (~ ~).
Similarly if A is singular but D is not, det (~ ~) = det D . det(A - BD-1C) and
A - BD-IC is the Schur complement of D in (~ ~).
Definition 4.20. A square matrix A is said to be invertible if there exists a square matrix
B of the same order such that AB = BA = I where I is the unit matrix of the same
order. B is called the inverse of A and written as B = A-I and AA-I = A-1A = I.
The computation of A-I could be carried out by Gaussian elimination, using elemen-
tary operations, (see any textbook on linear Algebra), or by using adjoint matrix and
determinant, as follows.
Definition 4.21. The matrix whose elements are cofactors Aij in transposed order, is
called adjoint A, denoted (adj A) and defined as:
... Ani
An2
1
(4.17)
Ann
From these definitions.
(4.20)
The case where b = 0, we have the homogeneous equations system Ax = 0 whose non
trivial solution (i.e. x =F 0) requires A to be singular i.e. det A = O. If A is non singular,
i.e. detA =F 0, then the only solution of Ax =F 0 is the trivial one x = A-I(O) = O.
Note that, written out separately, each element Xi of x in (4.19) is the Cramer's rule
Xi = IAMIAI where IAil is IAI whose column i has been replaced by column b. Take i = 1
for definiteness: Xl = 'IW = I!I (Allb l +A 21 b2 + ... + Anlbn) which is exactly the first row
of A-Ib, multiplied by column b, term by term, i.e. Xl = (A-Ib)'el.
In general, Ax = b where A is m x n, has a solution iff r(A) = r(A:b): b being a
linear combination of the column vectors of A. If r(A) < r(Ab), no solutions exist and if
r(A) = r(Ab) < n, there exists an infinite number of solutions with (n-r) variables being
assigned arbitrary parametric values. For the homogeneous case of Ax = 0, if r(A) < n,
there exists an infinite number of solutions with (n - r) variables treated as parameters.
If s is a particular solution of Ax = b and tEN (A) then s + t is a solution of Ax = band
t E N(A) then s + t is a solution of Ax = b. To see this, examine A(s + t) = b + 0 = b,
hence s + t is a solution of Ax = b. We shall encounter this in Chapters 5 and 6.
Eigen,values and eigenvectors play a very important part in many theoretical and
applied problems in every field.
Ax = >.x (4.21)
{4.24}
3This can be seen by taking the transpose of (A - >.1)x = O. Clearly [(A - >.1)xl' = x'(A - >.1)' =
x' (A' - >.1) = O. If A = A', the right eigenvector x and the left eigenvector y are the same i.e. x' (A' - >.1) =
x'{A - >.1) == Y'{A - >.1) = O.
For the general case of a non-symmetric matrix, with distinct eigenvalues, the right and left eigenvectors
are the inverse of each other. This can be seen by writing y' A = >.y' for the left eigenvector y and
Ax = >.x for the right eigenvector x. Let Q = the matrix of left eigenvectors·and P = the matrix of
right eigenvectors then clearly QA = AQ where A == diag (>'1o>'2, ... ,>'n), AP = PA. The first gives
QAQ-l = A. The second gives P-1AP = A. Hence Q = p-l.
4Certain authors, such as Lancaster (1966 p. 12), make the distinction between eigenvalues and latent
roots.
In the case of constant matrix A, the two are the same but in the case A{/1) is a function of some
parameter /1, such as the case of lambda matrices, the two are different: eigenvalues are dependent on
/1 while latent roots and vectors are not. Inman (1989 p. 59) complains that this excellent distinction
does not catch on in the literature. We shall follow the crowd and use them interchangeably so long as
confusion does not arise.
71
For n = 3,
(4.25)
where
CI :; tr A :; T = an + a22 + a33
C2 = sum of principal minors of order 2
:; Iaua21 al21 +
a22
Ialla31 I
al31 + a22 a231
a33 a32 a33
C3 = detA.
Note that had we adopted the practice of writing (4.21) as ()..J -A)x = 0 (4.22) would
be (-1)nC(A) = det()..J-A) = An-CIAn-I+C2An-2+ ... +(-1)ncn = 0 = (-1) n det(A-)..J)
by property P.6 of determinants in 4.3.1 above. We shall adopt (4.22).
Example 4.4.
A = [~ ;] C(A) = A2 - TA +8=0
= A2 - 11A + 24 =0
!
this gives two eigenvalues A = (11 ± V(l1)2 - 24)) = (3,8).
To compute the eigenvectors x, in order to find the basis for the eigenspace of A, we
solve (A - A;I)xi = 0 for each i (i = 1,2, ... , n if A is n x n). In example 4.4, these are:
]
= 8, (A - A2I)x 2 = [~4 ~1] [=~ = [~] giving 4XI = 2X2 or X2 = t (any
For A2
parameter), Xl = !t i.e. the second eigenvector x 2 is x 2 = t [ 1{2 ] = [~] for t = 2.
Definition 4.23. Two matrices A and B are said to be similar to each other if there
exists a nonsingular matrix P such that B = p-l AP. If P is an orthogonal matrix,
p-l = pI, then B = p-l AP = PAP and B is said to be orthogonally similar to A.
Similar matrices have the same determinants, since B = p-l AP gives, upon taking
determinants and remembering that the determinant of a matrix product is the product
of determinants
IBI = Ip-IIIAIIPI = IAllp-IIIPI = IAI (4.27)
since p-1p = I, Ip-111PI = III = 1.
Similar matrices also have the same characteristic equations. Let B = p- l AP, i.e. B
and A are similar to each other, then their characteristic equations are
c().) = IB - = IF-lAP - .HI = IP-I(A -
),11 ).I)PI = Ip-ll IA - ),11 IFI
= IA - .Hllp-IIIFI = IA - ),11 = c().). (4.28)
Note however that although similar matrices have the same eigenvalues, they do not
have the same eigenvectors: these are transformations of one another. To see this, let
B = p-l AP as before and let). be an eigenvalue of B with y its corresponding eigenvector.
Then clearly
By = ).y
P-IAPy =).y
APy = ).Py
Ax =).x (4.29)
where x == Py i.e. eigenvectors x and y are multiples of each other: A and B have the
same eigenvalues). but not the same eigenvectors.
Similar matrices also have the same traces, trace being defined as the sum of diagonal
terms of a matrix. To see this first note that if A is an m X nand B an n X m matrix,
then
73
Theorem 4.7. The eigenvectors associated with distinct eigenvalues of a real symmetric
matrix are orthogonal.
Proof. Take any two distinct eigenvalues Ai and Aj (i =F j) of a real symmetric matrix A
md their corresponding eigenvectors Xi and xi. We have
AXi = Ai Xi
Axj = .xjxi .
Premultiplication of the first equation by xi and the second by xi gives
= Ai Xj . xi
x j . AXi
xi. Axi = AjXi . xi.
But (xi. Axi)' = xi. A'xi = xi. Axi , A being symmetric. Hence
(Ai - Aj)Xi . xi = o. (4.30)
Since Ai =F Aj by hypothesis, Xi ·xi = 0 i.e. Xi and x j , both non-zero vectors, are orthogonal.
Since i and j are arbitrary, the result is proved.
74
Theorem 4.8. If an eigenvalue >'i of a symmetric matrix A has multiplicity mi, then
will be mi orthogonal eigenvectors corresponding to this >'i. In other words, the algebrail
multiplicity, if any, of an eigenvalue is always equal to its geometric multiplicity, ani
hence A is always diagonalizable.
Quadratic forms involve symmetric matrices whose properties were investigated in th(
last section.
Definition 4.24. Given any square symmetric matrix A, the quadratic form Q(x) i:
defined as
Q(x) = L~::aijXiXj == x'Ax == X· Ax == (x, Ax}.
i j
Definition 4.25. A real symmetric matrix A is said to be positive definite if for all non
zero x vectors, x'Ax == (x,Ax) > 0; positive semi-definite if x'Ax 2: 0, and IAI = (
negative definite if x' Ax < 0, and negative semi-definite if x' Ax ~ 0 and IAI = o.
In the complex case, a real symmetric matrix A is said to be positive definite (negativi
definite) if for all non-zero complex vector X, (x,Ax) == x' Ax> 0 « 0) where x is thl
complex conjugate of x. i.e. if x == '11 + iv then x == '11 - iv where '11 and v are real vector:
and i 2 =-l.
Thus x' Ax == x· Ax == (x, Ax) = ('11 - iv)' A(u + iv) = ('11, Au) + (v, Av) > o.
Theorem 4.9. A real square symmetric matrix A is positive (negative) definite iff all it
eigenvalues are positive (negative).
Proof. Let >'i be any eigenvalue of A and xi its associated eigenvector, i.e. AXi = >'ixi
Premultiplying this by xi and omitting superscript and subscript i to alleviate notation
we have x' Ax = >.x' x, or
x'Ax
>.=-
x'x
Clearly x' x > O. It is then clear that>. > 0 => x' Ax > 0 and conversely x' Ax > 0 => >. >0
Similarly, >. < 0 {:} x' Ax < O. Since i is arbitrary, the theorem is proved. (QED).
The quadratic form Q(x) == x' Ax can be greatly simplified by orthogonally diagonal
izing A, using the transformation x = Poy where Po is the normalized modal matrix 0
A, i.e. Po = [xl/llx/Il,X2/I1x211 ... xn/llxnlil where PO- 1 = P~. This gives
This allows quadratic forms to be examined very simply by use of orthonormal trans-
formation x = poY which gives
Example 4.5.
is a polynomial equation of A of degree n. Its solution gives n roots (A1> A2," . ,An) which
(i) may be all distinct and non-zero (in which case the rank of A is n);
(ii) some Ai = 0 (if A is of rank r then only r eigenvalues are non-zero, the remainin~
n - r eigenvalues (Ar+l' Ar+2, ... ,An) = (0,0, ... ,0) are zero);
(iii) some Ai may be repeated mi times (mi is called the algebraic mUltiplicity of Ai); or
(iv) some Ar may be complex, in which case they come in pairs Ar = Or ± if3r where
i = A and Or, f3r both real numbers.
or
AP=PA
giving p-l AP = A where P == [xl, x 2 , ••• ,xn] = modal matrix whose columns are thE
eigenvectors of A and A = [diag (Ai)]. Thus P diagonalizes A. For this, p- l must exist
i.e. the columns of P which are the eigenvectors of A must be independent. Conversely
if Pis nonsingular, p- l exists and p- l AP = A i.e. P diagonalizes A. (QED)
Note that eigenvectors are not unique: if v is an eigenvector, so is kv for any nonzerc
scalar k. If k = II~II where IIvll = (vf + v~)1/2 = norm of v, then P is called a normalize<
modal matrix, written, say, as Po, then Po is unique. In this example, xA = ~ n)
x~ = ~ (-;1) and -l and P~APo = [~ ~2] = POl APo.
Po = PO
77
Note that if A has no repeated eigenvalues i.e. if ~t. ~2,"" ~n are all distinct, A is
said to be a simple matrix. Then P is non singular and A is diagonalizable.
Not all matrices are diagonalizable. Those which are not, are called defective. For
example, A = (~ ~) has a double zero eigenvalue ~l = 0 = ~2 and the unique corre-
sponding eigenvector x = k ( ~ ). Since ~ has multiplicity m = 2 but there is only one
independent eigenvector, there exist no modal matrices P such that p-l AP = A. The
number of independent eigenvectors corresponding to a repeated eigenvalue is called the
geometric multiplicity.
If ~i has multiplicity mi, i.e. ~i is repeated mi times, then all depends on N(A - ~;l),
the null space of (A - ~;l) i.e. the number of vectors Xi (i = 1,2, ... ) such that
Axi - ~ixi = O. If there exists a full set of independent eigenvectors, then A is diag-
onalizable. Otherwise it is not.
Theorem 4.11. A n-square constant matrix A having degeneracy a has a null eigenvalue
with multiplicity m ~ a.
Proof. Clearly the rank r(A) = n - a (i.e. there are only n - a non-zero eigenvalues),
and the coefficients cn - a +! = 0 = Cn - a +2 = ... = Cn-l = Cn = 0 in the characteristic
equation c(~) = IA - AIl = (_l)n~n + (_l)n-l~n-lcl + ... + Cn = 0, since all minors of
A of the order greater than r vanish. Thus
(4.35)
Since a is the geometric multiplicity, i.e. a is the dimension of the null space N(A-~;l),
and mi is the algebraic multiplicity, the above theorem is often stated in simpler form as
"the arithmetic multiplicity (m) is not less than the geometric multiplicity (a) i.e. m ~ a" .
Example 4.7.
A = [ 010 0
2 0
1; IA - AIl = (1 - A)(2 - ~)2 = O.
012
eigenvector is V2 =(~) where s,i are arbitrary constants taken as parameters. Thus
there exist only two independent eigenvectors VI and V2 and hence A is not diagonalizable.
Furthermore, the arithmetic multiplicity = m = 2 ~ a = 1 = geometric multiplicity.
78
Example 4.8.
A=[~023
~ ~l
IA - All = (1 - A)2(5 - A) = 0 i.e. Al = (1,1), A2 = 5. For Al = 1, the eigenvector is
VI = ( -:) =s ( -!) + ~) t ( . For A2 = 5, the eigenvector is V2 = t ( ~) . The
1
modal matrix is
P =[ 0 1 0
-1 0 1 ; p- l =! [~ -~ ~ 1
101 2 0 1 1
(4.36)
= =
Proof. Let w u+iv and w u-iv, (u, v E R2, i = A). Then we have u = ~(w+w)
and v = ~(w - w) i.e. u and v are independent real two-dimensional vectors and as such
[u, v] is a basis for the vector space. Details of the decomplexification of A are as follow~
(4.38)
where R has a on the diagonal and the skew-symmetric matrix j3J with ±j3 off the
diagonal.
We have seen that the eigenvectors corresponding to distinct eigenvalues are linearly
independent and hence the modal matrix P whose columns are the eigenvectors of A is
non-singular. This allows A to be diagonalized, i.e. p-l AP = A == diag(~i)' A is called
a simple matrix if it is similar to the diagonal matrix of its eigenvalues. Matrices which
are not simple are said to be defective.
For the case some eigenvalue is repeated m times, (i.e. of arithmetic multiplicity m), we
have seen that the arithmetic multiplicity cannot be less than the geometric multiplicity a
(which is the maximum number of independent eigenvectors corresponding to the repeated
eigenvalue) i.e. m 2: a. (See examples 4.7 and 4.8).
Defective matrices are not diagonalizable. However they can be. "block-
diagonalizable." This leads to the concept of the Jordan canonical form:
Definition 4.28. For any n-square matrix A, there exists a non-singular matrix T such
that
(4.39)
80
(4.40)
Clearly if ni = 1 i.e. all eigenvalues are distinct, In,(Ai) = Ai and T-l AT = A i.e. the
Jordan canonical form is reduced to a diagonal matrix. It can be shown that the maximum
number of linearly independent eigenvectors associated with an eigenvalue is exactly the
number of Jordan blocks in which the eigenvalue appears. This number is called the index
of an eigenvalue and the theorem m ~ a could be rephrased as "the multiplicity of an
eigenvalue is not less than its index" .
The proof of this requires the introduction of many additional concepts which are not
necessary for our purpose. The interested readers can consult Gantmacher (1959), for
example.
Note that the Jordan block J could be written as
where
0 1 0
S.,=
[
~Ol (4.41 )
is called the superdiagonal matrix, with 1 in the superdiagonal position and zero every-
where else. It is easy to verify that (Sn)m = 0 for all m ~ n and Sn has only one eigenvalue
which is zero, with multiplicity n and with the right eigenspace of dimension one.
(4.42)
Definition 4.29. Let the linear space 5 have 2 subspaces 51 and 52. Then 5 is said to
be the direct sum, written as 5 = 51 $ 52 if
(a) 5, c.;, 5 (i = 1,2);
(b) For each x E 5, there exists Xl E 51 and X2 E 52 such that there is a unique vector
x = Xl +X2;
(c) If X E 51 and X E 52 then x = 0 i.e. there exist no common nonzero vectors to both
51 and 52.
4.10. Conclusion
This chapter was devoted to a review of some Linear Algebra. The review is bound to
be selective: only those notions which are relevant to the study of systems of Differential
and Difference equations have been covered. Yet it provides a sufficiently comprehensive
background in Linear Algebra for the study of Dynamical Systems, at this level.
Chapter 5
First Order Differential Equations Systems
5.1. Introduction
when A(t) is in general an n X n time variant coefficient matrix and b(t) a time variant
n-vector. The constant coefficient case emerges as a particular one in which A and
b are constant. The system (5.1) is homogeneous if b = 0 and non-homogeneous
if b i:- o. The solution of the homogeneous part, x = Ax is called the general
solution of the complementary function, xc(t) and the solution that fits (5.1) is
called the particular integral (x p ) or equilibrium solution (x e ). The combination of
the two, x(t) = xc(t) +xe , gives the complete solution of (5.1). In general, if vectors
xl, x 2 , ••• ,xn are each a solution of (5.1), so is their linear combination
(5.2)
Definition 5.1. The Wronskian W(X) of the n solutions of (5.2) is the determinant
of the matrix X the columns of which are the solution vectors of (5.1), i.e.
Clearly the n solutions at a point are linearly independent iff W i:- O. In this
case, there is exactly one way to express (5.2). To see this, write (5.2) as Xc = x ::}
c = X-Ix where X-I = adj X/W(X). If W i:- 0, there is a unique non-zero vector
c.
The next four sections will be devoted to the discussion of the quantitative
solution of (5.1) and its stability conditions, followed by the qualitative solution in
the plane. Some economic applications will conclude the chapter.
x = Ax + b, x(O) = Xo (5.4)
84
where P == (VI, V2, ... , Vn ) = modal matrix whose columns are eigenvectors of A and
A == [diag Ai] = a diagonal matrix whose diagonal elements are eigenvalues of A
(see ch. 4).
85
Proof. Distinct real eigenvalues ensure linearly independent eigenvectors and hence
non-singularity of P (see Theorem 4.10) and A is diagonalizable. From the definition
of eigenvalues Av; = .\;v;, we have A(Vb ... , Vn) = (.\lVb ... , .\nVn) or AP = PA
and hence P- 1AP = A or A= PAP- 1
Hence
(5.7)
(QED)
Remark 5.1. In the light of first order ODE in Chapter 2, (5.7) can be obtained
by trying a solution x(t) = ve.\t for some constant vector v. Differentiation gives
j; = .\ve.\t = Ave.\t or (A - .\I)ve.\t = 0 ~ (A - .\I)v = 0 ~ (A - .\I) is singular
(since v =F 0). This gives c(.\) == det (A -.\I) = 0 from which .\ 1, '\2, ... , .\n are found
and substituted into x(t) = C1v1e.\lt +c2v2e~t+ ... +envne.\.. t which is exactly (5.7).
0)( -13)_p
eSt
At
= e c
(5.9)
86
iJ = ± = Ax + b = A(x + A-Ib) == Ay
the solution of which, by Theorem 5.1 is
or
(QED)
( :~ ) = (~ ;) ( ~~ ) + ( ~ ) , Xo = ( ; )
Xe = -A-Ib = -1 (-i -~) ( ; ) = - ( ~~: )
c(-\) = IA - All = -\2 -
4-\ + 3 = 0 :::} -\ = (3,1)
P= ( ) _ (1 -1) p-l _ ( 1/2 1/2)
1 l' - -1/2 1/2 '
)
- VI V2 -
=
( 1 -1)
1 1
(e0 3t
0) ( 1/2 1/2) ( 10/3 ) _ ( 7/3 )
et -1/2 1/2 7/3 1/3
= ¥(~ )e 3t _ ~ -~ ( ) et _ ( ~~~ )
87
i.e.
AVI = AVI or (A - U)VI = 0
AV2 = AV2 + VI or (A-U)V2=VI
Multiplying both sides by (A - U), remembering that by the definition of eigenvec-
tors, (A - AI)VI = 0, VI '" 0, we have
I.e.
(A- U)2V2 = 0
Each vector V2 is called a generalized eigenvector of A. For example if x= Ax
has an eigenvalue A repeated 3 times, its solution vector is
(5.10)
In general Vi satisfies (A - AI) Vi = Vi-l where i = 1,2, ... ,m and m is the arithmetic
multiplicity of A, while CI, C2, Ca are arbitrary constants to be determined by initial
conditions.
(A - M)Vl = 0 gives - Xl + X2 = 0 or Vl = ( ~)
(A - M)V2 = Vl gives - Xl + X2 = 1 or Va = ( ~) , say.
The solution is
where hl == CIVl + C2V2 and h2 == i(CIVl - C2V2), both real vectors (see Chapter 2,
footnote 1).
[ 2i -i]
2'
P-1AP -_ A -_ [ -1 0+ 2i -1-02i ] -= [~10 ~20] •
Th I· .
e so utlOn IS
=
-
k
1
( - sin 2t) -t k (cos 2t) -t
2C082t e + 2 sin2t e
where kl = Cl + C2 and k2 = i(Cl - ~).
(j) , (-~)
n"'.
values and eigenvectors >. = (-1, -1 ± i) and Vl = V2 = +
to
x = PiJ=APy
iJ = P-lAPy = Jy (5.13)
(5.14)
It is easy to see that the original form (5.12) and the Jordan canonical form (5.13)
are topologically equivalent: only a change in coordinates is involved, the same
analysis is valid for both.
The three above cases will be examined separately, according to whether the
roots A of C(A) = IA - All = 0 are real and distinct or multiple or complex.
This is the simplest case which has been dealt with in Theorem 5.1 and 5.2. In
its Jordan form, the solution is simply
(5.15)
where J = A = diag (Ai) : a diagonal matrix with eigenvalues At, A2 on the diagonal.
The system is completely decoupled and the solution can be written at sight, as
Yi(t) = Yme>.;t.
--+--"---I---Xl 1/1
Clearly the two systems are topologically equivalent: both have the same structure
of Saddle Point instability. If the solution in x(t) is required, a transformation back
to x gives
Y1 = 3et ¢} Xl = 6e t 2e5t
-
Y2 = _e5t ¢} x2 = ge t _ e5t •
The two systems, as we have seen, are topologically equivalent: both being unstable,
having A = (1,5).
-1
PAP=J= [AOA.
1] (5.16)
92
Note that in general, the non-zero off-diagonal element is not zero: it is some c.
However, it could be transformed into 1 by using Pi == P [~ C~l] instead, i.e.
(5.17)
whose solution is
Y( t ) = eJt Yo = e~t [1
0 1 t] Yo· (5.18)
i.e. the two series coincide if SN = NS, and then e J = eSe N = e S+ N and hence e Jt
can be written as
Example 5.8. Solve x = Ax where A = [_~ !] c('x) = 0 gives (,X - 2)2 = 0 and
whose solution is
whose solution is simply (remembering that e(a±i,B)t = eat (cos f3t ± i sinf3t)) .
Note that, had X= a-if3 been used, we would have J = (~ -~) but the analysis
is not affected. Note also that using the SN decomposition of J as above, and since
94
J = [~ -!] == [~ ~] + [g -~] == 01 + P [~ - ~ ]
=S+N
eN = I+N+N2/2!+N3/3!+N4/4!+ ...
= [cos P -sin p ]
- sinp cosp
which is (5.20) (using X= 0 - iP). The above analysis can be summarized in the
following theorem:
Theorem 5.4. For a dynamic system on the plane ± = Ax where A is any real
2 x 2 matrix, there exists a non-singular matrix P such that p-l AP = J which is
one of the following 3 types, corresponding to the cases of real distinct, repeated and
complex eigenvalues respectively:
• J --
(';) [O~l "
~2]a n d th e so Iutzon
' 0if'y = J y 's
.
••• J -- [_~,.,
(';';';) 0(3] an d th
e soiut~on
' 0 if y. = J y .
IS
As we have seen in sections 5.2 and 5.3, the solution of x = Ax, which is
x(t) = eAtxo = peAt p-1xo = c1V1eAlt + ... + c,.vne A• T , involves the computation of
eigenvectors (V1' ... , vn), modal matrices P = (V1' V2, ... , vn ), and their transforma-
tion p-1 AP = J, the Jordan canonical form in (5.21). This could be cumbersome,
especially when A is not diagonalizable. To bypass such computations, several al-
ternative methods have been suggested, some of which are the following.
For the case of n distinct eigenvalues A1 :/: A2 :/: ... :/: An of A in x = Ax,
Sylvester (see Barnett and Cameron 1985) suggested writing eAt = 1:;;=1 Zlre A• t
where
Zir == (A - A1I)(A - A2I) ... (A - AnI) == (A - AjIn) . II (5.22)
(Air - A1)(AIr - A2) ... (Air - An) j_l Air - Aj
j,. .
n
= E ZlreA'txo. (5.23)
1r=1
_
- e
-t [ 4 ] + e-2t [ -3
-4 6]·
Comparing this with the conventional method in 5.2 and 5.3, we find that P =
[Vb V2] = [!2 !2] ; p- 1xo = [~1 ~1] [~] = [:3] and the solution is
identical, with ZlXO == C1V1 and X 2xo == ~V2 i.e.
96
1
C1 C2 Cn -1 1
C2 C3 1 0 z(t)
C= Z(t) == [ i~t)
Cn -1 1 0 0 zn-1(t)
1 0 0
The solution of x= Ax, regardless of possible mutliplicities of eigenvalues, can be
written as
n-1
eAt = E qj(t)Aj (5.24)
j=l
where qo(t), q1 (t), ... , qn-1(t) are the elements of the column vector q = C Z defined
above.
where Po = I; Pj == rr{=l(A - AkI) (j = 1,2, ... ,n) and T1(t), ... ,Tn(t) is the
solution of the triangular system
1-1 = A1T1
Tj = Tj-1 + AjTj (j = 2,3, ... , n)
T1(0) = 1; Tj(O) = O.
Method 1 gives
i.e.
Method 2 gives
and
1
x(t) = eAtxo = '2e-Xt[2J + 2t(A - >.J) + t 2(A - AI)2]XO. (5.27)
Both methods completely bypass the computation of eigenvectors and the Jordan
canonical form.
We are suggesting the direct method of writing the solution x(t) = eAtxo explic-
itlyas
(5.28)
The beauty of this method is in its simplicity. It is particularly useful in the case
of repeated eigenvalues of the type of Putzer's example. It is left to the readers to
verify that (5.28) gives exactly Putzer's (5.26) and (5.27) and is obtained in a much
simpler way, using only the definitional property of eA in eq (4.11).
Which method to be used is a matter of taste and convenience, for a given prob-
lem. If interest is in a qualitative solution and stability analysis, the conventional
method of the last 2 sections, giving
is quite adequate since, given certain conditions discussed in the last 2 sections, Vi,
P = (Vl •.• V n ) seldom need to be computed explicitly: CiVi (i = 1,2, ... ,n) being
98
where Dy
order ODE
=y, D2y =ii, ... , Dny =d"y/dtn, could be reduced to a system of n first
x = Ax, x(O} = Xo
Xl =Y
X2 = Y = Xl
Xa =Y =X2 etc... and writing
Writing A, which is called the companion matrix, as
or, in full,
0 1 0 0 0 Xl (t)
0 0 1 X2(t}
A= and X =
0 0 0 0 1
-an -an-l -an-2 -a2 -al xn(t}
The solution of this, by Theorems (5.1) and (5.2), is
{5.31}
99
with
and
x(t) = ~(t)c = ~(t)~(O)-lxO. (5.34)
We have seen (Theorem 5.2) that the solution of x = Ax, x(O) = Xo is
In the case of real diagonalizable matrices (with distinct roots), Ai < 0 for all i
amounts to saying that A is negative definite (see Chapter 4) or that the principal
minors of A of order r have the sign (-lY (r = 1,2, ... , n). In Economics, the
matrix A having this property is called Hicksian matrix. It is in this sense that
a matrix A with Re(Ai) < 0 is called a stable matrix by some authors (Lancaster
1969), which is, of course, true only for differential equations systems: for difference
equation systems Xt = AXt-t. A is a stable matrix only if IAil < 1, not Ai < 0,
for all i. This fundamental condition Re(Ai) < 0 has been studied extensively by
economists such as Arrow and Hurwicz (1958, 1959, 1960), Hahn (1962), Negishi
(1962), Newman (1959-1960), Basset et al. (1967) Quirk and Ruppert (1965) and
many others and stated in a variety of ways in relation to economic problems (for
example gross substitutes, complementarity of commodities). Matrices fulfilling
some conditions are referred to, for example, as Metzlerian (if aij < 0 Vi = j and
aij > 0 Vi ¥ j), dominant diagonal or McKenzie (1960) matrix, and theorems such
as "a Metzlerian matrix is stable if it is Hicksian" etc... As these are all but too
familiar to economic students, we shall not go into them, except to mention the
widely referred to "Routh-Hurwitz" test. This says that given a system :i: = Ax
with c(A) = IA - AIl = An + a1An-l + a2An-2 + ... + an = 0, Re(Ai) < 0 for all i
101
iff the following principal minors ~b ~2' ••• ' ~n formed by a; (i = 0,1, ... , n with
ao = 1) are all positive, where
al 1 0 0
I
1
I,... '~n
a3 a2 al
1
~l = al, ~2 = al = as a4 a3 a2
a3 a2
a7 as as a4
The above asymptotic stability conditions only guarantee local stability of sys-
tems starting near equilibrium, unless, of course, these are linear with constant
coefficients, in which case, they are both locally and globally asymptotically stable
if ReA; < 0 Vi and neutrally stable if ReAi = 0 # 1m Ai Vi. For nonlinear systems,
this need not generally be true. A global stability test is then provided by Lia-
punov's second method which consists in showing, without explicit solution of the
ODE system, that the distance between x{t) and its equilibrium x shrinks overtime.
In practice, we can take W = 1 and solve A'V +VA = -I. For example
A= [~1 !4],A=(-I,-3),A'V+VA=-/givesV= [~~~ ~~n which is
positive definite. In detail,
-1 -1
[~ ] [Vl1
-4 o
0 -1 Vl2 ] [ -1
O ] . Vl1 ]
[ Vl2 [ 5/6]
1/2. 5/6 1/2
o -4 -1 V2l = 0 gIves V2l = 1/2 I.e. V = [1/2 1/2]
3 3 -8 V22 -1 V22 1/2
102
Consider the simple (non-singular) real linear ODE system in the plane
.i = Ax (5.36)
in its Jordan canonical form (see Section 5.3)
iJ = Jy (5.37)
where J takes one ofthe following forms (Theorem 5.4)
----;;;..~-----.. YI
The arrows will be reversed in (a) if 0 < Al < A2 and in (b) if 0 < A2 < AI. This is
an unstable mode. In the limit, if Cl '" 0,
Y2 = C2 e(>-2->-I)1
Yl Cl
103
/
f/2 f/2
~ ~ Yl
~ Yl
\ f '\ (
(a) .xl < 0 < .x2 (b).x2 <o<.xl
tends to 00 if/,> 0 (and to -00 if/, < 0), i.e. in the limit as t -t 00, and the
trajectory comes home, this will be tangent to the Yraxis if /' > 0 and A < O.
Similarly, the arrows will reverse direction if A > O. See figure (5.4). The origin is
an improper node.
104
112 112
C~e (iii) Complex roots A = a ± ifJ, J = (_p !) .If a", 0 '" fJ, the canonical
form gives
112
--~~~----~------------lIl
2 2 ()(). • •
or Yl sec = YIY2 - Y2Yl·
Substituting from (5.40) and remembering that (sec2 ())y~ = r2 = y~ + y~, we have
r 2iJ = -fl(Y~ + y~) = -flr2
iJ = -fl
()(t) = ()o - flt. (5.41 )
These two equations (5.40) and (5.41) give complete information on the trajectory.
Three cases should be distinguished.
(a) a < 0 < fl: r(t) shrinks by (5.40) and ()(t) by (5.41) i.e. as the radius r decreases
over time, the system moves closer to the origin in a clockwise fashion because angle
() decreases over time (see figure 5.8). The origin is thus a stable focus or spiral sink.
If a,fl < 0, the motion is anticlockwise (see (iii))
106
(b) a, {3 > 0: r increases over time i.e. the system is winding out of the origin while
() decreases, i.e. it is spiralling out of the origin in a clockwise fashion. The origin
is unstable (see figure 5.7 case (ii)). The opposite is true for a,{3 < 0 (see figure 5.7
case (iii)).
(c) a = 0 < {3 : r remains constant over time while () decreases: the trajectory
is moving on a closed circle in the clockwise direction. If {3 < 0 = a, r remains
constant and the trajectory is anticlockwise. The origin is a centre: it has a neutral
stability (see figure 5.8).
--+--+---+--1/1 --+--+--+--1/1
The above analysis could be summarized and presented in the parameter space
T·6 of c(,x) where c(,x~ ~ IA - .xII = ,x2 - T,x + 6 = 0 (where T == tr A, 6 ==
detA), ,x(T,6) = i(T± T - 46) == i{T±J~). The 6 = 0 locus where 6 = T2/4 is
a patabola above which 6 < 0, c(.~) has complex roots causing periodic fluctuations
107
and below which A > 0, C(A) has real roots, causing monotonic movements. (see
figure 5.9). The system is stable on the left of the vertical axis (where T < 0) and
unstable on the right (where T > 0). Below the horizontal axis, 6 < 0 (i.e. 6 =
AIA2 < 0 indicating that eigenvalues are of opposite signs): the system exhibits a
saddle point instability.
* *
Stable Unstable
~
Nodes
~
Nodes
r' _ tr A
j~(
P~
+./ ~ -f.
»«
' . ,~"
\ /
,'~" S.P . ,~~
•• ODLf
Y= hi (I - 5)
r = h2 [L(Y, r) - M] (5.42)
The solution is
demand function has the same slope as the Investment function (in absolute value),
A = -8 ± io: and the system is a stable focus again (see figure 5.7 and 5.9).
A Leontief economy has n sectors each producing only one commodity, with a
non-negative, non-singular constant input-output coefficient matrix A == [aij] where
aij = output i used up per unit of commodity j, 0 ~ aij < 1 and a constant non-
negative, non-singular matrix B == [bij ] where bij is the output i required to build up
one unit of capacity j, 0 ~ bij < 1 (i, j = 1,2, ... , n). The economy must produce
enough to satisfy intermediate demand (Ax), investment demand (Bx) and a final
consumption target (e), i.e.
(5.45a)
110
[: ] = [~ -D ~ rIn ] [ ; ].
Note that C and D have the same eigenvalues since C == B-1(I -A) and (I -A)B-l
have the same eigenvalues as already noticed and D == (B-l),(I - A') being the
transpose of (I - A)B-l also has the same eigenvalues. In Jordan canonical form
(see eq. 5.13), using the transformation x =
Py and p =
Pq, we have p-1CP =
p- 1DP = A = diag(Al, ... , An), and
[~]=[~ -A~rIn][~]
== Mr [ :]
where
111
we can see that u(Mr) = u(Mr) + r/2 i.e. the spectrum of Mr is the spectrum
of Mr shifted to the right by r/2. Writing a typical eigenvalue of Mr as Aj{r) =
±Ctj(r) ± i{3j(r) with {3j(r) = 0 for real Aj and Ctj(r) = 0 for pure imaginary Aj
(ignoring possible multiplicity of Aj for simplicity), we can see that the symmetry
property of Mr is preserved but Mr's symmetry is lost through the perturbation
term r I, and several interes~~nf problems for investigation arise: if for some r = r*,
Ctj(r*) = 0 :/: {3j{r*) with da~:' :/:
0 we have a Hopf bifurcation (to be discussed in
Chapter 9). If Ctj(r*) = 0 = {3j(r*), then one real eigenvalue Aj(r*) becomes zero,
and the system could be analysed more simply by the Centre Manifold theorem.
In both cases, there is a loss of structural stabiiity (see Ch. 9). All this required
further analysis in vastly different directions from Jorganson's 1961 paper.
For the open system (5.45), the corresponding price system is
p = pA + rpB - pB + wao
which says that output price vector p includes current costs pA, interest cost rpB
(r is interest rate), labour costs wao less capital gains (or losses) pB, i.e.
p= -p(I - A - rB)B- 1 + waoB-1. (5.47)
It can be shown (see Jorgenson (1960) that this has negative eigenvalues and the
solution tends to
p = wao(I - A - rB)-l.
Consider an economy with n commodities x = (Xl. ... , xn) and their price vector
p = (Pl,P2, ... ,Pn)' In each market, there is a demand function Di(P) and supply
function Si(P) (i = 1, ... , n). Multimarket equilibrium is said to prevail if each
market i is cleared i.e. the excess demand Ei(P) = Di(P) - Si(P) for commodity i is
zero in each market i (1 :5 i :5 n), and thus, for the whole economy,
Take k = I i.e. ki = 1 for all i, for simplicity, this is just P= Ap, p(O) = Po, a system
of linear first order O.D.E. whose solution, by Theorems 5.1 and 5.2, is
(5.49)
The linear model of general equilibrium, cast in the framework of the static
Leontief model by Dorfman, Solow and Samuelson (1958), has been analyzed by
Morishima (1960) in a dynamic setting. We shall briefly present Morishima's model
to illustrate an important economic application of first order ODE systems.
Consider an economy with n goods x = (Xl, X2, ... , xn) with their prices p =
(Pl,P2, ... ,Pn) and m + 1 primary factors r = (rl' r2, ... , r m, ro) with their rental
v = (Vl,V2, ... ,Vm ,vo). Let A = [aij] be an n x n constant input coefficient matrix,
assumed non-negative and indecomposable and B = [b ij ] be an m x n constant non-
negative factor coefficient matrix. The (consumption) demand function c(p, v) and
factor supply function r(p, v) are both assumed to be homogeneous of degree zero
in p and v, as is usual in general equilibrium models, and to obey the weak axiom
of Revealed Preference (WARP), i.e. if (Po, vo) =f (PI, VI) then
Given the "budget constraint" which says that the outlays on goods, ric, are
identically equal to the income from factor services, v' r, i.e.
p'c(P,v) == v'r(p,v) (5.51)
general equilibrium is obtained when the goods market and factor market are both
cleared and no abnormal profit is made, i.e.
x = Ax + c(p, v) (5.52)
Bx = r(p,v) (5.53)
p = A'p+ B'v. (5.54)
The general equilibrium flavour is thus introduced to the conventional Leontief model
by way of the demand c(p, v) and supply r{p, v) functions which, instead of being
exogenously given, are now function of product and factor prices (p, v).
Morishima then introduced the dynamics into the model by requiring that
(i) output prices rise in response to excess demand in the goods market;
(ii) factor prices rise in response to excess demand in the factor market;
(iii) production rises in response to the excess of price over cost, i.e. to the emergence
of abnormal profit, i.e.
Hp = Ax+c(p,v)-x
Kv = Bx - r(p,v)
x'M =p' -p'A-v'B-bo
where H, K and M are diagonal matrices whose diagonal elements hi, ki and mi are
all constant positive speeds of response.
As an illustration of an economic application, let us analyze, following Mor-
ishima, a simpler model where (i) price adjusts to maintain eqUilibrium in the goods
market i.e.
p = Ax + c(p, v) - x = 0
giving
x = (l- A)-lc(p, v) (5.55)
and (ii) output adjusts to maintain equality between price and costs i.e. to ensure
the absence of abnormal profits i.e.
x'M=p'-p'A-v'B-b~=O
giving
p' = v' B(/ - A)-l + b~(/ - A)-l (5.56)
where
b~ == (bOl' b02 ,' •. ,bon).
With these two markets in equilibrium, we can eliminate x and p from the above
two equations and write
Kv = Bx - r(p, v)
= B(/ - A)-lc(v) - r(v) (5.57)
114
if
v* I B(/ - A}-l ~c + b~(I - A)-l ~c - v* I ~r - ~ro ~ o. (5.64)
Since (5.64) contradicts (5.63), the LHS of (5.65) must be positive. Subtracting this
from (5.63), we get
But (5.66) is the RHS of (5.62). Hence V < 0 and the Walras-Cassel-Leontiefsystem
is globally stable.
Chapter 6
First Order Difference Equations Systems
(6.3)
Xl = Axo
X2 = AXl = A2xo
(QED)
(6.6)
where k ji == CiVji, where Vji is element i of eigenvector Vj (i,j = 1,2, ... , n), is a
constant to be determined by initial conditions.
Theorem 6.3. The solution of the non-homogeneous system Xt = AXt-1 +b, x(O) =
Xo in (6.2), is
Proof. By iteration,
Xl = Axo +b
X2 = AXI + b = A{Axo + b) + b
A2xo + Ab + b
X3 = A3xo + (I + A + A2)b, and so, for any t,
Xt = Atxo + {I + A + ... + At-l)b.
Xt = At(xo - xe) + Xe
= PAt p-l(xO - xe) + Xe
which is (6.8). (QED)
117
Yt = AtYo.
The above theorems are valid for all cases. The case in which A has n distinct
(real or complex) eigenvalues is obvious: the n corresponding eigenvectors are inde-
pendent and A is completely diagonalizable. They are valid for the case of repeated
eigenvalues, so long as the arithmetic multiplicity is equal to the geometric multiplic-
ity: A is completely diagonalizable since the n eigenvectors are linearly independent.
118
They are equally valid for the case of defective matrix A (see Chapter 4) where for
some Ai repeated k times, the geometric multiplicity is less than k: a non-singular
matrix P can always be found to block-diagonalize A i.e. to bring A to its Jordan
canonical form J where p-l AP = J and J is a block diagonal matrix
Al
Ai 1 0
Ai 1
Ai
Clearly the case of complete diagonalizability emerges as a particular one where
J = A == diag (Ai).
Canonical forms are simpler to analyze especially when stability and qualita-
tive solutions are involved: the original and canonical systems being topologically
equivalent, a mere change of coordinates is often all that is involved.
We shall examine the cases of real distinct roots, multiple roots and complex
roots of c(A) = IA - Ail = 0 separately.
Yt = At Yo = [30t 0] [-1]3 ·
2t
where Xt = PYt and P == [VI, V2] chosen such that (A - >.J)VI = 0, (A - AI)v2 = VI (in
general, (A - >.J)Vi = Vi-I (i = 1,2, ... ,mi) where mi is the arithmetic multiplicity
of A;). Thus (6.9) has been reduced to a first order d.e. system (6.10) whose solution
IS
(6.10)'
where Yo = P-Ixo. In fact, (6.10) being a triangular matrix system, the last equation
could be solved at sight, Y2(t) = AtY20, and substituted to the first (in general, to
the second last, third last etc. recursively).
In fact, the second equation could be solved at sight, as Y2(t) = 2tY20 = 21(3) and
substituted into the first equation to give YI(t) = 2YI(t - 1) + Y2(t - 1) or
+ 1) = 2YI(t) + 2t(3)
YI(t
whose solution is YI(t) = (YlO + tY2o/2)2t = (-1 + 3t/2)2t where Yo == p-I XO =
[-~ ~] [ ~ ] = [ -; ] == ( ~~: ) .
120
When A in Xt = AXt-l has some complex eigenvalues Aj, these come in pairs
Aj = Ctj ±i{3j where Ctj and (3j are real and i 2 = -1. Take the 2 x 2 case for simplicity:
C(A) = IA->.II = 0 gives A = Ct±i{3 where Ct == r/2 and {3 == ';48 - r 2 )/2, both real
numbers. A change to polar coordinates Ct = r cos (), {3 = r sin () gives the standard
basis of no
as
t t -1
Xt=Axo=PAP Xo=
[1010] [ (1 +0 i)t 0 ] [24 ]
(l-i)t
where () = tan- 1 (1) = 45° = (7r/4) and r = 1 in eq. (6.13) which is exactly the same
as in the scalar case in Chapter 3.
The above discussion can be summarized into a theorem, using the system in
the plane for simplicity.
Theorem 6.4. The system Xt = AXt_l can be transformed into its canonical form
Yt = JYt-l (6.15)
whose solution is
(6.16)
121
where (i) J = [~l ~2] for the case of real distinct eigenvalues
(ii) J = [~ !] for the case of multiple eigenvalues
(iii) J = r [_::: ; : : ] for the complex roots case and Jt = (~ ~~) for
(,0); Jt = [.\'
0
t.\'-l] l
.\'
(00) d Jt =r t [ -sinOt
Jor n an
cosOt SinOt] l
cosOt
(000)
Jor m 0
(6.17)
given n initial conditions, can be reduced to a first order d.e. system by redefining
variables as follows
010
o 0
1 0 (6.18)
o 0 1
of the form
Yt = AYt-l
whose solution, as in (6.3) and (6.4), is
Yt = At Yo = PNp-1yo.
The matrix A in (608) is called the companion matrix of (6.17).
122
[ X1{t) ] _
X1{t) -
[0-cv c{1 + v)
1] [ -1) ]+ [ 0]
X1{t
X2{t - 1) G
Xt = At{xo - xe) + Xe
= PAtp-1{xO - xe) + Xe
where c = P-1{XO -xe), >'1, >'2 = eigenvalue of the companion matrix A and V1, V2 =
corresponding eigenvectors.
In canonical form, where Yt = JYt-b the solution is Yt = Jtyo, in Section 6.2
where J is the Jordan matrix form.
If c = 0.8, v = 1, >. = 0.8 ± O.4i = 0: ± i(3, r = {a2 + (32)1/2 = 0.89. This economy
exhibits convergent (damped) oscillations since r = 0.89 < 1 and c(1 + v)2 < 4v.
The solution, in canonical form, is
Yt
= (0.89)t [ C?S Ot sin Ot ] [ C2C1 ] + Ye
_ smOt cosOt
It is easy to convert into the original variable Yi and compare the results with (3.44)
in Chapter 3, to verify that they are identical.
123
From {6.4}, {6.5} and {6.6}, it has been seen that the solution of Xt = AXt-l is
{6.4}, {6.6}
or, in canonical form
{6.9}'
The asymptotic stability conditions can be seen to be simply
=
But>. = 0: ± i(3 in the complex case, and 1>'12 >.X = {o: + i(3)(o: - i(3} = 0:2+ (32 =
r2 = 8{= det A) i.e. 1>'1 = Irl = ../0:2 + /3 2 {only + needs be considered, radius r
being positive. See figure 6.1}.
a+ ifj
-""""""""lE---corr---+------a = Re ~
a - ifj
Thus 1>'1 < 1 ~~ Irl < 1, i.e. for stability, the modulus of each eigenvalue must lie
inside the unit circle {of radius r = I}.
124
Note that when Ai > 0, the convergence to (O < Ai < 1), or divergence from
(I < Ai), equilibrium is orientation preserving since A: in (6.6) is positive for all t.
However, when Ai < 0, the trajectory is orientation reversing, whether converging
to equilibrium (-I < Ai < 0) or diverging from it (Ai < -1) since A: in (6.6) is
alternatively positive (for even t) and negative (for odd t).
This fundamental condition IAil < 1 'Vi is stated in a variety of ways and numer-
ous theorems examine the conditions for its occurrence. Some of them are
(i) Schur's Theorem: The roots of c{A) = An + CIAn-l + ... +cn = 0 will be less than
unity in absolute value iff the following determinants are all positive
1 0 Cn Cn-l
Cl 1 0 Cn
~2= >0 ...
Cn 0 1 Cl
Cn-l Cn 0 1
1
Cl 1
Cn-l 1 Cn o > O.
~n= --'c:....:;-----+--:-l-c-l---c~-1
C n n-l
(6.20)
Cn-l o 1
Cl en 0 1
(ii) In practice, a more easily verifiable condition (although stronger than necessary)
is IIAII < 1 where IIAII is any norm of A. In input-output analysis, IIAII is the largest
column sum of A. This can easily be seen by noting that, by definition
Av = Av.
Taking norm on both sides gives
and hence
IAI < IIAII Ilvll = IIAII < 1 (6.21)
- II vII .
This provides what Conlisk (1973) called a "quick check" of stability conditions.
(iii) 161 < 1 and ITI < n.
(6.22)
This can easily be seen by recalling that
6 == det A = det A = II Ai and IAil < 1 ~ 161 < 1
i
n n
which imply
181 < 1 and Irl < 1 + 8. (6.24)
These conditions can easily be seen by noting that c(>.) = 0 is a convex quadratic
equation (c"(>') = 2 > 0) and if >'1 and >'2 are its two real roots, then c(>'d = 0 =
C(>'2) by definition. These are the points at which c(>.) = 0 intersects the >.-axis (see
figure 6.2)
C(A)
C(A) = 0
'\r----;-:-:1 c( -1)
c( 1)1------1..
----~~~~---T~----A
-1
Theorem 6.5. The system Xt = AXt-l is globally stable iff there is a symmetric
positive definite matrix B such that -C = A'BA - B is negative definite.
where B is positive definite. It is easy to check that V(xd qualifies for a Lyapunov
function (V ~ 0, ~V < 0).
126
For a d.e. system Xt = AXt-l on the plane, phase diagrams are constructed in
the same way as for the ODE systems in Chapter 5, the only difference being that
points are discrete and connected as continuous curves only as a visual aid. We
shall analyze the system in its Jordan canonical form rather than its original form
for simplicity: the two being topologically equivalent.
The characteristic equation c(A) = IA - AIl = A2 - TA + 8 = 0 has the roots
A = T /2 ± ..(is./2 where ~ == T2 - 48. Three cases will be analyzed separately
according to the sign of ~.
(i) ~ > 0 : two real distinct roots Al, A2. The system is a stable node (SN) if
IAil < 1 , unstable node (UN) if IAil > 1 and a saddle point (SP) if lAd < 1 < IAjl
for i,j = 1,2 and i i' j.
"-
Y2 Y2 Y2
~/ ~/ ,/
/ ~ '\ I
Yl Yl
/ ~
Yl
Jf2 Jf2
\ ) \ )
(
1/1 1/1
( '- ~
(a) Stable improper node (II) Unstable improper node
---H-H--+----yl
If Q = 0 :F {J, r = ';0: 2 + {J2 = {J < 1 for a stable focus and r = (J => 1 for an
unstable focus.
The various cases analyzed above can be summarized in the 1'·6 parameter space
in the same way as in the ODE system in Chapter 5. Consider the characteristic
equation c(~) = ~2 - 1'~ + 6 = 0 with the solution ~ = 1'/2 ± V"K/2. Referring
to the analysis in 6.4.1 (iv) above, we see that the three stability conditions are
Ic(O)1 = 161 < 1, c(l) > 0 and c(-I) > O. The limiting cases of f1 = 0 (~ 1'2 = 46)
128
and c(l) = 0 = c(-l) delineate stable zones from unstable ones (see figure 6.6).
6 = 0 is represented by the parabola 7 2 = 48 and c(l) = 0 and c( -1) = 0 by the
two straight lines: c( 1) = 1 - 7 + 8 = 0 (8 = 7 - 1) and c( -1) = 1 + 7 + 8 = 0
(8 = - 7 -1).
--------------~--~~--r_~~~--~------------T
Stability conditions require 7 and 8 to lie inside the triangle ABC where -1 <
8 < 1 on the vertical axis and above the c( 1) = 0 and c( -1) = 0 lines in figure 6.6.
[ Yl (t)]
Y2(t) =
[0-B I] [ Yl (t -
A+B Y2(t - 1)
1) ] .
129
Consider the Keynesian model of income (Yi) determination in which the imme-
diate as well as distant past continue to influence current economic activities. More
specifically let induced investment (It) have a two-period lag and consumption (Ct )
have a three-period lag, i.e.
It = VI(Yi-1 - Yi-2) + V2(Yi-2 - Yi-3) , (VI + V2 = V)
Ct = clYi-1 + c2Yi-2 + c3Yi-3, (CI + C2 + C3 = c)
Yi = It + Ct = (VI + CI)Yi-1 + (C2 + V2 - VI)Yi-2 + (C3 - V2)Yi-3
== alYi-1 + a2Yi-2 + a3Yi-3
or, as a first order d.e. system,
YI (t)
[ Y2(t)
1[ 00
Y3(t) a3
130
or
Yt = AYt-1
where A is the companion matrix of the above system. The solution, by Theorem
1, is
Yt = AtYo = PNP- 1yo
where A1, A2, A3 are the solution to C(A) = _A 3 + a1A2 + a2A + a3 = O.
which Solow (1958) interprets in the light of Capital theory as the equilibrium
condition of an investor facing the choice between using his money to set up business
and lending it out for interest (r) income. In the first alternative, he would receive
his sale revenue, at the end of the period, Pt+1 less his current costs Pt+1A and
still owns his equipment worth Pt+1B to enter the second period. In the second
alternative, he lends ptB at interest rate rt and will get (1 + rt)ptB. In equilibrium,
he would be indifferent between the two, and we have the dynamic system
Pt+1[(I - A)B- 1 + Ij = Pt
or
r
PH1 = pt[I + (I - A)B- 1 1 '= PtN
where N '= [I + (I - A)B- 1j-1. The output system Xt (in column vector) and its
price dual Pt (in row vector, to avoid transposition notations), are
Pt+1 = PtN.
Jorgenson's dual instability can then be shown very simply by noting that M
and N- 1 have the same eigenvalues, say A, and hence M and N have eigenvalues
which are the inverse of each other, i.e. A and t, where A = 1 + J.L and J.L is an
eigenvalue of B- 1(I - A) or of (I - A)B-1 as has been noted in section 5.9.2 of
Chapter 5 (see also Chapter 4). This establishes the Saddle Point property of the
system's solution i.e. x(t) = Mtxo = C1V1A1 + ... + CnVnA~ and p(t) = (N')lpo =
t
C1W1Alt+ ... +CnWnA;;-t: in other words, if A is inside the unit circle, then is outside
it. Using the nonnegativity of the matrices M and N- 1 and the Perron-Frobenius'
theorem concerning the existence of a largest nonnegative eigenvalue, say A1 such
that A1 > IAi 1 (i = 2,3, ... , n) and a corresponding nonnegative eigenvector say v 1.
Jorgenson has also established the dual relative instability, defined as
.I
hm - Xi - -
t--+oo Xl V1
I
Vi <€ (€>O)
where Xi(t) is the ith component of the solution x(t) = eMtxo and Vi the ith compo-
nent of the characteristic solution vector v. Thus if the output system x(t) is stable,
its dual price system p(t) is unstable and vice versa.
Chapter 7
Nonlinear Systems
7.1. Introduction
In Chapter 5, we have dealt with linear dynamic systems, the nature of their
equilibrium, and their analytical as well as qualitative solutions. In this chapter,
we are discussing nonlinear systems. These are more important since the world is
more nonlinear than linear in general, and also linear systems can be considered
a local linearization of nonlinear systems about an equilibrium point. Nonlinear
systems can be approximated by such linearization in some cases and not in some
others. This will be discussed in the Linearization theory in the general context
of solution spaces, together with their stability and qualitative solutions. A brief
introduction to Limit Cycles will be presented. The discussion will be illustrated
with some applications in Economics and Biology.
Note that Dynamical Systems refer to both the continuous system represented
by differential equations
(7.1)
whose solution gives rise to a flow, and the discrete systems represented by difference
equations
Xt+! = g(Xt) (7.2)
written as a C r (i.e. a smooth r-times (r ~ 1) differentiable) map
where x(t) ERn is a vector valued function of time (t) and f : U -t Rn is a smooth
function defined on some subset U of Rn. Then f is a vector field which generates
134
a flow <Pt : U -+ Rn where <Pt(x) =<p(x, t), defined for all x E U and t E (a, b)
satisfying
d
dt <p(x, t)t=6 = f[<p(x, s)]
for all x E U and s E (a,b).
Given some initial condition Xo E U, the solution of (7.1) consists in finding a
flow <p(xo, t) satisfying <p(xo,O) = Xo. Does such a solution exist and is it unique?
Assuming the Lipschitz constant K exists where
Proof. Coddington and Levinson [1955], or Hirsch and Smale [1974] or any standard
text book on differential equations. The proof of existence and uniqueness can also
be carried out by use of the Contraction Mapping Theorem discussed in any text
book on Functional Analysis, such as Taylor [1958] or Kolmogorov and Fomin [1957],
for example. It will not be carried out here.
(7.1)
Definition 7.2. A fixed point is said to be simple if its linearized system Ax has
no zero eigenvalues, i.e. if det A -I 0.
Consider the nonlinear system x = f(x) in (7.1). Using Taylor expansion about
some critical point x*, set at the origin for simplicity, we have
where
and cp(x) is such that lim.,-to cp(x) = 0. Ax in (7.4) is called the linearization of
(7.4) or the linearized system i.e.
x = f(x) = Ax + cp(x) as
Xl = II (x) = aXI + bX2 + CPI (Xl, X2)
X2 = h(x) = cXI + dX2 + CP2(XI,X2)
°
where a == afI/axl == au; b == alI/aX2 == a12; C == ah/aXI == a21, d == ah/aX2 ==
a22 and liIIlr-to 'f,(":,"2) = = liIIlr-to 'f2(":,"2) where r == Jx~ + x~, i.e. CPI and CP2
being so small, can be neglected, provided the critical point is hyperbolic.
Example 7.1.
X = X + 2y+x 2
if = 2x + y + xy2.
Clearly the critical point is the origin i.e. (x*, y*) = (0,0), where
A = [ ac db] = [12]
2 1 ; CPI = X2 , CP2 = xy2
°
with lim."lI-to CPI = = lim."lI-to CP2. The origin is a simple critical point since the
eigenvalues ,X = (3, -1) are both nonzero, det A = -3 ¥- 0. It is also hyperbolic
since Re (,X) = (3,-1) ¥- (0,0).
In general, for any critical point (x, y) ¥- (0,0), we can define their variations as
new variables ~ == (x - x), Tf == (y - y) and X = f(x) is
( ~Y ) == ( ~)
Tf
= [ac db] [ xy -- ~Y ] + [ CP2(~'
CPI(~' Tf) ]
Tf)
.
This only represents a shift of the critical point from (0,0) to some nonzero (x, y).
Proof. Hartman (1964). Although the proof is involved and not presented here,
the Theorem is easy to understand and apply. It simply says that provided no
eigenvalues of A lie on the imaginary axis (including the origin) in the complex plane
(fig. 7.1), in other words, provided Re (Ai) '" 0 Vi, then L and NL are qualitatively
equivalent: nodes, saddle points (SP) or spirals in L remain nodes, SP or spirals in
NL. Thus, it can be seen that near a hyperbolic critical point, the dynamical system
is structurally stable: it is robust to small perturbations. Hyperbolic critical points
are also called generic in that their occurrence is the rule rather than exceptions.
Thus, provided the critical point is generic, it is sufficient to study the lineariza-
tion Ax, as if it was a linear system x = Ax. But this has been discussed at length
in Ch. 5 where the solution of x = Ax, for a given x(O) = Xo, has been shown (in
equation 5.14), to be
(7.6)
where x ERn, y == p-lx and J is the Jordan form of A. For the case of distinct
eigenvalues Ai of A, J = diag (At, A2, ... ,An) == A and (7.6) is simply
or
(7.7)
(7.8)
ffiR +
hnA I
• represented or
simply
ReA as windows as
we can spell out the above analysis, separating the hyperbolic from nonhyperbolic
cases as follows. (See Table 7.1).
(ii) Al = 0 = A2 ~ 8= 0= T
+ DG
Case(5): .6. = 0 Al = A2 = T /2 = 0 ~ T = 0 + DG
The above eigenstructure of Dynamical Systems in the plane can also be shown
in the parameter space as in fig 7.2.
*"
*
Stable Unstable
~
Nodes
~
Nodes
,.. _tr A
j~(
P',
+./ ~ -f.
}~"
X(
\ (
,';"4,-, ,~'\
IADDLE S.P.
1m .\
--------*--------Re.\ -----E~~~------Re.\
Thus, the Linearization Theorem says that so long as the critical point is hy-
perbolic i.e. Re (Ai) =f:. 0 'Vi, Land NL are qualitatively equivalent and hence it is
sufficient to solve the linearized problem. However, when Re Ai = 0 the system is
structurally unstable and linearisation is inappropriate. Take case (6) for example,
eigenvalues lie on the imaginary axis: the critical point is a centre. A slight per-
turbation would cause them to leave the axis to move to the right thus becoming a
139
repelling spiral or to the left thus becoming an attracting spiral. The system loses
its structural stability. If eigenvalues are functions of some parameter, a change in
the latter may cause them to move on the imaginary axis to come together until they
collide at the origin then split on the real axis: a centre thus becomes a stationary
point then split into a SP equilibrium. See fig. 7.3 and 7.4.
Example 7.2.
Xl = -X2 + fr2xl
X2 = Xl + fr2x2
where r2 == x~ + x~ and f = 1 in system I and f = -1 in System II. Both systems
have the linearized part X = Ax where A = (~ -~) with eigenvalues A = ±i and
the critical point is a centre. In polar coordinates, Xl == r cos 0, X2 = r sin 0, we have
x~ + x~ = r2 (cos 2 0 + sin2 0) = r2 and tan 0 = X2/ Xl, which give, on differentiating
and substituting, as in Ch. 5 (see eqs. 5.41, 5.42),
r
20· = •
XlX2 -
.
X2Xl
rr = XlXl + X2X2 + X2 X 2 = r3
r = r2 > 0 for system I (f = 1)
r = _r2 < 0 for system II (f = -1).
It can be seen that although both systems have the same linearized part, which
is a centre, both moving anticlockwise, but radius r expands (r > 0) in system I,
causing an unstable (repelling) spiral and r shrinks (r < 0) in system II, causing
an attracting or stable spiral. Thus the Land NL systems are not qualitatively
equivalent and Linearization is not allowed. See fig. 7.5.
0 (a) Linearised
a (b) NL in I
®
(c) NL in II
Systems I and II
------~~~------Re~
± = ax - cxy
if = -by+ dxy (7.9)
where a,b,c,d are positive constants. The Jacobian DJ(x) == A evaluated at equi-
librium z (== x, y), is
Clearly there are 2 critical points z = (0,0) called extinction equilibrium, and
Z = (b/d,a/c) called coexistence equilibrium and A at these points are
A(z) = [ 0 -cb/d]
ad/c 0
with eigenvalues '\(A(z)) = (a, -b) and '\(A(z)) = ±iVab. The first critical point
is thus a S.P. at the origin and the second critical point is a centre with neutral
stability (see fig. 7.7).
' \ (bid
----------__~~-----L-------x
:d
Note that at Z, we have x = -~ y and iJ = x which could be written as y+ab y =0
or y + w2 y = 0 where w2 == ab, whose solution is
y(t) = a cos(wt - 8)
where a is the amplitude of the oscillation with period 27r/w, as shown in fig. 7.7.
where m == M - Jf, w == W - liT and Jf, liT are such that Ai = 0 = l,tl. AE a
numerical example, let (al,bl.el) = (1,-1,4) and (a2,~,c2) = (1/2,1/2,1). It is
easy to see that there exist 4 equilibrium point.s
Xl = (0,0) with Al = (~ ~), >. = (4,1)
X2 = (4,0) with A2 = (~ -:) , >. = (-4,3)
Xl = (0,2) with Al = (~ _~) , >. = (2,-1)
w
4
3
2
---,~----~L------------------4~~~~--------M
4
x= (a - by)x
y = (ex - d)y
liI.
w
------~~----~~--~~-------X
o
-e
tlmax ••••••••••••
..
• ••••• :
alb
____ ~---L--~~--~-----------X
Xmin die Xmax
The Linearization Theory holds for dynamical systems of any dimension n. How-
ever, for n > 2, phase diagrams are getting complicated. For n = 3, all 3 eigenvalues
can be either all real (including zero and multiple) or one real and one complex pair,
and the canonical form oftke linearized part Ax == D/(x}x (x E R 3, A E R3 X R3)
is
or
[~ o
Q
-f3
f30
Q
1 or
where A = Q ± if3 with Q = 0 for pure imaginary A, f3 = 0 for real A and Q "I 0 "I f3
for complex A. This exhausts all possibilities. Their phase diagram is presented
in fig. 7.11 where on1y hyperbolic critical points are presented. These are cases
where linearization is useful. Other non hyperbolic critical points such as the cases
where one or more eigenvalues have zero real parts, for example where Al = 0,
A2, A3(= A2) = Q ± i,B or Al = 0, A2, A3 = ±if3, all on the imaginary axis, are not
considered: linearization would give misleading results.
Attractors
(stable) ~4. ~+
~* ~+
Saddle
Point
Saddle
Point ~~ ~+
Repellors
(unstable) ~+ ~+
Figure 7.11. Hyperbolic flows in 3 dimensions
145
Example 7.7.
o 0 -1
1
x = Ax where A = Df(x) = [~ ~ -~ i.e. Ax is the linearization
0) , U) 0)
of x= f(x). c(>.) = det(A - >.J) = 0 gives >. = (0,1, -1) with corresponding
x. = -xs, x. ERn.
(7.10)
Xu = xu, Xu ERn•.
p = kl[Y(P) - X(w/p)]
tV = k2[N(w/p) - N]
r = k3[I(r) - S(X, r)]
where ki (i = 1,2,3) = constant positive speeds of response, and p = price. The
properties of the model are assumed to be as follows.
A.l Y'(p) < 0, X'(w/p) < 0(' indicates derivative)
A.2 N'(w/p) < 0
A.3 I'{r) < 0 < Sz and Sr > o.
The linearized system is x = Ax where
2. The model is globally stable. Take the Liapunov function v(x) == x' Bx, B
positive definite v{x) = x' Bx + x' Bx = x'{A + A')x < 0 where A is a stable
matrix by (1), and B = I, the equal weights case.
147
3. A sufficient condition for 6 > 0, i.e. for the existence of two distinct roots '>'1
and '>'2 ('>'3 = k(I' - Sr) < 0) is that both the aggregate output supply and
labour demand be decreasing functions of real wage (w/p) i.e. N' < 0 and
X' < O. It is easy to see that it is sufficient for 6 > 0 that a12 and a21 be of
the same sign, which implies N' < 0, X' < O.
4. The model has a stable node equilibrium. This can be seen by noting that
'>'3 < 0, ~(r ± ..[is.) < 0, ..[is. == vr2 - 46 < Irl, hence there are no complex
roots and no saddle points.
5. If the aggregate demand is vertical, i.e. Y' = 0, then.>. = [0, r, k3 (I' - Sr)], and
if I' = Sr, .>. = [~(r ±..[is., 0]: in either case there exist two negative and one
zero eigenvalues. The critical point is a stable node as can be seen by reference
to the Centre Manifold Theorem. Thus, so long as the aggregate Demand is not
verticle and/or the I and S have the same slopes, ns + nu + nc = 3 + 0 + 0, and
the flow is hyperbolic and the NL and L systems are topologically equivalent.
This equivalence is lost when Y' = 0 and/or I' = Sr.
A well known technique for finding a qualitative solution to the system x = f(x)
is the method of isoclines. This is particularly useful when the dimension is low and
f(x) is autonomous. In the plane, for example, we
x= f(x,y) (7.11)
iJ = g(x,y). (7.12)
we seek curves y = hI (x) or x = h2(y) such that the slope of the vector field
dy/ dx = c is a constant, i.e. g(x, y) = cf(x, y), the solution of which, for the various
c, gives the isoclines, i.e. curves on which the trajectories have the same slope c.
The main steps are as follows:
(i) Draw horizontal (H) and Vertical (V) manifolds H == {x, ylg = O}, V == {x, ylf =
O}. Trajectories cross H horizontally and V vertically. The points where H meets
V are the critical points or equilibria, and if (x*, y*) is one such, f(x*, y*) = 0 =
g(x*, y*).
(ii) Then evaluate the Jacobian [I"g" gIlI,,] at (x*, y*) and find its determinant 6,
trace r and discriminant 6 == r2 - 46, i.e . .>. = ~ (r ± vr2 - 46). The linearization
148
theory then says that the critical point is a source (sink) if fJ > 0, r > 0 (r < 0), a
SP if fJ < O. If fJ > 0 and a < 0, we have a spiral.
(iii) We can then draw the trajectories near the critical point, taking special account
of the 8Bymptotes ofthe hyperbolae when (x*,y*) is a S.P. Now dy/dx cannot change
sign on each of the region R of R2 - (H 0 V). Place arrows in on each side of H and
V. For example:i; > 0 * * *
f > 0, y > 0 9 > 0 and:i; < 0 f < 0, y < 0 9 < o. *
Then draw trajectories in the direction of these arrows, subject to the no-crossing
rule: trajectories cannot cross except at critical points.
(iv) If (j, g) is a gradient system (see Ch. 8), then no spirals or limit cycles can exist:
the critical points must be maxima, minima or S.P. This is because the Jacobian
matrix is symmetric and hence can only have real eigenvalues.
Example 7.9. Let the linearization of x = f(x) be AI where A = (-~ _~) i.e.
separatrix (inset)
The lines whose slope is the same as that of the phase space, i.e. X2 = CXl, cannot
be crossed by any phase curve: they either move to the origin (if c < 0) or away from
the origin (if c > 0), and are sometimes called the inset and outset, respectively.
They are the boundaries separating phase curves and are called separatrices. In our
example, their slope c is equal to the slope (2 + c) / (2c + 1), i.e.
2+c
- - = c::} c= ±1
2c+ 1
Y2
--------~~OM--E~-------Yl
\(
Figure 7.13. Y= Ay, A = (-~ ~)
When the system :i; = f(x), x E R2, admits periodic solutions which are rep-
resentable in the phase space as closed curves, we have a limit cycle (LC). These
are closed isolated trajectories exhibiting repetitive patterns of a stationary motion,
in contrast with those critical points which represent equilibrium states. A L.C. is
stable if it attracts, and unstable if it repels, neighbouring curves and semi-stable if
it is an attractor on one side and a repeller on the other (see fig. 7.14).
150
Definition 7.4. Given the vector field f(x) of x = f(x), x E R2, a point y is said
to be an w-limit point of x EWE R2 if liml-+oo (Pt(x) = y and an a-limit point if
liml-+_ oo (Pt(x) = y. (See Hirsch & Smale 1974).
The set of all w-limit points of x is called the w-limit set, L",(x). Similarly
La(x). In general, the limit set L(x) is the set of all limit points x E W. If x is
asymptotically stable, L(x) consists of a single point. A closed orbit is the limit
set of every point on it, i.e. if the closed orbit is a limit cycle, then 'Y C La(x) and
'Y C L",(x). Thus the limit set L(x) of'Y is non-empty, closed and connected. If
L(x) contains a regular point P (i.e. a point at which f(x) # 0) then the trajectory
'Y through P is a full trajectory and 'Y lies entirely in L(x). A limit cycle crosses a
transversal f in only one point. (See fig. 7.15).
151
Figure 7.15.
Theorem 7.5. Given the system x = P(x,y), i; = Q(x,y) there are no L.C. in a
region where (P", + Qy) is of the same sign.
{T ( dY dX) dx dy
= 10 P dt - Q dt dt, but P = dt ' Q = dt
Proof. Again, by contradiction, using Green's theorem (see Hirsch & Smale 1974).
o oFigure 7.16
152
Proof. Hirsch & Smale (1974) or Coddington & Levinson (1955). We shall not
reproduce this proof but rather illustrate it with an example.
Example 7.11. Consider the system (7.13) in the last example. The origin (0,0) is
clearly a critical points. Take R = {rl~ ~ r ~ n. By Theorem 7.6, it must enclose
a critical point. But this critical point must be excluded by Theorem 7.7. For r = ~,
I,
r > 0, the trajectory is winding out and for r = r < 0, it spirals inward, i.e. the
circle with r = 1 is the attracting L.C. R must thus be closed and bounded: any
curve 'Y starting in R is trapped in this region for ever: it could either be a closed
orbit (if'Y starts on the circle with r = 1) or approaches a closed orbit (r = 1 in our
example), as t -? 00.
Kaldor's (1940) Trade Cycle model, elaborated by Ichimura (1954) and analyzed
rigorously by Chang and Smyth (1971) provides an excellent application of limit
cycle.
Kaldor assumes that national income (Y) rises in response to the excess of in-
=
vestment I(Y, K) over saving S(Y, K) where both I(Y, K) and S(Y, K) are assumed
to be analytical functions of Y and capital (K), and I k by definition. Thus the
model is
€Y = I(Y,K) - S(Y,K)
k = I(Y,K) (7.15)
= a = a constant positive speed of response and 1,11/(= {)I j{)Y etc.), S,
° °
where €-l
S1/ > with S1/ - 11/ 0« 0) for Y < Y1 (Y > Y2) and 1- S = at Yo, Y*, Y3 with
>
Yo < Yi < Y* < Y < Y 2 3 (see fig. 7.17).
I,S S(Y,K)
I(Y,K)
At (Y*, K*), tr A = 0(11/ - 5,,) + J,. > 0, det A> 0: the critical point is unstable.
Consider the subset U of R2
which is the rectangle oyeR in fig. 7.18. Clearly the vector field on U points
inwards and (Y*, K*) enclosed in U is unstable. Any curve starting outside U must
eventually enter U, as shown by the arrows, and once in U, cannot leave U. Since
this critical point is simple (no zero eigenvalues since det A > 0), the linearized
154
system is either an unstable focus or node. Furthermore, any point starting in the
neighbourhood N2 E (K*, Y*) must cross the ellipse from inside to outside, and no
point would cross from outside to inside the ellipse. Hence the limit set is closed and
non-e,mpty. Furthermore, this limit set consists of regular points only (i.e. points
where Y '" 0 '" K). Thus, by Poincare-Bendixson's Theorem, either the trajectory
or its limit set is a periodic orbit. (QED)
There are large numbers of economic applications of L.C. such as Rose (1967),
Torre (1977), Schinasi (1982) Wolfstetter (1982) among others. Torre, for example,
examines the IS-LM Keynesian model where
where national income (Y) rises in response to the excess of Investment I(Y, R) over
saving S(Y, R) and interest rate (R) rises in response to the excess of money demand
L(Y,R) over money supply Ls. The I,S,L functions are assumed to have the usual
properties Ir < 0 < Iy ; Sy, Sr > 0, Lr < 0 < Ly and I, S are I, S are of a sigmoid
form (S-shape, like Kaldor's). Under these conditions, Torre showed that all the
Poincare-Bendixson's conditions are fulfilled and hence the above economy exhibits
a limit cycle.
L.C. theory is also used as a tool of investigation in many biological models but
space limitations do not allow us to go into them here.
Two best known examples of L.C. are the Lienard-Van der Pol's equations.
We shall briefly discuss them and show how they are applied in Economics. The
Lienard's equation is one of the form
± = y - g(x) (7.17)
iJ = -h(x)
where g(x) == J; g'(x) du (i.e. g'(x) = dgjdx) is an even function (i.e. g( -x) = g(x))
with
155
g(z)
----~------~r_--~--+_---z
g'(z) == dg/dz
-----+-\-+--+----1--I--+-++----z·
Proof. Levinson & Smith (1942), Coddington & Levinson (1955) Hirsch & Smale
(1974). The proof is involved, but we shall sketch it very briefly, leaving details
available in the above sources, among others, to interested readers.
First note that g(x) and h(x) are both odd functions i.e. if (x, y) is a solution, so
is (-x,-y), i.e. the phase diagram is symmetric about the origin. Take h(x) = x
for simplicity.
The slope of the trajectory
dy -x
=
dx y-g(x)
x=o
Figure 7.20. The Lienard's unique L.C.
Above x = 0 isocline, x > 0 and below it, x < o. Also iJ < 0 for x > 0, iJ > 0 for
x < 0, and iJ = 0 for x = 0, on the y-axis. Let us start from point P on the positive
portion of the y-axis where y > 0, x = O. To the right of it, in zone A, iJ < 0 < x
and hence the trajectory 1 moves in the south East direction, until it reaches the
x = 0(= y - g(x)) isocline say at Q, which it must cut vertically. Below the x = 0
isocline, in zone B, x < 0, iJ < 0, so 1 must move in the South West direction until
it meets the y-axis (y < O). which it must cut horizontally, say at R, to move into
zone C. By virtue of the skew symmetry of the vector field (x, y) = (y - x 3 + x, -x)
the curve 1 on RSPI is the mirror of the portion PQR, where PI is the first time
1 returns to the y-axis, considered as a transversal (see fig. 7.15). Thus, curve 1,
starting at P, can be considered a mapping O'(P} and on first return PI = O'(P}. It
remains to show that PI = P i.e. O'(P) = P, a fixed point, O'(P} mapping into itself,
iff P is on the L.C. In this case, the point of second, third, ... , return, P2 , P3 , .••• will
be the same point i.e. PI = P2 = P3 = ... = P iff we have a limit cycle. It can also
be shown that this L.C. is unique.
Kaldor's model in (7.14) could be turned into a single equation of second order
in the usual way, by differentiation Y and substitution in (7.14). This gives
(7.18)
which is a Lienard equation of the forms (7.15) except that h(Y, K} _ -0:(/,. -
S,.)/(Y, K) involves K. To apply Lienard theorem, K must somehow be eliminated.
One way to do this is to assume the production to be of Harrod-Domar fixed co-
efficient type, K = vY where v is the constant capital-output ratio. Another way,
suggested by Gabisch and Lorenz (1987), is either to assume that Investment is in-
dependent of K i.e. 1 = I(Y) or k = S(Y). In this case, (7.17) will be, in deviations
157
which is a standard Lienard equation, with g'(x) == a(z'-s') and h(x) == -az's(y). It
is easy to check that g'(y) < 0, s(y) ~ 0 depending on whether y ~ 0 and -z'(y) > 0
for all y, by assumption and hence g'(y)y == az's(y)y > 0 'fIz '" O. g(y) = J~ a(s'-
z') du = a(s - z) and lim!l~oo g(y) = 00 and J~ h( -a')s dy = a( -z')J~ s du -+ 00
as y -+ 00. Hence there exists a unique L.C.
A parallel analysis to the continuous DS above can be carried out for diffeomor-
phisms. The major differences between the two can be seen by referring to those
between differential and difference equations in Chapters 2 & 3 and 5 & 6 above.
Consider the map x 1-4 g(x), x ERn, 9 : Rn -+ Rn in (7.2)' above, whose
associated linear map, obtained by the linearization method (discussed in 7.2) about
X, is
Y 1-4 Ay, y ERn, A = Dg(x).
The eigenspace of A is composed of the stable EB, unstable EU and central EC
eigensubspaces where
Note that (i) W·, WU, E·, EU in the theorem are all local. Global counterparts
are unions of these. (ii) orbits of flows are continuous curves in Rn whereas orbits of
maps are discrete successions of points. E.g. p ~ g(p) ~ g2(P) etc. (see fig. 7.21).
The comparative features of flows and maps, the location of their eigenvalues
and their resulting orbits are summarized in Fig. 7.22.
+.
Orbits Diffeomorphism x ~ g(x)
If -$-
ImA
(a)
(b)
+.
*~ +.
ImA
ImA
-$-
(c)
-$-
~ +. -$
ImA
(d)
,( +. 4r
ImA
(e)
~
159
(iv) the non-wandering set consists only of fixed points and periodic orbits where,
for a given point p, a non-wandering set n is defined as a set of points in its
neighbourhood N such that the flow ¢h (N) n N =F </J or for map g, gn (N) n N =F
</J, for large t or n.
It will be recalled (sections 5.3 and 5.6 of Ch. 5) that if x(t) is any solution of
i: = f(x) i.e. f(x) = 0 or of any map x 1-+ g(x) i.e. gn(x) = x for n -+ 00, then
x(t) is said to be stable if solutions starting close to x(t} remains closed to x(t} at
all future times. If in addition, these solutions converge to x(t} as t -+ 00, then x(t}
is said to be locally asymptotically stable. If this convergence takes place from any
initial position, in other words, if the distance between x(t} and any other solution
"shrinks" over time, global stability obtains. This distance function is referred to
as the Liapunov function (see Definition 5.4 and Theorem 5.5).
When solutions are closed orbits, asymptotic stability refers to periodic stability
i.e. the property the orbit has to attract neighbouring curves. Thus, we have
160
Definition 1.5. If <Pt is the flow of ± = f(x) and the solution curve, C W (where
W is an open subset of RR) is a closed orbit of the flow, then, is said to be
asymptotically stable if
lim d[<pt(x),,] = 0
t-+oo
i.e. if the distance (d) between the flow <Pt(x) and the closed orbit, decreases to
zero when t -+ 00. The orbit, is then said to be a periodic attractor: , attracts
neighbouring solution curves to itself. Note that this definition also holds for maps.
It can be shown (see Hirsch & Smale (1974) p. 277) that if, is an asymptot-
ically stable closed orbit of period T, i.e. limt-+oo I<PT+t (x) -<pt(x)1 = 0 then, has
neighbouring .curves with asymptotic period T. Also if p E , and the linear map
D<PT(p) has n -1 eigenvalues of modulus < 1, then, is asymptotically stable. This
condition IAil < 1 also holds for maps.
Roughly speaking, two nearby flows or maps are said to be topologically equiva-
lent if they exhibit the same qualitative properties, for example Saddle points, sink
or source at some critical point x, and structural stability refers to the capacity of
the system to preserve its qualitative features under perturbation. More precisely
When does a dynamical system possess structural stability? For the two-
dimensional case, Peixoto has provided the answer which summarizes much of im-
portant earlier work.
(iv) the non-wandering set consists only if fixed points and periodic orbits.
Proof. (Peixoto 1962). The theorem should be clear from the above discussion
except (iii) which would require some elaboration. Suppose a flow f,.(x} or map
g,.(x}: Rn ~ Rn depend continuously on some parameter J.l E R. Then for (iii),
referred to as homoclinic, it is easy to show that.a slightest perturbation caused by
a change in J.l past some critical level J.lo, could break the saddle connection into
a focus or periodic orbit (see fig. 7.23) and thus change the system's topological
structure.
7.8. Conclusion
8.1. Introduction
In this chapter, we shall examine some important dynamic systems (DS) which
are widely used in many fields such as Economics and Biology. These are the Gradi-
ent Dynamic Systems (GDS), Lagrangean Dynamic Systems (LDS) and Hamiltonian
Dynamic Systems (HDS). We shall briefly discuss the major characteristics of each
system in turn and present some applications.
Let U be a subset of R" and V(x) : U - R be a function from the state space of
n dimensions to the real line, called a potential function (in reference to the potential
energy in Physics). Assuming V(x) is twice differentiable, then the gradient of V(x)
Theorem 8.2. At regular points (where f(x) '" 0), the trajectories cross level
curves V(x) = c; (i = 1,2, ... ) orthogonally. Critical points x* at which f(x*) =
are equilibria. Isolated minima are attracting basins and are asymptotically stable.
°
Proof. Hirsch & Smale (1974).
Theorem 8.3. At a critical point of a GDS, all eigenvalues are real, which excludes
spiralling nodes and limit cycles.
Proof. This follows from the fact that Df(x) = H(V) = Hessian of V(x), which
is a symmetric matrix. Symmetric matrices can only have real eigenvalues (see Ch.
4). Hence no periodic solutions are possible. (QED)
This is a very useful result: in a DOS, we only need to look for maxima, minima
and saddle points. No limit cycles, no fluctuations are possible since the trajectories
always seek minimum points which they reach as fast as possible.
Df(x) =_[
-
{)2V ] =
{)Xi{)Xj -
[-V1l
- ~l
-V12]
- ~2
= [ -3x~ + 6Xl -
° 2 °]
-1
where V;j == {)2V/{)Xi{)Xj (i,j = 1,2). At the three critical points x l ,x2,x3, Df(xi )
are Df(O, O) = [-~ _~] j Df(l,O) = [~ _~] j Df{2,0) = [-~ _~]. Clearly
V(x) has a minimum at xl = (0,0) and x3 = (2,0) separated by the saddle point
° °
x2 = (1,0) which has a maximum in the xl-direction at Xl = 1 and a minimum in
the x2-direction at X2 = i.e. X2 = everywhere on the x - V plane (see fig. 8.1.(a)
and 8.1.(b».
°
It can be seen that at these 3 critical points, the level curves V(x) = c; (i = 1,2,3)
are V(O,O) = Cl = 0, V(2,0) = C3 = and V(l, 0) = C2 = 1/4. In this example, the
phase space has two attracting basins at (0,0) and (2,0) which are separated by a
saddle point which is a ridge at (1,0). The insets (sets of all initial points which
eventually end up at an equilibrium point) are the stable arms of the SP. They act
as separatrices to separate the two attracting basins.
Note also that the gradient vector field, -grad V(x), is orthogonal to the level
curves V(x) = c; where c; (i = 1,2, ... ) are the constant heights of V(x) at the
various places (five such heights, are shown in fig. 8.1.{a). Fig. 8.1.{b) is the
projection of fig. 8.1.(a) on the Xl-X2 plane.
v(x) 165
Example 8.2. Consider the typical problem of minimizing production cost (w,x)
subject to a given output level q = J(z) : If' -. Rj (w,z) E If' where z is an
n-vector of inputs and w its constant rental (or wage) vector. The problem is one
of constrained cost C (z) minimization, i.e.
C(z) = w'x + ~[q - J(z)]
where ~ is the usual Lagrange multiplier. The dynamic law is
:i: = -grad C(x) = -(w - ~f')
i.e. factor hiring moves in the direction of decreasing costs. Theorem 8.1 gives
6(x) = DC(x):i:
= (w - ~f'), -(w --: V'))
= -(w - ~J')2 :5 O.
The movement of x stops when 6(x) = 0 i.e. when w = ~J', the usual equilibrium
conditions. It is easy to verify that second order conditions are fulfilled.
166
Example 8.3. Consider the problem of profit maximization II(x) = pf(x) - w'x
where P = constant output price, x E R" is the input vector and w E R" its rental
vector assumed constant, f( x) is the usual concave production function f" < 0 < f'.
The dynamic law is
x = grad II(x) = pf' - w
i.e. factor hiring moves in the direction of profit increase. Theorem 8.1 gives, for
this maximization problem,
i.e. factor hiring continues so long as iI( x) > 0 and stops when iI( x) = 0 => PI' = w
i.e. when the value of marginal factor product (PI') is equal to factor rental (w) :
II(x) then reaches its maximum. This is easily verified by examining the Hessian of
II(x) i.e. H(II) = Pi" is negative definite for any concave production function f(x).
p = - Yp = -(I - A)q
q= Vq = (J - A')p.
167
d 8L 8L .
-d -8' - -8
t qi q;
=0 (z = 1,2, ... ,n) (8.2)
(8.3)
which is
by integration by parts. Since there are no variations at the end points i.e. at t =0
and t = T, h(O) = 0 = h(T), substitution of (8.6) into (8.5) gives
This is the necessary condition for an extremum of (8.4). More formally, we have
Proof. Suppose g(t) -:F 0, say g(t) > 0 in [0, T]. Then by continuity, g(t) > 0 for
some interval [a, b] in [0, T]. Let h(t) == (t - a)(b - t) \:It E [a, b] and h(t) = 0 "It ¢
[a, b]. (See fig. 8.2.) Clearly h(t) satisfies all the conditions of the Lemma. But then
If g(t)(t - a)(b - t) dt -:F O. This contradiction proves the Lemma. (QED)
h t)
--~O--~a--~--~--t
Theorem 8.4. The curve "y : q = q(t) is an extremal of J in (8.4) on the space of
curves passing through two fixed points q(O) = qo and q(T) = qT if it satisfies the
Euler equation
d
-L·-L
dt q q
=0 (8.8)
Proof. The necessary condition is oj = 0, which by (8.5) and (8.6), is (8.7). Iden-
tifying the expression in the square brackets of (8.7) with the function g(t) of the
Lemma completes the proof. Thus, we must have for (8.7), the Euler equation
d
dtLq - Lq = 0 (8.8)
Theorem 8.S. The LDS in (8.2) and the HDS in (8.9) are equivalent to each other.
q= H"
Hq = -Lq =-p (8.10)
Ht = -Lt == -oL/at
where Hq == (:~, ... , ::~.) etc ... , and Hq = -Lq = -p follows from the Euler
it
Lagrange equation (8.8) where p == Lq = Lq • (8.10) is called the canonical form of
the Euler equations. (For further details, see Tu 1984 or 1991 p. 72). Hence if q(t)
satisfies the LDS then (p(t), q(t» satisfies the HDS. Similarly the converse could be
proved.
Example 8.5. Optimal Economic Growth model. Consider the neo-classical eco-
nomic growth of Swan (1956) and Solow (1956), optimized by Cass (1966) and
others. It consists of optimizing the functional J(k) where
where
e(t) = f(k) - >.k - k, k(O) = ko (8.12)
u(e) == u[f - .U - k] is the consumption utility function, increasing
and concave, i.e. u"(e) < 0 < u'(e)
f(k) = per capita output, f" < 0 < f'
A = constant depreciation (p) and population growth (n) rates,
i.e. A = p + n, constant
k == dk/dt = per capita investment.
Defining u(e)e- 6t == u(f - >.k - k)e- 6t == F(k, k, t).
(8.13)
Euler-Lagrange equation (8.1) or (8.8) gives
d d
0= FIc - dt Fi; = e- 6t [u'(e)(f' - A) + dt u'(e)]
where p(t) == qe- 6t = the co-state variable and k(t) is the state variable, representing
the stock of capital per worker. Note that if utility is a linear function, i.e. u(e) = e,
then (8.15) is simply
170
which is the discounted per capita GNP, composed of the value of consumption
(c) and of investment (k) measured at price qe- 6t == p(t) in terms of the price
of consumption (c) taken as a numeraire. Thus p(t) == q(t)e- 6t is the generalized
momentum, i.e. p(t) = 8L/8k where k is the generalized coordinate (with k == dk/dt)
of this economic system.
The HDS given by (8.15) is
which is exactly (8.14). Thus the LDS and HDS are equivalent to each other: they
give the same results.
Proof.
I.e.
if == dH(~~q,t) = 88~ == Ht • (QED) (8.19)
A CHDS is a vector field: at each point (p, q) of the phase space, there is a
2n-dimensional vector X h == (-Hq , Hp) which is called the Hamiltonian flow, i.e.
putting x == (q,p) and grad H == (Hq, Hp),
where
Note that (8.20) is just a compact way of writing (8.3). Note also that J' = J-1 =
-J where J' is the transpose of J, and JJ = J2 = -I2n • Finally, note that (8.20)
looks a bit like the gradient system (8.1) but it is very different. While (8.1) is an
inner product, (8.20) is a symplectic form. Without going into Symplectic Geometry
and Lie algebra, (see Arnold 1978, Abraham and Marsden 1978 for example), we
note simply that the sympletic area of the parallelogram defined by two vectors u
and v is given by the skew product
ul\v=(Ju).v (8.22)
and
~(Ul\v) = (Ju)·v+(Ju)·v
= (JJHu).v+(Ju)·(Jv)=O (8.23)
Proof. Linearizing the Hamiltonian flow X h about its equilibrium point where
Hq = 0 = Hp set at the origin for simplicity, which amounts to approximating
the Hamiltonian function H(q,p) by its quadratic form ~ (x'fix) where x == (q,p),
we have
x=Xh=O+JHx==Mx (8.24)
172
where JH is called the Hamiltonian_matrix M. (In our notation, H(x) is called the
Hamiltonian function and M(= J H) is called the Hamiltonian matrix). Written
out in full, x = Mx in (8.24) is
(8.24')
-X. • ,\ =0 + ifJ
--------+--------0
--------+--------0
It is clear, from this double symmetry, that simple HDS (i.e. detM ¥ 0) with
one degree of freedom must fall into of the first two categories. This fact could be
formulated as a Theorem (which is just a Corollary of Theorem 8.6), as follows:
Proof. Linearizing the HDS about the equilibrium point (q*, p*) at which Hr. = 0 =
H q , we have, as in (8.24)
(8.24)
or, in full
'""
[ pIi] ,.., [a b ] [ q - q* ] _ [ Hpq Hpp ] [ q - q* ]
-c -a p-p* ...., -Hqq -Hqp p-p*
174
Note that this corollary is a typical case in the Optimal Growth literature where
H(q,p) is concave in capital (q) and convex in shadow price (p). This is a key
assumption of Hamiltonian Economics (see Cass & Shell 1976).
subject to
k= f(k) - H(t) - e(t).
This gives rise to the Hamiltonian function H(k, q, t)
k = Hq (8.25)
q = -Hi< + Sq(t).
175
or
(8.26)
where A == Hqlc(== 8 2 H/8q8k), B == Hqq , C = -H"". A,B,C are (n x n) real
matrices, with Band C both positive definite by the usual assumption that H(k, q, t)
is concave in k and convex in q, In is an (n x n) identity matrix and 0 a real discount
parameter.
Writing the matrix in (8.26) as
(8.27)
!
where A == A - In, M6 == [~ -i] , Mo == [~ _.!] .
Clearly M6 and Mo
(0 = 01. the undiscounted case) are Hamiltonian matrices having a symplectic form
like JH in (8.24) above, and hence possess the double symmetry property proved
in Theorem (8.6) and analyzed in fig. 8.3 above. Thus, the spectrum (or the set
of eigenvalues) of M6, are simply the spectrum of M6 shifted through 0/2, (see fig.
(8.4)), i.e. from (8.27),
1m I'
SP SP
tI tI
ReI'
-6/2 6/2
I instability
I
corridor
let A and II. be an eigenvalue of M6 and M6 respectively, then A = II. + 0/2. But
M6 being a Hamiltonian matrix, has the double symmetry properties i.e if II. is an
176
eigenvalue of £16 , so are -1', p. and -p.. Writing I' == ±a ± i{3 in general, we have
>. = I' + 6/2 == ±a ± i{3 + 6/2
= ±a + 6/2 ± i{3 (8.28)
= Re (>.) + 1m (>.).
By shifting to the right through 6/2, it is easy to see, from fig. 8.4, that Re (>.) ==
±a+6/2 > 0 for 10'1 < 6/2 and Re (>.) have opposite signs for 10'1 > 6/2, as has been
pointed out by Kurz (1968). Thus, the perturbed Hamiltonian matrix M6 no longer
possesses the double symmetry of Mo. The distance 6/2 from the imaginary axis in
fig. 8.4 can thus be called the "instability corridor" and Kurz's (1968) theorem may
be formulated, by summarizing the discussion above, as follows.
Theorem 8.9. The PHDS x = M6X in (8.28) has the saddle point properties (SPP)
iff £16 has no pairs of eigenvalues inside the instability corridor. If it has, Re (>.) > 0,
the PHDS is unstable and disintegrates. Furthermore, it is not possible for (8.26)
to have all eigenvalues lying on the imaginary axis.
Proof. As above except for the last sentence. For all eigenvalues of M6 to be on
the imaginary axis, Re (>.) must be zero, i.e. -a + 6/2 = 0 = 0'+6/2 which is
impossible so long as 6 > 0, as has been pointed out by Kurz (1968).
HDS made its appearance in Economics literature in the 1960's when dynamic
optimization was formulated as a standard optimal control problem and the appli-
cation of Pontriagin's Maximum Principle (1962) naturally leads to the formulation
and solution of some Hamiltonian function H(q,p) with p = -Hq and q = Hp. By
1970's, the Hamiltonian approach to dynamic Economics became commonplace (see
Cass and Shell (eds.) 1976). Applications range from economic growth theory, fluc-
tuations, capital theory, dynamic profit, intertemporal production and consumption
plans, foreign investment, resource allocation, pollution, natural resources, portfolio
allocation, optimal financing and advertising, to name only a few. This approach
has attracted the attention not only of economists but also mathematicians like
Rockafellar (1976).
As an illustration of the Hamiltonian representation of the economy, take a typ-
ical optimal economic growth problem of maximizing intertemporal discounted (at
rate r) consumption (c) utility u(c) (u" < 0 < u'(c)) subject to some capital accu-
mulation law k = f( k) - nk - c where k is capital and f( k) is per capita production
function with f"(k) < 0 < f'(k). Application of Pont riagin's (1962) Maximum Prin-
ciple leads to the formulation of the Hamiltonian function H (k, q, r) = sup c { C + qk}
177
where u(c) = c, which is precisely net national product (NNP), consisting of con-
sumption (c) and investment values (qk) where the discounted price (q) of investment
is in terms of consumption good (c) taken as a numeraire. Consumption (c) must be
chosen such as to maximize NNP over time. The HDS gives k = H" = f( k) - nk - c
which is the economic definition of net investment which is per capita production
f(k) net of population growth requirement (nk) (n is a constant population growth
rate) left over after consumption needs (c) are met. q = -Hie = -[f'(k) - n - 6]q
states that the returns to capital f'(k), net of population growth rate allowance
(n) and depreciation (6) are equal across capital goods. The existence of (k, q) is
guaranteed by Pontriagin's (1962) theorem and"its stability has been thoroughly
investigated, as has been seen in section 8.4.2 above. The properties of technol-
ogy as characterized by the Hamiltonian function H (q, p, r) reflect the competitive
process of individual pursuit of self economic interest in capitalist societies (descrip-
tive HDS) as well as planners' objective in command economies (optimal planning
models). The double axi-symmetry of HDS ensures that, in the absence of discount
(r), so long as no eigenvalues lie on the imaginary axis, the economy possesses a
Saddle Point equilibrium with the stable and unstable manifolds having exactly the
same dimensions. The introduction of a future discount rate (r), treated as the only
parameter, in the "perturbed" Hamiltonian system (PHDS) destroys its double axi-
symmetry and the conditions for the Saddle Point property to be maintained arise as
natural research questions. If at certain critical discount rate ro, a pair of eigenval-
ues hits the imaginary axis with nonzero speed, orbits are born: the economy starts
spinning and generates business cycles: the HDS undergoes a Hopf bifurcation (see
Chapter 9). Thus, not only local and global stability but also structural stability
can be fruitfully investigated in the framework of HDS. Other applications also have
similar economic meanings and interpretations specific to each problem have been
formulated.
The role of GDS in Economics is more subtle, as has been noted above (see
examples 8.2, 8.3 and 8.4): it is seldom spelled out but it implicitly pervades eco-
nomic modelling. Both the Hamiltonian function which gives rise to the HDS and
the potential function which gives rise to GDS are economists' objective functions
and yet the HDS and GDS are vastly different mathematically, as has been noted:
one is of a symplectic form, the other is an inner product, a natural queStion is
"what is the relationship between them?" Perko (1991) has provided the answer: a
dynamic system is a HDS with n degrees of freedom iff the dynamic system which
is orthogonal to it is a GDS in R2n.
As an illustration, consider Goodwin's (1987) Leontiefinput-output model where
the net profit function II(q,p) = P'(I - A)q where q E R:' is the n-output vector and
pER:', its price vector, A = [aij] = input-output coefficient matrix with aij == 'if
=
178
minimum quantity of output i required to produce one unit of output j, 0 ~ a'i < 1,
a'i constant, (i,j = 1,2, ... , n). Thus profit 7r == p'q - p' Aq = total revenue less
total cost.
Writing grad 7r(p,q) as grad 7r(q,p) == (7rq ,7rp ) and spelling out market laws in
models I and II separately, we have, as in example 8.4 above,
Model I. P = -7r q = -(I - A')p: if price exceeds costs i.e. p > A'p, p will be
driven down by competition.
q = -7rp = -(I - A)q: if demand Aq exceeds supply q production q will be
increased.
Model II. P = -7rp = -(I - A)q: if demand Aq exceeds supply q, i.e. Aq> q price
will be driven up by competition.
q = 7rq = (I - A')p: if price exceeds costs i.e. p > A'p, profit maximizing firms
will step up production q.
The PHDS with two degrees offreedom (n = 2) has been applied in Economics
under the headings of two-state-variable, or two-sector models, by Uzawa (1968)
in the context of optimal economic growth, Brock and Sheinkman (1977) in the
context of Stability, by Pitchford (1977) who raised some theoretical problems and
lately by Dockner (1985) whose important work has been applied to various eco-
nomic problems by Wirl (1991), Dockner, Feichtinger and Novak (1991) Dockner
and Feichtinger (1991) among others. In this section, we shall show how the double
symmetry and spectrum shifting analyzed in the last section could provide a simpler
179
approach to the problem and bring about Dockner's results in a simpler way. For
further details, see Tu (1992).
Dockner's problem of minimizing
10'>0 e- 6t F(x,y,u)dt
subject to x= f(x,y,u), iJ = g(x,y,u) leads to the current valued Hamiltonian
function
H(x,y,)..,p) = max{F(·)
u
+ )..f(·) + pg(.)}
where (x,y) = state variables, ()..,p) = co-state variables, u = control variable and
S = discount rate, the only parameter under consideration. Pontryagin's Maximum
Principle (see Ch. 10) gives the HDS
x =H>.
iJ = HI-' (8.29)
~ = S)" - Hz
The linear HDS, obtained by linearizing the above system about the critical point
at which x = 0 = iJ = ~ = p. gives, as in (8.26)
z=M6z (8.30)
= rCA ~, ] + ~ [ ~ J ] 2
=M6 + -14
•
2
S
h M• -
were 6 = [ CA" -.k
B ] =- [6
A - 2"
C
[2 B
_ (A' _ ~ [2) ] and = [ H#~
A _ H).~ H).~
H#~ ] .
,
where K == C3 - 83 • This gives the explicit value of ~ in (8.28) i.e. the eigenvalue
quartet
8
~=-+",
-
2
where ",3 == (~r ~ ± ';K2 - 4c•.
We are now going to show how the double symmetry and spectrum shifting
(through 8/2) above can facilitate the solution and provide Dockner's results.
First, by shifting the spectrum of M6 through 8/2 to obtain the spectrum of M6,
it can be seen, from fig. 8.4, that the axis of symmetry (which was the original
imaginary axis) is shifted to the right by 8/2 and hence 8/2 is the new axis of
symmetry. This is Dockner's Corollary 1.
Also by spectrum shifting and bearing in mind at all times the double axi-
symmetry property of M6 , we have Dockner's Theorem 2, which says: if 8> 0, we
have either (i) "all Re (~) > 0", corresponding to our case of both eigenvalues pairs
lying inside the instability corridor;
or (ii) "three eigenvalues of M6 have a positive, and one has a negative, real part",
corresponding to our case where M6 has a pair of eigenvalues inside, and the other
pair, outside, the instability corridor, all four lying on the real axis, symmetrical
with respect to the imaginary axis;
or (iii) "two eigenvalues of M6 have positive, and the remaining two have negative,
real parts," corresponding to our case where the complex eigenvalues quartet lies
outside the instability corridor, and symmetric with respect to both axes.
and (iv) "no eigenvalue of M6 has real part zero". This follows from his proof in
which the equality signs (Dockner 1985 p. 101) are overlooked. It is easy to see
that if Re ",(== ±a) lies on the boundary of the instability corridor, i.e. ±a = 8/2,
then on shifting to the right through 8/2, two eigenvalues will land on the imaginary
axis, each on one side of the real axis. Thus is the onset of Hopf bifurcation. In fact
it is not difficult to show this by explicit computation. The double symmetry and
spectrum shifting approach saves us from such computations.
Finally, Dockner's results that" det M6 < 0 iff one eigenvalue is negative and the
remaining three are positive" can be seen by noting that det M6 = n~ ~; and the
alternative of one positive and three negative eigenvalues, which also gives det M6 <
0, is ruled out by the rightward, not leftward, shifting of the spectrum of M6 .
Thus the double symmetry and spectrum shifting approach, simple as it is, is
insightful and useful for this type of PHDS in Economics.
Dockner's model above has been used to investigate various economic prob-
lems involving PHDS with two degrees of freedom, such as the theory of rational
addiction (Wirl1991), complementarity over time (Dockner and Feichtinger 1991),
optimal saving and externalities (Wirl1991), population growth and Easterlin Cycles
(Feichtinger and Dockner 1990), production and marketing (Dockner, Feichtinger
and Novak (1991) and others. Space limitations do not allow us to treat readers
with the results obtained.
181
8.6. Conclusion
In this chapter, we have discussed the CDS, LDS and HDS, their meaning,
derivation and some applications, in a simple manner. The CDS has not been
applied in Economics explicitly under this name, but economic behaviour clearly
indicates its presence underneath economic calculations and behaviour. The LDS
and especially the HDS are more familiar features in Economics, especially in the
context of Optimal Economic Control which will be briefly examined in Ch. 10.
The dynamics of these systems could be quite complex when some key parameter
reaches some critical level which causes the system to lose its stability. This is the
subject matter of the next Chapter where Bifurcation Theory, Catastrophe Theory
and Chaos will be introduced.
Chapter 9
9.1. Introduction
(9.1 )
Suppose the flow CPt generated by (9.1) is T-periodic, i.e. cp(t + T, xo) = cp(t, xo) and
the cross-sectional map ~ of dimension n - 1, transversal to the vector field, the
Poincare map P(x) : V C ~ - t ~ associates point x in V with its point P(x) of first
184
(a) (b)
The point Xo which P(x) maps to itself, i.e. P(xo) = Xo is the fixed point. The global
stable W;lb) and unstable W~('Y) manifolds as well as their local stable W-b) and
unstable W"b) can be visualized geometrically as well as their periodic orbit 'Y (see
fig.9.1b).
=
W;I('Y) U<p,(W-(xo))
t
Example 9.1.
x = -y + x(1 - x 2 _ y2)
(9.2)
iJ = x + y(1 - x 2 _ y2).
In polar coordinates (putting x = r cosO, y = r sinB x 2+y2 = r2(cos2 B+sin2 B) = r2
and differentiating 2rr = 2xx + 2yiJ and substituting, etc. (see Chapter 5, section
5.7), (9.2) is
r = r(1 - r2 ), r(O) = ro
(9.3)
(B) = 1, B(O) = Bo.
The solution, obtained by separable variables or Bernouilli equation (see Ch. 2,
section 2.1.3), is
Let T = 211" be the time of first return to E of the orbit t/>t(x) based at x, then the
Poincare map P(x) = t/>T(X) in our example is
(9.5)
185
Thus the point x with ro = 1 is a stable fixed point and 'Y is a stable closed orbit.
Poincare map has also been extended to homo clinic orbits to study structural
stability, Melnikov function, suspension of diffeomorphism, and discrete systems in
general. These, however, will not be discussed. For details, see Arrowsmith and
Place (1990) or Wiggins (1990):
Theorem 9.1. (Floquet.) Each fundamental matrix <I>{t) of the T-periodic sys-
tem
:i; = A{t)x, x{O) = xo (9.6)
where A{t) = A{t + T), T fixed, and x E Rn can be written as
<I>{t) = P{t)e Bt (9.7)
where P{t) is T -periodic, i.e. P{t) = P{t + T) and B is a constant n X n matrix.
(9.9)
It is easy to show that the fundamental matrix is T-periodic, i.e. <I>(t) = <I>(t + T).
Let <I>{t)e- Bt == P{t).
Definition 9.1. C in (9.9) is called the monodromy matrix of (9.6). Its eigenvalues
p are called characteristic multipliers and A such that
P= e>.T (9.10)
is called characteristic or Floquet exponent. Let A be chosen such that they coincide
with the eigenvalues of B, i.e. (B - M)u = 0 and (C - pI) v = 0, u ::J 0 ::J v. C = eBT
in (9.9) gives (in canonical form V-I BV = diag(Pi) and T-ICT = diag(Ai) assuming
Band C are both simple matrices)
(9.10)'
Note that the fundamental matrix (9.7) is a product of a periodic matrix P(t) =
P( t + T) and eBt . This leads to the following Corollary.
Corollary 9.2. The periodic system:i; = A(t)x in {9.6} is equivalent to the con-
stant coefficient system
i; = By (9.11)
Proof. Using the transformation x = P(t)y, we have
:i; = Py + Pi; = A(t)P(t)y
i; = P-I(AP - p)y. (9.12)
Differentiating P(t) = 4.>(t)e- Bt in (9.7) gives
P= <Pe- Bt - 4.>e- Bt B
= AP-PB (9.13)
since <P = A(t)4.> by definition and
<Pe- Bt == 'A(t) 4.> (t)e- Bt = AP.
Substitution gives
i; = P-I(AP - AP + PB)y
= By (QED) (9.11)
Thus Floquet theory brings about an important simplification. Solving system (9.6)
amounts to solving (9.11) which is much easier. It can be seen that a necessary and
sufficient condition for asymptotic stability is that characteristic exponents Ai have
negative real part, or equivalently Pi have modulus less than 1. '
Theorem 9.3.
det C = IT Pi = ef tr A dt (9.14)
i
n
tr B = ~ Ai =
1 (T
T 10
(21Ti)
tr A dt mod T (9.15)
187
(9.16)
and the results follow from the definitions of characteristic multiplier and
exponents. (QED).
Remark 9.1. w = ef;tr AdT is well known (see for example, Grimshaw (1990), p.
28). This could be derived very simply by differentiating ~~ == ft(det ~(t)) using
property P.8 of determinants in Chapter 4 and substituting <i> = A~. This gives tV =
r(t)w whose solution is w(t) = woef;T(S)ds where r(t) = tr A and w(t) = det~(t).
E'
----===--::+........::;:::::---_.,
~;:....- _ _ w·
The Centre Manifold Theorem (CMT) extends this to the non-hyperbolic cases
in which some eigenvalues are zero (or have zero real parts) i.e. the spectrum of A
where A = D f(x) is split into u., Uu and U c (s + u +c = n) with their corresponding
eigenspaces E·, EU and EC (Rn = E6 $ EU $ EC), where uC is the number of "central"
eigenvalues i.e. those which lie on the imaginary axis and EC the central eigenspace
spanned by U c (see fig. 9.3). The CMT reduces the investigation of the dynamic
189
system to the dynamics of the centre manifold WC(x) at its point of tangency with
EC.
Proof. Carr (1981). Marsden and McCracken (1976), Guckenheimer and Holmes
(1986).
--~~--~~~----~ __----x
I
Figure 9.4. The Central Manifold
x = Ax + I(x,y)
(9.18)
iI = By + g(x, y)
where x ERn, y E Rm, 1(0) = 0 = 1'(0); g(O) = 0 = g'(0) where 1'(0) == DI(O)
g'(0) == Dg(O), A is an n x n "central" matrix (i.e. with eigenvalues lying on the
imaginary axis), B is an m x m stable matrix (i.e. with eigenvalues having negative
real part). Assume for simplicity, the absence of Wu. Since the centre manifold we
goes through the origin (i.e. h(O) = 0) where it is tangent (i.e. h'(O) = 0) to ~ (i.e.
the x-axis where y = 0, see fig 9.3 and 9.4), it can be represented as
With h(x) substituted into (9.18), the flow on we is governed by the reduced
n-dimensional system
x = Ax + J[x, h(x)] (9.20)
Carr's (1981) first theorem asserts that the full system (9.18) and the reduced system
(9.20) are equivalent: if one is stable, so is the other, and vice versa. Since (9.20)
contains all the informtion needed, it alone need be investigated. This constitutes a
considerable reduction of dimensions.
To compute h(x), note that h(O) = 0 = h'(O) implies that h(x) must have neither
a constant nor a linear term i.e. its Taylor expansion must start with quadratic and
higher order terms, h(x) = o:x 2 + f3x 3 + .... Substituting h(x) into (9.18) gives
(9.21) with the "initial conditions" h(O) = 0 = h'(O) must be solved for the centre
manifold. But this is a formidable task, so that we must be approximated by study-
ing some neighbouring functions ¢ : Rn -t Rm which are C l in the neighbourhood
of the origin.
and Carr's (1981) Theorem 3 asserts that h(x) and ¢(x) are "close" to each other.
More precisely,
Proof. Carr (1981), Theorem 3. Note that we have written /,(0) and ¢'(O) for
D f(O) and D¢(O) to indicate the Jacobian of f(O) and ¢(O) at x = 0, to simplify
notations unless confusion arises.
x = xy + ax2 + by2 x
(9.23)
iJ = -y + cx 2 + dx 2y
the center manifold h(x) can be approximated by
°
Equating the coefficients of equal powers of x on both sides, will give, after some
calculations, a = c, (J = so that h(x) = cx 2 + 0(lxI 4 ). Substitution into (9.23)
gives
± = cx3 + ax 3 + bc2x 5
= (a + c)x 3 + 0(x 5 )
°
a+c = 0, higher order terms must be computed, which give ± = (cd+bc2)x 5 +O(x7 )
which is stable if cd + bc2 < and unstable if cd +" bc2 > 0. Carr also showed that the
CMT carry over to maps (Carr (1981), p. 35). Finally note that Carr has warned
us that while WS and WU are unique, we is not.
Thus, it can be seen that the CMT provides an important simplification by
reducing dimensions.
± = f(x)
= Ax + fr(x) (r 2: 2)(Taylor expansion) (9.24)
where A = D f (x), f m (x) = monomials of the form xm == X;"l x2'2 ... x;:''' with
Em; = m, and fr(x) == h(x) + fa(x) + .... Clearly fm(>'x) = .Am f(x) i.e. fm(x) is
a homogeneous function of degree m.
If (9.24) could be brought to the form iJ = Jy where J is the Jordan form
(see Chapter 5), we are done. Unfortunatley this is almost too much to expect:
the reduction of (9.24) almost always involves monomials (such as xlx~, a,(J 2: 1)
which could not be removed by coordinate changes, and the Taylor expansion will
be of the form
(9.25)
where hr(y) E Hr, the space of homogeneous polynomials of degree r (r ~ 2), for
example hm(y) = y;"lY2'2 ",y;:''', with Em; = m, m 2: 2.
The procedure is to use the near-identity transformation
In practice, the number of active variables is usually very small. Writing active
variables x a as y to avoid superscripts, i.e. x a == (Xn-m+h . .. ,xn) == (Yh Y2, ... ,Ym)
and taking m = 1 for simplicity of exposition, i.e. x a = y, Taylor expansion of W
gives
W(xa,J.t) == W(y,J.t) = 0 + 0 + 0 + 0 + ay4 + by2 + cy + H.O.T.
h
were - I B2 W
a = 4f By. etc ....
The gradient dynamic system (see Chapter 8) is now
:i; = -grad W(xa, J.t).
R n = ker A E9 X 2 == Xl E9 X 2
R n = R(A) E9 Y2 == YI E9 Y2 •
Note that dimXI = 1 = dimY2 ¢:}rankA = n -1 and dimYI = n -1 = dimX2
(see Chapter 4). Let P be the projection of Rn onto R(A) i.e. P: Rn ~ YI with
ker P = Y2 and Range R(I -A) = Y2 and ker(I -A) = YI . Thus Df(x) is Fredholm
i.e. of dim Xl < 00, dimYI < 00. Suppose (xo,J.to) = (0,0). Clearly for any u ERn,
U = 0 iff Pu = 0 and (I - P)u = o. Thus
F(XI,X2,/1-) == Pf(XI,X2,/1-).
Solving (9.29a) by the Implicit Function Theorem gives a unique X2(XI,/1-) with
X2(0,0) = O. Substituting into (9.29) gives
which is obtained by substituting X2(XI, /1-) into (9.29b). Thus f(x, /1-) = 0 iff f[Xl +
X2(Xl, /1-), /1-] = o.
The reduced equation f(x, /1-) : R x R -+ R has all the information needed
and thus the n-dimensional DS has been reduced to the equivalent one-dimensional
equation which represents the utmost simplicity. This is the result of a clever use
of the Implicit Function Theorem in a situation where it appears, at first sight, to
fail i.e. where the critical point of the DS is non-hyperbolic.
It can be seen that the various simplification techniques discussed in this chapter
are very powerful indeed. However, their application would require the equations
of the DS to be given in explicit form, which would seldom be the case, especially
in pure Economics, and hence the discussion of these various approaches may seem
to have little economic relevance. It is true that, except in advanced theoretical
Economics papers, these tools are not widely known among Economics students
and their application is not widespread, or at least not yet. However, it is precisely
for this reason that Economics students should be exposed to them and have them
in their tool box, in view of their importance and usefulness.
On further reflection, however, the simplification methods discussed in this chap-
ter have actually been implicit in Economics reasoning and model building. They
provide a logical foundation and justification for reducing economic systems in-
volving thousands and thousands of variables to just one, two or three dimensions
where figures a small number of essential economic variables. The ceteris paribus
assumptions made to justify these reductions can now be implicitly based on these
simplification theories. Indeed, there is no need to know the precise and explicit
forms of each function in the DS: so long as we are aware that, theoretically at least,
these complicated models can be simplified, and furthermore their reduced forms
are topologically equivalent to the full models and could be brought to the various
standard reduced forms provided by each one of the above approaches, then valid
conclusions can be drawn from qualitative economic analysis where equations are
brought into the standard forms of the various approaches discussed in this chapter.
Chapter 10
Bifurcation, Chaos and Catastrophes
in Dynamical Systems
10.1. Introduction
B.T. refers to the branching of solutions at some critical value J-lo of parameter
J-l, causing a loss of structural stability.
B.T. goes back to Euler's buckling of columns in the 1870's but the most
explicit results are perhaps presented by Poincare (1885). Poincare investigating
the equilibrium of a system f(x,J-l) : Rn+l -+ Rn obtained by solving 8f!8.7:i = 0
giving xi = xi(J-l) (i = 1,2, ... , n) which he called series lineaires des racines (1952
p. 43) has shown that the necessary and sufficient conditions for two or more roots
to coincide are that Hf(x*), the Hessian of f at x = x*, vanishes (1952 p. 43-44).
196
As JL varies, if the DS is stable for JL < JLo, unstable for JL > JLo, then at JLo, there
exists an exchange of stability and bifurcation is said to take place. For periodic
solutions of (10.1) where x(t) = x(t + T) of period T, Poincare has shown that if
two periodic solutions approach each other and merge, at some JLo, then there exists
an exchange of stability. In the case of complex roots, he noted that when a pair of
eigenvalues )"(JL) crosses the imaginary axis at JL = JLo, i.e. Re )..(JLo) = 0 then there
is an exchange of stability. These findings, amazing for the 1880's, have inspired
Andronov et al. in the 1930's, Hopf in the 1940's and a large number of researchers
in the last 20 years.
We shall briefly outline B.T. restricting ourselves to the simplest codimension
one B.T. where only one parameter JL E R is involved, in the one, two and higher
dimensional DS. Consider System (10.1). If II: == [*]= D",f(x,JL) is nonsingular,
then by the 1FT, x* = x*(JL) is an equilibrium of (10.1) which varies continuously
with JL, i.e.
f[x*(JL),JL] = 0 (10.2)
and we have, by the Linearization theory (see Ch. 7),
(10.3)
ImA 1m A
--.-~~~--------ReA --------~--------ReA
o
l[x*(JL),JLl = O. (10.5)
Thus the origin where I(x, JL) = 0 = Ix i.e. (x, JL) = (0,0) is the bifurcation point
where the stable (8) and unstable (u) arms coalesce, resulting in an exchange of
stability (see fig. 10.2). This is called Saddle Node Bifurcation.
The critical point is x* = (0, JL) obtained by solving I = O. The solution consists
of two branches xi = 0 i.e. the horizontal axis and x 2 = JL, the 45°-line (see fig.
10.2) '/(x,JL) = 0 and A = Ix = JL - 2x = 0 imply that the origin (x,JL) = (0,0)
is the bifurcation point where 2 branches intersect and exchange stability xi = 0
for all JL with Al = JL - 2x = JL - 0 is stable on the left and unstable on the right,
of the vertical axis and xi = JL branch with A2 (JL) = JL - 2x = -JL (for x = JL) is
unstable on the left and stable on the right of the vertical axis. This is a transcritical
bifurcation.
198
The critical points are x = (0, ±,fii) and stability depends on Ix i.e.
This is the same as the supercritical pitchfork with _x3 instead of +x 3• The result
is the fork turns into the opposite direction and is unstable (see fig. 10.2).
x x
(8)
(8)
. 11
...
--------~-------Il
-" -(u)
.
•• (u)
x x
(u)
'-,
9
(u)'
Proof. This is a simplified version of a well known bifurcation theorem whose proof
can be found, at various levels of difficulty, in Marsden (1978), Abraham and Mars-
den (1978), Arnold (1972), Crandall and Rabinowitz (1971) or Nirenberg (1974).
The Theorem is easy to understand and apply:
(i) is the equilibrium condition, whether f is a vector field, or a first order condition
of some maximization or minimization problem, i.e. f = 8F(u, x)8x = 0 for some
potential function F. Note that (i) implies fjJ(O, J-L) = 0 = fjJjJ(O, J-L) for all J-L.
(ii) indicates the failure of the 1FT i.e. fx(O,O) = >"(J-L) = 0 and
(iii) means that the eigenvalue must cross the origin at nonzero speed i.e. fXjJ =
::k = d~~) #- O. It is easy to verify that all these conditions are satisfied by the
transcritical x = J-LX - x2 and pitchfork x = J-LX ± x3 bifurcations, but not for the
saddle node x = J-L - x2 (where fXjJ = 0) which does not have two crossing curves.
satisfying (i), being of rank 1 I(O,p.) == 0 for all p., satisfying (ii)
Ix(O, 0, O)Xl = [~ ~] [ ~ ] = [ ~ ]
1;(0, 0, 0)Y2 = [~ ~] [ ~ ] = [ ~]
(i.e. Xl = ker Ix, Yi = Range Ix, X = Xl EB X 2 , Y = Yi EB 12 where I: is the
conjugate transpose of Ix)'
8 21 )
I/1x ( == 8p.8x =
[0-1 0]
0
(x, y)
Hopf Bifurcation arises when one pair of complex eigenvalues crosses the imag-
inary axis at nonzero speed. Suppose initially all eigenvalues of the linearized part
A(p.) = DxI(x,p.) of·x = I(x,p.) : Rn+1 --+ ~ lie in the open half left plane
and as p. varies, one and only one pair A(P.) = a(p.) ± if3(p.) (dropping subscripts
to alleviate notation) crosses the imaginary axis at p. = p'o, at nonzero speed, i.e.
a(p.o) = 0 :I da(p.o)/dp. and f3 :I 0, then near P.o, the equilibrium bifurcates into a
201
Limit cycle. By convention, the crossing is from the left i.e. da(p,o)Jdp, > 0 (See fig.
10.1.)
Intuitively, this can be seen by writing this subsystem in Jordan normal form
(see Ch. 5, eqs 5.19 and 5.20) as
X= J(p,)x (10.8)
or in full
where /3 = Jdet J(p'o) and a, b E R, involving third order derivatives in the Taylor
expansion. In polar coordinates, this is
whose Taylor expansion about p, = P,o (= 0 for convenience, i.e. a(O) = 0 # /3(0))
gives
where 0:'(0) == dlJt:=o.Similarly /3'(0). Neglecting higher order terms in (10.12) and
writing a(O) = a, /3(0) = /3, b(O) = b, /3'(0) == c and a'(O) == d for simplicity, (10.12)
is
r = dp,r + ar3
iJ = /3 + cp, + br2. (10.13)
The behaviour of the system could be seen by examining the values of rand () in
(10.13) for which r = 0 (i.e. the closed orbit) but B # 0 (i.e. the orbit is alive).
Thus, for the periodic orbit of (10.13), for -00 < ~ < 0 and p, sufficiently small,
we have
(10.14)
and it is easy to see that this periodic orbit is stable (attracting to itself neighbouring
curves from inside and outside) for a < 0 and unstable (repelling) for a > 0, whether
202
the crossing of eigenvalues is from the left, as is usually assumed, i.e. 0'(0) == d> 0
or from the right i.e. d < O. In applied work, therefore, it is sufficient to find d and
a. \Vhile d is very easy to find (d == da/dl-' at 1-'0 = 0), the computation of a could
be messy. We give here Guckenheimer and Holmes' (1986 pp. 154-156) calculations
which we find simpler than most, as follows.
1[1 1 2 2] 1[1(1 1)
a == 16 Illl + 1122 + 1112 + 1222 + 16;1 112 III + 122
- IMlfl + li2) - Itdfl + li2li2] . . . (10.15)
where s~erscripts indicate functions and subscripts, derivative orders, for example
ft12 == Jt'Jl etc.
~
Two cases can be distinguished:
Case 1. The crossing is from the left i.e. d > 0
(i) If a < 0 (d> 0); the origin is a stable focus for I-' < 0 but as I-' > 0, this fixed
point becomes unstable and a stable orbit is born, attracting to itself neighbouring
curves both on the inside and outside: this is a supercritical case. (See fig. 10.4 (i).)
(ii) If a > 0 (d > 0), the origin is a fixed point which is unstable for it > 0, stable
for 1-',0, grows into an unstable periodic orbit as I-' grows more and more negative.
(See fig. 10.4 (ii)). This is the subcritical case.
Case 2. \\Then the crossing is from the right, i.e. d < 0,
(i) If a < 0 (d < 0), the origin is a fixed point which is asymptotically stable for
I-' < 0, unstable for I-' > 0 with an asymptotically stable orbit for I-' < O. This is
exactly fig. 10.4 (i) turned around by 180°.
(ii) Case a > 0 (d < 0) is exactly fig. 10.4 (ii) turned around by 180°.
Figure 10.4. Hopf Bifurcation (i) Supercritical (a < 0) (ii) Subcritical (a> 0)
The above analysis could be summarized as follows.
203
1]j == -:r
1 1211"
J. 0
fx 1" ...1"(0, /-lo)( cpl )1/;2 dt
(IO.IOa)
204
Theorem 10.4. If O} = 0 and c is the smallest integer such that Oe i- 0, then (i)
for c even such that aO e(7 < 0 where (7 = ±1, the point (0,1'0) is a bifurcation point
of system (10.1); (ii) for c odd, (0,1'0) is a bifurcation point of (10.1).
Proof. (Tan & Tu 1992). These theorems allow results to be computed explicitly,
as will be illustrated by the dynamic Demand-Supply model in the next section.
Note that our theorems do not need the conditions a'(l'o) = d i- 0 of eigenvalues
crossing the imaginary axis at nonzero speed.
Note that although in high dimensional D.S., the Hopf bifurcation involves heavy
algebra (pages and pages of calculations in Marsden and McCracken 1976 where it
is claimed to take a month to check), in two and three dimensions (n = 2,3), it
is fortunately fairly simple. In the plane (n = 2) for example C(A) = det[(A(I') -
A(I')I] = 0 = A2 - TA + 8 = 0 where T(I') = tr A(I')" 8(1') = det A(I'), the Hopf
bifurcation conditions are T(I'O) = 0 < 8(1'0) where 8(1'0) = A}A2 = A}>:} = f32 > 0
and dT(l'o) / dl' > o. It suffices to find 1'0 fulfilling these conditions.
In three dimensional DS (n = 3) i.e. x E R3, I' E R, the characteristic equation
IS
(10.16)
where c} = -tr A(I') = - 'E~ Ai; C2 = sum of principal minors of A(I') of order 2;
C3 = -detA(I') = -A}A2A3. Hopf conditions are satisfied if Ci > 0 (i = 1,2,3) and
C}C2 = C3. The first conditions, Ci > 0, are just the Routh Hurwitz conditions for
all eigenvalues of A(I') to be in the open left half plane and the second property
C}C2 = C3, together with Ci > 0 are sufficient conditions for the existence of one
negative real eigenvalue and a pair of pure imaginary eigenvalues (see Gantmacher
1954, esp. p. 197). For example A3 + A2 + 2A + 2 = 0 '* A = (-1, ±IAi) and
A3 + 2A2 + 3A + 6 = 0 '* A = (-2, ±1.732i). In practice, to establish the existence
of Hopf bifurcation, it is sufficient to find a value 1'0 fulfilling these conditions.
Thus at f-lo = ,8Lr/(Sy - Iy), if Iy - Sy > 0 and Ir - Sr > 0 then Hopf Bifurcation
takes place.
Advertising models have been more and more rigourously investigated since the
1950's. We shall only discuss Feichtinger's (1992) work in view of our interest in the
application of Hopf bifurcation. Inspired by Baily's (1957) Mathematical theory of
epidemics, advertising is likened to spreading germs. Potential buyers (Xt) catch
these germs through advertisement and contact with brand name users (X2 ):
[u
v] = 1 -,1/J]
[-' 1 [ uv ] + g(u, v) ( -1, ) (10.18)
-1
A(,o) = [ 1
-1j;]
1
( ; ) = [~ -Ow] [ ~ ] + [ ~~:: ~~ ]
where h(x, y) == 2( -x+wy)x+x2 + (-x+wy)x 2 and k(x, y) = O. This is the normal
form of (10.10) where Guckenheimer's a in eq. (10.15), evaluated at x = 0 = y, is
Thus a < 0 and by Theorem 10.2, Hopf bifurcation gives rise to a stable periodic
orbit with 4 phases
(i) prosperity where both Xl and X2 increase
(ii) saturation where Xl declines but X2 still increases
(iii) downturn where both Xl and X2 decrease
(iv) recovery where Xl increases while X2 decreases.
(See fig. 10.5.)
a -1]
A(J.l) = [ J.l -J.lC (10.20)
with r(J.l) = a - J.lC and r(J.lo) = a - J.loC = 0 for J.lo = a/c, dr(J.lo)/dJ.l = -c # 0
and det A(J.lo) = (1 - ac)a/c. The conclusions are (i) if a < 0, r < 0: The model
is stable; (ii) If a > 0, r > 0 for "low" J.l, r < 0 for "high" J.l and r(J.lo) = 0 for
J.lo = a/e at which det A(J.lo) > 0, and 0 < a < l/c with dr(J.lo)/dJ.l # 0: bifurcation
takes place.
We shall now generalize this model slightly by writing D(p, q) - S(p, q) == E(p, q)
and C (p, q), all assumed to be C 3 , i.e. three times differentiable and apply our
approach. Putting x == (p, q), (10.12) and (10.13) become
and /1l"(0,J.l) = [0 0]
gl g2
Let 60 == 6(J.lo) == det /.,(0, J.lo) = J.lo(elg2 - e2gd where J.lo == -~, we have, at
bifurcation value J.lo,
which possesses a pair of pure imaginary simple eigenvalue ±i{3o where {30 == ~.
It is easy to see that A has an eigenvector vI corresponding to i{3o
208
and
-1.1 -_
'I' . t [ 0 ] -_ [
cos t [ e 2 ] - sm e2 cos t . ]
-el /30 el cos t - /30 sm t
-1.2
'I' = cos t [ /300] + sm
. t [ e2 ] =
el
[ e2 sin t .
/30 cos t - el sm t
]
./,1
'I'
. t [ /30 ] = [ el cos t
= cos t [ e 2 ] - sIn - /30 sin t ]
-el 0 e2 cos t
./,2
'I'
. t [ et ] = [ /30 cos t +
= cos t [ /300 ] + sm . ett sin t ]
e2 e2 sm
Tobin's (1965) model of the role of money on economic growth has been extended
by Benhabib and Miyao (1981) among other, to incorporate the role of expecta-
tions treated as a parameter which varies gradually from irrational through adaptive
expectations, to perfect foresight and ill so varying, causes a Hopf bifurcation. We
shall present Benhabib and Miyao's (1981) model to illustrate the way Hopf bifur-
cation arises in a three-sector economy. The model is
where
sf' - n -(I - s)n
A{J.t) == [ EmLl -Em
j.tCLl J.tE
whose characteristic equation gives
The analogy between discrete and continuous dynamical systems has been noted
in Chapters 5 and 6: the stability conditions for continuous systems are Re Ai < 0
for i, and for discrete systems, IAil < 1 for all i, which is best represented as a unit
circle. Some parallel cases are
1. The Fold of Saddle Node Bifurcation
2. Transcritical Bifurcation
4. Logistic System
This has very complicated bifurcation behaviour (see fig. 10.5d): it will be
discussed in the next section, on Chaos.
x x
------~o~\--------~
---
(a) Fold (SN) Bifurcation (b) Flip Bifurcation
(xn+! = J.l + Xn - x~) (Xn+l = -(1 + Il)x n + x!)
x
C.
/(
x
....
•• u
(8)
. •• (u)
.
------~~-------~ --~O~/~--~----~
I
or in polar coordinates
Proof. Arrowsmith and Place (1990 pp. 260-262) or Guckenheimer and Holmes
(1986 pp. 162-165).
(10.21 )
(10.22)
=
where I,.. : [0,1] -+ [0,1], 1 ~ /.l ~ 4, or defining /.l 40', (O',x) E [0,1]' I : [0,1]-+
°
° ° ° °
[0,1]. It is easy to see that (10.22) fulfills all 4 conditions above: 1,..(0) = 1,..(1) =
for all/.l, I' = at Xc = 1/2, f' > on [0,1/2]' f' < on (1/2, 1] and 1~{1/2) = for
all/.l i.e. I is unimodal with a maximum at Xc = 1/2. Furthermore, f" = -2/.l < 0,
and S(f) = -(3/2)(-2/{1- 2x)2 < 0: I,..{x) is concave with a single maximum at
Xc = 1/2. S(f) was discovered by Singer (1978) but it turned out to be the well
known Schwarzian derivative used in complex analysis for over a century now. If
S(f) < 0, 1f'1 has no positive local minimum. Note that I,..{x) : [0,1] -+ [0,1] is a
continuous mapping of a closed convex set into itself and hence, by Brouwer's Fixed
Point Theorem, there exists at least one fixed point x* = I(x*), where I intersects
the 45°-line. The trivial one is the origin, which would be the only one if II < 1
(for then I~{O) = /.l < 1, I,.. lies below the 45°-line everywhere). If /.l > 1, there is
another one, the interior or non-trivial one x* = 1 - l/p. (See fig. 10.7)
213
(10.23)
i.e. x* is an attractor iff 1 < Jl < 3, a repeller iff Jl > 3 with 1f'1 = 1 for Jl = 3 as
the boundary. At the origin, f'(0) = Jl and if Jl < 1, the origin is the only stable
point, an equilibrium of extinction, attracting to itself the whole interval [0,1]. If
f'(0) = Jl > 1, the origin becomes a repeller. For Jl = 2, 1~(1/2) = 0 : h(x)
intersects the 45°-line at its maximum and X* = 1/2 is the only attractor.
It can be proved that
(i) If S(J) < 0 then S(r) < 0 for all n ~ 1
(ii) If S(J) < 0 then 1f'1 has no positive local minima
(iii) If S(J) < 0 then for every periodic attracting orbit, there exists a critical point
of 1 or an end point [0,1] which is attracted by this orbit.
(iv) If all 4 unimodal mapping conditions are satisfied, then IjJ(x) has at most one
attractor in [0,1].
The second generation map is given by
by IL: increasing IL increases the maximum and reduces the minimum. Stability
depends on the slope dP / dXt i.e.
{1O.24}
Thus if f' < 1, 1(J'}21 < 1. For example if IL = 3, f' = -1, (J'}2 = {2 - IL}2 = 1
i.e. the slope of the first and second generation are of opposite signs: (J'}2 is tangent
to the 45°-line. For IL < 3, the hump is mild and f2 intersects the 45°-line only once,
but for IL > 3, it intersects 3 times giving 2 stable fixed points xi and xi where
1(J'}21 < 1 and one unstable xi (xi < xi < xi) at which (J'}2 > 1 (see fig. 10.7). As
IL changes and reaches some critical levels, pitchfork or period doubling bifurcations
occur in which a stable cycle of period n becomes unstable and a new stable cycle
of period 2n is generated as IL varies further. More precisely, as IL increases through
the range 3 < IL < 3.57, stable cycles of lengths 1,2,4,8,16 ... are generated. As IL
increases further, 3.57 < IL < 4, an infinite number of bifurcations arise, leading to
chaos (see fig. 10.8).
O / / /-----=-3--~::-----+~
3.5 4
/J
Thus, it can be seen from fig. 10.8, the passage to chaos is via a sequence of
period doubling at the points where the pitchfork bifurcations occur. This is a route
to chaos. An interesting aspect of period doubling is that the ratio of the spacings
between the consecutive values of Il giving rise to the bifurcation approaches a
universal constant 8, called Feigenbaum (1978) number where
Finally, note that one of the characteristics of chaos is its sensitivity to initial
conditions: two systems starting at two closely initial points can diverge widely
from each other after a time. This can be illustrated by fig. 10.9 where Il = 3.94 in
all 3 cases but Xo = 0.98, 0.99 and 0.995 respectively. The trajectories are widely
different after about 21 periods.
215
w~m
0.0 0 5 10 15 20 n SO 15 40 45 10 1
~Mtj
o.o~o~5~'~0~U~20~25-S~0~S~I"'40"""45""'10 ,
',O~
0.'
0.6
O.C
0.2
o.o~o-5~10""""''''''''''20-2''''...
SO,........U-4....
0 .....
4 ....
' ....
50 ,
Theorem 10.6. (Li & Yorke 1975). Let J be an interval and let I : J -.. J be
continuous. Assume there exists a E J such that b = I(a), c = p(a) and d = p(a)
where d ~ a < b < c (or d ~ a > b > c), then
(i) For all k = 1,2, ... , there exists a periodic point in J having period k.
(ii) There exists an uncountable set S 01 J containing no periodic points, such that
lor all p, q E S, p", q,
(a) limn --+oo inf IJ"(P)IJ"(P) - J"(q)1 = 0 < liIDn--+oo sup IJ"(P) - J"(q)1
(b) lor every pES and periodic point q E J, limn --+ oo IJ"(P) - J"(q) > O.
The Theorem says, grosso modo, that if Xt+! = f,..(xt) rises (falls) continuously
for the first 2 periods then falls below (rises above) its original level in the third
period, then
(i) there exists a cycle of period k,
(ii) there exists an uncountable set S of initial points between a and b such that if
Xo E S, the two paths move close to each other at first then diverge after some time.
Moreover, no such paths will converge to any path originating outside S. Thus if f
has a periodic point of period 3, then f is chaotic.
Diamond (1976) shows that Li-Yorke's Theorem carries over to the n-dimensional
case. Marotto (1978) extended Li-Yorke's work to provide another route to chaos:
the snap-back repeller. A fixed point x is a repeller if all eigenvalues of D f(x)
exceed 1 in norm for all x E Br(x) where Br(x) is an n-dimensional ball of radius
r and centre x. A fixed point x is called a snap-back repeller if there is some point
Xo E Br(x), Xo =I x, and an integer M such that fM (xo) = z and ID fM (xo)1 =I O. In
non-technical terms, a fixed point x is a snap-back repeller ifit repels a neighbouring
point Xo at first, then eventually (after M periods), attracts it to itselflike a magnet,
i.e. fM (xo) = x, while the Jacobian of fM (xo) remains nonsingular.
Theorem 10.7. (Marotto 1978). Snapback repellers imply chaos in Rn. More
precisely, in the vein of Li and Yorke's theorem, suppose x is a snap- back repeller
for f,.. (x), then
(i) there is an integer N such that for all k > N, there exists a periodic point having
period k;
(ii) there exists an uncountable set S of Br(x) satisfying (ii) of Li- Yorke's Theorem.
Chaos can also arise in n-dimensional continuous dynamic systems where n 2:: 3
(for n = 2, the most complex dynamic is the Limit Cycle). Chaotic dynamical
systems are highly sensitive to initial conditions and characterized by the presence
of a strange attractor, as follows
Theorem 10.8. (Ruelle 1979). Given:i; = f(x, It) : Rn+! -t Rn, a bounded set
A of Rn is a strange attractor if
217
(ii) There exists an open neighbourhood U of A such that all points Xl E U tend to
A as t --t 00,
This theorem means, roughly, that starting from a point near A, the dynamic
path is attracted to the strange attractor set A and another path starting nearly,
follows a widely different trajectory after a few periods. The presence of a strange
attractor denotes the presence of chaos. An application will be given in Chapter 10.
Finally, it may be appropriate to mention that chaos is only part of the behaviour
of complex systems. Current research (for example Kauffman 1991) is now being
carried out on antichaos, especially in Biology where order is seen to emerge behind
chaos in cellular differentiation during entogeny: systems are poised between order
and chaos. Research in this area is still at an early stage.
A few main routes to chaos are period doubling, intermittency, horseshoes and
homoclinic orbits. We shall briefly discuss these here.
We have seen, in the discussion of the logistic map (eq. 10.13) that at the
critical value J-lc = 3, fHx) = 1: the system loses stability through flip bifurcation
and period doubling and is on its way to chaos. For an in-depth analysis of this, see
Arrowsmith and Place (1990 p. 226-234) or Guckenheimer and Holmes (1986 pp.
157-160, 346-349). Intermittency, referring to infrequent and irregular variations, is
another route to chaos. With reference to the logistic map (eq. 10.13),1/0;:::: 3.8284
gives rise to a 3-cycle. A further increase of J-l causes subsequent flip bifurcations
leading to period 3n -cycles: for J-l > J-lo, j3(x) == f(f(f(x))) has one stable and one
unstable fixed point which coalesce for J-l = J-lo and completely disappear for J-l < J-lo
(see fig. 10.10). The system then moves inside a channel and after a few regular
iterations, leaves it in an irregular fashion, in search of a new attractor. This is the
onset of chaos.
218
"--------%,
P < Po
Figure 10.10. Intermittency
Take a square. Compress it horizontally (by a factor a < 1), stretch it vertically
(by a factor f3 > 1) and foldl it back in the form of a horseshoe, and we shall
have a horseshoe map. This was designed by Smale (1963, 1967) to study complex
dynamical systems. We cannot go into it in any detail, for space limitations (see
any textbook on chaos), rather let us note the important role it plays in generating
chaos, its presence behind transversal homoclinic orbits. A homoclinic point (x) is a
point at which the stable W 6 (x) and unstable WU(x) manifolds intersect each other
transversally (see fig. 10.11). Smale and Birkhoff's (see for example Guckenheimer
and Holmes Ch. 4 or Arrowsmith and Place p. 165) important theorem, which
was used in the development of Melkinov's theory, proves essentially that if one
homo clinic fixed point x exists, then there are infinitely many other homoclinic
points x since if x lies on W8(X), so do all its iterates, i.e., since WU(x) and W8(X)
are invariant, {pk(x)}f:_oo C WU(x) and {pk(X)}f:_oo C W·(x). One particular
case is the homo clinic or saddle connection where W·(x) = WU(x) (see fig. 10.12).
Intuitively, we can see that, like centres, homoclinic connections are structurally
unstable: a slight perturbation, whether causing a displacement of x to say x or
not, would break up the saddle loop causing a portion of WU(x) to lie either outside
or inside the stable manifold W8(X) (see fig. 10.12(b) and (c)). This is a route to
chaos: stretching and foldings of manifolds are indications of chaotic dynamics. To
illustrate this, let us examine two examples.
:i;= Hy = 2y
iJ = -Hz: + y(h - H)
where H = x 3 - 3x + y2, with a damping which drives it to the energy level H = h.
°
The level curves of H are shown in fig. 10.12. Clearly:i; = = iJ ~ (x, y) = (±1,0)
and H(1,0) = -2 is at a minimum and H(-1,0) has a saddle point at (-1,0).
The minimum is an attractor when h $ -2 and a repellor when -2 < h. When
H(-1,0) = h = 2 there is a saddle connection as in fig. 1O.12a.
search of some other attractor (see fig 10.13). This is called Blue Sky Catastrophe by
Abrahams and Marsden (1978) to refer to the sudden disappearance of an attractor
into blue sky.
Homoclinic orbits may also arise in higher dimensions. Some of these are illus-
trated in the 3-dimensional DS in fig. 10.14, for system x = f(x,l') : R3 X R -+ R3
with a homoclinic orbit connecting the origin to itself for I' = 0, but breaking up
under parameter change from J.Lo = 0 to J.L > 0 or J.L < 0, and Al, A < 0 < A3, all
three eigenvalues real. If Al, A2 = (} ± ij3, (} < 0 < A3 the orbit is homoclinic to a
saddle focus (see fig. 10.15).
p.>.
p.-.
p. ••
Proof. Silkinov (1970), Guckenheimer and Holmes (1986 p. 319), Wiggins (1990
p. 602).
In applied work, LCE and attractors' dimension are important for diagnosing the
presence of sensitivity to initial conditions and hence of chaos. LCE is the Floquet
theory discussed in Chapter 9. It will be recalled that if Ai is an eigenvalue of D f( x)
and cI>t = eD/Ix)! and mi = e>.·t and LCE is
Ai = lim
t-oo
~t In Ie>.·tl
= lim
t-oo
~t Re(Ai)t = Re(Ai).
Thus LCE = Re(Ai): it indicates the rate of contraction (Ai < 0) or expansion
(Ai> 0) near X. Since LCE are defined in the limit as t -+ 00, any finite transients
may be neglected and LCE of Xo and x are identical: the basis of attraction has the
same LCE as the attractor. Note also, as in Chapter 9, one LCE, say At, is always
equal to 0 and hence its corresponding ml = 1. The theory of Liapunov exponents
has been further developed and applied to stochastic trajectories, Kolmogorov en-
tropy but we cannot discuss these here. In addition to the LCE, the dimension
of attractors is also important, fractal dimension, capacity dimension, information
dimension, correlation dimension, Liapunov dimension. We shall not go into these
here. Interested readers should consult Wiggins (1988), Orcut (1993), Medio (1992)
or Lorenz (1993). Finally it should be mentioned that complexity, the "science" on
the edge of chaos, is currently an active research area (see Kauffman 1992, Waldrop
(1992), Lewin (1992) among others). It is still in its infancy.
222
There have been large numbers of economic applications of chaos, for example
Benhabib and Day (1981, 1982), Benhabib and Nishimura (1985), Grandmont (1985,
1986) Day (1982, 1983), Stutzer (1980), Deneckere and Pelican (1986), Boldrin and
Montrocchio (1982), to name a few among deterministic models. We shall discuss
only one or two, just to illustrate the role of chaos in economic analysis.
unstable for 4 < a < 5.i5; it becomes chaotic at a c = rd. l For a = 5.75, see
fig. 10.18.
1.1
I.'
1.4
1.2
I.II-¥----,-+-~-+_...,;.-H__r+_t-_r__;
!
•
1.1 1.2 1.4 I .• :1.1
'" i~ ;u, i!
e.•
e.
x = am/p
y = (1 - a)m/q.
The dependence of these demand functions on past choices is modeled by
The role of Inventory Cycles in economic fluctuations was first noted by Metzler
(1941) whose pioneering work has been further developed by many, among which
Gandolfo (1983), Lorenz (1993) and Medio (1993). We shall outline the last three's
approach to show the possible emergence of bifurcation and chaos. The model, with
individual variations and emphasis, has essentially the following features.
(i) Y = 1"(bye_B) = 1"[G 1(D)Y -B] where G1(D)Y == (a2D2+alD+1)Y; D ==-it
operator (see Ch. 2, section 2.3) output (Y) increases in response to the discrepancy
between desired output (ye) and existing stock (B), 1" > 0, b> 0;
(ii) 13 = S(Y) - I(Y) = F(Y) == mY(l - Y) i.e. stock (B) increases when savings
S(Y) exceeds investment I(Y), the difference function F(Y) is assumed unimodal,
m > 0 and Y E [0,1], i.e. F(O) = 0 = F(l);
225
An early economic application of chaos theory was made by Day (1982) who
showed that in the presence of pollution, irregular growth cycles can arise. His
model is as follows
(i) (1 + n)kt+l = sf(kt ) = sBkf(m - kt}'Y where kt = capital per head of population
growing at rate nj s, m are positive constant, 0 < s < 1 being the savings propensity
and (m - k)'Y accounts for the effects of pollution on f(k), B is a constant which
accounts for technical progress. Thus f(k)t = Bkf(m - kt)'Y is the production
function with f(O) = 0 = f(k m ) where k m = m i.e. if pollution m reaches some
saturation level, production is completely choked off (when kt -+ m). This is a
variety of the logistic growth curve, since (i) gives, on simple rearrangement,
(ii) kt+l = w(kt,p,) == p,kf(m - kt)'Y where p, == l":n. (3
If = 1 = '"( = m, (ii) is
kt+l = p,kt (1- kt) and it can be seen that Li-Yorke's theorem applies. For small p,
and ko, growth will exhibit a monotonic convergence to a stable equilibrium. For p,
in the range
irregular cycles occur and chaos emerges. It can be shown that chaos will persist
for quite a range of B and hence of J.l.
C.T. was introduced by Thom (1972) and popularized by Zeeman (1976) and
others. It studies the graph M of all critical points (c.p.) of a real valued function
!(x,J.l) : Rn X Rr -+ R (or !Jl(x) : Rn -+ R for fixed J.l) where M is normally an
r-manifold in Rn+r, involving n equations in n variables and r parameters. It anal-
yses not only nondegenerate c.p. x*(J.l) of a regular maximum or minimum, where
variations of x* in response to J.l changes are predictable, but especially degenerate
c.p. where the failure of the Implicit Function Theorem invalidates predictions of a
Comparative Statics type. In the first case, the projection 1f of M into II-space Rr,
1f : M -+ Rr : [x* (J.l), J.ll -+ J.l covers the parameter space with one sheet, and in the
latter, 1f is singular (i.e. rank aJ.l/ax < r), the number of equilibria changes and a
portion of the parameter space is covered by several equilibrium sheets correspond-
ing to multiple extrema. At the points where one equilibrium bifurcates into several
(or where several equilibria coalesce into one), there is an abrupt, discontinuous
change in the state variables (x) concerned. This is the best known aspect of C.T.
C.T. is much too technical to be treated as part of a chapter. We are forced,
by severe space limitations, to by-pass the important technical details such as the
the germs, jets, Jacobian ideals, determinacy of functions and Mather's Theorems,
227
In other words, two functions are equivalent if one could be obtained from the other
by a smooth change of coordinates. For example xy '" x 2 - y2, obtained by defining
228
where A == f"(xo} = Hf(O} = Hessian of f(x) of rank n. This is just like eq. 4.3G,
for the linear case, treated in Ch. 4 above.
g(xd ± L2 Xf. In general, if H/(xo) is of corank r, (i.e. of rank n - r), there are
some smooth coordinate changes such that near Xo (set at the origin for simplicity),
1 takes the form.
n
I(Xb X2, ... ,Xn) ~ g(Xb ... ,Xr ) ± LX; (10.28)
r+l
i.e. 1 is split into 2 pieces, the "good" Morse piece L~+1 xf which is well behaved and
the "bad" or degenerate piece g(XI, ... ,xr ) which gives trouble. The Morse piece
may be neglected (i.e. X r+1, . •. ,Xn can be treated as "inessential variables" which
could be left out) and effort is concentrated on the degenerate piece g(XI' ... ,Xr ), a
function of r "essential variables". To see this, let r = 1, the critical points obtained
by grad/(x) = 0 = g'(XI) + Li=22x; = (g'(XI),O,O, ... ,0), i.e. if (XI,X2, ... ,Xn) is a
critical point, set at the origin, of I(x) then (Xl, X2, ... ,Xn) = (X?, 0, 0, ... ,0) where
x? is a critical point of I(x) = g(xI) + L xf at the origin. Hence, only g(xd need
be considered.
This is an extremely useful Lemma since it implies that the behaviour of I(x) :
Rn -+ R (n very large) of thousands of variables near a degenerate c.p. can be
analyzed by studying only a much smaller {usually one or two) number of "essential
variables" which is equal to the corank of 1(x) or of H/(xo) : the remaining variables,
being in the good Morse piece, can be ignored. This "reduction of dimension" is
extremely important in applied work. Compare this with Thompson and Hunt's
method discussed in Chapter 9.
We are finally coming to the Classification problem. It will be recalled that given
a function f E En,
(i) If df(O) =I 0, then f(x) is right equivalent to Xl, by the Implicit Function Theorem
(1FT);
(ii) If df(O) = 0 =I det[8 2f(O)/(8x;8xj))' then by the Morse Lemma, f '" L\ ±xr
(iii) If df(O) = 0 = det[82 f(0)/(8x j 8xj)] of corank r < n, then by the Splitting
Lemma, f '" g(XI, ... ,xr) + L~+l ±xr.
(ii) L\ ±x;
(iii) g(XI) + L~ ±xr where g(xt} = xt xi,xf or xy
(iv) g(XI' X2) + L3 ±x; where g(XI' X2) = XIX2 ± x~ or XIX2 + x~.
Proof. (Thorn 1972 or Trotman & Zeeman 1976 or Brocker & Lander
1975). Note that the first two cases of Thorn's Theorem, being non-degenerate and
well behaved, we only need to focus on the last two, with corank r = 1 (case (iii))
and r = 2 (case iv) with cod (I) ~ 4. It is well known that r(r + 1)/2 ~ cod (I).
C.T. deals with cod (I) ~ 4. This implies that no matter how many variables
f has, so long as its codimension is ~ 4, we can always find a coordinate system
such that no more than 2 essential variables are involved. In other words, Thorn's
Classification Theorem says that if f is degenerate and with cod (I) ~ 4, then
there is a diffeomorphism h of the neighbourhood of the origin with h(O) = 0 such
that fOh-I(XI, ... ,Xn) =g(Xt}+L~ ±x; (case (iii)) or foh- l =g(XI,X2)+L3 ±x;
(case iv) and all these singularities can be classified into 7 types of elementary
catastrophes with cod (I) ~ 4 whose miniversal unfolding, writing Xl == X and
X2 == Y and parameters /-L as /-L = /-LI, /-L2, /-L3, /-L4, are given in Thorn's list in Table
10.1. (Note that subsequently, 4 additional types have been found making for the
11 elementary catastrophes, cod (I) ~ 5, but there is no need to enter this here).
We shall limit our discussion to the first two types because of their wide
applicability, and their simplicity, but also because the remaining types are simply
combinations of these two. A catastrophe "organizes" lower orders: a cusp organizes
2 folds, a swallowtail organizes 2 cusps, a butterfly organizes 2 swallowtails etc ...
Its critical points, for the primal fold xl + /-lX, are given by F;(x) == 3x2 + /-l = 0,
i.e. x· = ±J-/-l/3 which are defined only for /-l < O. F;(x) = 6x ~ 0 for ~ 0
i.e. on the negative portion of the /-l-axis, F is a minimum for X > 0, a maximum for
X < O. The maximum and minimum coalesce into an inflexion point at the origin
where (x, /-l) = (0,0) and disappears for /-l > 0 (see fig. 10.20). The catastrophe set
consists of one point: the origin. For the dual fold _xl + /-lx, the axes are reversed,
i.e. F' = -3x2 + /-l = 0; F" - 6x ~ 0 for X ~ 0 (see fig. 10.20). Thus, in the dual
233
case, for example a decrease in J.l causes the equilibrium x to decrease until J.l = 0
when it disappears altogether. An economic example of this is the shutdown of a
firm, which will be discussed later.
A Cusp Catastrophe is a function of the form ±x! + 'E 2 ±x~ whose unfolding,
(see Thorn's list in Table 10.1), neglecting the Morse piece and writing Xl as X, gives
(10.31 )
The critical points obtained by F;(x) = 4x 3 + J.lI + 2J.l2X = 0 (for the primal Cusp)
form the equilibrium manifold M : J.lI = -4x 3- 2J.l2X which, for the various values of
J.l2, are shown in fig. 10.21(a). Stacking these curves together gives the equilibrium
manifold M : F;(x) = 0 in fig. 1O.21(b) whose projection on the parameter space
J.l E R2 gives the cusp in fig. (d) and on the X - J.l2 plane (fig. c), gives a supercritical
pitchfork bifurcation. Fig. (b) shown that M has one equilibrium sheet where F' =
o ¥ F" and 3 equilibrium sheets, two minima (attractors) separated by a repeller
maximum sheet in the middle, when F' = 0 = F". The parameter space (fig. d) is
covered by one equilibrium sheet almost everywhere i.e. everywhere except the area
under the degenerate critical points within the cuspidal curve "y : 27 J.lr + 8J.l2 = 0
obtained by solving F' = 0 = F" (i.e. F" = 0 ~ J.l2 = -6x2, substituting into
F' = 0 gives J.lI = -4x3 - 2J.l2X = -4x3 + 12x3 = 8x 3. Thus J.lr = (8x 3)2 = 64x 6 =
64(-J.l2/6)3 = -8J.lV27 giving "y as 27J.lr + 8J.l2 = 0). This triple equilibrium zone
can be seen by partially unfolding x4 by F(x, 0, J.l2) = x4+J.l2X2 (fig. (e) which shows
one critical point for J.l2 > 0, 3 for J.l2 < 0 which all fuse together at J.l2 = 0, and
give the triple equilibrium zone in fig. (f) or (d). "y is also referred to as Bifurcation
set or Catastrophic set defined as the set of values of J.l such that F~ = 0 = F:
giving "y : 27J.lr + 8J.l~ = O. This indicates the appearance or disappearance of
the attracting equilibrium sheet. In fig. (b), for example, starting from (d), as J.lI
increases, the equilibrium path moves to (e) where M folds over, F" = 0 and the
minimum disappears. The path cannot follow the middle repelling sheet, and must
drop to (f) on the lower attractor sheet. This explains how an infinitesimal increase
in some parameter J.lI could bring the system to the edge and cause a catastrophic
drop of X to the lower sheet. Once there, if J.lI subsides, the equilibrium path moves
back to (g) where again the minimum disappears and x jumps to (d), the attracting
sheet, by-passing the repelling middle sheet. This shows how a gradual change in
J.lI only causes gradual changes in some cases (J.l2 > 0) such as a - b - c path and
abrupt changes in x such as path d - e - f - 9 - d in some other cases (J.l2 < 0).
234
(c) (b)
(e) (d)
Finally, note that the above discussion refers to the primal Cusp for definiteness.
For the dual Cusp, _X4 + JlIX + Jl2X2, the same analysis applied with axes reversed.
For example.
)11
235
Assuming unchanged technical production and cost conditions, i.e. constant a and
/3, the only relevant parameter is the market price p which goes up and down in
response to demand and supply conditions at the industry level, over which the firm
has no control. Thus f is our potential function which has been brought to the
canonical form of a Fold Catastrophe whose critical points are given by
f'(x) = -3x 2 + p - a =0
< 0 (i.e. x > 0) for a maximum
f"(x) = -6x { > 0 (i.e. x < 0) for a minimum
= 0 (at x = 0) for a degenerate c.p.
Now 3x 2 = p- a is a parabola with (p- a) as axis of symmetry (see fig. 10.22). The
critical output is x = ±J(p - a)/3 which has no real root for p - a < 0, 2 roots
corresponding to a maximum and minimum for p - a> o. The profit maximization
path follows the maximum branch as p varies. A price fall brings about a decrease
in p - a and hence in output x. The firm will eventually incur losses but would
continue producing until the maximum completely disappears, at the origin where
the maximum and minimum coalesce, with p - a = 0 and x = 0 and disappear for
a slightest further drop in prices. Thus, profit maximization conditions are simply
x* = J(p - a)/3 and p - a> 0 i.e. MC = M Rand p > AVC(y). This can be seen
by writing AVC(y) = y2+a = (p-a)/3+a, and p > AlI"C(y) ~ P > (p+2a)/3 ~
236
f(x.p-a)
p-a
Finally, note that although the above problem is formulated as a static optimiza-
tion problem as is customary in the literature, it can be made explicitly dynamic by
spelling out the gradient function as
i.e. output moves in the direction of profit maximization. The analysis remains
unaffected.
Kaldor's (1940) Trade Cycle model has been re-examined and formulated as a
Limit Cycle by Chang and Smyth (1971) and as a Catastrophe model by Varian
(1979), George (1981) and Tu (1981). A potential function V(x) can be defined
such that
x = -dV/dx = k[I(x,a) - S(x,b)]
where x is GNP and I(x, a), S(x, b) are the aggregate Investment and Saving func-
tions respectively, I, S E C 2 , a and b are shift parameters, k is a positive constant
speed of response (k = 1 for simplicity). Kaldor's I(x, a) is a member of the one-
parameter sigmoid function (for example I(x,a) = a + tanhax where a is a shift
parameter, causing I(x, a) to move up and down, to reflect Keynes' volatile busi-
ness expectations): it is flatter at low and high incomes and steeper in the medium
income range. S(x, b) is steeper at low and high income levels and flatter in the
medium income range for example S(x, b) = (x - b)3 i.e. 0 < aI/ax < as/ax at low
and high x, 0 < as/ax> aI/ax at medium income range (see fig. 10.23).
237
s s
S
B 1 BJIj
,,~
/~
,(t, C B y-I
r
7.' C ~
L.
.~?
I
~ '.!:.,
"/ ---A C ;'
_A .,...,'
.I"J A
0 x 0 x 0 x 0 x
coalesce and disappear, leaving the equilibrium path under the influence of a single
attractor point B. At B, investment saturation will cause a to fall, eventually,
causing I(x, a) to shift downward, C to emerge and BC to coalesce into an unstable
inflexion and abruptly fall to A. Note that a change in b causing the saving function
S(x, b) to move leftward or rightward, could corroborate these effects.
In terms of Catastrophe Theory, a degeneracy occurs when a stable (I - S =
o < S' - I') and unstable (I - S = 0 < I' - S/) equilibrium points coalesce,
i.e. V' = 0 = V" which is the case when I and S are both equal (i.e. V' = 0 = 1- S)
and tangent (i.e. V" = 0 = I' - S/) to each other, an abrupt change in x will be
observed. Furthermore, it can be seen from fig. 10.23 that if b is high, i.e. society's
savings propensity is low, a rise in business expectation a, causing an upward shift
in I will simply result in a gradual increase in income x : A and C come closer to
each other but do not fuse (see path c.d.e): no catastrophic changes are recorded.
But if b is low, i.e. savings propensity is high, a change in business expections a will
easily cause a coalescence of A and C or Band C and a consequent catastrophic
change in x, as illustrated by the equilibrium path 1-2-3-4-5-6 in fig. 10.24.
Varian (1979) and George (1981) introduced the wealth (w) effect into Kaldor's
238
Thorn's list in Table 10.1: g(x) = -"'44 + e;x2 + J.lIX whose equilibrium manifold,
obtained by g'(x) = 0 is
_x3 + J.l2X + J.lI = 0
which reflects the hypothesis as can be seen in fig. 10.26(a). Stacked together, these
curves form a cusp catastrophe whose bifurcation set or cuspidal curve -y, satisfying
g'(x) = 0 = g"(x), is obtained by a projection of the equlibrium manifold on the
parameter space, is 27J.l~ = 8J.l~ (see fig. 10.26(b)).
x
~2 <0 ~2 =0 #-12> 0
(low cost) (high cost)
III
This model explains and predicts defence expenditure: at low costs (J.l2 < 0),
defence spending is a smooth increasing function of threat: it moves, say, on curve
a - b - c in fig. 10.26, but at high costs (J.l2 > 0), opinion is split and defence
budget may follow path d - e - f - 9 where at the edge e, it suddenly jumps from its
lower to its upper equilibrium sheet, at f: defence is suddenly stepped up at high
240
costs and threat. When threat subsides to some critic.allevel g, at the edge of the
fold, defence is suddenly dismantled, military bases are closed and defence budget
is abruptly cut to its bare minimum at d. Note the hysteresis phenomenon inherent
in every cusp catastrophe: the jump from the lower to the upper equilibrium levels
at e and from the upper to the lower level at g, occur at two different levels of
threat (ILl). This explains the irregular (non-smooth) movement in defence budget:
it changes smoothly in some cases and abruptly in some others. Isnard and Zeeman
also developed this model into a butterfly catastrophe with 2 additional cusps where
the two additional unfolding parameters are identified as bias (IL3), biasing opinion
into stepping up or winding down defence spending and butterfly (IL4) splitting one
cusp into three.
Analysing the data on aggregate investment places in \Vest Germany for the
1956-1978 period, Mensch et al. (1980) found clear evidence of the thresholds of a
typical cusp catastrophe. They identified two types of investment: the traditional
expansionary (e) (wooden ploughs accumulated over wooden ploughs) and rational-
ized revolutionary (r) investment such as automation. Their various assumptions,
based on their observations, their statistical analysis and their various coordinate
changes led to a standard cusp catastrophe whose equilibrium manifold is given in
fig. 10.27. It summarizes all their findings: with low r, output x increases only
slowly and smoothly 'with traditional investment but at high levels of innovation
(r), output takes off into its higher equiligrium level. This model inspired Balasko
and Boyer (1981) to formulate a cusp catastrophe model of technical progress and
employment, using the same concepts of x, e and r but arriving at the cusp model
by different routes.
x x
s s
---"o+----JJ ----ri----T
s s
But will it buckle upward or downward? Similarly, an increase in tax (r) past some
critical level rc (set at 0) could cause an increase or decrease in equilibrium national
income. Mathematically, this is :i; = J.lX - x3 where x is the displacement of the
column and J.l an axial load in the Euler problem, and x is the displacement of
equilibrium income and J.l = r = tax parameter. The C.S. problem is to predict the
direction of the equilibrium path when J.l increases past J.lc (located at the origin),
i.e. ox* /OJ.l ~ O. Without further information, C.s. predictions are impossible. This
242
is where unfolding comes to the rescue. We know from the above analysis that a
miniversal unfolding of 9 = X4 (x = f = g' = 4x 3 ) requires 2 parameters, say J.l and
J.Lo i.e. _x 3 + J.LX + J.Lo where J.Lo is a "side load" , an "imperfection parameter" in the
Euler buckling problem, and J.Lo is a measure of income distribution in the economic
problem. With such a miniversal unfolding, the C.S. predictive power is restored:
the direction of the buckling depends on the side load J.Lo (see fig. 10.29).
x x
o/·c
-----+-----p
In the economic problem, an increase in tax rate past the critical level Tc in a country
where income is very unequally distributed, with the majority of people being poor,
and having to work harder to make ends meet, will lead to an increase in income. In
the opposite case where income distribution is more equal, most people are well-off,
the disincentive effect of taxes may cause equilibrium national income to fall.
This is the miniversal unfolding problem. In the Cusp Catastrophe, for example,
it is easy to see from fig. 10.21 that at J.Ll = 0 = J.L2, it is impossible to predict
the equilibrium path of x resulting from a decrease in J.L2 (from J.L2 = 0 to J.L2 < 0
while keeping J.Ll = 0) : the equilibrium path can either end up in the upper or
lower equilibrium sheet. But with a slightest change in J.Ll (which is Euler's "side
load" or the income distribution index in the economic problem), the equilibrium
path becomes completely predictable: from J.L2 = 0 = J.Ll, a decrease in J.L2 past zero
will cause the equilibrium path of the Cusp Catastrophe of fig. 10.21 to land on the
upper equilibrium sheet if J.Ll < 0 and on the lower sheet if J.Ll > o.
Thus we can see that the supercritical pitchfork bifurcation x = J.lX - x 3 (see
fig. 10.2(c)) is structurally unstable in that a small perturbation will give rise to
a topologically different equilibrium set: it has only one parameter J.L where two
are needed. This brings together Bifurcation and Catastrophe theories: the stable
cusp catastrophe is the unfolding of the pitchfork: it stabilizes it by adding another
parameter to meet the requirement of a miniversal unfolding of a degeneracy of
codimension 2 (see section 10.4.3).
243
This chapter brings B.T., Chaos and C.T. together under the more general head-
ing of Singularity Theory. It is long, although it did no more than scratching the
surface of the problem. It is hoped, however, that it has provided a plain, nontech-
nical, introductory, but nevertheless fairly comprehensive discussion of the various
aspects of these theories and issues involved. For example, much controversy has
been aroused by C.T. especially in many applications where there is hardly any
mathematics involved. The presentation of C.T. in this chapter clearly shows that
it is indeed a respectable mathematical theory, which many users may not fully grasp
in their applications. On the other hand, some critics seem to overlook the impor-
tant fact that there is no need to know exactly the form of the underlying potential
function. With the use of the various coordinate changes, the Splitting Lemma,
we only need to identify those variables in the degenerate set, its co dimension and
miniversal unfolding with the unfolding parameters and apply Thorn's theorem to
obtain useful results which are not available by using other conventional tools of
analysis. Similarly, B.T. has drawn our attention to the possibility of optimal eco-
nomic fluctuations which were considered a contradiction in terms until recently.
Furthermore, it has been shown that, contrary to the popular belief, exogenous
stochastic shocks are by no means necessary for the emergence of Business Cycles: a
completely deterministic economic system, not subjected to any exogenous shocks,
can display chaos and fluctuations. These and many other applications show the
usefulness of a thorough understanding of the mechanism of B.T., Chaos and C.T.
Chapter 11
11.1. Introduction
x.*( t ) = 8H
8p (* * * )
x,p,u,t (11.3)
P.*( t ) = - 8H
8x (*
x, p *, u * , t ) (11.4)
S[x(T), T] == S[xo, 0] + 10
rt dtd S[x(t), t] dt. (11. 7)
Setting S[xo,O] = 0 and to = 0 for simplicity, and substituting it into (11.2) gives
the augmented functional J a (u)
where
(11.12)
The first term on the RHS of (11.12) is zero, being the Euler equation, (see eq. 8.7
and 8.8 of Ch. 8), i.e.
If both the initial state x(O) and time to are also arbitrary and unspecified, (11.16)
becomes
(Sx - p)ox It=T
t=O + [ H () t=to = o.
t + St ] Ot It=T (11.17)
These are called the Transversality Conditions (c) of Theorem 11.1. Thus (11.13),
(11.14) and (11.15) are exactly Pontryagin's Theorem 11.1: (11.13) is the Euler
equations, (11.14) is (b) for the interior extremum case, and (11.17) covers all the
cases of (c).
The transversality conditions of equation (11.16), for given Xo and to, are summa-
rized in the following Table 11.1 to provide a handy reference for the determination
of the 2n boundary constants for the n-state vector x and n-costate vector p.
248
Table 11.1.
Transversality Conditions (S., = p(T))8x(T) + [H(T) + St]8T = 0, eq (11.16)
Case Substitution Boundary Condition Determination of
in eq. (11.6) equations Constants
A. Fixed Terminal Time T (8T = 0)
1. Fixed x(T) XT= =
8x(T) 0 x(O) = =
Xu; x(T) XT 2n equations to
8T=0 (No restrictions on p(T)) determine 2n constants
2. Free x(T) 8x(T) '" 0 x(O) Xu= 2n equations to
i.e. 8x(T) ::F 0 8T=0 p(T) = S., determine 2n constants
B. Free Tenrunal Time T (8T '" 0)
4. Fixed x(T) = XT 8x(T) = 0 x(O) = Xu 2n + 1 equations to
8T '" 0 x(T) = XT determine 2n constants
H(T) + St = 0 at t =T and terminal time T
5. Free x(T) 8x(T) '" 0 x(O) = Xu 2n + 1 equations to
8T '" 0 p(T) = S.,[x(T),T) determine 2n constants
H(T) + St = 0 at t =T andT
The first variations giving the necessary conditions are provided in Theorem
11.1 the derivation of which has just been presented. We shall now comment on the
meaning of these conditions one by one and their uses in practice.
1. x,p, E Rn are called state and co-state variables vectors respectively, and
U E U, a control vector.
3. Condition (b) of Theorem 11.1 covers all possible cases of interior as well as
boundary extrema, of linear and nonlinear control. In the nonlinear interior
optimization case, Hu = 0 gives optimal u*. In the linear case, extremum
occurs at the boundary when U is bounded. Writing H(x,p, u, t) = 'I/J(x,p, t) +
O'(x,p, t)u where O'(x,p, t)u groups all the terms of H linear in U and 'I/J(x,p)
the remaining terms, we can see that aH/au = O'(x,p,t) and' if 0' > 0« 0),
H is linearly increasing (decreasing) in u and hence the choice of U max = b
(Umin = a) provides the highest H. When O'(t, x,p) changes sign, the choice
of optimal U switches from u = a to u = b or vice versa: thus O'(x,p, t) is
called the switching function and the linear control is called bang bang. (See
fig. 11.1). When 0' == 0 for some non-zero time interval, sgn 0' provides no
help: this is called Singular Control. The optimal control u* is then found by
249
dk
dt k u(x,p, t) =0 (k = 0,1,2, ... ) (11.18)
H H
a a
u~
I ~.
u
b r--___ u=b
a u=a
•••••••••••••••• !-o_ _ _ _ _-!
L -_ _ _ _ ~ _____ L-_~ _____t
5. Once u* has been found and substituted into (11.3) and (11.4) we have a HDS
discussed in Ch. 8 with one exception: it is an optimal HDS. It requires 2n
boundary conditions to determine 2n constants of integration. These are the
transversality conditions required in Theorem 11.1 (c) and shown in (11.17).
Various combinations are possible. For example if Xo and T are fixed, (11.17)
gives p = S" which, together with xo, gives 2n conditions. When T = 00, this
is limHoo p(t)x(t) = 0 (See Arrow & Kurz 1970).
250
Xl = X2(t), XI(O) = 0
X2 = u(t), X2(0) = 0
This gives X2(t) = -3t2 + 6t, XI(t) = -t3 + 3t2, u(t) = -P2(t) = 6(1- t).
Case (ii):
ih = -H~l = 0 =? PI(t) = CI
'h = -H~l = 0 = -PI(t) = -CI =? P2(t) = -CIt + C2
H(x,p, u) being linear in u, and lui ~ 1, the switching function is aCt) == P2(t) =
-cIt + C2 which cannot vanish identically in any nonzero time interval, for this
would imply CI = 0 = C2 i.e. PI = 0 = P2 and H = -1 which would contradict the
transversality requirement that H(T) = 0, T being unspecified, and H(x,p, u) being
autonomous, H = 0 'Vt E [0,1]. Singular control, for which a(t) == 0 for some tEl
(J nonzero interval), is thus ruled out leaving bang bang control, where u = +1 or
-1, as the only possibilities. \Vith u = ±1, we have
l.e.
1
Xl(t) = 2x~ + C5 for u= 1 and
Zl
By the switching theorem (see Tu 1984 p. 197) there is at most one switching
from u = 1 to u = -1 as (T(t) == P2(t) changes sign. It can be seen that if x(O) is on
the curve AOD (u = 1 on OA, u = -1 on OD), no switching takes place. Starting
from anywhere else not on AOD, the system needs one s\\itching to go to the origin.
As a physical example, we can think of x(t) as the distance at t of a spacecraft
from the origin, x as its speed and !i = u as its acceleration, treated as a control
variable to drive the spacecraft home as quickly as possible. As an economic example,
x(t) may represent foreign indebtedness, x, debt accumulation or discharge rate and
!i = u = speeding up or slowing down of this rate, the capacity of this acceleration
or deceleration being bounded by the country's international credibility, economic
viability and political feasibility, i.e. lui ~ 1. The ultimate destination is obviously
Xl (T) = 0 = x2(T) i.e. x(T) = 0 = x(T) where the country is free from all
indebtedness.
The second variations can be studied by examining the total variations of Ja(u)
in (11.8) around u* :
(11.21)
It is easy to see that the matrix in (11.22) must be negative semidefinite for a
maximum and positive semidefinite for a minimum. Note also that Huu is also
negative semidefinite for a maximum and positive semidefinite for a minimum, which
is the Legendre-Clebsch condition. (Fbr further details and proofs, see I'll 1984
p. 136).
In the particular case in which lo(x, u, t) and I(x, u, t) in (11.1) .and (11.2) and
hence H(x,p, u) == 100 + p. 1(·) are concave (convex) in X and u for given p, the
Necessary conditions (11.13), (11.14) and (11.15) of Theorem 11.1 are also sufficient
conditions. This assurance is very useful in practice since it will dispense us of the
search for sufficient conditions. Fbrtunately (for economists), most applied Eco-
nomics problems fall into this category. In Optimal economic growth, for example,
100 is the utility function which is concave by nature, 10 = !p(k(t)) - Ak(t) where
!p(k) is the production function, concave by nature and Ak(t) is a linear function
253
(A = constant) and thus f(-} is concave in k(t) and hence H(k,p, u) == lo{-} + pl(·)
is concave.
The proof, provided by Seierstad and Sydsaeter (1977), consists of a string of
inequalities
= l[(H*-H-P(x*-X)]dt
= l[(H* - H + p(x* - x)] dt
Example 11.3. Consider the optimal economic growth model examined in Exam-
ple 8.2 where H == u(c) + q[J(k) - Ak - c] and H* == u* + q(f* - Ak* - CO) where
u* == u(c*), f* == f(k*). Application of (11.23) gives
~J4 == l [u* + q(f* - Ak* - c*) - u - q(f - Ak - c) - q(f' - A)(k* - k)] elt
Thus necessary condi tions also ensure sufficient condi tions since H (.) is concave in
c and k.
1 r
'21h (T),Sh(T) + '210
T
J = (h'Qh + u'Ru) dt (11.26)
for the Linear Tracking problem (see Athans and Falb 1966, or Tu 1984), where CJ
and R are positive definite matrices, h(t) == x(t)-x(t) = deviation of x(t) ERn from
some desired level x(t). A translation of x to the origin, x = 0, \vould reduce the
tracking to the Regulator problem, hence we shall only discuss the Linear Regulator.
The Hamiltonian His
x= Hp = Ax + Bu* = Ax - BR-1B'p
-Hz = P = -Qx - A'p with p(T) = Sx(T) by (11.16) i.e.
(11.29)
whose solution, taking the Transversality condition p(T) = Sx(T) into account, (for
details, see Tu 1984 ch. 8) is
p(t) = K(t)x(t) (11.30)
where K(t) is part of the inverse of the transition matrix of (11.29). Substitution
into (11.28) gives the feedback control law
where - R- 1 B' K is often referred to as the Kalman matrix. Substitution into (11.24)
gives
x = (A - BR- 1B' K)x. (11.32)
Differentiating p = Kx and substituting from (11.32) yield
Since (11.35) holds for all arbitrary choice of xo, and K(t) does not depend upon
xo, the matrix in (11.35) must vanish i.e. K must satisfy the Riccati equation
K = -KA-A'K +KBR-1B'K-Q (11.36)
255
K(T) = S. (11.37)
[ Hxx Hxu] _
Hux Huu -
[Q0 0] .
R (11.38)
Theorem 11.2. Given system {11.24} and the functional {11.25} where u(t) is un-
bounded, T specified and S, Q are positive semi-definite, R positive definite, there
exists a unique optimal feedback control u* = _R- 1B' K x(t) where K(t) is the unique
solution of the Riccati equation {11.36} satisfying the boundary conditions {11.37}.
k2 - 2rak - rq = 0
whose solution is
Given a fixed stock (8) of some non-renewable resource such as oil and gas,
assumed known with certainty, the intergenerational distribution problem is what
quantity q(t) of 8 should be consumed by the current generation and how much
should be saved for the future, knowing that future generations having no voice,
no vote, have to depend on us for their fair share. Given the usual assumption of
concave increasing utility u( q) where u" (q) < 0 < u' (q), the objective is to maximize
loT u( q)e- ot dt
subject to the isoperimetric constraint
loT q(t) dt = 8.
Defining x(t) == 8 - fci q(r) dr as the remaining stock at time t, with x(O) = 8 and
x(T) = 0, we have
± = -q(t)
ot
H = u(q)e- - p(t)q(t) where fJ is the usual rate of future discount. Theorem 11.1
gives
Hq= e-otu'(q) - p(t) = 0
p = -Hz = 0 ::} p(t) = p, some constant.
This implies p(t) = P = e-otu'(q) or u'(q) = peot i.e. marginal utility u'(q) increases
at an exponential rate. In view of the law of diminishing marginal utility u" < 0 < u'
with liIDq--+o u'(q) = 00, this means the optimal quantity q*(t) of non-renewable
resource consumed at t must decrease exponentially over time. In other words, if
fJ > 0, current generations should consume more than future generations. If fJ = 0,
p = u'(q) i.e. marginal utility is constant: all generations receive equal treatment:
each should consume the same quantity q.
It is reasonable to expect that f(O) = 0 = f(K), f(x) > 0 \::Ix (0, K) and f(x) <
o \::Ix > K, and also f"(x) < 0 i.e. f(x) is concave with some saturation level K
257
above which overcrowding will cause a negative growth rate. The logistic curve is
an illustration of this. The objective is to maximize the profit functional of a typical
fisherman who considers fish as common property, i.e. subject to :i; = f(x) - h
MaxJ = 1000
e- 6t [p - c(x)]h(t) dt
where 8 is the usual future discount rate, p = fish price, c(x) = unit cost of harvest-
ing, d(x) < o.
The Hamiltonian is
H = e- 6t [p - c(x)]h(t) + A(t)[f(x) - h(t)]
== a(t)h(t) + e- 6t A(t)f(x)
where a(t) == e- 6t [p - c(x) - A(t)], the switching function. Theorem 11.1 gives
:i; = f(x) = h*(t) = H>.
.x = e- 6t hd(x) - Af'(x) = -Hz
where h* = hmax if a(t) > 0, h* = 0 if a(t) < 0 and if a(t) == 0 for some nonzero
time interval, singular control takes place. In this case
a(t) == 0 ~ A(t) = p - c(x)
a(t) == 0 = _e- 6t [(p - c)8 + d(x):i;] = .x = O.
Substituting:i; and .x in the above, and dividing by e-6t (p - c) gives the well known
equilibrium relation in Capital theory (see Clark 1976)
J'(x)- df(x) =8
p - c(x)
which says that the marginal productivity f'(x) of the fish population net of the
stock effect df(x)/(P - c(x)) must be equal to the social discount rate 8. If cost
is stock invariant, i.e. d(x) = 0, this gives the well known rule that the marginal
product of capital is equal to the rate of future discount which is brought into
equality with the rate of interest in equilibrium. If a(t) > 0, u*(t) = U max and if
a(t) < 0, u* = Umin = O. This is the bang bang control solution.
J = -11
2 0
00
(qy2 + rl)dt
where I(t) = total investment demand, composed of private induced investment vY
(v = constant accelerator) and government investment G. S = total saving, Y =
GNP, y(t) == Y(t) - Y*, g(t) == G(t) - G* where Y*, G* are optimal full employment
GNP and government expenditure, a == -8/(1 - v), b == 1/(1 - v), q, r = positive
constant weights.
The Hamiltonian is H = t(qy2 + rg2) + p(ay + bg), Hg = 0 =} 9 = -(~)p =
-~(ky + v) where limt-+oo k(t) = Te, some constant. Equation (11.36) gives
or
k = (r/b 2)(a + ja 2 + qb 2/r).
This is almost exactly like Example 11.4. The final solution is if = ay + bg* =
(a - b2k/r)y == my where m == a - b2k/r = -(8 + q/r)1/2/(1 - V)2 < 0, and
y*(t) = yoe mt , i.e. Y(t) --+ y* as t --+ 00 as required. For further details, see Tu
(1984 ch. 8).
Ak
f (k)
~--~~~_________________k
: f(k)-Ak
~--~--~--------~~-----k
o k*
Figure 11.3. Phase plane of the OEG path
The solution is summarized in fig. 11.3 in the c - k space where it is easy to see
that c ; 0 if f'(k) ; A + 8 and k ; 0 if c ~ f(k) - H and that equilibrium (c*, k*)
< < < >
is a saddle point since the Jacobian J(c*, k*) = (~1 ~) where 0: == c.!;:~.) < 0
with eigenvalues !(8 ± .../8 2 - 40:) both real and of opposite signs.
In the interesting particular case of linear utility functions u( c) = c, H = e- c5t { c+
q[J(k) - H - cn is linear in the control c and 0 ~ c ~ Crnax where Crnax = f(k*),
the switching function u(t) == e- c5t (1 - q) gives the solution of the bang bang type:
c* = 0 if q > 1, c* = f(k*) if q < 1 and the singular type if u(t) == 0 i.e. q(t) == 1 for
some nonzero time interval and c* E (O,j(k*)). The Hamiltonian flow is
k=f(k)-H-c
q = -[f'(k) - (A + 8)]q
which gives the saddle point equilibrium at (c*, q*) = [f(k*) - H*, 1] (see fig. 11.4).
It is easy to see that q = 0 separates the region offalling q, to the left of k*, from the
region of rising q, to the right of k*. Similarly, the line q = 1 delineates the region
of falling k, below it from the region of rising k, above it (see fig. 11.4). Again,
the equilibrium (k*, 1) is a saddle point: paths on the stable arms approach (k*, 1)
asymptotically, any other paths not starting on the stable arms diverge from (k*, 1)
as can be seen in fig. 11.4.
It should be noted that the above analysis of the neoclassical growth model,
formulated as an optimal control problem for the first time by Cass (1965), has
been exhaustively developed and extensively applied to many areas of economic
theory.
Ak
260
f(k)
f(k)-Ak
k
k* k:
L
q
~
k=D ~
1
~
~
~~-----------L-- ________ ~k
o k* k
Figure 11.4. Bang bang-singular solution of OEG model
The stability of ODS is the stability of the HDS which was examined in some
details in Ch. 8 and also in Tu (1991 Ch. 12). Since stability varies from one problem
to another, we shall confine our analysis to the Optimal Economic Growth (OEG)
problem encountered in Ch. 8.
It will be recalled that application of Pontryagin's Maximum Principle to the
OEG problem leads to the following HDS
k= Hq
q= -Hk + tSq(t) (11.39)
where H :: e- ot { u(c) + q[f(k) - '\k(t) - c(t)}.
Linearizing about the equilibrium (k*, q*) at which Hq = 0 = -Hk + tSq(t) and
setting it at the origin for simplicity, leads to :i; = Mox in (8.26), i.e. in full, with
X:: (k,q),
(11.40)
where
and
A
Mo
B]
= [AC -A' ,Mo
[AC -A'
B] = Mo ·A
(11.41)
A ::A-~In' A:: Hqk , B::Hqq, C::-Hkk
261
Proof. Lancaster (1991) first shows the equivalence of >.J - Mo and 1$ K(>') by
Schur decomposition as follows
where
K(>.) == (>.J + A')B- 1 (>'I - A) - C. (11.43)
Being equivalent, (>.J - Mo) and 1$ K(>') have the same eigenvalues with the same
multiplicity structure. The eigenvalues of K(>.) are the zeros of det K(>'). Now
define
L(>') == (5.1 - iA)* B- 1 (>.J - iA) + C (11.44)
where * denotes conjugate transposition, i.e. the eigenparameter of L(>') is obtained
from that of K(>.) (and hence of Mo) by rotation through a right angle. Furthermore,
for>. E R, L(>'*) = L(>') and B > 0, C > 0 imply L(>.) > 0 for all >. E R
and all matrices A. Hence L(>') has no real eigenvalues, in other words, Mo has
no pure imaginary eigenvalues. Furthermore, the eigenvalues of Mo and hence of
K(>.) and L(>.) have the double symmetry discussed in Theorem 8.6. Hence Mo has
SPP. (QED)
The next question is whether with the introduction of a positive discount rate
8, considered as a perturbation, this SPP will still be maintained? This has been
answered by Kurz (1968) in Theorem 8.8: it will, so long as the real parts of the
eigenvalues of M6 do not lie within the distance 8/2 from the imaginary axis. We
shall only add one latest result by Lancaster.
(11.45)
has no eigenvalues within the distance fJ /2 of the real axis. A straightforward cal-
culation with the discriminant functional for £(A) gives the results. (QED)
Note that Lancaster's condition fJ2 < 4')' in (11.45) is less restrictive than the
Rockafellar's curvature condition" fJ2 < 4 aj3 where a and j3 are respectively the
minimum eigenvalues of C and B, i.e. fJ2 < 4aj3 ~ 4')'. For example, if B =
(~ 1~2) and C = (1&2 ~) , then aj3 = 1/4 < 1 = ')'. Will the system :i; =
M6X be globally asymptotically stable (GAS)? We shall first introduce Lancaster's
Lemma, before proving GAS (Global asymptotic stability).
Q=
-
[B
§.I
~CIn ] (11.46)
2 n
then fJ2 < 4')' iff Q is positive definite (written as Q > 0).
Proof.
i.e. Q and I EB (C - ~ B- 1 ) are congruent and thus if either one is positive definite,
so is the other, i.e. Q > 0 ¢} S > 0 where S == I EB (C - ~B-I) :
fJ2
S> 0 ¢} C - "4B-l >0
¢} Bl/2(C _ fJ2 B- l )B I / 2 > 0
4
¢} Bl/2CBI/2 _ fJ2 I> 0
4
¢} Am(BI/2CBI/2) > fJ2 (since Bl/2(BI/2CBI/2)BI/2 = BC)
4
Note that Brock and Sheinkman (1976 pp. 169-170) proved that fJ2 < 4aj3 => Q > O.
Lancaster's Lemma is stronger and proves also the converse.
We can now prove GAS.
Theorem 11.5. The Perturbed Hamiltonian Dynamic System (PHDS) :i; = Max
in (11.40) is GAS provided fJ2 < 4')'.
263
Proof. Following Brock (1977), let us choose the Liapunov function V == -ilk > O.
(For details on the positivity of V, see Brock (1977). Differentiation gives
V = -ij'k - i/k.
Substituting, from the OSS (11.39), k = Bq + Ak and ij = ck + (JI - A}q, gives ('
denotes transposition)
In the last section, and also in Chapter 8, we have discussed the SPP of optimal
HDS. We have shown that the conservative HDS have the double symmetry prop-
erties which the PHDS - typical in Economics - do not possess, and conditions
sufficient for PHDS to preserve SPP have been investigated. This might have left
us with the impression that a solution path which fails to converge to the optimal
stationary state (OSS) should be rejected as non-optimal. However, it has been
shown recently that an optimal path need not converge to OSS nor have the SPP: it
may bifurcate into periodic orbits and yet remains optimal in that it fulfills all the
optimality requirements including transversality conditions. More recently, another
type of stability loss causing chaotic dynamical paths in Optimal Economic Control
(OEC) models has also been investigated. Since these are advanced areas which,
furthermore, are not yet fully developed, we shall briefly discuss the Hopf bifurcation
in OEC models, first in the PHDS with two degrees of freedom (n = 2) which is the
simplest case in which Hopf bifurcation can arise, then in the multisectional models
(n > 2). Finally we shall briefly mention the possibility of chaotic solutions in OEe
models.
It will be recalled, from Chapter 9, that Hopf bifurcation occurs when a pair of
complex eigenvalues .\(6) = 0(6} ± ijJ(6} depending continuously on some parameter
6, crosses the imaginary axis, for some value 6*, at nonzero speed, i.e. 0(6*} = 0 ¥
80(6*}/86 and jJ # o. By convention, 80(6*}/86 > 0 i.e. the crossing is from the
264
left. Then for 8 < 8*, the origin (which is chosen as a critical point) is a stable focus
and for 8* < 8, it is an unstable focus surrounded by a stable limit cycle whose size
increases with 8.
Two-State-Variable Models.
Let us first consider a PHDS with two degrees offreedom investigated by Dockner
(1985) and discussed in Chapter 8, equation (8.29) to (8.31). The model has 2 state
variables (x, y) and two co-state variables (A, /l). Putting z == (x, y, A, /l), we have, as
in equations (8.29), (8.30) or (11.40) and (11.41), a PHDS depending continuously
on one parameter 8, the rate of future discount, as follows
(11.47)
(11.48)
8
A=-+/l (11.49)
2
where
8)2 K 1
V
/l2 = ( 2 - '2 ± 2 K2 - 4C4 (11.50)
k = Ak+Bq (11.52)
q = Ck - A'q
where A == A = ~ In; A, B, C are real (n x n) matrices and B, C are both symmetric
positive definite matrices, with notations as in (11.41) above.
Taking Laplace transforms (with zero initial conditions, see Chapter 4) gives
Mk+Gk-Nk = 0 (11.55)
L()..)v = 0 (11.56)
where ).., v are respectively eigenvalues and eigenvectors of the Lambda matrix L()..),
(See Lancaster 1966 for )..-matrices) in (11.56) where L()") == M)..2 + G)" - N as
defined in (11.54).
Premultiplying (11.56) by v (conjugate of v) gives
whose solution is
).. = _1 (-ig ± }_g2 + 4mn) (11.58)
2m
where m == v'Mv, n == v' Nv and ig == v'Gv. Further analysis (for details see Medio
1987b pp. 417-419) shows that the relevant discriminant (11.58) for stability study
IS
(11.59)
where nl == n - mJ2 / 4. Medio showed that the necessary and sufficient conditions
for system :i; = MdX to have local SPP is that ~(J) > o. If ~(J) < 0, no SPP exist
and if there exists some value J* such that ~W) = 0 and ~(J) ~ 0 if J ~ J* and
d~(J*)/8J < 0 (i.e. ~(J) is a decreasing function of J, vanishing at J = J*) then as
J is increased past J*, the system will undergo a bifurcation and lose its SPP. Two
subcases arise
(i) g(J) = 0 for J E N.(J*), some eigenvalue lying on the real axis, crosses the
imaginary axis from left to right, causing "total instability";
(ii) g(J*) i 0 : the loss of stability is of the "flutter" type: a pair of complex conju-
gate eigenvalues crosses the imaginary axis from the left, causing a Hopf bifurcation
and giving birth to closed orbits around the equilibrium point.
Note that this limit cycle is optimal in that it fulfills all optimality requirements,
including the transversality limHoo q(t)e-dlk(t) = 0 conditions. Thus an economy
267
satisfying all the standard neo-classical competitive conditions such as perfect fore-
sight, zero profit, market clearing, can exhibit permanent oscillations in prices and
capital stocks. This is the concept of optimal economic fluctuations.
Recent research interest in nonlinear dynamical systems has been shifted to HDS
and shown the possiblity of chaos in this area.
We have been introduced to chaos in Chapter 9 especially in discrete ODS.
In continuous ODS, the emergence of chaos - which necessitates a torus of at
least three dimensions - is due to the presence of strange attractors and thus,
the investigation of chaotic motion in continuous ODS amounts to establishing the
existence of a strange attractor. It will be recalled that an attractor is a closed
invariant set A which attracts all orbits with initial states in its neighbourhood U
i.e. the flow gtu ~ A as t ~ 00, and a strange (or chaotic) attractor is the one which
contains a transversal homoclinic orbit (see for example, Guckenheimer and Holmes
1983). Strange attractors are highly sensitive to initial conditions: two neighbouring
starting points can lead to exponentially divergent paths.
The existence of chaos in discrete OEC (Optimal economic control) models has
been investigated by Boldrin and Montrucchio (1986), Deneckere and Pelican (1986),
Montrucchio (1986) among others and in continuous OEC models by Benhabib and
Nishimura (1979), Benhabib and Day (1981) and Lorenz (1988). The lack of space
on the one hand, and the advanced nature of work in this area, on the other, do
not allow us to go into these here. Rather we shall show the possibility in ODS
with reference to Lorenz's (1988) work on a decentralized OEC model, because it
is carried out in the framework of continuous HDS, in the spirit of Benhabib and
Day (1981), Benhabib and Nishimura (1979) and Medio's (1987) models which were
analyzed in the last section, in connection with the Hopf bifurcation and the ensuing
optimal limit cycles.
Lorenz shows that if some agent h's(h = 1,2, ... ,r)H DS, having a periodic orbit
caused by Hopf bifurcation, is perturbed by the action of other agents, then chaos
can arise.
Starting with the conventional OEG model, we have
where H = max u H(x, A, u, t) where u is a control vector and (x, A) are n-state and
co-state vectors. Suppose (11.45) could be separated into r subsystems for each of
the 1" agents, we have for each h
Now if these are also functions of other agents' actions as well, (11.60) becomes
±h = jh(Xh,Ah,X, X) (11.62)
~h = gh(Xh,Ah,X,X)
h
were x- -= (x 1 , ... , x h-1 , x h+1 , ... , x r)., /,I =
- (\1 \h-1 , /I\h+1 , . . . , /I\r)'1.e. (-x, /I') are
/I , . . . , /I
(x, A) without elements (xh, Ah) i.e. (x, X) are the state and co-state vectors of other
agents.
The current valued Hamiltonian H(u, k, >.) above can be separated for each agent
h, into two parts
Hh(uh, kh, >.h) = H hO + Hh1
where Hho is the "unperturbed" Hamiltonian of household hand Hh1 _
H(u\ kh, >.\ ii, k, X) is the "perturbed" Hamiltonian reflecting the influence of other
sectors, as shown in (11.62). (Note that "perturbed" here does not refer to the per-
turbation caused by introducing the discount parameter 8 as before, but to the per-
turbation caused by the action of other agents). Assuming that the "unperturbed"
HDS experiences a Hopf bifurcation at 8 = 8* causing a periodic orbit, Lorenz shows
that, under the action of other agents causing the perturbed H h1 , application of the
Newhouse, Ruelle and Takens (1978) theorem points to the emergence of chaotic
motion in the HDS.
Although the investigation of chaos in OEC models is still at an exploratory
stage and has an ad hoc character, it leads to the important conclusion that chaos
in OEC models is a distinct possibility, in which case, it is impossible for agents to
calculate the results of their optimal programmes except in a very short run: strange
attractors cause optimal paths starting close to one another, to diverge widely from
one another after a few periods.
11.7. Conclusion
In this chapter, we have discussed ODS, optimal in that they are optimally con-
trolled in order to achieve some specific objectives given in the objective functionals.
We have shown how Pontryagin's Maximum Principle is derived from the Variational
Calculus and how solving an optimal Control problem amounts to solving the asso-
ciated HDS, referred to as ODS. The necessary and sufficient conditions have been
discussed as well as the stability of ODS, both the asymptotic and structural stabil-
ity. The concept of optimal economic fluctuations caused by Hopf bifurcation has
been analyzed as well as the emergence of chaotic motions which make any longrun
predictions and planning unreliable.
No doubt Optimal Control theory cannot be presented in one chapter: a book
would be required (see, for example Tu 1984, 1991). Omissions are inevitable. For
example, discrete ODS and their stability have not been discussed, although they
could be derived very simply (see Tu, 1984, 1991), and their stability could be shown
with reference to the "unit circle" , instead of the "open left half complex plane".
269
Nevertheless it can be seen that considerable grounds have been covered in one
chapter.
Chapter 12
Some Applications in Economics and Biology
12.1. Introduction
Dynamical Systems (DS) have been applied in almost every field. In this chapter,
we shall present some applications in Economics and Biology.
The selection is difficult: as a tool, DS has been widely used in so many areas,
and besides, the various applications have been presented in the various chapters
throughout the book, as illustrations of the various dynamical tools of analysis. In
Economics, we shall limit our presentation to some major areas such as Business
Cycles, General Equilibrium and Economic Growth and in Biology, to ecology since
population dynamics, arguably, is a field in which DS find a natural application.
To this, will also be added the dynamics of a heartbeat since this provides a good
example of modelling a complicated biological phenomenon, and also as an applica-
tion of Folds and Cusps, analyzed in Chapter 10, it is a respectable example which
stands up to the severest critics of Catastrophe Theory applications.
Clearly, in view of the space constraint, and a selective presentation, this chapter
does justice neither to the work reviewed, because of the various oversimplifications
required, nor to the work omitted which is even more numerous. The choice is made
on the basis of applicability of the various dynamical tools treated in this book,
rather than an extensive review of economic or biological theory in any area.
Economic fluctuations are natural candidates for D.S. applications, as can be seen
from the reviews undertaken by Zarnowitz (1985), Gabish & Lorenz (1987), Lorenz
(1993) among others. Although earlier economists have advanced various theories,
from money and banking (Hawtrey), innovation (Schumpeter), expectations (Pigou,
Bagehot) underconsumption (Hobson, Foster), overinvestment (Hayek, Mises), to
sunspot (Jevons, More) theories (for a detailed survey, see Haberler (1958)), it was
not until the late 1930's that Business Cycle Theories were rigorously formulated,
using difference and differential equations. The main ones are the linear multiplier-
accelerator models (Samuelson, Hicks), nonlinear models (Goodwin, Kaldor) optimal
fluctuations model (Medio) and Chaotic model (Grandmont). We shall briefly review
these, concentrating on the dynamic tools applied.
272
where ~ == Cl(k + 2)2 - 4c(k + 1). The results depend on sgn ~ i.e. ~ ~ 0 if
b ~ 4(1 + v)j(2 + v)2 : if 0 < c < l~v , the solution path is periodic convergent if
c = l~v the solution is a constant cycle if l~V < c < tJ~~)J , it is periodic divergent
and if tJ~~)l ~ c, it is monotonic divergent.
2. Nonlinear Models.
Under this heading, we shall examine Goodwin (1951) and Kaldor's (1940) model
as reformulated by Chang & Smyth (1971) and also Goodwin's (1967) class struggle
model.
Samuelson's model was highly successful at first but its limitations have been
quickly recognized: the economy cannot blindly follow the dictates of a mechanistic
multiplier accelerator: there is an upper limit imposed by full employment and a
lower limit set by the depreciation rate. Thus Hicks (1950) and Goodwin (1951)
quickly imposed a ceiling and a floor to investment k as follows
. {b0 > 0
K=
if K < K*
if K = K* (12.6)
-d if K > K*
1 . 1 .
Y = -(a+K) == -(a+K). (12.8)
1-c s
At the start of the cycle, let J{ < J{~, i.e. there is a shortage of capital. Investment
then proceeds at the maximum rate J{ = b while Y remains unchanged until J{ = 1<;
when J{ = J{* and hence J{* = J{o. Now J{ = J{i > J{* = J{o so J{* switches to
J{;. Thus J{* changes from 1<; to 1q
i{
B C
K
Iq K·0 K·2
K
o~~-----+--~~-------
0
A D
(a) (b)
y+A(y)y+B(y)=O (12.10)
where B(y) is an odd function with B(O) = 0, A(y) an even function with A'(O) <
o < A(O). This gives a unique limit cycle (see Ch. 7). If the autonomous investment
function is periodic, of period T, i.e. A(t + T) = A(t), instead of being a constant
A, the above becomes a forced oscillator of a Van der Pol type and can be shown
to possess two stable limit cycles containing an infinity of unstable limit cycles.
Lorenz (1987) has shown that this gives rise to a period-three cycle which, by Li
& Yorke's Theorem (see Ch. 10) implies chaos. As Lorenz pointed out, actually
by introducing lags, Goodwin (1947) has shown the emergence of chaos in Business
Cycles long before the name "chaos" was invented.
275
Other efforts at nonlinearity have been made by Kaldor (1940) who introduced
nonlinear investment (/) and Saving (S) functions: Whereas both are increasing
functions of income (Y), I is flatter than S at very low Y because of excess capacity
and at very high Y level because of saturation of investment opportunities, and
steeper than S at "normal" or medium income range where investors are more
sensitive to rising Y than savers are. Assuming these intersect at 3 points (see
fig. 7.17 in Ch. 7), business cycles are generated by an upward shift of I, causing a
stable and unstable equilibrium point to coalesce and end at the high stable point.
Kaldor left his insightful model thus loosely formulated at that. It was Chang
and Smyth (1971) who applied the rigorous dynamic tools to bring out the right
properties therein. Kaldor's model can thus be formulated more explicitly as a
planar D.S.
Chang & Smyth have detected that with these sigmoid I and S functions, all
Poincare-Bendixson's conditions are satisfied and hence a limit cycle emerges (see
Ch. 7). Furthermore, it could be shown, on time differentiation and grouping terms,
that
(12.12)
which is a typical Lienard function provided I and S behave as assumed (/y == aI laY
etc ... ), and hence this limit cycle is unique. Furthermore, it could also be modelled
as a Cusp Catastrophe, as has been shown by Varian (1979) and Tu (1982) (see
Ch. 10).
Another nonlinear model, away from the Keynesian stream of the previous mod-
els, is formulated by Goodwin (1967). Inspired by Lotka-Volterra model, Goodwin
visualized economic fluctuations as an outcome of the struggle between capitalists
and workers in order to secure a larger share of income. This is a system of two
nonlinear differential equation in x (employment rate) and y (worker's income share)
with 0 < x, y < 1, as follows
i; = x f(y)
iJ = y g(x) (12.13)
where f(y) = a - by and g(x) = ex - d (see Ch. 7 for more details). The result is a
harmonic motion causing x and y to fluctuate perpetually between a maximum and
minimum, averaging at the "coexistence" equilibrium which is never reached. This
276
model has been extensively studied and enriched by Vilupillai (1979), Wolfstetters
(1982), van der Ploeg (1983) and Flaschel (1984) among others (see Ch. 7).
k = Hq
q=-Hk+O(t) (12.14)
where H = u(c) + q(J(k) - Ak(t) - c(t)] is the current valued Hamiltonian function
(see Ch. 8 and 10) where f(k) = output, k(t) = capital, c(t) = consumption all per
capita and q(t) = costate of k(t) and 0 is the constant discount rate, considered as
a parameter whose variations cause a Hopf bifurcation. Medio (1987) has noticed
that such periodic motions generated by the stable limit cycle to which the Hopf
bifurcation gives rise, satisfy all Pontryagin's optimality conditions, including the
transversality requirements and hence are optimal.
Finally, business cycles could be chaotic: they can exhibit very complex dynam-
ics. Contrary to the claim (by Lucas, Prescott, Sargent and others. See Zarnowitz
1985 for details) that for economic fluctuations to emerge, stochastic shocks must
be present, Grandmont (1985) and others have shown that a totally deterministic
economy can give rise to fluctuations and chaos, and the complex DS thus obtained
exhibits all the properties of a stochastic model although there are no stochastic
elements in it. For a recent comprehensive review of complexities in deterministic
nonlinear Business Cycle models, see Lorenz (1993).
In view of Walras law that L:i" pjEj == 0, only m - 1 equations are independent,
hence putting Pi = 1 as a numeraire, we have
(12.16)
subject to
(12.17)
j j
where Xij (p) and Yij (p) are respectively, agent if s demand and supply of good j,
i.e. E j == L:i (Xij - Yij) total excess demand for good j in the market by all agents.
The main areas of investigation in G.E. are the existence, uniqueness and stabil-
ity of the equilibrium price vector. The existence theorem has been proved by use
of Brouwer and Kakutani's Fixed point Theorems, the uniqueness has been estab-
lished for some cases such as Gross substitutes, weak axiom of Revealed Preference.
We shall briefly examine the stability problem, both local and global, mainly in
the context of the Tatonnement model, and briefly mention the non-tatonnement
dynamics.
p = kE(p) (12.18)
Theorem 12.1. (Local Stability). G.E. is locally stable under any speed of ad-
justment if anyone of the following conditions holds
278
Theorem 12.2. (Global stability). G.E. is globally stable if anyone of the fol-
lowing conditions holds:
(i) there is no trade at equilibrium, in the pure exchange economy;
(ii) the Weak axiom of Revealed Preference hold (i.e. pOx l ::; pOxo implies pI xO >
pI Xl where xO, Xl are two distinct baskets and pO, pI their respective price vectors)
and Walras law (Lj pjEj = 0) holds;
(iii) A is quasi-negative definite everywhere;
(iv) A is quasi-dominant diagonal, with aii < 0 for all ij
(v) all goods are gross substitutes.
Proof. Most proof make use of the Liapunov function 2V(p) == (p - p*), I(p - p*)
(i.e. B = I : all weights are equal. See Theorem 5.5 above), which is the (equal)
distance function of p from its equilibrium level p*, and show that, under the above
cases, V < 0 i.e. W == A'B + BA = (A' + A) is negative definite (see eq. 5.27).
For details, see Hahn (1982), Negishi (1989) and also the references listed in section
5.8.3 in Ch. 5.
Non-Tatonnement Models
Naturally D.S. has been widely used in economic growth theories. We shall briefly
present their development and their use of more and more sophisticated dynamic
tools.
1. Harrod-Domar's Models.
2. Neo-Classical Models.
The Neo-Classical model was formulated by Solow (1956) and Swan (1956) who
recognized that it is the rigidity of technology which condemns the Harrod-Domar
280
economy to eternal instability: if capital and labour are substitutable to each other,
i.e. K / L == k is a variable, then a capital surplus would lead to capital deepening
and maintain full employment growth. Thus, K and L no longer have to grow at
the same rate. With labour growing at a constant rate n, production taking place
at constant returns to scale i.e. Y = F(K, L) = LF(K/ L, 1) == Lf(k), the Saving
(S)-investment (1) equality gives the fundamental neo-classical growth law
where f" < 0 < f' and 1'(00) = 0, 1'(0) = 00 (Inada's conditions) guarantee the
existence and uniqueness of equilibrium growth sf = (8 + n)k, as has been seen in
Ch.2.
The neo-classical growth model has been extended to two sectors, multisectors,
with and without money, as well as optimal growth.
The aggregate model above has been extended to two sectors, producing capital
goods Yl = it(kd and consumption goods Y2 = h(k2) by Uzawa (1961) who con-
cluded that, with the usual concavity assumption fI' < 0 < ff (i = 1,2), stability
requires the capital goods producing sector it (kl) to be less capital intensive than
the consumption goods sector h(k2), i.e. kl < k 2. Solow (1961) finds it paradoxical
that "such an important characteristic of the equilibrium paths should depend on
such a casual property of the technology" (Solow 1961 p. 48). Solow's remark has
led to further research work which introduced the elasticity of factor substitution
a == (dk/dw)w/k where w == w/r = wage/rental ratio, the separate saving rate of
capitalists Sr and workers sw. The model has six variables x == (Yl, Y2, k 1 ,k2,w,p)
where p == L1/ L = ratio of labour employed in the capital goods sector to total
labour force L. This leads to the fundamental growth equation
(12.23)
Theorem 12.4. Local stability and uniqueness obtain i.e. h'(k) < 0 Vk E (a, b) and
k* such that h(k*) = 0 is unique, under regular (non vanishing Jacobian) conditions
if anyone of the following conditions is satisfied
(i) Sr ~ Sw and kl ::; k2
(ii) a ~ 1
281
(iii) rT2 ~ 1
(iv) Sr = 1
(v) Sw = o.
Proof. Burmeister & Dobell (1970) and references given therein.
Money has been incorporated into Swan-Solow's model by Tobin (1965). Money
matters: equilibrium k* is influenced by the rate () == if 1M of increase in nominal
money supply (M).
Let x == mlp == (MIL)lp = per capita real money balances; p = output price
in terms of money as a numeraire; r(k,x) = f'(k) - 8 + pip where PiP = E(plp)
i.e. the expected rate of inflation is assumed equal to the actual rate, i.e. with the
constant depreciation rate 8,
x
:; =
(mm - p) =
p
if
M -
t p
L - p.
Substituting, and putting .x == () - 8 - n, gives
These two equations (12.25) and (12.26) form a planar DS whose linearization about
equilibrium point (k*, x*) gives
i = Ax + H.O.T. (12.27)
A -= [ifk if"'] .
1/;k 1/;",
Under the economic assumptions of the model, ifk < 0 < 1/;." if",,1/;k < 0, det A < 0
which gives a SP equilibrium (see fig. 12.2) with positive (x*, k*) In the absence of
money (12.26) becomes
A = [:: ~x]
x x=o
k=o
k
0
It is easy to see that 'ljJ(k,O) being independent of x, tr A = 'Pk + 'ljJx < 0 and
det A = 'ljJx'Pk > 0 and hence the model is stable.
This model has also been extended to incorporate inflation p/ p as a third equa-
tion by Benhabib and Miyao (1981) who have shown the emergence of Hopf bifur-
cation, as has been seen in Ch. 10.
The neo-classical growth model has also been optimized as one-sector, two-sector
and multisector models where Pontryagin's Maximum Principle is used to obtain
optimality. These have been discussed in detail in Chs. 8 and 11 where the emergence
of Hopf Bifurcation, limit Cycles and Chaos has been shown, and also above under
Optimal Business Cycles.
Finally, before concluding this section, a few words should be said about the
latest development: endogenous growth with increasing returns by Romer (1986,
1990) Rebelo (1988), Lucas (1988) among others.
With a few exceptions such as Ben Porath (1967) Shell (1966), Uzawa (1965),
Tu (1966, 1969, 1970) and Denison (1962) under Schultz's (1961) inspiration, tradi-
tional growth models overemphasized the role of physical capital (K) to the point
283
of neglecting human capital (He). This sin of omission is responsible for the failure
of Growth theory to account for many phenomena such as the high rate of returns
on capital in capital rich countries, which goes against the law of diminishing re-
turns. Human capital in the form of educational training and research, apart from
its cultural value, is a crucial factor of production and an engine of growth. The
"german economic miracle" in the postwar period, under the Marshall Plan, would
not have occurred without the HC stock there: the same foreign aid spent on Cam-
puchia would not produce any noticeable economic results because the modicum
of HC stock was virtually wiped out in the Killing field. Physical capital without
HC would never produce economic growth. But technical progress (A), the fruit of
education and research, is financed from resources within the economy: it does not
come free like manna from heaven. Thus, production exhibits increasing returns:
F()"K, )"L, )"A) > F()"K, )"L, A) = )"F(K, L, A) i.e. F is linearly homogeneous in K
and L. But under increasing returns, factors payment according to their marginal
product would bankrupt the economy. To rescue equilibrium, external economies
are brought in: technical progress is a form of external economies firms do not take
into account in their maximization decision although it benefits society as a whole.
Thus, there are constant returns at firms' level but increasing returns at industry and
society level, as has been taught by Marshall (1890) and Meade (1952) among oth-
ers. With this increasing returns feature, endogenous growth models are formulated
like others, and the usual dynamic tools are applied. For example, with an increas-
ing returns production function F(K, L, H, A) described above, a usual increasing
concave utility function U(C), Romer (1990) casts his model in the framework of
an Optimal Control model of
subject to
[(=y-C
A= aHaA
Ha+Hy = H
where Y = F(A, H, K, L) as above, a is average product H, Ha, Hy are respectively
the total HC stock H, allocated to R&D and production. The current valued
Hamiltonian is
H = U(C) + )"(Y - C) + JlaHaA
where the necessary conditions of Pontryagin's Maximum Principle give, for interior
maximization,
He = 0; .x = p).. - H k ; it = PJl- Ha.
This is a typical PHDS with two degrees of freedom, discussed in detail in Chs. 8
and 11.
284
which means that population grows (r > 0) so long as x < K, until it reaches the
ceiling K when x I K = 1 and :i; = 0 : it stops growing. The solution, by separation
of variables
J(~ + k ~ x) dx = Jrdt
IS
k
x(t) = -l-+-c-e---
rt (12.31 )
(12.32)
285
When one species encounters another, some outcomes are possible among which
two best known ones are: prey-predator and competition for the same food supply.
1. Predation Models.
The prey (x)-predator (y) model was originally developed by Lotka (1925) and
Volterra (1931) from observations of the various fish populations in the upper Adri-
atic in the 1920's. It describes the nature of population fluctuations. The model
IS
where a, b, c, d are positive constants. Without predators, the prey population (x)
grows at the Malthusian rate ax which is decreased by cxy as a result of encoun-
ters with predators. Similarly, without preys as food, predators decrease at rate
- by, but the appearance of preys helps slow down this decrease by dxy. This
model has been analyzed in some detail in Ch. 7 where it was shown that the co-
existence (x', y' > 0) equilibrium is a centre, which shows perpetual fluctuations of
the harmonic motion type.
This model is unsatisfactory as it stands because of unending fluctuations and
also because it is structurally unstable: a slightest perturbation would turn it into
a stable or unstable focus.
Samuelson (1967) noticed that and introduced decreasing returns caused byover-
crowding and increasing returns brought about by increased density. The Lotka-
Volterra model is thus modified to
x=x(a-cy+ax)
iJ = y(-b+dx+f3y) (12.34)
where a, b, c, d are positive as before and a, f3 > 0 for the case of increasing returns
and a, f3 < 0 for decreasing returns. The results are that the neutrally stable
equilibrium now becomes a stable focus for decreasing returns and unstable focus for
increasing returns (see fig. 12.3) A generalized Lotka-Volterra model was formulated
286
± = xr(x) - yp(x)
iJ = y[-b + q(x)] (12.35)
0'-------- Z o
where r(x) is the rate of growth of x and p(x) is the predator's influence function
unfavourably affecting the prey population and q(x) is the prey's influence function
favourably affecting the predator population. Thus, even in the absence of predators,
the prey population does not grow at a constant rate a as in Lotka-Volterra model,
but by r(x) which is much more general. Similarly the effect on preys of encounters
with predators is not cxy but yp(x). With the assumptions that (i) r'(O) < 0 < r(O),
(ii) p(O) = 0 < p'(x), p'(O) > 0, p(oo) = p; (iii) q(O) = 0 < q'(0), q'(x) > 0,
q(oo) = ij it has been shown that the model has 3 equilibrium points (0,0), (K,O)
and (x*, y* > 0) where K is the carrying capacity and (x*, y*) is the coexistence
equilibrium which is a w-limit set which is either an equilibrium or a limit cycle.
(see fig. 12.4)
II
oL--~---L--Z
x = x/{x,y}
if = yg{x,y} {12.36}
o,L..---"""'"=:------..l----z (prey)
Under these conditions, Kolmogorov {1936} and also Rescigno & Richardson
{1965} conclude that there exists either an interior {coexistence} equilibrium point
E, or a stable limit cycle or both. For a proof, see Kolmogorov {1936} Rescigno
& Richardson {1965}, also May {1972} who noticed some inconsistencies which he
corrected.
Waltman {1964} introduced a parameter J.L to Kolmogorov's model
x = J.L x/{x,y}
if = yg{x, y} {12.37}
288
and showed that there exists some critical value J.lo of J.l such that r(J.lo) = 0 < r(J.lo)
and <5(J.lo) > 0 where r(J.lo) = trace A, <5(J.lo) = detA, and A is the Jacobian of the
linearization of (12.37) i.e. J.lo gives rise to a Hopf bifurcation (see Ch. 10).
Finally, note that the Lotka-Volterra type model has been extended in many
directions. For example Takeuchi & Adachi (1983) have extended it to 2 tropic
levels with two-prey one-predator, two-prey two-predator systems and by the use of
perturbation methods and Hopf bifurcation, have shown that adding more species
increases diversity. They have also shown that (i) the coexistence equilibrium is
globally stable; (ii) the existenced of Hopf bifurcation into stable limit cycle, and
(iii) the emergence of chaotic motions. Gardini, Lupini, Mammana & Messia (1987)
have generalized the model to n species and found Hopf bifurcation of the three-
population equilibrium point, stable periodic orbits and the transitions to chaotic
attractors via sequences of Hopf bifurcations and period-doublings. Moreira (1990)
has also shown the uniqueness of limit cycles in predation models.
2. Competition Models.
Consider two populations, for example two species of fish, which do not prey on
each other but compete with each other for a common food supply. Each species
in isolation, grows according to the logistic law :i; = x(ao - a1x), iJ = y(bo - b1y).
The presence of another species will reduce the available food supply and hence slow
down the growth rate of the other species, by a2xy and b2xy and respectively, as
follows
with ai, bi > 0 (i = 0,1,2). It can be shown that this model has 4 critical points:
total extinction E1 (0,0), partial extinction E 2(0, bo/bt} , E3(aO/a1' 0) and coexistence
E4(X*, y*), with x*, y* > 0, determined by the intersection, assumed to exist, of
!(x, y) = 0 and g(x, y) = 0 curves. The nature of these equilibrium points depends
on the eigenvalues of the linearization of (12.38) about these points which in turn
depend on the assumptions made about the relative magnitude of these coefficients.
For example, at (0,0), A(O,O) = diag (ao, bo) with both eigenvalues >'1 = ao, >'2 =
bo positive, the origin is a source. This model has also been extended in several
directions. We cannot go into these here, but will rather report one more application
in another area of Biology: the heartbeat.
The heart is in one of the two states: the relaxed state (diastole) and contracted
state (systole). When it stops beating, it is in the diastole state which is a stable
289
equilibrium. What makes the heart contract is the electrochemical wave which
reaches each individual fibre and triggers the action. Each fibre remains contracted
then rapidly relaxes, causing a jump return to equilibrium. Zeeman (1972, 1973)
models these phenomena with a Van der Pol and Lienard equations (see Ch. 7). The
simplest D.S. having these properties is
EX = (x 3 - X + b)
b= x - Xo (12.39)
where x is the length of muscle fibre, (with Xo > 1/../3), E > 0 and b is some
electrochemical control. The equilibrium manifold M : x 3 - x + b = 0 is a double
fold curve, consisting of two stable equilibrium portions, separated by an unstable
one in between (see fig. 12.6)
The equilibrium is stable at E(xo, bo) on the upper fold of M where the lineariza-
tion is
(12.40)
/' b
\~ ......-""=1---.-....1:--
..... _
•••
Figure 12.6. Zeeman's heartbeat: Fast Returns
An increase of b moves the heart from E to T (see fig. 12.6), a threshold which
causes x to jump down to A from T. The muscle then contracts rapidly, sending the
heart from A to T ' , another threshold at which the heart rapidly relaxes along TA'
and slowly returns to the original relaxed state E, where the cycle TAT A' repeats
itself. This models the fast return to E along the x-direction.
To model a slow return, a third dimension is needed: a parameter representing
the tension in the fibre, caused by blood pressure, must be added. The simplest
model is
EX = -(x3 +ax+b)
a = -2x - 2a (12.41)
b= -a-l.
The fixed point (x*, a* ,b*) = (1, -1, 0) at E on M where the linearization is
[t] ~ [-~{E -~f -t] [:] (12.42)
290
Given the knowledge of population dynamics reviewed in the last section, a nat-
ural problem arises as how best to control them, to achieve some specific objectives.
This is the field of Management of Natural Resources and Bioeconomics, a meeting
ground of Economics and Biology. The various populations examined in the last
section emerge under the headings of renewable resources, the best known of which
are fisheries and forestry. The economic problem ensure the best numbers and pro-
portions of the various species. But the Economics of Resource management also
deals with non- renewable resources such as mining and optimal extraction, with the
knowledge that more now means less later and unborn future generations must also
be taken into account. In this section, we shall present some application of Optimal
DS (discussed in Ch. 11) in the field of Resource Management.
Thus, given the laws of population dynamics, the problem consists of maximizing
or minimizing some functional, which could be the social utility of having some
species, or the profit or benefit from harvesting renewable resources such as fishing,
forest cutting or from extracting exhaustible mineral resources. In the latter case,
the user costs, reflecting the cost to future generations of depleting non-renewable
resources by the current generation must enter the calculation of intergenerational
equity.
More specifically, the problem is to maximize
subject to
x= -h(t) + g(x, h) (12.44)
where x(t) is the resource in question, the state variable
h(t) = harvesting rate, the control variable, h E S
g( x, h) = growth rate of x (= 0 in the case of non renewable resources)
f(x, h) : objective function e.g. f = utility function, or profit function or cost
function etc ...
8 = constant discount rate.
This is a standard Optimal Control problem to which application of Pontryagin's
Maximum Principle (see Ch. 11) gives rise to the PHDS
x= Hp
p=-Hx+8p (12.45)
marginal net productivity is equal to the social discount rate~. If bang bang control
(see Ch. 11) is used, only h(max) or hemin) = 0 should be used and if initially,
Xo > x*, harvesting should be carried out to the maximum, i.e. h = h(max) and
if Xo < x*, h = O. In the exhaustible resource case, where g(x, h) = 0, we have
a typical isoperimetric problem (see Ch. 11, also Tu 1991 Ch. 7). Results vary
with specific problems. For example, in the problem of extracting non- renewable
resources with the aggregate social utility function u(h) with u" < 0 < u', as the
objective function I(x, h) above, the Maximum Principle gives u'(h) = peot where
p is constant 'Vt E [0, Tj, i.e. optimal harvesting policy is such that marginal utility
u'(h) increases exponentially at the social discount rate ~, which in view of the
concavity of u(h) and constancy of p, implies that later generations consume less
than current generations (see Ch. 11, section 11.4). Using other objective functions
such as profit II(x, h) = ph - c(x, h) = TR - TC variety, Clarke et al (1982),
Wilen & Brown (1986), Clark (1973), Clark, Clarke & Munro (1979) Cropper, Lee
& Pannu (1979) among others, have obtained a variety of results ranging from
selective harvesting to optimal extinction. We cannot go into these here. Rather,
we shall conclude this section with an example of prey-predator control problem by
Goh, Leitmann & Vincent (1974) to illustrate an explicit application of Optimal
Control dynamics.
An ideal pesticide is a chemical, applied at rate u(t), that kills pests (preys) (Nd
or predators (N2 ) or both, and leaves no residue. The model is
(12.47)
~l = -8Hj8NI
~2 = -8Hj8N2 (12.48)
Chemical control is very effective: it gives speedy results but causes pollution.
An alternative is biological control in the form of release of predators, at rate v, to
control pests (preys), such as the control of cottony-cushion scale by the lady-bird
beetle in California (see DeBach 1964) or release of pests, at rate u, in order to
prevent the extinction of predators. An example of this is the control of red spider
mite (see Hussey & Bravenboer (1971)} or of cabbage worm (see Parker 1971). The
294
Nl = (al - fJIN2)N1 + U
N2 = (fJ2Nl - (2)N2 + V (12.50)
where Ni (0) = N;o, Nl (T) = ad fJ2; N2 (T) = ad fJl as before T is unspecified and
o ~ U ~ U rnal" 0 ~ V ~ Vrnax •
The objective is cost minimization i.e.
(12.51 )
where Nl and N2 are given in (12.50). This is a L.O.C. model and optimal control
u*, given by the switching function cr(t) == Cl + Al (t), as before: cr(t) > 0 =} u* =
o == Urnin, cr(t) < 0 =} u* = U max and cr(t) == 0 =} singular control. Again, the
latter is eliminated by the test cr == 0 = if = if, leaving the bang bang as the only
admissible candidate. Similarly for the case U = 0 =j: v.
Thus, it can be seen that, with a selective application of pesticide and release of
species at appropriate rates, the harmonic motions are brought under control and
the two populations, driven to a desired target. There are many other prey-predator
optimal control models such as Ragozin & Brown (1985), Wilen & Brown (1986),
but we cannot go into them here.
12.5. Conclusion
This chapter provides some applications of D.S. in the various fields of Economics,
Ecology and Bioeconomics. It can be seen, however, even from the small sample of
the work under review, that D.S. is a powerful research tool. No doubt every tool
has its limitations, and nothing can replace the researcher's ingenuity in modelling
Economics and Biology with D.S. It is nevertheless true that D.S. theory provides
valuable insight into many problems, facilitates their solutions and brings about
interesting results.
Bibliography
[5] Anton, H. Elementary Linear Algebra. John Wiley & Sons, New York, 1984.
[6] Anton, H. & C. Rorres. Elementary Linear Algebra with Applications. J. Wiley
& Sons, New York, 1987.
[9J Arrow, K.J. & L. Hurwicz. On the Stability of the Competitive Equilibrium,
Part I - II. Econometrica 26: 522-552, October, 1958. 27: 82-109, January,
1959.
[10] Arrow, K.J. & M. Kurz. Public Investment, The Rate of Return and Optimal
Fiscal Policy. John Hopkins Press, Baltimore, Md, 1970.
[12] Arrow, K.J. & M. McManus. A Note on Dynamic Stability. Econometrica 26:
297-305, 1958.
[13] Arrow, K.J., H. Block & L. Hurwicz. The Stability of Competitive Equilibrium
II. Econometrica 27 (I): 82-109, January 1959.
[14] Arrowsmith, D.K. & C.M. Place. An Introduction to Dynamical Stytems. Cam-
bridge University Presss, Cambridge, 1990.
[15] Athans, M. & P.L. Falb. Optimal Control. McGraw-Hill, New York, 1966.
296
[16] Bailey, N.T.J. The Mathematical Theory of Epidemics. Hafner, New York,
1957.
[17] Balasko, Y. & R. Boyer. Une Analyse de l'effet du Progres Technique sur
l'emploi. CEPREMAP, January 1981.
[24] Benhabib, J. & R.H. Day. Rational Choice and Erratic Behaviour. Review of
Economic Studies 48: 459-471,1981.
[25] Benhabib, J. & R.H. Day. Characterization of Erratic Dynamics in the Over-
lapping Generation Model. Journal of Economic Dynamics and Control 4: 37-
55, 1982.
[26] Benhabib, J. & T. Miyao. Some New Results on the Dynamics of the Gener-
alised Tobin Model. International Economic Review 22(3): 589- 596, October,
1981.
[27] Benhabib, J. & K. Nishimura. The Hopf Bifurcation and the Existence of
Closed Orbits in Multisection Models of Optimal Economic Growth. Journal
of Economic Theory 21: 421-444, December, 1979.
[28] Benhabib, J. & K, Nishimura. Stability of Equilibrium in Dynamic Models of
Capital Theory. International Economic Review 22: 275- 293, June, 1981.
[32] Brock, W.A. "The Global Asymptotic Stability of Optimal Control: A Sur-
vey of Recent Results" in Intrilligator, M.D. (ed): Frontiers of Quantitative
Economics. VoI3A, North Holland Co., Amsterdam, 1977.
[33] Brock, W.A. & J.A. Scheinkman. Global Asymptotic Stability of Optimal Con-
trol Systems with Applications to the Theory of Economic Growth. Journal of
Economic Theory 12: 164-190, February, 1976.
[34] Brocker, Th. & L. Lander. Differentiable Germs and Catastrophes. Cambridge
University Press, 1975.
[38] Cass, D. & K. Shell (eds). The Hamiltonian Approach to Dynamic Economics.
Academic Press, New York, 1976.
[39] Chang, W.W. & D.J. Smyth. The existence and Persistence of Cycles in a
Nonlinear Model: Kaldor's 1940 Model Re-examined. Review of Economic
Studies 38: 37-44, January, 1971.
[41] Chipman, J.S. The Theory of Intersectoral Money Flows and Income Forma-
tion. John Hopkins Press, Baltimore, 1951.
[42] Chow, S.N. & J.K. Hale. Methods of Bifurcation Theory. Springer-Verlag,
Heidelberg, 1982.
[43] Clark, C. Profit Maximization and the Extinction of Animal Species. Journal
of Political Economy 81: 950-960, July-August, 1973.
[44] Clark, Colin W. Mathematical Bioeconomics. John Wiley & Sons, New York,
1976.
[45] Clark, C.W., F.H. Clarke & G.R. Munro. The Optimal Exploitation of Re-
newable Resource Stocks: Problems of Irreversible Investment. Econometrica
47(1): 25-27, January, 1979.
[46] Clarke, F.H., M.N. Darrough & J.M. Heineke. Optimal Pricing Policy in the
Presence of Experience Effects. Journal of Business 55: 517-530, October,
1982.
298
[49] Conlisk, J. Quick Stability Checks and Matrix Norms. Economica (NS) XL:
402-409, November, 1973.
[50] Conrad, J.M. & C.W. Clark. Natural Resource Economics. Cambridge Uni-
versity Press, 1987. .
[53] Day, R.H. Irregular Growth Cycles. American Economic Review 72: 406-414,
June, 1982.
[54] Day, R.H. The Emergence of Chaos from Classical Economic Growth. Quar-
terly Journal of Economics 98: 20-213, May, 1983.
[55] Day, R.H. & W.J. Shafer. Keynesian Chaos. Working Paper, Department of
Economics, University of Southern California, Los Angeles, 1983.
[56] DeBach, P. Biological Control of Insect Pests and Weeds. Chapman & Hall,
London, 1964.
[57] Debreu, G. Theory of Value. Yale University Press, New Haven, 1959.
[59] Denison, E.G. The Sources of Economic Growth in the U.S. and the Alterna-
tives Before Us. Suppl. Paper No 13, C.E.D., New York, 1962.
[60] Diamond, P.A. National Debt in a Neo Classical Growth Model. American
Economic Review 55 (5): 1126-1150, December, 1965.
[61] Diamond, P. Chaotic Behaviour of Systems of Difference Equations. Interna-
tional Journal of Systems Science 7: 953-956, 1976.
[62] Dockner, E. "Local Stability-Analysis in Optimal Control Problems with Two
State Variables" in Feichtinger, G. (ed): Optimal Control and Economic Anal-
ysis 2. Elsevier Science, North Holland, Amsterdam, 1985.
[63] Dockner, E.J. & G. Feichtinger. Cyclical Consumption Patterns and Rational
Addiction. Vienna, undated.
299
[64] Dockner, E.J. & G. Feichtinger. On the Optimality of Limit Cycles in Dynamic
Economic Systems. Journal of Economics 53(.): 31-50, 1991.
[65] Dockner, E.J., G. Feichtinger & A. Novak. Cyclical Production and Marketing
Decisions: Application of Hopf Bifurcation Theory. International Journal of
Systems Science, Vol 00, No 00, 000-000, 1991.
[66] Domar, E.D. The Burden of Debt and the National Income. American Eco-
nomic Review 34: 798-827, December, 1944.
[67] Domar, E.D. Capital Expansion, Rate of Growth and Employment. Economet-
rica 14: 137-47, April, 1946.
[68] Domar, E.D. Essays in the Theory of Growth. Oxford University Press, Lon-
don, 1957.
[69] Dorfman, R. An Economic Interpretation of Optimal Control Theory. Ameri-
can Economic Review 59: 817-831, December, 1969.
[70] Dorfman, R., P.A. Samuelson & R.M. Solow. Linear Programming and Eco-
nomic Analysis. McGraw-Hill, New York, 1958.
[71] Duesenberry, J. S. Selected Problems in Economic Theory: Discussion. Amer-
ican Economic Review /S 49: 528-30, May, 1959.
[72] Euler, L. Methodus inveniendi lineas curvas ... in Opera Omnia I, V 24: 231-
297, Fissli, Zurich, 1960.
[78] Fiedler, M. & V. Ptak. On Matrices with Nonpositive Off- Diagonal Elements
and Positive Principal Minors. Czechoslov. Mathematics Journal 12: 382-400,
1962.
[81] Gabisch, G. & H.W. Lorenz. Business Cycle Theory. Lecture Notes in Eco-
nomics and Mathematical Systems No 283, Springer-Verlag, 1987.
[82] Gandolfo, G. Economic Dynamics: Methods of Models, second edition. North
Holland, Amsterdam, 1983.
[83] Gantmacher, F.R. The Theory of Matrices. Vol 1, 2, InterScience Publishers,
Chelsea, New York, 1959.
[84] Gardini, L., R. Lupini, C. Mammana, & M.G. Messia. Bifurcations and tran-
sitions to Chaos in the three-dimensional Lotka- Volterra Map. SIAM Journal
Appl. Math. Vol. 47(3): 455-582, June, 1987.
[85] Gause, G.F. The Struggle for Existence. Williams & Wilkins, Baltimore, 1934.
[98] Grandmont, J.M. (ed). Nonlinear Economic Dynamics. Academic Press, Or-
lando, 1986.
[103] Hahn, F. Gross Substitutes and the Dynamic Stability of General Equilibrium.
Econometrica 26: 169-170, January, 1958.
[104] Hahn, F. "Stability" in Arrow, K.J. & M.D. Intrilligator (eds): Handbook of
Mathematical Economics. Vol II, North Holland Publishing Co, Amsterdam,
1982.
[106] Hahn, F.H. & R.C.O. Matthews. The Theory of Economic Growth: A SUnJey.
Economic Journal 74: 779-902, December, 1964.
[107] Halfon, E. Theoretical Systems Ecology. Academic Press, New York, 1979.
[108] Harrod, R.F. An Essay in Dynamic Theory. Economic Journal 49: 14-33,
March, 1939.
[111] Hassard, B.D., N.D. Kazarinoff & U Y.H. Wan. Theory and Applications of
Hopf Bifurcation. Cambridge University Press, 1981.
[112] Hawkins, D. & H.A. Simon. Note: Some Conditions of Macroeconomic Stabil-
ity. Econometrica 17: 245-248, July-October, 1949.
[113] Hicks, J.R. A Contribution to the Theory of the Trade Cycle. Oxford University
Press, 1950.
[114] Hilton, P.J. (ed). Structural Stability, The Theory of Catastrophes and Appli-
cations in the Sciences. Springer, Berlin, 1976.
[115] Hirsch, M.W. & S. Smale. Differential Equations, Dynamical Systems and
Linear Algebra. Academic Press, New York, 1974.
302
[134] Lancaster, P. Quadratic Eigenvalue Problems. Linear Algebra and its Appli-
cations 150: 499-506, 1991.
[136] Levinson, N. & O.K. Smith. A General Equation for Relaxation Oscillations.
Duke Mathematical Journal 9: 382-403, 1942.
[138] Li, T. & J. Yorke. Period Three Implies Chaos. American Mathematical
Monthly 8: 985-992, 1975.
[139] Lorenz, H.W. Goodwin's Nonlinear Accelerator and Chaotic Motion. Journal
of Economics 47: 413-418,1987. .
[140] Lorenz, H.W. "Optimal Economic Control and Chaotic Dynamics" in Fe-
ichtinger (ed): Optimal Control Theory and Economic Analysis 9. Elsevier
Science Pub., Amsterdam, 1988.
[144] Lotka, A.J. Elements of Physical Biology. Williams & Wilkins, Baltimore,
1925.
[148] Marotto, F.R. Snap-Back Repellers Imply Chaos in R:'. Journal of Mathemat-
ical Analysis and Applications 63: 199-223, 1978.
[150] Marsden, J.E. & M. McCracken. The Hopf Bifurcation and its Applications.
Springer-Verlag, Heidelberg, 1976.
[152] May, R.M. Limit Cycles in Predator-Prey Communities. Science 177: 900-902,
1972.
[153] May, R.M. Stability and Complexity in Model Ecosystems. Princeton Univer-
sity Press, 1973.
[154] May, R.M. Biological Populations with Non Overlapping Generations: Stable
points, Stable Cycles and Chaos. Science 186: 645- 647, 1974.
[155] May, R.M. Simple Mathematical Models with Very Complicated Dynamics.
Nature 261: 459-467, 1976.
[156] McKenzie. Stability of Equilibrium and the Value of Positive Excess Demand.
Econometrica 28: 606-617, July, 1960.
[157] McKenzie, L.W. The Matrix with Dominant Diagonal and Economic Theory.
Proceedings of a Symposium on Mathematical Methods in the Social Sciences,
Stanford University Press, Palo Alto, 1960.
[163] Mensch, G., Kaasch, K., Kleinknecht, A. & R. Schnopp. Innovation Trends
and Switching Between Full and Under-Employment Equilibrium 1950-1978.
International Institute of Management Discussion Papers 80-5, January, 1980.
[164] Metzler, L.A. The Nature and Stability of Inventory Cycles. Review of Eco-
nomic Studies 23: 113-129, August, 1941.
[167] Moreira, H.N. On Lienard's equation and the uniqueness of Limit Cycles in
predator-prey systems. Journal of Mathematical Biology 28: 341-354, 1990.
[168] Morishima, M. Prices, Interest and Profits in a dynamic Leontief System.
Econometrica 26: 358-80, July, 1958.
[169] Morishima, M. On the three Hicksian laws of Comparative Statics. Review of
Economic Studies 27: 195-201, June, 1960.
[170] Morishima, M. A Reconsideration of the Walras-Cassel-Leontief Model of
General Equilibrium. In Arrow, K.J., S. Karlin & P. Suppes (eds.): Mathe-
matical Methods in the Social Science, 1959 Proceedings of the First Stanford
Symposium, Stanford, 1960.
[171] Negishi, T. The Stability of the Competitive Equilibrium: A Survey Article.
Econometrica 30: 635-670,1962.
[172] Negishi, T. "Tatonnement and Recontracting" in The New Palgrave: General
Equilibrium. W.W. Norton, New York, 1989.
[173] Newhouse, S., D. Ruelle & F. Takens. Occurrence of Strange Axiom Attractors
near Quasi-Periodic Flows on Tm (m > 3). Communications in Mathematical
Physics 64: 35-40, 1978.
[174] Newman, P.K. Some Notes on Stability Conditions. Review of Economic Stud-
ies 27: 1-9, September, 1959.
[175] Nikaido, H. Convex Structure and Economic Theory, Academic Press, New
York, 1968.
[176] Nirenberg, L. Topics in nonlinear Functional Analysis. Courant Institute of
Mathematical Sciences, New York University, 1973-1974.
[177] Nitecki, Z. Differentiable Dynamics. MIT Press, Cambridge, MA, 1971.
[178] Ott, Edward. Chaos in Dynamical Systems. Cambridge University Press, Cam-
bridge, 1993.
[179] Parker, F.D. "Management of Pest Populations by Manipulating Densities of
both Hosts and Parasites through Periodic Releases" in Huffaker, C.B. (ed):
Biological Control. Plenum Press, New York, 1971.
[180] Peixoto, M.M. Structural Stability on Two-Dimensional Manifolds. Topology
1: 101-120, 1962.
[181] Perko, L. Differential Equations and Dynamical Systems. Springer-Verlag,
Heidelberg, 1991.
[182] Perlis, S. Theory of Matrices, Vol. 1. Chelsea, New York, 1958.
[183] Phillips, A.W. Stabilisation Policy in a Closed Economy. Economic Journal
64: 290-324, June, 1954.
306
[185] Pitchford, J.D. "Two state variable problems" in Pitchford J.D. & S.J.
Turnovsky (eds): Applications of Control Theory to Economic Analysis. North
Holland Co, Amsterdam, 1977.
[186] Poincare, H. Sur l'equilibre d'une masse fluide animee d'un mouvement de
rotation. Acta Mathematica (16 September, 1885) 7: 259- 380.
[187] Poincare, H. Oeuvres de Henri Poincare. Tome VIII, pp. 40-140. Published
by the Academie des Sciences Gauthier-Villars, Paris, 1952.
[190] Pontryagin, L.S., V.G. Boltyanskii, R.V. Gamkrelidze & E.R. Mishchenko.
The Mathematical Theory of Optimal Processes. Interscience Publishers, New
York,1962.
[191] Poston, T. & LN. Stewart. Taylor Expansions and Catastrophes. Pitman, Lon-
don, 1976.
[192] Poston, T. & LN. Stewart. Catastrophe Theory and its Applications. Pitman,
London, 1978.
[193] Putzer, E.J. Avoiding the Jordan Canonical Form in the Discussion of Linear
Systems with Constant Coefficients. American Mathematical Monthly, Vol. 73:
2-7, Januar~ 1966.
[194] Puu, T6nu. Nonlinear Economic Dynamics. Lecture Notes in Economics and
Mathematical Systems No 336, Springer-Verlag, Heidelberg, 1989.
[195] Quirk, J. & R. Rupert. Qualitative Economics and the Stability of Equilibrium.
Review of Economic Studies 32: 311-326, October, 1965.
[196] Ragozin, D.L. & G. Brown. Harvest Policies and Nonmarket Valuation in a
Predator-Prey System. Journal of Environmental Economics and Management
12: 155-168, June, 1985.
[197] Ramsey, F.P. A Mathematical Theory of Saving. Economic Journal 38: 543-
59, December, 1928.
[198] Rebelo, S. Long Run Policy Analysis and Long Run Growth. Working Paper,
University of Rochester, Rochester, New York, 1988.
[199] Rescigno, A. The Struggle for Life I: Two Species. Bull. Math. Biophysics 29:
377-388, 1967.
307
[201] Rescigno, A. & I.W. Richardson. The Struggle for Life I: Two Species. Bull.
Math. Biophysics 29: 377-388, 1967.
[202] Romer, P.M. Increasing Returns and Long Run Growth. Journal of Political
Economy 94(5): 1002-1037, October, 1986.
[204] Rose, H. The Possibility of Warranted Growth. Economic Journal 69: 313-332,
June, 1959.
[206] Rosen, R.R. (ed). Foundations of Mathematical Biology. Vol III, Academic
Press, New York, 1973.
[210] Samuelson, P.A. Interaction between the Multiplier Analysis and Principle of
Acceleration. Review of Economic Statistics 21: 75-78, May, 1939.
[211] Samuelson, P.A. The Stability of Equilibrium: Comparative Statics and dy-
namics. Econometric 9: 97-120, April, 1941.
[218] Schultz, T.W. Investment in Human Capital. American Economic Review 51:
1-17, March, 1961.
[222] Singer, D. Stable Orbits and Bifurcation of Maps of the Interval. SIAM Journal
of Applied Mathematics 35: 260-267, 1978.
[225] Smith, J.M. Models in Ecology. Cambridge University Press, Cambridge, 1974.
[226] Smithies, A. Economic Fluctuations and Growth. Econometrica 25: 1-52, Jan-
uary, 1957.
[227] Smithin, J.N. & Tu, P.N.V. Disequilibrium Adjustment in a Classical Macroe-
conomic Model: A Note. Journal of Economics 47(2): 207- 213, 1987.
[230] Solow, R.M. Note on Uzawa's Two-Sector Model of Economic Growth. Review
of Economic Studies 29: 48-50, October, 1961.
[233] Strang. Linear Algebra and its Applications. Academic Press, New York, 1976.
309
[234] Struble, R.A. Nonlinear Differential Equations. McGraw- Hill, New York,
1962.
[235] Swan, T.W. Economic Growth and Capital Accumulation. Economic Record
32: 334-361, November, 1956.
[237] Tan, N.x. & P.N.V. Tu. Some Hopf Bifurcation Theorems at Simple Eigen-
values and Economic Applications. Unpublished paper, 1992, now retitled as
Some New Hopf Bifurcation Theorems at Simple Eigenvalues, to appear in
the forthcoming issue of Applicable Analysis, an International Journal, Vol
53/ 3-4, June 1994.
[238] Taylor, A.E. Introduction to Functional Analysis. John Wiley, New York, 1958.
[240] Thompson, J.M.T. & G.W. Hunt. A General Theory of Elastic Stability. Wiley,
London, 1973.
[241] Thompson, J.M.T. & G.W. Hunt. A Bifurcation of Theoryfor the Instabilities
of Optimization and Design. Synthese, Vol. 36: 315-351, 1977.
[242] Thompson, J.M.T. & H.B. Stewart. Nonlinear Dynamics and Chaos. Wiley &
Sons, New York, 1986.
[243] Tobin, J. Money and Economic Growth. Econometrica 33: 671-684, October,
1965.
[244] Torre, V. Existence of Limit Cycles and Control in Complete Keynesian Sys-
tems by Theory of Bifurcations. Econometrica 45: 1457-1466, 1977.
[245] Trotman, D.J.A. & E.C. Zeeman. "The Classification of Elementary Catastro-
phes of Co dimension ~ 5," in Hilton, P.J. (ed): Structural Stability, the Theory
of catastrophes and Applications in the Science. Springer, Berlin, 1976.
[249] Tu, P.N.V. Comparative Statics and Catastrophe Theory in Economics. Dis-
cussion Papers Series No 74, University of Calgary, July, 1982.
310
[252] Tu, P.N.V. Towards an Optimal Wildlife Management. Discussion Papers Se-
ries No 111, University of Calgary, June, 1988.
[253] Tu, P.N.V. Introductory Optimization Dynamics. Second revised and enlarged
edition, Springer-Verlag, Heidelberg, 1991.
[254] Tu, P.N.V. Double Axisymmetry and Perturbed Hamiltonian Dynamical Sys-
tems in Optimal Economic Growth Models. Discussion Paper #135, Sept. 1992,
Department of Economics, University of Calgary.
[258] Uzawa, H. 1968. Market Allocation and Optimal Growth. Australian Economic
Papers 7: 17-27, June, 1968.
[259] Van der Ploeg, F. Economic Growth and Conflict over the Distribution of
Income. Journal of Economic Dynamics and Control 6: 253- 279, 1983.
[260] Van der Ploeg, F. Classical Growth Cycles. Metroeconomica 37(2): 221-230,
1985.
[261] Varian, H.R. Catastrophe Theory and the Business Cycle. Economic Inquiry
17: 14-28, January, 1979.
[263] Verhulst, P.F. Notice sur la loi que la population suit dans son accroissement.
Correspondance Mathematique et Physique 10: 113-121, 1838.
[266] Waldrop, M.M. Complexity: the Emerging Science at the Edge of Chaos. Si-
mon & Schuster, 1992.
311
[267] Waltman, P.E. The Equation of Growth. Bulletin Math. Biophys. 26: 39-43,
1964.
[272] Wirl, F. The Ramsey Model Revisited: The Optimality of Cyclical Consump-
tion Paths. Institute of Energy Economics, University of Vienna, January,
1991.
[273] Wolfstetters. Fiscal Policy and the Classical Growth Cycle. Journal of Eco-
nomics 42(4): 375-393, 1982.
[274] Zarnowitz, V. Recent Work on Business Cycles in Historical Perspective: A
Review of Theories and Evidence. Journal of Economic Literature 23: 523-580,
June, 1985.
[275] Zeeman, E.C. "Differential Equations for the Heartbeat and Nerve Impulse"
in Waddington, C.H. (ed): Towards a Theoretical Biology. Aldine-Atherton,
Chicago, 1972.
[276] Zeeman, E.C. "Differential Equations for the Hearth and Nerve Impulse," in
Peixoto (ed): Dynamical Systems. Academic Press, New York, 1973.
[277] Zeeman, E.C. Catastrophe Theory. Scientific American 234 (PL 4): 65-83.
[278] Zeeman, E.C. Catastrophe Theory. Reading, Mass, Addison- Wesley, 1977.
INDEX