Beruflich Dokumente
Kultur Dokumente
Chad R. Galley
*
Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California 91109 and Theoretical Astrophysics,
California Institute of Technology, Pasadena, California 91125, USA
(Received 1 September 2012; published 22 April 2013)
Hamiltons principle of stationary action lies at the foundation of theoretical physics and is applied in
many other disciplines from pure mathematics to economics. Despite its utility, Hamiltons principle has a
subtle pitfall that often goes unnoticed in physics: it is formulated as a boundary value problem in time but
is used to derive equations of motion that are solved with initial data. This subtlety can have undesirable
effects. I present a formulation of Hamiltons principle that is compatible with initial value problems.
Remarkably, this leads to a natural formulation for the Lagrangian and Hamiltonian dynamics of generic
nonconservative systems, thereby lling a long-standing gap in classical mechanics. Thus, dissipative
effects, for example, can be studied with new tools that may have applications in a variety of disciplines.
The new formalism is demonstrated by two examples of nonconservative systems: an object moving in a
uid with viscous drag forces and a harmonic oscillator coupled to a dissipative environment.
DOI: 10.1103/PhysRevLett.110.174301 PACS numbers: 45.20.d, 02.30.Xx, 05.20.y, 47.10.g
Hamiltons principle of stationary action [1] is a corner-
stone of physics and is the primary, formulaic way to derive
equations of motion for many systems of varying degrees
of complexityfrom the simple harmonic oscillator to
supersymmetric gauge quantum eld theories. Hamiltons
principle relies on a Lagrangian or Hamiltonian formula-
tion of a system, which account for conservative dynamics
but cannot describe generic nonconservative interactions.
For simple dissipation forces local in time and linear in the
velocities, one may use Rayleighs dissipation function [1].
However, this function is not sufciently comprehensive
to describe systems with more general dissipative features
like history dependence, nonlocality, and nonlinearity that
can arise in open systems.
The dynamical evolution and nal conguration of
nonconservative systems must be determined from initial
conditions. However, it seems under appreciated that while
initial data may be used to solve equations of motion
derived from Hamiltons principle, the latter is formulated
with boundary conditions in time, not initial conditions.
This observation may seem innocuous, and it usually is,
except that this subtlety may manifest undesirable features.
Remarkably, resolving this subtlety opens the door to
proper Lagrangian and Hamiltonian formulations of
generic nonconservative systems.
An illustrative example.To demonstrate the shortcom-
ing of Hamiltons principle, consider a harmonic oscillator
with amplitude q(i), mass m, and frequency w coupled with
strength to another harmonic oscillator with amplitude
Q(i), mass M, and frequency . The action for this systemis
Sjq, Q|
Z
i
]
i
|
Ji
m
2
( _ q
2
w
2
q
2
) +qQ
+
M
2
(
_
Q
2
2
Q
2
)
. (1)
The total system conserves energy and is Hamiltonian but
q(i) itself is open to exchange energy with Q and should
thus be nonconservative. For a large number of Qoscillators
the open (sub)systemdynamics for q ought to be dissipative.
Let us account for the effect of the Q oscillator on q(i)
by nding solutions only to the equations of motion for Q
and inserting them back into (1), which is called integrat-
ing out. The resulting action,
S
eff
jq|
Z
i
]
i
|
Ji
m
2
( _ q
2
w
2
q
2
) +qQ
(h)
(i)
+
2
2M
Z
i
]
i
|
Ji
/
q(i)G
ret
(i i
/
)q(i
/
)
, (2)
is the effective action for q(i), though it is sometimes called
a Fokker action [2]. Q
(h)
(i) is a homogeneous solution
(from initial data) and G
ret
(i i
/
) is the retarded Green
function for the Q oscillator.
The last term in (2) involves two time integrals and the
product q(i)q(i
/
). The latter is symmetric in i -i
/
and
couples only to the time-symmetric part of the retarded
Green function. Hence, the last term in (2) equals
2
2M
Z
i
]
i
|
JiJi
/
q(i)
G
ret
(i i
/
) +G
adv
(i i
/
)
2
q(i
/
) (3)
when using the identity G
ret
(i
/
i) G
adv
(i i
/
).
Applying Hamiltons principle to the effective action (2)
yields the equation of motion for q(i),
m q +mw
2
q Q
(h)
(i) +
2
2M
Z
i
]
i
|
Ji
/
jG
ret
(i i
/
)
+G
adv
(i i
/
)|q(i
/
). (4)
There are a couple of key points regarding (4). First, the
second term on the right side depends on the advanced
PRL 110, 174301 (2013)
P HYS I CAL RE VI E W L E T T E RS
week ending
26 APRIL 2013
0031-9007,13,110(17),174301(5) 174301-1 2013 American Physical Society
Green function implying that solutions to (4) do not evolve
causally nor are specied by initial data alone. Second, the
kernel of the integral in (4) is symmetric in time, which
means that the integral describes conservative interactions
between q and Q. Consequently, (4) does not account
for dissipation, a time-asymmetric process, that should be
present when there are N >1 of the Q oscillators.
These undesirable features can be traced back to the very
formulation of Hamiltons principle, which solves the
problem: Find the path q(i) passing through the given
values q
|
at i i
|
and q
]
at i i
]
that makes the action
stationary (see the left cartoon in Fig. 1). Stated in this
way, it is clear that Hamiltons principle is appropriate for
systems satisfying boundary conditions in time, not initial
conditions. According to Sturm-Liouville theory [3], the
time-symmetric integration kernel in (4), which is a Green
function itself, satises boundary conditions in time.
Likewise, boundary conditions in time imply that the
corresponding Green function is time symmetric. This
example indicates an intimate connection in the variational
calculus between boundary (initial) conditions and conser-
vative (nonconservative) dynamics.
In the remainder, I formulate Hamiltons principle
with initial conditions for general systems, report some
consequences, and present some examples.
Hamiltons principle with initial data.A hint for
how to proceed comes from the previous example. The
advanced Green function in (3) and (4) appears because the
factor q(i)q(i
/
) couples only to the time-symmetric part
of the retarded Green function. Breaking the symmetry
by introducing two sets of variables, say q
1
and q
2
, implies
that q
1
(i)q
2
(i
/
) will couple to the full retarded Green
function, not just its time-symmetric part. Varying with
respect to only q
1
gives the correct force, provided one
sets q
2
q
1
after the variation [4]. This procedure is
formalized and developed for general systems below.
Let q q
|
N
|1
and
_
q _ q
|
N
|1
be a set of N general-
ized coordinates and velocities of a general dynamical
system. Formally, double both sets of quantities, q -
( q
1
, q
2
) and
_
q -(
_
q
1
,
_
q
2
). Parametrize both coordinate
paths as q
1,2
(i, e) q
1,2
(i, 0) +e ,
1,2
(i), where q
1,2
(i, 0)
are the coordinates of the two stationary paths, e <1,
and ,
1,2
(i) are arbitrary virtual displacements. To ensure
that enough conditions are given for varying the action, we
require that (1) ,
1,2
(i
|
) 0 and (2) q
1
(i
]
, e) q
2
(i
]
, e)
and
_
q
1
(i
]
, e)
_
q
2
(i
]
, e) for all e (the equality condition).
The equality condition does not x either value at the nal
time since the values they equal are not specied. After all
variations are performed, both paths are set equal to each
other and identied with the physical one, q(i) (the physi-
cal limit). See the right cartoon in Fig. 1.
The action functional of q
1
and q
2
is dened here as the
total line integral of the Lagrangian along both paths plus
the line integral of a function K (discussed below) that
depends on both paths q
o
2
o1
and cannot generally be
written as the difference of two potentials,
Sj q
o
|
Z
i
]
i
|
JiL( q
1
,
_
q
1
) +
Z
i
|
i
]
JiL( q
2
,
_
q
2
)
+
Z
i
]
i
|
JiK( q
o
,
_
q
o
, i),
Z
i
]
i
|
JijL( q
1
,
_
q
1
) L( q
2
,
_
q
2
) +K( q
o
,
_
q
o
, i)|. (5)
This action denes a new Lagrangian
( q
o
,
_
q
o
) L( q
1
,
_
q
1
) L( q
2
,
_
q
2
) +K( q
o
,
_
q
o
, i). (6)
If K could be written as the difference of two potentials,
V( q
1
) V( q
2
), then it could be absorbed into the differ-
ence of the Lagrangians in (5), leaving K zero [8]. Thus, a
nonzero K describes generalized forces that are not deriv-
able from a potential (i.e., nonconservative forces) and
couples the two paths with each other.
It is convenient, but not necessary, to make a change of
variables to q
+
( q
1
+ q
2
),2 and q
q
1
q
2
because
q
-0 and q
+
- q in the physical limit. The conjugate
momenta in the variables, regarded as functions of the
coordinates and velocities, are found to be r
,
_
q
,
and the paths are parametrized as q
(i, e) q
(i, 0) +
e ,
|,Je)
e0
for all ,
, or
0
Z
i
]
i
|
Ji
,
+
q
+
J r
Ji
0
+ ,
J r
+
Ji
+j ,
+
(i) r
(i) + ,
(i) r
+
(i)|
i
]
ii
|
, (7)
where the subscript 0 denotes evaluation at e 0 and
,
+
r
P
N
|1
,
+|
r
|
, etc.
The equality condition requires q
1
(i
]
, e) q
2
(i
]
, e) and
_
q
1
(i
]
, e)
_
q
2
(i
]
, e) so that ,
(i
]
) 0 and r
(i
]
) 0.
With ,
(i
|
) 0 it follows that the boundary terms in (7)
all vanish. Thus, (7) is satised for any ,
(i) solve
J r
Ji
q
. (8)
FIG. 1 (color online). Left: A cartoon of Hamiltons principle.
Dashed lines denote the virtual displacements and the solid line
denotes the stationary path. Right: A cartoon of Hamiltons
principle compatible with initial data (i.e., the nal state is not
xed). In both cartoons, the arrows on the paths indicate the
integration direction for the line integral of the Lagrangian.
PRL 110, 174301 (2013)
P HYS I CAL RE VI E W L E T T E RS
week ending
26 APRIL 2013
174301-2
Of course, one could have used the q
1,2
coordinates instead
to nd J r
1,2
,Ji, q
1,2
with r
1,2
(1)
1,2
,
_
q
1,2
regarded as functions of q
1,2
and
_
q
1,2
.
In the physical limit (PL), only the J r
+
,Ji , q
PL
L
q
+
K
q
PL
, (9)
where the conjugate momenta are
r( q,
_
q)
_
q
PL
_
q
+
_
q
PL
. (10)
When K 0 the generalized forces are derived from
potentials and one recovers the usual Euler-Lagrange
equations. A nonzero K can be regarded as a nonconser-
vative potential.
In the physical limit, only the Euler-Lagrange equation
for the +variable survives. Hence, expanding the action in
powers of q
uSj q
|
u q
(i)
PL
. (11)
Only terms in the new action (5) that are perturbatively
linear in q
_
q
1
2
_
q
2
( q
1,2
,
_
q
1,2
)
H( q
1
,
1
)H( q
2
,
2
)K( q
1,2
,
_
q
1,2
,i),
(12)
where
_
q
1
and
_
q
2
are now functions of their respective
coordinates and momenta. Writing (12) in the variables
gives
A( q
)
+
_
q
_
q
+
( q
,
_
q
). (13)
Both (12) and (13) can be written as
A( q
o
,
o
)
o
_
q
o
( q
o
,
_
q
o
), (14)
where a metric c
ol
is introduced to raise and lower the
indices labeling the doubled variables: (1, 2) in (12) and
(+, ) in (13). For the former c
ol
diag(1, 1) and for
the latter c
ol
offdiag(1, 1) so that
o
_
q
o
c
ol
o
_
q
l
(repeated indices are summed), where c
ol
is the inverse
of c
ol
. Dene new Poisson brackets by
f], gg
]
q
o
g
o
]
o
g
q
o
, (15)
which can be shown to satisfy Jacobis identity.
Then, Hamiltons equations follow by extremizing the
action (5), giving
_
q
o
A
o
f q
o
, Ag,
_
o
A
q
o
f
o
, Ag. (16)
Note the index positions since they are raised and lowered
by the metric c
ol
. In the physical limit, (16) becomes
Hamiltons equations for a nonconservative system,
_
q
H
PL
q, H jf q
, Kg|
PL
,
_
H
q
+
K
q
PL
, H jf
, Kg|
PL
.
(17)
The total time derivative of the energy function [1],
h( q,
_
q)
_
q
L
_
q
L, (18)
follows from the usual manipulations [1], which here give
Jh
Ji
L
i
_
q
J
Ji
K
_
q
K
q
PL
. (19)
The amount of energy entering or leaving the system is
determined by K when L,i 0 and can be found
directly from the new Lagrangian.
Example: viscous drag forces.This new formalism can
be used like the standard theory. Consider the following
new Lagrangian, given in the variables:
( x
,
_
x
) m
_
x
_
x
+
n x
_
x
+
}
_
x
+
}
n1
, (20)
where n 1 (linear) or 2 (nonlinear). The rst term is the
difference of the two kinetic energies ( m
_
x
2
1
,2 m
_
x
2
2
,2),
and the second term is K. The new Lagrangian (20) is
unique up to terms nonlinear in x
n _ x
|
. The physical
limit implies that x
+
is determined by the physical initial
data, x
+
(i
|
) x
|
and
_
x
+
(i
|
)
|
, while x
is specied by
nal data, x
(i
]
) 0
_
x
(i
]
), according to the equality
condition. Because x
(i) 0 and x
+
(i) x
|
+m
|
,nj1 e
n(ii
|
),m
|.
The former automatically imposes the physical limit so
PRL 110, 174301 (2013)
P HYS I CAL RE VI E W L E T T E RS
week ending
26 APRIL 2013
174301-3
that x
+
(i) is the physically correct solution. The new action
is stationary for these solutions, as can be shown by direct
substitution into (7).
With K given by the second term of (20) it follows from
(18) and (19) that h m
_
x
2
,2 and Jh,Ji n}
_
x}
n+1
,
which is precisely the energy lost per unit time by the
object through frictional forces from viscous drag.
Example: coupled harmonic oscillators.Return to the
rst example of a harmonic oscillator q coupled to another
oscillator Q to show that the new framework gives the
correct physical description for the open dynamics of q
itself. Assume initial conditions q(i
|
) q
|
, _ q(i
|
)
|
,
Q(i
|
) Q
|
, and
_
Q(i
|
) V
|
. The total system is closed
implying that K 0 and the usual action is given by (1).
Doubling the degrees of freedom, the new action is con-
structed as in (5) but with K 0. The effective action for
the open dynamics of the q oscillator subsystem itself
is obtained by integrating out the Q
variables, which
satisfy (8), M
Q
+M
2
Q
(i)
M
Z
i
]
i
|
Ji
/
G
adv
(i i
/
)q
(i
/
), (22)
where Q
(h)
(i) Q
|
cos(i i
|
) +V
|
,sin(i i
|
) is
the homogeneous solution. The +variable evolves forward
in time and satises the initial conditions while the
variable evolves backward in time because of the equal-
ity condition at the nal time. This is a general feature of
the variables.
Substituting these solutions into the action yields the
effective action
S
eff
jq
|
Z
i
]
i
|
Ji
m( _ q
+
_ q
w
2
q
+
q
) +q
Q
(h)
+
2
M
Z
i
]
i
|
Ji
/
q
(i)G
ret
(i i
/
)q
+
(i
/
)
. (23)
The factor q
(i)q
+
(i
/
) in the last term is not symmetric in
i -i
/
and couples to the full retarded Green function as
opposed to just its time-symmetric piece as in (4).
Applying (11) to (23) gives the equation of motion
m q+mw
2
q
2
M
Z
i
]
i
|
Ji
/
G
ret
(i i
/
)q(i
/
) +Q
(h)
(i) (24)
in the physical limit. Now, the Green function in (24) is the
retarded one, G
ret
(i i
/
) 0(i i
/
),sin(i i
/
), and
solutions to (24) evolve causally from initial data.
Generalizing to N oscillators, Q -Q
n
N
n1
, it is
straightforward to show that the effective Lagrangian,
from S
eff
R
Ji
eff
, is
eff
(q
, _ q
) m( _ q
_ q
+
w
2
q
q
+
) +q
E(i)
+
Z
i
i
|
Ji
/
q
(i)(i i
/
)q
+
(i
/
). (25)
Here, E(i)
P
N
n1
n
Q
(h)
n
(i) acts like an external force
and (i i
/
)
P
N
n1
2
n
,(M
n
n
) sin
n
(i i
/
), where a
quantity with a subscript n is associated with Q
n
. The
last two terms in (25) constitute an effective nonconserva-
tive potential K
eff
for the open subsystem that is nonlocal
in time and history dependent.
From (19), the energy function evolves as
Jh
Ji
_ qE(i) + _ q
Z
i
i
|
Ji
/
(i i
/
)q(i
/
), (26)
where h m( _ q
2
+w
2
q
2
),2 is the energy of the oscillator
from (18). To see a familiar dissipation, choose trivial
initial data for the Q
n
so that E(i) 0 and take each
M
n
to be a constant, M. The coupling strengths
n