Beruflich Dokumente
Kultur Dokumente
Editor
G. J. Toomer
Advisory Board
Herman H. Goldstine
A History of the
Calculus of Variations
from the 17th through
the 19th Century
Springer-Verlag
New York Heidelberg Berlin
HERMAN H. GOLDSTINE
With 66 illustrations
9 8 7 6 5 432 I
ISBN -13:978-1-4613-8108-2
e-ISBN -13 :978-1-4613-8106-8
DOl: 10.1007/978-1-4613-8106-8
Preface
The calculus of variations is a subject whose beginning can be precisely
dated. It might be said to begin at the moment that Euler coined the name
calculus of variations but this is, of course, not the true moment of
inception of the subject. It would not have been unreasonable if I had gone
back to the set of isoperimetric problems considered by Greek mathematicians such as Zenodorus (c. 200 B.C.) and preserved by Pappus (c. 300
A.D.). I have not done this since these problems were solved by geometric
means. Instead I have arbitrarily chosen to begin with Fermat's elegant
principle of least time. He used this principle in 1662 to show how a light
ray was refracted at the interface between two optical media of different
densities. This analysis of Fermat seems to me especially appropriate as a
starting point: He used the methods of the calculus to minimize the time of
passage cif a light ray through the two media, and his method was adapted
by John Bernoulli to solve the brachystochrone problem.
There have been several other histories of the subject, but they are now
hopelessly archaic. One by Robert Woodhouse appeared in 1810 and
another by Isaac Todhunter in 1861. In addition to these there are good
historical accounts by Lecat in the Encyclopedie des sciences mathematiques,
II 31, 1913 and 1916, by Ernesto Pascal in his 1897 book on the calculus of
variations as well as accounts by Kneser, and Zermelo and Hahn in the
Encyclopiidie der mathematischen Wissenschaften, II A8, 1900 and II A8a,
1904. The reader might also wish to look at Lecat;s excellent bibliography
from the beginnings of the subject through 1913.
In light of the fact that these previous accounts are either out-of-date or
inadequate I have in this volume attempted to select those papers and
authors whose works have played key roles in the classical calculus of
variations as we understand the subject today. In doing this I have
excluded some otherwise excellent papers and very many that seemed to
me to be pedestrian. Other readers may not agree with this selection
process. I must bear sole responsibility.
It is with pleasure that I acknowledge my deep appreciation to the
International Business Machines Corporation, of which I have the honor
of being a Fellow, and especially to Dr. Ralph E. Gomory, Vice-President
for Research, for the encouragement and support that have made this book
possible. I would like to express my gratitude for the patience and
helpfulness of Professors Marshall Clagett and Harold Cherniss of the
vii
viii
Preface
Institute for Advanced Study in discussing with me many points that arose
in the course of writing this book. It would be remiss of me if I did not also
acknowledge the helpfulness of the Institute for Advanced Study in sustaining me intellectually and for providing me with its facilities. My
colleagues both at IBM and at the Institute for Advanced Study have
served me as examplars of the highest standards of science and scholarship. In particular I wish to single out Professor Otto Neugebauer who has
been a great inspiration to me.
In closing I extend my thanks to Mrs. Irene Gaskill, my secretary, for
her tireless, good-natured and patient help in preparing the manuscript and
in reading proofs, and to Springer-Verlag for its excellent help and for the
high quality of its work on this volume.
Fall 1980
HERMAN H. GOLDSTINE
Table of Contents
Introduction
1.
xiii
2. Euler
2.1.
2.2.
2.3.
2.4.
2.5.
2.6.
2.7.
2.8.
3.2.
3.3.
3.4.
3.5.
3.6.
38
44
47
50
58
67
Introduction . . . .
The Simplest Problems .
More General Problems
Invariance Questions
Isoperimetric Problems .
Isoperimetric Problems, Continuation .
The Principle of Least Action
Maupertuis on Least Action . . . .
07
30
32
34
35
67
68
73
84
92
99
101
.108
110
110
115
.129
.139
145
.148
Table of Contents
5.
Excursus
Jacobi's Paper of 1836
Excursus on Planetary Motion
V.-A. Lebesgue's Proof .
Hamilton-Jacobi Theory
Hesse's Commentary.
Weierstrass
5.1.
5.2.
5.3.
5.4.
5.5.
5.6.
5.7.
5.8.
5.9.
5.10.
5.11.
5.12.
5.13.
Weierstrass's Lectures
The Formulation of the Parametric Problem
The Second Variation
Conjugate Points .
Necessary Conditions and Sufficient Conditions
Geometrical Considerations of Conjugate Points
The Weierstrass Condition
Sufficiency Arguments .
The Isoperimetric Problem
Sufficient Conditions
Scheeffer's Results
Schwarz's Proof of the Jacobi Condition.
,
Osgood's Summary
151
. 151
.156
.164
.168
.176
.186
190
.190
. 191
.193
.197
.201
.204
.210
.214
.219
.223
.237
.245
.246
250
.250
.250
.257
.269
.282
.287
.300
314
.314
.317
.322
.330
.338
.346
.357
.362
.371
Table of Contents
7.10. Bolza and the Problem of Bolza .
7.11. Caratheodory's Method
7.12. Hahn on Abnormality
xi
.373
.383
.387
Bibliography
391
Index
401
Introduction
The ancients were certainly well aware of isoperimetric problems, and
their results were preserved for us by Pappus (c. 300 A.D.). Their methods
were, of course, geometrical, and I have accordingly ignored this work
preferring to limit the scope of this book to problems solved analytically. I
have also given almost no account of Galileo's analysis of the brachystochrone and the heavy chain problems for more or less the same reason.
Instead this volume begins with Fermat's analysis of the passage of a light
ray from one optical medium to another. He postulated that light moves in
such a way that it will traverse the media in the least possible time.
Fermat's analysis does make use of the calculus and hence seemed to
me to be a reasonable starting point for this book (see Section 1.1). It is
also important because John Bernoulli took Fermat's ideas and used them
to solve the brachystochrone problem by an adroit use of Fermat's
principle (see Section 1.7): He divided the space between the two given
points into narrow horizontal strata and supposed that the descending
particle moved uniformly in each stratum and was refracted at each
interface as if it were a quantum of light so that the total time of descent
was a minimum.
After Fermat's papers of 1662 there was a lull in the subject until 1685
when Newton solved what can be viewed as the first really deep problem
of the calculus of variations. In the course of his researches he investigated
the motions of bodies moving through a fluid of "a rare medium of very
small quiescent particles. . . ." He led up to the general problem of the
shape of the body encountering the least resistance upon its nose by first
calculating the resistances encountered by several special ones. He thus
discussed a hemisphere, a cylinder, and the frustum of a cone. After this he
was able to proceed to the general case (see Section 1.2). This result of
Newton appeared in his Principia without any details being given. In 1691
Huygens studied this problem of Newton without, however, getting all the
way through. He was, however, more successful than anyone else.
David Gregory persuaded Newton in 1694 to write an analysis of the
problem, which made Newton's ideas available to the mathematical community. Newton's technique is very interesting and was duplicated by
subsequent authors until Lagrange developed an elegant and superior
analytical apparatus. This study did not exhaust Newton's interest in the
problem: He proceeded to study a similar problem in which one end-point
xiii
xiv
Introduction
Introduction
xv
of this he put Legendre's transformation on a rigorous basis and discovered the fundamental concept of conjugate points (see Section 4.2). None
of Jacobi's results was proved in his paper. As a result a large number of
commentaries were published, mainly to establish an elegant result of his
on exact differentials. All that work on exact differentials was rendered
obsolete by Clebsch and then Mayer. Jacobi viewed the problem based on
the integrandf(x, y, y', y", ... ,y(n as a more general problem than that
based on f(x, y, y'), which in a sense it is. He did not, however, see how
the former problem could be reduced to the latter by the addition of very
simple side-conditions.
At about the same time that Jacobi wrote his classic paper just mentioned, Hamilton published two remarkable papers on mechanics. In these
papers he showed that the motion of a particle in space, acted on by
various forces, could be comprehended in a single function which satisfies
two partial differential equations of the first order. In 1838 Jacobi criticised
this result, pointing out that only one partial differential equation, the
Hamilton-Jacobi equation, was needed. Jacobi then showed, in effect, the
converse of Hamilton's result (see Section 4.5). This work of Hamilton and
Jacobi underlies some of the most profound and elegant results not only of
the calculus of variations but also of mechanics, both classical and modern. Indeed it is really here that we see the deep significance of the calculus
of variations and the reason why it is basic to the physical as well as the
mathematical sciences.
At this point there was a lull in the theory for about twenty years. The
calculus of variations as a subject was now at a fork in its intellectual path:
The entire subject was in need of a rigorous re-analysis; the theory of weak
and strong extrema was not yet understood or indeed even articulated; and
general problems had yet to be enunciated. Two quite different directions
were now taken by the nineteenth century analysts: Weierstrass went back
to first principles and not only placed the subject on a rigorous basis using
the techniques of complex-variable theory but, perhaps more importantly,
discovered the Weierstrass condition, fields of extremals, and sufficient
conditions for weak and strong minima. Clebsch tentatively and A. Mayer
decisively moved on quite another route: They succeeded in establishing
the usual conditions for ever more general classes of problems.
In Chapter 5 we consider in some detail Weierstrass's researches on the
calculus of variations. This work was elegantly organized by Rothe in a
volume of Weierstrass's collected works and constituted a real milestone in
the subject. Here we find yet another example of Weierstrass's rigor,
clarity, and depth of understanding. He mainly studied only the simplest
pro.blem in parametric form; but for it he found his necessary condition,
discovered fields of extremals, and made sufficiency proofs. He made
several of these sufficiency pro.ofs using not only series expansions but also.
fields. In about this same period-1886-Scheeffer also gave a sufficiency
proof for a weak minimum in ignorance of Weierstrass's results.
xvi
Introduction
Introduction
xvii
Hilbert also gave the first completely rigorous proof of the multiplier
rule for the problem of Mayer in 1906, and showed the relation between
his independence theorem and the Hamilton-Jacobi theory. In the course
of this discussion he formulated the modern definition of a field (see
Section 7.3).
A new approach to the calculus of variations was provided by Kneser in
a paper in 1898 and in two editions of his Lehrbuch. He started from an
elegant result of Gauss on geodesics: Given a curve on a surface draw
through each point of the curve the geodesic through it and mark off a
fixed length on each. The end-points of these arcs form a curve that cuts
the geodesics orthogonally and conversely. Kneser generalized this result
by his discovery of the more general concept, transversality, which includes orthogonality as a special case. Out of this notion he was able to
establish the envelope theorem, first discovered by Zermelo in 1894. Using
the result, Kneser was able to give a geometrical interpretation of conjugate points. This interpretation is not different ftom Jacobi's, but his
analysis deepens and enriches our understanding of what is meant.
Kneser proceeded in his Lehrbuch to examine variable end-point problems, and as a result he discovered focal points. His work was followed by
Bliss and Mason who continued the discussion in depth. He also introduced what he termed normal coordinates of a field in an elegant way
which permitted both himself and Bolza to give sufficiency proofs by
showing that the Weierstrass &-function is invariant under coordinate
transformation (see Section 7.6).
In the course of extending results by Bliss and Mason in 1908 and 1910,
Hahn in 1911 established a theorem of great importance for the treatment
of sufficiency theorems for variable end-point problems (see Section 7.7).
At about the same time W. Cairns, Mason and Richardson, carrying out
Hilbert's ideas, introduced boundary-value methods into studies of the
second variation viewed as a quadratic functional (see Section 7.8). In the
course of this work Richardson gave several interesting oscillation theorems. As mentioned earlier Hilbert investigated under what conditions a
nonnegative integral of the calculus of variations attains its lowest bound
over a given class of arcs (see Section 7.9).
In 1913 Bolza formulated what Bliss called the problem of Bolza as a
generalization of the problems of Lagrange and Mayer. (Bliss showed that
in fact all three problems are equivalent.) He proceeded to give a
seminal and penetrating first discussion of the problem (see Section 7.10).
In 1904 Caratheodory took up John Bernoulli's second method for
handling the brachystochrone problem. This enabled him to define at each
point what he called the direction of steepest descent and the notion that a
family of curves are geodesically equivalent. With the help of these
concepts Caratheodory developed the usual results of the calculus of
variations with considerable elegance.
Upon this note the present volume ends. It does not contain any
xviii
Introduction
Bernoullis
assumed that light moved faster in a denser medium (cf. Sabra, TaL, p.
105n). It was therefore a source of astonishment to Fermat that he reached
by precise means apparently the same conclusion about refraction that
Descartes had.
The main reason I have started a history of the calculus of variations at
this point is that Fermat's work seemed to be clearly the first real
contribution to the field and certainly served as the inspiration for the
imaginative proposal and solution of the brachystochrone problem by
John Bernoulli in 1696/97. 4 Indeed, Bernoulli's solution depends directly
on Fermat's principle. As we shall see in Section 1.4, Bernoulli replaced his
material particle moving under the action of gravity by a ray of light
moving through a series of optically transparent media of properly varying
densities.
It might be argued with some reason that Galileo's treatment of the
brachystochrone problem should be taken as the proper starting point, but
I felt that since Galileo is incorrect in his argument and does not make use
of the calculus, it is more fair to choose Fermat as the legitimate sire of the
field.
In Figure 1.1 Fermat considers a circle ACBI with center at D and
made up of two media ACB and AlB of different optical densities. He
supposes that a ray of light starts from the point C in the upper and rarer
medium and moves along the broken ray cal across the interface ADB to
the point I in the denser medium. As the reader can see, there is also an
external line segment M in the figure. Fermat assumes, in effect, that the
length of M is a measure of the resistance of light in the rarer medium and
that the length of DF is a measure of the resistance in the denser one. At
least he assumes that they are proportional to these with a common factor
of proportionality.5 Moreover, he uses the term "resistance" to mean the
Figure 1.1
4John Bernoulli, 00, Vol. I, pp. 166ft and 187-193.
sIn his synthesis he sharpens up this assumption by locating M in the figure itself and
specifying more precisely how M is related to the ray in the denser medium.
+ 10
= M~N2 + E2 -
2BE
+ B~N2 + E2 + 2AE
+ E) -
(A
+ E)2=
BA - A2
+ BE -
2AE - E2
(1.1)
and so
BE-2AE+ E2
or
B-2A
+ E.
(1.2)
Thus
B=2A
(1.3)
Figure 1.2
Now in the case of the time for the light ray, Fermat sets
CO . M
+ 10 . B = N
+N
.M
. B,
does the algebraic manipulation, and finds the elegant result A = M; that
is, the segment DR is in fact the segment M. By his original assumption
that the denser the medium, the more slowly light proceeds, he then has
the fundamental relation
DF = const. > I
DR
'
DF
DH
= const. > 1.
(1.4)
(1.4')
and goes on to show that a ray moving from point M in the upper medium
through the center N of the circle to H in the lower one will arrive in the
least time. The situation is as shown in Figure 1.2. Fermat explicitly
assumes that the velocity in the upper (rarer) medium is the faster; thus the
ratio c in equation (1.4') above is greater than one. In Figure 1.2, D is the
foot of the perpendicular from M onto the interface ANB between the
media, and S is the foot of the perpendicular from H onto that line. The
point R is an arbitrary point on the interface; and I, P are points on
MN, MR, respectively, so chosen that the relations
DN
MR
MN
c = NS = RP = NI
(1.5)
obtain.
Fermat next chooses two more points 0 and Von the line RH so that
(1.6)
He then wishes to show that the time for a ray to move on the broken line
MNH is less than on the line MRH. His proof is quite simple and direct.
For notational purposes, let tXY be the time required by light to move
along a line segment XY in a homogeneous medium. Then since velocity in
such a medium varies directly with distance and inversely with time and
since the ratio of the velocity in the upper to that in the lower medium is c
[by (1.5)], Fermat has
tMN
tNH
MN I
NH . ~
NI
NH '
since the velocities along NH and RH are equal, and now he needs to
show that RP + RH > NI + NH. To see why this is so, recall the relations
(1.6). We know that DN < MN and NS < DN by (1.4') and thus
NO<RN,
NV< NO.
(1.7)
> MN+
NO.
MN
NI
NO
NV
--=--=--=
MN+NO
MR
NI+ NV = RP ,
I believe that I have often said both to M. de la Chambre and to you that
I do not pretend, nor have I ever pretended to be in the inner confidence
of Nature. She has obscure and hidden ways which I have never undertaken to penetrate. I would have only offered her a little geometrical aid
on the subject of refraction, should she have been in need of it. But since
you assure me, Sir, that she can manage her affairs without it, and that
she is content to follow tpe way that has been prescribed to her by M.
Descartes, I willingly hand over to you my alleged conquest of physics;
and I am satisfied that you allow me to keep my geometrical problempure and in abstracto, by means of which one can find the path of a thing
moving through two different media and seeking to complete its movement as soon as it can.
II This follows by expressing the side MR in triangle MNR in terms of the other sides. Thus
he has, by the first of the ratios (1.6) and of the inequalities (1.7),
Figure 1.3
(= CB cos2 0).
(1.8)
10
He then remarks that "bH will be to bE as the effect of the particle upon
the globe to the effect of the particle upon the cylinder. And therefore the
solid which is formed by all the right lines bH will be to the solid formed
by all the right lines bE as the effect of all the particles upon the globe to
the effect of all the particles upon the cylinder."
Newton then needs to calculate the volumes of these two solids. To this
end he notes that the solid formed by "the right lines bH" is "a paraboloid
whose vertex is C, its axis CA, and latus rectum CA.,,21 The other solid is
"a cylinder circumscribing the paraboloid." That the former solid is as
described can be seen from equation (1.8) by noting that
1o J.'2cos
277dcp
'I
3 (J.
Y ds
= 2."
J cos2
.~'2
(J. Y
dy.
(1.9)
)'1
and if we choose units properly. we may regard this as the value of the
resistance.
21 Newton, PRIN, p. 332.
22Newton, PAPERS, pp. 469n-47On; Heath, ARCH, pp. 129-131.
11
It is clear that this expression for the resistance is valid for surfaces of
revolution in general and not just for a sphere of radius c. In that case,
however, we have y, = 0, Y2 = c, and
2
y2
cos2 9 = = I - c2
c2 '
and hence R is 'TTc 2 /2; in the case of the cylinder cos2 9 = I and R is 'TTC 2
In his Scholium on pp. 333-334 Newton goes forward with his analysis.
He has just considered the bow resistance of a ship or a projectile when it
is blunt or hemispherical. He now goes to the case where it is a frustum of
a right-circular cone and finds, for a given altitude and base, the best
shape. From this he proceeds to the general question of the best-shaped
bow, as we shall see.
In Figure 1.4 the circle CEBH with radius b is the base of the cone, S is
its vertex, and the altitude of the frustum is OD = a. The radius of the
circular cap FD = c, and Q is the midpoint of the altitude OD. For
convenience, let z = DS. Then by the value of R in (1.9), the resistance of
the curved surface of the frustum is
R. = 2'TT
cos2 9 = - - - - b 2 + (a + Z)2
The total resistance is, of course, the sum of R. and R2 = 'TTC 2, the
resistance of the cap of the frustum, since the frustum moves in the
direction from 0 to D; that is,
R = RI
'TT'
R2 = 'TT(b 2cos2 9
b 4 +c 2(a+zi
b 2 +(a+zi
+ c 2sin2 6)
'TTb 2
Figure 1.4
b +z
b 2 +(a+z)2
12
since b/e = (a + z)/z. From this it is clear that the minimum value with
respect to z occurs at
z = - -a + ~a2
- +b 2
2
(1.10)
'
+ FD 2 . CF 2
CF 2
OC 2 . (OC - FD)2
+ FD 2 . OD 2
CF 2
y=
a 2b 2 - 2a 2bx + a 2x 2 + b 2x 2
a2 + x 2
+ a 2x 2 + b 2x 2 = ay + xy
13
E
Figure 1.5
-aa
+ a~aa + 4bb
2b
Whence, by the way, since the angle CSB (in Figure 1.4) is always
acute, the consequence is: if the solid ADBE (in Figure 1.5) be generated
by rotating the elliptical or oval figure ADBE round its axis AB, and the
generating figure be touched at the points F, B and I by three straight
lines FG, GH, HI, where GH is restricted to be perpendicular to the axis
/'..
/'..
at the contact point B, ",hile FG, HI contain with GH angles FGB, BHI
of 135, then the solid generated by the rotation of the figure ADFGHIE
round the axis will be resisted less than the previous solid, provided that
each moves forward along the line of its axis AB, the end-point B leading
in each case. This proposition will, I reckon, be not without application in
the building of ships.
14
mM
Figure 1.6
he showed this, but Bolza and Whiteside have each given possible reconstructions which are almost surely quite similar to Newton's method (see
Newton, PAPERS, Vol. VI, pp. 462nff).
In Figure 1.6 (which is Whiteside's), let FG be inclined at 45 to the
x-axis which extends in the direction from B toward C; MN is an ordinate
of the curve FB at an arbitrary point M, mn is the corresponding ordinate
at m, a point near to M, and no is drawn parallel to FG, that is, inclined at
135 to the axis CB. Since, as we just remarked, the cont; frustum having
this slope for its generator experiences minimal resistance among cone
frustums with common bases, the resistance felt by the frustum noMm is
less than that felt by nNMm, provided that m is near enough to M, as we
can see by continuity considerations. Thus we see that the resistance felt
by the line Nn rotated about CB is greater than that felt by the figure
generated by rotating the broken line segment No + on about CB. Since
oN is dy - dx, this latter resistance is expressible as 2'ITy(dy - dx) + 'lTy dx
= 2'ITydy - 'lTydx, since x + dx = Bm, y + dy = mn, and dx = Mm = ow
= wn. It follows directly by integration that the resistance R experienced
by rotating the arc BF is greater than
(This integral above, the area under the curve BF, is less than the area
under the line FG.) But since FG is inclined at 135 to CB, Ff - BG = Bf,
and hence
t (Ff2 -
BG 2)
IS
which is the area of the trapezoid BG Ff encompassing the area under Bf.
Thus the total resistance R experienced by the solid of revolution generated by BF is greater than that of the frustum generated by FGBf, which is
what Newton asserted.
Bolza summarizes the situation in an exercise in his Vorlesungen uber
Variationsrechnung: "If PQ is an arc whose slope is always [numerically]
> I, then the surface generated by rotating it about the x-axis will
experience greater resistance than will the surface generated by rotating
the broken line segment PRQ, where PR is parallel to the y-axi:> and where
RQ has slope [numerically] = 1.,,27 Notice that the broken segment does
not necessarily generate the surface of least resistance. In fact, we show
below that it does not.
Newton then proceeded to the general theorem. Whiteside's translation
of the text refers to Figure 1.5 and is this: 28
But should the figure DNFG be so shaped that, if from any point N in it
the perpendicular NM be let fall to the axis AB, and the straight line GP
be drawn parallel to the tangent to the figure at N, cutting the axis
(extended) in P, there shall be MN to GP as Gp3 to 4BP X GB 2 , then
will the solid described by this figure's rotation round the axis A B be
resisted less than all others of the same length and breadth.
CD y
JBG
cos 0. dy
CD y(
JBG
dy
ds
)2 dy= r
CD
y dy
JBG 1+ (dxjdy)2
= JI2
yy3 dt ,
II.e + j2
(1.11)
where 0 is the angle the normal makes at the point (x, y) with the x-axis, s
measures arc-length, and X, yare derivatives with respect to a parameter t.
In Newton's Figure 1.8 N is an arbitrary point (x, y) on the curve so that
MN = y, CM = x, and GR has been drawn parallel to the tangent to the
curve at N. Since GR is parallel to the tangent at N,
~! = sin GRB =
cos 0;
27Bolza, VOR, p. 455, exercise 43. He correctly attributes the result to Newton. In his next
exercise he gives the interesting datum that the surface of least resistance of given base and
height experiences a resistance of 0.3748 times that of a cylinder of the same dimensions.
28Newton, SOL, pp. 463-465. This is the heart of Newton's contribution. It is his determination of a first necessary condition for a minimum in this problem. As we shall see, it is a first
integral of the Euler differential equation. (Newton's Figure 1.5 is not quite right, since it
portrays the minimizing arc as made of DNF and the line segments FG and GB. What he
clearly intended is to have the configuration in Figure 1.7, depending upon the slope of the
minimizing curve at G. (See Bolza, VOR, p. 414.)
16
E
Figure 1.7
GR
= Yo'
to
yq
(I
+ q2i
Yo
(1.12)
4'
(]
Figure 1.8
17
eD
BG
fey, x, x') dy
a minimum. (Note that the integrand function does not contain x explicitly.) The parametric form of the integrand function F(x, X, y, j) is
yj3/(X 2 + j2), and the problem in that form is to find in the class of
admissible arcs, as represented parametrically above, one which renders
12
F(x, X, y, j)dt
1\
and that the Euler equations for the non parametric and parametric forms
of the problem are, respectively,
!i(~)=!i[
dy
ax'
dy
-2x'y
(I + X'2)2
1=0
J
(1.12')
and
( 1.12")
(Note in (1.12') that the variable y is the independent one, and not x.) If q
is taken to be - x' = - dx / dy, then relation (1.12') becomes, on integration,
yq
( 1.12'")
where (xo' Yo) is some fixed point on the minimizing arc. (Notice that this
q is the slope of the normal. It is nonnegative for oval curves such as those
Newton discusses.) If the slope of the curve is - I at this point, then
qo = + I, and (1.12'") becomes Newton's relation (1.12).
We show below that there is always such a point (xo' Yo) on a minimizing arc. Before doing this, let us first look at some properties of minimizing
arcs. We see that curves satisfying Newton's relation (1.12) include both
horizontal and vertical line segments. It is, however, clear that lines with
finite and nonzero slope cannot satisfy the relation (1.12') or (1.12"). Thus
frustums of cones such as that generated in Figure 1.5 by rotating FGB
18
Y=
a(1
+ q2)2
q
and
~ =-q
tlT/(q),
Z= -
(1.13)
a( -
~ + 2q + 3q3)
-a(q2 +
~q4 -Iogq)
+ b ==
-a~(q)
+ b.
(1.13')
(3q2 - 1)(q2 + I)
dq
p=+-------
q2
VOR. p. 411.
19
- dxl dy = - x',
(3qt + l)y (3qi + l)y
where qL and qR are the values of q on the left and right sides of a corner,
respectively. From this we see that at a corner such as the point G' in
Figure 1.7 where we have qR = 0,
(3qt + l)y
(qt + 1)2
=y;
-- =
BG X Bb
--=....:::.....:....:....=MN X Mm
(1.14)
20
may see by recalling that the resistance is proportional to y cos 2 (), where y
is the altitude BG or MN and () is the angle made with the line CB by the
normal to the curve. Hence the resistances are proportional to BG cos 2 (}
= BG sin 2 gGh = BG hg 2/ Gg 2 and to MN sin 2 nNo = MN on 2/ Nn 2
= MN hg 2/ Nn 2 since on = hg. The resistance of the two figures together
is then proportional to
BG
MN
q'
+ x)
BG X Bb
MN X Mrn'
21
Newton holds at each point N on the minimizing curve. This relation gives
us the condition
--::....!-.-- =
(1 + q2)2
(1.15)
22
l;
,.t
t.'._.";:
~ ~
/c"","-c'
-
A.
I'~.:-,<..,
~',.'
-: . '.-
~~::~:::.:.;; .~: -:::.~ .'L -1-"",- of c'"- '"~ c'.,.. ". .. ;S"'~_
I.':~:\.!fo .,~
_.
:,;
i"~.l.
-- ..... J.AO--:.;.~1ri-:.~~
-t
Figure 1.9
B
A
1------==""---
Figure 1.10
23
D
A 1----.:::=sC"""'--;-=-_- __
a
cr - - -
- -- --i
I
I
I
I
E~------J-----------------_iF
C'
B'
Figure 1.11
aa
+ xx
aa
+ xx
I
- 20x
+ 00
) X BE=
=
2xo - 00
a4
+ 2aaxx + X4
2,0
d alo = -,
pp
xBE
(1.16)
where p is a constant. 33 (We will explain why Newton took this difference
to be a constant in a moment.) From this it follows by properly neglecting
"infinitesimals" that
ppxy = a 4
+ 2aaxx + X4,
(a 2 + x 2 )
and
----=py.
X
33Note that the resistance on the surface formed by rotating BD, for example, is apart from a
proportionality factor,
2'17
BE
AE
'17(BE + AE)a 3
a2 + x 2
AB2
2
2
AD +AB
'17a 3(2BE - a)
a2 + x 2
2'17a 3 BE
a2 + x 2 .
A similar expression obtains for the resistance on the surface formed by rotating Be, except
that x is now replaced by x - o. Also note that 2, 0 means twice o.
24
a 2 + (x - 0)2
'11
a 2 + (~+ 0)2
where AD = x, BE = y, or
[_
1
a 2 +x 2
1
a 2 +(x-o)2
]y _[
a2+(~+0)2
i.e.,
2xy
aax~
aa , [
x
(1
+ x'
= const.,
which is relation (1.12). Notice that Newton does not yet show how he
determined the value of the constant above as he later did in the Principia.
So far Newton has confined himself to the simplest problem of the
calculus of variations: namely, to find among all admissible curves passing
through two fixed points the one that renders a given functional a
minimum. But he did more. In a fragment that was never published, but
which Whiteside has fortunately brought to light, Newton considers a
more complex problem, whose introduction into the literature is usually
ascriped to later mathematicians. The significance of this work of Newton
25
,
'-
'-
"
T'I
""
'-
'-
, "
, "
f-'H-'------l I
"-
"
M,
p
Figure 1.12
Y; + 2
r:'2 I:d~'2
( 1.17)
NTqq
BG
4NS X ST cub
26
Very possibly the theorem above represented Newton's final and precise
solution of the problem.]
First let us look at Newton's variable end-point problem as formulated
in (1.17). The transversality condition in the present case then says that at
the point the relation
(1.18)
+ X,2); i.e.,
(1.18')
qi = X~2 = 1.
The Euler equation for this problem is (1.12'), the same as for the earlier
one. Since Y = a(l + q2 I q, YI = 0 would imply that a = 0 and the minimizing curve would be the line CB, but this does not pass through point D;
x~ = 0 would again imply a = O. Thus X~2 must be 1; i.e., the curve
x = x(y) intersects the line x = XI at an angle of 45 or 135, as Newton
asserted.
In his next paragraph Newton takes up the question of finding a
parametric representation for the extremals in terms of the parameter
q = BEl BG. He considers that the problem is to locate the point N in
Figure 1.12 in terms of this quantity q. To do this, he draws GE parallel to
NT and notes that BSIGE = GE 3/(4BE X BG 2) since BS = MN, the
triangles NST and EGB are similar, and MN I BG = GE 4 /(4BE X BG 3);
he then lays off the point F on BS so that
It follows from this that either YI = 0, ql = x~ = 0, or
GF
BG+BE
(1.19)
27
that
= MN
= BS =
GE 4
4BE X BG 2
BG X (q2 + 1)2
4
q
= 0:
(q2
+ 1)2
q
(1.21 )
where 0: = BG/4. This is, of course, Newton's first integral of the Euler
equation (1.12'). Next, following Whiteside, we see how Newton found the
representation for x using the hyperbola described above.
With the help of the relation (1.20), it follows that
1
SN = GS + BG X (BG
+ FS) X AE - BG
BG 2
BE
-4- X log BA '
B~2 .
(BG
+ BE)(5BG 2 + BE2)
16BG 2
BG
16 (q + 1)(q2 + 5)
~ ( q + 1)( q2 + 5)
so that
BG + FS = BS - GF = 0:
(q2
+ 1)2
q
0:
'4 (q +
1)( q2 + 5).
- x = SN = (BS - BG) +
B~
_ BG X log BE
4
BG
=
0:(q2+1)2
q
- 40: + 0: (q - I)
[(q2+ 1)2
q
4" (q +
1)( q2 + 5)
-o:logq
(1.21')
(It should be noted that this representation puts the origin at B.)
It is not difficult to show that through any point (X2' Yl) with Yl > 0
there is one curve of the family of extremals (1.13), (1.13') which cuts the
28
has a unique solution q = if given the value (X2 - xl)lh < 0 and the value
of 1j in (1.13). To this end, note that 1j = 1j'q(q2 + 1)/(3q2 - 1) and hence
that
(3q2> 1).
and hence
29
~-~----~---------r-------.
L.
Figure 1.13
when rotated about the axis of motion through the fluid, experiences the
least resistance. His figure is the present Figure 1.13. Note that AE = BD
= a, EB = x, and DC = b. He allows B to move to K along the line EB.
He asserts that the resistance on the figure made up of A Band BC is
proportional to
(1.23)
and is to be a minimum; this he does by replacing x by x + e and b by
b - e and neglecting quadratic terms in e.
This seems not too different from Newton's 1685 solution if we think of
a, b, x as differentials. However, if this is the case, his formula (1.23) is not
correct. The resistance on the figure formed by the two segments is, apart
from a factor,
where y is the height of the point B above the axis of rotation, which is
presumably not shown in Huygens's drawing. Note that Huygens has a4 in
his numerator in (1.23); this may mean that he has chosen y = a, in which
case his analysis will not lead him to the correct Euler equation found by
Newton (see p. 23).
However, Huygens goes on in his 25 April 1691 notes to attempt to
establish the theorem of Bolza mentioned in footnote 27 above. His proof
is incomplete but shows clearly how he was proceeding. In fact, he
remarked in a marginal comment that no ship's prow of least resistance
can be formed out of a rotund or rounded curve. He clearly went quite far
but, I think, perhaps not as far as Whiteside feels he did.
30
Figure 1.14
31
Figure 1.15
32
could also have solved the problem. In fact, Newton did in an anonymous
paper in the January 1696 issue of the Philosophical Transactions, which
was reprinted in the Acta Eruditorum (see Section 1.5, below). The issue
also includes discussions of the problem by L'Hopital and Tschirnhaus.
In a letter to a M. Basnage, doctor of law and author of the History of
Works of SCientists,41 John Bernoulli discussed a number of aspects of the
problem and interestingly notes that "I proposed the problem of swiftest
descent in the Leipsic Acts, as being completely novel, not knowing that it
had been attempted previously by Galileo" (p. 194). Again (p. 199) he says
"M. Leibniz noted two remarkable things about Galileo: it is that this man
who was, without contradiction, the most clairvoyant person of his
times ... , lacking, however our new analysis, could conjecture that the
curve of the catenary is a parabola and that the curve of swiftest descent is
a circle,,42
Perhaps we should now turn to the actual solutions themselves to see
the state of the mathematical techniques of the time and then study the
wealth of problems that grew out of these beginnings.
XI)'
where g is the constant of gravity and the particle starts from the point A:
= (xI> YI) with the initial velocity VI' (We assume that the positive
x-axis points downward.) If the velocity is expressed as ds / dt with s
arc-length, the time T to fall from the point A: (xl,y\) to B: (x 2,Yz) is
given in parametric form by the functional relation
(x, y)
J2g T=
12
I)
~X'2 + y'2
~x -
XI
+k
dt,
( 1.24)
33
where x' = dx/dt,y' = dy/dt, and k = v;/2g. The problem is then to find
among all admissible curves joining the points A and B in a vertical plane
the one which renders this integral a minimum. Notice that the integrand
function
'X,2
+ y,2
--;V==':::::::::=:::-
VX -
XI
( 1.25)
+k
in this case does not contain y explicitly. In this sense it is quite analogous
to the one in Newton's problem. (Note that there the roles of x and yare
interchanged, but this is mere notation.)
One of the Euler equations for this problem is therefore
o=
h' =
J(x _
y'
XI
+ k)(X,2 + y,2)
1;
(1.26)
--;::::====~x'2 + y,2
= cos 7";
(1.26')
XI
(1.28)
that is, x' = - 2a sin 27". It then follows from (1.26') that y' = 4a cos 2 7"
or, equivalently,
y - YI +
( 1.28')
Relations (1.28) and (1.28') are thus a parametric form for the extremals,
with a and c as arbitrary parameters. (Note that 7" is not time but an angle.)
It is convenient to replace 7" by a related angle cp = 27" + 17, and these
equations reduce to the form
+ k = r(l- coscp),
y - YI + b = r(cp - sincp),
X -
XI
(1.28")
( 1.28"')
where rand b are arbitrary constants. These curves are the well-known
cycloids. Such a curve is generated by a point fixed on a wheel of radius r
that rolls along the line X = XI - k = a. This curve has been known and of
34
interest since about 1500. One of its most remarkable properties, that of
tautochronism, was discovered in 1673 by Huygens. It is known that for
very small oscillations the period of a simple pendulum moving on an arc
of a circle is nearly independent of its amplitude; that is, it is almost in
simple harmonic motion, but for a finite-sized oscillation this is no longer
so. However, Huygens showed that when the pendulum bob moves along
an arc of a cycloid, its period is precisely independent of the amplitude of
its excursion (i.e., the motion is exactly simple harmonic). In John Bernoulli's paper on his solution he aptly remarks that "one will be astounded
when I say that the Cycloid, the tautochrone of Huygens, is the sought
Brachystochrone."
To be found is the curve ADB along which a heavy particle will fall
under the action of gravity from any given point A to any given point B.
CPA
~
D
Solution
Through the given point A draw the horizontal line APCZ and on it
first describe any cycloid AQP cutting the line AB (produced if need be)
43 Andrade
(1958), p. 100.
PT, pp. 384-389. The paper appeared anonymously, presumably as Newton
desired. (It is not relevant but of note that Montagu secured for him the two posts he later
44 Newton,
35
in the point Q and second another cycloid ABC whose base and altitude
are, respectively, as AB is to AQ. This last cycloid passes through point B
and is the curve along which a heavy particle will descend most quickly
from the point A to the point B. QEI.
This concludes Newton's demonstration. However, it is reasonable that
he must have seen at once the decisive similarity between the integrand
functions for his surface of revolution problem and this one. That is, he
certainly must have been aware of the fact that both lack the derivative of
the dependent variable, and hence he certainly knew very quickly a
necessary condition for a minimum, namely, that afjay' = const. or,
equivalently, that (1.27) holds in this situation. As we saw earlier, once this
first-order differential equation (1.27) is obtained, it is easy to see that the
solution is a cycloid. It is, however, interesting to speculate on why
Newton gave no clue in his paper as to how he knew the curve was
cycloidal and why he bothered to write down only the rather simple
existence proof. It is also fascinating to read in Bernoulli's account to
Basnage that even though Newton sent in his paper anonymously he,
Bernoulli, knew ex ungue leonem-knew the lion from his touch. 45
~------""'B
Figure 1.16
45 John
36
= ~dy.
(1.29)
IX
(1.30)
dy : dx =~" x : ,2b - x
and then introduces a variable v so that v: b = dy : dx. From this he
reasons that y = Jv dx : b.)
It is remarkable that nowhere in his letter or its addendum does he
mention that the curve is a cycloid. He presumably knew in 1686 that his
differential equation defines the cycloid but makes no mention of it until
after Bernoulli sent him his solution on 21 July 1696.
Be that as it may, let us see how Leibniz deduced his differential
equation (1.29) and hence also (1.30).46 His analysis, unlike Newton's
rambles over unnecessary paths before getting to its goal. Let us, therefore,
extract merely the essence of his proof. In effect, he makes use of Galileo's
hypothesis that the time tAB of fall down the incline AB in Figure 1.16 is
given by the relation
1
2
1 2
AC
AB
= 2"
gtAB smABC =
2"
gtAB AB '
'21
gtAc"
Thus
(1.31)
Next in Figure 1.17 Leibniz has the points A, B, C, and E fixed bat wishes
to allow D to move on the line through E parallel to the base so that the
time of descent along the path made up of AD and DB will be a minimum.
To this end he first finds that
37
Figure 1.17
AD
= AE 'r,
IDB
DB
= EC n.
+ ED2,
DB2 = EC 2 + FB2
= EC 2 + (CB - ED)2.
FB
AD
ED
AD2
'--=1
DB
FB
DB2
--
( 1.32)
Given this result, Leibniz makes use of Figure 1.18, in which he has
plotted a parabola AE with vertex at A and axis AB so that a particle
which falls from A to B vertically arrives there in the time BE. The curve
A C is Leibniz's tachystoptote, and B l' B 2, and B3 are near to each other
and equally spaced. By relation (1.32), he has
I'
C'C 2
D(C2
(C(C 2)2
=t
C 2C 3
Figure 1.18
D 2C 3
.--(C 2C 3)2'
38
D I C2
D 2 C3
C C = IC2D2 C C
I
that is,
D I C2
FI E 2 C C
I
= F2 E 3
D2C3
( 1.32')
C2 C 3
since FIE2 = B2E2 - B\EI is the time for the particle to fall from BI to B 2,
i.e., from C\ to D I
Recall that the distances BIB2 = B2B3 = ... are "small" and hence that
ds = gl dt; thus, e.g., B\B2 = glAB, I B ,B2 ' From this it follows that
BIB2
BIEI
F\E2
~ABI
FIE2
1= - - = - - . - - = - - . - B2 B 3
B2E2
F2 E 3
~AB2
F2 E 3
ex AB;
or
dy =
+ dy2.
k'/x de
1.7. John Bernoulli's First Published Solution and Some Related Work
39
action, as we shall see later. 47 It is also clear that Bernoulli's method bears
no resemblance to Newton's for the least resistance problem and is
certainly not derived from Newton's.
As we shall see in Section 1.11, by 1697 John had already given a direct
analysis of the brachystochrone problem. The method he used, but with a
significant modification, provided Caratheodory with a general principle
for solving problems of the calculus of variations. This elegant work of
John's is never referred to in the modern literature apart from Caratheodory's works. It can be seen from John Bernoulli's two solutions of the
brachystochrone problem how remarkable a mathematician he was.
As we saw in Fermat's work, a ray of light moves in a medium of given
index of refraction or optical density on a path so determined that it
reaches the boundary in the shortest possible time; when it enters a
contiguous medium with a different index, it is refracted so that it again
moves in the shortest time. First Bernoulli divides up the space between
the points A and B into horizontal slices, in each of which he assigns an
index of refraction or density inversely proportional to the heavy particle's
velocity in that layer. Then at each interface he is able to make use of
Snell's law, which tells him exactly how the particle-ray bends as it crosses
that interface.
In Figure 1.19 he plots a curve AHE whose ordinate (notice that some
of the Continental mathematicians of this era use the term "ordinate" to
mean the horizontal distance from the vertical axis to a curve and usually
call ity; they generally use x for their vertical distances. James uses exactly
the opposite conventions) CH = t measures the velocity or the inverse of
Figure 1.19
47 It
is fortunate that most of the text of both John's and James's papers appears in English
translation by D. J. Struik, Source, pp. 391-399. (Struik has an omission on p. 395 that makes
his footnote 5 meaningless. I give the material correctly below.) In German translation they
appear fully in Ostwald, Klassiker, No. 46, pp. 1-20. The originals are in John Bernoulli, CR,
and James Bernoulli, JB. There are two enjoyable and excellent essays on the period by
Caratheodory [1937], [1945].
40
the density at the vertical distance A C = x from the starting point A of the
particle-ray. He takes his minimizing curve to be AMB and lets the
ordinate CM be y. Then he takes e to be a point on the vertical AC near to
C and sets Ce = dx; the horizontal through e cuts the curve AMB in m,
and he takes n to be the point directly below M on em. Then Mn = Ce
= dx, and he sets the arc-length Mm = dz.
He then notes that "the sines of the angles of inclination with respect to
the vertical are proportional to the velocities." It is clear that at the point C
the sine of the angle of refraction is dy / dz = mn / M m and that this must
be proportional to the velocity of the light at this point, i.e., to CH = I.
Hence
(1.33)
dY=dxJ a-x
x
{1.34}
Bernoulli now goes to show that this differential equation actually leads
48 John
1.7. John Bernoulli's First Published Solution and Some Related Work
41
1 adx - 2xdx
..fax - x 2
Bernoulli next goes on to show, as did Newton, that giv~n two points A
and B there always is a cycloid through those points. Bernoulli exclaimed
that "Nature tends always to proceed in the simplest way" when he noted
that his brachystochrone and Huygens's tautochrone are the same curve. 50
He continues and notes that when the velocity of the body is not proportional to the square root of the distance fallen but to the cube root, then
the brachystochrone is algebraic and the tautochrone transcendental;
"nevertheless if the velocity is directly proportional to the distance, both
will be algebraic: the one a circle, the other a straight line.,,51
In the first case Bernoulli has chosen t = ax 1/3, and his differential
relation (1.33) for the brachystochrone becomes
dy
dx
X I/ 3
---,=:::.::::;:;:;..fl _ X 2 / 3
y - c = -(2 + x 2/ 3)(1 _
x2/3)1/2
42
dy
dx
ci
t =
= _ _x__
Jl - x 2
acp =
~2a . ~ ~ g
. T,
/gT=/;icp
which, as we shall see shortly, is precisely the correct condition. He lays off
the arc GL on the circle and draws a line through the point L parallel to
G
Figure 1.20
1.7. John Bernoulli's First Published Solution and Some Related Work
43
AG; the point B where it cuts ABK is the desired point on the locus.
Stackel in a note on pp. 137ff of his translation gives what he terms "The
proof in the manner of Johann Bernoulli." In a given fixed time-interval
dT it is clear that a particle falling down AP will move so that
dX
IX
c:;-
=V2g dT,
where X = AP, by Galileo's hypothesis. Moreover, in the same timeinterval we know by (1.24) that the particle at B : (x, y) will move along
anyone of the cycloids (1.28"), (1.28"') with Xl - k = 0 and 2r = a so that
dT= _1_
Iii
A!...
li
dX
fx
or, equivalently, if u
= aX,
d =
-I --1!...
2
fu
li
fE
V ax -
_x_ dx
a- X
x2 dx,
+ ta-x dx.
~ax _ x2
He points out that this means the differential of U I / 2 is the sum of the
differentials of CM and LO in Figure 1.19. (In fact, CM = Y and his
statement about dy follows at once from the differential equation (1.30)
and the value of LO we calculated earlier (on p. 41). Thus it follows that
(AP . GK)I/2 = CM + LO = arc GL, as Bernoulli asserted. 52
To determine the analytical condition that a curve C : ~ = ~(a), ." = .,,(a)
cuts the family of cycloids through the point A : (Xl' Yl) at the points for
which the time of fall is a constant T, let us suppose that the points of
intersection are given by c:P = c:p(a). Then with the help of relations (1.28")
and (1.28"') with r = 2a, we can find the slope fir!/ d~ = (a1J/aa)/(a~/aa).
It is easy to see that
Cc:p - sinc:p) + aCI - cosC:P)C:Pa
d~= (l-cosc:p)+asinc:pc:pa
fir!
This must be the negative reciprocal of dy / dx, the slope of the cycloid.
This implies directly that
S2Curiously, Stroik says that Bernoulli noted that his synchrone is a cycloid. I cannot find this
assertion in the original text, nor is it true.
44
li
T = lEi fP,
and hence we see that the orthogonality of the curve C to the family of
cycloids is equivalent to the fact that the points of intersection correspond
to a constant time of descent.
It is easy to see that Bernoulli's synchrones are transversal to his
cycloids. In this same connection it is important to recall that John
Bernoulli discovered geodesics on surfaces and discussed this subject in a
1698 letter to Leibniz. He remarked there that "geodesics have always the
property of possessing osculating planes which cut the surface at right
angles, and that this property leads to the invention of a differential
equation for these curves" (see Caratheodory [1937], p. 96). Bernoulli
proposed to Euler in 1728 that he study this problem, whose solution Euler
presented to the Academy of St. Petersburg in 1732-it is dated November
1728, but Enestr6m showed it must have been done after April 1729.
8
Figure 1.21
53 James
Bernoulli, lB, pp. 768-778 or Stiicke1, KLASSIKER, No. 46, pp. 14-20.
45
Figure 1.22
tGD = tCL
tLD,
where, for instance, tCG means the time to descend from C to G along the
arc CG (this equality is good through terms that are to be retained in the
limit), and so
tCG - tCL = tLD - tGD.
(1.35)
Now to evaluate these times, Bernoulli states that
CE
CL
~v; + 2g
tCE
tCL
HC
--;::::=::::::C=G====-
~vT + 2g
( 1.36)
HC
tCE
tCG - tCL
46
he also notes that the triangles MLG and CEG are similar. This enables
him to conclude that MG / GL = EG / CG (s" / y' = y" / s') and hence that
CE
EG tCE
GL = CG (tCG - tCL) .
GD (tLD - tGD) .
Combining these ratios (EF = CE), he finds with the help of (1.35) that
EG tCE
Gr tEF
CG(tCG - tCL)
CG
GD(tLD - tGD) = GD .
~~~~
[To see this, recall relations (1.36), set VI = 0, and calculate by the same
means tEF= EF/(2gHE)I/2.] Bernoulli now concludes that
EG/fiiC
GJ/fHE
CG
GD .
= x, and
(1.37)
EG
ds= ~dv
IX
;n
I_x_
V
dy __
dx k2
'
which is our differential equation (1.27) for the cycloid with k 2 = 2a. the
diameter of the generating circle.
Bernoulli now says "one seeks to determine the curve whose elements of
arc-length are directly proportional to the elements of abscissa and inversely as the square root of the ordinates. I determined that this property
of Huygens's Isochrone ... is the same one that belongs to the cycloid,
well known to geometers. I go to show that this is so." He does this
geometrically with the help of Figure 1.23. The proof is not complex, nor
does it deepen our understanding of the calculus of variations, so I shall
omit it. He also gives a proof that there always is a cycloid passing through
the points A and B.
47
Figure 1.23
48
B~--;--------------iN
Figure 1.24
49
_ A __ .
Figure 1.25
=x
and BP = GF = Y and
is the solution curve. He goes on to discuss the cases n = 1/3, 1/2, 1,2 and
says that when n is a fraction with numerator 1, then for odd denominators
the solution is algebraic and for even ones the solution "can always be
constructed by the quadrature of a circle." He then generalizes to the case
where PZ = GH is an arbitrary function of PF. To do this, he sets
b=
J G~dX
y-
bdx
~a2 - b2
From this he attempts largely by rhetoric to cover the case where PZ is a
function of the corresponding arc BF. He gives no proof or hint of a proof
but merely asserts that this case is covered.
He also gives his solution to his brother's problem of finding among all
the cycloids through a given point and with the same horizontal base the
one down which a heavy particle will fall most quickly to a given vertical
line. lie repeats his assertion that he should receive the prize for the
solution of this problem. He remarks that the solution follows as a simple
corollary to the brachystochrone problem he solved in May 1697. The
solution curve, he says, is the cycloid generated by the circle whose
circumference is equal to twice the distance between the origin and the
given vertical line. But he indicates that he can go further. "If in place of
the vertical one supposes any line whatever or any curve, the problem does
not become more difficult; since it is clear by the nature of my Synchrone
58 A few months later in a subsequent paper John Bernoulli corrected this to read .. f call b the
ordinate GH" instead of the definition above. See John Bernoulli, Sf, p. 210 and PD, p. 217.
50
curve that the desired cycloid will always be that one which interesects the
given curve at right angles."59
James Bernoulli replied to this paper in a notice in the same journal,
Journal des Savans, where he wrote that he would undertake to examine his
brother's solution, indicate its contradictions, and give the correct solution. 6o There was then a considerable correspondence on this problem
which finally culminated in the publication of a note by James Bernoulli in
the Acta Eruditorum of June 1700 entitled "The solution of isoperimetrical
problems". This note was followed in May 170 I by a longer and fuller
paper, "Analysis of the large isoperimetrical problem," in the same journal. 61 Perhaps a remark by James about his brother's reasoning might be
worth repeating. He comments that his brother argues in this way: "Every
man is a stone; every pebble is a man; hence every pebble is a stone."62 It
does not seem to be relevant to discuss this series of papers in detail.
Suffice it to say that John's counter to James's papers of 1700 and 1701,
just mentioned, appeared in 1706. However, it had been sent sealed to the
French Academy on I February 1701 by Varignon with the request that it
not be opened until after James Bernoulli's solution should appear. For
whatever reasons, it did not appear in print until after James's death on 16
August 1705. There is also a long paper by John Bernoulli on the subject
which appeared in 1718.63 This latter paper by John is in part clearly
dependent on James's work in 1701. It is discussed in part in Section l.ll.
SI
____~~~~+-________~T
D
Figure 1.26
S2
and G to vary up or down along the vertical lines KF and LG. He sets
BX= I
FX=p
FY=m
GY=q
GZ=n
CZ=r
and also
RB=b
KF=f=b+p
LG=g= b+p+ q
BF=s
FG= t
GC= u
and thus
df=dp
dg= dp+ dq.
--=
rst - qsu
qsu - ptu
To derive this result, James Bernoulli notes that his hypotheses require
that
BX2 + FX2
= BF2, i.e., II + pp = ss
Fy2 + Gy2 = FG 2 ... mm + qq = It
GZ 2 +CZ 2 =GC 2 ...
BX + FY + GZ = const.
FX + GY + CZ = const.
BF + FG + GC = const.
nn+rr
=uu
I + m + n = const.
p + q + r = const.
s + t + u = const.
II.
III.
pdp = sds
qdq = tdt
rdr = udu
V.
VI.
dp + dq + dr = 0
ds + dt + du = 0,
( 1.38)
+ dq
rst - qsu
ptu - qsu
(1.38')
53
+ dz 2d 3x
dz 2d 2x
- dx(d 2x)2
+ 2dx(d 2x)2
1= dy
FY
= dx
BF=
s = dz
2
dx + dy2 = dz 2.
FX=
= m = dy
GY = q = dx
FB
= dz
GZ
+ d 2x
+ d 2z
= n = dy
GZ = r = dx
GC
= u = dz
+ 2d 2x + d 3x
+ 2d 2z + d 3z
+ dz 2 d 3x
qsu - ptu = dz 2 d 2x
+ 2dz d 2x d 2z
- dx dz d 2z - dx dz d 3z,
- dx dz d 2z - 2dx (d 2z)2
dx d 3x
+ (d 2x)2 =
dz d 3z
2
rst - qsu = dy2 [d 2X + d 3x _ dX(d X)2]
dz 2
and similarly
+ dz 2d 3x
dz 2d 2x
- dx(d 2x)2
+ 2dx(d 2x)
54
(1.39)
(1.40)
=0
(1.41 )
and
df
dg
--=
F'(f)
+ F"(f)df
F'(f)
as we see from (1.40) and (1.41). By his Theorem IV, this gives him the
relation
df
- -- =
dg
+ dz 2d 3x - dx(d 2x)2
dz2d2x + 2dx(d2x)2
dz 2d 2x
+ dh
= -----
hdz 3
a 2 dy
(He remarks that dy is constant and that the result follows by "the law of
homogeneity," i.e., by dimensional analysis.) If the functions h, x, and z
are viewed as functions of y, then his relation above states that
d 2x/dy2
h( dz / dy)3
= --;ii" .
55
'a
dz = yo
+ t2
dy;
a 2(dt/ dy)
at dt/ dx
t dt/ dx
h (a 2 + t 2)3/2
h (a 2 + t 2)3/2
(a 2 + t 2)3/2 dF/ dx
+ - - = ----------
- a2
a dx, he has
or
p
== F =
b (2
a
+t
2)1/2
and thus
dx-_- --I-
b-p
dy
~a 2 - (b -p )2
(Bernoulli considers in some detail the case b = a.) This is his defining
differential equation for x as a function of y. Recall that p = F is a given
function of x.
James Bernoulli's Problem II is to find among all curves of the same
length on the base A T passing through the fixed points A and D that one
ABD such that at the arbitrary point B the ordinate HP of the corresponding curve APV is a given function of the arc AB. To solve this problem, he
iqs
hpt
+ ipt
or
df
-- =
dg
dp
----=--dp
+ dq
iqs
hpt
+ ipt -
iqs
56
qdz
= -;::::==-=
fa2
+ q2
or dy =
(a-q)dz
-;::::.========fb 2 - 2aq
+ q2
III.
1dl + P dp = 0
m dm + q dq = 0
n dn + r dr = 0
IV.
V.
dl + dm + dn
dp + dq + dr
=0
= O.
(1.42)
or, equivalently,
dz 2[ dy 2 d 2x
Imr - Inq
+ dy 2d 3x + dx(d 2x)2]
= --=-------------=dy2
dz 2[ dy 2 d 2x - 2dx(d 2x)2]
Inq - mnp
65 James
= --=---------=-
dy2
(1.42')
57
(John Bernoulli also derives this result, as we shall see later, and calls this
his other fundamental equation.)
James now returns to his Figure 1.26 and, as before, lets HB = b,
KF = f, LG = g, and the arc AB = z; he further supposes that the weight
of z is q-recall that the chain is heavy-and notes that the weight of BF is
then dq, of FG is dq + d 2q, and of GC is dq + 2d 2q (he says that he omits
d 3q "since it is superfluous"). Then the moment of this mass with respect
to the A T axis is
(1.43)
He now allows F and G to vary on arcs of circles, as above, and observes
that under this assumption quantity (1.43) is a minimum, i.e.,
df(dq
+ dy 2 d 3x + dx(d 2x)
dy 2 d 2x
df
dq
--=
dy 2 d 2x - 2dx(d 2x)
This reduces to
d 2q
-=
+ 3dx(d 2x)
dy 2 d 3x
dy 2 d 2x
dq
d 2x
dqdy 3
I
adz 2 .
--=+--
t dt dz
= -adxd 2x = .::....:.:.::....::=-
adydy
so that
d 2x = -
tdldz2
dq y3
13 dq dz
= + - - = + -....::..-a4
a 2 dx
- adz 2
dq
_ a2
dt
12
- =+
dz
. -
dx
=+
a3
-=-~==::::;-
12~a2 _ 12
dy
dx
-=
dy
dz
-=-
58
(b-q)=+ aa
-1
1
-dz = -a = ---;:::::::::::::::====_I
Va2 + (b _ q)2
+ be + ee =
ag
+ gi + ie.
ke )
( ke
fb
( ab
- be
bg =
Ie )
be - ee
XCI.
(1.44)
He uses this relation in his discussion of his brother's Problems I and II.
Bernoulli's proof is very simple. He draws the "little lines" bm, gn, io,
and eh from the points b, g, i, and e perpendicular to the lines ag, be, ie,
66Bernou11i considers in particular the case b - O.
Bernoulli, RE. His argument on pp. 267-269, however, breaks new ground and in part
gave Caratheodory ([1904], pp. 71-79) the insight for his elegant approach to the calculus of
variations.
67 John
S9
Figure 1.27
and ge and bk and c/ parallel to aq. Then there are four pairs of similar
triangles: gmb and bfa, bng and ekb, eoi and ekb, ihe and ele; hence
gm = (fb X bg)/ ab, bn = (ke X bg)/ be, co = (ke X ei)/ be, and ih = (Ie X
ci)/ ee. But by hypothesis, ag - ab + gi - be + ie - ee = 0 and therefore
gm - bn - co + ih = O. (Notice that this last equality is correct up to terms
of the order of bm 2 , ) It then follows easily that Bernoulli's conclusion
(1.44) is correct. Bernoulli says that "without any calculation" he has the
same preliminary theorem "as my brother has demonstrated by a very long
calculation." He calls (1.44) a fundamental equation, as we noted in Section
1.lO, and he carefully notes that this relation is uniform in the sense that its
two members contain corresponding data displaced by one ordinate. Thus
the relation holds for any pair of ordinates in the right-hand member;
hence the left-hand member is a constant. (Recall how Newton used this
same device on p. 24, above.)
John Bernoulli next establishes his Lemma II. In terms of Figure 1.28,
he draws arcs of circles about the centers a and e and chooses g and i on
these arcs so that be = gi. He then draws "little lines" gn and io parallel to
e
Figure 1.28
60
(l.4S)
+ gn + 0;)2+ (kc
- co - bn)2
and so
bk X (gn
+ 0;) =
+ bn).
kc X (co
Since the triangles afb and bng are similar as are cle and co;, he finds
af
fb
bn
gn
cl
Ie
-=-
co
-=-
0;
lex co = kcxco
cl
bk
kcXbn
bk'
bk )
(bk
kc X gn = - kc
cl )
.
Ie X 01.
(1.45')
John Bernoulli now proceeds using his Lemma I to solve his brother's
Problem I. To solve this problem, he refers to Figure 1.27 and uses relation
(1.44) because of the isoperimetric condition. To find the condition that
arises from the fact that the area BMLET in Figure 1.29 is the greatest
possible, he asserts that
f(Pb)
+ f(Rc)
= f(Pg)
+ f(R;),
.~l.-.-1hr-1=----
Figure 1.29
(1.46)
61
NP
+ f(Rc)
PR = f(Pg)
NP
+ f(Ri)
PR
(1.47)
expresses the fact that the area under the segment of the maximizing arc
from N to R is an extremum and hence is not changed appreciably by a
small change of the ordinates. Therefore, he can write condition (1.47) in
the form f(Pg) - f(Pb) = f(Rc) - f(Ri), which establishes (1.46); and he
notes that through terms of the first order, this implies that
f'(Pb)
bg = f'(Pb)
X d.
(1.47')
fb _
( ab
kc )
bc
X _1_
f'(Pb)
= ( kc
bc
_ Ie )
ce
X _l_
f'(Rc) .
(1.48)
dy .
a'
-"----'-----'-'-- X - - = -
f'(x)
(XC)dx
Jaa - (X C)2
68 John
Bernoulli, RE, p. 244. Problems I and II are on pp. 102 and 106.
62
= f'(Bab)
mg
= f'(Babe)
ih.
(1.47")
fb _
( ab
ke ) X ab X
1
= ( ke _ Ie ) X ee X
1
be
fb f'(Bab)
be
ee
Ie
f'(Babe)'
(1.49)
fb _
( ab
ke ) X
be
fb
ab X be
= ( ke _ Ie ) X
ee X be
ke X f'(Bab)
be
ee
Ie X ke X f'(Babe) .
(1.49')
( fb _
ab
ke ) X ab 2 X
1
= ( ke _ Ie ) X be 2 X
1
be
fb2
f'(Bab)
be
ee
ke 2
f'(Babe) '
(1.50)
_ d( dx )
dz
X (
dz
dx
)2 X
_ 1 = dy
f'(z)
d( dy ) = _ dyd 2x = f'(z)dz ,
dx
dx 2
a
63
since
(Ze)
dx
-=
af + !P Babe X bk
= !PBag X (af -
+ !P Babee X cl
gn) + !PBagi X (bk + gn + oi) + !PBagie
(cl- oi),
(1.51 )
where !P is the function that determines the ordinates of the desired curve;
that is, !P Bab is the ordinate in Figure 1.28 at the point P standing above
the axis BS. Thus equation (1.51) expresses the fact that the area under the
desired curve between Band S, not shown in Figure 1.29, is a maximum.
Bernoulli points out that ab = ag, be = gi, and ee = ei and arcs Bab
= Bag, Babe = Bagi, and Babee = Bagie up to first order terms. He can
then write (1.51) in the form
(- !PBab
+ !PBabe)
X gn
= (- !PBabe + !PBabee) X
oi,
(1.52)
which gives him a second relation involving gn and oi. He therefore has, by
(1.45'),
- bk / ke + cl / Ie
- af/ fb + bk / ke
!P Babe + !P Babee '
- !P Bab + !P Babe
which has the uniformity he desires. He points out that bk / ke - af/ fb
= d(dy / dx) and !PBabe - !PBab = d!Pz and hence that
d( dy ) X _1_ = I
dx
d!Pz
a
64
Figure 1.30
65
= cK : Ce = MK
: Mm
n(x + a)
rm rx
..[Cii =
MD
CH
MK
CK
Mm
Ce ;
66
and thus the ratio of the times of fall along Mm and Ce becomes
. ~CG = ~CG < I.
~CF
~MD
ICF
The time of fall along the arc Mm of the cycloid is then less than that
along the circular arc Ceo Bernoulli next remarks that the time of fall along
Ce is greater than the time along Ce. To demonstrate this, he remarks that
Ce is the hypotenuse of the right triangle Cee. (It follows from this that
Ce> Ce, and hence with the same velocity a particle will descend Ce more
rapidly than it will Ce.) The time of descent along Mm is then less than the
time along Ce, and consequently the total time of descent along the cycloid
is less than the time on "any other curve ACB passing through the same
points A, B" (p. 269). This very elegant sufficiency proof is probably the
first in history.
Mm . ~CG
Ce
~MD
= ~MD
2. Euler
2.1. Introduction
It is not quite certain when Euler first became seriously interested in the
calculus of variations. Caratheodory, who edited Euler's magnificent 1744
opus, The Method of Finding Plane Curves that Show Some Property of
Maximum or Minimum . .. , believed it unlikely that it occurred during his
period in Basel with John Bernoulli.) However, we should note that Euler
considered in 1732 and 1736 problems more or less arising out of James
Bernoulli's isoperimetric problems; and even as early as the end of 1728 or
early in 1729, as Enestrom showed, he wrote "On finding the equation of
geodesic curves." In effect, Euler in 1744, following John Bernoulli, examined the question of end-curVes that cut a family of geodesics so that they
have equal length. He showed that the end-curves must be orthogonal to
the geodesics. This is the precursor of the so-called envelope theorem of
Chapter 7 below and plays a key part in Caratheodory's work.
The introduction to Euler's great work of 1744 (Euler, I, XXIV, pp.
Ivi-lxii) contains a listing of the 100 special problems taken up by Euler to
illustrate his methods; these were arranged in 11 convenient categories by
Caratheodory, the editor of Volumes XXIV and XXV in Series I. No
interested reader should overlook this "Complete listing of Euler's examples in the calculus of variations."
Perhaps the greatest things that Euler did in his 1744 book were to set
up a general apparatus or procedure for writing down the so-called Euler
differential equation or first necessary condition and to enunciate and
discuss the principle of least action, which he first discovered prior to 15
April 1743. In a letter of that date Daniel Bernoulli congratulated him on
this work, and Euler wrote it down in the latter part of 1743. (The book
itself, Methodus Inveniendi, was sent to the publisher Bousquet in May 1743
and its two appendices, in December of that same year.) Caratheodory
pointed out that Maupertuis's paper on least action was presented to the
1 Euler,
I, XXIV, p. viii and Euler I, XXV, p. vii and Carathcodory [1937], [1945].
68
2. Euler
Academy of Paris on 25 April 1744 and "had the greatest similarity to the
above mentioned ideas of Euler's." Caratheodory was of the opinion that
fundamentally Euler made no effort to establish his claim to this great
discovery because "the true reason for this notion of his lay above all in his
philosophical attitude.,,2 (We shall discuss the principle in Section 2.7.)
Euler's little (pp. 298-308 of Euler, I, XXIV) paper is the first presentation
by anyone of this great discovery and is a truly remarkable historical
document. (It appears as Appendix Ii to I, XXIV.) Here we see in nascent
form the first deep insight-apart from Fermat's principle of least timeinto the role of the calculus of variations in physics. This role has since
been of great importance to both subjects in a number of ways, and it has
become of paramount significance in quantum mechanics.
Another important contribution Euler made to the calculus of variations
was to change the subject from a discussion of essentially special cases to a
discussion of very general classes of problems, even by modem standards.
In fact, he took the fairly special methods of James and John Bernoulli
and transformed these into a whole new brand of mathematics. It is
interesting also that he had not yet completely understood the significance
of allowing the end-points of his admissible arcs to vary, and he repeatedly
is vague on this point. This is not too surprising since he was investigating
how to find the family of extremals for each general type of problem,
irrespective of the end-conditions. He clearly did not look for so-called
transversality conditions arising from variability of the end-points.
Euler's methods were themselves displaced, as we shall see later, in 1755
by Lagrange's elegant technique of "variations." In Lagrange's honor
Euler then renamed the subject the calculus of variations and dropped his
methods in favor of Lagrange's.
I, XXIV, pp. lv, x. CarathOOdory has done a great deal of investigation of the
Euler-Maupertuis correspondence on this general subject, and the interested reader would do
well to read Caratheodory's discussion of this and may also wish to consult Brunet, ACT, p.
8, who attributes priority to Maupertuis. I believe that Caratheodory's attribution to Euler is
sound, for reasons that we shall examine later (pp. 10 Iff).
3 Euler I, XXIV, p. 18. This is "Proposition II. Theorem." (Throughout the entire text Euler is
lax in specifying end-conditions.) The integral here is understood to be evaluated between the
limits of x = 0 and x = Q.
69
. .k
HIKLMNOPQRS
Figure 2.1
Mm=y
Mm=y
Nn=y'
LI= y,
Oo=y"
Kk=y"
Pp
y"'
Ii
YII,
Qq=/v
Hh = Yiv
etc.
etc.
He next approximates to his derivative with the help of finite differences
and writes
y'- y
p=-dx
y"- y'
dx
y"' _ y"
p" =
dx
yiv _ y"'
pili =
dx
p'=
etc.
q=
q'=
q" =
y" - 2y' + Y
dx 2
y"' _ 2y" + y'
dx 2
/v _ 2y"' + y"
dx 2
etc.
y'- y
p=-dx
y- y,
P,= d;-
p,,=
PII'=
etc.
y,-y"
dx
y" - YII,
dx
y" - 2y' + Y
dx 2
y'- 2y + y,
q,=
dx 2
y-2y,+y"
q,,=
dx 2
q=
etc.
2. Euler
70
J(X,
o Z dx+ Z dx + Z' dx + Z" dx + Z'" dx + ... ,
(2.1)
where he has set Z = Z(x, y, p, q, r, ... ), Z' = Z(x', y', p', q', r', ... ),
Z" = Z(x", y", p", q", r", ... ), etc., and supposes that anz is an extremal.
Then the derivative of expression (2.1) with respect to the ordinate y',
viewed as a variable, must vanish at the value y' = Nn. To evaluate this
derivative, Euler first begins Chapter II with a calculation of the effects on
y, p, q, r, s, t, etc., of altering y'. He shows these straightforward consequences in the table below, where he has set dy' = np in Figure 2.1.4
Quant. Increm.
y'
p
+ nv
+ nv
dx
p' -
nv
dx
nv
q,
+ dx
q'
rn
2nv
dx 2
nv
+ dx
nv
+ dx
3nv
dx 3
r,
+ 3nv
r'
dx
nv
dx
3
Quant. Increm.
8",
8"
nv
dx'
4nv
dx'
8'
6nv
dx'
4nv
dx'
nv
dx'
tlV
nv
dx"
5n"
dx"
8,
till
til
+ IOn"
t,
dx"
t'
IOn"
dx 5
5n"
dx"
nv
dx'
4Euler, I, XXIV, pp. 3Off. In the succeeding pages he works through a number of cases for his
integrand Z. One of these is dZ = M dx + N dy.
71
Euler then uses this in "Proposition III, Problem" (p. 39) to find what
happens in the case when Z is of the form
dZ = M dx
+ N dy + Pdp,
(2.2)
Euler now views expression (2.2) as a function of the variable y' and asks
that this expression be an extremum. He then knows that its derivative
with respect to y' vanishes. To calculate this derivative, he notes that the
change in Z dx caused by varying y' is p. nv(nv = y' - y in Figure 2.1)
and in Z'dx is N' . nv dx - P' . nv. This implies that for an extremum
P + N' dx - P' = 0, i.e., N' - (P' - P)/ dx = 0 or in the limit6
dP
(2.3)
N- dx =0.
aq
2
dx 2
-=--
ay'
ar,
3
ar
ay' = - dx 3 = - ay' ,
ar'
I
-=-ay'
dx 3
Sit is worth noting that Taylor in his MET introduced this method of representing the
dependence of Z on its variables. He also discussed the isoperimetric problem "on principles
not different from those of the Bemoullis, but with some alteration of symbolic notation." See
Woodhouse, COV, p. 29. This book is quite archaic and unsatisfactory in many respects.
6Euler I, XXIV, p. 39. Had Euler varied some ordinate other thany', he clearly would have
found again his differential equation (2.3).
72
2. Euler
az"
ay'
az = p.
ay'
ay:
_I _ 2Q . _1_
dx
dx 2
ay'
az'
az
_l_
dx 3 '
=N'-P,.-I +Q,_I_-R,_I_.
dx
dx 2
dx 3
0= dJ =.!L
dy'
dy'
(Z +z +Z+ Z')dx =
"
where f).P, e.g., stands for P' - P, etc. (Euler views these differences as
differentials.) Then by a passage to the limit, the Euler differential equation
results.
To show how to use his differential equation, Euler considers a number
of examples. Let us look at his Example III, which is, in his notation, to
minimize the integral
dxvl+Pi
Jx
=-
dxVI + pp
2x/x
pdp
+ --;:::::.:::::::====vx(1 + pp)
Vi +pp
M=----=~
2x/x
N=O,
P = --.==p===
vx(1 + pp)
His differential equation (2.3) can be easily integrated, and he finds that
73
Later in this chapter Euler sets out a mnemonic rule for writing down
his differential equation, which was to have a profound effect on Lagrange,
as we shall see later (p. Ill). His rule is that for the case when dZ
= M dx + N dy + Pdp, in the differential set M dx equal to zero, leave
N dy unchanged, and write -pdP instead of Pdp. The resulting expression should be set equal to zero. Clearly this rule gives for Euler's
differential equation the result
N dy - pdP = 0;
(2.4)
dx
+~=O.
dx 2
(2.5)
JZdx,
(2.6)
(2.6')
where this integral is evaluated between 0 and x and the integrands are
given by the differential equations
+ M dx + N dy + Pdp + ... ,
[M] dx + [N] dy + [p] dp + .. .
dZ = L dII
d[ Z] =
SEuler, I, XXIV, pp. 51-52.
2. Euler
74
II =
s= fox ~l + p2 dx.
Let us look at how Euler handled the general case described in (2.6),
(2.6') and was thereby able to "solve" quite complex problems of the
calculus of variations in the sense of finding their families of extremals. To
this end, again partition the interval from A to Z into finitely-Euler says
infinitely-many equal subintervals by the points ... Xiv' XII" X"' X" X, X',
X " ,x"' ,x iv ... , and ca11 t h e correspond'mg ord'mates ... Yiv' YII" Y ", y" y,
y', y", y"', /v .... Also for definiteness, write [Z] = f(x, y, p, q) and note
that
[Z,] = f( x"
y"
p" q,),
= f(x, y, p, q),
75
an
ay" = 0,
an'
ay,,=[Q]'dx'
~"
1
-an'" = -an = ... =[N"]dx-Il[P'] +1l2 [Q].iv
ay"
and
ay"
dx
(v)
an(v)
ay(v)
a'P(v)
aq(v)
az_
_
= L(v) _
_ + N(v) __ + p(v) _ _ + Q(v) _ _
ay"
ay"
ay"
ay"
ay"
= 0, 1,2, ... ).
(v
It remains now only to evaluate these latter expressions with the help of
our prior ones. Note that
az
ay"
az"
ay"
Q _1_
dx 2
az'
ay"
'
'
(v = 3,4,5, ... ).
y
X
dx
Since the sum of all these derivatives must be zero along a minimizing arc,
Euler deduces the condition
0= N - dP
~
+ !fQ + L[ p] _ dL [Q] _ 2L d[ Q]
~2
+ ([N] _ d[P]
dx
+ d 2 [Q] ) raLdx.
Jx
dx 2
(2.7)
This is, in effect, the conclusion Euler reaches on p. 95 .to his Problem,
Proposition III, on p. 94. He prefers to write
d'[P](H- fLdx)
dd'[Q](H- JLdx)
O=[N](H-fLdx)- - - - d x - - +
dP
ddQ
+ etc. + N - - +
- etc.,
dx
dx 2
(2.7')
76
2. Euler
J J
yx dx
Q
y dx = fo yx dx fox y dx
an extremum among all admissible curves y = y(x) (0 <; x <; a). Here he
has
IT = fox ydx,
[N]
[Z] = y,
[p] = [Q] = O.
= I,
+ xfox ydx,
where H is a constant, namely, zero. Euler notes that this equation has as
its only solution the curve y = 0, the x-axis.
Notice that Euler's Proposition III above was to find in the class of arcs
defined for 0 <; x <; a and satisfying the differential equations
dIT
dx = [Z](x, y, p, q),
dy
dx =p,
dp
-=q
dx
one which minimizes or maximizes the integral
If we write
A(X) = iQL(~)d~
in Euler's equation (2.7) or (2.7'), it is not difficult to see that it assumes
the equivalent form
2
dA
-=-L
dx
'
A(a) =
o.
(2.8)
77
d[
with
17
ZJ = [ L ] d17 + [ MJ dx + [ N] dy + [ p] dp + [ Q] dq
= f[z]dx
and
d[ zJ = [ m] dx + [ n ] dy + [p ] dp + [ q] dq.
He again wishes to find his condition that f Z dx an extremum. He arrives
without undue effort at his equation (p. 107). The details are very similar
to what we have seen above, only they are somewhat more complicated:
0= ( N - -dP
dx
ddn - etc. )
+~
dx 2
[ [N]( H - J L dX) -
d.[P](H- JLdx)
dx
+ _dd_.[_Q_](H_-J_L_d_X) _ etc.]
dx 2
+ [[ n J( G
+ etc.].
(2.9)
dx
(The interval of integration is from 0 to x.) In his Corollary 2 on p. 108
Euler simplifies this equation notationally by introducing the new variables
-the interval of integration is again from 0 and x-
T=H- JLdx,
V= G- J[L]dx(H- JLdx).
O=N+[N]T+[n]V- d(P+[P]T+[p]V)
dx
dd(Q+[Q]T+[q]V)
dx 2
- etc.
78
2. Euler
~~
d[ Z] = [ M] dx + [ N] dy + [ p] dp + [ Q] dq
[Z],
~~
= [Z],
d[ Z]
= [ L] dII + [ M] dx + [ N] dy + [ p] dp + [ Q] dq.1O
d(P+[P]V)
dx
where
v = e- flL] dX( H
fe
dd(Q+[Q]V)
dx 2
flL ] dxL dX)'
-etc.,
(2.10)
(2.10')
= g _ R(v)Vl + p2 = g _ aIInVI + p2
(2.11)
dx
where g is the acceleration due to gravity which acts along the AZ axis in
Figure 2.2.12 Equation (2.11) is equivalent to a statement of the conservation of energy with the mass = 1.
dII
79
c
A k::::-----
p~---\
Figure 2.2
Jl + pp
[L] = - anIIn- 1
[p]
= _
aIInp
Jl + pp
[M] =O=[N],
[Q] = 0,
or
C=[P]e-f[L]dx,
and thus [L]dx = d[P]/[P] since L = [L] in (2.10'). He now replaces [L]
and [P) by their values as given in the relation - f[L]dx = IC - I[P],
where he uses I for logarithm. He finds
JanIIn-Idx~l+pp = +IC-I(-a)-IIIn-lp+I~I+pp.
This gives, on differentiation and simplification, the differential equation
--
IIdp
(1
) .
P +pp
80
2. Euler
and so he finds
O=ngdx+
IIdp
(I
P +pp )
or II=-
=-
ngpdx(1 + pp)
dp
.
(2.12)
t dp / ng and has
+ pp).
He then calculates dII, which after some manipulation yields the equation
g = (a + {Jp)t"pn(1 + ppr- 1/ 2,
where {J is a constant of integration. From this, the definition of t, and the
fact that dy = p dx, he finds the differential equations
dx
-;:::=.=======_
= _ _ _ _ _----=dp
np(1 + pp)I-I/2n
Vgn-l(a + {Jp)
-dp
II =.
n/ gll+ii
V a+{Jp
dy = ----~==========
n(1 + pp)I-I/2n ~gn-l(a + {Jp)
He concludes that
I
x = - ng
I
y = - ng
dp
+ pp
V
_n /
gll+ii
a
+ {Jp
Euler notes that II = v 2 /2 can never vanish and hence, by (2.12), that dp is
always negative so that the curve is concave toward the axis as in the
figure. The vertical axis is AP and the horizontal one is Aa. At the start of
the motion p = 00 and, by definition, II = b. Then {J = g/ b n , and he sets
a = 1/ kn. This gives
bk
= -
ng
bk
y = - ng
f I +dppp Vbgll+ii
n + gk"p
_n /
,
n + gknp
Vbgll+ii
II = bk-n /
81
v2
2II
F=-=-
where
-dp
dx(l + pp)3/2
The normal force on the particle due to the action of gravity aloneignoring the reaction of the curve-is clearly
gdy
-- =
ds
gp
--=-~l + p2
= G.
It is perhaps relevant to point out that in the general case the reaction of
the curve to the heavy particle is given by!3
2n - 1 v 2
N=--_
2n
p'
and thus Euler's centrifugal force F-the total normal force-is then
exactly equal to the normal force due to gravity G plus the reaction N of
the curve, as it should be. Euler then shows by direct calculation that
F = 2nG and takes up in a little more detail the cases n = 1 and n = t.
Note that in the latter case F = G.
In the former case, n = 1, he has
x = -bkJ
dp
p(b + gkp).jI
+ pp
y= -bk
dp
(b + gkp)l + pp
s=AM= -bk
dp
b+gkp
J p(b+gkp)
=C+kl
p
p = - -b- - - = dx
gk(e s / k - 1)
13 BoIza, VOR, p. 580. (Note that the n in BoIza's and Euler's accounts differ by a factor 2.)
The solutions to Examples I and II below (n == I, n - 1/2) are very similar.
82
2. Euler
and that
= - gy
_
2
pUb +gp/k)
'
-2gbk
Yk
2b/k
- + 2gk log
,
b
2b/k -y/b
dp
_
_ 2
Ub + gp~k)
2bif
= ---'-'-"'-'--
iii +gp/k
Euler remarks correctly that in this case the minimizing curve is the same
as that for a free but heavy particle falling under the action of gravity
through the resisting medium.
In his Example II on p. 121 Euler takes up the problem of finding that
plane curve down which a heavy particle will fall in the shortest possible
time through a resisting medium; he assumes, as before, that the law of
resistance is given by the 2nth power of the velocity. The integral to be
minimized is now
radxq,
Jo
v'II
VI
+ p2
= - --'-------''---anIIn-1Vl
and [Z] =
M=N=O,
2IIv'II
[L] =
= (1 + p2)1/2/III/2
+ p2 ,
[M] = [N] = 0,
[p] = _
all)
Jl + p2
[f
exp (
-H]
(2.13)
to a.
83
2II,JIT
(2.13')
also his equation (2.10) above in this case has the form
d( P
+ [ p] V)
= 0
or
V=
C-P
[Pf
.
C-/1 + p2 - p/{II
-all)
c{t+fi
all)
V = ---'---=--=-::-..:...--
(2.14)
or
dV =
-(n + t)dII
aIIn+ 1{II
dX-/l + p2
--'-~
2II {II
nCdII-/l + p2
Cdp
+ -----'--aII n+
aII)2-/1 + p2
n dX-/l + p2
II ,JIT
nC(l + p2) dx
IIp
(2.15)
The latter equation follows easily from (2.13') and (2.14). But, by hypothesis, dii = g dx - all n dx(l + p2)1/2, and so Euler finds with the help of
relations (2.15) that
Cdp
(n + t)gdx
p2-/1 + p2
II,JIT
nCgdx{t+fi
IIp
or equivalently that
Cdp
gp2
(n + t )dx{t+fi
II {II
Cdp
dV=-
gp 2
V=D--C- = - - -
gp
'
aIIn{II
Cdp
(n
+ t )dx{t+fi
gp2
nC dx(l + p2)
IIp
II {II
aC
aD- -
gp
c{t+fi
1
= --IIn{II
II]J
84
2. Euler
dII
g - aIIn)1
+ pp
y=
pdII
__ .
g - aIIn)1 + pp
Mm
141 am indebted to Professor A. Weil for the observation that Euler's interest in this topic
probably stemmed from Leibniz's inquiries into the behavior of Jy dx under coordinate
transformations.
85
c
Figure 2.3
since p = dy / dx. (He now has shown the well-known formula for arclength in polar coordinates, namely: cis / d<p = [r2 + (dr / d<Pi] 1/2.) Thus the
length of the curve AM, which is to be minimum, is given as
foa -Iy2 + p2 dx
and Z
= (y2 + p2)1/2.
O=Q=R= ....
M=O,
-Iy2 + p2 = --:=====p2
-Iy2
+ p2
i.e.,
y2
--;::.====- = const. = b.
-Iy2
+ p2
(2.16)
From this last relation Euler infers that the minimizing curve is a straight
line. To do this, he notes in effect that his relation (2.16) states that the
length of the perpendicular CP dropped from the pole C upon the tangent
MP to the curve AM at the point M is a constant and thus that the curve
must be the straight line joining the end-points. 15
ISWilliamson, DC, pp. 223-224.
86
2. Euler
:P C
Figure 2.4
His next example is derived from the famous one of the Greeks: among
all admissible curves ABM in Figure 2.4, enclosing a given area ABMP, to
find the one of least length. This time he sets the area ABMP = x, the
ordinate PM = y, and the abscissa AP = t. Then
x
ydt
and the arc-length BM will be f(dy2 + dx 2/y2)1/2, since dt = dx/y and the
lines AP and PM form a cartesian coordinate system, ds 2 = dt 2 + dy2, in
that system.
Now if p = dy / dx, Euler has
or
Q=o,
M=O,
yVI + y]J2
r----
1/ 2
i.e.,
P=
Vb 2 - y2
y2
dy
dx
dy
y dt .
87
Figure 2.5
Tdx
Vdy,
= ~dX2 +
dy2 + dz 2
= ~dx2 +
dy2 + (T dx + V dy)2
~I + p2 + T2 + 2 TVp + V~2
p+ TV+ V~
=d~============~
~l + p2 + T2 + 2 TVp + V~2
VI + p2 + (T + Vp)2
+ G dy
= d. -;::=P=+==T=V=+=V=~===-
VI + p2 + (T + Vp)2
when the differentiation is carried out and the results simplified, Euler
88
2. Euler
finds that
(Tp - V)(dT+ pdV)
dp-------1 + T2 + V2
Consequently, since dp / dx = dy / dx 2, he has
dxd
Y=
+ V2
(217)
.
Tdy - Vdx
dx + Tdz .
(2.18)
-=-~--
Tdy(dx2 + 4Y 2 + dz 2) = (dz d 2z
+ 4Ydy)(Tdy - V dx).
dy(dx2
T dy
T dy - V dx
Td 2z
dx + T dz
dy
dy
+ Vd 2z
+ V dz . (2.19)
At this point Euler makes the simplifying assumption that the surface is
a figure of revolution given in the form y2 + Z2 = X2(X) so that dz
=XdX/z-ydy/z. Since he has T=XdX/zdx, V=-y/z, the first
and last members of (2.19) can be expressed as
dy dy + dz d 2z
dx 2 + dy2 + dz 2
Z dy - Y d 2z
Z 4Y - Y dz '
89
he then has an expression for (dx 2 + dy2 + dZ 2)1/2 in terms of dx, dy,
which can be solved for dy. This yields after considerable manipulation the
relation
yv dx
b dx~( 1 + V 2)(y2 - X 2)
dy = - - -----'---:=~==~-X
x.jb2 _ X2
Finally he sets y
= Xt
= b dx../f+"OO
Jtt-l
XJbb-XX
In closing his discussion he remarks that both X and v are functions of x
alone and hence that the variables are separated. 16
In the second part of this chapter Euler takes up another topic. He
considers now three indefinite integrals
(2.20)
A
B
---=-
+ dA) (B + dB) =
A .B
loadx~l + p2
16Euler, I, XXIV, p. 136.
17Euler, I, XXIV, pp. 140-163. His general result is stated in Proposition IV. Problem on p.
155.
90
2. Euler
A
Figure 2.6
+ x , d x F 1 +xdxVl + p2 + ...
and notes that the ordinate y enters precisely into p, and p and no others.
It then follows that
[
dA =
x,p,
~l + P~
dB = -nv' d--=P-+ p2
VI
xp
VI + p2
= cd
~l + p2
y=f
bdx
~(c -
X)2 -
+ p2)1/2 = (c -
x)p
,
b2
91
~l + p2(a)
Euler sets I
= y(a), g = (1 + p2(a1/2,
A
= faa y dx,
+ p2 .
and
= faa dxVl + p2
dA = nv' dx,
Jl
or if c
~l + pp
= II g,
x +b=
_-,CP,--_
Jl + pp
~C2 -
(x
+ b)2 .
To find the value of the constant c, the radius of this circle, he notes by
definition that
=c
and that
y (a)
= h
~C2 - (a + b l
It follows that
c2 = h~C2 -
(a
+ b)2
(c 2 -
(a
+ b)2),
y =J2cx - x 2
92
2. Euler
It clearly has the axis for a diameter and passes through the origin. In the
equation above relating a and c Euler takes the upper sign and concludes
that c = a. He then considers yet another case, but this is not really
relevant here.
'p
If'
dyV
np,
np
= dx '
d 'q 1/ =np
-
dx 2
iv
= ow,
np
'P=-dx
d . q'If = _ 2np
'
dx 2
ow
dx'
ow
dx 2
'
d 'q v =ow
-.
dx 2
He observes that I is the expression that would be present if only Nn had
been varied, and it is therefore the same value that has been already
~--------I~K~L~M~N~O~P~Q~R~S~T~--------~Z
Figure 2.7
93
+ ow' dA'.
(2.21)
+ ow' dB'.
(2.21')
Euler further remarks in Proposition IV. Problem (p. 174) that along an
extremal both (2.21) and (2.21') above must vanish and hence for arbitrary
quantities a, {3
np adA
+ ow . adA' =
0,
np' {3dB
+ ow {3dB' = O.
(2.22)
and equally
adA'
+ {3dB' = O.
(2.22')
(2.23)
{3'
-;;=73
is a constant. 20
In the Scholium 1 that Euler appends to this proposition, he notes that
there are two closely associated problems which are essentially equivalent:
the first is to maximize or minimize a function V in the class of arcs for
19Euler, I, XXIV, pp. 174-175.
I, XXIV, p. xxiii. Euler essentially points out the result on p. 175. We shall see in the
next section how Euler tried to generalize this result.
20 Euler,
94
2. Euler
Figure 2.8
ydx= const.,
Jydx~l + p2 = min.
n dx = dx~l
in which a /
+ p2 - d .
~l
yp
+ p2
dy
P=dx=
~y2 -
(ny + b)2
ny+b
For b = 0, the solution of this equation is the straight line joining the
points A and Z. For the case when the multiplier n is zero, the differential
21Boiza, VOR, p. 488, Mayer [18T1], p. 60; Kneser, LV, pp. 131, 136.
95
equation becomes
dx
= -;::::.b=d=!y==-
~y2 _ b2
COSh( X! k
) = : .
dx=
(b-y)dy
--;::::::;:===~2by _ y2
= c
2b-Y ~2by - b2
+3b
or equivalently as
II = f [Z] dx,
d[ Z]
d(P+(a+H- fLdx)[P])
O=N+ (a+ H- fLdx)[N] - - - - - d - , - x - - - -
etc.,
(2.24)
where H = JoL dx. His proof is not difficult. He notes that the differential
of II is
2. Euler
96
and of Z is
nJl . dx ( N + [ N]. V -
d(P+[P]V)
d 2(Q+[Q]V)
dx
+
dx 2
etc. ,
d( P+ ( C - f Ldx ) [ P])
+ - - - - - - - - - - - etc.
(2.24')
dx 2
dZ=Ltis+Mdx+Ndy+Pdp+ ....
In this case Euler finds, in his notation, that
_I d. (C - f L dx )p = N _ dP
";1 + pp
dx
+ d dQ _ etc.
dx
dx 2
(2.25)
since [P] = pl(l + p2)1/2. The result follows then immediately from (2.24').
In the case when N = 0, Euler sees that the differential equation (2.25) can
be integrated and finds
A
(C- fLdx)p
';1 + p2
= - P
+ 5!Q - . . . .
(2.26)
dx
(C- fLdx)p
';1 + p2
= N dy - pdP
dd
+ p!!.!!!2 dx
etc.
d
d2
0- (N + [NIA) - dx (p+ [pIA) + dx 2 (Q + [QI'A) - " ' ,
dA --L
dx
'
which is equivalent to (2.8) above (see p. 76) except that 'A(a) == 0 is no longer valid.
97
finding
J [L
dx~l + p2 + pd. (c - J L dx )p 1
~l + p2
= - A + Z - Pp - Qq + p!ff! + ....
But the left-hand member of this equation is integrable in the form
c- JLdx
~l + p2
'
c- JLdx
the relation
pdQ
= A - Z + Pp + Qq - ----;J;- .
~l + pp
A dp
(c - J Ldx)pdp
~l + pp
+A
= -Pdp
+ qdQ.
(2.27)
dp - L dxb
+ pp +
(c- JLdx)pdp
~l + pp
= q dQ + Q dq
+ B + Ap + ( C -
J L dx )~l
+ pp =
Qq.
= Z dy - P dx - Pp dy + dQ + pp dQ - Qp dp;
(2.28)
Z dy
= sn dy;
(2.29)
98
2. Euler
dx =
+ sn)ds
-;::::======={A2
+ (B + sn)2
(2.30)
+ pp
A dp(A - Bp)(I-n)/n
= - ----:---
np(l+n)/n
= -
-;;
dp(A - Bp)(I-n)/n
J p(l+n)/n~l +
pp
dp(A - Bpil-n)/n
A
y=--J---n
pl/n~l + pp
(2.31)
In the case n = 1 he finds from the second of the equations (2.30) that
+ B2 + 2Bs + ss -~A2 + B2
or by setting B = band (A2 + B2)1/2 = C,
x =~A2
+ c = ~C2 + 2bs + ss
A~l
+ pp
p
+b
c
=t.
f(x - b)2 - c2
cdx
f(x - b)2 - c 2
which is a catenary.
In Proposition VI. Problem on p. 207 Euler turns to a further generalization. He considers the variational problem that arises when A and B are
two given definite integrals such that A is to have a given value and a
function 4(A, B) is to be an extremum. To handle this situation, Euler first
observes that d4 can be expressed as d4 = adA + {3 dB; and that since 4
is to be an extremum subject to the condition A = const., there must exist a
constant y such that
(a+y)dA+{3dB=O
99
cx.r-----..:,:-:;~-----1
Figure 2.9
for the desired curve. He notes that a, f3 have fixed values, and he writes
(a + y)8, 11 = f38; from this he concludes that there are values ~,1I such
that along the extremal arc
~=
~ dA
+ 11 dB = O.
(2.32)
f yydx
f ydx .
Thus he seeks the curve aMb having the lowest center of gravity for the
figure AabB. He lets AP = x and PM = y. His denominator is now the
integral A and his numerator B. It then follows, in his terms, that their
differentials are np' dx' 1 and np' dx 2y, and his relation (2.32) above
becomes ~ + 2TV' = 0 or y = const. He concludes that the line af3 in the
figure is the solution curve.
100
2. Euler
To prove this result, Euler considers the arc Q that maximizes aA + f3B
and any other curve R on the same base from 0 to a which gives A the
same value as Q does. Then aA + f3B cannot have a greater value on R
than on Q, i.e.,
aA(Q)
;> (aA
But A(Q) = A(R) and hence f3B(Q) ;> f3B(R). Thus Q is the desired arc
for the problem of maximizing B subject to the condition that A is a
constant. The converse is clearly trivial to establish. [Euler has tacitly
assumed that f3B(Q) ;> f3B(R) implies B(Q);> B(R); this clearly depends
on the sign of f3 and is not discussed by Euler. The converse is noted in his
Corollary 1.]
In Proposition II. Theorem he generalizes this result to the case of three
integrals A, B, and C. He now shows that the arc which maximizes
aA + f3B + yC is the same as that for the problem of maximizing C subject
to the conditions that both A and B have fixed values. The proof is similar
to the one above: let Q be the maximizing arc for aA + f3B + yC and R
any other arc such that A(Q) = A(R) and B(Q) = B(R); then since
aA(Q) + f3B(Q) + yC(Q) ;> aA(R) + f3B(R) + yC(R), it follows directly
that yC(Q) ;> yC(R). He also notes the converse.
He unfortunately gets into difficulties when he attempts to take up the
major result of the chapter, Proposition III. Problem (p. 221). He seeks to
find a necessary condition that a functional C be an extremum among all
arcs on a common base that give the functionals A and B fixed and
preassigned values. To do this, he attempts to reason by analogy with his
results on multipliers in his Chapter V (see Section 2.5 above). To carry the
analysis thro'ugh, he chooses (Figure 2.7) to vary the ordinates Nn, 00, Pp
by "small" amounts n.", ow, and fYlT' respectively. This enables him to
calculate the variations of the functionals A, B, and C. He finds directly
that
(2.33)
101
pity ... the work, which contains so many novel ideas, should end in this
fashion on a discordant note."
24 James Bernoulli, CL. Example VI in Euler I, XXIV, p. 185 takes up the problem of finding
among all curves az of the same length the one which, when rotated about the axis AZ (in
Figure 2.8), contains the greatest volume. Euler shows that the solution is expressible as
x-
(yy
~b4 -
+ be) try
+ bd .
(yy
He remarks that the radius of curvature of this curve "is inversely proportional to the
ordinate y; whence it is clear that the curve in question is an Elastic one." (p. 186).
2S Euler, I, XXIV, pp. 298-308. The interested reader should read the very good account by
Caratheodory on p. Ii.
26The interested reader may also wish to read Brunet, ACT. This is a small historical study of
the principle of least action and in particular the works of Euler and Maupertuis on the
subject.
102
2. Euler
c
A
pt-------->."..M
Figure 2.10
He first observes that if there are no forces acting on the particle and its
initial velocity is a constant b, then his principle implies that s is a
minimum, and so the motion must be uniform and along the straight line
joining the end-points; this is, of course, one of Newton's laws of motion.
In case the only force acting is gravity (Figure 2.10) Euler sets AP = x,
PM = y, and Mm = ds. He therefore has dv = gdx-remember that v is
half the square of the velocity-and so v = a + gx. The path of the particle
is then determined by minimizing the integral
dx C I / 2 /(a - C
y=
1:.g ~C(a -
+ gx)
rl:i
x,
103
dy
-dx
=P =
or
y=
JC
---;.==:::::::::::=====-
~A -
c+ JXdx
dx/C
~A -
C+ JXdx
Y~l + p2
p= - - - - - - - - - - -
= - ~:============-
2~A -
JXdx - jYdy
dP~A -
JXdx- JYdy
Xdy- Ydx
~(l + p 2)3
or
2dp
Xdy- Ydx
---- = ---=-"'------=--1 + p2
A - J X dx - J Y dy
Xdy- Ydx
v
He rewrites this as
2vdp
Xdy- Ydx
~l + p2
(2.34)
104
2. Euler
-/(1 + p2)3
r = _
dx
dp
(1 + p2)
This gives him the new relation
- p2 J X dx + J Y dy - A
----------------=C
1+ p2
or equivalently
p=
YB+ JYdy
yc+
JXdx
dy
YB+ Jydy
dx
JC+ JXdx
Txdx
Tydy
dv= - - - - - - = -Tdt
t
v=A - fTdt.
'
105
p~---~
c"-----~-
Figure 2.11
dPVA - JTdt
Tydx,fl+P2
pTdt
J(1 + p2)3
2tVA - JTdx
T(xdy - ydx)
dp
=--
2t(A - J Tdt)
To eliminate variables, Euler now expresses dx and dy in terms of dt and
has
dp(py + x)
Tdt
xdp
pdp
--1+ p2 .
px- Y
px- Y
VA - JTdt
VI + p2
xdy - ydx
ds
= const.
106
2. Euler
dO
r2 dT
= const.
Let us now denote by A the area swept out by the radius vector starting
from the axis CB. Then the last relation above states that
2 dA = r2 dO = const.
dT
dT
The quantity dA / dT is called the areal velocity, and our relation above says
the areal velocity is constant when the force acting is a central one and
conversely. In essence this is Kepler's law that equal areas are swept out by
the radius vector in equal intervals of time.
Euler considers two other problems which need not concern us. He also
observes that his principle is applicable to the case of many bodies or
particles. Instead of discussing any of this further, let us quote Euler's
statements at the beginning of his paper (p. 298):
Since all the effects of Nature follow a certain law of maxima or
minima, there is no doubt that, on the curved paths, which the bodies
describe under the action of certain forces, some maximum or minimum
property ought to obtain. What this property is, nevertheless, does not
appear easy to define a priori by proceeding from the principles of
metaphysics; but since it may be possible to determine these same curved
paths by means of a direct method, that very thing which is a maximum
or minimum along these curves can be obtained with due attention being
exhibited. But above all the effect arising from the disturbing forces oUght
especially to be regarded; since this [effect] consists of the motion produced in the body, it is consonant with the truth that this same motion or
rather the aggregate of all motions, which are present in the body ought to
be a minimum. Although this conclusion does not seem sufficiently
confirmed, nevertheless if I show that it agrees with a truth known a priori
so much weight will result that all doubts which could originate on this
subject will completely vanish. Even better when its truth will have been
shown, it will be very easy to undertake studies in the profound laws of
Nature and their final causes, and to corroborate this with the firmest
arguments.
Again at the end of the appendix (p. 308) Euler muses on the generality
of his principle of least action. He remarks there that the principle seems to
run into a difficulty when one considers motion in a resisting medium. He
has some difficulty in trying to explicate the reason for this; but Lagrange
in his Mecanique Analytique shows in 1788 that the principle is, in general,
valid for conservative forces provided that all constraints are independent
of time. In the contrary case the action integral may, in fact, not be an
extremum. 28
27W. D. MacMillan, SAD, pp. 268-269. Note in the relations above that
28Lagrange, MA, Part 1, Section 3, Paragraph 13.
denotes time.
107
I)
In
V ds.
108
2. Euler
He observes that he can reduce all other laws of statics to this principle,
but he does not here discuss any relation it may have to dynamics. In fact,
he does not take up the principle of least action until his 1744 paper,
entitled "Harmony between different laws of nature which have, up to
now, appeared incompatible.,,35 What he says is that "Nature, in the
production of its effects, acts always by the simplest means." He amplifies
this by remarking that "the path ... is that along which the quantity of
action is the least." (He means by "quantity of action" the action integral
32 Maupertuis [1740], pp. 170-176.
33 In the Acta Eruditorum of 1695 Leibniz introduced the concept of vis viva, or living force. It
is, by definition, twice the kinetic energy, i.e., mv 2, where m is the mass and v the velocity of a
particle. There is an allied concept of vis mortua or dead force which is essentially potential
energy.
34Maupertuis [1740].
35Maupertuis [1744], pp. 417-426.
109
III
In his August 1755 letter Lagrange says to Euler that his departure point
was the remark in Euler I, XXIV, p. 52, where Euler said "A method is
therefore desired, free of geometric and linear solutions by which it is
evident that, in such investigations of maxima and minima, -pdP ought
to be written in place of Pdp." (I do not know what Euler meant by the
word "linear"; perhaps it referred to his use of polygons for comparison
purposes.) To do this, Lagrange produced what he thought of as a new
form of differential which came to be known as a variation. He envisaged
it as acting on functions of a real variable such as F(y) as well as on
operators such as JZ dx. To distinguish his new process, which may be
viewed as an operator, from the usual differential he wrote it as 8 instead
of d. In his letter he thought of it in connection with a function of x, y, y',
etc. in this way: the variable x is kept fixed and the ordinary differential is
calculated, but with the differential arguments 8y, 8y', etc. (Later Euler,
Lagrange, and Legendre extended the process to the case where x may
vary as well; see p. 115 below.)
He then asserts that the commutation rules
8dF(y)
= d8F(y),
(m
and
(3.1')
hold, where he writes the differential of the integrand Z as did Euler, i.e.,
in the form
dZ
= M dx + N dy + P dy + Q dy + R dy + ...
112
+J
Ud 2z,
+ Ud 2z -
J Ud 3z,
(3.2)
J u J z = J u X J z - J z J u.
J~u
+ N dy + P dy + Q dy + R dy + ... ,
he writes
8Z = N 8y + P8 dy + Q8 dy + R8 dy + ...
+ J d 2Q8y+ ...
(3.3)
113
=f
(N - dP
+ (Q -
+ d 2Q - ... ) 8y + (P - dQ + ... ) 8y
... ) d8y
+ ... ;
(3.4)
in these relations the integrals are evaluated between the limits x = 0 and
x = a, as are the other terms. Then if the comparison curves are such that
o = 8y = d8y = . .. at x = 0 and x = a, Lagrange has
8fZ= f(N-dP+d 2Q-d 3R ... )8y.
(3.5)
+ d 2Q - d 3R ...
O.
(3.6)
1\Z),
(3.7)
+f
(P) My
+ ... ;
hence
8fZ= f N8y+ f Pd8y+ fQdY+'"
+ fLf(N)8Y
(3.8)
114
f~L
+ 1oQ[Q+(Q)V]d 28y+
....
8'IT = 8 (Z) =
f 8(Z) f [
(L) 8'IT
+ (N) 8y + . . . ].
f f
8 Z=
f
f f
N 8y+
Pd8y+ ...
Ve - J(L) ,
J J
eJ(L)L e- J(L)(N)8y
+ (N)e-J(L)V -
d[ P
+ (P)e-J(L)V] + ...
= 0;
or if e-J(L)V = S,
N+(N)S-d[P+(P)S] +d 2 [Q+(Q)S] =0.
(Note that this V is not the same quantity he called V just above.)
115
l)Z = n8x
5 Lagrange [1760]. In the Memoirs of the Turin Academy he published not only the 1760/61
paper but a second one that appeared in Vol. IV, 1766/69. The reader may also wish to
consult his paper on pp. 4-20 of Vol. I, which he wrote in 1759 (Lagrange [1759D on maxima
and minima of real functions, especially his discussion of the second differential.
6 Euler [1764], [1764'], and [1771]. There are also some other interesting papers by him on the
subject in the same volume.
116
+ (p + (r -
dq
+ d 2r -
... ) d 26x
... ) 8x
+ (q -
dr
+ ... ) d8x
+ .. .
+ d 2R - ... )8y
dR + ... ) d8y + (R - ... ) d 28y + ...
+ (w - dX + d 2p - ... ) 6z + (X - dp + ... ) d8z
+ (p - ... ) d 28z + ... .
+ (P + (Q -
dQ
(3.9)
+ (v -
dw
+ d 2X -
d 3p +
... ) 8z = 0
(3.10)
+ (q - dr + ... ) d8x
+ (r - ... )d 26x + ... + (P - dQ + d 2R - ... )8y
+ (Q - dR + ... ) d8y + (R - ... ) d 28y
+ (w - dX + d 2 p - ... ) 8z + (X - dp - ... ) d8z
+ (p - ... ) d 26z + . . . = 0,
(3.10')
(p - dq
+ d 2r -
... ) 8x
where these last relations hold when the expression is understood to mean
its value at the terminal point less its value at the initial point. He calls the
value at the latter point' M and at the former M'. Then the condition is
that M' - ' M = 0. When there are no constraints on the variation 8x, 8y,
8z at the end-points, Lagrange observes that relation (3.10) becomes the
three equations
n - dp + d 2q - d 3r + ...
+ d 2Q dw + d 2 X -
N - dP
v-
+ ...
d 3 p + ...
d 3R
= 0,
= 0,
(3.11)
= 0.
[As we recall in the parametric case, the integrand I(x, y, x', y') is such
that for" > 0, I(x, y, "X', ley', KZ') = lCj(x, y, x', y', z'); hence Ix'x' + /yo y' +
h'z' = I, where x' = dx/dt,y' = dy/dt, z' = dz/dt, and t is the parameter.
We have, therefore, Ix'x' + /yo y' + h'z' = f.]
117
Suppose now that the arc in question is assumed at one end-point to lie
upon a given surface. In that case Lagrange's condition (3.1O'r-I take the
case where f contains only x, y, z, x', y', z' since this does not restrict the
generality of the discussion-becomes 0 = fx' 8x + /y, 8y + /Z, 8z where 8x,
8y, 8z are the direction cosines of the normal to the bounding surface at
the point where the extremal intersects it. But we can write this equation as
tion.
Lagrange does not enlighten us on why he felt that it was necessary to go
to curves expressed in parametric form to find the correct transversality
conditions. Certainly, later writers seemed to comprehend the matter less
than he, and the situation was really not completely clarified until much
later, as we shall see.
Having set out his relations, Lagrange now considers the brachystochrone problem to illustrate the power of his process ([ 1760], p. 339). He
takes x as the vertical axis and y, z as horizontal. Then the integral to be
minimized is
dy
dz
- d - - =0,
- d - - = 0. (3.12)
2xrx
rx ds
rxds
rxds
If the second of these is multiplied by 2 dy / X 1/2 ds and the third by
2 dz / X 1/2 ds, and if the results are added, there results the equation
ds
dx
- - - -d--=O,
d(
! X
dx 2
xds 2
= 0,
ra '
118
z=t
~a
/i
+b
y=t
,fa
/ii
+b
dy = dt
,fa
,fb
+b
dt =
/-; dx
~c - x '
ab
c=-a+b'
dx
dy
dz
rx ds
rx ds
(3.13)
,f--; ds
dy
--=0
l-; ds
'
00,
~x
= 0, ~y, ~z arbitrary.
b
00,
"which will transform the cycloid into a vertical line." However, if the
plane is not horizontal but vertical and orthogonal to the y- or z-axis, then
~y = 0 or ~z = 0: in the former case
dy
--=0
,f--; ds
dy
--=0.
,f--; ds
In both cases a, b can be found and the cycloid cuts the plane at right
angles.
He next supposes that the second end-point is constrained to lie on an
arbitrary surface. Let the surface be given by the relation dz = T dx + U dy
so that ~z = T~x + U~y. Now the condition M' = 0 becomes
dx
dy
dz
--=-~x + --=-- ~y + --=-- (T ~x + U ~y) = 0;
,fx ds
,fx ds
,fx ds
119
~x + T ~z
~x ds
~x ds
) 8x
+(
~ + U ~z
~x ds
~x ds
) dy = 0,
dx
+ Tdz
dy
= 0,
+ Udz
= O.
But this means that the minimizing curve cuts the surface in a right angle.
If now the brachystochrone is to pass from one surface to another, then
Lagrange remarks that both' M = 0 and M' = 0 for the first and last points
on the curve. He says it follows simply that the minimizing curve cuts each
surface orthogonally. His analysis of the case when the first end-point lies
on a curve and the second one is fixed, was correctly criticized by Borda
and Lagrange later amended his reasoning to conform to Borda's.7
We see here how Lagrange has begun to master the transversality
condition and also has no fear of proceeding in three-space. We can also
observe how formalistically he proceeds, but also how adroitly and carefully. In what follows he continues to exploit his ability to handle problems
with variable end-points. Inasmuch as his procedure is so formalistic it is
quite remarkable that it leads to the correct result. The whole procedure
shows, however, how immature was the real understanding of some aspects
of analysis in the eighteenth century and yet how bold and how inspired
and inspiring were its best practitioners.
Let us see how Lagrange copes with the problem of finding the solution
to the brachystochrone problem when all the admissible curves are constrained to be on a given surface dz = p dx + q dy. This gives him the
constraint 8z = p 8x + q 8y on the variations 8x, 8y, 8z, which must hold
along each curve. He goes back to relation (3.10) and substitutes for 8z its
value above; he now has 8x and 8y arbitrary so he can obtain the results
dz
ds
dx
- p d - - - - - - d - - =0
rx ds
rx2x
l; ds '
dz
dy
- q d - - - d - - =0,
rx ds
rx ds
for the brachystochrone problem. Lagrange notes with the help of the
relation dz = p dx + q dy that these two equations are equivalent; thus the
7Borda [1767), p. 558 and Lagrange [1766/69), p. 63. The criticism was based on Lagrange's
failure to notice that when a particle descends from a fixed curve, the line x = (XI - vr)/2g
on which the cycloid-producing circle rolls is not any longer fixed. The difficulty arose largely
because Lagrange failed to specify the initial velocity VI' We discuss the point later in more
detail. The reader may also wish to read Bliss, COV, pp. 78-79. The essential point is that the
initial curve at its intersection point with the minimizing curve must have its direction parallel
to that of the final curve at the second point which the minimizing curve cuts orthogonally.
120
rx
+ -dx-
rx ds
=0
'
dz
q-ds
rx
+ -dy~~ ds
=0
'
6Z' =
+ p6d 2x + ...
+ N 6y + P 6dy + Q 6dy + ...
+."dz + w6dz + X6d 2z + "',
n' 6x + p' 6dx + q' 6d 2x + ...
+ N' 6y + P' 6dy + Q' 6dy + .. .
+ .,,' 6z + w' 6dz + X' 6d 2z + ... .
He therefore has
f f 6Z f (n 6x + p 6dx + q 6d x + ... )
+ f L J(n' 6x + p' 6dx + q' 6d x + ... )
6 Z=
121
and remarks that the first integral is, just as before, transformable into the
form
fL
= J [ n'(
+ [ q'( H
- J L ) - ... ] d8x
+ ... .
H- f L) = (n),
p + p'( H- f L) = (p),
n + n'(
q + q'( H -
JL )
(q),
with corresponding definitions for (N), (P), (Q), (p), (w), (x). Then
... ] 8z
]8y+[(Q)-'"
= O.
]8dy+ ...
(3.14)
This form for 8fZ is quite similar to that for Problem I above (p. 116).
122
L'dIl'
,~Z" =
n" ~x
+ p"~dx + ... ,
then equation (3.14) is still valid provided that the quantity (n) there is
augmented by n"[H' - I(H - I L)L'], (p) by p"[H' - I(H - I L)L'], etc.
Clearly, this process can be generalized.
In Section XI Lagrange poses his Problem III: "To find the equation for
a maximum or minimum of the formula IZ, when Z is simply given by means
of a differential equation which contains no differentials of Z beyond the
first."
To solve the problem, Lagrange imagines Z is defined by an equation
whose variation is
~
dZ
J J J
e- fT) = (n),
Je- fT)
(p),
123
He starts with the example of finding the surface of least area among all
those bounded by a given curve. (This is the so-called problem of Plateau.)
To analyze the situation, Lagrange assumes the surface is represented by
the equation (in his notation)
dz
dz
dz=pdx+qdy= -dx+ -dy
dx
dy
and writes the integral to be minimized in the nonparametric form
"where the two integral signs indicate two successive integrations, the one
with respect to x and the other with respect to y, or inversely." (Although
Lagrange does not make clear the character of the region of x, y-space
over which the integration takes place, let us assume that Xl";; X ..;; x 2 ;
Yl ..;; Y ..;; h) He then writes
.b
+ I I dxdy
P
+ p2 + q2
q
~l + p2 + q2
lidz
dx
~:
:;
Lagrange observes that lip = lidz / dx = dliz / dx, and hence he can write
d8z
dx
liz.
(I have inserted limits of integration into Lagrange's notation.) He assumes
that the surface is such that liz = 0 at x = x 1 and x = x 2; i.e., the surface
passes through a fixed boundary curve. Then he writes
~===p===- = P
~l + p2 + q2
and the double integral being considered (in his notation) becomes
I I dxdy
~+~+i
dP
dP
-dliz = - I dy I dx -liz=
- I I dxdy -liz.
124
(Notice that he does not have a notation for partial derivatives.) Similarly,
he lets
and finds
thus
- J J dx dY [
~: +
!! ]
8z = 0,
ap + EQ =0
ax ay
which says that P dy - Qdx is an exact differential; hence he says "the
problem is therefore reduced to seeking p and q by the conditions that
pdy - qdx
"';1 + p2 + q2
are each exact differentials." He notes that the choice of p and q each
constant satisfies these conditions but that this is only a very special case
"since the general solution must be such that the boundary curve of the
surface can be determined at will."
Now he turns to the case where the desired surface is to be a minimum
among all those of given volume. In this case
J J dxdy"';l + p2 + q2
min.,
J J z dx dy= const.,
and
8(J J zdxdy) = 0,
This is equivalent to the conditions
J J dxdy8z= 0,
J J dx dY [
~: + ~ ] 8z = O.
k+:+~=O.
This is expressible by saying that (P
125
~l + p2 + q2
+ (y
- b)2
(y - b)dy
+ (x
- c)dx
dz
~,2
(3.15)
pdx+qdy
~====- +kxdy,
+ (x - ci =
,2
and ob-
(y _ b)2 _ (x _ C)2
so that
p=
x-c
~,2
y-b
-;=============-
q=
_ (y _ b)2 _ (x _ C)2
~,2
_ (y _ b)2 _ (x _ C)2
(1)
1
bdx-cdy
- +k xdy - - ydx +
,
,
.!.+k=-.!.
In his Appendix II (pp. 357ff) Lagrange gives a quite different but also
elegant application of his methodology. Specifically, he seeks "to find that
polygon, whose area is greatest, among all those which have a fixed
number of given sides." He points out that "the method of this memoir is
also applicable to these sorts of questions ...." Since his analysis is
somewhat sketchy and depends on finite difference techniques, perhaps we
should give a little discussion to expand his remarks.
Let the coordinates of the vertices of the given polygon be (xo' y~, (x I'
YI)' ... , (xI - i' YI-I)' (XI' YI) = (xo' Yo), and let (xo' Yo) and (XI-I' YI-I)
lie on the x-axis. To evaluate the area, Lagrange decomposes it into
trapezoids with altitudes Xi+1 - Xi and basesYi+1 + Yi. (Note that some of
these may have negative areas.) The area of such a trapezoid is
Yi + I + Yi
[ Y i + I - Yi ]
2
(Xi + 1 - Xi) = Yi +
2
(Xi+1 - Xi)'
J( 1) dx=.~
y+ "2dy
I - I [
.=0
Yi+
Yi + I
Yi ]
(Xi+I-Xi ),
(3.16)
126
o=
J[(
8y) dx
(3.17)
but the length of each side, (dx 2 + dy2)1/2, is fixed and consequently
8~dx2 + dy2 =
+ dy 8dy = o.
~dX2 + dy2
dx 8dx
In other words,
dyMy
(3.18)
8dx=-~.
J[
dx 8y
+ z My ]
(3.19)
= 0,
1
ydy
1 dy2
z=-dx----- - .
2
dx
2dx
+ udv + du
dv = vdu
+ (u + du)dv =
vdu
+ u'dv,
Uj +
I(Vi+ 1 -
Vj)'
= UIVI
~ d(ujv;)
j=O
- UOVo
=
=
where u;
that
[-I
[-I
j-O
j-O
[-\
[-I
i=O
i=O
~ vjduj + ~
Uj
+ 1 dVj
~ vjduj + ~ u; dvj,
(3.20)
[-I
[-I
[-I
i=O
i=O
i-O
j-I
[-\
[-I
Uj
j=O
[-I
[-I
i=O
i-O
i=O
(3.20')
127
J z d8y= z 8y - J d'z 8y
(3.21)
=x
- 'z
= x' -
= x + dx -
= x + dx -
- dx
2
ydy
dx
1 dy2
2 dx
+ -- + - -
i.e., that
adx = (x
+ r2 = x 2 + y2,
-J dyd8y
dx '
8x =
(3.22)
as we see from relation (3.18), and so the sum above must vanish as well as
J [ dx 8y
+ ( -I
y2
Y dy
]
dx - - - -1 -d ) d8y.
dx
2dx
(3.23)
+ (k
-dy
dx
+ -1 dx 2
y2
-1 -d ) d8y = O.
2 dx
+ td(X2) + td(y2),
128
He sums up in the theorem (p. 360) that "the largest polygon that can be
formed with given sides is that which can be inscribed in a circle." He goes
on to say that Cramer demonstrated this result synthetically in the 1752
Memoirs of the Berlin Academy.9
He goes on to consider the case when the individual sides are not given
but only their sum; i.e., he fixes the perimeter. Then the variation of
f(dx 2 + dy2)1/2 must vanish and
f[
dX13dX+dy13dy=O,
~dX2 + dy2
dx 13y
+ 2. dx 13dy +
(
)Mx
] = 0,
Y +I
2. dy
+ (y
tdy +
) My
)8dx]- O.
kdx
~dX2 + dy2
+ u13x -
From this he concludes that dx - d' z = and d' u = 0, which gives the
relations x - 'z = a or x + dx - z = a, 'u = b or u = b. These mean that
y +
dy +
k dx
,jdx 2 + dy2
= b.
(3.24)
Lagrange multiplies the first by dx and the second by dy and adds, finding
(x
+ t dx) dx + (y + t dy) dy =
a dx
+ b dy
1)2.
-k-dx
- = ( b-y--dy
2
dx 2 + dy2
9Cramer [1752].
129
k 2 = (a - x -
t dX)2 + (b -
Y-
t dy)2
= a 2 + b 2 - 2ax - 2by + x 2 + y2
- (a - x)dx - (b - y)dy
+ idx 2 + idy2
= a 2 + b 2 + r2 - i dx 2 - i dy2,
since
x 2 + y2 - 2ax - 2by
= r2,
k2 ,
"Which shows that all sides of the polygon must be equal to each other
and consequently the polygon must be regular." He goes on in his last
paragraph to clean up a few details. If the first and last x and yare fixed,
then z Sy and u Sx vanish. But, he continues, if the base is fixed and of
length c, then the corresponding ordinate y will not be fixed and it is
necessary to make u = 0 and z = 0 when x = c; this gives b = 0 and c = a,
and the base becomes the diameter of the circumscribing circle.
In the same issue of the Memoirs of the Turin Academy Lagrange
proceeded to apply his variational method to the solution of a variety of
dynamical problems. In this paper he generalized Euler's principle of least
action and used his own techniques to solve the resulting variational
problems. \0 Since this work would carry us too far afield, I leave the topic
and turn to more relevant work of Lagrange.
II Lagrange
130
done in other places in the same volume ... but as, by the citation of the
memoirs of M. Euler ... they appear to wish to attribute to him this
method, I believe I must remark that I am the first author and that I do
not share the credit with any person."
Lagrange goes on to correct another misstatement by these men (Vol. II,
p. 531) regarding Euler's principle of least action. They said Euler had
shown that on the trajectories described by any number of bodies whatsoever, the action integral is always' a maximum or a minimum. Lagrange
says that "he [Euler] has seen that the trajectory of a body being acted
upon by any central forces is the same as the curve which one would find
by supposing the action integral to be a maximum or minimum . . . . The
application of this beautiful theorem to any system of bodies and above all
the way it serves to solve with the greatest ease and generality all the
problems of dynamics is completely due to me ... ,,12
Lagrange then turns to a calculation he will use later: he supposes that
I/> is a function given by "a differential equation of arbitrary degree
between I/> and z, y, z, ... and the differentials of these quantities." He calls
this differential equation <I = and proceeds to find S<I. (His procedure is
quite formalistic.) In what follows the reader should realize that x, y, z, . ..
are functions defining a curve
x(t),
y = y(t),
z = z(t), ...
0,
where p, p', p", ... , q, q', q", ... are given functions of 1/>, x, y, ... , dl/>, dx,
dy, . ... Since S dl/> = d SI/>, etc., he writes this equation as
(3.25)
I3l
= p~ -
d(p'~)
+ d2(p"~) -
+ d2(p"'~) -
d(p"~)
d(q'~)
+ d2(q"~) -
d(q"~)
d3(p"'~)
... , ...
d3(q"'~)
+ d\q"'~) -
+ ... ,
+ ... ,
... ,
with similar definitions for R, R', ... , S, S', .... This gives the relation
(P 8</>
+ Q 8x + R 8y + S 8z + ... )
+P'8</>
(3.26)
where the expression outside of the integral sign is tlJe value at the upper
limit less its value at the lower limit. Lagrange next calls the integrand 'I'
and the part outside the integral, II. This gives the relation
II
J'I'
= const.;
next he lets r be the value of II at the lower limit of integration and ~ its
value at the upper limit. Then clearly ~ = r - f'l'; and he proceeds to
choose ~ so that
0= P
= p~ -
d(p'~)
+ d2(p"~) -
d3(p"'~)
+ ....
+ R'" d 28y
-J
d 28z
(Q 8x
d 28y
+ ...
+ ... ]
d 28x
d 28z
+ ... )
(3.27)
where Lagrange has used square brackets ou~side the integral sign to
indicate that the quantities are evaluated at the first end-point of the
132
o= p~ -
d(p'~)
0= (P"),
+ d2(p"~) -
d3(p"'~)
+ ... ,
0 = (P"'), ....
o=
0= [P'8cp
J(Q 6x + R 6y +
S 6z
+ ... ),
(3.28)
(3.29)
must hold separately in the variations, 8x, 8y, 8z, ... ; and hence that for
a ..; x ..; /, b ..; Y ..; m, c ..; Z ..; n, ... ,
Q 8x + R 8y + S 8z
+ ...
= O.
(3.28')
133
Y 8y
+ Z 8z + . . . = 0)
RX - QY = 0, SX - QZ = 0, . .. .
+A'" 8d 2a
8d 2b
+ ...
- ... =0.
(3.29')
In closing the section he remarks that in order to use this relation, one
must examine the problem at hand and find out what conditions, if any,
exist between j, a, b, c, ... , I, m, n, ... and their differentials.
In Section IV ([1766/1769], p. 47) Lagrange notes that
8j= '1T8a
and that these should be substituted into relation (3.29') and then terms
rearranged so that a linear equation in 8a, 8b, 8c, ... , 81, 8m, 8n, .. . results.
If, moreover, the function q, depends not only on x, y, z, ... and their
differentials, but also on the parameters a, b, c, ... , I, m, n, ... and their
differentials, then the value of 8w itself will have a more complex form. In
that case expression (3.25) for 8 W must be altered by the addition of an
expression of the form
a8a
134
J
+ 8cJy~ + Me Jy'~ + M cJy"~ + .. .
+8b J
+ 8nJ P~ + 8dn J
8cI>
= 8Z -
8d</>.
(The integral is understood to vanish at the point (a, b, c, ... ).) He then
expresses 8Z as
8Z = q 8x + q' 8dx + q" M 2x + .. .
(3.30)
= 0,
p'
= -1,
p"
= 0,
p'"
=-
d~,
P'
= ~,
P"
= 0,
= 0, ....
~
P'"
= 0, ....
135
= 6(Z),
6V
+ d'IT
6dcf>
6V
6dcf>
- d - =6(Z)+d- - d - ,
'IT
'IT
'IT
(3.31)
+ (q')6dx + (q")6d 2x + .. .
+ (r)6y
+ (r')6dy + (r")6dy + .. .
+ (s)6z
+ (s')6dz + (s")6d 2z + . . . .
(3.32)
'IT
'IT
'IT
= const.
After substituting the values of 6V and 6(Z) into this equation, he finds, in
the notation he used earlier, that
-!
P'= d~,
p"=
,
'IT
'IT
'IT
and, as before, he sets P = 0 to determine ~, finding
d~
P= - d - ,
P"' = 0, ...
~= h + gf'IT'
where g and h are arbitrary constants. As he did before, he riow proceeds
to set (P = 0 and notes that this requires ~ to be zero at x = t, Y
= m, .... To do this, he lets II be the value of f'IT at the point (t, m, ... )
and observes that h + gIl = 0 is the condition in question. He sets g = - I
and has
II
~= II P'= -1,
J'IT'
II- J'IT
P"=----
'IT
These conclusions agree also with those of Section IX of Lagrange's first
memoir (see p. 120ff above).
Since cf> again vanishes at the first end-point, f = cf>(a, b, c, ... ) = 0, and
hence 6f = O. This gives the result that 6df = 6dcf> = 6Z at the initial point;
it is also clear that at that point (cf = f(Z) = 0, so 6(cf = O. Thus the value
136
of I3df is expressible as
q I3x
As a case in point, Lagrange recalculates the problem of the brachystochrone whose end-points are free to move along two arbitrary curves in
the same plane. He chooses the coordinates of his first curve to be a, b; of
his second one, I, m; and of his minimizing arc, x, y. He further lets u be
the velocity of the heavy particle so that u = (2(y - k1/2, where k is an
arbitrary constant and the gravity constant g = 1. At the beginning of the
motiony = band u = (2h)I/2, where k = b - h. Then Jds/u is the integral
to be minimized. He therefore has in his previous notations
~= J ~,
and since ds
ds
u
Z=-
kI/2,
dx I3dx
dy My
+- +uds'
-uds
q=o,
ds
r= - -
U3 '
,
dx
q=uds'
dy
r'= - uds '
(~
= 1)
dx
Q=-d uds '
Q'= dx
uds'
Q"=o, ... ,
ds
dy
R= - - - d - - ,
3
u
uds
R'= dy
uds '
R"=O, ....
He concludes with the help of (3.28') from these relations that for I3x,l3y
arbitrary,
so that Q
Q I3x + R l3y =
and R = 0, either of which will serve to find the extremals.
137
dx = ju = j~2(y - k)
ds
for j a constant of integration; this gives him
dx=
j dy";2(y - k)
~l
- 2j2(y - k)
+ B' I>b -
(3.33)
where A', B' are the values of Q', R' at the first point (a, b) of the curve
and L', M' at the last point (/, m). But in the case with k viewed as a
parameter, Lagrange returns to his analysis on p. 133 above, where he
showed how to alter the condition (3.29').
In the present case he has (recall k = b - h)
~
= Z - dq, =
ds
~2(y-k)
- dq,
I>~
the term
= ds I>k'
u3
2..f2(y - k)3
'
then because ~ = lone has to adjoin the quantity -13k Jds / u 3 t(' ...
[(3.33)], which becomes consequently
A' I>a
+ B' I>b -
=0
'
u3
ds
dy
- - - d - =0
u3
uds
or
J u~ + udYds = const.,
which implies that
J ~ + R'
= const.
By evaluating this integral at the first point on the curve, he finds R' = B'
and at the last point, R' = M'. Thus Jds / u 3 calculated over this interval is
138
+ [M'
and since the end-curves are independent, Lagrange has the relations
[j- (M' -
B')G] 8a
He then defines the two end-curves with the help of the differential
equations da = e db, dl = 1/ dm and as a result has 8a = e 8b, 81 = 1/ 8m. It
follows from these that his conditions are now expressible as
[j- (M' -
B')G] da
0,
fdl+ M'dm = O.
+ B' db = 0,
fdl
+ M'dm = 0;
i.e.,
dx
dm
dy = - di
at the final point. These imply that the minimizing arc, in this case, cuts
each bounding curve orthogonally.
In the case that the initial velocity is zero, then h = 0 and therefore
G = H = 0 so that
fda
+ M'db =
0,
Since the latter relation is exactly as it was above, it follows that the
minimizing curve must cut the second curve orthogonally. The first one,
however, implies that
da
M'
dl
dm;
db
=-1=
this means that the slope of the first curve at the point the minimizing arc
cuts it is equal to the slope of the second one at the point where the
minimizing curve cuts it. This is the conclusion that Borda pointed out in
correcting Lagrange's first paper. 13
13Borda [1767].
139
SJ vdx= J dxSv=
Jx d(
~;)
where the superscripts 0 and I are used to signify that the quantity
indicated is evaluated at the beginning and end of the interval integration.
Along an extremal he concludes it is necessary that
av
ay
-d( avap ) = 0,
(3.34)
(Notice that Legendre seems to have invented the symbolism for a partial
derivative. He says "In order to avoid ambiguity I designate the coefficient
of dx in the differential of v by av lax and the coefficient of dx in the
exact differential by dxl dx.") Next he expands v(x, y + Sy, p + Sp) using
a Taylor expansion. The total variation of v is
av
av
2 ayap
2 ay2
2 ap2
a2v
2 ay2
. Sy2
+ 1.
2 ayap
. Sp2),
which he writes in the form f dx(P Sy2 + 2Q Sy Sp + R Sp2). (He does not
use l but S, which makes for some confusion.)
Legendre now employs a very neat trick to find his necessary condition.
He arbitrarily writes the last expression as
l J v dx = J dx (P Sy2
+ Jdx(
14Legendre [1786).
+ 2 Q Sy Sp + R Sp2)
~: Sy2+2aSySp)-(aSy2)/+(aSy2)O,
140
f v dx
(a 8y2) 0 - (a 8y2) I
+
dx [ ( p
~~ ) 8y2 + 2( Q + a) 8y 8p + R 8p 2J.
and he asks that the integrand in this expression be a perfect square, i.e.,
that a be chosen so that
(p+: )R=(Q+al.
(3.35)
-=R
Op2
must be positive (presumably he means nonnegative) and for a maximum
negative (nonpositive). He also asserts, incorrectly, that this condition is
sufficient. There is a theorem that says this is true for regular problems,
i.e., for problems for which vpp > 0 for all admissible sets (x, y, p) and for
which every (x, y, p) with PI < P < P2 is admissible whenever the sets
(x, y, PI) and (x, y, P2) have this property. Many applications are indeed
regular. (He disregards the case where R == 0 since it is so special.) He
concludes that the Euler condition plus R > 0 are sufficient for a minimum.
[As we shall see below, Legendre's argument is not flawless, but to him
goes the credit for focusing attention on the second variation. First
Lagrange showed, in general, that R > 0 is not sufficient, as asserted by
Legendre; and then Jacobi made a beautiful analysis of the problem and
uncovered yet another necessary condition in so doing (see Sections 3.5
and 4.2 below).] Legendre proceeds next to the case where v depends not
only on x, y and P = dy / dx, but also on q = dp / dx. He gives the same
sort of argument as before and concludes that now the relevant quantity is
02V /
oq 2.
141
for k > O. Legendre then calculates the second-order terms that enter into
!J.I V dt and finds
f { 2I
dx
a2v
2
-6x
ax 2
+.!2
+
+ -I -ay2 ax ay
a2v
ax ap . 26x 6p
+ "2I
I a2v
2
26x6y+ - 6y
2 ay2
a2v
ayap . 26y . 6p
+"2I
a2v 6
ap2 If'
v)
f( axav 6xd6x + a
ay 6yd6x
2}
(3.36)
as the value. From his point of view the important thing about this
expression is that it does not contain a term of the form (av/ap)6pd6x
since his method would not then be applicable.
To simplify notations, Legendre writes the integrand of the first integral
as
F 6x 2 + 2 G 6x 6y + J 6y2 + 2H 6x 6p + 2K 6y 6p + L 6p2.
a 6x 2 + 2{36x 6y
da 6x 2 + 2d{3 6x 6y
+ dy 6y2
In accord with what he did in the previous case, in effect, he notices that
y 6y2) - (a 6x2 + 2{3 6x 6y + y 6y2)1~ = O. Then he
adds this expression to (3.36). He now wishes to choose a, {3, y so that the
modified quadratic functional (3.36) becomes
I d(a 6x 2 + 2{36x 6y +
Ldx(6p +,roy
+ A8xi
where A, p. are to be determined along with a, {3, y. This will be the case
provided that
Lp.
= K + y,
dy
2
Ln
,.. =J+ -dx'
LA = H + {3,
d{3
L"l = G+,.."
dx '
LA 2 = F + da
dx
(3.37)
together with
av
ax
+ 2{3p + 2a =
0,
av
ay + 2yp + 2{3 = O.
(3.37')
142
d(
~~
d(
~; ) =
d(
~; )
+ 2J dy + 2K dp,
= 2H dx
+ 2K dy + 2L dp.
dp
da
d{3
dp
F + Gp + H dx + dx + P dx + {3 dx = 0,
dp
d{3
dy
dp
G + Jp + K dx + dx + P dx + y dx = O.
With the help of equations (3.37), these become
dp
LA2 + Lp,"}t.p + LA dx
= 0,
dp
Lp.lI. + Lp,'jJ + Lp, dx
= 0,
dp
dx +pp +A= O.
Last he notes with the help of (3.37) that the Euler differential equation is
expressible as
av ) av
a2v
a2v
a2v
ap - ay = ax ap + p ayap + ap2
dp
av
= 2H + 2Kp + L - - d~
ay
o=
d (
dx
= 2H + 2Kp + L :
dp
dx -
av
ay
+ 2yp + 2{3,
O=LA+Lpp+Ldx .
In other words, instead of seven independent equations (3.37), (3.37') to
determine the quantities a, {3, y, A, p" there are only five, which may be
written as
(J+
{3
~:
)L=(K+y)2,
av
= - yp - 2 ay ,
K+y
p,= - L - '
I
a=-{3p-2
H+{3
A=-L- .
av
143
+ 2f38x8y +
y8y( - (a8x2
+ 2f38x8y + y8y2)1
ydy 3
dx 2 + dy2
py
1 + p2 dx,
(3.38)
subject to the condition that the admissible arcs pass through two given
points. As we know, this leads to the Euler equation
py
-..!.....:'------=a,
(1 + p2)2
and if
x= a( ~
+ _1_ + lp) + b,
4p4
p2
y=
a2v
2py(3 - p2)
ap2
(1 + p2)3
-=-----
Figure 3.1
144
Figure 3.2
Figure 3.3
3.5. Excursus
145
3.5. Excursus
After Lagrange wrote his two papers in the Memoirs of the Turin
Academy, he brought out two more works on our subject, which should be
mentioned. The first of these appears in his Theory of Analytic Functions,
the first edition of which came out in 1797 and the second in 1813; it is the
latter edition which appears in his collected works. The second paper
appears in his Lectures on the Calculus of Functions, specifically in lectures
21 and 22 of 1806. This volume of lectures were intended "to serve as a
commentary and supplement to the first Part of the Theory of Analytic
Functions." IS
In his Theory Lagrange criticizes Legendre's method (TAF, p. 305):
In this way one would achieve the same result as is given by the method
proposed in the Memoirs of the Academy of Sciences of 1786 for
distinguishing maxima from minima in the Calculus of Variations. But
after what we have said above it would be necessary for the correctness of
this result that one be able to make sure that the value of v [this is the
negative of Legendre'S function a in equation (3.34)] could not become
infinite for a value of x lying between the given values a and b, which will
most often be impossible to do because of the difficulty of finding the
function v(x). Without this requirement, although the quantity
p(w' + ~~ f.
146
b(nw2
(3.40)
.2( -2-x
1 + n + -4-sm
1 - n . 2x -
m
2)
TCOS
x + c,
(3.41)
f(x, y, y')dx
147
3.5. Excursus
-dx-
=-
(Q + a)2
-'---R--'-- ,
f, C + f]
elsewhere.
=a
JC -
Q + A )2
"'i' + -R-- 1] dx.
But R(x) < 0 on the interval [c - f, C + f], and thus this integral" 0, i.e.,
the second variation, would be nonpositive. The equality sign above,
moreover, could hold only if
1]
'+ -Q+A
-0
R-1]=
lim
x=c-e
(x -
'+ Q+A
-1]
+ f)( X
C -
f)
-4f*'0;
thus the integrand is not zero near x = C - f. It then follows at once that
< 0 is impossible, and hence it is necessary that R(x) ~ 0 on [a, b].
R(c)
148
149
+ tq2q,II(y) + ...
t p2q,II(X) + pqq,"(X, y)
= 0,
(3.42)
where his notation is somewhat peculiar. By q,'(x), q,'(y), q,'(z), ... he means
aq,jax, aq,jay, aq,jaz, ... , and by q,"(X), q,"(X, y), q,"(y), ... he means
a2q,jax2, a2q,jaxay, a2q,jay2, .... He now says that one could solve (3.42)
for p and substitute that value in the expansion for !(x + p, y + q, z +
r, ... ) or, better yet, multiply (3.42) by an undetermined quantity a and
find as a necessary condition that, in his notation,
f'(x)
+ aq,'(x) = 0,
f'(y)
+ aq,'(y) = 0,
.r
(3.43)
(3.44)
He says that from the discussion above, one simply adjoins the q" multiplied by a variable .:l, to! and regards the variables as being independent.
He then finds, in his notation, the two equations
= 0,
+ .:lq,'(z) -
[.:lq,'(z')]'
+ [.:lq,'(ZI)]"
- ... = 0.
150
He also takes up the isoperimetric case and considers the integral (3.43)
being a maximum or minimum subject now to the condition that
,Z, z',
(3.44')
Here he says that f + Ilq, is again considered, but that now Il is a constant,
which can be determined from the value of the constant in (3.44').
In his Lecture 22 Lagrange again discusses the so-called Lagrange
problem. 18 He takes as before an integral (3.43) and the side-condition
(3.44). He says "it is considerably simpler to employ the multipliers in the
way that they are used in the Mecanique Analytique, which is totally based
upon the calculus of variations." He then goes on to another most
important case. He considers now that his integral (3.43) has adjoined to it
one or more end-conditions which he writes in the form
ues of x, Y, y', ... ,z,z', ... at the ends of the arc being examined. He
remarks that the variation of <P = 0 is
xo<P'(xo) + Yo<P'(yo)
+ zo<P'(zo) + .. .
+XI<P'(X I ) + yl<P'(YI) + ZI<P'(ZI) + ...
O.
18Lagrange, TeF, Vol. X, pp. 414-421; BoIza, VOR, pp. 543-546, and 566-569.
X2
/(x, y, y')dx
(4.1)
XI
y(x l ) = YI'
(4.2)
152
y~(x,ao,bo)
is different from zero, where Y' means dy I dx and Ya' Yb mean ay laa,
ay lab, respectively.
XI
(f;,' 1/
k.
1/ ) dx =
X2
f;" . 1/ I +
XI
11
X2(
f;, -
d)
f;" 1/ dx,
dx
(4.4)
where
2Q( x, 1/, 1/') = f;,Y 1/ 2 + 2f;,Y'1/1/'
(4.6)
'1
dx
'1
= O.
(4.7)
Consider now Euler's differential equation (4.3) above, whose coefficients are evaluated along the family of extremals y = y(x, a, b). Then (4.3)
is an identity not only in x but also a and b. If this relation is differentiated
with respect to a or b and if 1/ == Ya or Yb' then clearly
f;,y 1/
+ f;,y'1/'
:x
(f;,y'1/
+ f;,'y'1/')
or equivalently
d
(P - Q')1/ - dx (R1/') = 0;
i.e., relation (4.7), which is called the Jacobi differential equation, is valid for
ay laa or ay lab and hence for a linear combination
1/ =
u = aYa
+ /3Yb
153
4.1. Excursus
= kYb(xl,ao,b o),
{3
= -kYa(xl,ao,bo)'
XI) =
IYa(xl,ao,b
Ya(x, ao' bo)
)
o
(4.8)
={
d(X'X I)
[XI,X3]'
[X3' xz].
(4.9)
We see at once that TJ(x I) = 0 = TJ(x z), and except at the possible corner
= x 3 ' TJ satisfies Jacobi's differential equation (4.7) since bothYa andYb
do. Then with the help of (4.7) and (4.5) we have it(TJ) = 0 on [XI' xz], and
the second variation vanishes for this choice of TJ. Moreover, this TJ is not
identically 0 as can be seen directly from the existence theorem for
second-order differential equations. In fact, for Euler's equation, it is well
known that the family of extremals can be so chosen that at X = X I the
determinant
y;(x,a,b)
D(x,a,b) =
(
b)
Ya x,a,
X
It is clear, moreover, that dd(x, x I)/ dxlx, = d'(x, xI)IX, = D(xI' a o' bO>, and
so d' is different from zero at xI" It is then evident that d is not identically
zero. Any points X3 -=1= XI at which the determinant d of (4.8) vanish are
called conjugate points to the point XI' Thus the theorem above guarantees
that if there is a point X3 conjugate to XI on [xl'X Z]' then the second
variation cannot be positive for all TJ '# 0 vanishing at X I and x z . I In fact,
we can show that there are such variations 1/ which give it negative values,
as we shall now see with the help of the Weierstrass-Erdmann corner
condition applied to the accessory minimum problem. To do this, express
the Jacobi differential equation in the integral form
{2", =
where d is a constant.
i {2"dx+ d,
X
XI
I Since for a minimum the second variation must be nonnegative for all variations vanishing
at XI and x2' the problem of finding an 'II which makes the second variation a minimum is
often called the accessory minimum problem.
154
Thus
R . 1]'(x - 0) = Q'l'(x, 1], 1]'(x - 0 = Q'l'(x, 1], 1]'(x + 0 = R . 1]'(x + 0),
= y(x)
X3
:x
u'lt(v) - v'lt(u) = -
[R(uv' - u'v)],
fx
[R(uv' - u'v)].
-p
fx
(Rp'u 2) =
fx
e"
(Rpp'u 2) + R(p'u)2.
XI
(4.10)
155
4.1. Excursus
as we see with the help of relations (4.5). (This expression, as will be seen,
is related to Legendre's form for the second variation. It is often called
Jacobi's form. If there is no point X3 between XI and X2 conjugate to XI'
then u(x) = ~(X,XI) does not vanish on (x l ,x0, and hence (4.10) implies
that the second variation is positive-recall the assumption that f;,y = R
*0 on [x l ,x2]-for all variations 1/ vanishing at XI and x 2
Theorem. Let Y = y(x) define an extremal arc on the interval [x l ,X2 ]
embedded in the two-parameter family Y = y(x, a, b) described earlier and
suppose that there is no pOint X3 between XI and x 2 conjugate to XI. Then the
second variation defined along y(x) is positive for all variations 1/ vanishing at
XI and x 2
= y(x,ao,bo)
and havingYa(x l , a o) =1= O. Let us see how this family is then related to the
two-parameter family y = y(x, a, b) when Yb(X I' ao' bO> =1= O. Consider the
equation YI - y(x l ' a, b) = O. It can be solved for b as a function of
a, b = B(a), so that Y = y(x, a, B(a with bo = B(aO> is a one-parameter
family of extremals all passing through (X., YI). It can, moreover, be shown
that near a = a o the two families Y = y(x, a, B(a and y = y(x, a) can be
related; in fact, there is a function A (a) such that
y(x,A(a
= y[x,a,B(a)],
/C
= - kyAx.),
Yb(x,ao,bo)
Yb(X I, ao' bo)
then
since y(x .. a):; YI. From this it follows that the points X3 conjugate to XI
are equally given by the zeros =1= X I of ~ or of Ya. Then consider the curve
determined by the parametric equations y = y(x, a), Ya(x, a) = O. This
curve is well known to be the envelope of the family Y = y(x, a). It is
patent that along this envelope Ya vanishes, and so the point X where the
minimizing arc meets the envelope is conjugate to XI since ~(X,XI)
= KYa(X, a o) = O. If the envelope degenerates into a single point (X3' YJ),
then the curves of the family all pass through the points (XI' YI) and
156
(X3' Y3)' and no minimum can exist over any interval containing [X I,X3]?
This result, which we discuss below, has great elegance in that it gives a
remarkable geometrical significance to what perhaps might have otherwise
seemed like an analytical shortcoming.
It is also possible to establish a converse of this result. Suppose that X3 is
conjugate to XI in the sense that Y",(x 3 , 0: 0) = 0, and that y",Ax 3 , 0: 0) =1= o.
Then there is a unique solution X = ~(o:) of y",(x, 0:) = 0 through (x 3 ,0:0)
and no:) = - Y",a/Y",x. We now can see that the curve X = ~(o:), Y = 1](0:)
= y[~(o:), 0:] is the envelope of the family of extremals Y = y(x, 0:) near
(x 3 ,0:0 ). To do this, notice that along this curve y", == 0, and the conclusion
then follows directly.
a'i
a'ia ' ww'+ --w'w'
a'i) dx
J( -ww+2-a
ay2
Y Y
ay,2
2These results on envelopes were apparently first considered for geodesics on a surface by
Darboux, (TDS, Vol. II, p. 88), by Zermelo ([1894], p. 96), and later by Kneser ([1898], p. 27,
LV, p. 93, or LV', p. 116). Note that great circles on a sphere are instances of extremals for a
problem in which the envelope is a fixed point.
3 Jacobi [1838]. It is noteworthy, perhaps that as early as 1770 Laplace interested himself in
the second variation but got nowhere.
4Bertrand [1841], Clebsch [1858], [1858'], Delaunay [1841], Hesse [1857], V.-A. Lebesgue
[1841], Mainardi [1852], and Spitzer [1854]. There were also a considerable number of others
who worked on consequences of these papers, but the next major figure was clearly
Weierstrass, and the men in between Jacobi and him only prepared the way.
157
and says that it is necessary for an extremum that aj/ay,2 always have the
same sign. He then turns attention to the function v of Lagrange-v = - a, the function in (3.35) above Legendre used--<lefined by
the differential equation
= y(x,a,b)
a'i
I a'i
= - ( ayay' + -;; ay'2
dU)
dx
(4.11)
and that this expression for v contains just one arbitrary constant, !3 / a.
Notice that the function v becomes infinite precisely when u vanishes,
since u' =1= 0 at such points. [Otherwise, u, a solution of (4.7), would be
identically zero.] This is precisely Lagrange's objection to Legendre's
result: v becomes infinite just at the values X3 conjugate to x I.
To see that the function a = - v in (4.11) satisfies Legendre's equation
(3.35) above, let P = a'i/ay2, Q = a'i/ay ay', R = a'i/ay'2. Then since
u satisfies Jacobi's equation (4.7) above and since u'/u = (v + Q)/ R by
(4.11),
a
=-P+ R(v+ Q ),
(4.12)
158
t- -ddX t,+
d2
-2
dx
k=O
(4.13)
and, in general, a complete family of extremals will contain four parameters a, a p a 2 , a3 He sets <5y = wand has <5y' = w' and <5y" = w". He then
expresses the second variation in the usual form
a'i, + 2
a'i
J( aya'i ww+2~ww
a a-2
"
"ww +
Y Y
a'i"
ww
--2
Y yay'
,,, + -a'i
",,) dx.
+ 2 a a'i
'a "ww
-2 W W
y yay"
Now for a maximum or a minimum, Jacobi notes that t"y" must have the
same sign on the interval [X I ,X2 ). To simplify notations, let us write his
integrand as
2fl(w, w', w") = Pw 2 + 2Qww' + Rw'2 + 2Sww" + 2 Tw'w" + Uw,,2.
)
dv )(
dV 2
( P + dx
R + dx + 2vI
dV) = (S + VI) 2 ,
U ( P + dx
dV I )2
dx
'
= (Q + v +
dV2
U ( R + dx + 2vI
(T + V2)2.
ay
ay
aa + 0: 1 -aa-l
ay
ay
ul = /3 -a
a + /31-aa-l
U
0:
ay
ay
+ 0:2 -a
a + 0:3 -a
a
2
ay
ay
+ /32-aa- + /33 -a
a
2
'
'
(4.14)
or
159
= (P -
Q'
dx
(S'IJ
=
+ T'IJ' + U'IJ")
0,
(4.15)
the generalized form of Jacobi's equation (4.7) above. The second variation
is then representable as
(4.16)
Since (4.15) is a linear differential equation of the fourth order having both
" and as solutions, it is evident that there must be relationships between
the parameters a, aI' a l , a 3 , p, PI' Pl ' P3 In fact, Jacobi remarks that
even the six quantities aP I - a l p, aP2 - a2 P, aP 3 - a3 P, a2 P3 - a3 P2 ,
a 3 PI - a l P3 ' a l P2 - a l PI are not independent but that he is not going to
set out the relationship.
By analogy with what he did before, Jacobi relates Legendre's functions
v, VI' V2 to " and "I with the help of the relations
"I
VI
,
v= -v l
= -S+ U
Q - U
(uu~ - u IU,)2
"'u~
uu~
- u~u"
- ulu' ,
Let us see how he transforms the second variation with the help of these
functions. To do this, he first states a very elegant result on self-adjoint
differential operators. The way he establishes this theorem is not indicated.
What he says is, however, quite interesting and revealing. He considers the
differential operator
Ay
d(Aly')
d
X
d l (A 2 y")
~
d 3 (A 3 y"')
~
+ ... +
dn(A,J'(n
d n
= Y,
d(Alu')
dx
d 2(A 2u")
dx2
+ ... +
dn(Anu(II
fixn
1= yU
5 Jacobi [1838J, p. 44. As we mentioned earlier, numerous papers were written giving proofs of
his results. In addition, a very elaborate literature grew up about the second variation, whose
purpose was to show how, by suitable transformations, it could be changed into a form that
was surely positive. Most of this literature is now quite obsolete and can be substantially
simplified by newer understandings. In fact most of -this modem approach grew out of
Weierstrass's concepts and of a better way to formulate general problems in the calculus of
variations, as we shall see below.
160
will be integrable, i.e., one can specify its integral without knowing t, and
this integral has again the form of Y, except that n becomes I smaller;
one has, namely:
yu dx= Bt'
d(B1t")
dx
d 2(B 2 t"')
dx 2
+ ... +
d n- 1( Bn_1t(n
dx n- 1
With the help of these remarks Jacobi now illustrates the use of his
theorem in the case where the integrand is j(x, y, y', y"). He begins by
representing the second variation in the form J8V 8y dx, where he has
8V=A8y+
d(A 18y')
d 2(A 28y")
d
+
,
X
dx 2
and 8V = 0 for 8y = U, one of the solutions given in (4.14). This form for
8V follows'directly from the expression (4.16) for the second variation with
A = P - Q' + S", AI = -(R - 2S - T'), A2 = U. He now uses his result
above on self-adjoint expressions to write
8y
8'y=- ,
U
161
~xy
]8'Y- J[B8'Y'+
~xY
]8'Y'dX,
where he has used the expressions 8'y',8'y" to stand for the derivatives
d(8'y)/ dx, d(8'y')/ dx = d 2(8'y)/ dx 2 In the right-hand member of this
equation the integral, as well as the expression outside the integral sign, are
understood to be evaluated between the given limits.
In the equations above he calls the integral f VI 8'y' dx and says that
VI = 0 if 8'y = uI/u, i.e., if 8'y' = (uu; - u I U')/U 2 ; here U and u l are given
by equations (4.14). In order to use his theorem again he defines a new
variation 8"y = u28'y'/(uu; - u'u l ). Notice that
VI = B8'y'
+ d(B I 8'y")/dx
UU' - U u' )
J
I u2 I
8"ydx=C8"y'.8"y- C(8'y,)2dx,
( UU 'I -
u2
UIU
, )2
BI =
('
UU I -
UIU
u2
,)2
A2
and that A2 = aJ/ay,,2 (see Section 4.2 below). He is now able to conclude
that A2 must always be positive [nonnegative] for a minimum and negative
[nonpositive] for a maximum. He then observes that one must examine
whether 8"y' remains finite on the interval between the end-points. This,
he says, can be done by an examination of U and u l as soon as y, the
complete solution of V = 0, has been found.
Jacobi now proceeds to formulate a geometrical interpretation for his
condition. He also states a sufficiency theorem which is only partially
correct as he has formulated it. (His result is, in fact, only true for a
"weak" minimum, i.e., for the case when the comparison arcs are close to
the given arc in both position and direction. Weierstrass gave an example
in 1879 to show this; but it was not until the turn of the century that
Kneser in his Lehrbuch, L V, explicitly distinguished between weak and
strong minima.) Jacobi's words are these 6 :
6 Jacobi [1838], pp. 46-47. Stickel in Ostwald's Klassiker, No. 47, p. 108 tells us Bertrand tried
to argue that even if a conjugate point is passed, the property of a curve to furnish an
extremum may still hold. This is false, as Weierstrass proved in his 1879 lectures. See Section
5.5 below.
162
and consequently
= y[ x2(h), a o + h]
163
Figure 4.1
from which it follows by the definition of the envelope that x 2(O) = ~(ao)'
Y2(O) = 1J(ao), i.e., lim xi h) = x 2 , lim Yih) = Y2, as asserted. 7
To illustrate how his condition operates, Jacobi discusses an example
from dynamics involving the principle of least action. The problem involves the motion of a planet about the sun. Jacobi says in the first place
that Lagrange was wrong in believing the action integral could ever be a
maximum; he further says that it is not even always a minimum. He
proceeds to observe that if the end-points are not properly chosen with
respect to each other, the integral will be neither a minimum or maximum.
In Figure 4.1 he imagines a planet is at a point a somewhere between
perihelion, the point nearest to the sun f, and aphelion, the point furthest
from f. It starts at a and proceeds to b along the ellipse whose major axis
has length 2A and whose focus f is at the sun's location. The other focus is
then to be determined by the property of the ellipse that af + aI' = 2A
= bf + bI'. Jacobi uses this fact to draw circles with centers at a and band
with radii 2A - af and 2A - bf, respectively. In general, these circles
intersect in two points and in one point when they are tangent. This latter
case occurs only when the line ab passes through the focus I'. Thus in
general there are two ellipses and in the latter case, just one. (In the next
section we discuss Jacobi's point in detail.)
He draws the chord aa' through the focus I' and says "in accordance
with the general rule the other epd-point b must lie between a and a', if the
ellipse is to make the integral appearing in the principle of least action a
minimum. If b falls on a' the second variation cannot become negative, but
zero, so that the variation of the integral is of the third order and so may
be positive or negative. [In Section 7.5 we mention Erdmann's analysis of
the third variation for such cases.] If b falls outside of a', the second
variation itself can [will] become negative."
Jacobi then turns to the case when the initial point a lies between
aphelion and perihelion (see Figure 4.2). He now draws the chord aa'
through the sun f and rotates the ellipse about afa', finding "infinitely
'There is a nice discussion of the geometrical significance of conjugate points in Bliss, LEe,
pp. 34-36. See also Section 4.1 above.
164
Figure 4.2
many solutions to the problem. If, therefore, the second end-point in the
latter case lies beyond a', there will be a space-curve joining the two
end-points for which f v ds has a smaller value then for the ellipse."
f
a(ro,D)
Figure 4.3
(2A, Q)
165
Figure 4.4
t mv 2 = T = (U + H),
(4.17)
I.
r/
= 0,
\r,2 +
(The first relation implies that the areal velocity is constant and is a
statement of Kepler's law that the radius vector sweeps out equal areas in
equal intervals of time.) It is now convenient to change both the independent and the dependent variables from t, r to cp, u = 1/ r. After a little
166
u +u=-=CJl'P
h2
p'
where, e.g., uCJl'P = a~/aq? The solution of the second equation is clearly
the conic
U
1. [1 p
e cos( cp -
X)]
(4.18)
t mh2( u; + u2) =
k 2mu
+ H.
m r~'2 _ k 2m
2
r
k2
= A (I
+ e)
h 2m _ k 2m
2r2
r
1= -
[ A (1 - e 2)
2A (1 + e) - 1
k2 ( L
r 2r
_ 1)
k 2m
2A
).
With these facts at hand we can now proceed. The first thing is to show
in Figure 4.3 that f3 = a. To do this, note that
Vx
and that at cp = 0
vx
vy=vosma= - ,
ro
2e.
2
2 2a = 2h-k (2A - ro).
SIn 2 a = vosm
smX;
A~
moreover, by the so-called law of sines for the triangle aff' in Figure 4.3
(2A - ro)
2Ae
sin(a + (3) = sinX
167
Putting these together, we see that sin(a + f3) = sin 2a, and therefore we
have f3 = a. Thus we have the relations
h2
k2
= r5
Ie
=2-
ro
e cos X = 1 -
1..
v5
k 2 sin2 a
e sm X =
A'
ro
= ro(
2A - ro ) 2
A
sin a
= role sin2 a,
"2Ie.sm 2 a,
1 - re sin2 a.
With the help of these relations the one-parameter family (4.18) of ellipses
passing through the point a may be expressed in terms of the parameter a
as
rore sin 2 a = O.
(4.19)
:~ = fa = rle(sin 2a cos cp -
= O.
(4.20)
[It is not difficult to show that these are the straight lines through the
points a and f'; the locl,ls of intersections of these lines (4.20) wi~h the
ellipses (4.19) is clearly the locus of points conjugate to XI. This locus is
also the envelope.]
To find this envelope, let us substitute the value of r from (4.20) into
(4.19). After some manipulation, we find that 2 tan cp/2 = re tan a. We can
use this relation to eliminate a from either (4.19) or (4.20). One way to do
this is to set
cp
.
2' cp
SIn a = 1\ SIn "2 '
cos a = reA cos "2 '
in (4.20). This gives us the partial result for cp =F 0, 7T that
r=
but
1 = (4 sin2
I+
and consequently
2rero A2
(1 - re 2A2)
re 2 cos2
+ re (2 - re)A 2 cos cp
I )A = [
2
(2 +
~2 ) -
(2 -
~ ) cos cp ]A 2
168
which implies directly that the equation of the envelope is the ellipse
T=
41(TO
(4 - 1(2) - (2 - 1()2COSq>
4A (2A - TO)
(4A - TO) - TOCOSq>
(4.21)
It is clear that this ellipse passes through the point T = 2A, q> = 0, that
the sun is at one focus, and that the point a is at the other since the major
axis has length = (4A - TO> and eccentricity = To/(4A - TO>. Consider now
Jacobi's point b in Figure 4.3; if it lies inside ellipse (4.21), then the circles
about a and b with radii 2A - af= 2A - TO and 2A - bf= 2A - TI intersect in two points f' and f". Thus there are, in general, two members of the
family (4.19) of ellipses through a that also pass through b; if b lies on the
envelope, there is only one; and if b lies outside, there are none since the
circles of radii TO' TI about a and b do not meet in any point f'.
As Figure 4.3 is drawn, point a lies between perihelion and aphelion.
However, if a lies between aphelion and perihelion instead, the situation is
quite different. This is the situation as depicted in Figure 4.2. In this case
point a' conjugate to a (Figure 4.2) is on the line segment afa' through the
sun at f. If b coincides with a', then not only is the elliptical arc through a
and a' (Figure 4.2) an extremal, but so are all rotations of this arc about
the line afa'. The envelope has here degenerated into the a'.
Jacobi uses these conclusions to state the rule at the end of Section 4.2
above: in the case when a lies between perihelion and aphelion, draw the
chord aa' from a through the focus f'; then point b must lie between points
a, a' if the action integral Jv ds is to be a minimum (see Figure 4.1). If b
lies at a', then the second variation of the integral can be made zero, and
decision must be made on the basis of the third variation. "If b lies beyond
a', then the second variation itself can become negative." He next takes up
the case where the first end-point a lies between aphelion and perihelion.
Now point a' is determined by the chord drawn from a through the focus f
as in Figure 4.2 since there are infinitely many solutions obtainable by
rotating the ellipse about the segment afa'. If the second end-point lies
beyond a', there will be a space curve through the given end-points for
which Jv ds is smaller than for the ellipse.
169
+ n'~,;l
[xr-Z[y]z+ ...
+ [y)",
+ 1) and
Lebesgue divides both members of this relation by 1.2.3 ... n, and writes
(x,m)
x(x - I) ... (x - m
,
m.
+ I)
,(x,O) = 1.
(x - y,n) = ~ (-It(x,x - n
z-o
+ z)(y - I + z,z);
(4.22)
To prove this result and to determine the form of the Bj' Lebesgue first
chooses I to be a constant and has
Bo = Y {AoY
Aj
daA j
dx a
9Delaunay [1841]. Delaunay later [1843] wrote a prize paper entitled "Memoire sur Ie calcul
des variations." It is concerned with the problem of finding among all curves with continuously varying tangents and constant curvature, which join two fixed points, the one with
greatest or least arc-length. There is a very nice discussion of this problem in Caratheodory
(1930).
170
+ ... + (Bnt(n)t),
in which the
Dj = CjJ'
Dot
U=
Dlt'
= (BIt')'
D~
+ ... - D(2n-3)]t"
+ [D 4 - D'S + ... + D(2n-4)]t"'
+ ...
2n
2n
(B2t")' + (B3 t"')" + ... ,
[ D 3 - D'4
=
+ 3D~' -
...
+ (2n
- 3)D4:n- 4).
+ 1)D(2n-2a+l)
2n
(4.26)
171
and
Ba = D 2a - (a,
I)D~a+1
+ (a +
1,2)D~~+2
+ ...
+ (2n - a - I, 2n - 2a)Di;n-2a>.
(4.26')
(D,b,a) = Db - (a,
I)D~+I
+ (a + 1,2)D;'+2 - .. "
(4.27)
(D,b,a)
= [Cb - (a,
I)C~+I
+ (a + I, 2) C;'+ 2 - ...
+ I, I)C~+2
+(a+2,2)C~'+3+'"
+ (a + 1,2)[ Cb +2
Jy
(a
+ (a + 3,2)C;'+4 + ...
Jy'
+ 2, I) C;+3
Jy" + "',
11 V.-A.
(d, I)C;+I
172
... + (i, a +
= ... + (i , a +
e - i)(a + e'
e +I
2)A2i-c-aya-2
MA}.,-2. a +
LA}.,-Iya
I
KAJ'ya = CC'
+ L'A}..,,a-l=
Cc+l'
if
+ M"A!'-a-2=
C
if
c+2'
NA~-3ya + N'A}.,-2. a - 1 + N"A~-lya-2 + N"'A}..,,a-3 = C
.Y
if
c+3'
M'A~-Iya-I
1.Y'
(4.29)
where the coefficients K, L, L', ... are given in the equations above and
A = 2i - e - a. (Lebesgue's array is different from this but is incorrect. His
reasoning about the array is also flawed; however, his results are correct.)
The diagonals in this array determined by a fixed value of the superscript
on y may be used to evaluate fa' The terms in y(a) in this array contribute
to fa the value
(i,a + e - i)[(a + e,e) - (d, l)(a + e,e + I) + (d+ 1,2)(a + e,e + 2)
+ ...
JA
i}.,
(4.30)
as can be seen with the help of Vandermonde's relation (4.22) above with
n = a, x = a + e, y = d. The next term in fa comes from the second
diagonal of the array in (4.29) given by the exponent a - I on y, provided
that the relevant coefficents are multiplied by the factors I, (2, I), (3, I),
... , which enter when the appropriate quantities have been formed.
Thus in the formation of fa the coefficients -(d, I), (d + 1,2), -(d + 2,3),
... , which enter into (4.30), are altered for the next term. They are now
multiplied by I, (2, I), (3, I), ... and become -(d, I), +(d, I)(d + I, I),
173
- (d, 1)(d + 1, 2), + .... It is also clear that in (4.30) c needs to be increased to c + l. It follows, consequently, that the second diagonal contributes to fa the term -(d, 1)(i, a + c + 1 - i)(a + c - d, a)Ai2i-C-a; by
analogous reasoning, the third diagonal contributes
A?i-C-a(a+c-d,a)
2i-c-u
2:
z=o
(-I)'(i,a+c-i+z)(d-l+z,z)
A/i-b-a-Py<a>y( P).
This gives Lebesgue the term in ya . yp. To find in (D, b, a) the term in
y Pya, he appeals to symmetry and finds
( - I)a (a
X
Ai2i-b-a-PyPy a
a - a, 2i - b - a - {3)
174
q'
=p
- h
.
p(p-I) ... q'
(p,h)= ----:--~
12 ... h
= (_I)h( -
q', h)
12 ... h
= (- -l)h(q, h),
and thus (- I)"'(p, h) + (- l)fJ(q, h) = (q, h)[( - I)",+h + (- l)fJ]. But 2(ia) + I = (0: + h) + {3 from the definition of h, and therefore the pair 0: + h
and {3 must consist of an even and an odd number, which means that the
expression above is zero. It follows directly that
Q = (a + 0: - I,o:)(a + {3 - I; {3)[(-I)"'(p,h) + (-I)fJ(q,h)] =
o.
(0: = {3).
Q=(-I)"'(a+o:-I,o:)(a+o:,o:)(th,h)
(4.32)
(q,h)=(-q',h)=(-I)h
= (_l)h
=(-I)h(p-I,h)
-q
(p,h).
p
Moreover,
o:+a
(a + 0:,0:) = - - (a + 0: - 1,0:),
a
(a
+ {3,
{3+a
{3) = - - (a
a
+ {3 -
1, {3);
and, as a consequence,
Q = (a
+ 0: -
X [( _
I,o:)(a
+ {3 -
I, {3)(p,h)
I)'" {3 + a _ ( _ l)fJ + h 0: + a .
a
!!.. ].
p
175
+a
0:
+a
- - - - --=
a
p
a
i( f3 ap
0:)
and hence
Q = (-It
i(f3 -
ap
0:)
(a +
0: -
= Bol +
+ 3A;yy'.
; = 2, he has (h,o:, f3) = (0,0,2), (0, I, 1), and (1,0, I), which correspond to
Q = 4, -2, and 2 and the terms 4A 2yy" - 2A 2y'y' + 2A 2yy'. For i = 3,
there are six solutions: (0,0,4), (0, 1,3), (0,2,2), (1,0,3), (I, 1,2), and
(2,0,2), corresponding to the values of Q 6, - 6, 3, 9, - 3, and 3,
respectively; thus he has the terms
6A 3yy'''' - 6A 3y'y'" + 3A 3y"y" + 9A;yy'" - 3A;y'y" + 3A;yy".
+ 3A 3 y"y" + 9A;y'y'"
- 3A;y'y"
+ 3A 3 yy".
Finally, since
Bo = Y[AoY + (A.y,), + (A 2y")" + (A 3 y"')"'],
176
A3yyVi
This completes Lebesgue's paper. As we see, his method is tedious, and the
various other authors who have attempted the solution have run into more
or less the same difficulties in their papers. Since Jacobi's result is no
longer central to the calculus of variations, I do not discuss the other
methods that were used.
12 Jacobi [1838), p. 50. The paper is a letter to Encke, who was then secretary of the
mathematics-physics section of the Berlin Academy of Sciences. Recall that the principle of
living force is what we now call the principle of conservation of energy. (We discuss
Hamilton's results below in this section.)
I3W. Hamilton (1834), (1835).
177
the present essay, this problem is reduced to the search and differentiation
of a single function, which satisfies two partial differential equations of
the first order and of the second degree.
He then proceeds to find these partial differential equations. (As we see
below, Jacobi shows that only one of these equations is needed.) He
considers a system of n heavy particles, each of mass m; and coordinates
(Xi' Yi' Zi) (i = 1,2, ... ,n). To carry out his analysis, he supposes that
there exists a force function U and that T = 1L . m(x,2 + y,2 + Z,2) is the
kinetic energy so that the "celebrated law of living forces"
(4.33)
T= U+ H,
J~ .
m (dx . 8x'
=J
+ dy . 8y' + dz
. 8z')
(4.34)
When V, the action integral which Hamilton calls the accumulated living
force, is defined as a function of X, y, z, a, b, c and H, by
V= J~'m(x'dx+Y'dy+z'dz)= fo'2Tdt,
(4.35)
+ y'8y + z'8z) -
~ 'm(a'8a
-;- = mixi ,
uXi
and that
8V
,
-;- = -miai ,
uai
8V
b'
8b., = -mi i'
,
-8V
aCi = -mc
' ,
(i = 1,2, ... , n)
(4.36')
(4.36")
178
t ~ .! {(~~ f+ ( ~~ r+ ( ~~ f}
= U+ H
(4.37)
t ~ .! {(~f + ( ~r f + ( ~f}
= Uo + H
(4.37')
(4.38)
(4.38')
uXj
8S
uaj
-;- = -mjaj
He then says that the "limits of the present essay do not permit us here to
develope the consequences of these new expressions. We can only observe,
that the auxiliary function S must satisfy the two following equations ... ,,15
8S
8t
+~._l
8S +
8t
2m
~ . _1
2m
{(8S)2+(8S)2 +(8S)2}.=U,
8x
8y
8z
{( 8S
8a
)2 + ( 8S
)2 + ( 8S
)2} = U. .
8b
8c
0
(4.39)
(4.39')
179
It seems clear from its position in the last section of the paper that he only
came on S at the last moment and in the 1835 paper made it the chief
function of his work. In that latter paper he calls S the principal function of
motion of a system. 16
To get at the paramount role of S, in his second paper Hamilton makes
uses of an elegant result first deduced by Lagrange in his Mecanique
Analytique,17 where Lagrange changes from the variables (x;, y;' z;), (i = I,
2, ... , n) to new ones 1/" 1/2' ... , 1/3n and shows, without any reference to
the calculus of variations, that the differential equations of motion now
assume the form (in Hamilton's notation)
(4.40)
~~
= wa
(a
(4.41)
~
UWa
= 1/a'
(4.43)
"T being here considered as a function of the 6n quantities of the form 1/'
and 1/, ...." To derive the equations due to Lagrange (4.40) above,
Hamilton notes with the help of
= 1,2, ... , n)
(i
~ .m(x"
8x +y" 8y +Z" 8z )
81/;
~
(x, -8x
= -d 4.Jm
dt
~;
81/;
+y , -8y +z , -8z )
81/;~;
81/;
- ~
L.Jm (x, -d -8x +y , -d -8y +Z , -d -8Z) .
dt 81/;
dt~;
dt 81/;
(4.44)
Since
,
, 8x
x = 1/1 81/1
+ '
8x
1'/2 81/2
...
' 8x
+ 1/3n ~3n
'
16Hamilton (1834), pp. 162-211. Notice that the integrand function T + U is the so-called
Lagrangian function and T - U the famous hamiltonian function.
17Lagrange, Mecanique Analylique, 3rd ed., Vol. I, pp. 290-292.
180
~
( , Bx +y ' -By + z-' Bz )
~'m x -BTJ;
=~'m
BTJ;
BTJ;
( , Bx'
, By'
, BZ')
x ~+y ~+z ~
uTJ;
uTJ;
uTJ;
BT
= u'l;
JI.,,'
( ' d Bx
~ . m x dt BTJ;
+ ,d By + ,d BZ)
y dt BTJ;
,Bx'
BTJ;
, By'
BTJ;
Z dt BTJ;
, BZ')
BT
=--.
BTJ;
BTJ;
(
=~'mx-+y-+z-
= F - U = F(w w2' ... , w3n ' TJ .. TJ2' ... , TJ3n) - U(TJI' TJ2' ... , TJ3n)
BH
BTJa
dWa
dt
-=--
(a
(4.45)
(These are often called the canonical equations of motion and TJa , Wa the
canonical variables. Lagrange had introduced these equations to discuss
perturbation theory; see Mecanique Analytique, 3rd ed., Vol. I, p. 310.) This
system is then precisely equivalent to that of Lagrange (4.40). To find its
solutions, Hamilton considers his function S in the form
(4.46)
and forms the variation of S "without varying t or dt" (see p. 181 below or
Hamilton [1835], p. 166). This gives him
BS
= ('BS'dt,
BS'
Jo
= ~ (wB BH - BH BTJ).
Bw
BTJ
BS'
dt
dt
dt
and hence
(4.47)
where Hamilton uses e and p as the initial values of TJ and w. This relation
may then be decomposed into the equations
BS
wa=-g-,
TJa
BS
Pa=-S;a
(4.47')
181
~F
~H
~1I'
~w
~w
2T=2F(w,1I)=~1I-=~-W=~w-.
Then
S'=~'w ~:
= {(T+ U)dt
(4.48)
has the property that the Euler equations found by minimizing S "are
precisely the differential equations of motion ... [(4.40)] under the forms
assigned by Lagrange." (See also Section 2.4 above.)
He next proceeds to find the famous Hamilton-Jacobi partial differential equation. To do this, he notes that the variation of S in (4.47) omitted
the variation of t, the time. To find ~S / ~t, he observes that
dS = ~S
dt
&
+ ~ ~S m,
dt
~1I
so that
~S
&
S' -
L.
~H
~w
U - F.
dt
~w
dt
dt
dt
dt
dt
= 0
~S
~S
~S
-;+ F (~S
~,~, ... , -1:>ut
u1l1
u1l2
u1l3n
(4.50)
18Hamilton [1835]. p. 167.
182
These last relations are what we need from Hamilton's paper. The former
is the Hamilton-Jacobi partial differential equation; the latter was thrown
out by Jacobi, as we see next. The problem of rendering the integral (4.48)
for S an extremum is Hamilton's principle.
In 1838 Jacobi wrote a most important paper on this same subject. 19 In
this paper he levels two criticisms at Hamilton's work as we shall see. He
says: 20
It appears to me that Hamilton has presented his beautiful discovery in
a false light, which both complicates and limits its usefulness unnecessarily. His theorem as he has stated it, has also the disadvantage of being
obscure when one does not have his proof in front of one, since one can
not define one function by two partial differential equations without first
showing that such a function really exists. By the choice he has made of
the special function S the arbitrary constants become the initial values of
the coordinates and of the components of the velocities with respect to the
coordinate axes; but this is not an advantage since the introduction of
these constants ordinarily makes the integral equations more complicated,
and since one can transform the integral equations to this form from any
other form. It is perhaps because he has always to consider at the same
time two partial differential equations that Hamilton has not applied to
his theorem the general rules that Lagrange gives in his lectures on the
calculus of functions for integrating a non-linear partial differential equation of the first order in three variables; and for this reason, as I shall
show in ar.other memoir, results of the greatest interest for mechanics
have escaped him. Finally the requirement that the function S after
having to satisfy the first partial differential equation satisfies also a
second one leads to a restriction in that it excludes the case where the
force-function U contains the time explicitly: for this case, in fact, the
second partial differential equation is not valid.
Jacobi's criticisms are then first, that Hamilton does not know that his
equations necessarily have a solution and second, that his second equation
is unnecessary; it suffices to show that a function S exists which satisfies
the first one, as we shall see.
Jacobi considers more or less the same dynamical problem as does
Hamilton: he has n heavy particles of masses mi , located at the points
(Xi' Yi' Zi) with initial values (ai' bi , c;) and initial components of velocity
(a;, b;, C;) (i = 1,2, ... , n). However, he permits the force function U to
contain t explicitly, as well as the coordinates Xi' Yi' Zi and the initial values
ai' bi , ci . He is then able to introduce the time into U since he does not
make use, as does Hamilton, of the principle of living force--conservation
of energy.
Jacobi's main result is contained in the theorem: 21
19 Jacobi [1838'].
2OJacobi [1838'], pp. 73-74. The translation above is from the French version, pp. 76-77.
21 Jacobi [1838'], pp. 71-72.
183
where U Signifies a given junction of the 3n coordinates xI' YI' ZI' x2' Y2,
Z2' ... , X n, Yn' zn' and the time t, and i takes on all values 1,2, ... , n.
Further let S be some complete integral of the partial differential equation:
(4.52)
as
-a
=131'
al
(4.53)
where the quantities PI' 132, ... 'P3n are 3n new arbitrary constants. Further the components of the velocity relative to the coordinate axes are:
x~ = ~ as,
,
y~ = ~ as,
m; ax;
m; ay;
Z~ = ~ as .
I
m;
az;
(4.54)
Notice that Jacobi, in essence, has shown the converse of what Hamilton did. His proof is quite straightforward. He starts by differentiating
relations (4.53) with respect to t and finds
He remarks that these can be used to find the quantities x;, y;, z;. To do
this, he differentiates the Hamilton-Jacobi relation (4.52) above with
respect to f7, and finds the equivalent relations
o = ~ + ~ -.!.. [ a2s . as + ~ as + ~ as ]
af7, at
m;
aap ax; ax; aap ay; ay; aap az; az;
(p = 1,2, ... , 3n).
Then a comparison of these two systems shows that
x~ = dx; = -.!..
I
dt
m;
as
ax; ,
y;
dy
dt
m;
as
ay; ,
as
az;'
184
new relations
+ ~ .~ + ~ .~ ]+
+ ~ .~ + ~ .
dt2
m. d 2z; =
I
_1 [ o2S . oS
k mk ox; oXk oXk
dt2
m dy; =
dt2
m. d 2x; =
I
_1 [ 02S . oS
k mk 0Yk oXk oXk
_1
k mk
[~.
oZ; oXk
oS
oXk
oY; 0Yk
+~.
oZ; 0Yk
0Yk
ox; OZk
0Yk
oS
0Yk
oY; OZk
+ ~.
oZ; OZk
OZk
02S ,
ox; ot
oS ]
oZk
02S
oY; ot '
oS ]
OZk
02S .
oZ; ot
_1 [ oS. 02S
mk oXk ox; oXk
oS ~
0Yk ox; 0Yk
oS . ~ ]
oZk ox; OZk
02S
ox; ot .
185
as + .!. 2: ~ [( as
at 2 mi aXi
)2 + ( aYi
as )2 + ( as )2] = u,
aZi
as + .!. 2: ~ [( as
at 2 mi aXi
)2 + ( aYi
as )2 + ( as )2] = U
aZi
+ HI = S + (T -
BV = I BH
since
as/at = -
where
12 ~
~
2 +y: 2 + Z:2)
m.(x:
I
I
I
I
U = H
r ;~ r ~~ r] -
t 2: ~i
[ (
~!
+(
+(
U= H
so that both S and V become functions of H, Xi' Yi' Zi' and ai' hi' Ci
(i = 1,2, ... , n). Thus
av
aH
= t,
av
-a
Zj
= mjzj'
With the help of these values he finds that V satisfies the partial differential equation
(4.55)
186
av
-aa- l
av
-aa2
/3 p
/32"'"
where /31' /32' ... , /33n are new arbitrary constants. Thus the solution of
these 3n equations for the 3n quantities x;, Y;' z; contains 6n constants of
integration a p a 2 , , a 3n ; /3 p /32" .. , /33n' Moreover, Jacobi notes that
3n intermediate integrals containing 3n constants arise from the relations
av
a-=m;x;,
x;
av
a-=m;z;,
Z;
187
'It(z)
ch{I'(z')
= I/I'(z) -
and supposes that the coefficients in 1/1 are defined along some extremal of
his family. He also supposes that along that extremal a l1 =1= O. [The expression for 'It would now be written as a1/1 /az - d(al/l /az')/ dx.] He can then
write the second variation in the familiar form (see Section 4.1)
112
= LbZ'lt(Z)dX.
He expresses 'It as
da l1 z'
'It(z) = m:oZ - - dx
and finds
d {a l1 (uz' - zu')}
u'lt(z) - z'lt(u) = - - - - d - x - - so that
b
r
Ja
a
ay /aa l ,
and r 2 =
(uz' - zu'f
l1
ay /aa 2 and
discovers
dx.
~ = ,,(
'2
Q2
"
-m)
188
never vanishes between the limits for x mentioned above .... If, on the
other hand, that ratio assumes all values from - 00 to + 00 between the
same limits for x, then that expression must necessarily vanish one time
whatever value the arbitrary constant m takes on. For that reason we
investigate the character of the ratio '2/ ' .
He now notes that both '. and '2 satisfy the Jacobi differential equation
q,(z) = 0 and thus - a l1 (,.r; - r2r;) = C, a constant, i.e.,
The right-hand member of this equation does not change its sign since
does not, if the given extremal truly furnishes a maximum or minimum.
It follows from this that r2 / r. is a monotone function which takes on all
values between - 00 and + 00 "provided it takes on the same value for two
different values of x. From this one perceives how necessary it is to restrict
the limits of integration .... The limits of the integral must therefore only
be extended to the point at which for the first time the equation
all
ay
ay aa 2
t:,
for 1t:1 small. Hesse remarks that in general there is such an intersection
point with x =1= a, which he calls (b, Yb). Then he has
ay
( aa
l
ay
( aa
l
(a y
aa2
aa2
aa l = 0
and also
(aaay ) aa
aa2
2a l
= 0,
where the subscripts a and b mean that the relevant expressions are
evaluated for x = a and b. Eliminating aa 2 /aa l , he finds
(r 2 )a
(rl)a
(r2 )b
(rl)b '
189
which is his condition above. He discusses this and concludes that this
second point is on the envelope of the family of extremals through the
given first point.
He then proceeds to say:
If the integral expression
second variation
~2 = Lbz'l'(z)dx
must keep the same sign for all functions z. It is however still permissible
that it vanish for a designated function z. The vanishing of the second
variation for a designated function z is nevertheless the limit of the
extension of the limits for the integral ~. For if one extends the limits of
the integral further, then the principle of continuity offers at least the
possibility that the second variation also will change its sign.
Hesse goes on to observe that the second variation vanishes along any
solution of i'(z) = 0, and the most general such z is given by z = alr l +
a 2 r2 He concludes by observing that if there is such a z, then
5. Weierstrass
191
to
tt
(5.1 )
where x' = dx / dt and y' = dy / dt, and asked that it be a minimum (maximum) usually for fixed end-points. He assumes throughout that there is a
region R of xy-space in which F is single-valued and regular-expansible in
a convergent power series-for all x', y' =1= 0,0. 2 He further needs the
requirement on F of positive homogeneity, i.e., for all K > 0
F( x, y, KX', Ky')
+ hx',y' + hy') = F+
hF
+ hx',y' + hy') =
(5.2)
g;
(5.3)
x' a2F
ax,2
+y'~ =0
ax' ay"
ay,2
'
a2F
ax' ay' = -
F' ,
IX Y ,
(5.4)
192
5. Weierstrass
ax
_.4. aF
dt ax"
_.4. aF
G2 = aF
ay
dt ay'
(5.5)
and
G=
a2F
ayax'
a2F
axay'
(5.5')
r G (x''II - y'~)dt,
)to
tl
where ~,'II are the variations, and that G1 = -y'G, G2 = x'G.3 He concludes from this that G = 0 is the first necessary condition.
In Chapter 11 Weierstrass works out his corner condition.4 He states his
result in this way: "If the tangent direction of a given [minimizing] curve
changes suddenly [discontinuously] in one or more places, then the quantities
aFlax' and aFlay' do not undergo any break in their continuity, provided
that there are only a finite number of such places in the interval (to ... t)."
To prove this, Weierstrass considers a value t' for which to < t' < t)
where the curve changes direction discontinuously. He further knows that
there are values t", t'" such that on the intervals I" < I < t', t' < t < t'"
there are no other discontinuities in the curve's tangent. Then he chooses
variations ~ = /Cu, 'II = /cv with /C any constant and u and v arbitrary
continuous [differentiable] functions of I which vanish at t" and t"'. [They
vanish outside the interval (t",t"').] Then the first variation, which must
vanish, has the form
(I{ aF
u+ aF 1'+ aF du + aF dV}dt.
ax
ay
ax' dt
ay' dt
)to
l G(xv-yu)dl+
tl
to
aF
aF
uX
liy
~u+~v
]t'
t"
aF ]t"'_
+ [aF
~u+-a,v
-0.
uX
t'
It follows then that aFlax' and aFlay' are continuous at I = I' since u and
v are independent of each other. The symbols + and - are used to
indicate right- and left-hand limits, respectively.
VOR, Chap. X. His analysis of the fundamental lemma is based on ideas of
Heine and du Bois-Raymond, which we discuss in Chapter 6.
4See also Erdmann [1877']. His work is independent of that of Weierstrass. Weierstrass worked
out his condition during his summer semester lecture in 1865.
3 Weierstrass,
193
dt
where
a2F
axax' -y
,dy'
di F -
L,
2F
2F
, dy'
_
, dx'
_
axay' +x di F - ayax' +y di F - M,
2F
dx'
ayay' -xdi F = N.
+ N1J2 =
x'~
= w and
R.
8 2F = F.(
where
194
5. Weierstrass
+ a2p ,
axay' y,
ap = a2p x'
ay
ayax'
+ a2p
'.
ayay' y,
but since G = 0
ap _ a2p
, + a2p
,
"p dy'
ax - ax ax' x
ayax' y - x Y I dt
ap = a2p y'
ayay'
ay
+ ,2p dx'
y
dt '
dt
dt
and hence
ap = Lx' + My'
ax
'
~; =
Mx'
+ Ny'.
He now differentiates these equations with respect to t and finds after some
calculation that
Llx' + Mly' = 0,
MIx'
+ Nly' = O.
M 1-- -xy
"F2'
N I --
X ,2F2'
(54')
.
and since R = Le + 2M ~T1 + NTl 2, the expression outside the integral sign
vanishes at to and t I for the fixed end-point case. Weierstrass notes that the
total variation of the integral 1 can be represented as
J: PI( tt; f
1
+ P2 w2 } dt
195
~ { F[P)( Wp + IT-
Wp )
+ F1 >W; } T,
p
where the vp are linear functions of the WP ' and all the H(p) possess the
same sign; in the case of a minimum, they are all positive.
"This deliberation Lagrange carried out for the purpose of bringing the
second variation into the form
8 2] = (tIHv 2 dt,
)to
J:
FI(
vw 2];;,
f+
2vw ~
+ ( F2 +
~~ )w2.
~~ ) = o.
and for fixed end-points the expression outside the integral sign vanishes.
(In what Weierstrass has done it is evident that he has tacitly assumed
FI1= 0.)
Weierstrass now wishes to discuss the character of Legendre's differential equation above to see whether its solution v is everywhere finite and
196
5. Weierstrass
c;;: ) -
o.
+ F. du
dt
du*
F2u + dt == 0,
= 0,
or equivalently
Fu - ~ (F. c;;: ) - o.
(5.7)
+ flG.
G2 + flG 2 - (x'
when he varies x, y by
~,.".
+ ~~ )(G + flG),
8G. - -y'8G -
~ G,
8G2 == x'8G +
~~ G,
dq 8G2.
dt
Weierstrass then makes use of the definitions (5.5), (5.5') of G., G2 and G to
calculate 8G., 8G2, and 8G. With the help of the definition w - x'." - y'~
and of (5.4) of F., he is able to show after considerable calculation that
8G. - - y,( F2w -
~ (F. ~ )),
~ (F. ~ ))
197
~ (FI tt;)
198
5. Weierstrass
o=
d ( FI dt
dUI )
U2 dt
UI
I
d ( FI dt
dU2 ) = dt
d ( FI (dU
dt
U2 dt
dU 2 ))
U. dt
or
(5.9)
FIP=C
where P = (u2du l / dt) - (Uldu2/ dt). If now u l =1= 0, this implies that
and consequently
U2 = - CUI (I dt 2
)'0 Flul
Now recall that 8(t, t') is a solution of (5.7) containing a parameter t'.
Choose ul = 8(t, t'), U2 = 8(t, t' + T) and note that when to is taken to be
t' + T, the relation above for U2 becomes
8(t,t'
+ T) =
- C8(t,t') (t
dt
),,+ .. F I8(t,t,)2
By hypothesis, Weierstrass has 8(t, t') =1= 0 inside the interval (t' ... t")
and FI =1= 0 on that interval; hence 8(t, t' + T) cannot vanish for t between
t' + T and t". Next he proceeds to show Utat 8( t", t' + T) =1= O. To do this,
he notes that 8(t", I') = fJ l (t")fJ2(/') - fJ l (/')fJ2(t") = 0; if now
8(1",1' + T) = fJ l (t")fJ2(t' + T) - fJl(/' + T)fJ2(t")
were 0, then
(5.10)
199
where 8 1,82 are the partial derivatives of 8 with respect to I and I' and the
third term in this relation is at least quadratic in T and T'. It is not difficult
to show that 8 1(/",1') and 8 2(/",1') are not both zero; in fact, since
8(1",1') = -8(/',1"), it is evident that 8 1(/",1') = -8 2(/',1"). The quantity
T' can then be expanded for sufficiently small T into a power series of the
form
'T'
aT
+ T 2$( T),
= _ 9 2(t",t') = [ 91(t")
a
91(t')
8 1(t",t')
]2 FI(t")
FI(t') '
(IFI { dw _
)'0
dt
w du}2 dt.
u dl
I:I {
FI(
~ + F2W2 }dt
I:I{ - ~(FI';)+F2W}wdt
VOR, p. 161.
200
5. Weierstrass
2[= i:1
{(FI+ e)( 7t
Weierstrass now chooses w so that the first integral in this relation will
vanish. To do this, he applies the usual integration by parts technique and
has
13
2[ = {tl{ _ ~((FI+e)7t)+(F2+e)w}wdl
+ [W(FI + e) 7t
r> ei:l((7t f
2)dl.
+W
(5.11)
To make the first integral in this expression vanish, he wishes to choose for
W a solution of the differential equation
(5.7')
which vanishes for I = 10 and I = II' [Clearly, for e = 0 this reduces to (5.7).]
If I" is conjugate to I', then (5.7) has, by definition, a solution 8(/, I')
which vanishes at I' and I" and is continuous. Weierstrass now shows by a
continuity argument for linear differential equations that there is a solution
u = 8(1, I', e) of (5.7') with a root I'" near to I" for lei sufficiently small and
e of the same sign as Fl' If there is a point 10 < I I conjugate to 10 , then for
such e there is a point 12 < I I for which W = 8(/, 10 , e) vanishes at 1 = 10 and
12, When this W is introduced into (5.11) with W = 0 on [t2,/d, there results
for the second variation the expression
For this variation w it is evident that 13 2[ and thereby also the complete
variation of [ have the opposite sign to FI' In this connection Weierstrass
proves in his 18th chapter that 13 2[ has the same sign as the complete
variation
iJ.[ =
201
In the course of this analysis Weierstrass has, in effect, shown that if the
Jacobi differential equation (5.7) has no solution u vanishing at any point
of the closed interval from 10 to II' then for e sufficiently small the same is
true of the perturbed equation (5.7'). This point is needed in what follows.
202
5. Weierstrass
arbitrarily small and where the directions of the tangents to the two curves in
these points deviate only a little from each other. Then the three conditions ... [above] ... are not only necessary but also sufficient for the
exhibited integral to have a maximum provided the function FI is negative
and, on the contrary, a minimum provided that FI is positive.
i:l(
F( x
~~ ,Y' + ~) -
F(X,Y,X',Y')dl.
e)(~Fa~(X
+ e~I'Y + e~2'x' + e~3'Y' + e~4)~~~dedl,
Jto Jo
a,p
where a, f3 run over the integers 1, 2, 3, 4,
dl = (tl (1(1 -
~I
= ~,
~2
= 1/,
~3
d~
= dl '
and the Fa~ are the second partial derivatives of F with respect to ~ and ~~.
He then expands the Fa~ in powers of e and notices that the first term in
the right-hand side of the resulting expansion of dl is expressible in terms
of the second variation of I. Thus
dl =
(a,
f3 = 1,2,3,4),
where the coefficients ea~ are themselves functions of the ~, which can be
made arbitrarily small, uniformly in I. By this means Weierstrass has
brought the problem of deciding on the sign of dl to that of deciding on
the sign of the second variation 8 21.
He chooses for ~, 1/ the values
I
~= -
Y'w
x'w
1/ = --.:;::.....:..:....-
8 21= .(I(FI( ~
8 As
f+
F2w 2)dl;
we shall soon see, Weierstrass later developed a more elegant way to handle sufficiency
proofs, which was further refined by his successors.
203
and the quadratic form in the ~ appearing in fl.! goes over into such a form
in wand dwldt
jw 2 + 2gw dw
+h(
dw)2
dt
dt'
where f, g, h are functions of wand dw I dt. Weierstrass now transforms this
form by a rotation of axes into fl u? + f2Ui with
u2 + u2 =
I
2+ ( dw
)2.
dt'
here the functions fl' f2 have the same property as the falJ; i.e., as functions
of w they can be made small, uniformly in t. (Notice thatfl,!2 are the roots
of the quadratic equation S2 - (j + h)s + (fit - g2) = 0.)
He now chooses 1 as a value between fl and f2 so that
tt; f)
t1:1(1 +
k)(
(tt; f + w2)dt,
~ ( FI -
k)
'!: )- (F2 -
k)u = 0
has the same property. This follows from an examination of equation (5.7')
with k = - f and the remarks at the close of Section 5.4.
Knowing this, Weierstrass can then transform the first integral in the
expression for fl.! above into
i ll (FI 10
au)2 dt
k) (dw
- - w ~
dt
u dt
"and one sees that it has the same sign as FI - k, and therefore also as Fl'
This is however what was to be proved."
204
5. Weierstrass
(5.13)
with O'i= A = deta,,1' and with the X"I' continuous functions of XI'
x 2 , ,xn ' which vanish along with their arguments and which have
finite-valued partial derivatives. It is now always possible to find constants
gl' g2' ... , gn and hI' h2' ... , hn so that for IYI'I < hI' there is one and only
one set xI' satisfying system (5.13) with IXI'I < gl" This solution, moreover, is
of the form
205
He seeks to determine a', f3', 'T so that this new curve goes through the
point A given by t = to in equations (5.12) and makes with the given curve
defined by a, f3 as small an angle as desired. If x = x + ~, ji = y + 1/, then
he makes certain assumptions: first, that if t = to, then ~ = 0, 1/ = 0; and
second, that if
dv
,
.,
-d x
= x, = pCOSI\,
= y = pSlfll\,
dt
ax = PCOSA
- -
dt
'
:it
dji
dt
'
= p-sinA
'
x , -dji - y , -ax
dt
dl
' ('1\ - 1\
') ="
= x , -ar, - y , -d~ = pp-SIn
dl
dt
(5.14)
1/12 = a1/1 /af3 and where P, q,', PI' ql' 'I' P2' q2,'2 are quantities that vanish
with a', f3', 'T.
By Weierstrass's implicit-function theorem, to solve this system for
a', f3','T, it suffices to examine the case when these nine last quantities are
set to zero. Before doing this, he first seeks to simplify the relation for" in
(5.15) above. In the expression (5.14)
~,1/
(5.16)
1(1)
02( t)
Then the expression 1/X' -
~'
(5.17)
206
5. Weierstrass
/C
where 0, ai' 02 are quantities that vanish with 7', a', P' since ~ = 11 = 0 at
t = to'
Using this relation, Weierstrass returns to his system (5.15) above with
p,q,r, PI,ql,r l , P2,q2,r2 set to zero and replaces the third equation there
with the equivalent one
/C = O,(to)a'
+ 02(tO) P'
obtained from the one above by setting 0,01,02 to zero. The new system has
the determinant
"'I
We proved earlier with the help of equation (5.9) that this expression is not
null, sihce it is equal to - C / Fl' This shows that the simplified system has
the solutions
and hence the original system (5.15) has, for I/CI sufficiently small, the
solutions
7'
= ( -
~~ ::~ + k)
0l( to)
P' = ( D(to) + k2
/c,
/C,
(5.18)
where k, kl' k2 are quantities vanishing with /c. Weierstrass then summarizes
as follows: "To any given curve satisfying the differential equation G = 0
there always corresponds a second one which intersects the first in a given
point A at a given, sufficiently small angle; and that the second curve extends
as far as the first."
Weierstrass next asks whether the second curve (5.12') has any other
point in common with the first one, (5.12). To see this he considers the
distance between them measured along the normal to the original curve. If
X, Yare the coordinates of a point on the normal, the distance from the
foot of the normal to the original curve is given by
x'(Y - y) - y'(X - x)
~X'2 + y,2
(If there are no singular points on the first curve, the denominator is not
207
q,'(t)( 1/1'( t)'T + I/Il(t)a' + 1/12(t){J') - I/I'(t) (q,'(t)'T + q,l(t)a' + q,2( t){J')
= 8\(t)a' + 82(t){J',
where 8\,82 are given by definitions (5.17). In terms of the 'T,a', {J' of (5.18)
and 8 of (5.8), this becomes
(
-8\(t)82(tO) + 82(t)8\(to)
D(to)
+ (K)
) _ ( 8(to,t)
KD(to)
+ (K)
)
K,
where (K) is a quantity vanishing with K. Thus the distance along the
normal to the first curve from its base to where it cuts the second curve
goes to zero with the angle of intersection at the initial point A. If this
distance vanishes, then
8( to, t)
D(to) +(K)=O.
(5.19)
This is only possible if 8(to, t)/ D(to) vanishes with K. But this means that
"the intersection point of the two curves must, with vanishing K, become
infinitely near to a point conjugate to to." If in (5.19) the value t is chosen
not conjugate to to, then 8(to, t) 0 and (5.13) cannot be satisfied for K
very small. Near such a point the curves cannot intersect.
On the other hand, if t = to + 'T is a value very near to a point to
conjugate to to, then (5.19) can be satisfied. To see this, recall that 8
changes sign near a root (see p. 198 above.) Thus there are positive
constants 'Tl,'T2 such that the left-hand member of (5.19) has for t = to - 'Tl
the opposite sign from the one for t = to + 'T2' Since it is a continuous
function, it consequently must vanish for some intermediate value, such as
= to + 'T.
208
5. Weierstrass
the parameters so that the curves go through the initial point A. "The
second constant thereupon determines the initial direction, and it is now to
be so chosen that the curve goes through the given second point, if this is
possible." He supposes this is so and remarks that it can then be shown that
there is a neighborhood of B such that each point B' in the neighborhood
can be joined to A by a solution of G = 0, Le., by an extremal. He asserts,
moreover, that this solution viewed as a function of B' is continuous and
that it converges to the original curve when B' approaches B. It is only
necessary to assume that B is not conjugate to A.
To show these things Weierstrass returns to his curves (5.12) and (5.12')
and first requires that they pass through the point A for I = 10 This means
that
= 0,
= O.
= 1(11)2(/0 ) -
01(10)02(11)
= 8(/ 1,/2).
If then the points A and B are not conjugate to each other, this
determinant D =P 0, and Weierstrass can find values for 'T, 'T I, a', p'. In fact,
the system of equations above has the solution
'T
GI~I
+ HllIl
D
'TI
G2~1
+ H2111
D
where the quantities GI , HI' ... , G4, H4 are all continuous functions of
~I' 111 when I~II, 11111 are within determined bounds. This solution completely
determines the location of the point B' in the theorem above.
However, when the point B approaches the point A, the argument above
becomes questionable since the denominator D is approaching zero-recall
209
x = a + tCOSA + usinA,
y = b + tSinA + UCOSA.
The equation G = 0 for the extremals is transformed by this rotation into
a~- du'
a~,
a~- - -af-_
+-u +0
au'2 dt
au au'
at au'
au
'
and
au'
He now seeks a solution of the differential equation above with the initial
conditions u = 0, u' = 0 and says that this is possible only when FI =1= O.
Since the point A is fixed this means the angle A must be such that FI is
different from zero in that direction. By the continuity of FI it must remain
different from zero near to this direction. Weierstrass now has a pencil of
curves emanating from the point A, which are all close together in direction.
The solutions of G = 0 in these directions are expressible as convergent
power series
= ~ t + c 2 t 2 + c3 t 3 + ...
....
u=
p+'"
is a convergent series for all sufficiently small p. He has then shown that for
all permissible values of c and p, the series above give the points t, u of
intersection with the circle; he has also shown that no two coincide. This
completes his analysis of the situation near the point A :
210
5. Weierstrass
It is not asserted that only one extremal can be drawn through the point
B ' , nor that it must lie completely inside the strip. It is clear, however, that
except possibly for point A the original curve AB does lie wholly inside the
strip. Point A may lie on the boundary, but if the function F1(xo, Yo,x', y')
never vanishes nor becomes infinite for any set x', y', then the strip also
contains the point A as an interior point. As he conceives of a strip, it is a
set of points (x, y) at each of which he has a given direction x': y'. The
x', y' are proportional to the direction cosines of the extremal-solution of
G = O-from A through the point (x, y).
o
Figure 5.1
211
ing to 3, t2 + 'T.
If the family of extremals (5.12) is again x = </>(t,a, /3), Y = I/;(t,a, /3)
with </>' = a</>/at, 1/;' = al/;/at, </>1 = a</>/aa, 1/;1 = al/;/aa, </>2 = a</>/a/3, and
1/;2 = a I/; / a /3, it follows that the equations
</>(10' a, /3)
(5.20)
must be satisfied by the values 'T, 'T', a', /3', where a, /3 are the parameter
values corresponding to (02) and a + a', /3 + /3' to (03). Moreover, since
x 2 + ~2' Y2 + '1/2 are the coordinates of point 3, the equations
~2 =
(5.20')
must also hold for 'T, 'T', a', /3'. To solve these four equations, Weierstrass
expands the indicated functions in power series in the variables and finds
that through first-order terms, the equations for 'T,'T', a', /3' are
(5.21)
The last two of these equations give him the variations of the coordinates
X2' Y2 of point 2, so that the coordinates X2 + ~2' Y2 + '1/2 signify the
coordinates of point 3.
Next Weierstrass introduces quantitie~2' q2 proportional to the direction cosines of the tangent to the curve (23) at point 2 and chooses a to be
arc-length along (23) measured from 2 toward 3. Then he has
~2
= a(p2 + a l ),
(5.22)
where ai' a2 are quantities that vanish with a. He has shown in his 20th
chapter with the help of his implicit-function theorem how equations (5.21),
(5.22) can be solved for 'T, 'T', a', /3' provided that ~2' '1/2' and a are small.
With the help of these solutions, he can then express the variations of the
coordinates in the form
~=
(X + a')a,
'1/ = (Y + a")a,
(5.23)
evaluated along the arcs (02), (03), and (32). With the help of (5.23), he
212
5. Weierstrass
/03 -
i '2
10
/02
G'(x'TI-y'~dt+
[ -,f,+-,TI
aF
aF ]'2 +"',
ax
ay
10
where S is a quantity vanishing with a and the subscript (2) on aF jax' and
aF jay' means that these quantities are evaluated for t = t2
To estimate the remaining integrali32 , he again makes use of (5.22) and
writes the coordinates of an arbitrary point on (23) as
y = Y2 + (q2 + a)')i7,
where i7 varies over the closed interval [0, a] as (x, y) goes from the point
x 2 , Y2 to x 3 , Y3' This gives him the value
i32 = - LOF(x,y,X',y')di7=
LaFdi7 = aF,
+ i32 -
/02
= {(
aa~
)(2) '"h + ( aa~)
.q2
X
Y (2)
VOR, p. 213.
213
this he writes
aF = F(l)
ax'
,
aF = F(2)
a,'
-h=p,
- q2 = q
(5.25)
and in place of x', y', he writes the direction cosines p, q of the curve (02).
His condition is then expressible in terms of his function
&(x, y;p, q;p, q) = F(x, y, p,q) - F(I)(x, y, p,q)p - F(2)(X, y, p,q)q;
(5.26)
and it states that for all x, y, p, q belonging to a curve C, satisfying G = 0
and p, q arbitrary, the function & ~ 0 if the curve C renders the integral a
minimum and & " 0 if a maximum.
He goes on to show that if /C, iC are positive constants, then
&( x, y;/CP, /Cq;iCp, iCq) = iC&(x, y;p, q;p, q),
and
&( x, y;p, q; Kp, /Cq) = O.
LX ~ F(l)(x,y,cosq"sinq,)dq,
with a corresponding formula for F(2). Weierstrass expresses these integrands in the form
d F(I)(
. )_
.
a2F
a2F
T
x,y,cosq"smq, - -smq,--2 +cosq,-a
'a I'
uq,
ax'
x y
(5.27)
d F(2)(
. )_
.
a2F
a2F
dq,
x,y,cosq"smq, - -smq, ax'ay' + cosq, ay'2 .
214
5. Weierstrass
a2F
ax' ay'
.
-coscpsmcp' F I,
cp) dcp.
He comments that sin(x - cp) takes on all values between sin(X - X) and 0
and that X and X are known up to multiples of 27T; hence sin(X - cp) cannot
change sign inside the interval of integration. By means of the mean-value
theorem of the integral calculus, he can then assert that
t9(x, y; cos x, sin X; cosX, sin X)
where X = X + fJ(X - x) with 0 < fJ < 1 (Weierstrass gave this in his 1882
lectures). He concludes that "if the function FI(x, y, p, q) maintains the
same sign not only for such values of the arguments p, q which are
proportional to direction cosines along the original curve but also for
arbitrary arguments p, q, then the same is true for the function t9, and it is
clear that under this hypothesis the above formulated fourth necessary
condition is itself fulfilled."
'*
= (1 -
pp -qq)FI'
215
+ /21 -
102
101 =
</>(s).
r
J
II
+ (Po + (0)\)0,
y = Yo + (qo
+ (0)2)0,
where (0)\,(0)2 are functions of 0 that vanish with it, and the function F as
F(x,y,x',y')
= F(xo,yo,Po,qo) + (0)3;
z
0-----Figure 5.2
216
5. Weierstrass
with (0)3 a function of a vanishing with it. He can then assert that for
= II - 10 , a sufficiently smaIl value of a,
a)
10
0101'
where again D) is a small quantity vanishing with a). But the expression
F(x), YI' PI' ql) - F(xo, Yo, Po, qo) is also a quantity that vanishes with a,
and so there is a quantity ii) such that the integral can also be expressed in
the form
cp(s - a) =
103
+ 131 -
1 0 ),
and
i.e.,
cp(s - 0) - cp(s) = -
102
+ 132 + 1 03
Now when point 3 is so near to point 2 that the earlier considerations apply
to the integral 132 , then
102
+ S }o.
Putting his various results together, Weierstrass finds with the help of his
f9 -function that
cp(s + 0) - cp(s)
= 104 + 141 -
101 -
102 -
121
+ 101 = 104 -
124 -
1 02 ,
and by (5.24)
104 -
as well as
102
217
by analogy with what we saw above. Then the limit of (</>(8 + a) - 4>(8/ a
is also - S (X2' Y2; Pl' q2; Pl' til) so that this value is in fact d4>(8)/ tis, where
X2' Y2 are the coordinates of an arbitrary point on (Ol)-any curve through
points 0 and 1 and inside the plane strip; Pl' til are the direction cosines of
this curve at the point 2; and P2' q2 are the direction cosines at 2 of the
given curve (01) satisfying the equation G = O. (Weierstrass remarks that
his proof makes use of the fact that the curve (01) has a continuously
varying tangent.) Integrating the quantity </>'(8), we find Weierstrass's
famous theorem:
!J.I =
tl
to
S dt.
(5.29)
(This result also follows easily by the use of Hilbert's invariant integral, as
we shall see later.The importance of this result lies in the fact that the total
variation of the integral I is now related directly to the S -function, and this
makes sufficiency theorems easy to state and prove.)
If it is assumed that S(X2' 12; Pl,q2; Pl,q0 is positive for all values of its
arguments under consideration, then the derivative </>'(8) < 0 and </> is a
monotonic decreasing function with the final value of 0; hence it is always
positive and
102
+ i 21 > lOi;
i OI > lOi
Weierstrass (VOR, p. 224) summarizes in the theorem
If the junction fij is not positive at any point of an arbitrary curve (01)
inside the plane strip and joining the points 0 and 1 and if it does not vanish
at every point on the curve, then the integral in hand, evaluated along the
curve (01), which satisfies the differential equation G = 0, has a greater value
than it does evaluated along the curve (01); and if the junction fij is never
negative along the curve (01) and also not zero at each point, then the
integral along the original curve (01) is smaller than along the arbitrary curve
(01).
He goes on to remark that the condition on the arbitrary curve (01) that
the tangent be continuously changing can be weakened to permit the curve
to consist of a finite number of pieces, each of which has the property. This
follows by a simple reconsideration of 4>(8).
He then asks what happens if the function S vanishes along the curve
(01). Under the hypothesis, made earlier, that FI not only is different from
zero along the curve (01) but also for x', y' arbitrary arguments in the plane
strip, it is clear that S can only vanish when 1 - pp - qq does. But this is
only possible when the directions determined by p, q and.h q coincide, i.e.,
when pq - qp = o. This means that the arbitrary curve (01) must coincide
with the extremal (02) at their intersection point 2 in both position and
direction; but 2is an arbitrary point, and hence they must coincide.
218
5. Weierstrass
= q,(t, IC),
Then if t, IC and t + -r', IC + IC' define the original curve x, y and a neighboring one x + ~,y + 11 at corresponding points, then for -r';IC' sufficiently
small
11 = \{I'( t)-r'
If the two curves have another point in common, then at that point ~ = 0,
11 = 0 and the determinant of these equations must vanish, i.e.,
q,'( t)
~!
\{I'( t)
~~ = o.
Let tl = tl(/C) be the smallest root after to of this equation as a function of /C.
Then the locus of points conjugate to the initial point 0 is given by the
equations x = q,(tl' /C) and y = \{I(t l , /C), where x, yare the coordinates of the
conjugate points. It follows readily that
dx
dIC
aq,(tl' /C) dt l
atl
d/C
dy
d/C
a\{l(tI'IC) dt l
at I
d/C
aq,(tl' /C)
a/C
= q, (tl)
a\{l(tI'IC)
= \{I (tl)
aIC
dt l
d/C
dt l
dIC
+
+
aq,(tl' /C)
a/C
a\{l(tI'IC)
aIC
ep'(t I ) d/C
dy - '/(t ) ax = o.
't'
I d/C
Weierstrass notes that since q,'(tI),\{I'(t l ) are proportional to the direction
cosines of the tangent to the extremal arc through points 0 and 1 at point 1,
this equation implies that the locus of conjugate points is tangent to the
extremal at their intersection. He concludes that this proves the envelope of
the one-parameter family of extremals through 0 is the locus of points
conjugate to O.
He now considers a curve x = f(u), y = g(u) through 0 lying totally
inside the region determined by the envelope and not coinciding with any
extremal through O. For this curve to be tangent to such an extremal, it is
necessary that
q,( t, /C)
f( u),
\{I ( t, /C)
= g( u),
aq, dg _ a\{l df
at du
at du
Weierstrass remarks that the first two of these relations give t and u as
functions of /C for IICI sufficiently small. It then results that
aq, dt
aq,
-+=df- du
at d/C
aIC
du dIC '
a\{l dt
dg du
- + -a\{I= _.
at dIC
aIC
du dIC '
219
220
5. Weierstrass
ll
(FO - "A.FI)dl.
10
This was first shown to be false by Lundstrom. 13 A conjugate point for the
isoperimetric problem can exist, beyond which no extremal can be found
but which is not a conjugate point for the unrestricted problem above.
The problem at hand is to render the integral
1=
II
FO(x,y,xl,y')dl
10
has a prescribed value and the end-points are fixed. 14 Both integrals are to
be understood as being evaluated along the same curve. To do this,
Weierstrass notes that he must ensure that the variations given to the
original curve will be such as to leave the integral 1 I unchanged in value.
He considers variations of the form ~ = KU + KIU I , 1] = KV + KIV I, where K
and KI are constants and u, u l , V, VI are differentiable functions of I, that
vanish for I = 10 and II' He now goes to show that the parameters K and KI
can be so chosen that 1 I remains unchanged in value when I~I, 11]1 are
sufficiently small. As we saw earlier (p. 192),
M
(II
(II[
d~ d1]]2
~,1], dl 'dl
dl.
WI
IIIGwdl+K IIIIGWldl+(K,KI)IK+(K,KI)2KI'
III I =K
10
10
where
221
If
KI
as
= -(
~I:
+ (K))K.
~ = K( U -
U
I~I: )+
71 = K( V -
(K)K,
VI ~I: )+
(K)K.
('Go'(X'1j-y'~)dt+
Jlo
=K(IGO'(W-W I W:)dt+(K)K.
Jlo
WI
Jlo
W~ ~I:) + (K)K.
always negative or always positive; but this is only possible when the
coefficient of the first power of K vanishes, i.e., when
WO : WI
W~
: wi.
The left-hand member of this relation contains only the functions u, V and
the right-hand only UI'V I . There must therefore exist a constant lI. such that
WO=lI.WI,
W~=lI.WII
= 0,
(5.30)
and Weierstrass sums up in the theorem: "In general if there is a curve for
which the integral 1 assumes a greatest or least value while the integral I I
maintains a prescribed value, then there must be a constant lI. given so that the
coordinates of an arbitrary point of the curve satisfy the differential equation ... [Go - lI.G I = 0]."
Since this differential equation is in general of the second order, it must
possess a two-parameter family x = c[>(t,a, /3,lI.), y = 1/;(t,a, /3,lI.) of solu-
5. Weierstrass
222
tions. Thus there are three constants a, {3,"A to be determined so that the
integral/I has its preassigned value and the curve passes through both
end-points. Weierstrass argues, as did Mayer ([1877], p. 65n), that even if
the minimizing arc contains "restricted" or discontinuous segments or
points, the "free" or continuous segments-those whose variation is not
restricted-must satisfy equation (5.30) above with the same value for the
constant "A.
If F = FO - "AFI, then Weierstrass forms out of F the function G
= GO -"AG I with the help of definition (5.5'). The equation G = 0 is then
equivalent to the equations
0= MI +
= 8/0 -
MIl
t {I {
FI(
~~ + F2W2} dt+
1H0 =
t:1 { ~~ f +
FI(
F2W2} dt+
As before, the first term in the right-hand member of this relation can be
transformed into the expression
1
2
16 Weierstrass,
Jto
223
d1u
dt 2
+ -dF I -du
dt
dt
-F2u=0.
224
5. Weierstrass
M02
f.1 2 a
[ aFa
G wdt+ p~2
~
x
where p, q signify the direction cosines of the curve (32) at point 2 and ~2' 112
are the variations of the coordinates of point 2; they are given, as we saw
earlier in (5.22), by the relations
~2
= ( - p + al)a,
112
= ( - q + (2)a.
Thus
Y m
(s = 0, I).
Weierstrass now multiplies tH<12
finds
f9 being formed from F = Fa - .\F 1, where P2' q2 are the direction cosines
of the extremal (02) at point 2 and the terms not shown in the relation
above are regular functions of a which contain a factor a 2
It follows that if
2 does not change sign, then neither can the function
f9 (x, y; P, q; p, q) along the given curve for any two pairs p, q and p, q.
Consider now in Figure 5.3 the arc (01) through points 0 and I and
satisfying G = 0 and another curve (01) through those points and having on
it a point 2. Weierstrass now wishes to examine 2 2 as a function of
point 2, and he denotes by a arc-length along (02). By the hypotheses made
above, there is a solution of G = 0 through points 0 and 2 so that 1<12 = 1<12
He then examines the continuous function j(s) = ig2 - 2 under the hy-
AIg
Ig Ig
Ig
225
Figure 5.3
pothesis that (01) is a minimizing arc. Let 3 be a point between 0 and 2 and
4 a point between 2 and 1 so chosen that the distances along the arcs (32)
and (24) have the same value a, which will be understood to be small. Then
and
f( s - a) - f( s)
= -
i 302 -
Ig + Ig
i2~
I~ + I~ = FD(x 2 ,Y2;
= - FD( x 2 , Y2;
Similarly,
f(s + a) - f(s)
P2,Q2;
p,ij)a +
and hence
provided only that the curve (02) has a continuously turning tangent at th..e
point 2.
Suppose now that f9 is positive for all values of its arguments; then J(s)
is an increasing function of the point 2, starting at f(O) = O. Thus when 2
coincides with 1, f(s) is positive. Weierstrass summarizes in the theorem:
"If the function FD (x, y; P, q; p, ij), evaluated along the curve and for arbitrary
p,ij, is always positive then 1 > 1, i.e., a minimum is surely exhibited."
In case dJ/ ds == 0, then f is a constant equal to 0 since f(O) = 0, and
hence 2 = 2 for every point 2 on (01), in particular 1 = 1 But the
curve (01) does not satisfy the differential equation G = 0 since by hypothesis only one solution (01) of G = 0 can pass through points 0 and 1. Since
(01) is not a solution of G = 0, a variation can always be found so that the
corresponding integral will be smaller or larger than 1 = 1, and there
can be no maximum or minimum.
Weierstrass remarks that the investigations above require continuity
assumptions on the arbitrary curve (01). Along the curve the quantities
x', y' must vary continuously, and it must consist of a finite number of
pieces along each of which the second derivatives are continuous in order
that the complete variations III can be expanded into power series.
ig Ig
ig Ig
ig Ig
ig Ig
226
5. Weierstrass
Ig
Iri2
Ig
ig
Ig
= 100I + 1-I02 ,
i.e., that I I has the same value along the arc (02) as along (002). If in
particular point 2 coincides with 1, then the value of the integral is 16.)1. As
before, he can write the total variation in the form
~lg21
where now P2' q2 are the direction cosines of the arc (02) at point 2 and
same point. Again he finds that
o
0
-0
Iril
Ig ig
Ig
and hence that 1 < l , which means under his assumption that 1 is a
minimum.
(Since he discovered the idea of moving the initial point slightly to the
227
z =itFI(x,y,x',Y')df
to
evaluated along the curve (01). Thus for 1 = fI' he has II = z(t). He now
considers a neighboring curve defined at corresponding points by the
values I,a, /3 and 1 + T,a + a', /3 + /3', 'A + A' with end-points given by
fo,t I and 10 + To,/I + T I He denotes by ~,'I),r the differences of corresponding coordinates x, y,z and finds that
~=
'I) =
1/;'(t)T + I/;'(t)a' +
=itG I . (1/>'(1)'1)
~
~~,I ~o -
-I/;'(y)~)dl + ~F,I ~+
uX
~~,I )0'1)0 +
0/,1
'I)
... ,
where ~0,'I)0 are the values of ~,'I) at 1 = to and 1/>3 = ol/>/o'A, 1/;3 = ol/;/o'A with
corresponding definitions for 1/>1,1/;],1/>2,1/;2' He gives the formula for
another form by setting
(v
1,2,3)
and
(v
1,2,3).
He finds that
r -
of]
ax' ~ -
of]
oy'
'I)
( of] )
ox'
)
}o + ( of]
oy'
0 '1)0
228
5. Weierstrass
(5.31)
9. ( t)
9.(10)
8.(1, to)
92(t)
92(to)
93(t)
93(to)
(5.32)
does not vanish for t = t . He now says that "disregarding the case when
the function 8(t, to) vanishes identically for all values of t, the first root of
the equation being considered after to as t moves across the interval (to ... t.)
determines the point coryugate to to" (p. 275). (This is his generalization of
conjugate points to the isoperimetric problem.)
Weierstrass now asserts that if there is no point conjugate to the point 0
on (01), it can be embedded in a plane strip which bas the property that
between 0 and each of its points passes one and only one curve satisfying
the equation G = GO - AG = 0; this curve lies arbitrarily near to the
original curve, and their points can be so ordered that the integral
(IF(x, y,x', y')dt
)'0
always has the same value on each of them; moreover, the value of A on the
new curve deviates only a little from its value on the original one. His proof
is given later in the same chapter and will appear in due course.
He then states the fourth necessary condition: "A maximum or minimum
for the integral /0 can not occur under the condition that the integral/ takes
on a prescribed value if the arc contains in its interior a point conjugate to to."
229
y'~,
(s
0, 1),
where the suppressed terms are at least second order. (The functions FI and
F2 appear in (5.4) and (5.4') above.) He next expands w in powers of
T,a', /3',1..'; to do this he aggregates all terms of first degree and calls the
resulting expression w. Then w has the form
w=
and consequently
~G =
NG I(X, y)
+ (T,a', 13',1..')2.
Weierstrass now argues that since the varied curve must also satisfy G = 0
for sufficiently small ITI, la'l, 1131, 11..'1, the coefficients of a', 13',1..' in the
relation above must vanish. This implies that
FI
FI
d 20s
dF I dOs
2 + -d -d - F 20s
dt
t
t
0,
(s
1,2),
d 20
dF I d0
I
+ - - 3 -F203 + G =0.
dt 2
dt dt
3
--
(5.33)
d0 3 ))
- Os Tt
Os G I
(s= 1,2),
2
d ( FI (d0
dOl )) =0,
dt
OITt -02Tt
or in the form
(s= 1,2),
(5.33')
230
5. Weierstrass
then
(}I =
8 1(t, to)
this would imply with the help of the second determinantal form for 8(t, to)
in (5.32) that 8 vanished identically, which is contrary to Weierstrass's
assumption.
He now goes on to show that 8(t, to) changes sign as t passes through a
zero. To do this he has from the definition of 8 above that
(}I(t)
d8(t,to)
dt
(}l t)
(}{(t)
(}3(tO) +
d8 3
(}1(tO)
(}2( t)
(}1(tO) (}2(tO)
d8 1
d8 2
dt
dt
dt
8 1(t, to)
( V
8 2(t, to)
= 123
, , ).
8 3(t,to)
To simplify notations, Weierstrass now defines 11,12,13 with the help of the
relations
(}2(t)(}1 (to)
= 13(t)
8( t, to) = 11 (t)8 1(t, to) + 12(t)82(t, to) + 13( t)8 3(t, to),
; 8( t, to) = 1{(t)8 1(t, to) + 12(t)82( t, to) + 13( t)8 3(t, to),
since
d 8(t, to)
dt 13( t)
(J3(1)f{(I) - II (t)13(1) )8 1(t, to) + (13(t)12(t) - 12( t)13(t8 2(t, to)
13( t)2
.
The numerator of the right-hand member of this equation is expressible as
231
[ F1( 0l( 1)0~(1) - O2(1)0;( I J:003(10) + FI (1) 03(1)( 0l( (0)0~(1) - O2(10)0;(1))
- F1(1)03(10)( 01(1)0~(1) - O2(1)0;(1)).
The third relation in (5.33') shows that the expression in brackets above is
zero. When these facts are assembled, it results that
d 8(1, (0)
FIE2
dl f3( I) = f3( 1)2 .
Weierstrass now supposes that I' is a root of 8(t, (0) = 0 of order K-the
series expansion of 8 about I' begins with the term (I - I')"-and that
fil') *- 0. 19 Accordingly, the expansion of the expression
f3(1)
d8(1,lo)
dl
- f3(1)8(1, (0)
begins with the term (I - t't- I This expression is, as we just saw, equal to
FI E 2; since, by hypothesis, FI neither vanishes along the given curve nor is
infinite, the expansion of FIE2 must begin with an even power of I - t'-it
cannot change sign-and K must be odd. This implies immediaiely that 8
changes sign at each root.
If fi I') = 0, then f3(t') *- 0, because otherwise
=0
and
= 0,
o.
and thus that 1 is a point conjugate to to for the unrestricted problem, i.e., the problem without
the isoperimetric condition (see Bolza. VOR pp. 478ff). We examine this case directly.
232
5. Weierstrass
But by the third equation (5.33'), these equations are impossible since
C =F 0, F( =F 0, =F 00 and hence not both 13( I') and 13( I') vanish at the same
point. From this Weierstrass infers that the expansions of 8(t, (0 ) and 13(/)
are of the forms
13( I)
d8(/, (0 )
dl
-
13(1)8( I, (0 )
begins with the term ab(IC - 1)(1 - I't. When IC = 1 his proof is still valid.
However this is not necessary. See his example on pp. 301-302 or see
BoIza, VOR, p. 478 for a discussion of the exceptional case. The condition
13 = 0 determines the conjugate points for the unrestricted problem as
mentioned in footnote 19.
Weierstrass turns now to a generalization of his notion of a strip for
isoperimetrical problems. He again considers the family of solutions of
G = 0, x = cp(t,a, p,'A), y = ",,(/,a, p,'A), which contains the given arc (01)
for the parameter values a, 13, 'A. He now appends to the coordinates x, y
the third one
evaluated along (01) and draws, as in Figure 5.4, the space curve (01'). Let
the curve (01) vary in the family, and set up a correspondence between
points on the original and the varied curve so that at corresponding points
the values of z are the same.2 Suppose that 2 is a point on (01) such that
there is no point conjugate to 0 between 0 and 2, and suppose that 2 is the
point corresponding to 2 on (02), a neighboring curve. He then constructs
the space curve (02') and notes that the space curves embedding (02') fill a
2'
Figure 5.4
20The problem implicitly posed by Weierstrass is that of finding among all space curve passing
through the fixed points xo, Yo, 0 and XI' YI' ZI = J:~FI dt one which minimizes the integral
f:~Fo dt subject to the side-condition z' = FI(x, y, x', y') and the end-conditions that x, y, Z
are fixed for t = to and t I. This is, in effect, the reason for his recourse to space curves. What
he produces is the first example of a field in three-space.
233
o=
234
5. Weierstrass
+ (e,t),
where (e, I) vanishes with e for every value of t. If t' is conjugate to to,
8(/'-7",/0 ) and 8(t'+7",to) have opposite signs and so 8(t, to + e) does
also for t = t' - 7", t = t' + 7". It must therefore vanish for some value t
between t' - 7" and t' + 7". The change in the point conjugate to to caused
by a small variation in to is then arbitrarily small. These remarks apply
when fl' f2' f3 do not all vanish together.
In his 29th chapter he continues the discussion and returns to the form
for the complete variation of / I
that he established on p. 220 above and to the form for the complete
variation of /0
235
WI
ll
Glwdt= 0,
(S.34)
10
(S.34')
After this has been proved, the complete variation Al o for such a w reduces
to
J/o
'
and thus this variation has the same sign as e, i.e., it can be made either
positive or negative and hence there can be no extremum.
To make his proof, Weierstrass writes the relation (S.34') with the help of
(S.34) as
.(1 {_~(FI~~)+(F2-e)W-e3GI}wdl=0,
I.
d ( dOs(t, e) )
dl F2
dt
-(F2 -e)Os(t,e)=0,
(s= 1,2),
(S.3S)
t" ... t l
21The basic reason why the unrestricted problem of the calculus of variations and the
isoperimetric one differ is that for the former one
{' {FI( ~;
F2W2} dt~ 0
for every w vanishing at t = to and t l , whereas for the latter one w must satisfy the additional
condition
It is a priori evident that the isoperimetric problem is, in general, more restrictive. The proper
generalization for the isoperimetric problem of the notion of conjugate points was first
discovered by Lundstrom [1869].
236
5. Weierstrass
This function w is not identically zero since the solutions of the differential
equations (5.35) could be linearly dependent only if the equation G 1 = 0
were satisfied. Wei~rstrass explicitly bars this case. Now the function w
must clearly satisfy the differential equation
G 1= 0
on
= 10 and
1o ", I";
I" and is such that
He has
~o
J~
~o
J~
(I"G10.(t,e)dl= 8.(I",lo,e)
Jto
(v = 1,2,3).
e101(tO,e)
+ e202(tO,e) + e303(tO' e) = 0,
e101(t",e)
+ e202(1",e) + e303(t", e) = 0,
e I 8 1(1",l o,e)
(5.36)
are nontrivially satisfied. The determinant of this system differs from the
determinant 8(1",10) of (5.32) by an amount which vanishes with e.
It was proved above that 8(1",10) has opposite signs for I" = I' - T and
I" = I' + T; and consequently for lei very small, the determinant above
does also. This implies that the determinant must vanish for some value I"
between t' - T and t' + T. For this value, system (5.36) must have a solution
e 1, e2 , e3 =!= 0,0,0; and Weierstrass has shown that w has the properties he
asserted.
In his 27th chapter Weierstrass mentioned the special case when the
t9 -function vanishes along a curve. Here he discusses the case and after a
lengthy calculation concludes that this cannot occur in his spatial strip or
field. It is not pointful to include it here.
In his 30th chapter (pp. 269ff) Weierstrass gives examples to illustrate his
procedures. The first of these involves finding the surface of revolution
containing a fixed volume which, when moving through the air, will
encounter the least resistance. This is an isoperimetric version of Newton's
famous problem discussed above. In his 31st and final chapter Weierstrass
takes up briefly and tentatively two new classes of problems: those with the
variable end-points and those with restricted variations, i.e., those where
points or segments of the minimizing arc lie on the boundary of the region.
237
After an example to show that an arc may make the first variation
vanish and the second variation positive and yet not furnish a strong
minimum, he goes on to discuss what he learned from a "generous oral
22Scheeffer [1885], [1885'], [1886].
23 Scheeffer [1885'], pp. 594-595.
238
5. Weierstrass
and assumes that the values of YI' h, ... ,Yn are given at
turns at once to a consideration of the second variation
xO
and
Xl.
He
fx
Q2(1]),
t.L
x,
QI(w)dx.
239
Ull
U2l
U2 2
ul,2n
u2,2n
Unl
Un2
un,2n
U l2
Ll(x,xo) = Ull
U2l
U l2
u? 2n
U?2
u2,2n
Un2
Unl
U~2n
Here the uij are partial derivatives with respect to the parameters of a
2n-parameter family of solutions of the Euler equations.
Scheeffer sums up in the theorem:
Let the determinant ~(x, xo) not vanish for any value of the variable x
between Xo and x I; then the maximum or minimum of the integral J depends
upon whether the quadratic form
~I(W) =
a'a2a
F
L:ik ' WiWk
Yi Y k
J F(x, y, y')dx
X 1
xO
and asks that it be a minimum among the arcs in a given set joining two
fixed points. He assumes that "I. All partial derivatives through the third
order of the function F(x, y, l') are finite and continuous on the interval
xOx land '(JlF /ay,2 [= S'y'] is moreover positive throughout. II. A certain
determinant Ll(x, xo) is different from zero for all values of x between XO and
x 1 and also for x = x 1 and is finite and continuous together with its first
derivative.,,24
He goes on to define the function Ll in terms of two solutions u l , U 2 of the
accessory differential equation, (5.38) below, as
Ll(x,xo)
F(x,y
240
5. Weierstrass
where he has
and a corresponding third-order expression for '3 where y, y' have been
evaluated at some values y + Ol:ly, y' + Ol:ly' (0 < 0 < I). It follows from
this that since the integral of gl' the first variation, vanishes along his
minimizing curve, the total variation of his integral J is
Jxo
Jxo
G = 1 rx'
2
2Jx o
Jxo
that
= J~~F(x, y, y')dx
M=
l xl
xO
F(x,y+~y,1'+~y')dx-
lxl F(x,y,1')dx
xO
25 In his 1885 paper, Scheeffer has used u = ~(x, xo). This, or any other, choice is valid so long
as there is no point on the closed interval [xo, x '] conjugate to xo.
241
is positive for every function ~y whose absolute value remains less than a
given bound g while the absolute value of its derivative ~y' throughout
remains less than a second bound. If, on the contrary, a2F ja/ 2 is negative at
any place on the interval xOx I or if ~(x, x~ vanishes inside the interval xOx I,
then the second variation is also capable of taking on a negative sign and for
this reason no minimum can occur for the integral J.
Scheeffer remarks in his penultimate paragraph that the proof for the
second part of his theorem is carried out in his 1885 paper. His proof that
the second variation can take on either positive or negative values when
there is a conjugate point inside the interval xOx I is very similar to one due
to Erdmann ([ 1878], pp. 365-368). Let us look at Scheeffer's proof in
[1885], pp. 547ff.
He writes the integrand of the second variation in the usual form
u2uD = C,
(5.37)
= 0,
(5.38)
and he assumes that ~,y' =1= 0 along the arc being considered. He points out
that the constant C above cannot be zero when these solutions U t , U 2 are
taken to be the partial derivatives ay faCt, ay /aC2 of a two-parameter
family of solutions y(x, c t , c2 ) of the Euler equation. His proof that C =1= 0
depends on writing an expansion for y about x = xo. He has y = c t +
c2(x - x~ + ... and thus ay/ac I = UI = 1 + .. " ay/ac 2 = U2 = (xxo) + .... From this he has [u I U2- u2u;]x=xo = I and C = [aF/y/2]x=xo'
Furthermore, since
it follows that U2/UI is not a constant; and hence ~(x,xo) = U~UI - u?u2 is
not identically zero. (The expressions u~ (s = 1,2) mean Us evaluated at
x = xo. In what follows I have slightly modified Scheeffer's notation to
simplify matters a little.)
Scheeffer now considers the arc A A I in Figure 5.5 and fixes attention
on two arbitrary points A 2: (x 2, /) and A 3: (x 3, y3) on this arc and a point
A : (~, y(~) + 1/) between these points near to the given arc. He then forms
Figure 5.5
242
5. Weierstrass
two arcs e', e" neighboring to the arc A 2A 3 with the help of the variations
(5.39)
u=
U on [x2,~]
V on [~,X3]
0 on [X 3,X 1 ]
is then such that along it A?J = (A 2J)c'+c".Clearly, this variation is continuous since u(~) = v(~) but does not have a continuous derivative at x = ~.
Scheeffer then expresses the second variation evaluated along the arcs e'
and e" as
I E
I x3
(A 2J)c'+c" = ( A2J t, + (A 2Jt" = '2 g (U,u')dx+ '2 ~ g(v, v') dx;
i2
(this is clear since, e.g., U(X2) = 0, u(~) = 1)(~), v(~) = 1)(~), v(x 3) = 0) and
consequently he has, with the help of the accessory equation,
(A Jt,+c"-
'2 S'y"
(u 1U 2 - u2u;)A(X 3,x 2) 2
A(~,X2) . A(~,X3) 1) (~)
A(X 3 ,x 2 )
2
'2 C A(~,X2) . A(~,X3) 1) (~).
I
(5.40)
He next shows that each of the expressions A(x 3, x 3), A(~, x 2), and A(t x 3)
has the same sign as u1U2 - u2u;. To do this he notes, e.g., that
U 1U
23 -
X3 _
U U1
X
23 - U 2
3
X -x
3
U1 -
U1
U2
U1
-x
U2/
~(U2)=_C
dx
u1
u?F.v'y' '
243
U2/ u l
at the value ~ on
(This result is very close to one due to Sturm [1900], who studied the
roots of second-order linear differential equations. He showed, in effect,
that if U and v are independent solutions of such an equation, then between
two successive roots of U there is one and only one root of v.)
Scheeffer (p. 551) is hence able to conclude that "In order that (d2J)c'+c"
never be negative for arbitrary choices of the points A 2AA 3, it is necessary that
(I) ~'J( be positive except for certain places where it can vanish; and that (II)
d(x,x~ never vanish for XO < x < Xl. If d(x,x~ also does not vanish for
x = xl, then (!:l.2J)c'+c" is always positive; if, on the contrary, d(xI,x'1 = 0,
then (d 2J)c'+c" certainly can not take on both signs but can be made to
vanish."
From this he goes on to show how the conclusions about the sign of the
second variation, evaluated along the special arc c' + c", imply the same
conclusions for general comparison arcs (pp. 551-555). To see how he
handles this, consider the arcs in Figure 5.6. He examines a comparison arc
b joining AO and A I and defined by a function 'I] which is of class C', i.e.,
continuous together with a continuous derivative on XO ~ x ~ X I. The
segment between A and A he calls b' and between A and AI, b". The
segments c' and c" in Figure 5.6 are defined by the functions ii, i3 above
withA 2 =Ao,A 3 =A I.
b"
b~=-=-~
AD~
c---A'
Figure 5.6
244
5. Weierstrass
Scheefer now calculates the difference between the value of the second
variation along b' and along c' and finds for it
2
(t::.J)b'- (!::d)c'=
2F (
'2I i~ -a'aa
' 1/' - u;u~xO
Y Y
U.U 2 -
U2 U
?01/ )2 dx.
u2 u.
(5.41)
= '2 PQ ,
where P is the first and Q the second factor above and where
in (5.39). He next writes
(1:l 2J)
,=
ii
is defined
rx .iL(1:l
J) ,dx= 1 rx( dP Q + dQ P)dX
dx
2
dx
dx
Jxo
U
Jxo
= I:l(x,x~,
dP
dx Q = Fyy 1/ + ~y'
1/,
I:l(x,x )
U2U. -
U.U 2
dx
P - - Fyy'
,2
,
1/ + 2~y'1/1/
I:l(x,x )
U2 U I -
U 1U 2
0, ,)21/
(
yy
I:l(x,xo)
- F, ,
U2U 1 -
U 1U 2
+ 2F,
yy
U 2U 1 -
U.U 2
I:l(x,xo)
1/1/
Combining these, Scheeffer has his desired expression (5.41) above. He has
then shown that I:l(x, x~ *- 0 and ~'y' ;> 0 on [xo,~] imply that the difference (1:l 2J)b' - (1:l 2J)c' cannot be negative. In the same way
245
r = { ~(x,xo) + eU = u + eU
on
on
eU
[xo,x']
[x',xd,
= 0, and
fx~AUU'-u'U).
= 2e~'y'u'(x')U(x') + e2
XI
Xo
Uq,( U)dx.
246
5. Weierstrass
X!
F(x, y, y')dx
Xo
A 2 <y<B 2 ,
A3<P<B3'
with AI';;; x o, XI';;; BI and "with some or all the quantities AI,B I ,., B3
... infinite." He further supposes fixed end-points and assumes that the
functions y(x) to be considered satisfy the
Conditions A
(I) y(x) is a single-valued continuous function of x in the interval
x o ';;; x.;;; XI;
(2) y has a derivative y' at each point of the open interval X o < x
< XI'
247
M = J( a) =
is a minimum for a
X\
Xo
0 and
1'(0)
X\
Xo
{TJ0
(5.42)
Following Kneser, Osgood says that any function y satisfying (5.42) at all
points inside an interval (x',x") is called an extremal provided that all
points (x, y(x), y'(x lie in R.26 He also assumes that 0'y'(x, y, p) =1= 0 at
each point (x, y, p) of R.
Osgood then gives Schwarz's generalization of Weierstrass's notion of a
strip or a Jield about the extremal cY He assumes that there is a
one-parameter family of extremals y = </>(x, y), which contains C for y = Yo
and "which sweeps out the neighborhood of C just once; ...." More
precisely, it is assumed that
(I) a Junction </>(x, y) exists which, together with its partial derivatives </>x
= </>', </>1" </>x1' = </>~, is a continuous Junction oj the two independent variables (x, y) in the domain
Yo -
K ,;;;
Y ,;;; Yo
+ K,
248
5. Weierstrass
/02 =
L;2F (X,cp(X'Y2),cpl(X,
Y2))
dx,
i 21
I X,
X2
and denotes their sum by f(s). Thus f(s) = /02 + i 21 and f{l) = I' + J,
where I' is the value of / taken alon~ C from (x o,1b) to (xo, Yo) and J fro~
(x o, Yo) to (xI' YI); also f(O) = I' + J, where now J is the value of / along C
from (xo,Yo) to (XI'YI)' It follows thatf(l)- 1(0)= -(J-J).30 Osgood
now proceeds to establish the lemma:
The function f(s) is a continuous function of s in the interval 0 .;; s .;; I
and it has a derivative that is never positive and is sometimes negative,
provided that the function
f;,'y'( x, y, p)
is positive at every point (x, y) of C when to the third argument p an
arbitrary value is assigned.
28 A
definition of a field used by Bliss, LEe, p. 44 and due, in essence, to Hilbert is this: A
region F of xy space with a slope function p(x, y) having the properties (I) p is single-valued
and has continuous first partial derivatives in F; (2) the elements (x, y, p(x, y are in R; and
(3) the Hilbert integral
1*
is independent of the path for all curves in F with the same end-points.
29 Recall that Weierstrass actually used both A and A '. This device considerably simplifies the
analysis. As was mentioned above, it was Zermelo who, following Weierstrass, first published
this ([1894), pp. 87-88), and Kneser who gave currency to the notion (LV, p. 59 or LV', p. 76).
Mayer [I903) then applied these ideas to more general problems.
30Recall how Weierstrass calculated the derivative /'(s) on p. 216 above. Osgood asserts on p.
l16n of [190\ ") that his proof is superior to Weierstrass's. Actually both their methods are
difficult and were replaced by Hilbert's method, which is simpler and more elegant.
249
m)2~Ax2'J2,m
+ 9(M -
X\
Xo
F(x,y,y')dx
a minimum is
(I) that the points (xo, Yo), (XI> YI) can be joined by an extremal C satisfying the Conditions A ;
(2) that there exists a field about C ... ;
(3) that Fy,/x, y, p) be positive at all points of C, p being arbitrary in the
case oJ a strong minimum, and equal to y' in the case of a weak
minimum.
Osgood then takes up both Hilbert's and Kneser's methods. They are
not germane here, so we close the discussion.
6.1. Introduction
Much work went on in the calculus of variations without reference to
that of Weierstrass's for quite a while. One reason for this was that
Weierstrass's great accomplishments were generally made known to the
mathematical community through the dissertations of his students. The
other reason was the development of a school interested in generalizing the
scope of the calculus of variations to more general problems than Weierstrass was considering.
In this chapter we start as an introduction with Clebsch's work in 1857
which, of course, preceded Weierstrass's; and we continue with an analysis
of Mayer's papers which follow naturally out of Clebsch's. Another writer
of this period is von Escherich, who followed closely in Clebsch's and
Mayer's footsteps. However, his papers are so verbose and repititious of
earlier writers that it seemed to me unreasonable to attempt a systematic
study of them, but he did emphasize clearly the notion of abnormality (von
Escherich [1899], p. 129) and gave an important form for the second
variation on p. 1278 of that paper.
251
higher than the first. 1 It is unfortunate that BoIza in VOR attributes much
of this work of Clebsch to von Escherich who wrote 40 years later. It is true
that a good deal of Clebsch's analysis lacked the rigor one seeks today but
no:p.etheless the basic ideas in these two papers, Clebsch [1858], [1858'],
constitute a watershed in the calculus of variations. The work of all later
writers is profoundly influenced by his ideas, and it is a pity that his
contribution is so little regarded today, many of his ideas being attributed
to Mayer or von Escherich.
Specifically, he considers a function f of x, Y = (YI' Y2, ... ,Yn), Y'
= (Y1' y~, ... ,y~) on the interval a ..;; x ..;; b and asks that the integral
Lbf(x,y,y')dX
(6.1)
= 0,
</>2
= 0, ... ,
</>"
(6.2)
= 0,
F= f+ L" Ap</>p,
p=1
where the Ap are Lagrange's multipliers. Notice that Clebsch, like all his
contemporaries, assumes the coefficient of f to be unity, i.e., that the
extremal under consideration is normal; it was not until A. Mayer in 1886
and von Escherich in 1899 that the possibility of abnormal arcs was really
considered.2
Be that as it may, Clebsch writes out the Euler-Lagrange equations
which, he says, must be satisfied by an extremal
aF=F"=.!L
aF=.!LF,
12
dX a'
d
" (.l = "
a
'Yi
'Yi
X
ori
ori
...
,n,)
(6.3)
(In what follows I shall partially follow Clebsch's notations, but to simplify
matters I use repeated indices for summation. The indices i, j range over
1,2, ... , n; p,o, over 1,2, ... , Ie; and r,s over 1,2, ... , 2n.) He now
IClebsch [1858], pp. 254-273. In another paper ([1858'], pp. 335-355) in the same volume he
goes beyond this, as we shall see below.
2Mayer [1886], pp. 79; von Escherich [1899], pp. 1290.
3Clebsch does not make clear a condition on his f/>p that is really needed. It is that the matrix
of the </>pYi is of rank Ie < n everywhere in the region being considered.
252
v = LbF(X, y, y',A)dx
and replaces his minimizing arc y by Y + W, where the ware "arbitrary
functions of x, however signifies a very small number." Then he notes
that V can be expanded into V + VI + 2V2 and that VI vanishes along
the extremal y. He concludes that for a minimum, V2 must be positive
(nonnegative) and for a maximum, negative (nonpositive). He does not in
this place recognize that the variations w must satisfy the conditions
4>PYiWj + 4>py;W; = O.
The integrand of V2 is now the object of Clebsch's study. It is, as he
says, a second-order homogeneous function and is expressible as
(6.4)
0", =
fx 0..;,
O"p = 0,
(6.5)
where
= 4>PYiUj + 4>py;u;.
= YrYje,'
/Lp = Y~pe,
and notes that they must satisfy the accessory equations (6.5) for all Yr' This
is Clebsch's generalization of Jacobi's result, which we noted above.
In what follows he writes the integrand 20 as
where
write
O~
then can
253
equations (6.5) (this is not his notation) and proceeds to show that
Ui~v: - Vi~U:
= const.
(6.6)
(This is often called Clebsch's relation.) To do this he notes with the help of
(6.5) that
= dx (Ui~v:)
and with the help of the homogeneity of
that
(6.7)
He now desires to consider only those u, J-L for which the constant in
(6.6) is zero. (In more modern parlance, he chooses a conjugate set of
solutions of the accessory equations; such sets exist, as we shall see later.)
The maximal number of linearly independent conjugate sets can be shown
to be n, and later we shall see how they can be found. Clebsch implicitly
assumes he has such a set. (These matters received more thorough discussion by Clebsch [1858'], Mayer [1886], and von Escherich [1898], as we
shall see below.) In his notation such a set is u(r), J-L(r), but for notational
simplicity I prefer to write Uik ' JLpk. He asserts that there is a symmetric
matrix Pij such that
(i,k
It is easy to see that for such a matrix P
conjugate set since
U'k~
, - u'/~
,
,
UiJ
,
Uik
and a matrix M
JLpk
ajiujk
(6.8)
uik form a
JI
lj
(i,k = 1,2,.,., n)
(6.9)
(6.9')
Later he exhibits the form of the aji by inference as quotients of determinants, but not the form of the Mpj" (He writes the matrices aji,Mpj as ap)
and M p ' respectively, with no other index on M.) In terms of these matrices
Clebsch now has some relations he needs shortly. For uj = ujk ' JL p = J-Lpk'
~";k
~U;k
(6.10)
(6.11)
254
relations (6.9) and (6.9') for a and M that equations (6.8) become
P;k
(i,k
(6.12)
P(k
since the equations 0,. = 0 imply that Pp;u;k + qp;U;k = 0 and hence that
PplUlk = - qp;a/iu1k , He ~sserts that the n 2 + me equations (6.12) and (6.12')
suffice to find the matrices M and a. When these values are substituted into
the conditions above for P(,., he finds the differential equations
P(k = a;k - ai/cljakj
+ Pp;Mpk + PpkMp;'
(6.13)
+ wiaijcjkWk + MpiWiqpkWk;
where
+ 2Mp;w;(PpkWk + qpkWk) =
w; -
fx w;P;kwk'
wjaj;.
(6.15)
(6.16)
(6.17)
where
2F = W;cij JJj,
-i'p
= Mp;w;,
2B= w;Pij~'
cPp = Pp;w; + qp;w;.
(6.17')
Moreover, by the relation just after (6.14) and (6.12'), the cP can be
expressed as linear functions of the W
(6.18)
255
since
Pp = - qpj(Xijwi + qpjW; = qpAw; - wi(Xij)'
(This elegant result is due to Clebsch.) Clebsch now has his transformation
of the second variation in relations (6.17), (6.17'), (6.18). This is a generalization of Legendre's form and also of Jacobi's. From it his necessary
condition follows directly, as we see from the fact that for variations W
vanishing at the end-points and satisfying (6.18), Clebsch has shown that
V2 = J Fdx= J'Fdx.
He then investigates the form of the W. He has with the help of (6.9)
and (6.16), the relation
w2
U;I u ll U21
1
W.=I
R U;2 u\2 U22
un2
ufn u ln u2n
unn
W~I
wn
WI
unl
where R, the determinant of Uik ' must be different from 0 for each x.
Clebsch now proceeds to show how his method is applic~ble to the
problem discussed by Jacobi in which the integrand function f is of the
form
f( x,Y,Y ,Y , ... ,Y (n
I
II
and which caused so many commentaries to be written. Clebsch is concerned with minimizing or maximizing the integral
-Lbf
V-
(Y,dY
d"y)
-d , ... , -dn-y
- - - I 'd n dx.
X
dxn
X
dy
4>1 = dx - YI = 0,
dYI
4>2 = dx - Y2 = 0, ... ,
F = f + Ap4>p,
/y = A;,
4>n-1
K
dYn-2
= ~
-Yn-I = O.
J = 1,2, ... , n -
2),
/y._,-~-I= dx/Y~-"
When he eliminates the Ap from these equations, he finds the well-known
result
d it + -d 2 r - + ...
itY - -dx
y,
dx2 JY2
O.
256
a'1
a(dYn_l/dx)a(dYn_l/ dx )
;;1 ;
(du)2
where
U(I)
R=
U(2)
u(n)
uP)
u(2)
I
u(n)
I
u(l)
U11)
n-I
U12)
U(2)
u1 n)
u(n)
n-I
n-I
U(I)
U(2)
u(n)
du(l)
dn-IU(I)
dx
dU(2)
dx n- I
d n- IU(2)
dx
dx n- I
du(n)
dn-Iu(n)
dx
dx n- I
and
W=
WI
W2
u(l)
U(I)
I
U11)
u(n)
u~n)
u1 n)
dwn _ 1
dw
dx
dX
dU~I)
fiX
du~n)
dx
d~
d~
dx 2
dxn
dnU(I)
dx n
u(l)
du(l)
d'lu(l)
dx
dx 2
u(n)
du(n)
dVn)
dx
dx 2
dnu(n)
dxn
In these results the u(i) are a set of n conjugate solutions. The paper then
closes with a discussion of multiple integral problems.
In closing our discussion of the first of Clebsch's papers in the 1858
257
volume of Crelle's Journal, it is worth stating formally the Clebsch condition for the problem of Lagrange. As stated by Bolza,4 it is
For a minimum of the integral J with side-conditions tPp = 0 it is ... necessary that at each point of an extremal arc
a2F
~
-'aa' ~i~k>O
i,k 'Yi 'Yk
for all systems of quantities ~ \' ~2'
~
i
, ~n
atPp
-a
' ~i=O.
'Yi
Notice that Clebsch did not quite prove this condition explicitly in that he
did not show how, given the W, he could find the corresponding w which
satisfy the differential equations (6.16) as well as the relations (6.18).
Nonetheless, he did show the elegant way to handle Jacobi's general
problem.
258
4>2 = 0, ... ,
4>1 = 0,
(6.20)
subject to the conditions (6.20) can be transformed into the problem just
mentioned by setting
J =i x1 dV dx.
xO
dx
This is a version of the problem now called the problem of Mayer, and
Clebsch's general problem is now called the problem of Lagrange.
He reverts to the problem of Lagrange, as he formulated it, and sets
Q= F
(6.21)
aQ
a(!)Ii
d /d
x)
aQ
-a
Yi
(i
(p
(6.22)
,y2) with
(6.23')
xO
6Clebsch's discussion in his Sections 2 and 3 is flawed. However, the germs of several
important ideas are in there, and I have therefore felt it reasonable to replace his discussion by
a correctly formulated one. I have made this discussion conform as closely as possible to
Clebsch's despite the fact that there are much more elegant ways to handle the problem in
terms of Hilbert's invariant integral.
259
~~
au
axo
F(x, Y, Y'),
_FlxO + (XL { an
Jxo
_
IXO
--F
aYj + an ay/} dx
aYj axo aYl axO
an aY IX '
+La-r-o
Yj ax
j
XO
au _ '" an aYj IX
aaj - f aYl aaj ,
since an/aAp = 0 and
(6.23') that
aYJaaj
au = FIX
ax
'
Yj[~;xO,yO,A(xO,yO;~,1/)J.
In terms of this family, define another function V(xo, yO;~, 1/) by the
relation
We then have
av
a~
aU) + L
= (
ax
( au) aA j
av _ L ( au) aA j
a"!; -
aaj
aaj
a"!;'
a~'
260
av
= (aU)I~ _ ~ an Y!I~'
a~
ax
fay;
av; an I~
I
a'llj =
ay} .
av +~
a~
aVY'=F
i a1Ji I
,
av = Vi'
-a
Yi
dVi
aH
dx = - aYi
(6.25)
av = ai
-a
ai
(6.26)
By solving the second set and substituting the solutions into the first set, he
has
V = vi(x,a,a)
y = Yi(x,a,a),
with a = (a"a 2 , ,an)' a = (a"a 2 , ,an)' They form the general solutions for equations (6.25). The proof, which Clebsch does not give, is
simple.
Suppose that the solutions Yi(X, a, a), vi(x, a, a) are substituted into the
Hamilton-Jacobi equation above and the resulting identities differentiated
with respect to aj and Y)' There results the identities
(6.26')
It also follows from the relations (6.26) by differentiating with respect to
x that
Vx~ + ~ y;~,~ = 0,
I
V} =
VXJ'i
+ ~ Y;~,y/
I
261
When we compare these sets of equations with (6.26'), we see that the
canonical equations (6.25) result. This is, in essence, what Clebsch remarks
at the end of Section 2.
In Section 4 Clebsch turns to a discussion of the second variation. To
this end he considers the two functions (in his notation)
He remarks that if 6Yi> 6>-'; are substituted for ui' ILi' these become the first
and second variations of O. (Notice his unhappy use of the index i to range
over both the sets 1,2, ... , nand 1,2, ... , Ie.) In his earlier paper [1858]
he had shown that
2
6J
= "2 Jxo
"7 f
a20
RiRk
I
a(dyJ dx) a(dYkl dx) /i2dx+ B - B ,
where the Ri are linear functions of 6Yi and MyJ dx and B 0, B I are the
values at x = Xo and x I of a homogeneous function B of second order. The
Ri are related by the equations
262
where the
:t
Pk;U{=
an )r
( a(du)dx)
an2
anI
a( duJ dx)
a( dyJ dx)
an2
a( duJ dx)
a(dyJdx)
an
or equivalently
(6.29')
as can be seen with the help of (6.25). (In these and in the following
relations k and m range from I through 2n.) When these are substituted
into equations (6.28), there results
7In what follows Clebsch uses symbols such as (a v jay;) or (a v jaaj) to mean the appropriate
partial derivatives of V(x, y, a). However, when he writes [a(a V jay;)jaad he means that
(a V jay;) is formed, evaluated along the family of extremals y = y(x, aI' ... , a2n) and then
the resulting expression is differentiated with respect to ak' In this connection note that
[a V jaakl = ~j(a V jay;)[ayJaakl + (a V jaak)'
263
(The A ',A S are the constants in (6.27) associated with u' and us, respectively.) Clebsch thus has
O=~~((AA:'-A;'Ak)~([
k
ayi ][ _a
aak
aam (av)]
aYi
(6.30)
He proceeds to assert that this relation (6.30) can represent at most one
equation between the A, and thus that the quantity inside the sum as to i
must be a constant. To see this note that
~([aYi][
a (av)]
a (av)])
aak aam
aYi - [ayi][
aam aak
aYi
i
since the additional terms introduced cancel each other. Clebsch further
notes that
a(v)_[av] (av)
aYi - dak - aak - aak - aak .
~[aYi](av)_d(V)
aak
a!k (
(6.31')
He now chooses the constants aI' a2, ... , a2n so that they are the sets
al,a2' ... , a ,al,a2' ... , a" in (6.26). He then has V", = (V)", = a (i = 1,
n
2, ... , n). Notice also that (V)o,C>; = 0 since V does not contain a explicitly.
Equation (6.30) then reduces to the relation
n
k=l
(AA:+k - A:'+kAk) = O.
Thus expression (6.31) has the value I for k = 1,2, ... , nand m = n + k
and the value 0 otherwise.
Clebsch now divides the constants AI,A 2, ... , A2n into two sets: for the
first n he writes 4" 42, ... , A,n, but for the second n he writes A I'
A 2 , , An. In this notation he rewrites (6.30) in the form
o=
~ (bAt
k
- AlcAk),
(6.32)
264
where "the sum now only extends from 1 to n."g He goes on to say these
are the equations that the constants 4,A must satisfy and that he will show
they are actually identities. To do this, he says that, in general, one can
always find n 2 quantitites C so that the equations
.
A':' =
L.J cjm 4;
~
(i,m
(6.33)
~~
0= L.J
k
From these n(n - 1)/2 relations he concludes that relation (6.32) can be
replaced by the condition that C = (Chk ) is a symmetric matrix; or, as he
expresses the matter,
"The arbitrary constants 4, A are related to each other by an arbitrary
system of linear equations with a symmetric determinant." He concludes the
section with the remark that the 2n 2 interdependent constants 4'ic,AI can
be replaced by the n 2 + n(n + 1)/2 linearly independent constants &C, Chk. 9
In Section 6 Clebsch proceeds to his first form of the second variation.
To this end, he inquires as to how the choice of the constants influences the
quantities Rand R;. He substitutes the values for A~ in (6.33) into relations
(6.27), finding, in his notation,
k=n
{[ ay. ]
ay' ] + C2k [ aa:
ay' ] + ... + Cnk [ aa:
ay' ]} '
u/ = k~l
AI
aa~ + Clk [ aa:
= A 1'v.I 1 + A 2'v~I
+ . . . + A n'vn
I
where
(6.34)
u:
v:.
265
a~
a~
a~'
This he remarks "is a determinant of the same nature as R itself, only that
in place of the Vk the functions dvJ dx appear." He notes, moreover, that
this expression transforms into
A (dV/
as + dvl . as + ... + dv jn as )
~ av~
dx
avi
dx aVh
'
B2J=lixl~~
2 xO
a20
. SjS2kdx+Bl_BO.
k a(dyJdx)a(dyddx)
S
(6.37)
aq,p
~Sj a(dyJdx)
(p
(6.38)
This is easy to see since equations (6.35), together with the equations
a02
( a(duJ~)
)r = A,Oj
r, + 420j
r 2 + ... + 4"Oj,
rn
266
where
[a
[a
[a
+Cin [
a~n ~;)].
(
He concludes from this with the help of the relation following (6.29) that
0=
2: ~ { f3\i V f + f32i V {
k
an )' = n~.
( a(dvjdx)
(6.40)
Relations (6.34), (6.36), (6.37), (6.38), (6.39), and (6.40) completely define
the new form of the second variationY
In Section 7 Clebsch proceeds to his second and final form for the
second variation, calculating both the parts under the integral sign and
outside. To do this, he proceeds to examine the expressions [ayJ ak],
[ayJaad. Recall that
( av)=a.
aai
I
(6.41 )
[1858'], p. 348.
267
i,k. It then follows that equations (6.41) can be solved in the form
_ [ aYi ] _ ) ( a2v ) +
P aak - Pi aa) aak
Pi
(6.42)
and equations (6.41') in the form
aYi ]
k
P [ aak = Pi'
With the help of these relations, Clebsch finds a new form for the
functions v. By (6.34), he has
(6.43)
as
as
as
(6.44)
~(W)
P
aw
aW)k
aw + ... +w
2aW
2k
+w
aw).
aWnk
268
1.(
oW c5a l + aw c5a 2 + ... + aw c5an).
P aWlk
aW2k
aWnk
A...
W1kPiI + w2kPi2 +
dx
. ..
Clebsch now notes that the coefficient of d(p: Ip)1 dx in this relation is,
apart from the factor 1I p,
aw
Wc5ar - "'(
4.J -a-c5al
k
W1k
aw
aw
) Wrk'
+ -a-c5a2
+ ... + -a-c5an
W2k
wnk
Si =
where
From this it is not hard to see with Clebsch that Th is expressible as the
determinant
dc5ah
dx
dw 1h
dx
Th = dw 2h
dx
dwnh
dx
c5al
c5a2
c5an
wlI
WI2
Win
W21
W22
W2n
Wnl
Wn2
Wnn
Moreover, when these are introduced into the second variation (6.37), the
expression under the integral sign is a homogeneous function in Thl W of
269
the second order, and the coefficients themselves are the expressions
~t
a2~
p/'p(
a(dyJdx)a(dYk/ dx )
Jxo
(6.46)
4>1 = 0,
4>2 = 0, ... ,
4>m = 0,
(6.47)
where the 4>k are differential equations of the first order and m < n. He also
states that the values xo, x 1 as well as the end-values of yare specifiedP He
then proceeds to determine the y and A so that the integral
(XI
J= JJ..o ~dx
12Mayer [1868].
(6.48)
270
(6.49)
he also points out that inasmuch as the end-values of the yare fixed, the z
must vanish at Xo and Xl' He then, as usual, finds the Euler-Lagrange
equations
(6.50)
He now assumes explicitly that the determinant
a<Pl
ay~aYl
aYl
a<Pm
aYl
aYl ay~
a20
a<Pl
a<Pm
ay~ ay~
ay~
ay~
a<Pl
a<Pl
a20
a20
aylaYl
a20
R=
aYl
ay~
a<Pm
aYl
a<Pm
ay~
(6.51)
ao
ay;'
= Vh'
(6.52)
aH
aVh '
(6.53)
~hYhVh- f
1
(6.54)
13See, e.g. Bliss, LEe, pp. 65ff. This is the hamiltonian, which Hamilton called his principal
junction, as we saw above on p. 179. See also Mayer [1886'].
271
and hence
2n2 =
+
2::2:;
{[ aYha2~.
]Zh Z;+2[ aYha~y,~ ]ZhZ:+[ aYha,2~y,, ]ZhZ:}'
1
y,
(6.55)
(6.56)
a(a[Yh]/aa;)
= dx
=0.
272
where the Yj are 2n arbitrary constants. Then the accessory equations above
become
a0 2( U, r)
d a02( U, r)
aUh
- dx a( dUh/ dx) ,
(6.58)
With the help of these equations, Mayer now simplifies the form of 202
and finds by its homogeneity that
202(u,r)
= dx
n
a0 2(u,r)
i>h a(duh/ dx) ,
and for two different sets u, rand w, 'T of solutions of the accessory
equations (6.58) that
~ {
LJh
I
wh
:t
a02( U, r)
ark
a0 2 (u,r)
= dx
~
+LJk'Tk
I
:t {a0
n
a02(w,'T)
2(u,r)
h Wh a( dUh/ dx) - Uh a( dwh/ dx)
= const.
(6.59)
= ~v.o a[YhJ
I
2n
a[lI. J
= ~v.o
_ _k_
rko
aa '
LJII
aa .
LJII
I
(6.60)
(6.61)
Zh=2:ogoU,,",
I
dZh
dx
= rh + 'T/h'
rh = 2:ogo
I
duo
dh ,
x
'T/h
dg
o
= 2:odx U,,".
I
(6.62)
For notational purposes, he calls O~ the part of O2 formed when the values
in (6.60) for u, r are substituted into O2 but the value is put in place of
dz/dx = + 'T/.
Mayer now remarks that if the go were constants, then the quantities z, JL
of (6.61) would be solutions of the accessory differential equation and thus
o_ d
202 - dx
gouh
a0 2( uP, r P) }.
pgp a( duJ:! dx) ,
He goes on to observe that since 20~ does not contain the derivatives of the
g, this is indeed the form of 20~ whether the g are functions of x or not,
273
provided that the terms containing the dg / dx are removed. He thus has
a20g
aog
202 + 2t h a~h
d n
a02( uP, rP)
'l/h= dx thZhLpgp a(duh/dx)
n {n
th t"
dg
n ao (uP rP) }
d: uh't pgp a(;uJ:;dX)
L~Li
dg"
gp dx
th
n
Pa0 2(u",r") }
"a0 2(u P,rP)
uh a(duh/dx) -uh a(duh'/dx) .
a20g
a202
[a2o]
202 =
d n
n
a0 2 (u P,rP)
n
dx thZhtPgp a(duh/dx) + LhLi
274
where
Zh
= ~ aU;
1
g",
P.k
= ~ ark
1
g",
and Mayer notes that the coefficient of g" is a02(u",r")/a'k and thus is
zero, as is seen by equations (6.58). This gives the form
n [aq,k]
h aYh lIh'
8q,k =
He now goes to find expressions for the quantities that enter into the
second variation. To do this, he eliminates the ga between the equations
Zj
dzh dul
du;
dUhn
-lIh + dx = dx gl + dx g2 + ... + dx gn
dul
dut
dx
dx
ZI
uf
un1
Zn
unl
unn
U 'lIh =
t "gIlt
n
0
= -
a02(u l,rl)
th Zh a(duUdx)
= Uh,
(6.63)
a02(u",r")
h Zh a( duC/ dx)
n
th
a02(u n,r n)
Zh a(dut/dx)
ZI
u 1l
u1
Zn
unl
unn
= -2B.
(6.64)
275
o
U ILk = -
(6.65)
Z\
(6.66)
with
(6.67)
In Section 3 Mayer points out that Clebsch's relations (6.66) and (6.67)
hold identically for all 2n 2 constants 'I/' which satisfy the n(n - 1)/2
conjugacy conditions
:th
n
a O~2(uP,rP) _ P O~2(Ua,ra) } _
Uh o(dut:/dx) Uh o(du:/dx) -0
(6.68)
(6.70)
OVh
OVa
~i o[ Vh] [ o(ai )
1
oai
OVa
J,
276
an2(u",r")
a( du;: / dx)
_
-:t
Xw
-:t
Y;" = [
by the relations
a~:: L;
u;: = 0 and
Yi"
Xw
(6.70')
he has
an2(u",r")
a( du;: / dx)
= 0 or I,
a(x,x w) =
a[yJ]
a[yJ]
a[Yd
aa 1
---aa;-
a[Yn]
a[Yn]
aa 2
a[Yn]
aa 2n
a[Ydw
aa 1
a[Ydw
aa2
a[Ydw
aan
a[YnL
a[YnL
a[YnL
aan
aa 1
aa2
aa 2n
(6.71)
in which the subscript w attached to an expression means that the expression is to be evaluated at x = xw' He also defines a determinant A, which
he writes, as was common in his era,
and A w' which is A at x = xw' He then concludes with the help of identities
(6.70) that
277
_L
V-
a[ VIJ
~ ...
a[ vnJ a[YIJ
--aa:aan
+ 1
a[YnJ
aa2n .
h~'{
I
av
a[aCVh)/aajJ
A... a[ VhJ +
dx
aa j
av
A... a[YhJ }
aca[YhJ/aa j) dx aa j
To simplify this, he substitutes the functions [y],[A] from (6.69) into the
canonical equations (6.53). By differentiating the resulting identities, he
finds at once that
A... a[VhJ
dx
aa j
= _
+[ aYha2aVk
a[VkJ }
aa j .
dV
dx
=Vh{[.1!L]-[.1!L]}
1
aVh aYh
aYh aVh =0.
y/, =
a~:j) ]
a
,
x-x w
U = C I1Cx, X.,)
where C is a non-null constant independent of x.,. (Mayer [1868], p. 252)
278
Xo
and
XI'
[a
Q]
if'"if'
'Yh 'Y,
2
UhU;,
= PI vf + P2 vi +
n
2.h
I
Uh2 =
vf + vi + ... + vn2_ m ,
(6.74)
(6.74')
279
[ ayia2~aYi ] [ a2~ ]
aYi aj~ ,
p,
[ a2~ ]
ay~ aYi '
[ ay~ay~
a2~ ] -
p,
[ a'
a~l ],
Yl
[ a~m ]
ayi
[ a'
a~l ],
Yn
[ a~m ]
ay~
[ a'
a~l ],
Yl
[ a~l
]
a'
Yn ,
0,
[ a~m
]
a'
Yl ,
[ a~m
a' ],
0,
Yn
=0.
The part of this equation which is free of p is exactly the determinant [R],
which is, by hypothesis, not identically zero. Thus none of the p can vanish
identically. Since 2 W cannot change sign for X o ~ x ~ Xl' the p must
therefore be always positive, or always negative; and 2 W can, for a given x,
vanish only when the corresponding n - m quantities V; all vanish for that
x, and this can only happen when all the Uh do also, as we see from (6.74').
The second variation can then only vanish for variations Zh which, for
some x, satisfy the n linear differential equations of the first order
U2 = 0, ... , Un = O.
(6.75)
and vanish at the end-points Xo and Xl. It is clear from (6.63) that equations
(6.75) are linear differential equations of the first order and have as
solutions the ut, so that their general solution is expressible as Zh = clu1 +
c2ui + ... + cnuhn with the C arbitrary constants. The only way that such a
solution Z can avoid being zero at a given X is for the determinant U to
vanish for that x. But by hypothesis, U is different from zero on the closed
interval [xo,xd, and so there is no set of variations Z on (XO,Xl) which
makes the second variation vanish. This completes the proof.
Mayer now remarks that by the hypotheses of Theorem II, i.e., that
U =1= 0 on the closed interval, the condition that 2 W be of one sign viewed
as a function of U for each X on [xo,xd is both necessary and sufficient for
a minimum. (Note here and elsewhere that Mayer considers only weak
minima.) He has just proved it is sufficient. To prove it necessary, suppose
that 2 W could change sign for a particular X on [xo,xd; then clearly one of
17Mayer [1868], p. 254 or BoIza, VOR, pp. 608-609.
280
the p would have to change sign, and in this case the second variation
would also, contrary to hypothesis. IS
He concludes the section with a result originally stated by Richelot:
III. So long as the arbitrary constants y;" can be so determined that the
determinant U is never zero on the interval [xo, x d, then also the second
variation can not vanish.
on2
OZh
on2
(6.76)
= dx OZ;' ,
is of the form
O[YhJ
O[YhJ
Zh = Y I - - +Y2-oa l
oa2
ILk
O[AkJ
= YI--
oa l
O[YhJ
+ ... + Y2n--'
oa
2n
+Y2-oa 2
2n
(6.77)
(6.77')
since
213
n = 2n
d ~
on2
L.JhZh-,
dx I
oZh
= -
for all Z which satisfy the accessory equations (6.76). Mayer concludes from
the hypothesis of Theorem III, U =1= 0 on [xo,xd, that the second variation
cannot vanish and hence necessarily the value x' > x I'
In Section 7 he considers the converse problem. If the upper end-value
XI lies between Xo and x', then the y/, can always be so determined that the
determinant U =1= 0 on [xo, XI]' Moreover, it follows by the methods described in Section 6 that if a(xo, xl) = 0 for two different values x o, xi on
ISIt is shown in Bliss, LEe, p. 254 that the second variation is positive definite for all
variations Z satisfying l3</>k = 0, zh(xO) = Zh(XI) = 0, if and only if there is conjugate system Uk
with U 0 on the interval [xo, xd. Bolza curiously does not credit Mayer with this result.
281
[xo,xd, then the second variation can be made to vanish on [xo,xd. This is
perfectly clear. He therefore concludes that
IV. If it is possible to choose the constants yt so that the determinant U
is different from zero on [xo, xd, then the determinant d(x,xI) can not vanish
for any value of x which is smaller than XI and greater than or equal to Xo
u = c a(x, X.,)
with C a constant independent of x.,; and he chooses a value X O so that
Xo < XO< XI < x', where x' is the first root of a(x,xo) after xo. Then for
x., = xo, he has U = C a(x,xo) =1= 0 for x.,;; x < XI; moreover, a(xl'xo)
=1= 0 since a(xo, XI) = (-l)na(xl'xo)
He now remarks that so long as Xo < XI < XI + f < x', then a(x,xl + f)
=1= 0 on the interval [xo,xd. From this he concludes that
V. So long as the larger end-point XI lies between Xo and the next root x'
after Xo of the equation d(x,xo) = 0, the given integral will be a [weak]
maximum or minimum for those functions y, which make the first variation
vanish provided that the homogeneous function
n
2W= LhL;
--1-
[a
g]
~ UhU;,
2
'Yh 'Y.
In Mayer's final section he considers the curves [Yh] passing through the
fixed end-points (xo, Yo) and (XI' YI)' i.e., having
YhO
= [Yh]O'
Yhl
= [Yh]l;
and he supposes that these equations hold for a l ,a2 , ,a2n and for
a l + Ba l , a2 + Ba2' ... , a2n + Ba2n with Ba l , Ba2' ... , Ba2n restricted by the
conditions
... + a[Yh]O
--Ba
aa2n 2n'
aa I
aa2
aa2n
2n
282
Mayer's paper closes with a few remarks on how to treat the variable
end-point case.
< n pre-
(6.78)
(k
which have given values at the end-points Xo and x .. and which are continuous along with their derivatives, to find that one which furnishes the given
integral
283
(6.81 )
where f
= F.
1, 2, . . . , m)
(6.83)
"*
Ully
== iX1dx
Xo
m+!
s(
~F
-:.
Ys
g\) 8ys= O.
Ys
(6.84)
With him, let v: (K, p = 1,2, ... , m) be "the linearly independent system of
solutions VI' V 2 ' , Vm of the m abridged linear differential equations of
the first order:
(r = 1,2, ... , m)."
(6.85)
20There is an essential gap here in Mayer's reasoning. He has not shown that the given arc
y = y(x) can be embedded in a family of arcs y; = Yi(X, b) satisfying the side-conditions (6.78)
and having
y;(x,O) = Yi(X),
ay;(x, 0)
-a-b-=8y;,
for any preassigned continuous variation 8y; such that 8y;(xo) = 0 = y;(Xl). This gap was first
filled by Kneser, LV, pp. 16lff, and by Hilbert [1905], [1905']. See also Bolza, VOR, pp.
566-569 and Bolza [1907], pp. 370.
284
Tx2:iI8Yi2:K
2: S8YS2:K
V: a- TV:a'
= o.
I V:a'
Yi + m+1
1
Ys
X
Ys
d ( n
O</>K )
m (
O</>K
O</>K )
m+1
v:
(6.86)
He now views these as m linear equations in the m unknowns 8YI' ... , 8Ym'
The determinant of the system has been chosen by Mayer so that it does
not vanish at Xo or x I' It is
""
12
m""
o</> 1 0</>2
O</>m
L.J VIV2 ... Vm L.J if' ~ ... if"
YI liYz
Ym
Mayer then observes that since all variations 8y must vanish at
WJ;
==1
XI
Xo
dx
m ( O</>K
d
O</>K)
2:n 8YS2:K
V: a- TV:a'
= o.
1
Ys
X
Ys
m+ I
XI'
(6.87)
These are then m conditions that the 8ys must fulfill. To exhibit the 8ys' he
writes
m
8ys = Zs + 2: "a"us"
(s = m
+ 1, ... , n),
(6.88)
where the us" are arbitrary, as are the zS' except that the us" and Zs must
vanish at Xo and XI' Then the Win (6.87) must be such that
m
Wi + 2: "a" W:.= o.
(6.89)
a" =
13;
2:I p 13;W:
with
dependent only on the us".
Mayer next substitutes the values (6.88) for 8ys into formula (6.84) for
USy and finds that
285
or equivalently that
m
Uz
+ 2:p Yp W1'= 0,
1
with
m
= 2: af3;Uu
Yp
where the functions z are arbitrary, except that they must vanish at Xo and
Xl' But this implies that the coefficients of the Zs must vanish.21 That is,
since by (6.85)
~
(acf>"
L.J" PICa
1
Yr
d
acf>,,)
- TP"if'
Yr
for
=0
(6.91 )
PIC
= 2:p YpP:,
1
it follows from (6.83) that not only must the minimizing arc satisfy
aF _ A.. aF
aYr
dx ay;
=0
= 1, 2, ... , m ) ,
(6.92)
aF d a F ~ (acf>" d
ays - dx ay; + f PIC ays - dx
PIC
acf>"
)
ay; = 0
(s = m + 1, ... , n),
(6.93)
af + 2:"
m
acf>" -a
A"-a
Yi
1
Yi
d (
-d
X
af + 2:m "A"-a
acf>"' ) = 0
-a'
Yi
1
Yi
(6.94)
286
(6.90) to be not zero for the set usa vanishing at Xo and X\, However, if there
is a solution "1' ... , "m of the m differential equations (6.85) which also
satisfies the n - m equations
~ (a</>"
d
a</>,,)
~"
""a
I
Ys - (i""a'
X
Ys
(s = m
+ 1, ... , n),
(6.95)
then clearly the W in (6.87) will vanish. In this case there are constants
10 = 0, A.c such that for
F = lof + AI</>I
the equations
(i = 1,2, ... , n)
are satisfied. Not all the A.c can be identically zero. It was Hahn who called
this the abnormal case.22
In Section 2 Mayer turns to the case where his new side-conditions are
l/Ia(x'YI"" ,Yn) = 0
(6.96)
(6.97)
al/l l al/l2
- aYI aY2
al/lp
ayp
(6.98)
(6.99)
He substitutes these into the integrand f and into the </>p, obtaining the new
functions
</>p
By this means he has reduced the new problem to that of solving the
problem
= 0,
Xo
a<I> _ A. a<I> = 0
( s=p + 1, ... ,n,
)
ays dx ay;
<I> = X + AIXI + ... + AqXq.
22Hahn [1904), p. 152.
(6.100)
287
aUh
dx au;'
aYi
dx oY;
OUh
ays
dx oY;
ay~
(s = p + 1. ... , n).
Mayer also has, as a consequence of the side-conditions l/;",(x, YI' ... , Yp)
= 0, that
0Ys
acp =
dx ay;
f (0Y.aF _ ~dx aF + f
M al/;a) aYK
+ aF _ ~ aF + ~
ol/;",
oy~
I'
ov
/s
dx av'
/s
I '" '"
LJ aILa
I
aYK
ays
all/s '
where ILl' ... , J.Lp are arbitrary multipliers. But by the assumption that the
determinant (6.98) 'i= 0 "these p multipliers can be so determined that in the
last formula the coefficient of each ayJays vanishes." This reduces the
situation to equations (6.96), (6.97) together with
air
_ ~
air
~
d a'
UYi
x 'Yi
=0
( l. = "
1 2 ... ,n.
)
lLpl/;p'
288
(6.101)
X2
XI
M=O
Recall that for the case where the integrand is j(x, y, y/), the first
variation is expressible as
6I =
(XI[
Jxo
'dj 1J
'dy
Now if d/y,f dx exists and is integrable, then the first variation is expressible
as
6I =
L~I[ /y -
e! /y,
]1J dx ,
for all 1J such that 1J(xo) = 1J(x l ) = 0 and for which 1J' has some additional
property such as continuity on [xo,x l ]. The form of d/y,jdx is such that
e! /y'
and thus it was usual to assume that y" existed and was continuous.
However, it is perfectly possible to express the first variation in the form
(6.102)
provided that /y' and /y are integrable. This form, due to du Bois-Raymond,
is applicable even wheny" does not necessarily exist.24
Let us examine the proofs due to Stegmann and du Bois-Raymond. The
former's argument is not sound but is easily alterable into a correct form.
He chooses 1J so that
1J(x) =
p" P.
(x - XO)"(Xl - x)"
M(x)
289
(6.101) reduces to
(6.103)
in which "P represents a linear function of c5y and its p. - I derivatives with
respect to x." He chooses for c5y the function
where he has assumed that f is analytic and that its sign changes at the
places a l ,a2' .... Then the expression outside the integral sign in (6.103)
vanishes, and he argues that the function f must now vanish also. His
method is not free of objections and requires more differentiability on the
minimizing arc than is needed.
The standard proofs of the fundamental lemma are in a paper by P. du
Bois-Raymond. 29 This paper is of some interest, and I review parts of it
below. However, he does devote a considerable amount of space to results
on (Riemann) integrability and rectifiability, which now hardly seem germane. I have omitted these and start with Section 8 of his paper.
Here du Bois-Raymond establishes the fundamental lemma in several
25To correct Stegmann's argument, suppose that M;l.O and that M(x')
Then in a small interval x' - 8, x' + 8, the function M O. Choose
* 0 for
Xo
290
L dXA(X)F'(x)
XI
Xo
= 0
(6.104)
for all A(X) of class c(n) on [xo,xt1 which vanish at Xo and xI' i.e., all such
functions which, along with their first n derivatives, are continuous. (In
what follows he assumes F' is bounded and Riemann integrable.) He
chooses for A the function defined in terms of two arbitrary values a, p
such that Xo < a < p < XI; he sets
A(X)=O forxo";;x<aandp<x";;xl'
F'm J:A(x)dx= 0
with ~ some value between a and p. Thus F'(O = 0 and by simple
continuity arguments F'(x) = 0 at every point x at which F' is continuous
since a and p are arbitrary.
In Section 10 du Bois-Raymond considers condition (6.104) as holding
for all A which vanish at a and b and possess continuous derivatives of all
orders. To do this, he rewrites his condition (6.104) as
LbdXA(X)f(x) = O.
He divides up the interval a ... b into five parts a ... a ... a' ...
p' ... p ... b, and he seeks to determine A so that on a";; x ..;; a,
p..;; x..;; b it vanishes and is unity on a''';; x ..;; p'. On the remaining
intervals a < x < a', p' < x < p the function A varies smoothly between 0
and 1. He sets
A(X) = ~ {arctanp(x) +
I },
a + a' )I/(X-axa'-x)
p(x)= ( x- - 2 -
(The reader can satisfy himself that this function A has the desired differentiability properties; in Figures 6.1 and 6.2 he can see the function and its
limiting position as a', p' approach a, p.)
With the help of this function A, he finds that
0= L bdxA(X)f(x)
a ti,'
{f ~
Figure 6.1
291
Figure 6.2
becomes
Ja
Ja
J P'
and notes the monotonic property of 1 - A(X). Then in the limit as a' ~ a,
P'~P
LPf(x)dx= 0
for a, p arbitrary. He then concludes that ffdx vanishes on every subinterval of a,b.
On pp. 307-309 and 312-313 du Bois-Raymond, in effect, replaces his
original form of the fundamental lemma by the one that appears in (6.102).
He considers the lemma as an isoperimetric problem in the form
rXldxA'(x)f(x) = 0
Jxo
for all By such that
LA'(x)dx= O.
XI
Xo
I U" + i dx U' By = 0,
b
I V" + i dx V'By= O.
b
292
but small subintervals ao ... f3o, a, ... f3, in each of which 8y is a constant, 8yo and 8y,. Then the first integral becomes
8yo
Jao
al
8yo Uo + 8y, U;
since f30 - ao = f3, - a" where Uo, U; are values of U' somewhere on the
intervals ao, f30 and a" f3,. In the same way
8yo Vo + 8y, V; = O.
From these it follows that UoV; - U; Vo = O. Now du Bois-Raymond fixes
ao = f3o, a, = f3, and allows x = a, = f3, to be variable. (This is, in effect,
John Bernoulli's uniformity principle.) Then
Uo/ Yo'
U'+(1V'=O,
8J dx V = 0,
8J dx W = 0
= 0,
"where the (1, and (12 depend only on the fixed points Xo and x" ... .',3'
du Bois-Raymond next takes up his own proof. To do this, he considers
the relations
o=J8Udx =Jdx { au
8y + au 8y' + ... },
ay
ay'
o= J 8V dx = J dx { ~; 8y + ~; 8y' + ... },
where again the second equation arises from the isoperimetric condition.
He now needs what he calls unbounded variations. (These are variations
which need not be of class c(n) as before.) Let 8,y,82y be two such
variations and consider 8y = 8,y + C8 2 y, where C is to be determined so
31
Recall the problem Euler had with this case; see p. 100 above.
293
0=
J
- J
0=
=
L fK(x,YI>'"
XI
'Y~I"" 'Y~)dx= IK
Xo
(Ie
= 1,2, ... , m)
and moreover which take on given values at Xo and XI, so that at the same
time the integral
V=
L
XI
Xo
32 Mayer
problem.
= .f.c dx,
<PK
== .f.c - u~
(1877). In his discussion Mayer calls attention to a paper by Lundstrom (1869) on this
294
dAK
dx '
0=--
= c + J.dx.
K
u~
h=n
afK
2: -a' Uh = V
h=! Yh
(6.105)
The VK are, along with the U;, variable quantities, but unlike the latter, they
do not appear in the expression for 2 W. They therefore can be chosen as
one wishes, and conditions (6.105) are thus always satisfied. 33 If the
determinant of a2F lay;' ay; =1= 0, the equations
aF =
aYh
aF
dx ay;'
33See, e.g., Pascal VR, pp. 95-99 and also Scheefer [1885], p. 584.
295
where
He concludes that
IV. The Problem II is solvable with the help of the n differential
equations
in which
and the A signify undetermined constants. The complete solution of these
equations yields YI' ... ,Yn as functions of x, the m isoperimetric constants
AI, ... ,Am and 2n integration constants al, ... , a2n provided that the
second derivatives y\" ... ,Y:; cannot be eliminated. If these 2n + m constants are determined by the m isoperimetric and the 2n end-conditions, it
follows . .. that the functions YI' ... ,Yn so determined afford the integral V
a true relative maximum or minimum provided that the homogeneous function
of the second order
h=n;=n
2W= ~ ~
h=1 ;=1
a2F
"If'lf' UhU
~h ~I
always has the same sign on the interval of integration and provided moreover
that the upper limit [of this interval] x I lies between Xo and the next root
after Xo of the equation
~(xxo) = ~
aYI
aYn aylO
aYno aVI
... - - ... ...
aal
aan aan+ I
aa2n aAI
VIC
VIC
lX kdx.
Xo
296
V=
L fdx=
XI
Xo
M~,
(a)
and
XI
Xo
fdx= I,
(In
In problem (a) it is V that is to be minimized, and in (P) it is VI' He
assumes the given end-points, end-conditions, and isoperimetric conditions
to be the same, i.e., the Xo, XI' I, Ik' and sets
ILk
Ak = - ,
ILF =. ILf + ILlfl + ... + p.",fm = ell.
IL
Note that the determinant A(xxo) only changes by a constant factor if the
ai' ... , a2n,AI' ... ,~ are replaced by 2n + m independent functions of
these constants.
The solution of the problem (a) of Mayer can now be found by
integrating the differential equations
oell = A- oell
0Yh
dx 0Yh'
(6.106)
The extremals are functions of 2n constants of integration ai' ... , a2n and
the ratios of the m + 1 isoperimetric constants IL, ILl' , p.",. These are to
be fixed by the 2n end-conditions and the m isoperimetric conditions
The resulting arc is then a true maximum or minimum for problem (a) so
long as the upper end-point XI is less than the next root after Xo of the
equation
0YI
0Yn 0YIO
0Yno OVI OV2
oVm
A(xx)=L+- ... - - - ... - - - ... - =
0
a
0
- oal
oan oan+ 1
oa 2n op. OP.I
op.",
provided that, moreover, the second-order homogeneous function
1 h=n ;=n
o2ell
L L ~UhUi
P. h=1 ;=1 Yn Yi
2Wa=-
is replaced by
LXI fdx=
Xo
297
I.
Mayer now supposes that the maximal or the minimal value in problem
/C. Then problem ((3) with 1= /C has the same
Euler-Lagrange equations (6.106) as (a) and has
XI
XI
XI
fdx= /C,
f, dx= /C" ,
fm dx = /Cm
L
Xo
Xo
Xo
aa,
+,
where YK and vK are functions of the same ai' JLK as in (a). The difference is
that v, has been replaced by v, i.e., by
v= (X fdx .
Jxo
Jxo
Now let anyone of the constants a" ... , a2n , JL, JL" ... , JLm be designated
by c; then
av
av,
aVm LXI h=n( a<I> aYh
a<I> ay ;,)
JL-+JL,-++JLm-=
dx~ - - + - , ac
ac
ac
Xo
h=' aYh ac
aYh ac
and hence
av =
ac
_..!!:.! ~
JL ac
_ 1- {k~m JL
JL
~2W
= ~2W
JL,
a
A,
a'
298
V= lX'fdx=
Xo
M~,
lX'fldx= 1(0""
(X'fmdx= 1m
Jxo
Xo
aF
aF
aYh - dx ay;'
in which
F==-f+AI/1 +A2h+'"
+Amfm
M~,
lX'fdx= I,
lX'hdx= 12 ,
Xo
Xo
lX'fmdx= 1m,
Xo
provided that the end-conditions are the same for both problems.
If, further,
V=I
in problem a) is a true maximum or minimum of the integral V, then at the
same time in problem f3)
VI
II
Mayer closes the paper with the classical example: "To find the curve of
given length and given end-points, whose center of gravity is the lowest."
Let the z axis be vertical, and let the xy-plane pass through the initial
point on the curve. Then the problem is this:
(Xlz_.jI
Jxo
Thus F
He sets
and has
Jxo
299
The five constants al,a2,a 3 ,a4,A are now to be determined by the endconditions and the given arc-length Ip Mayer fixes the sign of a 2 to be the
same as that of (a~ + ai)I/2 so that dVI/ dx is positive.
It is easy to see that
2W =
Z+AI
( ~I
+ y,2 + z,2 )
{U I2 + U22 + (z'u I
y 'u2) 2} ,
and therefore it has the same sign as (a~ + a~)1/2. For a minimum, he
therefore has to choose the positive value for this square root. This
corresponds to choosing for the extremal arc in question the one whose
convex side hangs down. Then since Xo = 0, it follows that AI > 0.
Now Mayer's determinant is easy to calculate, and his determinantal
equation becomes
O(y - Yo)
o(y - Yo)
oa l
o(z - zo)
oa l
oV I
oa l
oa2
o(z - zo)
o(z - zo)
oa2
oa4
OVI
OVI
oa2
oa4
=0.
___
4 =~,
a2
and has
Z -
VI
fa~ + ai (q -
Zo =
yar + ai (p - Po),
qo)'
where
This equation has only one real root 8 = 0, i.e., x = xo, and therefore
imposes no restriction on the second end-point x I' He thus has the result
that "Among all curves of given length and end-points the catenary has always
the lowest center of gravity." Moreover, since AI > 0, he also has the
reciprocal result that "Among all curves with given end-points, whose centers
of gravity lie in one and the same horizontal plane, the catenary always has
the shortest length."
300
<PI
(m
< n)
,y~)dx
(m
<
n).
(6.107)
Mayer points out that Problem A is a special case of I. He does this by,
in effect, defining Yl = f and fixing YI at X = Xo' Then the problem
becomes one of finding an arc YI'Y2,'" 'Yn for which YI(X 1) is an
extremum. Conversely, however, not every problem I is reducible to the
form A because of the fixed end-conditions. 34
Let us look at Mayer's 1878 treatment of the problem. He finds for a
minimum that 8YI(x 1) = 8YlI must be 0 and that 8)11 must have one sign.
He also notes that the usual accessory side-conditions
ocp
oCP}
.L
0 k 8Yi + 0 ~ 8y; = 0
1=1
YI
YI
i=n {
(6.108)
34However, the problems are equivalent if one does not insist on fixed end-points because in
this case
V=
L y;dx
XI
Xo
can be chosen as the integral to be minimized, and the problem of Mayer becomes one of
Lagrange.
301
must be satisfied by the variations BYI' BY2, ... ,BYn' and he defines a
function n:
(6.109)
and he imposes the condition on the A that
lR-.!LlR=O
aYI
dx
ay;
Since BY2, ... , BYm vanish for x = Xo and x = XI and BYI also vanishes for
x = xo, relation (6.109) becomes, on integration with respect to x,
an]
[ at
YI
BYII = -
LXI dXL
i=n( an
~xo, = 2
y,
an) BYi'
ddX at
y,
This gives him an expression for BYII in terms of the quantities BY2, ... , BYn
provided that [anjay;] =1= 0 at x = XI' He concludes, with the help of this
expression and (6.109), that the equations
_.!L
an = 0
aan
dx aYi'
Yi
( 1=
. I" 2
...
)
,n,
(6.110)
a2n
ay; ay;
R=
a2n
ay;
a<P1
ay;
ay~
a<Pm
ay;
a2n
ay; ay~
a<P1
ay;
a<Pm
ay;
a2n
a<P1
a<Pm
ay~ ay~
ay~
ay~
a<P1
ay~
a<Pm
ay~
is not identically zero. Note that equations (6.108) do not contain the A, but
that equation (6.109) contains the functions A, A' linearly and homogeneously. Hence one integration constant-Mayer takes it to be a2n -is an
302
acp
acp)
.~ ~8)i+ a ~8); +
,=1
Y.
Y.
28 2q,k =0,
where the 28 2q,k are homogeneous functions of the seond order in Zi = 8Yi
and z; = 8y;. Multiplying these equations by the quantitites Ak and adding,
he finds the relation
i=1 a'i
a'i
With the help of the Euler equations (6.110) this relation becomes
(6.111)
since 8)2' ... ,8)n all vanish at x = Xo and XI and 8)1 vanishes at
= Xo sinceh, ... ,y,; have fixed end-values.
For an extremum, it is necessary that 8Yl1 = 0 and that 8)11 must be
negative or positive for all ZI,Z2' .. , Zn which satisfy the accessory sideconditions
X
(6.112)
which are continuous on the interval from Xo to XI' and for which
ZI = 8YI must satisfy Zl1 = O. As a
consequence of these conditions (6.112), the expression (6.111) for 8)11
becomes
1
(XI
8)11 = - [(an/ay;) Jxo 2n 2 dx,
i-n {
2n2 = 28 2n + 2 ~ ILk ~
k=1
i=1
aq,
aq,}
a k Zi + a k z;
Yi
Yi
aILk
(6.113)
303
c5Yll
= -
rX1dx h~n(
[(an/ayn]1 Jxo
aZh
h=1
dx
aZh
Mayer now writes the complete solution of equations (6.108) and (6.110)
as functions of 2n constants a l ,a2' . , a2n and sets
Zi
ay'
h=2n-1
2:
h= I
Ilk
Yh-',
aah
Zi'
h=2n
ax
2: Yh _k
.
h= I
aah
the system
This implies that the second variation c5Yll will vanish if the 2n - 1
constants YI'Y2"'" Y2n-1 can be so determined that the Zi defined above
vanish at any two values xo,x; on the interval [xo,xd without vanishing
identically.
Mayer observes that the Zi satisfy the equation
i-n
~ { an Zi + a~ Z;} = 0
i= I
aYi
aYi
an
2:
PZi= O.
i= I ~i
d
d
i=n
(6.114)
x
He concludes this with a result relating the second variations c5fi to his
determinant A(x,xo) (in his form of A notice that YI does not appear for
x = xI):
~(X,XI)
11
aYI
aal
all
aa2
aYI
aa2n-1
all
all
aa2
aa2n-1
= aYn
aYn
aa2
aYn
aa2n-1
aYI
alll
aal
all I
all I
aa2
aa2n-1
aYnl
aYnl
aa2
aYnl
aa2n-1
aal
XI,
XI'
304
ao ay;
L
-,
;=1 ay; aa
;=n
= const.= bh ,
aay,~ A(x,x)
= [ aa~
] A(x),x).
y, )
h=n ;=n
a20
[(ao/aYi')J ) h=1
L ;=1
L -Yh'aa
Y;,UhU;,
2W= -
;-n
L
;=1
aq,k
pUi=O.
Y;
i~)
a02(u PrP)
u; a( duf / dx)
a0 2(u ar a )
} _
u; a( dut / dx)
- O.
... =2n-1
""
L.J
... =1
"i...
Yi
a'
a...
in which
g = A)c/
Under the assumption that the problem is in general solvable the complete
integration of these equations produces Yl'
,YnoA\o ... ,Am as functions
of x and of 2n arbitrary constants. Of these constants, however, those that
enter into the solutions Y always reduce to 2n - 1 constants aJ>
a2, ... , a2n-); the 2n-th constant a2n enters only as a common factor of all
0
3S Recall
305
the solutions A. The constants al,a2, ... , a2n-1 are then to be so determined
that YI' Yl, ... ,Yn for x = Xo and Yl' ... ,Yn for x = XI have pre-assigned
values. If such a system, satisfying the end-conditions, has been found and if
for XI such a value has been chosen that the expression ilD/ilYI for x = XI is
not null, then for the corresponding solutions Yb Yl, ... ,Yn a true maximum
or minimum for the problem is obtained provided that Xo lies between XI and
the next smaller root of the end equation
l1(xxl) = 0;
assuming, moreover, that the homogeneous function of the second order
2W= -
between these limits [i.e., for all x on [xo, x 111 is always negative or always
positive for all n arbitrary arguments that satisfy the conditions
i-n ilD
~ pU;=O.
'Y;
On the contrary neither a maximum nor a minimum will occur if the last
hypothesis is not fulfilled, and the same is valid in general also if Xo is less
then or equal to the given root [the first root of l1(xXI) = 0 below xI1.
i-I
In his 1895 paper Mayer takes up the task of establishing the multiplier
rule for his general problem of Mayer. 36 The procedure is not very different
from that discussed above, but perhaps it is worthwhile to give a quick
sketch of the method.
The problem posed here is to find among all continuous functions
Yo, YI' ... ,Yn of the independent variable x, which satisfy identically r + I
given differential equations
k = 0, 1, ... , r < n,
and which are such that Yo, ... 'Yn take on given values for x = Xo and
YI"" ,Yn given values for x = XI' one which renders Yo at XI a maximum
or minimum. Mayer assumes that the determinant
~ cf>OYocf>IYl cf>ry;
"* O.
(6.115)
o
and be such that 8yo, ... , 8Yn vanish at X = Xo and 8YI' ... , 8Yn vanish at
X = X I 37 Mayer then rewrites these equations with the help of multipliers Ak
36Mayer (1895). There is also a very different proof in a paper by Turksma (1896), which is a
part of his doctoral thesis.
37Mayer's notation in this paper is not very good. For instance, he writes the equations above
in the form ~i ('I>Icy; 8y; + 'l>lcyj 8yj) = 0, which makes it difficult to understand precisely what
is meant, and he writes determinant (6.115) as ~ 'l>oyO</>'\y; .. </>;y;.
306
in the form
(6.116)
dAk4>ky~ )
~ k Ak4>kyp -
dx
=0
(p
= 0, 1, ... , r)
(6.117)
+ 1 linear,
(J = 0, 1, ... , r).
~p8Yp~kAk'4>ky'= - ~ T8YT~kAk'4>ky;
o
0
p
r+ I
0
= Xo'
[ r
~p8ypl ~kAk'4>ky~
]
x=x,
= -
LXI
Xo
r+ I
dx ~ T8YT~k Ak'4>kYT -
= XI'
. . . 1''''+''''
", L.J -
""
't"oy(,'f'ly',
... 't"ry;
"" =1=0,
it follows that those equations can be solved for the 8y pI in the form
8Ypl
= -
LXI
Xo
n
r
(
dp-f4>ky' )
dx ~ T8YT~k P-f4>kYT - ~ ,
r+ l O X
where
(p,k=O,I, ... ,r)
307
Waj
==
XI
Xo
dx
r+l
Xo
and
XI
dJLkcJ>ky; )
-d- = 0
X
(p
He sets
8YT = ZT + L"a"uT"
1
U:
W zo + L "a" W~.= 0,
(6.118')
Wi
+ L "a" W:.= O.
1
fl,
UT"
(6.119')
In this case equation (6.118') is a consequence of (6.119') and the Z are not
restricted. For each set Z,+ I' . . . , Zn' a system of constants ai' .. , an can
be found satisfying relations (6.119'). Equations
Wo==
o1W2+
...
z
JJ
z
+ Po'Wzr
- JL2 + LpfiPJLk=="k
1
Since the
ZT
t k AkcJ>kyp r
dAkcJ>kY~ )
dx
= 0
(p = 0, 1, ... , r),
(6.120)
u:
308
Ao,AI' ... , A,. of the r + 1 equations (6.l20) which also satisfy the n - r
equations
('1' = r + 1, ... , n)."
If ar does vanish for such
this requires that
tk
r
then the
UTa,
(6.l20')
dP.&cI>ky' )
~ =0.
P.Nky, -
Mayer ([1895], p. 136) next considers the case when ar = 0 and there is a
p > 1 and ..;; r such that all subdeterminants of the pth degree vanish but
that one subdeterminant of (p - l)st degree, say,
I
2
p 1
ap-I ="'+W
-LJ u Wu W 2
uTa.
Up-I
I
2
UT,UT ,
p-I
U.,.
'1'
Xo
= r + 1, ... , n
and
XI
for which
ap _ 1 "* 0, chooses
aP ='"
+ WIIW~' .. WP
LJu
up
a = 2:
p
pCp
W';p,
p _ I'
_ aap
"* 0)
cp = awpuP
are independent of the W';p (p
then sets
2: p cp p.&= 7Tk
and notes that
ap
is expressible as
ap =
XI
Xo
dx
d7Tkcl>ky' )
2: uf2: k 7Tkcl>ky, - ~
r+ l O X
T
Ao = 7To,
AI
Uf,ZT('T
= 7Ti ,
Ar = 7Tr
309
tk Ak<PkYi r
dAk<Pky; )
dx
=
(i = 0, I, ... , n)."
(6.121)
(i = 0, 1, ... , n).
(6.121')
dd aan, = aan
x Yi
Yi
He notes 38 that the system of differential equations (6.107) and (6.121') "is
then and only then of order 2n + 2 when the n + r + 2 equations
an
ay; = Vi'
determine the n
+ r + 2 unknowns
"
(6.122)
'\
}.
'\
"
= (Y;),
Ak
= (Ak)
= (Yi)'
dVi
dx
(an )
= aYi
(6.123)
between the Yo, YI' ... 'Yn' VI' ... , Vn and x and by the r + 1 finite equations
(6.123')
Clearly, the complete solution of system (6.123) furnishes the solutions Y of
(6.107) and (6.121') and then by proper substitution the A are found from
(6.123').
Mayer proceeds in Section 2 to the Hamilton-Jacobi theory for the
problem at hand. He defines H as a function of x, Yo, YI' ... 'Yn' va'
VI' ... , Vn through the equation
H
38Mayer [1895], p. 138. Note that the y' and A are now functions of x, y and v which he
expresses as (yj), (Ak).
310
(ay;an) _
=
Vi'
or equivalently
(an)
aH _
aYi =
- aYi '
aVi
(6.124')
'
Va, VI' ,
vn Mayer then
Va
..l ( dVh +
Va
dx
dVa )
Ph dx '
The quantity
Va
Thus the solutions Y of the differential equations do not contain more than
2n + 1 constants. However, these equations can be replaced by the Hamil-
311
0Yo 0Yo
OYo)
'Yn' OYI 'OY2 , ... , 0Yn .
Mayer now says "Then the system (6.125) is equivalent to the partial
differential equation
?o = F(X' Yo,
uX
YI' ...
'Yn'
~Yo , lIY2
~Yo , ... , llYn
~Yo).
lIYI
(6.126)
If
(6.127)
is a complete solution of it, then the 2n
+ I equations
(6.128)
containing the 2n + I arbitrary constants a, ai' ... , an' /31' ... , /3n' form
the complete integral of the system (6.125)."
What he means by this is very elegant and is derived from Jacobi's work
on the Hamilton-Jacobi equation. 39 Let us follow Jacobi's analysis to
derive Mayer's result, modifying notations as needed. Suppose that the Yo
in (6.127) satisfies (6.126). Jacobi notes that
dYo
dx
= 0 Yo +
0 Yo . dYk
k=! 0Yk dx
of
of
ax
=F+
k=!
Pk dYk
dx
~-
dx
k=! 0Yk
n
'~Yk+ -
oYo
(dYk
x
'~Yo
aF )
lIPk
n
k=!
dYk
X
+ ~ -d +~ ~Pk+ ~ Pk~-d .
k=!
~Yo
8Yo=
the value
ayo
ayo
k=! lIYk
2:n
ua
ar: (
T
(6.129)
ay
o
2:
-~-8ak
k=! uak
2:n
Pk8Yk+
8a +
/3k 13ak ,
k=!
a
k=!
as can be seen with the help of the equations (6.128). It follows inter alia
=
39Mayer refers to Jacobi, NACH, p. 291; this is a profound paper on the Hamilton-Jacobi
theory in Jacobi's Nachlass and is well worth examining.
312
that '"
d~Yo
-d
x
dPk
k= I
= ~ -d ~Yk+
L.J
k= I
d~Yk
Pk
dX
da yo/aa (
d
~a
X
+ ~ f3k~ak
k= I
(6.129')
By equating the two expressions (6.129) and (6.129'), he finds the resulting
relation
+ ~
aF ayo
-a
-aYo
a
k=1
dayo/aa)(
d
~a
x
+ ~
k=1
f3k ~ak .
(6.129")
Jacobi now argues that the ~Ph' ~Yh and Ll == ~a + ~ f3k~ak are 2n + 1
quantities determined by the 2n + 1 "arbitrary and independent parameters" ~a,~al"'" ~an,~f31' ... ,~f3n.40 He asserts that since the ~a,
~al' ... , ~an' ~f3I' ... , ~f3n are independent of each other, so are ~Ph' ~Yh' Ll.
Their coefficients in (6.129") then vanish, and he has relations (6.125):
dYh = _ aF
dx
aPh '
aF .
ayo
aYo
aa
dPh = aF
dx
aYh
+ Ph
aF
ayo '
_ d(a yo/aa) = o.
dx
'
n
aF
L
Pk-
k=1
aPk
(In closing this discussion Jacobi remarks that the equation in ayo/aa also
holds with a replaced by ah')
Mayer now returns to the form of F and shows, with the help of (6.122),
(6.124'), the equations vhf Vo = -Ph' and H = voF, that F can be directly
expressed as
n
Yo - ~hPhY;"
provided that the n + r + 1 unknowns Yo, y;, ... ,y~; Ao: AI : ... : \
F=
determined by the n
+ r + 1 equations
8Ph
k=l
aPh
aPh
n
aPh
-a-dak+ -a da + L -a
fJ dPk'
ak
a
k=1 I-'k
are
313
and their values substituted into F. He then closes with the following rule
which also occurs in an earlier paper (Mayer [1878], p. 20):
Solve the n
+ r + 1 equations
ao + ao ayo = 0
ay;'
aYa aYh
'
<Pk
=0
(h
,y~;
ayo _, n ayo ,
- -yo- ~h-Yh
ax
1 aYh
so that it becomes a partial differential equation of first order between the
unknown function Yo and the n + 1 independent variables x, YI' ... ,Yn'
Given some complete solution
an)
of this partial differential equation, the n + 1 unknowns Yo, YI' ... ,Yn found
by solving the n
. ,
+ 1 equations
with 2n + 1 arbitrary constants a, aI, ... , an> /31> , /3n' immediately furnish the complete solutions Y of the differential equations [(6.107), (6.121')]
where the ratios of the multipliers A can be determined by a simple quadrature of some of the equations [(6.121')].
IOsgood [190 I "], Hedrick [1902]. There are also interesting results stemming from Hilbert's
ideas in theses of his students. See, e.g. Bliss, LEe, p. 287 for several references.
21t is amusing to read Hedrick's not wholly unjustified criticism of Kneser's work. (Hedrick
[1902], p. 24). He says, "In conclusion the author desires to enter protest against the extreme
complication recently introduced in some quarters into the essentially simple subject of the
315
where
[ F= F(p,y;x), Fp
aF(p, y:x) ]
ap
.
L
b
{Ayx - B } dx,
where A and B are independent of Yx. If this integral is to be pathindependent, i.e., have the same value for all curves through the same
end-points, then its first variation must vanish and hence
ax + aB
ay = 0.
aA
aFp
ax +
a(p~
- F)
ay
= O.
(7.1)
calculus of variations. In the case of the only modern text-book [Kneser, Variationsrechnung] ... , this condition is so exaggerated as to essentially mar the usefulness of the book, in
that many who would otherwise interest themselves in the subject are repelled by style and
treatment." Osgood ([1901"], p. J05n) says of Kneser's book, ''This is a work of high scientific
merit; in point of style, however it is not all that could be desired."
316
must be satisfied. Hilbert now remarks that this last equation is closely
related to the Euler differential equation
d~
dx' - ~ = O.
(7.2)
Yx
= p(x,y).
(7.3)
317
318
hypotheses. This was followed by Bliss's and Bolza's proofs. Osgood and
Bolza first show that under certain conditions a field can be constructed in
the neighborhood of each point on the extremal in question and then go on
to demonstrate the result globally. Osgood thus says "the lower limit ... ,
as (x', y') describes C [the given extremal], will be a positive quantity, as
can be shown at once by a well-known method of reasoning in higher
analysis." Bliss's proof is quite different, as we shall see below. 6
Osgood uses the definition of a field given above in Section 5.13
involving a family tp(x, y) of extremals. He wishes to show that there is a
neighborhood S of the given curve C such that through each point of it
there passes one and only one of the family of extremals, tp(x, y), and that
tpix, y) =1= 0 throughout S. From this he can conclude that the extremal
family tp(x, y) exactly fills a region in which C is embedded.
He first shows the result "not for the neighborhood S about C but for
the neighborhood of an arbitrary point (x,', y') of C." To do this, he notes
that since tpy(x', Yo) =1= 0, there is a neighborhood Ix - x'i < h', Iy - Yol < Ie'
about (x', Yo) such that tpix, y) =1= 0 in that neighborhood by continuity. He
then can solve the equation y = tp(x, y) for y as a function of x, y in the
neighborhood Ix - x'i < h', Iy - y'l < h'. Osgood's proof of this follows
from an implicit-function theorem of Oini's which was given in lectures at
Pisa and appears in the first of Oini's two volumes, Analisi Infinitesimale,
1877-1878, p. 163. (The printed version is in Peano-Genocchi [1884],
Section 110; Jordan, COURS, 2nd ed., Vol. 1, Paris, 1893, Sections 91, 92;
Osgood, LEHR, pp. 47-57; or Bliss, PRINCE, pp. 7-21. An historical
survey of implicit-function theory can be found in Osgood, Encyclopiidie
der Mathematischen Wissenschaft, II.B.1, Section 44 and footnote 30.)
Oini's theorem as stated by Osgood [1901"] is this:
Let ~(x, y, y) be a continuous function of the three independent variables
(x, y, y) throughout the neighborhood D of the point (x', y', Yo) and let
~(x',y',Yo) =
0;
then for every point (x, y) of the neighborhood d of (x', y') the equation
~(x,y,y)=O
has one and only one root y that lies in the neighborhood of the point y
and thus defines y as a single valued function of (x, y):
= Yo,
y = I/J(x, y).
319
implicit functions:
By the use of this theorem with <P = cp - y, Osgood now finds his desired
neighborhood. He then makes the remark mentioned earlier about a
"well-known method of reasoning in higher analysis" to extend his result.
The extension of Dini's theorem to arcs from p,oints was first carried out
in detail by Bolza [1906], p. 247 or VOR, pp. 160-163. His theorem
concerns a transformationYi = j;(XI' ... , XII) (i = 1,2, ... ,n) between two
regions of n-space and is this:
(A) The functions j;(Xb ... , XII) are of class C' in some region A.
(B) C is a bounded and closed point set in the interior of A.
(C) The transformation [above] always carries two different points (x'),
(x") of C into two different points (y'), (y").
(D) The functional determinant
a(ft, ,f,,)
Xl,, Xn
Notice that (P)c means the neighborhood of the curve C given by the
relation IY(x) - y(x)1 < p for Xo < X < X\,7
Bliss's proof of the existence of a field is quite different, depending on a
theorem of Picard's on solutions of second-order differential equations, and
not on solutions of implicit equations. He does not presuppose the existence
of a minimizing arc joining the given end-points and then show that it can
be embedded in a family of extremals. Instead, he shows that there is a
minimizing arc joining the end-points, if they are close enough together.
Picard, in his Traite d'Ana/yse, ([1891], Vol. III, pp. 94-100), has shown that
the equation
320
equation 1/ p
"
G(x,y,x ,y)
Fl
- x"y')/(x,2
+ y,2)3/2 and
Fx'y - Fxy'
(X,2
+ y,2)
3/2'
Fl
xy
x'
In Section 7.9 below we shall see, by way of contrast, how Hilbert and
his student Noble attacked frontally the problem of showing under quite
general hypotheses that a minimizing arc actually exists. 9
9In a remarkable paper Hilbert (1899), as in his lectures in 1900, outlined his method for
showing the existence of a minimizing arc joining two given points; and Noble ([1901),
Sections 5-14) in his dissertation carried Hilbert's results somewhat further. (However,
Noble's paper lacks rigor in some places. Bolza says inter alia, "In particular the developments
in 9, IO and 13 are imperfect.") As Hedrick ([1902), p. 240) says: "It was the original
purpose of Hilbert's proof (merely) to show that, in this most favorable case of a regular
problem, a curve must exist which renders our integral an unlimited strong minimum,
compared with all continuous comparison curves; and that the minimizing curve is composed
of a finite number of pieces of extremals. Hilbert's existence theorem may therefore properly
be called a theorem in differential equations."
Hilbert ([1899), p. 184) said "Each problem of the calculus of variations has a solution as
soon as suitable assumptions regarding the nature of the given end-conditions are fulfilled,
and, if need be, the notion of a solution has undergone an obvious generalization." We say
more on Hilbert's existence theory below.
321
XI
< XI'
X2
< X2
CPa(X,aO)+O on [XI,X2]'
one can choose a positive constant k so small that the family of arcs
Y = cp(x,a),
XI " X " X2'
la - aol " k
forms a field embedding the arc Eo."
Bolza says, in accord with Schwarz and Kneser, that a family (7.4) of
extremals joining two curves xl(a),x2(a), a l " a " a2 [with x l,X2 continuous, with xl(a) < x2(a), and with cP,cp' of class C' for a l " a " a2' xl(a)
" X " x2(a)] forms a field of extremals about Eo when Eo is contained in
the family for some value a = ao between a l and a2 and when no two
curves of the family have any point in common. Bliss's definition of a field,
based directly on Hilbert's ideas-as we shall see in the next section-is, as
mentioned earlier, different in formulation. For him a field F is a region of
x, YI' ... 'Yn space with a set of slope-functions Pi(X, Y) (i = 1,2, ... , n)
such that
(1) the Pi are single-valued and have continuous partial derivatives in F;
(2) the points x, y, Pi(X, y) all lie in the domain of definition of the
integrand f(x, y, p);
(3) the Hilbert integral
/* = J[f(x,y,p)dx
+ (dYi - Pidx)h,(x,y,p)]
The proof for the existence of a field embedding a given arc in Bolza
VOR, pp. 100-103 is interesting. Since CPa is continuous and + 0, it cannot
change sign on [XIX2], and Bolza fixes the sign to be > 0; thus CPa (X, ao) > 0
on [Xlx2]' This implies that <Pa(X, a) > 0 for (x,a) on [x l,x2]' la - aol " k
provided that k is chosen sufficiently small. Bolza now designates by Sk the
map of this closed set from the xa-plane onto the xy-plane under transformation (7.4), and he chooses Pix3' Y3) to be any point in Sk' Then
Xl " X3 " X2' and there is a value a = a3 in la - aol " k such that Y3
= CP(X3' a3)' He goes on to show that there cannot be another value a3 + a3
in la - aol " k such that Y3 = CP(X3' a) = CP(X3' a3)' This is impossible since
CPa is a properly monotonic function of a. As a consequence, the curve
a = a3 is the unique member of family (7.4) passing through P3 in Sk'
322
of the line x = X3 lies in Sk and that Sk is the set of the xy-plane bounded
by the lines x = Xl and x = X2 and the two nonintersecting curvesy = <p(x,
ao - k), Y = <p(x, ao + k). He can then solve equation (7.4) for a(x, y) as a
function of class C'. (The continuity and the differentiability follow easily
from implicit-function theory.) He finally wishes to choose a neighborhood
(p) of Eo-the set of all points (x, Y) such that Iy(x) - YI < p for Xl ..; X
..; x2-which lies entirely in the interior of Sk; to do this, he remarks that
the two continuous functions
<p(x,ao
+ k) -
is cut by a curve defined on the family by a function x = ~(a) (a\ <; a <; a~
and if the family and the intersecting curve have continuity properties ... ,
then every region F of the xy-plane which is simply covered by the extremals
is a field with the slope function p(x, y) of the family, provided that the
derivative Ya(x,a) is different from zero at each set of values (x,a) corresponding to a point (x, y) in F.
323
for the problem; second, he showed the relation between his independence
theorem and the Hamilton-Jacobi theory and in the course of this gave the
modern definition for a field; third, he carried out his ideas on double
integral problems in the calculus of variations; and last, he gave a "general
rule for the handling of variational problems and the formation of a new
criterion." We shall examine briefly some of these points.
Hilbert in this paper and Kneser in LV, Sections 56 and 59, bridged a
fundamental gap in earlier proofs of the multiplier rule. They show, in
effect, that given a set of admissible variations l1i there is a family Yi(X, E) of
admissible comparison arcs satisfying the side- and end-conditions, containing the given arc Yi for E = 0, and having ayJx, E)/aEI<=o = l1i'
Hilbert also established in his summer lectures in 1899 what is now
called Hilbert's differentiability condition by a simple modification of du
Bois-Raymond's lemma (see Whittemore [1901], p. 132). He showed that a
minimizing arc y(x) must satisfy the Euler equation in integral form
fr' = gofr dx + c; and hence near every point (x, y, y') not a corner and
where fry =1= 0, the function y has continuous derivatives of order nwhen f
has continuous partial derivatives of order .;;; n near (x, y, y') (see Bliss,
LEe, p. 13). What Hilbert noted is that the second derivative y" exists for
all values of x when fr' is differentiable and fry =1= 0.
Hahn [1906], p. 254, showed that every rectifiable curve which has a
defined tangent at each point must satisfy the Euler differential equation if
it furnishes the integralJ a minimum. Hahn [1902] also modified a proof of
Kneser's and gave the differentiability condition of Hilbert. Out of this he
showed it is not necessary to assume the existence of a continuous second
derivative y" for a minimizing arc. Hahn [1904] gives a nice discussion of
the multiplier rule for isoperimetric problems.
Hilbert acknowledges both the prior work of Mayer [1895] and Kneser
L V, Sections 56-58, on the multiplier rule. He considers the case of three
dependent functions y,z,s of x and two conditions on these quantities of
the form
f(y',z',s',y,z,s;x)
0,
g(y',z',s', y,z,s; x)
o.
(7.5)
He supposes that y(x), z(x), sex) are three functions satisfying conditions
(7.5) which have
ay'
ay'
az'
=1=0
(7.6)
az'
for all x between x = a] and x = a2 Let Y(x), Z(x), Sex) be any three
other functions also satisfying (7.5), for which
324
differ only a little from y, z, s and their derivatives, and finally that
Y(a 2)
(7.7)
He then states his result: "if this minimal condition is satisfied, then there
necessarily exist two functions A(X), p.(x) which do not both vanish identically
for all x, and which together with the functions y(x), z(x), s(x) satisfy the
Lagrange differential equations
..!l...
78
( . )
..!l...
78'
( . )
dx
dx
..!l...
a2
a,
I .
"
for
(7.8")
= Z(X'I'2)'
0' 1,0'2
of x, which
(7.9)
+ 20'2(X)
I, 2
0(lY(a2'\'2)
0(lY(a2,p2)
+ mz(a2,\'2)] =
o\
+ mz(a2,\'2)] =
o2
0,
(7.11 )
0,
IOHilbert [1906], p. 352. It is not unreasonable to call (7.8), (7.8'), and (7.8") the EulerLagrange equations since both men contributed significantly.
325
where the subscript zero means that the relevant expressions are evaluated
for f) = f2 = o.
Consider now the linear, homogeneous differential equations (7.8) and
(7.8') in the functions ;\(x), p,(x) together with the end-conditions at x = a2
O(;\~;, p,g) = I,
O(;\~:, p,g) ]0 = m;
(7.12)
of [OY] of [OZ'] of
[ 0Y']
Of)
0 oy' + ~ 0 oy +
Of)
0 oz'
OZ] of
, of
of
0 oz + a) os' + a) os
+[ ~
[ 0Y']
Of)
= 0,
[0
og
Y] og [OZ'] og
oy' + ~ 0 oy + ~ 0 oz'
as
OZ] og
, og
og
0 oz + a) os' + a)
= 0,
+[ ~
of [OY] of [OZ'] of
[ 0Y']
Of2 0 oy' + Of2 0 oy +
Of2
0 oz'
+[
[ 0Y']
Of2
+[
OZ] of
, of
of _
Of2 0 OZ + 02 os' + 02 os - 0,
og
oy' +
[0 Y]
Of2
og
oy +
[OZ']
Of 2
, og
og
OZ] og
0 OZ + O 2 os' + O 2 oS
Of 2
og
oz'
o.
He multiplies the first equation by ;\, the second by p" adds the two, and
integrates the resulting relation between the limits x = a), x = a 2 ; he also
does the same with the third and fouth equations. This gives him the
relations
o(;\f + p,g) [ oZ
0Z
0fa
(a= 1,2).
+
0
326
[az] afa
0-
(a=1,2);
az]
afa 0 = 0
(a = 1,2).
Hilbert now applies the usual integration by parts-he calls this integration of a product-to equations (7.13), taking account of relations (7.8) and
(7.8'). This yields
Lo.02{ a(AJa'+
ILg),
a(Af + ILg) } _
aa+
as aa dx-O
s
(a = 1,2).
non-identically vanishing solution system A, IL of the differential equations ... [(7.8) and (7.8')], so that there results
(AIL, a l )
Hilbert now goes on to show that either for this pair A, IL, (AIL, a) = 0 for
all a or that there is another pair A', IL' of solutions of (7.8) and (7.8') for
which (A' IL', a) = 0 for all a. Suppose that for the functions A, IL there is a a3
for which
(7.14)
Then by what went before there is a nonidentically zero solution system
A', IL' of (7.8) and (7.8') for which
(N1L',a 3 )
= O.
(7.14')
(7.14")
(A"IL",ap)
(,8 = 3,4).
(7.14"')
Notice that A, IL;A', IL';A", IL" are solutions of a system of two homogeneous, linear differential equations of the first order. They must therefore
be linearly dependent; i.e., there must be constants a, a', a" not all zero
such that
aA + a'A' + a"A" = 0,
Now by relations (7.14), (7.14') and (7.14"') it would follow directly that
II Hilbert
[1906), p. 355.
327
a
a'
its minimal value among comparison curves through the same end-points.
He then poses the problem:
We consider now the integral
J* =
[F= F(p,q,y,z;x),
b {F
+ (y -
Fp =
p) Fp
+ (z' -
aF(p,q,y,z;x)
ap
,
q)Fq } dx
_ aF(p,q,y,z;x) ]
Fq -
aq
and ask how the functions p, q appearing therein are to be chosen so that the
value of this integral J* will be independent of the path of integration in
xyz-space, i.e., independent of the functions y(x), z(x).
328
y = 1f;(x,a),
= x(x,a);
(7.16)
WT ,
, aI + ayY'Y'J(x,y),X
ai, r
.)
F( Y'ax
-'(y,y,x),
_.m.
z'
= Ix + /yy',
q = Ix
+ /yp
cp(p,y;x)
14To see this point of Hilbert consider the first variation of the integral of 41 from a to b. It is
clearly expressible in terms of variations "I/(x), with the help of the Lagrange equations (7.15),
as
{41), . "1/
= O.
J:
= 0,
329
From this it follows that the invariance of the integral (7.17) implies the
integral
J* =
has the same value for all paths lying on T = 0 and joining points A and P.
Hilbert, moreover, has the relation
J(w)
= (
J(WT)
{F+(y'-p)~+(z'-q)Fq}dx+ (QFdx,
JP
(7.18)
J(W)
=(
J(WT)
{F+(y'-p)~(z'-q)Fq}dx+ (QFdx.
JP
(7.18')
The first integrals in the right-hand members of (7.18) and (7.18') have the
same value since both W T and wT lie on the surface T = 0 and join the same
points A and P. It therefore follows that the left-hand members of (7.18)
and (7.18') are equal, which establishes for Hilbert his independence
theorem for xyz-space.
Since the integral J* is independent of the path, he can write
J(x, y,z)
LX,y,Z {F + (y' -
moreover, he has
aJ_
ax - F - p~ - qFq'
(7.19)
Hilbert now notes that if p, q are eliminated between these equations, there
results the "Jacobi-Hamilton partial differential equation of the first order
for the function J."IS If the functions p, q of the field are so chosen that
az '
. aT . aT
F -pFp - qFq.Fp.Fq -- aT
ax . ay .
then on the surface T = 0 the integral J* will have the value 0 for all
paths. 16 In this case the function J(x, y, z) is clearly a solution of "the
Jacobi-Hamilton equation" which vanishes on T = O.
ISHilbert [1906], p. 36l.
16Let us set
F - pFp - qFq = Tx '
Fp = 'ry,
Fq = Tz
and solve the last two equations for p = P(x, y, z, TY' Tz } and q = Q(x, y, z, T , Tz }. If these
are substituted into the first of these equations, there results Tx + H(x, y, z,
Tz} = 0, the
familar Hamilton-Jacobi equation.
Y
f. ,
330
ab
aFb + (z , -
aFq }
p) ab
q) ab
dx.
Notice that y' = p, z' = q on each surface of the family and thus that the
integrands above vanish. It follows at once that
~~ = c,
~~ = d,
=0
(p = 1,2, ... , r)
(7.20)
which are solvable for r of the derivatives Y;, ... ,y~. Then among all
functions Yl' ... ,Yn of x which are continuous between the limits Xo and
17Bliss, LEC, p. 73.
18Hilbert [1906], pp. 362-370.
19B1iss says in his discussion of problems in parametric form (LEC, p. 125) that they are
families "whose slope functions form fields in every region F of y-space which they simply
cover." As we shall see, such families make the Hilbert integral independent of the path.
331
> x o, which pass through given points at Xo and x I' and which satisfy
equations (7.20), he seeks to find a set which renders the integral
XI
a maximum or a minimum.
The Lagrange differential equations are
d
an
dx aY; -
an
aYi
'
fp = 0,
(7.21 )
in which
Y;, ...
(7.23)
an
(7.24)
Mayer, in effect, now observers that every extremal Yi,A p' i.e., every set
Yi,Ap satisfying the differential equations (7.21) defines by means of equations (7.22) a solution Yi' Vi of equations (7.24). Conversely, every solution
Yi,Vi of (7.24) defines an extremal Yi' Ap with the help of the last r equations
(7.23).
He now writes In I, If I, Ifpl to mean the functions n, f, fp with the
expressions Y; = Pi' Ap = JL p of (7.23) substituted for Y;, Ap in those functions.
Thus, e.g., 1101 = 1o(X'YI' ,Yn'PI' ,Pn) and
r
Inl = If I + L pJLplfpl
I
He poses what he calls Problem II: "To determine the Pi' JL p as functions of
x, YI' ... 'Yn so that the expression
(7.25)
332
~ .p. alnl = av
Inl- ft
api
ax'
(7.26)
aav
' ... , aav)
'
YI
Yn
~/'1V,... , ~/'nV).
(7.23')
The first of equations (7.26) with these values for Pi' /L p then becomes the
Hamilton-Jacobi partial differential equation
~~ +H(X,YI,'"
'Yn'
(7.27)
Inl + ~h(Yh 1
alnl _
aB;
aBh
aYh
aYi
must be satisfied. It follows with the help of the relations
---=0
(7.28)
333
that
a{
aYi = aYi IQI-
aB
alQI
a/tp
a alQI
t hPh alQI}
aPh = aYi + t p aYi lipl- t hPh aYi aph
n
lipl = O.
a alQI
aYh api
a alQI
= aYi aPh '
"
aPk
Yk =
aPk
ax + ~hay
Ph'
Ap =
a/tp
n
I
a/tp
-a+ ~h-aPh
x
Yh
raJ.
lipl = 0 then
d
dx
aQ aQ
ay; = aYi '
i = O.
p
334
What he is in effect saying is that given a solution V of the HamiltottJacobi equation (7.27), one finds an n-parameter family of extremals by
integrating the system of first order differential equations
y; =
when these solutions y;(x, a), ... , an) are substituted into the equations for
the JLp
y;
V;
(7.30)
y;
, ... ,
V;
, ... ,
~~) =
335
= a,
the Yi
(7.31 )
i x{n:t hVh aH
aVh - H } dx,
av
aak
... ,
an)
Ja
aak aVh
aYh aak
'
but
and consequently the sum under the integral sign above becomes
n {_
aYh
aVh aYh }
Lh
Vh
axaa
+
ax
aak
I
k
Moreover for x
= ax
n _ aYh
LhVh
aak .
I
_ ayh]X n (_ aYh
aA aah )
n _ aYh
aA
[~
~hVhaa = Lh Vhaa - aa aa = LhVhaa - aa
I
ka
I
k
h k
I
k
k
It follows from this directly that
(7.32)
Mayer now assumes that the equations Yi
solved for the ai and writes their solutions as
(7.33)
He writes (V) = W(x, YI' ... ,Yn) to mean V[x, al(x, Y), ... , an(x, y).
Recall that for x = a, V= A(a,a l , ... , an) and hence that
aw = k(
aYi
336
aaW
'Y;
(v;).
(7.34)
dV
dx
= aw + h aw aYh = aw + h V ail .
ax
a'Yh ax
ax
aVh
aw aw
(- aw
aw)_
ax + H = ax + H x, 'YI' ... ,'Yn' a'Yl , ... , aYn = O.
He replaces the a; by a;(x'YI'" . ,Yn) and has
~~ +H(X,YI,'"
'Yn'
Cj?;(XICI, ,
Ap = 8ix, CI,
C2n),
CI, ,
C2n)
aa~
] = aa Aa; ,
Y; x=a
av = aa
aA [n _ ail -]
aa
- ~hVh av -H
I
n _ aYh
x-a
[n _ ayh ]
+ ~hVhaa - ~hVhaa
I
x=a
but for x
337
a he has
and consequently
~~ = ~~
+H(a,al, ...
,an,:~
, ... ,
:~)+*hVh~'
and thus
aa~ = ~~
::1"'"
~~).
W= A(a'YI>'" ,Yn)'
Mayer sums up his analysis in his Theorem VI. He observes that "After
a complete integration of the [Euler-Lagrange] differential equations . .. [(7.21)] the theorem V permits one in general to obtain every system
of solutions of those differential equations which with some system of solutions
of Problem II, satisfy the relations
In his last section Mayer repeats in essence a part of Hilbert's paper
[1906] which appeared in the same year in the Gottinger Nachrichten. What
Mayer showed is that the total variation I:lI of the integral I is expressible
as
I:lI =
X1
Xo
Edx,
338
where E is given by
He did this with the help of the invariant integral just as Hilbert did for the
simplest problem.
21
correctly: "Kneser has given a proof by means of power series on the assumption that all
functions considered are analytic. This assumption is unfortunate, inasmuch as it introduces
restrictions without being accompanied by any simplification."
22See, e.g., Bolza, VOR, pp. 333. The theorem is in Gauss, DIS.
339
(7.35)
gives then the length of any arc ~, u = u( t), v = v( t) on the surface, joining
two points A and B with A corresponding to to and B to t \. The value of J
when evaluated on the geodesic through A and B is j = u\ - Uo and
consequently
M
=j
- J
(II Uu,2
)10
Figure 7.1
23Darboux, TDS, Vol. III, p. 88.
340
_ _- - - - : : : :........ 6
Figure 7.2
+ I(D46)
holds for every position of the point 4 preceding the point 6 on D.24
Lx, j(x,y,y')dx
X2
341
Jt9
dx.
Zermelo considered the extremal arc A B in Figure 7.1 and the comparison
arc ACB made up of the extremal arc AC neighboring to AB and the arc
CB of the envelope. By the Weierstrass theorem, he has
l(ACB) - l(AB) =
t9(x, y, p,P)dx
Y -lP(x,a) =
lPx(x,ao)(Y - y)
+x
0,
(7.36)
- x = 0
(7.36')
for a near ao. He assumes that the functional determinant of these equations with respect to y and x
1
!lPAx,a
o)
-lPx1(x,a)!
is different from zero for x = x, a = ao. There is then the desired correspondence between x, y and x, y. In fact (x, y) is the point at which the
normal at (x, y) to ~ cuts the curve (7.36). The x, yare regular functions of
a. Kneser then forms the variations
where the expressions in brackets are power series in (a - ao) whose lowest
terms are of the first order. He indicates by (x2' h) and (X3' Y3) the points
on ~' corresponding to (X2' Y2) and (X3' Y3) on ~; and he writes
1 23
= (3 j (x,y,p)dx,
JX2
J23 =
CX3 j(x,y,p)dX,
JX2
342
Kneser calculates the first variation of his integral and determines that
6ivf(X,Y,P)dx= i t t -
- (J - p/p)x=u 6u + [I, 6y
J:.
Along an extremal the integrand in the right-hand member of this expression vanishes, and it is easy to evaluate IlJ = 123 - J 23 . He finds without
difficulty that
123 -
where, e.g., (DB) is the value of J evaluated along the extremal arc DB.
Kneser then is able to conclude that
[a l - aoh.
343
This means directly that (AB) = (AC) + (CB) so that the integral J has
the same value when evaluated on the extremal joining A and B as it does
along "countless other" integration paths deviating arbitrarily little from
this path, i.e., along the extremal A C plus the arc CB of the envelope.
"Therefore the integral J along the arc AB is surely neither a maximum nor
a minimum." The points A and B are then conjugate points. Kneser goes
on to comment that the points C and B can, in general, be joined by an
extremal arc if they are close enough together; he designates by (CB) this
value of J. Then surely (CB) > (CB) since CB is an arc of the envelope and
cannot satisfy the Euler equation; (The reason for this is that the unique
extremal with values x, y, y' at point B is the extremal through A and B
itself; i.e., by the theory of differential equations, there is exactly one
solution of the second-order Euler equation with the given common values
x, y, y' of the extremal and the envelope at B.) It results that
(AB) >(AC)
+ (CB),
He goes on to say that this removes the uncertainty left by the results of
Mayer and Scheeffer.28
To discuss the existence of an envelope Kneser ([1898], pp. 36ff) starts
with a two-parameter family of extremalsy = g(x,a,b) containing a given
arc <ir for a = ao, b = boo By simple differen~iation with respect both to a
27Kneser, LV, pp. 89-97.
28In Erdmann [1877], p. 327, we find that when the second variation 8 2J = 0, then
8 3J = - R(x')'PaAx', aO)'Poa(x', ao),
hy'
second variation vanishes in the situations mentioned in Kneser's result. (Recall that !lJ
= e8J + (e 2 j2)8 2J + (e 3 j6)8 3J + ....)
344
(7.37)
cp(X, a) - cp(x,ao) = 0
would hold for sufficiently small values of X - XI and a - ao. But the
left-hand member may be written as
= (a -
xdl)
xdd,
and this expression is not zero for Ix - xII and la - aol small enough.
Kneser [1898] concludes (p. 39): "When therefore the curves [y = cp(x, a)]
have points in common with ~ for arbitrarily small values of la - aol,
which accumulate without bound in the neighborhood of at least one point
(XI' Yl)' one has for this [point] the equation
CPa(xl,ao) = O.
(7.38)
The accumulation point is different from (xo' Yo) since in the vicinity of
this [point] each curve ... [y = cp(x, a)] has no second point in common
with ~, as one can easily see."
Kneser now considers identity (7.37) and writes
CPa - ga
db.
gb da '
db
0= ga(xo,a,b) + gb(xo,a,b) da .
It follows from the equation (7.38) that
0= gaCxpao,bo) + gb(xl,ao,bo)(
~b)
ua
0=
a=ao
I'
(7.39)
345
gb(xI,ao,bo)
ga(xo,ao,bo)
b) = &(xI,ao,bo)
(
gb xo,ao, 0
He also has
so that
gxb(xl,ao,bo)
gb (xo, ao, bo)
and thus that
gxb(X1,ao, bo)
gb(xl,ao,bo)
Now the right-hand member of this equation is different from zero since by
assumption the equations y = g(x,a,b), p = gx(x,a,b) are such that at
x= Xo
ao = gx( x o, ao, bo)
Yo = bo = g( xo, ao, bo),
b = g(xo,a,b),
a = gx(xo,a,b).
and & are different from zero. Kneser then concludes that qlxa(x 1, ao) =1= o.
He also remarks that the right-hand member of (7.39) is the quantity Mayer
designated by a(xO,x l ).29
He concludes this discussion by observing that the relations
qla(xl,ao) = 0,
qlxa(xl,ao) =1= 0
xI)qlax(xl,ao) + (a - ao)qlaa(xl,ao)
:i9 Kneser has, in effect, shown that .:1(xo, x) is not identically zero and that its first zero
the point conjugate to Xo on the arc Cii.
XI
is
346
furthermore, he sets
1]
= cr(~,a) = cr(xl,ao) + [a -
aoJI = YI
+ [a -
aoJI
and notes that the point (~, 1]) traces out a curve, which may in a special
case reduce to the point (XI' YI)' This curve is clearly the envelope.
J=
II
F(x,y,x',y')dt=
10
IXI f(x,y,p)dx,
Xo
where
" ) -X
- 'F( x,y,_
+ 1,_y
+ '/ X') -X
'f( x,y,p.
) 30
F( x,y,x,y
(a<3),
and says "the most important special case is that in which Xo and Yo are
fixed but between XI and YI there is a given relation." In this case the
end-conditions become g(x l , YI) = 0, and he has the so-called transversality condition (a = 1 in this case)
Fx ,8x + ~,8YII = f
ag
ag_
o.
He now says that two directions (8x,8y) and (dx, pdx) with pdx
transversal in case
Fx ,8x + ~,8y
= (f -
pf,,) 8x
= dy
are
+ f" 8y = O.
30 Note that F is positively homogeneous, i.e., F(x, y, ax', al') = aF(x, y, x', l') for a
(Kneser, LV, p. 7).
>0
347
He then states that an extremal 01 joining fixed point 0 with point I on the
curve g = 0 can render the integral J an extremum only when the curve is
cut transversally by the extremal at point 1. (Osgood [1901], p. 112, on the
other hand, uses a different terminology; he says the extremal is cut
transversally by the curve.)
Now consider (LV, p. 43) a one-parameter family of extremals
x
= ~(t,a),
y=1)(t,a)
(7.40)
with T < t < T, la - aol < y, with X,2 + y,2 =~? + 1)t2 =1= 0, and with F
regular at every value (x, y, x', y') determined by (7.40); furthermore,
suppose that for a = ao the resulting extremal L\: defined for T < t < T is
regular and does not intersect itself. Thus along an arc )8 of the extremal L\:
different values of t correspond to different points, and there is therefore "a
uniquely defined system (t, ao)" corresponding to each point (x, y). Finally,
suppose that
a(~, 1)
~= a(t,a) =~t1)a-~a1)t=l=O
T < t < T, la - aol < y.
Kneser now says that the region of x, y space determined by the family
(7.40) with T < t < T and la - aol < y is a field about the arc )8 and that
the curves of family (7.40) are the extremals of the field. He considers a
regular curve L\:o inside the field whose equation is g(x o, Yo) = O. Through
each point (xo, Yo) on it there passes a unique extremal with Xo = ~(to, a),
Yo = 1) (to' a), and g[~(to, a), 1)(to, a)] = O. He then considers the function of t
and a defined by
for
u =JtF(t1),~t,1)t)dt,
to
= ~, y = 1) is an extremal of the
348
au =
aa
o
F1 dt
da + FAa + E'y'l1a
= FAa + E'y'l1all -
Fx{
II
~ + ~, ~: ) -
~: )
Dx
= ~ + ~, dt
d: 1da,
dtolo
Dy = 11a +11, da da,
(7.41)
Suppose that FA~, 11, ~" 11,), E'y,(~, 11, ~" 11,) do not both vanish. Since in
Kneser's fields regular functions of t and a are also regular functions of x
and y, the relation at t = to,
FA~'11,~,'11,)Dx
+ E'y{~,11,~,,11,)Dy =
0,
(7.42)
du =
FA~,dt
= Fx'dx
+ E'y,dy,
du
= Fx,dx + E'y,dy = 0,
define another set of curves <it I at each point in the field with the same
properties as those of the <ito, i.e., they also cut transversally all extremals of
31 Kneser,
LV, p. 46. Note that lito is thetcurve described by the end-point to.
349
Figure 7.3
the field as in Figure 7.3. (In this figure of Kneser's the straight lines 01 are
the extremals of the field.) He concludes that the value of J,JOI' evaluated
along the extremals 01 of the field between the transversal curves is a
constant. 32 It is also very easy to see the converse proposition. He has thus
made an elegant generalization of Gauss's result on geodesics: two transversals to the same set of extremals intercept arcs on these extremals along
which the integral J is a constant. (Recall that for geodesics, the notion of
transversality is the same as the notion of orthogonality.)
It is worth noticing that the envelope property is entirely analogous to
the elegant property of the evolute which is sometimes called the string
property. In Figure 7.4 we see the evolute D of a curve C-the locus of
centers of curvature of curve C is called its evolute. It has the property that
the end of a taut string wound around the evolute and then unrolled will
generate the given curve. At end 3 of the string E the string is orthogonal to
C. The length of the arc made up of E34 and D 46 equals that of E56 for each
position of point 4.
The results of Kneser are easy to demonstrate with the help of a scheme
used by Bliss but certainly derived from Kneser (LV, p. 96). Bliss supposes
that the problem is that of minimizing the integral
X2
f( X, y, y') dx
XI
in a class of arcs Jommg the points (XI' YI) and (X2' Y2) in the plane;
consider a variable arc E with end-points 3 and 4, which describe two given
Figure 7.4
32Kneser, LV, p. 48. This follows at once since the value of J01 is u, which is a constant.
350
l(t)=
X4(t)
X3( I)
f{x,y[x,a(t)],y'[x,a(t)]}dx,
finding
I'(t)
JX3
JyYa
+f
,_
Jy'Ya -
d(/Y'Ya)
dx
and
1 ,(t)
dX
[f
tit + da
dt '/Y'Ya J43'
It is clear that
and consequently
1* = f[Jdx
+ (dy - y'dx)/y.]
o
c
Figure 7.5
33Bliss, LEe, p. 20.
351
Figure 7.6
~=I~I1/1 1/~Ia
Figure 7.7
352
vanishes after 0 is called by him the focal point of the curve ~o on the
extremal ~. He notes that when ~o degenerates into a point the notions of
focal and conjugate points coincide. With the help of the quantity
= ~Ll=.=~=a=~~? + TJ}
+ M w' + N w = 0
that 34
Fx'x'
Fx'y'
~'y'
* O.
(7.43)
X1
with F regular, F > 0, ~'y' > 0 in a region R of the x, y-plane, and y'
arbitrary (Hilbert calls such problems regular); he seeks a minimum among
curves in R joining two fixed and nonintersecting curves D and E.
I. C must be an extremal, in other words, a solution of the differential
equation
d
Fy - dx Fy' = O.
II. C must cut both D and E transversally, i.e., in its intersection points
1 and 2 with the curves D and E the condition must hold,
+ (y' - y')E'y' = 0
where y, y' refer always to the curve C, and y, y' to D or E.
F
is not difficult to show that ~ satisfies Jacobi's differential equation, which may be written
in the form
34It
d(F dU)
=0
dt
'
F 2 u- -
dt
1-
where F1 is given above and F2 is a related function of t (see Bolza, VOR, pp. 224ff).
[1904], p. 74.
35 Bliss
353
If C is a curve joining the two given curves D and E, and satisfying the
necessary conditions I and II, then a third necessary condition for a minimum
of the integral J is
III. The critical point d of D must not lie between the point 1 . .. and the
critical point e of E.36
= G(u,v,du,dv) = g(u,v,s)du, ~~ = s;
= xuu' + xvv',
u' = uxx' + uyy',
x'
since Dx
= xuDu +
= a = const, s = O.
xvDv, Dy
354
Figure 7.8
that Fx,Dx + ~,Dy = 0 by (7.42) when Dx, Dy are calculated along the
transversals, u = const. From this it is clear that
Gu,Du
+ Gv,Dv =
+ gsDv = 0;
(g - sgs)Du
in this the derivatives Gu" Gv ' are understood to be evaluated along the
extremals of the field, v = const. But for s = 0, Du = 0, and the last
equation implies that gs(u, v, 0) = O. Kneser then has for a Taylor's expansion of g the following
g(u,v,s) = 1 +
S2
101 < 1.
(7.44)
Now he supposes (Figure 7.8) that his field contains the extremal (;
through the points 0, 1,2 with 0 the point where (; cuts the given curve (;0
transversally-he seeks to minimize the integral 1 among arcs joining (;0 to
2. He also draws an arbitrary curve x = X(T),y = yeT) through the point 2
cutting (;0 in the point 3. Then
_Jr F
T2
132 -
T3
Moreover, since u
dx dy ) _ T2 (
du dv)
x,y, dT 'dr dr- J 3 G u,v, dr 'dr dr.
T2
dr
12
U 0
> 0,
U,
v,
~~ , ~~ ) - ~~ ] dr,
"2
du
gss(u,v,Os) dr .
355
= FI(x,J" - yv x ,)2,
---'-,
u
= FI(X,J'
,,2
- yvx)
= FI(X,J'u
- xuYv)
2 ,2
U
(Note that the expression xuYv - x,J'u 0.) From this Kneser can now infer
that J32 - J02 > 0 provided that A 0, F 0, and FI(x, y, cos y, sin y) > 0
along the extremal (i$; for every value of y.
In the same issue of the Transactions of the American Mathematical
Society, Bolza [1901] and Osgood [1901] give proofs of a suffiency theorem
for a strong minimum using the Weierstrass E-function. Bolza's proof is
very simple and depends on recognizing the fact that the E-function is
invariant under the transformation x = X(u,v),y = Y(u,v). He calculates
without difficulty that
A= a(t'l/) *0 for a = ao
aU,a)
and that
for every y. He can infer from this that in a sufficiently small neighborhood
RK in t,a-space to < t < t l , la - aol < Ie and hence in a neighborhood SK of
x, y-space he has A 0, FI(x, y, cos y, sin y) > 0, and a one-to-one correspondence between RK and SK.
BoIza can then embed the given extremal C in the field SK: He considers
AI = Ic(AB) - Ic(AB) =
T\
TO
where x', y' are the values of the slope functions at the point x, y of the
field and x', Y' are the values of x', y' for C. [In his 1882 lectures
Weierstrass argues that AI > 0 since
356
ii'
B" : I--~~:::""---------;b------=::::::~B~,,----I
,== ..
L-------------------------------~.=n.-.
Figure 7.9
III =
=
LE'(u, v;
'I
So
L(1 - cosw)G,(u,v,cosO*,sinO*)ds.
SI
So
In these expressions the integrals are taken along the image C' in the u,v
plane of curve C; w is the angle between the positive tangent at u, v to C'
and the positive u-axis; and s is arc-length measured along C'. It results
that there is a constant m such that for every u, v in R" and every A,
D2F, = G,(u,v, cos A, sin A) ~ m > 0; as a consequence,
III
L
SI
So
to,
where I is the length of the curve C' and D = a(X, y)/a(u, v).
To finish the proof, BoIza now supposes that C is not wholly within Sh
and therefore passes through some point P of one of the extremals defined
by a = ao h. Then C' passes through a point P' whose ordinate is
v = ao h as in Figure 7.9. Then I is not less than the length of the broken
line A' P' B'. He now chooses Q' on the same line v = const. as P' so that
A'Q' = B'Q' and has A' P' B' ~ A'Q' B'. It now follows that
.11>
2m[ h' + ( I, ;
357
!:::.I
provided that it is assumed that F[g, 1/, gl' 1/1] > 0 in the region. 39
to
x = x(t),
y = y(t),
z = z(t)
with
(K
> 0).
Fx'x'
Fy'x'
Fz'x'
x'
Fx'y' Fx'z'
Fy'y' ~'z'
Fz'y' Fz'z'
z'
y'
x'
y' = _ F](X,2 + y,2 + Z,2) 2,
z'
0
and they then choose arc-length as their parameter. If F] =1= 0 for all values
of its arguments, there is one and only one extremal through each point
x o, Yo, Zo in a given direction x' 0' y' 0' z' 0' These extremals are representable
in the form
x = 'P( S, xo, Yo, Zo, x~, y~, z~),
yo,zo,x~, y~,z~),
39For the details, see Bolza, LEe, p. 191. In fact, the discussion there and on pp. 187-199 is
very good and complete.
358
where X,2
x.(O,xo,Yo,zo,xo,yo,zo)
xo,
zoo
The functions (Ji, 0/, X, (Jis' o/s' x. are of class C' for all values of their arguments, which define points in the fundamental region R of definition.
Mason and Bliss first show that the determinant
x = (Ji(s,u,v),
Y = o/(s,u,v),
and the associated determinant
a(s,u,v)
X(s, u,v)
(7.45)
as au a v
(Jis (Jiu (Jiv
O/S
x.
0/" O/V
Xu
Xv
359
Figure 7.10
a.
au a v
CPs CPu CPv
tPs tPu tPv
a(s,u,v) = 0,
cp(s,u,v) = x,
tP(s,u,v) = y
s = s(x, y),
u = u(x, y),
v = v(x,y)
near to X2' 12. If these values are substituted into z = X( x, u, v), a surface
z = z(x, y) is found as in their figure (Figure 7.10). This can be shown to
be the envelope of the two-parameter family (7.45).
They next consider the envelope theorem and the Jacobi condition. By a
direct calculation they show the following envelope theorem41 :
Suppose that an extremal COl is contained in a one-parameter family of
extremals, each of which passes through a fixed point 0 and which have an
enveloping curve touching COl at a point 2 [see Figure 7.10]. Then if 3 is a
neighboring point to 2 on d, the value of J taken along C03 plus the value of J
taken along d32 is always equal to the value of J taken along CO2 In other
words
When the extremals are all cut transversally by a surface S, the same
equation is true if 0 is understood to denote the variable intersection of C03
with S.
They show without difficulty that the envelope d cannot satisfy Euler's
equation, and thus that points 2 and 3 can be joined by a curve d32 such
41
360
that
After this Mason and Bliss establish the Legendre and Weierstrass
conditions without complication and then state and prove sufficient conditions for strong and weak minima.
Hahn [1911] undertook to finish the investigation begun by Mason and
Bliss on problems with variable end-point problems. In the course of his
analysis Hahn proved a theorem of basic importance for establishing
sufficient conditions. This result has been called by Bliss the theorem of
Hahn. Hahn considers the Lagrange problem of making
Jf(x'YI""
,Yn,Y;,
,y~)dx
He assumes that he is dealing with a normal arc so that there are multipliers AI' ... , Am such that
Fy, -~F.=O
dx y,
(.1= 12
, ,
where
F= f+
... ,
n ),
2: AkCPk'
k=1
~;Y; CPky;
CPIy;
where Y?'
(x o, yO, w~.
361
Iy; -
and all values of YI' ... ,y~. (The equality can hold only for Y; = Y;
(i = 1,2, ... , n).); second, that if A < a < b < B, the arc of the extremal
(xoo, Yoo, w~ on a .;;; x .;;; b has on it no point conjugate to a.
In terms of these hypotheses, Hahn's theorem (p. 129) is this:
There are three positive numbers 13, , 1/ such that for
Ix o - xool .;;; 13, IYiO - Yiool .;;; 13, Iw? - wiool .;;; 13
(i
= 1,2, ... , n)
the arc of the extremal (x o, yO, w~ on a - .;;; X .;;; b + furnishes a minimum relative to every admissible comparison curve which has the same
initial- and end-points and remains entirely in the 1/-neighborhood of the
extremal (xoo, Yoo, w~ on (a - ,b + ).
(By an "1/-neighborhood of an arc Yi(X) on a .;;; x .;;; p," Hahn means
the set of all points x, Yl' ... 'Yn such that a .;;; x .;;; p, IYl - Yi(x)1 .;;; 1/
and by a, the Mayer determinant (6.71).)
He shows directly that la(x, xti)1 =I=- 0 on a .;;; x .;;; a + ~, where xti is some
point on a - ~ < xti < a and consequently that it has a positive lower
bound on that interval. Also a(x,a) =I=- 0 for a < x .;;; b since the given
extremal has on it no point conjugate to a, and so la(x, a)1 has a positive
lower bound on a + ~ .;;; x .;;; b. Notice that a(x, x o) is a continuous function of x o; by means of this fact Hahn infers that la(x,x6)1 ~ some positive
bound for a + ~ .;;; x .;;; b and that xti can, in fact, be so chosen that
la(x, xti)1 also has this property on a .;;; x .;;; b.42
Hahn now argues that the determinant
(i,j = 1,2, ... , n)
lyO - yO*I';;;
H,
Iwo - wO*I';;;
H.
*
Yi - Yi ( X,XO'Yl""
,Yn'w 1 , , wn0) -- 0
42Perhaps the easiest way to see Hahn's result here is to notice that if an arc on a .;; x .;; b has
on it no point conjugate to a, there is always a point a' to the left of a such that the arc
defined on this later interval has on it no point conjugate to a' (see Bliss, LEe, p. 30). With the
help of this remark, one can always find a family of conjugate solutions whose determinant is
nonzero on a .;; x .;; b and hence, by continuity, has a nonzero lower bound.
362
,y2)
(i = 1,2, ... , n)
serve Hahn to define a Mayer field which embeds the given extremal and
in which the E-function is positive. Hahn's theorem then follows directly.
Hahn uses that result to prove a general sufficiency theorem:
Let the proposed variational problem be to make the integral ... a
minimum subject to the side-conditions . .. among all sufficiently near arcs
whose initial-and end-point coordinates satisfy any conditions whatsoever. Let
the arc of the extremal
Yi = Yi(X),
Wi = wi(x)
(where A < a <. b < B) furnish a weak minimum to our variational problem. If the E-function is positive definite in the region characterized by the
inequalities . .. [A .;;; x .;;; B, IYi - Yi(x)1 .;;; p, Iy; - y;(x)1 .;;; p', IAk - Ak(X)1
.;;; p"], this extremal arc also provides a strong minimum for our variational
problem.
363
b( d
J(y)=llx
)2 dx
(7.46)
K(y) =
lbAy2
dx
= 1,
(7.47)
= dy2 + AA (x)y = 0,
A(x)
dx
subject to the boundary conditions
yea)
= 0,
y(b)
> 0,
= o.
(7.48)
(7.48')
Ao - J(Yh) =
lb y,,/J,
dx,
rbyJh dx = O.
h=oo a
44Mason [1904], pp. 528-544; Cairns [1907]; and Richardson [1910], [1911].
364
Vh
(h = 1,2,3, ... , )
(7.50)
LbYhgh dx = 0.
45
h= co a
He makes an indirect proof. Suppose that infinite subsequences Y;', v;' could
be picked out of Yh' Vh so that either
L
b
y;'g;' dx ~
or";
for all h, where in the first case p > 0 and in the second case p < 0; and p
is, in both cases, independent of h. [The superscript' does not here signify
the derivative; II., g;' are defined with the help of (7.49) and (7.49').] He
then could form TJh = Y;' + cv;' with c a constant, and write
d4r,h + I\.CY'1jh
). .d = Ji'h + cgh'
,
dx
From these relations it would follow, as before, that
--2
Lb
a
(vhJj, - y;'g;') dx
Y
= J(b(
v;' (ddx 2;' + AoAy;' ) a
y;'
2
(d
dxv;'2 + AnA v;' )) dx = 0,
L
b
yhih dx
+ 1)2.
45Mason [1904), pp. 534-536. Repeated indices here do not mean summations.
> c2B;
and
365
Since the integral in the right-hand members of these relations has limit
zero and 8 2 is independent of h, he could find a value h = H so that
> O.
XoK('T/H) - J('T/H)
It is clear from this that 'T/H would not be identically zero, and so he could
y(x)
l'T/H(x)
a
would be twice differentiable, vanish at a and b, and K(y) = 1. As a
consequence, he would have >"0> J(y), which is a contradiction of the
definition of >"0' and thus
=
dX2
+ AoAVh = AYh
vh(a)
= 0,
(h
= 1,2,3, ... ),
vh(b)
=0
could be solved for the Vh' It remairis for him to demonstrate that these Vh
satisfy the conditions (7.50), which he does by an appeal to Fredholm's
theory of integral equations. The Vh would satisfy an integral equation of
the form
dy / dX2 =
0 on [a,b]:
(b-~)(x-a)
b-a
(b -
x)(~
- a)
b-a
366
would have
ILb VhAYhdxl
where m = (a + b)(l + /l(b - a/2 since IG(x,~)1 ,.:;; (a + b)/2. The argument is then completed by use of Schwarz's inequality, from which it would
follow that
which is contradiction of the fact that K(Yh) = I for every h. The lower
bound Au is then an eigenvalue, and there is a Yo =fo 0 such that
dyo
dx 2 + AoAyo = 0,
Yo(a) = 0,
Yo(b) = 0,
K(yo) = 1.
Mason remarks that for this Yo he has J(yo) = Au, and Yo is the solution
of the variational problem, and Ao is the smallest eigenvalue. He then goes
on to show the existence of a second eigenvalue and eigensolution by
considering the new variational problem
J(y) =
L~
b
dx,
Fredholm's theory requires in this case that D(~) must vanish. Mason goes
to show that this is impossible and concludes that A( > Ao. He summarizes
his results in the following theorem:
There exists an infinite sequence of increasing eigenvalues and their
corresponding solutions Yi of the differential equation dy / dx 2 + M (x)y = 0
under the boundary conditions y(a) = 0, y(b) = 0: The junction Yi is the
46Mason [1904), pp. 537-539.
367
(He also briefly takes up the case in which A is periodic with period b - a.)
In his last section Mason generalizes his results to the case of a system
LI(y,z) =
d~
d 2z
+ A22(X)Z) = f2(X),
dx
+ A(A21(X)Y
dx
subject to the boundary conditions
L 2(y,z) =
y(a) =
-2
y(b) =
C I,
C2 ,
(7.51)
He knows from Hilbert's results that when k ;;. 0 on [0, I], there are an
infinite number of positive eigenvalues AI' A2' . .. and corresponding eigenfunctions UI' U2, ....47 Let K(x,g) be k(x) km times the Green's
function for the self-adjoint differential expression
d(pu')
L( u) = -----;IX
+ quo
47Richardson notes (p. 281) that when k .;; 0, the isoperimetric condition needs to be changed
to JJku 2 dx = - J. He also notes that if k changes sign, there are infinitely many positive and
infinitely many negative eigenvalues.
368
Then
k(x)u(x) = Afo\K(X,~U(~d~.
fkUldx=O,
the integral
D(u) = f(pu,2 - qu 2)dx
has a smallest value
~n;
u = aUI(x,~),
VI =
(7.52)
a2foXk(x)u?dx.
This requires that for no value of x > on the interval 0, I can the two
corresponding equations of the family ... [(7.52)] formed by the variation of the parameters ~ and a:
UI(X'~I) 6a
[alfoXk(X)U?dX] 6a
+ al
au(x'~I)
a~
6~ = 0,
+ [ ar foxk(x)ul
aa~1
dX]
6~ = 0,
(7.53)
369
If there is a point X
(7.54)
Theorem 2.
(pu'(x'
and at
+ qu(x) + Ak(x)u(x) = 0
L
Q
'k(x)u 2(x)dx>
then
o.
= -[puu'J:'+
2}
dx
a'[pu,2_ qu2]dx>0
+ qu + Ak(x)u =
(p(x)
> 0, q(x)
..;; 0),
which satisfy the initial conditions u(O) = 0, u*(O) = 0 and correspond to the
parameter values A, A* respectively; if A* > A > 0, then the second,
third, . .. zero of u* precedes the second, third, ... zero of u.
Richardson says that for his purposes it suffices to prove this last result
when A* = A + t: with t: > 0 arbitrarily small. He considers the equations
+ qu + Aku = 0,
(pu*')' + qu* + [A + t: Jku* = 0,
(pu')'
= t:foakuu* dx.
370
E[ foaku2 dx + E' ]"where E' is an infinitesimal quantity of the same order as E." 48 But by
Richardson's Theorem 2 above, the integral in this last expression is
positive and hence u'(a)u*(a) > O. If now a = a l is the first zero of u, then
u'(a l ) < 0 and consequently u*(a l ) < O. But he assumes that at x = 0 his
solutions have u = 0 and u' > 0, and so u*(x) > 0 near to x = O. This
implies at once that there is a zero of u*' between x = 0 and x = al' If
a = a2 is the second zero of u, then u'(a2) > 0 and consequently u*(a2) > O.
Since u*(a l ) < 0, there must be a second root of u* between a l and a2' The
remainder of the argument is similar.
Richardson now arrays the zeros a l > 0, a2,a3' ... of UI on the positive
real axis. He has ui(al) < 0, ui(a2) > 0, ui(a3) < 0, .... Moreover, for
x = ai' he finds that the function DI of (7.54) is expressible as
In his fifth section Richardson considers the Jacobi condition for the
case of two conditions
In this case he shows "that the Jacobi condition requires the minimal
function U2 be a one-time oscillating function on the interval 0, I." Then in
his sixth section he takes up the general case. He finds 50 :
48Richardson [1910], p. 290.
49Richardson [1910], p. 291.
50 Richardson [1910], p. 300.
371
[p(x)
> 0, q(x)
.;;; 0, u(O)
= u(l) = 0]
folk(x)Un(x)u(x)dX= 0
oscillates exactly n times in the interval.
fo 1{(X)u 2(x)dx= 1,
fo1r(X)u2(X)dx= O.
372
(p. 49) that this is not necessarily true. Consider the problem of minimizing
the integral
J =JI xy,2dx
-I
among all suitable curves of class C' joining the point (-l,a) to (+ l,b)
when a b. It can be shown that the lower bound of the integral is zero.
To see this note that J ;;. 0 and that the curves
+b
b - a arctan x / t:
<
+I
- I
(X2 + t: 2)y,2dx=
t:(b-a)2
.
2 arctan 1/ t:
(This last expression goes to zero with t:.) There is however no curve with y'
continuous which gives the integral the value zero since this would require
the integrand xy,2 to vanish and this would mean that y' = 0; but y
= const. cannot satisfy the end-conditions. 51
Hilbert [1899] investigated this question of whether a given integral of
the calculus of variations actually attains its lower bound calculated over
the class of admissible arcs. He indicated how it is possible to restrict the
integrand F or the class of admissible arcs so that there is an a priori
assurance of the existence of a solution. Bolza52 notes that Hilbert "illustrates the gist of his method by the example of the shortest line upon a
surface and by Dirichlet's problem. In a subsequent course of lectures
(G6ttingen, summer, 1900) he gave the details of his method for the
shortest line on a surface and some indications ... concerning its extension
to the problem of minimizing the integral
J
Jxo
373
To do this, he calls I the lower bound of the lengths of all curves on the
surface, joining the points given above, and chooses a sequence of these
curves C I , C2 , C3 , whose lengths L I , L 2 , L 3 , approach the bound I.
He then marks off on C I from the first point P the length LI/2 and calls
that end-point p} 1/2); similarly, he marks off from P on C2 the length L2/2
and finds pi l / 2), etc. The points p}I/2),pi l / 2),pf/ 2), ... have an accumulation point p(l/2), which is again on the surface z = j(x,y). By
extension he can then produce p(l/4),p(3/4) as well as p(l/8),p(3/8),
p(5/8),p(7/8);p(I/16), .... Hilbert concludes that "all these points and
their accumulation points form on the surface z = j(x, y) a continuous
curve, which is the desired shortest line." 53
Hilbert remarks that the proof for this statement is easy to make if one
views the length of a curve as the limit of the lengths of inscribed polygons.
He then states his next problem:
II. To find a potential function z = f(x, y), which takes on preassigned
boundary values along a given boundary curve.
We will not pursue this further. The subject of existence theorems has
been carried forward steadily by Caratheodory, Tonelli, McShane, and
others.
374
=0
,y~)
(a
=0
minimum. 55
(y
= 1,2, ... , r)
(7.57)
a<Jla
ay; ,
at[;p
aYI'
m= p
+ q)
(7.56')
@O:Yi=Yi(t)
10<:.t<:./1
(Y
=0
S4Bliss later showed that this new problem which he called the problem of Bolza is actually
equivalent to the other two problems. (Bliss, LEe, pp. 189ff).
ss Although he does not make the point explicitly, the functions f and CPa are positively
homogeneous of dimension I in the y;, ... ,y~.
375
II aylO
aWg . aWg
aWg
' ... , aYno ' aYll ' ... , aYnl
+ I of the matrix
II
vanish identically while at least one of order R does not vanish identically
in that neighborhood. This implies that R ..; 2q + rand R ..; 2n. He explicitly does not permit Ao to be a place where all determinants of order R
vanish by supposing that at least one determinant of order R is different
from zero at Ao. In the contrary case, Bolza says that the end-points are
singular; whereas if R = 2q + r, he says that the end-conditions are reduced. If R < 2q + r, some of the WI' ,W2q+r can be expressed as
functions of the others. In this case the relations
WR+I
= iJR+/(w l , , w R ),
, W2q+r
= iJ2q + r(w p
, w R )
must hold identically at each place (YIO' ... 'Yno; Yll' ... ,Ynl)' In particular, when this place is A o, then
It follows "that every place (YIO' ... ,Yno; Yll' ... ,Ynl) m some given
neighborhood of A o, which satisfies the R equations
WI = 0, ... , W R = 0,
(7.58)
ao/fJ
aylO
ao/fJ
aYno
0
\
ao/fJ
aYll
ao/fJ
aYnl
is different from zero at the place Ao and R ~ 2q. He also concludes that at
least one determinant of order 2q of the matrix
ao/ fJ
aWl
ao/ fJ
aWl
at the place
W\ =
ao/fJ
aWR
1
ao/ fJ
aWR
376
o I
functions If; P' 1/J P have been expressed in terms of the independent functions
wI' . . . , w R He summarizes:
Theorem I. If we exclude the case of singular end-points, we are
permitted to assume without restriction of generality that the system ...
[(7.57), (7.56')] of initial conditions is reduced, i.e., that 2q + r .;;; 2n, and
that at least one determinant of order (2q + r) of the matrix
0
aljip
aljip
aylO
aYno
aljip
aYnl
aXy
aYnl
aXy
aylO
aXy
aYll
aXy
aYno
aYll
P = 1,2, ... , q;
aljip
y = 1,2,'... , r
is different from zero at the place Ao(Y 10' ... , Y nO; Y II' ... , Y nl)'
There is then no system of 2q + r multipliers CI,c2,"" C2q+ro not all
zero, for which at the place Ao
aljip
aXy
2: cP-a- + 2:c2q+y-a- = 0,
P
YiO
Y
YiO
(7.59)
From here on Bolza assumes that the end-points are not singular and that
the initial conditions are reduced. 56
In his second section (p. 435) Bolza takes up the problem of finding the
Euler-Lagrange equations for this case. To do this, he first establishes an
embedding theorem which shows in effect that the given admissible arc ~o
is not an isolated point.
Let q + r
Lemma.
(J
(J
+ 1 systems of functions
(J
(J
= 1,2, ... , q + r + 1
be given, which are twice continuously differentiable on the interval [to, til
and satisfy the m = p + q differential equations
k = 1,2, ... ,m
56 Notice
(7.60)
that Bolza assumes in this theorem that the finite conditions are of the form (7.56').
377
where ...
(7.60')
one can always construct a (q
+ r + I)-parameter family
y; = ~l;(t'I'2"
of curves
(7.61)
. , q+r+I)'
... ,
= 1,2, ... , m
(i = 1,2, ... , n)
= 0, 2 = 0, ... , q+r+ I = O.
(J
+ LE.r1Jm+p(t)
(1
CJlk(YI' ... , Ym' IDm+ I' ... , IDn' Yi, ... ,y,:" ID:"+ I' ... , ID~)
(k=I,2, ... ,m)
for the quantities YI' ... ,Ym given the initial conditions
Yk It 0 = Y0 kO+ ""
LI Ea 1J(J k(tO)
(1
If the q + r + 1 quantities
Ea
(7.62)
are satisfied, then the relations (7.61) define an admissible variation of the
curve@o
57 What Bolza has done here is to modify his sets of side- and end-conditions to eliminate the
finite equations 1/Ip(YI' . ,Yn) = 0; he enlarges the set of conditions CPa = 0 (0: = 1,2, ... ,p)
by appending the additional new conditions
d
dt
a1/lp ,
cP +P = - 1/Ip = ~ P
; ay;
y., = 0
378
He now writes
U(I'2'' q+r+l) = (t1f(IDI' ... ' IDn,ID;, ... , ID~)dt
Jlo
and observes that at 1 = 0, ... , q+r+ I = 0 this function must be a minimum, subject to the side conditions (7.62). By the theory of extrema of
ordinary functions, there must then exist constant multipliers 10 ,
II' ... , Iq + r , not all zero, such that
10 V( 'I] ) + ~ Ip 'It p( 'I] )
"
0"
{J
+ ~ Iq + yXy( 'I]" ) = 0
( a = 1,2, ... , q + r + 1)
(7.63)
where
(7.64)
and where the arguments of the functions that enter are those of the curve
~o
Bolza then asserts that, as he showed in his 1907 paper, the multipliers I
are independent of the choice of at least one of the q + r + 1 systems of
functions ~ I'
... ,
(7.63')
holds with the same multipliers for all 'I] I'
satisfy the m differential equations
cIk('I])
=t
( afPk
aYi 'l]i
afPk)
ayI'I];
.. ,
=0
(7.65)
The proof given in Bolza [1907] is concerned with the rank of the matrix58
"
0"
"
II V( '1]),
'It p( 'I] ),Xy( '1])11
S8See
(a
Bolza [1907], p. 372 or Bliss, LEe, p. 201. The matrix above must have a maximal rank
p less than q + r + I for some system ~; otherwise, the equation U( E) = U(O) + v together
with (7.62) would have their functional determinant different from zero at an initial solution
(E, v) = (0, 0); in this case there would be a solution ,,(v) near v = 0 and U(E) < U(O) for v
small and .negative, which is a contradiction. If now there were a set '1/ for which the relation
(7.63') were not valid, the rank of the matrix could be made p + I, which is a contradiction.
379
so that
10
cI>p+p dt
2:, (at/lp
---a:-l1i II y,
at/lp
---a:-l1i
y,
10
1
o.
(7.66)
,,( a<p"
a<p" ,)_
f aYi l1i + ay; l1i - 0
(a = I, 2, . . . ,p)
(7.67)
~ aYi
II'l1i(tl)-~
at/lp
aYi
10
(7.68)
l1i(to)=O
2: A"<p,, + 2: JLpt/lp,
aQ
Hi o = - -a'
Yi
"
f3
10
IO
f3
Yi
YiO
(7.70)
Id
To complete the argument, the functions A,,(t), JLp(t) and the constants
are so determined that the equations
(k
= I, 2, ... , m)
380
(a{2
a{2 )T/m+p dt + ""
0
-a-- ddt -a-'L.J Hi T/i(tO)
Ym+p
Ym+p
i
(p
_.!!...
~ =0
dt ily;
i = 1,2, ... , n
(7.71)
H;o=O,
(7.72)
are satisfied and besides the q + r + 1 multipliers lo,~, Iq+r are not all equal
to zero. A t the same time the quantities g, H;o, H; ,Ip are defined by the
equations . .. [(7.70)].
for which 10 =1= 0, and for which the differential equations [(7.71)] and the
boundary conditions [(7.72)] hold, then V(1J) = 0 for every system of admissible variations, i.e., for every two times continuously differentiable system of
functions 1J I, . . . , 1Jn which satisfy the q finite equations
ifp(1J) =
ill/lp
Li -1J;=
0
ily;
(7.73)
(7.74)
[as well].
+L
(\'ucIu{T/)dt+
a )to
Lp It(
JJ.pi' p{T/)dt+ L I3$p{T/)
to
p
+ L1Ji'{T/) + Llq+rXr{T/)
p
is equal to V(T/) for all T/i satisfying equations (7.65), (7.73), and (7.74).
381
+ r + 1 systems of functions
~ n from ~ so that at least one determinant of degree q + r of the
~ I,
... ,
matrix
0=
1,2, ... , q + r + 1
(7.75)
~o
Case II. All determinants of the matrix ... [(7.75)] vanish however the
l1
l1
1/ I, . . . , 1/ n of ~ may be chosen.
+ r + I systems of functions
(7.76)
382
ago
aYi
_!
dt
ago = 0
i = 1,2, ... , n
ay;
hold with
(7.77)
+ r constants
Ip = Ipo + lpI
i"'0 JLp(t)dt.
The proof is not difficult. The necessity follows at once from his
Theorem II with 10 = O. To show the sufficiency he proceeds as follows: the
systems of equations (7.66), (7.67), and (7.68) are valid for every." in ~;
multiply these equations with the multipliers Aa , JLp,IJ whose existence is
guaranteed by the theorem, integrate the first two sets from 10 to II' sum
them as to a and /3, and add up the results. He next applies the usual
integration by parts and uses the equations (7.77). It follows directly that
(7.76) holds.
For the normal case BoIza states and proves:
Theorem IV.
If the curve
@o
>.,.(t), JLp(t),
13,
lJ, lq+r
for which the differential equations . .. [(7.71)] and the boundary conditions ... [(7.72)] are satisfied with 10 = 1, then there is exactly one such set
of multipliers.
H/ = 0 imply that
383
a(n - 0)
aYi
A.. a(n dt
0) = 0
ay;
= extremum in ID1
and Problem II
XYo(YtO, ... ,Yna; YlI' ... ,Ynt) = extremum in ID1'.
BoIza closes his paper with a discussion of some special cases and the
problems that arise in applying the general theory to them.
X2
p= dy
dx
f(x, y, p)dx,
XI
/J
fJ
CPx
Jcp; + cP;
.
,Sln
/J
fJ
CPy
= -;:====Jcp; + cP;
The distance dn between two curves defined by "A and "A + d"A is given by
d"A = CPxdx + cpydy = dn(cp; + cp;)t/2. Caratheodory now draws a curve segment, meeting the x-axis in the angle a as in Figure 7.11, and asks that it be
384
Figure 7.11
ds =
dn
cos(a - 13) ,
dx =
dncosa
cos (a - 13) ,
f(x,y,p)d"A.
fdx=---CPx
since p
+ PCPy
f - pJ;, =
CPy
J;, .
(7.78')
The sign of
385
+ fXPy)1/1(<P),
(7.79)
which reduces to
Ix dx + /y dy = (<Pxx + fXPxy) dx + (<Pxy + fXPyy) dy
because of the second of equations (7.80). This yields the two equations
Ix
<Pxx
+ fXPxy'
/y = <Pxy + P<Pyy .
(7.81 )
He next differentiates the second of the equations (7.80) along a curve for
which dy / dx = p and finds that
<Pxy
+ fXPyy
= dx
/p,
which combines with the second of equations (7.81) to yield the Euler
condition
(7.82)
62Re1ation (7.78) may be considered as follows: let y(x) Xl .;;; X .;;; X2 be an admissible arc
nowhere tangent to 'P(x, y) = A. The equation 'PIx, y(x) = A then determines x as a function
of A, X(A), so that X'(A) = I/('Px + P'Py). The integral J taken alongy(x) from Xl to x2 = X(A)
is such that d.J / dA = f /('Px + p'py )' At each point (x, y) this expression may be viewed as a
function of y' = p; the direction I : p in which it has its least value is the direction of quickest
descent (see Bliss LEe, pp. 77ff). Then the arc y(x) has the direction of quickest descent at a
point (x, y) where it intersects the curve 'P(x, y) = A if and only if the curve 'P = A is
transversal to the arc at the point. For the direction 'Py : - 'Px to Be transversal to the direction
1 : y' = p, the integrand f dx + (dy - p dx)J, of the Hilbert invariant integral must vanish; but
this is f'Py - J,(f(!x + P'Py) = 0, which is an immediate consequence of (7.78).
63Two curves of the family are geodesically equidistant in case the values of the integral J
calculated along the segments of curves of quickest descent, bounded by the two curves of the
family, are all equal.
386
y = g(x,a),
be a one-parameter family of integrals of the differential equation (7.82),
the curves of which are cut transversally by tp = A. The solution of the
equation of y = g(x,a) is a = a(x, y), and p = gAx, a).
Caratheodory wishes to show that the curves tp = A, which these extremals-the solutions of (7.82)-cut transversally, are geodesically equidistant. To do this he wishes to show that tpx = f - p/p, tpy = /p or equivalently
that the equation
a:
holds at every point (x, y) of a region of the plane. Caratheodory uses a bar
over a function, such as p, to indicate that it is to be viewed as a function of
x, y alone. He thus has J= f[x, y, p(x, y)]. With the help of this notation,
he writes this relation as
(7.83)
since p = p(x. y) = gx [x, a(x, y) and y
lates for the same reason that
gxlaxa }.
and hence that .t, = /px + /pygx + /ppgxx holds identically in x, a. If the
values a(x, y), p(x, y) are put in place of a and p, then this relation holds
identically in x, y. This enables Caratheodory to write
gxa
ga '
from which it is evident that (7.83) is valid. This establishes the converse he
desired.
Later Caratheodory noticed that the curves tp = A satisfy a Hamilton64Caratheodory [1904]. p. 74.
387
(7.84)
tl
to
,y~)dt
,y~) =
388
Hahn supposes that along the arc in question, not all subdeterminants of
the matrix
I a!Pi I
(i
ay~
vanish identically, and that f and the !Pi are positively homogeneous in the
y'.
Jto
which are
m (
a!Pk
d
3!Pk )
(i = 0, 1, . . . , n)
~ Ak -a- - {[Ak-a' = 0
k=O
Yi
t
Yi
are satisfied. He goes on to note that for the problem of Lagrange the first
of these equations reduces to dAo/ dt = 0, and the remaining equations are
then expressible as
\( af
d af) + ~
aYi - dt ay;
k~l
"0
(Ak a!Pk
aYi
d A a!Pk) - dt k ay; -
(i
k=l
Ak -a!Pk
aYi,
d A -a!Pk, )
-d
k a
t
Yi
=0
(i
(7.86)
Hahn now says: "We wish to say that our extremal shows itself in an
abnormal condition on each interval in which the equations ... [(7.86)] can
be satisfied with functions A, which are not all identically zero." He goes on
to assume that the extremals are not abnormal on the interval (tot I). He
calls this the principal case. His result is then:
"Each extremal of the Lagrange problem, which does not behave abnormally on any sub-interval of (tot I)' must satisfy the equations . .. [(7.85) with
AO = 1]."
The reader will recall that Mayer [1886] was the first to observe the
difference between normal and abnormal arcs and that von Escherich
([1899], p. 1290) discussed the distinction in some detail. After that various
66For a full discussion of the equivalence of problems, see Bliss, LEe, pp. 189-193.
389
papers were written on the problem of Lagrange, assuming that the arc in
question was normal on every subinterval or on even stronger assumptions.
In recent times Bliss, Morse and Myers, Graves, and McShane have shown
that the multiplier rule as well as the conditions of Weierstrass and Clebsch
can be established with no assumptions of normality. It was Morse in a
series of papers in 1930-31 who "was the first to avoid the use of a
hypothesis on an extension of the interval of the minimizing arc. Hestenes ... , Morse ... , and Reid ... have proved sufficiency theorems applicable to arcs which are normal but not necessarily normal on every
subinterval, and to abnormal arcs of a special type.,,67
Since the work of Hahn there has been a great deal of research on
general problems, mainly the problem of BoIza, so that by the middle of
this century what one might well call the "classical theory of the calculus of
variations" was completed. In very large measure this work was accomplished by Bliss and his Chicago school of students and colleagues. This
group included Graves, Hestenes, McShane, Reid, and many others. Their
work is admirably summarized in Bliss, LEe.
67Bliss, LEe, p. 219. An arc is said to be normal or abnormal on an interval for the problem
of Bolza if it is normal or abnormal for the corresponding problem with fixed end-points on
that interval.
Bibliography
Adams CAT:
John Couch Adams, A Catalogue of the Portsmouth Collection of Books and
Papers Written by or Belonging to Sir Isaac Newton, Cambridge, 1888.
Andrade [1958]:
Edward N. da C. Andrade, Sir Isaac Newton, His Life and Work, London,
1954. Reprinted Garden City, N. Y., 1958.
Armanini [1900]:
Egidio Armanini, Sulla superficie di minima resistenza, Ann. di Mat., Ser. IlIa,
Vol. IV, 1900, pp. 131-149.
Beltrami
OPERE: Eugenio Beltrami, Opere Matematiche, 4 Vols., Milan, 1902-1920.
[1868]:--Sulla teoria delle linee geodetiche, Rend. d R. 1st. Lombardo (2),
Vol. I, 1868, pp. 708-718 = OPERE, Vol. I., pp. 366-373.
Bernoulli, Ja.
0: James Bernoulli, Jacobi Bernoulli, Basileensis, Opera, 2 vols., Geneva, 1744.
Reprin ted Brussels, 1967.
CL:---Curvatura laminae elasticae, Act. Erud., Leipzig, June 1694, p.
262 = 0, Vol. I, pp. 576-600.
JB:---Jacobi Bernoulli solutio problematum fraternorum, Act. Erud.,
Leipzig, May 1697, p. 214 = 0, Vol. II, pp. 768-778. German translation
(partial) by P. Stackel in No. 46, Ostwald's Klassiker der exacten Wissenschaften, Leipzig, 1894, pp. 14-20. English translation (partial) in D. J.
Struik, ed., A Source Book in Mathematics, 1200-1800, Cambridge, 1969, pp.
396-399.
SO:---Jacobi Bernoulli solutio propria problema tis isoperimetrici, Act.
Erud. , Leipzig, June 1700, p. 261 = 0, Vol. II, pp. 874-887.
MP:---Analysis magni problematis isoperimetrici, Act. Erud., Leipzig, May
1701, p. 213 = 0, Vol. II, pp. 895-920.
Bernoulli, Jo.
00: John Bernoulli, Johannis Bernoulli, Opera Omnia, J. E. Hofmann, ed., 4
Vols., Lausanne and Geneva, 1742, Reprinted Hildesheim, 1968.
LB:---"Lectori benevolo," Act. Erud., Leipzig, December 1696, p. 560
= 00, Vol. I, p. 165.
PN:---Problema novum ad cujus solution em mathematici invitantur, Act.
Erud., Leipzig, June 1696, p. 269 = 00, Vol. I, p. 161. German translation
by P. Stackel in No. 46 of Ostwald's KLASSIKER, p. 3.
PR:--Programma, 00, Vol. I, pp. 166-169. German translation by Stackel, KLASSIKER, No. 46, pp. 3-6.
CR:---Curvatura radii in diaphanis non uniformibus solutioque problematis a se in Actis 1696, p. 269, propositi, de invenienda linea brachystochrona,
id est, in qua grave a dato puncto ad datum punctum brevissimo tempore
390
Bibliography
391
392
Bibliography
[1907]:---Die Lagrangesche Multiplikatorenregel in der Variationsrechnung fUr den Fall von gemischten Bedingungen und die zugeh6rigen Grenzgleichungen bei variablen Endpunkten, Math. Ann., Vol. LXIV, 1907, pp.
370-386.
VOR:---Vorlesungen uber Variationsrechnung, Leipzig u. Berlin, 1909.
[1912/13]:---Bemerkungen zu Newtons Beweis seines Satzes uber den
Rotationsk6rper kleinsten Widerstandes, Bibl. Math., Ser. 3, Vol. XIII,
1912-1913, pp. 146-149.
[1913]:---Uber den "Anormalen Fall" beim Lagrangeschen und Mayerschen Problem mit gemischten Bedingungen und variablen Endpunk ten, Math. Ann., Vol. LXXIV, 1913, pp. 430-446.
Borda [1767]:
Jean-Charles, Chevalier de Borda, Ec1aircissement sur les methodes de trouver
les courbes qui jouissent de quelque propriete du maximum ou du minimum,
Mem. Acad. Sci., 1767, pp. 551-563.
Brunet ACT:
Pierre Brunet, Etude historique sur Ie Principe de la Moindre Action, Paris, 1938.
Cairns [1907]:
W. Cairns, Die Anwendung der Integralgleichungen auf die zweite Variation bei
isoperimetischen Problemen, Dissertation, G6ttingen, 1907.
Caratheodory
GMS: Constantin Caratheodory, Gesammelte Mathematische Schriften, 5 vols.,
Munich 1954-1957.
[1904]:---Uber die diskontinuierlichen Losungen in der Variationsrechnung,
Dissertation, G6ttingen, 1904 = GMS, Vol. I, pp. 3-79.
[1906]:---Uber die starken Maxima and Minima bei einfachen Integralen,
Math. Ann., Vol. LXII, 1906, pp. 449-503 = GMS, Vol. I, (Habilitationschrift, G6ttingen, 1905), pp. 80-142.
[1930]:---Untersuchungen tiber das De1aunaysche Problem der Variationsrechnung, Abh. Hamburg, Vol. VIII, 1930, pp. 32-55 = GMS, Vol. II,
pp. 12-39.
[1937]:---The beginning of research in the calculus of variations," Osiris,
Vol. III, 1937, Part I, pp. 224-240 = GMS, Vol. II, pp. 93-107.
[1945]:---Basel und der Beginn der Variationsrechnung, Festschrift zum 60.
Geburtstag von Prof Dr. Andreas Speiser, Zurich, 1945, pp. 1-18 = GMS,
Vol. II, pp. 108-128.
Clebsch
[1858]: Rudolff F. A. Clebsch, Ueber die Reduktion der zweiten Variation auf
ihre einfachste Form, Jour. fur Math., Vol. LV, 1858, pp. 254-273.
[1858']:---Ueber diejenigen Probleme der Variationsrechnung, welche nur
eine unabhangige Variable en thai ten, Jour. fur Math., Vol. LV, 1858, pp.
335-355.
Cramer [1752]:
Gabriel Cramer, Memoire posthume de geometrie, Abh. Akad. Berlin, 1752, pp.
283-290.
Darboux TDS:
Gaston Darboux, Lec;,ons sur la Theorie generale des Surfaces, 4 vols., Paris,
1887-1896.
Delaunay
[1841]: Charles Delaunay, Sur la distinction des maxima et des minima dans les
questions qui dependant de la methode des variations, Jour. d. Math., Vol.
VI, 1841, pp. 209-237.
[1843]:---Memoire sur Ie calcul des. variations, Jour. ['Ecole Poly., Vol.
XVII, 1843, pp. 37-120.
Bibliography
393
du Bois-Raymond
[1879]: Paul du Bois-Raymond, Erlauterungen zu den Anfangsgriinden der
Variationsrechnung, Math. Ann., Vol. XV, 1879, pp. 283-314.
[I879']:---Fortsetzung der Erlauterungen zu den Anfangsgriinden der
Variationsrechnung, Math Ann., Vol. XV, 1879, pp. 564-578.
Egorow [1906]:
D. Egorow, Die hinreichenden Bedingungen des Extremums in der Theorie des
Mayerschen Problems, Math. Ann., Vol. LXII, 1906, pp. 371-380.
Erdmann
[1877]: G. Erdmann, Untersuchung der hOheren Variationen einfacher Integrale, Zei~J Math. u. Phy., Vol. XXII, 1877, pp. 327.
[1877']:---Uber unstetige Losungen in der Variationsrechnung, Jour. for
Math., Vol. LXXXII, 1877, pp. 21-30.
[I878]:---Zur Untersuchung der zweiten Variation einfacher Integrale,
Zeit. f. Math. u. Phy., Vol. XXIII, 1878, pp. 362-379.
Euler
OPERA: Leonard Euler, Leonhardi Euleri Opera Omnia, 72 vols., Bern, 19111975. Series I, Opera Mathematica.
I, XXIV:---Methodus Inveniendi Lineas Curvas Maximi Minimive Proprietate Gaudentes sive Solutio Problematis Isoperimetrici Latissimo Sensu
Accepti, Lausanne and Geneva, 1744 = OPERA, I, Vol. XXIV, C. Caratheodory, ed., Bern, 1952.
I, XXV:-Commentationes analyticae ad calculum variationum pertinentes, OPERA, I, Vol. XXV, C. Caratheodory, ed., Bern, 1952.
[1755]:--Euler Ii Lagrange, Berolini, die 6 Sept. 1755, Oeuvres de Lagrange,
Vol. XIV, pp. 144-146. This is Euler's response to Lagrange's first letter to
him.
[I764]:--Elementa calculi variationum, I, XXV, pp. 141-176.
[I764']:---Analytica explicatio methodi maximorum et minimorum, I,
XXV, pp. 177-207.
[I77I]:---Methodus nova et facilis calculum variationum tractandi, I,
XXV, pp. 208-235.
Fatio [1699]:
Fatio de Duillier, Investigatio Geometrica Solidi Rotundi, in quod Minima fiat
Resistentia. This is an appendix to his Lineae Brevissimi Descensus Investigatio Geometrica Duplex, London, 1699.
Fermat OP:
Pierre de Fermat, Oeuvres de Fermat, P. Tannery and C. Henry, eds., 4 vols.,
Paris, 1891-1912; Supplement, C. de Waard, ed., Paris, 1922.
Galileo TWO:
Galileo Galilei, Discourses and Mathematical Demonstrations Concerning Two
New Sciences, Pertaining to Mechanics and Local Motions, with an Appendix
on Centers of Gravity of Solids, Leyden, 1638, (English translation by S.
Drake, Madison, 1974).
Gauss
WERKE: Carl Friedrich Gauss, WERKE, 12 vols., Gottingen, 1870-1933.
DIS:---Disquisitiones generales circa superficies curvas, WERKE, Vol. IV,
p. 241 = Allgemeine FHichentheorie, Klassiker, Vol. 4, pp. I-51.
Gregory N M F:
David Gregory, Newtoni methodus fluxionum. This appears as an appendix to
Andrew Motte's 1729 translation of the Principia, The Mathematical Principles of Natural Philosophy, by Sir Isaac Newton, Vol. II, Appendix, pp. v-viii.
Hadamard
LEC: J. Hadamard, Ler"ons sur Ie Calcul des Variations, Paris, 1910.
394
Bibliography
Bibliography
395
[1901]:--Uber das Dirichletsche Prinzip, Math. Ann., Vol. LIX, 1901, pp.
161-186.
[1905]:--Uber das Dirichletsche Prinzip, Jour. flir Math., Vol. CXXIX,
1905, pp. 63-67.
[1906]:--Zur Variationsrechnung, Math. Ann., Vol. LXII, 1906, pp. 351370 = Gott. Nach. 1905, pp. 159-180.
[1912]:--Grundzuge einer allegemeinen Theorie der linearen Integralgleichungen, Leipzig u. Berlin, 1912.
Huygens OC:
Christiaan Huygens, Oeuvres completes, 22 vols., The Hague, 1888-1950.
Jacobi
WERKE: C. G. J. Jacobi, C. G. J. Jacobi's Gesammelte Werke, K. Weierstrass,
ed., 8 vols., Berlin, 1881-1891.
[1838]:--Zur Theorie der Variations-Rechnung und der DifferentialGleichungen, Jour. flir Math., Vol. XVII, 1937, pp. 68-82 = WERKE, Vol.
IV, pp. 39-55 = Sur Ie calcul des variations et sur la theorie des equations
differentielles, Jour. de Math., Vol. III, 1838, pp. 44-59. This paper is also
reprinted by Stackel in Ostwald's Klassiker, Vol. 47, pp. 87-98.
[1838']:---U ber die Reduction der Integration der partiellen
Differentialgleichungen erster Ordnung zwischen irgend einer Zahl Variabeln auf die Integration eines einzigen Systemes gewohnlicher Differentialgleichungen, Jour. flir Math., Vol. XVII, 1838, pp. 97-162 = WERKE,
Vol. IV, pp. 57-127.
NACH:--Uber diejenigen Probleme der Mechanik in welch en eine
Kraftefunction existirt und fiber die Theorie der Storungen, WERKE,.Vol.
V, pp. 217-395.
NACH':--Uber die vollstandigen LOsungen einer partiellen Differentialgleichung erster Ordnung, WERKE, Vol. V, pp. 399-438.
Jordan COURS:
C. Jordan, Cours d'Analyse de ['Ecole Poly technique. 3 vols., Paris, 1882.
Kneser
[1898]: Adolph Kneser, Zur Variationsrechnung, Math. Ann., Vol. L, 1898, pp.
27-50.
[1899]:--Ableitung hinreichender Bedingungen des Maximums oder
Minimums einfacher Integrale aus der Theorie der zweiten Variation, Math.
Ann., Vol. LI, 1899, pp. 321-345.
LV:--Lehrbuch der Variationsrechnung, Braunschweig, 1900.
LV':--Lehrbuch der Variationsrechnung, Braunschweig, 1925.
Kobb [1892/93]:
Gustaf Kobb, Sur les maxima et les minima des integrales doubles, Acta Math.,
Vol. XVI, 1892/93, pp. 65 - 140; and Vol. XVII, 1893, pp. 321-344.
Lagrange
OEUVRES: Joseph Louis Lagrange, OEUVRES, J. - A. Serret and G. Darboux, eds., 14 vols., Paris, 1867-1892.
MA:--Mecanique Analytique, 2 vols., Paris, 1788 and 2nd aug. ed. 1811 15 = OEUVRES, Vois. XI and XII.
[1755]:--Lagrange a Euler. Die 12 Augusti [1755], OEUVRES, Vol. XIV,
pp. 138-144. In fact, pp. 135-245 contain the correspondence between the
two men.
[1759]:--Recherches sur la methode de maximis et minimis, Misc. Soc.
Tur., Vol. I, 1759 = OEUVRES, Vol. I, pp. 3-20.
[l760]:--Essai sur une nouvelle methode pour determiner les maxima et
les minima des formules integrales indefinies, OEUVRES, Vol. I, pp.
Bibliography
396
333-362 = Misc. Soc. Tur., Vol. 11,1760-1761, pp. 173-195. German translation by P. Stackel, Klassiker, Vol. 47, pp. 3-56 and English translation (in
part) by D. J. Struik, A Source Book in Mathematics, pp. 407-418.
[1760']:---Application de la methode exposee dans Ie memoire precedent a
la solution de differents problemes de dynamique, OEUVRES, Vol. I, pp.
363-468 = Misc. Soc., Tur., Vol. II.
[1766/69]:--Sur la methodes des variations, Misc. Soc. Tur., Vol. IV,
1766 = OEUVRES, Vol. II, pp. 37-63.
TAF:---Theorie des Fonctions analytiques, contenant les Principes du Calcul
differentiel, degages de toute Consideration d'injiniment Petits, d'Evanouissans,
de Limites et de Fluxions, et reduits a l'Analyse algebrique des Quantires jinies,
Paris 1797, 2nd ed. 1813 = OEUVRES, Vol. IX.
LCF:---Lec,ons sur Ie Calcul des Fonctions, Paris, 1806 = OEUVRES, Vol.
x.
Lebesgue, H. [1902]:
Henri Lebesque, Integrale, longueur, aire, Ann. di Mat., (3), Vol. VII, 1902, pp.
342-359.
Lebesgue, V.-A. [1841]:
V.-A. Lebesgue, Memoire sur une formule de Vandermonde, et son application
a la demonstration d'un theoreme de M. Jacobi, Jour. de Math., Vol. VI,
1841, pp. 17-35.
Lecat
BIB: Maurice M. A. Lecat, Bibliographie du Calcul des Variations 1850-1913,
Ghent and Paris, 1913.
BIB':---Bibliographie du Calcul des Variations, depuis les Origines jusqu'a
1850 comprenant la Liste des Travaux qui ont Prepare ce Calcul, Ghent and
Paris, 1916.
Legendre [1786]:
Adrien-Marie Legendre, Sur la mainere de distinguer les maxima des minima
dans Ie calcul des variations, Mem. Acad. Sci., (1786) 1788, pp.7-37.
German translation by P. Stackel, Klassiker, Vol. 47, pp. 57-86.
Leibniz LMS:
G. W. Leibniz, Leibnizens mathematische Schriften, C. I. Gerhardt, ed., Halle,
Part I, Vol. III, 1855.
Linde1of-Moigno LEC:
Ernst L. Linde10f and Fran~ois N. M. Moigno, Lec,ons de Calcul des Variations,
Paris, 1861.
Lundstrom [1869]:
O. Lundstrom, Distinction des maxima et des mimma dans un probleme
isoperimetrique, Nova Act. Upsal., Ser. 3, Vol. VII, 1869.
Mach MEC:
Ernst Mach, Die Mechanik in ihrer Entwickelung historisch-kritisch dargestellt,
Leipzig, 1933.
Maclaurin FLUX:
Colin Maclaurin, A Treatise of Fluxions, 2 vols., Edinburgh, 1742; 2nd end.
London, 1801. French translation. Paris, 1749.
MacMillan SAD:
William D. MacMillan, Statics and the Dynamics of a Particle, New York, 1927.
Mainardi [1852]:
G. Mainardi, Ricerche suI calcolo delle variazioni, Ann. Sc. Math. e. Fis., Vol.
III, 1852, pp. 149-192,379-383.
Mason [1904]:
Max Mason, Zur Theorie der Randwertaufgaben, Math. Ann., Vol. LVIII,
1904, pp. 528-544. See also Bliss.
Bibliography
397
Maupertuis
[1740]: Pierre Louis Moreau de Maupertuis, Loi du repos des corps, Mem.
A cad. Sci., Paris, 1740, pp. 170-176.
[1744]:---Accord de differentes lois de la nature qui avaient jusqu'ici paru
incompatibles, Mem. Acad. Sci., 1744, pp. 417-426.
Mayer
[1866]: Adolph Mayer, Beitriige zur Theorie der Maxima und Minima der
einfachen Integrale, Leipzig, 1866.
.
[I868]:---Ueber die Kriterien des Maximums und Minimums der
einfachen Integrale, Jour. fur Math., Vol. LXIX, 1868, pp. 238-263.
[1870]:---Der Satz der Variationsrechnung, welcher dem Princip der
kleinsten Wirkung in der Mechanik entspricht, Math. Ann., Vol. '11, 1870,
pp. 143-149.
GP:---Geschichte des Princips der kleinsten Action, Leipzig, 1877.
[1877]:---Die Kriterien des Maximums und Minimums der einfachen
Integrale in den isoperimetrischen Problemen, Math. Ann., Vol. XIII, 1878,
pp. 53-68 = Leip. Ber., Vol. XXIX, 1877, pp. 114-132.
[I 878]:---Ueber das allgemeinste Problem der Variationsrechnung bei einer
einzigen unabhiingigen Variablen, Leip. Ber., Vol. XXX, 1878, pp. 16-32.
[I 884]:---Zur Aufstellung der Kriterien des Maximums und Minimums der
einfachen Integrale bei variablen Grenzwerthen, Leip. Ber., Vol. XXXVI,
1884, pp. 99-128.
[1886]:---Begriindung der Lagrange'schen Multiplicatorenmethode in der
Variationsrechnung, Math. Ann., Vol. XXVI, 1886, pp. 74-82.
[1886']:---Die beiden allgemeinen Siitze der Variationsrechnung, welche
den beiden Formen des Princips der kleinsten Action in der Dynamik
entsprechen, Leip. Ber., Vol. XXXVIII, 1886, pp. 343-355.
[1895]:---Die Lagrang'sche Multiplicatorenmethode und das allgemeinste
Problem der Variationsrechnung bei einer unabhiingigen VariabeIn, Leip.
Ber., Vol. XLVII, 1895, pp. 129-144.
[I896]:---Die Kriterien des Minimums einfacher Integrale bei variablen
Grenzwerthen, Lep. Ber., Vol. XLVIII, 1896, pp. 436-465.
[1903], [1905]:--Uber den Hilbertschen Uniibhangigkeitsatz in der Theorie
des Maximums und Minimums der einfachen Integrale, Math. Ann., Vol.
LVIII, 1904, pp. 235-248 = Leip. Ber., Vol. LV, 1903, pp. 131-145. II
Mitteilung, Math. Ann., Vol. LXII, 1906, pp. 335-350 = Leip. Ber., Vol.
LXII, pp. 49-67, 313-314.
Morse [1934]:
Marston Morse, The Calculus of Variations in the Large, New York, 1934. (Here
there is an excellent bibliography of his earlier papers. The interested reader
should certainly consult these.)
Moulton BAL:
F. R. Moulton, New Methods in Exterior Ballistics, Chicago, 1925.
Newton
PAPERS: Isaac Newton, The Mathematical Papers of Isaac Newton, 7 vols., D.
T. Whiteside, ed., Cambridge, 1967-1976.
SOL:---The solid of revolution of least resistance to motion in a uniform
fluid, PAPERS, Vol. VI, late 1685, pp. 456-465.
A PP 1 :---Appendix 1. The resistance of a sphere to 'rapid' rectilinear
motion, PAPERS, Vol. VI, pp. 466-469.
APP 2:---Appendix 2. Recomputation of surfaces of least resistance
(1694), PAPERS, Vol. VI, pp. 470-480.
PRIN:---Philosophiae Naturalis Principia Mathematica, 1st ed. London,
1687, 2nd ed. 1713, 3rd ed., London 1725/26. English translation by A.
398
Bibliography
Bibliography
399
400
Bibliography
Index
Abnormality
Bliss on 389
Bolza on 381 - 383
Conditions of Weierstrass and
Clebsch 389
vs. Escherich on 250-251,387-388
Graves on 389
Hahn on 286,381,387-389
Hestenes on 389
Mayer on 250-251,285-286,
387-388
McShane on 389
Morse and Myers on 389
Reid on 389
Sufficiency conditions 389
Accessory equations and extremals
Clebsch on 252, 271 - 272
Jacobi on 158-160
Mayer on 271- 272 (see also Jacobi
differential equation)
Adams 19 n
Andrade 34 n
Archimedes 10
Armanini 25
Basnage 32, 35, 48
Beltrami (see Beltrami - Hilbert theorem;
Fields of extremals; Hamilton - Jacobi
equation)
Beltrami -=-Hilbert theorem 317 n
Bernoulli, Daniel 67, 108
Bernoulli, James 7,21,31,68, 101 (see
also James Bernoulli's isoperimetric
problems; Brachystochrone problem;
Elastic curves; Heavy chain problem;
Isochrone)
James Bernoulli's isoperimetric problems
John Bernoulli's responses 48 - 50,
52, 58-63
Euler on 67
His challenge and solutions 47 -59,63
Bernoulli, John 2,7,21,39,50,67-68,
108 (see also James Bernoulli's
isoperimetric problems;
Brachystochrone problem,
Conservation of energy; Fermat's
principle; Fundamental equations of
John Bernoulli; Geodesics; Heavy
chain problem; Law of uniformity;
Orthogonal trajectories; Sufficient
conditions; Synchrones)
Bertrand (see Fundamental lemma;
Jacobi's theorem)
Bliss 119 n, 314 n, 316 n (see also
Abnormality; Bolza problem;
Caratheodory's method; Conjugate
points; Conjugate sets; Envelope
theorem; Equivalence of problems;
Existences of fields; Existence
theorems; Fields of extremals; Focal
points; Hahn theorem; Hilbert
differentiability condition; Hilbert
integral; Implicit-function theorems;
Jacobi condition; Legendre condition;
Mayer families; Multiplier rule;
Necessary conditions; Parametric
problem; Second variation; Strong and
weak extrema; Sufficient conditions;
Transversality condition;
Transversals; Variable end-point
problems; Weierstrass condition)
Bocher 363
Bolza 150 n, 190 n, 201 n, 223 n, 251
(see also Abnormality; Bolza
problem; Clebsch condition;
Conjugate points; Conjugate sets;
Existence of fields; Existence
theorems; Fundamental lemma;
Geodesics; Hilbert integral;
401
402
Index
Bolza [cont.]
Implicit-function theorems;
Isoperimetric problem; Jacobi
condition; Jacobi differential
equation; Lagrange problem;
Legendre condition; Mayer
determinant; Mayer problem;
Multiplier rule; Newton problem;
Normal coordinates of a field;
Parametric problem; Second variation;
Strong and weak extrema; Sufficient
conditions)
Bolza problem 389
Bliss on 374, 378 n, 389
Bolza on 373 - 383
Chicago School 389
Multiplier rule for 378-383
Borda (see Brachystochrone problem)
Boundary-value problems
w. Cairns on 363
Hilbert on 362, 367 - 368
Mason on 363 - 367
Richardson on 363 - 371 (see also
Oscillation theorems)
Bousquet 67
Brachystochrone problem
Analytic solution 32-34
James Bernoulli's solution 44-47
John Bernoulli's proof of
sufficiency 65-66
John Bernoulli's use of Fermat's
Principle 38 -44
John Bernoulli's use of wave
fronts 42-44
Borda's criticism 119, 138
Caratheodory on 383, 387
Challenge to solve 30-31
Euler on 78 -84
First solution by John
Bernoulli 30-32, 35, 38-44
Formulation by John
Bernoulli 30-31,34
Galileo 32
Lagrange on 117 - 120
Lagrange's treatment for variable
end-points 120-121,136-138
Lagrange's treatment of on a
surface 119-120
Leibnizon 31,35-38,44,46,48,51,
64
Named Tachystoptote by Leibniz 31
Second solution by John
Bernoulli 63-66
Solution by James Bernoulli 7, 31,
39 n, 45-46
Index
Conjugate points
Bliss's discussion of 163 n
Bolza on 231 n
Erdmann on 241
For isoperimetric problem 220
Hesse on 186-189
Jacobi's analysis of 140,151-168
Jacobi's example 163 - 168
Kneser on 281, 343 - 346
Lundstrom on 220, 235 n, 293 n
Mayer on 276-281,303-305,343
Relation to sign of second
variation 410
Scheeffer on 239, 343
Weierstrass on 197-201,204-207,
220, 228 - 229 (see also Envelope of a
family; Focal points; Isoperimetric
problem; Jacobi condition; Second
variation)
Conjugate sets 253
Bliss on 280 n
Bolza on 280 n
Clebsch and Mayer on 253,272-276,
304
Escherich on 253 (see also Accessory
equations and extremals)
Conservation of energy 164
John Bernoulli on 108
Control theory 315
Cramer (see Closed polygon of greatest
area)
Critical points (see Focal points)
D'Alembert 109
Darboux (see Envelope of a family;
Geodesics)
De la Chambre 1, 6
Delaunay 156, 186 (see also Delaunay's
problem; Jacobi's theorem)
Delaunay's problem 169 n
Descartes 4, 6 (see also Descartes' or
Snell's rule; Fermat's principle)
Descartes' or Snell's rule 1-6
Dini (see Implicit-function theorems)
Dirichlet 372
du Bois-Raymond 191 n-192 n (see
also Fundamental lemma;
Isoperimetric problem)
Elastic curves
James Bernoulli on
101
403
Euler on 101
Emerson 8 n
Encke 176 n
Enestrom 44
Envel{)pe of a family 155-156
Darboux on 156 n, 339-340
Jacobi on 162-163, 167 -168
Kneser on 156 n, 339-346
Relation to conjugate points 162, 218
Zermelo on 156 n, 338 n, 340-341
(see also Envelope theorem)
Envelope theorem 67
Bliss on 340 n, 349-351, 358-359
Gauss on 338
Kneser on 338 - 343
Relation to Jacobi condition 338 - 343
Zermelo on 338-341
Equivalence of problems
Bliss on 388 n
Erdmann (see Conjugate points; Second
variation; Third variation;
Weierstrass - Erdmann comer
condition)
Escherich 250 (see also Abnormality;
Conjugate sets; Escherich's
fundamental form; Second variation)
Escherich's fundamental form 254
Euler 7, 51, 63, 130
Names the subject 68
100 special problems 67 (see also
Brachystochrone problem; Elastic
curves; Euler equations;
Euler- Lagrange equations; Euler's
methods; Euler's modification of
Brachystochrone problem; Geodesics;
Heavy chain problem; Invariance of
Euler equation; Isoperimetric
problem; Lagrange problem; Least
action principle; Multiplier rule;
Relation between Euler and Lagrange;
Sufficient conditions; Variable
end-point problems; Variations)
Euler equations 17-18,26,33,68,
72-73,78 (see also Euler-Lagrange
equations)
Euler-Lagrange equations
Euler on 72-78
Lagrange on 113-116,119-124,
148-150
Euler's methods 21
For Lagrange problems 73 -84
For simple problems 67 - 73
Functions of functionals 89-92
Variational method 115 (see also
Lagrange's method of variations)
404
Index
Osgod on 247-248
Relation to Hilbert integral 321-3.22
Schwarz on 223 n, 247-248, 317,
321
Slope-functions 316
Weierstrass on 190,207-210,214,
232-236, 247
Zermelo on 226, 248 n (see also
Existence of fields; Mayer families)
Focal points
Bliss on 352
Kneser on 351-352
Fontaine 129
Fredholm 363, 365-366
Fundamental equations of John
Bernoulli 52, 59-60, 63, 93, 292
Fundamental lemma 287 - 293
Bertrand on 275 n, 289
Bolza on 288
du Bois-Raymond on 192 n,
288-293
Heine on 192 n, 289
Reiff on 291
Stegmann on 288
Weierstrass on 192 n
Fatio de Duillier 8, 51
Fermat 2-3 (see also Fermat's principle;
Least action principle)
Fermat's principle 1-6, 38-39
Analysis 2-4
Maupertuis on 109
Synthesis 4-7
Use by John Bernoulli 38-39
Weierstrass on 219 (see also
Descartes' or Snell's rule)
Fields of extremals
Beltrami on 317 n
Bliss on 248 n, 316 n, 319-322
Called strip by Weierstrass 214-217
Comparison of Hilbert- Bliss and
Schwarz definitions 322
Connection with Hamilton - Jacobi
theory 334-337,387
For isoperimetric problem 232-236
Hilbert on 248 n, 314-317, 322,
327-330
In three space 327 - 330
Kneser on 223 n, 226, 248 n, 321,
347-349
Mayer on 214, 248 n, 322, 334-337
Normal coordinates of 353
Index
Hadamard 381 n (see also Existence
theorems; Mayer problem)
Hahn (see Abnormality; Hahn theorem;
Hilbert differentiability condition;
Isoperimetric problem; Mayer
families; Mayer problem; Multiplier
rule; Transformations of problems)
Hahn theorem
Bliss on 360, 362
Hahn on 360-362
Hamilton 50 n, 107 (see also
Hamiltonian or Principal function;
Hamilton - Jacobi equation;
Hamilton's principle)
Hamilton - Jacobi equation
Beltrami 186
Caratheodory 386- 387
Clebsch on 257, 259 - 260
Hamilton on 176-182, 186
Hilbert on 327-329
Jacobi on 182-186,309-313
Mayer on 309-313,330-337
Relation to geodesic
equivalence 385 n
Hamiltonian or Principal function
Hamilton on 179, 181,270
Jacobi on 183
Hamilton's principle 107, 181 (see also
Canonical equations and variables;
Hamilton - Jacobi equation)
Heath 10 n
Heavy chain problem
Bernoulli brothers on 32, 50 n,
56-58, 63
Euler on 89-92
Galileo on 32
Huygens's solution of 32 n
Leibniz's solution of 32 n
Mayer on 298 - 299
Hedrick 246, 314 n, 320 n
Heine (see Fundamental lemma)
Helmholtz 107
Hesse 156, 257 (see also Conjugate
points; Jacobi condition; Jacobi's
theorem)
Hestenes (see Abnormality)
Hiebert 78 n
Hilbert 314, 373 (see also
Boundary-value problems; Existence
theorems; Fields of extremals;
Hamilton - Jacobi equation; Hilbert
differentiability condition; Hilbert
integral; Hilbert's methods; Mayer
problem; Multiple integral problems;
405
Implicit-function theorems
Bliss on 318
Bolza on 318-319
Dini on 248,318-319
Mason and Bliss on 319 n
Osgood on 318-319
Weierstrass on 204-207 (see also
Existence theorems)
Invariance of Euler equation 84 - 89, 179
Invariant integral (see Hilbert integral)
Isochrone
James Bernoulli on 46
Huygens on 46
Isoperimetric problem
Bolza on 94
Clebsch on 257
du Bois-Raymond on 291-293
Euler on 92-101
Euler's rule 291-293
First necessary condition 221-222
General case treated incorrectly by
Euler 100
Hahn on 323
Jacobi condition for 228, 235, 293 n
Jordan on 291 n
Lundstrom's observation 220 n, 235,
293 n
406
Index
Index
Euler on 101-107
Jacobi on 106
Lagrange on 106-107, 129-130
Maupertuis on 101, 108-109
Newton's laws of motion 104, 184
Relation to Fermat's principle 109
Relation to Snell's rule 109
Lebesgue, V.-A 156 n, 168-176 (see
also Jacobi's theorem)
Lebesgue, H. 372 (see also Existence
theorems)
Legendre 111, 193 (see also Legendre
condition; Legendre differential
equation;"Legendre function; Newton
problem; Second variation; Sufficient
conditions)
Legendre condition 193, 201, 245
Bolza on 146 n
Jacobi on 151,157-159
Lagrange on 140, 145-146
Lagrange's criticism 140, 145 -146,
157
Legendre on 139-143
Mason and Bliss on 360
Relation to Weierstrass condition 214
Weierstrass on 146-147, 193-196
Legendre differential equation
Jacobi on 157 - 158
Lagrange on 145-146, 158
Legendre on 140
Weierstrass on 146-147,195-196
Legendre function 139-142, 157 -158
Leibniz 8, 32,44, 64, 108 n (see also
Brachystochrone problem; Heavy
chain problem)
Le Seur 129
Lipschitz 237
Lundstrom (see Conjugate points;
Isoperimetric problem)
Mach 41 n
Maclaurin 8 n
MacMillan 106 n - 107 n
Mainardi (see Jacobi's theorem)
Mason (see Bliss; Boundary-value
problems; Inplicit-function theorems;
Jacobi condition; Legendre condition;
Parametric problem; Strong and weak
extrema; Transversality condition;
Variable end-point problems;
Weierstrass condition)
Maupertuis 68 n (see also Fermat's
principle; Least action principle)
407
408
Index
Necessary conditions
As formulated by Weierstrass 192, 201
Bliss on 352-353
For isoperimetric problem 221 (see
also Clebsch condition; Hilbert
differentiability condition; Jacobi
condition; Legendre condition;
Transversality condition;
Weierstrass- Erdmann condition)
Newton 51, 104, 108 (see also
Brachystochrone problem; Newton
problem; Transversality condition;
Variable end-point problems)
Newton problem 33, 39, 59, 145
Bolza on 15, 19, 29
Discontinuous solution of 16,
144-145
First necessary condition 15, 17,
19-21
Frustrum of cone 7, II - 15
Huygens on 8 n, 28-29
Legendre's solution 16, 143 - 145
Modem treatment 17 - 19
Newton on 7-29
Parametric representation 18, 26-28
Ship's bow 1I, 13
Spheres and cyclinders 7 - II
Variable end-points 26-28
Weierstrass on 236
Noble (see Existence theorems)
Normal coordinates of a field
Bolza on 355-357
Kneser on 353-355
Normality 251 (see also Abnormality)
Orthogonal trajectories
Discovered by John Bernoulli 42-44
Synchrones 42, 387 (see also
Transversality condition;
Transversals)
Oscillation theorems
Richardson on 370-371
Osgood 542 (see also Existence of fields;
Fields of extremals; Implicit-function
theorems; Osgood's summary; Strong
and weak extrema; Sufficient
conditions; Transversality condition;
Transversals; Weierstrass ~-function;
Weierstrass's methods)
Osgood's summary 246-249,314
Ostwald 31 n, 39 n
Parametric problem
Bolza on 355-357
Formulated by Weierstrass 191-194
Mason and Bliss on 357-360
Pascal 249 n
Peano 318
Picard (see Existence theorems for
differential equations)
Plateau problem
Lagrange on 123
Poisson 107
Problem of Bolza (see Bolza problem)
Problem of Lagrange (see Lagrange
problem)
Problem of Mayer (see Mayer problem)
Regularity 320
Reid 389 (see also Abnormality)
Reiff (see Fundamental leinma;
Isoperimetric problem)
Relation between Euler and
Lagrange 11O-lIl, 1I4-115
Richardson (see Boundary-value
problems; Oscillation theorems)
Richelot 280
Rothe 190
Sabra 1 n, 4 n-6 n
Scheeffer 191 n, 343 (see also Conjugate
points; Jacobi condition; Second
variation; Strong and weak extrema;
Sufficient conditions)
Schwarz 190, 366 (see also Fields of
extremals; Jacobi condition; Second
variation; Weierstrass fundamental
relation)
Index
Second variation
Bliss on 278 n, 280 n
Bolza on 279 n
Clebsch on 250-257,261-265,294,
304
Erdmann on 241
vs. Escherich's transformation of
251 n, 254
250,
302-305
278-279
409
Osgood on 248-249
Scheeffer on 201 n, 237-241
Weierstrass on 161, 190, 201-204,
214-219,223-226,234-237,249
Tautochrone
Huygens on 34, 41
Taylor 8 n, 71 n
Third variation
Erdmann on 200,343 n
Tonelli (see Existence theorems)
Total variation
Hilbert on 317, 337
Kneser on 341
Mayer on 337 - 338
Weierstrass on 202-204,210-217,
Sommerfeld on 245
Weierstrass on 193-201 ~ee also
Clebsch transformation)
Simpson 8 n
Sommerfeld (see Jacobi condition;
Multiple integral problems; Second
variation)
Spitzer (see Jacobi's theorem)
Stiickel
Translates texts 31 n, 39 n, 41 n, 43,
44 n, 161 n
Stegmann ~ee Fundamental lemma)
Strong and weak extrema
Bolza on 355-357
Kneser on 343
Mason and Bliss on 362
Mayer on 277-282,343
Osgood's discussion 246-249
Scheeffer on 237-245
Weierstrass on 161,201-204,
Struik
Translates texts 39 n, 43 n
Sturm 243
Sufficient conditions
John Bernoulli on 51,65-66
Bliss on 320, 353
Bolza on 355-357
Euler on 73
Hilbert on 246
Jacobi on 161-162
Kneser on 161, 354-355
Legendre on 140
222-226
Transformations of problems
Hahn on 388
Transversality condition
Kneser on 346-349
Lagrange on 113, 115-117, 119,
132-133, 136-138
410
Index
Variations
Adopted by Euler 68
Discovered by Lagrange 68,
110-112, 115-116
Restricted 222, 236
Strong and weak 246 (see also
Lagrange's method of variations)
Varignon 48, 50
Weil 84 n
Weierstrass 156 n, 159 n, 237-238,
250, 269 (see also Conjugate points;
Existence of fields; Existence
theorems; Fermat's principle; Fields
of extremals; Implicit-function
theorems; Isoperimetric problem;
Jacobi condition; Legendre condition;
Legendre differential equation;
Necessary conditions; Newton
problem; Parametric problems;
Second variation; Strong and weak
extrema; Sufficient conditions; Total
variation; Variable end-point
problems, Weierstrass condition;
Weierstrass S-function;
Weierstrass - Erdmann comer
condition; Weierstrass's methods)
Weierstrass condition
Hilbert on 316-317
Mason and Bliss on 360
Relation to Legendre condition 219
Weierstrass on 190, 210-214,
223-225 (see also Abnormality)