Sie sind auf Seite 1von 373

# Introduction to

Tensor Calculus
and
Continuum Mechanics

by J.H. Heinbockel
Department of Mathematics and Statistics
Old Dominion University
PREFACE

This is an introductory text which presents fundamental concepts from the subject
areas of tensor calculus, differential geometry and continuum mechanics. The material
presented is suitable for a two semester course in applied mathematics and is flexible
enough to be presented to either upper level undergraduate or beginning graduate students
majoring in applied mathematics, engineering or physics. The presentation assumes the
students have some knowledge from the areas of matrix theory, linear algebra and advanced
calculus. Each section includes many illustrative worked examples. At the end of each
section there is a large collection of exercises which range in difficulty. Many new ideas
are presented in the exercises and so the students should be encouraged to read all the
exercises.
The purpose of preparing these notes is to condense into an introductory text the basic
definitions and techniques arising in tensor calculus, differential geometry and continuum
mechanics. In particular, the material is presented to (i) develop a physical understanding
of the mathematical concepts associated with tensor calculus and (ii) develop the basic
equations of tensor calculus, differential geometry and continuum mechanics which arise
in engineering applications. From these basic equations one can go on to develop more
sophisticated models of applied mathematics. The material is presented in an informal
manner and uses mathematics which minimizes excessive formalism.
The material has been divided into two parts. The first part deals with an introduc-
tion to tensor calculus and differential geometry which covers such things as the indicial
notation, tensor algebra, covariant differentiation, dual tensors, bilinear and multilinear
forms, special tensors, the Riemann Christoffel tensor, space curves, surface curves, cur-
vature and fundamental quadratic forms. The second part emphasizes the application of
tensor algebra and calculus to a wide variety of applied areas from engineering and physics.
The selected applications are from the areas of dynamics, elasticity, fluids and electromag-
netic theory. The continuum mechanics portion focuses on an introduction of the basic
concepts from linear elasticity and fluids. The Appendix A contains units of measurements
from the SysteŐÄme International d‚ÄôUniteŐÄs along with some selected physical constants. The
Appendix B contains a listing of Christoffel symbols of the second kind associated with
various coordinate systems. The Appendix C is a summary of useful vector identities.

## J.H. Heinbockel, 1996

Reproduction and distribution of these notes is allowable provided it is for non-profit
purposes only.
INTRODUCTION TO
TENSOR CALCULUS
AND
CONTINUUM MECHANICS
PART 1: INTRODUCTION TO TENSOR CALCULUS

## ¬ß1.1 INDEX NOTATION . . . . . . . . . . . . . . . . . . 1

Exercise 1.1 . . . . . . . . . . . . . . . . . . . . . . . . . . 28
¬ß1.2 TENSOR CONCEPTS AND TRANSFORMATIONS . . . . 35
Exercise 1.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
¬ß1.3 SPECIAL TENSORS . . . . . . . . . . . . . . . . . . 65
Exercise 1.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
¬ß1.4 DERIVATIVE OF A TENSOR . . . . . . . . . . . . . . 108
Exercise 1.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
¬ß1.5 DIFFERENTIAL GEOMETRY AND RELATIVITY . . . . 129
Exercise 1.5 . . . . . . . . . . . . . . . . . . . . . . . . . . . 162

## ¬ß2.1 TENSOR NOTATION FOR VECTOR QUANTITIES . . . . 171

Exercise 2.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
¬ß2.2 DYNAMICS . . . . . . . . . . . . . . . . . . . . . . 187
Exercise 2.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 206
¬ß2.3 BASIC EQUATIONS OF CONTINUUM MECHANICS . . . 211
Exercise 2.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . 238
¬ß2.4 CONTINUUM MECHANICS (SOLIDS) . . . . . . . . . 243
Exercise 2.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . 272
¬ß2.5 CONTINUUM MECHANICS (FLUIDS) . . . . . . . . . 282
Exercise 2.5 . . . . . . . . . . . . . . . . . . . . . . . . . . . 317
¬ß2.6 ELECTRIC AND MAGNETIC FIELDS . . . . . . . . . . 325
Exercise 2.6 . . . . . . . . . . . . . . . . . . . . . . . . . . . 347
BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . 352
APPENDIX A UNITS OF MEASUREMENT . . . . . . . 353
APPENDIX B CHRISTOFFEL SYMBOLS OF SECOND KIND 355
APPENDIX C VECTOR IDENTITIES . . . . . . . . . . 362
INDEX . . . . . . . . . . . . . . . . . . . . . . . . . . 363
1

## PART 1: INTRODUCTION TO TENSOR CALCULUS

A scalar field describes a one-to-one correspondence between a single scalar number and a point. An n-
dimensional vector field is described by a one-to-one correspondence between n-numbers and a point. Let us
generalize these concepts by assigning n-squared numbers to a single point or n-cubed numbers to a single
point. When these numbers obey certain transformation laws they become examples of tensor fields. In
general, scalar fields are referred to as tensor fields of rank or order zero whereas vector fields are called
tensor fields of rank or order one.
Closely associated with tensor calculus is the indicial or index notation. In section 1 the indicial
notation is defined and illustrated. We also define and investigate scalar, vector and tensor fields when they
are subjected to various coordinate transformations. It turns out that tensors have certain properties which
are independent of the coordinate system used to describe the tensor. Because of these useful properties,
we can use tensors to represent various fundamental laws occurring in physics, engineering, science and
mathematics. These representations are extremely useful as they are independent of the coordinate systems
considered.

## ¬ß1.1 INDEX NOTATION

~ and B
Two vectors A ~ can be expressed in the component form

~ = A1 b
A e1 + A2 b
e2 + A3 b
e3 and ~ = B1 b
B e1 + B2 b
e2 + B3 b
e3 ,

where be1 , b
e2 and b ~ and
e3 are orthogonal unit basis vectors. Often when no confusion arises, the vectors A
~ are expressed for brevity sake as number triples. For example, we can write
B

~ = (A1 , A2 , A3 )
A and ~ = (B1 , B2 , B3 )
B

~ and B
where it is understood that only the components of the vectors A ~ are given. The unit vectors would
be represented
b
e1 = (1, 0, 0), b
e2 = (0, 1, 0), b
e3 = (0, 0, 1).

~ and B
A still shorter notation, depicting the vectors A ~ is the index or indicial notation. In the index notation,
the quantities
Ai , i = 1, 2, 3 and Bp , p = 1, 2, 3

~ and B.
represent the components of the vectors A ~ This notation focuses attention only on the components of
the vectors and employs a dummy subscript whose range over the integers is specified. The symbol Ai refers
~ simultaneously. The dummy subscript i can have any of the integer
to all of the components of the vector A
~ Setting i = 2 focuses
values 1, 2 or 3. For i = 1 we focus attention on the A1 component of the vector A.
attention on the second component A2 of the vector A ~ and similarly when i = 3 we can focus attention on
the third component of A.~ The subscript i is a dummy subscript and may be replaced by another letter, say
p, so long as one specifies the integer values that this dummy subscript can have.
2

It is also convenient at this time to mention that higher dimensional vectors may be defined as ordered
n‚ąítuples. For example, the vector
~ = (X1 , X2 , . . . , XN )
X

## with components Xi , i = 1, 2, . . . , N is called a N ‚ąídimensional vector. Another notation used to represent

this vector is
~ = X1 b
X e1 + X2 b
e2 + ¬∑ ¬∑ ¬∑ + XN b
eN

where
b
e1 , b
e2 , . . . , b
eN

are linearly independent unit base vectors. Note that many of the operations that occur in the use of the
index notation apply not only for three dimensional vectors, but also for N ‚ąídimensional vectors.
In future sections it is necessary to define quantities which can be represented by a letter with subscripts
or superscripts attached. Such quantities are referred to as systems. When these quantities obey certain
transformation laws they are referred to as tensor systems. For example, quantities like

## Akij eijk őīij őīij Ai Bj aij .

The subscripts or superscripts are referred to as indices or suffixes. When such quantities arise, the indices
must conform to the following rules:
1. They are lower case Latin or Greek letters.
2. The letters at the end of the alphabet (u, v, w, x, y, z) are never employed as indices.

The number of subscripts and superscripts determines the order of the system. A system with one index
is a first order system. A system with two indices is called a second order system. In general, a system with
N indices is called a N th order system. A system with no indices is called a scalar or zeroth order system.
The type of system depends upon the number of subscripts or superscripts occurring in an expression.
For example, Aijk and Bst
m
, (all indices range 1 to N), are of the same type because they have the same
number of subscripts and superscripts. In contrast, the systems Aijk and Cpmn are not of the same type
because one system has two superscripts and the other system has only one superscript. For certain systems
the number of subscripts and superscripts is important. In other systems it is not of importance. The
meaning and importance attached to sub- and superscripts will be addressed later in this section.
In the use of superscripts one must not confuse ‚Äúpowers ‚ÄĚof a quantity with the superscripts. For
example, if we replace the independent variables (x, y, z) by the symbols (x1 , x2 , x3 ), then we are letting
y = x2 where x2 is a variable and not x raised to a power. Similarly, the substitution z = x3 is the
replacement of z by the variable x3 and this should not be confused with x raised to a power. In order to
write a superscript quantity to a power, use parentheses. For example, (x2 )3 is the variable x2 cubed. One
of the reasons for introducing the superscript variables is that many equations of mathematics and physics
can be made to take on a concise and compact form.
There is a range convention associated with the indices. This convention states that whenever there
is an expression where the indices occur unrepeated it is to be understood that each of the subscripts or
superscripts can take on any of the integer values 1, 2, . . . , N where N is a specified integer. For example,
3

the Kronecker delta symbol őīij , defined by őīij = 1 if i = j and őīij = 0 for i 6= j, with i, j ranging over the
values 1,2,3, represents the 9 quantities

## őī11 = 1 őī12 = 0 őī13 = 0

őī21 = 0 őī22 = 1 őī23 = 0
őī31 = 0 őī32 = 0 őī33 = 1.

The symbol őīij refers to all of the components of the system simultaneously. As another example, consider
the equation
b
em ¬∑ b
en = őīmn m, n = 1, 2, 3 (1.1.1)

the subscripts m, n occur unrepeated on the left side of the equation and hence must also occur on the right
hand side of the equation. These indices are called ‚Äúfree ‚ÄĚindices and can take on any of the values 1, 2 or 3
as specified by the range. Since there are three choices for the value for m and three choices for a value of
n we find that equation (1.1.1) represents nine equations simultaneously. These nine equations are

e1 ¬∑ b
b e1 = 1 b
e1 ¬∑ b
e2 = 0 b
e1 ¬∑ b
e3 = 0
e2 ¬∑ b
b e1 = 0 e2 ¬∑ b
b e2 = 1 e2 ¬∑ b
b e3 = 0
e3 ¬∑ b
b e1 = 0 e3 ¬∑ b
b e2 = 0 e3 ¬∑ b
b e3 = 1.

## Symmetric and Skew-Symmetric Systems

A system defined by subscripts and superscripts ranging over a set of values is said to be symmetric
in two of its indices if the components are unchanged when the indices are interchanged. For example, the
third order system Tijk is symmetric in the indices i and k if

## Tijk = Tkji for all values of i, j and k.

A system defined by subscripts and superscripts is said to be skew-symmetric in two of its indices if the
components change sign when the indices are interchanged. For example, the fourth order system Tijkl is
skew-symmetric in the indices i and l if

## Tijkl = ‚ąíTljki for all values of ijk and l.

As another example, consider the third order system aprs , p, r, s = 1, 2, 3 which is completely skew-
symmetric in all of its indices. We would then have

## aprs = ‚ąíapsr = aspr = ‚ąíasrp = arsp = ‚ąíarps .

It is left as an exercise to show this completely skew- symmetric systems has 27 elements, 21 of which are
zero. The 6 nonzero elements are all related to one another thru the above equations when (p, r, s) = (1, 2, 3).
This is expressed as saying that the above system has only one independent component.
4

Summation Convention

The summation convention states that whenever there arises an expression where there is an index which
occurs twice on the same side of any equation, or term within an equation, it is understood to represent a
summation on these repeated indices. The summation being over the integer values specified by the range. A
repeated index is called a summation index, while an unrepeated index is called a free index. The summation
convention requires that one must never allow a summation index to appear more than twice in any given
expression. Because of this rule it is sometimes necessary to replace one dummy summation symbol by
some other dummy symbol in order to avoid having three or more indices occurring on the same side of
the equation. The index notation is a very powerful notation and can be used to concisely represent many
complex equations. For the remainder of this section there is presented additional definitions and examples
to illustrated the power of the indicial notation. This notation is then employed to define tensor components
and associated operations with tensors.

## EXAMPLE 1.1-1 The two equations

y1 = a11 x1 + a12 x2
y2 = a21 x1 + a22 x2

can be represented as one equation by introducing a dummy index, say k, and expressing the above equations
as
yk = ak1 x1 + ak2 x2 , k = 1, 2.

The range convention states that k is free to have any one of the values 1 or 2, (k is a free index). This
equation can now be written in the form
2
X
yk = aki xi = ak1 x1 + ak2 x2
i=1

where i is the dummy summation index. When the summation sign is removed and the summation convention
yk = aki xi i, k = 1, 2.

Since the subscript i repeats itself, the summation convention requires that a summation be performed by
letting the summation subscript take on the values specified by the range and then summing the results.
The index k which appears only once on the left and only once on the right hand side of the equation is
called a free index. It should be noted that both k and i are dummy subscripts and can be replaced by other
letters. For example, we can write
yn = anm xm n, m = 1, 2

where m is the summation index and n is the free index. Summing on m produces

yn = an1 x1 + an2 x2

and letting the free index n take on the values of 1 and 2 we produce the original two equations.
5

EXAMPLE 1.1-2. For yi = aij xj , i, j = 1, 2, 3 and xi = bij zj , i, j = 1, 2, 3 solve for the y variables in
terms of the z variables.
Solution: In matrix form the given equations can be expressed:
Ô£ę Ô£∂ Ô£ę Ô£∂Ô£ę Ô£∂ Ô£ę Ô£∂ Ô£ę Ô£∂Ô£ę Ô£∂
y1 a11 a12 a13 x1 x1 b11 b12 b13 z1
Ô£≠ y2 Ô£ł = Ô£≠ a21 a22 a23 Ô£ł Ô£≠ x2 Ô£ł and Ô£≠ x2 Ô£ł = Ô£≠ b21 b22 b23 Ô£ł Ô£≠ z2 Ô£ł .
y3 a31 a32 a33 x3 x3 b31 b32 b33 z3

Now solve for the y variables in terms of the z variables and obtain
Ô£ę Ô£∂ Ô£ę Ô£∂Ô£ę Ô£∂Ô£ę Ô£∂
y1 a11 a12 a13 b11 b12 b13 z1
Ô£≠ y2 Ô£ł = Ô£≠ a21 a22 a23 Ô£ł Ô£≠ b21 b22 b23 Ô£ł Ô£≠ z2 Ô£ł .
y3 a31 a32 a33 b31 b32 b33 z3

The index notation employs indices that are dummy indices and so we can write

## yn = anm xm , n, m = 1, 2, 3 and xm = bmj zj , m, j = 1, 2, 3.

Here we have purposely changed the indices so that when we substitute for xm , from one equation into the
other, a summation index does not repeat itself more than twice. Substituting we find the indicial form of
the above matrix equation as
yn = anm bmj zj , m, n, j = 1, 2, 3

where n is the free index and m, j are the dummy summation indices. It is left as an exercise to expand
both the matrix equation and the indicial equation and verify that they are different ways of representing
the same thing.

EXAMPLE 1.1-3. The dot product of two vectors Aq , q = 1, 2, 3 and Bj , j = 1, 2, 3 can be represented
with the index notation by the product Ai Bi = AB cos őł i = 1, 2, 3, A = |A|, ~ ~ Since the
B = |B|.
subscript i is repeated it is understood to represent a summation index. Summing on i over the range
specified, there results
A1 B1 + A2 B2 + A3 B3 = AB cos őł.

Observe that the index notation employs dummy indices. At times these indices are altered in order to
conform to the above summation rules, without attention being brought to the change. As in this example,
the indices q and j are dummy indices and can be changed to other letters if one desires. Also, in the future,
if the range of the indices is not stated it is assumed that the range is over the integer values 1, 2 and 3.

To systems containing subscripts and superscripts one can apply certain algebraic operations. We
present in an informal way the operations of addition, multiplication and contraction.
6

The algebraic operation of addition or subtraction applies to systems of the same type and order. That
is we can add or subtract like components in systems. For example, the sum of Aijk and Bjk
i
is again a
i
system of the same type and is denoted by Cjk = Aijk + Bjk
i
, where like components are added.
The product of two systems is obtained by multiplying each component of the first system with each
component of the second system. Such a product is called an outer product. The order of the resulting
product system is the sum of the orders of the two systems involved in forming the product. For example,
if Aij is a second order system and B mnl is a third order system, with all indices having the range 1 to N,
then the product system is fifth order and is denoted Cjimnl = Aij B mnl . The product system represents N 5
terms constructed from all possible products of the components from Aij with the components from B mnl .
The operation of contraction occurs when a lower index is set equal to an upper index and the summation
convention is invoked. For example, if we have a fifth order system Cjimnl and we set i = j and sum, then
we form the system
C mnl = Cjjmnl = C11mnl + C22mnl + ¬∑ ¬∑ ¬∑ + CN
N mnl
.

Here the symbol C mnl is used to represent the third order system that results when the contraction is
performed. Whenever a contraction is performed, the resulting system is always of order 2 less than the
original system. Under certain special conditions it is permissible to perform a contraction on two lower case
indices. These special conditions will be considered later in the section.
The above operations will be more formally defined after we have explained what tensors are.

## The e-permutation symbol and Kronecker delta

Two symbols that are used quite frequently with the indicial notation are the e-permutation symbol
and the Kronecker delta. The e-permutation symbol is sometimes referred to as the alternating tensor. The
e-permutation symbol, as the name suggests, deals with permutations. A permutation is an arrangement of
things. When the order of the arrangement is changed, a new permutation results. A transposition is an
interchange of two consecutive terms in an arrangement. As an example, let us change the digits 1 2 3 to
3 2 1 by making a sequence of transpositions. Starting with the digits in the order 1 2 3 we interchange 2 and
3 (first transposition) to obtain 1 3 2. Next, interchange the digits 1 and 3 ( second transposition) to obtain
3 1 2. Finally, interchange the digits 1 and 2 (third transposition) to achieve 3 2 1. Here the total number
of transpositions of 1 2 3 to 3 2 1 is three, an odd number. Other transpositions of 1 2 3 to 3 2 1 can also be
written. However, these are also an odd number of transpositions.
7

EXAMPLE 1.1-4. The total number of possible ways of arranging the digits 1 2 3 is six. We have
three choices for the first digit. Having chosen the first digit, there are only two choices left for the second
digit. Hence the remaining number is for the last digit. The product (3)(2)(1) = 3! = 6 is the number of
permutations of the digits 1, 2 and 3. These six permutations are

1 2 3 even permutation
1 3 2 odd permutation
3 1 2 even permutation
3 2 1 odd permutation
2 3 1 even permutation
2 1 3 odd permutation.

Here a permutation of 1 2 3 is called even or odd depending upon whether there is an even or odd number
of transpositions of the digits. A mnemonic device to remember the even and odd permutations of 123
is illustrated in the figure 1.1-1. Note that even permutations of 123 are obtained by selecting any three
consecutive numbers from the sequence 123123 and the odd permutations result by selecting any three
consecutive numbers from the sequence 321321.

## Figure 1.1-1. Permutations of 123.

In general, the number of permutations of n things taken m at a time is given by the relation

## P (n, m) = n(n ‚ąí 1)(n ‚ąí 2) ¬∑ ¬∑ ¬∑ (n ‚ąí m + 1).

By selecting a subset of m objects from a collection of n objects, m ‚Č§ n, without regard to the ordering is
called a combination of n objects taken m at a time. For example, combinations of 3 numbers taken from
the set {1, 2, 3, 4} are (123), (124), (134), (234). Note that ordering of a combination is not considered. That
is, the permutations (123), (132), (231), (213), (312), (321) are considered equal. In general, the number of
n n! 
n
combinations of n objects taken m at a time is given by C(n, m) = = where m are the
m m!(n ‚ąí m)!
binomial coefficients which occur in the expansion
n 
X n  n‚ąím m
(a + b)n = a b .
m=0
m
8

## Definition: (e-Permutation symbol or alternating tensor)

The e-permutation symbol is defined
Ô£Ī
Ô£ī
Ô£≤1 if ijk . . . l is an even permutation of the integers 123 . . . n
e ijk...l
= eijk...l = ‚ąí1 if ijk . . . l is an odd permutation of the integers 123 . . . n
Ô£ī
Ô£≥
0 in all other cases

## EXAMPLE 1.1-5. Find e612453 .

Solution: To determine whether 612453 is an even or odd permutation of 123456 we write down the given
numbers and below them we write the integers 1 through 6. Like numbers are then connected by a line and
we obtain figure 1.1-2.

## Figure 1.1-2. Permutations of 123456.

In figure 1.1-2, there are seven intersections of the lines connecting like numbers. The number of
intersections is an odd number and shows that an odd number of transpositions must be performed. These
results imply e612453 = ‚ąí1.

Another definition used quite frequently in the representation of mathematical and engineering quantities
is the Kronecker delta which we now define in terms of both subscripts and superscripts.

## Definition: (Kronecker delta) The Kronecker delta is defined:


1 if i equals j
őīij = őīij =
0 if i is different from j
9

EXAMPLE 1.1-6. Some examples of the e‚ąípermutation symbol and Kronecker delta are:

## e123 = e123 = +1 őī11 = 1 őī12 = 0

e213 = e213 = ‚ąí1 őī21 = 0 őī22 = 1
112
e112 = e =0 őī31 =0 őī32 = 0.

EXAMPLE 1.1-7. When an index of the Kronecker delta őīij is involved in the summation convention,
the effect is that of replacing one index with a different index. For example, let aij denote the elements of an
N √ó N matrix. Here i and j are allowed to range over the integer values 1, 2, . . . , N. Consider the product

aij őīik

where the range of i, j, k is 1, 2, . . . , N. The index i is repeated and therefore it is understood to represent
a summation over the range. The index i is called a summation index. The other indices j and k are free
indices. They are free to be assigned any values from the range of the indices. They are not involved in any
summations and their values, whatever you choose to assign them, are fixed. Let us assign a value of j and
k to the values of j and k. The underscore is to remind you that these values for j and k are fixed and not
to be summed. When we perform the summation over the summation index i we assign values to i from the
range and then sum over these values. Performing the indicated summation we obtain

## aij őīik = a1j őī1k + a2j őī2k + ¬∑ ¬∑ ¬∑ + akj őīkk + ¬∑ ¬∑ ¬∑ + aN j őīN k .

In this summation the Kronecker delta is zero everywhere the subscripts are different and equals one where
the subscripts are the same. There is only one term in this summation which is nonzero. It is that term
where the summation index i was equal to the fixed value k This gives the result

## akj őīkk = akj

where the underscore is to remind you that the quantities have fixed values and are not to be summed.
Dropping the underscores we write
aij őīik = akj

Here we have substituted the index i by k and so when the Kronecker delta is used in a summation process
it is known as a substitution operator. This substitution property of the Kronecker delta can be used to
simplify a variety of expressions involving the index notation. Some examples are:

## Bij őījs = Bis

őījk őīkm = őījm
eijk őīim őījn őīkp = emnp .

Some texts adopt the notation that if indices are capital letters, then no summation is to be performed.
For example,
aKJ őīKK = aKJ
10

as őīKK represents a single term because of the capital letters. Another notation which is used to denote no
summation of the indices is to put parenthesis about the indices which are not to be summed. For example,

## a(k)j őī(k)(k) = akj ,

since őī(k)(k) represents a single term and the parentheses indicate that no summation is to be performed.
At any time we may employ either the underscore notation, the capital letter notation or the parenthesis
notation to denote that no summation of the indices is to be performed. To avoid confusion altogether, one
can write out parenthetical expressions such as ‚Äú(no summation on k)‚ÄĚ.

EXAMPLE 1.1-8. In the Kronecker delta symbol őīji we set j equal to i and perform a summation. This
operation is called a contraction. There results őīii , which is to be summed over the range of the index i.
Utilizing the range 1, 2, . . . , N we have

## őīii = őī11 + őī22 + ¬∑ ¬∑ ¬∑ + őīN

N

őīii = 1 + 1 + ¬∑ ¬∑ ¬∑ + 1
őīii = N.

## őīkk = őī11 + őī22 + őī33 = 3.

In certain circumstances the Kronecker delta can be written with only subscripts. For example,
őīij , i, j = 1, 2, 3. We shall find that these circumstances allow us to perform a contraction on the lower
indices so that őīii = 3.

EXAMPLE 1.1-9. The determinant of a matrix A = (aij ) can be represented in the indicial notation.
Employing the e-permutation symbol the determinant of an N √ó N matrix is expressed

## |A| = eij a1i a2j

where the summation is over the range 1,2 and the e-permutation symbol is of order 2. In the special case
of a 3 √ó 3 matrix we have

a11 a12 a13

|A| = a21 a22 a23 = eijk ai1 aj2 ak3 = eijk a1i a2j a3k
a31 a32 a33

where i, j, k are the summation indices and the summation is over the range 1,2,3. Here eijk denotes the
e-permutation symbol of order 3. Note that by interchanging the rows of the 3 √ó 3 matrix we can obtain
11

more general results. Consider (p, q, r) as some permutation of the integers (1, 2, 3), and observe that the
determinant can be expressed
ap1 ap2 ap3

‚ąÜ = aq1 aq2 aq3 = eijk api aqj ark .
ar1 ar2 ar3
If (p, q, r) is an even permutation of (1, 2, 3) then ‚ąÜ = |A|
If (p, q, r) is an odd permutation of (1, 2, 3) then ‚ąÜ = ‚ąí|A|
If (p, q, r) is not a permutation of (1, 2, 3) then ‚ąÜ = 0.
We can then write
eijk api aqj ark = epqr |A|.

Each of the above results can be verified by performing the indicated summations. A more formal proof of
the above result is given in EXAMPLE 1.1-25, later in this section.

EXAMPLE 1.1-10. The expression eijk Bij Ci is meaningless since the index i repeats itself more than
twice and the summation convention does not allow this. If you really did want to sum over an index which
occurs more than twice, then one must use a summation sign. For example the above expression would be
X
n
written eijk Bij Ci .
i=1

EXAMPLE 1.1-11.
The cross product of the unit vectors e1 , b
b e2 , b
e3 can be represented in the index notation by
Ô£Ī
Ô£≤b
Ô£ī ek if (i, j, k) is an even permutation of (1, 2, 3)
ei √ó b
b ej = ‚ąí b ek if (i, j, k) is an odd permutation of (1, 2, 3)
Ô£ī
Ô£≥
0 in all other cases

ej = ekij b
ei √ó b
This result can be written in the form b ek . This later result can be verified by summing on the
index k and writing out all 9 possible combinations for i and j.

EXAMPLE 1.1-12. Given the vectors Ap , p = 1, 2, 3 and Bp , p = 1, 2, 3 the cross product of these two
vectors is a vector Cp , p = 1, 2, 3 with components

Ci = eijk Aj Bk , i, j, k = 1, 2, 3. (1.1.2)

~ =A
C ~ = C1 b
~√óB e1 + C2 b
e2 + C3 b
e3 .

## ~ is to be summed over each of the indices which

The equation (1.1.2), which defines the components of C,
repeats itself. We have summing on the index k

## Ci = eij1 Aj B1 + eij2 Aj B2 + eij3 Aj B3 . (1.1.3)

12

We next sum on the index j which repeats itself in each term of equation (1.1.3). This gives

## Ci = ei11 A1 B1 + ei21 A2 B1 + ei31 A3 B1

+ ei12 A1 B2 + ei22 A2 B2 + ei32 A3 B2 (1.1.4)
+ ei13 A1 B3 + ei23 A2 B3 + ei33 A3 B3 .

Now we are left with i being a free index which can have any of the values of 1, 2 or 3. Letting i = 1, then
letting i = 2, and finally letting i = 3 produces the cross product components

C1 = A2 B3 ‚ąí A3 B2
C2 = A3 B1 ‚ąí A1 B3
C3 = A1 B2 ‚ąí A2 B1 .

## The cross product can also be expressed in the form A ~ = eijk Aj Bk b

~√óB ei . This result can be verified by
summing over the indices i,j and k.

## eijk = ‚ąíeikj = ejki for i, j, k = 1, 2, 3

Solution: The array i k j represents an odd number of transpositions of the indices i j k and to each
transposition there is a sign change of the e-permutation symbol. Similarly, j k i is an even transposition
of i j k and so there is no sign change of the e-permutation symbol. The above holds regardless of the
numerical values assigned to the indices i, j, k.

## The e-őī Identity

An identity relating the e-permutation symbol and the Kronecker delta, which is useful in the simpli-
fication of tensor expressions, is the e-őī identity. This identity can be expressed in different forms. The
subscript form for this identity is

## eijk eimn = őījm őīkn ‚ąí őījn őīkm , i, j, k, m, n = 1, 2, 3

where i is the summation index and j, k, m, n are free indices. A device used to remember the positions of
the subscripts is given in the figure 1.1-3.
The subscripts on the four Kronecker delta‚Äôs on the right-hand side of the e-őī identity then are read

(first)(second)-(outer)(inner).

This refers to the positions following the summation index. Thus, j, m are the first indices after the sum-
mation index and k, n are the second indices after the summation index. The indices j, n are outer indices
when compared to the inner indices k, m as the indices are viewed as written on the left-hand side of the
identity.
13

## Figure 1.1-3. Mnemonic device for position of subscripts.

Another form of this identity employs both subscripts and superscripts and has the form

## eijk eimn = őīm őīn ‚ąí őīnj őīm

j k k
. (1.1.5)

One way of proving this identity is to observe the equation (1.1.5) has the free indices j, k, m, n. Each
of these indices can have any of the values of 1, 2 or 3. There are 3 choices we can assign to each of j, k, m
or n and this gives a total of 34 = 81 possible equations represented by the identity from equation (1.1.5).
By writing out all 81 of these equations we can verify that the identity is true for all possible combinations
that can be assigned to the free indices.
An alternate proof of the e ‚ąí őī identity is to consider the determinant
1
őī1 őī21 őī31 1 0 0
2
őī őī2 őī32 = 0 1 0 = 1.
13 2
őī1 őī23 őī33 0 0 1

By performing a permutation of the rows of this matrix we can use the permutation symbol and write
i
őī1 őī2i őī3i
j
őī j j ijk
k1 őīk2 őīk3 = e .
őī1 őī2 őī3

## By performing a permutation of the columns, we can write

i
őīr őīsi őīti
j
őī őīsj őītj = eijk erst .
kr
őīr őīsk őītk

## Now perform a contraction on the indices i and r to obtain

i
őīi őīsi őīti
j
őī j j ijk
i őīs őīt = e eist .
őīk őīk őīk
i s t

Summing on i we have őīii = őī11 + őī22 + őī33 = 3 and expand the determinant to obtain the desired result

14

## Generalized Kronecker delta

The generalized Kronecker delta is defined by the (n √ó n) determinant
i
őīm őīni ¬∑ ¬∑ ¬∑ őīpi
j
őīm őīnj ¬∑ ¬∑ ¬∑ őīpj
ij...k
őīmn...p = . .. .. . .
.. . ..
.
őīk őīnk ¬∑ ¬∑ ¬∑ őīpk
m

## For example, in three dimensions we can write

i
őīm őīni őīpi
j
ijk
őīmnp = őīm őīnj őīpj = eijk emnp .
őīk őīnk őīpk
m

Performing a contraction on the indices k and p we obtain the fourth order system

rs
őīmn rsp
= őīmnp = ersp emnp = eprs epmn = őīm őīn ‚ąí őīnr őīm
r s s
.

As an exercise one can verify that the definition of the e-permutation symbol can also be defined in terms
of the generalized Kronecker delta as
ej1 j2 j3 ¬∑¬∑¬∑jN = őīj11 j22 j33 ¬∑¬∑¬∑j
¬∑¬∑¬∑ N
N
.

Additional definitions and results employing the generalized Kronecker delta are found in the exercises.
In section 1.3 we shall show that the Kronecker delta and epsilon permutation symbol are numerical tensors
which have fixed components in every coordinate system.
Additional Applications of the Indicial Notation
The indicial notation, together with the e ‚ąí őī identity, can be used to prove various vector identities.

## EXAMPLE 1.1-14. ~√óB

Show, using the index notation, that A ~ = ‚ąíB
~ √óA
~
Solution: Let
~ =A
C ~ = C1 b
~√óB e1 + C2 b
e2 + C3 b
e3 = Ci b
ei and let
~ =B
D ~ = D1 b
~ √óA e1 + D2 b
e2 + D3 b
e3 = Di b
ei .
We have shown that the components of the cross products can be represented in the index notation by

## Ci = eijk Aj Bk and Di = eijk Bj Ak .

We desire to show that Di = ‚ąíCi for all values of i. Consider the following manipulations: Let Bj = Bs őīsj
and Ak = Am őīmk and write
Di = eijk Bj Ak = eijk Bs őīsj Am őīmk (1.1.6)

where all indices have the range 1, 2, 3. In the expression (1.1.6) note that no summation index appears
more than twice because if an index appeared more than twice the summation convention would become
meaningless. By rearranging terms in equation (1.1.6) we have

## Di = eijk őīsj őīmk Bs Am = eism Bs Am .

15

In this expression the indices s and m are dummy summation indices and can be replaced by any other
letters. We replace s by k and m by j to obtain

## Di = eikj Aj Bk = ‚ąíeijk Aj Bk = ‚ąíCi .

~ = ‚ąíC
Consequently, we find that D ~ or B
~ √óA
~ = ‚ąíA
~ √ó B. ~ = Di b
~ That is, D ei = ‚ąíCi b ~
ei = ‚ąíC.
Note 1. The expressions
Ci = eijk Aj Bk and Cm = emnp An Bp

with all indices having the range 1, 2, 3, appear to be different because different letters are used as sub-
scripts. It must be remembered that certain indices are summed according to the summation convention
and the other indices are free indices and can take on any values from the assigned range. Thus, after
summation, when numerical values are substituted for the indices involved, none of the dummy letters
used to represent the components appear in the answer.
Note 2. A second important point is that when one is working with expressions involving the index notation,
the indices can be changed directly. For example, in the above expression for Di we could have replaced
j by k and k by j simultaneously (so that no index repeats itself more than twice) to obtain

## Di = eijk Bj Ak = eikj Bk Aj = ‚ąíeijk Aj Bk = ‚ąíCi .

Note 3. Be careful in switching back and forth between the vector notation and index notation. Observe that a
~ can be represented
vector A
~ = Ai b
A ei

## or its components can be represented

~¬∑ b
A ei = Ai , i = 1, 2, 3.

~ = Ai as this is a
Do not set a vector equal to a scalar. That is, do not make the mistake of writing A
misuse of the equal sign. It is not possible for a vector to equal a scalar because they are two entirely
different quantities. A vector has both magnitude and direction while a scalar has only magnitude.

## EXAMPLE 1.1-15. Verify the vector identity

~ ¬∑ (B
A ~ √ó C)
~ =B
~ ¬∑ (C
~ √ó A)
~

Solution: Let
~ √óC
B ~ = Di b
~ =D ei where Di = eijk Bj Ck and let
~ = F~ = Fi b
~ √óA
C ei where Fi = eijk Cj Ak
where all indices have the range 1, 2, 3. To prove the above identity, we have

~ ¬∑ (B
A ~ √ó C)
~ =A
~ ¬∑D
~ = Ai Di = Ai eijk Bj Ck

= Bj (eijk Ai Ck )
= Bj (ejki Ck Ai )
16

Fi = eijk Cj Ak

## that we may obtain, by permuting the symbols, the equivalent expression

Fj = ejki Ck Ai .

## This allows us to write

~ ¬∑ (B
A ~ √ó C) ~ ¬∑ F~ = B
~ = Bj Fj = B ~ ¬∑ (C
~ √ó A)
~

## which was to be shown.

The quantity A~ ¬∑ (B
~ √ó C)
~ is called a triple scalar product. The above index representation of the triple
scalar product implies that it can be represented as a determinant (See example 1.1-9). We can write

A1 A2 A3

A ¬∑ (B √ó C) = B1
~ ~ ~ B2 B3 = eijk Ai Bj Ck
C1 C2 C3

A physical interpretation that can be assigned to this triple scalar product is that its absolute value represents
the volume of the parallelepiped formed by the three noncoplaner vectors A, ~ B,
~ C.~ The absolute value is
needed because sometimes the triple scalar product is negative. This physical interpretation can be obtained
from an analysis of the figure 1.1-4.

## Figure 1.1-4. Triple scalar product and volume

17

~ √ó C|
In figure 1.1-4 observe that: (i) |B ~ is the area of the parallelogram P QRS. (ii) the unit vector

~ √óC
B ~
b
en =
~ ~
|B √ó C|

~ and C.
is normal to the plane containing the vectors B ~ (iii) The dot product

~
~¬∑ B√óC =h
~
A en = A
~¬∑ b
~
~ √ó C|
|B

~ on b
equals the projection of A en which represents the height of the parallelepiped. These results demonstrate
that
~ ~ ~ = |B
A ¬∑ (B √ó C) ~ √ó C|
~ h = (area of base)(height) = volume.

## EXAMPLE 1.1-16. Verify the vector identity

~ √ó B)
(A ~ √ó (C
~ √ó D)
~ = C(
~ D~ ¬∑A
~ √ó B)
~ ‚ąí D(
~ C~ ¬∑A
~ √ó B)
~

Solution: Let F~ = A ~ = Fi b
~√óB ~ =C
ei and E ~ = Ei b
~ √óD ei . These vectors have the components

## where all indices have the range 1, 2, 3. The vector G ~ = Gi b

~ = F~ √ó E ei has the components

## Gq = (emqi emnp )eijk Aj Bk Cn Dp

which is now in a form where we can use the e ‚ąí őī identity applied to the term in parentheses to produce

## Simplifying this expression we have:

Gq = eijk [(Dp őīip )(Cn őīqn )Aj Bk ‚ąí (Dp őīqp )(Cn őīin )Aj Bk ]
= eijk [Di Cq Aj Bk ‚ąí Dq Ci Aj Bk ]
= Cq [Di eijk Aj Bk ] ‚ąí Dq [Ci eijk Aj Bk ]

## which are the vector components of the vector

~ D
C( ~ ¬∑A
~ √ó B)
~ ‚ąí D(
~ C~ ¬∑A
~ √ó B).
~
18

Transformation Equations

Consider two sets of N independent variables which are denoted by the barred and unbarred symbols
xi and xi with i = 1, . . . , N. The independent variables xi , i = 1, . . . , N can be thought of as defining
the coordinates of a point in a N ‚ąídimensional space. Similarly, the independent barred variables define a
point in some other N ‚ąídimensional space. These coordinates are assumed to be real quantities and are not
complex quantities. Further, we assume that these variables are related by a set of transformation equations.

xi = xi (x1 , x2 , . . . , xN ) i = 1, . . . , N. (1.1.7)

It is assumed that these transformation equations are independent. A necessary and sufficient condition that
these transformation equations be independent is that the Jacobian determinant be different from zero, that
is 1
‚ąāx1 ‚ąāx1
¬∑¬∑¬∑ ‚ąāx1
‚ąāx2 ‚ąāxN
i ‚ąāx 2
‚ąāx2 ‚ąāx2
x ‚ąāx ‚ąāx ¬∑¬∑¬∑
‚ąāx1 ‚ąāx2 ‚ąāxN
J( ) = j = .
x ‚ąā xŐĄ .. .. .. 6= 0.
.. . . .
N
‚ąāx 1 ‚ąāxN
¬∑¬∑¬∑ ‚ąāxN
‚ąāx ‚ąāx2 ‚ąāx N

## This assumption allows us to obtain a set of inverse relations

xi = xi (x1 , x2 , . . . , xN ) i = 1, . . . , N, (1.1.8)

where the x0 s are determined in terms of the x0 s. Throughout our discussions it is to be understood that the
given transformation equations are real and continuous. Further all derivatives that appear in our discussions
are assumed to exist and be continuous in the domain of the variables considered.

EXAMPLE 1.1-17. The following is an example of a set of transformation equations of the form
defined by equations (1.1.7) and (1.1.8) in the case N = 3. Consider the transformation from cylindrical
coordinates (r, őĪ, z) to spherical coordinates (ŌĀ, ő≤, őĪ). From the geometry of the figure 1.1-5 we can find the
transformation equations
r = ŌĀ sin ő≤
őĪ=őĪ 0 < őĪ < 2ŌÄ
z = ŌĀ cos ő≤ 0<ő≤<ŌÄ
with inverse transformation p
ŌĀ= r2 + z 2
őĪ=őĪ
r
ő≤ = arctan( )
z
Now make the substitutions

19

## Figure 1.1-5. Cylindrical and Spherical Coordinates

The resulting transformations then have the forms of the equations (1.1.7) and (1.1.8).

Calculation of Derivatives

We now consider the chain rule applied to the differentiation of a function of the bar variables. We
represent this differentiation in the indicial notation. Let ő¶ = ő¶(x1 , x2 , . . . , xn ) be a scalar function of the
variables xi , i = 1, . . . , N and let these variables be related to the set of variables xi , with i = 1, . . . , N by
the transformation equations (1.1.7) and (1.1.8). The partial derivatives of ő¶ with respect to the variables
xi can be expressed in the indicial notation as
‚ąāő¶ ‚ąāő¶ ‚ąāxj ‚ąāő¶ ‚ąāx1 ‚ąāő¶ ‚ąāx2 ‚ąāő¶ ‚ąāxN
= = + + ¬∑ ¬∑ ¬∑ + (1.1.9)
‚ąāxi ‚ąāxj ‚ąāxi ‚ąāx1 ‚ąāxi ‚ąāx2 ‚ąāxi ‚ąāxN ‚ąāxi
for any fixed value of i satisfying 1 ‚Č§ i ‚Č§ N.
The second partial derivatives of ő¶ can also be expressed in the index notation. Differentiation of
equation (1.1.9) partially with respect to xm produces
 
‚ąā2ő¶ ‚ąāő¶ ‚ąā 2 xj ‚ąā ‚ąāő¶ ‚ąāxj
= + . (1.1.10)
‚ąāxi ‚ąāxm ‚ąāxj ‚ąāxi ‚ąāxm ‚ąāxm ‚ąāxj ‚ąāxi
This result is nothing more than an application of the general rule for differentiating a product of two
quantities. To evaluate the derivative of the bracketed term in equation (1.1.10) it must be remembered that
the quantity inside the brackets is a function of the bar variables. Let
‚ąāő¶
G= = G(x1 , x2 , . . . , xN )
‚ąāxj
to emphasize this dependence upon the bar variables, then the derivative of G is
‚ąāG ‚ąāG ‚ąāxk ‚ąā 2 ő¶ ‚ąāxk
= = . (1.1.11)
‚ąāxm ‚ąāxk ‚ąāxm ‚ąāxj ‚ąāxk ‚ąāxm
This is just an application of the basic rule from equation (1.1.9) with ő¶ replaced by G. Hence the derivative
from equation (1.1.10) can be expressed
‚ąā2ő¶ ‚ąāő¶ ‚ąā 2 xj ‚ąā 2 ő¶ ‚ąāxj ‚ąāxk
= j + (1.1.12)
i
‚ąāx ‚ąāx m i
‚ąāx ‚ąāx ‚ąāx m
‚ąāxj ‚ąāxk ‚ąāxi ‚ąāxm
where i, m are free indices and j, k are dummy summation indices.
20

EXAMPLE 1.1-18. Let ő¶ = ő¶(r, őł) where r, őł are polar coordinates related to the Cartesian coordinates
‚ąāő¶ ‚ąā2ő¶
(x, y) by the transformation equations x = r cos őł y = r sin őł. Find the partial derivatives and
‚ąāx ‚ąāx2
Solution: The partial derivative of ő¶ with respect to x is found from the relation (1.1.9) and can be written

‚ąāő¶ ‚ąāő¶ ‚ąār ‚ąāő¶ ‚ąāőł
= + . (1.1.13)
‚ąāx ‚ąār ‚ąāx ‚ąāőł ‚ąāx

The second partial derivative is obtained by differentiating the first partial derivative. From the product
rule for differentiation we can write
   
‚ąā2ő¶ ‚ąāő¶ ‚ąā 2 r ‚ąār ‚ąā ‚ąāő¶ ‚ąāő¶ ‚ąā 2 őł ‚ąāőł ‚ąā ‚ąāő¶
= + + + . (1.1.14)
‚ąāx2 ‚ąār ‚ąāx2 ‚ąāx ‚ąāx ‚ąār ‚ąāőł ‚ąāx2 ‚ąāx ‚ąāx ‚ąāőł

To further simplify (1.1.14) it must be remembered that the terms inside the brackets are to be treated as
functions of the variables r and őł and that the derivative of these terms can be evaluated by reapplying the
‚ąāő¶ ‚ąāő¶
basic rule from equation (1.1.13) with ő¶ replaced by ‚ąār and then ő¶ replaced by ‚ąāőł . This gives
 
‚ąā 2ő¶ ‚ąāő¶ ‚ąā 2 r ‚ąār ‚ąā 2 ő¶ ‚ąār ‚ąā 2 ő¶ ‚ąāőł
= + +
‚ąāx2 ‚ąār ‚ąāx2 ‚ąāx ‚ąār2 ‚ąāx ‚ąār‚ąāőł ‚ąāx
  (1.1.15)
‚ąāő¶ ‚ąā 2 őł ‚ąāőł ‚ąā 2 ő¶ ‚ąār ‚ąā 2 ő¶ ‚ąāőł
+ + + .
‚ąāőł ‚ąāx2 ‚ąāx ‚ąāőł‚ąār ‚ąāx ‚ąāőł2 ‚ąāx
y
From the transformation equations we obtain the relations r2 = x2 + y 2 and and from
tan őł =
x
these relations we can calculate all the necessary derivatives needed for the simplification of the equations
(1.1.13) and (1.1.15). These derivatives are:

‚ąār ‚ąār x
2r = 2x or = = cos őł
‚ąāx ‚ąāx r
‚ąāőł y ‚ąāőł y sin őł
sec2 őł = ‚ąí 2 or =‚ąí 2 =‚ąí
‚ąāx x ‚ąāx r r
‚ąā2r ‚ąāőł sin2 őł ‚ąā2őł ‚ąír cos őł ‚ąāx
‚ąāőł ‚ąār
+ sin őł ‚ąāx 2 sin őł cos őł
= ‚ąí sin őł = = = .
‚ąāx2 ‚ąāx r ‚ąāx2 r2 r2

Therefore, the derivatives from equations (1.1.13) and (1.1.15) can be expressed in the form

‚ąāő¶ ‚ąāő¶ ‚ąāő¶ sin őł
= cos őł ‚ąí
‚ąāx ‚ąār ‚ąāőł r
2
2
‚ąā ő¶ ‚ąāő¶ sin őł ‚ąāő¶ sin őł cos őł ‚ąā 2 ő¶ 2 ‚ąā 2 ő¶ cos őł sin őł ‚ąā 2 ő¶ sin2 őł
= + 2 + cos őł ‚ąí 2 + .
‚ąāx2 ‚ąār r ‚ąāőł r2 ‚ąār2 ‚ąār‚ąāőł r ‚ąāőł2 r2

## By letting x1 = r, x2 = őł, x1 = x, x2 = y and performing the indicated summations in the equations (1.1.9)

and (1.1.12) there is produced the same results as above.

## Employing the substitutions x1 = x, x2 = y, x3 = z, where superscript variables are employed and

e1 , b
denoting the unit vectors in Cartesian coordinates by b e2 , b
e3 , we illustrated how various vector operations
are written by using the index notation.
21

## Gradient. In Cartesian coordinates the gradient of a scalar field is

‚ąāŌÜ ‚ąāŌÜ ‚ąāŌÜ
e1 + b
e2 + b
e3 .
‚ąāx ‚ąāy ‚ąāz

The index notation focuses attention only on the components of the gradient. In Cartesian coordinates these
components are represented using a comma subscript to denote the derivative

‚ąāŌÜ
b
ej ¬∑ grad ŌÜ = ŌÜ,j = , j = 1, 2, 3.
‚ąāxj

The comma notation will be discussed in section 4. For now we use it to denote derivatives. For example
‚ąāŌÜ ‚ąā2ŌÜ
ŌÜ ,j = j
, ŌÜ ,jk = , etc.
‚ąāx ‚ąāxj ‚ąāxk

## Divergence. ~ is a scalar field and can be

In Cartesian coordinates the divergence of a vector field A
represented
~ = ‚ąāA1 + ‚ąāA2 + ‚ąāA3 .
~ = div A
‚ąá¬∑A
‚ąāx ‚ąāy ‚ąāz
Employing the summation convention and index notation, the divergence in Cartesian coordinates can be
represented
‚ąāAi ‚ąāA1 ‚ąāA2 ‚ąāA3
~ = div A
‚ąá¬∑A ~ = Ai,i = = + +
‚ąāxi ‚ąāx1 ‚ąāx2 ‚ąāx3
where i is the dummy summation index.
Curl. To represent the vector B ~ = curl A~ = ‚ąá√óA ~ in Cartesian coordinates, we note that the index
~ can
notation focuses attention only on the components of this vector. The components Bi , i = 1, 2, 3 of B
be represented
Bi = b ~ = eijk Ak,j ,
ei ¬∑ curl A for i, j, k = 1, 2, 3

## where eijk is the permutation symbol introduced earlier and Ak,j = ‚ąāA k

‚ąāxj . To verify this representation of the
~ we need only perform the summations indicated by the repeated indices. We have summing on j that
curl A

## Now summing each term on the repeated index k gives us

Bi = ei12 A2,1 + ei13 A3,1 + ei21 A1,2 + ei23 A3,2 + ei31 A1,3 + ei32 A2,3

Here i is a free index which can take on any of the values 1, 2 or 3. Consequently, we have

‚ąāA3 ‚ąāA2
For i = 1, B1 = A3,2 ‚ąí A2,3 = ‚ąí
‚ąāx2 ‚ąāx3
‚ąāA1 ‚ąāA3
For i = 2, B2 = A1,3 ‚ąí A3,1 = ‚ąí
‚ąāx3 ‚ąāx1
‚ąāA2 ‚ąāA1
For i = 3, B3 = A2,1 ‚ąí A1,2 = ‚ąí
‚ąāx1 ‚ąāx2

~ in Cartesian coordinates.
which verifies the index notation representation of curl A
22

Other Operations. The following examples illustrate how the index notation can be used to represent
additional vector operators in Cartesian coordinates.
~ ¬∑ ‚ąá)A
1. In index notation the components of the vector (B ~ are

~ ¬∑ ‚ąá)A}
{(B ~ ¬∑b
ep = Ap,q Bq p, q = 1, 2, 3

This can be verified by performing the indicated summations. We have by summing on the repeated
index q
Ap,q Bq = Ap,1 B1 + Ap,2 B2 + Ap,3 B3 .

The index p is now a free index which can have any of the values 1, 2 or 3. We have:

## for p = 1, A1,q Bq = A1,1 B1 + A1,2 B2 + A1,3 B3

‚ąāA1 ‚ąāA1 ‚ąāA1
= B1 + B2 + B3
‚ąāx1 ‚ąāx2 ‚ąāx3
for p = 2, A2,q Bq = A2,1 B1 + A2,2 B2 + A2,3 B3
‚ąāA2 ‚ąāA2 ‚ąāA2
= B1 + B2 + B3
‚ąāx1 ‚ąāx2 ‚ąāx3
for p = 3, A3,q Bq = A3,1 B1 + A3,2 B2 + A3,3 B3
‚ąāA3 ‚ąāA3 ‚ąāA3
= B1 + B2 + B3
‚ąāx1 ‚ąāx2 ‚ąāx3

~ ¬∑ ‚ąá)ŌÜ has the following form when expressed in the index notation:
2. The scalar (B

## ~ ¬∑ ‚ąá)ŌÜ = Bi ŌÜ,i = B1 ŌÜ,1 + B2 ŌÜ,2 + B3 ŌÜ,3

(B
‚ąāŌÜ ‚ąāŌÜ ‚ąāŌÜ
= B1 1 + B2 2 + B3 3 .
‚ąāx ‚ąāx ‚ąāx

## ~ √ó ‚ąá)ŌÜ is expressed in the index notation by

3. The components of the vector (B
h i
b ~ √ó ‚ąá)ŌÜ = eijk Bj ŌÜ,k .
ei ¬∑ (B

This can be verified by performing the indicated summations and is left as an exercise.
~ √ó ‚ąá) ¬∑ A
4. The scalar (B ~ may be expressed in the index notation. It has the form

~ √ó ‚ąá) ¬∑ A
(B ~ = eijk Bj Ai,k .

This can also be verified by performing the indicated summations and is left as an exercise.
5. The vector components of ‚ąá2 A ~ in the index notation are represented

ep ¬∑ ‚ąá2 A
b ~ = Ap,qq .

23

## EXAMPLE 1.1-19. In Cartesian coordinates prove the vector identity

~ = ‚ąá √ó (f A)
curl (f A) ~ = (‚ąáf ) √ó A
~ + f (‚ąá √ó A).
~

~ = curl (f A)
Solution: Let B ~ and write the components as

Bi = eijk (f Ak ),j
= eijk [f Ak,j + f,j Ak ]
= f eijk Ak,j + eijk f,j Ak .

## This index form can now be expressed in the vector form

~ = curl (f A)
B ~ = f (‚ąá √ó A)
~ + (‚ąáf ) √ó A
~

~ + B)
EXAMPLE 1.1-20. Prove the vector identity ‚ąá ¬∑ (A ~ = ‚ąá¬∑A ~ +‚ąá¬∑B ~
~+B
Solution: Let A ~ =C
~ and write this vector equation in the index notation as Ai + Bi = Ci . We then
have
~ + ‚ąá ¬∑ B.
~ = Ci,i = (Ai + Bi ),i = Ai,i + Bi,i = ‚ąá ¬∑ A
‚ąá¬∑C ~

~ ¬∑ ‚ąá)f = A
EXAMPLE 1.1-21. In Cartesian coordinates prove the vector identity (A ~ ¬∑ ‚ąáf
Solution: In the index notation we write
~ ¬∑ ‚ąá)f = Ai f,i = A1 f,1 + A2 f,2 + A3 f,3
(A
‚ąāf ‚ąāf ‚ąāf ~ ¬∑ ‚ąáf.
= A1 1 + A2 2 + A3 3 = A
‚ąāx ‚ąāx ‚ąāx

## EXAMPLE 1.1-22. In Cartesian coordinates prove the vector identity

~ √ó B)
‚ąá √ó (A ~ = A(‚ąá
~ ~ ‚ąí B(‚ąá
¬∑ B) ~ ~ + (B
¬∑ A) ~ ¬∑ ‚ąá)A
~ ‚ąí (A
~ ¬∑ ‚ąá)B
~

~ √ó B)
Solution: The pth component of the vector ‚ąá √ó (A ~ is

b ~ √ó B)]
ep ¬∑ [‚ąá √ó (A ~ = epqk [ekji Aj Bi ],q

## = epqk ekji Aj Bi,q + epqk ekji Aj,q Bi

By applying the e ‚ąí őī identity, the above expression simplifies to the desired result. That is,

b ~ √ó B)]
ep ¬∑ [‚ąá √ó (A ~ = (őīpj őīqi ‚ąí őīpi őīqj )Aj Bi,q + (őīpj őīqi ‚ąí őīpi őīqj )Aj,q Bi

## In vector form this is expressed

~ √ó B)
‚ąá √ó (A ~ = A(‚ąá
~ ~ ‚ąí (A
¬∑ B) ~ ¬∑ ‚ąá)B
~ + (B
~ ¬∑ ‚ąá)A
~ ‚ąí B(‚ąá
~ ~
¬∑ A)
24

## EXAMPLE 1.1-23. In Cartesian coordinates prove the vector identity ‚ąá √ó (‚ąá √ó A) ~ ‚ąí ‚ąá2 A

~ = ‚ąá(‚ąá ¬∑ A) ~
~ is given by b
Solution: We have for the ith component of ‚ąá √ó A ~ = eijk Ak,j and consequently the
ei ¬∑ [‚ąá √ó A]
pth component of ‚ąá √ó (‚ąá √ó A)~ is

## b ~ = epqr [erjk Ak,j ],q

ep ¬∑ [‚ąá √ó (‚ąá √ó A)]
= epqr erjk Ak,jq .

## b ~ = (őīpj őīqk ‚ąí őīpk őīqj )Ak,jq

ep ¬∑ [‚ąá √ó (‚ąá √ó A)]
= Ak,pk ‚ąí Ap,qq .

~ ‚ąí ‚ąá2 A.
~ = ‚ąá(‚ąá ¬∑ A)
Expressing this result in vector form we have ‚ąá √ó (‚ąá √ó A) ~

## Indicial Form of Integral Theorems

The divergence theorem, in both vector and indicial notation, can be written
ZZZ ZZ Z Z
div ¬∑ F~ dŌĄ = b dŌÉ
F~ ¬∑ n Fi,i dŌĄ = Fi ni dŌÉ i = 1, 2, 3 (1.1.16)
V S V S

where ni are the direction cosines of the unit exterior normal to the surface, dŌĄ is a volume element and dŌÉ
is an element of surface area. Note that in using the indicial notation the volume and surface integrals are
to be extended over the range specified by the indices. This suggests that the divergence theorem can be
applied to vectors in n‚ąídimensional spaces.
The vector form and indicial notation for the Stokes theorem are
ZZ Z Z Z
b dŌÉ =
(‚ąá √ó F~ ) ¬∑ n F~ ¬∑ d~r eijk Fk,j ni dŌÉ = Fi dxi i, j, k = 1, 2, 3 (1.1.17)
S C S C

and the Green‚Äôs theorem in the plane, which is a special case of the Stoke‚Äôs theorem, can be expressed
ZZ   Z Z Z
‚ąāF2 ‚ąāF1
‚ąí dxdy = F1 dx + F2 dy e3jk Fk,j dS = Fi dxi i, j, k = 1, 2 (1.1.18)
‚ąāx ‚ąāy C S C

## Other forms of the above integral theorems are

ZZZ ZZ
‚ąáŌÜ dŌĄ = b dŌÉ
ŌÜn
V S

## obtained from the divergence theorem by letting F~ = ŌÜC

~ where C~ is a constant vector. By replacing F~ by
F~ √ó C
~ in the divergence theorem one can derive
ZZZ   ZZ
~
‚ąá √ó F dŌĄ = ‚ąí F~ √ó ~n dŌÉ.
V S

## In the divergence theorem make the substitution F~ = ŌÜ‚ąáŌą to obtain

ZZZ ZZ
 
(ŌÜ‚ąá2 Ōą + (‚ąáŌÜ) ¬∑ (‚ąáŌą) dŌĄ = b dŌÉ.
(ŌÜ‚ąáŌą) ¬∑ n
V S
25

## The Green‚Äôs identity ZZZ ZZ


ŌÜ‚ąá2 Ōą ‚ąí Ōą‚ąá2 ŌÜ dŌĄ = b dŌÉ
(ŌÜ‚ąáŌą ‚ąí Ōą‚ąáŌÜ) ¬∑ n
V S

is obtained by first letting F~ = ŌÜ‚ąáŌą in the divergence theorem and then letting F~ = Ōą‚ąáŌÜ in the divergence
theorem and then subtracting the results.
Determinants, Cofactors

## det A = |A| = ei1 i2 i3 ...in a1i1 a2i2 a3i3 . . . anin .

This gives a summation of the n! permutations of products formed from the elements of the matrix A. The
result is a single number called the determinant of A.

## EXAMPLE 1.1-24. In the case n = 2 we have

a11 a12

|A| = = enm a1n a2m
a21 a22
= e1m a11 a2m + e2m a12 a2m
= e12 a11 a22 + e21 a12 a21
= a11 a22 ‚ąí a12 a21

## EXAMPLE 1.1-25. In the case n = 3 we can use either of the notations

Ô£ę Ô£∂ Ô£ę 1 Ô£∂
a11 a12 a13 a1 a12 a13
A = Ô£≠ a21 a22 a23 Ô£ł or A = Ô£≠ a21 a22 a23 Ô£ł
a31 a32 a33 a31 a32 a33

## det A = eijk a1i a2j a3k

det A = eijk ai1 aj2 ak3
det A = eijk ai1 aj2 ak3
det A = eijk a1i a2j a3k .

## These represent row and column expansions of the determinant.

An important identity results if we examine the quantity Brst = eijk air ajs akt . It is an easy exercise to
change the dummy summation indices and rearrange terms in this expression. For example,

Brst = eijk air ajs akt = ekji akr ajs ait = ekji ait ajs akr = ‚ąíeijk ait ajs akr = ‚ąíBtsr ,

and by considering other permutations of the indices, one can establish that Brst is completely skew-
symmetric. In the exercises it is shown that any third order completely skew-symmetric system satisfies
Brst = B123 erst . But B123 = det A and so we arrive at the identity

26

## Other forms of this identity are

eijk ari asj atk = |A|erst and eijk air ajs akt = |A|erst . (1.1.19)

## Consider the representation of the determinant

1
a a12 a13
1
|A| = a21 a22 a23
a31 a32 a33

by use of the indicial notation. By column expansions, this determinant can be represented

## i in the determinant |A|. From the equation (1.1.20) the cofactor

Define Aim as the cofactor of the element am
of ar1 is obtained by deleting this element and we find

## The result (1.1.20) can then be expressed in the form

|A| = ar1 A1r = a11 A11 + a21 A12 + a31 A13 . (1.1.23)

That is, the determinant |A| is obtained by multiplying each element in the first column by its corresponding
cofactor and summing the result. Observe also that from the equation (1.1.20) we find the additional
cofactors
A2s = erst ar1 at3 and A3t = erst ar1 as2 . (1.1.24)

Hence, the equation (1.1.20) can also be expressed in one of the forms

## |A| = as2 A2s = a12 A21 + a22 A22 + a32 A23

|A| = at3 A3t = a13 A31 + a23 A32 + a33 A33

The results from equations (1.1.22) and (1.1.24) can be written in a slightly different form with the indicial
notation. From the notation for a generalized Kronecker delta defined by

ijk
eijk elmn = őīlmn ,

## the above cofactors can be written in the form

1 1jk 1 1jk s t
A1r = e123 erst as2 at3 = e erst asj atk = őīrst aj ak
2! 2!
1 1 2jk s t
A2r = e123 esrt as1 at3 = e2jk erst asj atk = őīrst aj ak
2! 2!
1 1 3jk s t
A3r = e123 etsr at1 as2 = e3jk erst asj atk = őīrst aj ak .
2! 2!
27

## These cofactors are then combined into the single equation

1 ijk s t
Air = őī a a (1.1.25)
2! rst j k

which represents the cofactor of ari . When the elements from any row (or column) are multiplied by their
corresponding cofactors, and the results summed, we obtain the value of the determinant. Whenever the
elements from any row (or column) are multiplied by the cofactor elements from a different row (or column),
and the results summed, we get zero. This can be illustrated by considering the summation

1 ijk s t m 1
am i
r Am = őīmst aj ak ar = eijk emst am s t
r aj ak
2! 2!
1 1 ijk
= eijk erjk |A| = őīrjk |A| = őīri |A|
2! 2!
Here we have used the e ‚ąí őī identity to obtain

ijk
őīrjk = eijk erjk = ejik ejrk = őīri őīkk ‚ąí őīki őīrk = 3őīri ‚ąí őīri = 2őīri

## which was used to simplify the above result.

As an exercise one can show that an alternate form of the above summation of elements by its cofactors
is
i = |A|őīi .
arm Am r

## EXAMPLE 1.1-26. In N-dimensions the quantity őīkj11jk22...jN

...kN is called a generalized Kronecker delta. It
can be defined in terms of permutation symbols as

## ej1 j2 ...jN ek1 k2 ...kN = őīkj11jk22...jN

...kN (1.1.26)

Observe that
őīkj11jk22...jN
...kN e
k1 k2 ...kN
= (N !) ej1 j2 ...jN

This follows because ek1 k2 ...kN is skew-symmetric in all pairs of its superscripts. The left-hand side denotes
a summation of N ! terms. The first term in the summation has superscripts j1 j2 . . . jN and all other terms
have superscripts which are some permutation of this ordering with minus signs associated with those terms
having an odd permutation. Because ej1 j2 ...jN is completely skew-symmetric we find that all terms in the
summation have the value +ej1 j2 ...jN . We thus obtain N ! of these terms.
28

EXERCISE 1.1

I 1. Simplify each of the following by employing the summation property of the Kronecker delta. Perform
sums on the summation indices only if your are unsure of the result.

(a) eijk őīkn (c) eijk őīis őījm őīkn (e) őīij őījn
(b) eijk őīis őījm (d) aij őīin (f ) őīij őījn őīni

## (a) őīii (c) eijk Ai Aj Ak (e) eijk őījk

(b) őīij őīij (d) eijk eijk (f ) Ai Bj őīji ‚ąí Bm An őīmn

I 3. ~ = Ai
Express each of the following in index notation. Be careful of the notation you use. Note that A
is an incorrect notation because a vector can not equal a scalar. The notation A~¬∑ b
ei = Ai should be used to
express the ith component of a vector.
~ ¬∑ (B
(a) A ~ √ó C)
~ ~ A
(c) B( ~ ¬∑ C)
~
~ √ó (B
(b) A ~ √ó C)
~ (d) ~ A
B( ~ ¬∑ C)
~ ‚ąí C(
~ A~ ¬∑ B)
~

I 4. Show the e permutation symbol satisfies: (a) eijk = ejki = ekij (b) eijk = ‚ąíejik = ‚ąíeikj = ‚ąíekji

I 5. ~ √ó (B
Use index notation to verify the vector identity A ~ √ó C)
~ = B(
~ A~ ¬∑ C)
~ ‚ąí C(
~ A~ ¬∑ B)
~

## I 6. Let yi = aij xj and xm = aim zi where the range of the indices is 1, 2

(a) Solve for yi in terms of zi using the indicial notation and check your result
to be sure that no index repeats itself more than twice.
(b) Perform the indicated summations and write out expressions
for y1 , y2 in terms of z1 , z2
(c) Express the above equations in matrix form. Expand the matrix
equations and check the solution obtained in part (b).

I 7. Use the e ‚ąí őī identity to simplify (a) eijk ejik (b) eijk ejki

## I 8. Prove the following vector identities:

(a) ~ ¬∑ (B
A ~ √ó C)
~ =B
~ ¬∑ (C
~ √ó A)
~ =C
~ ¬∑ (A
~ √ó B)
~ triple scalar product
~ √ó B)
(b) (A ~ √óC
~ = B(
~ A~ ¬∑ C)
~ ‚ąí A(
~ B~ ¬∑ C)
~

~ √ó B)
(a) (A ~ ¬∑ (C
~ √ó D)
~ = (A
~ ¬∑ C)(
~ B ~ ¬∑ D)
~ ‚ąí (A
~ ¬∑ D)(
~ B ~ ¬∑ C)
~
~ √ó (B
(b) A ~ √ó C)
~ +B
~ √ó (C
~ √ó A)
~ +C
~ √ó (A
~ √ó B)
~ = ~0

(c) ~ √ó B)
(A ~ √ó (C
~ √ó D)
~ = B(
~ A~¬∑C
~ √ó D)
~ ‚ąí A(
~ B~ ¬∑C
~ √ó D)
~
29

## I 10. ~ = (1, ‚ąí1, 0) and B

For A ~ = (4, ‚ąí3, 2) find using the index notation,

(a) Ci = eijk Aj Bk , i = 1, 2, 3
(b) Ai Bi
(c) What do the results in (a) and (b) represent?

dy1 dy2
I 11. Represent the differential equations = a11 y1 + a12 y2 and = a21 y1 + a22 y2
dt dt
using the index notation.

I 12.
Let ő¶ = ő¶(r, őł) where r, őł are polar coordinates related to Cartesian coordinates (x, y) by the transfor-
mation equations x = r cos őłand y = r sin őł.
‚ąāő¶ ‚ąā2ő¶
(a) Find the partial derivatives , and
‚ąāy ‚ąāy 2
(b) Combine the result in part (a) with the result from EXAMPLE 1.1-18 to calculate the Laplacian

‚ąā2ő¶ ‚ąā2ő¶
‚ąá2 ő¶ = +
‚ąāx2 ‚ąāy 2

in polar coordinates.

## I 13. (Index notation) Let a11 = 3, a12 = 4, a21 = 5, a22 = 6.

Calculate the quantity C = aij aij , i, j = 1, 2.

## I 14. Show the moments of inertia Iij defined by

ZZZ ZZZ
I11 = (y 2 + z 2 )ŌĀ(x, y, z) dŌĄ I23 = I32 = ‚ąí yzŌĀ(x, y, z) dŌĄ
ZR
ZZ ZR
ZZ
2 2
I22 = (x + z )ŌĀ(x, y, z) dŌĄ I12 = I21 = ‚ąí xyŌĀ(x, y, z) dŌĄ
R R
ZZZ ZZZ
I33 = (x2 + y 2 )ŌĀ(x, y, z) dŌĄ I13 = I31 = ‚ąí xzŌĀ(x, y, z) dŌĄ,
R R
ZZZ

can be represented in the index notation as Iij = xm xm őīij ‚ąí xi xj ŌĀ dŌĄ, where ŌĀ is the density,
R
x1 = x, x2 = y, x3 = z and dŌĄ = dxdydz is an element of volume.

I 15. Determine if the following relation is true or false. Justify your answer.

ei ¬∑ ( b
b ej √ó b
ek ) = ( b
ei √ó b
ej ) ¬∑ b
ek = eijk , i, j, k = 1, 2, 3.

Hint: Let b
em = (őī1m , őī2m , őī3m ).

I 16. Without substituting values for i, l = 1, 2, 3 calculate all nine terms of the given quantities

## (a) B il = (őīji Ak + őīki Aj )ejkl (b) Ail = (őīim B k + őīik B m )emlk

I 17. Let Amn xm y n = 0 for arbitrary xi and y i , i = 1, 2, 3, and show that Aij = 0 for all values of i, j.
30

I 18.
(a) For amn , m, n = 1, 2, 3 skew-symmetric, show that amn xm xn = 0.
(b) Let amn xm xn = 0, m, n = 1, 2, 3 for all values of xi , i = 1, 2, 3 and show that amn must be skew-
symmetric.

I 19. Let A and B denote 3 √ó 3 matrices with elements aij and bij respectively. Show that if C = AB is a
matrix product, then det(C) = det(A) ¬∑ det(B).
Hint: Use the result from example 1.1-9.

I 20.
(a) Let u1 , u2 , u3 be functions of the variables s1 , s 2 , s3 . Further, assume that s1 , s2 , s3 are in turn each
‚ąāum ‚ąā(u1 , u2 , u3 )
functions of the variables x1 , x2 , x3 . Let n = denote the Jacobian of the u0 s with
‚ąāx ‚ąā(x1 , x2 , x3 )
respect to the x0 s. Show that
i i
‚ąāu ‚ąāu ‚ąāsj ‚ąāui ‚ąāsj
= = ¬∑
‚ąāxm ‚ąāsj ‚ąāxm ‚ąāsj ‚ąāxm .

## ‚ąāxi ‚ąā xŐĄj ‚ąāxi i

(b) Note that j m
= = őīm and show that J( xxŐĄ )¬∑J( xxŐĄ ) = 1, where J( xxŐĄ ) is the Jacobian determinant
‚ąā xŐĄ ‚ąāx ‚ąāxm
of the transformation (1.1.7).

I 21. A third order system a`mn with `, m, n = 1, 2, 3 is said to be symmetric in two of its subscripts if the
components are unaltered when these subscripts are interchanged. When a`mn is completely symmetric then
a`mn = am`n = a`nm = amn` = anm` = an`m . Whenever this third order system is completely symmetric,
then: (i) How many components are there? (ii) How many of these components are distinct?
Hint: Consider the three cases (i) ` = m = n (ii) ` = m 6= n (iii) ` 6= m 6= n.

I 22. A third order system b`mn with `, m, n = 1, 2, 3 is said to be skew-symmetric in two of its subscripts
if the components change sign when the subscripts are interchanged. A completely skew-symmetric third
order system satisfies b`mn = ‚ąíbm`n = bmn` = ‚ąíbnm` = bn`m = ‚ąíb`nm . (i) How many components does
a completely skew-symmetric system have? (ii) How many of these components are zero? (iii) How many
components can be different from zero? (iv) Show that there is one distinct component b123 and that
b`mn = e`mn b123 .
Hint: Consider the three cases (i) ` = m = n (ii) ` = m 6= n (iii) ` 6= m 6= n.

I 23. Let i, j, k = 1, 2, 3 and assume that eijk ŌÉjk = 0 for all values of i. What does this equation tell you
about the values ŌÉij , i, j = 1, 2, 3?

I 24. Assume that Amn and Bmn are symmetric for m, n = 1, 2, 3. Let Amn xm xn = Bmn xm xn for arbitrary
values of xi , i = 1, 2, 3, and show that Aij = Bij for all values of i and j.

I 25. Assume Bmn is symmetric and Bmn xm xn = 0 for arbitrary values of xi , i = 1, 2, 3, show that Bij = 0.
31

I 26. (Generalized Kronecker delta) Define the generalized Kronecker delta as the n √ó n determinant
i
őīm őīni ¬∑ ¬∑ ¬∑ őīpi
j
őīm őīnj ¬∑ ¬∑ ¬∑ őīpj

ij...k
őīmn...p = . .. .. . where őīsr is the Kronecker delta.
.. . ..
.
őīk őīnk ¬∑ ¬∑ ¬∑ őīpk
m

123
(a) Show eijk = őīijk
ijk
(b) Show eijk = őī123
ij
(c) Show őīmn = eij emn
rs rsp
(d) Define őīmn = őīmnp (summation on p)
and show rs
őīmn = őīm őīn ‚ąí őīnr őīm
r s s

Note that by combining the above result with the result from part (c)
we obtain the two dimensional form of the e ‚ąí őī identity ers emn = őīm őīn ‚ąí őīnr őīm
r s s
.
r
(e) Define őīm = 12 őīmn
rn
(summation on n) and show rst
őīpst = 2őīpr
rst
(f ) Show őīrst = 3!

1
a1 a12 a13

I 27. Let Ar denote the cofactor of ai in the determinant a21
i r
a22 a23 as given by equation (1.1.25).
a31 a32 a33

(a) Show erst Air = eijk asj atk (b) Show erst Ari = eijk ajs akt

I 28. (a) Show that if Aijk = Ajik , i, j, k = 1, 2, 3 there is a total of 27 elements, but only 18 are distinct.
(b) Show that for i, j, k = 1, 2, . . . , N there are N 3 elements, but only N 2 (N + 1)/2 are distinct.

I 29. Let aij = Bi Bj for i, j = 1, 2, 3 where B1 , B2 , B3 are arbitrary constants. Calculate det(aij ) = |A|.

I 30.
(a) For A = (aij ), i, j = 1, 2, 3, show |A| = eijk ai1 aj2 ak3 .
(b) For A = (aij ), i, j = 1, 2, 3, show |A| = eijk ai1 aj2 ak3 .
(c) For A = (aij ), i, j = 1, 2, 3, show |A| = eijk a1i a2j a3k .
(d) For I = (őīji ), i, j = 1, 2, 3, show |I| = 1.

I 31. Let |A| = eijk ai1 aj2 ak3 and define Aim as the cofactor of aim . Show the determinant can be
expressed in any of the forms:

## (a) |A| = Ai1 ai1 where Ai1 = eijk aj2 ak3

(b) |A| = Aj2 aj2 where Ai2 = ejik aj1 ak3
(c) |A| = Ak3 ak3 where Ai3 = ejki aj1 ak2
32

## I 32. Show the results in problem 31 can be written in the forms:

1 1 1 1
Ai1 = e1st eijk ajs akt , Ai2 = e2st eijk ajs akt , Ai3 = e3st eijk ajs akt , or Aim = emst eijk ajs akt
2! 2! 2! 2!

I 33. Use the results in problems 31 and 32 to prove that apm Aim = |A|őīip .
Ô£ę Ô£∂
1 2 1
I 34. Let (aij ) = Ô£≠ 1 0 3 Ô£ł and calculate C = aij aij , i, j = 1, 2, 3.
2 3 2

I 35. Let
a111 = ‚ąí1, a112 = 3, a121 = 4, a122 = 2
a211 = 1, a212 = 5, a221 = 2, a222 = ‚ąí2
and calculate the quantity C = aijk aijk , i, j, k = 1, 2.

I 36. Let
a1111 = 2, a1112 = 1, a1121 = 3, a1122 = 1
a1211 = 5, a1212 = ‚ąí2, a1221 = 4, a1222 = ‚ąí2
a2111 = 1, a2112 = 0, a2121 = ‚ąí2, a2122 = ‚ąí1
a2211 = ‚ąí2, a2212 = 1, a2221 = 2, a2222 = 2
and calculate the quantity C = aijkl aijkl , i, j, k, l = 1, 2.

## I 37. Simplify the expressions:

‚ąāxi
(a) (Aijkl + Ajkli + Aklij + Alijk )xi xj xk xl (c)
‚ąāxj
(b) (Pijk + Pjki + Pkij )xi xj xk ‚ąā 2 xi ‚ąāxj ‚ąā 2 xm ‚ąāxi
(d) aij r ‚ąí ami
t s
‚ąāx ‚ąāx ‚ąāx ‚ąāxs ‚ąāxt ‚ąāxr

I 38. Let g denote the determinant of the matrix having the components gij , i, j = 1, 2, 3. Show that

g1r g1s g1t gir gis git

(a) g erst = g2r g2s g2t (b) g erst eijk = gjr gjs gjt
g3r g3s g3t gkr gks gkt

i
őīm őīni őīpi
j
I 39. Show that eijk emnp = őīmnp
ijk
= őīm őīnj őīpj
őīk őīnk őīpk
m

I 40. Show that eijk emnp Amnp = Aijk ‚ąí Aikj + Akij ‚ąí Ajik + Ajki ‚ąí Akji
Hint: Use the results from problem 39.

## (a) eij eij = 2! (c) eijkl eijkl = 4!

(b) eijk eijk = 3! (d) Guess at the result ei1 i2 ...in ei1 i2 ...in
33

I 42. Determine if the following statement is true or false. Justify your answer. eijk Ai Bj Ck = eijk Aj Bk Ci .

## I 43. Let aij , i, j = 1, 2 denote the components

of a 2 √ó 2 matrix A, which are functions of time t.
a11 a12
(a) Expand both |A| = eij ai1 aj2 and |A| = to verify that these representations are the same.
a21 a22
(b) Verify the equivalence of the derivative relations

d|A| dai1 daj2 d|A| dadt11 da12
dt
a11
+ da a12
= eij aj2 + eij ai1 and = 21 da22
dt dt dt dt a21 a22 dt dt

(c) Let aij , i, j = 1, 2, 3 denote the components of a 3 √ó 3 matrix A, which are functions of time t. Develop
appropriate relations, expand them and verify, similar to parts (a) and (b) above, the representation of
a determinant and its derivative.

I 44. For f = f (x1 , x2 , x3 ) and ŌÜ = ŌÜ(f ) differentiable scalar functions, use the indicial notation to find a
formula to calculate grad ŌÜ .

## I 45. Use the indicial notation to prove (a) ‚ąá √ó ‚ąáŌÜ = ~0 ~=0

(b) ‚ąá ¬∑ ‚ąá √ó A

I 46. If Aij is symmetric and Bij is skew-symmetric, i, j = 1, 2, 3, then calculate C = Aij Bij .

I 47. Assume Aij = Aij (x1 , x2 , x3 ) and Aij = Aij (x1 , x2 , x3 ) for i, j = 1, 2, 3 are related by the expression
‚ąāxi ‚ąāxj ‚ąāAmn
Amn = Aij m n . Calculate the derivative .
‚ąāx ‚ąāx ‚ąāxk

I 48. Prove that if any two rows (or two columns) of a matrix are interchanged, then the value of the
determinant of the matrix is multiplied by minus one. Construct your proof using 3 √ó 3 matrices.

I 49. Prove that if two rows (or columns) of a matrix are proportional, then the value of the determinant
of the matrix is zero. Construct your proof using 3 √ó 3 matrices.

I 50. Prove that if a row (or column) of a matrix is altered by adding some constant multiple of some other
row (or column), then the value of the determinant of the matrix remains unchanged. Construct your proof
using 3 √ó 3 matrices.

## I 51. Simplify the expression ŌÜ = eijk e`mn Ai` Ajm Akn .

I 52. Let Aijk denote a third order system where i, j, k = 1, 2. (a) How many components does this system
have? (b) Let Aijk be skew-symmetric in the last pair of indices, how many independent components does
the system have?

I 53. Let Aijk denote a third order system where i, j, k = 1, 2, 3. (a) How many components does this
system have? (b) In addition let Aijk = Ajik and Aikj = ‚ąíAijk and determine the number of distinct
nonzero components for Aijk .
34

I 54. Show that every second order system Tij can be expressed as the sum of a symmetric system Aij and
skew-symmetric system Bij . Find Aij and Bij in terms of the components of Tij .

## I 55. Consider the system Aijk , i, j, k = 1, 2, 3, 4.

(a) How many components does this system have?
(b) Assume Aijk is skew-symmetric in the last pair of indices, how many independent components does this
system have?
(c) Assume that in addition to being skew-symmetric in the last pair of indices, Aijk + Ajki + Akij = 0 is
satisfied for all values of i, j, and k, then how many independent components does the system have?

## I 56. ~ in indicial form. (b) Write the equation of the plane

(a) Write the equation of a line ~r = ~r0 + t A
~n ¬∑ (~r ‚ąí ~r0 ) = 0 in indicial form. (c) Write the equation of a general line in scalar form. (d) Write the
equation of a plane in scalar form. (e) Find the equation of the line defined by the intersection of the
planes 2x + 3y + 6z = 12 and 6x + 3y + z = 6. (f) Find the equation of the plane through the points
(5, 3, 2), (3, 1, 5), (1, 3, 3). Find also the normal to this plane.

I 57. The angle 0 ‚Č§ őł ‚Č§ ŌÄ between two skew lines in space is defined as the angle between their direction
vectors when these vectors are placed at the origin. Show that for two lines with direction numbers ai and
bi i = 1, 2, 3, the cosine of the angle between these lines satisfies

ai b i
cos őł = ‚ąö ‚ąö
ai ai b i b i

I 58. Let aij = ‚ąíaji for i, j = 1, 2, . . . , N and prove that for N odd det(aij ) = 0.
‚ąāőĽ ‚ąā2őĽ
I 59. Let őĽ = Aij xi xj where Aij = Aji and calculate (a) (b)
‚ąāxm ‚ąāxm ‚ąāxk
I 60. Given an arbitrary nonzero vector Uk , k = 1, 2, 3, define the matrix elements aij = eijk Uk , where eijk
is the e-permutation symbol. Determine if aij is symmetric or skew-symmetric. Suppose Uk is defined by
the above equation for arbitrary nonzero aij , then solve for Uk in terms of the aij .

I 61. If Aij = Ai Bj 6= 0 for all i, j values and Aij = Aji for i, j = 1, 2, . . . , N , show that Aij = őĽBi Bj
where őĽ is a constant. State what őĽ is.

I 62. Assume that Aijkm , with i, j, k, m = 1, 2, 3, is completely skew-symmetric. How many independent
components does this quantity have?

I 63. Consider Rijkm , i, j, k, m = 1, 2, 3, 4. (a) How many components does this quantity have? (b) If
Rijkm = ‚ąíRijmk = ‚ąíRjikm then how many independent components does Rijkm have? (c) If in addition
Rijkm = Rkmij determine the number of independent components.

I 64. Let xi = aij xŐĄj , i, j = 1, 2, 3 denote a change of variables from a barred system of coordinates to an
unbarred system of coordinates and assume that AŐĄi = aij Aj where aij are constants, AŐĄi is a function of the
‚ąā AŐĄi
xŐĄj variables and Aj is a function of the xj variables. Calculate .
‚ąā xŐĄm
35

## ¬ß1.2 TENSOR CONCEPTS AND TRANSFORMATIONS

e1 , b
For b e2 , b ~ as
e3 independent orthogonal unit vectors (base vectors), we may write any vector A

~ = A1 b
A e1 + A2 b
e2 + A3 b
e3

## ~ relative to the base vectors chosen. These components are the

where (A1 , A2 , A3 ) are the coordinates of A
~ onto the base vectors and
projection of A

~ = (A
A e1 ) b
~¬∑ b ~¬∑ b
e1 + (A e2 ) b ~¬∑b
e2 + (A e3 ) b
e3 .

## Select any three independent orthogonal vectors, (E ~ 2, E

~ 1, E ~ 3 ), not necessarily of unit length, we can then
write
~1
E ~2
E ~3
E
b
e1 = , b
e2 = , b
e3 = ,
~ 1|
|E ~ 2|
|E ~ 3|
|E
~ can be expressed as
and consequently, the vector A
! ! !
A~ ¬∑E~1 A~¬∑E ~2 A~¬∑E ~3
A~= ~1 +
E ~2 +
E ~ 3.
E
~1 ¬∑ E
E ~1 E~2 ¬∑ E
~2 ~3 ¬∑ E
E ~3

~¬∑E
A ~ (i)
, i = 1, 2, 3
~ (i) ¬∑ E
E ~ (i)

## ~ relative to the chosen base vectors E

are the components of A ~ 2, E
~ 1, E ~ 3 . Recall that the parenthesis about
the subscript i denotes that there is no summation on this subscript. It is then treated as a free subscript
which can have any of the values 1, 2 or 3.

Reciprocal Basis

## Consider a set of any three independent vectors (E~ 1, E

~ 2, E
~ 3 ) which are not necessarily orthogonal, nor of
~ in terms of these vectors we must find components (A1 , A2 , A3 )
unit length. In order to represent the vector A
such that
~ = A1 E
A ~ 1 + A2 E
~ 2 + A3 E
~ 3.

This can be done by taking appropriate projections and obtaining three equations and three unknowns from
which the components are determined. A much easier way to find the components (A1 , A2 , A3 ) is to construct
~ 1, E
a reciprocal basis (E ~ 2, E
~ 3 ). Recall that two bases (E
~ 1, E
~ 2, E ~ 1, E
~ 3 ) and (E ~ 2, E
~ 3 ) are said to be reciprocal
if they satisfy the condition 
~ j = őīj =
~i ¬∑ E 1 if i = j
E i .
0 6 j
if i =
Note that E ~ 1 = őī21 = 0 and E
~2 ¬∑ E ~ 1 = őī31 = 0 so that the vector E
~3 ¬∑ E ~ 1 is perpendicular to both the
~ 2 and E
vectors E ~ 3 . (i.e. A vector from one basis is orthogonal to two of the vectors from the other basis.)
We can therefore write E ~ 1 = V ‚ąí1 E
~2 √ó E ~ 3 where V is a constant to be determined. By taking the dot
product of both sides of this equation with the vector E ~ 1 we find that V = E ~ 1 ¬∑ (E
~2 √ó E
~ 3 ) is the volume
of the parallelepiped formed by the three vectors E ~ 2, E
~ 1, E ~ 3 when their origins are made to coincide. In a
36

~ 1, E
similar manner it can be demonstrated that for (E ~ 2, E
~ 3 ) a given set of basis vectors, then the reciprocal
basis vectors are determined from the relations
E~1 = 1 E ~2 √ó E~ 3, ~2 = 1 E
E ~3 √ó E
~ 1, ~3 = 1 E
E ~1 √ó E~ 2,
V V V
where V = E ~2 √ó E
~ 1 ¬∑ (E ~ 3 ) 6= 0 is a triple scalar product and represents the volume of the parallelepiped
having the basis vectors for its sides.
Let (E ~ 2, E
~ 1, E ~ 1, E
~ 3 ) and (E ~ 2, E
~ 3 ) denote a system of reciprocal bases. We can represent any vector A
~
with respect to either of these bases. If we select the basis (E ~ 2, E
~ 1, E ~ 3 ) and represent A
~ in the form

~ = A1 E
A ~ 1 + A2 E
~ 2 + A3 E
~ 3, (1.2.1)

## then the components (A1 , A2 , A3 ) of A

~ relative to the basis vectors (E
~ 1, E
~ 2, E
~ 3 ) are called the contravariant
~ These components can be determined from the equations
components of A.

A ~ 1 = A1 ,
~ ¬∑E ~ 2 = A2 ,
~¬∑E
A ~ 3 = A3 .
~¬∑E
A
~ 1, E
Similarly, if we choose the reciprocal basis (E ~ 2, E
~ 3 ) and represent A
~ in the form
~ 1 + A2 E
~ = A1 E
A ~ 2 + A3 E
~ 3, (1.2.2)
~ 1, E
then the components (A1 , A2 , A3 ) relative to the basis (E ~ 2, E
~ 3 ) are called the covariant components of
~ These components can be determined from the relations
A.
~ ¬∑E
A ~ 1 = A1 , ~ ¬∑E
A ~ 2 = A2 , ~ ¬∑E
A ~ 3 = A3 .

The contravariant and covariant components are different ways of representing the same vector with respect
to a set of reciprocal basis vectors. There is a simple relationship between these components which we now
develop. We introduce the notation

E ~ j = gij = gji ,
~i ¬∑ E and ~i ¬∑ E
E ~ j = g ij = g ji (1.2.3)

where gij are called the metric components of the space and g ij are called the conjugate metric components
of the space. We can then write
~ ¬∑E
A ~1 ¬∑ E
~ 1 = A1 (E ~2 ¬∑ E
~ 1 ) + A2 (E ~3 ¬∑ E
~ 1 ) + A3 (E ~ 1 ) = A1

A ~ 1 = A1 (E
~ ¬∑E ~ 1 ) + A2 (E
~1 ¬∑ E ~ 1 ) + A3 (E
~2 ¬∑ E ~3 ¬∑ E
~ 1 ) = A1
or
A1 = A1 g11 + A2 g12 + A3 g13 . (1.2.4)
~¬∑E
In a similar manner, by considering the dot products A ~ ¬∑E
~ 2 and A ~ 3 one can establish the results

## A2 = A1 g21 + A2 g22 + A3 g23 A3 = A1 g31 + A2 g32 + A3 g33 .

These results can be expressed with the index notation as

Ai = gik Ak . (1.2.6)

## Forming the dot products A ~ 1,

~ ¬∑E ~ 2,
~ ¬∑E
A ~ 3 it can be verified that
~¬∑E
A
Ai = g ik Ak . (1.2.7)

The equations (1.2.6) and (1.2.7) are relations which exist between the contravariant and covariant compo-
nents of the vector A. ~ Similarly, if for some value j we have E ~1 + ő≤ E
~j = őĪE ~2 + ő≥ E~ 3 , then one can show
~ =g E
that E j ij ~ i . This is left as an exercise.
37

Coordinate Transformations

Consider a coordinate transformation from a set of coordinates (x, y, z) to (u, v, w) defined by a set of
transformation equations
x = x(u, v, w)
y = y(u, v, w) (1.2.8)
z = z(u, v, w)
It is assumed that these transformations are single valued, continuous and possess the inverse transformation

u = u(x, y, z)
v = v(x, y, z) (1.2.9)
w = w(x, y, z).

These transformation equations define a set of coordinate surfaces and coordinate curves. The coordinate
surfaces are defined by the equations
u(x, y, z) = c1
v(x, y, z) = c2 (1.2.10)
w(x, y, z) = c3
where c1 , c2 , c3 are constants. These surfaces intersect in the coordinate curves

## ~r(u, c2 , c3 ), ~r(c1 , v, c3 ), ~r(c1 , c2 , w), (1.2.11)

where
e1 + y(u, v, w) b
~r(u, v, w) = x(u, v, w) b e2 + z(u, v, w) b
e3 .

## The general situation is illustrated in the figure 1.2-1.

Consider the vectors

~ 1 = grad u = ‚ąáu,
E ~ 2 = grad v = ‚ąáv,
E ~ 3 = grad w = ‚ąáw
E (1.2.12)

evaluated at the common point of intersection (c1 , c2 , c3 ) of the coordinate surfaces. The system of vectors
~ 1, E
(E ~ 2, E
~ 3 ) can be selected as a system of basis vectors which are normal to the coordinate surfaces.
Similarly, the vectors
~ 1 = ‚ąā~r ,
E ~ 2 = ‚ąā~r ,
E ~ 3 = ‚ąā~r
E (1.2.13)
‚ąāu ‚ąāv ‚ąāw
~ 1, E
when evaluated at the common point of intersection (c1 , c2 , c3 ) forms a system of vectors (E ~ 2, E
~ 3 ) which
we can select as a basis. This basis is a set of tangent vectors to the coordinate curves. It is now demonstrated
that the normal basis (E ~ 1, E
~ 2, E
~ 3 ) and the tangential basis (E
~ 1, E
~ 2, E
~ 3 ) are a set of reciprocal bases.
e1 + y b
Recall that ~r = x b e2 + z b
e3 denotes the position vector of a variable point. By substitution for
x, y, z from (1.2.8) there results

e1 + y(u, v, w) b
~r = ~r(u, v, w) = x(u, v, w) b e2 + z(u, v, w) b
e3 . (1.2.14)
38

## A small change in ~r is denoted

‚ąā~r ‚ąā~r ‚ąā~r
e1 + dy b
d~r = dx b e2 + dz b
e3 = du + dv + dw (1.2.15)
‚ąāu ‚ąāv ‚ąāw
where
‚ąā~r ‚ąāx ‚ąāy ‚ąāz
= b
e1 + b
e2 + b
e3
‚ąāu ‚ąāu ‚ąāu ‚ąāu
‚ąā~r ‚ąāx ‚ąāy ‚ąāz
= b
e1 + b
e2 + b
e3 (1.2.16)
‚ąāv ‚ąāv ‚ąāv ‚ąāv
‚ąā~r ‚ąāx ‚ąāy ‚ąāz
= b
e1 + b
e2 + b
e3 .
‚ąāw ‚ąāw ‚ąāw ‚ąāw
In terms of the u, v, w coordinates, this change can be thought of as moving along the diagonal of a paral-
‚ąā~r ‚ąā~r ‚ąā~r
lelepiped having the vector sides du, dv, and dw.
‚ąāu ‚ąāv ‚ąāw
Assume u = u(x, y, z) is defined by equation (1.2.9) and differentiate this relation to obtain
‚ąāu ‚ąāu ‚ąāu
du = dx + dy + dz. (1.2.17)
‚ąāx ‚ąāy ‚ąāz
The equation (1.2.15) enables us to represent this differential in the form:

du = grad u ¬∑ d~r
 
‚ąā~r ‚ąā~r ‚ąā~r
du = grad u ¬∑ du + dv + dw
‚ąāu ‚ąāv ‚ąāw (1.2.18)
     
‚ąā~r ‚ąā~r ‚ąā~r
‚ąāu ‚ąāv ‚ąāw
By comparing like terms in this last equation we find that

~1 ¬∑ E
E ~ 1 = 1, ~1 ¬∑ E
E ~ 2 = 0, ~1 ¬∑ E
E ~ 3 = 0. (1.2.19)

Similarly, from the other equations in equation (1.2.9) which define v = v(x, y, z), and w = w(x, y, z) it
can be demonstrated that
     
‚ąā~r ‚ąā~r ‚ąā~r
dv = grad v ¬∑ du + grad v ¬∑ dv + grad v ¬∑ dw (1.2.20)
‚ąāu ‚ąāv ‚ąāw
39

and      
‚ąā~r ‚ąā~r ‚ąā~r
dw = grad w ¬∑ du + grad w ¬∑ dv + grad w ¬∑ dw. (1.2.21)
‚ąāu ‚ąāv ‚ąāw
By comparing like terms in equations (1.2.20) and (1.2.21) we find

~2 ¬∑ E
E ~ 1 = 0, ~2 ¬∑ E
E ~ 2 = 1, ~2 ¬∑ E
E ~3 = 0
(1.2.22)
~3 ¬∑ E
E ~ 1 = 0, ~3 ¬∑ E
E ~ 2 = 0, ~3 ¬∑ E
E ~ 3 = 1.

The equations (1.2.22) and (1.2.19) show us that the basis vectors defined by equations (1.2.12) and (1.2.13)
are reciprocal.
Introducing the notation

## (x1 , x2 , x3 ) = (u, v, w) (y 1 , y 2 , y 3 ) = (x, y, z) (1.2.23)

where the x0 s denote the generalized coordinates and the y 0 s denote the rectangular Cartesian coordinates,
the above equations can be expressed in a more concise form with the index notation. For example, if

## then the reciprocal basis vectors can be represented

~ i = grad xi ,
E i = 1, 2, 3 (1.2.25)

and
~ i = ‚ąā~r ,
E i = 1, 2, 3. (1.2.26)
‚ąāxi
We now show that these basis vectors are reciprocal. Observe that ~r = ~r(x1 , x2 , x3 ) with

‚ąā~r
d~r = dxm (1.2.27)
‚ąāxm

and consequently

‚ąā~r  
= E~i ¬∑ E
~ m dxm = őīm
i
dxm , i = 1, 2, 3 (1.2.28)
‚ąāxm

Comparing like terms in this last equation establishes the result that

E ~ m = őīi ,
~i ¬∑ E i, m = 1, 2, 3 (1.2.29)
m

40

## Scalars, Vectors and Tensors

Tensors are quantities which obey certain transformation laws. That is, scalars, vectors, matrices
and higher order arrays can be thought of as components of a tensor quantity. We shall be interested in
finding how these components are represented in various coordinate systems. We desire knowledge of these
transformation laws in order that we can represent various physical laws in a form which is independent of
the coordinate system chosen. Before defining different types of tensors let us examine what we mean by a
coordinate transformation.
Coordinate transformations of the type found in equations (1.2.8) and (1.2.9) can be generalized to
higher dimensions. Let xi , i = 1, 2, . . . , N denote N variables. These quantities can be thought of as
representing a variable point (x1 , x2 , . . . , xN ) in an N dimensional space VN . Another set of N quantities,
call them barred quantities, xi , i = 1, 2, . . . , N, can be used to represent a variable point (x1 , x2 , . . . , xN ) in
an N dimensional space V N . When the x0 s are related to the x0 s by equations of the form

xi = xi (x1 , x2 , . . . , xN ), i = 1, 2, . . . , N (1.2.30)

then a transformation is said to exist between the coordinates xi and xi , i = 1, 2, . . . , N. Whenever the
relations (1.2.30) are functionally independent, single valued and possess partial derivatives such that the
Jacobian of the transformation
‚ąāx1
  1 ‚ąāx1
... ‚ąāx1
x 1 2 N ‚ąāx ‚ąāx2 ‚ąāxN
x ,x ,...,x ..
J =J = ... .. (1.2.31)
x x1 , x2 , . . . , xN N . ... .
‚ąāx ‚ąāxN
... ‚ąāxN
‚ąāx1 ‚ąāx2 ‚ąāxN

## is different from zero, then there exists an inverse transformation

xi = xi (x1 , x2 , . . . , xN ), i = 1, 2, . . . , N. (1.2.32)

For brevity the transformation equations (1.2.30) and (1.2.32) are sometimes expressed by the notation

## ¬Į coordinates. For simplicity

Consider a sequence of transformations from x to xŐĄ and then from xŐĄ to xŐĄ
¬Į = z. If we denote by T1 , T2 and T3 the transformations
let xŐĄ = y and xŐĄ

T1 : y i = y i (x1 , . . . , xN ) i = 1, . . . , N or T1 x = y
i i 1 N
T2 : z = z (y , . . . , y ) i = 1, . . . , N or T2 y = z

Then the transformation T3 obtained by substituting T1 into T2 is called the product of two successive
transformations and is written

T3 : z i = z i (y 1 (x1 , . . . , xN ), . . . , y N (x1 , . . . , xN )) i = 1, . . . , N or T3 x = T2 T1 x = z.

## This product transformation is denoted symbolically by T3 = T2 T1 .

The Jacobian of the product transformation is equal to the product of Jacobians associated with the
product transformation and J3 = J2 J1 .
41

## Transformations Form a Group

A group G is a nonempty set of elements together with a law, for combining the elements. The combined
elements are denoted by a product. Thus, if a and b are elements in G then no matter how you define the
law for combining elements, the product combination is denoted ab. The set G and combining law forms a
group if the following properties are satisfied:
(i) For all a, b ‚ąą G, then ab ‚ąą G. This is called the closure property.
(ii) There exists an identity element I such that for all a ‚ąą G we have Ia = aI = a.
(iii) There exists an inverse element. That is, for all a ‚ąą G there exists an inverse element a‚ąí1 such that
a a‚ąí1 = a‚ąí1 a = I.
(iv) The associative law holds under the combining law and a(bc) = (ab)c for all a, b, c ‚ąą G.
For example, the set of elements G = {1, ‚ąí1, i, ‚ąíi}, where i2 = ‚ąí1 together with the combining law of
ordinary multiplication, forms a group. This can be seen from the multiplication table.

√ó 1 -1 i -i
1 1 -1 i -i
-1 -1 1 -i i
-i -i i 1 -1
i i -i -1 1

The set of all coordinate transformations of the form found in equation (1.2.30), with Jacobian different
from zero, forms a group because:
(i) The product transformation, which consists of two successive transformations, belongs to the set of
transformations. (closure)
(ii) The identity transformation exists in the special case that x and x are the same coordinates.
(iii) The inverse transformation exists because the Jacobian of each individual transformation is different
from zero.
(iv) The associative law is satisfied in that the transformations satisfy the property T3 (T2 T1 ) = (T3 T2 )T1 .
When the given transformation equations contain a parameter the combining law is often times repre-
sented as a product of symbolic operators. For example, we denote by TőĪ a transformation of coordinates
having a parameter őĪ. The inverse transformation can be denoted by TőĪ‚ąí1 and one can write TőĪ x = x or
x = TőĪ‚ąí1 x. We let Tő≤ denote the same transformation, but with a parameter ő≤, then the transitive property
is expressed symbolically by TőĪ Tő≤ = Tő≥ where the product TőĪ Tő≤ represents the result of performing two
successive transformations. The first coordinate transformation uses the given transformation equations and
uses the parameter őĪ in these equations. This transformation is then followed by another coordinate trans-
formation using the same set of transformation equations, but this time the parameter value is ő≤. The above
symbolic product is used to demonstrate that the result of applying two successive transformations produces
a result which is equivalent to performing a single transformation of coordinates having the parameter value
ő≥. Usually some relationship can then be established between the parameter values őĪ, ő≤ and ő≥.
42

## Figure 1.2-2. Cylindrical coordinates.

In this symbolic notation, we let Tőł denote the identity transformation. That is, using the parameter
value of őł in the given set of transformation equations produces the identity transformation. The inverse
transformation can then be expressed in the form of finding the parameter value ő≤ such that TőĪ Tő≤ = Tőł .

Cartesian Coordinates

## At times it is convenient to introduce an orthogonal Cartesian coordinate system having coordinates

i
y, i = 1, 2, . . . , N. This space is denoted EN and represents an N-dimensional Euclidean space. Whenever
the generalized independent coordinates xi , i = 1, . . . , N are functions of the y 0 s, and these equations are
functionally independent, then there exists independent transformation equations

y i = y i (x1 , x2 , . . . , xN ), i = 1, 2, . . . , N, (1.2.34)

with Jacobian different from zero. Similarly, if there is some other set of generalized coordinates, say a barred
system xi , i = 1, . . . , N where the x0 s are independent functions of the y 0 s, then there will exist another set
of independent transformation equations

y i = y i (x1 , x2 , . . . , xN ), i = 1, 2, . . . , N, (1.2.35)

with Jacobian different from zero. The transformations found in the equations (1.2.34) and (1.2.35) imply
that there exists relations between the x0 s and x0 s of the form (1.2.30) with inverse transformations of the
form (1.2.32). It should be remembered that the concepts and ideas developed in this section can be applied
to a space VN of any finite dimension. Two dimensional surfaces (N = 2) and three dimensional spaces
(N = 3) will occupy most of our applications. In relativity, one must consider spaces where N = 4.

## x = x(r, őł, z) = r cos őł y = y(r, őł, z) = r sin őł z = z(r, őł, z) = z

from rectangular coordinates (x, y, z) to cylindrical coordinates (r, őł, z), illustrated in the figure 1.2-2. By
letting
y 1 = x, y 2 = y, y3 = z x1 = r, x2 = őł, x3 = z
the above set of equations are examples of the transformation equations (1.2.8) with u = r, v = őł, w = z as
the generalized coordinates.
43

## EXAMPLE 1.2.2. (Spherical Coordinates) (ŌĀ, őł, ŌÜ)

Consider the transformation

## from rectangular coordinates (x, y, z) to spherical coordinates (ŌĀ, őł, ŌÜ). By letting

y 1 = x, y 2 = y, y 3 = z x1 = ŌĀ, x2 = őł , x3 = ŌÜ

the above set of equations has the form found in equation (1.2.8) with u = ŌĀ, v = őł, w = ŌÜ the generalized
coordinates. One could place bars over the x0 s in this example in order to distinguish these coordinates from
the x0 s of the previous example. The spherical coordinates (ŌĀ, őł, ŌÜ) are illustrated in the figure 1.2-3.

## Scalar Functions and Invariance

We are now at a point where we can begin to define what tensor quantities are. The first definition is
for a scalar invariant or tensor of order zero.
44

## Definition: ( Absolute scalar field) Assume there exists a coordinate

transformation of the type (1.2.30) with Jacobian J different from zero. Let
the scalar function
f = f (x1 , x2 , . . . , xN ) (1.2.36)

## be a function of the coordinates xi , i = 1, . . . , N in a space VN . Whenever

there exists a function
f = f (x1 , x2 , . . . , xN ) (1.2.37)

## which is a function of the coordinates xi , i = 1, . . . , N such that f = J W f,

then f is called a tensor of rank or order zero of weight W in the space VN .
Whenever W = 0, the scalar f is called the component of an absolute scalar
field and is referred to as an absolute tensor of rank or order zero.

That is, an absolute scalar field is an invariant object in the space VN with respect to the group of
coordinate transformations. It has a single component in each coordinate system. For any scalar function
of the type defined by equation (1.2.36), we can substitute the transformation equations (1.2.30) and obtain

## In VN consider a curve C defined by the set of parametric equations

C: xi = xi (t), i = 1, . . . , N

## where t is a parameter. The tangent vector to the curve C is the vector

 
dx1 dx2 dxN
T~ = , ,..., .
dt dt dt

In index notation, which focuses attention on the components, this tangent vector is denoted

dxi
Ti = , i = 1, . . . , N.
dt

For a coordinate transformation of the type defined by equation (1.2.30) with its inverse transformation
defined by equation (1.2.32), the curve C is represented in the barred space by

## xi = xi (x1 (t), x2 (t), . . . , xN (t)) = xi (t), i = 1, . . . , N,

with t unchanged. The tangent to the curve in the barred system of coordinates is represented by

## dxi ‚ąāxi dxj

= , i = 1, . . . , N. (1.2.39)
dt ‚ąāxj dt
45

i
Letting T , i = 1, . . . , N denote the components of this tangent vector in the barred system of coordinates,
the equation (1.2.39) can then be expressed in the form

i ‚ąāxi j
T = T , i, j = 1, . . . , N. (1.2.40)
‚ąāxj

This equation is said to define the transformation law associated with an absolute contravariant tensor of
rank or order one. In the case N = 3 the matrix form of this transformation is represented
Ô£ę Ô£ę 1 Ô£∂Ô£ę
1Ô£∂ ‚ąāx ‚ąāx1 ‚ąāx1 Ô£∂
T ‚ąāx1 ‚ąāx2 ‚ąāx3 T1
2 Ô£¨
Ô£≠ T Ô£ł = Ô£≠ ‚ąāx21 ‚ąāx2 ‚ąāx2 Ô£∑Ô£≠ 2Ô£ł
‚ąāx ‚ąāx2 ‚ąāx3 Ô£ł T (1.2.41)
3
T ‚ąāx3
1
‚ąāx3 ‚ąāx3 T3
‚ąāx ‚ąāx2 ‚ąāx3

## Definition: (Contravariant tensor) Whenever N quantities Ai in

i
a coordinate system (x1 , . . . , xN ) are related to N quantities A in a
coordinate system (x1 , . . . , xN ) such that the Jacobian J is different
from zero, then if the transformation law

i ‚ąāxi j
A = JW A
‚ąāxj

## is satisfied, these quantities are called the components of a relative tensor

of rank or order one with weight W . Whenever W = 0 these quantities
are called the components of an absolute tensor of rank or order one.

We see that the above transformation law satisfies the group properties.

## EXAMPLE 1.2-3. (Transitive Property of Contravariant Transformation)

Show that successive contravariant transformations is also a contravariant transformation.
Solution: Consider the transformation of a vector from an unbarred to a barred system of coordinates. A
vector or absolute tensor of rank one Ai = Ai (x), i = 1, . . . , N will transform like the equation (1.2.40) and

i ‚ąāxi j
A (x) = A (x). (1.2.42)
‚ąāxj

## Another transformation from x ‚Üí x coordinates will produce the components

i
i ‚ąāx j
A (x) = A (x) (1.2.43)
‚ąāxj

Here we have used the notation Aj (x) to emphasize the dependence of the components Aj upon the x
coordinates. Changing indices and substituting equation (1.2.42) into (1.2.43) we find

i
i ‚ąāx ‚ąāxj m
A (x) = A (x). (1.2.44)
‚ąāxj ‚ąāxm
46

## From the fact that

i i
‚ąāx ‚ąāxj ‚ąāx
= ,
‚ąāxj ‚ąāxm ‚ąāxm
the equation (1.2.44) simplifies to
i
i ‚ąāx m
A (x) = A (x) (1.2.45)
‚ąāxm
and hence this transformation is also contravariant. We express this by saying that the above are transitive
with respect to the group of coordinate transformations.
Note that from the chain rule one can write

## ‚ąāxm ‚ąāxj ‚ąāxm ‚ąāx1 ‚ąāxm ‚ąāx2 ‚ąāxm ‚ąāx3 ‚ąāxm m

j ‚ąāxn = 1 ‚ąāxn + 2 ‚ąāxn + 3 ‚ąāxn = ‚ąāxn = őīn .
‚ąāx ‚ąāx ‚ąāx ‚ąāx

## ‚ąāxm ‚ąāx2 ‚ąāxm ‚ąāxm ‚ąāx3 ‚ąāxm

2 ‚ąāxn = ‚ąāxn or 3 ‚ąāxn = ‚ąāxn
‚ąāx ‚ąāx

as these expressions are incorrect. Note that there are no summations in these terms, whereas there is a
summation index in the representation of the chain rule.

## Vector Transformation, Covariant Components

Consider a scalar invariant A(x) = A(x) which is a shorthand notation for the equation

A(x1 , x2 , . . . , xn ) = A(x1 , x2 , . . . , xn )

involving the coordinate transformation of equation (1.2.30). By the chain rule we differentiate this invariant
and find that the components of the gradient must satisfy

‚ąāA ‚ąāA ‚ąāxj
i = ‚ąāxj . (1.2.46)
‚ąāx ‚ąāxi

Let
‚ąāA ‚ąāA
Aj = and Ai = ,
‚ąāxj ‚ąāxi
then equation (1.2.46) can be expressed as the transformation law

‚ąāxj
Ai = Aj . (1.2.47)
‚ąāxi

This is the transformation law for an absolute covariant tensor of rank or order one. A more general definition
is
47

## Definition: (Covariant tensor) Whenever N quantities Ai in a

1 N
coordinate system (x , . . . , x ) are related to N quantities Ai in a co-
ordinate system (x1 , . . . , xN ), with Jacobian J different from zero, such
that the transformation law

‚ąāxj
Ai = J W Aj (1.2.48)
‚ąāxi

## is satisfied, then these quantities are called the components of a relative

covariant tensor of rank or order one having a weight of W . When-
ever W = 0, these quantities are called the components of an absolute
covariant tensor of rank or order one.

Again we note that the above transformation satisfies the group properties. Absolute tensors of rank or
order one are referred to as vectors while absolute tensors of rank or order zero are referred to as scalars.
EXAMPLE 1.2-4. (Transitive Property of Covariant Transformation)
Consider a sequence of transformation laws of the type defined by the equation (1.2.47)

‚ąāxj
x‚Üíx Ai (x) = Aj (x)
‚ąāxi
x‚Üíx ‚ąāxm
Ak (x) = Am (x) k
‚ąāx
We can therefore express the transformation of the components associated with the coordinate transformation
x ‚Üí x and  
‚ąāxj ‚ąāxm ‚ąāxj
Ak (x) = Aj (x) m k
= Aj (x) k
,
‚ąāx ‚ąāx ‚ąāx
which demonstrates the transitive property of a covariant transformation.

## Higher Order Tensors

We have shown that first order tensors are quantities which obey certain transformation laws. Higher
order tensors are defined in a similar manner and also satisfy the group properties. We assume that we are
given transformations of the type illustrated in equations (1.2.30) and (1.2.32) which are single valued and
continuous with Jacobian J different from zero. Further, the quantities xi and xi , i = 1, . . . , n represent the
coordinates in any two coordinate systems. The following transformation laws define second order and third
order tensors.
48

## Definition: (Second order contravariant tensor) Whenever N-squared quantities Aij

mn
in a coordinate system (x1 , . . . , xN ) are related to N-squared quantities A in a coordinate
1 N
system (x , . . . , x ) such that the transformation law

mn ‚ąāxm ‚ąāxn
A (x) = Aij (x)J W (1.2.49)
‚ąāxi ‚ąāxj

is satisfied, then these quantities are called components of a relative contravariant tensor of
rank or order two with weight W . Whenever W = 0 these quantities are called the components
of an absolute contravariant tensor of rank or order two.

## Definition: (Second order covariant tensor) Whenever N-squared quantities

Aij in a coordinate system (x1 , . . . , xN ) are related to N-squared quantities Amn
in a coordinate system (x1 , . . . , xN ) such that the transformation law

‚ąāxi ‚ąāxj
Amn (x) = Aij (x)J W (1.2.50)
‚ąāxm ‚ąāxn

is satisfied, then these quantities are called components of a relative covariant tensor
of rank or order two with weight W . Whenever W = 0 these quantities are called
the components of an absolute covariant tensor of rank or order two.

## Definition: (Second order mixed tensor) Whenever N-squared quantities

1 m
Aij N
in a coordinate system (x , . . . , x ) are related to N-squared quantities An in
a coordinate system (x1 , . . . , xN ) such that the transformation law

m ‚ąāxm ‚ąāxj
An (x) = Aij (x)J W (1.2.51)
‚ąāxi ‚ąāxn

is satisfied, then these quantities are called components of a relative mixed tensor of
rank or order two with weight W . Whenever W = 0 these quantities are called the
components of an absolute mixed tensor of rank or order two. It is contravariant
of order one and covariant of order one.

Higher order tensors are defined in a similar manner. For example, if we can find N-cubed quantities
Am
np such that
i ‚ąāxi ‚ąāxőĪ ‚ąāxő≤
Ajk (x) = Aő≥őĪő≤ (x)J W (1.2.52)
‚ąāxő≥ ‚ąāxj ‚ąāxk
then this is a relative mixed tensor of order three with weight W . It is contravariant of order one and
covariant of order two.
49

General Definition

Tji11ji22...j
...im
n
(1.2.53)

## is contravariant of order m and covariant of order n if it obeys the transformation law

h  x iW i1
‚ąāxi2 ‚ąāxim ‚ąāxb1 ‚ąāxb2 ‚ąāxbn
i1 i2 ...im ...am ‚ąāx
T j1 j2 ...jn = J Tba11ba22...b ¬∑ ¬∑ ¬∑ ¬∑ ¬∑ ¬∑ ¬∑ (1.2.54)
x n
‚ąāxa1 ‚ąāxa2 ‚ąāxam ‚ąāxj1 ‚ąāxj2 ‚ąāxjn

where x
‚ąāx ‚ąā(x1 , x2 , . . . , xN )
J = =
x ‚ąāx ‚ąā(x1 , x2 , . . . , xN )
is the Jacobian of the transformation. When W = 0 the tensor is called an absolute tensor, otherwise it is
called a relative tensor of weight W.
Here superscripts are used to denote contravariant components and subscripts are used to denote covari-
ant components. Thus, if we are given the tensor components in one coordinate system, then the components
in any other coordinate system are determined by the transformation law of equation (1.2.54). Throughout
the remainder of this text one should treat all tensors as absolute tensors unless specified otherwise.

Note that vectors can be represented in bold face type with the notation

A = Ai Ei

This notation can also be generalized to tensor quantities. Higher order tensors can also be denoted by bold
face type. For example the tensor components Tij and Bijk can be represented in terms of the basis vectors
Ei , i = 1, . . . , N by using a notation which is similar to that for the representation of vectors. For example,

T = Tij Ei Ej
B = Bijk Ei Ej Ek .

Here T denotes a tensor with components Tij and B denotes a tensor with components Bijk . The quantities
Ei Ej are called unit dyads and Ei Ej Ek are called unit triads. There is no multiplication sign between the
basis vectors. This notation is called a polyad notation. A further generalization of this notation is the
representation of an arbitrary tensor using the basis and reciprocal basis vectors in bold type. For example,
a mixed tensor would have the polyadic representation

ij...k
T = Tlm...n Ei Ej . . . Ek El Em . . . En .

A dyadic is formed by the outer or direct product of two vectors. For example, the outer product of the
vectors
a = a 1 E 1 + a2 E 2 + a3 E 3 and b = b1 E1 + b2 E2 + b3 E3
50

ab =a1 b1 E1 E1 + a1 b2 E1 E2 + a1 b3 E1 E3
a2 b 1 E 2 E 1 + a2 b 2 E 2 E 2 + a2 b 3 E 2 E 3
a3 b 1 E 3 E 1 + a3 b 2 E 3 E 2 + a3 b 3 E 3 E 3 .
In general, a dyad can be represented

A = Aij Ei Ej i, j = 1, . . . , N

where the summation convention is in effect for the repeated indices. The coefficients Aij are called the
coefficients of the dyad. When the coefficients are written as an N √ó N array it is called a matrix. Every
second order tensor can be written as a linear combination of dyads. The dyads form a basis for the second
order tensors. As the example above illustrates, the nine dyads {E1 E1 , E1 E2 , . . . , E3 E3 }, associated with
the outer products of three dimensional base vectors, constitute a basis for the second order tensor A = ab
having the components Aij = ai bj with i, j = 1, 2, 3. Similarly, a triad has the form

## T = Tijk Ei Ej Ek Sum on repeated indices

where i, j, k have the range 1, 2, . . . , N. The set of outer or direct products { Ei Ej Ek }, with i, j, k = 1, . . . , N
i
constitutes a basis for all third order tensors. Tensor components with mixed suffixes like Cjk are associated
with triad basis of the form
i
C = Cjk Ei Ej Ek

where i, j, k have the range 1, 2, . . . N. Dyads are associated with the outer product of two vectors, while triads,
tetrads,... are associated with higher-order outer products. These higher-order outer or direct products are
The polyad notation is a generalization of the vector notation. The subject of how polyad components
transform between coordinate systems is the subject of tensor calculus.

## In Cartesian coordinates we have Ei = Ei = b

A = Aij b
ei b
ej or
A =A11 b
e1 b
e1 + A12 b
e1 b
e2 + A13 b
e1 b
e3
A21 b
e2 b
e1 + A22 b
e2 b
e2 + A23 b
e2 b
e3
A31 b
e3 b
e1 + A32 b
e3 b
e2 + A33 b
e3 b
e3
ei b
where the terms b ej are called unit dyads. Note that a dyadic has nine components as compared with a
vector which has only three components. The conjugate dyadic Ac is defined by a transposition of the unit
vectors in A, to obtain
Ac =A11 b
e1 b
e1 + A12 b
e2 b
e1 + A13 b
e3 b
e1
A21 b
e1 b
e2 + A22 b
e2 b
e2 + A23 b
e3 b
e2
A31 b
e1 b
e3 + A32 b
e2 b
e3 + A33 b
e3 b
e3
51

If a dyadic equals its conjugate A = Ac , then Aij = Aji and the dyadic is called symmetric. If a dyadic
equals the negative of its conjugate A = ‚ąíAc , then Aij = ‚ąíAji and the dyadic is called skew-symmetric. A
e1 b
J= b e2 b
e1 + b e3 b
e2 + b e3 .
~ produces the
This dyadic has the property that pre or post dot product multiplication of J with a vector V
same vector V~ . For example,
~ ¬∑ J = (V1 b
V e1 + V2 b
e2 + V3 b
e3 ) ¬∑ J
= V1 b e1 b
e1 ¬∑ b e1 + V2 b e2 b
e2 ¬∑ b e2 + V3 b e3 b
e3 ¬∑ b ~
e3 = V
~ = J ¬∑ (V1 b
and J ¬∑ V e1 + V2 b
e2 + V3 b
e3 )
= V1 b e1 ¬∑ b
e1 b e1 + V2 b e2 ¬∑ b
e2 b e2 + V3 b e3 ¬∑ b
e3 b ~
e3 = V
A dyadic operation often used in physics and chemistry is the double dot product A : B where A and
B are both dyadics. Here both dyadics are expanded using the distributive law of multiplication, and then
ej : b
ei b
each unit dyad pair b em b
en are combined according to the rule
ei b
b em b
ej : b en = ( b
ei ¬∑ b
em )( b
ej ¬∑ b
en ).
For example, if A = Aij b
ei b
ej and B = Bij b
ei b
ej , then the double dot product A : B is calculated as follows.
A : B = (Aij b
ei b
ej ) : (Bmn b
em b ei b
en ) = Aij Bmn ( b em b
ej : b en ) = Aij Bmn ( b
ei ¬∑ b
em )( b
ej ¬∑ b
en )
= Aij Bmn őīim őījn = Amj Bmj
= A11 B11 + A12 B12 + A13 B13
+ A21 B21 + A22 B22 + A23 B23
+ A31 B31 + A32 B32 + A33 B33
~ = Ai b
components are represented. For example, for A ~ = Bi b
ei and B ei vectors with outer product
~B
A ~ = Am Bn b
em b
en = ŌÜ
there is produced the dyadic ŌÜ with components Am Bn . In comparison, the outer product
~A
B ~ = Bm An b
em b
en = Ōą
produces the dyadic Ōą with components Bm An . That is
ŌÜ=A ~B~ =A1 B1 be1 b
e1 + A1 B2 b
e1 b
e2 + A1 B3 b
e1 b
e3
A2 B1 b
e2 b
e1 + A2 B2 b
e2 b
e2 + A2 B3 b
e2 b
e3
A3 B1 b
e3 b
e1 + A3 B2 b
e3 b
e2 + A3 B3 b
e3 b
e3
~A
and Ōą = B ~ =B1 A1 b
e1 b
e1 + B1 A2 b
e1 b
e2 + B1 A3 b
e1 b
e3
B2 A1 b
e2 b
e1 + B2 A2 b
e2 b
e2 + B2 A3 b
e2 b
e3
B3 A1 b
e3 b
e1 + B3 A2 b
e3 b
e2 + B3 A3 b
e3 b
e3
~ is defined for both pre and post multiplication as
The scalar dot product of a dyad with a vector C
ŌÜ¬∑C~ =A ~B~ ¬∑C
~ =A(~ B~ ¬∑ C)
~

C~ ¬∑ŌÜ=C
~ ¬∑A
~B~ =(C
~ ¬∑ A)
~ B~
These products are, in general, not equal.
52

## Operations Using Tensors

The following are some important tensor operations which are used to derive special equations and to
prove various identities.

Tensors of the same type and weight can be added or subtracted. For example, two third order mixed
tensors, when added, produce another third order mixed tensor. Let Aijk and Bjk
i
denote two third order
mixed tensors. Their sum is denoted
i
Cjk = Aijk + Bjk
i
.
That is, like components are added. The sum is also a mixed tensor as we now verify. By hypothesis Aijk
i
and Bjk are third order mixed tensors and hence must obey the transformation laws
i ‚ąāxi ‚ąāxn ‚ąāxp
Ajk = Am
np
‚ąāxm ‚ąāxj ‚ąāxk
i n p
i m ‚ąāx ‚ąāx ‚ąāx
B jk = Bnp j .
‚ąāxm ‚ąāx ‚ąāxk
i i i
We let C jk = Ajk + B jk denote the sum in the transformed coordinates. Then the addition of the above
transformation equations produces
 i   ‚ąāxi ‚ąāxn ‚ąāxp i n p
i i m ‚ąāx ‚ąāx ‚ąāx
C jk = Ajk + B jk = Am np + B m
np = Cnp .
‚ąāxm ‚ąāxj ‚ąāxk ‚ąāxm ‚ąāxj ‚ąāxk
Consequently, the sum transforms as a mixed third order tensor.

## Multiplication (Outer Product)

The product of two tensors is also a tensor. The rank or order of the resulting tensor is the sum of
the ranks of the tensors occurring in the multiplication. As an example, let Aijk denote a mixed third order
l
tensor and let Bm denote a mixed second order tensor. The outer product of these two tensors is the fifth
order tensor
il
Cjkm = Aijk Bm
l
, i, j, k, l, m = 1, 2, . . . , N.
i l
Here all indices are free indices as i, j, k, l, m take on any of the integer values 1, 2, . . . , N. Let Ajk and B m
il
denote the components of the given tensors in the barred system of coordinates. We define C jkm as the
il
outer product of these components. Observe that Cjkm is a tensor for by hypothesis Aijk and Bm
l
are tensors
and hence obey the transformation laws
‚ąāxőĪ ‚ąāxj ‚ąāxk
őĪ
Aő≤ő≥ = Aijk
‚ąāxi ‚ąāxő≤ ‚ąāxő≥ (1.2.55)
őī m
őī l ‚ąāx ‚ąāx
B  = Bm .
‚ąāxl ‚ąāx
The outer product of these components produces
őĪőī őĪ őī ‚ąāxőĪ ‚ąāxj ‚ąāxk ‚ąāxőī ‚ąāxm
C ő≤ő≥ = Aő≤ő≥ B  = Aijk Bm
l
‚ąāxi ‚ąāxő≤ ‚ąāxő≥ ‚ąāxl ‚ąāx
(1.2.56)
il ‚ąāxőĪ ‚ąāxj ‚ąāxk ‚ąāxőī ‚ąāxm
= Cjkm i
‚ąāx ‚ąāxő≤ ‚ąāxő≥ ‚ąāxl ‚ąāx
il
which demonstrates that Cjkm transforms as a mixed fifth order absolute tensor. Other outer products are
analyzed in a similar way.
53

Contraction

The operation of contraction on any mixed tensor of rank m is performed when an upper index is
set equal to a lower index and the summation convention is invoked. When the summation is performed
over the repeated indices the resulting quantity is also a tensor of rank or order (m ‚ąí 2). For example, let
Aijk , i, j, k = 1, 2, . . . , N denote a mixed tensor and perform a contraction by setting j equal to i. We obtain
Aiik = A11k + A22k + ¬∑ ¬∑ ¬∑ + AN
N k = Ak (1.2.57)
i
where k is a free index. To show that Ak is a tensor, we let Aik = Ak denote the contraction on the
transformed components of Aijk . By hypothesis Aijk is a mixed tensor and hence the components must
satisfy the transformation law
i‚ąāxi ‚ąāxn ‚ąāxp
Ajk = Am
np .
‚ąāxm ‚ąāxj ‚ąāxk
Now execute a contraction by setting j equal to i and perform a summation over the repeated index. We
find
i ‚ąāxi ‚ąāxn ‚ąāxp ‚ąāxn ‚ąāxp
Aik = Ak = Am
np i k
= Am np
m
‚ąāx ‚ąāx ‚ąāx ‚ąāxm ‚ąāxk (1.2.58)
p p
m n ‚ąāx n ‚ąāx ‚ąāxp
= Anp őīm k = Anp k = Ap k .
‚ąāx ‚ąāx ‚ąāx
Hence, the contraction produces a tensor of rank two less than the original tensor. Contractions on other
mixed tensors can be analyzed in a similar manner.
New tensors can be constructed from old tensors by performing a contraction on an upper and lower
index. This process can be repeated as long as there is an upper and lower index upon which to perform the
contraction. Each time a contraction is performed the rank of the resulting tensor is two less than the rank
of the original tensor.

## The inner product of two tensors is obtained by:

(i) first taking the outer product of the given tensors and
(ii) performing a contraction on two of the indices.

## EXAMPLE 1.2-5. (Inner product)

Let Ai and Bj denote the components of two first order tensors (vectors). The outer product of these
tensors is
Cji = Ai Bj , i, j = 1, 2, . . . , N.
The inner product of these tensors is the scalar
C = Ai Bi = A1 B1 + A2 B2 + ¬∑ ¬∑ ¬∑ + AN BN .

Note that in some situations the inner product is performed by employing only subscript indices. For
example, the above inner product is sometimes expressed as

C = Ai Bi = A1 B1 + A2 B2 + ¬∑ ¬∑ ¬∑ AN BN .
This notation is discussed later when Cartesian tensors are considered.
54

Quotient Law

Assume Brqs and Cps are arbitrary absolute tensors. Further assume we have a quantity A(ijk) which
we think might be a third order mixed tensor Aijk . By showing that the equation

## Arqp Brqs = Cps

is satisfied, then it follows that Arqp must be a tensor. This is an example of the quotient law. Obviously,
this result can be generalized to apply to tensors of any order or rank. To prove the above assertion we shall
show from the above equation that Aijk is a tensor. Let xi and xi denote a barred and unbarred system of
coordinates which are related by transformations of the form defined by equation (1.2.30). In the barred
system, we assume that
r qs s
Aqp B r = C p (1.2.59)

## where by hypothesis Bkij and Cm

l
are arbitrary absolute tensors and therefore must satisfy the transformation
equations
qs ‚ąāxq ‚ąāxs ‚ąāxk
B r = Bkij
‚ąāxi ‚ąāxj ‚ąāxr
s
s l ‚ąāx ‚ąāxm
C p = Cm .
‚ąāx ‚ąāxp
l
qs s
We substitute for B r and C p in the equation (1.2.59) and obtain the equation
 q s k
  s m

r ij ‚ąāx ‚ąāx ‚ąāx l ‚ąāx ‚ąāx
Aqp Bk = Cm l
‚ąāxi ‚ąāxj ‚ąāxr ‚ąāx ‚ąāxp
‚ąāxs ‚ąāxm
= Arqm Brql l .
‚ąāx ‚ąāxp
Since the summation indices are dummy indices they can be replaced by other symbols. We change l to j,
q to i and r to k and write the above equation as
 
‚ąāxs q
r ‚ąāx ‚ąāx
k
k ‚ąāx
m
Aqp i ‚ąí Aim p Bkij = 0.
‚ąāxj ‚ąāx ‚ąāxr ‚ąāx
‚ąāxn
Use inner multiplication by ‚ąāxs and simplify this equation to the form
 q k m

r ‚ąāx ‚ąāx k ‚ąāx
n
őīj Aqp i ‚ąí Aim p Bkij = 0 or
‚ąāx ‚ąāxr ‚ąāx
 q k m

r ‚ąāx ‚ąāx k ‚ąāx
Aqp i ‚ąí Aim p Bkin = 0.
‚ąāx ‚ąāxr ‚ąāx

Because Bkin is an arbitrary tensor, the quantity inside the brackets is zero and therefore

r ‚ąāxq ‚ąāxk k ‚ąāx
m
Aqp r ‚ąí Aim = 0.
i
‚ąāx ‚ąāx ‚ąāxp
‚ąāxi ‚ąāxl
This equation is simplified by inner multiplication by ‚ąāxj ‚ąāxk
to obtain

## r ‚ąāxm ‚ąāxi ‚ąāxl

őījq őīrl Aqp ‚ąí Akim =0 or
‚ąāxp ‚ąāxj ‚ąāxk
l ‚ąāxm ‚ąāxi ‚ąāxl
Ajp = Akim p
‚ąāx ‚ąāxj ‚ąāxk
which is the transformation law for a third order mixed tensor.
55

EXERCISE 1.2

## I 1. Consider the transformation equations representing a rotation of axes through an angle őĪ.


x1 = x1 cos őĪ ‚ąí x2 sin őĪ
TőĪ :
x2 = x1 sin őĪ + x2 cos őĪ

Treat őĪ as a parameter and show this set of transformations constitutes a group by finding the value of őĪ
which:
(i) gives the identity transformation.
(ii) gives the inverse transformation.
(iii) show the transformation is transitive in that a transformation with őĪ = őł1 followed by a transformation
with őĪ = őł2 is equivalent to the transformation using őĪ = őł1 + őł2 .
I 2. Show the transformation 
x1 = őĪx1
TőĪ :
x2 = őĪ1 x2
forms a group with őĪ as a parameter. Find the value of őĪ such that:
(i) the identity transformation exists.
(ii) the inverse transformation exists.
(iii) the transitive property is satisfied.
I 3. Show the given transformation forms a group with parameter őĪ.
( x1
x1 = 1‚ąíőĪx1
TőĪ :
x2
x2 = 1‚ąíőĪx1

I 4. Consider the Lorentz transformation from relativity theory having the velocity parameter V, c is the
speed of light and x4 = t is time. Ô£Ī 1 1 4
x ‚ąíV x
Ô£ī
Ô£ī x = p
Ô£ī
Ô£ī
V2
1‚ąí
Ô£ī
Ô£ī
c2
Ô£≤ x2 = x2
TV :
Ô£ī
Ô£ī x3 = x3
Ô£ī
Ô£ī
Ô£ī
Ô£ī x4 x4 ‚ąí Vcx2
1

Ô£≥ = p 2
1‚ąí V2
c

## Show this set of transformations constitutes a group, by establishing:

(i) V = 0 gives the identity transformation T0 .
(ii) TV2 ¬∑ TV1 = T0 requires that V2 = ‚ąíV1 .
(iii) TV2 ¬∑ TV1 = TV3 requires that
V1 + V2
V3 = .
1 + V1c2V2

I 5. ~ 1, E
For (E ~ 2, E
~ 3 ) an arbitrary independent basis, (a) Verify that

~1 = 1 E
E ~2 √ó E
~ 3, ~2 = 1 E
E ~3 √ó E
~ 1, ~3 = 1 E
E ~1 √ó E
~2
V V V

~ 1 ¬∑ (E
is a reciprocal basis, where V = E ~2 √ó E
~ 3) (b) Show that E ~ i.
~ j = g ij E
56

## I 6. For the cylindrical coordinates (r, ő≤, z) illustrated in the figure 1.2-4.

(a) Write out the transformation equations from rectangular (x, y, z) coordinates to cylindrical (r, ő≤, z)
coordinates. Also write out the inverse transformation.
(b) Determine the following basis vectors in cylindrical coordinates and represent your results in terms of
cylindrical coordinates.
(i) The tangential basis E ~ 2, E
~ 1, E ~ 1, E
~ 3 . (ii)The normal basis E ~ 2, E
~ 3 . (iii) eŐār , eŐāő≤ , eŐāz
where eŐār , eŐāő≤ , eŐāz are normalized vectors in the directions of the tangential basis.
(c) A vector A ~ = Ax b
e1 + Ay b
e2 + Az b
e3 can be represented in any of the forms:

~ = A1 E
A ~ 1 + A2 E
~ 2 + A3 E
~3
~ 1 + A2 E
~ = A1 E
A ~ 2 + A3 E
~3
~ = Ar eŐār + Aő≤ eŐāő≤ + Az eŐāz
A

## depending upon the basis vectors selected . In terms of the components Ax , Ay , Az

(i) Solve for the contravariant components A1 , A2 , A3 .
(ii) Solve for the covariant components A1 , A2 , A3 .
(iii) Solve for the components Ar , Aő≤ , Az . Express all results in cylindrical coordinates. (Note the
components Ar , Aő≤ , Az are referred to as physical components. Physical components are considered in
more detail in a later section.)
57

## I 7. For the spherical coordinates (ŌĀ, őĪ, ő≤) illustrated in the figure 1.2-5.

(a) Write out the transformation equations from rectangular (x, y, z) coordinates to spherical (ŌĀ, őĪ, ő≤) co-
ordinates. Also write out the equations which describe the inverse transformation.
(b) Determine the following basis vectors in spherical coordinates
(i) The tangential basis E ~ 2, E
~ 1, E ~ 3.
~ 1, E
(ii) The normal basis E ~ 2, E
~ 3.
(iii) eŐāŌĀ , eŐāőĪ , eŐāő≤ which are normalized vectors in the directions of the tangential basis. Express all results
in terms of spherical coordinates.
(c) A vector A~ = Ax b
e1 + Ay be2 + Az be3 can be represented in any of the forms:

~ = A1 E
A ~ 1 + A2 E
~ 2 + A3 E
~3
~ 1 + A2 E
~ = A1 E
A ~ 2 + A3 E
~3
~ = AŌĀ eŐāŌĀ + AőĪ eŐāőĪ + Aő≤ eŐāő≤
A

depending upon the basis vectors selected . Calculate, in terms of the coordinates (ŌĀ, őĪ, ő≤) and the
components Ax , Ay , Az
(i) The contravariant components A1 , A2 , A3 .
(ii) The covariant components A1 , A2 , A3 .
(iii) The components AŌĀ , AőĪ , Aő≤ which are called physical components.

I 8. Work the problems 6,7 and then let (x1 , x2 , x3 ) = (r, ő≤, z) denote the coordinates in the cylindrical
system and let (x1 , x2 , x3 ) = (ŌĀ, őĪ, ő≤) denote the coordinates in the spherical system.
(a) Write the transformation equations x ‚Üí x from cylindrical to spherical coordinates. Also find the
inverse transformations. ( Hint: See the figures 1.2-4 and 1.2-5.)
(b) Use the results from part (a) and the results from problems 6,7 to verify that

‚ąāxj
Ai = Aj for i = 1, 2, 3.
‚ąāxi

## (i.e. Substitute Aj from problem 6 to get AŐĄi given in problem 7.)

58

(c) Use the results from part (a) and the results from problems 6,7 to verify that

i ‚ąāxi
A = Aj for i = 1, 2, 3.
‚ąāxj

## (i.e. Substitute Aj from problem 6 to get AŐĄi given by problem 7.)

I 9. Pick two arbitrary noncolinear vectors in the x, y plane, say

~1 = 5 b
V e1 + b
e2 ~2 = b
and V e1 + 5 b
e2

~3 = b
and let V e3 be a unit vector perpendicular to both V~1 and V
~2 . The vectors V
~1 and V~2 can be thought of
as defining an oblique coordinate system, as illustrated in the figure 1.2-6.
~ 1 , V~ 2 , V~ 3 ).
(a) Find the reciprocal basis (V
(b) Let
e1 + y b
~r = x b e2 + z b
e3 = őĪV~1 + ő≤ V~2 + ő≥ V
~3

## and show that

5x y
őĪ= ‚ąí
24 24
x 5y
ő≤=‚ąí +
24 24
ő≥=z
(c) Show
x = 5őĪ + ő≤
y = őĪ + 5ő≤
z=ő≥
(d) For ő≥ = ő≥0 constant, show the coordinate lines are described by őĪ = constant and ő≤ = constant,
and sketch some of these coordinate lines. (See figure 1.2-6.)
(e) Find the metrics gij and conjugate metrices g ij associated with the (őĪ, ő≤, ő≥) space.

59

x = x(u, v, w)
y = y(u, v, w)
z = z(u, v, w)

## substituted into the position vector

e1 + y b
~r = x b e2 + z b
e3 .

## Define the basis vectors  

~ 2, E
~ 1, E ~ 3) = ‚ąā~r ‚ąā~r ‚ąā~r
(E , ,
‚ąāu ‚ąāv ‚ąāw
with the reciprocal basis

~1 = 1 E
E ~2 √ó E
~ 3, ~2 = 1 E
E ~3 √ó E
~ 1, ~3 = 1 E
E ~1 √ó E
~ 2.
V V V

where
V =E ~2 √ó E
~ 1 ¬∑ (E ~ 3 ).

~ 1 ¬∑ (E
Let v = E ~2 √ó E
~ 3 ) and show that v ¬∑ V = 1.
I 11. Given the coordinate transformation

x = ‚ąíu ‚ąí 2v y = ‚ąíu ‚ąí v z=z

## (a) Find and illustrate graphically some of the coordinate curves.

(b) For ~r = ~r(u, v, z) a position vector, define the basis vectors

~ 1 = ‚ąā~r ,
E ~ 2 = ‚ąā~r ,
E ~ 3 = ‚ąā~r .
E
‚ąāu ‚ąāv ‚ąāz

~ 1, E
Calculate these vectors and then calculate the reciprocal basis E ~ 2, E
~ 3.
(c) With respect to the basis vectors in (b) find the contravariant components Ai associated with the vector

~ = őĪ1 b
A e1 + őĪ2 b
e2 + őĪ3 b
e3

## where (őĪ1 , őĪ2 , őĪ3 ) are constants.

~ given in part (c).
(d) Find the covariant components Ai associated with the vector A
(e) Calculate the metric tensor gij and conjugate metric tensor g ij .
(f) From the results (e), verify that gij g jk = őīik
(g) Use the results from (c)(d) and (e) to verify that Ai = gik Ak
(h) Use the results from (c)(d) and (e) to verify that Ai = g ik Ak
~ on unit vectors in the directions E
(i) Find the projection of the vector A ~ 1, E
~ 2, E
~ 3.
~ 1, E
~ on unit vectors the directions E
(j) Find the projection of the vector A ~ 2, E
~ 3.
60

## I 12. ei where y i = y i (x1 , x2 , x3 ), i = 1, 2, 3 we have by definition

For ~r = y i b

i m
~ j = ‚ąā~r = ‚ąāy b
E ei . From this relation show that ~ m = ‚ąāx b
E ej
‚ąāxj ‚ąāxj ‚ąāy j

and consequently
m m i j
gij = E ~ j = ‚ąāy ‚ąāy ,
~i ¬∑ E ~ j = ‚ąāx ‚ąāx ,
~i ¬∑ E
and g ij = E i, j, m = 1, . . . , 3
‚ąāxi ‚ąāxj ‚ąāy m ‚ąāy m

## I 13. Consider the set of all coordinate transformations of the form

y i = aij xj + bi

where aij and bi are constants and the determinant of aij is different from zero. Show this set of transforma-
tions forms a group.

## I 14. For őĪi , ő≤i constants and t a parameter, xi = őĪi + t ő≤i ,i = 1, 2, 3 is the parametric representation of

a straight line. Find the parametric equation of the line which passes through the two points (1, 2, 3) and
(14, 7, ‚ąí3). What does the vector d~
r
dt represent?

I 15. A surface can be represented using two parameters u, v by introducing the parametric equations

## xi = xi (u, v), i = 1, 2, 3, a < u < b and c < v < d.

The parameters u, v are called the curvilinear coordinates of a point on the surface. A point on the surface
can be represented by the position vector ~r = ~r(u, v) = x1 (u, v) b
e1 + x2 (u, v) b
e2 + x3 (u, v) b
e3 . The vectors ‚ąā~
r
‚ąāu
‚ąā~
r
and ‚ąāv are tangent vectors to the coordinate surface curves ~r(u, c2 ) and ~r(c1 , v) respectively. An element of
surface area dS on the surface is defined as the area of the elemental parallelogram having the vector sides
‚ąā~
r ‚ąā~
r
‚ąāu du and ‚ąāv dv. Show that

‚ąā~r ‚ąā~r p
dS = | √ó | dudv = g11 g22 ‚ąí (g12 )2 dudv
‚ąāu ‚ąāv

where
‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r
g11 = ¬∑ g12 = ¬∑ g22 = ¬∑ .
‚ąāu ‚ąāu ‚ąāu ‚ąāv ‚ąāv ‚ąāv
~ √ó B)
Hint: (A ~ ¬∑ (A
~ √ó B)
~ = |A ~ 2 See Exercise 1.1, problem 9(c).
~ √ó B|

I 16.
(a) Use the results from problem 15 and find the element of surface area of the circular cone

## x = u sin őĪ cos v y = u sin őĪ sin v z = u cos őĪ

őĪ a constant 0‚Č§u‚Č§b 0 ‚Č§ v ‚Č§ 2ŌÄ

## (b) Find the surface area of the above cone.

61

I 17. The equation of a plane is defined in terms of two parameters u and v and has the form

xi = őĪi u + ő≤i v + ő≥i i = 1, 2, 3,

where őĪi ő≤i and ő≥i are constants. Find the equation of the plane which passes through the points (1, 2, 3),
(14, 7, ‚ąí3) and (5, 5, 5). What does this problem have to do with the position vector ~r(u, v), the vectors
‚ąā~
r ‚ąā~ r
‚ąāu , ‚ąāv and ~r(0, 0)? Hint: See problem 15.

I 18. Determine the points of intersection of the curve x1 = t, x2 = (t)2 , x3 = (t)3 with the plane

8 x1 ‚ąí 5 x2 + x3 ‚ąí 4 = 0.

I 19. ~k = E
Verify the relations V eijk E ~i √ó E
~j and v ‚ąí1 eijk E
~k = E
~i √ó E ~ 1 ¬∑ (E
~ j where v = E ~2 √ó E
~ 3 ) and
V =E ~2 √ó E
~ 1 ¬∑ (E ~ 3 )..

I 20. Let xŐĄi and xi , i = 1, 2, 3 be related by the linear transformation xŐĄi = cij xj , where cij are constants
such that the determinant c = det(cij ) is different from zero. Let ő≥m
n
denote the cofactor of cm
n divided by
the determinant c.
(a) Show that cij ő≥kj = ő≥ji cjk = őīki .
(b) Show the inverse transformation can be expressed xi = ő≥ji xŐĄj .
(c) Show that if Ai is a contravariant vector, then its transformed components are AŐĄp = cpq Aq .
(d) Show that if Ai is a covariant vector, then its transformed components are AŐĄi = ő≥ip Ap .

I 21. Show that the outer product of two contravariant vectors Ai and B i , i = 1, 2, 3 results in a second
order contravariant tensor.

## I 22. Show that for the position vector ~r = y i (x1 , x2 , x3 ) b

ei the element of arc length squared is
m m
2 ‚ąāy ‚ąāy
i j ~i ¬∑ E
ds = d~r ¬∑ d~r = gij dx dx where gij = E ~j = .
‚ąāxi ‚ąāxj
p i k i
I 23. For Aijk , Bnm and Ctq absolute tensors, show that if Aijk Bnk = Cjn
i
then Ajk B n = C jn .

I 24. Let Aij denote an absolute covariant tensor of order 2. Show that the determinant A = det(Aij ) is
p
an invariant of weight 2 and (A) is an invariant of weight 1.

I 25. Let B ij denote an absolute contravariant tensor of order 2. Show that the determinant B = det(B ij )
‚ąö
is an invariant of weight ‚ąí2 and B is an invariant of weight ‚ąí1.

I 26.
(a) Write out the contravariant components of the following vectors

~1
(i) E ~2
(ii) E ~3
(iii) E where ~ i = ‚ąā~r
E for i = 1, 2, 3.
‚ąāxi

## (b) Write out the covariant components of the following vectors

~1
(i) E ~2
(ii) E ~3
(ii) E ~ i = grad xi ,
where E for i = 1, 2, 3.
62

I 27. Let Aij and Aij denote absolute second order tensors. Show that őĽ = Aij Aij is a scalar invariant.

I 28. Assume that aij , i, j = 1, 2, 3, 4 is a skew-symmetric second order absolute tensor. (a) Show that

## ‚ąāajk ‚ąāaki ‚ąāaij

bijk = i
+ j
+
‚ąāx ‚ąāx ‚ąāxk

is a third order tensor. (b) Show bijk is skew-symmetric in all pairs of indices and (c) determine the number
of independent components this tensor has.

I 29. Show the linear forms A1 x + B1 y + C1 and A2 x + B2 y + C2 , with respect to the group of rotations
and translations x = x cos őł ‚ąí y sin őł + h and y = x sin őł + y cos őł + k, have the forms A1 x + B 1 y + C 1 and
A2 x + B 2 y + C 2 . Also show that the quantities A1 B2 ‚ąí A2 B1 and A1 A2 + B1 B2 are invariants.

I 30. Show that the curvature of a curve y = f (x) is őļ = ¬Ī y 00 (1 + y 02 )‚ąí3/2 and that this curvature remains
dy dy dx
invariant under the group of rotations given in the problem 1. Hint: Calculate dx = dx dx .

I 31. Show that when the equation of a curve is given in the parametric form x = x(t), y = y(t), then
xŐáyŐą ‚ąí yŐáxŐą
the curvature is őļ = ¬Ī 2 and remains invariant under the change of parameter t = t(t), where
(xŐá + yŐá 2 )3/2
xŐá = dx
dt , etc.

## I 32. Let Aij ij

k denote a third order mixed tensor. (a) Show that the contraction Ai is a first order
contravariant tensor. (b) Show that contraction of i and j produces Aii
k which is not a tensor. This shows
that in general, the process of contraction does not always apply to indices at the same level.

I 33. Let ŌÜ = ŌÜ(x1 , x2 , . . . , xN ) denote an absolute scalar invariant. (a) Is the quantity ‚ąāŌÜ
‚ąāxi a tensor? (b)
2
‚ąā ŌÜ
Is the quantity ‚ąāxi ‚ąāxj a tensor?

I 34. Consider the second order absolute tensor aij , i, j = 1, 2 where a11 = 1, a12 = 2, a21 = 3 and a22 = 4.
Find the components of aij under the transformation of coordinates x1 = x1 + x2 and x2 = x1 ‚ąí x2 .

I 35. Let Ai , Bi denote the components of two covariant absolute tensors of order one. Show that
Cij = Ai Bj is an absolute second order covariant tensor.

I 36. Let Ai denote the components of an absolute contravariant tensor of order one and let Bi denote the
components of an absolute covariant tensor of order one, show that Cji = Ai Bj transforms as an absolute
mixed tensor of order two.

I 37. (a) Show the sum and difference of two tensors of the same kind is also a tensor of this kind. (b) Show
that the outer product of two tensors is a tensor. Do parts (a) (b) in the special case where one tensor Ai
is a relative tensor of weight 4 and the other tensor Bkj is a relative tensor of weight 3. What is the weight
of the outer product tensor Tkij = Ai Bkj in this special case?

## I 38. Let Aij j ij

km denote the components of a mixed tensor of weight M . Form the contraction Bm = Aim
j
and determine how Bm transforms. What is its weight?

I 39. Let Aij denote the components of an absolute mixed tensor of order two. Show that the scalar
contraction S = Aii is an invariant.
63

I 40. Let Ai = Ai (x1 , x2 , . . . , xN ) denote the components of an absolute contravariant tensor. Form the
‚ąāAi
quantity Bji = ‚ąāxj and determine if Bji transforms like a tensor.
‚ąāAi ‚ąāAj
I 41. Let Ai denote the components of a covariant vector. (a) Show that aij = j
‚ąí are the
‚ąāx ‚ąāxi
‚ąāaij ‚ąāajk ‚ąāaki
components of a second order tensor. (b) Show that + + = 0.
‚ąāxk ‚ąāxi ‚ąāxj
I 42. Show that xi = K eijk Aj Bk , with K 6= 0 and arbitrary, is a general solution of the system of equations
Ai xi = 0, Bi xi = 0, i = 1, 2, 3. Give a geometric interpretation of this result in terms of vectors.

## I 43. Given the vector A~ = yb e1 + z b

e2 + x b
e3 where b
e1 , be2 , b
e3 denote a set of unit basis vectors which
~1 = 3 b
define a set of orthogonal x, y, z axes. Let E e1 + 4 b ~2 = 4 b
e2 , E e1 + 7 b ~3 = b
e2 and E e3 denote a set of
basis vectors which define a set of u, v, w axes. (a) Find the coordinate transformation between these two
~ 1, E
sets of axes. (b) Find a set of reciprocal vectors E ~ 3, E
~ 3 . (c) Calculate the covariant components of A.
~
(d) Calculate the contravariant components of A. ~

## I 44. Let A = Aij b

ei b
ej denote a dyadic. Show that

A : Ac = A11 A11 + A12 A21 + A13 A31 + A21 A12 + A22 A22 + A23 A32 + A31 A13 + A32 A23 + A23 A33

I 45. ~ = Ai b
Let A ~ = Bi b
ei , B ~ = Ci b
ei , C ~ = Di b
ei , D ~ B,
ei denote vectors and let ŌÜ = A ~ Ōą =C
~D~ denote
dyadics which are the outer products involving the above vectors. Show that the double dot product satisfies

~B
ŌÜ:Ōą=A ~ :C
~D~ = (A
~ ¬∑ C)(
~ B ~ ¬∑ D)
~

I 46. Show that if aij is a symmetric tensor in one coordinate system, then it is symmetric in all coordinate
systems.

I 47. Write the transformation laws for the given tensors. (a) Akij (b) Aij
k (c) Aijk
m

‚ąāxj ‚ąāxj
I 48. Show that if Ai = Aj i , then Ai = Aj ‚ąāxi . Note that this is equivalent to interchanging the bar
‚ąāx
and unbarred systems.

I 49.
(a) Show that under the linear homogeneous transformation

x1 =a11 x1 + a21 x2
x2 =a12 x1 + a22 x2

Q(x1 , x2 ) = g11 (x1 )2 + 2g12 x1 x2 + g22 (x2 )2 becomes Q(x1 , x2 ) = g11 (x1 )2 + 2g12 x1 x2 + g 22 (x2 )2

where g ij = g11 aj1 ai1 + g12 (ai1 aj2 + aj1 ai2 ) + g22 ai2 aj2 .
(b) Show F = g11 g22 ‚ąí (g12 )2 is a relative invariant of weight 2 of the quadratic form Q(x1 , x2 ) with respect
to the group of linear homogeneous transformations. i.e. Show that F = ‚ąÜ2 F where F = g 11 g22 ‚ąí(g12 )2
and ‚ąÜ = (a11 a22 ‚ąí a21 a12 ).
64

I 50. Let ai and bi for i = 1, . . . , n denote arbitrary vectors and form the dyadic

ő¶ = a1 b1 + a2 b2 + ¬∑ ¬∑ ¬∑ + an bn .

## By definition the first scalar invariant of ő¶ is

ŌÜ1 = a1 ¬∑ b1 + a2 ¬∑ b2 + ¬∑ ¬∑ ¬∑ + an ¬∑ bn

where a dot product operator has been placed between the vectors. The first vector invariant of ő¶ is defined
~ = a1 √ó b1 + a2 √ó b2 + ¬∑ ¬∑ ¬∑ + an √ó bn
ŌÜ

where a vector cross product operator has been placed between the vectors.
(a) Show that the first scalar and vector invariant of

e1 b
ő¶= b e2 b
e2 + b e3 b
e3 + b e3

## are respectively 1 and b

e1 + b
e3 .
(b) From the vector f = f1 b
e1 + f2 b
e2 + f3 b
e3 one can form the dyadic ‚ąáf having the matrix components
Ô£ę ‚ąāf1 ‚ąāf2 ‚ąāf3 Ô£∂
‚ąāx ‚ąāx ‚ąāx
‚ąáf = Ô£≠ Ô£ł.
‚ąāf1 ‚ąāf2 ‚ąāf3
‚ąāy ‚ąāy ‚ąāy
‚ąāf1 ‚ąāf2 ‚ąāf3
‚ąāz ‚ąāz ‚ąāz
Show that this dyadic has the first scalar and vector invariants given by
‚ąāf1 ‚ąāf2 ‚ąāf3
‚ąá¬∑f = + +
‚ąāx ‚ąāy ‚ąāz
     
‚ąāf3 ‚ąāf2 ‚ąāf1 ‚ąāf3 ‚ąāf2 ‚ąāf1
‚ąá√óf = ‚ąí b
e1 + ‚ąí b
e2 + ‚ąí b
e3
‚ąāy ‚ąāz ‚ąāz ‚ąāx ‚ąāx ‚ąāy

I 51. Let ő¶ denote the dyadic given in problem 50. The dyadic ő¶2 defined by
1X
ő¶2 = ai √ó aj bi √ó bj
2 i,j
is called the Gibbs second dyadic of ő¶, where the summation is taken over all permutations of i and j. When
i = j the dyad vanishes. Note that the permutations i, j and j, i give the same dyad and so occurs twice
in the final sum. The factor 1/2 removes this doubling. Associated with the Gibbs dyad ő¶2 are the scalar
invariants
1X
ŌÜ2 = (ai √ó aj ) ¬∑ (bi √ó bj )
2 i,j
1X
ŌÜ3 = (ai √ó aj ¬∑ ak )(bi √ó bj ¬∑ bk )
6
i,j,k
ő¶ = as + tq + cu
has
the first scalar invariant ŌÜ1 = a ¬∑ s + b ¬∑ t + c ¬∑ u
~ = a√ós+b√ót+c√óu
the first vector invariant ŌÜ
Gibbs second dyad ő¶2 = b √ó ct √ó u + c √ó au √ó s + a √ó bs √ó t
second scalar of ő¶ ŌÜ2 = (b √ó c) ¬∑ (t ¬∑ u) + (c √ó a) ¬∑ (u √ó s) + (a √ó b) ¬∑ (s √ó t)
third scalar of ő¶ ŌÜ3 = (a √ó b ¬∑ c)(s √ó t ¬∑ u)
65

I 52. (Spherical Trigonometry) Construct a spherical triangle ABC on the surface of a unit sphere with
sides and angles less than 180 degrees. Denote by a,b c the unit vectors from the origin of the sphere to the
vertices A,B and C. Make the construction such that a ¬∑ (b √ó c) is positive with a, b, c forming a right-handed
system. Let őĪ, ő≤, ő≥ denote the angles between these unit vectors such that

## a ¬∑ b = cos ő≥ c ¬∑ a = cos ő≤ b ¬∑ c = cos őĪ. (1)

The great circles through the vertices A,B,C then make up the sides of the spherical triangle where side őĪ
is opposite vertex A, side ő≤ is opposite vertex B and side ő≥ is opposite the vertex C. The angles A,B and C
between the various planes formed by the vectors a, b and c are called the interior dihedral angles of the
spherical triangle. Note that the cross products

## a √ó b = sin ő≥ c b √ó c = sin őĪ a c √ó a = sin ő≤ b (2)

define unit vectors a, b and c perpendicular to the planes determined by the unit vectors a, b and c. The
dot products
a ¬∑ b = cos ő≥ b ¬∑ c = cos őĪ c ¬∑ a = cos ő≤ (3)

define the angles őĪ,ő≤ and ő≥ which are called the exterior dihedral angles at the vertices A,B and C and are
such that
őĪ=ŌÄ‚ąíA ő≤ =ŌÄ‚ąíB ő≥ = ŌÄ ‚ąí C. (4)

(a) Using appropriate scaling, show that the vectors a, b, c and a, b, c form a reciprocal set.
(b) Show that a ¬∑ (b √ó c) = sin őĪ a ¬∑ a = sin ő≤ b ¬∑ b = sin ő≥ c ¬∑ c
(c) Show that a ¬∑ (b √ó c) = sin őĪ a ¬∑ a = sin ő≤ b ¬∑ b = sin ő≥ c ¬∑ c
(d) Using parts (b) and (c) show that
sin őĪ sin ő≤ sin ő≥
= =
sin őĪ sin ő≤ sin ő≥
(e) Use the results from equation (4) to derive the law of sines for spherical triangles
sin őĪ sin ő≤ sin ő≥
= =
sin A sin B sin C
(f) Using the equations (2) show that

## and hence show that

cos őĪ = cos ő≤ cos ő≥ ‚ąí sin ő≤ sin ő≥ cos őĪ.

## cos őĪ = cos ő≤ cos ő≥ ‚ąí sin ő≤ sin ő≥ cos őĪ.

(g) Using part (f) derive the law of cosines for spherical triangles
cos őĪ = cos ő≤ cos ő≥ + sin ő≤ sin ő≥ cos A
cos A = ‚ąí cos B cos C + sin B sin C cos őĪ
A cyclic permutation of the symbols produces similar results involving the other angles and sides of the
spherical triangle.
65

## ¬ß1.3 SPECIAL TENSORS

Knowing how tensors are defined and recognizing a tensor when it pops up in front of you are two
different things. Some quantities, which are tensors, frequently arise in applied problems and you should
learn to recognize these special tensors when they occur. In this section some important tensor quantities
are defined. We also consider how these special tensors can in turn be used to define other tensors.

Metric Tensor

## Define y i , i = 1, . . . , N as independent coordinates in an N dimensional orthogonal Cartesian coordinate

system. The distance squared between two points y i and y i + dy i , i = 1, . . . , N is defined by the
expression
ds2 = dy m dy m = (dy 1 )2 + (dy 2 )2 + ¬∑ ¬∑ ¬∑ + (dy N )2 . (1.3.1)

Assume that the coordinates y i are related to a set of independent generalized coordinates xi , i = 1, . . . , N
by a set of transformation equations

y i = y i (x1 , x2 , . . . , xN ), i = 1, . . . , N. (1.3.2)

To emphasize that each y i depends upon the x coordinates we sometimes use the notation y i = y i (x), for
i = 1, . . . , N. The differential of each coordinate can be written as

‚ąāy m j
dy m = dx , m = 1, . . . , N, (1.3.3)
‚ąāxj

and consequently in the x-generalized coordinates the distance squared, found from the equation (1.3.1),
becomes a quadratic form. Substituting equation (1.3.3) into equation (1.3.1) we find

‚ąāy m ‚ąāy m i j
ds2 = dx dx = gij dxi dxj (1.3.4)
‚ąāxi ‚ąāxj

where
‚ąāy m ‚ąāy m
gij = , i, j = 1, . . . , N (1.3.5)
‚ąāxi ‚ąāxj
are called the metrices of the space defined by the coordinates xi , i = 1, . . . , N. Here the gij are functions of
the x coordinates and is sometimes written as gij = gij (x). Further, the metrices gij are symmetric in the
indices i and j so that gij = gji for all values of i and j over the range of the indices. If we transform to
another coordinate system, say xi , i = 1, . . . , N , then the element of arc length squared is expressed in terms
of the barred coordinates and ds2 = g ij dxi dxj , where gij = g ij (x) is a function of the barred coordinates.
The following example demonstrates that these metrices are second order covariant tensors.
66

EXAMPLE 1.3-1. Show the metric components gij are covariant tensors of the second order.
Solution: In a coordinate system xi , i = 1, . . . , N the element of arc length squared is

## ds2 = gij dxi dxj (1.3.6)

while in a coordinate system xi , i = 1, . . . , N the element of arc length squared is represented in the form

## gmn dxm dxn = gij dxi dxj (1.3.8)

Here it is assumed that there exists a coordinate transformation of the form defined by equation (1.2.30)
together with an inverse transformation, as in equation (1.2.32), which relates the barred and unbarred
coordinates. In general, if xi = xi (x), then for i = 1, . . . , N we have

‚ąāxi ‚ąāxj
dxi = dxm and dxj = dxn (1.3.9)
‚ąāxm ‚ąāxn

## Substituting these differentials in equation (1.3.8) gives us the result

 
‚ąāxi ‚ąāxj ‚ąāxi ‚ąāxj
g mn dx dx = gij m n dxm dxn
m n
or g mn ‚ąí gij m n dxm dxn = 0
‚ąāx ‚ąāx ‚ąāx ‚ąāx

‚ąāxi ‚ąāxj
For arbitrary changes in dxm this equation implies that g mn = gij and consequently gij transforms
‚ąāxm ‚ąāxn
as a second order absolute covariant tensor.

EXAMPLE 1.3-2. (Curvilinear coordinates) Consider a set of general transformation equations from
rectangular coordinates (x, y, z) to curvilinear coordinates (u, v, w). These transformation equations and the
corresponding inverse transformations are represented

x = x(u, v, w) u = u(x, y, z)
y = y(u, v, w) v = v(x, y, z) (1.3.10)
z = z(u, v, w). w = w(x, y, z)

## Here y 1 = x, y 2 = y, y 3 = z and x1 = u, x2 = v, x3 = w are the Cartesian and generalized coordinates

and N = 3. The intersection of the coordinate surfaces u = c1 ,v = c2 and w = c3 define coordinate curves
of the curvilinear coordinate system. The substitution of the given transformation equations (1.3.10) into
e1 + y b
the position vector ~r = x b e2 + z b
e3 produces the position vector which is a function of the generalized
coordinates and
e1 + y(u, v, w) b
~r = ~r(u, v, w) = x(u, v, w) b e2 + z(u, v, w) b
e3
67

## ‚ąā~r ‚ąā~r ‚ąā~r

and consequently d~r = du + dv + dw, where
‚ąāu ‚ąāv ‚ąāw

~ 1 = ‚ąā~r =
E
‚ąāx
b
e1 +
‚ąāy
b
e2 +
‚ąāz
b
e3
‚ąāu ‚ąāu ‚ąāu ‚ąāu
E~ 2 = ‚ąā~r = ‚ąāx
b
e1 +
‚ąāy
b
e2 +
‚ąāz
be3 (1.3.11)
‚ąāv ‚ąāv ‚ąāv ‚ąāv
~3 = r =
E
‚ąā~ ‚ąāx
b
e1 +
‚ąāy
b
e2 +
‚ąāz
b
e3 .
‚ąāw ‚ąāw ‚ąāw ‚ąāw
are tangent vectors to the coordinate curves. The element of arc length in the curvilinear coordinates is

## ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r

ds2 = d~r ¬∑ d~r = ¬∑ dudu + ¬∑ dudv + ¬∑ dudw
‚ąāu ‚ąāu ‚ąāu ‚ąāv ‚ąāu ‚ąāw
‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r
+ ¬∑ dvdu + ¬∑ dvdv + ¬∑ dvdw (1.3.12)
‚ąāv ‚ąāu ‚ąāv ‚ąāv ‚ąāv ‚ąāw
‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r
+ ¬∑ dwdu + ¬∑ dwdv + ¬∑ dwdw.
‚ąāw ‚ąāu ‚ąāw ‚ąāv ‚ąāw ‚ąāw
Utilizing the summation convention, the above can be expressed in the index notation. Define the
quantities
‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r
g11 = ¬∑ g12 = ¬∑ g13 = ¬∑
‚ąāu ‚ąāu ‚ąāu ‚ąāv ‚ąāu ‚ąāw
‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r
g21 = ¬∑ g22 = ¬∑ g23 = ¬∑
‚ąāv ‚ąāu ‚ąāv ‚ąāv ‚ąāv ‚ąāw
‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r
g31 = ¬∑ g32 = ¬∑ g33 = ¬∑
‚ąāw ‚ąāu ‚ąāw ‚ąāv ‚ąāw ‚ąāw
and let x1 = u, x2 = v, x3 = w. Then the above element of arc length can be expressed as

ds2 = E
~i ¬∑ E
~ j dxi dxj = gij dxi dxj , i, j = 1, 2, 3

where
m m
gij = E ~ j = ‚ąā~r ¬∑ ‚ąā~r = ‚ąāy ‚ąāy ,
~i ¬∑ E i, j free indices (1.3.13)
‚ąāxi ‚ąāxj ‚ąāxi ‚ąāxj
are called the metric components of the curvilinear coordinate system. The metric components may be
thought of as the elements of a symmetric matrix, since gij = gji . In the rectangular coordinate system
x, y, z, the element of arc length squared is ds2 = dx2 + dy 2 + dz 2 . In this space the metric components are
Ô£ę Ô£∂
1 0 0
gij = Ô£≠ 0 1 0Ô£ł.
0 0 1
68

## EXAMPLE 1.3-3. (Cylindrical coordinates (r, őł, z))

The transformation equations from rectangular coordinates to cylindrical coordinates can be expressed
as x = r cos őł, y = r sin őł, z = z. Here y 1 = x, y 2 = y, y 3 = z and x1 = r, x2 = őł, x3 = z, and the
e1 + r sin őł b
position vector can be expressed ~r = ~r(r, őł, z) = r cos őł b e2 + z b
e3 . The derivatives of this position
vector are calculated and we find

~ 1 = ‚ąā~r = cos őł b
E e1 + sin őł b
e2 , ~ 2 = ‚ąā~r = ‚ąír sin őł b
E e1 + r cos őł b
e2 , ~ 3 = ‚ąā~r = b
E e3 .
‚ąār ‚ąāőł ‚ąāz

From the results in equation (1.3.13), the metric components of this space are
Ô£ę Ô£∂
1 0 0
gij = Ô£≠ 0 r2 0Ô£ł.
0 0 1

## We note that since gij = 0 when i 6= j, the coordinate system is orthogonal.

Given a set of transformations of the form found in equation (1.3.10), one can readily determine the
metric components associated with the generalized coordinates. For future reference we list several differ-
ent coordinate systems together with their metric components. Each of the listed coordinate systems are
orthogonal and so gij = 0 for i 6= j. The metric components of these orthogonal systems have the form
Ô£ę Ô£∂
h21 0 0
gij = Ô£≠ 0 h22 0 Ô£ł
0 0 h23

## 1. Cartesian coordinates (x, y, z)

x=x h1 = 1
y=y h2 = 1
z=z h3 = 1
The coordinate curves are formed by the intersection of the coordinate surfaces
x =Constant, y =Constant and z =Constant.
69

## 2. Cylindrical coordinates (r, őł, z)

x = r cos őł r‚Č•0 h1 = 1
y = r sin őł 0 ‚Č§ őł ‚Č§ 2ŌÄ h2 = r
z=z ‚ąí‚ąě<z <‚ąě h3 = 1

The coordinate curves, illustrated in the figure 1.3-1, are formed by the intersection of the coordinate
surfaces
x2 + y 2 = r2 , Cylinders
y/x = tan őł Planes
z = Constant Planes.

## x = ŌĀ sin őł cos ŌÜ ŌĀ‚Č•0 h1 = 1

y = ŌĀ sin őł sin ŌÜ 0‚Č§őł‚Č§ŌÄ h2 = ŌĀ
z = ŌĀ cos őł 0 ‚Č§ ŌÜ ‚Č§ 2ŌÄ h3 = ŌĀ sin őł

The coordinate curves, illustrated in the figure 1.3-2, are formed by the intersection of the coordinate
surfaces
x2 + y 2 + z 2 = ŌĀ2 Spheres
x2 + y 2 = tan2 őł z 2 Cones
y = x tan ŌÜ Planes.
4. Parabolic cylindrical coordinates (őĺ, ő∑, z)
p
x = őĺő∑ ‚ąí‚ąě<őĺ <‚ąě h1 = őĺ 2 + ő∑2
1 p
y = (őĺ 2 ‚ąí ő∑ 2 ) ‚ąí‚ąě<z <‚ąě h2 = őĺ 2 + ő∑ 2
2
z=z ő∑‚Č•0 h3 = 1
70

## Figure 1.3-2. Spherical coordinates.

The coordinate curves, illustrated in the figure 1.3-3, are formed by the intersection of the coordinate
surfaces
őĺ2
x2 = ‚ąí2őĺ 2 (y ‚ąí ) Parabolic cylinders
2
ő∑2
x2 = 2ő∑ 2 (y + ) Parabolic cylinders
2
z = Constant Planes.

## 5. Parabolic coordinates (őĺ, ő∑, ŌÜ)

p
x = őĺő∑ cos ŌÜ őĺ‚Č•0 h1 = őĺ 2 + ő∑2
p
y = őĺő∑ sin ŌÜ ő∑‚Č•0 h2 = őĺ 2 + ő∑ 2
1
z = (őĺ 2 ‚ąí ő∑ 2 ) 0 < ŌÜ < 2ŌÄ h3 = őĺő∑
2
71

The coordinate curves, illustrated in the figure 1.3-4, are formed by the intersection of the coordinate
surfaces
őĺ2
x2 + y 2 = ‚ąí2őĺ 2 (z ‚ąí ) Paraboloids
2
ő∑2
x2 + y 2 = 2ő∑ 2 (z + ) Paraboloids
2
y = x tan ŌÜ Planes.

## 6. Elliptic cylindrical coordinates (őĺ, ő∑, z)

q
x = cosh őĺ cos ő∑ őĺ‚Č•0 h1 = sinh2 őĺ + sin2 ő∑
q
y = sinh őĺ sin ő∑ 0 ‚Č§ ő∑ ‚Č§ 2ŌÄ h2 = sinh2 őĺ + sin2 ő∑
z=z ‚ąí‚ąě<z <‚ąě h3 = 1

The coordinate curves, illustrated in the figure 1.3-5, are formed by the intersection of the coordinate
surfaces
x2 y2
+ =1 Elliptic cylinders
cosh2 őĺ sinh2 őĺ
x2 y2
‚ąí =1 Hyperbolic cylinders
cos2 ő∑ sin2 ő∑
z = Constant Planes.
72

## 7. Elliptic coordinates (őĺ, ő∑, ŌÜ)

s
p őĺ 2 ‚ąí ő∑2
h1 =
x = (1 ‚ąí ő∑ 2 )(őĺ 2 ‚ąí 1) cos ŌÜ 1‚Č§őĺ<‚ąě őĺ2 ‚ąí 1
p s
y = (1 ‚ąí ő∑ 2 )(őĺ 2 ‚ąí 1) sin ŌÜ ‚ąí1‚Č§ő∑ ‚Č§1 őĺ 2 ‚ąí ő∑2
h2 =
z = őĺő∑ 0 ‚Č§ ŌÜ < 2ŌÄ 1 ‚ąí ő∑2
p
h3 = (1 ‚ąí ő∑ 2 )(őĺ 2 ‚ąí 1)

The coordinate curves, illustrated in the figure 1.3-6, are formed by the intersection of the coordinate
surfaces
x2 y2 z2
+ + =1 Prolate ellipsoid
őĺ2 ‚ąí 1 őĺ2 ‚ąí 1 őĺ2
z2 x2 y2
‚ąí ‚ąí =1 Two-sheeted hyperboloid
ő∑2 1 ‚ąí ő∑2 1 ‚ąí ő∑2
y = x tan ŌÜ Planes
8. Bipolar coordinates (u, v, z)

## a sinh v h21 = h22

x= , 0 ‚Č§ u < 2ŌÄ
cosh v ‚ąí cos u
a2
a sin u
‚ąí‚ąě < v < ‚ąě h22 =
y=
cosh v ‚ąí cos u
, (cosh v ‚ąí cos u)2
z=z ‚ąí‚ąě<z <‚ąě h23 = 1
73

## Figure 1.3-7. Bipolar coordinates.

The coordinate curves, illustrated in the figure 1.3-7, are formed by the intersection of the coordinate
surfaces
a2
(x ‚ąí a coth v)2 + y 2 = Cylinders
sinh2 v
a2
x2 + (y ‚ąí a cot u)2 = Cylinders
sin2 u
z = Constant Planes.
74

## 9. Conical coordinates (u, v, w)

uvw
x= , b 2 > v 2 > a2 > w 2 , u‚Č•0 h21 = 1
ab
r
u (v 2 ‚ąí a2 )(w2 ‚ąí a2 ) u2 (v 2 ‚ąí w2 )
h22 =
y=
a a2 ‚ąí b 2 (v 2
‚ąí a2 )(b2 ‚ąí v 2 )
r u2 (v 2 ‚ąí w2 )
u (v 2 ‚ąí b2 )(w2 ‚ąí b2 ) h23 =
z=
b b 2 ‚ąí a2 (w ‚ąí a2 )(w2 ‚ąí b2 )
2

The coordinate curves, illustrated in the figure 1.3-8, are formed by the intersection of the coordinate
surfaces
x2 + y 2 + z 2 = u2 Spheres
2 2 2
x y z
+ 2 + 2 =0, Cones
v2 v ‚ąí a2 v ‚ąí b2
x2 y2 z2
+ + = 0, Cones.
w2 w 2 ‚ąí a2 w 2 ‚ąí b2

## x = a sinh u sin v cos ŌÜ, u‚Č•0 h21 = h22

y = a sinh u sin v sin ŌÜ, 0‚Č§v‚Č§ŌÄ h22 = a2 (sinh2 u + sin2 v)
z = a cosh u cos v, 0 ‚Č§ ŌÜ < 2ŌÄ h23 = a2 sinh2 u sin2 v

The coordinate curves, illustrated in the figure 1.3-9, are formed by the intersection of the coordinate
surfaces
x2 y2 z2
2
+ 2
+ = 1, Prolate ellipsoids
(a sinh u) (a sinh u) (a cosh u)2
z2 x2 y2
2
‚ąí 2
‚ąí = 1, Two-sheeted hyperboloid
(a cos v) (a sin v) (a sin v)2
y = x tan ŌÜ, Planes.
75

## x = a cosh őĺ cos ő∑ cos ŌÜ, őĺ‚Č•0 h21 = h22

ŌÄ ŌÄ
y = a cosh őĺ cos ő∑ sin ŌÜ, ‚ąí ‚Č§ő∑‚Č§ h22 = a2 (sinh2 őĺ + sin2 ő∑)
2 2
z = a sinh őĺ sin ő∑, 0 ‚Č§ ŌÜ ‚Č§ 2ŌÄ h23 = a2 cosh2 őĺ cos2 ő∑

The coordinate curves, illustrated in the figure 1.3-10, are formed by the intersection of the coordinate
surfaces
x2 y2 z2
2
+ 2
+ = 1, Oblate ellipsoids
(a cosh őĺ) (a cosh őĺ) (a sinh őĺ)2
x2 y2 z2
2
+ 2
‚ąí = 1, One-sheet hyperboloids
(a cos ő∑) (a cos ő∑) (a sin ő∑)2
y = x tan ŌÜ, Planes.
12. Toroidal coordinates (u, v, ŌÜ)

x=
a sinh v cos ŌÜ
, 0 ‚Č§ u < 2ŌÄ h21 = h22
cosh v ‚ąí cos u a2
a sinh v sin ŌÜ h22 =
y= , ‚ąí‚ąě < v < ‚ąě (cosh v ‚ąí cos u)2
cosh v ‚ąí cos u
a sin u a2 sinh2 v
z= , 0 ‚Č§ ŌÜ < 2ŌÄ h23 =
cosh v ‚ąí cos u (cosh v ‚ąí cos u)2

The coordinate curves, illustrated in the figure 1.3-11, are formed by the intersection of the coordinate
surfaces  a cos u 2 a2
x2 + y 2 + z ‚ąí = , Spheres
sin u sin2 u
p 2
cosh v a2
x2 + y 2 ‚ąí a + z2 = , Torus
sinh v sinh2 v
y = x tan ŌÜ, planes
76

## Figure 1.3-11. Toroidal coordinates

EXAMPLE 1.3-4. Show the Kronecker delta őīji is a mixed second order tensor.
Solution: Assume we have a coordinate transformation xi = xi (x), i = 1, . . . , N of the form (1.2.30) and
i
possessing an inverse transformation of the form (1.2.32). Let őī j and őīji denote the Kronecker delta in the
barred and unbarred system of coordinates. By definition the Kronecker delta is defined

i 0, if i 6= j
őīj = őīji = .
1, if i=j
77

## ‚ąāxm ‚ąāxm ‚ąāxi ‚ąāxm ‚ąāxk i

n = = őī (1.3.14)
‚ąāx ‚ąāxi ‚ąāx n
‚ąāxi ‚ąāxn k
‚ąāxm m
By hypothesis, the xi , i = 1, . . . , N are independent coordinates and therefore we have ‚ąāxn = őī n and (1.3.14)
simplifies to
m ‚ąāxm ‚ąāxk
őī n = őīki .
‚ąāxi ‚ąāxn
Therefore, the Kronecker delta transforms as a mixed second order tensor.

## Conjugate Metric Tensor

Let g denote the determinant of the matrix having the metric tensor gij , i, j = 1, . . . , N as its elements.
In our study of cofactor elements of a matrix we have shown that

cof (g1j )g1k + cof (g2j )g2k + . . . + cof (gN j )gN k = gőīkj . (1.3.15)

We can use this fact to find the elements in the inverse matrix associated with the matrix having the
components gij . The elements of this inverse matrix are

1
g ij = cof (gij ) (1.3.16)
g

and are called the conjugate metric components. We examine the summation g ij gik and find:

## g ij gik = g 1j g1k + g 2j g2k + . . . + g N j gN k

1
= [cof (g1j )g1k + cof (g2j )g2k + . . . + cof (gN j )gN k ]
g
1 h ji
= gőīk = őīkj
g

The equation
g ij gik = őīkj (1.3.17)

is an example where we can use the quotient law to show g ij is a second order contravariant tensor. Because
of the symmetry of g ij and gij the equation (1.3.17) can be represented in other forms.

EXAMPLE 1.3-5. Let Ai and Ai denote respectively the covariant and contravariant components of a
~ Show these components are related by the equations
vector A.

Ai = gij Aj (1.3.18)
k jk
A = g Aj (1.3.19)

where gij and g ij are the metric and conjugate metric components of the space.
78

Solution: We multiply the equation (1.3.18) by g im (inner product) and use equation (1.3.17) to simplify
the results. This produces the equation g im Ai = g im gij Aj = őījm Aj = Am . Changing indices produces the
result given in equation (1.3.19). Conversely, if we start with equation (1.3.19) and multiply by gkm (inner
product) we obtain gkm Ak = gkm g jk Aj = őīm
j
Aj = Am which is another form of the equation (1.3.18) with
the indices changed.
Notice the consequences of what the equations (1.3.18) and (1.3.19) imply when we are in an orthogonal
Cartesian coordinate system where
Ô£ę Ô£∂ Ô£ę Ô£∂
1 0 0 1 0 0
gij = Ô£≠ 0 1 0Ô£ł and g ij = Ô£≠0 1 0Ô£ł.
0 0 1 0 0 1

## In this special case, we have

A1 = g11 A1 + g12 A2 + g13 A3 = A1
A2 = g21 A1 + g22 A2 + g23 A3 = A2
A3 = g31 A1 + g32 A2 + g33 A3 = A3 .
These equations tell us that in a Cartesian coordinate system the contravariant and covariant components
are identically the same.

EXAMPLE 1.3-6. We have previously shown that if Ai is a covariant tensor of rank 1 its components in
a barred system of coordinates are
‚ąāxj
Ai = Aj . (1.3.20)
‚ąāxi
Solve for the Aj in terms of the Aj . (i.e. find the inverse transformation).
‚ąāxi
Solution: Multiply equation (1.3.20) by ‚ąāxm (inner product) and obtain

## ‚ąāxi ‚ąāxj ‚ąāxi

Ai = Aj . (1.3.21)
‚ąāxm ‚ąāxi ‚ąāxm

## ‚ąāxj ‚ąāxi ‚ąāxj j

In the above product we have = = őīm since xj and xm are assumed to be independent
‚ąāxi ‚ąāxm ‚ąāxm
coordinates. This reduces equation (1.3.21) to the form

‚ąāxi j
Ai = Aj őīm = Am (1.3.22)
‚ąāxm

## which is the desired inverse transformation.

This result can be obtained in another way. Examine the transformation equation (1.3.20) and ask the
question, ‚ÄúWhen we have two coordinate systems, say a barred and an unbarred system, does it matter which
system we call the barred system?‚ÄĚ With some thought it should be obvious that it doesn‚Äôt matter which
system you label as the barred system. Therefore, we can interchange the barred and unbarred symbols in
‚ąāxj
equation (1.3.20) and obtain the result Ai = Aj i which is the same form as equation (1.3.22), but with
‚ąāx
a different set of indices.
79

Associated Tensors

Associated tensors can be constructed by taking the inner product of known tensors with either the
metric or conjugate metric tensor.

## Definition: (Associated tensor) Any tensor constructed by multiplying (inner

product) a given tensor with the metric or conjugate metric tensor is called an
associated tensor.

Associated tensors are different ways of representing a tensor. The multiplication of a tensor by the
metric or conjugate metric tensor has the effect of lowering or raising indices. For example the covariant
and contravariant components of a vector are different representations of the same vector in different forms.
These forms are associated with one another by way of the metric and conjugate metric tensor and

g ij Ai = Aj gij Aj = Ai .

## EXAMPLE 1.3-7. The following are some examples of associated tensors.

Aj = g ij Ai Aj = gij Ai
Am
.jk = g
mi
Aijk Ai.k
m = gmj A
ijk

A.nm
i.. = g
mk nj
g Aijk Amjk = gim Ai.jk

Sometimes ‚Äėdots‚Äôare used as indices in order to represent the location of the index that was raised or lowered.
If a tensor is symmetric, the position of the index is immaterial and so a dot is not needed. For example, if
Amn is a symmetric tensor, then it is easy to show that An.m and A.n
m are equal and therefore can be written
as Anm without confusion.
Higher order tensors are similarly related. For example, if we find a fourth order covariant tensor Tijkm
we can then construct the fourth order contravariant tensor T pqrs from the relation

T pqrs = g pi g qj g rk g sm Tijkm .

This fourth order tensor can also be expressed as a mixed tensor. Some mixed tensors associated with
the given fourth order covariant tensor are:

p pq p
T.jkm = g pi Tijkm , T..km = g qj T.jkm .
80

Riemann Space VN

A Riemannian space VN is said to exist if the element of arc length squared has the form

## ds2 = gij dxi dxj (1.3.23)

where the metrices gij = gij (x1 , x2 , . . . , xN ) are continuous functions of the coordinates and are different
from constants. In the special case gij = őīij the Riemannian space VN reduces to a Euclidean space EN .
The element of arc length squared defined by equation (1.3.23) is called the Riemannian metric and any
geometry which results by using this metric is called a Riemannian geometry. A space VN is called flat if
it is possible to find a coordinate transformation where the element of arclength squared is ds2 = i (dxi )2
where each i is either +1 or ‚ąí1. A space which is not flat is called curved.

Geometry in VN

## Given two vectors A ~ i and B

~ = Ai E ~ j , then their dot product can be represented
~ = Bj E

~¬∑B
A ~i ¬∑ E
~ = Ai B j E ~ j = gij Ai B j = Aj B j = Ai Bi = g ij Aj Bi = |A||
~ B|~ cos őł. (1.3.24)

~ and B
Consequently, in an N dimensional Riemannian space VN the dot or inner product of two vectors A ~
is defined:
gij Ai B j = Aj B j = Ai Bi = g ij Aj Bi = AB cos őł. (1.3.25)

In this definition A is the magnitude of the vector Ai , the quantity B is the magnitude of the vector Bi and
őł is the angle between the vectors when their origins are made to coincide. In the special case that őł = 90‚ó¶
we have gij Ai B j = 0 as the condition that must be satisfied in order that the given vectors Ai and B i are
orthogonal to one another. Consider also the special case of equation (1.3.25) when Ai = B i and őł = 0. In
this case the equations (1.3.25) inform us that

## g in An Ai = Ai Ai = gin Ai An = (A)2 . (1.3.26)

From this equation one can determine the magnitude of the vector Ai . The magnitudes A and B can be
1 1
written A = (gin Ai An ) 2 and B = (gpq B p B q ) 2 and so we can express equation (1.3.24) in the form

gij Ai B j
cos őł = 1 . (1.3.27)
(gmn Am An ) 2 (g p B q ) 12
pq B

An import application of the above concepts arises in the dynamics of rigid body motion. Note that if a
dAi
vector Ai has constant magnitude and the magnitude of dt is different from zero, then the vectors Ai and
i j
dA
dt must be orthogonal to one another due to the fact that gij Ai dA
dt = 0. As an example, consider the unit
e1 , b
vectors b e2 and b
e3 on a rotating system of Cartesian axes. We have for constants ci , i = 1, 6 that

= c1 b
e 2 + c2 b
e3 = c3 b
e 3 + c4 b
e1 = c5 b
e 1 + c6 b
e2
dt dt dt

## because the derivative of any b

ei (i fixed) constant vector must lie in a plane containing the vectors b
ej and
b
ek , (j 6= i , k 6= i and j 6= k), since any vector in this plane must be perpendicular to b
ei .
81

The above definition of a dot product in VN can be used to define unit vectors in VN .

## Definition: (Unit vector) Whenever the magnitude of a vec-

i
tor A is unity, the vector is called a unit vector. In this case we
have
gij Ai Aj = 1. (1.3.28)

## EXAMPLE 1.3-8. (Unit vectors)

In VN the element of arc length squared is expressed ds2 = gij dxi dxj which can be expressed in the
dxi dxj dxi
form 1 = gij . This equation states that the vector , i = 1, . . . , N is a unit vector. One application
ds ds ds
of this equation is to consider a particle moving along a curve in VN which is described by the parametric
dxi
equations xi = xi (t), for i = 1, . . . , N. The vector V i = dt , i = 1, . . . , N represents a velocity vector of the
particle. By chain rule differentiation we have

## dxi dxi ds dxi

Vi = = =V , (1.3.29)
dt ds dt ds
ds dxi
where V = dt is the scalar speed of the particle and ds is a unit tangent vector to the curve. The equation
(1.3.29) shows that the velocity is directed along the tangent to the curve and has a magnitude V. That is
 2
ds
= (V )2 = gij V i V j .
dt

## EXAMPLE 1.3-9. (Curvilinear coordinates)

Find an expression for the cosine of the angles between the coordinate curves associated with the
transformation equations

82

## Solution: Let y 1 = x, y 2 = y, y 3 = z and x1 = u, x2 = v, x3 = w denote the Cartesian and curvilinear

coordinates respectively. With reference to the figure 1.3-12 we can interpret the intersection of the surfaces
v = c2 and w = c3 as the curve ~r = ~r(u, c2 , c3 ) which is a function of the parameter u. By moving only along
‚ąā~r
this curve we have d~r = du and consequently
‚ąāu
‚ąā~r ‚ąā~r
ds2 = d~r ¬∑ d~r = ¬∑ dudu = g11 (dx1 )2 ,
‚ąāu ‚ąāu
or  2
d~r d~r dx1
1= ¬∑ = g11 .
ds ds ds
dx1 ‚ąö1
This equation shows that the vector ds = g11 is a unit vector along this curve. This tangent vector can
be represented by tr(1) = ‚ąö 1 őī1
r
.
g11
The curve which is defined by the intersection of the surfaces u = c1 and w = c3 has the unit tangent
vector tr(2) = ‚ąö 1 őī2
r
. Similarly, the curve which is defined as the intersection of the surfaces u = c1 and
g22
v = c2 has the unit tangent vector tr(3) = ‚ąö 1 őī3
r
. The cosine of the angle őł12 , which is the angle between the
g33
unit vectors tr(1) and tr(2) , is obtained from the result of equation (1.3.25). We find
1 1 g12
cos őł12 = gpq tp(1) tq(2) = gpq ‚ąö őī1p ‚ąö őī2q = ‚ąö ‚ąö .
g11 g22 g11 g22
For őł13 the angle between the directions ti(1) and ti(3) we find
g13
cos őł13 = ‚ąö ‚ąö .
g11 g33
Finally, for őł23 the angle between the directions ti(2) and ti(3) we find
g23
cos őł23 = ‚ąö ‚ąö .
g22 g33
When őł13 = őł12 = őł23 = 90‚ó¶ , we have g12 = g13 = g23 = 0 and the coordinate curves which make up the
curvilinear coordinate system are orthogonal to one another.
In an orthogonal coordinate system we adopt the notation

83

## Epsilon Permutation Symbol

Associated with the e‚ąípermutation symbols there are the epsilon permutation symbols defined by the
relations
‚ąö 1
ijk = geijk and ijk = ‚ąö eijk (1.3.30)
g
where g is the determinant of the metrices gij .
It can be demonstrated that the eijk permutation symbol is a relative tensor of weight ‚ąí1 whereas the
ijk permutation symbol is an absolute tensor. Similarly, the eijk permutation symbol is a relative tensor of
weight +1 and the corresponding ijk permutation symbol is an absolute tensor.
EXAMPLE 1.3-10. ( permutation symbol)
Show that eijk is a relative tensor of weight ‚ąí1 and the corresponding ijk permutation symbol is an
absolute tensor.
Solution: Examine the Jacobian 1
‚ąāx1 ‚ąāx1 ‚ąāx1
x ‚ąāx ‚ąāx2 ‚ąāx3
2 ‚ąāx2 ‚ąāx2
J = ‚ąāx
x ‚ąāx13 ‚ąāx2 ‚ąāx3
‚ąāx1 ‚ąāx3 ‚ąāx3
‚ąāx ‚ąāx2 ‚ąāx3
and make the substitution
‚ąāxi
aij = , i, j = 1, 2, 3.
‚ąāxj
From the definition of a determinant we may write
x
eijk aim ajn akp = J( )emnp . (1.3.31)
x
By definition, emnp = emnp in all coordinate systems and hence equation (1.3.31) can be expressed in the
form h x i‚ąí1 ‚ąāxi ‚ąāxj ‚ąāxk
J( ) eijk m n p = emnp (1.3.32)
x ‚ąāx ‚ąāx ‚ąāx
which demonstrates that eijk transforms as a relative tensor of weight ‚ąí1.
We have previously shown the metric tensor gij is a second order covariant tensor and transforms
‚ąāxm ‚ąāxn
according to the rule gij = gmn . Taking the determinant of this result we find
‚ąāxi ‚ąāxj
m 2 h x i2
‚ąāx
g = |gij | = |gmn | i = g J( ) (1.3.33)
‚ąāx x

where g is the determinant of (gij ) and g is the determinant of (g ij ). This result demonstrates that g is a
scalar invariant of weight +2. Taking the square root of this result we find that
p ‚ąö x
g= gJ( ). (1.3.34)
x
‚ąö
Consequently, we call g a scalar invariant of weight +1. Now multiply both sides of equation (1.3.32) by
‚ąö
g and use (1.3.34) to verify the relation

## ‚ąö ‚ąāxi ‚ąāxj ‚ąāxk p

g eijk m n p = g emnp . (1.3.35)
‚ąāx ‚ąāx ‚ąāx
‚ąö
This equation demonstrates that the quantity ijk = g eijk transforms like an absolute tensor.
84

## Figure 1.3-14. Translation followed by rotation of axes

In a similar manner one can show eijk is a relative tensor of weight +1 and ijk = ‚ąö1 eijk is an absolute
g
tensor. This is left as an exercise.

Another exercise found at the end of this section is to show that a generalization of the e ‚ąí őī identity
is the epsilon identity
g ij ipt jrs = gpr gts ‚ąí gps gtr . (1.3.36)

Cartesian Tensors

Consider the motion of a rigid rod in two dimensions. No matter how complicated the movement of
the rod is we can describe the motion as a translation followed by a rotation. Consider the rigid rod AB
illustrated in the figure 1.3-13.

## Figure 1.3-13. Motion of rigid rod

In this figure there is a before and after picture of the rod‚Äôs position. By moving the point B to B 0 we
have a translation. This is then followed by a rotation holding B fixed.
85

## Figure 1.3-15. Rotation of axes

A similar situation exists in three dimensions. Consider two sets of Cartesian axes, say a barred and
unbarred system as illustrated in the figure 1.3-14. Let us translate the origin 0 to 0 and then rotate the
(x, y, z) axes until they coincide with the (x, y, z) axes. We consider first the rotation of axes when the
origins 0 and 0 coincide as the translational distance can be represented by a vector bk , k = 1, 2, 3. When
the origin 0 is translated to 0 we have the situation illustrated in the figure 1.3-15, where the barred axes
can be thought of as a transformation due to rotation.
Let
e1 + y b
~r = x b e2 + z b
e3 (1.3.37)

denote the position vector of a variable point P with coordinates (x, y, z) with respect to the origin 0 and the
e1 , b
unit vectors b e2 , b
e3 . This same point, when referenced with respect to the origin 0 and the unit vectors
eŐā1 , eŐā2 , eŐā3 , has the representation
~r = x eŐā1 + y eŐā2 + z eŐā3 . (1.3.38)

By considering the projections of ~r upon the barred and unbarred axes we can construct the transformation
equations relating the barred and unbarred axes. We calculate the projections of ~r onto the x, y and z axes
and find:
e1 = x = x( eŐā1 ¬∑ b
~r ¬∑ b e1 ) + y( eŐā2 ¬∑ b
e1 ) + z( eŐā3 ¬∑ b
e1 )
e2 = y = x( eŐā1 ¬∑ b
~r ¬∑ b e2 ) + y( eŐā2 ¬∑ b
e2 ) + z( eŐā3 ¬∑ b
e2 ) (1.3.39)
e3 = z = x( eŐā1 ¬∑ b
~r ¬∑ b e3 ) + y( eŐā2 ¬∑ b
e3 ) + z( eŐā3 ¬∑ b
e3 ).
We also calculate the projection of ~r onto the x, y, z axes and find:

~r ¬∑ eŐā1 = x = x( b
e1 ¬∑ eŐā1 ) + y( b
e2 ¬∑ eŐā1 ) + z( b
e3 ¬∑ eŐā1 )
~r ¬∑ eŐā2 = y = x( b
e1 ¬∑ eŐā2 ) + y( b
e2 ¬∑ eŐā2 ) + z( b
e3 ¬∑ eŐā2 ) (1.3.40)
~r ¬∑ eŐā3 = z = x( b
e1 ¬∑ eŐā3 ) + y( b
e2 ¬∑ eŐā3 ) + z( b
e3 ¬∑ eŐā3 ).

By introducing the notation (y1 , y2 , y3 ) = (x, y, z) (y 1 , y2 , y3 ) = (x, y, z) and defining őłij as the angle
between the unit vectors b
ei and eŐāj , we can represent the above transformation equations in a more concise
86

## form. We observe that the direction cosines can be written as

`11 = b
e1 ¬∑ eŐā1 = cos őł11 `12 = b
e1 ¬∑ eŐā2 = cos őł12 `13 = b
e1 ¬∑ eŐā3 = cos őł13
`21 = b
e2 ¬∑ eŐā1 = cos őł21 `22 = b
e2 ¬∑ eŐā2 = cos őł22 `23 = b
e2 ¬∑ eŐā3 = cos őł23 (1.3.41)
`31 = b
e3 ¬∑ eŐā1 = cos őł31 `32 = b
e3 ¬∑ eŐā2 = cos őł32 `33 = b
e3 ¬∑ eŐā3 = cos őł33

which enables us to write the equations (1.3.39) and (1.3.40) in the form

## Using the index notation we represent the unit vectors as:

eŐār = `pr b
ep or b
ep = `pr eŐār (1.3.43)

where `pr are the direction cosines. In both the barred and unbarred system the unit vectors are orthogonal
and consequently we must have the dot products

## eŐār ¬∑ eŐāp = őīrp and b

em ¬∑ b
en = őīmn (1.3.44)

where őīij is the Kronecker delta. Substituting equation (1.3.43) into equation (1.3.44) we find the direction
cosines `ij must satisfy the relations:

## eŐār ¬∑ eŐās = `pr b

ep ¬∑ `ms b
em = `pr `ms b
ep ¬∑ b
em = `pr `ms őīpm = `mr `ms = őīrs
and b
er ¬∑ b
es = `rm eŐām ¬∑ `sn eŐān = `rm `sn eŐām ¬∑ eŐān = `rm `sn őīmn = `rm `sm = őīrs .

The relations
`mr `ms = őīrs and `rm `sm = őīrs , (1.3.45)

with summation index m, are important relations which are satisfied by the direction cosines associated with
a rotation of axes.
Combining the rotation and translation equations we find

yi = `ij y j + bi . (1.3.46)
| {z } |{z}
rotation translation

We multiply this equation by `ik and make use of the relations (1.3.45) to find the inverse transformation

## These transformations are called linear or affine transformations.

Consider the xi axes as fixed, while the xi axes are rotating with respect to the xi axes where both sets
of axes have a common origin. Let A~ = Ai bei denote a vector fixed in and rotating with the xi axes. We

~
dA ~
dA
denote by and the derivatives of A~ with respect to the fixed (f) and rotating (r) axes. We can
dt f dt r
87

~
dA i
db db
write, with respect to the fixed axes, that = dA b ei + Ai
ei
. Note that
ei
is the derivative of a

dt f dt dt dt
vector with constant magnitude. Therefore there exists constants ŌČi , i = 1, . . . , 6 such that
dbe1 dbe2 dbe3
= ŌČ3 b
e2 ‚ąí ŌČ 2 b
e3 = ŌČ1 b
e3 ‚ąí ŌČ 4 b
e1 = ŌČ5 b
e1 ‚ąí ŌČ 6 b
e2
dt dt dt

## i.e. see page 80. From the dot product b

e1 ¬∑ b
e2 = 0 we obtain by differentiation b be2 + d be1 ¬∑ b
e1 ¬∑ ddt e2 = 0
dt
which implies ŌČ4 = ŌČ3 . Similarly, from the dot products b
e1 ¬∑ b
e3 and b
e2 ¬∑ b
e3 we obtain by differentiation the
~
additional relations ŌČ5 = ŌČ2 and ŌČ6 = ŌČ1 . The derivative of A with respect to the fixed axes can now be
represented

dA~ i ~
= dA b e + (ŌČ A ‚ąí ŌČ A ) b
e + (ŌČ A ‚ąí ŌČ A ) b
e + (ŌČ A ‚ąí ŌČ A ) b
e =
dA + ~ŌČ √ó A
~
dt f dt r
i 2 3 3 2 1 3 1 1 3 2 1 2 2 1 3
dt

where ~ŌČ = ŌČi bei is called an angular velocity vector of the rotating system. The term ~ŌČ √ó A ~ represents the
dA~ i
velocity of the rotating system relative to the fixed system and = dA b ei represents the derivative with
dt r dt
respect to the rotating system.
Employing the special transformation equations (1.3.46) let us examine how tensor quantities transform
when subjected to a translation and rotation of axes. These are our special transformation laws for Cartesian
tensors. We examine only the transformation laws for first and second order Cartesian tensor as higher order
transformation laws are easily discerned. We have previously shown that in general the first and second order
tensor quantities satisfy the transformation laws:
‚ąāyj
Ai = Aj (1.3.48)
‚ąāy i
i ‚ąāy
A = Aj i (1.3.49)
‚ąāyj
mn ‚ąāy ‚ąāy
A = Aij m n (1.3.50)
‚ąāyi ‚ąāyj
‚ąāyi ‚ąāyj
Amn = Aij (1.3.51)
‚ąāy m ‚ąāyn
m ‚ąāy ‚ąāyj
An = Aij m (1.3.52)
‚ąāyi ‚ąāy n
For the special case of Cartesian tensors we assume that yi and y i , i = 1, 2, 3 are linearly independent. We
differentiate the equations (1.3.46) and (1.3.47) and find
‚ąāyi ‚ąāyj ‚ąāy k ‚ąāyi
= `ij = `ij őījk = `ik , and = `ik = `ik őīim = `mk .
‚ąāyk ‚ąāyk ‚ąāym ‚ąāym
Substituting these derivatives into the transformation equations (1.3.48) through (1.3.52) we produce the
transformation equations
Ai = Aj `ji
i
A = Aj `ji
mn
A = Aij `im `jn
Amn = Aij `im `jn
m
An = Aij `im `jn .
88

## Figure 1.3-16. Transformation to curvilinear coordinates

These are the transformation laws when moving from one orthogonal system to another. In this case the
direction cosines `im are constants and satisfy the relations given in equation (1.3.45). The transformation
laws for higher ordered tensors are similar in nature to those given above.
In the unbarred system (y1 , y2 , y3 ) the metric tensor and conjugate metric tensor are:

## gij = őīij and g ij = őīij

where őīij is the Kronecker delta. In the barred system of coordinates, which is also orthogonal, we have

‚ąāym ‚ąāym
g ij = .
‚ąāy i ‚ąāyj

## We examine the associated tensors

Ai = g ij Aj Ai = gij Aj
Aij = g im g jn Amn Amn = gmi gnj Aij
Ain = g im Amn Ain = gnj Aij

and find that the contravariant and covariant components are identical to one another. This holds also in
the barred system of coordinates. Also note that these special circumstances allow the representation of
contractions using subscript quantities only. This type of a contraction is not allowed for general tensors. It
is left as an exercise to try a contraction on a general tensor using only subscripts to see what happens. Note
that such a contraction does not produce a tensor. These special situations are considered in the exercises.

Physical Components

## ~ can be represented in many forms depending upon

We have previously shown an arbitrary vector A
the coordinate system and basis vectors selected. For example, consider the figure 1.3-16 which illustrates a
Cartesian coordinate system and a curvilinear coordinate system.
89

## Figure 1.3-17. Physical components

~ as
In the Cartesian coordinate system we can represent a vector A

~ = Ax b
A e1 + Ay b
e2 + Az b
e3

where ( b
e1 , b
e2 , b
e3 ) are the basis vectors. Consider a coordinate transformation to a more general coordinate
system, say (x , x2 , x3 ). The vector A
1 ~ can be represented with contravariant components as

~ = A1 E
A ~ 1 + A2 E
~ 2 + A3 E
~3 (1.3.53)

## with respect to the tangential basis vectors (E ~ 2, E

~ 1, E ~ 3 ). Alternatively, the same vector A
~ can be represented
in the form
~ 1 + A2 E
~ = A1 E
A ~ 2 + A3 E
~3 (1.3.54)

~ 1, E
having covariant components with respect to the gradient basis vectors (E ~ 2, E
~ 3 ). These equations are
just different ways of representing the same vector. In the above representations the basis vectors need not
be orthogonal and they need not be unit vectors. In general, the physical dimensions of the components Ai
and Aj are not the same.
The physical components of the vector A ~ in a direction is defined as the projection of A ~ upon a unit
vector in the desired direction. For example, the physical component of A ~ in the direction E
~ 1 is

~
~ ¬∑ E1 = A1 = projection of A
A ~ on E
~ 1. (1.3.58)
~ 1|
|E ~ 1|
|E

~ 1 is
~ in the direction E
Similarly, the physical component of A

~1 1
~ ¬∑ E = A = projection of A
A ~ 1.
~ on E (1.3.59)
~ 1|
|E ~ 1|
|E

EXAMPLE 1.3-11. (Physical components) Let őĪ, ő≤, ő≥ denote nonzero positive constants such that the
product relation őĪő≥ = 1 is satisfied. Consider the nonorthogonal basis vectors

~1 = őĪ b
E e1 , ~2 = ő≤ b
E e1 + ő≥ b
e2 , ~3 = b
E e3

90

## It is readily verified that the reciprocal basis is

~1 = ő≥ b
E e1 ‚ąí ő≤ b
e2 , ~2 = őĪb
E e2 , ~3 = b
E e3 .

~ = Ax b
Consider the problem of representing the vector A e1 + Ay b
e2 in the contravariant vector form

~ = A1 E
A ~ 1 + A2 E
~2 or tensor form Ai , i = 1, 2.

## This vector has the contravariant components

A1 = A ~ 1 = ő≥Ax ‚ąí ő≤Ay
~ ¬∑E and A2 = A ~ 2 = őĪAy .
~ ¬∑E

## Alternatively, this same vector can be represented as the covariant vector

A ~ 1 + A2 E
~ = A1 E ~2 which has the tensor form Ai , i = 1, 2.

## The covariant components are found from the relations

~ ¬∑E
A1 = A ~ 1 = őĪAx ~ ¬∑E
A2 = A ~ 2 = ő≤Ax + ő≥Ay .

~ 1 and E
~ in the directions E
The physical components of A ~ 2 are found to be:

~1 1
x ‚ąí ő≤Ay
~ ¬∑ E = A = ő≥A
A p = A(1)
~ 1
|E | ~ 1
|E | ő≥2 + ő≤2
~2 2
~ ¬∑ E = A = őĪAy = Ay = A(2).
A
~ 2|
|E ~ 2|
|E őĪ

~
Note that these same results are obtained from the dot product relations using either form of the vector A.
For example, we can write

~1 ~1 ~1 ~2 ~1
~ ¬∑ E = A1 (E ¬∑ E ) + A2 (E ¬∑ E ) = A(1)
A
~ 1|
|E |E~ 1|
~2 ~1 ~2 ~2 ~2
and ~ ¬∑ E = A1 (E ¬∑ E ) + A2 (E ¬∑ E ) = A(2).
A
~ 2|
|E |E~ 2|

## ~ in a direction of a unit vector őĽi is the generalized

In general, the physical components of a vector A
dot product in VN . This dot product is an invariant and can be expressed

~ in direction of őĽi
gij Ai őĽj = Ai őĽi = Ai őĽi = projection of A
91

## In orthogonal coordinates observe the element of arc length squared in V3 is

ds2 = gij dxi dxj = (h1 )2 (dx1 )2 + (h2 )2 (dx2 )2 + (h3 )2 (dx3 )2

where Ô£ę Ô£∂
(h1 )2 0 0
gij = Ô£≠ 0 (h2 )2 0 Ô£ł. (1.3.60)
0 0 (h3 )2
In this case the curvilinear coordinates are orthogonal and

## h2(i) = g(i)(i) i not summed and gij = 0, i 6= j.

At an arbitrary point in this coordinate system we take őĽi , i = 1, 2, 3 as a unit vector in the direction
of the coordinate x1 . We then obtain

dx1
őĽ1 = , őĽ2 = 0, őĽ3 = 0.
ds

## This is a unit vector since

1 = gij őĽi őĽj = g11 őĽ1 őĽ1 = h21 (őĽ1 )2
1
or őĽ1 = h1 . Here the curvilinear coordinate system is orthogonal and in this case the physical component
of a vector Ai , in the direction xi , is the projection of Ai on őĽi in V3 . The projection in the x1 direction is
determined from
1
A(1) = gij Ai őĽj = g11 A1 őĽ1 = h21 A1 = h1 A1 .
h1
Similarly, we choose unit vectors ¬Ķi and őĹ i , i = 1, 2, 3 in the x2 and x3 directions. These unit vectors
can be represented
¬Ķ1 =0, dx2 1 ¬Ķ3 =0
¬Ķ2 = = ,
ds h2 dx3 1
őĹ 1 =0, őĹ 2 =0, őĹ3 = =
ds h3
and the physical components of the vector Ai in these directions are calculated as

## A(2) = h2 A2 and A(3) = h3 A3 .

In summary, we can say that in an orthogonal coordinate system the physical components of a contravariant
tensor of order one can be determined from the equations

‚ąö
A(i) = h(i) A(i) = g(i)(i) A(i) , i = 1, 2 or 3 no summation on i,

which is a short hand notation for the physical components (h1 A1 , h2 A2 , h3 A3 ). In an orthogonal coordinate
system the nonzero conjugate metric components are

1
g (i)(i) = , i = 1, 2, or 3 no summation on i.
g(i)(i)
92

These components are needed to calculate the physical components associated with a covariant tensor of
order one. For example, in the x1 ‚ąídirection, we have the covariant components
1
őĽ1 = g11 őĽ1 = h21 = h1 , őĽ2 = 0, őĽ3 = 0
h1
and consequently the projection in V3 can be represented
1 A1
gij Ai őĽj = gij Ai g jm őĽm = Aj g jm őĽm = A1 őĽ1 g 11 = A1 h1 = = A(1).
h21 h1

## In a similar manner we calculate the relations

A2 A3
A(2) = and A(3) =
h2 h3

for the other physical components in the directions x2 and x3 . These physical components can be represented
in the short hand notation
A(i) A(i)
A(i) = =‚ąö , i = 1, 2 or 3 no summation on i.
h(i) g(i)(i)

In an orthogonal coordinate system the physical components associated with both the contravariant and
covariant components are the same. To show this we note that when Ai gij = Aj is summed on i we obtain

## Another form for this equation is

‚ąö A(i)
A(i) = A(i) g(i)(i) = ‚ąö i not summed,
g(i)(i)

which demonstrates that the physical components associated with the contravariant and covariant compo-
nents are identical.
NOTATION The physical components are sometimes expressed by symbols with subscripts which represent
the coordinate curve along which the projection is taken. For example, let H i denote the contravariant
components of a first order tensor. The following are some examples of the representation of the physical
components of H i in various coordinate systems:
orthogonal coordinate tensor physical
coordinates system components components

## general (x1 , x2 , x3 ) Hi H(1), H(2), H(3)

i
rectangular (x, y, z) H Hx , Hy , Hz
i
cylindrical (r, őł, z) H Hr , Hőł , Hz
i
spherical (ŌĀ, őł, ŌÜ) H HŌĀ , Hőł , HŌÜ
i
general (u, v, w) H Hu , Hv , Hw
93

## Higher Order Tensors

The physical components associated with higher ordered tensors are defined by projections in VN just
like the case with first order tensors. For an nth ordered tensor Tij...k we can select n unit vectors őĽi , ¬Ķi , . . . , őĹ i
and form the inner product (projection)
Tij...k őĽi ¬Ķj . . . őĹ k .

When projecting the tensor components onto the coordinate curves, there are N choices for each of the unit
vectors. This produces N n physical components.
The above inner product represents the physical component of the tensor Tij...k along the directions of
the unit vectors őĽi , ¬Ķi , . . . , őĹ i . The selected unit vectors may or may not be orthogonal. In the cases where
the selected unit vectors are all orthogonal to one another, the calculation of the physical components is
greatly simplified. By relabeling the unit vectors őĽi(m) , őĽi(n) , . . . , őĽi(p) where (m), (n), ..., (p) represent one of
the N directions, the physical components of a general nth order tensor is represented

## EXAMPLE 1.3-12. (Physical components)

In an orthogonal curvilinear coordinate system V3 with metric gij , i, j = 1, 2, 3, find the physical com-
ponents of
(i) the second order tensor Aij . (ii) the second order tensor Aij . (iii) the second order tensor Aij .
Solution: The physical components of Amn , m, n = 1, 2, 3 along the directions of two unit vectors őĽi and
¬Ķi is defined as the inner product in V3 . These physical components can be expressed

A(ij) = Amn őĽm n
(i) ¬Ķ(j) i, j = 1, 2, 3,

where the subscripts (i) and (j) represent one of the coordinate directions. Dropping the subscripts (i) and
(j), we make the observation that in an orthogonal curvilinear coordinate system there are three choices for
the direction of the unit vector őĽi and also three choices for the direction of the unit vector ¬Ķi . These three
choices represent the directions along the x1 , x2 or x3 coordinate curves which emanate from a point of the
curvilinear coordinate system. This produces a total of nine possible physical components associated with
the tensor Amn .
For example, we can obtain the components of the unit vector őĽi , i = 1, 2, 3 in the x1 direction directly
from an examination of the element of arc length squared

## By setting dx2 = dx3 = 0, we find

dx1 1
= = őĽ1 , őĽ2 = 0, őĽ3 = 0.
ds h1

This is the vector őĽi(1) , i = 1, 2, 3. Similarly, if we choose to select the unit vector őĽi , i = 1, 2, 3 in the x2
direction, we set dx1 = dx3 = 0 in the element of arc length squared and find the components

dx2 1
őĽ1 = 0, őĽ2 = = , őĽ3 = 0.
ds h2
94

This is the vector őĽi(2) , i = 1, 2, 3. Finally, if we select őĽi , i = 1, 2, 3 in the x3 direction, we set dx1 = dx2 = 0
in the element of arc length squared and determine the unit vector

dx3 1
őĽ1 = 0, őĽ2 = 0, őĽ3 = = .
ds h3

This is the vector őĽi(3) , i = 1, 2, 3. Similarly, the unit vector ¬Ķi can be selected as one of the above three
directions. Examining all nine possible combinations for selecting the unit vectors, we calculate the physical
components in an orthogonal coordinate system as:

## A11 A12 A13

A(11) = A(12) = A(13) =
h1 h1 h1 h2 h1 h3
A21 A22 A23
A(21) = A(22) = A(23) =
h1 h2 h2 h2 h2 h3
A31 A32 A33
A(31) = A(32) = A(33) =
h3 h1 h3 h2 h3 h3

## These results can be written in the more compact form

A(i)(j)
A(ij) = no summation on i or j . (1.3.61)
h(i) h(j)

## Aij = g im Amj = g i1 A1j + g i2 A2j + g i3 A3j . (1.3.62)

From the fact g ij = 0 for i 6= j, together with the physical components from equation (1.3.61), the equation
(1.3.62) reduces to

(i) 1
A(j) = g (i)(i) A(i)(j) = ¬∑ h(i) h(j) A(ij) no summation on i and i, j = 1, 2 or 3.
h2(i)

## This can also be written in the form

(i) h(i)
A(ij) = A(j) no summation on i or j. (1.3.63)
h(j)

Hence, the physical components associated with the mixed tensor Aij in an orthogonal coordinate system
can be expressed as
A(11) = A11 h1 h1
A(12) = A12 A(13) = A13
h2 h2 h3
A(21) = A21 A(22) = A22 h2
h1 A(23) = A23
h3 h3 h3
A(31) = A31 A(32) = A32 A(33) = A33 .
h1 h2
For second order contravariant tensors we may write

## Aij gjm = Aim = Ai1 g1m + Ai2 g2m + Ai3 g3m .

95

We use the fact gij = 0 for i 6= j together with the physical components from equation (1.3.63) to reduce the
(i)
above equation to the form A(m) = A(i)(m) g(m)(m) no summation on m . In terms of physical components
we have

h(m)
A(im) = A(i)(m) h2(m) or A(im) = A(i)(m) h(i) h(m) . no summation i, m = 1, 2, 3 (1.3.64)
h(i)

Examining the results from equation (1.3.64) we find that the physical components associated with the
contravariant tensor Aij , in an orthogonal coordinate system, can be written as:

## A(11) = A11 h1 h1 A(12) = A12 h1 h2 A(13) = A13 h1 h3

A(21) = A21 h2 h1 A(22) = A22 h2 h2 A(23) = A23 h2 h3
A(31) = A31 h3 h1 A(32) = A32 h3 h2 A(33) = A33 h3 h3 .

## Physical Components in General

In an orthogonal curvilinear coordinate system, the physical components associated with the nth order
tensor Tij...kl along the curvilinear coordinate directions can be represented:

T(i)(j)...(k)(l)
T (ij . . . kl) = no summations.
h(i) h(j) . . . h(k) h(l)

These physical components can be related to the various tensors associated with Tij...kl . For example, in
ij...m
an orthogonal coordinate system, the physical components associated with the mixed tensor Tn...kl can be
expressed as:
(i)(j)...(m) h(i) h(j) . . . h(m)
T (ij . . . m n . . . kl) = T(n)...(k)(l) no summations. (1.3.65)
h(n) . . . h(k) h(l)
EXAMPLE 1.3-13. (Physical components) Let xi = xi (t), i = 1, 2, 3 denote the position vector of a
particle which moves as a function of time t. Assume there exists a coordinate transformation xi = xi (x), for
i = 1, 2, 3, of the form given by equations (1.2.33). The position of the particle when referenced with respect
to the barred system of coordinates can be found by substitution. The generalized velocity of the particle
in the unbarred system is a vector with components

dxi
vi = , i = 1, 2, 3.
dt

The generalized velocity components of the same particle in the barred system is obtained from the chain
rule. We find this velocity is represented by

vi = = j
= v .
dt ‚ąāx dt ‚ąāxj

## dx1 dx2 dx3

(v 1 , v 2 , v 3 ) = ( , , )
dt dt dt
96

are tensor quantities. These quantities are called the components of the generalized velocity. The coordinates
x1 , x2 , x3 are generalized coordinates. This means we can select any set of three independent variables for
the representation of the motion. The variables selected might not have the same dimensions. For example,
in cylindrical coordinates we let (x1 = r, x2 = őł, x3 = z). Here x1 and x3 have dimensions of distance but x2
has dimensions of angular displacement. The generalized velocities are

## dx1 dr dx2 dőł dx3 dz

v1 = = , v2 = = , v3 = = .
dt dt dt dt dt dt

Here v 1 and v 3 have units of length divided by time while v 2 has the units of angular velocity or angular
change divided by time. Clearly, these dimensions are not all the same. Let us examine the physical
components of the generalized velocities. We find in cylindrical coordinates h1 = 1, h2 = r, h3 = 1 and the
physical components of the velocity have the forms:

dr dőł dz
vr = v(1) = v 1 h1 = , vőł = v(2) = v 2 h2 = r , vz = v(3) = v 3 h3 = .
dt dt dt

Now the physical components of the velocity all have the same units of length divided by time.

Additional examples of the use of physical components are considered later. For the time being, just
remember that when tensor equations are derived, the equations are valid in any generalized coordinate
system. In particular, we are interested in the representation of physical laws which are to be invariant and
independent of the coordinate system used to represent these laws. Once a tensor equation is derived, we
can chose any type of generalized coordinates and expand the tensor equations. Before using any expanded
tensor equations we must replace all the tensor components by their corresponding physical components in
order that the equations are dimensionally homogeneous. It is these expanded equations, expressed in terms
of the physical components, which are used to solve applied problems.

## Tensors and Multilinear Forms

Tensors can be thought of as being created by multilinear forms defined on some vector space V. Let
us define on a vector space V a linear form, a bilinear form and a general multilinear form. We can then
illustrate how tensors are created from these forms.

## Definition: (Linear form) Let V denote a vector space which

contains vectors ~x, ~x1 , ~x2 , . . . . A linear form in ~x is a scalar function
Ōē(~x) having a single vector argument ~x which satisfies the linearity
properties:

## (i) Ōē(~x1 + ~x2 ) = Ōē(~x1 ) + Ōē(~x2 )

(1.3.66)
(ii) Ōē(¬Ķ~x1 ) = ¬ĶŌē(~x1 )

for all arbitrary vectors ~x1 , ~x2 in V and all real numbers ¬Ķ.
97

## An example of a linear form is the dot product relation

~ ¬∑ ~x
Ōē(~x) = A (1.3.67)

## ~ is a constant vector and ~x is an arbitrary vector belonging to the vector space V.

where A
Note that a linear form in ~x can be expressed in terms of the components of the vector ~x and the base
e1 , b
vectors ( b e2 , b
e3 ) used to represent ~x. To show this, we write the vector ~x in the component form

ei = x1 b
~x = xi b e1 + x2 b
e2 + x3 b
e3 ,

## where xi , i = 1, 2, 3 are the components of ~x with respect to the basis vectors ( b

e1 , b
e2 , b
e3 ). By the linearity
property of Ōē we can write
ei ) = Ōē(x1 b
Ōē(~x) = Ōē(xi b e1 + x2 b
e2 + x3 b
e3 )
= Ōē(x1 b
e1 ) + Ōē(x2 b
e2 ) + Ōē(x3 b
e3 )
= x1 Ōē( b
e1 ) + x2 Ōē( b
e2 ) + x3 Ōē( b
e3 ) = xi Ōē( b
ei )

## Thus we can write Ōē(~x) = xi Ōē( b

ei ) and by defining the quantity Ōē( b ei ) = ai as a tensor we obtain Ōē(~x) = xi ai .
Note that if we change basis from ( b e1 , b
e2 , b ~ 1, E
e3 ) to (E ~ 2, E
~ 3 ) then the components of ~x also must change.
Letting xi denote the components of ~x with respect to the new basis, we would have

~i
~x = xi E ~ i ) = xi Ōē(E
and Ōē(~x) = Ōē(xi E ~ i ).

The linear form Ōē defines a new tensor ai = Ōē(E ~ i ) so that Ōē(~x) = xi ai . Whenever there is a definite relation
e1 , b
between the basis vectors ( b e2 , b ~ 1, E
e3 ) and (E ~ 2, E
~ 3 ), say,
j
~ i = ‚ąāx b
E ej ,
‚ąāxi
then there exists a definite relation between the tensors ai and ai . This relation is
‚ąāxj ‚ąāxj ‚ąāxj
~ i ) = Ōē(
ai = Ōē(E b
i ej ) =
b
i Ōē( ej ) = aj .
‚ąāx ‚ąāx ‚ąāxi
This is the transformation law for an absolute covariant tensor of rank or order one.
The above idea is now extended to higher order tensors.

## Definition: ( Bilinear form) A bilinear form in ~x and ~y is a

scalar function Ōē(~x, ~y) with two vector arguments, which satisfies
the linearity properties:

## (i) Ōē(~x1 + ~x2 , ~y1 ) = Ōē(~x1 , ~y1 ) + Ōē(~x2 , ~y1 )

(ii) Ōē(~x1 , ~y1 + ~y2 ) = Ōē(~x1 , ~y1 ) + Ōē(~x1 , ~y2 )
(1.3.68)
(iii) Ōē(¬Ķ~x1 , ~y1 ) = ¬ĶŌē(~x1 , ~y1 )
(iv) Ōē(~x1 , ¬Ķ~y1 ) = ¬ĶŌē(~x1 , ~y1 )

for arbitrary vectors ~x1 , ~x2 , ~y1 , ~y2 in the vector space V and for all
real numbers ¬Ķ.
98

Note in the definition of a bilinear form that the scalar function Ōē is linear in both the arguments ~x and
~y . An example of a bilinear form is the dot product relation

## where both ~x and ~y belong to the same vector space V.

The definition of a bilinear form suggests how multilinear forms can be defined.

## Definition: (Multilinear forms) A multilinear form of degree M or a M degree

linear form in the vector arguments

## ~x1 , ~x2 , . . . , ~xM

is a scalar function
Ōē(~x1 , ~x2 , . . . , ~xM )

of M vector arguments which satisfies the property that it is a linear form in each of its
arguments. That is, Ōē must satisfy for each j = 1, 2, . . . , M the properties:

(i) Ōē(~x1 , . . . , ~xj1 + ~xj2 , . . . ~xM ) = Ōē(~x1 , . . . , ~xj1 , . . . , ~xM ) + Ōē(~x1 , . . . , ~xj2 , . . . , ~xM )
(ii) Ōē(~x1 , . . . , ¬Ķ~xj , . . . , ~xM ) = ¬ĶŌē(~x1 , . . . , ~xj , . . . , ~xM )
(1.3.70)
for all arbitrary vectors ~x1 , . . . , ~xM in the vector space V and all real numbers ¬Ķ.

An example of a third degree multilinear form or trilinear form is the triple scalar product

## Ōē(~x, ~y , ~z) = ~x ¬∑ (~y √ó ~z). (1.3.71)

Note that multilinear forms are independent of the coordinate system selected and depend only upon the
e1 , b
vector arguments. In a three dimensional vector space we select the basis vectors ( b e2 , b
e3 ) and represent
all vectors with respect to this basis set. For example, if ~x, ~y , ~z are three vectors we can represent these
vectors in the component forms

~x = xi b
ei , ~y = y j b
ej , ~z = z k b
ek (1.3.72)

where we have employed the summation convention on the repeated indices i, j and k. Substituting equations
(1.3.72) into equation (1.3.71) we obtain

Ōē(xi b
ei , y j b
ej , z k b
ek ) = xi y j z k Ōē( b
ei , b
ej , b
ek ), (1.3.73)

## since Ōē is linear in all its arguments. By defining the tensor quantity

ei , b
Ōē( b ej , b
ek ) = eijk (1.3.74)
99

(See exercise 1.1, problem 15) the trilinear form, given by equation (1.3.71), with vectors from equations
(1.3.72), can be expressed as
Ōē(~x, ~y , ~z) = eijk xi y j z k , i, j, k = 1, 2, 3. (1.3.75)
The coefficients eijk of the trilinear form is called a third order tensor. It is the familiar permutation symbol
considered earlier.
In a multilinear form of degree M , Ōē(~x, ~y , . . . , ~z), the M arguments can be represented in a component
e1 , b
form with respect to a set of basis vectors ( b e2 , b
e3 ). Let these vectors have components xi , y i , z i , i = 1, 2, 3
with respect to the selected basis vectors. We then can write

~x = xi b
ei , ~y = y j b
ej , ~z = z k b
ek .

## Substituting these vectors into the M degree multilinear form produces

Ōē(xi b
ei , y j b
ej , . . . , z k b
ek ) = xi y j ¬∑ ¬∑ ¬∑ z k Ōē( b
ei , b
ej , . . . , b
ek ). (1.3.76)

## Consequently, the multilinear form defines a set of coefficients

aij...k = Ōē( b
ei , b
ej , . . . , b
ek ) (1.3.77)

which are referred to as the components of a tensor of order M. The tensor is thus created by the multilinear
form and has M indices if Ōē is of degree M.
Note that if we change to a different set of basis vectors, say, (E ~ 2, E
~ 1, E ~ 3 ) the multilinear form defines
a new tensor
~i, E
aij...k = Ōē(E ~j, . . . , E
~ k ). (1.3.78)
This new tensor has a bar over it to distinguish it from the previous tensor. A definite relation exists between
the new and old basis vectors and consequently there exists a definite relation between the components of
the barred and unbarred tensors components. Recall that if we are given a set of transformation equations

y i = y i (x1 , x2 , x3 ), i = 1, 2, 3, (1.3.79)

from rectangular to generalized curvilinear coordinates, we can express the basis vectors in the new system
by the equations
j
E~ i = ‚ąāy bej , i = 1, 2, 3. (1.3.80)
‚ąāxi
For example, see equations (1.3.11) with y 1 = x, y 2 = y, y 3 = z, x1 = u, x2 = v, x3 = w. Substituting
equations (1.3.80) into equations (1.3.78) we obtain
‚ąāy őĪ ‚ąāy ő≤ ‚ąāy ő≥
b
eőĪ , j b
aij...k = Ōē(
i
eő≤ , . . . , k b eő≥ ).
‚ąāx ‚ąāx ‚ąāx
By the linearity property of Ōē, this equation is expressible in the form
‚ąāy őĪ ‚ąāy ő≤ ‚ąāy ő≥
aij...k =
i j
. . . k Ōē( b eőĪ , b eő≤ , . . . , b
eő≥ )
‚ąāx ‚ąāx ‚ąāx
‚ąāy őĪ ‚ąāy ő≤ ‚ąāy ő≥
aij...k = . . . aőĪő≤...ő≥
‚ąāxi ‚ąāxj ‚ąāxk
This is the familiar transformation law for a covariant tensor of degree M. By selecting reciprocal basis
vectors the corresponding transformation laws for contravariant vectors can be determined.
The above examples illustrate that tensors can be considered as quantities derivable from multilinear
forms defined on some vector space.
100

Dual Tensors
The e-permutation symbol is often used to generate new tensors from given tensors. For Ti1 i2 ...im a
skew-symmetric tensor, we define the tensor
1 j1 j2 ...jn‚ąím i1 i2 ...im
TŐā j1 j2 ...jn‚ąím = e Ti1 i2 ...im m‚Č§n (1.3.81)
m!
as the dual tensor associated with Ti1 i2 ...im . Note that the e-permutation symbol or alternating tensor has
a weight of +1 and consequently the dual tensor will have a higher weight than the original tensor.
The e-permutation symbol has the following properties

## ei1 i2 ...iN ei1 i2 ...iN = N !

ei1 i2 ...iN ej1 j2 ...jN = őīji11 ij22...iN
...jN
(1.3.82)
ek1 k2 ...km i1 i2 ...iN ‚ąím ej1 j2 ...jm i1 i2 ...iN ‚ąím = (N ‚ąí m)!őīkj11jk22...jm
...km

őīkj11jk22...jm
...km Tj1 j2 ...jm = m!Tk1 k2 ...km .

Using the above properties we can solve for the skew-symmetric tensor in terms of the dual tensor. We find
1
Ti1 i2 ...im = ei i ...i j j ...j TŐā j1 j2 ...jn‚ąím . (1.3.83)
(n ‚ąí m)! 1 2 m 1 2 n‚ąím
For example, if Aij i, j = 1, 2, 3 is a skew-symmetric tensor, we may associate with it the dual tensor
1 ijk
Vi = e Ajk ,
2!
which is a first order tensor or vector. Note that Aij has the components
Ô£ę Ô£∂
0 A12 A13
Ô£≠ ‚ąíA12 0 A23 Ô£ł (1.3.84)
‚ąíA13 ‚ąíA23 0

~ are
and consequently, the components of the vector V

## (V 1 , V 2 , V 3 ) = (A23 , A31 , A12 ). (1.3.85)

Note that the vector components have a cyclic order to the indices which comes from the cyclic properties
of the e-permutation symbol.
As another example, consider the fourth order skew-symmetric tensor Aijkl , i, j, k, l = 1, . . . , n. We can
associate with this tensor any of the dual tensor quantities
1 ijkl
V = e Aijkl
4!
1
V i = eijklm Ajklm
4!
1
V ij = eijklmn Aklmn (1.3.86)
4!
1
V ijk = eijklmnp Almnp
4!
1
V ijkl = eijklmnpr Amnpr
4!
Applications of dual tensors can be found in section 2.2.
101

EXERCISE 1.3

I 1.
‚ąāxa ‚ąāxb
(a) From the transformation law for the second order tensor g ij = gab
‚ąāxi ‚ąāxj
solve for the gab in terms of gij .
(b) Show that if gij is symmetric in one coordinate system it is symmetric in all coordinate systems.
p ‚ąö x
(c) Let g = det(gij ) and g = det(gij ) and show that g = gJ 2 ( xx ) and consequently g = gJ( ). This
‚ąö x
shows that g is a scalar invariant of weight 2 and g is a scalar invariant of weight 1.

I 2. For
‚ąāy m ‚ąāy m ‚ąāxi ‚ąāxj
gij = show that g ij =
‚ąāxi ‚ąāxj ‚ąāy m ‚ąāy m

## (a) g = det(gij ) = g11 g22 g33

(b) gmn = g mn = 0 for m 6= n
1
(c) gN N = for N = 1, 2, 3 (no summation on N)
gN N

i 2
‚ąāy
I 4. Show that g = det(gij ) = j = J 2 , where J is the Jacobian.
‚ąāx

## ‚ąā~r ‚ąā~r ‚ąā~r

I 5. Define the quantities h1 = hu = | |, h2 = h v = | |, h3 = h w = | | and construct the unit
‚ąāu ‚ąāv ‚ąāw
vectors
1 ‚ąā~r 1 ‚ąā~r 1 ‚ąā~r
b
eu = , b
ev = , b
ew = .
h1 ‚ąāu h2 ‚ąāv h3 ‚ąāw
(a) Assume the coordinate system is orthogonal and show that
 2  2  2
‚ąāx ‚ąāy ‚ąāz
g11 = h21 = + + ,
‚ąāu ‚ąāu ‚ąāu
 2  2
2 
‚ąāx ‚ąāy ‚ąāz
g22 = h22 = + + ,
‚ąāv ‚ąāv ‚ąāv
 2  2  2
‚ąāx ‚ąāy ‚ąāz
g33 = h23 = + + .
‚ąāw ‚ąāw ‚ąāw

## (b) Show that d~r can be expressed in the form d~r = h1 b

eu du + h2 b
ev dv + h3 b
ew dw.

(c) Show that the volume of the elemental parallelepiped having d~r as diagonal can be represented

‚ąö ‚ąā(x, y, z)
dŌĄ = g dudvdw = J dudvdw = dudvdw.
‚ąā(u, v, w)
Hint:
A1 A2 A3

|A ¬∑ (B √ó C)| = B1
~ ~ ~ B2 B3
C1 C2 C3
102

## Figure 1.3-18 Oblique cylindrical coordinates.

I 6. For the change d~r given inqproblem 5, show the elemental parallelepiped with diagonal d~r has:
(a) the element of area dS1 = g22 g33 ‚ąí g232 dvdw in the u =constant surface.

q
(b) The element of area dS2 = 2 dudw in the v =constant surface.
g33 g11 ‚ąí g13
q
(c) the element of area dS3 = 2 dudv in the w =constant surface.
g11 g22 ‚ąí g12

## (d) What do the above elements

q of area reduce to in the special case the curvilinear coordinates are orthog-
~ √ó B|
|A ~ = (A ~ √ó B)
~ ¬∑ (A ~ √ó B)
~
onal? Hint: q .
= (A ~ ¬∑ A)(
~ B ~ ¬∑ B)
~ ‚ąí (A ~ ¬∑ B)(
~ A ~ ¬∑ B)
~

I 7. In Cartesian coordinates you are given the affine transformation. xi = `ij xj where

1 1 1
x1 = (5x1 ‚ąí 14x2 + 2x3 ), x2 = ‚ąí (2x1 + x2 + 2x3 ), x3 = (10x1 + 2x2 ‚ąí 11x3 )
15 3 15

## (a) Show the transformation is orthogonal.

~ 1 , x2 , x3 ) in the unbarred system has the components
(b) A vector A(x

## Find the components of this vector in the barred system of coordinates.

I 8. Calculate the metric and conjugate metric tensors in cylindrical coordinates (r, őł, z).
I 9. Calculate the metric and conjugate metric tensors in spherical coordinates (ŌĀ, őł, ŌÜ).
I 10. Calculate the metric and conjugate metric tensors in parabolic cylindrical coordinates (őĺ, ő∑, z).
I 11. Calculate the metric and conjugate metric components in elliptic cylindrical coordinates (őĺ, ő∑, z).
I 12. Calculate the metric and conjugate metric components for the oblique cylindrical coordinates (r, ŌÜ, ő∑),
illustrated in figure 1.3-18, where x = r cos ŌÜ, y = r sin ŌÜ + ő∑ cos őĪ, z = ő∑ sin őĪ and őĪ is a parameter
0<őĪ‚Č§ ŌÄ
2. Note: When őĪ = ŌÄ
2 cylindrical coordinates result.
103

I 13. Calculate the metric and conjugate metric tensor associated with the toroidal surface coordinates
(őĺ, ő∑) illustrated in the figure 1.3-19, where

## x = (a + b cos őĺ) cos ő∑ a>b>0

y = (a + b cos őĺ) sin ő∑ 0 < őĺ < 2ŌÄ
z = b sin őĺ 0 < ő∑ < 2ŌÄ

## Figure 1.3-19. Toroidal surface coordinates

I 14. Calculate the metric and conjugate metric tensor associated with the spherical surface coordinates
(őł, ŌÜ), illustrated in the figure 1.3-20, where

## y = a sin őł sin ŌÜ 0 < ŌÜ < 2ŌÄ

ŌÄ
z = a cos őł 0<őł<
2

## I 15. Consider gij , i, j = 1, 2

g22 ‚ąíg12 g11
(a) Show that g 11 = , g 12 = g 21 = , g 22 = where ‚ąÜ = g11 g22 ‚ąí g12 g21 .
‚ąÜ ‚ąÜ ik k
‚ąÜ
(b) Use the results in part (a) and verify that gij g = őīj , i, j, k = 1, 2.

I 16. Let Ax , Ay , Az denote the constant components of a vector in Cartesian coordinates. Using the
transformation laws (1.2.42) and (1.2.47) to find the contravariant and covariant components of this vector
upon changing to (a) cylindrical coordinates (r, őł, z). (b) spherical coordinates (ŌĀ, őł, ŌÜ) and (c) Parabolic
cylindrical coordinates.

I 17. Find the relationship which exists between the given associated tensors.

(a) Apqk
r. and Apq
rs (c) Ai.j.
.l.m and A.s.p
r.t.

## (b) Ap.mrs and Apq

..rs (d) Amnk and Aij
..k
104

## Figure 1.3-20. Spherical surface coordinates

I 18. Given the fourth order tensor Cikmp = őĽőīik őīmp + ¬Ķ(őīim őīkp + őīip őīkm ) + őĹ(őīim őīkp ‚ąí őīip őīkm ) where őĽ, ¬Ķ
and őĹ are scalars and őīij is the Kronecker delta. Show that under an orthogonal transformation of rotation of
axes with xi = `ij xj where `rs `is = `mr `mi = őīri the components of the above tensor are unaltered. Any
tensor whose components are unaltered under an orthogonal transformation is called an ‚Äėisotropic‚Äô tensor.
Another way of stating this problem is to say ‚ÄúShow Cikmp is an isotropic tensor.‚ÄĚ

I 19. Assume Aijl is a third order covariant tensor and B pqmn is a fourth order contravariant tensor. Prove
that Aikl B klmn is a mixed tensor of order three, with one covariant and two contravariant indices.

I 20. Assume that Tmnrs is an absolute tensor. Show that if Tijkl + Tijlk = 0 in the coordinate system xr
then T ijkl + T ijlk = 0 in any other coordinate system xr .

## I 21. Show that

gir gis git

ijk rst = gjr gjs gjt
gkr gks gkt
Hint: See problem 38, Exercise 1.1

I 22. Determine if the tensor equation mnp mij + mnj mpi = mni mpj is true or false. Justify your answer.

I 23. Prove the epsilon identity g ij ipt jrs = gpr gts ‚ąí gps gtr . Hint: See problem 38, Exercise 1.1

1
I 24. Let Ars denote a skew-symmetric contravariant tensor and let cr = rmn Amn where
‚ąö 2
rmn = germn . Show that cr are the components of a covariant tensor. Write out all the components.

1 rmn 1
I 25. Let Ars denote a skew-symmetric covariant tensor and let cr = Amn where rmn = ‚ąö ermn .
2 g
Show that cr are the components of a contravariant tensor. Write out all the components.
105

## I 26. Let Apq Brqs = Cpr

s
where Brqs is a relative tensor of weight ŌČ1 and Cpr
s
is a relative tensor of weight
ŌČ2 . Prove that Apq is a relative tensor of weight (ŌČ2 ‚ąí ŌČ1 ).

‚ąö i
I 27. When Aij is an absolute tensor prove that gAj is a relative tensor of weight +1.

I 28. When Aij is an absolute tensor prove that ‚ąö1 Ai is a relative tensor of weight ‚ąí1.
g j

I 29.
(a) Show eijk is a relative tensor of weight +1.
(b) Show ijk = ‚ąö1 eijk is an absolute tensor. Hint: See example 1.1-25.
g

I 30. The equation of a surface can be represented by an equation of the form ő¶(x1 , x2 , x3 ) = constant.
Show that a unit normal vector to the surface can be represented by the vector
‚ąāő¶
g ij ‚ąāx j
ni = ‚ąāő¶ ‚ąāő¶ 2 1 .
(g mn ‚ąāx m ‚ąāxn )

I 31. Assume that gij = őĽgij with őĽ a nonzero constant. Find and calculate gij in terms of g ij .

I 32. Determine if the following tensor equation is true. Justify your answer.

## Hint: See problem 21, Exercise 1.1.

I 33. Show that for Ci and C i associated tensors, and C i = ijk Aj Bk , then Ci = ijk Aj B k

I 34. Prove that ijk and ijk are associated tensors. Hint: Consider the determinant of gij .

## I 35. Show ijk Ai Bj Ck = ijk Ai B j C k .

I 36. Let Tji , i, j = 1, 2, 3 denote a second order mixed tensor. Show that the given quantities are scalar
invariants.
(i) I1 = Tii
1 i 2 
(ii) I2 = (Ti ) ‚ąí Tm
i
Tim
2
(iii) I3 = det|Tji |

I 37.
(a) Assume Aij and B ij , i, j = 1, 2, 3 are absolute contravariant tensors, and determine if the inner product
C ik = Aij B jk is an absolute tensor?
‚ąāxj ‚ąāxj
(b) Assume that the condition = őīnm is satisfied, and determine whether the inner product in
‚ąāxn ‚ąāxm
part (a) is a tensor?
(c) Consider only transformations which are a rotation and translation of axes y i = `ij yj + bi , where `ij are
‚ąāy j ‚ąāyj
direction cosines for the rotation of axes. Show that = őīnm
‚ąāyn ‚ąāym
106

I 38. For Aijk a Cartesian tensor, determine if a contraction on the indices i and j is allowed. That
is, determine if the quantity Ak = Aiik , (summation on i) is a tensor. Hint: See part(c) of the previous
problem.
I 39. Prove the e-őī identity eijk eimn = őīm őīn ‚ąí őīnj őīm
j k k
.

I 40. Consider the vector Vk , k = 1, 2, 3 and define the matrix (aij ) having the elements aij = eijk Vk ,
where eijk is the e‚ąípermutation symbol.
(a) Solve for Vi in terms of amn by multiplying both sides of the given equation by eijl and note the e ‚ąí őī
identity allows us to simplify the result.
(b) Sum the given expression on k and then assign values to the free indices (i,j=1,2,3) and compare your
results with part (a).
(c) Is aij symmetric, skew-symmetric, or neither?

I 41. It can be shown that the continuity equation of fluid dynamics can be expressed in the tensor form

1 ‚ąā ‚ąö ‚ąā%
‚ąö r
( g%V r ) + = 0,
g ‚ąāx ‚ąāt

where % is the density of the fluid, t is time, V r , with r = 1, 2, 3 are the velocity components and g = |gij |
is the determinant of the metric tensor. Employing the summation convention and replacing the tensor
components of velocity by their physical components, express the continuity equation in

## (a) Cartesian coordinates (x, y, z) with physical components Vx , Vy , Vz .

(b) Cylindrical coordinates (r, őł, z) with physical components Vr , Vőł , Vz .
(c) Spherical coordinates (ŌĀ, őł, ŌÜ) with physical components VŌĀ , Vőł , VŌÜ .

I 42. Let x1 , x2 , x3 denote a set of skewed coordinates with respect to the Cartesian coordinates y 1 , y 2 , y 3 .
Assume that E~ 1, E ~ 3 are unit vectors in the directions of the x1 , x2 and x3 axes respectively. If the unit
~ 2, E
vectors satisfy the relations
E ~1 = 1
~1 ¬∑ E ~1 ¬∑ E
E ~ 2 = cos őł12

E ~2 = 1
~2 ¬∑ E ~1 ¬∑ E
E ~ 3 = cos őł13

E ~3 = 1
~3 ¬∑ E ~2 ¬∑ E
E ~ 3 = cos őł23 ,

## I 43. Let Aij , i, j = 1, 2, 3, 4 denote the skew-symmetric second rank tensor

Ô£ę Ô£∂
0 a b c
Ô£¨ ‚ąía 0 d eÔ£∑
Aij = Ô£≠ Ô£ł,
‚ąíb ‚ąíd 0 f
‚ąíc ‚ąíe ‚ąíf 0

where a, b, c, d, e, f are complex constants. Calculate the components of the dual tensor

1 ijkl
V ij = e Akl .
2
107

I 44. In Cartesian coordinates the vorticity tensor at a point in a fluid medium is defined
 
1 ‚ąāVj ‚ąāVi
ŌČij = ‚ąí
2 ‚ąāxi ‚ąāxj
where Vi are the velocity components of the fluid at the point. The vorticity vector at a point in a fluid
1
medium in Cartesian coordinates is defined by ŌČ i = eijk ŌČjk . Show that these tensors are dual tensors.
2
I 45. Write out the relation between each of the components of the dual tensors
1
TŐā ij = eijkl Tkl i, j, k, l = 1, 2, 3, 4
2
and show that if ijkl is an even permutation of 1234, then TŐā ij = Tkl .

I 46. Consider the general affine transformation xŐĄi = aij xj where (x1 , x2 , x3 ) = (x, y, z) with inverse
transformation xi = bij xŐĄj . Determine (a) the image of the plane Ax + By + Cz + D = 0 under this
transformation and (b) the image of a second degree conic section

Ax2 + 2Bxy + Cy 2 + Dx + Ey + F = 0.

I 47. Using a multilinear form of degree M, derive the transformation law for a contravariant vector of
degree M.
‚ąāg ‚ąāgij
I 48. Let g denote the determinant of gij and show that = gg ij k .
‚ąāxk ‚ąāx
I 49. We have shown that for a rotation of xyz axes with respect to a set of fixed xŐĄyŐĄ zŐĄ axes, the derivative
~ with respect to an observer on the barred axes is given by
of a vector A

dA~ ~
= dA + ŌČ~ √ó A.~
dt f dt r
Introduce the operators
~
dA
~=
Df A = derivative in fixed system
dt f

~
~ = dA = derivative in rotating system
Dr A
dt r
(a) Show that Df A ~ = (Dr + ~ ~
ŌČ √ó)A.
(b) Consider the ~ is the position vector ~r. Show that Df ~r = (Dr + ~ŌČ√ó)~r
special case that the vector
A

produces V ~ = V~ + ~ ŌČ √ó ~r where V~ represents the velocity of a particle relative to the fixed system

f r f

and V~ represents the velocity of a particle with respect to the rotating system of coordinates.
r

(c) Show that ~a = ~a + ~ ŌČ √ó (~ŌČ √ó ~r) where ~a represents the acceleration of a particle relative to the
f r f

fixed system and ~a represents the acceleration of a particle with respect to the rotating system.
r
(d) Show in the special case ~
ŌČ is a constant that

~a = 2~ŌČ √ó V
~ +ŌČ
~ √ó (~ŌČ √ó ~r)
f
~ is the velocity of the particle relative to the rotating system. The term 2~ŌČ √ó V
where V ~ is referred to
as the Coriolis acceleration and the term ~ŌČ √ó (~ŌČ √ó ~r) is referred to as the centripetal acceleration.
108

## ¬ß1.4 DERIVATIVE OF A TENSOR

In this section we develop some additional operations associated with tensors. Historically, one of the
basic problems of the tensor calculus was to try and find a tensor quantity which is a function of the metric
‚ąāgij ‚ąā 2 gij
tensor gij and some of its derivatives , , . . . . A solution of this problem is the fourth order
‚ąāxm ‚ąāxm ‚ąāxn
Riemann Christoffel tensor Rijkl to be developed shortly. In order to understand how this tensor was arrived
at, we must first develop some preliminary relationships involving Christoffel symbols.

Christoffel Symbols

Let us consider the metric tensor gij which we know satisfies the transformation law

‚ąāxa ‚ąāxb
g őĪő≤ = gab .
‚ąāxőĪ ‚ąāxő≤

## ‚ąāgőĪő≤ ‚ąāgab ‚ąāxc ‚ąāxa ‚ąāxb ‚ąā 2 xa ‚ąāxb ‚ąāxa ‚ąā 2 xb

(őĪ, ő≤, ő≥) = ő≥ = ő≥ őĪ + g ab őĪ ő≥ + g ab
‚ąāx ‚ąāxc ‚ąāx ‚ąāx ‚ąāxő≤ ‚ąāx ‚ąāx ‚ąāxő≤ ‚ąāxőĪ ‚ąāxő≤ ‚ąāxő≥

1
and form the combination of terms [(őĪ, ő≤, ő≥) + (ő≤, ő≥, őĪ) ‚ąí (ő≥, őĪ, ő≤)] to obtain the result
2
   
1 ‚ąāgőĪő≤ ‚ąāgő≤ő≥ ‚ąāg ő≥őĪ 1 ‚ąāgab ‚ąāgbc ‚ąāgca ‚ąāxa ‚ąāxb ‚ąāxc ‚ąāxb ‚ąā 2 xa
+ ‚ąí = + ‚ąí + g ab . (1.4.1)
2 ‚ąāxő≥ ‚ąāxőĪ ‚ąāxő≤ 2 ‚ąāxc ‚ąāxa őĪ
‚ąāxb ‚ąāx ‚ąāxő≤ ‚ąāx ő≥
‚ąāxő≤ ‚ąāxőĪ ‚ąāxő≥

In this equation the combination of derivatives occurring inside the brackets is called a Christoffel symbol
of the first kind and is defined by the notation
 
1 ‚ąāgab ‚ąāgbc ‚ąāgac
[ac, b] = [ca, b] = + ‚ąí . (1.4.2)
2 ‚ąāxc ‚ąāxa ‚ąāxb

The equation (1.4.1) defines the transformation for a Christoffel symbol of the first kind and can be expressed
as
‚ąāxa ‚ąāxb ‚ąāxc ‚ąā 2 xa ‚ąāxb
[őĪ ő≥, ő≤] = [ac, b] + g ab . (1.4.3)
‚ąāxőĪ ‚ąāxő≤ ‚ąāxő≥ ‚ąāxőĪ ‚ąāxő≥ ‚ąāxő≤
Observe that the Christoffel symbol of the first kind [ac, b] does not transform like a tensor. However, it is
symmetric in the indices a and c.
At this time it is convenient to use the equation (1.4.3) to develop an expression for the second derivative
term which occurs in that equation as this second derivative term arises in some of our future considerations.
‚ąāxő≤ de
To solve for this second derivative we can multiply equation (1.4.3) by g and simplify the result to the
‚ąāxd
form
‚ąā 2 xe ‚ąāxa ‚ąāxc ‚ąāxő≤ de
= ‚ąíg de
[ac, d] + [őĪ ő≥, ő≤] g . (1.4.4)
‚ąāxőĪ ‚ąāxő≥ ‚ąāxőĪ ‚ąāxő≥ ‚ąāxd
‚ąāxd ‚ąāxe
The transformation g de = g őĽ¬Ķ allows us to express the equation (1.4.4) in the form
‚ąāxőĽ ‚ąāx¬Ķ

## ‚ąā 2 xe ‚ąāxa ‚ąāxc ‚ąāxe

ő≥ = ‚ąíg [ac, d]
de ő≤¬Ķ
őĪ őĪ ő≥ +g [őĪ ő≥, ő≤] ¬Ķ . (1.4.5)
‚ąāx ‚ąāx ‚ąāx ‚ąāx ‚ąāx
109

## Define the Christoffel symbol of the second kind as

     
i i 1 ‚ąāgkőĪ ‚ąāgjőĪ ‚ąāgjk
= = g [jk, őĪ] = g iőĪ
iőĪ
+ ‚ąí . (1.4.6)
jk kj 2 ‚ąāxj ‚ąāxk ‚ąāxőĪ

This Christoffel symbol of the second kind is symmetric in the indices j and k and from equation (1.4.5) we
see that it satisfies the transformation law
   
¬Ķ ‚ąāxe e ‚ąāxa ‚ąāxc ‚ąā 2 xe
= ő≥ + . (1.4.7)
őĪő≥ ‚ąāx¬Ķ ac őĪ
‚ąāx ‚ąāx ‚ąāxőĪ ‚ąāxő≥

Observe that the Christoffel symbol of the second kind does not transform like a tensor quantity. We can use
the relation defined by equation (1.4.7) to express the second derivative of the transformation equations in
terms of the Christoffel symbols of the second kind. At times it will be convenient to represent the Christoffel
symbols with a subscript to indicate the metric from which they are calculated. Thus, an alternative notation
   
i i
for j k is the notation j k .
g
EXAMPLE 1.4-1. (Christoffel symbols) Solve for the Christoffel symbol of the first kind in terms of
the Christoffel symbol of the second kind.
Solution: By the definition from equation (1.4.6) we have
 
i
= g iőĪ [jk, őĪ].
jk

## We multiply this equation by gő≤i and find

 
i
gő≤i = őīő≤őĪ [jk, őĪ] = [jk, ő≤]
jk

and so      
i 1 N
[jk, őĪ] = gőĪi = gőĪ1 + ¬∑ ¬∑ ¬∑ + gőĪN .
jk jk jk

## EXAMPLE 1.4-2. (Christoffel symbols of first kind)

Derive formulas to find the Christoffel symbols of the first kind in a generalized orthogonal coordinate
system with metric coefficients

## where i is not summed.

Solution: In an orthogonal coordinate system where gij = 0 for i 6= j we observe that
 
1 ‚ąāgac ‚ąāgbc ‚ąāgab
[ab, c] = b
+ a
‚ąí . (1.4.8)
2 ‚ąāx ‚ąāx ‚ąāxc

110

## CASE I Let a = b = c = i, then the equation (1.4.8) simplifies to

1 ‚ąāgii
[ab, c] = [ii, i] = (no summation on i). (1.4.9)
2 ‚ąāxi
From this equation we can calculate any of the Christoffel symbols

## CASE II Let a = b = i 6= c, then the equation (1.4.8) simplifies to the form

1 ‚ąāgii
[ab, c] = [ii, c] = ‚ąí (no summation on i and i 6= c). (1.4.10)
2 ‚ąāxc
since, gic = 0 for i 6= c. This equation shows how we may calculate any of the six Christoffel symbols

[11, 2], [11, 3], [22, 1], [22, 3], [33, 1], [33, 2].

CASE III Let a = c = i 6= b, and noting that gib = 0 for i 6= b, it can be verified that the equation (1.4.8)
simplifies to the form
1 ‚ąāgii
[ab, c] = [ib, i] = [bi, i] = (no summation on i and i 6= b). (1.4.11)
2 ‚ąāxb
From this equation we can calculate any of the twelve Christoffel symbols
[12, 1] = [21, 1] [31, 3] = [13, 3]
[32, 3] = [23, 3] [21, 2] = [12, 2]
[13, 1] = [31, 1] [23, 2] = [32, 2]
CASE IV Let a 6= b 6= c and show that the equation (1.4.8) reduces to

[ab, c] = 0, (a 6= b 6= c.)

## [12, 3] = [21, 3] = [23, 1] = [32, 1] = [31, 2] = [13, 2] = 0.

From the Cases I,II,III,IV all twenty seven Christoffel symbols of the first kind can be determined. In
practice, only the nonzero Christoffel symbols are listed.

EXAMPLE 1.4-3. (Christoffel symbols of the first kind)Find the nonzero Christoffel symbols of the
first kind in cylindrical coordinates.
Solution: From the results of example 1.4-2 we find that for x1 = r, x2 = őł, x3 = z and

## g11 = 1, g22 = (x1 )2 = r2 , g33 = 1

the nonzero Christoffel symbols of the first kind in cylindrical coordinates are:
1 ‚ąāg22
[22, 1] = ‚ąí = ‚ąíx1 = ‚ąír
2 ‚ąāx1
1 ‚ąāg22
[21, 2] = [12, 2] = = x1 = r.
2 ‚ąāx1
111

## EXAMPLE 1.4-4. (Christoffel symbols of the second kind)

Find formulas for the calculation of the Christoffel symbols of the second kind in a generalized orthogonal
coordinate system with metric coefficients

## where i is not summed.

Solution: By definition we have
 
i
= g im [jk, m] = g i1 [jk, 1] + g i2 [jk, 2] + g i3 [jk, 3] (1.4.12)
jk

## By hypothesis the coordinate system is orthogonal and so

1
g ij = 0 for i 6= j and g ii = i not summed.
gii

The only nonzero term in the equation (1.4.12) occurs when m = i and consequently
 
i [jk, i]
= g ii [jk, i] = no summation on i. (1.4.13)
jk gii

We can now consider the four cases considered in the example 1.4-2.
CASE I Let j = k = i and show
 
i [ii, i] 1 ‚ąāgii 1 ‚ąā
= = = ln gii no summation on i. (1.4.14)
ii gii 2gii ‚ąāxi 2 ‚ąāxi

## CASE II Let k = j 6= i and show

 
i [jj, i] ‚ąí1 ‚ąāgjj
= = no summation on i or j. (1.4.15)
jj gii 2gii ‚ąāxi

## CASE III Let i = j 6= k and verify that

   
j j [jk, j] 1 ‚ąāgjj 1 ‚ąā
= = = = ln gjj no summation on i or j. (1.4.16)
jk kj gjj 2gjj ‚ąāxk 2 ‚ąāxk

## CASE IV For the case i 6= j 6= k we find

 
i [jk, i]
= = 0, i 6= j 6= k no summation on i.
jk gii

## The above cases represent all 27 terms.

112

EXAMPLE 1.4-5. (Notation) In the case of cylindrical coordinates we can use the above relations and
find the nonzero Christoffel symbols of the second kind:
 
1 1 ‚ąāg22
=‚ąí = ‚ąíx1 = ‚ąír
22 2g11 ‚ąāx1
   
2 2 1 ‚ąāg22 1 1
= = = 1 =
12 21 2g22 ‚ąāx1 x r

Note 1: The notation for the above Christoffel symbols are based upon the assumption that x1 = r, x2 = őł
and x3 = z. However, in tensor calculus the choice of the coordinates can be arbitrary. We could just as well
have defined x1 = z, x2 = r and x3 = őł. In this latter case, the numbering system of the Christoffel symbols
changes. To avoid confusion, an alternate method of writing the Christoffel symbols is to use coordinates in
place of the integers 1,2 and 3. For example, in cylindrical coordinates we can write
     
őł őł 1 r
= = and = ‚ąír.
rőł őłr r őłőł

     
2 2 1 1
= = and = ‚ąír.
12 21 r 22

## In contrast, if we define x1 = z, x2 = r, x3 = őł, then the nonzero Christoffel symbols are written

     
3 3 1 2
= = and = ‚ąír.
23 32 r 33

Note 2: Some textbooks use the notation őďa,bc for Christoffel symbols of the first kind and őďdbc = g da őďa,bc for
Christoffel symbols of the second kind. This notation is not used in these notes since the notation suggests
that the Christoffel symbols are third order tensors, which is not true. The Christoffel symbols of the first
and second kind are not tensors. This fact is clearly illustrated by the transformation equations (1.4.3) and
(1.4.7).

Covariant Differentiation

Let Ai denote a covariant tensor of rank 1 which obeys the transformation law

‚ąāxi
AőĪ = Ai . (1.4.17)
‚ąāxőĪ
Differentiate this relation with respect to xő≤ and show

## ‚ąāAőĪ ‚ąā 2 xi ‚ąāAi ‚ąāxj ‚ąāxi

= Ai őĪ ő≤ + . (1.4.18)
‚ąāxő≤
‚ąāx ‚ąāx ‚ąāxj ‚ąāxő≤ ‚ąāxőĪ
Now use the relation from equation (1.4.7) to eliminate the second derivative term from (1.4.18) and express
it in the form "    #
‚ąāAőĪ ŌÉ ‚ąāxi i ‚ąāxj ‚ąāxk ‚ąāAi ‚ąāxj ‚ąāxi
= Ai ‚ąí + . (1.4.19)
‚ąāxő≤ őĪő≤ ‚ąāxŌÉ jk ‚ąāxőĪ ‚ąāxő≤ ‚ąāxj ‚ąāxő≤ ‚ąāxőĪ
113

Employing the equation (1.4.17), with őĪ replaced by ŌÉ, the equation (1.4.19) is expressible in the form
   
‚ąāAőĪ ŌÉ ‚ąāAj ‚ąāxj ‚ąāxk i ‚ąāxj ‚ąāxk
‚ąí A ŌÉ = őĪ ‚ąí Ai (1.4.20)
‚ąāxő≤ őĪő≤ k
‚ąāx ‚ąāx ‚ąāx ő≤ j k ‚ąāxőĪ ‚ąāxő≤

or alternatively "  #   
‚ąāAőĪ ŌÉ ‚ąāAj i ‚ąāxj ‚ąāxk
‚ąí AŌÉ = ‚ąí Ai . (1.4.21)
‚ąāxő≤ őĪő≤ ‚ąāxk jk ‚ąāxőĪ ‚ąāxő≤

## Define the quantity  

‚ąāAj i
Aj,k = ‚ąí Ai (1.4.22)
‚ąāxk jk
as the covariant derivative of Aj with respect to xk . The equation (1.4.21) demonstrates that the covariant
derivative of a covariant tensor produces a second order tensor which satisfies the transformation law

‚ąāxj ‚ąāxk
AőĪ,ő≤ = Aj,k . (1.4.23)
‚ąāxőĪ ‚ąāxő≤

## Aj,k = Aj;k = Aj/k = ‚ąák Aj = Aj |k . (1.4.24)

In the special case where gij are constants the Christoffel symbols of the second kind are zero, and conse-
‚ąāAj
quently the covariant derivative reduces to Aj,k = . That is, under the special circumstances where the
‚ąāxk
Christoffel symbols of the second kind are zero, the covariant derivative reduces to an ordinary derivative.

## Covariant Derivative of Contravariant Tensor

i ‚ąāxi
A contravariant tensor Ai obeys the transformation law A = AőĪ which can be expressed in the
‚ąāxőĪ
form
őĪ ‚ąāxi
Ai = A (1.4.24)
‚ąāxőĪ
by interchanging the barred and unbarred quantities. We write the transformation law in the form of equation
(1.4.24) in order to make use of the second derivative relation from the previously derived equation (1.4.7).
Differentiate equation (1.4.24) with respect to xj to obtain the relation

2 i őĪ
‚ąāAi őĪ ‚ąā x ‚ąāxő≤ ‚ąāA ‚ąāxő≤ ‚ąāxi
= A + . (1.4.25)
‚ąāxj ‚ąāxőĪ ‚ąāxő≤ ‚ąāxj ‚ąāxő≤ ‚ąāxj ‚ąāxőĪ

Changing the indices in equation (1.4.25) and substituting for the second derivative term, using the relation
from equation (1.4.7), produces the equation
"    # őĪ
‚ąāAi őĪ ŌÉ ‚ąāxi i ‚ąāxm ‚ąāxk ‚ąāxő≤ ‚ąāA ‚ąāxő≤ ‚ąāxi
=A ‚ąí + . (1.4.26)
‚ąāxj őĪő≤ ‚ąāxŌÉ mk ‚ąāxőĪ ‚ąāxő≤ ‚ąāxj ‚ąāxő≤ ‚ąāxj ‚ąāxőĪ

Applying the relation found in equation (1.4.24), with i replaced by m, together with the relation

‚ąāxő≤ ‚ąāxk
= őījk ,
‚ąāxj ‚ąāxő≤
114

## we simplify equation (1.4.26) to the form

    " ŌÉ   # ő≤
‚ąāAi i m ‚ąāA ŌÉ őĪ ‚ąāx ‚ąāx
i
+ A = + A ŌÉ. (1.4.27)
‚ąāxj mj ‚ąāxő≤ őĪő≤ j
‚ąāx ‚ąāx

## Define the quantity  

i ‚ąāAi i
A ,j = + Am (1.4.28)
‚ąāxj mj
as the covariant derivative of the contravariant tensor Ai . The equation (1.4.27) demonstrates that a covariant
derivative of a contravariant tensor will transform like a mixed second order tensor and
ő≤
ŌÉ ‚ąāx ‚ąāxi
Ai ,j = A ,ő≤ . (1.4.29)
‚ąāx ‚ąāxŌÉ
j

‚ąāAi
Again it should be observed that for the condition where gij are constants we have Ai ,j = and the
‚ąāxj
covariant derivative of a contravariant tensor reduces to an ordinary derivative in this special case.
In a similar manner the covariant derivative of second rank tensors can be derived. We find these
derivatives have the forms:    
‚ąāAij ŌÉ ŌÉ
Aij,k = k
‚ąí AŌÉj ‚ąí AiŌÉ
‚ąāx ik jk
   
‚ąāAij i ŌÉ
Aij ,k = + AŌÉj ‚ąí AiŌÉ (1.4.30)
‚ąāxk ŌÉk jk
   
‚ąāAij i j
Aij ,k = + A ŌÉj
+ A iŌÉ
.
‚ąāxk ŌÉk ŌÉk
In general, the covariant derivative of a mixed tensor

Aij...k
lm...p

## of rank n has the form

     
‚ąāAij...k
lm...p i j k
Aij...k
lm...p,q = + AŌÉj...k
lm...p + AiŌÉ...k
lm...p + ¬∑ ¬∑ ¬∑ + A ij...ŌÉ
lm...p
‚ąāxq ŌÉq ŌÉq ŌÉq
      (1.4.31)
ŌÉ ij...k ŌÉ ij...k ŌÉ
‚ąí Aij...k
ŌÉm...p ‚ąí AlŌÉ...p ‚ąí ¬∑ ¬∑ ¬∑ ‚ąí Alm...ŌÉ
lq mq pq

and this derivative is a tensor of rank n + 1. Note the pattern of the + signs for the contravariant indices
and the ‚ąí signs for the covariant indices.
Observe that the covariant derivative of an nth order tensor produces an n+ 1st order tensor, the indices
of these higher order tensors can also be raised and lowered by multiplication by the metric or conjugate
metric tensor. For example we can write

115

## Rules for Covariant Differentiation

The rules for covariant differentiation are the same as for ordinary differentiation. That is:
(i) The covariant derivative of a sum is the sum of the covariant derivatives.
(ii) The covariant derivative of a product of tensors is the first times the covariant derivative of the second
plus the second times the covariant derivative of the first.
(iii) Higher derivatives are defined as derivatives of derivatives. Be careful in calculating higher order deriva-
tives as in general
Ai,jk 6= Ai,kj .

EXAMPLE 1.4-6. (Covariant differentiation) Calculate the second covariant derivative Ai,jk .
Solution: The covariant derivative of Ai is
 
‚ąāAi ŌÉ
Ai,j = ‚ąí AŌÉ .
‚ąāxj ij

By definition, the second covariant derivative is the covariant derivative of a covariant derivative and hence
      
‚ąā ‚ąāAi ŌÉ m m
Ai,jk = (Ai,j ) ,k = ‚ąí AŌÉ ‚ąí Am,j ‚ąí Ai,m .
‚ąāxk ‚ąāxj ij ik jk

## Simplifying this expression one obtains

   
‚ąā 2 Ai ‚ąāAŌÉ ŌÉ ‚ąā ŌÉ
Ai,jk = ‚ąí ‚ąí AŌÉ
‚ąāxj ‚ąāxk ‚ąāxk i j ‚ąāxk i j
         
‚ąāAm ŌÉ m ‚ąāAi ŌÉ m
‚ąí ‚ąí AŌÉ ‚ąí ‚ąí AŌÉ .
‚ąāxj mj ik ‚ąāxm im jk

Rearranging terms, the second covariant derivative can be expressed in the form
     
‚ąā 2 Ai ‚ąāAŌÉ ŌÉ ‚ąāAm m ‚ąāAi m
Ai,jk = ‚ąí ‚ąí ‚ąí
‚ąāxj ‚ąāxk ‚ąāxk i j ‚ąāxj i k ‚ąāxm j k
         (1.4.32)
‚ąā ŌÉ ŌÉ m m ŌÉ
‚ąí AŌÉ ‚ąí ‚ąí .
‚ąāxk i j im jk ik mj
116

## Ai,jk ‚ąí Ai,kj = AŌÉ Rijk

ŌÉ

where          
‚ąā ŌÉ ‚ąā ŌÉ m ŌÉ m ŌÉ
ŌÉ
Rijk = ‚ąí + ‚ąí (1.4.33)
‚ąāxj ik ‚ąāxk ij ik mj ij mk
is called the Riemann Christoffel tensor. The covariant form of this tensor is

i
Rhjkl = gih Rjkl . (1.4.34)

It is an easy exercise to show that this covariant form can be expressed in either of the forms
   
‚ąā ‚ąā s s
Rinjk = [nk, i] ‚ąí k [nj, i] + [ik, s] ‚ąí [ij, s]
‚ąāxj ‚ąāx nj nk
 2 
1 ‚ąā gil ‚ąā 2 gjl ‚ąā 2 gik ‚ąā 2 gjk
or Rijkl = ‚ąí i k ‚ąí j l + i l + g őĪő≤ ([jk, ő≤][il, őĪ] ‚ąí [jl, ő≤][ik, őĪ]) .
2 ‚ąāxj ‚ąāxk ‚ąāx ‚ąāx ‚ąāx ‚ąāx ‚ąāx ‚ąāx

From these forms we find that the Riemann Christoffel tensor is skew symmetric in the first two indices
and the last two indices as well as being symmetric in the interchange of the first pair and last pairs of
indices and consequently

## Rjikl = ‚ąíRijkl Rijlk = ‚ąíRijkl Rklij = Rijkl .

In a two dimensional space there are only four components of the Riemann Christoffel tensor to consider.
These four components are either +R1212 or ‚ąíR1212 since they are all related by

## R1212 = ‚ąíR2112 = R2121 = ‚ąíR1221 .

In a Cartesian coordinate system Rhijk = 0. The Riemann Christoffel tensor is important because it occurs
in differential geometry and relativity which are two areas of interest to be considered later. Additional
properties of this tensor are found in the exercises of section 1.5.
117

## In a system of generalized coordinates (x1 , x2 , x3 ) we can construct the basis vectors (E

~ 1, E
~ 2, E
~ 3 ). These
basis vectors change with position. That is, each basis vector is a function of the coordinates at which they
are evaluated. We can emphasize this dependence by writing

E ~ i (x1 , x2 , x3 ) = ‚ąā~r
~i = E i = 1, 2, 3.
‚ąāxi
Associated with these basis vectors we have the reciprocal basis vectors

E ~ i (x1 , x2 , x3 ),
~i = E i = 1, 2, 3

## ~ can be represented in terms of contravariant components as

which are also functions of position. A vector A

~ = A1 E
A ~ 1 + A2 E
~ 2 + A3 E
~ 3 = Aj E
~j (1.4.35)

## or it can be represented in terms of covariant components as

~ 1 + A2 E
~ = A1 E
A ~ 2 + A3 E
~ 3 = Aj E
~ j. (1.4.36)

~ is represented as
A change in the vector A

‚ąāA~
~=
dA dxk
‚ąāxk
where from equation (1.4.35) we find

‚ąāA~ ~ ‚ąāAj ~
j ‚ąā Ej
= A + Ej (1.4.37)
‚ąāxk ‚ąāxk ‚ąāxk
or alternatively from equation (1.4.36) we may write

‚ąāA~ ~j
‚ąāE ‚ąāAj ~ j
k
= Aj k
+ E . (1.4.38)
‚ąāx ‚ąāx ‚ąāxk
We define the covariant derivative of the covariant components as

‚ąāA~ ~j
Ai,k = ~ i = ‚ąāAi + Aj ‚ąā E ¬∑ E
¬∑E ~ i. (1.4.39)
‚ąāxk ‚ąāxk ‚ąāxk
The covariant derivative of the contravariant components are defined by the relation

‚ąāA~ i ~
Ai ,k = ~ i = ‚ąāA + Aj ‚ąā Ej ¬∑ E
¬∑E ~ i. (1.4.40)
‚ąāxk ‚ąāxk ‚ąāxk
Introduce the notation
~j   ~j  
‚ąāE m ~ ‚ąāE j
= Em and =‚ąí E~ m. (1.4.41)
‚ąāxk jk ‚ąāxk mk

We then have      
~
~ i ¬∑ ‚ąā Ej =
E
m ~ ~ i = m őīi =
Em ¬∑ E
i
(1.4.42)
‚ąāxk jk jk m jk
118

and      
~j
~ i ¬∑ ‚ąāE = ‚ąí j E
E ~ i = ‚ąí j őīim = ‚ąí j .
~m ¬∑ E (1.4.43)
‚ąāxk mk mk ik
Then equations (1.4.39) and (1.4.40) become

‚ąāAi j
Ai,k = ‚ąí Aj
‚ąāxk ik
 
‚ąāAi i
Ai ,k = k
+ Aj ,
‚ąāx jk

which is consistent with our earlier definitions from equations (1.4.22) and (1.4.28). Here the first term of
the covariant derivative represents the rate of change of the tensor field as we move along a coordinate curve.
The second term in the covariant derivative represents the change in the local basis vectors as we move
along the coordinate curves. This is the physical interpretation associated with the Christoffel symbols of
the second kind.
We make the observation that the derivatives of the basis vectors in equations (1.4.39) and (1.4.40) are
related since
~ j = őīj
~i ¬∑ E
E i

and consequently
‚ąā ~ ~j ~j ~
~ i ¬∑ ‚ąā E + ‚ąā Ei ¬∑ E
(Ei ¬∑ E ) = E ~j = 0
‚ąāx k ‚ąāx k ‚ąāxk
~j ~
or ~ i ¬∑ ‚ąā E = ‚ąíE
E ~ j ¬∑ ‚ąā Ei
‚ąāx k ‚ąāxk
Hence we can express equation (1.4.39) in the form

‚ąāAi ~
Ai,k = ~ j ¬∑ ‚ąā Ei .
‚ąí Aj E (1.4.44)
‚ąāxk ‚ąāxk
We write the first equation in (1.4.41) in the form

~j  
‚ąāE m ~ i = [jk, i]E
~i
= gim E (1.4.45)
‚ąāxk jk

and consequently
~j      
‚ąāE i ~ ~m i m
k
~
¬∑E =m
Ei ¬∑ E = m
őīi =
‚ąāx jk jk jk
(1.4.46)
~
‚ąā Ej ~
and ~ ¬∑E
¬∑ Em =[jk, i]E i ~ m = [jk, i]őī = [jk, m].
i
m
‚ąāxk
These results also reduce the equations (1.4.40) and (1.4.44) to our previous forms for the covariant deriva-
tives.
~i
‚ąāE ~j
‚ąāE
The equations (1.4.41) are representations of the vectors ‚ąāxk
and ‚ąāxk
in terms of the basis vectors and
reciprocal basis vectors of the space. The covariant derivative relations then take into account how these
vectors change with position and affect changes in the tensor field.
The Christoffel symbols in equations (1.4.46) are symmetric in the indices j and k since

~j     ~k
‚ąāE ‚ąā ‚ąā~r ‚ąā ‚ąā~r ‚ąāE
= = = . (1.4.47)
‚ąāxk ‚ąāxk ‚ąāxj ‚ąāxj ‚ąāxk ‚ąāxj
119

## The equations (1.4.46) and (1.4.47) enable us to write

" #
‚ąā ~j
E 1 ‚ąā ~j
E ‚ąā ~k
E
[jk, m] =E~m ¬∑ = E~m ¬∑ +E ~m ¬∑
‚ąāxk 2 ‚ąāxk ‚ąāxj
" #
1 ‚ąā ~  ‚ąā ~  ‚ąāE~m ‚ąāE~m
= ~ ~
Em ¬∑ Ej + j Em ¬∑ Ek ‚ąí Ej ¬∑ ~ ~
‚ąí Ek ¬∑
2 ‚ąāxk ‚ąāx ‚ąāxk ‚ąāxj
" #
1 ‚ąā ~  ‚ąā   ‚ąā ~k
E ‚ąā ~j
E
= ~j +
Em ¬∑ E E ~k ‚ąí E
~m ¬∑ E ~j ¬∑ ‚ąíE ~k ¬∑
2 ‚ąāxk ‚ąāxj ‚ąāxm ‚ąāxm
 
1 ‚ąā ~ 
~j + ‚ąā E
 
~k ‚ąí ‚ąā

= Em ¬∑ E ~m ¬∑ E E ~k
~j ¬∑ E
2 ‚ąāxk ‚ąāxj ‚ąāxm
 
1 ‚ąāgmj ‚ąāgmk ‚ąāgjk
= k
+ j
‚ąí m = [kj, m]
2 ‚ąāx ‚ąāx ‚ąāx
which again agrees with our previous result.
~ is represented in the form A
For future reference we make the observation that if the vector A ~j,
~ = Aj E
involving contravariant components, then we may write
!
~ ‚ąāAj ~ ~
~ = ‚ąā A dxk =
dA Ej + A j ‚ąā Ej
dxk
‚ąāxk ‚ąāxk ‚ąāxk
 j   
‚ąāA ~ j i ~ (1.4.48)
= Ej + A Ei dxk
‚ąāxk jk
 j   
‚ąāA j m ~ j dxk = Aj dxk E
~j.
= + A E ,k
‚ąāxk mk
~ is represented in the form A
Similarly, if the vector A ~ j involving covariant components it is left as
~ = Aj E
an exercise to show that
dA ~j
~ = Aj,k dxk E (1.4.49)

Ricci‚Äôs Theorem

Ricci‚Äôs theorem states that the covariant derivative of the metric tensor vanishes and gik,l = 0.
Proof: We have
   
‚ąāgik m m
gik,l = ‚ąí gim ‚ąí gmk
‚ąāxl kl il
‚ąāgik
gik,l = ‚ąí [kl, i] ‚ąí [il, k]
‚ąāxl    
‚ąāgik 1 ‚ąāgik ‚ąāgil ‚ąāgkl 1 ‚ąāgik ‚ąāgkl ‚ąāgil
gik,l = ‚ąí + k ‚ąí ‚ąí + ‚ąí k = 0.
‚ąāxl 2 ‚ąāxl ‚ąāx ‚ąāxi 2 ‚ąāxl ‚ąāxi ‚ąāx
Because of Ricci‚Äôs theorem the components of the metric tensor can be regarded as constants during covariant
differentiation.
i
EXAMPLE 1.4-7. (Covariant differentiation) Show that őīj,k = 0.
Solution        
‚ąāőīji i ŌÉ i i
i
őīj,k = ŌÉ
+ őīj ‚ąí őīŌÉ
i
= ‚ąí = 0.
‚ąāxk ŌÉk jk jk jk
120

## EXAMPLE 1.4-8. (Covariant differentiation) Show that g ij,k = 0.

Solution: Since gij g jk = őīik we take the covariant derivative of this expression and find

## (gij g jk ),l = őīi,l

k
=0
gij g jk,l + gij,l g jk = 0.

But gij,l = 0 by Ricci‚Äôs theorem and hence gij g jk,l = 0. We multiply this expression by g im and obtain

## g im gij g jk,l = őījm g jk,l = g mk

,l = 0

which demonstrates that the covariant derivative of the conjugate metric tensor is also zero.

## EXAMPLE 1.4-9. (Covariant differentiation) Some additional examples of covariant differentiation

are:
(i) (gil Al ),k = gil Al ,k = Ai,k
(ii) (gim gjn Aij ) ,k = gim gjn Aij,k = Amn,k

## Intrinsic or Absolute Differentiation

The intrinsic or absolute derivative of a covariant vector Ai taken along a curve xi = xi (t), i = 1, . . . , N
is defined as the inner product of the covariant derivative with the tangent vector to the curve. The intrinsic
derivative is represented
őīAi dxj
= Ai,j
őīt dt
   j
őīAi ‚ąāAi őĪ dx
= ‚ąí AőĪ (1.4.50)
őīt ‚ąāxj ij dt
  j
őīAi dAi őĪ dx
= ‚ąí AőĪ .
őīt dt i j dt
Similarly, the absolute or intrinsic derivative of a contravariant tensor Ai is represented
 
őīAi dxj dAi i dxj
= Ai ,j = + Ak .
őīt dt dt jk dt

The intrinsic or absolute derivative is used to differentiate sums and products in the same manner as used
in ordinary differentiation. Also if the coordinate system is Cartesian the intrinsic derivative becomes an
ordinary derivative.
The intrinsic derivative of higher order tensors is similarly defined as an inner product of the covariant
derivative with the tangent vector to the given curve. For example,

őīAij dxp
klm
= Aij
klm,p
őīt dt

## is the intrinsic derivative of the fifth order mixed tensor Aij

klm .
121

EXAMPLE 1.4-10. (Generalized velocity and acceleration) Let t denote time and let xi = xi (t)
for i = 1, . . . , N , denote the position vector of a particle in the generalized coordinates (x1 , . . . , xN ). From
the transformation equations (1.2.30), the position vector of the same particle in the barred system of
coordinates, (x1 , x2 , . . . , xN ), is

## xi = xi (x1 (t), x2 (t), . . . , xN (t)) = xi (t), i = 1, . . . , N.

dxi
The generalized velocity is v i = dt , i = 1, . . . , N. The quantity v i transforms as a tensor since by definition

## dxi ‚ąāxi dxj ‚ąāxi j

vi = = = v . (1.4.51)
dt ‚ąāxj dt ‚ąāxj

Let us now find an expression for the generalized acceleration. Write equation (1.4.51) in the form

‚ąāxj
vj = v i (1.4.52)
‚ąāxi

## and differentiate with respect to time to obtain

dv j ‚ąā 2 xj dxk dv i ‚ąāxj
= vi i k + (1.4.53)
dt ‚ąāx ‚ąāx dt dt ‚ąāxi
dv i
The equation (1.4.53) demonstrates that dt does not transform like a tensor. From the equation (1.4.7)
previously derived, we change indices and write equation (1.4.53) in the form
"    #
dv j dxk ŌÉ ‚ąāxj j ‚ąāxa ‚ąāxc ‚ąāxj dv i
= vi ‚ąí + .
dt dt ik ‚ąāxŌÉ i
a c ‚ąāx ‚ąāx k
‚ąāxi dt

## Rearranging terms we find

   c k  
‚ąāv j dxk j ‚ąāxa i ‚ąāx dx ‚ąāxj ‚ąāv i dxk ŌÉ ‚ąāxj dxk
+ i
v k dt
= i k dt
+ vi ŌÉ or
‚ąāxk dt ac ‚ąāx ‚ąāx ‚ąāx ‚ąāx ik ‚ąāx dt
 j    k "   # k
ŌÉ j
‚ąāv j a dx ‚ąāv ŌÉ i dx ‚ąāx
+ v = + v
‚ąāxk ak dt ‚ąāxk ik dt ‚ąāxŌÉ
őīv j őīv ŌÉ ‚ąāxj
= .
őīt őīt ‚ąāxŌÉ
The above equation illustrates that the intrinsic derivative of the velocity is a tensor quantity. This derivative
is called the generalized acceleration and is denoted
    m n
i őīv i dxj dv i i d2 xi i dx dx
f = = v i,j = + m n
v v = 2
+ , i = 1, . . . , N (1.4.54)
őīt dt dt mn dt m n dt dt

## xi = xi (t), i = 1, . . . , N is the generalized position vector, then

i
dx
vi = , i = 1, . . . , N is the generalized velocity, and
dt
i
őīv dxj
fi = = v i,j , i = 1, . . . , N is the generalized acceleration.
őīt dt
122

## Parallel Vector Fields

Let y i = y i (t), i = 1, 2, 3 denote a space curve C in a Cartesian coordinate system and let Y i define a
constant vector in this system. Construct at each point of the curve C the vector Y i . This produces a field
of parallel vectors along the curve C. What happens to the curve and the field of parallel vectors when we
transform to an arbitrary coordinate system using the transformation equations

y i = y i (x1 , x2 , x3 ), i = 1, 2, 3

## with inverse transformation

xi = xi (y 1 , y 2 , y 3 ), i = 1, 2, 3?

The space curve C in the new coordinates is obtained directly from the transformation equations and can
be written
xi = xi (y 1 (t), y 2 (t), y 3 (t)) = xi (t), i = 1, 2, 3.

## The field of parallel vectors Y i become X i in the new coordinates where

‚ąāy i
Y i = Xj . (1.4.55)
‚ąāxj

Since the components of Y i are constants, their derivatives will be zero and consequently we obtain by
differentiating the equation (1.4.55), with respect to the parameter t, that the field of parallel vectors X i
must satisfy the differential equation

dX j ‚ąāy i ‚ąā 2 y i dxm dY i
j
+ Xj j m = = 0. (1.4.56)
dt ‚ąāx ‚ąāx ‚ąāx dt dt

Changing symbols in the equation (1.4.7) and setting the Christoffel symbol to zero in the Cartesian system
of coordinates, we represent equation (1.4.7) in the form
  i
‚ąā 2yi őĪ ‚ąāy
=
‚ąāxj ‚ąāxm j m ‚ąāxőĪ

## and consequently, the equation (1.4.56) can be reduced to the form

 
őīX j dX j j dxm
= + Xk = 0. (1.4.57)
őīt dt km dt

The equation (1.4.57) is the differential equation which must be satisfied by a parallel field of vectors X i
along an arbitrary curve xi (t).
123

EXERCISE 1.4

I 1. Find the nonzero Christoffel symbols of the first and second kind in cylindrical coordinates
(x , x , x3 ) = (r, őł, z), where x = r cos őł,
1 2
y = r sin őł, z = z.

I 2. Find the nonzero Christoffel symbols of the first and second kind in spherical coordinates
(x , x , x3 ) = (ŌĀ, őł, ŌÜ), where x = ŌĀ sin őł cos ŌÜ,
1 2
y = ŌĀ sin őł sin ŌÜ, z = ŌĀ cos őł.

I 3. Find the nonzero Christoffel symbols of the first and second kind in parabolic cylindrical coordinates
1
(x , x , x3 ) = (őĺ, ő∑, z), where x = őĺő∑, y = (őĺ 2 ‚ąí ő∑ 2 ), z = z.
1 2
2

I 4. Find the nonzero Christoffel symbols of the first and second kind in parabolic coordinates
1
(x , x , x3 ) = (őĺ, ő∑, ŌÜ), where x = őĺő∑ cos ŌÜ, y = őĺő∑ sin ŌÜ, z = (őĺ 2 ‚ąí ő∑ 2 ).
1 2
2

I 5. Find the nonzero Christoffel symbols of the first and second kind in elliptic cylindrical coordinates
(x , x , x3 ) = (őĺ, ő∑, z), where x = cosh őĺ cos ő∑,
1 2
y = sinh őĺ sin ő∑, z = z.

I 6. Find the nonzero Christoffel symbols of the first and second kind for the oblique cylindrical coordinates
(x , x2 , x3 ) = (r, ŌÜ, ő∑), where x = r cos ŌÜ,
1
y = r sin ŌÜ+ő∑ cos őĪ, z = ő∑ sin őĪ with 0 < őĪ < ŌÄ
2 and őĪ constant.
Hint: See figure 1.3-18 and exercise 1.3, problem 12.

‚ąāgik
I 7. Show [ij, k] + [kj, i] = .
‚ąāxj

I 8. 

r
(a) Let = g ri [st, i] and solve for the Christoffel symbol of the first kind in terms of the Christoffel
st
symbol of the secondkind. 
n
(b) Assume [st, i] = gni and solve for the Christoffel symbol of the second kind in terms of the
st
Christoffel symbol of the first kind.

I 9.
(a) Write down the transformation law satisfied by the fourth order tensor ijk,m .
(b) Show that ijk,m = 0 in all coordinate systems.
‚ąö
(c) Show that ( g),k = 0.

,m = 0.

## I 11. Calculate the second covariant derivative Ai ,kj .

‚ąāŌÜ
I 12. The gradient of a scalar field ŌÜ(x1 , x2 , x3 ) is the vector grad ŌÜ = E
. ~i
‚ąāxi
(a) Find the physical components associated with the covariant components ŌÜ ,i
dŌÜ Ai ŌÜ,i
(b) Show the directional derivative of ŌÜ in a direction Ai is = .
dA (gmn Am An )1/2
124

I 13.
‚ąö
(a) Show g is a relative scalar of weight +1.
(b) Use the results from
 problem
 9(c) and problem 44, Exercise 1.4, to show that
‚ąö
‚ąö ‚ąā g m ‚ąö
( g),k = k
‚ąí g = 0.
‚ąāx
  km
m ‚ąā ‚ąö 1 ‚ąāg
(c) Show that = k
ln( g) = .
km ‚ąāx 2g ‚ąāxk
 
m ‚ąā ‚ąö 1 ‚ąāg
I 14. Use the result from problem 9(b) to show = ln( g) = .
km ‚ąāxk 2g ‚ąāxk‚ąö
Hint: Expand the covariant derivative rst,p and then substitute rst = gerst . Simplify by inner
rst
e‚ąö
multiplication with g and note the Exercise 1.1, problem 26.

I 15. Calculate the covariant derivative Ai,m and then contract on m and i to show that

1 ‚ąā ‚ąö i
Ai,i = ‚ąö gA .
g ‚ąāxi

 
1 ‚ąā ‚ąö ij  i
I 16. Show ‚ąö gg + g pq = 0. Hint: See problem 14.
g ‚ąāxj pq

I 17. Prove that the covariant derivative of a sum equals the sum of the covariant derivatives.
Hint: Assume Ci = Ai + Bi and write out the covariant derivative for Ci,j .

I 18. Let Cji = Ai Bj and prove that the covariant derivative of a product equals the first term times the
covariant derivative of the second term plus the second term times the covariant derivative of the first term.

‚ąāxőĪ ‚ąāxő≤
I 19. Start with the transformation law AŐĄij = AőĪő≤ and take an ordinary derivative of both sides
‚ąā xŐĄi ‚ąā xŐĄj
k
with respect to xŐĄ and hence derive the relation for Aij,k given in (1.4.30).

‚ąāxi ‚ąāxj
I 20. Start with the transformation law Aij = AŐĄőĪ ő≤ and take an ordinary derivative of both sides
‚ąā xŐĄőĪ ‚ąā xŐĄő≤
with respect to xk and hence derive the relation for Aij,k given in (1.4.30).

## (a) Aijk (b) Aijk (c) Aijk (d) Aijk

I 22. Find the intrinsic derivative along the curve xi = xi (t), i = 1, . . . , N for

## (a) Aijk (b) Aijk (c) Aijk (d) Aijk

I 23.
(a) Assume A ~ i and show that dA
~ = Ai E ~ = Ai dxk E ~ i.
,k
(b) Assume A ~ and show that dA
~ = Ai E i ~ = Ai,k dx E
k ~ i.
125

I 24. (parallel vector field) Imagine a vector field Ai = Ai (x1 , x2 , x3 ) which is a function of position.
Assume that at all points along a curve xi = xi (t), i = 1, 2, 3 the vector field points in the same direction,
we would then have a parallel vector field or homogeneous vector field. Assume A ~ is a constant, then
~= ‚ąāA~
dA ‚ąāxk dxk = 0. Show that for a parallel vector field the condition Ai,k = 0 must be satisfied.
   
‚ąā[ik, n] ‚ąā ŌÉ ŌÉ
I 25. Show that = gnŌÉ j + ([nj, ŌÉ] + [ŌÉj, n]) .
‚ąāxj ‚ąāx ik ik

‚ąāAr ‚ąāAs
I 26. Show Ar,s ‚ąí As,r = ‚ąí .
‚ąāxs ‚ąāxr

I 27. In cylindrical coordinates you are given the contravariant vector components

A1 = r A2 = cos őł A3 = z sin őł

## Arr Arőł Arz

(b) Denote the physical components of Ai,j , i, j = 1, 2, 3, by Aőłr Aőłőł Aőłz
Azr Azőł Azz .
Find these physical components.

I 28. Find the covariant form of the contravariant tensor C i = ijk Ak,j .

1
I 29. In Cartesian coordinates let x denote the magnitude of the position vector xi . Show that (a) x ,j = xj
x
1 1 2 1 ‚ąíőīij 3xi xj
(b) x ,ij = őīij ‚ąí 3 xi xj (c) x ,ii = . (d) LetU = , x 6= 0, and show that U ,ij = 3
+ and
x x x x x x5
U ,ii = 0.

I 30. Consider a two dimensional space with element of arc length squared
 
2 1 2 2 2 g11 0
ds = g11 (du ) + g22 (du ) and metric gij =
0 g22

## where u1 , u2 are surface coordinates.

(a) Find formulas to calculate the Christoffel symbols of the first kind.
(b) Find formulas to calculate the Christoffel symbols of the second kind.

I 31. Find the metric tensor and Christoffel symbols of the first and second kind associated with the
two dimensional space describing points on a cylinder of radius a. Let u1 = őł and u2 = z denote surface
coordinates where
x = a cos őł = a cos u1
y = a sin őł = a sin u1
z = z = u2
126

I 32. Find the metric tensor and Christoffel symbols of the first and second kind associated with the
two dimensional space describing points on a sphere of radius a. Let u1 = őł and u2 = ŌÜ denote surface
coordinates where
x = a sin őł cos ŌÜ = a sin u1 cos u2
y = a sin őł sin ŌÜ = a sin u1 sin u2
z = a cos őł = a cos u1

I 33. Find the metric tensor and Christoffel symbols of the first and second kind associated with the
two dimensional space describing points on a torus having the parameters a and b and surface coordinates
u1 = őĺ, u2 = ő∑. illustrated in the figure 1.3-19. The points on the surface of the torus are given in terms
of the surface coordinates by the equations

x = (a + b cos őĺ) cos ő∑
y = (a + b cos őĺ) sin ő∑
z = b sin őĺ

I 34. Prove that eijk am bj ck ui,m + eijk ai bm ck uj,m + eijk ai bj cm uk,m = ur,r eijk ai bj ck . Hint: See Exercise 1.3,
problem 32 and Exercise 1.1, problem 21.

## I 35. Calculate the second covariant derivative Ai,jk .

 
1 ‚ąā ‚ąö ij  i
I 36. Show that ŌÉ ij,j = ‚ąö gŌÉ + ŌÉ mn
g ‚ąāxj mn

I 37. Find the contravariant, covariant and physical components of velocity and acceleration in (a) Cartesian
coordinates and (b) cylindrical coordinates.

I 38. Find the contravariant, covariant and physical components of velocity and acceleration in spherical
coordinates.

I 39. In spherical coordinates (ŌĀ, őł, ŌÜ) show that the acceleration components can be represented in terms
of the velocity components as

## vőł2 + vŌÜ2 vŌĀ vőł vŌÜ2 vŌĀ vŌÜ vőł vŌÜ

fŌĀ = vŐáŌĀ ‚ąí , főł = vŐáőł + ‚ąí , fŌÜ = vŐáŌÜ + +
ŌĀ ŌĀ ŌĀ tan őł ŌĀ ŌĀ tan őł
Hint: Calculate vŐáŌĀ , vŐáőł , vŐáŌÜ .

I 40. The divergence of a vector Ai is Ai,i . That is, perform a contraction on the covariant derivative
Ai,j to obtain Ai,i . Calculate the divergence in (a) Cartesian coordinates (b) cylindrical coordinates and (c)
spherical coordinates.

I 41. If S is a scalar invariant of weight one and Aijk is a third order relative tensor of weight W , show
that S ‚ąíW Aijk is an absolute tensor.
127

I 42. Let YŐĄ i ,i = 1, 2, 3 denote the components of a field of parallel vectors along the curve C defined by
dyŐĄ i
the equations y i = yŐĄ i (t), i = 1, 2, 3 in a space with metric tensor gŐĄij , i, j = 1, 2, 3. Assume that YŐĄ i and dt
are unit vectors such that at each point of the curve CŐĄ we have

dyŐĄ j
gŐĄij YŐĄ i = cos őł = Constant.
dt

(i.e. The field of parallel vectors makes a constant angle őł with the tangent to each point of the curve CŐĄ.)
Show that if YŐĄ i and yŐĄ i (t) undergo a transformation xi = xi (yŐĄ 1 , yŐĄ 2 , yŐĄ 3 ), i = 1, 2, 3 then the transformed
m
vector X m = YŐĄ i ‚ąāx
‚ąā yŐĄ j makes a constant angle with the tangent vector to the transformed curve C given by
xi = xi (yŐĄ 1 (t), yŐĄ 2 (t), yŐĄ 3 (t)).

-
‚ąāxi
I 43. Let J denote the Jacobian determinant | |. Differentiate J with respect to xm and show that
‚ąāxj
   
‚ąāJ őĪ ‚ąāxp r
= J ‚ąí J .
‚ąāxm őĪ p ‚ąāxm rm

## Hint: See Exercise 1.1, problem 27 and (1.4.7).

I 44. Assume that ŌÜ is a relative scalar of weight W so that ŌÜ = J W ŌÜ. Differentiate this relation with
respect to xk . Use the result from problem 43 to obtain the transformation law:
"   #     m
‚ąāŌÜ őĪ ‚ąāŌÜ r ‚ąāx
‚ąíW ŌÜ =J W
‚ąíW ŌÜ .
‚ąāxk őĪk ‚ąāxm mr ‚ąāxk

The quantity inside the brackets is called the covariant derivative of a relative scalar of weight W. The
covariant derivative of a relative scalar of weight W is defined as
 
‚ąāŌÜ r
ŌÜ ,k = ‚ąíW ŌÜ
‚ąāxk kr

## and this definition has an extra term involving the weight.

It can be shown that similar results hold for relative tensors of weight W. For example, the covariant
derivative of first and second order relative tensors of weight W have the forms
   
‚ąāT i i r
i
T ,k = + T ‚ąíW
m
Ti
‚ąāxk km kr
     
‚ąāTji i ŌÉ r
Tji ,k = + Tj ‚ąí
ŌÉ
TŌÉ ‚ąí W
i
Ti
‚ąāxk kŌÉ jk kr j

When the weight term is zero these covariant derivatives reduce to the results given in our previous definitions.

dxi
I 45. Let dt = v i denote a generalized velocity and define the scalar function of kinetic energy T of a
particle with mass m as
1 1
T = m gij v i v j = m gij xŐái xŐáj .
2 2
őīT dT
Show that the intrinsic derivative of T is the same as an ordinary derivative of T. (i.e. Show that őīT = dt .)
128

## I 46. Verify the relations

‚ąāgij ‚ąāg nm
= ‚ąígmj gni
‚ąāxk ‚ąāxk
‚ąāg in ‚ąāgjm
= ‚ąíg mn g ij
‚ąāxk ‚ąāxk

1 ‚ąā ‚ąö ijk 
I 47. Assume that B ijk is an absolute tensor. Is the quantity T jk = ‚ąö gB a tensor? Justify
g ‚ąāxi
impose upon B ijk such that the above quantity will be a tensor?

I 48. The e-permutation symbol can be used to define various vector products. Let Ai , Bi , Ci , Di
i = 1, . . . , N denote vectors, then expand and verify the following products:
(a) In two dimensions
R =eij Ai Bj a scalar determinant.
Ri =eij Aj a vector (rotation).
(b) In three dimensions
S =eijk Ai Bj Ck a scalar determinant.
Si =eijk Bj Ck a vector cross product.
Sij =eijk Ck a skew-symmetric matrix
(c) In four dimensions

## T =eijkm Ai Bj Ck Dm a scalar determinant.

Ti =eijkm Bj Ck Dm 4-dimensional cross product.
Tij =eijkm Ck Dm skew-symmetric matrix.
Tijk =eikm Dm skew-symmetric tensor.

## I 49. Expand the curl operator for:

(a) Two dimensions B = eij Aj,i
(b) Three dimensions Bi = eijk Ak,j
(c) Four dimensions Bij = eijkm Am,k
129

## ¬ß1.5 DIFFERENTIAL GEOMETRY AND RELATIVITY

In this section we will examine some fundamental properties of curves and surfaces. In particular, at
each point of a space curve we can construct a moving coordinate system consisting of a tangent vector, a
normal vector and a binormal vector which is perpendicular to both the tangent and normal vectors. How
these vectors change as we move along the space curve brings up the subjects of curvature and torsion
associated with a space curve. The curvature is a measure of how the tangent vector to the curve is changing
and the torsion is a measure of the twisting of the curve out of a plane. We will find that straight lines have
zero curvature and plane curves have zero torsion.
In a similar fashion, associated with every smooth surface there are two coordinate surface curves and
a normal surface vector through each point on the surface. The coordinate surface curves have tangent
vectors which together with the normal surface vectors create a set of basis vectors. These vectors can be
used to define such things as a two dimensional surface metric and a second order curvature tensor. The
coordinate curves have tangent vectors which together with the surface normal form a coordinate system at
each point of the surface. How these surface vectors change brings into consideration two different curvatures.
A normal curvature and a tangential curvature (geodesic curvature). How these curvatures are related to
the curvature tensor and to the Riemann Christoffel tensor, introduced in the last section, as well as other
interesting relationships between the various surface vectors and curvatures, is the subject area of differential
geometry.
Also presented in this section is a brief introduction to relativity where again the Riemann Christoffel
tensor will occur. Properties of this important tensor are developed in the exercises of this section.

## Space Curves and Curvature

For xi = xi (s),i = 1, 2, 3, a 3-dimensional space curve in a Riemannian space Vn with metric tensor gij ,
dxi
and arc length parameter s, the vector T i = ds represents a tangent vector to the curve at a point P on
i
the curve. The vector T is a unit vector because
dxi dxj
gij T i T j = gij = 1. (1.5.1)
ds ds
Differentiate intrinsically, with respect to arc length, the relation (1.5.1) and verify that

őīT j őīT i j
gij T i + gij T = 0, (1.5.2)
őīs őīs
which implies that
őīT i
gij T j = 0. (1.5.3)
őīs
őīT i
Hence, the vector őīs is perpendicular to the tangent vector T i . Define the unit normal vector N i to the
őīT i
space curve to be in the same direction as the vector őīs and write

1 őīT i
Ni = (1.5.4)
őļ őīs
where őļ is a scale factor, called the curvature, and is selected such that

őīT i őīT j
gij N i N j = 1 which implies gij = őļ2 . (1.5.5)
őīs őīs
130

The reciprocal of curvature is called the radius of curvature. The curvature measures the rate of change of
the tangent vector to the curve as the arc length varies. By differentiating intrinsically, with respect to arc
length s, the relation gij T i N j = 0 we find that
őīN j őīT i j
gij T i + gij N = 0. (1.5.6)
őīs őīs
Consequently, the curvature őļ can be determined from the relation
őīN j őīT i j
gij T i = ‚ąígij N = ‚ąígij őļN i N j = ‚ąíőļ (1.5.7)
őīs őīs
which defines the sign of the curvature. In a similar fashion we differentiate the relation (1.5.5) and find that
őīN j
gij N i = 0. (1.5.8)
őīs
őīN j
This later equation indicates that the vector őīs is perpendicular to the unit normal N i . The equation
(1.5.3) indicates that T i is also perpendicular to N i and hence any linear combination of these vectors will
also be perpendicular to N i . The unit binormal vector is defined by selecting the linear combination
őīN j
+ őļT j (1.5.9)
őīs
and then scaling it into a unit vector by defining
 
j 1 őīN j
B = + őļT j (1.5.10)
ŌĄ őīs
where ŌĄ is a scalar called the torsion. The sign of ŌĄ is selected such that the vectors T i , N i and B i form a
right handed system with ijk T i N j B k = 1 and the magnitude of ŌĄ is selected such that B i is a unit vector
satisfying
gij B i B j = 1. (1.5.11)

The triad of vectors T i , N i , B i at a point on the curve form three planes. The plane containing T i and B i is
called the rectifying plane. The plane containing N i and B i is called the normal plane. The plane containing
T i and N i is called the osculating plane. The reciprocal of the torsion is called the radius of torsion. The
torsion measures the rate of change of the osculating plane. The vectors T i , N i and B i form a right-handed
orthogonal system at a point on the space curve and satisfy the relation

B i = ijk Tj Nk . (1.5.12)

By using the equation (1.5.10) it can be shown that B i is perpendicular to both the vectors T i and N i since

## gij B i T j = 0 and gij B i N j = 0.

őīB i
It is left as an exercise to show that the binormal vector B i satisfies the relation őīs = ‚ąíŌĄ N i . The three
relations
őīT i
= őļN i
őīs
őīN i
= ŌĄ B i ‚ąí őļT i (1.5.13)
őīs
őīB i
= ‚ąíŌĄ N i
őīs
131

## Surfaces and Curvature

Let us examine surfaces in a Cartesian frame of reference and then later we can generalize our results
to other coordinate systems. A surface in Euclidean 3-dimensional space can be defined in several different
ways. Explicitly, z = f (x, y), implicitly, F (x, y, z) = 0 or parametrically by defining a set of parametric
equations of the form
x = x(u, v), y = y(u, v), z = z(u, v)

which contain two independent parameters u, v called surface coordinates. For example, the equations

## x = a sin őł cos ŌÜ, y = a sin őł sin ŌÜ, z = a cos őł

are the parametric equations which define a spherical surface of radius a with parameters u = őł and v = ŌÜ.
See for example figure 1.3-20 in section 1.3. By eliminating the parameters u, v one can derive the implicit
form of the surface and by solving for z one obtains the explicit form of the surface. Using the parametric
form of a surface we can define the position vector to a point on the surface which is then represented in
terms of the parameters u, v as

e1 + y(u, v) b
~r = ~r(u, v) = x(u, v) b e2 + z(u, v) b
e3 . (1.5.14)

The coordinates (u, v) are called the curvilinear coordinates of a point on the surface. The functions
x(u, v), y(u, v), z(u, v) are assumed to be real and differentiable such that ‚ąā~
r
‚ąāu √ó ‚ąā~
r
‚ąāv 6= 0. The curves

## ~r(u, c2 ) and ~r(c1 , v) (1.5.15)

with c1 , c2 constants, then define two surface curves called coordinate curves, which intersect at the surface
coordinates (c1 , c2 ). The family of curves defined by equations (1.5.15) with equally spaced constant values
‚ąā~
r ‚ąā~
r
ci , ci + ‚ąÜci , ci + 2‚ąÜci , . . . define a surface coordinate grid system. The vectors ‚ąāu and ‚ąāv evaluated at the
surface coordinates (c1 , c2 ) on the surface, are tangent vectors to the coordinate curves through the point
and are basis vectors for any vector lying in the surface. Letting (x, y, z) = (y 1 , y 2 , y 3 ) and (u, v) = (u1 , u2 )
and utilizing the summation convention, we can write the position vector in the form

~r = ~r(u1 , u2 ) = y i (u1 , u2 ) b
ei . (1.5.16)

The tangent vectors to the coordinate curves at a point P can then be represented as the basis vectors
i
~ őĪ = ‚ąā~r = ‚ąāy b
E ei , őĪ = 1, 2 (1.5.17)
‚ąāu őĪ ‚ąāu őĪ

where the partial derivatives are to be evaluated at the point P where the coordinate curves on the surface
intersect. From these basis vectors we construct a unit normal vector to the surface at the point P by
‚ąā~
r ‚ąā~
r
calculating the cross product of the tangent vector ~ru = ‚ąāu and ~rv = ‚ąāv . A unit normal is then

~1 √ó E
E ~2 ~ru √ó ~rv
b=n
n b(u, v) = = (1.5.18)
~ ~
|E1 √ó E2 | |~ru √ó ~rv |
132

~ 1, E
and is such that the vectors E ~ 2 and n
b form a right-handed system of coordinates.
If we transform from one set of curvilinear coordinates (u, v) to another set (uŐĄ, vŐĄ), which are determined
by a set of transformation laws
u = u(uŐĄ, vŐĄ), v = v(uŐĄ, vŐĄ),

## e1 + y(u(uŐĄ, vŐĄ), v(uŐĄ, vŐĄ)) b

~r = ~r(uŐĄ, vŐĄ) = x(u(uŐĄ, vŐĄ), v(uŐĄ, vŐĄ)) b e2 + z(u(uŐĄ, vŐĄ), v(uŐĄ, vŐĄ)) b
e3

## and the tangent vectors to the new coordinate curves are

‚ąā~r ‚ąā~r ‚ąāu ‚ąā~r ‚ąāv ‚ąā~r ‚ąā~r ‚ąāu ‚ąā~r ‚ąāv
= + and = + .
‚ąā uŐĄ ‚ąāu ‚ąā uŐĄ ‚ąāv ‚ąā uŐĄ ‚ąāvŐĄ ‚ąāu ‚ąāvŐĄ ‚ąāv ‚ąāvŐĄ
Using the indicial notation this result can be represented as

‚ąāy i ‚ąāy i ‚ąāuő≤
őĪ
= .
‚ąā uŐĄ ‚ąāuő≤ ‚ąā uŐĄőĪ
This is the transformation law connecting the two systems of basis vectors on the surface.
A curve on the surface is defined by a relation f (u, v) = 0 between the curvilinear coordinates. Another
way to represent a curve on the surface is to represent it in a parametric form where u = u(t) and v = v(t),
where t is a parameter. The vector
d~r ‚ąā~r du ‚ąā~r dv
= +
dt ‚ąāu dt ‚ąāv dt
is tangent to the curve on the surface.
An element of arc length with respect to the surface coordinates is represented by
‚ąā~r ‚ąā~r
ds2 = d~r ¬∑ d~r = őĪ
¬∑ duőĪ duő≤ = aőĪő≤ duőĪ duő≤ (1.5.19)
‚ąāu ‚ąāuő≤
where aőĪő≤ = ‚ąā~
r
‚ąāuőĪ ¬∑ ‚ąā~
r
‚ąāuő≤ with őĪ, ő≤ = 1, 2 defines a surface metric. This element of arc length on the surface is
often written as the quadratic form
1 EG ‚ąí F 2 2
A = ds2 = E(du)2 + 2F du dv + G(dv)2 = (E du + F dv)2 + dv (1.5.20)
E E
and called the first fundamental form of the surface. Observe that for ds2 to be positive definite the quantities
E and EG ‚ąí F 2 must be positive.
The surface metric associated with the two dimensional surface is defined by

~őĪ ¬∑ E
~ő≤ = ‚ąā~r ‚ąā~r ‚ąāy i ‚ąāy i
aőĪő≤ = E őĪ
¬∑ ő≤
= , őĪ, ő≤ = 1, 2 (1.5.21)
‚ąāu ‚ąāu ‚ąāuőĪ ‚ąāuő≤
with conjugate metric tensor aőĪő≤ defined such that aőĪő≤ aő≤ő≥ = őīő≥őĪ . Here the surface is embedded in a three
dimensional space with metric gij and aőĪő≤ is the two dimensional surface metric. In the equation (1.5.20)
the quantities E, F, G are functions of the surface coordinates u, v and are determined from the relations
‚ąā~r ‚ąā~r ‚ąāy i ‚ąāy i
E =a11 = ¬∑ =
‚ąāu ‚ąāu ‚ąāu1 ‚ąāu1
‚ąā~r ‚ąā~r ‚ąāy i ‚ąāy i
F =a12 = ¬∑ = (1.5.22)
‚ąāu ‚ąāv ‚ąāu1 ‚ąāu2
‚ąā~r ‚ąā~r ‚ąāy i ‚ąāy i
G =a22 = ¬∑ =
‚ąāv ‚ąāv ‚ąāu2 ‚ąāu2
133

Here and throughout the remainder of this section, we adopt the convention that Greek letters have the
range 1,2, while Latin letters have the range 1,2,3.
b at this point. Also construct a
Construct at a general point P on the surface the unit normal vector n
b. Observe that there are an infinite number of planes
plane which contains this unit surface normal vector n
which contain this unit surface normal. For now, select one of these planes, then later on we will consider
all such planes. Let ~r = ~r(s) denote the position vector defining a curve C which is the intersection of the
selected plane with the surface, where s is the arc length along the curve, which is measured from some fixed
point on the curve. Let us find the curvature of this curve of intersection. The vector Tb = d~r , evaluated
ds
at the point P, is a unit tangent vector to the curve C and lies in the tangent plane to the surface at the
point P. Here we are using ordinary differentiation rather than intrinsic differentiation because we are in
a Cartesian system of coordinates. Differentiating the relation Tb ¬∑ Tb = 1, with respect to arc length s we
find that Tb ¬∑ dTb = 0 which implies that the vector dTb is perpendicular to the tangent vector Tb. Since the
ds ds
coordinate system is Cartesian we can treat the curve of intersection C as a space curve, then the vector
K~ = dTb , evaluated at point P, is defined as the curvature vector with curvature |K| ~ = őļ and radius of
ds
curvature R = 1/őļ. A unit normal N b to the space curve is taken in the same direction as dTb so that the
ds

~ b dTb
curvature will always be positive. We can then write K = őļN = . Consider the geometry of figure 1.5-1
ds
b=n
and define on the surface a unit vector u b √ó Tb which is perpendicular to both the surface tangent vector
Tb and the surface normal vector nb, such that the vectors T i ,ui and ni forms a right-handed system.

Figure 1.5-1 Surface curve with tangent plane and a normal plane.
134

## The direction of u b in relation to Tb is in the same sense as the surface tangents E

~ 1 and E
~ 2 . Note that
the vector ddsTb is perpendicular to the tangent vector Tb and lies in the plane which contains the vectors n b
b. We can therefore write the curvature vector K in the component form
and u ~

b
~ = dT = őļ(n) n
K b + őļ(g) u ~n +K
b=K ~g (1.5.23)
ds

where őļ(n) is called the normal curvature and őļ(g) is called the geodesic curvature. The subscripts are not
b ¬∑ Tb = 0 we obtain
indices. These curvatures can be calculated as follows. From the orthogonality condition n
b
dT db
n
b¬∑
by differentiation with respect to arc length s the result n + Tb ¬∑ = 0. Consequently, the normal
ds ds
curvature is determined from the dot product relation

~ = őļ(n) = ‚ąíTb ¬∑ db
b¬∑K
n
n d~r db
=‚ąí ¬∑
n
. (1.5.24)
ds ds ds

## b with equation (1.5.23) we find that the geodesic curvature is determined

By taking the dot product of u
from the triple scalar product relation

dTb dTb
b¬∑
őļ(g) = u n √ó Tb) ¬∑
= (b . (1.5.25)
ds ds
Normal Curvature

n. (1.5.26)

## b and position vector ~r are functions of the surface coordinates u, v with

The unit normal to the surface n

‚ąā~r ‚ąā~r ‚ąāb
n ‚ąāb
n
d~r = du + dv and db
n= du + dv. (1.5.27)
‚ąāu ‚ąāv ‚ąāu ‚ąāv

## We define the quadratic form

   
‚ąā~r ‚ąā~r ‚ąāb
n ‚ąāb
n
B = ‚ąíd~r ¬∑ db
n=‚ąí du + dv ¬∑ du + dv
‚ąāu ‚ąāv ‚ąāu ‚ąāv (1.5.28)
2 2 őĪ ő≤
B = e(du) + 2f du dv + g(dv) = bőĪő≤ du du

where  
‚ąā~r ‚ąāb
n ‚ąā~r ‚ąāb
n ‚ąāb n ‚ąā~r ‚ąā~r ‚ąāb
n
e=‚ąí ¬∑ , 2f = ‚ąí ¬∑ + ¬∑ , g=‚ąí ¬∑ (1.5.29)
‚ąāu ‚ąāu ‚ąāu ‚ąāv ‚ąāu ‚ąāv ‚ąāv ‚ąāv
and bőĪő≤ őĪ, ő≤ = 1, 2 is called the curvature tensor and aőĪő≥ bőĪő≤ = bő≥ő≤ is an associated curvature tensor.
The quadratic form of equation (1.5.28) is called the second fundamental form of the surface. Alternative
methods for calculating the coefficients of this quadratic form result from the following considerations. The
unit surface normal is perpendicular to the tangent vectors to the coordinate curves at the point P and
therefore we have the orthogonality relationships

‚ąā~r ‚ąā~r
¬∑n
b=0 and ¬∑n
b = 0. (1.5.30)
‚ąāu ‚ąāv
135

Observe that by differentiating the relations in equation (1.5.30), with respect to both u and v, one can
derive the results
‚ąā 2~r ‚ąā~r ‚ąāb n
e=
2
¬∑n
b=‚ąí ¬∑ = b11
‚ąāu ‚ąāu ‚ąāu
‚ąā 2~r ‚ąā~r ‚ąāb n ‚ąāb
n ‚ąā~r
f= ¬∑nb=‚ąí ¬∑ =‚ąí ¬∑ = b21 = b12 (1.5.31)
‚ąāu‚ąāv ‚ąāu ‚ąāv ‚ąāu ‚ąāv
‚ąā 2~r ‚ąā~r ‚ąāb n
g= 2
¬∑n
b=‚ąí ¬∑ = b22
‚ąāv ‚ąāv ‚ąāv
and consequently the curvature tensor can be expressed as

‚ąā~r ‚ąāb
n
bőĪő≤ = ‚ąí őĪ
¬∑ . (1.5.32)
‚ąāu ‚ąāuő≤

The quadratic forms from equations (1.5.20) and (1.5.28) enable us to represent the normal curvature
in the form of a ratio of quadratic forms. We find from equation (1.5.26) that the normal curvature in the
du
direction dv is
B e(du)2 + 2f du dv + g(dv)2
őļ(n) = = . (1.5.33)
A E(du)2 + 2F du dv + G(dv)2
If we write the unit tangent vector to the curve in the form Tb = d~r r duőĪ
‚ąā~
ds = ‚ąāuőĪ ds and express the derivative
of the unit surface normal with respect to arc length as ddsb
n ‚ąāb
n duő≤
= ‚ąāuő≤ ds , then the normal curvature can be

## expressed in the form  

db
n ‚ąā~r ‚ąāb
n duőĪ duő≤
őļ(n) = ‚ąíTb ¬∑ =‚ąí ¬∑
ds ‚ąāuőĪ ‚ąāuő≤ ds ds
(1.5.34)
bőĪő≤ duőĪ duő≤ bőĪő≤ duőĪ duő≤
= = .
ds2 aőĪő≤ duőĪ duő≤
Observe that the curvature tensor is a second order symmetric tensor.
In the previous discussions, the plane containing the unit normal vector was arbitrary. Let us now
consider all such planes that pass through this unit surface normal. As we vary the plane containing the unit
b at P we get different curves of intersection with the surface. Each curve has a curvature
surface normal n
associated with it. By examining all such planes we can find the maximum and minimum normal curvatures
associated with the surface. We write equation (1.5.33) in the form

e + 2f őĽ + gőĽ2
őļ(n) = (1.5.35)
E + 2F őĽ + GőĽ2
dv
where őĽ = du . From the theory of proportions we can also write this equation in the form

(e + f őĽ) + őĽ(f + gőĽ) f + gőĽ e + főĽ
őļ(n) = = = . (1.5.36)
(E + F őĽ) + őĽ(F + GőĽ) F + GőĽ E + FőĽ

## (e ‚ąí őļE)du + (f ‚ąí őļF )dv = 0 and (f ‚ąí őļF )du + (g ‚ąí őļG)dv = 0. (1.5.37)

dőļ(n)
The maximum and minimum curvatures occur in those directions őĽ where dőĽ = 0. Calculating the deriva-
tive of őļ(n) with respect to őĽ and setting the derivative to zero we obtain a quadratic equation in őĽ

136

## This equation has two roots őĽ1 and őĽ2 which satisfy

Eg ‚ąí Ge Ef ‚ąí F e
őĽ1 + őĽ2 = ‚ąí and őĽ1 őĽ2 = , (1.5.38)
F g ‚ąí Gf F g ‚ąí Gf

where F g ‚ąí Gf 6= 0. The curvatures őļ(1) ,őļ(2) corresponding to the roots őĽ1 and őĽ2 are called the principal
curvatures at the point P. Several quantities of interest that are related to őļ(1) and őļ(2) are: (1) the principal
1
radii of curvature Ri = 1/őļi ,i = 1, 2; (2) H = 2 (őļ(1) + őļ(2) ) called the mean curvature and K = őļ(1) őļ(2)
called the total curvature or Gaussian curvature of the surface. Observe that the roots őĽ1 and őĽ2 determine
two directions on the surface

## d~r1 ‚ąā~r ‚ąā~r d~r2 ‚ąā~r ‚ąā~r

= + őĽ1 and = + őĽ2 .
du ‚ąāu ‚ąāv du ‚ąāu ‚ąāv

## d~r1 d~r2 ‚ąā~r ‚ąā~r ‚ąā~r ‚ąā~r

¬∑ =( + őĽ1 )( + őĽ2 ) = 0.
du du ‚ąāu ‚ąāv ‚ąāu ‚ąāv

## This requires that

GőĽ1 őĽ2 + F (őĽ1 + őĽ2 ) + E = 0. (1.5.39)

It is left as an exercise to verify that this is indeed the case and so the directions determined by the principal
curvatures must be orthogonal. In the case where F g ‚ąí Gf = 0 we have that F = 0 and f = 0 because the
coordinate curves are orthogonal and G must be positive. In this special case there are still two directions
determined by the differential equations (1.5.37) with dv = 0, du arbitrary, and du = 0, dv arbitrary. From
the differential equations (1.5.37) we find these directions correspond to

e g
őļ(1) = and őļ(2) = .
E G
duőĪ
We let őĽőĪ = ds denote a unit vector on the surface satisfying aőĪő≤ őĽőĪ őĽő≤ = 1. Then the equation (1.5.34)
can be written as őļ(n) = bőĪő≤ őĽőĪ őĽő≤ or we can write (bőĪő≤ ‚ąí őļ(n) aőĪő≤ )őĽőĪ őĽő≤ = 0. The maximum and minimum
normal curvature occurs in those directions őĽőĪ where

## (bőĪő≤ ‚ąí őļ(n) aőĪő≤ )őĽőĪ = 0

and so őļ(n) must be a root of the determinant equation |bőĪő≤ ‚ąí őļ(n) aőĪő≤ | = 0 or
1
b ‚ąíőļ b12
|a őĪő≥
bőĪő≤ ‚ąí őļ(n) őīő≤ő≥ | = 1 2 (n) = őļ2 ‚ąí bőĪő≤ aőĪő≤ őļ(n) + b = 0. (1.5.40)
b1 b22 ‚ąí őļ(n) (n)
a

This is a quadratic equation in őļ(n) of the form őļ2(n) ‚ąí (őļ(1) + őļ(2) )őļ(n) + őļ(1) őļ(2) = 0. In other words the
principal curvatures őļ(1) and őļ(2) are the eigenvalues of the matrix with elements bő≥ő≤ = aőĪő≥ bőĪő≤ . Observe that
from the determinant equation in őļ(n) we can directly find the total curvature or Gaussian curvature which
is an invariant given by K = őļ(1) őļ(2) = |bőĪ
ő≤ | = |a bő≥ő≤ | = b/a. The mean curvature is also an invariant
őĪő≥

1 1 őĪő≤
obtained from H = 2 (őļ(1) + őļ(2) ) = 2a bőĪő≤ , where a = a11 a22 ‚ąí a12 a21 and b = b11 b22 ‚ąí b12 b21 are the
determinants formed from the surface metric tensor and curvature tensor components.
137

## The equations of Gauss, Weingarten and Codazzi

At each point on a space curve we can construct a unit tangent T~ , a unit normal N
~ and unit binormal
~ The derivatives of these vectors, with respect to arc length, can also be represented as linear combinations
B.
of the base vectors T~ , N
~ , B.
~ See for example the Frenet-Serret formulas from equations (1.5.13). In a similar
b form a basis and the derivatives of these basis vectors with respect to
fashion the surface vectors ~ru , ~rv , n
b. For
the surface coordinates u, v can also be expressed as linear combinations of the basis vectors ~ru , ~rv , n
b. We can write
example, the derivatives ~ruu , ~ruv , ~rvv can be expressed as linear combinations of ~ru , ~rv , n

b
~ruu = c1~ru + c2~rv + c3 n
b
~ruv = c4~ru + c5~rv + c6 n (1.5.41)
b
~rvv = c7~ru + c8~rv + c9 n

where c1 , . . . , c9 are constants to be determined. It is an easy exercise (see exercise 1.5, problem 8) to show
that these equations can be written in the indicial notation as
 
‚ąā 2~r ő≥ ‚ąā~r
őĪ ő≤
= b.
+ bőĪő≤ n (1.5.42)
‚ąāu ‚ąāu őĪ ő≤ ‚ąāuő≥
These equations are known as the Gauss equations.
In a similar fashion the derivatives of the normal vector can be represented as linear combinations of
the surface basis vectors. If we write
‚ąāb
n ‚ąā~r ‚ąāb
n ‚ąāb
n
= c1~ru + c2~rv = c‚ąó1 + c‚ąó2
‚ąāu or ‚ąāu ‚ąāu ‚ąāv (1.5.43)
‚ąāb
n ‚ąā~r ‚ąāb
n ‚ąāb
n
= c3~ru + c4~rv = c‚ąó3 + c‚ąó4
‚ąāv ‚ąāv ‚ąāu ‚ąāv
where c1 , . . . , c4 and c‚ąó1 , . . . , c‚ąó4 are constants. These equations are known as the Weingarten equations. It
is easily demonstrated (see exercise 1.5, problem 9) that the Weingarten equations can be written in the
indicial form
‚ąāb
n ‚ąā~r
= ‚ąíbő≤őĪ ő≤ (1.5.44)
‚ąāuőĪ ‚ąāu
where bő≤őĪ = aő≤ő≥ bő≥őĪ is the mixed second order form of the curvature tensor.
The equations of Gauss produce a system of partial differential equations defining the surface coordinates
i
x as a function of the curvilinear coordinates u and v. The equations are not independent as certain
compatibility conditions must be satisfied. In particular, it is required that the mixed partial derivatives
must satisfy
‚ąā 3~r ‚ąā 3~r
= .
‚ąāuőĪ ‚ąāuő≤ ‚ąāuőī ‚ąāuőĪ ‚ąāuőī ‚ąāuő≤
We calculate  
  ‚ąā
ő≥
‚ąā 3~r ő≥ ‚ąā 2~r őĪő≤ ‚ąā~r ‚ąāb
n ‚ąābőĪő≤
= + + bőĪő≤ őī + b
n
‚ąāuőĪ ‚ąāuő≤ ‚ąāuőī őĪő≤ ‚ąāuő≥ ‚ąāuőī ‚ąāuőī ‚ąāuő≥ ‚ąāu ‚ąāuőī
and use the equations of Gauss and Weingarten to express this derivative in the form
Ô£ģ   Ô£Ļ
‚ąā
ŌČ
      
‚ąā 3~r Ô£Į őĪő≤ ő≥ ŌČ Ô£ļ r
ŌČ Ô£ļ ‚ąā~ ő≥ ‚ąābőĪő≤
= Ô£Į + ‚ąí b b + b + b.
n
‚ąāuőĪ ‚ąāuő≤ ‚ąāuőī Ô£į ‚ąāuőī őĪő≤ ő≥őī
őĪő≤ őī Ô£Ľ
‚ąāuŌČ őĪő≤
ő≥őī
‚ąāuőī
138

## Forming the difference

‚ąā 3~r ‚ąā 3~r
‚ąí =0
‚ąāuőĪ ‚ąāuő≤ ‚ąāuőī ‚ąāuőĪ ‚ąāuőī ‚ąāuő≤
b and
we find that the coefficients of the independent vectors n ‚ąā~
r
‚ąāuŌČ b
must be zero. Setting the coefficient of n
equal to zero produces the Codazzi equations
   
ő≥ ő≥ ‚ąābőĪő≤ ‚ąābőĪőī
bő≥őī ‚ąí bő≥ő≤ + ‚ąí = 0. (1.5.45)
őĪő≤ őĪőī ‚ąāuőī ‚ąāuő≤

These equations are sometimes referred to as the Mainardi-Codazzi equations. Equating to zero the coefficient
of ‚ąā~
r
‚ąāuŌČ we find that RőīőĪő≥ő≤ = bőĪő≤ bőīő≥ ‚ąí bőĪő≥ bőīő≤ or changing indices we have the covariant form

## aŌČőī RőīőĪő≤ő≥ = RŌČőĪő≤ő≥ = bŌČő≤ bőĪő≥ ‚ąí bŌČő≥ bőĪő≤ , (1.5.46)

where          
‚ąā őī ‚ąā őī ŌČ őī ŌČ őī
RőīőĪő≥ő≤ = ‚ąí + ‚ąí (1.5.47)
‚ąāuő≥ őĪő≤ ‚ąāuő≤ őĪő≥ őĪő≤ ŌČő≥ őĪő≥ ŌČő≤
is the mixed Riemann curvature tensor.
EXAMPLE 1.5-1
Show that the Gaussian or total curvature K = őļ(1) őļ(2) depends only upon the metric aőĪő≤ and is
R1212
K= where a = det(aőĪő≤ ).
a
Solution:
Utilizing the two-dimensional alternating tensor eőĪő≤ and the property of determinants we can write
eő≥őī K = eőĪő≤ bő≥őĪ bőīő≤ where from page 137, K = |bő≥ő≤ | = |aőĪő≥ bőĪő≤ |. Now multiply by eő≥ő∂ and then contract on
ő∂ and őī to obtain
eő≥őī eő≥őī K = eő≥őī eőĪő≤ bő≥őĪ bőīő≤ = 2K

2K = eő≥őī eőĪő≤ (aő≥¬Ķ bőĪu ) aőīőĹ bő≤őĹ
‚ąö
But eő≥őī aő≥¬Ķ aőīőĹ = ae¬ĶőĹ so that 2K = eőĪő≤ a e¬ĶőĹ bőĪ¬Ķ bő≤őĹ . Using ae¬ĶőĹ = ¬ĶőĹ we have 2K = ¬ĶőĹ őĪő≤ bőĪ¬Ķ bő≤őĹ .
Interchanging indices we can write

## 2K = ő≤ő≥ ŌČőĪ bŌČő≤ bőĪő≥ and 2K = ő≥ő≤ ŌČőĪ bŌČő≥ bőĪő≤ .

Adding these last two results we find that 4K = ő≤ő≥ ŌČő≥ (bŌČő≤ bőĪő≥ ‚ąí bŌČő≥ bőĪő≤ ) = ő≤ő≥ ŌČő≥ RŌČőĪő≤ő≥ . Now multiply
ő≤ő≥ ŌČőĪ
both sides by ŌÉŌĄ őĽőĹ to obtain 4KŌÉŌĄ őĽőĹ = őīŌÉŌĄ őīőĽőĹ RŌČőĪő≤ő≥ . From exercise 1.5, problem 16, the Riemann
curvature tensor Rijkl is skew symmetric in the (i, j), (k, l) as well as being symmetric in the (ij), (kl) pair
ő≤ő≥ ŌČőĪ
of indices. Consequently, őīŌÉŌĄ őīőĽőĹ RŌČőĪő≤ő≥ = 4RőĽőĹŌÉŌĄ and hence RőĽőĹŌÉŌĄ = KŌÉŌĄ őĽőĹ and we have the special case
‚ąö ‚ąö R1212 b
where K ae12 ae12 = R1212 or K = . A much simpler way to obtain this result is to observe K =
a a
(bottom of page 137) and note from equation (1.5.46) that R1212 = b11 b22 ‚ąí b12 b21 = b.

Note that on a surface ds2 = aőĪő≤ duőĪ duő≤ where aőĪő≤ are the metrices for the surface. This metric is a
‚ąāuőĪ ‚ąāuő≤
tensor and satisfies aŐĄő≥őī = aőĪő≤ ő≥ and by taking determinants we find
‚ąā uŐĄ ‚ąā uŐĄőī
‚ąāuőĪ ‚ąāuő≤

aŐĄ = aŐĄő≥őī ő≥ őī = aJ 2
‚ąā uŐĄ ‚ąā uŐĄ
139

where J is the Jacobian of the surface coordinate transformation. Here the curvature tensor for the surface
RőĪő≤ő≥őī has only one independent component since R1212 = R2121 = ‚ąíR1221 = ‚ąíR2112 (See exercises 20,21).
From the transformation law
‚ąāuőĪ ‚ąāuő≤ ‚ąāuő≥ ‚ąāuőī
RŐĄő∑őĽ¬Ķ = RőĪő≤ő≥őī
‚ąā uŐĄ ‚ąā uŐĄő∑ ‚ąā uŐĄőĽ ‚ąā uŐĄ¬Ķ
one can sum over the repeated indices and show that RŐĄ1212 = R1212 J 2 and consequently

RŐĄ1212 R1212
= =K
aŐĄ a

## which shows that the Gaussian curvature is a scalar invariant in V2 .

Geodesic Curvature

## ~ associated with this curve, is

For C an arbitrary curve on a given surface the curvature vector K,
b and geodesic curvature őļ(g) u
the vector sum of the normal curvature őļ(n) n b and lies in a plane which
is perpendicular to the tangent vector to the given curve on the surface. The geodesic curvature őļ(g) is
obtained from the equation (1.5.25) and can be represented
!
dT~ dT~ dT~
őļ(g) =u ~ =u
b¬∑K b¬∑ n √ó T~ ) ¬∑
= (b = T~ √ó ¬∑n
b.
ds ds ds

## Substituting into this expression the vectors

d~r du dv
T~ = = ~ru + ~rv
ds ds ds
~
dT
=K~ = ~ruu (u 0 )2 + 2~ruv u 0 v 0 + ~rvv (v 0 )2 + ~ru u 00 + ~rv v 00 ,
ds
0 d
where = ds , and by utilizing the results from problem 10 of the exercises following this section, we find
that the geodesic curvature can be represented as
      
2 0 3 2 1
őļ(g) = (u ) + 2 ‚ąí (u 0 )2 v 0 +
11 12 11
      p (1.5.48)
2 1 1
‚ąí2 u 0 (v 0 )2 ‚ąí (v 0 )3 + (u 0 v 00 ‚ąí u 00 v 0 ) EG ‚ąí F 2 .
22 12 22

This equation indicates that the geodesic curvature is only a function of the surface metrices E, F, G and
the derivatives u 0 , v 0 , u 00 , v 00 . When the geodesic curvature is zero the curve is called a geodesic curve. Such
curves are often times, but not always, the lines of shortest distance between two points on a surface. For
example, the great circle on a sphere which passes through two given points on the sphere is a geodesic curve.
If you erase that part of the circle which represents the shortest distance between two points on the circle
you are left with a geodesic curve connecting the two points, however, the path is not the shortest distance
between the two points.
For plane curves we let u = x and v = y so that the geodesic curvature reduces to

dŌÜ
kg = u 0 v 00 ‚ąí u 00 v 0 =
ds
140

where ŌÜ is the angle between the tangent T~ to the curve and the unit vector b
e1 .
Geodesics are curves on the surface where the geodesic curvature is zero. Since kg = 0 along a geodesic
surface curve, then at every point on this surface curve the normal N ~ to the curve will be in the same
b to the surface. In this case, we have ~ru ¬∑ n
direction as the normal n b = 0 and ~rv ¬∑ n
b = 0 which reduces to

dT~ dT~
¬∑ ~ru = 0 and ¬∑ ~rv = 0, (1.5.49)
ds ds
~
b and
since the vectors n dT
ds have the same direction. In particular, we may write

## d~r ‚ąā~r du ‚ąā~r dv

T~ = = + = ~ru u0 + ~rv v 0
ds ‚ąāu ds ‚ąāv ds
dT~
= ~ruu (u 0 )2 + 2~ruv u 0 v 0 + ~rvv (v 0 )2 + ~ru u 00 + ~rv v 00
ds

## Consequently, the equations (1.5.49) become

dT~
¬∑ ~ru = (~ruu ¬∑ ~ru ) (u 0 )2 + 2(~ruv ¬∑ ~ru ) u 0 v 0 + (~rvv ¬∑ ~ru ) (v 0 )2 + Eu 00 + F v 00 = 0
ds . (1.5.50)
dT~ 0 2 0 0 0 2 00 00
¬∑ ~rv = (~ruu ¬∑ ~rv ) (u ) + 2(~ruv ¬∑ ~rv ) u v + (~rvv ¬∑ ~rv ) (v ) + F u + Gv = 0.
ds

Utilizing the results from exercise 1.5,(See problems 4,5 and 6), we can eliminate v 00 from the equations
(1.5.50) to obtain
  2     2
d2 u 1 du 1 du dv 1 dv
+ +2 + =0
ds2 11 ds 12 ds ds 22 ds
and eliminating u 00 from the equations (1.5.50) produces the equation
  2     2
d2 v 2 du 2 du dv 2 dv
+ +2 + = 0.
ds2 11 ds 12 ds ds 22 ds

## In tensor form, these last two equations are written

 
d2 uőĪ őĪ duő≤ duő≥
+ = 0, őĪ, ő≤, ő≥ = 1, 2 (1.5.51)
ds2 ő≤ő≥ a ds ds

where u = u1 and v = u2 . The equations (1.5.51) are the differential equations defining a geodesic curve on
a surface. We will find that these same type of equations arise in considering the shortest distance between
two points in a generalized coordinate system. See for example problem 18 in exercise 2.2.
141

Tensor Derivatives
Let uőĪ = uőĪ (t) denote the parametric equations of a curve on the surface defined by the parametric
equations xi = xi (u1 , u2 ). We can then represent the surface curve in the spatial geometry since the surface
curve can be represented in the spatial coordinates through the representation xi = xi (u1 (t), u2 (t)) = xi (t).
Recall that for xi = xi (t) a given curve C , the intrinsic derivative of a vector field Ai along C is defined as
the inner product of the covariant derivative of the vector field with the tangent vector to the curve. This
intrinsic derivative is written
"   #
őīAi i dx
j
‚ąāAi i k dx
j
= A ,j = j
+ A
őīt dt ‚ąāx jk g dt

or  
őīAi dAi i dxj
= + Ak
őīt dt jk g dt
where the subscript g indicates that the Christoffel symbol is formed from the spatial metric gij . If AőĪ is a
surface vector defined along the curve C, the intrinsic derivative is represented
 őĪ    ő≤
őīAőĪ duő≤ ‚ąāA őĪ ő≥ du
= AőĪ,ő≤ = + A
őīt dt ‚ąāuő≤ ő≤ő≥ a dt

or  
őīAőĪ dAőĪ őĪ duő≤
= + Aő≥
őīt dt ő≤ő≥ a dt
where the subscript a denotes that the Christoffel is formed from the surface metric aőĪő≤ .
Similarly, the formulas for the intrinsic derivative of a covariant spatial vector Ai or covariant surface
vector AőĪ are given by  
őīAi dAi k dxj
= ‚ąí Ak
őīt dt ij g dt
and  
őīAőĪ dAőĪ ő≥ duő≤
= ‚ąí AőĪ .
őīt dt őĪő≤ a dt
Consider a mixed tensor TőĪi which is contravariant with respect to a transformation of space coordinates
x and covariant with respect to a transformation of surface coordinates uőĪ . For TőĪi defined over the surface
i

curve C, which can also be viewed as a space curve C, define the scalar invariant ő® = ő®(t) = TőĪi Ai B őĪ where
Ai is a parallel vector field along the curve C when it is viewed as a space curve and B őĪ is also a parallel
vector field along the curve C when it is viewed as a surface curve. Recall that these parallel vector fields
must satisfy the differential equations
   
őīAi dAi k dxj őīB őĪ dB őĪ őĪ duő≤
= ‚ąí Ak = 0 and = + Bő≥ = 0. (1.5.52)
őīt dt ij g dt őīt dt ő≤ő≥ a dt

The scalar invariant ő® is a function of the parameter t of the space curve since both the tensor and the
parallel vector fields are to be evaluated along the curve C. By differentiating the function ő® with respect
to the parameter t there results

dő® dT i dAi őĪ dB őĪ
= őĪ Ai B őĪ + TőĪi B + TőĪi Ai . (1.5.53)
dt dt dt dt
142

But the vectors Ai and B őĪ are parallel vector fields and must satisfy the relations given by equations (1.5.52).
This implies that equation (1.5.53) can be written in the form
"     #
dő® dTőĪi i k dx
j
ő≥ i du
ő≤
= + T ‚ąí T Ai B őĪ . (1.5.54)
dt dt k j g őĪ dt ő≤ őĪ a ő≥ dt

The quantity inside the brackets of equation (1.5.54) is defined as the intrinsic tensor derivative with respect
to the parameter t along the curve C. This intrinsic tensor derivative is written
   
őīTőĪi dTőĪi i k dx
j
ő≥ duő≤
= + TőĪ ‚ąí Tő≥i . (1.5.55)
dt dt kj g dt ő≤őĪ a dt

The spatial representation of the curve C is related to the surface representation of the curve C through the
defining equations. Therefore, we can express the equation (1.5.55) in the form
"     #
őīTőĪi ‚ąāTőĪi i k ‚ąāx
j
ő≥ i du
ő≤
= + T ‚ąí T (1.5.56)
dt ‚ąāuő≤ k j g őĪ ‚ąāuő≤ ő≤ őĪ a ő≥ dt

The quantity inside the brackets is a mixed tensor which is defined as the tensor derivative of TőĪi with
respect to the surface coordinates uő≤ . The tensor derivative of the mixed tensor TőĪi with respect to the
surface coordinates uő≤ is written
   
‚ąāTőĪi i ‚ąāxj ő≥
i
TőĪ,ő≤ = + TőĪk ‚ąí Tő≥i .
‚ąāuő≤ kj g ‚ąāuő≤ ő≤őĪ a

i...j
In general, given a mixed tensor TőĪ...ő≤ which is contravariant with respect to transformations of the
space coordinates and covariant with respect to transformations of the surface coordinates, then we can
define the scalar field along the surface curve C as

i...j
ő®(t) = TőĪ...ő≤ Ai ¬∑ ¬∑ ¬∑ Aj B őĪ ¬∑ ¬∑ ¬∑ B ő≤ (1.5.57)

where Ai , . . . , Aj and B őĪ , . . . , B ő≤ are parallel vector fields along the curve C. The intrinsic tensor derivative
is then derived by differentiating the equation (1.5.57) with respect to the parameter t.
Tensor derivatives of the metric tensors gij , aőĪő≤ and the alternating tensors ijk , őĪő≤ and their associated
tensors are all zero. Hence, they can be treated as constants during the tensor differentiation process.
Generalizations
In a Riemannian space Vn with metric gij and curvilinear coordinates xi , i = 1, 2, 3, the equations of a
surface can be written in the parametric form xi = xi (u1 , u2 ) where uőĪ , őĪ = 1, 2 are called the curvilinear
coordinates of the surface. Since
‚ąāxi őĪ
dxi = du (1.5.58)
‚ąāuőĪ
then a small change duőĪ on the surface results in change dxi in the space coordinates. Hence an element of
arc length on the surface can be represented in terms of the curvilinear coordinates of the surface. This same
element of arc length can also be represented in terms of the curvilinear coordinates of the space. Thus, an
element of arc length squared in terms of the surface coordinates is represented

## ds2 = aőĪő≤ duőĪ duő≤ (1.5.59)

143

where aőĪő≤ is the metric of the surface. This same element when viewed as a spatial element is represented

## By equating the equations (1.5.59) and (1.5.60) we find that

‚ąāxi ‚ąāxj őĪ ő≤
gij dxi dxj = gij du du = aőĪő≤ duőĪ duő≤ . (1.5.61)
‚ąāuőĪ ‚ąāuő≤
The equation (1.5.61) shows that the surface metric is related to the spatial metric and can be calculated
‚ąāxi ‚ąāxj
from the relation aőĪő≤ = gij őĪ ő≤ . This equation reduces to the equation (1.5.21) in the special case of
‚ąāu ‚ąāu
Cartesian coordinates. In the surface coordinates we define the quadratic form A = aőĪő≤ duőĪ duő≤ as the first
fundamental form of the surface. The tangent vector to the coordinate curves defining the surface are given
‚ąāxi
by ‚ąāuőĪ and can be viewed as either a covariant surface vector or a contravariant spatial vector. We define
this vector as
‚ąāxi
xiőĪ = , i = 1, 2, 3, őĪ = 1, 2. (1.5.62)
‚ąāuőĪ
Any vector which is a linear combination of the tangent vectors to the coordinate curves is called a surface
vector. A surface vector AőĪ can also be viewed as a spatial vector Ai . The relation between the spatial
representation and surface representation is Ai = AőĪ xiőĪ . The surface representation AőĪ , őĪ = 1, 2 and the
spatial representation Ai , i = 1, 2, 3 define the same direction and magnitude since

## gij Ai Aj = gij AőĪ xiőĪ Aő≤ xjő≤ = gij xiőĪ xjő≤ AőĪ Aő≤ = aőĪő≤ AőĪ Aő≤ .

Consider any two surface vectors AőĪ and B őĪ and their spatial representations Ai and B i where

## Ai = AőĪ xiőĪ and B i = B őĪ xiőĪ . (1.5.63)

These vectors are tangent to the surface and so a unit normal vector to the surface can be defined from the
cross product relation
ni AB sin őł = ijk Aj B k (1.5.64)

where A, B are the magnitudes of Ai , B i and őł is the angle between the vectors when their origins are made
to coincide. Substituting equations (1.5.63) into the equation (1.5.64) we find

## ni AB sin őł = ijk AőĪ xjőĪ B ő≤ xkő≤ . (1.5.65)

In terms of the surface metric we have AB sin őł = őĪő≤ AőĪ B ő≤ so that equation (1.5.65) can be written in the
form
(ni őĪő≤ ‚ąí ijk xjőĪ xkő≤ )AőĪ B ő≤ = 0 (1.5.66)

## which for arbitrary surface vectors implies

1 őĪő≤
ni őĪő≤ = ijk xjőĪ xkő≤ or ni =  ijk xjőĪ xkő≤ . (1.5.67)
2
The equation (1.5.67) defines a unit normal vector to the surface in terms of the tangent vectors to the
coordinate curves. This unit normal vector is related to the covariant derivative of the surface tangents as
144

is now demonstrated. By using the results from equation (1.5.50), the tensor derivative of equation (1.5.59),
with respect to the surface coordinates, produces
   
‚ąā 2 xi i ŌÉ
xiőĪ,ő≤ = + xpőĪ xqő≤ ‚ąí xiŌÉ (1.5.68)
‚ąāuőĪ ‚ąāuő≤ pq g őĪő≤ a

where the subscripts on the Christoffel symbols refer to the metric from which they are calculated. Also the
tensor derivative of the equation (1.5.57) produces the result

## gij xiőĪ,ő≥ xjő≤ + gij xiőĪ xjő≤,ő≥ = aőĪő≤,ő≥ = 0. (1.5.69)

Interchanging the indices őĪ, ő≤, ő≥ cyclically in the equation (1.5.69) one can verify that

## gij xiőĪ,ő≤ xjő≥ = 0. (1.5.70)

The equation (1.5.70) indicates that in terms of the space coordinates, the vector xiőĪ,ő≤ is perpendicular to
the surface tangent vector xiő≥ and so must have the same direction as the unit surface normal ni . Therefore,
there must exist a second order tensor bőĪő≤ such that

## bőĪő≤ ni = xiőĪ,ő≤ . (1.5.71)

By using the relation gij ni nj = 1 we can transform equation (1.5.71) to the form

1 ő≥őī
bőĪő≤ = gij nj xiőĪ,ő≤ =  ijk xiőĪ,ő≤ xjő≥ xkőī . (1.5.72)
2

The second order symmetric tensor bőĪő≤ is called the curvature tensor and the quadratic form

## is called the second fundamental form of the surface.

Consider also the tensor derivative with respect to the surface coordinates of the unit normal vector to
the surface. This derivative is  
‚ąāni i
ni, őĪ = + nj xkőĪ . (1.5.74)
‚ąāuőĪ jk g

Taking the tensor derivative of gij ni nj = 1 with respect to the surface coordinates produces the result
gij ni nj,őĪ = 0 which shows that the vector nj,őĪ is perpendicular to ni and must lie in the tangent plane to the
surface. It can therefore be expressed as a linear combination of the surface tangent vectors xiőĪ and written
in the form
ni,őĪ = ő∑őĪő≤ xiő≤ (1.5.75)

where the coefficients ő∑őĪő≤ can be written in terms of the surface metric components aőĪő≤ and the curvature
components bőĪő≤ as follows. The unit vector ni is normal to the surface so that

## gij ni xjőĪ = 0. (1.5.76)

145

The tensor derivative of this equation with respect to the surface coordinates gives

## gij niő≤ xjőĪ + gij ni xjőĪ,ő≤ = 0. (1.5.77)

Substitute into equation (1.5.77) the relations from equations (1.5.57), (1.5.71) and (1.5.75) and show that

## ő∑ő≤ő≥ = ‚ąíaőĪő≥ bőĪő≤ . (1.5.79)

Now substituting equation (1.5.79) into the equation (1.5.75) produces the Weingarten formula

## ni,őĪ = ‚ąíaő≥ő≤ bő≥őĪ xiő≤ . (1.5.80)

This is a relation for the derivative of the unit normal in terms of the surface metric, curvature tensor and
surface tangents.
A third fundamental form of the surface is given by the quadratic form

## cőĪő≤ = gij ni,őĪ nj,ő≤ . (1.5.82)

By using the Weingarten formula in the equation (1.5.81) one can verify that

## cőĪő≤ = aő≥őī bőĪő≥ bő≤őī . (1.5.83)

Geodesic Coordinates
In a Cartesian coordinate system the metric tensor gij is a constant and consequently the Christoffel
symbols are zero at all points of the space. This is because the Christoffel symbols are dependent upon
the derivatives of the metric tensor which is constant. If the space VN is not Cartesian then the Christoffel
symbols do not vanish at all points of the space. However, it is possible to find a coordinate system where
the Christoffel symbols will all vanish at a given point P of the space. Such coordinates are called geodesic
coordinates of the point P.
Consider a two dimensional surface with surface coordinates uőĪ and surface metric aőĪő≤ . If we transform
to some other two dimensional coordinate system, say uŐĄőĪ with metric aŐĄőĪő≤ , where the two coordinates are
related by transformation equations of the form

uőĪ = uőĪ (uŐĄ 1 , uŐĄ 2 ), őĪ = 1, 2, (1.5.84)
146

then from the transformation equation (1.4.7) we can write, after changing symbols,
   
őī ‚ąāuőĪ őĪ ‚ąāuőī ‚ąāu ‚ąā 2 uőĪ
= + . (1.5.85)
ő≤ ő≥ aŐĄ ‚ąā uŐĄ őī őī  a ‚ąā uŐĄ ő≤ ‚ąā uŐĄ ő≥ ‚ąā uŐĄ ő≤ ‚ąā uŐĄ ő≥
 
őī
This is a relationship between the Christoffel symbols in the two coordinate systems. If ő≤ő≥
vanishes at
aŐĄ
a point P , then for that particular point the equation (1.5.85) reduces to
 
‚ąā 2 uőĪ őĪ ‚ąāuőī ‚ąāu
ő≤ ő≥
= ‚ąí (1.5.86)
‚ąā uŐĄ ‚ąā uŐĄ őī  a ‚ąā uŐĄ ő≤ ‚ąā uŐĄ ő≥

## where all terms are evaluated

 at 
the point P. Conversely, if the equation (1.5.86) is satisfied at the point P,
őī
then the Christoffel symbol ő≤ ő≥ must be zero at this point. Consider the special coordinate transforma-
aŐĄ
tion  
1 őĪ
0 + uŐĄ ‚ąí
uőĪ = uőĪ őĪ
uŐĄ ő≤ uŐĄ őĪ (1.5.87)
2 ő≤ő≥ a
where uőĪ
0 are the surface coordinates of the point P. The point P in the new coordinates is given by
uŐĄ őĪ = 0. We now differentiate the relation (1.5.87) to see if it satisfies the equation (1.5.86). We calculate
the derivatives    
‚ąāuőĪ 1 őĪ 1 őĪ ő≥
= őī őĪ
ŌĄ ‚ąí uŐĄ ő≤
‚ąí uŐĄ őĪ (1.5.88)
‚ąā uŐĄ ŌĄ 2 ő≤ŌĄ a 2 ŌĄő≥ a u =0

and  
‚ąā 2 uőĪ őĪ
=‚ąí (1.5.89)
‚ąā uŐĄ ŌĄ ‚ąā uŐĄ ŌÉ ŌĄ ŌÉ a uőĪ =0
where these derivative are evaluated at uŐĄ őĪ = 0. We find the derivative equations (1.5.88) and (1.5.89) do
satisfy the equation (1.5.86) locally at the point P. Hence, the Christoffel symbols will all be zero at this
particular point. The new coordinates can then be called geodesic coordinates.

## Riemann Christoffel Tensor

Consider the Riemann Christoffel tensor defined by the equation (1.4.33). Various properties of this
tensor are derived in the exercises at the end of this section. We will be particularly interested in the
Riemann Christoffel tensor in a two dimensional space with metric aőĪő≤ and coordinates uőĪ . We find the
Riemann Christoffel tensor has the form
         
‚ąā őī ‚ąā őī ŌĄ őī ŌĄ őī
őī
R. őĪő≤ő≥ = ‚ąí + ‚ąí (1.5.90)
‚ąāuő≤ őĪ ő≥ ‚ąāuő≥ őĪ ő≤ őĪő≥ ő≤ŌĄ őĪő≤ ő≥ŌĄ
where the Christoffel symbols are evaluated with respect to the surface metric. The above tensor has the
associated tensor
RŌÉőĪő≤ő≥ = aŌÉőī R.őīőĪő≤ő≥ (1.5.91)

## The two dimensional alternating tensor is used to define the constant

1 őĪő≤ ő≥őī
K=   RőĪő≤ő≥őī (1.5.93)
4
147

(see example 1.5-1) which is an invariant of the surface and called the Gaussian curvature or total curvature.
In the exercises following this section it is shown that the Riemann Christoffel tensor of the surface can be
expressed in terms of the total curvature and the alternating tensors as

## Consider the second tensor derivative of xrőĪ which is given by

     
‚ąāxrőĪ,ő≤ r őī őī
xrőĪ,ő≤ő≥ = + xrőĪ,ő≤ xnő≥ ‚ąí xrőī,ő≤ ‚ąí xrőĪ,ő≥ (1.5.95)
‚ąāuő≥ mn g őĪő≥ a ő≤ő≥ a

## xrőĪ,ő≤ő≥ ‚ąí xrőĪ,ő≥ő≤ = Rőī.őĪő≤ő≥ xrőī . (1.5.96)

Using the relation (1.5.96) we can now derive some interesting properties relating to the tensors aőĪő≤ , bőĪő≤ ,
cőĪő≤ , RőĪő≤ő≥őī , the mean curvature H and the total curvature K.
Consider the tensor derivative of the equation (1.5.71) which can be written

## xiőĪ,ő≤ő≥ = bőĪő≤,ő≥ ni + bőĪő≤ ni,ő≥ (1.5.97)

where    
‚ąābőĪő≤ ŌÉ ŌÉ
bőĪő≤,ő≥ = ‚ąí bŌÉő≤ ‚ąí bőĪŌÉ . (1.5.98)
‚ąāuőĪ őĪő≥ a ő≤ő≥ a

By using the Weingarten formula, given in equation (1.5.80), the equation (1.5.97) can be expressed in the
form
xiőĪ,ő≤ő≥ = bőĪő≤,ő≥ ni ‚ąí bőĪő≤ aŌĄ ŌÉ bŌĄ ő≥ xiŌÉ (1.5.99)

and by using the equations (1.5.98) and (1.5.99) it can be established that

## xrőĪ,ő≤ő≥ ‚ąí xrőĪ,ő≥ő≤ = (bőĪő≤,ő≥ ‚ąí bőĪő≥,ő≤ )nr ‚ąí aŌĄ őī (bőĪő≤ bŌĄ ő≥ ‚ąí bőĪő≥ bŌĄ ő≤ )xrőī . (1.5.100)

Now by equating the results from the equations (1.5.96) and (1.5.100) we arrive at the relation

## Rőī.őĪő≤ő≥ xrőī = (bőĪő≤,ő≥ ‚ąí bőĪő≥,ő≤ )nr ‚ąí aŌĄ őī (bőĪő≤ bŌĄ ő≥ ‚ąí bőĪő≥ bŌĄ ő≤ )xrőī . (1.5.101)

Multiplying the equation (1.5.101) by nr and using the results from the equation (1.5.76) there results the
Codazzi equations
bőĪő≤,ő≥ ‚ąí bőĪő≥,ő≤ = 0. (1.5.102)

## Multiplying the equation (1.5.101) by grm xm

ŌÉ and simplifying one can derive the Gauss equations of the
surface
RŌÉőĪő≤ő≥ = bőĪő≥ bŌÉő≤ ‚ąí bőĪő≤ bŌÉő≥ . (1.5.103)

By using the Gauss equations (1.5.103) the equation (1.5.94) can be written as

## KŌÉőĪ ő≤ő≥ = bőĪő≥ bŌÉő≤ ‚ąí bőĪő≤ bŌÉő≥ . (1.5.104)

148

Another form of equation (1.5.104) is obtained by using the equation (1.5.83) together with the relation
aőĪő≤ = ‚ąíaŌÉő≥ ŌÉőĪ ő≤ő≥ . It is left as an exercise to verify the resulting form

## Define the quantity

1 ŌÉő≥
H= a bŌÉő≥ (1.5.107)
2
as the mean curvature of the surface, then the equation (1.5.106) can be written in the form

## By multiplying the equation (1.5.108) by duőĪ duő≤ and summing, we find

C ‚ąí 2H B + K A = 0 (1.5.109)

## is a relation connecting the first, second and third fundamental forms.

EXAMPLE 1.5-2
In a two dimensional space the Riemann Christoffel tensor has only one nonzero independent component
R1212 . ( See Exercise 1.5, problem number 21.) Consequently, the equation (1.5.104) can be written in the
‚ąö ‚ąö
form K ae12 ae12 = b22 b11 ‚ąí b21 b12 and solving for the Gaussian curvature K we find

## b22 b11 ‚ąí b12 b21 b R1212

K= = = . (1.5.110)
a11 a22 ‚ąí a12 a21 a a

Surface Curvature
For a surface curve uőĪ = uőĪ (s),őĪ = 1, 2 lying upon a surface xi = xi (u1 , u2 ),i = 1, 2, 3, we have a two
duőĪ
dimensional space embedded in a three dimensional space. Thus, if tőĪ = is a unit tangent vector to
ds
őĪ ő≤
du du
the surface curve then aőĪő≤ = aőĪő≤ tőĪ tő≤ = 1. This same vector can be represented as the unit tangent
ds ds
dxi dxi dxj
vector to the space curve xi = xi (u1 (s), u2 (s)) with T i = . That is we will have gij = gij T i T j = 1.
ds ds ds
The surface vector tőĪ and the space vector T i are related by

‚ąāxi duőĪ
Ti = = xiőĪ tőĪ . (1.5.111)
‚ąāuőĪ ds

The surface vector tőĪ is a unit vector so that aőĪő≤ tőĪ tő≤ = 1. If we differentiate this equation intrinsically with
ő≤
őītőĪ
respect to the parameter s, we find that aőĪő≤ tőĪ őītőīs = 0. This shows that the surface vector őīs is perpendicular
őĪ őĪ
to the surface vector t . Let u denote a unit normal vector in the surface plane which is orthogonal to the
tangent vector tőĪ . The direction of uőĪ is selected such that őĪő≤ tőĪ uő≤ = 1. Therefore, there exists a scalar őļ(g)
such that
őītőĪ
= őļ(g) uőĪ (1.5.112)
őīs
149

őīuőĪ
where őļ(g) is called the geodesic curvature of the curve. In a similar manner it can be shown that őīs
őīuőĪ
is a surface vector orthogonal to tőĪ . Let őīs = őĪtőĪ where őĪ is a scalar constant to be determined. By
differentiating the relation aőĪő≤ tőĪ uő≤ = 0 intrinsically and simplifying we find that őĪ = ‚ąíőļ(g) and therefore

őīuőĪ
= ‚ąíőļ(g) tőĪ . (1.5.113)
őīs

The equations (1.5.112) and (1.5.113) are sometimes referred to as the Frenet-Serret formula for a curve
relative to a surface.
Taking the intrinsic derivative of equation (1.5.111), with respect to the parameter s, we find that

őīT i őītőĪ duő≤ őĪ
= xiőĪ + xiőĪ,ő≤ t . (1.5.114)
őīs őīs ds

Treating the curve as a space curve we use the Frenet formulas (1.5.13). If we treat the curve as a surface
curve, then we use the Frenet formulas (1.5.112) and (1.5.113). In this way the equation (1.5.114) can be
written in the form
őļN i = xiőĪ őļ(g) uőĪ + xiőĪ,ő≤ tő≤ tőĪ . (1.5.115)

## őļN i = őļ(g) ui + bőĪő≤ ni tőĪ tő≤ (1.5.116)

where ui is the space vector counterpart of the surface vector uőĪ . Let őł denote the angle between the surface
normal ni and the principal normal N i , then we have that cos őł = ni N i . Hence, by multiplying the equation
(1.5.116) by ni we obtain
őļ cos őł = bőĪő≤ tőĪ tő≤ . (1.5.117)

Consequently, for all curves on the surface with the same tangent vector tőĪ , the quantity őļ cos őł will remain
constant. This result is known as Meusnier‚Äôs theorem. Note also that őļ cos őł = őļ(n) is the normal component
of the curvature and őļ sin őł = őļ(g) is the geodesic component of the curvature. Therefore, we write the
equation (1.5.117) as
őļ(n) = bőĪő≤ tőĪ tő≤ (1.5.118)

which represents the normal curvature of the surface in the direction tőĪ . The equation (1.5.118) can also be
written in the form
duőĪ duő≤ B
őļ(n) = bőĪő≤ = (1.5.119)
ds ds A
which is a ratio of quadratic forms.
The surface directions for which őļ(n) has a maximum or minimum value is determined from the equation
(1.5.119) which is written as
(bőĪő≤ ‚ąí őļ(n) aőĪő≤ )őĽőĪ őĽő≤ = 0. (1.5.120)

The direction giving a maximum or minimum value to őļ(n) must then satisfy

150

## The expanded form of equation (1.5.122) can be written as

b
őļ2(n) ‚ąí aőĪő≤ bőĪő≤ őļ(n) + =0 (1.5.123)
a

where a = a11 a22 ‚ąí a12 a21 and b = b11 b22 ‚ąí b12 b21 . Using the definition given in equation (1.5.107) and using
the result from equation (1.5.110), the equation (1.5.123) can be expressed in the form

## őļ2(n) ‚ąí 2H őļ(n) + K = 0. (1.5.124)

The roots őļ(1) and őļ(2) of the equation (1.5.124) then satisfy the relations

1
H= (őļ(1) + őļ(2) ) (1.5.125)
2

and
K = őļ(1) őļ(2) . (1.5.126)

Here H is the mean value of the principal curvatures and K is the Gaussian or total curvature which is the
product of the principal curvatures. It is readily verified that

Eg ‚ąí 2f F + eG eg ‚ąí f 2
H= and K =
2(EG ‚ąí F 2 ) EG ‚ąí F 2

are invariants obtained from the surface metric and curvature tensor.

Relativity
Sir Isaac Newton and Albert Einstein viewed the world differently when it came to describing gravity and
the motion of the planets. In this brief introduction to relativity we will compare the Newtonian equations
with the relativistic equations in describing planetary motion. We begin with an examination of Newtonian
systems.
Newton‚Äôs viewpoint of planetary motion is a multiple bodied problem, but for simplicity we consider
only a two body problem, say the sun and some planet where the motion takes place in a plane. Newton‚Äôs
law of gravitation states that two masses m and M are attracted toward each other with a force of magnitude
GmM
ŌĀ2 , where G is a constant, ŌĀ is the distance between the masses, m is the mass of the planet and M is the
mass of the sun. One can construct an x, y plane containing the two masses with the origin located at the
eŌĀ = cos ŌÜ b
center of mass of the sun. Let b e1 + sin ŌÜ b
e2 denote a unit vector at the origin of this coordinate
system and pointing in the direction of the mass m. The vector force of attraction of mass M on mass m is
given by the relation
‚ąíGmM
F~ = b
eŌĀ . (1.5.127)
ŌĀ2
151

## Figure 1.5-2. Parabolic and elliptic conic sections

The equation of motion of mass m with respect to mass M is obtained from Newton‚Äôs second law. Let
~ = ŌĀb
ŌĀ eŌĀ denote the position vector of mass m with respect to the origin. Newton‚Äôs second law can then be
written in any of the forms

‚ąíGmM d2 ŌĀ
~ ~
dV ‚ąíGmM
F~ = b
e ŌĀ = m = m = ŌĀ~ (1.5.128)
ŌĀ2 dt2 dt ŌĀ3

and from this equation we can show that the motion of the mass m can be described as a conic section.
Recall that a conic section is defined as a locus of points p(x, y) such that the distance of p from a fixed
point (or points), called a focus (foci), is proportional to the distance of the point p from a fixed line, called
a directrix, that does not contain the fixed point. The constant of proportionality is called the eccentricity
and is denoted by the symbol . For  = 1 a parabola results; for 0 ‚Č§  ‚Č§ 1 an ellipse results; for  > 1 a
hyperbola results; and if  = 0 the conic section is a circle.
FP
With reference to figure 1.5-2, a conic section is defined in terms of the ratio PD
=  where F P = ŌĀ and
P D = 2q ‚ąí ŌĀ cos ŌÜ. From the  ratio we solve for ŌĀ and obtain the polar representation for the conic section

p
ŌĀ= (1.5.129)
1 +  cos ŌÜ
152

where p = 2q and the angle ŌÜ is known as the true anomaly associated with the orbit. The quantity p is
ŌÄ
called the semi-parameter of the conic section. (Note that when ŌÜ = 2, then ŌĀ = p.) A more general form
of the above equation is

p 1
ŌĀ= or u = = A[1 +  cos(ŌÜ ‚ąí ŌÜ0 )], (1.5.130)
1 +  cos(ŌÜ ‚ąí ŌÜ0 ) ŌĀ

where ŌÜ0 is an arbitrary starting anomaly. An additional symbol a, known as the semi-major axes of an
elliptical orbit can be introduced where q, p, , a are related by

p
= q = a(1 ‚ąí ) or p = a(1 ‚ąí 2 ). (1.5.131)
1+

To show that the equation (1.5.128) produces a conic section for the motion of mass m with respect to
mass M we will show that one form of the solution of equation (1.5.128) is given by the equation (1.5.129).
To verify this we use the following vector identities:

~√ó b
ŌĀ eŌĀ =0
 
d d~
ŌĀ d2 ŌĀ
~
~√ó
ŌĀ ŌĀ√ó 2
=~
dt dt dt
dbeŌĀ (1.5.132)
b
eŌĀ ¬∑ =0
 dt
dbeŌĀ dbeŌĀ
eŌĀ √ó b
b eŌĀ √ó =‚ąí .
dt dt

## From the equation (1.5.128) we find that

 
d d~
ŌĀ d2 ŌĀ
~ GM
ŌĀ~√ó =ŌĀ ~√ó b
~√ó 2 =‚ąí 2 ŌĀ eŌĀ = ~0 (1.5.133)
dt dt dt ŌĀ

## so that an integration of equation (1.5.133) produces

d~
ŌĀ ~
~√ó
ŌĀ = h = constant. (1.5.134)
dt

The quantity H ~ = ŌĀ ~ = ŌĀ
~ √ó mV ~ √ó m d~
ŌĀ ~
dt is the angular momentum of the mass m so that the quantity h
represents the angular momentum per unit mass. The equation (1.5.134) tells us that ~h is a constant for our
two body system. Note that because ~h is constant we have
 
d  ~ ~  dV
~ GM d~
ŌĀ
V √óh = √óh=‚ąí 2 b
~ eŌĀ √ó ŌĀ ~√ó
dt dt ŌĀ dt
GM dbeŌĀ dŌĀ
=‚ąí 2 b ŌĀb
eŌĀ √ó [~ eŌĀ √ó (ŌĀ + b
eŌĀ )]
ŌĀ dt dt
GM dbeŌĀ 2 dbeŌĀ
=‚ąí 2 b
eŌĀ √ó ( b
eŌĀ √ó )ŌĀ = GM
ŌĀ dt dt

~ √ó ~h = GM b
V ~
eŌĀ + C
153

## ~ is a vector constant of integration. The triple scalar product formula gives us

where C
d~
ŌĀ
ŌĀ~ ¬∑ (V ŌĀ √ó ) = h2 = GM ŌĀ
~ √ó ~h) = ~h ¬∑ (~ ~¬∑ b ~
eŌĀ + ŌĀ~ ¬∑ C
dt
or
h2 = GM ŌĀ + CŌĀ cos ŌÜ (1.5.135)
~ and ŌĀ~. From the equation (1.5.135) we find that
where ŌÜ is the angle between the vectors C
p
ŌĀ= (1.5.136)
1 +  cos ŌÜ

where p = h2 /GM and  = C/GM. This result is known as Kepler‚Äôs first law and implies that when  < 1
the mass m describes an elliptical orbit with the sun at one focus.
We present now an alternate derivation of equation (1.5.130) for later use. From the equation (1.5.128)
we have  
ŌĀ d2 ŌĀ
d~ ~ d d~
ŌĀ d~ŌĀ GM d~
ŌĀ GM d
2 ¬∑ 2 = ¬∑ = ‚ąí2 3
~¬∑
ŌĀ =‚ąí 3 ŌĀ¬∑ŌĀ
(~ ~) . (1.5.137)
dt dt dt dt dt ŌĀ dt ŌĀ dt
Consider the equation (1.5.137) in spherical coordinates ŌĀ, őł, ŌÜ. The tensor velocity components are V 1 = dŌĀ
dt ,
V2 = dőł
dt , V3 = dŌÜ
dt and the physical components of velocity are given by VŌĀ = dŌĀ
dt , Vőł = ŌĀ dőł
dt , VŌÜ = ŌĀ sin őł dŌÜ
dt
so that the velocity can be written
d~
ŌĀ dŌĀ dőł dŌÜ
V~ = = b
eŌĀ + ŌĀ b eőł + ŌĀ sin őł b
eŌÜ . (1.5.138)
dt dt dt dt
Substituting equation (1.5.138) into equation (1.5.137) gives the result
"   2  2 #  
2
d dŌĀ 2 dőł 2 2 dŌÜ GM d 2 2GM dŌĀ d 1
+ŌĀ + ŌĀ sin őł =‚ąí 3 (ŌĀ ) = ‚ąí 2 = 2GM
dt dt dt dt ŌĀ dt ŌĀ dt dt ŌĀ

## which can be integrated directly to give

 2  2  2
dŌĀ dőł dŌÜ 2GM
+ ŌĀ2 + ŌĀ2 sin2 őł = ‚ąíE (1.5.139)
dt dt dt ŌĀ

## where ‚ąíE is a constant of integration. In the special case of a planar orbit we set őł = ŌÄ

2 constant so that
the equation (1.5.139) reduces to
 2  2
dŌĀ 2 dŌÜ 2GM
+ŌĀ = ‚ąíE
dt dt ŌĀ
 2  2 (1.5.140)
dŌĀ dŌÜ 2 dŌÜ 2GM
+ŌĀ = ‚ąí E.
dŌÜ dt dt ŌĀ

## Also for this special case of planar motion we have

d~
ŌĀ dŌÜ
|~
ŌĀ√ó | = ŌĀ2 = h. (1.5.141)
dt dt
dŌÜ
By eliminating dt from the equation (1.5.140) we obtain the result
 2
dŌĀ 2GM 3 E
+ ŌĀ2 = ŌĀ ‚ąí 2 ŌĀ4 . (1.5.142)
dŌÜ h2 h
154

## Figure 1.5-3. Relative motion of two inertial systems.

1
The substitution ŌĀ = u can be used to represent the equation (1.5.142) in the form
 2
du 2GM E
+ u2 ‚ąí 2
u+ 2 =0 (1.5.143)
dŌÜ h h

which is a form we will return to later in this section. Note that we can separate the variables in equations
(1.5.142) or (1.5.143). The results can then be integrate to produce the equation (1.5.130).
Newton also considered the relative motion of two inertial systems, say S and S. Consider two such
systems as depicted in the figure 1.5-3 where the S system is moving in the x‚ąídirection with speed v relative
to the system S.
For a Newtonian system, if at time t = 0 we have clocks in both systems which coincide, than at time t
a point P (x, y, z) in the S system can be described by the transformation equations

x =x + vt x =x ‚ąí vt
y =y y =y
or (1.5.144)
z =z z =z
t =t t =t.

These are the transformation equation of Newton‚Äôs relativity sometimes referred to as a Galilean transfor-
mation.
Before Einstein the principle of relativity required that velocities be additive and obey Galileo‚Äôs velocity
VP/R = VP/Q + VQ/R . (1.5.145)
155

That is, the velocity of P with respect to R equals the velocity of P with respect to Q plus the velocity of Q
with respect to R. For example, a person (P ) running north at 3 km/hr on a train (Q) moving north at 60
km/hr with respect to the ground (R) has a velocity of 63 km/hr with respect to the ground. What happens
when (P ) is a light wave moving on a train (Q) which is moving with velocity V relative to the ground? Are
the velocities still additive? This type of question led to the famous Michelson-Morley experiment which
has been labeled as the starting point for relativity. Einstein‚Äôs answer to the above question was ‚ÄĚNO‚ÄĚ and
required that VP/R = VP/Q = c =speed of light be a universal constant.
In contrast to the Newtonian equations, Einstein considered the motion of light from the origins 0 and
0 of the systems S and S. If the S system moves with velocity v relative to the S system and at time t = 0
a light signal is sent from the S system to the S system, then this light signal will move out in a spherical
wave front and lie on the sphere
x2 + y 2 + z 2 = c2 t2 (1.5.146)

where c is the speed of light. Conversely, if a light signal is sent out from the S system at time t = 0, it will
lie on the spherical wave front
2
x2 + y 2 + z 2 = c2 t . (1.5.147)

Observe that the Newtonian equations (1.5.144) do not satisfy the equations (1.5.146) and (1.5.147) identi-
cally. If y = y and z = z then the space variables (x, x) and time variables (t, t) must somehow be related.
Einstein suggested the following transformation equations between these variables

## from which we obtain the ratios

dx ő≥(dx ‚ąí v dt) 1 v
= or v = ő≥(1 ‚ąí dx
). (1.5.150)
ő≥(dx + v dt) dx ő≥(1 + dx ) dt
dt

dx dx
When = = c, the speed of light, the equation (1.5.150) requires that
dt dt
v 2 ‚ąí1 v 2 ‚ąí1/2
ő≥ 2 = (1 ‚ąí ) or ő≥ = (1 ‚ąí ) . (1.5.151)
c2 c2
From the equations (1.5.148) we eliminate x and find
v
t = ő≥(t ‚ąí x). (1.5.152)
c2
We can now replace the Newtonian equations (1.5.144) by the relativistic transformation equations

## x =ő≥(x + vt) x =ő≥(x ‚ąí vt)

y =y y =y
or (1.5.153)
z =z z =z
v v
t =ő≥(t + x) t =ő≥(t ‚ąí x)
c2 c2
156

where ő≥ is given by equation (1.5.151). These equations are also known as the Lorentz transformation.
v
Note that for v << c, then 2 ‚Čą 0, ő≥ ‚Čą 1 , then the equations (1.5.153) closely approximate the equations
c
(1.5.144). The equations (1.5.153) also satisfy the equations (1.5.146) and (1.5.147) identically as can be
readily verified by substitution. Further, by using chain rule differentiation we obtain from the relations
(1.5.148) that
dx
dx dt
+v
= dx
. (1.5.154)
dt v
1+ dt
c c

The equation (1.5.154) is the Einstein relative velocity addition rule which replaces the previous Newtonian
rule given by equation (1.5.145). We can rewrite equation (1.5.154) in the notation of equation (1.5.145) as

VP/Q + VQ/R
VP/R = VP/Q VQ/R
. (1.5.155)
1+ c c

Observe that when VP/Q << c and VQ/R << c then equation (1.5.155) approximates closely the equation
(1.5.145). Also as VP/Q and VQ/R approach the speed of light we have

VP/Q + VQ/R
lim VP/Q VQ/R
=c (1.5.156)
VP/Q ‚ÜíC
VQ/R ‚ÜíC
1+ c c

which agrees with Einstein‚Äôs hypothesis that the speed of light is an invariant.
Let us return now to the viewpoint of what gravitation is. Einstein thought of space and time as being
related and viewed the motion of the planets as being that of geodesic paths in a space-time continuum.
Recall the equations of geodesics are given by
 
d2 xi i dxj dxk
+ = 0, (1.5.157)
ds2 jk ds ds

where s is arc length. These equations are to be associated with a 4-dimensional space-time metric gij
where the indices i, j take on the values 1, 2, 3, 4 and the xi are generalized coordinates. Einstein asked
the question, ‚ÄĚCan one introduce a space-time metric gij such that the equations (1.5.157) can somehow
d2 ŌĀ
~ GM
reproduce the law of gravitational attraction dt2 + ŌĀ3 ŌĀ
~ = 0?‚ÄĚ Then the motion of the planets can be
viewed as optimized motion in a space-time continuum where the metrices of the space simulate the law of
gravitational attraction. Einstein thought that this motion should be related to the curvature of the space
which can be obtained from the Riemann-Christoffel tensor Rijkl . The metric we desire gij , i, j = 1, 2, 3, 4
has 16 components. The conjugate metric tensor g ij is defined such that g ij gjk = őīki and an element of
arc length squared is given by ds2 = gij dxi dxj . Einstein thought that the metrices should come from the
Riemann-Christoffel curvature tensor which, for n = 4 has 256 components, but only 20 of these are linearly
independent. This seems like a large number of equations from which to obtain the law of gravitational
attraction and so Einstein considered the contracted tensor
         
‚ąā n ‚ąā n m n m n
Gij = Rtijt = ‚ąí + ‚ąí . (1.5.158)
‚ąāxj i n ‚ąāxn i j in mj ij mn

## ds2 = ‚ąí(dŌĀ)2 ‚ąí ŌĀ2 (dőł)2 ‚ąí ŌĀ2 sin2 őł(dŌÜ)2 + c2 (dt)2

157

where g11 = ‚ąí1, g22 = ‚ąíŌĀ2 , g33 = ‚ąíŌĀ2 sin2 őł, g44 = c2 and gij = 0 for i 6= j. The negative signs are
2
introduced so that ds
dt = c2 ‚ąí v 2 is positive when v < c and the velocity is not greater than c. However,
this metric will not work since the curvature tensor vanishes. The spherical symmetry of the problem suggest
that g11 and g44 change while g22 and g33 remain fixed. Let (x1 , x2 , x3 , x4 ) = (ŌĀ, őł, ŌÜ, t) and assume

## g11 = ‚ąíeu , g22 = ‚ąíŌĀ2 , g33 = ‚ąíŌĀ2 sin2 őł, g44 = ev (1.5.159)

where u and v are unknown functions of ŌĀ to be determined. This gives the conjugate metric tensor

‚ąí1 ‚ąí1
g 11 = ‚ąíe‚ąíu , g 22 = , g 33 = , g 44 = e‚ąív (1.5.160)
ŌĀ2 ŌĀ2 sin2 őł

## together with the nonzero Christoffel symbols

   
1 1 du 3 1
=   =
11 2 dŌĀ 2 1 13 ŌĀ
  =    
1 12 ŌĀ 3 cos őł 4 1 dv
= ‚ąí ŌĀe‚ąíu   = =
22 2 1 23 sin őł 14 2 dŌĀ
  =     (1.5.162)
1 21 ŌĀ 3 1 4 1 dv
= ‚ąí ŌĀe‚ąíu sin2 őł   = = .
33 2 31 ŌĀ 41 2 dŌĀ
  = ‚ąí sin őł cos őł  
1 1 dv 33 3 cos őł
= ev‚ąíu =
44 2 dr 32 sin őł

The equation (1.5.158) is used to calculate the nonzero Gij and we find that
 2
1 d2 v 1 dv 1 du dv 1 du
G11 = 2
+ ‚ąí ‚ąí
2 dŌĀ 4 dŌĀ 4 dŌĀ dŌĀ ŌĀ dŌĀ
 
1 dv 1 du
G22 =e‚ąíu 1 + ŌĀ ‚ąí ŌĀ ‚ąí eu
2 dŌĀ 2 dŌĀ
  (1.5.163)
1 dv 1 du
G33 =e‚ąíu 1 + ŌĀ ‚ąí ŌĀ ‚ąí eu sin2 őł
2 dŌĀ 2 dŌĀ
 2 !
2
1 d v 1 du dv 1 dv 1 dv
G44 = ‚ąí ev‚ąíu ‚ąí + +
2 dŌĀ2 4 dŌĀ dŌĀ 4 dŌĀ ŌĀ dŌĀ

and Gij = 0 for i 6= j. The assumption that Gij = 0 for all i, j leads to the differential equations
 2
d2 v 1 dv
1 du dv 2 du
+ ‚ąí ‚ąí
=0
dŌĀ2 2 dŌĀ
2 dŌĀ dŌĀ ŌĀ dŌĀ
1 dv 1 du
1+ ŌĀ ‚ąí ŌĀ ‚ąí eu =0 (1.5.164)
2 dŌĀ 2 dŌĀ
 2
d2 v 1 dv 1 du dv 2 dv
2
+ ‚ąí + =0.
dŌĀ 2 dŌĀ 2 dŌĀ dŌĀ ŌĀ dŌĀ
158

## Subtracting the first equation from the third equation gives

du dv
+ =0 or u + v = c1 = constant. (1.5.165)
dŌĀ dŌĀ

## The second equation in (1.5.164) then becomes

du
ŌĀ = 1 ‚ąí eu (1.5.166)
dŌĀ

Separate the variables in equation (1.5.166) and integrate to obtain the result

1
eu = (1.5.167)
1 ‚ąí cŌĀ2

## where c2 is a constant of integration and consequently

 
c1 ‚ąíu c2
v
e =e =e 1‚ąí
c1
. (1.5.168)
ŌĀ

The constant c1 is selected such that g44 approaches c2 as ŌĀ increases without bound. This produces the
metrices
‚ąí1 c2
g11 = , g22 = ‚ąíŌĀ2 , g33 = ‚ąíŌĀ2 sin2 őł, g44 = c2 (1 ‚ąí ) (1.5.169)
1 ‚ąí cŌĀ2 ŌĀ
where c2 is a constant still to be determined. The metrices given by equation (1.5.169) are now used to
expand the equations (1.5.157) representing the geodesics in this four dimensional space. The differential
equations representing the geodesics are found to be
 2  2  2  2
d2 ŌĀ 1 du dŌĀ dőł dŌÜ 1 v‚ąíu dv dt
+ ‚ąí ŌĀe‚ąíu ‚ąí ŌĀe‚ąíu sin2 őł + e =0 (1.5.170)
ds2 2 dŌĀ ds ds ds 2 dŌĀ ds
 2
d2 őł 2 dőł dŌĀ dŌÜ
+ ‚ąí sin őł cos őł =0 (1.5.171)
ds2 ŌĀ ds ds ds
d2 ŌÜ 2 dŌÜ dŌĀ cos őł dŌÜ dőł
+ +2 =0 (1.5.172)
ds2 ŌĀ ds ds sin őł ds ds
d2 t dv dt dŌĀ
+ = 0. (1.5.173)
ds2 dŌĀ ds ds

ŌÄ
The equation (1.5.171) is identically satisfied if we examine planar orbits where őł = 2 is a constant. This
value of őł also simplifies the equations (1.5.170) and (1.5.172). The equation (1.5.172) becomes an exact
differential equation  
d dŌÜ dŌÜ
ŌĀ2 =0 or ŌĀ2 = c4 , (1.5.174)
ds ds ds
and the equation (1.5.173) also becomes an exact differential
 
d dt v dt v
e =0 or e = c5 , (1.5.175)
ds ds ds

where c4 and c5 are constants of integration. This leaves the equation (1.5.170) which determines ŌĀ. Substi-
tuting the results from equations (1.5.174) and (1.5.175), together with the relation (1.5.161), the equation
(1.5.170) reduces to
d2 ŌĀ c2 c2 c24 c2 c24
+ + ‚ąí (1 ‚ąí ) = 0. (1.5.176)
ds2 2ŌĀ2 2ŌĀ4 ŌĀ ŌĀ3
159

## By the chain rule we have

 2  2  
d2 ŌĀ d2 ŌĀ dŌÜ dŌĀ d2 ŌÜ d2 ŌĀ c24 dŌĀ ‚ąí2c24
2
= 2 + 2
= +
ds dŌÜ ds dŌÜ ds dŌÜ2 ŌĀ4 dŌÜ ŌĀ5

## and so equation (1.5.176) can be written in the form

 2  
d2 ŌĀ 2 dŌĀ c2 ŌĀ 2 c2 c2
‚ąí + + ‚ąí 1‚ąí ŌĀ = 0. (1.5.177)
dŌÜ 2 ŌĀ dŌÜ 2 c24 2 ŌĀ
1
The substitution ŌĀ = u reduces the equation (1.5.177) to the form

d2 u c2 3
+ u ‚ąí 2 = c2 u 2 . (1.5.178)
dŌÜ2 2c4 2
du
Multiply the equation (1.5.178) by 2 dŌÜ and integrate with respect to ŌÜ to obtain
 2
du c2
+ u2 ‚ąí u = c2 u 3 + c 6 . (1.5.179)
dŌÜ c24

where c6 is a constant of integration. To determine the constant c6 we write the equation (1.5.161) in the
ŌÄ
special case őł = 2 and use the substitutions from the equations (1.5.174) and (1.5.175) to obtain
 2  2  2  2
dŌĀ dŌĀ dŌÜ 2 dŌÜ dt
e u
=e u
=1‚ąíŌĀ +e v
ds dŌÜ ds ds ds
or  2    
dŌĀ c2 c2 c2 ŌĀ 4
+ 1‚ąí ŌĀ2 + 1 ‚ąí ‚ąí 52 = 0. (1.5.180)
dŌÜ ŌĀ ŌĀ c c24
1
The substitution ŌĀ = u reduces the equation (1.5.180) to the form
 2
du 1 c2 c2
+ u 2 ‚ąí c2 u 3 + 2 ‚ąí 2 u ‚ąí 2 5 2 = 0. (1.5.181)
dŌÜ c4 c4 c c4

 2 
c5 1
c6 = ‚ąí 1
c2 c24

## so that the equation (1.5.179) takes on the form

 2  
du c2
2 c25 1
+ u ‚ąí 2u + 1 ‚ąí 2 = c2 u 3 (1.5.182)
dŌÜ c4 c c24

Now we can compare our relativistic equation (1.5.182) with our Newtonian equation (1.5.143). In order
that the two equations almost agree we select the constants c2 , c4 , c5 so that
c2
c2 2GM 1 ‚ąí c52 E
2 = and 2 = 2. (1.5.183)
c4 h2 c4 h

The equations (1.5.183) are only two equations in three unknowns and so we use the additional equation

dŌÜ dŌÜ ds
lim ŌĀ2 = lim ŌĀ2 =h (1.5.184)
ŌĀ‚Üí‚ąě dt ŌĀ‚Üí‚ąě ds dt
160

which is obtained from equation (1.5.141). Substituting equations (1.5.174) and (1.5.175) into equation
(1.5.184), rearranging terms and taking the limit we find that

c4 c2
= h. (1.5.185)
c5

## From equations (1.5.183) and (1.5.185) we obtain the results that

 
c2 2GM 1 h
c25 = , c2 = , c4 = p (1.5.186)
1 + cE2 c2 1 + E/c2 c 1 + E/c2

These values substituted into equation (1.5.181) produce the differential equation
 2  
du 22GM E 2GM 1
+u ‚ąí 2
u+ 2 = u3 . (1.5.187)
dŌÜ h h c2 1 + E/c2
c2 2GM 2GM 1
Let őĪ = c24
= h2 and ő≤ = c2 = c2 ( 1+E/c2 ) then the differential equation (1.5.178) can be written as

d2 u őĪ 3
+ u ‚ąí = ő≤u2 . (1.5.188)
dŌÜ2 2 2

## We know the solution to equation (1.5.143) is given by

1
u= = A(1 +  cos(ŌÜ ‚ąí ŌÜ0 )) (1.5.189)
ŌĀ

and so we assume a solution to equation (1.5.188) of this same general form. We know that A is small and so
we make the assumption that the solution of equation (1.5.188) given by equation (1.5.189) is such that ŌÜ0 is
approximately constant and varies slowly as a function of AŌÜ. Observe that if ŌÜ0 = ŌÜ0 (AŌÜ), then dŌÜ0
dŌÜ = ŌÜ00 A
d 2 ŌÜ0
and dŌÜ2 = ŌÜ000 A2 , where primes denote differentiation with respect to the argument of the function. (i.e.
AŌÜ for this problem.) The derivatives of equation (1.5.189) produce

du
= ‚ąí A sin(ŌÜ ‚ąí ŌÜ0 )(1 ‚ąí ŌÜ00 A)
dŌÜ
d2 u
=A3 sin(ŌÜ ‚ąí ŌÜ0 )ŌÜ000 ‚ąí A cos(ŌÜ ‚ąí ŌÜ0 )(1 ‚ąí 2AŌÜ00 + A2 (ŌÜ00 )2 )
dŌÜ2
= ‚ąí A cos(ŌÜ ‚ąí ŌÜ0 ) + 2A2 ŌÜ00 cos(ŌÜ ‚ąí ŌÜ0 ) + O(A3 ).

Substituting these derivatives into the differential equation (1.5.188) produces the equations

őĪ 3ő≤ 
2A2 ŌÜ00 cos(ŌÜ ‚ąí ŌÜ0 ) + A ‚ąí = A2 + 2A2 cos(ŌÜ ‚ąí ŌÜ0 ) + 2 A2 cos2 (ŌÜ ‚ąí ŌÜ0 ) + O(A3 ).
2 2

Now A is small so that terms O(A3 ) can be neglected. Equating the constant terms and the coefficient of
the cos(ŌÜ ‚ąí ŌÜ0 ) terms we obtain the equations

őĪ 3ő≤ 2 3ő≤ 2 2
A‚ąí = A 2A2 ŌÜ00 = 3ő≤A2 +  A cos(ŌÜ ‚ąí ŌÜ0 ).
2 2 2

Treating ŌÜ0 as essentially constant, the above system has the approximate solutions

őĪ 3ő≤ 3ő≤
A‚Čą ŌÜ0 ‚Čą AŌÜ + A sin(ŌÜ ‚ąí ŌÜ0 ) (1.5.190)
2 2 4
161

The solutions given by equations (1.5.190) tells us that ŌÜ0 varies slowly with time. For  less than 1, the
elliptical motion is affected by this change in ŌÜ0 . It causes the semi-major axis of the ellipse to slowly rotate
dŌÜ0
at a rate given by dt . Using the following values for the planet Mercury

## G =6.67(10‚ąí8) dyne cm2 /g2

M =1.99(1033 ) g
a =5.78(1012 ) cm
 =0.206
c =3(1010 ) cm/sec (1.5.191)
2GM
ő≤ ‚Čą 2 = 2.95(105) cm
pc
h ‚Čą GM a(1 ‚ąí 2 ) = 2.71(1019) cm2 /sec
 1/2
dŌÜ GM
‚Čą sec‚ąí1 Kepler‚Äôs third law
dt a3

## we calculate the slow rate of rotation of the semi-major axis to be approximately

 2  1/2
dŌÜ0 dŌÜ0 dŌÜ 3 dŌÜ GM GM
= ‚Čą ő≤A ‚Čą3 =6.628(10‚ąí14) rad/sec
dt dŌÜ dt 2 dt ch a3 (1.5.192)
=43.01 seconds of arc per century.

This slow variation in Mercury‚Äôs semi-major axis has been observed and measured and is in agreement with
the above value. Newtonian mechanics could not account for the changes in Mercury‚Äôs semi-major axis, but
Einstein‚Äôs theory of relativity does give this prediction. The resulting solution of equation (1.5.188) can be
viewed as being caused by the curvature of the space-time continuum.
The contracted curvature tensor Gij set equal to zero is just one of many conditions that can be assumed
in order to arrive at a metric for the space-time continuum. Any assumption on the value of Gij relates to
imposing some kind of curvature on the space. Within the large expanse of our universe only our imaginations
limit us as to how space, time and matter interact. You can also imagine the existence of other tensor metrics
in higher dimensional spaces where the geodesics within the space-time continuum give rise to the motion
of other physical quantities.
This short introduction to relativity is concluded with a quote from the NASA News@hg.nasa.gov news
release, spring 1998, Release:98-51. ‚ÄúAn international team of NASA and university researchers has found
the first direct evidence of a phenomenon predicted 80 years ago using Einstein‚Äôs theory of general relativity‚Äď
that the Earth is dragging space and time around itself as it rotates.‚ÄĚThe news release explains that the
effect is known as frame dragging and goes on to say ‚ÄúFrame dragging is like what happens if a bowling
ball spins in a thick fluid such as molasses. As the ball spins, it pulls the molasses around itself. Anything
stuck in the molasses will also move around the ball. Similarly, as the Earth rotates it pulls space-time in
its vicinity around itself. This will shift the orbits of satellites near the Earth.‚ÄĚThis research is reported in
the journal Science.
162

EXERCISE 1.5
~
~ and ŌĄ = őīN~ ¬∑ B.
I 1. Let őļ = ¬∑N
őīT
őīs őīs
~ Assume in turn that each of the intrinsic derivatives of T~ , N
~,B
~ are
some linear combination of T~ , N
~,B ~ and hence derive the Frenet-Serret formulas of differential geometry.

I 2. Determine the given surfaces. Describe and sketch the curvilinear coordinates upon each surface.
2uv 2 2u2 v
e1 + v b
(a) ~r(u, v) = u b e2 (b) ~r(u, v) = u cos v b
e1 + u sin v b
e2 (c) ~r(u, v) = b
e 1 + b
e2 .
u2 + v 2 u2 + v 2
I 3. Determine the given surfaces and describe the curvilinear coordinates upon the surface. Use some
graphics package to plot the surface and illustrate the coordinate curves on the surface. Find element of
area dS in terms of u and v.
e1 + b sin u sin v b
(a) ~r(u, v) = a sin u cos v b e2 + c cos u b
e3 a, b, c constants 0 ‚Č§ u, v ‚Č§ 2ŌÄ
u u u
(b) ~r(u, v) = (4 + v sin ) cos u b e1 + (4 + v sin ) sin u b e2 + v cos b e3 ‚ąí 1 ‚Č§ v ‚Č§ 1, 0 ‚Č§ u ‚Č§ 2ŌÄ
2 2 2
e1 + bu sin v b
(c) ~r(u, v) = au cos v b e2 + cu be3
(d) ~r(u, v) = u cos v b
e1 + u sin v b
e2 + őĪv b
e3 őĪ constant
e1 + b sin v b
(e) ~r(u, v) = a cos v b e2 + u b
e3 a, b constant
2
e2 + u b
e1 + u sin v b
(f ) ~r(u, v) = u cos v b e3
 
E F
I 4. Consider a two dimensional space with metric tensor (aőĪő≤ ) = . Assume that the surface is
F G
described by equations of the form y i = y i (u, v) and that any point on the surface is given by the position
vector ~r = ~r(u, v) = y i b
ei . Show that the metrices E, F, G are functions of the parameters u, v and are given
by
‚ąā~r ‚ąā~r
E = ~ru ¬∑ ~ru , F = ~ru ¬∑ ~rv , G = ~rv ¬∑ ~rv where ~ru = and ~rv = .
‚ąāu ‚ąāv
I 5. For the metric given in problem 4 show that the Christoffel symbols of the first kind are given by
[1 1, 1] = ~ru ¬∑ ~ruu [1 2, 1] = [2 1, 1] = ~ru ¬∑ ~ruv [2 2, 1] = ~ru ¬∑ ~rvv
[1 1, 2] = ~rv ¬∑ ~ruu [1 2, 2] = [2 1, 2] = ~rv ¬∑ ~ruv [2 2, 2] = ~rv ¬∑ ~rvv
‚ąā 2~r ‚ąā~r
which can be represented [őĪ ő≤, ő≥] = ¬∑ , őĪ, ő≤, ő≥ = 1, 2.
‚ąāu ‚ąāu ‚ąāuő≥
őĪ ő≤

I 6. Show that the results in problem 5 can also be written in the form
1 1 1
[1 1, 1] = Eu Ev
[1 2, 1] = [2 1, 1] = [2 2, 1] = Fv ‚ąí Gu
2 2 2
1 1 1
[1 1, 2] = Fu ‚ąí Ev [1 2, 2] = [2 1, 2] = Gu [2 2, 2] = Gv
2 2 2
where the subscripts indicate partial differentiation.
I 7. For the metricgivenin problem 4, show that the Christoffel symbols of the second kind can be
ő≥
expressed in the form = aő≥őī [őĪ ő≤, őī], őĪ, ő≤, ő≥ = 1, 2 and produce the results
őĪő≤
       
1 GEu ‚ąí 2F Fu + F Ev 1 1 GEv ‚ąí F Gu 2 2EFu ‚ąí EEv ‚ąí F Eu
= = = =
11 2(EG ‚ąí F 2 ) 12 21 2(EG ‚ąí F 2 ) 11 2(EG ‚ąí F 2 )
       
1 2GFv ‚ąí GGu ‚ąí F Gv 2 2 EGu ‚ąí F Ev 2 EGv ‚ąí 2F Fv + F Gu
= 2
= = 2
=
22 2(EG ‚ąí F ) 12 21 2(EG ‚ąí F ) 22 2(EG ‚ąí F 2 )
where the subscripts indicate partial differentiation.
163

## I 8. Derive the Gauss equations by assuming that

b,
~ruu = c1~ru + c2~rv + c3 n b,
~ruv = c4~ru + c5~rv + c6 n b
~rvv = c7~ru + c8~rv + c9 n

## where c1 , . . . , c9 are constants

 determined by  taking dot products
 of  the above
 vectors
 with the vectors ~ru , ~rv ,
1 2 1 2
and nb. Show that c1 = , c2 = , c3 = e, c4 = , c5 = , c6 = f,
   1 1 11 12 12  
1 2 ‚ąā 2~r ő≥ ‚ąā~r
c7 = , c8 = , c9 = g Show the Gauss equations can be written őĪ ő≤
= + bőĪő≤ n b.
22 22 ‚ąāu ‚ąāu őĪ ő≤ ‚ąāuő≥
I 9. Derive the Weingarten equations

b u = c1~ru + c2~rv
n ~ru = c‚ąó1 n
bu + c‚ąó2 n
bv
and
b v = c3~ru + c4~rv
n ~rv = c‚ąó3 n
bu + c‚ąó4 n
bv

and show
f F ‚ąí eG gF ‚ąí f G f F ‚ąí gE f G ‚ąí gF
c1 = c3 = c‚ąó1 = c‚ąó3 =
EG ‚ąí F 2 EG ‚ąí F 2 eg ‚ąí f 2 eg ‚ąí f 2
eF ‚ąí f E f F ‚ąí gE f E ‚ąí eF f F ‚ąí eG
c2 = c4 = c‚ąó2 = c‚ąó4 =
EG ‚ąí F 2 EG ‚ąí F 2 eg ‚ąí f 2 eg ‚ąí f 2
The constants in the above equations are determined in a manner similar to that suggested in problem 8.
Show that the Weingarten equations can be written in the form

‚ąāb
n ‚ąā~r
= ‚ąíbő≤őĪ ő≤ .
‚ąāuőĪ ‚ąāu

~ru √ó ~rv
I 10. b= ‚ąö
Using n , the results from exercise 1.1, problem 9(a), and the results from problem 5,
EG ‚ąí F 2
verify that
 p
2
(~ru √ó ~ruu ) ¬∑ n
b= EG ‚ąí F 2
11  p
 p 1
2 (~rv √ó ~ruv ) ¬∑ n
b=‚ąí EG ‚ąí F 2
(~ru √ó ~ruv ) ¬∑ n
b= EG ‚ąí F 2 21
12  p
 p 1
1 (~rv √ó ~rvv ) ¬∑ nb=‚ąí EG ‚ąí F 2
(~rv √ó ~ruu ) ¬∑ n
b=‚ąí EG ‚ąí F 2 22
11 p
 p b = EG ‚ąí F 2
(~ru √ó ~rv ) ¬∑ n
2
(~ru √ó ~rvv ) ¬∑ n
b= EG ‚ąí F 2
22

and then derive the formula for the geodesic curvature given by equation (1.5.48).
 
dT~ dT~ őĪ
n √ó T~ ) ¬∑
Hint:(b = (T~ √ó )¬∑n
b and aőĪőī ]ő≤ ő≥, őī] = .
ds ds ő≤ő≥
164

I 11. Verify the equation (1.5.39) which shows that the normal curvature directions are orthogonal. i.e.
verify that GőĽ1 őĽ2 + F (őĽ1 + őĽ2 ) + E = 0.
I 12. ő≤ő≥ ŌČőĪ
Verify that őīŌÉŌĄ őīőĽőĹ RŌČőĪő≤ő≥ = 4RőĽőĹŌÉŌĄ .
I 13. Find the first fundamental form and unit normal to the surface defined by z = f (x, y).
I 14. Verify
Ai,jk ‚ąí Ai,kj = AŌÉ R.ijk
ŌÉ

where          
‚ąā ŌÉ ‚ąā ŌÉ n ŌÉ n ŌÉ
ŌÉ
R.ijk = ‚ąí + ‚ąí .
‚ąāxj ik ‚ąāxk ij ik nj ij nk
which is sometimes written
   
‚ąā s
‚ąā s
Rinjk
= ‚ąāx j ‚ąāx k
+
n j n k

[nj, k] [nk, i] [ij, s] [ik, s]

I 15. ŌÉ
For Rijkl = giŌÉ R.jkl show
   
‚ąā ‚ąā s s
Rinjk = [nk, i] ‚ąí [nj, i] + [ik, s] ‚ąí [ij, s]
‚ąāxj ‚ąāxk nj nk

## which is sometimes written

   
‚ąā n

‚ąā
n
‚ąāxj ik
‚ąāxk
ij
ŌÉ
R.ijk =     +  
ŌÉ
 

ŌÉ ŌÉ ŌÉ
ij ik nk nj

I 16. Show
 
1 ‚ąā 2 gil ‚ąā 2 gjl ‚ąā 2 gik ‚ąā 2 gjk
Rijkl = ‚ąí ‚ąí + + g őĪő≤ ([jk, ő≤][il, őĪ] ‚ąí [jl, ő≤][ik, őĪ]) .
2 ‚ąāxj ‚ąāxk ‚ąāxi ‚ąāxk ‚ąāxj ‚ąāxl ‚ąāxi ‚ąāxl

## (i) Rjikl = ‚ąíRijkl , (ii) Rijlk = ‚ąíRijkl , (iii) Rklij = Rijkl

Hence, the tensor Rijkl is skew-symmetric in the indices i, j and k, l. Also the tensor Rijkl is symmetric with
respect to the (ij) and (kl) pair of indices.
I 18. Verify the following cyclic properties of the Riemann Christoffel symbol:

## (i) Rnijk + Rnjki + Rnkij = 0 first index fixed

(ii) Rinjk + Rjnki + Rknij = 0 second index fixed
(iii) Rijnk + Rjkni + Rkinj = 0 third index fixed
(iv) Rikjn + Rkjin + Rjikn = 0 fourth index fixed

I 19. By employing the results from the previous problems, show all components of the form:
Riijk , Rinjj , Riijj , Riiii , (no summation on i or j) must be zero.
165

I 20. Find the number of independent components associated with the Riemann Christoffel tensor
Rijkm , i, j, k, m = 1, 2, . . . , N. There are N 4 components to examine in an N ‚ąídimensional space. Many of
these components are zero and many of the nonzero components are related to one another by symmetries
or the cyclic properties. Verify the following cases:
CASE I We examine components of the form Rinin , i 6= n with no summation of i or n. The first index
can be chosen in N ways and therefore with i 6= n the second index can be chosen in N ‚ąí 1 ways. Observe
that Rinin = Rnini , (no summation on i or n) and so one half of the total combinations are repeated. This
1
leaves M1 = 2 N (N ‚ąí 1) components of the form Rinin . The quantity M1 can also be thought of as the
number of distinct pairs of indices (i, n).
CASE II We next examine components of the form Rinji , i 6= n 6= j where there is no summation on
the index i. We have previously shown that the first pair of indices can be chosen in M1 ways. Therefore,
the third index can be selected in N ‚ąí 2 ways and consequently there are M2 = 12 N (N ‚ąí 1)(N ‚ąí 2) distinct
components of the form Rinji with i 6= n 6= j.
CASE III Next examine components of the form Rinjk where i 6= n 6= j 6= k. From CASE I the first pairs
of indices (i, n) can be chosen in M1 ways. Taking into account symmetries, it can be shown that the second
pair of indices can be chosen in 12 (N ‚ąí 2)(N ‚ąí 3) ways. This implies that there are 14 N (N ‚ąí 1)(N ‚ąí 2)(N ‚ąí 3)
ways of choosing the indices i, n, j and k with i 6= n 6= j 6= k. By symmetry the pairs (i, n) and (j, k) can be
interchanged and therefore only one half of these combinations are distinct. This leaves
1
N (N ‚ąí 1)(N ‚ąí 2)(N ‚ąí 3)
8
distinct pairs of indices. Also from the cyclic relations we find that only two thirds of the above components
are distinct. This produces
N (N ‚ąí 1)(N ‚ąí 2)(N ‚ąí 3)
M3 =
12
distinct components of the form Rinjk with i 6= n 6= j 6= k.
Adding the above components from each case we find there are

N 2 (N 2 ‚ąí 1)
M4 = M1 + M2 + M3 =
12
distinct and independent components.
Verify the entries in the following table:

Dimension of space N 1 2 3 4 5
Number of components N 4 1 16 81 256 625
M4 = Independent components of Rijkm 0 1 6 20 50
Note 1: A one dimensional space can not be curved and all one dimensional spaces are Euclidean. (i.e. if we have
an element of arc length squared given by ds2 = f (x)(dx)2 , we can make the coordinate transformation
p
f (x)dx = du and reduce the arc length squared to the form ds2 = du2 .)
Note 2: In a two dimensional space, the indices can only take on the values 1 and 2. In this special case there
are 16 possible components. It can be shown that the only nonvanishing components are:

## R1212 = ‚ąíR1221 = ‚ąíR2112 = R2121 .

166

For these nonvanishing components only one independent component exists. By convention, the com-
ponent R1212 is selected as the single independent component and all other nonzero components are
expressed in terms of this component.
Find the nonvanishing independent components Rijkl for i, j, k, l = 1, 2, 3, 4 and show that

## R1212 R3434 R2142 R4124

R1313 R1231 R2342 R4314
R2323 R1421 R3213 R4234
R1414 R1341 R3243 R1324
R2424 R2132 R3143 R1432

## can be selected as the twenty independent components.

I 21.
(a) For N = 2 show R1212 is the only nonzero independent component and
R1212 = R2121 = ‚ąíR1221 = ‚ąíR2112 .
(b) Show that on the surface of a sphere of radius r0 we have R1212 = r02 sin2 őł.
I 22. Show for N = 2 that 2
‚ąāx
R1212 = R1212 J = R1212
2
‚ąāx
I 23. s
Define Rij = R.ijs as the Ricci tensor and Gij = Rji ‚ąí 12 őīji R as the Einstein tensor, where Rji = g ik Rkj
and R = Rii . Show that

## (a) Rjk = g ab Rjabk

‚ąö   ‚ąö     
‚ąā 2 log g b ‚ąā log g ‚ąā a b a
(b) Rij = i j
‚ąí b
‚ąí a +
‚ąāx ‚ąāx ij ‚ąāx ‚ąāx i j ia jb
i
(c) Rijk =0

I 24. By employing the results from the previous problem show that in the case N = 2 we have

= = =‚ąí
g11 g22 g12 g

## where g is the determinant of gij .

I 25. Consider the case N = 2 where we have g12 = g21 = 0 and show that

2R1221
(a) R12 = R21 = 0 (c) R=
g11 g22
(b) R11 g22 = R22 g11 = R1221 1
(d) Rij = Rgij , where R = g ij Rij
2

The scalar invariant R is known as the Einstein curvature of the surface and the tensor Gij = Rji ‚ąí 12 őīji R is
known as the Einstein tensor.
I 26. For N = 3 show that R1212 , R1313 , R2323 , R1213 , R2123 , R3132 are independent components of the
Riemann Christoffel tensor.
167
 
a11 0
I 27. For N = 2 and aőĪő≤ = show that
0 a22
    
R1212 1 ‚ąā 1 ‚ąāa22 ‚ąā 1 ‚ąāa11
K= =‚ąí ‚ąö ‚ąö + ‚ąö .
a 2 a ‚ąāu1 a ‚ąāu1 ‚ąāu2 a ‚ąāu2
 
a11 a12
I 28. For N = 2 and aőĪő≤ = show that
a21 a22
    
1 ‚ąā a12 ‚ąāa11 1 ‚ąāa22 ‚ąā 2 ‚ąāa12 1 ‚ąāa11 a12 ‚ąāa11
K= ‚ąö ‚ąö ‚ąí‚ąö + ‚ąö ‚ąí‚ąö ‚ąí ‚ąö .
2 a ‚ąāu1 a11 a ‚ąāu2 a ‚ąāu1 ‚ąāu2 a ‚ąāu1 a ‚ąāu2 a11 a ‚ąāu1

Check your results by setting a12 = a21 = 0 and comparing this answer with that given in the problem 27.
I 29. Write out the Frenet-Serret formulas (1.5.112)(1.5.113) for surface curves in terms of Christoffel
symbols of the second kind.
I 30.
(a) Use the fact that for n = 2 we have R1212 = R2121 = ‚ąíR2112 = ‚ąíR1221 together with eőĪő≤ , eőĪő≤ the two
dimensional alternating tensors to show that the equation (1.5.110) can be written as

‚ąö 1
RőĪő≤ő≥őī = KőĪő≤ ő≥őī where őĪő≤ = aeőĪő≤ and őĪő≤ = ‚ąö eőĪő≤
a

## are the corresponding epsilon tensors.

1
(b) Show that from the result in part (a) we obtain RőĪő≤ő≥őī őĪő≤ ő≥őī = K.
4
Hint: See equations (1.3.82),(1.5.93) and (1.5.94).
I 31. Verify the result given by the equation (1.5.100).
I 32. Show that aőĪő≤ cőĪő≤ = 4H 2 ‚ąí 2K.
I 33. Find equations for the principal curvatures associated with the surface

x = u, y = v, z = f (u, v).

I 34. Geodesics on a sphere Let (őł, ŌÜ) denote the surface coordinates of the sphere of radius ŌĀ defined
by the parametric equations

## x = ŌĀ sin őł cos ŌÜ, y = ŌĀ sin őł sin ŌÜ, z = ŌĀ cos őł. (1)

Consider also a plane which passes through the origin with normal having the direction numbers (n1 , n2 , n3 ).
This plane is represented by n1 x + n2 y + n3 z = 0 and intersects the sphere in a great circle which is described
by the relation
n1 sin őł cos ŌÜ + n2 sin őł sin ŌÜ + n3 cos őł = 0. (2)

This is an implicit relation between the surface coordinates őł, ŌÜ which describes the great circle lying on the
sphere. We can write this later equation in the form

‚ąín3
n1 cos ŌÜ + n2 sin ŌÜ = (3)
tan őł
168

and in the special case where n1 = cos ő≤, n2 = sin ő≤,n3 = ‚ąí tan őĪ is expressible in the form
 
tan őĪ tan őĪ
cos(ŌÜ ‚ąí ő≤) = or ŌÜ ‚ąí ő≤ = cos‚ąí1 . (4)
tan őł tan őł

The above equation defines an explicit relationship between the surface coordinates which defines a great
circle on the sphere. The arc length squared relation satisfied by the surface coordinates together with the
equation obtained by differentiating equation (4) with respect to arc length s gives the relations

dŌÜ tan őĪ dőł
sin2 őł =q (5)
ds tan2 őĪ ds
1 ‚ąí tan2 őł
ds2 = ŌĀ2 dőł2 + ŌĀ2 sin2 őł dŌÜ2 (6)

The above equations (1)-(6) are needed to consider the following problem.

(a) Show that the differential equations defining the geodesics on the surface of a sphere (equations (1.5.51))
are
 2
d2 őł dŌÜ
‚ąí sin őł cos őł =0 (7)
ds2 ds
d2 ŌÜ dőł dŌÜ
+ 2 cot őł =0 (8)
ds2 ds ds

dŌÜ
sin2 őł = c1 (9)
ds

## where c1 is a constant of integration.

dőł
(c) Multiply equation (7) by ds and use the result of equation (9) to show that an integration produces
 2
dőł ‚ąíc21
= + c22 (10)
ds sin2 őł

## where c22 is a constant of integration.

sin őĪ
(d) Use the equations (5)(6) to show that c2 = 1/ŌĀ and c1 = ŌĀ .
(e) Show that equations (9) and (10) imply that

dŌÜ tan őĪ sec2 őł
= q
dőł tan2 őł 1 ‚ąí tan2 őĪ
tan2 őł

tan őĪ
and making the substitution u = tan őł this equation can be integrated to obtain the equation (4). We
can now expand the equation (4) and express the results in terms of x, y, z to obtain the equation (3).
This produces a plane which intersects the sphere in a great circle. Consequently, the geodesics on a
sphere are great circles.
169

I 35. Find the differential equations defining the geodesics on the surface of a cylinder.
I 36. Find the differential equations defining the geodesics on the surface of a torus. (See problem 13,
Exercise 1.3)
I 37. Find the differential equations defining the geodesics on the surface of revolution

## x = r cos ŌÜ, y = r sin ŌÜ, z = f (r).

Note the curve z = f (x) gives a profile of the surface. The curves r = Constant are the parallels, while the
curves ŌÜ = Constant are the meridians of the surface and

## ds2 = (1 + f 02 ) dr2 + r2 dŌÜ2 .

I 38. Find the unit normal and tangent plane to an arbitrary point on the right circular cone

## x = u sin őĪ cos ŌÜ, y = u sin őĪ sin ŌÜ, z = u cos őĪ.

This is a surface of revolution with r = u sin őĪ and f (r) = r cot őĪ with őĪ constant.
I 39. Let s denote arc length and assume the position vector ~r(s) is analytic about a point s0 . Show that
h2 h3
the Taylor series ~r(s) = ~r(s0 ) + h~r 0 (s0 ) + ~r 00 (s0 ) + ~r 000 (s0 ) + ¬∑ ¬∑ ¬∑ about the point s0 , with h = s ‚ąí s0 is
2! 3!
given by ~r(s) = ~r(s0 ) + hT~ + 12 őļh2 N ~ + 1 h3 (‚ąíőļ2 T~ + őļ0 N
6
~ + őļŌĄ B) ~ + ¬∑ ¬∑ ¬∑ which is obtained by differentiating
the Frenet formulas.
I 40.
(a) Show that the circular helix defined by x = a cos t, y = a sin t, z = bt with a, b constants, has the
property that any tangent to the curve makes a constant angle with the line defining the z-axis.
(i.e. T~ ¬∑ b
e3 = cos őĪ = constant.)
(b) Show also that N e3 = 0 and consequently b
~ ¬∑ b e3 is parallel to the rectifying plane, which implies that
b
e3 = T~ cos őĪ + B
~ sin őĪ.
(c) Differentiate the result in part (b) and show that őļ/ŌĄ = tan őĪ is a constant.
I 41. Consider a space curve xi = xi (s) in Cartesian coordinates.
dT~ p

(a) Show that őļ = = x0i x0i
ds
1
(b) Show that ŌĄ = eijk x0i x00j x000 r 0 ¬∑ ~r 00 √ó ~r 000
k . Hint: Consider ~
őļ2

I 42.
(a) Find the direction cosines of a normal to a surface z = f (x, y).
(b) Find the direction cosines of a normal to a surface F (x, y, z) = 0.
(c) Find the direction cosines of a normal to a surface x = x(u, v), y = y(u, v), z = z(u, v).
I 43. Show that for a smooth surface z = f (x, y) the Gaussian curvature at a point on the surface is given
by
2
fxx fyy ‚ąí fxy
K= .
(fx2 + fy2 + 1)2
170

I 44. Show that for a smooth surface z = f (x, y) the mean curvature at a point on the surface is given by

## (1 + fy2 )fxx ‚ąí 2fx fy fxy + (1 + fx2 )fyy

H= .
2(fx2 + fy2 + 1)3/2

I 45. Express the Frenet-Serret formulas (1.5.13) in terms of Christoffel symbols of the second kind.
I 46. Verify the relation (1.5.106).
I 47. In Vn assume that Rij = ŌĀgij and show that ŌĀ = R
n where R = g ij Rij . This result is known as
Einstein‚Äôs gravitational equation at points where matter is present. It is analogous to the Poisson equation
‚ąá2 V = ŌĀ from the Newtonian theory of gravitation.
I 48. In Vn assume that Rijkl = K(gik gjl ‚ąí gil gjk ) and show that R = Kn(1 ‚ąí n). (Hint: See problem 23.)
I 49. Assume gij = 0 for i 6= j and verify the following.
(a) Rhijk = 0 for h 6= i 6= j 6= k
 2‚ąö ‚ąö ‚ąö ‚ąö ‚ąö 
‚ąö ‚ąā gii ‚ąā gii ‚ąā log ghh ‚ąā gii ‚ąā log gkk
(b) Rhiik = gii ‚ąí ‚ąí for h, i, k unequal.
‚ąāxhÔ£ģ‚ąāxk ‚ąāxh ‚ąāxk ‚ąāxk ‚ąāxh Ô£Ļ
 ‚ąö   ‚ąö  Xn ‚ąö ‚ąö
‚ąö ‚ąö Ô£Į ‚ąā 1 ‚ąā gii ‚ąā 1 ‚ąā ghh ‚ąā gii ‚ąā ghh Ô£ļ
(c) Rhiih = gii ghh Ô£į h ‚ąö + i ‚ąö + Ô£Ľ where h 6= i.
‚ąāx ghh ‚ąāxh ‚ąāx gii ‚ąāxi m=1
‚ąāxm ‚ąāxm
m6=h m6=i

I 50. Consider a surface of revolution where x = r cos őł, y = r sin őł and z = f (r) is a given function of r.
(a) Show in this V2 we have ds2 = (1 + (f 0 )2 )dr2 + r2 dőł2 where 0 = d
ds .
(b) Show the geodesic equations in this V2 are
 2  2
d2 r f 0 f 00 dr r dőł
2
+ 0 2
‚ąí 0 2
=0
ds 1 + (f ) ds 1 + (f ) ds
d2 őł 2 dőł dr
+ =0
ds2 r ds ds
dőł a
(c) Solve the second equation in part (b) to obtain = 2 . Substitute this result for ds in part (a) to show
p ds r
a 1 + (f 0 )2
dőł = ¬Ī ‚ąö dr which theoretically can be integrated.
r r 2 ‚ąí a2
171

## PART 2: INTRODUCTION TO CONTINUUM MECHANICS

In the following sections we develop some applications of tensor calculus in the areas of dynamics,
elasticity, fluids and electricity and magnetism. We begin by first developing generalized expressions for the
vector operations of gradient, divergence, and curl. Also generalized expressions for other vector operators
are considered in order that tensor equations can be converted to vector equations. We construct a table to
aid in the translating of generalized tensor equations to vector form and vice versa.
The basic equations of continuum mechanics are developed in the later sections. These equations are
developed in both Cartesian and generalized tensor form and then converted to vector form.

## ¬ß2.1 TENSOR NOTATION FOR SCALAR AND VECTOR QUANTITIES

We consider the tensor representation of some vector expressions. Our goal is to develop the ability to
convert vector equations to tensor form as well as being able to represent tensor equations in vector form.
In this section the basic equations of continuum mechanics are represented using both a vector notation and
the indicial notation which focuses attention on the tensor components. In order to move back and forth
between these notations, the representation of vector quantities in tensor form is now considered.

## For ő¶ = ő¶(x1 , x2 , . . . , xN ) a scalar function of the coordinates xi , i = 1, . . . , N , the gradient of ő¶ is

defined as the covariant vector
‚ąāő¶
ő¶,i = , i = 1, . . . , N. (2.1.1)
‚ąāxi
The contravariant form of the gradient is
g im ő¶,m . (2.1.2)

Note, if C i = g im ő¶,m , i = 1, 2, 3 are the tensor components of the gradient then in an orthogonal coordinate
system we will have
C 1 = g 11 ő¶,1 , C 2 = g 22 ő¶,2 , C 3 = g 33 ő¶,3 .

We note that in an orthogonal coordinate system that g ii = 1/h2i , (no sum on i), i = 1, 2, 3 and hence
replacing the tensor components by their equivalent physical components there results the equations

## C(1) 1 ‚ąāő¶ C(2) 1 ‚ąāő¶ C(3) 1 ‚ąāő¶

= 2 1, = 2 2, = 2 3.
h1 h1 ‚ąāx h2 h2 ‚ąāx h3 h3 ‚ąāx

## Simplifying, we find the physical components of the gradient are

1 ‚ąāő¶ 1 ‚ąāő¶ 1 ‚ąāő¶
C(1) = , C(2) = , C(3) = .
h1 ‚ąāx1 h2 ‚ąāx2 h3 ‚ąāx3

These results are only valid when the coordinate system is orthogonal and gij = 0 for i 6= j and gii = h2i ,
with i = 1, 2, 3, and where i is not summed.
172

Divergence

The divergence of a contravariant tensor Ar is obtained by taking the covariant derivative with respect
to xk and then performing a contraction. This produces

## div Ar = Ar,r . (2.1.3)

Still another form for the divergence is obtained by simplifying the expression (2.1.3). The covariant deriva-
tive can be represented  
‚ąāAr r
Ar,k = + Am .
‚ąāxk mk
Upon contracting the indices r and k and using the result from Exercise 1.4, problem 13, we obtain
‚ąö
‚ąāAr 1 ‚ąā( g) m
Ar,r = + ‚ąö A
‚ąāxr g ‚ąāxm
 ‚ąö 
1 ‚ąö ‚ąāAr r‚ąā g
Ar,r = ‚ąö g r +A (2.1.4)
g ‚ąāx ‚ąāxr
1 ‚ąā ‚ąö r
Ar,r = ‚ąö ( gA ) .
g ‚ąāxr

EXAMPLE 2.1-1. (Divergence) Find the representation of the divergence of a vector Ar in spherical
coordinates (ŌĀ, őł, ŌÜ). Solution: In spherical coordinates we have

## x1 = ŌĀ, x2 = őł, x3 = ŌÜ with gij = 0 for i 6= j and

g11 = h21 = 1, g22 = h22 = ŌĀ2 , g33 = h23 = ŌĀ2 sin2 őł.
‚ąö
The determinant of gij is g = |gij | = ŌĀ4 sin2 őł and g = ŌĀ2 sin őł. Employing the relation (2.1.4) we find
 
r1 ‚ąā ‚ąö 1 ‚ąā ‚ąö 2 ‚ąā ‚ąö 3
div A = ‚ąö ( gA ) + 2 ( gA ) + 3 ( gA ) .
g ‚ąāx1 ‚ąāx ‚ąāx

## In terms of the physical components this equation becomes

 
r 1 ‚ąā ‚ąö A(1) ‚ąā ‚ąö A(2) ‚ąā ‚ąö A(3)
div A = ‚ąö ( g )+ ( g )+ ( g ) .
g ‚ąāŌĀ h1 ‚ąāőł h2 ‚ąāŌÜ h3

## By using the notation

A(1) = AŌĀ , A(2) = Aőł , A(3) = AŌÜ

for the physical components, the divergence can be expressed in either of the forms:
 
1 ‚ąā 2 ‚ąā 2 Aőł ‚ąā 2 AŌÜ
div Ar = (ŌĀ sin őłA ŌĀ ) + (ŌĀ sin őł ) + (ŌĀ sin őł ) or
ŌĀ2 sin őł ‚ąāŌĀ ‚ąāőł ŌĀ ‚ąāŌÜ ŌĀ sin őł
1 ‚ąā 1 ‚ąā 1 ‚ąāAŌÜ
div Ar = 2 (ŌĀ2 AŌĀ ) + (sin őłAőł ) + .
ŌĀ ‚ąāŌĀ ŌĀ sin őł ‚ąāőł ŌĀ sin őł ‚ąāŌÜ
173

Curl

~ = curl A
The contravariant components of the vector C ~ are represented

 
1 1 ‚ąāA3 ‚ąāA2
C =‚ąö ‚ąí
g ‚ąāx2 ‚ąāx3
 
2 1 ‚ąāA1 ‚ąāA3
C =‚ąö 3
‚ąí (2.1.6)
g ‚ąāx ‚ąāx1
 
1 ‚ąāA2 ‚ąāA1
C3 = ‚ąö 1
‚ąí .
g ‚ąāx ‚ąāx2

## EXAMPLE 2.1-2. (Curl) ~ in spherical coordinates

Find the representation for the components of curl A
(ŌĀ, őł, ŌÜ).
Solution: In spherical coordinates we have :x1 = ŌĀ, x2 = őł, x3 = ŌÜ with gij = 0 for i 6= j and

## g11 = h21 = 1, g22 = h22 = ŌĀ2 , g33 = h23 = ŌĀ2 sin2 őł.

‚ąö
The determinant of gij is g = |gij | = ŌĀ4 sin2 őł with g = ŌĀ2 sin őł. The relations (2.1.6) are tensor equations
representing the components of the vector curl A. ~ To find the components of curl A~ in spherical components
we write the equations (2.1.6) in terms of their physical components. These equations take on the form:
 
C(1) 1 ‚ąā ‚ąā
=‚ąö (h3 A(3)) ‚ąí (h2 A(2))
h1 g ‚ąāőł ‚ąāŌÜ
 
C(2) 1 ‚ąā ‚ąā
=‚ąö (h1 A(1)) ‚ąí (h3 A(3)) (2.1.7)
h2 g ‚ąāŌÜ ‚ąāŌĀ
 
C(3) 1 ‚ąā ‚ąā
=‚ąö (h2 A(2)) ‚ąí (h1 A(1)) .
h3 g ‚ąāŌĀ ‚ąāőł

## C(1) = CŌĀ , C(2) = Cőł , C(3) = CŌÜ , A(1) = AŌĀ , A(2) = Aőł , A(3) = AŌÜ

~ in spherical coordinates,
to denote the physical components, and find the components of the vector curl A,
are expressible in the form:  
1 ‚ąā ‚ąā
CŌĀ = 2 (ŌĀ sin őłAŌÜ ) ‚ąí (ŌĀAőł )
ŌĀ sin őł ‚ąāőł ‚ąāŌÜ
 
1 ‚ąā ‚ąā
Cőł = (AŌĀ ) ‚ąí (ŌĀ sin őłAŌÜ ) (2.1.8)
ŌĀ sin őł ‚ąāŌÜ ‚ąāŌĀ
 
1 ‚ąā ‚ąā
CŌÜ = (ŌĀAőł ) ‚ąí (AŌĀ ) .
ŌĀ ‚ąāŌĀ ‚ąāőł
174

Laplacian

## The Laplacian ‚ąá2 U has the contravariant form

 
2 ij ij ij ‚ąāU
‚ąá U = g U,ij = (g U,i ),j = g . (2.1.9)
‚ąāxi ,j

## Expanding this expression produces the equations:

   
‚ąā ij ‚ąāU im ‚ąāU j
‚ąá2 U = g + g
‚ąāxj ‚ąāxi ‚ąāxi m j
  ‚ąö
‚ąā ij ‚ąāU 1 ‚ąā g ij ‚ąāU
‚ąá2 U = g + ‚ąö g
‚ąāxj ‚ąāxi g ‚ąāxj ‚ąāxi
   ‚ąö  (2.1.10)
1 ‚ąö ‚ąā ‚ąāU ‚ąāU ‚ąā g
‚ąá2 U = ‚ąö g j g ij i + g ij i
g ‚ąāx ‚ąāx ‚ąāx ‚ąāxj
 
1 ‚ąā ‚ąö ij ‚ąāU
‚ąá2 U = ‚ąö gg .
g ‚ąāxj ‚ąāxi

## and so (2.1.10) when expanded reduces to the form

      
2 1 ‚ąā h2 h3 ‚ąāU ‚ąā h1 h3 ‚ąāU ‚ąā h1 h2 ‚ąāU
‚ąá U= + 2 + 3 . (2.1.11)
h1 h2 h3 ‚ąāx1 h1 ‚ąāx1 ‚ąāx h2 ‚ąāx2 ‚ąāx h3 ‚ąāx3

## EXAMPLE 2.1-3. (Laplacian) Find the Laplacian in spherical coordinates.

Solution: Utilizing the results given in the previous example we find the Laplacian in spherical coordinates
has the form       
2 1 ‚ąā 2 ‚ąāU ‚ąā ‚ąāU ‚ąā 1 ‚ąāU
‚ąá U= 2 ŌĀ sin őł + sin őł + . (2.1.12)
ŌĀ sin őł ‚ąāŌĀ ‚ąāŌĀ ‚ąāőł ‚ąāőł ‚ąāŌÜ sin őł ‚ąāŌÜ
This simplifies to
‚ąā2U 2 ‚ąāU 1 ‚ąā 2U cot őł ‚ąāU 1 ‚ąā2U
‚ąá2 U = 2
+ + 2 2
+ 2 + 2 2 . (2.1.13)
‚ąāŌĀ ŌĀ ‚ąāŌĀ ŌĀ ‚ąāőł ŌĀ ‚ąāőł ŌĀ sin őł ‚ąāŌÜ2

The table 1 gives the vector and tensor representation for various quantities of interest.
175

## VECTOR GENERAL TENSOR CARTESIAN TENSOR

~
A Ai or Ai Ai

~¬∑B
A ~ Ai Bi = gij Ai B j = Ai B i Ai Bi
Ai Bi = g ij Ai Bj
1
~ =A
C ~√óB
~ C i = ‚ąö eijk Aj Bk Ci = eijk Aj Bk
g

‚ąāő¶
‚ąá ő¶ = grad ő¶ g im ő¶,m ő¶,i =
‚ąāxi

1 ‚ąā ‚ąö r ‚ąāAi
~ = div A
‚ąá¬∑A ~ g mn Am,n = Ar,r = ‚ąö ( gA ) Ai,i =
g ‚ąāxr ‚ąāxi

‚ąāAk
~=C
‚ąá√óA ~ = curl A
~ C i = ijk Ak,j Ci = eijk
‚ąāxj
   
1 ‚ąā ‚ąö ij ‚ąāU ‚ąā ‚ąāU
‚ąá2 U g mn U ,mn = ‚ąö gg
g ‚ąāxj ‚ąāxi ‚ąāxi ‚ąāxi

‚ąāBi
C ~ ¬∑ ‚ąá)B
~ = (A ~ C i = Am B i,m Ci = Am
‚ąāxm

‚ąāBm
~ = A(‚ąá
C ~ ~
¬∑ B) C i = Ai B j,j Ci = Ai
‚ąāxm
 
~ = ‚ąá2 A ‚ąā ‚ąāAi
C ~ C i = g jm Ai ,mj or Ci = g jm Ai,mj Ci =
‚ąāxm ‚ąāxm
 
A~¬∑‚ąá ŌÜ g im Ai ŌÜ ,m Ai ŌÜ,i

   ‚ąā 2 Ar
~
‚ąá ‚ąá¬∑A g im Ar,r ,m ‚ąāxi ‚ąāxr
   ‚ąā 2 Aj ‚ąā 2 Ai
~
‚ąá√ó ‚ąá√óA ijk g jm kst At,s ,m
‚ąí
‚ąāxj ‚ąāxi ‚ąāxj ‚ąāxj

## Table 1 Vector and tensor representations.

176

EXAMPLE 2.1-4. (Maxwell‚Äôs equations) In the study of electrodynamics there arises the following
vectors and scalars:
~ =Electric force vector, [E]
E ~ = Newton/coulomb
~ = Weber/m2
~ =Magnetic force vector, [B]
B
~ = coulomb/m2
~ =Displacement vector, [D]
D
~ =Auxilary magnetic force vector, [H]
H ~ = ampere/m
~ = ampere/m2
J~ =Free current density, [J]
% =free charge density, [%] = coulomb/m3
The above quantities arise in the representation of the following laws:
Faraday‚Äôs Law This law states the line integral of the electromagnetic force around a loop is proportional
to the rate of flux of magnetic induction through the loop. This gives rise to the first electromagnetic field
equation:
~
‚ąāB ‚ąāB i
~ =‚ąí
‚ąá√óE or ijk Ek,j = ‚ąí . (2.1.15)
‚ąāt ‚ąāt
Ampere‚Äôs Law This law states the line integral of the magnetic force vector around a closed loop is
proportional to the sum of the current through the loop and the rate of flux of the displacement vector
through the loop. This produces the second electromagnetic field equation:
~
‚ąāD ‚ąāDi
~ = J~ +
‚ąá√óH or ijk Hk,j = J i + . (2.1.16)
‚ąāt ‚ąāt
Gauss‚Äôs Law for Electricity This law states that the flux of the electric force vector through a closed
surface is proportional to the total charge enclosed by the surface. This results in the third electromagnetic
field equation:
1 ‚ąā ‚ąö i
~ =%
‚ąá¬∑D or ‚ąö gD = %. (2.1.17)
g ‚ąāxi
Gauss‚Äôs Law for Magnetism This law states the magnetic flux through any closed volume is zero. This
produces the fourth electromagnetic field equation:
1 ‚ąā ‚ąö i
~ =0
‚ąá¬∑B or ‚ąö gB = 0. (2.1.18)
g ‚ąāxi

The four electromagnetic field equations are referred to as Maxwell‚Äôs equations. These equations arise
in the study of electrodynamics and can be represented in other forms. These other forms will depend upon
such things as the material assumptions and units of measurements used. Note that the tensor equations
(2.1.15) through (2.1.18) are representations of Maxwell‚Äôs equations in a form which is independent of the
coordinate system chosen.
In applications, the tensor quantities must be expressed in terms of their physical components. In a
general orthogonal curvilinear coordinate system we will have

## g11 = h21 , g22 = h22 , g33 = h23 , and gij = 0 for i 6= j.

‚ąö
This produces the result g = h1 h2 h3 . Further, if we represent the physical components of

## Di , Bi , Ei , Hi by D(i), B(i), E(i), and H(i)

177

the Maxwell equations can be represented by the equations in table 2. The tables 3, 4 and 5 are the
representation of Maxwell‚Äôs equations in rectangular, cylindrical, and spherical coordinates. These latter
tables are special cases associated with the more general table 2.

 
1 ‚ąā ‚ąā 1 ‚ąāB(1)
2
(h3 E(3)) ‚ąí 3 (h2 E(2)) = ‚ąí
h1 h2 h3 ‚ąāx ‚ąāx h1 ‚ąāt
 
1 ‚ąā ‚ąā 1 ‚ąāB(2)
3
(h1 E(1)) ‚ąí 1 (h3 E(3)) = ‚ąí
h1 h2 h3 ‚ąāx ‚ąāx h2 ‚ąāt
 
1 ‚ąā ‚ąā 1 ‚ąāB(3)
1
(h2 E(2)) ‚ąí 2 (h1 E(1)) = ‚ąí
h1 h2 h3 ‚ąāx ‚ąāx h3 ‚ąāt

 
1 ‚ąā ‚ąā J(1) 1 ‚ąāD(1)
2
(h3 H(3)) ‚ąí 3 (h2 H(2)) = +
h1 h2 h3 ‚ąāx ‚ąāx h1 h1 ‚ąāt
 
1 ‚ąā ‚ąā J(2) 1 ‚ąāD(2)
(h1 H(1)) ‚ąí 1 (h3 H(3)) = +
h1 h2 h3 ‚ąāx3 ‚ąāx h2 h2 ‚ąāt
 
1 ‚ąā ‚ąā J(3) 1 ‚ąāD(3)
(h2 H(2)) ‚ąí 2 (h1 H(1)) = +
h1 h2 h3 ‚ąāx1 ‚ąāx h3 h3 ‚ąāt

      
1 ‚ąā D(1) ‚ąā D(2) ‚ąā D(3)
h1 h2 h3 + 2 h1 h2 h3 + 3 h1 h2 h3 =%
h1 h2 h3 ‚ąāx1 h1 ‚ąāx h2 ‚ąāx h3

      
1 ‚ąā B(1) ‚ąā B(2) ‚ąā B(3)
h1 h2 h3 + 2 h1 h2 h3 + 3 h1 h2 h3 =0
h1 h2 h3 ‚ąāx1 h1 ‚ąāx h2 ‚ąāx h3

## Table 2 Maxwell‚Äôs equations in generalized orthogonal coordinates.

Note that all the tensor components have been replaced by their physical components.
178

## ‚ąāEz ‚ąāEy ‚ąāBx ‚ąāHz ‚ąāHy ‚ąāDx ‚ąāDx

‚ąí =‚ąí ‚ąí = Jx + +
‚ąāDy
+
‚ąāDz
=%
‚ąāy ‚ąāz ‚ąāt ‚ąāy ‚ąāz ‚ąāt ‚ąāx ‚ąāy ‚ąāz
‚ąāEx ‚ąāEz ‚ąāBy ‚ąāHx ‚ąāHz ‚ąāDy
‚ąí =‚ąí ‚ąí = Jy +
‚ąāz ‚ąāx ‚ąāt ‚ąāz ‚ąāx ‚ąāt
‚ąāEy ‚ąāEx ‚ąāBz ‚ąāHy ‚ąāHx ‚ąāDz ‚ąāBx ‚ąāBy ‚ąāBz
‚ąí =‚ąí ‚ąí = Jz + + + =0
‚ąāx ‚ąāy ‚ąāt ‚ąāx ‚ąāy ‚ąāt ‚ąāx ‚ąāy ‚ąāz

## Dx = D(1) Bx = B(1) Hx = H(1) Jx = J(1) Ex = E(1)

Dy = D(2) By = B(2) Hy = H(2) Jy = J(2) Ey = E(2)
Dz = D(3) Bz = B(3) Hz = H(3) Jz = J(3) Ez = E(3)

with x1 = x, x2 = y, x3 = z, h1 = h 2 = h 3 = 1

## 1 ‚ąāEz ‚ąāEőł ‚ąāBr 1 ‚ąāHz ‚ąāHőł ‚ąāDr

‚ąí =‚ąí ‚ąí = Jr +
r ‚ąāőł ‚ąāz ‚ąāt r ‚ąāőł ‚ąāz ‚ąāt
‚ąāEr ‚ąāEz ‚ąāBőł ‚ąāHr ‚ąāHz ‚ąāDőł
‚ąí =‚ąí ‚ąí = Jőł +
‚ąāz ‚ąār ‚ąāt ‚ąāz ‚ąār ‚ąāt
1 ‚ąā 1 ‚ąāEr ‚ąāBz 1 ‚ąā 1 ‚ąāHr ‚ąāDz
(rEőł ) ‚ąí =‚ąí (rHőł ) ‚ąí = Jz +
r ‚ąār r ‚ąāőł ‚ąāt r ‚ąār r ‚ąāőł ‚ąāt
1 ‚ąā 1 ‚ąāDőł ‚ąāDz 1 ‚ąā 1 ‚ąāBőł ‚ąāBz
(rDr ) + + =% (rBr ) + + =0
r ‚ąār r ‚ąāőł ‚ąāz r ‚ąār r ‚ąāőł ‚ąāz

## Dr = D(1) Br = B(1) Hr = H(1) Jr = J(1) Er = E(1)

Dőł = D(2) Bőł = B(2) Hőł = H(2) Jőł = J(2) Eőł = E(2)
Dz = D(3) Bz = B(3) Hz = H(3) Jz = J(3) Ez = E(3)

with x1 = r, x2 = őł, x3 = z, h1 = 1, h2 = r, h3 = 1.

## Table 4 Maxwell‚Äôs equations in cylindrical coordinates.

179

   
1 ‚ąā ‚ąāEőł ‚ąāBŌĀ 1 ‚ąā ‚ąāHőł ‚ąāDŌĀ
(sin őłEŌÜ ) ‚ąí =‚ąí (sin őłHŌÜ ) ‚ąí = JŌĀ +
ŌĀ sin őł ‚ąāőł ‚ąāŌÜ ‚ąāt ŌĀ sin őł ‚ąāőł ‚ąāŌÜ ‚ąāt
1 ‚ąāEŌĀ 1 ‚ąā ‚ąāBőł 1 ‚ąāHŌĀ 1 ‚ąā ‚ąāDőł
‚ąí (ŌĀEŌÜ ) = ‚ąí ‚ąí (ŌĀHŌÜ ) = Jőł +
ŌĀ sin őł ‚ąāŌÜ ŌĀ ‚ąāŌĀ ‚ąāt ŌĀ sin őł ‚ąāŌÜ ŌĀ ‚ąāŌĀ ‚ąāt
1 ‚ąā 1 ‚ąāEŌĀ ‚ąāBŌÜ 1 ‚ąā 1 ‚ąāHŌĀ ‚ąāDŌÜ
(ŌĀEőł ) ‚ąí =‚ąí (ŌĀHőł ) ‚ąí = JŌÜ +
ŌĀ ‚ąāŌĀ ŌĀ ‚ąāőł ‚ąāt ŌĀ ‚ąāŌĀ ŌĀ ‚ąāőł ‚ąāt

1 ‚ąā 2 1 ‚ąā 1 ‚ąāDŌÜ
2
(ŌĀ DŌĀ ) + (sin őłDőł ) + =%
ŌĀ ‚ąāŌĀ ŌĀ sin őł ‚ąāőł ŌĀ sin őł ‚ąāŌÜ
1 ‚ąā 2 1 ‚ąā 1 ‚ąāBŌÜ
(ŌĀ BŌĀ ) + (sin őłBőł ) + =0
ŌĀ2 ‚ąāŌĀ ŌĀ sin őł ‚ąāőł ŌĀ sin őł ‚ąāŌÜ

## DŌĀ = D(1) BŌĀ = B(1) HŌĀ = H(1) JŌĀ = J(1) EŌĀ = E(1)

Dőł = D(2) Bőł = B(2) Hőł = H(2) Jőł = J(2) Eőł = E(2)
DŌÜ = D(3) BŌÜ = B(3) HŌÜ = H(3) JŌÜ = J(3) EŌÜ = E(3)

with x1 = ŌĀ, x2 = őł, x3 = ŌÜ, h1 = 1, h2 = ŌĀ, h3 = ŌĀ sin őł

## Consider the equation

Tij Aj = őĽAi , i, j = 1, 2, 3, (2.1.19)

where Tij = Tji is symmetric, Ai are the components of a vector and őĽ is a scalar. Any nonzero solution
Ai of equation (2.1.19) is called an eigenvector of the tensor Tij and the associated scalar őĽ is called an
eigenvalue. When expanded these equations have the form

## (T11 ‚ąí őĽ)A1 + T12 A2 + T13 A3 = 0

T21 A1 + (T22 ‚ąí őĽ)A2 + T23 A3 = 0
T31 A1 + T32 A2 + (T33 ‚ąí őĽ)A3 = 0.

The condition for equation (2.1.19) to have a nonzero solution Ai is that the characteristic equation
should be zero. This equation is found from the determinant equation

T11 ‚ąí őĽ T12 T13

f (őĽ) = T21 T22 ‚ąí őĽ T23 = 0, (2.1.20)
T31 T32 T33 ‚ąí őĽ
180

## where I1 , I2 and I3 are invariants defined by the relations

I1 = Tii
1 1
I2 = Tii Tjj ‚ąí Tij Tij (2.1.22)
2 2
I3 = eijk Ti1 Tj2 Tk3 .

When Tij is subjected to an orthogonal transformation, where TŐĄmn = Tij `im `jn , then

`im `jn (Tmn ‚ąí őĽ őīmn ) = TŐĄij ‚ąí őĽ őīij and det (Tmn ‚ąí őĽ őīmn ) = det TŐĄij ‚ąí őĽ őīij .

Hence, the eigenvalues of a second order tensor remain invariant under an orthogonal transformation.
If Tij is real and symmetric then

## ‚ÄĘ the eigenvalues of Tij will be real, and

‚ÄĘ the eigenvectors corresponding to distinct eigenvalues will be orthogonal.

Proof: To show a quantity is real we show that the conjugate of the quantity equals the given quantity. If
(2.1.19) is satisfied, we multiply by the conjugate Ai and obtain

## Ai Tij Aj = őĽAi Ai . (2.1.25)

The right hand side of this equation has the inner product Ai Ai which is real. It remains to show the left
hand side of equation (2.1.25) is also real. Consider the conjugate of this left hand side and write

## Ai Tij Aj = Ai T ij Aj = Ai Tji Aj = Ai Tij Aj .

Consequently, the left hand side of equation (2.1.25) is real and the eigenvalue őĽ can be represented as the
ratio of two real quantities.
Assume that őĽ(1) and őĽ(2) are two distinct eigenvalues which produce the unit eigenvectors LŐā1 and LŐā2
with components `i1 and `i2 , i = 1, 2, 3 respectively. We then have

Tij `j1 = őĽ(1) `i1 and Tij `j2 = őĽ(2) `i2 . (2.1.26)

## Consider the products

őĽ(1) `i1 `i2 = Tij `j1 `i2 ,
(2.1.27)
őĽ(2) `i1 `i2 = `i1 Tij `j2 = `j1 Tji `i2 .
and subtract these equations. We find that

## [őĽ(1) ‚ąí őĽ(2) ]`i1 `i2 = 0. (2.1.28)

By hypothesis, őĽ(1) is different from őĽ(2) and consequently the inner product `i1 `i2 must be zero. Therefore,
the eigenvectors corresponding to distinct eigenvalues are orthogonal.
181

Therefore, associated with distinct eigenvalues őĽ(i) , i = 1, 2, 3 there are unit eigenvectors

## The unit eigenvectors satisfy the relations

Tij `j1 = őĽ(1) `i1 Tij `j2 = őĽ(2) `i2 Tij `j3 = őĽ(3) `i3

## Consider the transformation

xi = `ij xj or xm = `mj xj

which represents a rotation of axes, where `ij are the direction cosines from the eigenvectors of Tij . This is a
linear transformation where the `ij satisfy equation (2.1.23). Such a transformation is called an orthogonal
transformation. In the new x coordinate system, called principal axes, we have

‚ąāxi ‚ąāxj
T mn = Tij = Tij `im `jn = őĽ(n) `in `im = őĽ(n) őīmn (no sum on n). (2.1.24)
‚ąāxm ‚ąāxn

This equation shows that in the barred coordinate system there are the components
Ô£ģ Ô£Ļ
 őĽ(1) 0 0
T mn =Ô£į 0 őĽ(2) 0 Ô£Ľ.
0 0 őĽ(3)

That is, along the principal axes the tensor components Tij are transformed to the components T ij where
T ij = 0 for i 6= j. The elements T (i)(i) , i not summed, represent the eigenvalues of the transformation
(2.1.19).
182

EXERCISE 2.1

## I 1. In cylindrical coordinates (r, őł, z) with f = f (r, őł, z) find the gradient of f.

~ = A(r,
I 2. In cylindrical coordinates (r, őł, z) with A ~ őł, z) find div A.
~

~ = A(r,
I 3. In cylindrical coordinates (r, őł, z) for A ~ őł, z) find curl A.
~

## I 5. In spherical coordinates (ŌĀ, őł, ŌÜ) with f = f (ŌĀ, őł, ŌÜ) find the gradient of f.

~ = A(ŌĀ,
I 6. In spherical coordinates (ŌĀ, őł, ŌÜ) with A ~ őł, ŌÜ) find div A.
~

~ = A(ŌĀ,
I 7. In spherical coordinates (ŌĀ, őł, ŌÜ) for A ~ őł, ŌÜ) find curl A.
~

## I 8. In spherical coordinates (ŌĀ, őł, ŌÜ) for f = f (ŌĀ, őł, ŌÜ) find ‚ąá2 f.

I 9. Let ~r = x eŐā1 + y eŐā2 + z eŐā3 denote the position vector of a variable point (x, y, z) in Cartesian coordinates.
Let r = |~r| denote the distance of this point from the origin. Find in terms of ~r and r:

1
r

## where ŌÜ = ŌÜ(r) is an arbitrary function of r.

I 10. Let ~r = x eŐā1 +y eŐā2 +z eŐā3 denote the position vector of a variable point (x, y, z) in Cartesian coordinates.
Let r = |~r| denote the distance of this point from the origin. Find:

(a) div (~r) (b) div (rm~r) (c) div (r‚ąí3 ~r) (d) div (ŌÜ ~r)

## where ŌÜ = ŌÜ(r) is an arbitrary function or r.

I 11. Let ~r = x eŐā1 + y eŐā2 + z eŐā3 denote the position vector of a variable point (x, y, z) in Cartesian
coordinates. Let r = |~r| denote the distance of this point from the origin. Find: (a) curl ~r (b) curl (ŌÜ ~r)
where ŌÜ = ŌÜ(r) is an arbitrary function of r.
~
I 12. Expand and simplify the representation for curl (curl A).

I 13. Show that the curl of the gradient is zero in generalized coordinates.

I 14. Write out the physical components associated with the gradient of ŌÜ = ŌÜ(x1 , x2 , x3 ).

## I 15. Show that

1 ‚ąā ‚ąö im  1 ‚ąā ‚ąö i 
g im Ai,m = ‚ąö i
gg Am = Ai,i = ‚ąö gA .
g ‚ąāx g ‚ąāxi
183
p
I 16. Let r = (~r ¬∑ ~r)1/2 = x2 + y 2 + z 2 ) and calculate (a) ‚ąá2 (r) (b) ‚ąá2 (1/r) (c) ‚ąá2 (r2 ) (d) ‚ąá2 (1/r2 )
1
I 17. Given the tensor equations Dij = 2 (vi,j + vj,i ), i, j = 1, 2, 3. Let v(1), v(2), v(3) denote the
physical components of v1 , v2 , v3 and let D(ij) denote the physical components associated with Dij . Assume
the coordinate system (x1 , x2 , x3 ) is orthogonal with metric coefficients g(i)(i) = h2i , i = 1, 2, 3 and gij = 0
for i 6= j.
(a) Find expressions for the physical components D(11), D(22) and D(33) in terms of the physical compo-
1 ‚ąāV (i) X V (j) ‚ąāhi
nents v(i), i = 1, 2, 3. Answer: D(ii) = + no sum on i.
hi ‚ąāxi hi hj ‚ąāxj
j6=i

## (b) Find expressions for the physical components

 D(12),
 D(13) and D(23)
 in terms
 of the physical compo-
1 hi ‚ąā V (i) hj ‚ąā V (j)
nents v(i), i = 1, 2, 3. Answer: D(ij) = +
2 hj ‚ąāxj hi hi ‚ąāxi hj
I 18. Write out the tensor equations in problem 17 in Cartesian coordinates.

## I 21. Express the vector equation (őĽ + 2¬Ķ)‚ąáő¶ ‚ąí 2¬Ķ‚ąá √ó ~ŌČ + F~ = ~0 in tensor form.

I 22. Write out the equations in problem 21 for a generalized orthogonal coordinate system in terms of
physical components.

## I 24. Write out the equations in problem 22 for spherical coordinates.

I 25. Use equation (2.1.4) to represent the divergence in parabolic cylindrical coordinates (őĺ, ő∑, z).

I 26. Use equation (2.1.4) to represent the divergence in parabolic coordinates (őĺ, ő∑, ŌÜ).

I 27. Use equation (2.1.4) to represent the divergence in elliptic cylindrical coordinates (őĺ, ő∑, z).

## Change the given equations from a vector notation to a tensor notation.

I 28. ~ = ~v ‚ąá ¬∑ A
B ~ + (‚ąá ¬∑ ~v ) A
~
~ ~ ~
I 29.
d ~ ~ ~ = dA ¬∑ (B
[A ¬∑ (B √ó C)] ~ √ó C) ~ ¬∑ ( dB √ó C)
~ +A ~ √ó dC )
~ ¬∑ (B
~ +A
dt dt dt dt
d~v ‚ąā~v
I 30. = + (~v ¬∑ ‚ąá)~v
dt ‚ąāt
1 ‚ąāH~
I 31. = ‚ąícurl E ~
c ‚ąāt
dB~
I 32. ‚ąí (B~ ¬∑ ‚ąá)~v + B(‚ąá
~ ¬∑ ~v ) = ~0
dt
184

## I 33. ijk Bk,j + F i = 0

I 34. gij jkl Bl,k + Fi = 0
‚ąā%
I 35. + (%vi ), i = 0
‚ąāt
‚ąāvi ‚ąāvi ‚ąāP ‚ąā 2 vi
I 36. %( + vm m ) = ‚ąí i + ¬Ķ m m + Fi
‚ąāt ‚ąāx ‚ąāx ‚ąāx ‚ąāx

Z Z
I 37. The moment of inertia of an area or second moment of area is defined by Iij = (ym ym őīij ‚ąíyi yj ) dA
A
where dA is an element of area. Calculate
 1 the3 moment of inertia
 Iij , i, j = 1, 2 for the triangle illustrated in
1 2 2
12 bh ‚ąí 24 b h
the figure 2.1-1 and show that Iij = 1 2 2 1 3 .
‚ąí 24 b h 12 b h

## Figure 2.1-1 Moments of inertia for a triangle

I 38. Use the results from problem 37 and rotate the axes in figure 2.1-1 through an angle őł to a barred
system of coordinates.
(a) Show that in the barred system of coordinates
   
I11 + I22 I11 ‚ąí I22
I 11 = + cos 2őł + I12 sin 2őł
2 2
 
I11 ‚ąí I22
I 12 = I 21 =‚ąí sin 2őł + I12 cos 2őł
2
   
I11 + I22 I11 ‚ąí I22
I 22 = ‚ąí cos 2őł ‚ąí I12 sin 2őł
2 2

## (b) For what value of őł will I 11 have a maximum value?

(c) Show that when I 11 is a maximum, we will have I 22 a minimum and I 12 = I 21 = 0.
185

## Figure 2.1-2 Mohr‚Äôs circle

I 39. Otto Mohr1 gave the following physical interpretation to the results obtained in problem 38:
‚ÄĘ Plot the points A(I11 , I12 ) and B(I22 , ‚ąíI12 ) as illustrated in the figure 2.1-2
‚ÄĘ Draw the line AB and calculate the point C where this line intersects the I axes. Show the point C
has the coordinates
I11 + I22
( , 0)
2
‚ÄĘ Calculate the radius of the circle with center at the point C and with diagonal AB and show this
 2
I11 ‚ąí I22 2
r= + I12
2
‚ÄĘ Show the maximum and minimum values of I occur where the constructed circle intersects the I axes.
I11 + I22 I11 + I22
Show that Imax = I 11 = +r Imin = I 22 = ‚ąí r.
2 2
 
I11 I12
I 40. Show directly that the eigenvalues of the symmetric matrix Iij = are őĽ1 = Imax and
I21 I22
őĽ2 = Imin where Imax and Imin are given in problem 39.

I 41. Find the principal axes and moments of inertia for the triangle given in problem 37 and summarize
your results from problems 37,38,39, and 40.

## I 42. Verify for orthogonal coordinates the relations

h i 3
X e(i)jk ‚ąā(h(k) A(k))
~ ¬∑ eŐā(i) =
‚ąá√óA h(i)
h1 h2 h3 ‚ąāxj
k=1

or
h1 eŐā1 h2 eŐā2 h3 eŐā3
1
~=
‚ąá√óA ‚ąā ‚ąā ‚ąā .
h1 h2 h3 ‚ąāx1 ‚ąāx2 ‚ąāx3
h1 A(1) h2 A(2) h3 A(3)

## I 43. Verify for orthogonal coordinates the relation

" #
h i X3
h(i) ‚ąā h2(r) ‚ąā(h(m) A(m))
~ ¬∑ eŐā(i) =
‚ąá √ó (‚ąá √ó A) e(i)jr ersm
m=1
h1 h2 h3 ‚ąāxj h1 h2 h3 ‚ąāxs

1
Christian Otto Mohr (1835-1918) German civil engineer.
186

## I 44. Verify for orthogonal coordinates the relation

h  i   
1 ‚ąā 1 ‚ąā(h2 h3 A(1)) ‚ąā(h1 h3 A(2)) ‚ąā(h1 h2 A(3))
‚ąá ‚ąá¬∑A ~ ¬∑ eŐā(i) = + +
h(i) ‚ąāx(i) h1 h2 h3 ‚ąāx1 ‚ąāx2 ‚ąāx3
I 45. Verify the relation
h i 3
X  
~ ¬∑ ‚ąá)B~ ¬∑ eŐā(i) = A(k) ‚ąāB(i) X B(k) ‚ąāh(i) ‚ąāhk
(A + A(i) ‚ąí A(k)
h(k) ‚ąāxk hk h(i) ‚ąāxk ‚ąāx(i)
k=1 k6=i

## I 46. The Gauss divergence theorem is written

ZZZ  1  ZZ
‚ąāF ‚ąāF 2 ‚ąāF 3 
+ + dŌĄ = n1 F 1 + n2 F 2 + n3 F 3 dŌÉ
V ‚ąāx ‚ąāy ‚ąāz S
where V is the volume within a simple closed surface S. Here it is assumed that F i = F i (x, y, z) are
continuous functions with continuous first order derivatives throughout V and ni are the direction cosines
of the outward normal to S, dŌĄ is an element of volume and dŌÉ is an element of surface area.
(a) Show that in a Cartesian coordinate system
‚ąāF 1 ‚ąāF 2 ‚ąāF 3
F,ii = + +
‚ąāx ‚ąāy ‚ąāz
ZZZ ZZ
and that the tensor form of this theorem is F,ii dŌĄ = F i ni dŌÉ.
V S
(b) Write the vector form of this theorem.
(c) Show that if we define
‚ąāu ‚ąāv
ur = , vr = and Fr = grm F m = uvr
‚ąāxr ‚ąāxr
then F,ii = g im Fi,m = g im (uvi,m + um vi )
(d) Show that another form of the Gauss divergence theorem is
ZZZ ZZ ZZZ
im m
g um vi dŌĄ = uvm n dŌÉ ‚ąí ug im vi,m dŌĄ
V S V
Write out the above equation in Cartesian coordinates.
Ô£ę Ô£∂
1 1 2
I 47. Find the eigenvalues and eigenvectors associated with the matrix A = Ô£≠ 1 2 1Ô£ł.
2 1 1
Show that the eigenvectors are orthogonal.
Ô£ę Ô£∂
1 2 1
I 48. Find the eigenvalues and eigenvectors associated with the matrix A = Ô£≠ 2 1 0Ô£ł.
1 0 1
Show that the eigenvectors are orthogonal.
Ô£ę Ô£∂
1 1 0
I 49. Find the eigenvalues and eigenvectors associated with the matrix A = Ô£≠ 1 1 1Ô£ł.
0 1 1
Show that the eigenvectors are orthogonal.

I 50. The harmonic and biharmonic functions or potential functions occur in the mathematical modeling
of many physical problems. Any solution of Laplace‚Äôs equation ‚ąá2 ő¶ = 0 is called a harmonic function and
any solution of the biharmonic equation ‚ąá4 ő¶ = 0 is called a biharmonic function.
(a) Expand the Laplace equation in Cartesian, cylindrical and spherical coordinates.
(b) Expand the biharmonic equation in two dimensional Cartesian and polar coordinates.
Hint: Consider ‚ąá4 ő¶ = ‚ąá2 (‚ąá2 ő¶). In Cartesian coordinates ‚ąá2 ő¶ = ő¶,ii and ‚ąá4 ő¶ = ő¶,iijj .
187

¬ß2.2 DYNAMICS

Dynamics is concerned with studying the motion of particles and rigid bodies. By studying the motion
of a single hypothetical particle, one can discern the motion of a system of particles. This in turn leads to
the study of the motion of individual points in a continuous deformable medium.

Particle Movement

The trajectory of a particle in a generalized coordinate system is described by the parametric equations

xi = xi (t), i = 1, . . . , N (2.2.1)

where t is a time parameter. If the coordinates are changed to a barred system by introducing a coordinate
transformation
xi = xi (x1 , x2 , . . . , xN ), i = 1, . . . , N

## The generalized velocity of the particle in the unbarred system is defined by

dxi
vi = , i = 1, . . . , N. (2.2.3)
dt

By the chain rule differentiation of the transformation equations (2.2.2) one can verify that the velocity in
the barred system is
dxr ‚ąāxr dxj ‚ąāxr j
vr = = j
= v , r = 1, . . . , N. (2.2.4)
dt ‚ąāx dt ‚ąāxj
Consequently, the generalized velocity v i is a first order contravariant tensor. The speed of the particle is
obtained from the magnitude of the velocity and is

v 2 = gij v i v j .

The generalized acceleration f i of the particle is defined as the intrinsic derivative of the generalized velocity.
The generalized acceleration has the form
    m n
iőīv i i dx
n
dv i i m n d2 xi i dx dx
f = = v,n = + v v = + (2.2.5)
őīt dt dt mn dt2 m n dt dt

## and the magnitude of the acceleration is

f 2 = gij f i f j .
188

## Figure 2.2-1 Tangent, normal and binormal to point P on curve.

Frenet-Serret Formulas

The parametric equations (2.2.1) describe a curve in our generalized space. With reference to the figure
2.2-1 we wish to define at each point P of the curve the following orthogonal unit vectors:

## T i = unit tangent vector at each point P.

N i = unit normal vector at each point P.
B i = unit binormal vector at each point P.

These vectors define the osculating, normal and rectifying planes illustrated in the figure 2.2-1.
In the generalized coordinates the arc length squared is

## ds2 = gij dxi dxj .

dxi
Define T i = ds as the tangent vector to the parametric curve defined by equation (2.2.1). This vector is a
unit tangent vector because if we write the element of arc length squared in the form

dxi dxj
1 = gij = gij T i T j , (2.2.6)
ds ds

we obtain the generalized dot product for T i . This generalized dot product implies that the tangent vector
is a unit vector. Differentiating the equation (2.2.6) intrinsically with respect to arc length s along the curve
produces
őīT m n őīT n
gmn T + gmn T m = 0,
őīs őīs
which simplifies to
őīT m
gmn T n = 0. (2.2.7)
őīs
189

őīT m
The equation (2.2.7) is a statement that the vector őīs is orthogonal to the vector T m . The unit normal
vector is defined as
1 őīT i 1 őīTi
Ni = or Ni = , (2.2.8)
őļ őīs őļ őīs
where őļ is a scalar called the curvature and is chosen such that the magnitude of N i is unity. The reciprocal
1
of the curvature is R = őļ, which is called the radius of curvature. The curvature of a straight line is zero
while the curvature of a circle is a constant. The curvature measures the rate of change of the tangent vector
as the arc length varies.
The equation (2.2.7) can be expressed in the form

gij T i N j = 0. (2.2.9)

Taking the intrinsic derivative of equation (2.2.9) with respect to the arc length s produces

őīN j őīT i j
gij T i + gij N =0
őīs őīs
or
őīN j őīT i j
gij T i = ‚ąígij N = ‚ąíőļgij N i N j = ‚ąíőļ. (2.2.10)
őīs őīs
The generalized dot product can be written

gij T i T j = 1,

## and consequently we can express equation (2.2.10) in the form

j
 
i őīN őīN j
gij T = ‚ąíőļgij T Ti j
or gij T i
+ őļT j = 0. (2.2.11)
őīs őīs

## Consequently, the vector

őīN j
+ őļT j (2.2.12)
őīs
is orthogonal to T i . In a similar manner, we can use the relation gij N i N j = 1 and differentiate intrinsically
with respect to the arc length s to show that

őīN j
gij N i = 0.
őīs
This in turn can be expressed in the form
 
i őīN j
gij N + őļT j = 0.
őīs

This form of the equation implies that the vector represented in equation (2.2.12) is also orthogonal to the
unit normal N i . We define the unit binormal vector as
   
i 1 őīN i i 1 őīNi
B = + őļT or Bi = + őļTi (2.2.13)
ŌĄ őīs ŌĄ őīs

where ŌĄ is a scalar called the torsion. The torsion is chosen such that the binormal vector is a unit vector.
The torsion measures the rate of change of the osculating plane and consequently, the torsion ŌĄ is a measure
190

of the twisting of the curve out of a plane. The value ŌĄ = 0 corresponds to a plane curve. The vectors
T i , N i , B i , i = 1, 2, 3 satisfy the cross product relation

B i = ijk Tj Nk .

## If we differentiate this relation intrinsically with respect to arc length s we find

 
őīB i őīNk őīTj
= ijk Tj + Nk
őīs őīs őīs
= ijk [Tj (ŌĄ Bk ‚ąí őļTk ) + őļNj Nk ] (2.2.14)

= ŌĄ ijk Tj Bk = ‚ąíŌĄ ikj Bk Tj = ‚ąíŌĄ N i .

The relations (2.2.8),(2.2.13) and (2.2.14) are now summarized and written

őīT i
= őļN i
őīs
őīN i
= ŌĄ B i ‚ąí őļT i (2.2.15)
őīs
őīB i
= ‚ąíŌĄ N i .
őīs
These equations are known as the Frenet-Serret formulas of differential geometry.

## Chain rule differentiation of the generalized velocity is expressible in the form

dxi dxi ds
vi = = = T i v, (2.2.16)
dt ds dt
ds
where v = dt is the speed of the particle and is the magnitude of v i . The vector T i is the unit tangent vector
to the trajectory curve at the time t. The equation (2.2.16) is a statement of the fact that the velocity of a
particle is always in the direction of the tangent vector to the curve and has the speed v.
By chain rule differentiation, the generalized acceleration is expressible in the form

őīv r dv r őīT r
fr = = T +v
őīt dt őīt
dv r őīT r ds
= T +v (2.2.17)
dt őīs dt
dv r
= T + őļv 2 N r .
dt

The equation (2.2.17) states that the acceleration lies in the osculating plane. Further, the equation (2.2.17)
dv
indicates that the tangential component of the acceleration is dt , while the normal component of the accel-
2
eration is őļv .
191

## Work and Potential Energy

Define M as the constant mass of the particle as it moves along the curve defined by equation (2.2.1).
Also let Qr denote the components of a force vector (in appropriate units of measurements) which acts upon
the particle. Newton‚Äôs second law of motion can then be expressed in the form

Qr = M f r or Qr = M fr . (2.2.18)

The work done W in moving a particle from a point P0 to a point P1 along a curve xr = xr (t), r = 1, 2, 3,
with parameter t, is represented by a summation of the tangential components of the forces acting along the
path and is defined as the line integral
Z P1 Z P1 Z t1 Z t1
dxr r dxr
W = Qr ds = Qr dx = Qr dt = Qr v r dt (2.2.19)
P0 ds P0 t0 dt t0

where Qr = grs Qs is the covariant form of the force vector, t is the time parameter and s is arc length along
the curve.

Conservative Systems

If the force vector is conservative it means that the force is derivable from a scalar potential function

‚ąāV
V = V (x1 , x2 , . . . , xN ) such that Qr = ‚ąíV ,r = ‚ąí , r = 1, . . . , N. (2.2.20)
‚ąāxr

In this case the equation (2.2.19) can be integrated and we find that to within an additive constant we will
have V = ‚ąíW. The potential function V is called the potential energy of the particle and the work done
becomes the change in potential energy between the starting and end points and is independent of the path
connecting the points.

## Lagrange‚Äôs Equations of Motion

The kinetic energy T of the particle is defined as one half the mass times the velocity squared and can
be expressed in any of the forms
 2
1 ds 1 1 1
T = M = M v 2 = M gmn v m v n = M gmn xŐám xŐán , (2.2.21)
2 dt 2 2 2

where the dot notation denotes differentiation with respect to time. It is an easy exercise to calculate the
derivatives
‚ąāT
r
= M grmxŐám
 ‚ąā xŐá   
d ‚ąāT m ‚ąāgrm n m
= M g rm xŐą + xŐá xŐá (2.2.22)
dt ‚ąā xŐár ‚ąāxn
‚ąāT 1 ‚ąāgmn m n
r
= M xŐá xŐá ,
‚ąāx 2 ‚ąāxr
and thereby verify the relation
 
d ‚ąāT ‚ąāT
‚ąí = M fr = Qr , r = 1, . . . , N. (2.2.23)
dt ‚ąā xŐár ‚ąāxr
192

## This equation is called the Lagrange‚Äôs form of the equations of motion.

EXAMPLE 2.2-1. (Equations of motion in spherical coordinates) Find the Lagrange‚Äôs form of
the equations of motion in spherical coordinates.
Solution: Let x1 = ŌĀ, x2 = őł, x3 = ŌÜ then the element of arc length squared in spherical coordinates has
the form
ds2 = (dŌĀ)2 + ŌĀ2 (dőł)2 + ŌĀ2 sin2 őł(dŌÜ)2 .

The element of arc length squared can be used to construct the kinetic energy. For example,
 2
1 ds 1 h i
T = M = M (ŌĀŐá)2 + ŌĀ2 (őłŐá)2 + ŌĀ2 sin2 őł(ŌÜŐá)2 .
2 dt 2

The Lagrange form of the equations of motion of a particle are found from the relations (2.2.23) and are
calculated to be:
  h i
d ‚ąāT ‚ąāT
M f1 = Q1 = ‚ąí = M ŌĀŐą ‚ąí ŌĀ(őłŐá)2 ‚ąí ŌĀ sin2 őł(ŌÜŐá)2
dt ‚ąā ŌĀŐá ‚ąāŌĀ
     
d ‚ąāT ‚ąāT d
M f2 = Q2 = ‚ą