Beruflich Dokumente
Kultur Dokumente
Field Theory
July 9, 2010
Preface
These notes are from the transparencies written for the course “Field Theory”.
The parts related to Electrodynamics were based mainly on the book by
• J.D. Jackson Classical Electrodynamics while other books like Glassical
Electrodynamics by W. Greiner
while we have also used
• The Classical Theory of Fields by L.D. Landau & E.M. Lifshitz
• Eric Poisson’s notes
• Electrodynamics by Fulvio Melia
• Classical Electrodynamics by Walter Greiner
For the parts related to General Theory of Relativity we have used the
books
• General Relativity: An Introduction for Physicists by M.P.Hobson, G. Ef-
stathiou & A.N. Lasenby
• A first course in General Relativity by B.F. Schutz.
• Gravity: an Introduction to Einstein’s General Relativity by James B. Har-
tle
Contents
1 Electrostatics-Magnetostatics (****) . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Electrostatics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Green Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Laplace Equation in Spherical Coordinates . . . . . . . . . . . . . . . . . 5
1.4 Legendre Equation and Legendre Polynomials . . . . . . . . . . . . . . . 6
1.4.1 Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.5 Legendre Polynomials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.6 Associated Legendre Functions and Spherical Harmonics . . . . . 10
1.6.1 Spherical Harmonics Ylm (θ, φ) . . . . . . . . . . . . . . . . . . . . . . 11
1.6.2 Addition Theorem for Spherical Harmonics . . . . . . . . . . . 12
1.7 Multipole Expansion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.7.1 Monopole moment l = 0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.7.2 Dipole moment l = 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.7.3 Quadrupole moment l = 2 . . . . . . . . . . . . . . . . . . . . . . . . . . 16
1.7.4 Multipole Expansion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.8 Energy of a Charge Distribution in an External Field . . . . . . . . 18
1.8.1 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.9 Magnetostatics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.10 Biot & Savart Law . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.10.1 Diff. Equations for Magnetostatics & Ampere’s Law . . . 21
1.11 Vector Potential . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
1.1 Electrostatics
The behavior of an electrostatic field can be described by two differential
equations:
∇ · E = 4πρ (1.1)
(Gauss law) and
∇×E=0 (1.2)
the latter equation being equivalent to the statement that E is the gradient
of a scalar function, the scalar potential Φ:
E = −∇Φ (1.3)
Eqns (1.1) and (1.3) can be combined into one differential equation for a single
scalar function Φ(x):
∇2 Φ = −4πρ (1.4)
This equation is called Poisson equation.
In the regions of space where there is no charge density, the scalar potential
satisfies the Laplace equation:
∇2 Φ = 0 (1.5)
ρ(x0 ) 3 0
Z
Φ(x) = d x (1.6)
|x − x0 |
Thus, we have not only shown that the potential from Coulomb’s law satisfies
Poisson’s eqn, but we have established (through the solution of Poisson’s eqn)
the important result that :
the potential from a distributed source is the superposition of the individual
potentials from the constituent parcels of charge.
We may consider situations in which ρ is comprised of N discrete charges
qi , positioned at x0i so that
N
X
ρ(x0 ) = qi δ 3 (x0 − x0i ) (1.10)
i=1
In this case the solution for the potential is a combination of terms propor-
tional to 1/|x − x0 |.
Now we can apply Poisson’s equation (1.8) for discrete charge, substituting
for ψ = 1/|x − x0 |
4πρ(x0 ) 3
Z
3 0 0
−4πδ (x − x )Φ(x ) + d x
V |x − x0 |
I
∂ 1 1 ∂Φ
= Φ 0 − da0 (1.16)
S ∂n |x − x0 | |x − x0 | ∂n0
Integrating the Dirac delta function over all values of x0 within V and for
x within the volume V yields a nonzero result
ρ(x0 ) 3 0
Z I
1 1 ∂Φ ∂ 1
Φ(x) = 0
d x + −Φ 0 da0
V |x − x | 4π S |x − x0 | ∂n0 ∂n |x − x0 |
(1.17)
• The (blue) correction term goes to zero as the surface S goes to infinity
(because S falls of faster than 1/|x − x0 |)
• If the integration volume is free of charges, then the first term of equation
(1.17) becomes zero, and the potential is determined only by the values of the
potential and the values of its derivatives at the boundary of the integration
region (the surface S).
• Physical experience leads us to believe that specification of the po-
tential on a closed surface defines a unique potential problem. This is called
Dirichlet problem or Dirichlet boundary conditions.
• Similarly it is plausible that specification of the electric field (normal
derivative of the potential) everywhere on the surface (corresponding to a
4 1 Electrostatics-Magnetostatics (****)
where hΦiS is the average value of the potential over the whole surface
I
1
hΦiS ≡ Φ(x0 )da0 (1.28)
S S
In most cases S is extremely large (or even infinite), in which case hΦiS → 0.
The physical meaning of F (x, x0 ) : it is a solution of the Laplace eqn
inside V and so represents the potential of charges external to the volume V
chosen as to satisfy the homogeneous BC of zero potential on the surface S.
• It is important to understand that no matter how the source is dis-
tributed, finding the Green function is completely independent or ρ(x0 ).
• G(x, x0 ) depends exclusively on the geometry of the problem, is a “tem-
plate”, potential and not the actual potential for a given physical problem.
• In other words G(x, x0 ) is the potential due to a unit charge, positioned
arbitrarily within the surface S consistent either with GD = 0 or ∂GN /∂n =
−4π/S on the surface.
• The “true potential” is a convolution of this template with the given
ρ(x0 ).
1 d2 U 1 d2 Q
2 2 1 d dP
r sin θ + sin θ + =0 (1.31)
U dr2 r2 sin θP dθ dθ Q dφ2
We see that the therms depending on φ have been isolated and we can set
1 d2 Q
= −m2 (1.32)
Q dφ2
with solution
Q = e±imφ (1.33)
m2
1 d dP
sin θ + l(l + 1) − P =0 (1.34)
r2 sin θ dθ dθ sin2 θ
d2 U l(l + 1)
2
− U =0 (1.35)
dr r2
The radial equation will have a solution
m2
d 2 dP
(1 − x ) + l(l + 1) − P =0 (1.37)
dx dx 1 − x2
This is the generalized Legendre equation and its solutions are the associated
Legendre functions.
We will consider the solution of (1.54) for m2 = 0
d dP
(1 − x2 ) + l(l + 1)P = 0 (1.38)
dx dx
The solution should be single valued, finite, and continuous on the interval
−1 ≤ x ≤ 1 in order that it represents a physical potential.
The solution can be found in the form of a power series
∞
X
P (x) = xk aj xj (1.39)
j=0
1.4 Legendre Equation and Legendre Polynomials 7
(k + j)(k + j + 1) − l(l + 1)
aj+2 = aj (1.40)
(k + j + 1)(k + j + 2)
while for j = 0, 1 we find that
P0 (x) = 1
P1 (x) = x
1
P2 (x) = (3x2 − 1) (1.43)
2
1
P3 (x) = (5x3 − 3x)
2
1
P4 (x) = (35x4 − 30x2 + 3)
8
The Legendre polynomials can be taken from Rodrigue’s formula:
1 dl 2
Pl (x) = (x − 1)l (1.44)
2l l! dxl
The Legendre polynomials form a complete orthonormal set of functions
on the interval −1 ≤ x ≤ 1 (prove it)
Z 1
2
Pl0 (x)Pl (x) = δl 0 l (1.45)
−1 2l + 1
8 1 Electrostatics-Magnetostatics (****)
where (how?)
Z 1
2l + 1
Al = f (x)Pl (x)dx (1.47)
2 −1
Thus for problems with azimuthal symmetry i.e. m = 0 the general solution
is:
∞ h
X i
Φ(r, θ) = Al rl + Bl r−(l+1) Pl (cos θ) (1.48)
l=0
where the coefficients Al [it is not the same as in eqn (1.47)] and Bl can be
determined from the boundary conditions.
1.4.1 Example
If, for example V (θ) = ±V on the two hemispheres then the coefficients
can be derived easily and the potential inside the sphere is (how?):
3 r 7 r 3 11 r 5
Φ(r, θ) = V P1 (cos θ) − P3 (cos θ) + P5 (cos θ) − . . .
2 R 8 R 16 R
(1.51)
To find the potential outside the sphere we merely replace (r/R)l by
(R/r)l+1 and the resulting potential will be (how?):
2 " 2 #
3 R 7 R
Φ(r, θ) = V P1 (cos θ) − P3 (cos θ) + . . . (1.52)
2 r 12 r
1.5 Legendre Polynomials 9
Fig. 1.1.
where r< (r> ) is the smaller (larger) of |x| and |x0 | and γ is the angle between
|x| and |x0 |.
Fig. 1.2.
m2
d dP
(1 − x2 ) + l(l + 1) − P =0 (1.54)
dx dx 1 − x2
dm
Plm (x) = (−1)m (1 − x2 )m/2 Pl (x) (1.55)
dxm
If Rodrigues’ formula is used an expression valid for both positive and negative
m is obtained:
(−1)m 2 m/2 d
m
Plm (x) = (1 − x ) Pl (x) (1.56)
2l l! dxm
(l − m)! m
Pl−m (x) = (−1)m P (x) (1.57)
(l + m)! l
For fixed m the functions Plm (x) form an orthonormal set in the index l on
the interval −1 ≤ x ≤ 1. The orthogonality relation is
Z 1
2 (l + m)!
Plm m
0 (x)Pl (x)dx = δl 0 l (1.58)
−1 2l + 1 (l − m)!
We have found that Qm (φ) = eimφ , this function forms a complete set of
orthogonal functions in the index m on the interval 0 ≤ φ ≤ 2π.
The product Plm Qm forms also a complete orthonormal set on the surface
of the unit sphere in the two indices l, m.
From the normalization condition (1.58) we can conclude that the suitably
normalized functions, denoted by Ylm (θ, φ), are :
1.6 Associated Legendre Functions and Spherical Harmonics 11
s
2l + 1 (l − m)! m
Ylm (θ, φ) = P (cos θ)eimφ (1.59)
4π (l + m)! l
and also
∗
Yl,−m (θ, φ) = (−1)m Ylm (θ, φ) (1.60)
r
1 35
l = 3 Y33 = − sin3 θ e3iφ
4 4π
r
1 105
Y32 = sin2 θ cos θe2iφ
4 2π
r
1 21
Y31 = − sin θ(5 cos2 θ − 1) eiφ
4 4π
r
1 7
sin θ 5 cos3 θ − 3 cos θ
Y30 =
2 4π
(1.65)
Fig. 1.3. Schematic representation of Ylm on the unit sphere. Ylm is equal to 0 along
m great circles passing through the poles, and along l − m circles of equal latitude.
The function changes sign each time it crosses one of these lines.
Fig. 1.4.
l
4π X
∗
Pl (cos γ) = Ylm (θ0 , φ0 )Ylm (θ, φ) (1.66)
l(l + 1)
m=−l
The addition theorem offer the possibility to extend the expansion valid
for a point charge (axially symmetric distribution) to an arbitrary charge
distribution.
By substituting (1.66) into (1.53) we obtain
∞ X
l l
1 X 1 r<
= 4π Y ∗ (θ0 , φ0 )Ylm (θ, φ) (1.68)
0
|x − x | 2l + 1 r> lm
l+1
l=0 m=−l
This equation gives the potential in a completely factorized form in the coor-
dinates x and x0 . This is useful in any integration over charge densities, when
one is the variable of integration and the other the observation point.
Fig. 1.5.
ρ(x0 ) 3 0
Z
Φ(x) = d x (1.70)
|x − x0 |
∗ 1 q
Φ(x) = 4πq00 Y00 + · · · = + . . . (1.74)
r r
NOTE:
The moments with m ≥ 0 are connected (for real charge density) too the
moments with m < 0 through
∗
ql,−m = (−1)m qlm (1.75)
Z
∗
q22 = ρ(x)r02 Y22
∗
(θ0 , φ0 )d3 x0
r Z
1 15 2
= ρ(x) [r0 sin θ0 (cos φ0 − i sin φ0 )] d3 x0
4 2π
r Z
1 15
= ρ(x)(x0 − iy 0 )2 d3 x0
4 2π
1 02
because (x0 − iy 0 )2 = (3x − r02 ) − 6ix0 y 0 − (3y 02 − r02 )
r 3
1 15
= (Q11 − 2iQ12 − Q22 ) (1.79)
12 2π
where Qij is the traceless (why?) quadrupole moment tensor:
Z
3x0i x0j − r2 δij ρ(x0 )d3 x0
Qij = (1.80)
Analogously
Z r Z
∗ 0 02 ∗ 15
q21 = ρ(x )r Y21 (θ0 , φ0 )d3 x0 =− ρ(x0 )z 0 (x0 − iy 0 )d3 x0
8π
r
1 15
=− (Q13 − iQ23 ) (1.81)
3 8π
and
1.7 Multipole Expansion 17
Z r Z
∗ 1 5
q20 = ρ(x0 )r02 Y20
∗
(θ0 , φ0 )d3 x0 = ρ(x0 )(3z 0 − r02 )d3 x0
2 4π
r
1 5
= Q33 (1.82)
2 4π
From eqn (1.60) we can get the moments with m < 0 through
∗
ql,−m = (−1)m qlm (1.83)
The electric field components for a given multipole can be expressed most
easily in terms of spherical coordinates. From E = −∇Φ and (1.69) for fixed
(l, m) we get(how?):
4π(l + 1) 1
Er = qlm l+2 Ylm
2l + 1 r
4π 1 ∂
Eθ = − qlm l+2 Ylm (1.85)
2l + 1 r ∂θ
4π 1 1 ∂
Eφ = − qlm l+2 Ylm
2l + 1 r sin θ ∂φ
For a dipole p along the z-axis they reduce to:
2p cos θ p sin θ
Er = , Eθ = , Eφ = 0 (1.86)
r3 r3
Fig. 1.6.
When this inserted into (1.89) the energy takes the form:
3 3
1 XX ∂Ej
W = qΦ(0) − p · E(0) − Qij (0) + . . . (1.93)
6 i=1 j=1 ∂xi
Notice, that:
• the total charge interacts with the potential,
• the dipole moment with the electric field,
• the quadrupole with the electric field gradient, etc
1.8.1 Examples
EXAMPLE 1 : Show that for a uniform charged sphere all multipole mo-
ments vanish except q00 .
If the sphere has a radius R0 and constant charge density ρ(x) = ρ then
Z Z R0
Rl+3
Z
∗
qlm =ρ r0l Ylm
∗
(θ0 , φ0 )r02 dr0 dΩ 0 = ρ 0 Y ∗ (θ0 , φ0 )dΩ 0
Ω0 0 l + 3 Ω 0 lm
√
but since Y00 = 1/ 4π we have from the orthogonality relation
Rl+3 √ Rl+3 √
Z
∗ ∗
qlm =ρ 0 4π Ylm Y00 dΩ 0 = ρ 0 4πδl0 δm0
l+3 Ω0 l+3
EXAMPLE 2: Perform multipole decomposition of a uniform charge dis-
tribution whose surface is a weakly deformed sphere:
2
!
X
∗
R = R0 1 + a2m Y2m (θ, φ) , |a2m | << 1
m=−2
Z Z R(θ 0 ,φ0 )
∗
qlm = ρr0l Ylm
∗ 02 0
r dr dΩ 0
Ω0 0
2
!l+3
Rl+3
Z X
∗
=ρ 0 Ylm (θ0 , φ0 ) 1+ a∗2m Y2m
∗
(θ0 , φ0 ) dΩ 0
l+3 Ω0 m=−2
20 1 Electrostatics-Magnetostatics (****)
2
!
Rl+3
Z X
∗ ∗
qlm ≈ρ 0 Ylm (θ0 , φ0 ) 1 + (l + 3) a∗2m Y2m
∗
(θ0 , φ0 ) dΩ 0
l+3 Ω0 m=−2
Apart from the monopole moment q00 of the previous example, we find
2
R0l+3 X Z X
∗
qlm =ρ (l + 3) a∗2µ ∗
Ylm Y2µ dΩ 0 = ρR0l+3 a∗2µ δmµ δl2 (l > 0)
l+3 µ=−2 Ω 0
µ
∗
Thus the nonvanishing multipole moments are : q2m = ρR05 a∗2m .
1.9 Magnetostatics
• There is a radical difference between magnetostatics and electrostatics :
there are no free magnetic charges
• The basic entity in magnetic studies is the magnetic dipole.
• In the presence of magnetic materials the dipole tends to align itself in
a certain direction. That is by definition the direction of the magnetic flux
density , denoted by B (also called magnetic induction).
• The magnitude of the flux density can be defined by the magnetic
torque N exerted on the magnetic dipole:
N=µ×B (1.94)
∂ρ/∂t + ∇ · J = 0 (1.95)
∇·J=0 (1.96)
Fig. 1.7.
The basic law (1.97) for the magnetic induction can be written down in general
form for a current density J(x):
(x − x0 ) 3 0
Z
1
B(x) = J(x0 ) × d x (1.99)
c |x − x0 |3
This expression for B(x) is the magnetic analog of electric field in terms of
the charge density:
(x − x0 ) 3 0
Z
1
E(x) = ρ(x0 ) d x (1.100)
c |x − x0 |3
(x − x0 )
1 1 r
= −∇ as ∇ =− 3 (1.101)
|x − x0 |3 |x − x0 | r r
J(x0 ) 3 0
Z
1
B(x) = ∇ × d x (1.102)
c |x − x0 |
∇·B=0 (1.103)
J(x0 ) 3 0
Z
1
∇×B= ∇×∇× d x (1.104)
c |x − x0 |
1
which for steady-state phenomena (∇ · J = 0) reduces to (how?)
4π
∇×B= J (1.105)
c
• This is the 2nd equation of magnetostatics and corresponds to ∇ · E =
4πρ in electrostatics.
The integral equivalent of (1.105) is called Ampere’s law It can be ob-
tained by applying Stokes’s theorem to the integral of the normal component
of (1.105) over the open surface S bounded by a closed curve C. Thus
Z Z
4π
∇ × B · n da = J · nda (1.106)
S c S
is transformed into I Z
4π
B · d` = J · nda (1.107)
C c S
Since the surface integral of the current density is the total current I passing
through the closed curve C, Ampére’s law can be written in the form:
I
4π
B · d` = I (1.108)
C c
1
Remember : ∇ × (∇ × A) = ∇ (∇ · A) − ∇2 A
1.11 Vector Potential 23
Fig. 1.8.
4π
∇2 A = − J (1.114)
c
The solution for A in unbounded space is (1.111) with Ψ =constant:
J(x0 ) 3 0
Z
1
A(x) = d x (1.115)
c |x − x0 |
2
Classical Field Theory: Maxwell Equations
2.1 Introduction
• Electrostatics and Magnetostatics deal with steady-state problems in
electricity and in magnetism.
• The almost independent nature of electric and magnetic fields phenomena
disappears when we consider time-dependent problems.
• Time varying magnetic fields give rise to electric fields and vice-versa. We
then must speak of electromagnetic fields rather than electric or magnetic
fields.
• The interconnection between electric and magnetic fields and their essen-
tial sameness becomes clear only within the framework of Special Theory
of Relativity
Fig. 2.1.
Fig. 2.2.
If the circuit C is moving with a velocity v in some direction the total time
derivative in eqn (2.4) must take into account the motion e.g. convective
derivative
d ∂
= + v · ∇ and
dt ∂t
dB ∂B ∂B
= + (v · ∇) B = + ∇ × (B × v) + v (∇ · B)
dt ∂t ∂t
The flux through the circuit may change because:
• the flux changes with time at a point
• the translation of the circuit changes the location of the boundary.
Eqn (2.4) can be written as (why?):
I Z
∂B
[E0 − k (v × B)] · d` = −k · n da . (2.5)
C S ∂t
E0 = E + k (v × B) (2.7)
With the present choice of units for charge and current, Galilean covariance
requires that k = 1/c (why?).
Finally, the transformation of the electromotive force integral into a surface
integral leads to (how?)
Z
1 ∂B
∇×E+ · n da = 0 (2.8)
S c ∂t
28 2 Classical Field Theory: Maxwell Equations
Since the circuit C and the bounding surface S are abritrary, the integrant
must vanish at all points in space. Thus the differential form of Faraday’s
law is:
1 ∂B
∇×E+ = 0. (2.9)
c ∂t
Note that for time-independent electrostatic fields : ∇ × E = 0.
KELVIN-STOKES THEOREM :
relates the surface integral of the curl of a vector field over a surface S in
Euclidean 3-space to the line integral of the vector field over its boundary
Z I
∇ × F · dS = F · d` (2.10)
S ∂S
z
n
da
S
C
0
x y
Fig. 2.3.
• For such time-varying fields there are induced electromotive forces which
cause the sources of current to do work.
• Since the energy in the field is by definition the total work done to establish
it, we must consider these contributions.
• Suppose that we have a circuit with a constant current I flowing in it.
• If the flux through the circuit changes an electromotive force E is induced
around it.
• In order to keep the current constant, the sources of current must
do work.
• To determine the rate we note that the time rate of change of energy of
a particle with velocity v acted on by a force F is
dE
=v·F
dt
With a changing flux the added field E0 on each conducting electron of charge
q and drift velocity v gives rise to a change in energy per unit time of qv · E0
per electron.
Summing over all the electrons in the circuit, we find that the sources do
work to maintain the current at the rate
dW 1 dF
= −IE = I
dt c dt
Fig. 2.4. Distribution of current density broken up into elementary current loops
but J∆σd` ≡ Jd3 x since l is parallel to J. Evidently the sum over all such
elemental loops will be the volume integral.
• Hence the total increment of work done by an external source due to a
change δA(x) in the vector potential is
Z
1
δW = δA · J d3 x (2.14)
c
which is the analog of the electrostatic equation for the energy change
Z
1
δW = E · δDd3 x (2.18)
4π
where D = E + 4πP is the electric displacement and P the electric
polarization (dipole moment per unit volume).
If we bring the fields from zero to the final values the total magnetic
energy will be (why?) Z
1
W = H · B d3 x (2.19)
8π
which is the magnetic analog of the total electrostatic energy
Z
1
W = E · D d3 x (2.20)
8π
The energy of a system of charges in free space i.e. electrostatic energy
is: Z
1
W = ρ(x)Φ(x) d3 x (2.21)
2
The magnetic equivalent for this expression i.e. the magnetic energy is
Z
1
W = J · A d3 x (2.22)
2c
4. Ampère’s law: I Z
4π
H · dl = J · nda (2.26)
C c S
The basic laws of electricity and magnetism can be summarized in differ-
ential form:
Coulomb’s law ∇ · D = 4πρ (2.27)
4π 1 ∂D
Ampère’s law (∇ · J = 0) ∇ × H = J+ (2.28)
c c ∂t
1 ∂B
Faraday’s law ∇ × E + =0 (2.29)
c ∂t
no free magnetic poles ∇ · B = 0 (2.30)
where E and B are the averaged E and B of the microscopic or vacuum
Maxwell equations. The two extra field quantities D and H usually called the
electric displacement and magnetic field (B is then called the magnetic
induction and M is the macroscopic magnetization)
!
X ∂Q0
ab
Da = Ea + 4π Pa − + . . . → D = E + 4πP + . . . (2.31)
∂xb
b
Ha = Ba − 4π(Ma + . . . ) → H = B − 4πM + . . .(2.32)
B=∇×A (2.36)
1 ∂B
Then Faraday’s law ∇ × E + = 0 can be written
c ∂t
1 ∂A
∇× E+ =0 (2.37)
c ∂t
Thus the quantity with vanishing curl can be written as the gradient of a
scalar potential Φ:
1 ∂A 1 ∂A
E+ = −∇Φ or E = −∇Φ − (2.38)
c ∂t c ∂t
The definition of B and E in terms of the potentials A and Φ satisfies inden-
tically the 2 homogeneous Maxwell equations. While A and Φ are determined
by the 2 inhomogeneous Maxwell equations.
If we restrict our considerations to the vacuum form of the Maxwell equa-
tions the inhomogeneous form of Maxwell equations can be write in terms of
the potentials as:
1 ∂
∇2 Φ + (∇ · A) = −4πρ (2.39)
c ∂t
1 ∂2A
1 ∂Φ 4π
∇2 A − 2 2 − ∇ ∇ · A + =− J (2.40)
c ∂t c ∂t c
These eqns are equivalent to Maxwell eqns but they are still coupled. Thanks
to the abritrariness in the definition of the potentials we can choose transfor-
mations of the form
34 2 Classical Field Theory: Maxwell Equations
A → A0 = A + ∇Λ (2.41)
1 ∂Λ
Φ → Φ0 = Φ − (2.42)
c ∂t
Thus we can choose a set of potentials (A, Φ) such that
1 ∂Φ
∇·A+ =0 (2.43)
c ∂t
This way we uncouple the pair of equations (2.39) and (2.40) and leave
two inhomogeneous wave equations, one for Φ and one for A
1 ∂2Φ
∇2 Φ − = −4πρ (2.44)
c2 ∂t2
1 ∂2A 4π
∇2 A − 2 2 = − J (2.45)
c ∂t c
This set of equations is equivalent in all respects to the Maxwell
equations.
The transformations
A → A0 = A + ∇Λ (2.46)
1 ∂Λ
Φ → Φ0 = Φ − (2.47)
c ∂t
are called gauge transformation and the invariance of the fields under these
transformations is called gauge invariance.
The relation between A and Φ :
1 ∂Φ
∇·A+ =0 (2.48)
c ∂t
is called Lorenz condition 1 . [Prove that there will always exist potentials
satisfying the Lorentz condition].
The Lorenz gauge is commonly used because:
• It leads to the wave equations (2.44) and (2.45) which treat Φ
and A on equal footings
• It is a coordinate independent concept and fits naturally into the
considerations of special relativity.
1
The condition is from the Danish mathematician and physicist Ludvig Valentin Lorenz (1829-1891) and not
from the Dutch physicist Hendrik Lorentz (1853-1928)
2.5 Green Functions for the Wave Equations (***) 35
In this gauge
∇·A=0 (2.49)
From eqn (2.39) we see that the scalar potential satisfies the Poisson eqn :
∇2 Φ = −4πρ (2.50)
with solution
ρ(x0 , t) 3 0
Z
Φ(x, t) = d x (2.51)
|x − x0 |
The scalar potential is just the instantaneous Coulomb potential due to the
charge density ρ(x, t). This is the origin of the name Coulomb gauge.
From eqn (2.40) we find that the vector potential satisfies:
1 ∂2A 4π 1 ∂Φ
∇2 A − 2 2
=− J+ ∇ (2.52)
c ∂t c c ∂t
1 ∂2A 4π
∇2 A − = − Jt (2.54)
c2 ∂t2 c
The Coulomb or transverse gauge is often used when no sources are
present. Then Φ = 0, and A satisfies the homogeneous wave equation. The
fields are given by
1 ∂A
E=− , and B = ∇ × A (2.55)
c ∂t
1 ∂2Ψ
∇2 Ψ − = −4πf (x, t) (2.56)
c2 ∂t2
36 2 Classical Field Theory: Maxwell Equations
When the representation (2.57) are inserted into (2.56) it is found that the
Fourier transform Ψ (x, ω) satisfies the inhomogeneous Helmholtz wave
equation for each value of ω and k = ω/c
1 d2
(RGk ) + k 2 Gk = 4πδ(R) (2.61)
R dR2
d2
(RGk ) + k 2 (RGk ) = 0
dR2
with solution :
RGk (R) = AeikR + Be−ikR
The general solution for the Green function is :
(+) (−)
Gk (R) = AGk (R) + BGk (R) (2.62)
where
(±) e±ikR
Gk (R) = (2.63)
R
2.5 Green Functions for the Wave Equations (***) 37
For a continuous distribution of charge and current, the total rate of doing
work by the fields in a finite volume V is
Z
J · Ed3 x (2.69)
V
∇ · (E × H) = H · (∇ × E) − E · (∇ × H)
The terms with the time derivatives can be interpreted as the time derivatives
of the electrostatic and magnetic energy densities.
If we also remember that the sum/integrals
Z Z
1 1
WE = E · Dd3 x and WB = H · Bd3 x (2.72)
8π 8π
represents the total EM energy (even for time varying fields). Then the total
energy density is denoted by
1 1 2 1 2
u= (E · D + B · H) ≡ 0 E + B (2.73)
8π 2 µ0
Since the volume is abritrary, this can be cast into the form of a differ-
ential continuity equation or conservation law
∂u
+ ∇ · S = −J · E (2.75)
∂t
2.5 Green Functions for the Wave Equations (***) 39
The vector S represents the energy flow and is called Poynting vector
c
S= (E × H) ≡ (E × H) (2.76)
4π
• Poynting’s theorem (Conservation of energy) : The physical meaning of
the above relations is that the time rate of change of EM energy within a
certain volume, plus the energy flowing out through the boundary surfaces of
the volume per unit time, is equal to the negative of the total work done by
the fields on the sources within the volume.
• In other words Poynting’s theorem for microscopic field (E, B) is
a statement of conservation of energy of the combined system of
particles and fields.
• If we denote the total energy of the particles within the volume V
as Emech and assume that no particles move out of the volume we have
Z
dEmech
= J · Ed3 x (2.77)
dt V
where (J ≡ ρv) we converted the sum over particles to an integral over charge
& current densities.
We can use Maxwell equations to eliminate ρ and J from (2.81) by using
40 2 Classical Field Theory: Maxwell Equations
1 c 1 ∂E
ρ= ∇ · E, J= ∇×B− (2.82)
4π 4π c ∂t
After some manipulations we can show (how?) that the rate of change
of mechanical momentum of eqn (2.81) can be written
Z
dPmech d 1
+ (E × B) d3 x (2.83)
dt dt V 4πc
Z
1
= [E(∇ · E) − E × (∇ × E) + B(∇ · B) − B × (∇ × B)] d3 x
4π V
We can identify the volume integral on the left as the total EM momen-
tum Pfield in the volume V
Z
1
Pfield = (E × B) d3 x (2.84)
4πc V
The integrant can be interpreted as the density of EM momentum
1
g= (E × B) (2.85)
4πc
The EM momentum density g is proportional to the energy-flux density S
with proportionality constant c−2 .
also
X ∂
(∇ · T)a = Tab (2.88)
∂xb
b
is the a-th component of the flow per unit area of momentum across the
surface S into the volume V .
In other words it is the force per unit area transmitted across the surface
S and acting on the combined system of particles and fields inside V .
The derivation of the EM angular momentum shares the same tactical ap-
proach as that of the linear momentum
Let us define the mechanical angular momentum of the system as
By using the definition of the Maxwell stress tensor (2.86), we can simplify
eqn (2.94) considerably
d
(Lmech + Lfield ) = r × ∇ · T (2.94)
dt
where
Lfield = r × g (2.95)
has the interpretation of being the EM field angular momentum density
In integral form since
42 2 Classical Field Theory: Maxwell Equations
r × ∇ · T = ∇ · (r × T) and ∇ × r = 0 (2.96)
we get Z Z
d
Lmech + Lfield d3 x = (r × T) · nda (2.97)
dt V S
The right-hand side of this equation represents the integrated torque density
due to the fields over the boundary surface S.
3
Electromagnetic Waves (****)
µ ∂ 2 B µ ∂ 2 E
∇2 B − =0 and ∇2 E − =0 (3.3)
c2 ∂t2 c2 ∂t2
In other words each component of B and E obeys a wave equation of the
form:
44 3 Electromagnetic Waves (****)
1 ∂2u c
∇2 u − =0 where v = √ (3.4)
v 2 ∂t2 µ
is a constant with dimensions of velocity characteristic of the medium.
The wave equation admits admits plane-wave solutions:
u = eik·x−iωt (3.5)
ikn·x−iωt ikn·x−iωt
E(x, t) = Ee and B(x, t) = Be (3.6)
where the relation between the frequency ω and the wave vector k is
ω √ ω ω 2
k = = µ or k · k = (3.7)
v c v
also the vectors n, E and B are constant in time and space.
If we consider waves propagating in one direction, say x-direction then the
fundamental solution is:
which represents waves traveling to the right and to the left with propagation
velocities v which is called phase velocity of the wave.
? From the divergence relations of (3.1) by applying (3.6) we get
This means that E (or E) and B (or B) are both perpendicular to the
direction of propagation n. Such a wave is called transverse wave.
? The curl equations provide a further restriction
√ 1
B= µ n × E and E = − √ n × B (3.10)
µ
The combination of equations (3.9) and (3.10) suggests that the vectors n, E
and B form an orthonormal set.
Also, if n is real, then (3.10) implies that that E and B have the same
phase.
It is then useful to introduce a set of real mutually orthogonal unit vectors
(1 , 2 , n).
In terms of these unit vectors the field strengths E and B are
√
E = 1 E0 , B = 2 µE0 (3.11)
or
√
E = 2 E00 , B = −1 µE00 (3.12)
E0 and E00 are constants, possibly complex.
In other words the most general way to write the electric/magnetic field
vector is:
3.2 Plane Electromagnetic Waves 45
Fig. 3.1.
Thus the wave described by (3.6) and (3.11) or (3.12) is a transverse wave
propagating in the direction n.
Fig. 3.2.
where E and H are the measured fields at the point where S is evaluated.1
The time averaged flux of energy is:
r
c
S= |E0 |2 n (3.15)
8π µ
The total time averaged density (and not just the energy density associ-
ated with the electric field component) is:
1 1
u= E · E∗ + B · B∗ = |E0 |2 (3.16)
16π µ 8π
The ratio of the magnitude of (3.15) to (3.16) is the speed of energy flow i.e.
√
v = c/ µ. 2 (Prove the above relations)
Project:
What will happen if n is not real?
What type of waves you will get?
What will be the form of E?
E1 = 1 E1 eik·x−iωt , E2 = 2 E2 eik·x−iωt
with (3.17)
√ k × Ei
Bi = µ , i = 1, 2
k
Can be combined to give the most general homogeneous plane waves prop-
agating in the direction k = kn,
Fig. 3.3.
The amplitudes E1 = |E1 |eiφ1 and E2 = |E2 |eiφ2 are complex numbers in
order to allow the possibility of a phase difference between waves of different
polarization.
LINEARLY POLARIZED
If the amplitudes E1 = |E1 |eiφ1 and E2 = |E2 |eiφ2 have the same phase
(3.18) represents a linearly polarized wave with the polarization vector
making an angle θ = tan−1 (<(E2 )/=(E1 )) (which remains constant
p as the
field evolves in space and time) with 1 and magnitude E = E12 + E22 .
ELLIPTICALLY POLARIZED
If E1 and E2 have the different phase the wave (3.18) is elliptically po-
larized and the electric vector rotates around k.
• E1 = E2 = E0
• φ1 − φ2 = ±π/2 and the wave becomes
At a fixed point in space, the fields are such that the electric vector is constant
in magnitude, but sweeps around in a circle at a frequency ω.
Fig. 3.4.
48 3 Electromagnetic Waves (****)
The components of the electric field, obtained by taking the real part of
(3.20)
For the upper sign (1 + i2 ) the rotation is counter-clockwise when the
observer is facing into the oncoming wave. The wave is called left circularly
polarized in optics while in modern physics such a wave is said to have
positive helicity.
For the lower sign (1 − i2 ) the wave is right circularly polarized or it
has negative helicity.
Fig. 3.5. The figure shows the general case of elliptical polarization and the ellipses
traced out by both E and B at a given point in space.
Fig. 3.6. The figure shows the linear, circular and elliptical polarization
A useful tool for this are the four Stokes parameters. These are
quadratic in the field strength and can be determined through intensity mea-
surements only. Their measurements determines completely the state of po-
larization of the wave.
For a wave propagating in the z-direction the scalar products
Fig. 3.7. Incident wave k strikes plane interface between different media, giving
rise to a reflected wave k00 and a refracted wave k0 . The media below and above the
plane z = 0 have permeabilities and dielectric constants
√ µ, and µ’, ’ respectively.
√
The indices of refraction are n = µ and n0 = µ0 0 .
• Since the E-fields are parallel to the surface the 1st BC of (3.38) yields
nothing
• The 3rd and 4th of of (3.38) give (how?):
E0 + E000 − E00 = 0
s
0 0
r
(E0 − E000 ) cos i − E cos r = 0 (3.38)
µ µ0 0
Fig. 3.8. Reflection and refraction with polarization perpendicular to the plane of
incidence. All the E-fields shown directed away from the viewer.
The relative amplitudes of the refracted and reflected waves can be found
from (3.38)
E00
2n cos i 2 2 sin r cos i
= = =
1 + µµ0 tan i
p
E0 n cos i + µµ0 n02 − n2 sin2 i tan r
sin(i + r) µ=µ0
p
E000 n cos i − µµ0 n02 − n2 sin2 i 1 − µµ0 tan i
tan r sin(r − i)
= = = (3.39)
1 + µµ0 tan i
p
E0 n cos i + µ0 n02 − n2 sin2 i tan r
sin(i + r) µ=µ0
µ
p
Note that n02 − n2 sin2 i = n0 cos r but Snell’s law has been used to express
it in terms of the angle of incidence.
For optical frequencies it is usually permitted to put µ = µ0 .
54 3 Electromagnetic Waves (****)
Fig. 3.9. Reflection and refraction with polarization parallel to the plane of inci-
dence.
The condition that normal E is continuous, plus Snell’s law, merely dubli-
cates the 2nd of the previous equations.
The relative amplitudes of refracted and reflected fields are therefore
(how?)
E00 2nn0 cos i
= p
E0 µ 02
0 n cos i + n n02 − n2 sin2 i
µ
0
2 nn0
2 sin r cos i
= =
1 + 0 tan i
tan r
sin(i + r) cos(i − r) µ=µ0
p
µ 02 02 2 2
E000 µ0 n cos i − n n − n sin i
= p
E0 µ 02 02 2 2
µ0 n cos i + n n − n sin i
1 − 0 tan i
tan r tan(i − r)
= = (3.41)
1 + 0 tan i
tan r
tan(i + r) µ=µ0
3.5 Reflection & Refraction of EM Waves 55
E00 2 2n
= q 0 → 0
E0 1 + µ n +n
µ0
q 0
E000 −1 + µ µ0 n0 − n
= q 0 → 0 (3.42)
E0 1 + µ n +n
µ0
EXERCISES:
What are the conditions for:
• Total reflection
• Total transmision
Fig. 3.10. The conversion of an incident circularly polarized wave into an elliptically
polarized wave. The ratio of semimajor to semiminor transmitted amplitude for this
example in 1.3
4
Simple Radiating Systems (****)
For the near zone where r λ (or kr 1) the exponential in (4.6) can
be replaced by unity.
Then by using:
∞ X
l `
1 X 1 r<
= 4π Y ∗ (θ0 , φ0 )Y`m (θ, φ)
`+1 `m
(4.7)
|x − x0 | 2` + 1 r>
`=0 m=−`
This shows that the near fields are quasi-stationary, oscillating harmonically
as e−iωt , but otherwise static in character.
In the far zone (kr 1) the exponential in (4.3) oscillates rapidly and deter-
mines the behavior of the vector potential.
In this region is sufficient to approximate
|x − x0 | ≈ r − n · x0 (4.9)
eikr
Z
0
lim A(x) = J(x0 )e−ikn·x d3 x0 (4.10)
kr→∞ cr
4.2 The Near Zone 59
I.e. in the far zone the vector potential behaves as an outgoing spherical wave.
It is easy to show that the fields calculated from (4.4) and (4.5) are trans-
verse to the radius vector (how?) and fall like 1/r. Thus they correspond to
radiation fields.
If the source size is small compared to a wavelength one can expand in
powers of k:
eikr X (−ik)n
Z
lim A(x) = J(x0 )(n · x0 )n d3 x0 (4.11)
kr→∞ cr n n!
xl x3
j` (x) → 1− + ... (4.14)
(2` + 1)!! 2(2` + 3)
Then the vector potential is of the form (4.8) but with the replacement
1 eikr
1 + a1 (ikr) + a2 (ikr)2 + · · · al (ikr)l
→ (4.15)
r`+1 r `+1
The coefficients ai come from explicit expressions for the spherical Hankel
functions.
1
[(2` + 1)!! = (2` + 1)(2` − 1)(2` − 3) · · · 5 · 3 · 1]
60 4 Simple Radiating Systems (****)
For a source varying with the time the analogue of (4.2) for a scalar potential
is
0 0
|x − x0 |
Z Z
3 0 0 ρ(x , t ) 0
Φ(x, t) = d x dt δ t + −t (4.16)
|x − x0 | c
The electric monopole contribution is obtained by replacing in the integral
|x − x0 | → |x| ≡ r. The result is
q(t0 = t − r/c)
Φmonopole (x, t) =
r
where q(t) is the total charge of the source. Since charge is conserved and a
localized source has not charge flowing into or away from it, the total charge
q is independent of time.
? Thus the electric monopole part of the potential (and fields) of
localized source is of necessity static.
? The fields with harmonic time dependence e−iωt with ω 6= 0 have no
monopole terms.
eikr
Z
A(x) = J(x0 )d3 x0 (4.17)
cr
Examination of (4.13) and (4.15) shows that (4.17) is the ` = 0 part of the
series and that it is valid everywhere outside the source, not just in the far
zone. Integrating by part we get
Z Z Z
Jd x = − x (∇ · J)d x = −iω x0 ρ(x0 )d3 x0
3 0 0 0 3 0
(4.18)
∇ · J = iωρ (4.19)
eikr
A(x) = −ik p (4.20)
r
Z
p(x) = x0 ρ(x0 )d3 x0 (4.21)
The electric dipole fields from (4.4) and (4.5) are (how?)
ikr
2 1 e
B = k (n × p) 1 − (4.22)
ikr r
ikr
2 e 1 ik
E = k (n × p) × n + [3n(n · p) − p] − 2 eikr (4.23)
r r3 r
Note that the magnetic induction is transverse to the radius vector at all
distances, but the electric field has components parallel and perpendicular to
n.
In the radiation zone (kr 1) the fields take the limiting form
eikr
B = k 2 (n × p) , E=B×n (4.24)
r
In the near zone (kr 1) the fields approach
1 1
B = ik(n × p) , E = [3n(n · p) − p] (4.25)
r2 r3
? The electric field apart from its oscillation in time is just the static
electric dipole field seen earlier
? The magnetic induction is smaller than the electric field by a factor
(kr) in the region where kr 1.
? Thus the fields in the near zone are dominantly electric in
nature.
? The magnetic induction vanishes, in the static limit k → 0.
? The following identities might be useful for future derivations
eikr eikr
B = −k 2 p sin θ eφ and E = −k 2 p sin θ eθ (4.28)
r r
62 4 Simple Radiating Systems (****)
dP c 4 2
= k |(n × p) × n| (4.31)
dΩ 8π
If all the components of p have the same phase, the angular distribution is a
typical dipole pattern (where θ is measured from the direction of p )
dP c 4 2 2
= k |p| sin θ (4.32)
dΩ 8π
The total power radiated, independent of the relative phases of the com-
ponents of p, is
1 2
P = ck 4 |p| (4.33)
3
4.4 Examples
4.4.1 Example I
For p = pez we can substitute (4.28) into (4.29) and we get (how?)
c 4 2 sin2 θ 2ikr
S= k p e n (4.34)
8π 2 r2
Since the energy flux oscillates in time after averaging over a complete wave
cycle we get (hcos2 (kr)i = 1/2)
ck 4 p2 sin2 θ
hSi = n (4.35)
16π 2 r2
From the angular dependence of this expression we see that:
• most of the energy is emitted near the equatorial plane (θ = π/2), and
• none of the energy propagates along the z-axis (θ = 0 or θ = π).
4.4 Examples 63
4.4.2 Example II
2iI0
ρ0 (z) = ± (4.39)
ωd
64 4 Simple Radiating Systems (****)
Fig. 4.2. Electric field lines of an oscillating dipole p(t) = p0 cos(ωt)ez at selected
moments of time for ω = 1. Witness the transition between near-zone behavior and
wave-zone behavior which occurs near r = λ = 2πc/ω ≡ 1.
Fig. 4.3. Electric field lines of an rotating dipole p(t) = p0 [cos(ωt)ez + sin(ωt)ey ]
at selected moments of time, for ω = 1. Witness the transition between near-zone
behavior and wave-zone behavior which occurs near r = λ = 2πc/ω ≡ 1.
4.5 Magnetic Dipole & Electric Quadrupole Fields 65
Fig. 4.4.
The next term in the expansion (4.11) and (4.15) leads to a vector potential
eikr 1
Z
A(x) = − ik J(x0 ) (n · x0 ) d3 x0 (4.43)
cr r
This vector potential can be written as the sum of two terms, one of which
gives a transverse magnetic induction and the other who gives a transverse
electric field.
This can be achieved by writing the integrand as the sum of a part sym-
metric in J and x0 and a part that is antisymmetric.
1 1 1 0
(n · x0 )J = [(n · x0 )J + (n · J)x0 ] + (x × J) × n (4.44)
c 2c 2c
The 1st symmetric term will be shown to be related to the electric
quadrupole moment density.
The 2nd antisymmetric part is the so called magnetization due to
the current J:
1
M= (x × J) (4.45)
2c
eikr
1
A(x) = ik(n × m) 1− (4.46)
r ikr
eikr
2 1 ik
B = k (n × m) × n + [3n(n · m] − m] − 2 eikr (4.48)
r r3 r
Similarly, the electric field for a magnetic dipole source is the negative of
the magnetic field for an electric dipole:
eikr
1
E = −k 2 (n × m) 1− (4.49)
r ikr
4.5 Magnetic Dipole & Electric Quadrupole Fields 67
• All the arguments concerning the behavior of the fields in the near and
far zones are the same as for the electric dipole source, with the interchange
E → B, B → −E, p → m.
• Similarly the radiation pattern and total power radiated are the same
for the two kinds of dipole. The only difference in the radiation fields is the
polarization.
• For an electric dipole the electric vector lies in the plane defined by n
and p, while for a magnetic dipole it is perpendicular to the plane defined by
n and m.
k 2 eikr
Z
1
A(x) = − 1− x0 (n · x0 )ρ(x0 )d3 x0 (4.53)
2 r ikr
Since the complete fields are complicated to write down, we will study fields
in the radiation zone. Then it is easy to see that
ik 3 eikr
Z
B=− (n × x0 )(n · x0 )ρ(x0 )d3 x0 (4.55)
2 r
68 4 Simple Radiating Systems (****)
ik 3 eikr
B=− n × Q(n) (4.59)
6 r
The time-averaged power radiated per unit solid angle
dP c 2
= k 6 |[n × Q(n)] × n| (4.60)
dΩ 288π
and the direction of the radiated electric field is given by the vector inside the
absolute value.
The general angular distribution is complicated. But the total power ra-
diated can be calculated in a straightforward way. We can write the angular
dependence as
2
|[n × Q(n)] × n| = Q∗ · Q − |n · Q|2
X X
= A∗ab Qac nb nc − Q∗ab Qcd na nb nc nd (4.61)
a,b,c a,b,c,d
As we have mentioned earlier the trace of Qab is zero and thus the total
power radiated by a quadrupole source is
c 6X
P = k |Qab |2 (4.64)
360
a,b
Notice that the radiated power varies as the 6th power of the frequency for
fixed quadrupole moments, compared to the 4th power for dipole radiation.
dP ck 6
= Q0 sin2 θ cos2 θ (4.67)
dΩ 128π
The total power radiated by this quadrupole is:
c
P = k 6 Q0 (4.68)
128π
The energy carried away by the radiation will then come to the expense of
the star’s rotational energy. For this we need the field’s orientation to differ
from star’s rotational axis (oblique rotator model).
If the NS maintains a magnetic dipole moment
m(t) = m0 (sin α cos Ωtex + sin α sin Ωtey + cos αez ) (4.72)
Because the NS is located within the near zone
1
B(t) = 3 [3(m · n)n − m] (4.73)
R
ρ(x0 , t − |x − x0 |/c) 3 0
Z
Φ(x, t) = d x (5.3)
|x − x0 |
J(x0 , t − |x − x0 |/c) 3 0
Z
1
A(x, t) = d x (5.4)
c |x − x0 |
It is not difficult to see that these retarded potentials take into account the
finite propagation speed of the EM disturbances since an effect measured at
x and t was produced at the position of the source at time
|x − x0 (t̃)|
t̃ = t − (5.5)
c
Thus, using our expressions for ρ and J from eqns (5.1) and (5.2) and putting
β ≡ v/c,
δ(x0 − x0 [t − |x − x0 |/c]) 3 0
Z
Φ(x, t) = q d x (5.6)
|x − x0 |
Note that for a given space-time point (x, t), there exists only one point
on the whole trajectory, the retarded coordinate x̃ coresponding to the re-
tarded time t̃ defined in (5.5) which produces a contribution
Fig. 5.1.
Fig. 5.2.
x1 − x̃1 v x2 − x̃2 v
= and = (5.20)
x − x̃1 c x − x̃2 c
That is, the effective length L̃ and the natural length L differ by the
factor (1 − x · β)−1 = (1 − v/c)−1 because the source is moving relative to
the observer and its velocity must be taken into account when calculating the
retardation effects.
where R(t0 ) ≡ |x−x0 (t0 )|. This can be verified by using the following property
of the Dirac delta function (how?)
5.2 Liénard - Wiechert potentials : radiation fields (****) 77
Z X g(x)
g(x)δ[f (x)]dx = (5.25)
i
|df /dx| f (xi )=0
which holds for regular functions g(x) and f (x) of the integration variable x
where xi are the zeros of f (x).
• The advantage in pursuing this path is that the derivatives in eqn (5.22)
can be carried out before the integration over the delta function.
This procedure simplifies the evaluation of the fields considerably since,
we do not need to keep track of the retarded time until the last step.
We get for the electric field
0
δ(t − t + R(t0 )/c)
Z
E(x, t) = −q ∇ dt0
R(t0 )
β(t0 )δ(t0 − t + R(t0 )/c)
Z
q ∂
− dt (5.26)
c ∂t R(t0 )
Thus, differentiating the integrand in the first term, we get (HOW?)
R(t0 ) R(t0 )
Z
n 0 n 0 0
E(x, t) = q δ t −t+ − δ t −t+ dt0
R2 c cR c
β(t0 )δ(t0 − t + R(t0 )/c)
Z
q ∂
− (5.27)
c ∂t R(t0 )
But (HOW?)
R(t0 ) R(t0 )
∂
δ 0 t0 − t + = − δ t0 − t + (5.28)
c ∂t c
R(t0 ) R(t0 )
(n − β)
Z Z
n 0 0 q ∂ 0
E(x, t) = q δ t − t + dt + δ t − t + dt0
R2 c c ∂t cR(t0 ) c
(5.29)
We evaluate the integrals using the Dirac delta function expressed in equa-
tion (5.25). But we need to know the derivatives of the delta function’s argu-
ments with respect to t0 . Using the chain rule of differentiation
R(t0 )
d 0
t − t + = (1 − n · β)t̃ (5.30)
dt0 c
with which we get the result (HOW?):
n q ∂ n−β
E(r, t) = q + (5.31)
(1 − n · β)R2 t̃ c ∂t (1 − n · β)R t̃
Since
∂t0 ∂t0 ∂t0
∂R ∂R ∂ t̃ 1
= = −n · v =c 1− ⇒ =
∂t ∂t0 ∂t ∂t ∂t ∂t (1 − n · β)
(5.32)
78 5 Radiation by Moving Charges
Thus
∂ n−β 1 ∂ n−β
= (5.33)
∂t (1 − n · β)R t̃ (1 − n · β) ∂ t̃ (1 − n · β)R2 t̃
• When the particle is at rest and unaccelerated with respect to us, the
field reduces simply to Coulomb’s law qn/R2 . whatever corrections are
introduced the do not alter the empirical law.
• We also see a clear separation into the near field (which falls off as 1/R2 )
and the radiation field (which falls off as 1/R)
• Unless the particle is accelerated (β̇ 6= 0), the field falls off rapidly at large
distances. But when the radiation field is present, it dominate over the
near field far from the source.
• As β → 1 with β̇ = 0 the field displays a “bunching” effect. This “bunch-
ing” is understood as being a retardation effect, resulting from the finite
velocity of EM waves.
Fig. 5.3.
Fig. 5.4.
dP c 2 2 q2
= R |E| = |n × (n × β̇)|2 (5.41)
dΩ 4π 4πc
and if Θ is the angle between the acceleration v̇ and n then the power radiated
can be written as
80 5 Radiation by Moving Charges
dP q2
= |v̇|2 sin2 Θ (5.42)
dΩ 4πc3
Fig. 5.5.
where dτ = dt/γ is the proper time and pµ is the charged particle’s energy-
momentum 4-vector. Obviously for small β it reduces to (5.44)
2 2 2 2
dpµ dpµ
dp 1 dE dp 2 dp
− = − 2 = −β (5.46)
dτ dτ dτ c dτ dτ dτ
If (5.45) is expressed in terms of the velocity & acceleration (E = γmc2 &
p = γmv with γ = 1/(1 − β 2 )1/2 ), we obtain the Liénard result (HOW?)
2 e2 6 2
2
P = γ β̇ − β × β̇ (5.47)
3 c
2 e2 2 2 2 2 e2 c 4 4
P = 2 3
γ ω |p| = β γ (5.52)
3m c 3 ρ2
2πρ 4π e2 3 4
δE = P = β γ (5.53)
cβ 3 ρ
[E(GeV)]4
δE(MeV) = 8.85 × 10−2 (5.54)
ρ(meters)
In a 10GeV electron sychrotron (Cornell with ρ ∼ 100m) the loss per revo-
lution is 8.85MeV. In LEP (CERN) with beams at 60 GeV (ρ ∼ 4300m) the
losses per orbit are about 300 MeV.
Note that the useful quantity is (S · n)(dt/dt0 ) i.e. the power radiated per unit
area in terms of the charge’s own time. Thus we define the power radiated
per unit solid angle to be
5.5 Angular Distribution of Radiation Emitted by an Accelerated Charge (****) 83
dP (t0 ) dt
= R2 (S · n) 0 = R2 (S · n) (1 − β · n) (5.57)
dΩ dt
If β and β̇ are nearly constant (e.g. if the particle is accelerated for short
time) then (5.57) is proportional to the angular distribution of the energy
radiated.
For the Poynting vector (5.55) the angular distribution is
The simplest example is linear motion in which β and β̇ are parallel i.e.
β × β̇ = 0 and (HOW?)
dP (t0 ) e2 v̇ 2 sin2 θ
= (5.59)
dΩ 4πc3 (1 − β cos θ)5
For β 1, this is the Larmor result (5.42). But as β → 1, the angular
distribution is tipped forward and increases in magnitude.
Fig. 5.6. radiation pattern for charge accelerated in its direction of motion. the two
patterns are not to scale, the relativistic one (appropriate for γ ∼ 2) having been
reduced by a factor ∼ 102 for the same acceleration.
dP (t0 ) 8 e2 v̇ 2 8 (γθ)2
≈ γ (5.61)
dΩ π c3 (1 + γ 2 θ2 )5
84 5 Radiation by Moving Charges
The peak occures at γθ = 1/2, and the half-power points at γθ = 0.23 and
γθ = 0.91.
• The root mean square angle of emission of radiation in the relativistic
limit is
1 mc2
hθ2 i1/2 = = (5.62)
γ E
The total power can be obtained by integrating (5.59) over all angles
2 e2 2 6
P (t0 ) = v̇ γ (5.63)
3 c3
in agreement with (5.47) and (5.48). In other words this is a generalization of
Larmor’s formula.
• It is instructive to express this is terms of the force acting on the particle.
This force is F = dp/dt where p = mγv is the particle’s relativistic
momentum. For linear motion in the x-direction we have px = mvγ and
dpx
= mv̇γ + mv̇β 2 γ 3 = mv̇γ 3
dt
and Larmor’s formula can be written as
2 e2 |F|2
P = (5.64)
3 c3 m2
This is the total charge radiated by a charge in instantaneous linear motion.
Fig. 5.8.
Fig. 5.9. A relativistic particle in periodic motion emits a spiral radiation pattern
that an observer at the point A detects as short bursts of radiation of time duration
T = L/c, occurring at regular intervals T0 = L0 /c. The pulse length is given by
(5.70), while the interval T0 = 2πρ/v ≈ 2πρ/c.
The general form of the power radiated per unit solid angle is
dP (t)
= |A(t)|2 (5.72)
dΩ
where c 2
A(t) = [R E]ret (5.73)
4π
and E is the electric field defined in (5.37).
• Notice that here we will use the observer’s time instead of the retarded
time since we study the observed spectrum.
The total energy radiated per unit solid angle is the time integral of (5.72):
Z ∞
dW
= |A(t)|2 dt (5.74)
dΩ −∞
This can be expressed via the Fourier transform’s as an integral over the
frequency.
The Fourier transform is:
Z ∞
1
A(ω) = √ A(t)eiωt dt (5.75)
2π −∞
and its inverse: Z ∞
1
A(t) = √ A(ω)e−iωt dω (5.76)
2π −∞
d2 I
= |A(ω)|2 + |A(−ω)|2 (5.80)
dωdΩ
If A(t) is real, form (5.75) - (5.76) it is evident that A(−ω) = A∗ (ω). Then
d2 I
= 2|A(ω)|2 (5.81)
dωdΩ
which relates the power radiated as a funtion of time to the frequency spec-
trum of the energy radiated.
NOTE : We rewrite eqn (5.37) for future use
h i
n−β n × (n − β) × β̇
E(r, t) = e + (5.82)
γ 2 (1 − n · β)3 R2 c(1 − n · β)3 R
ret
By using (5.82) we will try to derive a general expression for the energy
radiated per unit solid angle per unit frequency interval in terms of an integral
over the trajectory of the particle.
We must calculate the Fourier transform of (5.73) by using (5.82)
2 1/2 Z ∞ " #
e n × [(n − β) × β̇]
A(ω) = eiωt dt (5.83)
8π 2 c −∞ (1 − β · n)3
ret
since the observation point is assumed to be far away the unit vector n
can be assumed constant in time, while we can use the approximation
where x is the distance from the origin O to the observation point P , and
r(t0 ) is the position of the particle relative to O.
Then (5.84) becomes:
1/2 Z ∞
e2
n × [(n − β) × β̇]
A(ω) = eiω(t−n·r(t)/c) dt (5.86)
8π 2 c −∞ (1 − β · n)2
and the energy radiated per unit solid angle per unit frequency interval (5.81)
is Z 2
d2 I e2 ω 2 ∞ iω(t−n·r(t)/c) n × [(n − β) × β̇]
= e dt (5.87)
dωdΩ 4π 2 c −∞ (1 − β · n)2
90 5 Radiation by Moving Charges
Fig. 5.10.
For a specified motion r(t) is known, β(t) and β̇(t) can be computed, and the
integral can be evaluated as a function of ω and the direction of n.
If we study more than one accelerated charged particles, a coherent sum of
amplitudes Aj (ω) (one for each particle) must replace must replace the single
amplitude in (5.87).
If one notices that, the integrand in (5.86) is a perfect differential (exclud-
ing the exponential)
n × [(n − β) × β̇] d n × (n × β)
= (5.88)
(1 − β · n)3 dt 1−β·n
then by integration by parts we get to the following relation for the intensity
distribution:
2
d2 I e2 ω 2 ∞
Z
iω(t−n·r(t)/c)
= 2
n × (n × β)e dt (5.89)
dωdΩ 4π c −∞
a result that can be obtained from the direct solution of the inhomogeneous
wave equation for the vector potential.
5.9 What Is Synchrotron Light? (***) 91
Fig. 5.11.
5.9.1 Synchrotrons
Fig. 5.12. Components of a synchrotron light source typically include (1) an elec-
tron gun, (2) a linear accelerator, (3) a booster synchrotron, (4) a storage ring, (5)
beamlines, and (6) experiment stations.
Fig. 5.13. The largest light source facilities are campuses onto themselves with
administrative, office, and laboratory buildings in addition to the light source itself.
Fig. 5.14. The trajectory lies on the plane x − y with instantaneous radius of
curvature ρ. The unit vector n can be chosen to lie in the x − z plane, and θ is the
angle with the x-axis.
c2 3
n · r(t) ω 1 2
ω t− ≈ + θ t + 2t (5.95)
c 2 γ2 3ρ
d2 I e2 ω 2 2
= 2
−k Ak (ω) + ⊥ A⊥ (ω) (5.96)
dωdΩ 4π c
where the two amplitudes are (How?)
c ∞ c2 t 3
Z
ω 1 2
Ak (ω) ≈ t exp i + θ t + dt (5.97)
ρ −∞ 2 γ2 3ρ2
Z ∞
c2 t 3
ω 1 2
A⊥ (ω) ≈ θ exp i + θ t + dt (5.98)
−∞ 2 γ2 3ρ2
This shows the characteristic behavior seen in the circular motion case e.g. in
equation (5.66).
This result can be obtained directly, by integrating a slight generalization
of the power formula for circular motion, eqn (5.65), over all times. Again:
• The 1st term corresponds polarization parallel to the orbital plane.
• The 2nd term term to perpendicular polarization.
Integration over all angles shows that seven (7) times as much energy is
radiated with parallel polarization as with perpendicular polarization. In other
words:
The radiation from a relativistically moving charge is very strongly, but
not completely, polarized in the plane of motion.
• The radiation is largely confined to the plane containing the motion,
being more confined the higher the frequency relative to c/ρ.
• If ω gets too large, then ξ will be large at all angles, and then there
will be negligible power emitted at those high frequencies.
• The critical frequency beyond which there will be negligible total energy
emitted at any angle can be defined by ξ = 1/2 and θ = 0 (WHY?).
Then we find 3
3 c 3 E c
ωc = γ 3 = (5.106)
2 ρ 2 mc2 ρ
96 5 Radiation by Moving Charges
For ω ωc
d2 I 3 e2 2 ω −ω/ωc
|θ=0 ≈ γ e (5.109)
dωdΩ 4π c ωc
These limiting cases show that the spectrum at θ = 0 increases with frequency
roughly as ω 2/3 well bellow the critical frequency, reaches a maximum in the
neighborhood of ωc , and then drops exponentially to 0 above that frequency.
• The spread in angle at a fixed frequency can be estimated by deter-
mining the angle θc at which ξ(θc ) ≈ ξ(0) + 1.
• In the low frequency range (ω ωc ), ξ(0) ≈ 0 so ξ(θc ) ≈ 1 which
gives
1/3 1/3
3c 1 2ωc
θc ≈ = (5.110)
ωρ γ ω
We note that the low frequency components are emitted at much wider
angles than the average, hθ2 i1/2 ∼ γ −1 .
• In the high frequency limit (ω > ωc ), ξ(0) 1 and the intensity
falls off in angle as:
d2 I d2 I 2 2
≈ |θ=0 · e−3ωγ θ /2ω0 (5.111)
dωdΩ dωdΩ
Thus the critical angle defined by the 1/e point is
1/2
1 2ωc
θc ≈ (5.112)
γ 3ω
π/2 ∞
d2 I d2 I
Z Z
dI
= 2π cos θdθ ≈ 2π dθ (5.113)
dω −π/2 dωdΩ −∞ dωdΩ
• For the low-frequency range we can use (5.95) at θ = 0 and (5.108) at
θc , to get
dI d2 I e2 ωρ 1/3
∼ 2πθc |θ=0 ∼ (5.114)
dω dωdΩ c c
showing the the spectrum increases as ω 1/3 for ω ωc . This gives a very
broad flat spectrum at frequencies below ωc .
• For the high-frequency limit ω ωc we can integrate (5.111) over
angles to get: r 1/2
dI 3π e2 ω
≈ γ e−ω/ωc (5.115)
dω 2 c ωc
• These results have been compared with experiment at various energy syn-
chrotrons. The angular, polarization and frequency distributions are all in
good agreement with theory.
• Because of the broad frequency distribution shown in previous Figure,
covering the visible, ultraviolet and x-ray regions, synchrotron radiation is a
useful tool for studies in condensed matter and biology.
5.11 Thomson Scattering of Radiation (***) 99
Fig. 5.17.
dP e2 2
= |∗ · v̇| (5.119)
dΩ 4πc3
If the propagation vector k0 and its the polarization vector 0 can be written
100 5 Radiation by Moving Charges
Fig. 5.18.
The incident energy flux is the time averaging Poynting vector for the plane
wave i.e. c|E0 |2 /8π. Thus from eqn (5.121) we get the differential scattering
cross section 2 2
dσ e
= |∗ · 0 |2 (5.123)
dΩ mc2
The scattering geometry with a choice of polarization vectors for the out-
going wave is shown in the Figure.
5.11 Thomson Scattering of Radiation (***) 101
The Thomson cross section for electrons is 0.665 × 10−24 cm2 . The unit of
length e2 /mc2 = 2.82 × 10−13 cm is called classical electron radius.
• This classical Thomson formula is valid only for low frequencies where
the momentum of the incident photon can be ignored.
• When the photon momentum ~ω/c becomes comparable to or larger
than mc modifications occur.
• The most important is that the energy or momentum of the scattered
photon is less than the incident energy because the charged particle recoils
during the collision.
102 5 Radiation by Moving Charges
6.1 Introduction
Einstein’s theory of special relativity is based on the assumption (which might
be a deep-rooted superstition in physics) that all physical laws should be
invariant under transformation between inertial systems.
The demand that Maxwell’s equations should be invariant under trans-
formations, and the failure of Galilean transformations to do it led to the
Lorentz transformations (β = v/c, γ = (1 − β 2 )−1/2 )
x0 = γ(x00 − βx01 )
x1 = γ(x01 − βx00 ) (6.1)
x2 = x02
x3 = x03
x0 = γ(x00 + βx01 )
x1 = γ(x01 + βx00 ) (6.2)
x2 = x02
x3 = x03
c2 t2 − x2 + y 2 + z 2 = 0
(6.3)
is invariant.
104 6 Special Theory of Relativity (****)
1st Postulate : The laws of nature and the results of all experiments
performed in a given frame of reference are independent of the translational
motion of the system as a whole
2nd Postulate : The speed of light is finite and independent of the motion
of the source
From the 1st postulate it follows that the mathematical equations express-
ing the laws of nature must be covariant, that is, invariant in form, under
the Lorentz transformations.
These demands call for rules on the ways that the scalars, 4-vectors and
4-tensors will transform in a spacetime whose norm is defined by (6.4).
SPACETIME
The space-time continuum is defined in terms of a 4-dimensional space with
coordinates x0 , x1 , x2 , x3 .
6.2 Tensors
If we assume that there is a well defined transformation that yields from the
coordinates x0 , x1 , x2 , x3 a new set of coordinates x00 , x01 , x02 , x03 according
to the rule
x0α = x0α (x0 , x1 , x2 , x3 ) (α = 0, 1, 2, 3) (6.5)
Here we will defined the tensors under their transformation prop-
erties.
A scalar (tensor of rank 0) is a single quantity whose value is not changed
under the transformation. for example the interval s2 in (6.4) is a scalar.
Vectors are tensors of rank 1, and we distinguish two kinds.
The contravariant vector Aα whose components transformed according
to the rule
3
0α
X ∂x0α β ∂x0α β
A = A ≡ A (6.6)
∂xβ ∂xβ
β=0
where the partial derivatives are calculated from (6.5). Explicitly we have 4
equations of the form:
where the partial derivatives are calculated from the inverse of (6.5).
6.2 Tensors 105
∂x0α ∂x0β γδ
F 0αβ = F (6.9)
∂xγ ∂xδ
A covariant tensor of rank 2 Gαβ transforms as
∂xγ ∂xδ
G0αβ = Gγδ (6.10)
∂x0α ∂x0β
The mixed tensor of rank 2 H α β transforms as
∂x0α ∂xδ γ
H 0α β = H δ (6.11)
∂xγ ∂x0β
The generalization to arbitrary rank tensors is quite obvious extension of the
above relations.
The inner or scalar product of two vectors is defined as the product of
the components of a covariant and a contravariant vector
B · A ≡ B α Aα (6.12)
with this definition the scalar product is an invariant or scalar under the
transfomation (6.5):
This norm or metric is a special case of the general differential length element
g αβ = gαβ (6.17)
106 6 Special Theory of Relativity (****)
The previous examples show how the covariance of a physical law emerges
provided suitable Lorentz transformation properties are attributed to the
quantities entering the equation.
6.3 Invariance of Electric Charge; Covariance in Electrodynamics 107
pα = (p0 , p) = m (U0 , U)
dx0 dx0 dt dx dx dt
U0 ≡ = = γc , U ≡ = = γu (6.27)
dτ dt dτ dτ dt dτ
dp q
= (U0 E + U × B) (6.29)
dτ c
the left hand side is the space part of a 4-vector. The corresponding time
component equation is the rate of change of the energy of the particle
Z
dp0 q dEmech
= U·E ⇐ = J · Ed3 x (6.30)
dt c dt V
The right-hand sides of the previous two equations involve three factors, the
charge q, the 4-velocity and the electromagnetic fields.
If the transformation properties of two of the three factors are known and
Lorentz covariance is demanded, then the transformation properties of the
3rd factor can be established.
The experimental invariance of electric charge and the requirement of
Lorentz covariance of the Lorentz force eqn (6.29) and (6.30) determines the
Lorentz transformation properties of the EM field.
108 6 Special Theory of Relativity (****)
For example, the requirement from (6.30) that U·E be the time component
of a 4-vector establishes that the components of E are the time-space parts
of a 2nd rank tensor F αβ such that
U · E = F 0β Uβ
We will consider Maxwell equations and we begin with the charge density
ρ(x, t) and current density J(x, t) and the continuity equation
∂ρ
+∇·J=0 (6.31)
∂t
It is natural to postulate that ρ and J together form a 4-vector J α :
J α = (cρ, J) (6.32)
∂α J α = 0 (6.33)
1 ∂2A 4π
2 2
− ∇2 A = J
c ∂t c
(6.35)
1 ∂2Φ
− ∇2 Φ = 4πρ
c2 ∂t2
Notice that the differential operator in (6.35) is the invariant 4-D Laplacian
(6.25) while the right hand side are the components of the 4-vector (6.32).
Obviously, Lorentz covariance requires that the potentials Φ and A form
a 4-vector potential
Aα = (Φ, A) (6.36)
Then the wave equation (6.35) and the Lorentz condition (6.34) take the
covariant forms
4π α
Aα = J , ∂α Aα = 0 (6.37)
c
1 ∂B
∇ · B = 0, ∇×E+ =0
c ∂t
take the form (HOW?)
With the above definitions of the various quantities and the reformulation
of the wave and Maxwell equations the covariance of the equations of EM is
established.
Finally, the Lorentz force (6.29) and rate of change of energy (6.30) can
be set in manifestly covariant form
dpα dU α q
=m = F αβ Uβ (6.47)
dτ dτ c
E10 = E1 B10 = B1
E20 = γ(E2 − βB3 ) B20 = γ(B2 + βE3 ) (6.52)
E30 = γ(E3 + βB2 ) B30 = γ(B3 − βE2 )
This suggest that for a general Lorentz transformation between two systems
moving with a speed v relative to each other the transformation of the fields
can be written (HOW):
γ2
E0 = γ (E + β × B) − β (β · E)
γ+1
(6.53)
γ2
B0 = γ (B − β × E) − β (β · B)
γ+1
• These transformations show that E and B have no independent existence.
• A purely electric or magnetic field in one coordinate system will appear
as a mixture of electric and magnetic fields in another coordinate frame.
• Thus one should properly speak of the electromagnetic field F αβ rather
than E and B separately.
Finally, if no magnetic field exists in a frame K 0 the inverse of (6.53) shows
that in the frame K the magnetic field B and the electric field E are linked
by the simple relation
B=β×E (6.54)
note that E is the transformed field from K 0 to K.
Then using the inverse of (6.53) we find the transformed fields in the
system K:
qγvt
E1 = E10 = −
(b2 + γ 2 v 2 t2 )3/2
γqb
E2 = γE20 = 2 (6.56)
(b + γ 2 v 2 t2 )3/2
B3 = γβE20 = βE2 (6.57)
Fig. 6.1. Fields of a uniformly moving charged oarticle (a) Fields at the observation
point P as function of time. (b) Lines of electric force for a particle at rest and in
motion (γ = 3).
7
Dynamics of Relativistic Particles and EM
Fields (**)
is an extremum.
116 7 Dynamics of Relativistic Particles and EM Fields (**)
From the 1st postulate of STR the action integral must be a Lorentz
scalar, because the equations of motion described by the extremum condition
δA = 0. Then if we set in (7.4) dt = γdτ we get
Z τ2
A= γL dτ (7.6)
τ1
since the proper time is invariant the condition that A is invariant requires
that γL is also Lorentz invariant.
The Lagrangian for a free particle can be a function of the velocity (the
only invariant function of the velocity is ηαβ U α U β = Uα U α = c2 ) and the
mass of the particle but not its position.
r
2 u2
Lfree = −mc 1 − 2 (7.7)
c
and through (7.5) the free-particle equation of motion (remember that p =
γmu)
d
(γmu) = 0 (7.8)
dt
Lint → LN R
int = −eΦ (7.9)
If ones wants to use of proper covariant description, has to abandon the vec-
torial writing x, u and to replace them with the 4-vectors xα , U α . Then the
free particle Lagrangian (7.7) will be written as
mc p
Lfree = − Uα U α (7.21)
γ
This invariant form can be the starting point for a variational calculation
leading to the equation of motion dU α /dτ = 0. One can further make use of
the constraint
Uα U α = c2 (7.23)
or the equivalent one:
dU α
Uα =0 (7.24)
dτ
The integrant in (7.22) is:
r
dxα dxα
p q
Uα U α dτ = dτ = g αβ dxα dxβ
dτ dτ
i.e. the infinitesimal length element in 4-space. Thus the action integral can
be written as Z s2 r
dxα dxβ
A = −mc g αβ ds (7.25)
s1 ds ds
where the 4-vector coordinate of the particle is xα (s), where s is a parameter
monotonically increasing with τ , but otherwise arbitrary.
• The action integral is an integral along the world line of the particle
• The principle of least action is a statement that the actual path is the
longest path, the geodesic. We should keep in mind that
r
dxα dxβ
g αβ ds = cdτ (7.26)
ds ds
and then a straightforward variational calculation with (7.25) leads to
d dxα /ds
mc =0 (7.27)
ds dxβ dxβ 1/2
ds ds
or
7.1 Lagrangian Hamiltonian for a Relativistic Charged Particle 119
d2 xα
m =0 (7.28)
dτ 2
as expected for a free particle motion.
For a charged particle in an external field the form of the Lagrangian (7.11)
suggests that the manifesltly covariant form of the action integral is
Z s2 " r #
dx α dx β e dxα
A=− mc g αβ + Aα (x) ds (7.29)
s1 ds ds c ds
d2 xα e α β dxβ e dxβ
m 2
= ∂ A − ∂ β Aα = F αβ (7.32)
dτ c dτ c dτ
which is the covariant equation of motion (7.3).
∂ L̃ e
Pα = − = mU α + Aα (7.33)
∂(dxα /ds) c
H̃ = Pα U α + L̃ (7.34)
eAα
α eAα
P − Pα − = m2 c2
c c
dxα eAα
∂ H̃ 1
= = Pα −
dτ ∂Pα m c
and (7.36)
α
dP ∂ H̃ e eAβ
=− = Pβ − ∂ α Aβ
dτ ∂xα mc c
The path is a helix of radius a and pitch angle α = tan−1 (vk /ωB a).
The magnitude of the gyration radius a depends on the magnetic induction
B and the transverse momentum p⊥ of the particle
cp⊥ = eBa
This relation allows for the determination of particle momenta. For particle
with charge equal to electron charge the momentum can be written numeri-
cally as
Fig. 7.1. This three basic motions of charged particles in a magnetic field: gyro,
bounce between mirror points, and drift. The pitch angle α between the directions
of the magnetic field B and the electron velocity v.
The angle between the direction of the magnetic field and a particle’s
spiral trajectory is referred to as the ”pitch angle”, which in a non-uniform
magnetic field changes as the ratio between the perpendicular and parallel
components of the particle velocity changes. Pitch angle is important because
it is a key factor in determining whether a charged particle will be lost to the
Earth’s atmosphere or not.
The interaction Lagrangian was given by (7.11). In its simpler form the non-
relativistic Lagrangian for two charged particles
q1 q2
LN R
int = (7.47)
r
including lowest order relativistic effects is
q1 q2 1 (v1 · r)(v2 · r)
Lint = −1 + 2 v1 · v2 + (7.48)
r 2c r2
where rij = |xi − xj |, and r̂ij is the unit vector in the direction xi − xj and
the “tilde” (∼) in the summation indicates omission of the self-energy terms
i = j.
7.5 Lagrangian for the Electromagnetic Field 125
∂L
The term ∂(∂ β Aα )
in the Euler-Lagrange equations becomes (how?)
∂L 1 1
β α
= − Fβα = Fαβ (7.54)
∂(∂ A ) 4π 4π
126 7 Dynamics of Relativistic Particles and EM Fields (**)
1 µ2 1
LProca = − Fαβ F αβ + Aα Aα − J α Aα (7.58)
16π 8π c
The parameter µ has dimensions of inverse length and is the reciprocal Comp-
ton wavelength of the photon (µ = mγ c/~). Instead of (7.56) the Proca equa-
tions of motion are
4π
∂ β Fβα + µ2 Aα = Jα (7.59)
c
with the same homogeneous equations ∂α F αβ = 0 as in Maxwell theory. In
contrast to the Maxwell equations the potentials have real physical (observ-
able) significance through the mass term. In the Lorentz gauge (7.59) can be
written
4π
Aα + µ2 Aα = Jα (7.60)
c
If the source is a point charge q at rest in the origin then the only non-vanishing
component is A0 = Φ. the solution will be the spherically symmetric Yukawa
potential
q
Φ(x) = e−µr (7.62)
r
i.e. we observe an exponential falloff of the static potentials and fields, with
1/e distance equal to 1/µ.
Notice that the exponential factor alters the character of the Earth’s (and
other planets) magnetic fields sufficiently to permit us to set quite stringent
limits on the photon mass from geomagnetic data.
µ2 r2 e−µr 2 2 e−µr
B(x) = [3r̂(r̂ · m) − m] 1 + µr + − µ m 3
3 r3 3 r
The result shows that the Earth’s magnetic field will appear as a dipole an-
gular distribution plus an added constant magnetic field antiparallel to m.
Measurements show that this “ external” field is less than 0.004 times the
dipole field at the magnetic equator which leads to µ < 4 × 10−10 cm−1 or
mγ < 8 × 10−49 g.
The first factor in the sum is the field momentum canonically conjugate to
φk (x) and ∂ α φk (x) is equivalent to the velocity q̇i .
The Lorentz transformation properties of H suggest that the covariant
generalization of the Hamiltonian density is the canonical stress tensor:
X ∂L
T αβ = ∂ β φk − g αβ L (7.65)
∂(∂α φk )
k
1 1
T 00 = E2 + B2 +
∇ · (ΦE)
8π 4π
1 1
T 0i = (E × B)i + ∇ · (Ai E) (7.67)
4π 4π
1 1 ∂
T i0 = (E × B)i + (∇ × ΦB)i − (ΦEi )
4π 4π ∂x0
If the fields are localized in some finite region of space the integrals over
all 3-space at fixed time in some inertial frame of the components T 00 and
T 0i can be interpreted as the total energy and c times the total momentum
of the EM fields in that frame:
Z Z
1
T 00 d3 x = E2 + B2 d3 x = Efield
8π
(7.68)
Z Z
1
T i0 d3 x = (E × B)i d3 x = cPfield
i
4π
∂α T αβ = 0 (7.69)
7.8 Conservation Laws : Symmetric Stress Tensor 129
Consider
αβ
X ∂L
∂α T = ∂α ∂ φk − ∂ β L
β
∂(∂α φk )
k
X ∂L ∂L
β
= ∂α ∂ φk + ∂ α ∂ φk − ∂ β L
β
∂(∂α φk ) ∂(∂α φk )
k
but because of the equation of motion (7.50) the first term can be transformed
so that
X ∂L ∂L
αβ β
∂α T = ∂ φk + ∂ (∂α φk ) − ∂ β L
β
∂φk ∂(∂α φk )
k
α
since L = L(φk , ∂ φk ) the term in the square bracket is an implicit differen-
tiation, hence
∂α T αβ = ∂ β L(φk , ∂ α φk ) − ∂ β L = 0
The conservation law (or continuity equation) (7.69) yields the conserva-
tion of total energy and momentum upon integration over all of 3-space at
fixed time Z Z Z
0 = ∂α T αβ d3 x = ∂0 T 0β d3 x + ∂i T iβ d3 x
If the fields are localized the 2nd integral (divergence) gives no contribution.
Then with the identification (7.68) we get
d d
Efield = 0 , Pfield = 0 (7.70)
dt dt
The results are valid for an observer at rest in the frame in which the fields
are specified.
M αβγ = T αβ xγ − T αγ xβ (7.71)
∂α M αβγ = 0 (7.72)
then by using (7.69) one can see that the conservation of angular momentum
requires that T αβ be symmetric. Finally, the involvement of the potentials in
(7.66) makes it not gauge invariant.
We will construct a symmetric, traceless, gauge-invariant stress tensor Θαβ
from the canonical stress tensor T αβ of (7.66).
We substitute ∂ β Aλ = −F λβ + ∂ λ Aβ and obtain
1 1 1 αµ
T αβ = g αµ Fµλ F λβ + g αβ Fµν F µν − g Fµλ ∂ λ Aβ (7.73)
4π 4 4π
The first term in (7.73) is symmetric and gauge invariant while the last term
of (7.73), with the help of the source-free Maxwell equations, can be written
(how?)
αβ 1 αµ 1 λα
TD ≡− g Fµλ ∂ λ Aβ = F ∂ λ Aβ
4π 4π
1 1
F λα ∂λ Aβ + Aβ ∂λ F λα = ∂λ F λα Aβ
= (7.74)
4π 4π
with the following properties: (why?)
Z
αβ 0β 3
(i) ∂α TD = 0, (ii) TD d x=0
Thus the differential conservation law (7.69) will hold for the difference (T αβ −
αβ
TD ) if it holds for T αβ .
If we define the symmetric stress tensor Θαβ
αβ 1 1
Θαβ = T αβ − TD = g αµ Fµλ F λβ + g αβ Fµν F µν (7.75)
4π 4
then the differential conservation law (7.69) will hold if it holds for T αβ . The
explicit components of Θαβ are:
1
Θ00 = E2 + B2
8π
1 i
Θ0i = (E × B) (7.76)
4π
1 1 ij
Θij i j i j 2 2
=− E E +B B − δ E +B
4π 2
where the time-time component is the energy density (2.73) the time-space
component is the momentum density (2.85) while the space-space compo-
nents are the negative of the Maxwell stress tensor (2.86).
The various covariant or mixed forms of the stress tensor are
! !
u −cg α u −cg
Θαβ = (M ) Θ β= (M )
−cg −Tij cg Tij
!
β u cg
Θα = (M )
−cg Tij
The differential conservation law
∂α Θαβ = 0 (7.78)
The right-hand side is zero (why?) and thus the divergence of the stress
tensor is
1
∂α Θαβ = − F βλ Jλ (7.80)
c
The time and space components of this equation are
1 ∂u 1
+∇·S =− J·E (7.81)
c ∂t c
and
3
∂gi X ∂ (M ) 1
− T = − ρEi + (J × B)i (7.82)
∂t j=1
∂xj ij c
These are the conservation of energy and momentum equations for EM fields
interacting with sources described by J α = (cρ, J).
The negative of the right hand side term in (7.80) is called the Lorentz
force density,
1 1 1
f β ≡ F βλ Jλ = J · E, ρE + J × B (7.83)
c c c
If the sources are a number of charged particles then the volume integral
of f β leads through the Lorents force equation (7.1) to the time rate of change
of the sum of the energies or momenta of all particles:
β
dPparticles
Z
f β d3 x =
dt
The conservation of the 4-momentum for the combined system of particle
and fields:
Z
d β β
d3 x ∂α Θαβ + f β =
Pfield + Pparticles =0 (7.84)
dt
8
A Short Introduction to Tensor Analysis
Studying the way in which these values change with the coordinate system
leads to the concept of tensor.
With the help of this concept we can express the physical laws by tensor
equations, which have the same form in every coordinate system.
• ¡1-¿ Scalar field : is any physical quantity determined by a single numer-
ical value i.e. just one component which is independent of the coordinate
system (mass, charge,...)
• ¡2-¿ Vector field (contravariant): an example is the infinitesimal dis-
placement vector, leading from a point A with coordinates xµ to a neigh-
bouring point A0 with coordinates xµ + dxµ . The components of such a
vector are the differentials dxµ .
From the infinitesimal vector AA0 with components dxµ we can construct a
finite vector v µ defined at A. This will be the tangent vector of the curve
xµ = f µ (λ) where the points A and A0 correspond to the values λ and λ + dλ
of the parameter. Then
dxµ
vµ = (8.2)
dλ
Any transformation from xµ to x̃µ (xµ → x̃µ ) will be determined by n equa-
tions of the form: x̃µ = f µ (xν ) where µ , ν = 1, 2, ..., n.
This means that :
X ∂ x̃µ X ∂f µ
dx̃µ = ν
dxν = ν
dxν for ν = 1, ..., n (8.3)
ν
∂x ν
∂x
and
dx̃µ X ∂ x̃µ dxν X ∂ x̃µ
ṽ µ = = ν
= ν
vν (8.4)
dλ ν
∂x dλ ν
∂x
Then the transformation formulae for the components of the tensors of order
2 are (why?):
Tensor addition : Tensors of the same order (p, q) can be added, their sum
being again a tensor of the same order. For example:
∂ x̃ν µ
ãν + b̃ν = (a + bµ ) (8.9)
∂xµ
T µν = Aµ B ν or T µ ν = Aµ Bν or Tµν = Aµ Bν (8.11)
136 8 A Short Introduction to Tensor Analysis
T λµν λα = T µν α (8.12)
aλ = aλ (xα )
∂xµ ∂x0ρ 0ν
Aµ ,α = [A ,ρ + Γ 0ν σρ A0σ ] . (8.16)
∂x0ν ∂xα
Suggesting that the transformation (x̃µ → x0µ ) will be:
µ ∂ x̃µ ∂x0ρ 0ν
õ,α + Γ̃αλ Ãλ = (A ,ρ + Γ 0ν σρ A0σ ) (8.17)
∂x0ν ∂ x̃α
The necessary and sufficient condition for Aµ ,α to be a tensor is:
Aµ ;α = Aµ ,α + Γ µ αλ Aλ or ∇α Aµ = ∂α Aµ + Γ µ αλ Aλ (8.19)
Fig. 8.1.
The ”trip” of a parallel transported vector along a closed path The parallel
transport of the vector aλ from the point P to A leads to a change of the
vector by a quantity Γ λ µν (P )aµ dxν and the new vector is:
aλ (A) = aλ (P ) − Γµν
λ
(P )aµ dxν (8.28)
Since both dxν and δxν assumed to be small we can use the following
expression
8.2 Tensors: at last (****) 139
Fig. 8.2.
The quantity g ≡ det |gµν | is the determinant of the metric tensor. The deter-
minant transforms as :
∂xα ∂xβ
α β
∂x ∂x
g̃ = det g̃µν = det gαβ µ ν = det gαβ · det · det
∂ x̃ ∂ x̃ ∂ x̃µ ∂ x̃ν
2
∂xα
= det µ g = J 2g (8.46)
∂ x̃
where J is the Jacobian of the transformation.
This relation can be written also as:
p p
|g̃| = J |g| (8.47)
p
i.e. the quantity |g| is a scalar density of weight 1.
• The quantity
p p
|g|δV ≡ |g|dx1 dx2 . . . dxn (8.48)
is the invariant volume element of the Riemannian space.
• If the determinant vanishes at a point P the invariant volume is zero
and this point will be called a singular point.
142 8 A Short Introduction to Tensor Analysis
Fig. 8.3.
α 1 αν
Γµρ = g (gµν,ρ + gνρ,µ − gρµ,ν ) (8.54)
2
8.5 Geodesics in a Riemann Space (***) 143
Fig. 8.4.
A neighboring curve x̃µ (s) connecting the same points will be described
by the equation:
x̃µ (s) = xµ (s) + ξ µ (s) (8.55)
where ξ µ (A) = ξ µ (B) = 0. The length of the new curve will be:
B µ ν 1/2
Z
α dx̃ dx̃
S̃ = gµν (x̃ ) ds (8.56)
A ds ds
The last term does not contribute and the condition for the length of S to be
an extremum will be expressed by the relation:
Z B
∂f d ∂f
δS = − ξ α ds = 0 (8.58)
A ∂xα ds ∂uα
Since ξ α is arbitrary, we must have for each point of S:
d ∂f ∂f
µ
− =0 (8.59)
ds ∂u ∂xµ
Notice that the Langrangian of a freely moving particle with mass m = 2, is:
L = gµν uµ uν ≡ f 2 this leads to the following relations
∂f 1 ∂L −1/2 ∂f 1 ∂L −1/2
= L and = L (8.60)
∂uα 2 ∂uα ∂xα 2 ∂xα
and by substitution in (8.59) we come to the condition
d ∂L ∂L
− = 0. (8.61)
ds ∂uµ ∂xµ
thus
d dgµα µ duµ duµ
(2gµα uµ ) = 2 u + 2gµα = 2gµα,ν uν uµ + 2gµα
ds ds ds ds
ν µ µ ν duµ
= gµα,ν u u + gνα,µ u u + 2gµα (8.64)
ds
1
We make use of the following relations:
!
α α α α α α α α α ∂f α ∂f 2
f (x̃ , ũ ) = f (x + ξ ,u + ξ̇ ) = f (x , u ) + ξ + ξ̇ + O( )
∂xα ∂uα
! !
d ∂f α ∂f α d ∂f α
ξ = ξ̇ + ξ
ds ∂uα ∂uα ds ∂uα
8.5 Geodesics in a Riemann Space (***) 145
duµ 1
gµα + [gµα,ν + gαµ,ν − gµν,α ] uµ uν = 0
ds 2
if we multiply with g ρα we the geodesic equations
duρ ρ µ ν
+ Γµν u u = 0, or uρ ;ν uν = 0 (8.65)
ds
because duρ /ds = uρ ,µ uµ .
The Lagrangian for a freely moving particle is: L = gµν uµ uν and the Euler-
Lagrange equations:
d ∂L ∂L
− =0
ds ∂uµ ∂xµ
are equivalent to the geodesic equation
duρ ρ µ ν d2 xρ µ
ρ dx dx
ν
+ Γµν u u = 0 or 2
+ Γµν =0
ds ds ds ds
Notice that if the metric tensor does not depend from a specific coordinate
e.g. xk then
d ∂L
=0
ds ∂ ẋκ
which means that the quantity ∂L/∂ ẋκ is constant along the geodesic. Then
eq (8.62) implies that ∂∂L
ẋκ = gµκ u
µ
that is the κ component of the generalized
µ
momentum pκ = gµκ u remains constant along the geodesic.
Fig. 8.5.
d2 xκ µ
κ dx dx
ν
2
+ Γµν =0 (8.66)
dλ dλ dλ
and obiously:
dxµ dxν
gµν = 0. (8.67)
dλ dλ
d2 xµ α
µ dx dx
β
d2 σ/ds2 dxµ
2
+ Γαβ =− (8.68)
dσ dσ dσ (dσ/ds)2 dσ
where we have used
2
dxµ dxµ dσ d2 xµ d2 xµ dxµ d2 σ
dσ
= and = + (8.69)
ds dσ ds ds2 dσ 2 ds dσ ds2
The new geodesic equation (8.68), reduces to the original equation(8.66) when
the right hand side is zero. This is possible if
8.6 Riemann Tensor (***) 147
d2 σ
=0 (8.70)
ds2
which leads to a linear relation between s and σ i.e. σ = αs + β where α and
β are arbitrary constants. σ is called affine parameter.
Rλ βνµ = −Γβν,µ
λ λ
+ Γβµ,ν σ
− Γβν λ
Γσµ σ
+ Γβµ λ
Γσν (8.71)
1
Gµ ν;µ = Rµ ν − δ µ ν R = 0. (8.76)
2 ;µ
Rλ µ[νρ;σ] = 0 (8.77)
Fig. 8.6.
Prove that :
aλ ;µ;ν − aλ ;ν;µ = −Rλ κµν aκ
g λρ Cλµνρ = 0 (8.81)
∂A1 1 ∂A1
= −Γ22 A2 ⇒ = sin θ cos θA2
∂x2 ∂φ
∂A2 2 ∂A2
= −Γ12 A1 ⇒ = − cot θA1
∂x2 ∂φ
150 8 A Short Introduction to Tensor Analysis
Fig. 8.7.
and for an initial unit vector (A1 , A2 ) = (1, 0) at (θ, φ) = (θ0 , 0) the integration
constants will be α = 1 and β = 0. The solution is:
sin(2π cos θ)
A = A1 eθ + A2 eφ = cos(2π cos θ)eθ − eφ
sin θ
i.e. different components but the measure is still the same
2 2
|A|2 = gµν Aµ Aν = A1 + sin2 θ A2
sin2 (2π cos θ)
= cos2 (2π cos θ) + sin2 θ =1
sin2 θ
Question : What is the condition for the path followed by the vector to
be a geodesic?
9
Physics on Curved Spaces
fµ uµ = qFµν uµ uν = 0
Fµν = −Fνµ
Fig. 9.1.
ρ0 = γv ρ0
i.e the charge density is not a scalar but transforms like the component of a
vector.
Thus the obvious choice of a source term for the EM field equations should
be a 4-vector
j = ρ0 (x)u(x) and j · j = ρ20 c2
In an inertial frame S the components of the 4-current density j
j µ = ρ0 γu (c, u) = (cρ, j)
∇ · F = kj.
F µν ,µ = k j ν (9.2)
F µν ,µν = k j ν ,ν ⇒ j ν ,ν = 0 (why?)
F µν ,µ = k j µ (9.5)
F[µν,σ] ≡ Fµν,σ + Fσµ,ν + Fνσ,µ = 0 (9.6)
The last equation can be easily derived from equation (9.3) (how?).
Finally, the constant k can be found by demanding consistency with the
standard Maxwell equations. In SI units we have k = µ0 , where 0 µ0 = 1/c2 .
A(new)
µ = Aµ + Qµ (9.7)
This is not a coordinate transformation. The new EM field tensor will be:
(new)
Fµν = A(new)
ν,µ
(new)
− Aµ,ν = Aν,µ − Aµ,ν + Qν,µ − Qµ,ν (9.8)
154 9 Physics on Curved Spaces
A(new)
µ = Aµ + ψ,µ (9.9)
and still recover the same EM field tensor and the same EM field equations.
This is an example of a gauge transformation, which is not a coordinate
transformation.
In the field equations
the 2nd term in the left hand side can be written as Aµ ,µλ thus this term can
become zero by choosing a scalar field ψ such that
Aµ ,µ = 0 (9.11)
This condition is called Lorentz gauge1 and simplifies the EM field equations
• The last 2 equations are the EM field equations in the Lorentz gauge.
• In the absence of charges and currents Aµ has wave solutions traveling
with the speed of light.
• This is also true for the components of Fµν since in this case we also have
Fµν = 0.
Aµ = (φ/c, A)
ρ ∂B
∇·E = , ∇×E=− (9.15)
0 ∂t
∂E
∇ · B = 0, ∇ × B = µ0 j + µ0 0 (9.16)
∂t
Finally, by using eqns (9.3) and (9.14) we can find (how?) that the covariant
components of F µν in S are:
E 1 /c E 2 /c E 3 /c
0
−E 1 /c 0 −B 3 B 2
F µν = −E 2 /c B 3
0 −B 1
−E 3 /c −B 2 B 1 0
F µν ,µ = µ0 j µ (9.17)
F[µν,σ] ≡ Fµν,σ + Fσµ,ν + Fνσ,µ = 0 (9.18)
F µν ;µ = µ0 j µ (9.19)
F[µν;σ] ≡ Fµν;σ + Fσµ;ν + Fνσ;µ = 0 (9.20)
These new equations are fully covariant and if they are valid in one system of
coordinates then they will be valid in all coordinate systems.
In the same way we can write the EM field equations in terms of the
4-potential Aµ
Conclusion
The general procedure for converting an equation valid in Cartesian inertial
coordinates into one that is valid in an arbitrary coordinate system is as
follows:
• replace partial with covariant derivatives
• replace ordinary derivatives along curves with absolute (intrinsic) deriva-
tives
• replace ηµν with gµν .
EXAMPLE: The EM stress energy tensor for a constant electric field in the
x direction will be (prove it):
E2 E2 E2
T 00 = , T xx = − , T yy = T zz =
8π 8π 8π
0
what will be the components of this tensor in a frame S moving with a velocity
v at the x-direction?
To construct the gravitational field equations, we must find a covariant
way of expressing the source term.
We must find a tensor that describes the matter distribution at each event
in spacetime.
• Let’s start with dust, this is a time-dependent distribution of electrically
neutral non-interacting particles, each of rest mass m0 .
• At each event P of the spacetime we can characterize the distribution
completely by giving the matter density ρ and 3-velocity u as measured in
some inertial frame.
• For example in an instantaneous rest frame S at a given point P , u = 0
while the proper density is given by ρ0 = m0 n0 , where m0 is the rest mass of
each particle and n0 is the number of particles in the unit volume.
A perfect fluid is defined as one for which there are no forces between the
particles and no heat conduction or viscosity. Thus in an inertial reference
frame the components of T for a perfect fluid are given by:
2
ρc 0 0 0
0 p 0 0
T µν =
0 0 p 0
0 00p
For the general real fluid we have to take into account that:
• besides the bulk motion of the fluid, each particle has some random (ther-
mal) velocity
• there may be various forces between the particles that contribute potential
energies to the total.
In this case the components of the energy momentum tensor will be:
• T 00 : is the total energy density, including any potential energy contribu-
tions from forces between the particles and kinetic energy from the random
thermal motions.
160 9 Physics on Curved Spaces
T µν ,ν = 0 (9.30)
We will show that this relation leads to well known equations of motion and
continuity for a fluid in the Newtonian limit. Substituting eqn (9.28) into
equation (9.30) we get:
and equation (9.31) becomes the the relativistic equation of motion for
a perfect fluid in local inertial coordinates at some point P :
A slowly moving fluid is the one for which we may neglect u/c and so take
γu ≈ 1 and uµ ≈ (c, u) and the proper density becomes the “normal” density.
In this limit we get
∂ρ
T 0ν ,ν = 0 → eqn (9.32) ⇒ + ∇ · (ρu) = 0 (9.34)
∂t
which is the classical equation of continuity for a fluid, and
∂u 1
T jν ,ν = 0 → eqn (9.33) ⇒ + (u · ∇) u = − ∇p (9.35)
∂t ρ
which Euler’s equations of motion for a perfect fluid.
9.10 Linear Field Equations for Gravitation (**) 161
————–
We can easily obtain the condition for energy and momentum conservation
in arbitrary coordinates by replacing the partial with the covariant derivative,
and we get:
T µν ;ν = 0 (9.36)
This is a fundamental equation which imposes tight restrictions on the pos-
sible forms that the gravitational field equations may take.
∂µ ∂ µ Aν + b∂ ν ∂µ Aµ + aAν = κj ν (9.37)
while κ is the coupling constant of the theory and will be estimated from the
Newtonian limit of the above equation.
• We will call hµν gravitational tensor potential in analogy to EM
potential Aµ .
• Besides the tensor field hµν one may consider extra scalar, vector
or even tensor fields and to construct endlessly complicated theories.
• Jordan (1951) and Brans-Dicke (1961) suggested the least complicated
scalar-tensor theory containing only an extra scalar field. This new scalar field
has the effect of making the gravitational constant dependent on position. The
experimental evidence (up to now) speaks against the scalar-tensor theory
• In EM we got a simplified field equation by taking advantage of the
gauge invariance. The same can be done for gravitation and it can be shown
that the field equations are invariant under the gauge transformation
hµν
(new) = h
µν
+ Qµ,ν + Qν,µ (9.41)
If the Hilbert gauge condition holds; the field equations simplifies to:
5
Show that if Qν is solution of the differential equation Qν ,µ ,µ = −hµν ,µ + 12 h,ν
then hµν
(new) will satisfy the Hilbert condition.
9.11 Interaction of Gravitation and Matter (*) 163
,λ
1
hµν − η µν h = −κT µν (9.43)
2 ,λ
By substituting
1
φµν = hµν − η µν h (9.44)
2
the field equation becomes
If we act on both side with ∂/∂xν we get (because of the Hilbert gauge)
T(m)µν + tµν =0 (9.49)
,ν
then6 Z
κ κp
− hαβ,µ T(m)αβ d3 x = − 1 − u2 mhαβ,µ uα uβ (9.58)
2 2
Thus we finally get (how?)
6
√
ρ0 uα uβ d3 x = muα uβ 1 − u2
R
Show that
9.12 Local Cartesian coordinates(-) 165
d κ
Pµ = − mhαβ,µ uα uβ
dτ 2
which has the right form for an equation of motion
If we assume that :
∂L κ ∂L
= − mhαβ,µ uα uβ and = Pµ (9.59)
∂xµ 2 ∂uµ
we get the Euler -Lagrange form of equations of motion.
Actually, in the absence of forces the Lagrangian will have the form
1
L0 = m ηαβ uα uβ
2
while in the presence of a field becomes:
1
L= m ηαβ + κh̃αβ uα uβ
2
notice that h̃αβ is a quantity related to the spacetime while hαβ is the ”tensor
potential” of our theory. But because of the Euler-Lagrange equations these
two quantities are identical and this suggests that a relation between the
“geometrical” quantity h̃αβ and the “ physical” quantity” hαβ !
NOTE 1 : We have created a relation between physical and geometrical
quantities
Alternatively, if in the geodesic equations one assumes that gµν = ηµν +
κhµν then it comes to the equation
d κ
(uµ + κhµα uα ) − hαβ,µ uα uβ = 0 . (9.60)
dτ 2
NOTE 2 : That the equation of motion of a particle in a gravitational
field is independent of the mass of the particle, in agreement with Galileo
principle.
Fig. 9.2. Construction of local geodesics coordinates near a point P on the surface
of a sphere. A small flat patch is placed tangent to the sphere at P . The rectangular
coordinates give us the local coordinates.
∂xα
• N 2 independent values
∂x0β2 α
P
∂ x
• 0β ∂x0γ N 2 (N + 1)/2 independent values
∂x P
∂ 3 xα
• ∂x0β ∂x0γ ∂x0δ
N 2 (N + 1)(N + 2)/6 independent values
P
0
• gαβ (P ) N (N + 1)/2 independent values
0
∂gαβ
• ∂x0µ N 2 (N + 1)/2 independent values
P
0
∂ 2 gαβ
• ∂x0µ x0ν N 2 (N + 1)2 /4 independent values
P
We will show how one can get s system of local Cartesian coordinates starting
from a general system of coordinate xα via equations
0
gαβ (P ) = δαβ (9.64)
0
∂gαβ
= 0. (9.65)
∂x0µ P
if the first of the two relations is valid (and not the 2nd) in the new coordinate
system then from the definition of the Christoffel symbols they have to vanish
i.e.
Γ 0α βγ (P ) = 0. (9.66)
If the 2nd of the above relations is valid (and not the 1st) then since gµα,β =
Γ ν γβ gνα + Γ ν αβ gµν (remember that gµα;β = 0) this leads again to equation
(9.66).
Coordinates for which equation (9.66) holds are referred to as geodesic
coordinates about P .
If we start with some arbitrary coordinates xα and lets a point P has
0α
coordinates xα P and then we define a new coordinate system x by
1 α µ
x0α = xα − xα ν ν µ
P + Γ νµ (P ) (x − xP ) (x − xP ) (9.67)
2
by differentiation we get
∂x0α
= δνα + Γνµ
α
(P ) (xµ − xµP )
∂xν
thus at the point P we get: ∂x0α /∂xν = δνα (or ∂xα /∂x0ν = δνα ).
One more differentiation will give:
∂ 2 x0 α
= Γ α µν (P )δλµ = Γ α λν (P )
∂xλ ∂xν
and if we substitute these two relations in the formula for the transformation
of connections (Chapter 1):
Thus in the new primed coordinates the Christoffel symbols are zero at P
and thanks to condition (9.66) we constructed a system of geodesic coordinates
at P .
168 9 Physics on Curved Spaces
The metric function at this coordinate system will not satisfy necessarily
the first condition (9.64). Thus one more transformation of the form
x00α = Aα β x0β
00
which is linear and thus the coefficients of Aα β will bring the metric gαβ (P )
in these coordinates into the form (9.64) without affecting the derivatives and
the equation (9.65) will still be satisfied.
Fig. 9.3. (Left) Two neighbouring geodesics. (Right) Converging geodesics on the
surface of a sphere.
d2 ξ α dxµ dxν λ
2
+ Γ α µν,λ ξ =0
dσ dσ dσ
9.14 Tidal forces in a curved spacetime (-) 169
However, in our geodesic coordinates the 2nd order absolute (intrinsic) deriva-
tive of ξ α at P is:
D2 ξ α d dξ α d2 ξ α dxµ dxλ ν
α µ ν α
= + Γ µν ξ x = + Γ µν,λ ξ
Dσ 2 dσ dσ dσ 2 dσ dσ
α
where we have used the fact that Γµν (P ) = 0.
By combining the last two equations we get:
D2 ξ α α α
µ
ν dx dx
λ
+ [Γ µλ,ν − Γ µν,λ ]ξ =0
Dσ 2 dσ dσ
which will give
D2 ξ α α
µ
ν dx dx
ν
+ R µνλ ξ =0 (9.69)
Dσ 2 dσ dσ
because the term in the square brackets is the Riemann tensor in local geodesic
coordinates.
Tidal forces deform the shape of bodies as they freely move in a gravitational
field.
Thus two nearby particles with trajectories xi (t) and x̃i (t) (in Cartesian
coordinates) will be separated by a vector ζ i = xi − x̃i
d2 ζ
2
∂ Φ
= − ζj (9.70)
dt2 ∂xi ∂xj
(why?) where Φ is the Newtonian gravitational potential.
Tidal effects can be also estimated in GR for two particles moving along
timelike geodesics xµ (τ ) and x̃µ (τ ) (τ is the proper time of the 1st particle).
170 9 Physics on Curved Spaces
Fig. 9.5. The basis vectors of the instantaneous rest frame (IRF) at P .
• êα is a set of orthonormal basis vectors at P that define the IRF of the first
particle (observer) with êα · êβ = ηαβ .
• ξ is a general connecting vector with ξ α̂ ≡ êα · ξ = (êα )µ ξ µ
• ζ is the orthogonal connecting vector.
For an observer sitting on the one of the particles it can be shown that in
any orthonormal freely falling frame becomes:
d2 ξ µ̂
= c2 Rµ̂ 0̂0̂ν̂ ξ ν̂ . (9.72)
dτ 2
Newtonian limit (we will discuss the details later)
∂2Φ
Rµ̂ 0̂0̂ν̂ → (9.73)
∂xi ∂xj
10
Einstein’s Theory of Gravity
1 R 02 0 0
Z
U (r) = − r ρ(r )dr for r > R
r 0
1 r 02 0 0
Z Z R
U (r) = − r ρ(r )dr − r0 ρ(r0 )dr0 for r < R
r 0 r
Fig. 10.1.
1 1 X xk x0k 1 XX 0k 0l
k l
02 l x x
= + + 3x x − r δk + ...
|x − x0 | r r3 2 r5
k k l
GM G X k k G X kl xk xl
U (x) = − − 3 x D − Q + ...
r r 2 r5
k kl
Gravitational Multipoles
Z
M = ρ(x0 )d3 x0 Mass
Z
k
D = x0k ρ(x0 )d3 x0 Mass Dipole moment1
Z
Qkl = 3x0k x0l − r02 δkl ρ(x0 )d3 x0
Mass Quadrupole tensor2
The Earth’s polar and equatorial diameters differ by 3/1000. This devia-
tion produces a quadrupole term in the gravitational potential, which causes
perturbations in the elliptical Kepler orbits of satellites. Usually, we define
the dimensionless parameter
Q33
J2 = − (10.1)
2M R2
as a convenient measure of the oblateness of a nearly spherical body. For the
Sun the oblateness due to rotation gives J2 ≈ 10−7 .
The main perturbation is the precession of Kepler’s ellipse and this can
be used for precise determinations of the multipole moments and the mass
distribution in the Earth.
Fig. 10.2.
Fig. 10.3.
d2 xµ ρ
µ dx dx
σ
+ Γ ρσ = 0.
ds2 ds ds
• The 2nd term appeared due to the use of a non-inertial frame, i.e. the
inertial accelerations will be described by the Christoffel symbols.
• But according to Einstein the gravitational accelerations as well will
be described by the Christoffel symbols.
• This leads to the following conclusion: The metric tensor should play
µ
the role of the gravitational potential since Γρσ is a function of the metric
tensor and its derivatives.
2. When gravitational accelerations are present the space can-
not be flat since the Christoffel symbols are non-zero and the Riemann tensor
is not zero as well.
In other words, the presence of the gravitational field forces the space to
be curved, i.e. there is a direct link between the presence of a gravitational
field and the geometry of the space.
3. Consequence: if gravity is present there cannot exist inertial
frames.
If it was possible then one would have been able to discriminate among
inertial and gravitational accelerations which against the “generalized equiv-
alence principle” of Einstein.
The absence of “special” coordinate frames (like the inertial) and their
substitution from “general” (non-inertial) coordinate systems lead in naming
Einstein’s theory for gravity “General Theory of Relativity”.
10.4 Solutions of Einstein’s Equations (*) 175
Since the source of the gravitational field is a tensor (Tµν ) the field should
be also described by a 2nd order tensor e.g. Fµν . Since the role of the gravita-
tional potential is played by the metric tensor then Fµν should be a function of
the metric tensor gµν and its 1st and 2nd order derivatives. Moreover, the law
of energy-momenum conservation implies that T µν ;µ = 0 which suggests that
F µν ;µ = 0 Then since F µν should be a linear function of the 2nd derivative
of gµν we come to the following form of the field equations (how & why?):
F µν = Rµν + ag µν R + bg µν = κT µν (10.2)
8πG
where κ = c4 . Then since F µν ;µ = 0 there should be
which is possible only for a = −1/2. Thus the final form of Einstein’s equations
is:
1
Rµν − g µν R + Λg µν = κT µν . (10.4)
2
8πG
where Λ = c2 ρv is the so called cosmological constant.
In the absence of strong gravitational fields and for small velocities both Ein-
stein & geodesic equations reduce to the Newtonian ones.
• Geodesic equations:
d2 xµ α
µ dx dx
β
d2 t/ds2 dxµ d2 xj
+ Γαβ = − ⇒ ≈ g00,j (10.5)
dt2 dt dt (dt/ds)2 dt dt2
2 k
If g00 ≈ η00 + h00 = 1 + h00 = 1 + 2 cU2 then ddtx2 ≈ ∂x
∂U
k
• Einstein equations
1 1
Rµν = −κ Tµν − gµν T ⇒ ∇2 U = κρ (10.6)
2 2
where κ = 8πG.
Here we have used the following approximations:
j 1 j
Γ00 ≈ g00,j and R00 ≈ Γ00,j ≈ ∇2 U
2
where we have set r̃2 = D(t, r) and we redefined the A, B̃ and C̃. The next
step will be to introduce a new timelike coordinate t̃ as
0 1 0
dt̃ = Φ(t, r̃) A (t, r̃)dt − B (t, r̃)dr̃
2
where Φ(t, r̃) is an integrating factor that makes the right-hand side an exact
differential.
By squaring we obtain
1
dt̃2 = Φ2 A02 dt2 − A0 B 0 dtdr̃ + B 02 dr̃2
4
Thus by defining the new functions  = 1/(A0 Φ)2 and B̂ = C + B 0 /(4A0 ) the
metric 10.8 becomes diagonal
in the absence of matter (outside the source) Rµν = 0 we can prove that
λ(r) = −ν(r), i.e. the solution is independent of time (how and why?).
−1
2GM 2 2 2GM
2
dr2 − r2 dθ2 + sin2 θdφ2
ds = 1 − c dt − 1 −
rc2 rc2
and from gλµ ẋλ ẋµ = 1 (here we assume a massive particle) we get :
Then from equations (10.23) and (10.24) we can easily prove (how?) that:
d 2
r φ̇ = 0 ⇒ r2 φ̇ = L = const : (Angular Momentum) (10.26)
dτ
d ν
e ṫ = 0 ⇒ eν ṫ = E = const : (Energy) (10.27)
dτ
• For a massive particle with unit rest mass, by assuming that τ is the
affine parameter for its motion we get pµ = ẋµ . Thus
————
Finally, by substituting eqns (10.27) and (10.26) into (10.25) we get the
“energy equation” for the r-coordinate
eν 2
ṙ2 +
L + eν = E 2 (10.29)
r2
which suggests that at r → ∞ we get that E = 1.
By combining the 2 integrals of motion and eqn (10.25) we can eliminate
the proper time τ to derive an equation for a 3-d path of the particle (how?)
2
E 2 − 1 2M u
d
u + u2 = + + 2M u3 (10.30)
dφ L2 L2
dr dr dφ L dr
ṙ = = = 2
dτ dφ dτ r dφ
The previous equation can be written in a form similar to Kepler’s equation
of Newtonian mechanics (how?) i.e.
d2 M
u + u = 2 + 3M u2 (10.31)
dφ2 L
For the radial motion φ is constant, which implies that L = 0 and eqn (10.29)
reduces to
ṙ2 = E 2 − eν (10.32)
and by differentiation we get an equation which reminds the equivalent one
of Newtonian gravity i.e.
M
r̈ = 2 . (10.33)
r
• If a particle is dropped from the rest at r = R we get that E 2 =
ν(R)
e = 1 − 2M/R and (10.32) will be written
2 1 1
ṙ = 2M − (10.34)
r R
180 10 Einstein’s Theory of Gravity
which is again similar to the Newtonian formula for the gain of kinetic energy
due to the loss in gravitational potential energy for a particle (of unit mass)
falling from rest at r = R.
• For a particle dropped from the rest at infinity E = 1 and the
geodesic equations are simplified
r
dt −ν dr 2M
=e and =− (10.35)
dτ dτ r
The component of the 4-velocity will be
r !
µ dxµ −ν 2M
u = = e ,− ,0,0 (10.36)
dτ r
Fig. 10.4. Radial fall from rest towards a Schwarschild BH as described by a co-
moving observer (proper time τ ) and by a distant observer (Schwarschild co-
ordinate time t)
r2 M
L2 = (10.41)
r − 3M
if we also put ṙ = 0 in eqn (10.29) we get :
1 − 2M/r
E=p (10.42)
1 − 3M/r
Circular orbits will be bound for E/(m0 c2 ) = E < 1, so the limit on r for an
orbit to be bound is given by E = 1 which leads to
2
(1 − 2M/r) = 1 − 3M/r true when r = 4M or r = ∞ (10.43)
Thus over the range 4M < r < ∞ circular orbits are bound.
From the integral of motion r2 φ̇ = L and eqn (10.41) we get
2
dφ M
= 2 (10.44)
dτ r (r − 3M )
NOTICE : This equation cannot be satisfied for circular orbits with r <
3M . Such orbits cannot be geodesics and cannot be followed by freely falling
particles.
182 10 Einstein’s Theory of Gravity
Fig. 10.5. The variation of E = E/(m0 c2 ) as a function of r/M for a circular orbit
of a massive particle in the Schwarzchild geometry
Fig. 10.6. The shape of a bound orbit outside a spherical star or a black-hole
According to the previous discussion the closest bound orbit around a massive
body is at r = 4M , however we cannot yet determine whether this orbit is
stable.
In Newtonian theory the particle motion in a central potential is described
by:
10.5 Schwarzschild Solution (****) 183
2
1 dr
+ Veff (r) = E 2 (10.46)
2 dt
M L2
Veff (r) = − + 2 (10.47)
r 2r
Fig. 10.7.
The bound orbits have two turning points while the circular orbit corre-
sponds to the special case where the particle sits in the minimum of the of
the effective potential.
In GR the ‘energy’ equation (10.29) is
eν 2
ṙ2 + L + eν = E 2 (10.48)
r2
which leads to an effective potential of the form
eν 2 M L2 M L2
Veff (r) = 2
L + eν = − + 2− 3 (10.49)
r r 2r r
Circular orbits occur where dVeff /dr = 0 that is:
dVeff M L2 3M L2
= 2 − 3 + (10.50)
dr r r r4
so the extrema are located at the solutions of the eqn M r2 − Lr + 3M L2 = 0
which occur at
L p
r= L ± L2 − 12M 2
2M
√ √
Note that if L = 12M = 2 3M then there is only one extremum and no
turning points in the orbit for lower values of L. Thus the innermost stable
circular orbit (ISCO) has
√
rISCO = 6M and L = 2 3M
184 10 Einstein’s Theory of Gravity
Fig. 10.8. The dots indicate the locations of stable circular orbits which occur
at the local minimum of the potential. The local maxima in the potential are the
locations of the unstable circular orbits
and it is unique in satisfying both dVeff /dr = 0 and d2 Veff /dr2 = 0, the latter
is the condition for marginal stability of the orbit.
• The upper shows circular orbits. A stable (outer) and an unstable (in-
ner) one.
• The lower shows bound orbits , the particle moves between two turning
points marked by dotted circles.
• The upper shows a scattering orbit the particle comes in from infinity
passes around the center of attraction and moves out to infinity again.
• The lower shows plunge orbit , in which the particle comes in from
infinity.
Photons as any zero rest mass particle move on null geodesics. In this case
we cannot use the proper time τ as the parameter to characterize the motion
and thus we will use some affine parameter σ.
We will study photon orbits on the equatorial plane and the equation of
motion will be in this case
eν ṫ = E (10.51)
ν 2 −ν 2 2 2
e ṫ − e ṙ − r φ̇ = 0 (10.52)
2
r φ̇ = L (10.53)
eν 2
ṙ2 + L = E2 (10.54)
r2
while the equivalent of eqn (10.31) is
d2
u + u = 3M u2 (10.55)
dφ2
pφ = g φµ pµ = g φt pt + g φφ pφ and pt = g tµ pµ = g tt pt + g tφ pφ
dφ pφ g tφ 2M a
= t = tt = 3 = ω(r)
dt p g r
188 10 Einstein’s Theory of Gravity
Useful constants
Useful Constants in geometrical units
d2 M
u + u = 2 + 3M u2 (10.31)
dφ2 L
M
u= [1 + e cos(φ + φ0 )] (10.60)
L2
where e and φ0 are integration constants.
• We can set φ0 = 0 by rotating the coordinate system by φ0 .
• The other constant e is the eccentricity of the orbit which is an
ellipse if e < 1.
10.7 The Classical Tests 189
Now since the term 3M u2 is small we can use perturbation theory to get
a solution of equation (10.31).
d2 M 3M 3 2 M 3M 3 6eM 3
2
u + u ≈ 2 + 4 [1 + e cos(φ)] ≈ 2 + 4 + cos(φ)
dφ L L L L L4
Because, 3M 3 /L4 M/L2 and can be omitted and its corrections will be
small periodic elongations of the semiaxis of the ellipse.
The term 6eM 3 /L4 cos(φ) is also small but has an accumulative effect
which can be measured. Thus the solution of the relativistic form of Kepler’s
equation (10.61) becomes (k = 3M 2 /L2 )
3eM 2
M M
u = 2 1 + e cos φ + 2
φ sin φ ≈ 2 {1 + e cos[φ(1 − k)]} (10.61)
L L L
Fig. 10.13.
2nπ 2π 6πM 2
φn = ⇒ φn+1 − φn = ≈ 2π(1 + k) = 2π + (10.62)
1−k 1−k L2
6πM 2
δφ ≈ (10.63)
L2
190 10 Einstein’s Theory of Gravity
Fig. 10.14.
Fig. 10.15.
Since the term 3M u2 is very small we can substitute u with the Newtonian
solution (10.66) and we need to solve the non-homogeneous ODE
d2 u 3M
+ u = 2 cos2 φ . (10.67)
dφ2 b
admitting a solution of the form
cos φ M
+ 2 1 + sin2 φ
u= (10.68)
b b
which for distant observers (r → ∞) i.e. u → 0, and we get a relation between
φ, M and the parameter b
cos φ M
+ 2 1 + sin2 φ = 0
(10.69)
b b
and since r → ∞, this means that φ → π/2 + and cos φ → 0 + and
sin φ → 1 − we get:
192 10 Einstein’s Theory of Gravity
2M
≈− (10.70)
b
Since, φ → π/2 + 2M/b for r → ∞ on the one side & φ → 3π/2 − 2M/b on
the other side the total deviation will be the sum of the two i.e.
4M
δφ = . (10.71)
b
For a light ray tracing the surface of the Sun gives a deflection of ∼ 1.7500 .
The deflection of light rays is a quite common phenomenon in Astronomy
and has many applications. We typically observe “crosses” or “rings”
Fig. 10.16. Einstein Cross (G2237+030) is the most characteristic case of gravi-
tational lens where a galaxy at a distance 5 × 108 lys focuses the light from a quasar
who is behind it in a distance of 8×109 lys. The focusing creates 4 symmetric images
of the same quasar. The system has been discovered by John Huchra.
Fig. 10.17. “Einstein rings” are observed when the source, the focusing body and
Earth are on the same line of sight. This ring has been discovered by Hubble space
telescope.
Fig. 10.18. Let’s assume 3 static observers on a Schwarszchild spacetime, one very close to the source of
the field the other in a medium distance from the source and the third at infinity.
1/2
2M
who measures the so called ‘ coordinate time” i.e. dτ1 = 1 − r1 dt and
1/2
dτ2 = 1 − 2M
r2 dt this means that
1/2
2M
dτ2 1− r2 M M
dτ1
= 1/2 ≈ 1 + r − r . (10.72)
2M 1 2
1− r1
A more recent test (late 60s) where the delay of the radar signals caused
by the gravitational field of Sun was measured. This experiment suggested
and performed by I.I. Shapiro and his collaborators. The line element ds2 =
gµν dxµ dxν for the light rays i.e. for ds = 0 on the equatorial plane has the
form
−1 2 2
2M 2M dr dφ
0= 1− − 1− − r2 (10.76)
r r dt dt
194 10 Einstein’s Theory of Gravity
Fig. 10.19.
For the study of the radial motion we should substitute the term dφ/dt from
the integrals of motion (10.26) and (10.27) i.e. we can create the quantity:
−1
L 2M dφ
D= = r2 1 − (10.77)
E r dt
At the point of the closest approach to the Sun, r0 , there should be dr/dt = 0
r02
and thus we get the value of D2 = 1−2M/r 0
. Leading to an equation for the
radial motion:
dr
2M
r 2 1 − 2M/r 1/2
0
= 1− 1− (10.79)
dt r r 1 − 2M/r0
leading to
Z r1
dr
t1 = q 2 1−2M/r
1 − 2M 1 − rr0 1−2M/r
r0
r 0
" p #
2 2
r
r1 r1 − r0 r1 − r0
q
2 2
= r1 − r0 + 2M ln +M
r0 r1 + r0
2r1
≈ r1 + 2M ln +M
r0
For flat space we have t1 = r1 i.e. the term t̃1 = 2M ln(2r1 /r0 ) + M is
the relativistic correction for the first part of the orbit in same way we get
a similar contribution as the signal returns to Earth. Thus the total “extra
time” is:
4r1 r2
∆T = 2 t̃1 + t̃2 = 4M 1 + ln . (10.80)
r02
10.7 The Classical Tests 195
Fig. 10.20. Comparison of the experimental results with the prediction of the
theory. The results are from I.I. Shapiro’s experiment (1970) using Venus as reflector.
Fig. 10.21.
11
Solutions of Einstein’s Equations & Black
Holes
while the
2M 02 2M 4M
2
ds ≡ 0 = 1 − dt − 1 + dr2 ± drdt0
r r r
0 2M 0 2M
(dt + dr) 1− dt − 1 + dr = 0
r r
11.1 Schwarzschild Solution: Black Holes (***) 199
dr dr r − 2M
= −1 and =
dt0 dt0 r + 2M
32M 3 −r/2M
ds2 = dv 2 − du2 − r2 dΩ 2
e
r
du
du − dv 2 = 0 ⇒
2
= ±1
dv
In a maximal spacetime, particles can appear or disappear, but only at sin-
gularities.
200 11 Solutions of Einstein’s Equations & Black Holes
ds2 ≡ 0 = g00 dt2 + grr dr2 + gφφ dφ2 + 2g0φ dφdt at θ = π/2
2
dφ dφ
for constant r i.e. dr = 0 we get 0 = g00 + gφφ dt + 2g0φ dt then at
g00 = 0 we will get
1
If ∆ = r2 − 2M r + a2 + Q2 the we get the so called Kerr-Newman solution which
describes a stationary, axially symmetric and charged spacetime.
202 11 Solutions of Einstein’s Equations & Black Holes
dφ dφ gφφ
=0 and =− (11.5)
dt dt 2g0φ
The light cone in the radial direction:
2
2 2 dr g00 ∆(ρ2 − 2M r)
0 = g00 dt + grr dr ⇒ =− = (11.6)
dt grr ρ4
thus the horizon(s) will be the surfaces:
p
∆ = r2 − 2M r + a2 = 0 ⇒ r± = M ± M 2 − a2 (11.7)
while Hawking (‘70) suggested that the area of the BH horizon cannot be
reduced via the Penrose process. The two statement are equivalent!
∂p µν
T µν ;ν = g + (ρ + p)Γ µ 00 u0 u0 = 0 (11.11)
∂xν
where Γ µ 00 = − 12 g µν g00,ν = − 21 eν ν 0
...multiplying with gµλ we get
∂p 1 ∂ν dp 1 dν
λ
= − (ρ + p) λ ⇒ = − (ρ + p) (11.12)
∂x 2 ∂x dr 2 dr
which is the relativistic version of the equations for hydrodynamical equilib-
rium, since in the Newtonian limit g00 = eν ≈ 1 + 2U and ρ >> p which
leads to:
∂p ∂U
= −ρ (11.13)
∂r ∂r
Still we need to find a way, via Einstein’s equations
1
Rµν = −8π Tµν − gµν Tλλ (11.14)
2
to estimate ν(r) in the same way that we need to solve Poisson equation to
estimate the gravitaional potential U .
h r i
{θθ} : 1 − e−λ 1 + (ν 0 − λ0 ) = −4πr2 (ρ − p) (11.15)
2
ν 00 (ν 0 )2 ν 0 λ0 λ0
{rr} : + − − = 4πeλ (ρ − p) (11.16)
2 4 4 r
ν 00 (ν 0 )2 ν 0 λ0 ν0
{tt} : + − + = −4πeλ (3ρ + p) (11.17)
2 4 4 r
11.4 The Tolman-Oppenheimer-Volkov (TOV) Solution (-) 205
0
which leads to the following ODE rλ0 + eλ − 1 = 8πeλ r2 ρ or re−λ =
1 − 8πρr2 leading to
Z r
−1
eλ(r) = (1 − 2M/r) where M (r) = 4π ρ(r0 )r02 dr0 (11.18)
0
dp M + 4πr3 p dν M + 4πr3 p
= −(ρ + p) or =2 . (11.19)
dr r(r − 2M ) dr r(r − 2M )
For the special case ρ = const there is an analytic solution. For example the
mass function will become:
4 3 4 3
M (r) = πr ρ for r≤R and M (r) = πR ρ for r≥R
3 3
The we get:
p (1 − 2M r2 /R2 )1/2 − (1 − 2M/R)1/2
= . (11.20)
ρ 3(1 − 2M/R)1/2 − (1 − 2M r2 /R3 )1/2
But substituting the above relations in (11.12) we get an analytic solution
for g00 i.e.
1/2 1/2
2M r2
3 2M 1
eν/2 = 1− − 1− . (11.21)
2 R 2 R3
Maximum allowed mass when p(r = 0) → ∞ :
M 4
=
R 9
A
Useful constants in geometrical units