Preface to the second
edition
There has not been any drastic changes in the present edition. Of course
some of the typographical and other smaH mistakes, posted on the internet
page for the book, have been corrected. In addition, we have added OCC8Ssional comments and clarifications at some places, hoping that they would
make the discussions more accessible to a beginner.
The only serious departure from the first edition has been in the notation for spinor solutions of the Dirac equation. In the previous edition,
we put a subscript on the spinors which corresponded to their helicities.
In this edition, we change the notation 'towards what is more conventionaL
This notation has been introduced in 4.3 and explained in detail in 4.6.
We have received many requests for posting the answers to the exercises.
In this edition, we have added answers to some exercises which actually
require an answer. The web page for the errata of the book is now at
http://tnp.saha.ernet.in/~pbpal/books/qft/errata.html
If you find any mistake in the
presen~ edition,
please inform us through this
web page.
It is a pleasure to thank everyone who has contributed to this book
by pointing out errors or asking for clarifications. On the errata page we
have acknowledged all correspondence which have affected some change. In
particular, comments by C. S. Aulakh, Martin Einhorn and Scan Murray
were most helpful.
Amitabha Lahiri
Palash B. Pal
July 2004
Preface to the first
edition
It has been known for more than fifty years that Quantum Field Theory
is necessary for describing precision experiments involving electromagnetic
interactions. Within the last few decades of the twentieth century it has also
become clear that the weak and the strong interactions are well described
by interacting quantum fields. Although it is quite possible that at even
smaller length scales some other kind of theory may be operative, it is clear
today that quantum fields provides the appropriate framework to describe
a wide class of phenomena in the energy range covered by all experiments
to date.
In this book, we wanted to introduce the subject as a beautiful but
essentially simple piece of machinery with a wide range of applications.
This book is meant as a textbook for advanced undergraduate or beginning
postgraduate students. For this reason, we employ canonical quantization
throughout the book. The name of the book is an echo of various children's texts that were popular a long time ago, not a claim to primacy or
originality.
Our approach differs from many otherwise excellent textbooks at the
introductory level which set up the description of electrons and photons
as their goal. For example, decays are rarely discussed, since the electron
and the photon are both stable particles. However l decay processes are in
some sense simpler than scattering processes, since the former has only one
particle in the initial state whereas the latter has two.
We felt that the basic machinery of Feynman diagrams could be introduced through decay processes even before talking about the quantization
of spinl fields. With that in mind , we start with some introductory material in Chapters 1 and 2 and discuss the quantization of scalar fields in
Chapter 3 and ofspin4 fields in Chapter 4. Unlike many other texts at this
level, we use a fermion normalization that should he applicable to massless
as well as massive fermions. After that we discuss the generalities of the
Smatrix theory in Chapter 5, and the methods of calculating Feynman
diagrams, decay rates, scattering cross sections etc. with spinO and spin!
fields, in Chapters 6 and 7.
The quantization of the spinl fields, with special reference to the pho
vii
Vlll
Preface
ton, is taken up next in Chapter 8. After this, we have a detailed introduction to quantum electrodynamics in Chapter 9, where we introduce the
crucial concept of gauge invariance and give detailed derivations of important scattering processes at their lowest orders in perturbation theory.
Discrete symmetries can serve as a good guide in calculating higher order corrections. With this in mind, we discuss parity, time reversal, charge
conjugation, and their combinations, in Chapter ] O. Unlike most other
textbooks on the subject which describe the P, T, C transformations only
in the PauliDirac representation of the Dirac matrices, we present them in
a completely representationindependent way. Since other representations
such as the Majorana or the chiral representations are very useful in some
contexts, we hope that the general formulation will be of use to students
and researchers.
We calculate loop diagrams in Chapter 11, showing how to use symmetries of a problem to parametrize quantum corrections. In this chapter, we
restrict ourselves to finite contributions only. Some basic concepts of renormalization are then described, with detailed calculations, in Chapter 12.
Only the electromagnetic gauge symmetry is used in these two chapters for
illustrative purpose. But this leads to a more general discussion of symmetries, which is done in Chapter 13. This chapter also discusses the general
ideas of symmetry breaking, and the related physics of NambuGoldstone
theorem as well as its evasion through the Higgs mechanism. This is followed by an introduction to the YangMills (or nonAbelian) gauge theories
in Chapter 14. Finally, a basic introduction to standard electroweak theory and electroweak processes is given in Chapter 15. In keeping with our
overall viewpoint, we discuss decays as well as scattering processes in this
chapter.
We have tried to keep the book at the elementary level. In other words,
this is a book for someone with no prior knowledge of the subject, and only
.a reasonable familiarity with special relativity and quantum mechanics. To
set the stage as well as to help the reader, we visit briefly the relevant
parts of classical field theory in Chapter 2. A short but comprehensive
introduction to group theory also appears in Chapter 13, since we did not
want to assume any background in group theory for the reader.
The most important tools to help the reader are the exercise problems
in the book. These problems are not collected at the ends of chapters.
Instead, any problem appears in the place of the text where we felt it
would be most beneficial for the reader to have it worked out. Working it
out at that stage should also prepare the reader for the ensuing parts of
the chapter. Even if for some reason the reader does not want to work out
a problem at that stage, we suggest strongly to at least read carefully the
statement of the problem before proceeding further. Some of the problems
come with notes or hints, some come with a relation that might be useful
later. A few of the problems are marked with a * sign, implying that they
Preface
IX
might be a little hard at that stage of the book, and the reader can leave
it at that point to visit it later. A few have actually been worked out later
in the book, but we have not marked them.
The book was submitted in a cameraready form to the publishers. This
means that we are responsible for all the mistakes in the book, including
typographical ones. We have spared no effort to avoid errors, but if any
has crept in, we would like to hear about it frum the reader. We have set
up a web site at
http://tnp. saha. ernet. in/rvpbpal/qftbk .html
containing errata for the hook, and a way of contacting us.
The book grew out of courses that both of us have taught at several
universities and research institutes in India ami ahroad. We have ben
efited from the enquiries and criticisms of OUl' students and colleagues.
Indrajit Mitra went through the entire manw;cript carefully and offered
numerous suggestions. Many other friends and colleagues also read parts
of the manuscript and made useful comments, in particular Kaushik Bhat
tacharya, Ed Copeland, A. Harindranath, H. S. Mani, Jose Nieves, Saurabh
Rindani. Those who have taught and influenced liS through their lectures,
books and papers are too numerous to name sepa.rately. We thank them
all.
We thank our respective institutes for extending various facilities while
the book was being written.
Amitabha Lahiri
Palash B. Pal
April 2000
Notations
Spacetime indices of a vector or tensor.
JJ., II,
i,i,
Spatial indices of a vector or tensor.
(Components of) metric tensor, diag(l, 1, 1, 1).
pJJ contravariant 4vector.
gpol.J
P,.. covariant 4vectof.
p 3vector.
p Magnitude of the 3vector p, i.e., Ipl. We have used p2 and p2
interchangeably. The magnitude of the coordinate 3vector has
been denoted by r.
a b Scalar product of 3vectors a and b, a . b = aib i == I: aib i .
a b Scalar product of 4vectors a and b, a b = al'b,..
aObo  a b.
= L: aPb,..
;. "'(/Jaw
!L' Lagrangian density, frequently called Lagrangian.
L Total Lagrangian (; f d3 x Z).
PI Action (; f dt L ; f d4~ Z) .
.Ye Hamiltonian density, frequently called Hamiltonian.
H Total Hamiltonian (; f d3 x').
lA, Blp Poisson bracket of A and B.
lA, BI_ Commutator AB  BA.
lA, BI+ Anticommutator AB + BA.
a(p), at (p) Annihilation and creation operators for antiparticles of particles
created by at(p) and annihilated by a(p), first encountered in
3.6.
8(x) Unit step function, defined in Eq. (3.13).
:[000]: Normal ordered product, first defined in 3.4.
5' Ij Time ordered product, first defined in 3.7.
xi
Notations
Xli
AT Transpose of t.he matrix A.
At For any matrix A, At =::. !'oAfl'D .
(Ti Pauli matrices, given in Eq. (4.55).
r i Same as (Ti, hut thought of as generators of some internal SU(2)
symmetry.
le (p, s)) Electron state of 3mornentum p and spin s, first defined in 6.2.
6.F(p) Feynman propagator for scalar field in momentum space.
SF(p) Feynman propagator for fermion field in momentum space.
DJw{p) Feynman propagator for vector hoson field in momentum space.
e Electric charge of proton. Electron carries charge e.
a Fine structure constant, first defined in 1.5.
e Electron.
"'Y Photon.
S/i Smatrix element between initial state Ii) and final state If}.
.4lfi Feynman amplitude between initial sLate
1_4f1 2
,
lit
Ii} and final state If).
Magnitude squared of Feynman amplitude after making spin
and polarization sums and averages.
Polarization vector for a vector boson.
P Parity transformation operator, defined in Ch. 10.
C Charge conjugation operator.
T Time reversal operator.
P Matrix of parity transformation acting on a fermion.
C Matrix of charge conjugation acting on a fermion.
T Matrix of time reversal acting on a fermion.
x Parity transformed system of coordinate::;, i = (t, x).
r It General electromagnetic vertex, first. defined in 11.1.
E
For dimen::;ional regularization in N dimensions,
= 2
!N.
dN Rank of I'rnatrices in Ndimensional spacetime, tr hpl'v) =
d N 9l w
'
Ta Generators of a Lie group.
fabc Structure constants of a Lie group, IT", Tbl = ifnbcTc.
DI Gauge covariant derivative D, =
!
a/. + i9I:,A~.
Contents
Preface to the second edition
Preface to the first edition
vii
Notations
xi
1 Preliminaries
1.1 Why Quantum Field Theory
1.2 Creation and annihilation operators
1.3 Special relativity . . . . . . .
1.4 Space and time in relativistic quantum theory.
1.5
Natural units . . . .
1
1
3
5
8
9
2 Classical Field Theory
2.1 A quick review of particle mechanics
2.1.1 Action principle and EulerLagrange equations
2.1.2 Hamiltonian formalism and Poisson brackets
2.2 EulerLagrange equations in field theory .
2.2.1 Action functional and Lagrangian
2.2.2 EulerLagrange equations
2.3 Hamiltonian formalism.
2.4 Noether's theorem . . . .
12
12
12
14
15
15
17
19
21
3 Quantization of scalar field3
3.1 Equation of motion .
. . . . .
3.2 The field and its canonical quantization
3.3 Fourier decomposition of the field.
3.4 Ground state of the Hamiltonian and Ilormal ordering
3.5 Fock space. ..
3.6 Complex scalar field .
3.6.1 Creation and annihilation operators
3.6.2 Particles and antiparticles . . .
3.6.3 Ground state and Hamiltonian
3.7 Propagator .. . . . . .
28
28
29
30
xiii
34
36
37
37
39
40
41
Contents
XIV
4 Quantization oj Dirac fields
4.1 Dirac Hamiltonian
. . . . .
. . . . .
4.2 Dirac equation ..
4.3
47
47
51
4.5
4.6
Plane wave solutions of Dirac equation.
4.3.1 Positive and negative energy spinor::;
4.3.2 Explicit solutions in DiracPauli representation
Projection operators . ..
4.4.1 Projection operators for positive and negative energy
states
. . . . . . .
4.4.2 Helicity projection operators .
4.4.3 Chirality projection operators.
4.4.4 Spin projection operators .
Lagrangian for a Dirac field . .
Fourier decomposition of the field.
59
60
61
62
63
65
4.7
Propagator
69
4.4
5 The 5 matrix expansion
5.1
5.2
5.3
5.4
Examples of interactions
Evolution operator
5matrix.
Wick\; theorem . .
54
54
56
59
72
73
75
80
82
6 Prom Wick expansion to Feynman diagrams
6.1 Yukawa interaction: decay of a scalar .
6.2 Normalized states. . . . . . . . .
6.3 Sample calculation of a matrix element.
6.4 Another example: fermion scattering
6.5 Feynman amplitude
6.6 Feynman rules
6.7 Virtual particle~
6.8 Amplitudes which are not Smatrix elements
87
87
94
97
101
105
106
110
112
7 Cross sections and decay rates
7.1 Decay rate. . . . . . . . . . . . . . . . . . . . . . . . . .
7.2 Examples of decay rate calculation. . . . . . . . . . . .
7.2.1 Decay of a scalar into a fermionanti fermion pair
7.2.2 Muon decay with 4fermion interaction.
7.3 Scattering cross section
7.4 Generalities of 2to2 scattering
7.4.1 eM frame. . .
7.4.2 Lab frame. . . . . . . .
7.5 Inelastic scattering with 4fermion interaction
7.5.1 Crosssection in CM frame
7.5.2 CroSl:isection in Lab frame
7.6 Mandelstam variahles
115
115
117
117
122
130
133
135
137
140
142
143
144
Contents
xv
8 Quantization of the electromagnetic field
8.1 Classical theory of electromagnetic fields.
8.2 Problems with quantization . . . .
8.3 Modifying the classical Lagrangian
8.4 Propagator . . . . . . . . . . . ..
8.5 Fourier decomposition of the field ".
8.6 Physical states . . . . . . . . .
8.7 Another look at the propagator
8.8 Feynman rules for photons.
146
146
149
150
153
156
158
162
164
9 Quantum electrodynamics
9.1 Local gauge invariance .
9.2 Interaction Hamiltonian
9.3 Lowest order processes .
9.4 Electronelectron scattering
9.5 Electronpositron scattering
9.6 ee+ + ~ ~+
.
9.7 Consequence of gauge invariance
9.8 Compton scattering
.
9.9 Scattering by an external field . .
9.10 Bremsstrahlung
.
166
166
170
172
174
180
182
184
185
194
197
lOP, T, C and their combinations
10.1 Motivations from classical physics .
200
200
201
201
202
204
205
207
207
211
212
212
213
216
217
218
10.2 Parity
.
10.2.1 Free scalar fields
10.2.2 Free Dirac field .
10.2.3 Free photon field
10.2.4 Interacting fields
10.3 Charge conjugation.
10.3.1 Free fields .
10.3.2 Interactions.
10.4 Time reversal . . . .
10.4.1 Antilinearity
10.4.2 Free fields .
10.4.3 Interactions
10.5 CP .
10.6 CPT
.
11 Electromagnetic form factors
11.1 General electromagnetic vertex
11.2 Physical interpretation of form factors
11.2.1 Charge form factor F 1 . . . . .
11.2.2 Anomalous magnetic mOE1ent F2
11.2.3 Electric dipole moment F2 . . . .
222
222
224
224
228
228
xvi
Contents
11.2.4 Anapole moment F3 . . . . . . . . .
11.3 Anomalous magnetic moment of the electron
11.4 Charge form factor . . . .
11.5 Electron proton scattering . . . . .
229
230
239
241
12 Renormalization
12.1 Degree of divergence of a diagram
.
12.1.1 Superficial degree of divergence . . . . .
12.1.2 Superficial vs. real degree of divergence
12.5 General forms for divergent amplitudes .
12.5.1 Fermion selfenergy.
12.5.2 Vacuum polarization
.
12.5.3 Vertex function . . . . . . . . .
12.6 Regularization of selfenergy diagrams
12.6.1 Vacuum polarization diagram
12.6.2 Fermion selfenergy diagram.
12.7 Counterterms . . .. .
.
12.7.1 Vacuum polarization diagram
12.7.2 Fermion selfenergy diagram.
12.7.3 Vertex function
.
245
245
245
248
250
252
253
256
256
257
259
260
260
265
267
267
270
272
12.8 FUll Lagrangian
273
12.2 Specific examples in QED
12.3 Outline of the program
12.4 WardTakahashi identity
12.9 Observable effects of renormalization
12.9.1 Modification of Coulomb interaction
12.9.2 Running coupling constant . . . . .
12.9.3 Cancellation of infrared divergences
13 Symmetries and symmetry breaking
13.1 Classification of symmetries
13.2 Groups and symmetries
.
13.2.1 Symmetry group
.
13.2.2 Examples of continuous symmetry groups
13.2.3 Generators of continuous groups
13.2.4 Representations
.
13.3 Approximate symmetries. . . . . . .
13.4 Spontaneous breaking of symmetries
13.4.1 Discrete symmetry
.
13.4.2 U(I) symmetry
13.4.3 NonAbelian symmetry
13.5 Goldstone's theorem
.
13.5.1 Appearance of massless states.
13.5.2 Examples of NamhuGoldstone hosons.
275
275
276
279
283
283
284
284
286
289
292
293
294
295
299
300
301
301
303
Contents
13.5.3 Interaction of Goldstone bosons .
13.6 Higgs mechanism . . . . . . . . . . . . .
xvii
305
308
14 YangMilis theory oj nonAbelian gauge fields
14.1 Gauge fields of nonAbelian symmetry .
14.2 Pure gauge Lagrangian. . . . . . . . . . . .
14.3 Interactions of nonAbelian gauge fields . .
14.3.1 Gauge interactions of other particles
14.3.2 Selfinteractions of gauge bosons . .
14.4 Equations of motion and conserved currents
14.5 Quantization of nonAbelian gauge fields.
14.6 Quantum Chromodynamics . . . . . . . . .
312
312
315
318
318
319
321
322
323
15 Standard electroweale theory
15.1 Gauge group . . . . . . . .
15.1.1 Choice of gauge group.
15.1.2 Pure gauge Lagrangian
15.2 Spontaneous symmetry breaking
15.2.1 Introducing the Higgs boson multiplet
15.2.2 Gauge boson masses
15.2.3 Scalar modes . . .
15.3 Fermions in the theory.
15.3.1 Gauge interactions
15.3.2 Electron mass. . .
15.3.3 Yukawa couplings.
15.3.4 Other fermions in the model
15.4 Gauge boson decay . . . . . . . . . .
15.5 Scattering processes
15.5.1 Forwardbackward asymmetry
15.5.2 Low energy weak interactions
15.5.3 High energy scattering. .
15.6 Propagator for unstable particles
15.7 Global symmetries of the model.
326
326
326
328
330
330
332
333
335
335
338
339
340
342
346
346
349
351
355
356
A Useful formulas
A.l Representation of 1'matrices
A.2 Traces of 1'matrices . . . .
A.3 The antisymmetric tensor . .
A.4 Useful integration formulas .
A.4.1 Angular integrations in Ndimensional space
A.4.2 Momentum integration in loops.
358
358
360
363
364
364
366
B Answers to selected exercises
368
Index
371
Chapter 1
Preliminaries
The era of quantum mechanics began when Planck postulated that
the modes of radiation of angular frequency w could be counted as
particles with energy E = fu.J, and derived the iaw of blackbody
radiation without running into infinities. The physical implications
of this fundamental postulate became clearer when Einstein showed
that quantization of radiation also explained the frequency dependence of photoelectric effect. This soon led to a proliferation of new
ideas due to Bohr, Dirac, Born, SchrOdinger, Heisenberg and many
others, who appiied the idea of quantization to particles and found
remarkable success in describing subatomic phenomena. Dirac even
constructed a relativistic theory of the electron, which was in excellent agreement with subatomic experiments. Despite its success,
there were some essential shortcomings of quantum mechanics of particles.
1.1
Why Quantum Field Theory
One of these shortcomings was philosophical. Quantum mechanics
started with the description of light in terms of photons, but the classical description of light was in terms of propagating electromagnetic
fields. So a theory of photons required a prescription or how to quantize fields , a missing link between the two descriptions. As we shall
see in the next several chapters, the structure required to quantize
electromagnetic fields can be used to describe all elementary particles as quantum fields. Even the relativistic wave equation of Dirac
was found to have the simplest description ill terms of quantum fields
1
Chapter 1. Prelirninades
representing electrons and positrons.
Another problem of particle quantum mechanics is that it is valid
in the nonrelativistic regime by definition. This is not just because
it uses nonrelativistic Hamiltonians to solve various problems. In
fact, the whole design of nonrelativistic quantum mechanics defies
relativity. For example, it uses the concept of potentials, which is
untenable in any relativistic theory since it assumes the transfer of information at an infinite speed. Moreover, space and time are treated
very differently in nonrelativistic quantum mechanics. The spatial
coordinates are operators, whereas time is a parameter 1 and we typically study the evolution of different operators, including the spatial coordinates, in time. Dirac equation, although covariant, treats
space and time on different footings. In a truly relativistic theory,
space and time should merge into a spacetime, and one cannot make
such fundamental distinction between the spatial part and the temporal part.
There is a practical problem as well. In llature there are processes
in which new particles are created or annihilated. For example {3decay can be thought of as the decay of a neutron into a proton, an
electron and an antineutrino:
(1.1)
The neutron is annihilated in this proce~s, while the proton. the
electron etc are created. Quantum mechallic~ of particles deals with
stable particles and their motions in various potentials. Without
quantization of fields, any calculation involving the creation or annihilation of particles was essentially ad hoc if not downright wrong.
Quantum Field Theory, by incorporating creation aud annihilation
of quanta or particles as its essential feature, allowed meaningful
calculations of experimentally verifiable results.
When one goes to the relativistic regimc, onc cncounters the pos
sibility of creating new particles from ellcrgy. For example, if an
e1ectron and a positron collide with sufficient cllergy, the collision
process can create extra electronpositron pairs:
(1.2)
This of course call happell only if the total killctic energy of the initial
e e+ pair is larger than the rest mass energy of the extra pair to be
1.2. Creation and annihilation operators
created. To allow for such possibilities, one requires some mechanism
to describe particle creation.
Even when the initial and the final states of a problem involve
the same particles, a quantum process can go through intermediate
states involving other particles which are created and destroyed in
the process. Such intermediate states can contribute to the amplitude of the process and affect the result. It is important to consider
these possibilities when one wants to make precision tests of quantum
theory. The creation and annihilation of the intermediate particles
in such cases can be handled only by going over to Quantum Field
Theory.
1.2
Creation and annihilation operators
We now recall one problem in nonrelativistic quantum mechanics
which can be solved by the introduction of operators which create
and annihilate quanta. This is the problem of the simple harmonic
oscillator. Here we review this topic since it will be very useful for
building up Quantum Field Theory in subsequent chapters.
The Hamiltonian of the onedimensional oscillator is given by
H =
2~ (p2 + m 2w 2x 2)
(1.3)
We define the following combinations of x and p:
1
a=J1
(Pimwx),a t =J
(p+imwx),(1.4)
2mliw
2mliw
sa that the Hamiltonian is
H =
~Iiw (ala + aa t )
(1.5)
If p and x were purely classical objects, we could have written H =
aat 1iw = atanw. But they are not. In Quantum Mechanics, they are
operators which satisfy the commutation relation
[x,pj_ == xp  px = ih.
(1.6)
This implies that the objects a and at, defined in Eq. (1.4), are also
operators. Unlike x and p, these are nonhermitian operators, and
they satisfy the relation
[a,atl_
= 1.
(1.7)
Chapter 1. Preliminaries
Moreover, they have the crucial property which can easily be checked
directly from Eqs. (1.3) and (104):
[H,ul = fu.Ju,
IH,atl_ = /twa!.
(1.8)
If we use the commutation relation of Eq. (1.7), we can rewrite
the Hamiltonian as
(1.9)
The relations in Eq. (1.8) implies that, if In) is an eigenstate of H
with eigenvalue En. i.e., if
H In) = En In) ,
(1.10)
then the states a In) and at In) are also eigenstates:
Ha In)
= (En
 fu.J)a In),
= (En + fu.J)a t In)
Hat In)
. (1.11)
In other words, the operator a seems to annihilate a quantum of
energy, of amount hw, from the state. On the other hand, at creates
a quantum of energy. In this sense, they are the annihilation and the
creation operators, respectively. Of course, in any physical process
energy must be conserved, so the operators a or at cannot appear
alone in the expression of any measurable quantity. For example, in
the expression for the Hamiltonian they appear together as we see
in Eq. (1.9).
The ground state can be denoted by 10). Since this is the state
of lowest energy, the annihilation operator a, acting on it l cannot
produce a state of lower energy. Thus, this state must be totally
annihilated by the operation of a:
1
alO) = 0.
(1.12)
Using Eq. (1.9) now, we can easily find its energy eigenvalue:
H 10) =
~fu.J 10)
(1.13)
The first excited state, which we will denote by II), can be defined
by the state whose energy is larger than the ground state by one
quantum, i.e.,
II} = at 10) .
(1.14)
1.3. Special relativity
The normalized nth excited state is defined by
It is easy to see that the energy eigenvalue of the state In} is given
by
E n =(n+
2)1iw.
(U6)
o Exercise 1.1 If the ground state is norma.lized, i.e., if (O 10) = 1,
ShOUl tha.t the excited states defined bll Eq. (1.15) a.re also normalized.
Exercise 1.2 The number opera.tor is de.fined b1J
JV=ata.
(1.17)
Sh.om tha.t its eigen'Uo.lue in the state giuen in Eq. (1.15) is n.
o Exercise 1.3 Construct the Hamiltonian for n fS'ystem of N simple
harmonic oscillators of frequencies Wi. i = 1, ... , N.
Construct the normalized state in which the itll. oscilla.tor is in the
nith excited state.
Construct the number operator and
nbo'lle stnte is 2::i nj.
ShOUl
that its eigen1Julue in the
Exercise 1.4 Suppose in a. system there nre operators which abell
a.nticommuto.tion rela.tions [an a11+ == aral + a~ar = OrlJ and
lar,a.lll+ = O. foT' T.S = 1.,N. Construct the generic norm.o.l
\Zed. excited state. ShOUl that no state can huve two Of' mof'C qua.nta
of the same s'Pecies.
1.3
Special relativity
Since our goal is to discuss relativistic field theories. here we give a
quick summary of the mathematical structure of special relativity,
which will help in setting up the notation.
In special relativity, the three spatial coordinates and time define
the four components of the position 4vector, which we will denote by
x p . In keeping with established practice, we shall use Greek indices
J.l., v, A,
to denote components of a fourvector. and Latin indices
i,l, k, to denote its spatial components. In other words, Greek
Chapter 1. Preliminaries
indices take on values 0,1,2,3, while Latin indices take on values
1,2,3. Thus,
x" == (xO,x i ) == (cl,x) ,
(1.18)
where the factor of c is put in so that all components have dimensions
of length. Here c is the speed of light in the VlUOuum, which is frameindependent according to one of the fundamental axioms of special
relativity. The distance I between two points x and y in spacetime
can be written in terms of the Cartesian coordinates as
[2 =
3
~)Xi
(xo _ yO)2 _
_ yi)2
(1.19)
1=1
We can write this in a compact form by defining the metric tensor
9"v by
(1.20)
Here, the values of the various components of the metric tensor are
9"v=diag(I,I,I,I),
(1.21)
which means that 900 = +1, 911 = 922 = 933 = I, and 9"v = 0 if
J.I. # v. In addition, we have used the summation convention, which
says that any index which appears twice in the same term is summed
over. We can use the metric to lower the indices, e.g.:
(1.22) .
Similarly, one can use the inverse of the matrix gpoV} which will be
denoted by gpov} to raise the indices:
x lt =
glt
xv
(1.23)
= gJJ.\. }
(1.24)
The inverse matrix must satisfy
gPV gv>.
= 6~
where o~ denotes the Kronecker delta which is defined by
o~ =
>
{ 1 if I" = A
0 if I" ;l A.
(1.25)
1.3. Special relativity
Under a Lorentz transformation, the coordinates transform as
(1.26)
where the specific form of the matrix A is not very important for our
purposes. These transformations are defined to be the ones which
leave x~xJl invariant:
(1.27)
Since this must equal 9 pq X P X(1, we obtain the relation
9p.v A ~ p A" (1
 gpa
(1.28)
which the Lorentz transformation matrices must satisfy.
o Exercise 1.5 Using the definition in Eq. (1.22), show that the
Lorentz tra.nsjo1"'ffio.tion rule for
X JJ
is giucn b1J
(1.29)
o Exercise 1.6 [f 0. primed frame of reference moues with respect to
an unprimed onf'. with a. uniform 'llelocitll v along the common xnris, the Lorentz transformation equa.tions between the coordinates
in the two frames a.re gillen by
t' = '"((t  vx/c'),
x' = '"((x  vt),
y' = y,
z' = z,
(1.30)
'Where "t = (1  v 2 /c 2) 1/2. Show that these tra.nsfonna.tions can be
'Written in the joT'Tl\ oj Eq. (1.26) with A/ w = g/UI +WjJ.V for infin.itesimal unlues of v, wheTe WjJ.V = WVjJ.'
Any object that remains invariant like xjJ.xjJ. is a scalar quantity. Any
fourcomponent object that transforms like xl' is called a contravariant vector, whereas a fourcomponent object transforming like x!' is
a covariant vector.
The scalar product of two vectors A~ and B~ will be written as
g~"A~B"
= A~B~ = A~B~
= AoB o + A,B' = AOBo 
A B,
(1.31)
where A B is the usual threedimensional scalar product. If </>(x) is
a scalar function, so is a small change in it:
6</> = aa</>
x~
6x~.
(1.32)
Chapter 1. Preliminaries
Since the right side is a scalar while 6xp.. is a contravariant vector,
must be a covariant vector, which will be denoted. by 8p..rP, with
the identification
It
(1.33)
For some arbitrary vector Ap.. I
(1.34)
The wave operator 0 is a scalar operator I
_l&"
o = c2 &1 2
2_
 a~{)J' .
(1.35)
The momentum 4vector of a particle is given by
(1.36)
where E is the energy of the particle. The invariant scalar product
~P~ is denoted hy
(1.37)
where m is the restmass of the particle, which we will call simply
mass for the sake of brevity.
Exercise 1.7 Write Maxwell's equa.tions and the LorenbjoTce!O'rmula in 4'UectoT nota.tion. Get the signs right!
1.4
Space and time in relativistic quantum
theory
In nonrelativistic quantum mechanics, the status of time and of the
spatial ccrordinates are very different, as we mentioned earlier. In a
relativistic theory, they must be brought to par before proceeding.
This is achieved by treating space on the same footing as time, i.e.,
by treating the spatial coordinates as parameters as well. Thus, the
quantities that we can talk about in relativistic quantum theories are
functions of both space and time, Le., of spacetime. Such functions
1.5. Natural units
are called fields even in classical physics. For example, in classical
physics, when we talk about the electromagnetic field, we talk about
the electric field and the magnetic field which are functions of the
spatial coordinates and time. Similarly, in Quantum Field Theory,
we will talk about various kinds of fields, including the electromagnetic field. In classical or quantum mechanics, we could talk about
objects which were functions of time alone. But the necessity to talk
about space and time on the same footing automatically requires
that we talk about fields.
It is important to realize that the uncertainty relations, which
form the basis of a quantum theory, should be interpreted differently
for this purpose. In nonrelativistic quantum mechanics, the relation
(1.38)
is interpreted. in terms of the uncertainties in measurements of the
ccrordinate x and the corresponding momentum pz. On the other
hand, the timeenergy uncertainty relation,
(1.39)
is interpreted by saying that if one tries to make a measurement of
energy to an accuracy ~E, the measurement process should take a
time ~t which is related to ~E by the above relation. Now that x
is also a parameter, the interpretation of Eq. (1.38) must also follow
the line of that of Eq. (1.39). In other words, we should understand
that the measurement of momentum to an accuracy ~Px requires
that the measurement be performed in a region whose spatial extent
is larger than ~x given by Eq. (1.38).
1.5
Natural units
Elementary courses in physics start with three independent units,
those of length , time and mass. As one proceeds, one introduces
extra units, e.g., those of electric charge, temperature etc.
In discussions on Quantum Field Theory, it is customary to use
a system of units in which there is only one fundamental unit, which
we can take to be the unit of mass. The units of length and time are
defined by declaring that, in this system of units,
1i=I,
c=l.
(1.40)
Chapter 1. Preliminaries
10
Table 1.1: Dimensions of various physical quantities in natural units. To go
from energy units (MeV or GeV) to conventional units, we need to multiply by
the conversion factor. For electric charge, 'conventional' will mean Heaviside.
Lorentz unit for us.
Quantity
Mass
Length
Time
Energy
Momentum
Force
Action
Electric charge
Dimension
Conversion factor
[M]
l/e 2
nc
n
IMt'
[M]'
[M]
[M]
[M]'
l/e
l/ne
[M]D
[M]D
v'nC
The resulting units are called natural units, which will be used in the
rest of this book. The dimensions of various quantities in these units
are presented in Table 1.1. We have given the conversion factors for
quantities expressed in energy units, which is universally preferred
in calculations. Readers should note that most other texts give conversion factors for mass units.
The electric charge is defined in such a way that Coulomb's law
about the force F between two charges q, and q, takes the form
F = q,q, r.
(1.41)
47Tr 3
Thus electric charge is dimensionless, as ~hown In Table 1.1. The
unit of electric charge will be taken to be the charge of the proton, e.
The finestructure constant 0:: is related to this unit by the following
relation:
e > O.
(1.42)
Exercise 1.8 In natural units, the inllerse lifetime of the muon is
gillen b'y
1
T
C'F m 5
= 192"." '
(1.43)
where m is the muon mass, 106 MeV. Whut is the dimension of GF
in natural units'? Put in the factors of rt und c so that the equation
I
I
1.5. Natural units
11
001\ be interpreted in con:\le.nlional. units o.s well.
From th.is, jind
the. lifetime in seconds if G F = 1.166 X 10 11 in MeV units. [Hint:
Remember that MeV is a. unit oj ene'r9ll.1
Chapter 2
Classical Field Theory
Before getting into the question of quantization, we discuss in this
chapter some of the basic techniques involved in the covariant formulation of classical field theories.
2.1
2.1.1
A quick review of particle mechanics
Action principle and EulerLagrange equations
All theories of classical physics can be described by starting from
a wonderful principle called the principle of least action. Suppose
we are dealing with a nonrelativistic system of particles and rigid
bodies. We identify a set of coordinates q,.(t) which, together with
their time derivatives 4,(t), adequately describe the configuration
and the evolution of the system. If we are clever enough we can
use our knowledge of the symmetries of the system to reduce this
set to the smallest number of independent coordinates. On the other
hand, in almost all the interesting cases it is useful not to make the
reduction but instead choose a set of coordillates which have simple
interrelations, making explicit the sym!netric:; of the system.
We now define two math~matical objects mlled the Lagrangian
L and the action .fl1, related by
1
t2
.fl1 =
I,,
dt L(q,(t),4,(t).t).
(2.1)
The Lagrangian L is a function of the coordinates q, and the velocities qrl and tl and t2 are the initial and final times between which
we are studying the evolution of the system. If there is a source or
12
2.1. A quick review of particle mechanics
13
sink of energy in the system under consideration, the Lagrangian can
also depend explicitly on time.
The principle of least action then states that, among all trajectories that join q,(tIl to q,(t2), the system follows the one for which
.PI is stationary.
To see what this implies, consider infinitesimal variations around
a trajectory defined by q,(t), 4,(t):
q,(t)
q,(t)
+ oq,(t) ,
4,(t)
4,(t)
:t
(2.2)
oq,(t).
This introduces the following variation on the action defined in Eq.
(2.1):
(2.3)
where we have used the summation convention for the index T. We
will assume that the system is conservative, so the Lagrangian does
not have any explicit timedependence. Integrating by parts, we can
then write
o.vl =
1.'2
tl
d 8'
8L] oq,
dt [8L
8  d
qr
t qr
8L
1'2 .
+ 8,oq,
qr
(2.4)
tl
Since we are looking at paths connecting given initial and final con
figurations q,(tIl and q,(t2), we shall not vary the end points. So we
set
(2.5)
and the last term of Eq. (2.4) vanishes. The principle of least action
states that, near the classical path, the variation of the action should
be zero. Since this is true for any arbitrary variation of the path,
Le., for arbitrary oq,(t), o.vl = 0 implies that
d (8L)
8L
dt 84, = 8q, .
(2.6)
These are the equations of motion, also known as the EulerLagrange
equations, corresponding to the coordinates (}r.
14
Chapter 2. Classical Field Theory
For a particle of mass m moving in a timeilldependent potential
V(x), a convenient choice for the Lagrangian is
L =
~lTlX2 
V(x),
(2.7)
and the EulerLagrange equation as derived from this Lagrangian is
(2.8)
:tlTlX = V'V,
as expected from Newton's second law.
2.1.2
Hamiltonian formalism and Poisson brackets
The velocityindependent part of the Lagrangian can be thought of
as generalized potential energy. So we can make the EulerLagrange
equations look like Newton's second law by defining generalized me>menta 'conjugate to q/ as
p, =
"a
uqr
L(q, q).
(2.9)
There will be one such equation for each value of the index r. Since
the Lagrangian is a function of the positions and the velocities, the
right hand sides of these equations are in general functions of the
q's and the (j's. Let us assume that these equations are invertible,
Le., the q's can be solved in terms of the positions and the momenta.
Then the Hamiltonian is defined by the Legendre transformation
H(p,q) = Prqr(p,q)  L(q,q(p,q)).
(2.10)
Indeed, by differentiation, we find
dH = q,dp,.
aL. )
+ p,dq,..  (aL
ndq,. + oodq,.
uqr
(2.11 )
U{Jr
The terms with dqr cancel each other owing to the definition of the
momentum in Eq. (2.9). The term with dq,. can be simplified by the
use of the EulerLagrange equation and Eq. (2.9), so that we can
finally write
dH = qrdpr  p,.dq,. ,
(2.12)
2.2. EulerLagrange equations in field theory
15
confirming the fact that the Hamiltonian is indeed a function of the
positions and the momenta. Moreover it directly leads to Hamilton's
equations of motion:
I
aH
Pr=aqr
(2.13)
The Poisson bracket of any two dynamical variables is defined in
the following way:
[II. hlp
alI ah _ af, ah.
aqr apr apr oqr
(2.14)
Using this definition, it is then trivial to check that
(2.15)
and that the Hamilton'5 equations can be rewritten as
qr = [qro H]p. Pr = jpro H]p
More generally, if
f is any dynamical variable. we can write
. df
f= dt
where ~ occurs if
2.2
2.2.1
(2.16)
af
at +lJ,HJp.
(2.17)
f has an explicit timedependence.
EulerLagrange equations in field theory
Action functional and Lagrangian
Going over to a field theory is trivial. Let us consider our system to
be a field or a set of fields. A field is essentially a set of numbers at
eacb point in spacetime. We still want to describe this system by
an action, and we still want to write the action as the time integral
of a Lagrangian. But time is not the only independent variable in
this case. We have to incorporate the information from every point
in space into our action as well. We can do this by writing the
Lagrangian as a spatial integral of some function of the fields. This
has the added advantage of putting space and time on the same
footing in the integral. The integrand has then the dimensions of
Lagrangian density. However, in discussions of field theory, this is
16
Chapter 2. Classical Field Theory
always called the Lagrangian, and the word 'density' is implied for
the sake of brevity. We will always use the word Lagmngian in this
sense from now on. However, we will denote it by a different symbol,
viz, .!'. If, at any point, we need to use the word Lagrangian in the
sense used in particle mechanics, we will call it the total Lagrangian
and will denote it by L.
The Lagrangian should depend on the fields, which we denote
generically by <j>A Here, different values of the index A can denote
completely independent fields, or the different members of a set of
fields which are related by some internal symmetry, or maybe the
components of a field which transforms nontrivially under Lorentz
transformations, e.g., like a vector. In any case, the field or fields
denoted by <j>A take the role of the coordinates qr of Eq. (2.1).
What then replaces the velocities tIr of Eq. (2.1)? These should
be the derivatives of the fields. However, we cannot have only the
time derivative in this case, since spatial coordinates are independent
parameters as well. We therefore should expect the Lagrangian to
depend on the derivatives 8/1 <I> A .
And finally, since we are interested about fundamental fields, we
should not expect any source or sink of energy or momentum in the
system. Thus, the Lagrangian should not depend explicitly on the
spacetime coordinates. So finally we can write the action in the
form
PI =
In
d'x.Y (<j>A(x),
a~<j>A(x))
(2.18)
where the region of integration n is the region of spacetime we are
interested ill. For any physical experiment this region extends far
enough away from the equipment and far enough into the past and
future of the actual experiment so as to make the effect of all events
outside that region negligible on the experiment. Usually n is taken
to be the entire spacetime for convenience.
A few things are worth noticing at this point. The action must be
Poincare invariant in a covariant theory. which means that it should
be invariant under Lorentz transformations as well as spacetime
translations. Since the spacetime volume element d 4 x is Poincareinvariant by itself, the Lagrangian as defined in Eq. (2.18) must also
be invariant.
The Lagrangian density .Y is a function of x through its dependence on the fields <j>A and their derivatives. The action on the other
2.2. EulerLagrange equations in field theory
17
hand is a number, i.e., it is a rule that associates a real number with a
given configuration of the fields. If the field configuration is changed
the number also changes in general. Such objects are called functionalso The action is a functional of the fields. The total Lagrangian L
is a function of t but a functional of the fields at a given t.
2.2.2
EulerLagrange equations
Given the Lagrangian, the classical equations of motion can be derived from the principle of least action. This states that the system
evolves through field configurations which keep the action PI stationary against small variations of the fields q,A which vanish on the
boundary of n.
Consider therefore the variations
<t>A(X) ~ q,A{x)
+ 6q,A{x),
a~<t>A{x) ~ a~<t>A{x)
+ a,,6q,A(x),
(2.19)
such that 6<t>A vanishes on the boundary, which we denote by an.
Using the functional dependence of the action denoted by Eq. (2.18),
we can write
(2.20)
We now invoke Gauss theorem, which, for 4 dimensions, reads
inr d"x aIt F~ =
r dS
Jan
FI'
J1.
'
(2.21)
where F~ is any wellbehaved vector field, dS" is the outward pointing volume element on the boundary an. Using this, we can write
the last term in Eq. (2.20) as an integral over an. However, the integrand now contains 64>A, which vanishes everywhere on af!. Thus,
the last term in Eq. (2.20) vanishes.
18
Chapter 2. Classical Field Theory
As for the other terms, we notice that they must vanish for arbitrary variations of the fields. This can be true only if the expression
multiplying o4>A vanishes, which implies
(2.22)
These equations, one for eoch 4>A, are called the EulerLagrange equations for the system.
o Exercise 2.1 The Lagrangian (density) of an electromagnetic jield
interQcting 'With a c'u:rTent jJl is given by
~F"VFJlll }' /A"
.U)z__4
',
(2.23)
where F'lii = 8j.iA v  8 v AJJ> Treating the All's as the fields, find
the EulerLagro.nge equations and show that they give the inhomo
gcnoous Maxwell's equations.
o Exercise 2.2 For a real scalur field <P, the Lagrangian is !L' =
!(o")(o")  !m''  V(). Find Ute EulerLo.gnmge equo.tions.
o Exercise 2.3 Do the same Jor a complex scalar field <P with Lugro.ngian!L' ~ (o")t(o")  m't  V(<p' ).
It should be realized that the inverse problem cannot be solved.
In other words, given the equation of motion, it is not possible to construct a unique Lagrangian. For example, consider two Lagrangians
.!L' and .!L" which differ by the divergence of some function of the
fields:
(2.24)
When the fields are varied infinitesimally by o4>A, the changes in octions defined from these two Lagrangian. would differ by the amount
(2.25)
Using Gauss theorem of Eq. (2.21), this can be rewritten as an integral over the boundary &0. But since o4>A vanishes on the boundary,
we find that 0$" = aNI, which means that the equations of motion
obtained from .!L" and .!L' would be identical.
2.3. Hamiltonian formalism
2.3
19
Hamiltonian formalism
The momenta canonical to the fields cl>A are defined in analogy to
systems with finite number of degrees of freedom:
(2.26)
Here ciA is the timederivatiye of 4>A. This is a partial derivative,
since I>A depends also on the spatial coordinates. Moreover I L is
a functional, so that the derivative indicated in Eq. (2.26) is really
the derivative of a functional with respect to a function, Le., a functional derivative. We will avoid using the partial derivative sign to
remember this fact and will use the symbol J instead.
In order to perform the functional differentiation in Eq. (2.26),
we need to know the derivatives, with respect to cl> 's, of the fields
cl>1 as well as of 4>A and '17 cl>A, since these are the building blocks of
L. We take the cue from particle mechanics, where the generalized
coordinates qi and their corresponding velocities
qi
are independent
variables in the Lagrangian formalism. In particular, oq;/oqj = Jij ,
whereas oq;/oqj = O. For fields, we replace the discrete index by the
space coordinate x and the index A which characterize the field. So
the fundamental functional derivatives are given by
(2.27)
whereas the functional derivatives of the ficlds and their spatial
derivatives with respect to 4>A vanish identically. In Eq. (2.27), notice that the functional derivative of 4> with respect to itself is not
dimensionless, but has the dimensions of an inverse volume.
We now introduce the Hamiltonian of the system, which also
should be understood to really mean the Hamiltonian density and
will be denoted by the symbol . The volume integral of , which
is usually called the Hamiltonian in the case of particle dynamics, will
be called total Hamiltonian and denoted by H. The Hamiltonian
can be defined in terms of the canonical momenta in the following
way:
(2.28)
Chapter 2. Classical Field Theory
20
Notice that the Lagrangian is a function of the fields and their partial derivatives with respect to all spacetime coordinates. In the
Hamiltonian, the field derivatives with respect to time have been replaced by the canonical momenta. However, the Hamiltonian can
still be a function of the spatial derivatives. The total Hamiltonian
is a functional in the same sense as the total Lagrangian.
o Exercise 2.4 TT'eo.ting the
AI"8
as the fields, find the co.nonico.l
momenta Q.ssocinted with them, u.sing the Lo.grnngio.n of Eq. (2.23).
Ca.n these equations, expressing the momenta in teT1l\S oj AJ."s and
their deri'vatives, be inuerled to 80lue the AJ.I's'? II not, can 1I0U think.
of Q Teason wh'll'?
Exercise 2.5 There is some freedom in defining the electroma.gnetic 'Potentials AJ.'. Usin.9 this freedom, we can make AO = O. This
is an example oj gauge fixing. In this gauge, treating the components of A as the independent fields, find out the momenta. Can
these equations be inuerted to soLve the Ai's, wheT'e the index i runs
oueT' spatial components onlll'?
The Poisson bracket of any two functionals F I and F2 can be
defined as
[F"F2]p =
3 (JF'
JF2
JF,
JF2 )
d X J<I>A(t,x) JITA(t,x)  JIIA(t,x) J<I>A(t,x)
(2.29)
These derivatives can be reduced to the following fundamental ones
which are similar to Eq. (2.27):
(2.30)
A 3
(2.31)
J<I>A(t,x) <I> (t,y)=JAJ (xy),
JITA(t,x) IIa(t,y) = JaJ (x  y),
whereas the functional derivatives of the fields with respect to the
momenta, as well as vice versa, vanish. Thus, for example,
A]
[<I> (t,x),IIa(t,y) p =
d z
d3 z
(J<I>A(t,X) JITa(t,y)
J<I>C(t,z) JITc(t,z)
_ J<I>A(t,x) JIIa(t,y))
JITc(t,z) J<I>C(t,z)
(J~J3(X  z)J~J3(y  z) 
= J~J3(X  y).
0)
(2.32)
2.4. Noether's theorem
21
Exercise 2.6 Find the. Hamillon\Q:n faT the complex srotnT field
whose Lagrangian has been gi:uen in Ex. 2.3 (p 18).
Exercise 2.7 Con.sider 0. rodiation field in 0. onedimensl.onQ,l box
equi'Uo.lenUy displa.cements on a. string of Length I with jUed ends.
(This wo.s the "'11 first attempt at fieLd theo'1l.)
01'
L =
l'
dx [ (::)' _
c' (:~) ']
(2.33)
Let 'U.s e:tpres8 the field u in a. Fourier series:
~
,,(x,t) = Lq.(t)sin
(W:X) ,
W,
= bee/l.
(2.34)
k;;:l
0.) CalcuLate the 'momentum' Pk, canonically conjugate to qk.
b) Write down th.e Ha.miltonia.n as Q function oj q's Qnd p's.
c) Take qk and Pk to be operotoTS with Iqk,Pi] = ihtS kj . Define
opera.loTS ak, a k bll
qk =
J n [a e'wo +
2lwk
<
at~.. iWO ']
(2.35)
at
Find th.e commutators oj ak,
'mith themsdlles Qnd one an
other.
d) CaLculate the Hamiltonian in tenns oJ a's and at's.
2.4
Noether's theorem
Some Lagrangians are known fwm basic physics. But generally a
system may not be as well understood, and we have to construct a
Lagrangian that we expect to describe the system. In such constructions, it is often useful to rely on symmetry, i.e., transformations
which leave the system invariant. It so happens that associated with
each symmetry of a system is a conserved quantity. Thus if we know
some conserved quantities of a system we can work backwards to find
the symmetries of the system and from there make a guess at the
form of a convenient Lagrangian. The result which relates symmetries to conserved quantities is known as Noether's theorem after its
discoverer, while the conserved quantities are called Noether charges
and Noether currents.
Let us consider infinitesimal transformations of the c~ordinate
system
x ll
I>
x'Po
= x ll + dx P
(2.36)
22
Chapter 2. Classical Fjeld Theory
under which the fields transform as
(2.37)
The change in the action resulting from these transformations is given
by
oJil
= in,d"x' ~(4)'A(X'),a;.<I>,A(X'))
in
d 4x
~ (4)A(X), 8"4>A(X))
(2.38)
where Of is the transform of n under the coordinate change. If
= 0, we say that the theory is invariant under the transformation.
In the first integral x' is a dummy variable, so we can replace it
os;(
by x and wri te
oJil =
=
in,
in
d"x
~ (4)'A(x), a;. 4>'A(x))
d4x
[~( 4>'A(x), a;.4>'A(x))
Jrrn
in
d'x
~ (4)A(x), 8"4>A(x))
~ (4)A(x), 8"4>A(x))]
d"x ~ (4),A(x),a;.4>,A(x))
(2.39)
The last term is an integral over the infinitesimal volume fl'  fl, so
we can replace it by an integral over the boundary an,
=
=
Jon
in
dS"
oX''~(4)A,8"4>A)
d"x 8" (ox" ~ (4)A, 8"4>A)). (2.40)
Here we have suppressed the spacetime index x as we no longer need
to distinguish between x and x'. To get the first equation we have
replaced 4>,A by 4>A as the difference is of higher order in Ox". In the
second equation we have used Gauss theorem as shown in Eq. (2.21).
At this point, it is convenient to define the variation for fixed X,
which gives the change in the functional forms only. For any function
f(x) whose functional form changes to j'(x), we can write
bf(x)
=j'(x)  f(x)
= [j'(x')  f(xlJ  1J'(x')  J'(x)J
= of (x)  8"f(x)ox" ,
(2.41)
2.4. Noether's theorem
23
where we have again ignored terms of higher order in 6x"" in writing
the last equality. Note that this variation J commutes with the partial derivatives op. as the variation is taken at the same spacetime
point.
This property of J allows us to simplify the integrals of Eq. (2.39)
since we can write
(2.42)
using EulerLagrange equation, Eq. (2.22), to get the last equality.
Putting this in Eq. (2.39) and using Eq. (2.40), we obtain
6d
in d"
= in d"
=
a~ [ a( :~A) J<I>A + 2'6x~]
a~ [ a( :~A) 6<I>A  T~v 6x v]
(2.43)
where we have used Eq. (2.41) again to bring back 6<I>A, and defined
a2' ,,"... A
T~v = a(a~<I>A)u",

g~V
'"
..L,
(2.44)
which is called the stressenergy tensor. We can now define a current
(2.45)
and write
(2.46)
This relation holds for arbitrary variations of the fields and coordinates provided the equations of motion are satisfied.
Even though the right hand side of Eq. (2.46) is the integral of a
total divergence, it does not necessarily vanish, because the current
need not vanish on the boundary. If however there is a symmetry,
24
Chapter 2. Classical Field Theory
6Jif vanishes for arbitrary n even without the use of the equations
of motion. In that case, we obtain a conservation law
(2.47)
when the equations of motion are satisfied. This result is called
Noether's theorem, and the current J" in this case is called the
Noether current. Note that J" is defined only up to a constant factor.
It may so happen that under the symmetry transformations of
Eqs. (2.36) and (2.37), the action does not remain invariant but
instead changes by the integral of a total divergence,
(2.48)
Then the expression for the corresponding conserved current will be
the same as in Eq. (2.45), plus V".
It is easy to see that Eq. (2.47) implies the conservation of a
charge, called Noether charge. It is defined by an integral over all
space,
(2.49)
provided the current vanishes sufficiently rapidly at spatial infinity.
This is because
(2.50)
The last expression can be converted into a surface integral on the
spatial surface at infinity, on which it should vanish. Thus,
dQ =0
dt
'
(2.51)
i.e., the charge Q is conserved by the dynamics of the system. In
what follows, we will find the specific form of the conserved current
for various kinds of symmetries.
Spacetime translations
The action constructed from relativistic fields is invariant under
Lorentz transformations as well as translations of the coordinates.
2.4. Noether's theorem
25
Let us look at translations first. Suppose we make small changes in
the spacetime coordinates, defined by
(2.52)
where a~ is constant, i.e., independent of X~. The fields do not change
at any point, so
(2.53)
Since Eq. (2.47) is true for arbitrary
a~T~v
a~,
we conclude that
= 0
(2.54)
This equation implies the conservation of the quantity
p~ =
d3 x
T'~ ,
(2.55)
which is the 4momentum of the field. Using the definition of the
stressenergy tensor from Eq. (2.44), it "is easy to see that pO is the
total Hamiltonian defined through Eq. (2.28).
Exercise 2.8 Find the stresseneT91l tensor JOT the following fields:
a) rool scalar, as in "Ex. 2.2 (p 18);
b) complex 'calar, as in Ex. 2.3 (p 18);
c) electromagnetic j\eld, a.s in Eq. (2.23) with f
O.
Lorentz transformations
In this case the infinitesimal transformations are
(2.56)
where the w~/,I are independent of x, and they are also antisymmetric
because x~x~ is invariant. The field variations under this kind of
operation are written in terms of a spinmatrix L by
(2.57)
The factor of appears because otherwise each independent component of w is counted twice because of antisymmetry. Putting these
in Eq. (2.45), we obtain
a.:f 1WAp ("Ap)A>.B
a~ a(a~~A)
2 L. B'" [
T~A WApX p]
 0

(2.58)
Chapter 2. Classical Field Theory
26
Pulling out the w~p which are constants, and also using the antisymmetry property of them, we can write this as
& M"~P  0
"
(2.59)
,
where
(2.60)
The conserved charges for this case are
J~p =
The space components of this,
d 3x MO~p
Jij I
(2.61)
are related to angular momentum
by
J k 
~<ijkJ ..
2
(2.62)
IJ'
The quantities JOi correspond to Lorentz boosts.
Internal symmetries
Internal symmetries are those which relate different fields at the same
spacetime point. In this case ox" = 0, and let us write the infinites.
imal transformations of the fields as
r = 1 ... p
"
(2.63)
which leaves the action invariant. Here the O<r are infinitesimal parameters independent of spacetime, and f:'(x) are specified functions of the fields 4>A and their derivatives. The index r is not
summed over, it indicates the type of symmetry. In other words,
there may be several independent symmetries in a system. We can
treat them separately by defining the conserved current for the roth
symmetry as
"
&.!L' o4>A
r
J = &(&,,4>A) O<r .
Exercise 2.9 Consider the complex scular
The La.gro.ngiclT\ Temoins inua:rio:nt under </>
(2.64)
fieldoJ
+
Ex. 2.3 (p 18).
e 1q 4>, q;,t + e ,q8 </>t.
2.4. Noether's theorem
27
a.) Use Noether'g theorem to Jind the colTesponding conserued current p'. Verifll 0s.j" = O.
b) Ca.lcula.te the conserued N oether turn.',nt for the, sa.me transformation when the electroma.gnetic field is also included, and the
combined La.gra.ngian is
!L'
= _~F"vFvV + [(a"
 iqA,,)tll(a,.
+ iqA v )]
_m't  V(t)
(2.65)
c) The EuleTLa.gT'unge equations for AI' should gi've the Maxwell
equations: OIJ,FJlV = j"'. ShoUl that the 'current' ~ppea.ring in
this equation is the same as the CUTTent obtained in part (b)
abolle. INote: Since the conserued current inuolues A~. the
Lagrangian cannot be written like Eq. (2.23) for this case. I
o Exercise 2.10 Consider the Lagrangian oj a. real sca.lar field gi'lJen
in Ex. 2.2 (p 18), with m = 0 and V() = >.'. Verijy that the "ction
has the 8'Ymmetrll
bx,
(2.66)
Find the conserued CU1Tent corresponding to this 8'ymmetry.
Exercise 2.11 Suppose the action of a certain fl.eld theory is in'llarlant under spacetime translations as well us dilatations
xtbx,
Show that the stTessenergy tensor in this cuse
T " 0 .
"
(2.67)
'LS
traceless, i.e.,
Chapter 3
Quantization of scalar fields
Having outlined the necessary aspects of classical field theory in
Ch. 2, we now turn to quantization of fields. In this chapter, we
take up the quantization of the simplest kind of field which is described classically by one Lorentz invariant quantity at each spacetime point. Such fields are called scalar field". From n0W on, we will
employ the natural units outlined in 1.5.
3.1
Equation of motion
For free particles of mass m, the 4momentum satisfies the relation
pI'p~  m 2 = O.
(3.1)
In quantum theory, momentum is treated as an operator whose coordinate space representation is given by
PIA + ia~
(3.2)
which operates on the wavefunctions. Thus, if we tried to represent the particles by a single wavefunction >(x), we would simply
substitute Eq. (3.2) into Eq. (3.1) and obtain
(0 + m 2 ) >(x) = O.
(3.3)
This is called the KleinGomon equation after the people who suggested it.
It looks simple enough, but there is a problem. The energy eigenvalues satisfy
(3.4)
28
3.2. The field and its canonical quantization
29
Notice that energy can be negative, and its magnitude can be arbitrarily large since the magnitude of p, which we denote by p, is
not bounded. Thus, the system will have no ground state. The system will collapse into larger and larger negative values of energy and
will be totally unstable. Hence, a wavefunction interpretation of the
KleinGordon equation is not possible. We will see that this problem
will be evaded once we interpret (x) as a quantum field, with a
proper prescription for quantization.
3.2
The field and its canonical quantization
So we want to interpret (x) as a field. The classical equation of
motion of this field will be given by Eq. (3.3). According to Ex. 2.2
(p 18), it can be derived from the Lagrangian
.!/ =
i(8")(a~) 
i m2 2
(3.5)
D Exercise 3.1 Use the dejinition oj the Lagrangian in Eq. (2.18) to
find the mass dimensions of .!L' in natuTul units, assuming the spacetime is Ndimensional. Use this to show that the mass dimensions
oj the jield is (N  2)(2.
4>
o Exercise 3:.2 From the Lagrangian in Eq. (3.5), shoUJ that the momentum canonical to <P is
n(x) = 4>(x) ,
(3.6)
and that the Hamiltonia.n is gi1Jen b1:l
' =
~(n' + (V4' + m'4>').
(3.7)
In particle mechanics, the Poisson brackets between the coordinates and the canonical momenta were derived in Eq. (2.15).
In order to go over to Quantum Mechanics, one can just use the
replacement
(38)
whereby one obtains the commutation relation of Eq. (1.6). Similarly,
in the case of the fields, once we obtained the canonical momentum
II, we can use the same prescription to replace Eq. (2.32) by
1(t,x),II(t,y)L = iJ 3 (x  y)
(3.9)
30
Chapter 3. Qual'tization of scalar fields
This is called the canonical commutation relation, which implies that
and II can no more be considered as classical fields. They should
now be treated as operators. The quantization procedure is a direct
consequence of this commutation relation, as we will gradually see.
It might be a little dissatisfying to see that although we are trying to build up a relativistically covariant theory, the commutation
relation of Eq. (3.9) does not seem to be a covariant equation. It
seems that it somehow gives a preferential treatment to the time
component and therefore cannot be valid in a general frame of reference. However, appearances are sometimes deceptive, and Eq. (3.9)
can really be valid in any frame. The commutator is nonzero only if
we consider the field and the canonical momentum II at the same
spacetime point. If two spacetime points coincide in one frame,
they will coincide in any other frame. Thus, at least for this part,
there is no difference if we go to any other frame. Next, consider
the Lorentz transformation properties of the commutator bracket on
the left hand side. Since is a scalar and II = if, = 8o, the nontrivial Lorentz transformation property of the left hand side comes
only from 80. In other words, the left hand side transforms like the
time component of a 4vector. As for the right hand side, notice that
d4 x 53 (x  y) = dt when integrated over a spatial region containing
the point y. By definition, dt transforms like the time component
of a 4vector. Since d"x is a scalar, this means that 53 (x  y) also
transforms like the time component of a 4vector, and therefore Eq.
(3.9) is covariant.
3.3
Fourier decomposition of the field
A classical field, being a function of spacetime coordinates, can be
Fourier decomposed. For the free field, we can write
(3.10)
Here the factor outside the integral sign is just a normalizing factor, whose convenience will be seen later. Inside the integral sign,
apart from the usual Fourier components A(p), we have put a factor
3.3. Fourier decomposition of the field
31
5(p'  m') to ensure that satisfies Eq. (3.3),
(0 + m') (x) =
(2:)'12 J d"p (p'
+ m')5(p' 
m')A(p)e ip .X
= O.
(3.11)
The 5function, in association with the factor (p' +m'), guarantees
that the integral will vanish.
Let us now assume that rI>{x) is a hermitian operator, i.eo, classically speaking the value of the field at any point is a real number. The
generalization to complex scalar fields will be taken up in 3.6. The
hermiticity condition, (x) = t(x), then implies that the Fourier
components satisfy the condition
A(p)
= At(p).
We now introduce the step function
z by
e(z)=
I
O!
e,
(3.12)
defined for any real variable
if z > 0,
ifz=O,
if z < O.
(3.13)
Obviously, e(po) + e( _pO) = I. Inserting this factor in Eq. (3.10)
and changing the sign of the integrated variable p in the second term,
we can write
(x) =
1 Jd"P 5(p' _ m')e(po) (A(p)e'P'x
(2,,) 'j,
+ At(p)e'P'x)
(3.14)
This form, then, involves only the states of positive energy.
In fact, the Fourier decomposition can be written in a somewhat
simplified form if we eliminate pO from Eq. (3.14). We will use the
result that given a function f(z) which vanishes at the points Zn for
n=1,2, ... ,
5 ( f(z) )
"" 5(z  zn)
= ~ Id! /dzl,=,,, '
(3.15)
provided the derivatives themselves do not vanish at the points
Thus we can write
5(p'  m')
= 5 ((pO)' =
21~OI
Zn.
En
[5(pO  E p )
+ 5(l + E p )]
(3.16)
Chapter 3. Quantization of scalar fields
32
where from now
eigenvalue:
Oll,
we reserve the symbol E p for the positive energy
Ep
= +VP 2 +m 2
(3.17)
When Eq. (3.16) is substituted into Eq. (3.14), the second 6function
does not contribute to the integral since this is zero everywhere in
the region where 8(po) i' o. The other 6function contributes to the
integral and gives
(x) =
d3p
( a(p)e'.px
V(21r)32Ep
.)
+ at (p)e'px
(3.18)
where
(3.19)
and we have introduced the notation
a(p)
A(p)
(3.20)
JIE;.
This is the form in which the Fourier decomposition will be most
useful for us. The canonical momentum can also be written in a
similar form involving integration over the 3momentum only:
II(x)
= (x) =
J iV 2(~;)3
3
dp
(_a(p)e iPX
+ at(p)e iPX )
(3.21).
In the discussion about simple harmonic oscillators in L2, we
started with the operators x and p, but switched to another set of
operators, viz, a and at. The situation is quite similar here. We
started with (x) and II(x), hut now we can do everything in terms
of a(p) and at(p). Just as the commutation relation between x and
pin Eq. (1.6) led to the commutation relation of Eg. (1.7) between
a and at here also we can use the canonical commutation relation
of Eg. (3.9) to derive the commutation relation between the a's and
the at's. For this, we need to invert the Fourier transform equations
of Eqs. (3.18) and (3.21) to express a(p) and at(p) in terms of (x)
and II(x). From that, using Eg. (3.9), one can show that
1
[a(p) , at (p')
L = 6 (p 
[a(p), a(p')L
p'),
= 0,
[at (p) ' at (p')L =0
(3.22)
3.3. Fourier decomposition of the field
33
These commutation relations are reminiscent of the corresponding
equations obtained for a linear harmonic oscillator in Eq. (1.7). If
one takes a different prefador in Eq. (3.10) than the one we used, the
right hand side of the commutator of a(p) and al(p) contains some
extra factors.
We can derive Eq. (3.22) from the canonical commutation relation
of Eq. (3.9), but we leave that for Ex. 3.4 (this page). Here, let
us prove the converse. If we assume Eq. (3.22), using the Fourier
decomposition of Eqs. (3.18) and (3.21), we can write
[q,(t,x),IT(t,y)J =
2(2~)3 Jd p Jd p' j~
{[a(p),al(p')L
[al(p),a(p')L
eip.x+ip'.y 
eiP.XiP'.Y}
(3.23)
where xO = yO = t. The commutators give afunctions, and so the
integration over p' can easily be performed. Note that once this is
done, it forces p = pi, which also means E'l = Epl. In the exponents,
the time components in the dot products therefore cancel, and we
are left with
This immediately reduces to Eq. (3.9) if we recall the integral representation of the 6function of the form:
63 (x _ y) =
d p
eip.(,,y)
(2")3
Exercise 3.3 Use the commutntOi5 oj Eq. (3.22) to ShOUl tha.t
[4>(t,x),8i 4>(t,y)!_ = 0
(3.25)
(3.26)
Exercise 3.4 Write a(p) and at (p) us inuerse Fourier tra.nsjonns
of a.nd fi, a.nd deriue the commuta.tion relations oj a, at from the
canonica.l commuta.tors oj and n.
We can now derive the total Hamiltonian of the system. Let us
start with the volum.e integral of ,J'l' given in Eq. (3.7), replace q, and
IT by the expressions obtained in Eqs. (3.18) and (3.21) and perform
34
Chapter 3. Quantization of scalar fields
the integral over all space. Each of the exponential factors will give
us a momentum cS"function. If we now integrate over one of the
momenta, say pI, we get for the different terms in the Hamiltonian
the following results:
J ,flx rr 2 (x)
=J
,flp
r;
i::.
[_u(p)u(_p)e 2iEp , + u(p)at(p)
+at(p)a(p)  at(p)at(_p)e2iEp,] , (3.27)
J d3x (V.p(x))2
=J
d3p
[a(p)u(_p)e 2iEp , + a(p)at(p)
+at(p)a(p) + at (p)at(_p)e 2iEp ,] , (3.28)
m2Jd3xq,2(x) = Jd3p
[a(p)a(_p)e2iEp'+a(p)at(p)
2E.
+at(p)a(p) +at(p)at(_p)e2iEp'] . (3.29)
m
This yields
=~J
d3p E. [at (p)a(p)
+ a(p)at(p)]
(3.30)
Once again, the similarity with Eq. (1.5) is obvious. Moreover, one
can check the following relations:
[H,a(p')I_
[H,at(p')J_
= E.,a(p')
= +E.,at(p') ,
(3.31)
which are similar to the relations in Eq. (1.8). The analogy between
the simple harmonic oscillator and the field is now complete. It is
clear that we can interpret a(p) as the annihilation operator and
at(p) as the creation operator for a field quantum with momentum
p. What was the positive energy component of the classical field
now annihilates the quantum, and the negative energy component
now creates the quantum. This quantum is what we call a particle
of positive energy.
3.4
Ground state of the Hamiltonian and
normal ordering
The ground state of the field, which is called the vacuum in fieldtheoretical parlance, can now be defined in analogy with Eq. (1.12).
3.4. Ground state of the Hamiltonian and normal ordering
35
Here, it reads
a(p) 10) =
for all p.
(3.32)
We will assume that this state is normalized, Le.,
(010) = 1.
(3.33)
However, there is one problem here. which was not present in
the case of the linear harmonic oscillator. To see this, let us use the
commutation relations of Eq. (3.22) to rewrite the expression of Eq.
(3.30) in the form
H =
tPp Ep [at(p)a(p)
+ ~83(0)p]
(3.34)
which would be the analog of Eq. (1.9) for this case. The subscript
p after the 8function in this equation is just to remind us that it is
a 8function for the zero of momentum, not the zero of coordinate.
The distinction is not really relevant here, but the notation will be
useful later. What comes out from this way of writing is that, if we
consider the ground state energy, Le., the expectation value of the
total Hamiltonian in the ground state, we obtain
(3.35)
which is infinite!
The point is that, here we have one oscillator for each value of the
momentum. Thus, we have an infinite number of oscillators. Since
each oscillator has a finite ground state energy, the contributions
from all such oscillators add up to give us an infinite result for the
total ground state energy.
This, by itself, is not a catastrophe. After all, energy differences
are physical quantities, absolute values of energies are not. We can
just redefine the zero of energy such that the ground state energy
vanishes. For this, we need a consistent prescription so that we
do not run into trouble with other variables. This prescription is
called normal ordering. Stated simply, it means that whenever we
encounter a product of creation and annihilation operators, we define
a normalordered product by moving all annihilation operators to the
right of all creation operators as if the commu tators were zero. Once
Chapter 3. Quantization of scalar fields
36
we have done the rearrangement, we treat them again as operators
with the usual commutators. Using this algorithm on the expression
of Eq. (3.30), we find that the normalordered total Hamiltonian,
denoted by :H:, is given by
This expression immediately shows two things. First, for any state
11lI) ,
(1lIIH: I Ill) =
=
J
J
d3 p E p (1lIIat(p)a(p)1 Ill)
3
d p E p Ila(p) 11lI)
(3.37)
which is always nonnegative. Second, using Eq. (3.32), we obtain
(OI:H:IO) =0,
(3.38)
so that the vacuum defined earlier is really the state of lowest energy,
i.e., the ground state.
D Exercise 3.5
a.) For 0. real massiue scalar fleld, calculnte the
4momentum PIJ =
d3 x ']'O1l in terms of creo.tion and a.nnihila.tion operators. Show tha.t nonnal ordering is not required
jor P'.
b) Show that Ie/>, P"I_ = Wee/>
3.5
Fock space
So far, we have defined only the vacuum state, which is a state with
no particles. We shall need other states, in particular states with
specified particle content, when we start describing physical events.
We can define such states in analogy with excited states of oscillators.
For example, we will define a oneparticle state as
Ip)
= at (p) 10)
(3.39)
This state contains one quantum of the field 4> with momentum pi' =
(Ep , p). Such states have positive norm, since
(3.40)
3.6. Complex scalar field
37
which comf'.8 from the commutation relations and the definition of
the vacuum.
Similarly, we can define manyparticle states. If a state has N
particles with all different momenta PI, P2, ... PN, it is defined by
(3.41)
On the other hand, if we want to construct a state with n particles
of momentum P, it will be given by
Ip(n))
1 ( at(p) )" 10),
=v'n!
(3.42)
where the prefactor is needed for proper normalization.
Such multiparticle states distinguish field quantization (also
called second quantization) from singleparticle quantum mechanics
(or first quantization). The vacuum, together with single particle
states and all multiparticle states, constitute a vector space which
is called the Pock space. The creation and annihilation operators act
on this space.
D Exercise 3.6 The numbeT O'PeratoT in lhe Fock RpaCf'. can be delined as
(343)
Show that
1A",at'(k)l_ = a'(k),
1A",a(k)l_ = a(k).
(3.44)
Show that this operator corrccttll counts the number of particles in
the states given in Eqa. (3.41) and (3.42).
3.6
3.6.1
Complex scalar field
Creation and annihilation operators
The Lagrangian for a complex scalar field </>(x) is given by
.:t: = (8"</>t)(o"</ _ m 2</>t</>.
(3.45)
Let us decompose the complex scalar field into its real and imaginary
parts:
</>(x)
= .,j2(</>1 (x) + i</>2(X))
(3.46)
Chapter 3. Quantization of scalar fields
38
Then Eq. (3.45) can be rewritten as
(3.47)
which is just the free Lagrangian of two independent real scalar fields
, and 4>2. From this form, it is straightforward to calculate the
momenta conjugate to , and 2, and impose the canonical commutation relations. We can perform Fourier decomposition of both
, and 2 in the way outlined in 3.3, and define the annihilation
and creation operators corresponding to each of them. The canonical
commutation relations in this case imply
(3.48)
all other commutators being zero. To express these relations in terms
of creation and annihilation operators associated with the complex
field , let us define
1
a(p) = j2(a, (p)
which
+ W2(p)),
alp) =
~(al(p) 
at(p) =
~(at(p)  ia~(p)),
at(p) =
~(at(p) + ia~(p))
(3.49) .
ia2(p)),
mean
(3.50)
We can now start with Eq. (3.18) for both 1 and 2 and use the
definition of Eq. (3.46) to write the Fourier expansion of the field :
(x) =
3
d p
(a(p)e ip .X
)(27[)32Ep
+ at (p)e iP '
(3.51)
which also implies
(3.52)
3.6. Complex scalar field
39
Moreover, the commutation relations of Eq. (3.48) can be cast into
the form
(3.53)
whereas all other commutators vanish. Since
at and a~ create the
quanta of the fields (p, and (/>2 which are different, it is clear that there
are two different particles in the theory. Since Eq. (3.53) presents the
commutation relations in the form of Eq. (3.22), we can interpret at
and at to be the creation operators for these two particles. Of course,
we could have chosen any two orthonormal combinations of at and
a2, even at and a2 themselves, as the annihilation operators for our
two particles. There is a special reason for quantizing and t in
terms of a and a, as we shall now see.
3.6.2
Particles and antiparticles
The Lagrangian of Eq. (3.45) is invariant under the transformation
_ e"o,
t _ e"Ot,
(3.54)
where IJ does not depend on spacetime. Following the procedure
outlined in 2.4, we can calculate the Noether current for this invarianee. For infinitesimal (), we have
o = iqlJ,
ot = iqlJt .
(355)
So,
a~.
J~ = a(a~) (.q)
= iq
a~.
+ a(a~t) (.q
t
)
[(~)t  wt)]
(3.56)
The conserved charge Q is therefore given by
Q=
d3 x
JO
J
J
= q
d3 x iq [(ao)t  (aot)]
d3 p [at(p)a(p)  at(p)a(p)]
(3.57)
The first term here, remembering the definition of Eq. (3.43), is just
the number operator for the quanta created by at. Let us denote it
by N a Similarly, the second term is N. Thus we obtain
Q = q(Na
No.)
(3.58)
40
Chapter 3. Quantization of scalar fields
Notice the negative sign for the second term. If q is called the charye
associated with each quanta created by at, the first term is the total
charge in such quanta. The quanta created by ii t then possess an
opposite charge. These are called the antiparticles for the particles
which are created by at. Eq. (3.58) then tells us that the total
charge in particles and antiparticles is conserved as a consequence of
Noether's theorem.
Exercise 3.1 Write Q in terms oj a,(p),a!(p),a,(p) and a~(p) to
uerif'y that it cannot he 'Written in terms of number overotoT' of the
fields <1>, and <1>,.
Looking back at Eq. (3.51), we thus see that the operator "'(x)
can do two things. The first term describes the annihilation of a
particle, the second one the creation of an antiparticle. In either
case, if "'(x) operates on any state, it will reduce the charge of the
state by an amount q. Similarly, Eq. (3.52) implies that the operator
",t(x) can either create a particle, or annihilate an antiparticle. In
other words, if it operates on a state, the charge will increase by q.
One may wonder at this point as to how the quantity q is defined.
After all, in Eq. (3.54) where it first shows up, q is always multiplied
by B, so we can change the value of q by redefining B with a multiplicative constant. But there is nothing mysterious about this. Aft~r
all, any charge is defined only up to a multiplicative constant. For
example, take the case of electric charge. If we multiply the values
of the charges of all particles by a constant, the law of charge conservation will not be affected at all. The Maxwell's equations will also
be unaffected, since the electromagnetic fields will be multiplied by
the same constant. So, if we consider only one kind of particle in i~
lation, the value of the charge is really arbitrary. Only the ratios of
charges of different particles have some physical meaning, and these
ratios can be determined only when different kinds of particles are
present and there is some interaction among them. Such instances
will be encountered when we will discuss interactions in the later
chapters.
3.6.3
Ground state and Hamiltonian
We can now construct the states of the Hamiltonian, following the
procedure outlined in 3.4 and 3.5. The vacuum, or the ground
3.7. Propagator
41
state, is defined in this case by the relation
a(p) 10} = ii(p} 10} = 0,
for all p.
(3.59)
In other words, the vacuum is the state which contains no particles
and no antiparticles either. The state Ip} defined in Eq. (3.39) will
be a state containing one particle with energy Ep . Similarly, we can
define a state
(3.60)
which will contain one antiparticle with energy Ep . The states with
more than one particles or antiparticles can be constructed similarly.
The normalordered Hamiltonian can now be written as
(3.61 )
Exercise 3.8 Find the Hamiltonian in terms of the field t:P 'Using
the Lagrangian of Eq. (3.45). Use the Fourier decomposition of the
jield giuen in Eq. (3.51) Qnd the 'Prescription of normal ordering to
express it in the Jorm oj Eq. (3.61).
3.7
Propagator
The goal of Quantum Field Theory is to describe particle interactions. Our interest will be in c81culating crosssections, transition
probabilities, decay rates etc. For this, we need to know how parti
cles move in spacetime. We shall use Green's function techniques for
this and define a propagator, which will be useful in later chapters.
Let us start with the KleinGordon equation with a source term:
(0 + m 2 ) (x) = J(x).
(3.62)
Since the KleinGordon equation is the same whether the scalar field
is real or complex, the discussion in this section is equally applicable
for both cases.
In order to solve this equation, we first introdnce a propagator, or
Green's function, denoted by G(x  x'), which satisfies the equation
(ox +m2 ) G(x 
x') =  c5'(x  x')
(3.63)
42
Chapter 3. QUantization of scalar fields
Here the notation Ox indicates that the derivatives in the operator
have to be taken with respect to the coordinates of x, not of x'
Clearly, if this equation can be solved, we can represent the solution
of Eq. (3.62) by
4>(x) = 4>o(x) 
d"x' G(x  X')J(X ' ),
(3.64)
where 4>o(x) is any solution of the free KleinGordon equation, Eq.
(3.3). The boundary conditions are matched by choosing 4>0 appropriately.
Now, to solve for the propagator, we introduce its Fourier transform by the equation
=!
G( x _ x ')
d p
(2,,)4 e
;p.(xx')G()
p ,
(3.65)
where all the components of pi' should be treated as independent
variables, not related by the energymomentum relation of Eq. (3.1).
If we act on this equation by the operator appearing in Eq. (3.63),
we obtain
Remembering that
,j4(
x')
=!
d"p e;p(xx')
(2,,)4
'
(3.67)
and comparing with Eq. (3.63), we obtain
G(p) =
1
2
P m
1
2
(3.68)
where E p was defined in Eq. (3.17).
This expression, however, has an ambiguity. Let us put this expression into Eq. (3.65) and try to perform the integration over pO.
The integrand has poles at pO = Ep , and therefore the integration
cannot be performed over the real axis for pO. This has nothing to
do with quantum theory, since we have not used anything regarding
quantization in this section so far. The problem lies even with the
classical theory.
3.7. Propagator
43
To get around this problem, we use a prescription by Feynman
and complexify the propagator. Instead of the Green's function of
Eq. (3.68), from now on we will use
1
AF(p) = p2 m 2
1
(pO)2 _ (Ep
+ 1.E. ,
i)2 '
(3.69)
where , ' are infinitesimally small positive parameters, obviously
related to each other by,' = 2Ep ' . Of course, has to be taken to
zero at the end of calculations. For this reason, we will ignore the
distinction between ' and E. Introduction of this parameter makes
the propagator complex, and hence inherently nonclassical.
Let us now rewrite Eq. (3.69) in the form
A F (p) = _1_ [
1
2Ep pO  (Ep
i,)
,,,,~I_:;]
pO + (Ep
i,)
(3.70)
This expression can be put into Eq. (3.65), which gives
AF(x  x') =
d3p
(2.. )3
e+'p(zz')
2Ep
[(PO~p)+i' 
/+00 dpo ..p. 0(,,')
00 2..
(pO+L)i]
(3.71)
To perform the integrations over pO I we need the following result
from complex analysis:
+00
lim
_0 /
00
ei(t
de (+ IE. =
21ri8(t),
(3.72)
where the 8function was defined in Eq. (3.13). For the first term in
Eq. (3.70), we can shift to the variable pO' = pO  E p to perform the
pOintegration:
+00 dpot
/ 00 27r
ei(pOI +Ep)(tt')
,.
pO + IE
= i8(t  I').'E,(,,'}
Similarly, for the other term, we can substitute pO'
gives for this term
(3.73)
_po  E PI which
+00 dpoJ ei(pOI +Ep)(tlt)
= i8(t'  I).'E,(,'')
_po'  if
/ 00 27T
(3.74)
Chapter 3. Quantization of scalar fields
44
Combining the two t.erms then, we can write
tJ. (x  x') = i
F
d p
(21f)32Ep
lett _ t')e'Ep(tt')+'P'(""')
+e(t' 
t)eiE,(tt')+il>(,,,,'l] . (3.75)
Changing the sign of the integration variable p in the second term,
this can be written as
[e(t 
t')eip.(xx'l
+8(t' _ t)eiP'(xx')] ,
(3.76)
where now the components of the 4vector pi' are not independent,
but are related by pO = E p .
Exercise 3.9 Show that tJ.p(x  x'), given in Eq. (3.76), .ati.jie.
the equation
(0, + m 2 ) tJ.F(x  x')
=  .'(x  x').
(3.77)
It is therefoTe a. Green's Ju.nction.
This form can be easily related to the quantized fields. For the
sake of illustration, let us use the real scalar field. From Eq. (3.18),
we can write
J
J
(x) 10) =
3
d p
at (p)e'PX 10)
2
J(21f)3 E p
3
d p
e'P'x Ip)
J(21f)32Ep
(3.78)
,
where Ip) is the oneparticle state defined in Eq. (3.39). Notice that
the annihilation operator in (x) does not contribute here owing to
Eq. (3.32). On the other hand,
(Ol(x') =
d p'
e'p'x' (p'l ,
J(21f)32Ep '
(3.79)
where only the annihilation operator part of the Fourier decomposition of (x') is important. Using the normalization of Eq. (3.40), we
thus obtain
3
(0 1(x')(x)1 0) =
d p
e'P'(xx')
(3.80)
(21f)32Ep
3.7. Propagator
45
Thus, we can rewrite Eq. (3.76) in the following form using the field
operators:
itlF(X  x')
= 8(t 
t') (0 I>(x)>(x')i 0)
+8(1'  t) (0 I>(x')>(x) I0) ,
(3.81)
which can be abbreviated to
itlF(X  x') = (01&1 [>(x)>(x')] I0) ,
(3.82)
where &1 [... J implies a timeordered product, which means that the
operator with the later time must be put to the left of the operator
with the earlier time. In general, for any two scalar operators A(x)
and B(x), the timeordered product is defined as
&1 [A(x)B(x
_ { A(x)B(x')
B(x')A(x)
)J =
ift>t',
ift'>t.
(3.83)
Let us now see how the form given in Eq. (3.81) can be interpreted. First, suppose t < t', in which case only the second term
survives. From the expression in Eq. (3.78), we see that the operator
>(x) operating on the vacuum state to the right creates a particle at
time t. On the other hand, for (01 >(x'), as commented earlier, only
the annihilation operator part contributes, so this part describes the
annihilation of a particle at time t'. Thus the matrix element describes the amplitude of a particle being created at a time t, which
propagates in spacetime and is annihilated at a later time t'. Similarly, if t > t', the first term of Eq. (3.81) describes the amplitude of
a particle being created at a time t', which propagates in spacetime
and is annihilated at a later time t. This, in a sense, is the physical meaning of the propagator, which also shows why it should be
important in calculating the amplitudes of various processes involving particle interactions. After all, between interactions, the particles
just propagate in spacetime! The mathematical formulation of these
comments will be the subject of Ch. 6.
Exercise 3.10 'or the comple:>: srotar jield, ShOUl that Eq. (3.76)
can be written as
it.F(X  x') = (Olg [>(x)>t(x')]
I0)
(3.84)
a) Argue that it represents the propngation oj a. partide from t'
to t aT the propugation of an antiparticle from t to t'.
46
Chapter 3. QUaIjtization of scalar fields
b) ShOUl that the pTopagatoT ca.n o1so be 'Written
iLlp(x  x') =
<L8
(013' [tI>(x')tI>t(x)] 10)
(3.85)
Describe this in tenns oj pa.rtiete and a.ntiparticle propaga.tion.
Chapter 4
Quantization of Dirac fields
In Ch. 3, we dealt with the quantization of scalar fields. Such fields,
by definition, are invariant under Lorentz transformations. In this
chapter, we try something more complicated  fields which describe
spin ~ particles.
4.1
Dirac Hamiltonian
The problem with the oneparticle interpretation of the KleinGordon
equation comes from the fact that we encounter negative energies for
free particles. We showed how this problem is solved for scalar fields.
Dirac tried an alternative solution to this problem, one that led to
the correct description of spin ~ particles. He observed that this
problem did not arise in nonrelativistic quantum mechanics because
the Schrodinger equation was linear in the time derivative. So he
tried to construct a Hamiltonian which, in a sense, would represent
the square root of the equation
(4.1)
where H is the Hamiltonian operator and p is the operator for 3momentum. In coordinate space these operators can be written as
derivative operators, H = iBt, P = iV, as we know from their
commutation properties. Dirac wanted an operator linear in the
components of momentum which would square to p2 + m 2 . But such
an operator cannot be constructed using only numbers and functions
as coefficients. So Dirac assumed that the square root of this equation
47
48
Chapter 4. Quantization of Dirac fields
should be of the form
H
,0
=l<rp+m),
(4.2)
where
as well as ~1 are constant matrices. Since H and p are
Hermitian operators, "Yo, and "Yo must be Hermitian matrices.
Let us now square both sides of Eq. (4.2). This gives
I
H2 =
~ ['l')", lyit pipi + [')'O')'i, ')'ot pi m + (')'0) 2m2,
where we have used the fact that the operators
and introduced the anticommutator bracket:
pi
(4.3)
and p1 commute,
[A, BJ+ _ [B, AJ+ = AB + BA.
(4.4)
Since this must agree with Eq. (4.1), we obtain the following relations
for the ')'matrices:
[')'O')'i,')'Oyt
= 26 ij ,
[')'o')",')'ot = 0,
(')'0)2=1.
. (4.5)
Using the third relation, the second relation can be written as
')'O')'i')'O
+ (')'0) 2')'i = ')'0')"')'0 + ')" = O.
Multiplying on the left by
')"')'0
')'0,
we now find that
+ ')'0')"
= [')", ')'ot = O.
(4.6) .
(4.7)
We can now use this to simplify the first of the trio,
')'o')"')'0yi + ')'0yi')'o')', =
_')'iyi _ yi')'i
= _ [')",yit = U ij . (4.8)
All the equations in Eq. (4.5) can therefore be expressed in a more
compact form:
(4.9)
where g~" is the inverse metric, defined in Eq. (1.23). This is a matrix
equation, where an identity matrix is implicit in the right hand side.
4.1. Dirac Hamiltonian
49
Dirac observed that this condition can be satisfied in four dimensions
if the "'(#J's are at least 4 x 4 matrices. Despite appearances, the "IJ.l do
not transform as the components of a 4vector. They are constant
matrices. We will use them to construct vectors and tensors out of
spin ~ fields.
.
The first property to note about the "(matrices is that each of
them is traceless:
for all /1.
(4.10)
For example, Tr(y") = Trh,),I,),I) since h l )2 = I from Eq.
(4.9). Since ,,0 and 1'1 anticommute, we can write the right hand side
as Tr hi ')'0')" ). Using the cyclic property of the trace, Tr (ABC) =
Tr(CAB), this can be rewritten as Trho')',')',) = TrhO), using
the square of the matrix ')'1 once again. This shows that the trace of
1'0 vanishes. One can construct similar proofs for the "t i as well.
Let us define some new matrices constructed from the ')'matrices.
First we define
(4.11)
There arc six of these matrices in four dimensions because of the
antisymmetry in the indices. They are related to the spin angular
momentum of fields obeying the. Dirac eqnation, or particles with
halfinteger spin.
Next we define the matrix
(4.12)
where f.p.v>..p is the completely antisymmetric tensor in four dimensions. The interchange of any two indices introduces a factor of ( 1),
and n interchanges introduce a factor of (_1)n. It follows that f.p.v>..p
has only one independent component, which is usually normalized to
1. We shall choose aUf convention by setting
'0123 =
+1.
(4.13)
Some properties of the antisymmetric tensor are given in the Appendix. Note that for 'Ys, we do not make allY distinction between
the upper and the lower indices.
50
Chapter 4. Quantization of Dirac fields
Using ,,5, we can construct a set of sixteen linearly independent
matrices
(4.14)
These can be thought of as a basis spanning a sixteendimensional
space of matrices. Therefore the matrices must be at least 4 x 4.
Exercise 4.1 Show that "Y 5 is traceless, and that it 8G.tisJies
b",'y']+=o
JOT
all 1".
o Exercise 4.2
(4.15)
Show that the pToduct oj any two oj the
expressed as a linear combination oj all the r r 's.
o Exercise 4.3
r,
can be
Show that the equation
I"
Larrr=O.
(4.16)
r=l
wh.ere a r are complex. nu.mbers and f r , r = (1"",16) aTe membeTs
oj the set r dejined above in Eq. (4.14), can be satisjied if and only
i1 aT = 0 for a.ll r, i.e., the f r UTe linearly independent. [Hint:
Multiply the sum by each of r, in tUTn and take the tTau.1
,,0,
Since
as well as the products
follows that
,,0,,',
are Hermitian matrices, it
i.e., the matrices ,,' are antihermitian. We can include
a more compact relation,
,,0 and write
(4.18)
This, along with the anticommutation relation of Eq. (4.9), are the
basic properties which define the ,,matrices.
It is immediately clear that the specific form of the ,,matrices is
not unique. Suppose we have a set of matrices b"} which satisfy
both Eqs. (4.9) and (4.18). For any unitary matrix U, another set
{'ii"), defined by
(4.19)
4.2. Dirac equation
51
will also satisfy the same equations. The converse of this statement
happens to be true as well. If two sets of matrices "'I" and "I" satisfy
Eq. (4.9), it can be shown that there exists a constant 4 x 4 matrix
M such that
(4.20)
We will not prove this statement. But once it is assumed, it is easy
to see that since in addition the matrices "'I" and "I" both satisfy the
hermiticity condition of Eq. (4.18), the matrix M has to be unitary.
We will not use any explicit representation of the ')'matrices since
one representation is as good as any other. The only exception will
be made in 4.3.2, where a specific form will be assumed just for the
sake of constructing some explicit solutions.
4.2
Dirac equation
Using the Hamiltonian derived in the previous section, we can write
the equation of motion of '" as
i ~ "'(x) = "'10 (i"Y' V
+ m) ",(x).
(4.21)
This is the celebrated Dirac equation. Multiplying from the left by
"'10, this can be put into the form
(i",!"8"  m) "'(x)
= o.
(4.22)
It is convenient (and conventional) to use the Islash' notation 
for
any 4vector aJ.J I we will henceforth write
(4.23)
where "YJJ = gp.,/"t In this notation, the Dirac equation takes the
form
(~
m)"'(x) = O.
(4.24)
Since the 7"'S are 4 x 4 matrioes, it follows that ",(x) must be a
column matrix with 4 entries. Since the ')'matrices are not unique,
the solution "'(x) will also be nonunique. If ;j;(x) satisfies Eq. (4.22)
52
Chapter 4. Quantization of Dirac fields
for a different choice of the 'Ymatrices defined in Eq. (4.19), it follows
that
;[,(x)
= U,p(x).
(4.25)
The Dirac equation should be relativistically covariant. This fact
fixes the Lorentz transformation property of ,p(x). To see this explicitly, let us assume that Eq. (4.22) is satisfied in a different frame of
reference, and denote the objects in this frame with a prime. Thus,
(4.26)
where
x lIJ = AIJ. v x'" .
(4.27)
The relation between ,p'(x') and ,p(x) is assumed to be linear:'
,p'(x')
= S(A),p(x).
(4.28)
Putting this in Eq. (4.26) and using the transformation property of
a~ from Eq. (1.29), we obtain
(4.29)
Multiplying this equation from the left by S1 and comparing with
Eq. (4.22), we conclude that S(A) must satisfy the relation
(4.30)
Let us now consider an infinitesimal Lorentz transformation, given
by
(4.31 )
wherew~v
= wv~, which follows from Eq. (1.28).
Now, for infinitesimal transformations, we want S(A) to be linear in the transformation
parameters wjJ.v and reduce to the identity transformation as wjJ.v = O.
Let us then write
S(A)
= 1  ~}3~vw~v ,
(4.32)
4.2. Dirac equation
53
where fJllll are some 4 x 4 matrices, antisymmetric in their indices
/1, II. It follows that
Sl(A) = 1 + !..{3~"w~".
4
(4.33)
Putting these into Eq. (4.30) and keeping only first order terms in
w~" , we find
(4.34)
a condition that the matrices {3~" must satisfy. It is straightforward
to check that this condition is satisfied by the u~" matrices defined
earlier,
{3~" = u~" =
2 h~,'Y"1 .
(4.35)
o Exercise 4.4 Using tlte "'(matrix algebra 01 Eq. (4.9), ~erily tltat
tlte matrices def\ned in Eq. (4.35) do indeed satis/y Eq. (434).
[Hint: First sltow that lA, BCI_ = [A,BI+C  B[A,C +.1
Thus for infinitesimal transformations, the Dirac equation is c0variant if,p transforms by the !11atrix given in Eq. (4.32), with the
matrices {3~" given by Eq. (4.35). Finite Lorentz transformations
can be obtained by exponentiating Eq. (4.31). The corresponding
transformation rule for ,p is
,p'(x') = S(A)1/J(x) = exp
(~u~"w~") 1/J(x).
(4.36)
Objects defined in spacetime are classified as scalars, vectors, tensors, etc. according to their Lorentz transformation propert~es. Here
we find a new class of objects  column vectors whose components
mix among themselves according to Eq. (4.36). Such objects are
called spinors (even if they do not satisfy the Dirac equation).
The angular momentum operators J~" generate the transformations
(4.37)
corresponding to what we have called "6 in 2.4. Using the form of
the infinitesimal Lorentz transformation matrix from Eq. (4.31), we
can write
1/J'(x') = 1/J'(x)
+ w~"x"a~1/J'(x).
(4.38)
54
Chapter 4.
Quantization of Dirac fields
Using the linearized forms of Eqs. (4.36) and (4.37) for infinitesimal
lI
wlJwe obtain
I
J~" = i(x~a"  x"a~) + ~a~".
(4.39)
The first term is the familiar expression for the orbital angular momentum, but there is also a second term. This shows that the solutions of Dirac equation carries some intrinsic angular momentum, or
spm.
Exercise 4.5 Show that under Lorentz transformations 'I/J,,(Pt/J
lrunsforms as a vector and 1/Ja~lI1/J as u tensor, where
1/Jt,,/o.
INote: This is tne renson "(Ii carries a Lorentz index despite not
being a vector.J
Exercise 4.6 Suppose there exists a matrix C which ha.s the prop
erty tltnt
C  1"(p C = 
T
"tJI
j or a II 11,.
(4.40)
SItO\lJ tltnt tlte object
(4.41)
hus the sa.me Lorentz transformation properties a.s 1/1.
o Exercise 4.7 Using th.e a1\ticommutation retation oj the "(ma.trices, prove the following contraction fOTmulas:
1'1'=4,
= 2"11"
'Y)..'YIJ.'YII"Y>' = 4g jw ,
"'/>''''(1'''(>'
1'1"1"1.1' = 21p1"1,.
4.3
4.3.1
(4.42)
Plane wave solutions of Dirac equation
Positive and negative energy spinors
For a single particle interpretation, the t/J(x) appearing in Eg. (4.22)
would be a wave function. Although Dirac set forth to avoid the
negative energy solutions, the irony of the situation is that the Dirac
equation also has such solutions, just like the KleinGordon equation. To see that, let us assume that we are trying a plane wave
solution in the rest frame of a particle. The time dependence of this
4.3. Plane wave solutions of Dirac equation
55
solution would be of the form e iE'. From Eq. (4.22) we see that the
eigenvalue E should satisfy the equation
(4.43)
However, (,,(0)2 = 1, as we have seen, so the eigenvaiues of the matrix
"(0 can be + 1 or 1. Thus we obtain the solution of the eigenvalue
equation to be
E=m.
(4.44)
There will be two independent eigenvectors for each sign of the energy, as should be expected since 1/J, being a 4component object, can
have four independent solutions.
Thus both positive and negative eigenvalues will occur in a single
particle interpretation of the Dirac equation. For a general value of
the 3momentum p, let us represent solutions in the form
u,(p)e ip ."
1/J(x) ~ { v,(p)e+ ip ., ,
(4.45)
up to normalization factors, where the index s labels the two independent solutions of each kind. The positive energy usolutions correspond to a particle with momentum p and energy E p = + J p 2 + m 2 ,
whereas the negative energy vsolutions correspond to a particle with
momentum p and energy Ep =
p2 + m2
Putting Eg. (4.45) into Eq. (4.22), we can see that the spinors
satisfy the relations
_J
(p m)u,(p) = 0,
(p+m)v,(p) =0.
(4.46)
For spinors, hermitian conjugation turns out to be less useful
than what we may call Dirac conjugation, defined by
Using this definition and Eq. (4.18), it is easy to see that Eg. (4.46)
can also be written as
u,(p) (pm) = 0,
v,(p) (p+m) =
o.
(4.48)
56
Chapter 4. Quantization of Dirac fields
The normalization of the spinors has not been specified yet. Let
us now fix them with the conditions
ut(p)u,(p) = vt(p)v,(p) = 2Ep 8",
vt(p)u,( p) = ut(p)v,( p) =
o.
(4.49)
Exercise 4.8 Multiply tne uequation oj Eq. (4.46) jrom tlte lejt
bllli:y~
and its conjugate equa.tion, Eq. (4.48), by
Add the resulting equa.tions to show that
'YllU
from the right.
(4.50)
mu(p h"u(p) = pI'u(p )u(p) .
DeritJe a. similar equation faT the v8?inoTs. In particular, putting
J1. = 0, show that the normaLizat.ion equations can ulso be written in
tne jorm
u.(p)u,(p) =  v.(p)v,(p) = 2m6".
Note that this is an incorrect normalization
(4.49) "oLd. jor aLI m.
JOT'
(4.51)
=
0, while Eq.
Exercise 4.9 Use the same technique to show a. mol'. general idenlilll. where the spinol'S on the two sides coTiespond to different momentcl. This leads to
u(p'b" u(p) = _1_ u(p') I(p + p')"  ;17''" qvl u(p) ,
(4.52)
2m
wher. q = p  p'. This is colled the Gordon identity. Deriue
equation for the vspinors.
0.
similar
o Exercise 4.10 * Since u(p) is a column matrix and u(p) is
matrix, the 'Product uti must be a. 4 x 4 sQua.re matrix,
Show that, if we sum ouer spins, 'We obtain
I>,(p)u,(p) =
p + m,
L, v,(p)ii,(p) =
P
7Il.
0.
row
and so is vv.
(4.53)
(Hint: Take, for example, the first equation. Apply the matrices on
hoth sides on the fouT' basis spinol'S and show that the T'esults are the
same. If both sides act similo:rtll 01\ aU the basis spinors, thell must
act the same 'Wall on an1l spinor and thuefore must be the same
matrix.]
4.3.2
Explicit solutions in DiracPauli representation
So far, we have done everything in a representationindependent way.
As we remarked earlier, all representations are equivalent, and for
4.3. Plane wave solutions of Dirac equation
57
most purposes we will use the representationindependent properties
of the rmatrices. However, it is useful to derive the plane wave
solutions in a particular representation in order to gain some insight.
For this, we take the DiracPauli representation, which is given by
r
0=
"l. (0
(I0 I0 )
a
_u i 0
(4.54)
'
where I is the 2 x 2 unit matrix and the ai's are the Pauli matrices:
1
q=
(0 1)
10
, ( 72=
(0 i)
iO
a3 =
(1 0)
0 1
(4.55)
To find the solutions, we put the representation of the rmatrices
into Eq. (4.46). First, note that
p
m = roEp  'Y' P  m = (E  m
U'p
'
P ),
pm
(4.56)
where once again each entry of this matrix is really a 2 x 2 matrix,
as in Eq. (4.54). Thus, if we write the uspinors also as
u
_ (tP,)
=
tPb '
(4.57)
where tP, and tPb are the top two and the bottom two components
respectively, we obtain the following equations for them using Eq.
(4.46):
(Epm)tP,u'PtPb = 0
uPtP,(Ep+m)tPb = O.
(4.58)
These are two homogeneous equations with two unknowns, and s0lutions exist since the determinant of the coefficients is zero, which
can be seen by explicit calculation.
Exercise 4.11 VeriJ'y the sta.tement a.boue, tha.t the detenni:nant
of the coefiletenb of Eq. (4.58) \8 zeTo.
We can now use anyone of the equations in Eq. (4.58) to relate tP,
and tPb. Using the second one, we get
up
+ m tPt
tPb = E
(4.59)
58
Chapter 4. Quantization of Dirac fields
Thus, there can be two independent solutions since 4>, is a twocomponent column vector. Let us choose the independent solutions
for 4>, to be proportional to
The elements of x+ and x are numbers, not matrices. We can now
write down the solutions for the uspinors:
u(p) =
JE +
p
m (
CT
.x; ),
Ep+m
(4.61)
where the preflltor has been put in so that our solutions comply
with the normalization condition of Eq. (4.49).
Similarly, we can find the solutions for the vspinors, which are
(4.62)
These solutions also have been normalized llcording to Eq. (4.49).
The mismatch of subscripts on the two sides is a conventional choice,
which we will discuss again in 4.6.
o Exercise 4.12 Another 'Wo:y of finding the explicit solutions is to
solue. first in the rest frame, where pP = (m,O). In this frame, we
0011. chooa the independent solutions to be of th.e following form:
u+{O) = V2m
v+(O) = V2m
(~)
(D
u_(O) = V2m
(D '
v_ (0) = V2m (
~l )
(4.63)
It is easy to check. that these satisfy the normulization conditions
gillen aboue. Find the correctly normalized solutions faT any 3momentum by noling that (p + m)u(O) and {p  m)v (0) are solu
lio"s oj Eq. (4.46).
o Exercise 4.13 Find the matrix "15 in th.e
Hon.
DiTac~
Pauli
TepTe8enta~
4.4. Projection operators
4.4
59
Projection operators
Since we have multiple solutions, a useful concept is that of the
projection operators, which project out the appropriate solutions in
some particular situation. We discuss some such projection operators, which will be quite useful in later parts of the book.
4.4.1
Projection operators for positive and negative
energy states
FO.I example, if we are interested in states of either positive or negative energy, but not both, we can use the energy projection operators
A(p) =
p2m
+m .
(4. 6)
4
For any two vectors a and b, the anticommutation relation of "(matrices imply the relation
~~ =
In particular, when a
= b,
2a . b 
N.
(4.65)
this implies
#=a 2 .
(4.66)
It follows that acting on any solution of the Dirac equation, A
satisfy
(4.67)
Using Eq. (4.46), we see that the plane wave solutions of positive
and negative energy which we found are eigenstates of these energy
projection operators,
A+(p)us(p)
A_(p)vs(p)
= us(p) ,
= vs(p),
(4.68)
On the other hand
A+(p)vs(p)
= A(p)us(p) = O.
(4.69)
Similarly, using Eq. (4.48) we can write
us(p)A+(p)
vs(p)A_(p)
= us(p) ,
= vs(p),
(4.70)
60
Chapter 4. .Quantization of Dirac fields
as well as
(4.71)
4.4.2
Helicity projection operators
Another useful pair are the helicity projection operators. The spin
of a Dirac fermion, when measured along any direction, can be either + or  in units of ~h. A special direction is the direction of
motion of the fermion (except in its rest frame, of course). The spin
measured along the direction of motion is called helicity, and the helicity projection operators project out states of positive and negative
helicities.
We have noted in Eq. (4.39) that ~a~" is the spin operator for
fermions. Let us therefore define a 3vector
(4.72)
Next we project this operator along the direction of motion of the
fermion,
Ep
This has eigenvalues
+ 1 and
Ep
(4.73)
=
1.
Exercise 4.14 Check the eigen'\la.lues oj E p . (You can do this by
going to the Dirac Pauli rC'PTesento.tion, but also try to do it in a.
representationindependent Wo.1I.)
We can now define the helicity projection operators,
II(p) =
21 (1 E p )
(4.74)
These operators satisfy the relations expected of projection operators,
It is also possible to show that for any solution of Dirac equation,
the energy projection operators commute with the helicity projection
operators,
(4.76)
4.4. Projection operators
61
These relations are independent of the representation we may choose
to use, and this last one shows that we are allowed to choose common
eigenstates of A(p) and II(p) as the basis of our representation.
For example, if the 3momentum is in the zdirection, one can easily
verify that the spinors given in Eqs. (4.61) and (4.62) are indeed
helicity eigenstates:
(4.77)
with s = . For any other direction of the momentum, one can
make linear combinations of the two uspinors of Eq. (4.61) and the
two vspinors of Eq. (4.62) such that they satisfy Eq. (4.77). For the
uspinors, therefore, the subscript (+ or ) denotes the helicity. For
the vspinors, it is the opposite of helicity. As we mentioned after
Eq. (4.62), the reason for this choice will be explained in 4.6.
Exercise 4.15 ShOUl that, for arbitral" direction of the 3momentum, the spinoTs which aTC simultaneous eigenstates of A
and I1 are given by Eqs. (1,.61) and (4.62), Ulhere X are given not
by Eq. (4.60), but by
x+  /2p(p + p,)
x
y'2p(P+P,)
( P + 1', )
+ i1'y ,
1', + i1'y )
PI
P+p,
(4.78)
o Exercise 4.16 Prove the relations of Eq. (4.75).
o Exercise 4.17 * Prove the relations of Eq.
4.4.3
(4.76).
Chirality projection operators
In general, any operator that squares to the identity can be used to
construct a projection operator. If Q is an operator such that Q2 = 1,
we can define two operators P+ = ~(I+Q) and P_ = ~(1Q). These
operators obey pl = P, P+ + P_ = 1, and P+P_ = 0, i.e., they
are projection operators projecting onto orthogonal subspaces. The
energy and helicity projection operators A and II are specific examples of this sort of construction. Another obvious operator which
squares to the identity is ')'5. We can use 1'5 to construct another pair
of projection operators,
L=
2(1 
1'5) ,
(4.79)
62
Chapter 4. Quantization of Dirac fields
These are called the chirality projection operators. These turn out
to be closely related to the helicity projection operators, because in
the limit m + 0, the helicity projection operators become chirality
projection operators,
(4.80)
Exercise 4.18 Show that it is imiosSible to find (l linea,. combination oj the uspinors of Eq. (1. 61 th.at would be a.n eigensta.te oj
')'5, unless the jennion is massless. Furth.er, for a massless jennio1\,
ShOUl that the eigensto.tes oj 1'5 are identical to the heLicit'll eigenslales.
o Exercise 4.19 * Proue Eq. (4.80).
4.4.4
Spin projection operators
As mentioned earlier, helicity projection operators project out definite spin projections along the direction of the 3momentum of a
particle. They are therefore useless for a definite spin projection of
a particle at rest. For that one needs different projection operators.
Suppose we want to project the spin states for a certain spatial
direction denoted by the unit 3vector s. For this, first let us define
a 4vector nJl. whose components are given by
n
p' s
= ,
m
(p. s)p
s + m(E + m) ,
(4.81)
where (E, p) denote the 4momentum of the particle of mass m.
Notice that nil satisfies the conditions
(4.82)
Because of the second property in Eq. (4.82), it is easy to check that
the operator,9J. squares to unity. So we can construct the projection
operators
(4.83)
To understand the significance of these operators, let us consider
to be the xdirection, i.e., s = (1,0,0). In the rest frame of the
4.5. Lagrangian for a Dirac field
63
particle, n P = (0, s), so that rj. = /'1. Using the definition of /'s from
Eq. (4.12), we get /'srj. = 1)'0/,2/,3 = /'<1 23 = /'E 1 , where the 3vector E was defined in Eq. (4.72). This commutes with the spin
operator in the xdirection, El, so the eigenstates of this operator
are eigenstates of E' as well. In other words, for this choice of s,
the eigenstates of /'srf. have welldefined spin components along the
xdirection in the rest frame of the particle. The operators given
above project out the two possible values of the spin component.
Exercise 4.20 Consider 0. particle at rest, and take n~ = (a, 1,0,0)
like in the text. ConstiUct P T explicitly in the DiracPauli repnsentation of the 'j'matrices. Take the u and the v spinal'S shown in Eq.
(4.63). Show that Plu(O) and PIV(O) give eigenstates oj E r with
eigenua.lucs + 1 and 1 respecti'uelll.
o Exercise 4.21 Prouc the same thing, without taking a.ny e~Licit
fonn for the Dira.c matrices and spinal'S. [Hint: u(O) and v(O) are
eigenstates oj /'0, which joHows jrom Eq. (446).1
4.5
Lagrangian for a Dirac field
The problem with a single particle interpretation of the solutions of
the Dirac equation is the same as that with a scalar field. It contains
negative energy states. So we want to go over to a field theoretic
interpretation. The first step for that is to construct a Lagrangian.
The Dirac equation can be derived from a Lagrangian
(4.84)
If we put in the spinor component indices explicitly, this will read
.1!' = ,po (ihP)0~8p  mJo~),p~ .
(4.85)
From this, one can derive the EulerLagrange equations. We are
considering the case where ,p is a complex field (see Appendix A.l
for the possibility of a real field). Since components of ,p are linear
combinations of the components of ,pI, we can treat ,p and ,p to be
independent. Thus, the EulerLagrange equation for ,p gives
(4.86)
The left hand side is zero since the Lagrangian contains no terms
with a derivative of ,p. Thus we obtain Eq. (4.22).
1
64
Chapter 4. Quantization of Dirac fields
Exercise 4.22 Deriue the EulerLagra.nge equa.tion fo,.1/; a.nd show
that it is
l/>(il/+m)=O,
(4.87)
'Where the lefto.TToW sign on the den'lJo.ti\lc implies that the deriua.
liue should opera.te on things to its left. The.n ta.king the heT'11\itian
conjugate of this equa.tion and using the properties of the ')'mo.trices,
ShOUl that it is the sa.me equ.a.tion as that gi'ven in Eq. (4.22).
In 2.2 we mentioned that in classical field theories, the action
is a real number, a functional of the fields. The Lagrangian is then
also a real function in classical field theory. In quantum physics,
fields are operators, so the Lagrangian should be hermitian. Indeed,
the Lagrangians for scalar fields, given in Eqs. (3.5) and (3.45), are
hermitian. But the Dirac Lagrangian of Eq. (4.84) is not. The mass
term is hermitian, but the hermitian conjugate of the term involving
derivatives is not equal to itself:
(;1/'''8,,1)t = i(8"1)t/,,,t/'o1
= i(8,,1h"l/>,
(4.88)
using the hermiticity property of the Dirac matrices, Eq. (4.18), in
the last step. If we really want a hermitian Lagrangian, we can
discard the Lagrangian !L' of Eq. (4.84) and use instead
.
.
I
21
!L' = 21/J/,"8,,1/J  2(8,,1/Jh"1/J  m1/J1/J.
(4.89)
However, it is not absolutely essential that we use this hermitian
form of the Lagrangian. The reason is that
!L'  !L" =
~1/J/''' 8"1/J + ~ (8"1/Jh"1/J =
8" G1/J'l'''1/J)
(4.90)
Le., the difference is a total divergence. According to the discussion
of 2.2, the two Lagrangians should therefore be equivalent. We will
therefore keep using Eq. (4.84) for the Lagrangian of a Dirac field.
The Dirac Lagrangian is invariant under the transformations
(4.91)
for which the Noether current is
j" = q1/J'l'"1/J .
(4.92)
4.6. Fourier decomposition of the field
65
The conserved charge is given by
Q= q
d x 1/r'l"", = q
d x ",t",.
(4.93)
o Exercise 4.23 Find the equations of motion and the COf\scrued
CU1'1'ent jor the Lagrangian oj Eq. (4.89).
* The PauliLubansky \lector is dejined by
Exercise 4.24
(4.94)
'Wher. PIJ is the momentum opera.tor. A theorem then states that,
for pa.rticles which are eigenstates of momentum a.nd a.ngula.r momentum,
W"W" = m's(s + t),
(4.95)
where s is called the spin oj the pa.rticle. Using the e:r:pression jar J,..v
from Eq. (4.39), show that the spin of a. particle satisfying the Dira.c
equa.tion is ~. IHint: The contra.ction formulas of the antisllmmetric
tensor are gi"en in the Appendh.]
4.6
Fourier decomposition of the field
We now Fourier decompose the Dirac field as we did with the scalar
field in the last chapter. As with the scalar field, we write the Dirac
field as an integral over momentum space of the plane wave solutions,
with creation and annihilation operators a::; coefficients,
"'(x)
JJ
d p
(27T)32Ep
(fs(p)us(p)e iPX
+ fl(p)vs(p)e iPX ) .
s=I,2
(4.96)
Of course, this also implies
"'(x) =
JJ
3
d p
(27T)32E p
(f!(p)us(p)ei"X
+ f.,(p)vs(p)e iPX )
s=I.2
(4.97)
The total Hamiltonian can be written as
H =
=
3
6L
d x.
Ii"'Q(x)
"'Q(x)  L
d x "'( iy . V
+ m)"',
(4.98)
66
Chapter 4. Quantization of Diroc fields
using the Lagrangian of Eq. (4.84). To apply the operator (i'y. V
m) on 1jJ(x) given in Eq. (4.96), we note that
(y. p + m)u,(p)e ip .x
= '"YoEpu, (p)e ip .x ,
+ m)u,(p)e ip .x =
(i'y . V
(4.99)
using the definition of the uspinors from Eq. (4.46) at the last step.
Similarly,
(4.100)
Substituting these in Eq. (4.98) and using "'"Yo = u l and "'"Yo = vI,
we obtain
J d J P'J
3
H =
x
2(271')3
:L
d3
d3
fK x
PYE;
(J;,(pl)u;,(p')eiP'X
+ f.,(pl)v;,(p')eiP'x)
s,s'=1,2
(f,(p)u,(p)e iPX  fl(p)v,(p)e'P'x) .
(4.101)
Notice that the cross terms , Le., the ones involving utv and vtu!
vanish due to the normalization conditions from Eq. (4.49). The
other two terms give
H =
d3p Ep
:L
[f!(p)f,(p)  f,(p)fl(p)] .
(4.102)
.'1=1,2
This Hamiltonian can give negative energy eigenvalues, which is a
serious problem. And this does not go away by normalordering,
at least if we assume the definition of normalordering employed in
Ch.3. The way to take care of this problem is to assume that the creation and annihilation operators in this case obey anticommutation
rather than commutation relations:
[f,(p),f;,(pl)t
[f,(p), iJ,(pl)t
= O,,'03(p 
p'), (4.103)
while all other anticommutators are zero. And we define normal ordering for fermionic operators by saying that whenever we encounter
products of them, we shall put the annihilation operators to the right
and creation operators to the left as if the anticommutators were zero.
4.6. Fourier decomposition of the field
67
Using this prescription, then, we obtain the Ilormalordered Hamiltonian to be
:H: =
"'[IIs (p)Is(p) + I,rt (p)I,(p)
~ ]
d 3 pEp L.,
(4.104)
5=1.2
Exercise 4.25 Using the anticommutulion relation of Eq. (4.103) I
shoUl that the field operatoT's satisfy lhe reLation
(4.105)
whereas aU other anticommuto.toTS vanish.
o Exercise 4.26 * Den'll. the e::r.prcssion for AlI~1/ for
Show tna.t the
(4.39), i.e.,
COT1.SCMJed
0.
Dira.c j\eld.
charges are the sume as those gi'lJen in Eq.
(4.106)
o Exercise 4.27 There (lTe different con'Uentiolls regarding the normalization faT the spinoTs. One can use 0. quu.ntit'Y N J in place
oj 2Ep on the right ha.nd side oj the norm.atizution equa.tions, Eq.
(4.49). Moreover, one can define the anticommutation relations of
Eq. (4.103) with an extra factor oj N 2 on the right hand side, and
use a jactor N 3 instead of 2Ep under the square root sign in the denominator of the Fourier expansion in Eq. (4.96). Show that these
quantities must be related by
N,N, = N3
(4.107)
in order thut Eq. (4.105) i8 8nli8Jied.
Some consequences of the anticommutation relations and the new
definition of normalordering are worth exploring at this point. First)
notice that the Noether charge} in the normalordered form} becomes
:Q: = q
= q
d3 x :",1",:
d p
[I!(plI,(p) 1!(p)!.(p)]
(4.108)
3=1,2
Just as in the case of complex scalar fields, we see that the Noether
charge for the particles created by II is opposite to the charge of
those created by ft. The latter are therefore antiparticles of the
former ones.
68
Chapter 4.
Quantization of Dirac fields
Secondly, we said that all anticommutators other than the ones
mentioned in Eq. (4.103) are zero. For example, if we take the anticommutator of fl(p) with itself, it will vanish. So will the anticommutator of l1(p) with itself. This means that
fl(p)fl(p)
= l1(p)l1(p) = 0.
(4.109)
In other words J we cannot create two particles or two antiparticles
with the same spin and same momentum. Anticommutators imply
the Pauli exclusion principle.
We can also define the Fock space for fermions by starting with
the definition of the vacuum as
f,(p) 10) = 0,
f,(p) 10) =
for all p and s.
(4.110)
The states containing particles and antiparticles can be constructed
by acting on the vacuum by ft and j!. The Hamiltonian of Eq.
(4.104) and the anticommutation relations of Eq. (4.103) produce
the following relations:
[H, f!(k)l = Ekft(k) ,
[H, j,l(k)J = Ekj!(k)
(4.111)
These imply that both fi(k) and fi(k) create positive energy states,
a particle in the first case and an antiparticle in the second. In
particular, fi(k) 10) is a oneparticle state, and [,1(k) 10) is a oneantiparticle state.
To understand the helicities of these oneparticle states, we first
note that one can use the normalization conditions in Eq. (4.49) to
invert the planewave expansion for the field operator ..p(x), given in
Eq. (4.96), and write
I
j,l(k) =
J
J
d3x eikxu~(k)..p(x),
d 3x
eikXv~(k)..p(x).
(4.112)
Secondly, if we define the operator
(4.113)
r
4.7. Propagator
69
its commutation with the field operator can be easily derived from
Eq. (4.106) to be
(4.114)
since the orbital angular momentum part in Jpv does not contribute
to J k . Here E k is the helicity operator, defined in Eq. (4.73). The
corresponding commutators with j,(k) and J;.(k) can then be obtained by using the two relations in Eq. (4.112). For example, the
second of these relations gives
[J;.(k),J k ]
=~
2 )(2,,)32E.
Jd3x eikxv;(k)Ek"'(x). (4.115)
If we now use the convention of the helicity eigenstates as in Eq.
(4.77) where r = , we find
il]_lil
[h,j,(k) _  'irj,(k).
(4.116)
Operating on the vacuum which has no angular momentum, we find
that the state J;.(k) 10) has a spinprojection
4r in the direction of
the momentum, which implies a helicity
This l in hindsight, is
T.
the rationale for the convention of choosing the subscripts on the vspinors to be opposite to their helicity: v+ 1 for example, has negative
helicity, but the subscript is governed by the fact that the creation
operator that accompanies it in the Fourier expansion of Eq. (4.96)
in fact creates a positivehelicity antiparticle state from the vacuum.
The helicities of particle states can be calculated in the same way.
D Exercise 4.28 ShoUl that f~(k) crea.tes a. particle state 01 helicit'Y
r.
4.7
Propagator
Now we are interested in how the fermionic states move in space
time. We proceed as we did for the scalar field and write the Dirac
equation coupled to a source:
(iy~8~
 m) "'(x)
= J(x) ,
(4.117)
and introduce the Dirac propagator by
(iy~a:; 
m) S(x 
x')
= o'(x  x').
(4.118)
70
Chapter 4. Quantization of Dirac fields
Then the solution of the Dirac equation is given by
.p(x) = 1/Jo(x)
d'x' S(x  x')J(x') ,
(4.119)
where 1/Jo(x) is any solution of the homogeneous Dirac equation, Eq.
(4.22).
Our technique for solving Eq. (4.118) is the same as the one used
for scalar fields. We take a Fourier transform to write
S(x  x') =
d p eip.(xx')S(p)
(21T)4
.
(4.120)
Then the Fourier transform, S(p), satisfies the equation
(p m)S(p) = 1.
Multiplying both sides by
(4.121)
p + m, we obtain
(4.122)
This is a matrix equation, S(p) is a matrix. But p2  m 2 is a number,
so we can take it to the denominator on the right hand side. We
immediately see that there are poles for pO = Ep , which will cause
problems for the Fourier transform. So we use the s prescription as
for the scalar field to get the s<>called Feynman propagator:
p+m
SF(P) _
 p2 _ m 2 + is'
(4.123)
Often, for the sake of brevity, this is written as
1
SF(p) =
p m +.,
2
(4.124)
directly from Eq. (4.121), which should be taken to mean the same
thing as Eq. (4.123).
If we put the Fourier transform obtained in Eq. (4.123) into Eq.
(4.120), we get
d p e ip(xx') 2 P+ 2m
(21T)4
P  m + is
= (~+ m)6F(x  x'),
') =
S FXX
(
(4.125)
4.7. Propagator
71
where 6.F(X  x') is the propagator for the scalar fields derived in
Eq. (3.76). This gives
S (
, F X
') =
X
d p
(21r)32E
feet  t')(p + m)eip.(xx')
8(t'  t)(p  m)e'P(Xx'J]. (4.126)
The matrix elements of SF(X  x') can be written in terms of the
field operators in the form
(4.127)
where the timeordered product for fermionic fields are defined by
ift>t'
ift'>t.
(4.128)
Like the propagator for the complex scalar field, the propagator in
Eq. (4.127) describes either a particle going from (t',x') to (t,x), or
an antiparticle going from (t,x) to (t',x').
o Exercise 4.29 Derive Eq. (4.126) !rom Eq. (4.125).
o Exercise 4.30
Prove Eq. (4.127).
Chapter 5
The Smatrix expansion
So far we have dealt with free fields only. Even though we had a
SOUl ~.~ term for the derivation of the propagator the source was consider~\... nondynamical for that purpose, Le., the source itself did not
have a propagator. A truly free field h"" no experimental consequences, since it would not interact with the detection apparatus at
all. Moreover, if all fields were free fields, they would exist independent of one another and nothing would happen in the world!
Of course, the fields in the real world are not free. They interact
with one another, which gives rise to phenomena we can observe. To'
describe such phenomena, we therefore need a framework to describe
the interactions.
In the Lagrangians of free fields described earlier, we had only
terms which are quadratic in field operators. In terms of creation
and annihilation operators, this is exactly what is needed for describing free fields. At any spacetime point, the free Hamiltonian
can annihilate a particle and create the same. Thus, in effect, the
I
same particle travels on.
It is possible to have terms bilinear in two fields which would
annihilate a particle of one type and create a particle of another type.
These can be brought around to the standard quadratic form by a
redefinition of fields. Alternatively, they can be treated as quadratic
interactions of the original fields.
To describe any other type of interaction, we need some term in
the Lagrangian which has three or more field operators. Moreover 1
the term has to be Poincare invariant since the Lagrangian (density)
is. Also, the action is a real number, so the Lagrangian has to be
72
73
5.1. Examples ofinteractions
hermitian. Of course, as in the case of the Dirac field, the Lagrangian
can differ from a hermitian Lagrangian by total divergence terms.
5.1
Examples of interactions
The possibilities of interaction depend, of course, on the type of fields
in a theory. For example, suppose we have only one real scalar field
q, in our theory. Then in addition to the free Lagrangian we can have
the following interaction terms:
.>!int
_p.q,3 
),q,4 
),(n)4>n .
(5.1 )
n?:5
Here f.J, A etc are some constants which are called coupling constants,
Le.} their values determine how strong the relevant interactions are.
We have lumped all the terms with more than four powers of 4>
together under the summation sign. This is because the coupling
constants in these terms have negative mass dimensions. Such cou
plings introduce some problems in field theory, which we will discuss
later.
Exercise 5.1 Show that the mass dimension oj Ute coupling constant A(n) is 4  n.
If we have a complex scalar field, the interaction terms in Eq. (5.1)
can still be present. In addition, we can also have terms where one or
more factors of 4> is replaced with 4>1. Of course, the Lagrangian must
remain hermitian after these replacements. The free Lagrangian of
a complex scalar field is invariant under the change of the phase of
the field, which was mentioned in Eq. (3.54). If for some reason we
want to have this symmetry for the interaction terms as well, then
the most general form of the interactions would be
.>!int
= 
L
n~2
),(2n)
(4)' q,
(5.2)
If we have only spinor fields in a theory, our choices are much more
restricted. This is because the spinors transform nontrivially under
Lorentz transformations 1 and we must make combinations which are
Lorentz invariant. A product of an odd number of spinor fields cannot be Lorentz invariant. Thus the first nOlltrivial combination will
74
Chapter 5. The Smatrix expansion
have four fermion fields. We can also have more than four if we so
wish.
There can be many types of 4fermion interactions. For example,
we can write an interaction of the form
(5.3)
However, there are other possibilities. For example, we can also write
(5.4)
or
(5.5)
We can also make combinations using ,5, e.g.:
(5.6)
and 50 on. One combination which played a crucial role in the development of weak interaction theory is
(5.7)
The fields denoted by ,p in these equations need not refer to the same
fermion field. For example, suppose we want to describe nuclear beta
decay in which a neutron decays into a proton, an electron and an
electronantineutrino. In this ease, the last equation turns out to be
successful in explaining experimental data provided we replace the
four .,poperators as follows:
where the subscripts in parentheses stand for the particles involved.
The factor of ,J2 in the coupling constant is purely conventional.
Defined this way, Gp is called the iJdecay constant. Notice also
that in Eq. (5.8), we have added the symbol h.c.. This is because
the term written down in detail is not hermitian by itself. Since
the Lagrangian has to be hermitian, this means that the hermitian
conjugate term must also be present in the Lagrangian.
Let us now consider a field theory in which there is one real scalar
field and a spinor field ,p. In this case, apart from the interactions
5.2. Evolution operator
75
that involve only the ~terms or only the ,pterms, we can have other
interaction terms which involve both 4> and 1/;. For example, we can
have
.5!int
= 
h,p,p</> .
(5.9)
In the standard model of particle interactions, this kind of interaction
arises between the fermions and a scalar boson field representing the
Higgs particle. Similarly, one can have
(5.10)
Originally, a term like this was proposed to describe the interaction
between protons and neutrons (which are together grouped as nucleons since they occur in atomic nuclei) with spinless particles called
pions,
Exercise 5.2 ShOUl that, foT' the inte.raction Lagrangia.n to be hermitia.n, the coupling consta.nt h in Eq. (5.9) has to be renl, and hi
in. Eq. (5.10) ha.s to be pu.relll ima.ginary.
In writing all these interactions, we have ignored the free terms
in the Lagrangian since they are already known to us. It is the
nature of the interaction that defines a specific theory, and so theories
are named after their interaction terms. The type of interaction
given in Eq. (5.1O) was first envisaged by Yukawa. In the modern
literature, interactions of this form and also of the form in Eq. (5.9)
are together called Yukawa interactions. Any theory with a scalar
and a fermion field with such interactions is called Yukawa theory.
The interaction with four fermionic field operators was first discussed
by Fermi and are therefore called Fermi interactions, or sometimes
4fermion interactions. With the scalar field, we can have a A</>4
theory where the only interaction is the 4>01 interaction. Discussion
of these and other interactions requires a gClleral framework to deal
with interacting fields, which we now proceed to construct.
5.2
Evolution operator
If for a given system we could find all the quantum states we would
be able to exactly predict the behavior of the system from its initial
conditions. In other words, given an initial state Ii) consisting of
76
Chapter 5. The 5 matrix expansion
some particles with given quantum numbers we would know all the
possible states If) it could evolve into, along with the amplitudes
of transition. Unfortunately, apart from a few and almost always
unrealistic exceptions, it is not possible to find the eigenstates of a
Hamiltonian if the fields in it interact with each other. Therefore,
given a theory, we separate the Hamiltonian into a part which we
can solve exactly and another which we cannot,
H= Ho+HI.
(5.11)
Of course we could have done the separation in the Hamiltonian
density as well. Here H o includes all the bits of H whose eigenstates
are exactly known to us. We shall restrict ourselves where H o is the
free Hamiltonian of the fields concerned, which can be written in
terms of the creation and annihilation operators as described earlier.
The remaining piece, HI, contains all the interactions. We want to
find some way of describing the interactions in terms of the free fields.
We shall have to resort to perturbation expansions in order to achieve
this. There can be situations where H o includes the Hamiltonian
for bound states, describing an atom for example, and HI describes
interactions of particles with the atom. We shall not consider these
cases, but the formalism we shall develop can be applied equally well
to them.
A state evolving under the total Hamiltonian H = Ho+HI obeys
i
:t
IIlI(t)} = (Ho + HI) IIlI(t)) .
(5.12)
On the other hand in the absence of interactions, the evolution of
states is governed by the free Hamiltonian H o, which is constructed
out of free field operators. The states then obey the equation
.d
'dt IllIo(t)) = H o IllIo(t)) .
(5.13)
Let us now define two different evolution operators Uo(t) and U(t)
such that
IllIo(t)) = Uo(t) 1111 0(00))  Uo(t) Ii) ,
IIlI(t)) = Uo(t)U(t)UJ(t) IllIo(t)) .
(5.14)
If the states on both sides are taken to be normalized states, the
operators Uo(t) and U(t) are unitary for any t. The initial state Ii)
5.2. Evolution operator
77
is a member of some Fock space. Then
= Uo(t)U(t) Ii)
II\1(t))
(5.15)
We can write Eq. (5.13), which is Schriidinger's equation for the free
system, as
(5.16)
By considering a complete set of states at t
this into an operator equation,
= 00,
we can convert
(5.17)
which implies
Uo(t) =
e'Hot ,
(5.18)
assuming H o is not explicitly dependent on time.
Let us now consider a system evolving under the full Hamiltonian.
SchrOdingerls equation for this system is
i ~ II\1(t))
= (Ho + HI) II\1(t))
(5.19)
We can simplify the left hand side by using Eq. (5.15),
i :t II\1(t))
= i ~ (Uo(t)U(t))
(i
Ii)
;?) U(t) + Uo(t)id~;t))
dU
Ii)
(HoUo(t)U(t) + Uo(t)id~;t)) Ii) ,
(5.20)
while the right hand side reads
(H o + HIlII!'(t)) = (Ho + HI) Uo(t)U(t) Ii) .
(5.21)
So we can now equate the two sides, and going to a complete set of
initial states (at t = 00) as before, we can write Eq. (5.19) as an
operator equation,
HoUo(t)U(t)
dU(t)
+ Uo(t)i dt
= HoUo(t)U(t)
+ HIUO(t)U(t). (5.22)
78
Chapter 5. The S matrix expansion
The first term on the left cancels the first term on the right, and
multiplying both sides by Ud(t), we get
id~;t) = Ud(t)H/Uo(t)U(t)
= H[(t)U(t),
(5.23)
where
(5.24)
Let us try to understand the meaning of H / (t). The Hilbert space on
which both sides of Eq. (5.23) act is the set of states which we had
denoted by Ii} earlier. Let us denote the basis vectors in this space
by 1,6}. These are then the timeindependent free states of definite
quantum numbers. If we now look at the matrix elements of H/(t}
between these basis states, we find that
(f3I H /(t)1 f3') = (f3I Ud(t)H/Uo(t)1 f3')
= (f3,t IHJif3', t) .
(5.25)
Here we have written
Uo(t) 1f3}
= 1f3, t}
(5.26)
the timedependent free state into which 1f3} would have evolved at
time t had there been no interactions. So the matrix elements of
H/(t) between timeindependent basis states are equal to those of
HI between the corresponding timedependent free states. In other
words, RI(t) is the interaction Hamiltonian Hll now written in terms
of free fields at time t. Sometimes this is referred to as the 'interaction
Hamiltonian in the interaction picture'. Eq. (5.23) is solved by an
integral equation,
t
U(t)
U(to) 
r dtl H/(tIlU(tIl.
Jto
(5.27)
Now we make an assumption, that the states 1f3} are also a complete set of eigenstates for the full Hamiltonian Hast ~ 00. In
other words, HI t 0 as t + 00. We could think of this as saying
that all fields are free fields in the remote past, and the interactions
5.2. Evolution operator
79
are slowly turned on. This makes sense in scattering experiments,
where we have particles moving in as free particles, interacting briefly,
and moving away as free particles again. Our assumption boils down
to U(t) . 1 as t . 00.
Putting to = 00 in Eq. (5.27), we can rewrite it as
(5.28)
This is an exact equation obeyed by U(t), but is useless in this form
because of the presence of U(t]) on the right hand side. We can
obtain a solution iteratively if we replace the U(tIl on the right hand
side by Eq. (5.28). This gives
Replacing U(t2) again by Eq. (5.28) and carrying on this procedure
indefinitely, we obtain
This can be written in a mu~h nicer form if we use the timeordered products of operators. For example, take the term in the
sum with n = 2. Note that
(5.31)
The first term is obtained for tt > t2, and the second for t2 > tl.
Now, by interchanging the dummy variables tl and t2 in the second
integral and adding up the two integrals, this reduces to
(5.32)
80
Chapter 5. The Smatrix expansion
Similarly, the higher terms can also be rewritten in terms of timeordered products, and the perturbative solution for U(I) can be writ! ten as
U(t) = I + ~
00
(i)"
n!
l' l'
00
dt]
_oo
r'
<it, ... .1= dt"
.'Y [H,(t] )H,(I.,) H,(I,,}].
(5.33)
Time ordering of n operators has the ohviO\IS meaning that any operator appears to the right of all other 0IH'ratvl's at later times. One
sometimes writes U{/.) in the morc compact. fonn
U(I.) = 5" [ex p
(i loo Ill' H,(I'))] ,
(5.34)
which is to be illtcrpret(~d to mean Eq. (5.3:3).
The expre"iou for U(I.), E<]. (5.33), would he exact if the series
011 the right ltalld side n>llvcrged. In g(~Il('l'al. for arhitrary HI, the
series docs !lot COli verge. As a result, we arc usually int.erested in
cases where II I is proport.iollal to some 'slIlall' parameter. (Here,
as in qlla,ntullI Illc('hanicti, 'small' implies t.ilat t lu' ('xpcdatioll value
of H, in allY state is SIIIaIl compared to 1.11<' typical f'llcrgy scale of
the systelIl.) However, it turns out that <,Vt'li t.hell. for 1II0st I.heories
the expression for U(t) does !lot cOllverg<'. Ills\.('ad of a convergent
series, we get all (l~!Jm]Jt()tic series, one ill which c:tcli term is smaller
than the precedillg OIW, hut the slim docs llOt. lIecessarily cOllverge.
One could therefore question the validity of the formalism derived
above, fl.'"; well a.,; physical results which W(' shall COlllplltc by Ilsing
it. This is not a t.rivial issue, and aL'; a matk!" of fact. tlJis qllestion
survives all known ways of qualItizillg field t1worics. 1I0!. just the
callonical quantization that we have dotH'. TIJ(' H'sIHHISC to this is
that. the test of allY formalislIl is ill its agf('{'IIIPlit. with experimcnts.
It is trllly f0.llla.rkahlc~ that qualltlllli field tl)('ory. dt'spitf' somc severc
mathematical proiJlclIls ill defining it, pa..'is('s tlw tl'St. of ('Xperinlental
verihcatioll with flying: colors.
5.3
Smatrix
TiJ(' SIllatrix is
s=
df~fil1t'd a~
,~~~ U(I)
the lilllit
='7
["Xl' (if: dl 11,(/))] .
(5.35)
5.3. Smat6x
81
Since
(5.36)
where xO = t on the right hand side, we can also write the definition
of the Smatrix in a neater form:
S = g [ex p
(i Jd'x J1)(X))] .
(5.37)
We have already shown in Eq. (5.14) that U(t) i, unitary for any t.
Thus the Smatrix is unitary:
SSI = SIS = 1.
(5.38)
Why is the Smatrix useful? Particle interactions usually occur
for small intervals of time. Long before thb time, one can consider the
particles to be essentially free. Similarly, long after the interaction
period, the particles are again essentially free. A typical problem of
interaction can then be reduced to finding out whether a given state
of free particles, through interactions, can evolve to another given
state of free particles long after all interactions have cea."icd.
Consider a state at t = 00. Let us denote this initial state by Ii).
This is a state containing noninteracting fields, with HI = o. Now
let us slowly turn on the interaction, let tlw state evolve under the
full Hamiltonian H, and again slowly turn off the interaction. After
a long time the system must again be free and we call describe it by a
superposition of states of the form Uo(t) If!) where li'f) are our timeindependent basis states of free particles (free particle eigenstatcs
of H o with definite quantum numbers). Using Eq. (5.15), we find
that the amplitude for transition from Ii) to another specific state
If) (Ii), If) E (II')}) at late times t ~ 00 is
t~ (1 IUd(tjUo(t)U(tll i)
t~~ U IU(t)i i)
U lSI i)
, (5.39)
which is just a matrix element of the operator S in the vector space
of free states.
We have been a bit sloppy over the busiBcs~ of slowly turtling
the interaction on or off. It is physically understaudable <lli well as
justified from an experimentalist's point of view, hut we had already
assumed in our derivation that the Hamiltoniau did Hot depend explicitly on time, so we should not really illdllde it time dependent
82
Chapter 5. The
, Smatrix expansion
part. An alternative derivation involves defining 'in' and 'out' states,
/lJi in (",)) and IlJiout("'ll which are timeindependent eigenstates of the
full Hamiltonian H with quantum numbers collectively denoted by
"', but consisting of particles which are too far apart to interact, at
t + 00 and t + 00, respectively. The 'in' and 'out' states are
separately taken to be orthonormal and complete. The Smatrix is
defined as the infinite matrix with elements
(5.40)
While a derivation of the Smatrix using 'in and 'out' states is more
rigorous, it turns out that our free states, the eigenstates 1,8) of the
free Hamiltonian H o, are related to the 'in' and 'out' states by
1
IlJi in (,8)) = 11(00)1,6),
IlJi out (,8)) = 11(00) 1,8) ,
(5.41 )
where l1(t) is related to the evolution operator by U(t)
I1 I (t)I1(oo). Then the elements of the Smatrix are
(lJiout (,8)l lJi in("')) =
.~~ (,8 Il1t (t)l1( 00)1 "')
= lim (,8IU(t)I"') ,
'00
(5.42)
which are identical to the matrix elements of the Soperator defined
above. There are certain assumptions involved in taking these limits,
but for all purposes we can ignore such subtleties and work with the
Soperator.
5.4
Wick's theorem
Each term in the perturbative expansion of the Smatrix contains a
time')rdered product of a number of factors of ~, each of which is
normal ordered. The normal ordering procedure involved putting all
the annihilation operators to the right of all the creation operators
so that it annihilates the vacuum. But the time ordering raises complications because in it all operators at earlier times must be further
to the right. So creation operators at earlier times would be to the
right of annihilation operators at later times, contrary to what we
need for normal ordering. The advantage of normal ordered products
is that their expectation values vanish in the vacuum. So we would
5.4. Wick's theorem
83
like to get bock from time ordered products to normal ordered products. For this, we need a relation between the two. This relation is
provided by Wick's theorem.
Consider first the simple example of the timeordered product of
two foctors of a scalar field. Remember the definition given in Eq.
(3.83), which can be written as
g [4>(x)4>(x')]
= e(t 
t')4>(x)4>(x')
+ e(t' 
t)4>(x')4>(x). (5.43)
Our task reduces to writing the simple products on the right hand
side in terms of normal ordered products. To this end, let us write
4>(x)
= 4>+ (x) + 4>_(x) ,
(5.44)
where the 4>+ contains the annihilation operator and the 4> contains
the creation operator. They are thus shorthands for the two terms
appearing in Eq. (3.18). Because of the definition of the vocuum
state in Eq. (3.32), we can write
(014)_ (x)
=0
(5.45)
Now,
4>(x)4>(x') = 4>+(x)4>+(x') + 4>+(x)4>_ (x')
+4>_(x)4>+(x') + 4>(x)4>_(x').
(5.46)
On the other hand, if we write the normal ordered operator
:4>(x)4>(x'):, the only difference in the right side of Eq. (5.46) will
be in the second term, where the order of the two operators mllst be
reversed. In other words,
:4>(x)4>(x'): = 4>(x)4>(x')  4>+(x)4>(x') + 4>_(x')4>+(x)
= 4>(x)4>(x')  [4>+(x),4>(x')L .
(S 47)
The commutator on the right side can be put into a more convenient form. Because of Eq. (5.45), we have
(0 14>(x)4>(x') I0) = (0 14>+ (x)4>_ (x')1 0) ,
(5.48)
since the other terms vanish. We can also write it as
(0 14>(x)4>(x')1 0)
= (01
[4>+(x),4>(x')L
10) ,
(5.49)
Chapter 5. The Smatrix expansion
84
since the extra term introduced in the process vanishes anyway owing
to Eq. (5.45). However, since the commutator appearing on the right
side is a number, its vacuum expectation value is the number itself.
Thus we can write
(0 1(x)(x')1 0)
(5.50)
= [+(x), _(x')]_
Putting all this together, we have
(x)(x')
+ (0 1(x)(x')1 0)
:(x)(x'):
(5.51)
Putting this in the definition of the timeordered product, Eq. (5.43),
we obtain
.'7 [(x)(x')]
since 8(t  t')
+ 8(t' 
:(x)(x'):
+ (01.'7
[(x)(x')]IO),
(5.52)
t) = 1, and for scalar fields :(x)(x'):
:(x')(x): .
On the right side here, we have the vacuum expectation value of
the timeordered product of two field operators. Since such quantities
will appear often in the ensuing discussion, it is useful to develop a
compact notation for them. Such quantities are usually called Wick
contractions and are denoted by the following symbol:
(01.'7 [(x)(x')] I0)
= (x)(x')
~
(5.53)
Thus, Eq. (5.52) can be rewritten with the help of this notation in
the form
.'7 [(x)(x')] = :(x)(x'):
+ (x)(x')
(5.54)
More generally when the two fields appearing in the prod net are
not necessarily the same, we will write
.'7 [<I>(x)<I>'(x')]
:<I>(x)<I>'(x'):
+ <I>(x)<I>'(x')
(5.55)
o Exercise 5.3 Show that for two jermionic. operators 1/J(X) and
1J;'(x')' although Ute definition oj the timeordered pro<luct has a negati\le sign Jar one oj the tfTms, one still obtains Eq. (5.55) since the
commutators also get replaced by anticommutnt01'!:I.
Since the contraction is a vacuum expectation value, it will vanish
unless one operator (to the right) creates a particle which the other
5.4.
Wick's theorem
85
operator (to the left) annihilates. So if they correspond to different
fields, this expectation value vanishes. If they correspond to the
same field (or a field and its adjoint) we get i times the respective
Feynman propagators,
(x!l(X2)
,
,
="iC>F(X,
 X2) ,
(xtlt
(X2) = t(x2)(xIl
= iC>F(XI  X2),
,
,
"
'fo(xIl'f~(x2)
= 'f~(X2)'fo(x!l
= iSFo~(XI  X2) (5.56)
Wick's theorem can be generalized to any number of field operators by multiplying on the right by a field and considering the
interchanges required to change from a time ordered product to a
normal ordered product. The theorem is then proven by induction.
The result is
g [<I>, (xtl . <l>n(x n)]
:<1>, ... <l>n:
+ [:tj'2 <1>3'"
+ [:tj'2
+
.
+
+ perm]
t14 <1>5'"
<I>, <1>2
LJ
{ 4> 1ef>2
l.
<l>n:
<l>n:
... <l>n_1 <l>n
+ penn]
+ perm]
L.J
... 4>n2<1>n1
I
4>n
if n is even,
perm] if n is odd.
(5.57)
A few words about the notation used in writing up this result. All
possible pairs are contracted, first one pair, then two pairs, and so on
until we run out of pairs to contract. The word 'perm I occurring here
means permutations on all fields present. The formula is long. So,
in order to save space, we have omitted the ~pacetime points corresponding to the field operators. It is to be understood that the field
4>1 is a function of the spacetime point XlI and so on. The contractions are shown within the normalordered product~. However, since
the contractions are not field operators but rat.her the vacuum expectation values of some combinations of them, they should be treated
as numbers and can therefore be written outside the normalordering
symbol as well.
86
Chapter 5. The Smatrix expansion
Finally, in the Smatrix expansion, we had timeordered products
of normalordered objects:
[~(xtl ~(xn)J
=g
[:AB . (Xl): ... :AB . (x n ):]. (5.58)
We need to avoid commutators or anticommutators at the same point
in spacetime because they are infinite. For this, we change the times
of creation and annihilation operators in the rth factor by some
small amount 'r > 0 so that creation operators act after annihilation
operators,
~+(Xr)
>
~+(x~  ,,,Xr ) ,
~_(xr)
>
~_(x~+,,,Xr).
(5.59)
Then within each normal ordered group we have automatic time
ordering so there is no contraction within a group. Then we take
> 0 after we have done all other contractions, and we have the
result
'r
g [:AB(XI): ... :AB(xn ):]
= g [(AB (Xl)) (AB (Xn))]no etc.'
(5.60)
where the right hand side is expanded according to Wick's theorem,
but avoiding all equaltime contractions.
Wick's theorem looks formidable, but actually its implementation
is not very difficult. In calculating Smatrix elements for specific
initial and final states, we need to take the matrix elements of the
timeordered product given above between those states. Thus in
the Wick expansion, we need only those terms where, apart from
the contractions, we have just the right number of field operators to
annihilate the initial state and create the final one. These comments
will probably be clearer as we look at specific examples.
o Exercise 5.4 * Assuming tha.t Wick's theorem holds JOT the 'PToduct of n jidda as in Eq. (5.57), snow tno.l it nolds for tne product of
n + 1 fl.elds. The theorem is then 'P1'o13ed b'Y induction, starting from
Eq. (5.55).
Chapter 6
From Wick expansion to
Feynman diagrams
In this chapter, we show how to use the Wick expansion to calculate
Smatrix elements involving scalars and spinors. As a concrete example, we take a theory with a fermion field and a scalar field, which
interact via the Yukawa interaction:
2lnt
= 
ht/J1JJ1> .
(6.1)
We will consider specific initial and final states, and in each case
see which terms in the Smatrix expansion can give nonzero contribution to the relevant Smatrix element. An alternative approach
could involve considering a specific order in the Smatrix expansion
and investigate which processes it can contribute to. This approach
will be taken later in the book for Quantum Electrodynamics.
6.1
Yukawa interaction : decay of a scalar
Let the quantum of the field 1> be denoted by B, since the particle is
a boson. The quanta of the fermionic field 1JJ will be called electrons.
Of course, the mathematical results that follow will not depend on
whether the quanta of t/J are electrons or some other spin ~ particle.
We give the names only to avoid cumbersome sentences when we
explain what is going on.
Let us denote the mass of the particle B by M and the mass of
the electron by m. Suppose M > 2m, so that kinematically it is
possible to have the B particle decay into all electronpositron pair.
87
88
Chapter 6. From Wick expansion to Feynman diagrams
Let us denote this process by
(6.2)
where k, p, p' are the 4momenta of the particles. Our objective is to
calculate the Smatrix elements for this process.
The Hamiltonian derived from the Lagrangian will also have the
usual free terms, minus the interaction Lagrangian. This is obviously
true for all theories without derivative interactions. It also happens
to be true if derivative interactions are present. For the fermionscalar interaction,
.n} = h :1/np: .
(6.3)
Let us now look at the term linear in the interaction Hamiltonian in
the Smatrix. It is
S(I)
= g
[i! a'x .n}(X)]
= ih
a'x :1/njJ: .
(6.4)
In the last step, we have omitted the timeordering sign, since in this
case, there is only one spacetime point involved and so there is just
one time. Consequently, Wick's theorem is not needed here  we
already have the normal ordered product.
It will be convenient at this point if we use the notation of Eq.
(5.44) and a similar one fot the "'field, where the creation and the
annihilation parts of the field operator are denoted separately. Thus
for the "'field, the 1/J+ part will contain the annihilation operator for
the electron and the "'_ part will contain the creation operator for
the positron. Similarly, in "', the part containing the annihilation
operator for the positron will be denoted by 1/J+, and the part containing the creation operator for the electron is 1/J_. The subscript is
what we get for the sign of E when we act on the Fourier transform
by the energy operator ia/at. A good mnemonic for this notation is
that the subscripted sign on the field operator is the negative of the
sign in the exponential factor appearing in the Fourier decomposition
of the field operator. Using these notations, we can write
6.1.
Yukawa interaction: decay of a scalar
89
Consider now the matrix element of this operator between the
initial and the final states given in Eq, (6.2). If we use the terms in
S(I) which involve the 1> and consider 1> IB), it will create another
Bparticle in the initial state, and this can never match the final
state. Thus such terms will vanish. On the other hand, if we consider
<P+ IB), the particle B of the initial state will be annihilated by the
operator, and we will get the vacuum state. From the vacuum state,
the operator 1/1_1/1 can create an electron and a positron. Since this
matches the final state, we will get a nontrivial result. In other
words, among the eight terms occurring in the notation of Eq. (6.5),
only one term will contribute to the matrix element of the process in
question, which is
ih
dx1/l_1/I_1>+.
(6.6)
In writing this form, we have put the operators in the normal ordered form anyway, so we have omitted the normalordering symbol.
All the field operators pertain to the spacetime point x. Thus the
annihilation of B occurs at the same point in spacetime where the
electronpositron pair is created. Pictorially, this can be represented
by the diagram in Fig. 6.1.
B(k) <e(p)
........
e+( p')
Figure 6.1: Feynman diagram for Smatrix element coming from Eq. (6.6),
This diagram is a trivial example of what are called Feynman
diagrams. This one has three lines, representing the initial and final
particles. They meet at a point, called a vertex, where the fields
interact. In order to distinguish the scalar field from the fermions,
we have used a dashed line for the scalar field and solid lines for the
ferrrions.
Let us describe our convention of putting the arrows on the lines
in these diagrams. Time will be taken to flow from left to right in all
diagrams. Initial particlcs enter a diagram from the left, and final
particles leave a diagram to the right.
90
Chapter 6. From Wick expansion to Feynman diagrams
The arrow on an electron line will always point to the future, Le.,
to the right. The arrow on a positron line will always point to the
past, I.e., to the left. Since momentum always points from the past to
the future, it follows that for the electron, momentum flows along the
arrow, whereas for the positron, momentum flows against the arrow.
In addition, we have put a minus sign in the positron momentum.
So, in the diagram, the positron appears to be going into the vertex
with a momentum P/, which means it is in fact leaving the vertex
with a momentum p'.
The convention on time's direction varies from one book to another. In fact, if the lines are labeled as e+ or e, the arrows are
not needed to decide which one is electron and which one is positron.
Then one can dispense with time's direction altogether and put the
arrow in the direction of the momentum. However, an additional
advantage of fixing a direction for the flow of time is that charge
conservation is automatically ensured at each vertex if the arrow is
made continuous. Therefore, and for ease of understanding, we shall
allow a little redundancy and do both. We shall label the lines as e+
or e, as well as take time to flow from left to right.
The value of the matrix element obtained from this term will be
discussed in 6.3. For now, let us just try to see which other terms
have nonzero contributions to the process of Eq. (6.2). Let us look at
the second order term in the Smatrix expansion. Here we will have
two factors of the interaction Hamiltonian. These factors will contain
six field operators. Of them, we need three to annihilate the initial
particle and create the final ones. That leaves us wi th three more.
Since this is an odd number, all these remaining operators cannot
be combined into contractions. In the Wick expansion, we will have
terms which have one I or all three of these remaining operators sitting
outside the contractions. When we take the matrix element I these
terms will vanish. Extending this argument, one can show that the
matrix elements for this process will vanish for all even powers of
this interaction Hamiltonian.
The first nontrivial corrections to the amplitude obtained with
S(1) appear in S(3), the term with three factors of the interaction
6.1. Yukawa interaction: decay of a scalar
91
Hamiltonian. This term is
S(3)
(_;~)3
J Jd" Jd"
d"xl
x2
x3
[:(I/"/I</XI: :(,p,p</,,: :(,p,p</,,:],
(6.7)
where we have indicated the spacetime dependence of each normal
ordered term by a subscript. This is done just to save some writing.
The Wick expansion of this term will now involve a lot of terms.
Let us therefore start with some initial observations to help our way
through the maze. First, we know from Eq. (5.57) that in the Wick
expansion, we must take all permutations. However, all such terms
are related by a renaming of the spacetime points. Since all the
interaction points are integrated over in Eq. (6.7), these terms are
equal to one another. We can get rid of these repetitions by considering terms in the Wick expansion which are different irrespective of
the labels assigned to the different spacetime points. The permutation of the spacetime points would then provide a factor of n! for the
nth order term in the Smatrix expansion, which precisely cancels
the lin! coming from the exponential of Eq. (5.37).
This is an observation which 1:1Olds in general, for any interaction
Hamiltonian and any process in question. Now let us turn to the
process we have been discussing. Remember that we need one </>
operator to annihilate the particle in the initial state, and a ,p and a
,p to create the particles in the final state. All the other fields must be
contracted, otherwise the matrix element will vanish. Thus, in the
Wick expansion of S(3), the only terms which will give a nonzero
matrix element for the process of our interest are the ones which
have three pairs of contracted fields. The contraction of fields of
different types, e.g., </> with ,p or ,p, or ,p with ,p, vanish. Let us
denote the relevant terms with a superscript indicating the number
of contractions and write them out:
92
Chapter 6. From Wick expansion to Feynman diagrams
+ :(V"/Jr/J)XI
(V"/Jr/J)x,
(1/J1/Jr/J)X3:
I
I I
' I
I
+ (1/Jtr/J)"
(rtf)x,
(1/J1/Jf)x3
+ :(1/J1/Jr/J)XI (1/J1/Jr/J)x,
(1/J1/Jr/J)X3:
II
I' I
(6.8)
We can represent all such terms pictorially by Feynman diagrams.
Consider the first term on the right hand side of Eq. (6.8). Let us
rewrite it by separating out the contractions. This time, we explicitly
write the spacetime point corresponding to each operator, and also
the spinor indices:
The minus sign appears because of the rearrangement of the fermion
field operators. It is an odd permutation of the original order, hence
the sign. We can interchange the order of another pair of fermion
field operators to rewrite it as
In the normalordered factor, the only term which gives a nOnzero matrix element for the process in question is given by
(6.11)
as we argued just before Eq. (6.6). The spinor indices have been
raised only for convenience, and are equivalent to lower indices. Thus
the boson B is annihilated at the point x" the electron is created
at X2. and the positron is created at X3. Also, from Eqs. (6.10) and
(5.56), we find that there is a scalar propagator between X2 and X3,
and two fermion propagators, one from x I to X2, and another from
X3 to x ,. All these statements can be pictorially represented by the
Feynman diagram of Fig. 6.2a.
Notice that the contractions now appear as internal lines of this
diagram. By this, we mean lines which do not represent the particles
in the ini tial state or the final state. For example, the contraction of
r/J(X2)r/J(X3) is now represented by a dashed line which runs between X2
6.1. Yukawa interaction : decay of a scalar
B(k) xv~C(p)
.... ..
_~
_.~
(a)
X3
(,I
e+( _pi)
'i"'~; ,(,I
X2
._~
B(k)
Figure 6.2:
Xl
93
o <
e(p)
1!_(kJ_(b)
X3
e+(p')
(dl t:/>h>'1
X2
.. _..
B(k)
~._.
e+( _pi)
Xl
e(p)
Feynman diagrams representing the first four terms
of Eq. (6.8).
and X3. Similarly, there are two internal fermion lines, one between
Xl and X2, and another between X3 and Xl' These internal lines
appear as propagators in the evaluation of the Smatrix element.
An internal scalar line represents the contraction between two
operators at two spacetime points. For an internal fermion line,
however, the situation is slightly different, because here the contraction is between the ,poperator at one point and the ,poperator at
another. As we know, the propagator represents a particle being
created at one spacetime point.and annihilated at another. We can
indicate the points of creation and annihilation in the diagram itself,
by putting an arrow on the internal line which points towards the end
where the ,poperator occurs. For example, from Eq. (6.10), we see
that for the fermion line between Xl and X2, we have the ,poperator
at X2 and the ,poperator at Xl. Consequently, we have put an arrow
pointing from Xl to X2 on the fermion line ill Fig. 6.2a. Similarly,
for the other internal fermion line in this diagram, the direction of
the arrow is from X3 to Xl. This rule automatically maintains the
continuity of arrows on the fermion lines.
Let us now move on to the next term in Eq. (6.8). Here we have
an internal fermion line going from Xl to X2, but there is also one
coming back from X2 to Xl. Therefore, the Feynman diagram has
a fermion loop as shown in Fig. 6.2b. We also find a B propagator
between X2 and X3. Note that the arrow convention is valid only for
the initial and final particles  internal particles may be neither in
the past nor in the future of one another.
94
Chapter 6. From Wick expansion to Feynman diagrams
In the third term of Eq. (6.8), the initial B is annihilated at Xl,
which is also the point where the final e+ has been created. The
Feynman diagram for this case has been given in Fig. 6.2c. The
fourth term of Eq. (6.8) is equivalent to the diagram in Fig. 6.2d,
since here the final C has been created at the point where the initial
B has been annihilated.
Figure 6.3: Feynman diagram representing the last term of Eq. (6.8).
The last term remains to be discussed. It is pictorially represented in Fig. 6.3. As is clear from the diagram, this term represents
the amplitude of Fig. 6.1 times an amplitude for a vacuum to vacuum transition. This latter part comes from the loop shown, since
it has no external legs. [f the vacuum is stable, as it is assumed to
be for all physical systems, all such vacuum to vacuum amplitudes
must add up to an overall phase factor. Since such diagrams can
be added to any nontrivial diagram, they provide the same overall
phase factor to all diagrams. Therefore, we will happily ignore all
vacuum to vacuum processes.
o Exercise 6.1 Dro:u) the Fe\lnmo.n diagram( M) at lowest order at
which the 8calmsoolar 8co.tle1'ing BB
I
BB takes pluce.
Exercise 6.2 Write down the hermitiun conjugate term in Eq.
(5.8), a.nd show that it can mediate processes like p + e I> n + V e .
6.2
Normalized states
So far, we have shown how only a few terms in the Wick expansion
contribute to the Smatrix elements for specified initial and final
states. We have not calculated any matrix clement explicitly.
To perform such a calculation, we face a small problem. We have
defined the states in the Fock space by acting on the vacuum by
the creation operator, as for example in Eg. (3.39). However, this
95
6.2. Normalized states
definition is not the best for calculations, for two reasons. First,
the creation operators have a mass dimension of 3/2, as can be
checked from the Fourier decomposition of Eq. (3.18) for the scalar
fields, and of Eq. (4.96) for the spinor fields. Thus with the definition
of Eq. (3.39), the oneparticle states have different dimensions from
the vacuum state. Continuing on this argument} the dimension of
the twoparticle states will be more different, and so on for other
many particle states. The other problem is that the norm of any
oneparticle state diverges, as seen from Eq. (3.40) if we put p = p'.
To avoid these problems} we can define the states within a finite
region of volume V. Physically, this does not create any problem
provided the volume V is large enough so that the boundaries of
this region are far removed. from the region of interaction. We can
calculate quantities of physical interest using such states, and then
take the limit V ~ 00 at the end of the calculations. With this
in mind, we continue to use the Fourier expansions of fields in the
infinite volume limit.
The vacuum state is assumed to be dimensionless by virtue of the
normalization condition in Eq. (3.33). We want to define the other
states to be dimensionless as well. Let us now define the oneparticle
states as
(6.12)
for the scalar field whose quanta are denoted by B. Note that the ket
is now dimensionless. Similarly, for the electrons and the positrons,
we define the oneparticle states with momentum p and spin s as
Ie (p,s)
 jt27r)3
le+(p, s
=jt2~)3 J1(p) 10) .
f,(p) 10) ,
(6.13)
Using the commutation relations, our states are then normalized as
(B(p) IB(p'
(2~)3 03(p _
p'),
(e(p, s) le(p', s' =
(2~)3 o",o3(p 
(e+(p,slle+(p',s')
(2~)
p'),
o,,'03(p_p')
(6.14)
96
Chapter 6. From Wick expansion to Feynman diagrams
In all these relations, we have only used the 3momentum of the
particles concerned, without making any reference to the energy. The
reason is that these are physical states which must obey the relation
p2 = m 2 I so the energy is determined by the 3momentum.
We have solved the problem of the dimension of the states, but
the other problem still remains. For example,
(6.15)
which is meaningless. However, this is not unexpected. This 03(0)p
appears in all problems involving normalization of plane waves. The
reason is that plane waves are spread. over infinite volumes and time
intervals. This is not a problem specifically with qnantum field theory, but with plane waves in any theory. We need to carefully eliminate such infinities from all physical quantities.
The way to solve this problem is to restrict the plane waves to a
box and then to let the box become infinite in size. Then,
(6.16)
where the integration is done over the volume V as indicated by
the subscript after the integration sign. Before taking the limit of V
going to infini ty, we obtain
I
o (O)p =
(271")3 .
(6.17)
With this definition, we see that the oneparticle states of Eqs. (6.12)
and (6.13) are normalized to unity, provided we take the limit of
V > 00 at the end of the calculation of any physical quantity.
Now we can write down the action of various field operators on
different oneparticle states:
<p+(x) IB(k))
(6.18)
97
6.3. Sample calculation of a matrix element
where Wk and E p represent the energies of the scalar and the electron
for the 3momenta in the subscripts.
Similarly, for the adjoint operators
(B(k)1 4>(x) =
1 eib (01
2w
J kV
'
1
_ ( ) ipx (01
J2EpVus p e
,
( ') ip'x (01
J2Ep'Vvs' p e
(6.19)
These results have to be used quite often when we try to evaluate
the Smatrix elements, as we now show,
Exercise 6.3 Proue the results oj Eq. (6.18), using the Fourier
decomposition of the .fl.elds and the commutation (or nnticommutation) relations obe'Yed by the crea.tion a.nd annihila.tion opel"o.t01'8.
6.3
Sample calculation of a matrix element
Consider first the process of Eq. (6.2), where a Bparticle decays
to an electronp')sitron pair. We have already argued that in the
lowest order in the expansion of the Smatrix the only term which
contributes to this matrix element is the term shown in Eq. (6.6).
The matrix element of this terril between the initial and the final
state is then
(6.20)
Using the relations in Eqs. (6.18) and (6.19), we obtain
sj:)
= (ih)us(p)vs'(p')
d1
i(p+p'k)x
xe
1
1
1
(6.21)
J 2w k V J2EpV J2E p'V'
assuming that the spin of the final electron is s and that of the
positron is s'! and also using the fact that the vacuum ~tate is normalized to unity.
Consider now the integration over x. In the infinite volume limit
we can write
l
d'x ei(p+p'k)x = (27T)'O'(k  p  p').
(6.22)
98
Chapter 6. From Wick expansion to Feynman diagrams
Inserting this in the expression obtained before, we obtain
(6.23)
The afunction thus obtained merely gives us the constraint from 4momentum conservation. In other words it tells us that the initial
4momentum must be equal to the final 4momentum.
To get a feeling for a more nontrivial case, let us now move
on to the diagram given in Fig. 6.2a. The term in the Wick expansion that corresponds to this diagram was written down in Eq.
(6.10). The contractions given in that formula are nothing but the
vacuum expectation values of the timeordered products of the operators involved, and therefore they should be replaced by i times the
propagator of the particle involved. In other words, we should make
the replacements
l
4>(X2)(X3) = ill.F(X2  X3)
(6.24)
,p(xIly(x2) = iSF(Xj  X2)
etc, where the propagators have been defined for the scalar fields in
3.7 and for spinor fields in 4.7. The contribution of the Smatrix
element coming from this diagram can be written as
SJ':") = (_ih)3
X
J J J
d'xI
ill.F(X2  X3)
d' x 2
d' x 3
iSF~a(X2
xIliSFa~(xI
 X3)
x (e(p)e+(pl) 1,p~(x2),p:r.(x3)+(xIlIB(k)) . (6.25)
On the left, we have put the superscript '3' to indicate that it is a
term from the 3rd order expansion of the Smatrix, and the superscript 'a' to denote the particular diagram of Fig. 6.2.
We can use the Fourier transform of the propagators, and also
6.3. Sample calculation of a matrix element
99
use the matrix elements derived in Eqs. (6.19) and (6.18) to write
(6.26)
As in the case with S(I), we can perform the integrations on the
spacetime points. For example, the integral over all the factors
which depend on Xl gives
Similarly, integration over X2 gives (211")44 (ql + q2  p), and integra.tion over X3 gives (211")4 04(ql + q3 + p').
If the arguments of all these functions vanish, it automatically
implies k = p + rI, which is the condition for overall momentum
conservation. Thus we can rewrite the product of the dfunctions as
4(k  q2
+ q3)4(ql + q2  p)4(qj + q3 + p')
= 4(k  q2 + q3)4(qj + q2  p)4(k 
P  p'),
(6.28)
since both expressions imply the vanishing of the same quantities.
Inserting the expression in Eq. (6.26) and arranging the terms in a
manner that the spinor indices need not be written down, we obtain
(3.)
Sf'
= (,h)
J J J
a'ql
(211")4
a'q2
(211")4
(211")124(k  q2
4
d q3
(211")4
+ q3)4(ql + q2 
p)4(k  P  p')
x i~F(qIl [u,(p)iSF(q2)iSF(q3)V,,(p')]
[v'~k V J2~p V J2~p.v]
(6.29)
At this point, we can perform the integrations over q2 and q3 easily,
and write q instead of the remaining unknown momentum ql for the
100
Chapter 6. From Wick expansion to Feynman diagrams
sake of brevity. This gives
S~~a)
J(::~4
1] .
[J2w1V J:2E1V J2EI"V
= (_ih)3 (2rr)4J4(k  p  pi)
x iC,F(q) [u.(p)iSF(P  q) iSp(p  q  k)v"(pi)]
X
(6.30)
This is the final expression. Notice that it involves the integration
over an unknown momentum q. This is because the momenta of all
the internal lines are not determined by the external momenta. In
this diagram, there is one loop of internal lines, which admits one
unknown momentum which must be integrated over. In general, if
there are n loops, there should be n such unknown momenta which
have to be integrated over. The diagrams without any loops are
called tree diagrams, whose example is the first order contribution
to the Smatrix discussed earlier. Such diagrams do not have any
integration over unknown momenta, as seen in Eq. (6.23) above.
To appreciate another subtlety, let us try to find out the contribntion to diagram Fig. 6.2b to the Smatrix element. The contractions
were shown in Eq. (6.8). Written out with explicit indices for spacetime points and spinor components, this term reads
Let us now concentrate on the fermion contractions. We can exchange the order of the fermion fields in one of the contractions to
get
= iSFlio(:r2  XI) iSFo{i(X,  X2)
=
Tr [iSd:r2  X,)iSF(XI  X2)].
(6.32)
This shows that if the diagram has a fermion line which closes on
itself, the corresponding momentumspace amplitude should have a
minus sign for the loop 'and a trace over the Dirac indices running in
the loop.
101
6.4. Another example: fermion scattering
o Exercise 6.4 Find the contributions of diagrams Fig. 6.2c a.nd d
to the SmatriJ: element. Leave llour expressions at the stnge where
they in\Jol'Uc integration oller an unknown momentum. (In other
words, do not integrate, or write the exphcit forma for the propa.ga.lors. )
6.4
Another example: fermion scattering
We now consider some nontrivialities which might arise if the initial
or the final state (or both) contain some identical particles. As a
concrete example, let us think of the proces.':i
(6.33)
governed by the Yukawa interaction that we have been discussing.
This process will have no contribution at the first order in the
Smatrix expansion, but will have contributions at the second order.
To find the amplitude of these second order contributions, let us start
by writing the relevant term of the Smatrix, which is:
S (2) = (ih)2
2!
J J
d4 Xl
d4 X2:!7
[: ('l/nl)),.
]
: :('ljJ'ljJ)X2:
. (6.34)
After applying Wick's theorem, we will obtain a number of terms.
For the process at hand, we want terms where four fermionic operators are not contracted, corresponding to the particles in the initial
and the final states. There is only one such term, which can be
written as
(6.35)
where Q , {3 are spinor indices as usual. We have only electrons in the
initial and the final states. Therefore, if we decompose 'ljJ into 'ljJ+
and 'ljJ_, as in Eq. (6.5), the part which will contribute to tbe present
process IS
(6.36)
102
Chapter 6. From Wick expansion to Feynman diagrams
where now we have put the field operators in the normal order and
omitted the normal ordering symbol, picking up a minus sign in the
process. Putting this back in Eq. (6.34), we see that the Smatrix
element between the initial and the final state is given by
(2)
(ih)2
Sf; = 2!
. .
j d'Xl j d' ,t>.F(Xl
X2
X2)
(e (pDe (p;) 1~(XJl"'~(X2)"'~(xJl"'~(X2)Ie (pJle (P2)) .
(6.37)
Exercise 6.5 Argue tha.t odd order tenns in the Sma.triz ezp<1.nsion cannot contribute to the process of Eq. (6.33).
Consider now the initial state. Using Eq. (6.13), we can write it
as
Therefore,
We now use the identity
[AB,CDJ_ = A[B,CJ+D  [A,CI+BD
+CA[B, DJ+  CIA, DJ+B,
(6.40)
which holds for any four operators A, B, C and D. In the present
case, using the anticommutation relations of Eq. (4.103), we obtain
[f(k)f(k'), fl (p2)fl (pJl]_
= <53(k'  P2)f(k)ft(pJl <53(k  P2)f(k')ft(pJl
+<53(k'  Pl)fl(p2)f(k)  63(k  Pl)f l (P2)f(k'). (6.41)
The important point is that the combination of creation and annihilation operators operating on the vacuum state in Eq. (6.39) can
be replaced by the left hand side of Eq. (6.41), since the extra term
6.4. Another example: fermion scattering
103
in the commutator annihilates the vacuum anyway. Now, looking at
the right hand side of Eq. (6.41), we find that the last two terms
annihilate the vacuum. As for the other two, we can use the anticommutation relations once again and conclude that
!(k)!(k')!'(P2)!'(pJlI0)
3
= (6 (k'  P2)6 3 (k  pJl 6 (k  P2)6 (k'  pJl) 10). (6.42)
Putting this bad< in Eq. (6.39) and performing the integrations over
k and k', we obtain
where EI = Jp~
+ m 2 etc.
We now look at the final state, which is
(6.44)
Following similar steps, we find
(e (p~ )e (p;)1 ~(xJl"'~(X2)
1
J2EJV J2E2V
(01 [u"(p~)u/l(p;)eiP""eiP'.Z'
_u"(p;)u/l(p~)e''''Zle'P;.Z'] . (6.45)
Going back to Eq. (6.37)' we now rewrite the Smatrix element in
the form
S(2)
/'
= _ (ih)2
2!
Ja' x, Ja'x J (2.".)4q i~
d
x [UO(p~ )u,13 (p;)eiP~ 'Xl eip~'X2
(q)eiq.(zlz'l
ua(p~)u.B (pll )eiP2'XI eip't'X2 ]
x [u" (pJlu/l (P2)e iPI " eip,z,  u" (p2)u/l (pJle i".,.ZI e ipI z, ]
x
[~ ~ ~ ~]
(6.46)
Chapter 6. From Wick expansion to Feynman diagrams
104
It seems that there are four different terms coming from multiplying the spinor factors. But since Xl and X2 are integrated over,
we can interchange their labels and rewrite the expression as
S};l
= (_ih)2
J Ja'x~ J(~:~4
a'XI
x [UO(p;)u.o (pit )eiP2it eip'I,x 2
iLlp(q)e,q(XlX,)
t.r=t (pit )u.8 (p;)eiP~ XI eiP2'%2 ]
x u Q (PI )u13 (P2)e ipt'XI e iP2:'X2
[~ J2~2V ~ ~].
(6.47)
Here we have used L\p(q) = LlF( q).
We can now perform the integrations over Xl and X2, which will
give two momentum conserving afunctions in each term. We can
rearrange the arguments of these <Ifunctions as we did in Eq. (6.28)
such that one of the arguments becomes independent of the internal
momentum q. With the other one, we can perform the qintegration
trivially. After these steps, we obtain
S};)
(ih)2(21r)4<1 4(PI
X
+ P2  P~
[U"(P2)jjil(p~)iLlF(PI
 P2)
 P2) 
U"(p~)jjil(P2)iLlF(PI
p~)]
xu a (pd uil (p2)
x[~~~~].
(6.48)
We can make two observations. First, although we started from only
one term in the Wick expansion, we ended up with two terms in the
e (pJl
Xl
! B(PI
c(P2)
X2
(a)
e (p,)
e (pJl
! B(PI
 p,)
e (PI)
Xl
e (P2)
X2
e(pi)
 pi)
e (1"2)
(b)
Figure 6.4: Lowest order diagrams for electronelectron scattering in the
Yukawa theory.
6.5. Feynman amplitude
105
Smatrix element 1 because the initial and the final states now have
two identical particles. These two terms correspond to the Feynman
diagrams of Fig. 6.4.
The second point is that the second term in Eq. (6.48) can be
obtained by interchanging p\ and P2 in the first term. But this term
comes with the opposite sign. This is expected since if two identical fermions are interchanged, the quantum mechanical amplitude
changes sign.
6.5
Feynman amplitude
In the expression for various Smatrix elements calculated in 6.3
and 6.4, we can identify three types of factors. One of these is a &function for the conservation of 4momentum in the overall process,
which is multiplied by (271")4. Then there is a factor of (2EV)1/2 for
each particle in the initial or final state with energy E. These factors
do not depend on the nature of the interaction. The remaining part
in the expression for SIi I which depends on the exact nature of the
interaction, is called the Feynman amplitude. We will denote it by
iAtfi. In other words, we are writing the Smatrix elements in the
form:
Sfi = &fi
.
44'"
+ '(271")
& (L;Pi
'"
7
II J2EY
1
IIf .j2E1 VJI(f i ,
Pf ) i
(6.49)
which serves as a definition of the Feynman amplitude. Again. Pi and
Pi are the 4momenta of initial and final particles, not components
of some p.
Notice the first term on the right hand side of Eq. (6.49). This
term was not present in our calculations earlier. But that is because
we did not do the calculations for a case where the initial and the
final state was the same. and so this term was zero for the cases
we considered. But it is easy to see that this term has to be there.
Consider a world where there are no interactions. In this world,
the Feynman amplitude would be zero. But there should still be 5matrix elements. Since there are no interactions, the only nonzero
Smatrix elements would be those corresponding to the same states
initially and finally. This is the &fiterm of Eq. (6.49). Interesting
106
Chapter 6. From Wick expansion to Feynman diagrams
physics is of course embedded in the other term.
Let us now look at Feynman amplitudes for some of the diagrams
discussed earlier. For the Smatrix element given in Eq. (6.23), the
Feynman amplitude is given by
iAlX l
= (ih)u,(p)V,.(p'),
(6.50)
where pi = k  p due to the afunction.
For the diagram in Fig. 6.2a, the contribution to the Smatrix
element was given in Eq. (6.30). From this expression, we can read
off the Feynman amplitude for this diagram to be:
....A3a)
l./fl;
Ii
(_ih)3
X
d4
(2".~4
it>F(q)
[u,(p)iSF(p  q) iSp(p  q  k)V,.(p')] . (6.51)
Similarly, for the electronelectron scattering process, the Feynman
amplitude at the second order expansion in the Smatrix is given by
iAlj~l = (ihf [UO(P2)UIl (P\)iL'>F(Pl  P2)
UO (pi, )ull (P2)iL'>P(Pl  p',) ] un (p,)u ll (P2), (6.52)
which can be read off from Eq. (6.48).
6.6
Feynman rules
The nontrivial part of writing down the Smatrix element is the
determination of the Feynman amplitude. In the examples above,
we have determined it by applying Wick's theorem.
However, we can draw the Feynman diagrams for a process di
rectly by looking at the interaction Lagrangian. To do this, we first
draw the initial and final particles and join them by the vertices
which are allowed by the interaction Lagrangian. The number of
vertices is the same as the order of perturbation theory. All vertices
are labeled such that momentum and all charge conserva.tion laws
are obeyed at each vertex. Therefore, our task would be a lot easier
if we had some rules to calculate Alfi directly from the Feynman di
agram of a process. It turns out that a consistent set of rules, called
Feynman rules, can be constructed from our results above. Let us
try to summarize these rules here, using our experience in calculating
the amplitudes.
6.6. Feynman rules
107
Internal lines
Looking at the examples of the Feynman amplitudes above, it is clear
how the internal lines contribute to the amplitude. If we have an in
terna! scalar line of momentum p, there will be a factor of itJ.F(P)
in the Feynman amplitude. Similarly, if we have an internal fermion
line, the appropriate factor is iSF(p). These statements are pictorially summarized as below:
P
............... = itJ.F(P),
P
. , ..._ _
, = iSP(p).
(6.53)
Of course, the fermion propagator is a matrix, so we should really
put the matrix elements, as in Eq. (6.26). We will comment later on
how to order the matrix indices.
External lines
For the external lines, we can use the results in Eqs. (6.18) and
(6.19). In these equations, the exponential factors on the right hand
side go into constituting the momentum conserving bfunction. The
factors under the square root are state factors which are outside the
Feynman amplitude. The rest or'the factors on the right hand side,
excluding the vacuum state, go into the Feynman rule for the external
particles. For scalar particles, nothing else remains after we throw
out all the factors mentioned above. So we need not put anything in
the Feynman rule for external scalar particles. For spin1 particles,
we have the following factors depending on whether it is in the initial
state or in the final state:
, e(p,s)
u,(p) ,
e(p,",)
(6.54)
= u.,(p) .
Here, a small filled circle put at one end of any line implies the rest of
the Feynman diagram, including all other internal and external lines.
Thus the upper line shows that a particle is leaving the diagram, i.e.,
it is produced in the final state. In the lower diagram, the particle is
annihilated.
108
Cbapter 6. From Wick expansion to Feynman diagrams
For antiparticles in the external lines, similarly, the rules are:
e+(p,s)
= v.(p),
(6.55)
e+( p, s)
= v.(p).
In interpreting these rules, remember the arrow convention for antiparticles. The upper diagram now represents an antiparticle in the
initial state, whereas the lower one represents an antiparticle in the
final state.
Numerical factors
We have already seen that the integration over the c<>ordinate of
each vertex gives a factor of (2,,)4 times the momentumconserving
6function. If the number of vertices in a diagram be v, we will thus
obtain a factor of (2,,)4v, and v different 6functions.
But there are other ways in which factors of 2" would come in the
amplitude. When we express the coordinate space propagators in
terms of their Fourier transforms, it involves an integration over the
momentum space, i.e., a factor of J d4q/(2,,)4. Thus, if there are n
internal lines in a diagram, we will obtain a factor of (2,,) 4n. So, the
overall power of 2" is 4(vn). While defining the Feynman amplitude
from the Smatrix element, we keep a factor of (2,,)4 64 (L;p, L;Pf)
outside. This takes care of four powers of 2". All the other powers
are left in the Feynman amplitude, which therefore has a factor of
(2,,)4(vnI). However, it can be shown that
v  n  1 = f
(6.56)
where e is the number of loops in the diagram. Thus, essentially we
obtain a factor (2,,) 4t.
o Exercise 6.6 * Show that Eq. (6.56) holds in
an~
connected dia
gram irrespecti'Ue of how many Lines enter UTl:Y 'Vertex.
Momentum integrations
Let us now count the number of momentum integrations. We have
one momentum for each propagator from the Fourier transform, and
there are n of them. But as we said before, we obtain v different 6functions from the coordinate integrations at the v different vertices.
6.6. Feynman rules
109
Of these, one combination can be interpreted as the momentum con
servation in the overall process where no reference is made to the
internal lines, as shown for example in Eq. (6.28). There will thus be
vI different 6functions which will involve the internal momenta.
We can integrate over these <Ifunctions, which will determine some
of the internal momenta in terms of the external ones. Then we will
have n  v + 1 momenta left, and using Eq. (6.56) once again, this
number is seen to be the same as the number of loops, l.
We can then start by writing the momenta in Feynman diagram
in a way that takes care of as many <Ifunctions as possible. In other
words, we start with the momenta of the external lines, and then
start putting the momenta of the internal lines. At any stage, if the
momentum of an internal line is determined by the momenta of the
external lines and of the internal lines whose momenta have already
been assigned, we will write it in terms of them. This way, we will
have only l momentum integrations in the Feynman amplitude. And
since the power of 211" calculated earlier is 4, we can combine the
two and say that for each loop, we need to put
d4q
(211")4
(6.57)
for a momentum circulating in the loop. If we do this, we need not
write down any extra numerical factor.
Dirac indices on fermion lines
In course of describing the Feynman rules so far, we have mentioned
that the rules for fermions should come with appropriate Dirac indices. In case of any confusion, it is always safe to work with these
indices. However,our earlier experience shows that if we follow a
fermion line in the direction opposite to the arrow and put the Feynman rules for the elements encountered this way, the Dirac indices
appear exactly in order, and we can do away with all the Dirac indices and treat the entire expression as a matrix formula. For fermion
lines which do not close on themselves, such formulas will start with
a 11: or a til and end with a u or v spinor. In between, there can be
matrices corresponding to propagators and vertices.
For fermion loops, we can start from anywhere and follow the line
in the direction opposite to the arrow. If we keep the Dirac indices,
110
Chapter 6. From Wick expansion to Feynman diagrams
at the end of the loop we will come back to the same index that we
started with. Summing over all the Dirac indices will then imply a
trace over the string of factors obtained around the loop.
Antisymmetry of fermions
The fermions anticommute among themselves. This fact manifests
itself in various ways. For each fermion loop, we need to put a factor
of 1, as explained at the end of 6.3.
The other consequence of this antisymmetry is a relative minus
sign between diagrams differing from one another by an exchange
of two identical external fermion lines. This was discussed with an
example in 6.4.
Vertices
The Feynman rules for the vertices cannot be specified in such general terms, since they depend on the interaction Lagrangian which
gives rise to them. Roughly speaking, if we take the term in the
interaction Lagrangian, take out all the field operators, and multiply
the remaining expression by a factor of i, we obtain the Feynman
rule for the vertex. If there arc n identical operators being taken
out, we should multiply the result by a factor of nL For example,.
for the interaction Lagrangian given in Eq. (5.9), the Feynman rule
for the vertex is ih. We have noticed that this is the factor coming
in the Smatrix expansion for each occurrence of the vertex. On the
other hand, if we had a >..4 interaction term, the Feynman rule for
the corresponding vertex will be 4!iA. For other interactions, the
Feynman rule can be obtained similarly. We will, of course, discuss
many examples later.
6.7
Virtual particles
We observed that there may be some lines in a Feynman diagram
which belong to neither the initial nor the final state. These are
internal lines. There is some property of these lines which leads to
an important concept in Quantum Field Theory.
To understand this with a specific example, consider now the
top vertex of Fig. 6.4a. Momentum flowing through the scalar line is
6.7. Virtual particles
111
Using p~ = p? = m 2 where m is the mass of the electron,
we obtain q2 = 2pI .q. Suppose we use the energymomentum relation
to conclude that q2 = M 2 where M is the mass of the internal boson.
In a frame where the initial electron with momentum PI is at rest,
we would then obtain qo = M 2 /2m, where qO is the energy of the
intermediate particle.
Let us stick with the case of M > 2m which was used in 6.1.
In that case, qo > M. On the other hand, since the energy of the
initial electron in its rest frame was only m, momentum conservation
cannot allow an energy larger than m for the outgoing B line.
How can both these results be correct? In quantum physics, we
have to analyze carefully what can be measured. For any measurement of energy made in a finite time, there is an uncertainty. In
this case, if the uncertainty is larger than the difference of the two
answers obtained in two ways, we cannot really tell which answer is
q
= PI  Po.
correct.
The internal boson was created at some time during the entire
process and annihilated at some other time. If the difference between
these two times is short enough, the energy uncertainty can be larger
than the debated amount. Thus, this internal particle is bound to
be shortlived, the extent of which is to be determined by the timeenergy uncertainty relation.
We have, of course, obtained 4momentum conservation at each
vertex for any Feynman diagram, as illustrated in Ch. 6. So, for the
calculation of the Feynman diagrams, we should use 4momentum
conservation, which will mean that the internal line need not satisfy
the energymomentum relation q2 = M 2. This is true in general for
internal lines in any Feynman diagram.
There are various words to say this. The relation PPP~ = m 2 is
sometimes called the onshell condition for a particle of mass m, since
it is satisfied on a sphere if we could draw the 4dimensional graph
corresponding to the components of the momentum vector. Particles
on external lines, which satisfy this condition, are accordingly called
onshell particles. The internal particles are called offshell particles. Equivalently, one uses the terms physical particles and virtual
particles respectively for the same purpose. Thus, we are led to the
conclusion that internal lines in a Feynman diagram can correspond
to virtual particles.
112
Chapter 6. From Wick expansion to Feynman diagrams
This is in fact the reason that we can use propagators. Notice that
the denominators of all the propagators vanish if the corresponding
particle is onshell. However, propagators are to be used for internal
lines only, and if the particles in those lines are virtual, it makes
the propagator meaningful. If it happens that the internal line of a
Feynman diagram corresponds to a physical particle, we will have to
discuss it on a different footing.
One example of this kind of diagrams is shown in Fig. 6.2b. If we
cut the line going from X2 to X3 in this diagram, the diagram falls
apart in two pieces. Such diagrams are called Iparticle reducible (or
IPR for short) diagrams. Other examples of IPR diagrams appear
in Fig. 6.2c<l. In all of these examples, the line being cut carries
the same momentum as an external physical particle. For this, we
cannot use the propagator directly. The technique for calculating
such diagrams will be taken up later, in Ch. 12.
Of course not all IPR diagrams have this property. For example,
the diagrams in Fig. 6.4 are also IPR, but the internal lines represent virtual particles. So we can use the propagator to calculate the
amplitude of such diagrams. In addition, we can of course discuss
diagrams which do not have any such internal line which, when cut,
can break the diagram into two pieces. Such diagrams are called
Iparticle irreducible (or IPI) diagrams.
6.8
Amplitudes which are not Smatrix elements
So far, we have been talking about Smatrix elements only. These are
diagrams in which all external particles are physical and observable.
In practical calculations however sometimes one has to evaluate amplitudes of diagrams where this is not the case. For example, suppose
we have to evaluate the next order correction to the scattering diagrams given in Fig. 6.4. There will be no diagrams at the third order
in the Smatrix expansion, but there will be a number of diagrams
at the fourth order. Some examples are shown in Fig. 6.5.
There are, of course, other diagrams at this order. But it is clear
that the calculation of the four displayed diagrams will be much
easier if we decide to do it in two steps. In the first step, one can
calculate the diagram shown in Fig. 6.6, with onshell fermions of
1
6.8. Amplitudes which are not Smatrix elements
e (pil
e (P2)
B(PI
e (pil
c(p;)
e (P2)
c(Po)
..
e (P2)
P2)
e(p;)
(bl)
B(PI 
B(PI .
I
(al)
e (pil
.. ..
pJl
C(P2)
113
c(p;)
C (PI)
B(p!
p;)
e(p,)
e (P2)
. .
e(P2)
P2)
e (pJl
(b2)
(a2)
Figure 6.5: Example of diagrams whose calculation is conveniently done by
evaluating the amplitude which is not an Smatrix element.
arbitrary momenta. At this stage, we do not assume anything about
whether the boson line showing as an external leg of this diagram is
a physical particle l and put no factor in the amplitude for this line.
Once this amplitude is ohtained, one can use it to find the Feynman
amplitude for all the four possible diagrams of the type shown in
Fig. 6.5.
Let us explain these comments in some detail. We can represent
the amplitude of the diagram in Fig. 6.6 by
 u(p'W (p, p')u(p) ,
(6.58)
where V(p,p') stands for the effective vertex. Using the Feynman
e(p)
pk:p'k
=..,
,
. 
~.~:
=..~:
,,
,,
Figure 6.6:
'.
.
.. .
e(p')
. ...
Amplitude of this diagram, calculated without assuming
whether the vertical scalar line is physical or virtual, aids the calculation of
the diagrams in Fig. 6.5.
Chapter 6. From Wick expansion to Feynman diagrams
114
rules described in 6.6, we can write the expression for this effective
vertex as follows:
iV(p,p') = (_ih)3
d4k
(271")4 iSp(p'  k) iSp(p  k) i~F(k) , (6.59)
where the factor (ih)3 comes from the three vertices of this diagram.
Once this is evaluated, we can write the amplitudes of the four
diagrams appearing in Fig. 6.5 in the following way:
i.H'a' = [u(p',){ iV (PI, p; )}u(pilJ i~F(P'  p;) [u(p;){ ih }U(p2)]
i.H'a2 = [u(p',){ih}u(pilJ i~p(p,  p',) [u(p;){iV(p"p;)}U(p2)]
i.H'b' = [u(p;){ iV(p"p;)}u(pilJ i~F(P'  p;) [u(p;){ ih}u(P2l]
i.H'b2 = [u(p;){ ih}u(pilJ i~F(P'  p;) [u(p'tl{ iV(P2,p;)}u(P2)],
(6.60)
where the subscripts of .H' indicate the diagram.
o Exercise 6.7 Find aU other diagrams thut wilt contribute to e +
e+ 
e
+ e+
o.t fourth order in the Smutrix expansion.
Chapter 7
Cross sections and decay rates
The processes of highest interest are those for which the initial state
has either one or two particles. In the case of oneparticle initial
state, we typically study its decay to a number of particles in the
final state. In this case, the quantity of experimental interest is the
decay rote of the initial particle. For twoparticle initial states, we
study their scattering. The final state in this case might contain the
same two particles, in which case the scattering is called elastic. Alternatively, one can also study cases where the particle content of the
initial and the final state are different, in which case the scattering is
called inelastic. In either case, the important physical quantity is the
scattering cross section. In this c,hapter we define these quantities,
along with some examples of calculations.
I
7.1
Decay rate
Suppose some particle decays into a number of particles in the final
state. The initial and the final states are obviously not the same, so
we can forget about the <lfiterm. Then the Smatrix element for the
process can be written as
.
44
'"
Sfi = ,(2,,) <I (pi  LJPJ) y'2EV
y'2E V.4tfi'
f
'f
f
(7.1 )
where E, is the energy of the initial particle and Ef's are the energies
of various particles in the final state. The transition probability from
the initial to the final state is then given by ISf,12.
115
Chapter 7. Cross sections and decay rates
116
However, when we try to square the matrix element we face a
small problem, viz., that we need to square a 6function. Can we
assign a meaning to that? For any function f{p} of momentum, we
can write
{7.2}
under an integration sign. If the function f(p} happens to be another
6function, we will write
{7.3}
It is now necessary to assign a meaning to 6' (O}p. This can be done
if we perform our calculations within a large volume V and a large
time T. Repeating the argument which led to Eq. (6.17), we can now
say
,
VT
6 {O}p = (211")' .
{7.4}
Using this, we can now square the Smatrix element given in Eq.
{7.1} and obtain
15 1,1 2 =
1
{211"}"
6 (pi' "
L.,PI}VT 2EV
2E1 V
I
'I
I
I.4Iil 2 ,
{7.5}
which is the transition probability. The transition probability per
unit time is obtained by dividing this expression by T, and it gives
ISId 2 IT =
l n 2E V
(211")"
6 {pi'"
L.,PI}2E
1'1
I
I.4Id 2 .
{7.6}
This is the probability per unit time of obtaining a specific final
state with specific values of momenta. When we go to the infinite
volume limit, the momentum values arc continuous, and so we do not
look for specific values of the final momenta. Rather, we ask whether
the final momenta are in some specific range. If for example we are
interested about whether the final momentum of some particle is in
a region d3 p in the momentum space, we must multiply the above
expression by the number of states in that region. This number is
calculated by discretizing the phase space into cells of volume {211"h}3
7.2. Examples of decay rate calculation
117
and putting one state into each cell. Therefore the number of singleparticle states in a momentumspace volume of d 3 p is given by
Vd 3 p
(21T)3
(7.7)
in natural units where n = 1. Multiplying by this factor for all final
state particles and integrating over the final momenta, we thus obtain
the decay rate r to be
r=
I
2.
JII
PI
. (2 d)32E
(21T) 4 64 (Pi I
1T
I
'L..,
"
2
PI) 1Af'/d
(7.8)
The phase space factor has to be multiplied by I/n! if there are n
identical particles in the final state. The lifetime of a particle is
the inverse of this decay rate. In the next section, we show some
examples of calculations of lifetimes with different interactions.
7.2
7.2.1
Examples of decay rate calculation
Decay of a scalar into a fermionanti fermion pair
In 6.1, we discussed the possibility of a scalar particle B decaying
to e+e through the Yukawa interaction term. In 6.5, we also calculated the Feynman amplitude for this process to the lowest order
in the coupling constant h, which was presented in Eq. (6.50). We
now use that result, and the definition of the decay rate in Eq. (7.8),
to find the decay rate of the scalar B in its rest frame.
We start from the general formula of Eq. (7.8). Since in this case
there are two particles in the final state, it would read
r=
I
2M
dp
(21T)32E
d p'
4 4
,
2
(21T)32E' (21T) 6 (k  P  p) 1Af'lil . (7.9)
In the factor outside the integral sign, we have replaced the energy
Ei by the mass M of the decaying scalar since we are considering the
decay in the rest frame of the scalar particle.
To the lowest order in the coupling constant h, the Feynman
amplitude was given in Eq. (6.50). Putting that result in, we obtain
h
r = 2M
d p
d p'
(21T)32E
(21T)32E'
2
(21T)46 4(k  p  p') lu,(p)v,,(p') 1
(7.10)
Chapter 7. Cross sections and decay rates
118
This in fact will be the decay rate if we look for specific spin values
s for the electron and s' for the positron. However, let us say we do
not really care for the spin projections of the final particles. In that
case, we should really sum over all the possible spins. Since final
states with different spin projection values are incoherent, we should
add the probabilities rather than the amplitudes. This gives
d3p
(2rr)32E
3
d p'
(2rr)32E'
x(2rr)4 04(k  p  p')
L lu,(p)v,,(p')
2
1 .
(7.11)
s,s'
Let us now proceed to evaluate this expressioD, marking various
stages for the sake of clarity.
Spin sum
Let us consider first the spin sum appearing in Eq. (7.11), which we
denote by Esp,". Thus,
Esp," =
=
L
L
lu,(p)v,,(p')1
[u,(p)v,,(p')] [u,(p)v,,(p')]' .
(7.12)
S,s'
Since the quantities within square brackets are numbers, we can replace the complex conjugation operation by hermitian conjugation.
Thus,
[u,(p)v,,(p')]' = [u,(p)v,,(p')J t
[u!(Ph'ov,,(p')r
[v1, (p'h'ou, (p) ]
 [v,,(p')u,(p)] .
(7.13)
Exercise 7.1 Proue the. more generot case of complex conjugation.
when the. two spinoTS sa.ndwich a.ny 4 x 4 matTi.x F:
[u.(p)Fv.,(p')j" = [v.,(P')Flu.(p)] ,
(7.14)
'Where
(7.15)
7.2. Examples of decay rate calculation
119
Let us now go back to Eq. (7.12) and rewrite it using Eq. (7.13)
and putting the spinor indices explicitly:
E,p;n
= L [(u'(Pn (v"
(pl))n] [CV,,(p'fJ(U,(piJ]
~,.!'
[L
u,(p)u, (P)]
po.
[L
v,, (p')V"(pi)]
s'
(7.16)
n{3
Notice that we have now separated the sums over s and S', and these
sums over single spin variables were given in Eq. (4.53). Using the
results, we can now write
E,p'n =
f1! + m]fJn [P' 
m]nfJ
= Tr [(p+ m) (pi 
m)l
(7.17)
o Exercise 7.2 FOT the more gene1'o.l case wh.en the matrix element
is oj the jonn u,(p)Fv,,(p') UJith any 4 x 4 matrix F, show thoI the
s'Pin sum oj the absolute squaTe oj the matrix element can be 'Written
as
L lu,(p)Fv,,(p')I' = 1'1" [(p + m) F (p'  m)Fl] ,
(7.18)
1,1'
where Fl was defined in Eq. (1.15).
Traces of 1'matrices
We now have to evaluate the trace appearing in Eq. (7.17). We can
decompose it into four terms:
Tr [(p + m) (pi  m)] = Tr (PP'
 mp + mp'  m2 ).
(7.19)
Here, m is a number which can be taken outside of the trace operation. The last term is then really m 2 times the trace of the unit
matrix, which gives 4m 2 The second term can be written as
(7.20)
since the trace of any l'matrix vanishes, as was proved in Eq. (4.10).
For the same reason, the third term also vanishes. In fact, one can
prove a more general result, viz., that the trace of any odd number
of l'matrices vanishes. This and some other useful trace formulas
are given in Appendix A.2.
Chapter 7. Cross sections and decay rates
120
We are now left with the first trace appearing in Eq. (7.19). This
can be rewritten as
pl'p'VTr h~'Yv).
(721)
However I since the trace operation is cyclic, we can also write it as
(7.22)
or equivalently, as
(7.23)
using the anticommutation relation of the ,matrices in the last step.
Now I gJ.w is just a number so far as the spinorial indices are concerned,
so the trace is really g~v times the trace of the unit matrix. Thus
Eq. (7.23) gives us 4p p'. Putting it in Eq. (7.19), we obtain
Tr [(p+m) (pm)] = 4(pp' _m 2 ).
(7.24)
Inserting the expression for the spin sum in Eq. (7.11), we can write
h2
f=2M
(7.25)
First we evaluate the final factor, which came from the spin sum,
subject to the constraints of the momentumconserving ISfunction.
The afunction enforces k = p + p', and squaring both sides, we get
2p . p' = M2  2m 2 . Therefore,
4(p. p'  m 2 ) = 2(M 2
4m 2 )
(7.26)
when multiplied by the afunction.
Phase space
Thus the factor coming from the Feynman amplitude becomes independent of the magnitude or direction of the 3momcnta, and can be
pulled out of the integral in Eq. (7.25). So we can write
h 2 (M 2 _ 4m 2 )
f=
M
p,
(7.27)
7.2. Examples of decay rate calculation
121
where
(7.28)
This p is usually called the phase space factor. It contains all kinematical factors, Le., factors not involving the Feynman amplitude
and the initial state factor. It also includes the afunction for the
4momentum conservation. Of course, it is not always possible to
make such a separation into a phase space factor and the Feynman
amplitude factor. It works here and in other problems where the
factors other than p do not depend on the magnitude or direction of
the 3momenta. Later, we will also encounter examples where this
separation cannot be done. However in this case, the separation
works , and it is very convenient to do things piecewise. Let us now
evaluate the phase space factor.
Starting from Eq. (7.28), we can immediately integrate over one
of the final momenta, say p'. This gives
I
P = (211")'
d p
,
4EE' a(ke  Po  Po)'
(7.29)
Since we have decided to perform the calculations in the rest frame
of the decaying particle,
k~
= (M,O,O,O).
(7.30)
In this frarne, let us denote the Ocomponents of p and p' by E and
E' respectively. Since the mass of the fermion is m, we must have
E'  p' = m' as well as E"  pi' = m'. But we have already used
the spatial afunctions, so p' = p in this frame. It follows that
E = E', so that
1
P = (211")'
=
.!.
11"
d p
4E' a(M  2E)
dp p' . .!.a(E  ~M).
4E' 2
2
(7.31)
In the last step, we have integrated over the angular variables of the
3vector p, noting the fact that the integrand does not depend on
them. This angular integration gives a factor 411".
122
Chapter 7. Cross sections and decay rates
Because of the energymomentum relation, we can write
dpp=dEE.
(7.32)
The 8function forces E to take the value M /2, and therefore p =
JEfl  m 2 = ~M Jl (4m 2/M2). From Eq. (7.31), we can now
write
p=
28~
JE
dE p . J(E 
~M)
2
1~
=8~VIM'i.
(7.33)
Putting everything together, we find from Eq. (7.27) that
= Mh
8~
(1 _
4m
M2
)%
(7.34)
The lifetime is the inverse of r, and therefore is inversely proportional
to h2. In other words, if h is smaller, the lifetime will be longer. This
is expected intuitively  if the coupling is smaller, it will take more
time for the process to take place. Secondly, for r to be real, one
must have M > 2m. This is also expected, since if M < 2m, the
decay cannot take place because of energy conservation.
Exercise 7.3 * Ca.lculate the deco.ll Tate for the process B + ee+
in Q frame where the initial B po.rticle moues with a. _speed v"in
the zdiTeCtion, i.e., its 4momentuffi is given blJ kP = ')'M(l,O,O,v)
where "f = (1  V2 )1/2. You mall neglect the mass of the e1ect,.o11.
faT this calculation.. Is yOUT final result justifia.ble by a. timedilation
argument?
o Exercise 7.4 Suppose the intera.ction between the spinle88 fl.eLd
and the fermions is given by Eq. (5.10), instead of Eq. (5.9) which
W0.8 used in the calcuLa.tion abo"e. ShOUl that in this case one obtains the same fOTTnula Jo,. the deco.y Tate, i.e., Eq. (7.34), with. h
substituted b~ 1h'1.
7.2.2
Muon decay with 4fermion interaction
We now calculate the decay rate of an unstable fermion, the muon.
It decays predominantly via the process
(7.35)
7.2. Examples of decay rate calculation
where
v~
is called the muonneutrino,
Ve
123
is the electronneutrino and
the hat indicates the antiparticle. The 4momenta of the particles
involved are given in parentheses.
The interaction Lagrangian governing the decay of the muon is
of the 4 fermion type, which can be writ ten as
Here GF is called the Fermi constant, whose value is 1.166 x
lO s GeV 2 Of course, the hermitian conjugate term must also
be present in order that the Lagrangian is hermitian, but we will not
need that term for describing this decay. The interaction Hamiltonian arising from this term is simply
The lowest order Feynman amplitude for the process is then given
by
.,Itj' =
~ U(e) (p')y'(1 
'Yslv<v.)(k') u(vv)(k)Y,(l 'Ys)U(,,)(p).
(7.38)
Let us find the decay rate of \.mpolarized muons. For this we sum
over initial muon spin and divide by 2. Summing over the final spins,
we get the decay rate of the muon,
f=_1_
2m"
3
3
d3p'
d k
d k'
(21T)32Po
(21T)32ko
(21T)32ko
X(21T)46 4(p  p'  k  k') x ~
l.,Itj,I 2
(7.39)
spm
Here m" is the mass of the muon, which appears in the denominator
since we are calculating the decay rate in the rest frame of the muon.
Since the subscript J1. here stands for the muon, we will not use the
same subscript for Lorentz indices in this section, in order to avoid
any possible confusion.
124
Chapter 7. Cross sections and decay rates
Spin sum
The first task, as before, is to calculate the square of the Feynman
amplitude. From the expression of the amplitude in Eq. (7.38), we
can write
~2 L.
l.A'd = G}
4spin
.
Spin
[U(e)(p'h.l(1  I's)v(v,)(k')]
I's)U(~)(p)]
[U(v")(kh.l(l 
[U(e)(P'hP(l  I's)V(v,) (k')]'
[U(v")(khp(l
I's)U(~)(p)]'.
(7.40)
We shall take the neutrinos to be massl~, because current experimental data shows that even if the neutrinos are massive, they are
much lighter than all charged particles. Then the spin sum rules
described in 7.2.1 yield
f
G2
2
1.A'1 =
Tr
X
[(p' + meh.l(1 
I's)fI'P(1  I's)
Tr [h.l(l  "Is)(p
+ m"hp(l "Is)] .
(7.41)
From now on, we will write 1.A'1 2 for the absolute square of the
Feynman amplitude, suitably summed or averaged over various spins
as the problem demands. Now, "Is anticommutes with any of the ,.I,
and so it commutes with a string of an even number of ')'matrices.,
Using this, we can write
(I  "Is)f"lP(1  "Is) = ~'''IP(I  "Is)2
=
2f"lP(I"Is),
(7.42)
using the property ("(s)2 = I in the last step. Then we can write
1.A'1 2 = G~, Tr [(p' + meh.lf"l"(l  "Is)]
X
Tr Ih.l(p + m~hp(l  "Is)] .
(7.43)
In either trace, the term proportional to the ma::;s vanishes, since it
involves an odd number of ,matrices. For the remaining terms, we
can use the trace lvrmulas given in Appendix A.2 to write
1.A'!2 = 16G} [p'.Ik'P
X
[p.lkp
+ p'Pk'.1 
+ Ppk.l 
g.lpp, . k'  iE.I.pTP' kIT]
g.lPp . k  iE.lVI"I kvp,,] .
(7.44)
7.2. Examples of decay rate calculation
125
Notice that in each trace factor, the first three terms are symmetric
in the Lorentz indices >., P, whereas the last term is antisymmetric.
Thus, the contraction of the first three terms with the last one of the
other trace vanishes, and we are left with
1.1 2
= 16G} [(P"k'P
+ p'Pk"
 g'Pp' . k') (p,k p + Ppk,  g,pp. k)
]
""pT" >'YPfI P1Dk'Tk vp,
(745)
.
The result of the contraction among the symmetric parts of the two
traces is given by
2 (p . p' k . k'
+ P . k'
k . p') .
(7.46)
As for the other parts involving the etensors, we use the identity of
Eq. (A.33),
>'UpT
>'1.'1"1_
 
2( 9(1 v 97" 1}  g.,. v gO" ")
(7.47)
which gives for the last contraction the result
"',pT,,'v'"p" kIT kvp, = 2 (p. k' k . p'  p. p' k . k')
(7.48)
Combining the two terms and noting the negative sign between the
two in Eq. (7.45), we obtain the final result for the square of the
Feynman amplitude in the following form:
1.1 2 = ~ L 1./d 2 =
2 spm
.
64G} p . k' k . p' .
(7.49)
Inserting this expression into Eg. (7.39), we can write
3
3
3
_
G}
d
p'
d
k
d
k' <4( pp,  k  k') p. k' k .p.
'
s
,   ,0
r
IT ffi p
2po
2ko
2ko
(7.50)
J J J
Integration over k and kJ
The integration is substantially more complicated than in the example shown in 7.2.1 because the final state here has three particles.
So, we perform the integration in steps. First, wc write Eq. (7.50) as
r=
G2
sF
11"
mp
J 2~
d3
Po
p'p'PI,p(pp'),
(7.51)
Chapter 7. Cross sections and decay rates
126
where, writing q = p  p', we can dump all the terms depending on
k and k' into the definition of l>.p:
l>.p(q) =
J
J
d3k
2k
o
d k'
2k~ J4(q  k  k')k~kp.
(7.52)
Note that the integration measures over the! final state momenta can
be written as
(7.53)
where m is the mass of the relevant particle. The form on the right
hand side makes it obvious that the integration measure is a Lorentz
invariant quantity. Thus. the entire thing under the integration sign
in Eq. (7.51) is Lorentz invariant.
Therefore !>.p must be a rank2 tensor, and can depend only on
the 4vector q. So the most general form for !>..p can be written as
(7.54)
where A and B are Lorentz invariants which need to be determined.
From dimensional analysis, we see that both A and B have to be
dimensionless.
We have now two different expressions for l)"p, which are given
in Eqs. (7.52) and (7.54). Contracting hoth these expressions by g>'P
and equating the results, we obtain
(4A+B)q2=
J
J
3
d k' 4
 J (qkk')k'k.
d3k
2ko
2k~
(7.55)
Similarly, contracting the two equations by (/," qP l we get
+ il),!,
(A
=
=
'l"k
d 3'"
';'4
J J
J~:: J~:~'
2"0
2k'o
J ('I  k  k " ) q k q. k
J4(q  k  k') (k'. k)2 , (7.56)
where in the last step, we have replaced q by k + k' outside the
isfunction and put k 2 = k,2 = 0 for the neutrinos.
Eqs. (7.55) and (7.56) have Lorentz invariant quantities on both
sides, so we CoUll choose a.ny frame for evalua.ting thc~c integrals. A
l
7.2. Examples of decay rate calculation
127
convenient frame is one in which k + k' = 0, i.e., the neutrinos move
with equal and opposite 3momenta. Since both the neutrinos are
massless, this also implies ko = k~ = k. And therefore, denoting the
results of this frame by a subscript 'CM', we obtain
(k . k') eM
= kok~ 
k . k'
= kok~ + k 2 = 2k5 .
(7.57)
Inserting this into Eq. (7.55) and performing the k' integration, we
obtain
(7.58)
The angular integrations give a factor of 47T, and the integration over
the magnitude of k is easily performed by using the afunction. This
gives "q5/4 for the right hand side. This result is obtained in the
CM frame where q = 0, i.e., q2 = q5. So
4A+ B
= "4'
(7.59)
Similarly, performing the integration of Eq. (7.56) in the same frame,
we obtain
(7.60)
Solving these two equations, we find A and B. Putting it in Eq.
(7.54), we get
(7.61)
Putting this back into Eq. (7.51), we can write
f= 24
C 2F
4
7l"
JP
J
3
mlJ
C 2F
4
24" m
d , ~ Ip 2
2
' PP
gA" + 2QAQp)
Po
3
P,
2
I
,
d
 , (Q p'p +2T}'Qp .q)
2po
"
where q = p  p' by definition.
(q
(762)
Chapter 7. Cross sections and decay rates
128
Differential decay rate
Before performing the remaining integrations, let us observe a few
things. We are calculating the decay rate in the rest frame of the
muon, with
(7.63)
In this frame, let us write the electron momentum as
p'A = (E" p').
(7.64)
This means
(7.65)
in this frame. Moreover,
q2
= (p 
p ')2
2 miJ. E e
= rnlJ.2 + me2 
p' q = p2 _ p' pI = m~  mlJ.Ee
,
P .q = p .q  q
rnlJ. E e
(7.66)
(7.67)
2
 me .
(7.68)
Plugging all this into Eq. (7.62), we get
_ G}
r
d p'
24".4
2E,
[(m~ + m;  2m E,) E,
p
+2(mp  E,) (mpE,  m;)].
(7.69)
In the integrand, nothing depends on the angular variables, so they
can be integrated trivially. Using Eq. (7.32) on the electron momentum, we can then write
1~:3
dE, p' [( m~ +
m;  2m pE,) E,
+2 (mp  E,) (mpE,  m;) ].
(7.70)
We can write this as
r=
dr
dE'dE,'
(7.71 )
7.2. Examples of decay rate calculation
129
where
[ (m~ + m;  2rn
JJ
Ee) E e
+2 (m~  E,) (m"E, 
mn ].
(7.72)
This quantity then represents the transition rate per unit energy
into final states with electron energy between Ee and Ee + dEe. In
other words, jf we observe electrons coming out of the decay of a
large number of muons, the probability that the electron energy is
between E, and E, + dE, is given by
1 dr
dE, dE,.
(7.73)
Often, this kind of distributions are very useful in checking how
theoretical predictions agree with experimental results. Since it involves the derivative of the decay rate with respect to some final
state parameter, such quantities are called differential decay rates.
In this case, the derivative is with respect to the electron energy, but
we could have considered other kinematical variables as well. For
example, we could take the derivative with respect to the energy of
the "w But experimentally, that quantity would not have been of
any interest since the neutrino energies are not directly measured in
an experiment. Another possible variable would be the direction of
the electron momentum, but in this case 1.It!2 does not depend on
this direction. So the derivative would be uninteresting.
Total decay rate
The total decay rate can be obtained by integrating Eq. (7.72) with
respect to E e . In order to perform this integration, we will make
one simplifying assumption, viz., that the mass of the electron is
zero. This is a good assumption, since m~ = 106 MeV, whereas
m, = 0.511 MeV, so that the electron mass is negligible compared to
the muon mass. With this assumption, Eq. (7.72) can be written as
dr
G}m~ 2
dE, = 12,,3 E, (3m"  4E,)
(7.74)
To integrate this, we need the limits of the integration. The lower
limit of the electron energy can certainly be zero if the neutrinos carry
Chapter 7. Cross sections and decay rates
130
the entire energy. The upper limit is obtained when the two neutrinos
are emitted in the same direction, opposite to the direction of the
electron. In this case, p' = k + k', which also means E e = k o + k o
since all the decay products are now considered. massless. However
E e + ko + k o = m~ due to energy conservation. So we obtain that
in this case, the energy of the electron is m~/2, which should be the
upper limit of integration. Therefore, the total decay rate is given
by
1
r=
!om./2 dEe E; (3m~  4Ee)
G2 m
F
12;or
0
2
5
GFm ~
192;or3 .
(7.75)
This lifetime was quoted earlier in Eq. (1.43).
Exercise 7.5 Start from the expression faT the differential deco.'Y
T'nte in Eq. (1.72). Do not completely neglect the electron ma.ss.
Find the corrections to the total decay Tate at the lowcst order oj
me/rnw Using me = 0.511 MeV and rnlJ = 106 MeV. estimate the
fractional error in assuming me = O.
o Exercise 7.6 Two of the possible decay modes, or channels, oj the
charged pion 1r+ are 1T+ + p.+ + vlJ and 1r+ + e+ + V e . In either
case, the interaction Lagrangian is
(7.76)
f 1f is a. C01\stant (called the pion deco.y c01\stant) which is
the Sllme for both processes, ,p, is the jield for the chllrged lepton
(electron or muon) a.nd VJI.J is the field fOT the cOTTesponding neu.trino. Find the ratio oj the decay rates in these two channels. IHint:
Den\lati'l1e interactions are treated like any other interaction in pertu.rbation theoT1J, 8>. gi\ling a. factor oj k>. in the Feynmo.n rule.]
'Where
7.3
Scattering cross section
At the beginning of this chapter, we mentioned that the concept of
scattering cross section is important when the initial state in the
Smatrix contains two particles. We now quantify the concept.
The name "crosssection" derives from a paradigm scattering pro
cess involving hard spheres. Consider, for example, a hard sphere of
radius a, located somewhere within a total area of A. Another sphere
is thrown towards it. To this sphere, the target sphere shows an area
of 7Ta 2 . Thus the probability that the incoming sphere would scatter
7.3. Scattering cross section
131
from this target sphere is given by rra 2 I A. Calling this the probability of a scattering event and denoting it by Ps, we can then write
rra 2 = PsA.
(7.77)
This equation gives the cross sectional area of the sphere in terms of
the beam area.
All objects are of course not hard spheres. But in a scattering
process, we can still define the effective crotis section by the same
formula. The cross section is usually denoted by the symbol Cf. Thus,
by definition,
a = PsA.
(7.78)
Now suppose we have a parallel beam with a density p and velocity v
towards the target. In time t, this beam fills a volume pvtA, where A
is now the area normal to the beam which fully contains it. Choosing
t such that the volume contains just one particle, we can write
1= pvtA,
(7.79)
or,
I
A=.
pvt
(7.80)
Therefore, Eq. (7.78) can be rewritten as
Pslt
a=.
pv
(7.81)
The quantity Pslt appearing in the numerator is called the transition
rate, Le., the probability of scattering per unit time. The quantity
in the denominator is the flux of particles. Thus the cross section
can be defined as the transition rate per unit of incident flux.
Although we have written these formulas for classical particles,
we will now carry everything over to quantum mechanical scattering.
Ps will now mean the quantum mechanica.l transition probability.
Consider a scattering process which takes an initial state Ii) at t 00 to a final state If) at t ~ +00. The SlIlatrix element between
Chapter 7. Cross sections and decay rates
132
these two states is given by
Sf' = (J 151 i)
6f,+ i(27[)'6' (~p, 2(Pf) I;I /2k,v 9J2~fV~f'.
(7.82)
The 6,.term stands for the case when nothing happened, and is of
no interest to us. Throwing it out and squaring the matrix element,
we obtain
ISf;1
= (27[)'6' (
~p;  ~Pf) VT I;I 2~, V92~f Vl~fd2 ,
(7.83)
where the square of the momentumconserving afunction has given
us
according to 7.1. As in there, we have assumed that the
scattering process occurs within a large volume V and a large time
T, both of which will be taken to infinity at the end.
This gives the transition probability. The transition rate, or the
transition probability per unit time, is therefore given by
VT,
f,1
T
2
,
1 II 1
L.:P'7Pf VII, 2EY
f 2EfV I~fd 2 ,
,(",
= (27[) 6
",)
(7.84)
This is the transition rate to a final state of specified momentum.
For a range of momenta, we should integrate this, including a factor
of V d3p/(27[)3 for the number of states. This gives
u =
P:V4E:E,J(g (2:~i;Ef)
(27[)'6'(
~P'  2(Pf)l~f'12,
(7.85)
where we have written the initial state factors explicitly.
As for the flux, we have taken the incident state normalized to
one particle in the entire volume, Le., the density of particles in
the initial state to be I/V. When we considered one beam particle
7.4.
Generalities of 2t02 scattering
133
hitting one target particle at rest, we had to multiply the density
by the velocity of the beam particle in order to obtain the flux. In
general, both particles may be moving, and for v we should use
mIm~
J(PI . P2)2 Vrel
E12
(7.86)
'
where Pi and E i denote, for i = 1,2, the 3momenta and the energies
of the initial particles whose masses are ml and m2. In a collinear
frame, i.e., if PI and P2 are along the same line, this reduces to
Vrel
1~: 
~ I
(7.87)
This coincides with relative velocity for small velocities, but not in
general. Then we can write the definition of the scattering cross
section as
<7=
Vr e4E
E
l12
Jn
f
d pf
(211")32E (211") ~
4 4 (",
",)
LPi  LPf
if
lAi'fil .
(7.88)
7.4
Generalities of 2to2 scattering
A scattering process has two particles in the initial state. But there
can be any number of particles in the final state. However, the calculations of cross sections become particularly simple if the final state
also contains two particles. In this ease, it is also easier to analyze
experimental data. Such processes are called 2 to2 scattering processes. In this section, we show some generalities of the calculation
of scattering crosssection in such cases.
Let us symbolically denote the scattering process in the following
way:
(7.89)
where PI and P2 are the 4momenta for the initial particles 1 and 2,
and m2. For the final particles l' and 2', we
which have masses
denote the momenta and masses with a prime.
m,
134
Chapter 7. Cross sections and decay rates
From the general formula given in Eq. (7.88), we can write down
the expression for the crosssection for the present case:
1
(1
V<el
1
4E, E,
d p'
d p'
(211" )3~El
(211" )3;E,
(211")48 4(Pl + p,  p;  p;)
Of course, I.AI' depends on the
interaction Hamiltonian we have
edge of that, we cannot find out
the kinematics can give us some
I.AI' .
(7.90)
process we are considering and the
for that. Without a detailed knowlthe cross section. However, already
simplification in the formula.
p;=(E"p')
e
p, = (E"p)  _ .  
p; =
Figure 7.1:
'~p, =
(E" p)
(E" p')
Kinematics of two particle scattering in the centerafmass
frame.
The analysis is usually done in two types of frames:
1. The CenterofMass (or CM) frame, which is defined by the
condition that the total 3momentum of the initial particles is
zero in this frame.
2. The Lab frame, in which one of the initial particles is at rest.
We should mention that the phrase "Lab frame" is somewhat con
fusing. These days many experiments are performed in which none
of the initial particles is at rest. To make matters more confusing,
in many modern experiments the physical frame of the laboratory is
really the centeroimass frame. Nevertheless, the name "Lab frame ll
has stuck from the days when all experiments were done by directing
a beam of particles onto a fixed target. Maybe a name like the "fixed
target frame" would have been more appropriate for what we want
to say, but we will stick to the conventionaL
7.4. Generalities of 2to2 scattering
7.4.1
135
eM frame
The ofunction in Eq. (7.90) implies
Pi =  P2,
(7.91)
PI =  P2
(7.92)
since
in the CM frame by definition. Then we can perform the integration
over p~ t say, to write
(J
= 641I"
2E1 E
1
2 Vrel
p;
d E' o(E I
E'
1
2
+ E2  E'1  E'}I1
2 ..# .
7.93 )
The Feynman amplitude factor has also been evaluated subject to
Eqs. (7.91) and (7.92).
The remaining integration is over d 3 p'Jo Noting that p~ = P21 we
can denote the common magnitude by pI, The energies of the two
final particles, however, can be different if their masses are different,
so we keep distinguishing them by the symbols E; and E~. Then,
(7.94)
where dO is the integration measure for the angular variables 0 and
'P, and we have used Eq. (7.32)' to exchange the other integration
variable for the energy of one of the final state particles. The general
kinematical variables in the CM frame have been shown in Fig. 7.l.
We now show that the magnitudes of the energies are determined
by the energy in the initial particles, so we can perform the integration over it. The angle 8, however, is not determined by the initial
momenta, and remains a free variable. It is called the scattering
angle.
Squaring both sides of Eq. (7.91) and expressing the squares of
the 3momenta in terms of energy and ffia.iS, we obtain Ei 2  m'l2 =
E22  m2 2 . We can thus express E 2 in terms of E 1:
I
E'2 JE 1
2 m'2
+2
m /2'
1
(7.95)
It is customary to represent the total initial energy in the CM frame
by the notation VS:
VS = E I + E 2
(7.96)
136
Chapter 7. Cross sections and decay rates
So the dfunction appearing in Eq. (7.93) can be written as
6(VSE~JE~2m~2+m22).
(7.97)
The argument of this dfunction is a function of the integration variable E;, and so we must use Eq. (3.15) to reduce it. The argument
vanishes when
s+m /2 _m/2
E', 
1
2.,fS
== WI,
(7.98)
= w.
(7.99)
which implies, through Eq. (7.95),
/2
+ m 2/2
E'  s  m2.,fS
1
Evaluating the derivative of the argument of the dfunction of Eq.
(7.97) and using Eq. (3.15), we finally obtain
d (.,fS  E;  JE;. 
mj2 + m,.) = d(E;  wtl ~.
(7.100)
We now go bac), to Eq. (7.93), which can be written as
drY
1
d n~, = 6 471" .E1E 2 .,fSSVreJ
J"'"
dLI
P d(E l
wd 1.
oLl

P
6411" E, E .,fSv,el
loLl .
(7.101)
where now p' as well as the Feynman amplitude part have to be
evaluated subject to the momentum conservation equations. Using
Eq. (7.98), we find
p' =
([s 
(m,
+ m')~ls
 (m, m,).]) y,
(7.102)
The relative velocity of the initial particles can also be expressed in
terms of the variable s. Using the general definition of Eq. (7.87) and
substituting the eM frame relation from Eq. (7.92), we can write
V,e' =
P,
U, + ~J '
(7.103)
7.4. Generalities of 2to2 scattering
137
which implies that in the eM frame
(7.104)
where P = PI
we can write
= P2'
However, from the definition of sin Eq. (7.96),
(7.105)
The solution to P is an expression similar to Eq. (7.102), with the
masses of initial particles instead of the final ones.
Putting all the factors together, we can rewrite Eq. (7.101) in the
final form:
do
dfl
=c;I.64rr 2 s
[{s  (mj + m;)2) {s 
(mj m;)2)] Y, 1..A'1 2
{s  (ml + m2)2) {s  (m,  m2)2)
.
(7.106)
We emphasize again that in this expression, the Feynman amplitude
factor has to be evaluated with 4momentUITl conservation. For elastic scattering, this formula simplifies to
du
dfl
1
..A'2
64,..2 s I I
(7.107)
The quantity duJdfl is called the differential crosssection, since
it involves derivatives of the total crosssection. In a specific problem,
we need. to integrate it over the angular variables to obtain the total
crosssection. Notice that the azimuthal variable <p does not enter
the kinematics, so the Feynman amplitude cannot depend on it. The
integration over this variable will simply give a factor 211'". However in
general, the Feynman amplitude square will depend on the scattering
angle 0, and therefore this dependence will have to be taken into
account while performing the integration over ().
1
7.4.2
Lab frame
We now turn to the other frame of interest, in which one of the initial
particles is at rest. Without loss of generality, we can take this to
be the first particle in the general notation for the scattering process
138
Chapter 7. Cross sections and decay rates
P; = (E;, V;)
82
P2
(E2,V2) .81
Figure 7.2: Kinematics of two particle scattering in the Lab frame. Particle
1 is at rest in this frame, shown as a blob.
given in Eq. (7.89). The general picture of the scattering process will
then look like Fig. 7.2.
The most general formulas in this frame look quite cumbersome,
so we are presenting the formulas for the case in which one of the
particles in the initial as well as in the final state is massless. Since
we have chosen the initial state as the rest frame for the particle 1,
this particle cannot be massless. We will take m2 = 0. This also
means that Vrel = 1, since one of these initial particles is massless
and the other is at rest. In the final state, without further loss of
generality, we can put m~ = o.
We can still start from Eq. (7.93), except that with our assumption of m2 = 0, it is more convenient to integrate over p~ first and
leave the integration over V; for the future. The only change in Eq.
without any other change at
(7.93) would be to replace d 3 by d3
this stage. In this frame, the components of the momenta of different
particles can be chosen as follows:
p;
p;,
PI = (mhO,O,O),
P2 = (E2, 0, 0, E 2 ),
pIt = (Ei
sin 81 , 0, P/l
I
p;
COS
OJ) ,
P; = (E;,E~sin82,0,E~cos82).
(7.108)
We have chosen the zaxis in the direction of the 3momentum of the
second particle in the initial state. We certainly have the freedom
to do that. And then the xaxis has been chosen so that V; lies in
7.4. Generalities of 2t02 scattering
139
the xz plane. This implies that the other scattered particle must
also be in the xz plane, since no other particle has any momentum
component in the ydirection and hence momentum conservation in
this direction forces this component to be zero.
Applying momentum conservation conditions on the x and z components, we obtain the relations
pit
sinDt  E;sinB2 = O!
p; cos 01 + E2cos O2 =
E2
(7.109)
These relations enable us to evaluate PI in terms of
E;.
p;2 = (E2  E2COS 02)2 + (E2sin 02)2
= E? + E 22  2E2 E 2cos O2 .
(7.110)
This means
(7.111)
Writing
(7.112)
in analogy with Eq. (7.94), we can now express the differential cross
section in the following form:
M
dO 2
E JdE 2 EE~ <I(ml
64" mt 2
t .
/
+ E2 
E;  E 2) 1'*'1
(7.113)
where Ei is related to E 2 by Eq. (7.111).
The argument of the <Ifunction is a function of the integration
variable E 2, so we must again use Eq. (3.15) to simplify it. The
argument vanishes when
(mt
+ E2 
E 2)2 = E?
+ E 22 
2E2E 2cos O2 +m;2, (7.114)
Le.) when
(7.115)
Thus,
<I(m!
+ E 2  E;  E 2) =
<I(E2
W2)
I1 +
dE'IdE~
2 E;=lol2
(7.116)
Chapter 7. Cross sections and decay rates
140
Using the expression for E; from Eq. (7.111), we obtain
dE'
1+_'
dE;
I
E;=W2
= 1
E2  E,coso'l
E'
1 +
I
E~=W2
m, + E,  E,cosO,
w,
(7.117)
where we have used the 8function to replace E~ + E 2 by ml + E 2 ,
and written WI for the value of E~ when 2 = W2. Substituting the
<I. function in Eq. (7.113) and performing the integration over E 2, we
obtain
1
mj + 2m,E,  m',z
(m,
+ Ez 
E, cos O,)Z
1.4fl z.
(7.118)
Once again, of course, it has to be remembered that the quantity
1.4f1' has to be evaluated subject to the restrictions imposed by momentum conservation.
o Exercise 7.7 Consider the deoo1J oj n ]l(trtide. oj TTLO.SS A1 into two
particles 01 musse,s 1n\ and 1112. FoUOUlin!J un unn1llsis simila.l to
what was done in this section, ShOUl that the differential d.ecay rate
in the rest Intme of the decaying pnrticll' il'l given b'Y
df
.(7.119)
dfl
Use this directly to obtain the decay
7.5
l'{tte.
ll.eriucd in Eq. (7.34).
Inelastic scattering with 4fermion interaction
In this section, we present an example of calculatioll of scattering
cross sections, viz, that of neutrinoele<.:troll scattering. Many more
examples of scatlerillg processes will be discusscd in the later chapters, particularly in CiJ. 9 and Ch. 15.
MallY kinds of ncutrinoelectron scattering processes call be discussed with 4fcrmioll interactions. For examplc, one can discuss
Vee ; Vf,C I or vpe v,1p. In such ca...;cs, t.he illitial aBel the final
states have the ::;amc pa.rticles, alld thcse processes are instances of
elastic scattering. Here, we discuss another process, where the scattering is incla..'itic.
7.5. Inelastic scattering with 4fermion interaction
141
The process we have in mind is:
(7.120)
where, as usual, p, k etc. are the momenta of various particles. The
interaction Lagrangian describing the process is
~nl
"
=G
J2 F
"'(vo)"I (I 
"15)"'(,) "'(~)"I,,(1  "Is)"'(vv) '
(7.121)
which is really the hermitian conjugate of the interaction term given
in Eq. (7.36) to describe muon decay. The interaction Hamiltonian
is the negative of the interaction Lagrangian, and so the Feynman
amplitude can be written as
.4li
=~
U(vol(k'Jy"(1 "Is)U(,)(p)
il(~)(p'Jy,,(1
 "Is)u(vv)(k).
(7.122)
We will square this amplitude, with a sum over final spins and
an average over initial spins. For the averaging 1 one peculiarity of
neutrinos has to be noted. Although they are spin~ particles, they
appear to come only in one helicity state, with helicity 1. Thus,
averaging over initial spins will involve a division by 2 because of
the electron spin only. There will be no corresponding factor from
neutrino spin states. So, in the general formulas for cross sections
derived in 7.4, we should put
1.41 2 = ~2 L:. 1.41 2
Spin
G:
Tr
[(p + m,JyP(1 
x Tr [hp(1  "Is)(p'
"Isl'''I'(1  "15) 1
+ m~h,(1
 "15)), (7.123)
using the techniques described in 7.2.2. Apart from some minor
changes in notation , this is the same as the expression as in Eq.
(7.41). Since we discussed the traces in detail there, we will skip the
calculations here and simply write down the final result, which is:
1.41 2 = 64G} p . k p' . k' .
(7.124)
We can now use this to evaluate the cr=section in either the eM
frame or the Lab frame. Let us discuss them one by one. We can put
ml = me and mil = mil' whereas m2 = m2 = 0 since the neutrinos
are ma.c;sless.
7.5.1
Cross sections and decay rates
Chapter 7.
142
Crosssection in CM frame
In the CM frame where p = k, we can use Eq. (7.106) for the
differential crosssection to obtain
do
 dO
(sm;)2
14(1
., .
1
64;or2 s
S 
(7.125)
m~
To complete the calculation, we need to express the square of the
Feynman amplitude in terms of the kinematical variahles in the CM
frame. We first note that p = k. The energy of the initial neutrino
is then p as well, since it is massless. The energy of the incoming
electron will be denoted by E. Adapting the general formula of Eq.
(7.105) for the present purpose, we can write
s  m2
= ":2:;..jSrs",e .
(7.126)
Therefore,
E 
Jp
2 _
+ me
s +m e
(7.127)
2..jS .
Then
(7.128)
The quantity p' . k', which also appears in the expression of Eq.
(7.124), can be evaluated by starting from the momentum conservation equation p + k = pI + k'. and squaring hoth sides. This gives,
since k2 = k'2 = 0,
II
P .k
= p. k 
22
2(m"  me)
1
= 2(8

Tn").
(7.129)
Putting these back into Eq. (7.124) and u,ing Eq. (7.125), we obtain
do
C} (s 
  dO  4;or2
m;)2
s
(7.130)
which is independent of the scattering angle in this case. Thus,
integration over the angular variables is trivial, and it yields a factor
47T. The total crosssection is given by
0=
For s
m;,
C 2 (s  m 2 )2
.E."
;or
s
the total crosssection is proportional to s.
(7.131)
7.5. Inelastic scattering with 4fermion interaction
7.5.2
143
Crosssection in Lab frame
To evaluate the same crosssection in the Lab frame, we can use the
general formulas of 7.4.2 since we have one massless particle in both
the initial and the final states. In this frame,
(7.132)
pk=mw 1
w = k being the energy of the incoming neutrino. Then} using the
first part of Eq. (7.129), we obtain
2+ 2mew  milo2)
'k' = 2"1 ( m
p'
(7.133)
Putting these in the expression for IAfl 2 appearing in Eq. (7.124) and
using the expression for the differential crosssection in Eq. (7.118),
we obtain
G}
da
dfh = 411'"2
(m; ++
me
2mew  m~)2
w  w cos 82
(7.134)
Unlike in the eM frame} the differential crosssection depends on the
scattering angle in this frame.
To obtain the total crosssection} we can integrate over the angular variables. The integration over the azimuthal angle gives a factor
of 21r} so we obtain
<7=
G2jl
'(m2+2m w_m 2 )2
..L
d(cosB J
'
,
"
21T'
1
G} (m; + 2m,w 
;
me
+w 
w cos 82
m~)2
m; + 2mew
This is the same as the total crosssection obtained in the
for
(7.135)
eM frame
(7.136)
This is not a coincidence. It will be discussed in more detail in the
next section.
o Exercise 7.8 Show that the CM frame for the
vJ,e scattering
moues with a speed v = w/(m~ + w) with rcs'Pect to the rest frame
of the electron. Calculo.te the energy oj both, particles in this frame.
Ij thell are denoted b1J Ell o.nd Wv , show thut s = (Ell + w v )2 =
+2mew.
m;
144
Chapter 7. Cross sections and decay rates
o Exercise 7.9 In the Lab frome in 'Which. the initia.l eLectron is at
rest, ca.teula.te the minimum (01' threshold) energy that th.e incoming
neutrino mu.st p088es8 in orde,. to initiate the process described here.
Use me = 0.511 MeV a.nd rn" = 106 MeV. IHint: The energy E2,
gi1Jen in Eq. (7.115), must be posihue.)
7.6
Mandelstam variables
Let us go back to the general definition of the scattering crosssection
in Eq. (7.88). The absolute square of the Feynman amplitude is a
Lorentz invariant quantity. So is the energymomentum conserving
6function. The integration measures are also Lorentz invariant, as
argued in Eq. (7.53). With Vee' as defined in Eq. (7.86), the product
v ee ,E,E2 is also Lorentz invariant. It then follows that the total cross
section is Lorentz invariant. Of course, differential crosssections
need not be.
Since the total crosssection is invariant, it would be elegant to
present the formulas in a notation which is explicitly Lorentz invariant. Instead. of individual energies and momenta, we can write
the crosssection in terms of their Lorentz invariant combinations.
For a 2to2 scattering, the invariant combinations that one uses are
called Mandelstam variables. Let us first define them, in the general
notation of scattering variables introduced in Eq. (7.89):
s
= (PI + P2)2 ,
t = (PI U
pD 2 ,
= (PI  P2)2
(7.137)
Of these three, the quantity s was introduced before in specific
frames. The general definition given here reduces to the definition of
Eq. (7.96) for the eM frame since PI + P2 = 0 in that frame. Also,
in the Lab frame in which one of the initial particles is at rest, it
reduces to the expression of Eq. (7.136).
The definition of the other two Mandelstam variables, viz., t and
ti, has some ambiguity, since there is no universal way of assigning
which one of the final particles is the first one, and which one is
the second. Thus, two persons may disagree about what is t and
what is u in a given process, but it is not important. If we state our
assignments clearly, there would be no ambiguity.
7.6. Mandelstam variables
145
Let us, however. try to convince ourselves that the three variables
are sufficient to describe the crosssection for 2t02 scattering. There
are fouf particles, but because of 4momentum conservation, the 4momenta of three of them are independent. From threr momenta,
we can make six invariants, viz. we can contract each momenta with
itself, and with one another. Thus we will have six different scalar
products. However, fOUf combinations of these will give the masses
of the four particles involved. Thus, we are left with only two scalar
parameters. And we have defined three Mandelstam variables. This
shows that the Mandelstam variables are not only sufficient, in fact
there is some redundancy in their definition. In fact, it is straight
forward to verify that
(7.138)
Exercise 7.10 If the deco.lI ra.te of a. pa.rticle of ma.ss M is r in its
rest frame a.nd f' in u fra.me 'WheTe its ener9"Y is E, Q.Tgue from Eq.
(1.8) t"at
(7139)
irrespectiue of the interoctions.
Chapter 8
Quantization of the
electromagnetic field
We have already discussed how to quantize fields with spinO and
spin!. In this chapter, we describe how to quantize spinl fields.
A familiar example of spinI particles is the photon, which is the
quantum of the electromagnetic field. Let us now see how the photon
appears through quantization of this field. This will help us see the
connection between the classical theory and its quantized version. To
set up the notation, we will begin with a brief summary of classical
electromagnetic theory.
8.1
Classical theory of electromagnetic fields
The classical theory of electromagnetic fields is based on the Maxwell
equations. In a notation which is not manifestly covariant, these
equations can be written as
"'1 E = p,
"'1XBa:=j,
(8.1)
in HeavisideLorentz units with c = 1. Here p is called the charge
density and j the current density, and the components of the electric
field E and the magnetic field B are constrained by two further
equations:
"'1B=O,
8B
"'1xE=8t
146
(8.2)
8.1. Classical theory of electromagnetic fields
147
These last two equations involve six quantities, viz., the components
of the 3vectors E and B. They can be expressed in terms of four
quantities, viz., the components of a 3vector A and a scalar quantity
<po
B = V x A,
E = V<p 
aA
at .
(8.3)
In fact, these four quantities transform like the components of a 4vector A, Le.,
A"
= (Ao, A) = (<p, A).
(8.4)
In terms of these quantities, one can now write Eq. (8.3) in a manifestly covariant form:
(8.5)
where the components of the fieldstrength tensor Fv " are the components of the electric and magnetic fields:
FIlil
E'
E'
0
B3
_E2
_E3 _B 2
E 2 E3
_B 3 B 2
0
B
'
B'
0
(8.6)
Fv" is called the field strength tensor, while Av is called the potential.
The components of Fv" can be obtained by replacing EO by _EO in
Eq. (8.6). The two homogeneous Maxwell equations given in Eq.
(8.2) can now be written in the form
(8.7)
On the other hand, the inhomogeneous Maxwell equations, given in
Eq. (8.1), can be written as
(8.8)
where l' is a 4vector which incorporates the sources, Le., the charge
density and the current density:
j"
= (j0,j) =
(p,j).
(8.9)
148
Chapter 8. Quantization of the electromagnetic field
The Lagrangian formulation of this theory can be performed if
we start from
(8.10)
The EulerLagrange equations obtained from this Lagrangian by
varying A~ are precisely those given in Eq. (8.8). The current density
y\ which comes from external sourc.es, depends on the field operators of these source fields. For example, if the sources are electrons,
we should have j~ = eVry~1/J. In field theoretic language, we can
say that this is an interaction term. We will discuss these terms in
Ch.9. In this chapter, we discuss only the pure electromagnetic field,
which can be described by
!t' = 
~F~vPv
(8.11)
For future purposes, let us also note that
the redefinitions of A~ which are of the form
~LV
is invariant under
(8.12)
where (} is any differentiable function of spacetime. Such transformations can affect the values of A~ at each spacetime point differently, and are called local tmnsformations. In contrast, for the phase
rotations of fermion fields or of complex scalar fields, which we discussed in Eqs. (3.54) and (4.91), the transformation was the same for
all spacetime points. This type of transformations are called global
transformations.
Under the transformation (8.12), the Lagrangian of Eq. (8.10)
changes by
(8.13)
If the current jlJ. is a conserved current, 8/1YJ = 0, we find that
the Lagrangian changes by a total divergence. It follows that the
dynamics of the electromagnetic field is invariant under (8.12) even
when it is coupled to an external current, as long as the current is
conserved. This derivation assumes that jI' is independent of AJl
This assumption may not be true in general , as we found for scalar
electrodynamics in Ex. 2.9 (p26). Even thm the Lagrangian may be
invariant, as it was there, but the proof is a little more complicated.
8.2. Problems witb quantization
8.2
149
Problems with quantization
There are many ways to see that the classical theory of electromagnetic fields, outlined above, cannot be directly quantized. One way
is to convince oneself that the propagator of such a field does not
exist. For this, let us go back to the equation of motion. Eq. (8.8).
This can be rewritten, using the definition of the field tensor from
Eq. (8.5), in the following form:
a~(~A~a~A~)
=i'.
(8.14)
or
(g'~D  a~~) A~ =
i'
(8.15)
The equation for the corresponding Green's function will be
(8.16)
where all the derivatives are with respect to the coordinate x. The
Fourier transform of this equation will read
(8.17)
The Fourier transform of the proposed propagator. D~" (k). must be
a rank2 tensor which depends only on the 4vector k. Thus, its most
general form can be written as
(8.18)
where a and b are Lorentzinvariant quantitie..<i. Putting this form
back into Eq. (8.17), we see that the terms involving b cancel out.
This is the first indication that the propagator does not exist, because
b is indeterminable. Moreover, we are left with the following equation
involving a:
a k 9 ~ v
2
+ a k~k v
9~ v
(8.19)
If we now want to equate the coefficients of g>' v on both sides, we
would get a = l/k 2 , but the k~k" term gives a = O. In short, this
equation cannot be solved. The propagator does not exist.
150
Chapter 8. Quantization of the electromagnetic field
The basic reason for the nonexistence of the propagator is the
following. The momenta conjugate to the A~ are given by
= JL =
JL
M~
J(80A~)
= F~o.
(8.20)
It follows that nO = O. One of the canonical momenta does not exist. The equations defining the canonical momenta cannot therefore
be inverted to express the quantities Alt in terms of the momenta.
Accordingly, the Hamiltonian formulation does not exist.
The deeper reason for this fact can be understood by looking at
the Maxwell equations in Eq. (8.1). The field variables are the six
components of the 3vectors Band E, a..c; mentioned earlier. However, Eq. (8.2) introduces four constraints on them, so that there
are really two independent degrees of freedom in the electromagnetic
field. We are trying to represent these by the 4vector A~. So, all the
components of AJ.' are not independent. We have never encountered
such a situation while talking about scalar fields or Dirac fields. Indeed, a Dirac field also had four components, but all of them were
independent. ~or AIL I only two of the fOUf components are independent, and this is the root of the problem. We will have to somehow.
deal with this problem of redundancy before we can quantize the theory. Finally, although the field A~ is called the potential in classical
physics, we shall refer to it as the photon field.
o Exercise 8.1 SUPllose, instead of the photon field, 'We a.re considering the theory of a. massitJe \lector field, given bll the Proen La~
grangUln
if ~41 FlAo!' F""
+2"1 M' A" A1' A~
IAoJ
(8.21 )
Find the equ.ation of motion. Show that in th.is case, there is no
problem with defining the propagator I and one obtains
D~"(k) = k' _1 M' ( g,," + k"k")
M'
.
8.3
(8.22)
Modifying the classical Lagrangian
The photon is known to be massless to a very high degree of accuracy. So we will not try to add a mass term as in Eq. (8.21) just to
8.3. Modifying the classical Lagrangian
151
define the propagator. Such a mass term is also not invariant UDder the local transformation noted in Eq. (8.12). !nvariance under
this transformation means that given any electric and magnetic field,
there is some freedom in defining what the A~ should be. Therefore,
if we try to find A~(x) at any spacetime point x by operating on the
sources jlJ.(x / ) by a Green's function, the Green's function needs to
be able to find all the A~(x) which are related by Eq. (8.12). Such
a Green's function cannot exist, as we have seen in the previous section. This is a classical problem , not a quantum one. A first step
towards a resolution is to look for a representative A~(x) from each
(infinite) set of A~(x) connected by Eq. (8.12). This representative
A~(x) can be chosen by imposing a condition on A~(x), and the procedure is called gaugefixing. Examples of gaugefixing relations are
the Lorenz gauge, a~A~ = 0, the Coulomb gauge, V . A = 0, the
axial gange, A 3 = 0, the temporal gauge, Ao = 0. Different choices
of the gaugefixing relation (also called gauge choices) are useful in
different situations. We are interested in a Lagrangian formulation,
so we shall incorporate the gaugefixing procedure directly into the
Lagrangian. We shall use a generalization of the Lorenz gauge, which
is the most commonly used manifestly Lorentzcovariant gaugefixing
relation.
Let us start this procedure by rewriting the action arising from
the Lagrangian of Eq. (8.11).
J
~ Jri'x [(a~A")(~
(a~A")(O" A~)I
J [(a~A")(a"A~)l J [a~(A"i)"A~) A"(a~i)" A~)l
Jri'x [a~(A"i)" A~) a"(A"a~A~)
JJi' = 
~ ri' x F~"F~"
A") 
(8.23)
The second term can be rewritten as
d'x
d'x
+ (0" A")(a~A~)].
(8.24)
The total divergence terms, on integration, will produce surface
terms which are assumed to vanish on the boundary of the 4dimensional spacetime, which is at infinity. Thus we are left with
JJi' =
~
d'x
[(a~A")(a~A")  (a~A~)2]
(8.25)
152
Chapter 8. Quantization of the electromagnetic field
Of course, this is still the same action. We have not made any changes
whatsoever. So all the problems that we mentioned earlier still remain with us.
As we mentioned, the problem originated because we had too
much freedom in defining the A~. We nee,Ho sacrifice some of this
freedom. We do this by introducing another term in the Lagrangian,
which is
(8.26)
This is called the gaugefixing term, which is why we have put a
subscript "GF" on this term. The total action, after the inclusion of
this term, is
d = ~ Jd'x
[(a~A")(8"A") 
(1 D(a~A~)2].
(8.27)
This is for the free photon field. In presence of sources, we can
simply augment it by the j~A~ term as in Eq. (8.10). The equations
of motion that would then follow from this action is:
DA" 
(1  Da"(a~A~)
j" ,
(8.28)
instead of Eq. (8.8) which can be obtained from this equation by
letting ~ go to infinity.
o Exercise 8.2 Ij
A~
+ {}~(}
a~A"(x) =
f(x), construct O(x) such th"t
satisfies OpA'#' = O. [Hint: Use 8enIn,. propagator. I
A~ =
It is a fair question to ask who gave us the right to change
Maxwell's equations, which has been verified so well in experiments.
We can argue, however, that we have not really changed the Maxwell
equations as they would appear in terms of the field tensor F~".
What we are changing, or rather restricting, is the definition of F pv
in terms of A w Previously, given any FIJVl we could have taken any
A~ satisfying Eq. (8.5). We can no longer do that. In fact, if we use
the freedom of the local transformation of Eq. (8.12) to define the
A~ in such a way that
a A ~O
,
~
(8.29)
the gaugefixing term vanishes, and we have done no offense to the
Maxwell equations. And this can be done, at least classically, which
8.4. Propagator
153
can be seen from Eq. (8.28). Since the current j~ is conserved, i.e.,
a~j~ = 0, taking the 4divergence of Eq. (8.28) we obtain
(8.30)
We can now argue that appropriate boundary conditions can be imposed on A~ so that Eq. (8.29) is satisfied everywhere. In other
words, we can always transform A~ according to Eq. (8.12) so that
the new A~ satisfy Eq. (8.29). Thus, in order not to disturb Maxwell
equations, we are now restricted to use only those A~IS which satisfy
Eq. (8.29).
In passing, we note that the canonical momenta are now given
by
(8.31)
Obviously, lIo no longer vanishes identically. But we still have a
problem if we try to canonically quantize the theory. If we try to
mimic the technique employed for scalar fields and write
[A~(t,,,,),lI"(t,y)L = i&~&3(",  y),
(8.32)
it will imply
(8.33)
Thus, if a~A~ vanishes everywhere as we argued a little while ago,
this commutation relation cannot be imposed. However, we shall
worry about this later. Our first priority now is to get a sensible
propagator.
8.4
Propagator
Before introducing the gauge fixing term, we showed that the propa,gator could not be defined. Let us revisit this issue now, aided with
the gauge fixing term.
154
Chapter 8. Quantization of the electromagnetic field
The equation of motion, including the gauge fixing term, was
given in Eq. (8.28). We can rewrite it as
(8.34)
The equation for the Green's function will be
Taking the Fourier transform of this equation, we obtain
The general tensorial form of the propagator is still given by Eq.
(8.18). Putting this form back into Eq. (8.36), we obtain
2
g"v =  [9""k = _ok
2g" v +
(1 Dk"k"] (og"v+bk"kv)
[0 (1  D~bk2] k"kv .
(8.37)
This can now be solved, and the solution is
1(
b=/T
(8.38)
The propagator is then given by
k ]
D,..(k) =  k12 [ g"V  (1  {) k ~/
(8.39)
Like the propagator for the scalar and fermion fields, there is a
problem if we try to find the propagator in coordinate space. The
problem can be solved in the same way, so that we get the Feynman
propagator:
(8.40)
Although the problem of nonexistence of a gaugeinvariant propagator is a classical one, this particular propagator lies outside the
purview of classical physics. One way to see this is to note that this
propagator is complex , due to the ie term in the denominator. That
8.4. Propagator
155
would not be allowed in classical physics, where a real source leads to
a real field. On the other hand in quantum physics only observables
need be Hermitian (with real eigenvalues), and we have a certain
leeway as a result.
It is natural to ask at tbis point if the propagator of Eq. (8.40)
really has any meaning at all. It depends 011 an arbitrary parameter
which was put in to modify the action. This parameter labels
a class of theories which are all classically equivalent to MaxweIrs
electrodynamics in the gauge a~A~ = O. Have we then found a class
of propagators for these theories which will g"ive different results that
agree only in that specific gauge? More specifically, if we calculate a
Feynman diagram involving an internal photon line, will the use of
this propagator lead to a result that depends 011 and can therefore
have (almost) any value?
But there is no need to worry_ In fact, ill the calculation of S
matrix elements, the ~dependence always cancels out. The result
will be independent of . We will show thi, explicitly with some
examples in eh. 9. In practical calculations, this fact can be utilized
in two ways. Either we can evaluate diagrams with this general
propagator and check whether the parameter ~ rcally cancels, which
will give a good cross check for our calculations. Or we can make the
calculations simpler by choosing any value of ~ that is convenient.
Some common choices are:
~<
=]
= 0
=}
tDJ1V (k)
.
'* ,D~u(k)
=  . k.2'9~u
+ 'lE. ,
i
=  k2 +
if:
(8.41 )
9,," 
k,.k
0
v]
(8.42)
The first one is usually referred to as the FeYIlIllan 't Hooft gauge,
whose advantage is that the propagator is v~ry simple, contains just
one term, The second one is usually called the Laudau gauge, which
satisfies the relation k~ D~u(k) = 0, which ,illlplifies the calculations
III some cases.
We shall usually work in the = 1 gauge and explicitly indicate
if we change to another gauge choice. In this gauge, the action reads
.w = ~
d'x (o"A u)(&" AU),
(8.43)
and the equations of motion are
DAJt
= jJ1 ,
(8.44)
Chapter 8. Quantization of the electromagnetic field
156
which look exactly like the equations for four massless scalar fields.
We can therefore utilize our knowledge of scalar fields and decompose
A".
8.5
Fourier decomposition of the field
Mimicking the Fourier expansions in the cases of scalar and fermionic
fields, we can write for the photon field
A"(x) =
d3k
[,I'(k)a (k)e ik .x
J(21f )32wk r=O
L..J"
+ ""(k)at(k)e ik ' x ]
"
(8.45)
where
Wk =
k,
(8.46)
and f~ constitute a set of four linearly independent 4vectors which
are called the polarization vectors. The Lorentz transformation prop
erties of A" are carried by these polarization vectors since the other
factors, as in the scalar case, transform like an overall Lorentz scalar.
The polarization vectors
,~
satisfy the following orthonormality
relations:
(8.47)
where sum on the repeated index r will not be assumed unless explicitly indicated, and the quantities
are defined as
<,
<0 =
(8.48)
1,
These vectors also satisfy a completeness relation, given by
3
L (rf~f;&I =
_gPV .
(8.49)
r=O
In order to understand the physical content of these relations, let
us consider a definite choice of the polarization vectors. Let n" be an
arbitrary timelike vector satisfying n"n" = 1, nO > O. We shall set
f~, called the scalar polarization vector, to equal nJ1.. Next we choose
8.5. Fourier decomposition of the field
157
f~ I called the longitudinal polarization vector, in the nk plane such
that '3~n~ = 0, '3. '3 = 1. It follows that
,~(k) = _k~7TF~("'1k",.n~)~n","
J(k . n)2
(8.50)
k2
o Exercise 8.3 Show, without assuming k2 = 0, that Eq. (8.50)
gi'Ues the unique vecloT' in the n~k plant' sutisfying the conditions
i3~nJt = 0 a.nd f3 3 =1.
Now we choose the remaining pair of polarization vectors fr(k)
and ,~(k) to be in the plane orthogonal to the nk plane (which is
also the fO(3 plane by OUf construction L such that
T,S
= 1,2.
(8.51)
These last two are called the transverse polarization vectors. In the
frame where the third axis is chosen along the direction of propagation (i.e. along k) and n~ = (1,0,0,0), we find from Eq. (8.50) that
the longitudinal polarization vector fj is also along the third axis,
and we can write
'b =
,r
(1,0,0,0),
= (0,1,0,0),
,~ =
(D, 0, 1,0) ,
,~ =
(0,0,0,1).
(8.52)
The polarization vector f~ is timelike, and is usually called the
scalar polarization since it is a scalar so far as the 3dimensional
space is concerned. The other three are spacelike. Of them, there
is one linear combination which would be ill the direction of the 3momentum k. This is the longitudinal polarization mode. The other
two are the transverse modes. Since k is chosen along the zdirection,
f~ is the longitudinal mode, whereas f~ and i~ are transverse modes.
Of course, these choices are not unique. In general, one can take
any four vectors satisfying Eqs. (8.47) and (8.49). For example, in
some situations it is useful to replace
and i~ by
if
,j = (0,1, i, 0)/)2,
''R =
(O,l,i,O)/)2,
(8.53)
Chapter 8. Quantization of the electromagnetic field
158
which represent left and right circularly polarized radiation for propagation along the zdirection. The choice of (It and (~, on the other
hand, represents plane polarized radiatioll. One can also choose
something more general than those given in Eq. (8.53), e.g.:
(0, cosO. isin 0,0),
(O,sinO, i cosO, 0) ,
(8.54)
for an arbitrary parameter fJ, which represent elliptical polarizati?1l
states.
Quantization is based on the commutation relation given in Eq.
(8.32), along with the relations
[A,,(t,x),Av(t,y)!_ = n,
[n"(t,x),nv(t,y)L =
n.
(8.55)
On the operators ar(kfs and their hermitian conjugates, this implies
the relations
(8.56)
Of course 1 in addition, one also obtains
[a,(k), a,(k')L = O.
[a((k), a!(k')
L= 0
(8.57)
It follows that the commutator of aD with a~ has the wrong sign
compared to the commutators for scalar fields. This is of course
related to the wrong normalization of fO shown in Eqs. (8.47) and
(8.48). We will discllss its physical implicatiolls in ~8.6.
Exercise 8.4 Show tiLe cquitJalenc.c of Ell!;,
8.6
(8,~j2)
and (8.56).
Physical states
We introduced four polariuttion vectors ill Eq. (ti.52). We also found
that the creation and annihilation operators a.<;'''iociated with the polarization vector !(J docs lIot have the correct COlllllllltation rclation::.
Let us now try to 1Il1der~talld what t.hey illlpl'y.
8.6. Physical states
159
As with scalar and fermionic fields, we define the vacuum state
by
a,(k) 10) =
for all k and all r,
(8.58)
and assume that this state is normalized, i.e.,
(8.59)
(010) = 1 .
The state containing one photon with momentum k and polarization
vector r is given by
Ik, r) '" a;(k) 10) .
(8.60)
In the case of scalar fields, states defined 111 a similar way in Eq.
(3.39) have positive norm. This is however not guaranteed in this
case. Indeed, if we consider a state with polarization vector fO, we
find that
(k,Olk',O) =
=
(0 lao(k)a6(k') I0)
(01 [ao(k),a6W)L 0)
1
(8.61)
since the extra term included in the last step vanishes anyway due
to the definition of the vacuum state in Eq. (8.58). Using the commutation relation from Eq. (8.56) and recalling that (0 = 1, we
get
(k,O Ik',O) =  83 (k  k').
(8.62)
This norm is negative, which goes against all we know about quantum
medlanics.
The number of states also seem to pose a pU:lzle. Classical electromagnetic fields have only two independent degrees of freedom, as
mentioned earlier. However, in the quanti7,cd version, we seem to
have four polarization states which are linearly independent. Somehow there must be some redundancy in the definition of these polarization states.
These two problems are related, and in order to resolve them we
have to take into account one crucial fact which we swept under the
160
Chapter 8. Quantization of tpe electromagnetic field
rug a little while ago. We are not quantizing Maxwell's electrodynamics, but a theory which agrees with it if and only if a~A~ = o.
But we cannot impose 8IJAIJ = 0 on the quantum operator Aw
However, Gupta and Bleuler realized that this is not really necessary in order to make a correspondence with the classical theory.
All that is needed is that for any two physical states 1>11) and 1>1I'},
the matrix element of a~A~ between the states vanishes:
(8.63)
Let us now define a physical state as any state which is annihilated
by the annihilation operator part of a~A~. We denote this part by
a~A~, in keeping with the notation introduced for scalar fields in Eq.
(5.44), and write this condition as
(8.64)
Of course, this will also imply
(>111 a A~ = 0 ,
(8.65)
where A~ is the part of the field operator which contains the creation
operator. Since a~A~ = a~A~ + a~A~, Eq. (8.63) is automatically
satisfied.
Let us now see what this means. From Eq. (8.45), we can write
3
L <~ (k la, (k)e ik%
r=O
3
k~<~(k)a,(k)eik.z
(8.66)
r=O
This equation is Lorentz invariant, so we can determine the physical
content in any frame and gain insight from that. Let us consider a
frame in which k is in the zdirection, i.e.,
(8.67)
where we have used Eq. (8.46). In this frame, Eq. (8.66) will take
the form
a~A~(x)
= i
Jj::)~~k
[ao(k)  a3(k)] eik%,
(8.68)
8.6. Physical states
161
using the polarization vectors given in 8.5. Now, Eq. (8.64) will be
satisfied if
ao(k) I'IJ)
a3(k) I'IJ) ,
(8.69)
and Eq. (8.65) will be satisfied if
(8.70)
This fact !las very interesting consequences on the number of independent degrees of freedom. To see this, let us employ the Feynman't Hoaft gauge mentioned earlier, Le., put the gauge parameter ~ to
be equal to unity. In this case, using Eq. (8.43), we see that the
canonical momenta are given by
n~
= 
A~.
(8.71)
We can now construct the total Hamiltonian and use normal ordering
to avoid zeropoint catastrophes. Using the Fourier decomposition
of the fields A~ from Eq. (8.45), we obtain
:H:
J
=J
=
d k Wk
L (ca;(k)ac(k)
r=O
3
d k Wk
[~a;(k)ac(k) 
ab(k)ao(k)]
(8.72)
However, if we take the matrix element of the total Hamiltonian
operator between any two physical states Wand $', the contributions
of the a~a3 and the abao terms cancel because of Eqs. (8.69) and
(8.70), and we are left with
('11'1 :H: I 'IJ) =
3
d k Wk
('IJ'
~ a;(k)ac(k)
'IJ)
(8.73)
Hence, for the matrix element of the Hamiltonian between two physical states, we find that only two polarization states really contribute.
And in fact, this is true for all other observables as well. Thus, we
seem to have cured the problem of counting of states mentioned earlier. Physical quantities do depend on two independent polarization
vectors only. Moreover, none of these two is the scalar polarization
state, so we need not bother about the problem of negative norm
states.
Chapter 8. Quantization of ehe electromagnetic field
162
Exercise 8.5 Among the
ones are ph1lsica.l jar
Q.
JOUT sto.tes dr.jinrA in Eq. (8.60), which
ph.oton moving in the zdirection'?
o Exercise 8.6 Find the total momentum oj the free photon jletd
PI~,
normal ordered and for nn arbitrary uo.lue of the gouge parumeter
~, in terms oj the creation and annihilation operators. ShoUl that
jar ph1jsical stales it depends only on the. two transuerse modes oj
pola.rization.
8.7
Another look at the propagator
The propagator is not a physically measnrahle quantity, so it depends
on all polarization states. Let us take a cia",,, look at the propagator,
trying to separate parts of it that describe different kinds of interactions. We shall work in the Feynman't Hooft gauge (~ = 1), in
which the propagator reads
D""(k) = _
;'w .
(8.74)
k + tE
Using the completeness relation for the polarization vectors, we can
rewrite this as
3
D,w(k) =
1 .
L (,,::;".
+ If r=O
(8.75)
It will prove useful to go to a definite frame, so let us go to the
previously chosen frame with
,j.
n" = (1,0,0,0),
k"  (k . n)n"
v'(k . n)2
k2 '
(8.76)
In this frame the propagator takes the form
D""(k) =
k2
1. [ ' "
+"
L.,
r=I.2
,~(k),~(k)
(k"  (k . n)n") (k"  (k . n)n")
""]
(k . n}2 _ k2
 n n . (8.77)
The first term is exclusively for transverse polarizations, and we shall
interpret it as the exchange of transverse photons. The last two terms
can be rearranged ru; a sum of two terms,
(8.78)
8.7. Another look at the propagator
163
and
v"
kvk"  (k . n)(nvk" + kvn")
DR (k) =
k2[(k . n)2 _ k2j
In our chosen frame, nJ.t = gp.o and k . n
D'W( _ ') =
c x X
_
J
J
kO so we can write
4
d k DV"(k) ik(xx')
(2rr)4 C
e
d3k 9 vO"O
9
ik(xx')
(2rr)3
= 
k2
1 9 v0 9""
4rr Ix  x'i
(8.79)
J e
dk o iko(xox lO )
2rr
o(x o  x'D) .
(8.80)
Exercise 8.7 Show that
I
q 2'
(8.81 )
IHint:
This can be done only as n limiting procedure. For exampLe, 'YOU ca.n luke the Fourier tra.nsform of the junction eXP~~;/A)
a.nd then tnke the limit .,\  00. Alternatiuely, think of the Green's
function for La.place's equation. I
The significance of these different parts can be understood if we
consider a scattering process involving the photon a..., the intermediate
particle. Since the interaction with other fields is given by j" A v ' the
matrix element for this process can be written as
J J
d4 x
d 4 x' jj(x)Dv"(x  x')j2(x') ,
(8.82)
where ji(x) and j(x') are two current densities interacting via the
photon field. The
part of the propagator contribotes
Dr
J
J
d4X
d 4x' j?(x)jg(x') o(x"  x"')
4rrlx  x'i
(8.83)
to this matrix element. This looks like the instantaneous Coulomb
interaction between the two charge densities j?(x) and j~(x'). Consequently we can interpret D't;" as the Coulomb part of the propagator.
In order to understand the remainder term D~V 1 we Bote that the
matrix element of Eq. (8.82) can be writtell in momentum space as
d4k ."
."
(2rr)411 (k)D,w(k)), (k).
(884)
164
Chapter 8. Quantization of the electromagnetic fleld
Both the currents are conserved currents and therefore satisfy
8p jP(x) = O. In momentum space, this conservation law reads
kpjP(k) = 0, again for both currents. Therefore all terms proportional to k p or k" in D p" are in fact irrelevant. So D~"(k) does not
contribute to the interaction matrix element.
This is also the reason why we can get away with the Feynman't Hooft gauge. If we had not set { = 1, the extra terms in the
propagator would have been proportional to kpk", which vanish when
the propagator couples to a conserved current.
8.8
Feynman rules for photons
We can now summarize the Feynman rules for photons. For an internal photon, the rule is
k
~ =iDP"(k)
(8.85)
IJ.
v
irrespective of the direction of the momentum since the photon carries no charge. Sometimes we will put arrows on internal photon
lines to indicate the direction of momentum. For external photon
lines! the rule is
Ap(k)
~ =<p(k),
(8.86)
Ap(k)
~
=<~(k).
In calculating cross sections involving external photons, we will
need the sum over transverse polarizations only. This is the first
term in the square brackets in Eq. (8.77). Using Eq. (8.74), we can
write it as
<~(k)<~(k) = gP"  k' [D6"(k)
+ D\i(k))
(8.87)
,.=1,2
For physical photons, k' = 0, which implies k'D6"(k} = O. However,
k' D~" (k) need not vanish since D\i (k) has a factor of 1/k'. But the
photon couples to conserved currents, so D\i(k) has no contribution
to physical amplitudes. So we can safely write
L
T=I,2
<~(k}<~(k) = gP"
(8.88)
8.8. Feynman rules for photons
165
for the polarization sum of physical photons in any scattering amplitude.
Current conservation has played a crucial role in our arguments so
far. Any current appearing in Maxwell's equation, Eq. (8.8), must
satisfy 8"j" = 0, and the dynamics of the fields constituting the
current has to be chosen according to this. We shall now see that
there is a simple principle which fixes the interaction between the
photon field and fields which act as its sources. These interactions
of the photon field are the subject of the next chapter.
Chapter 9
Quantum electrodynamics
9.1
Local gauge invariance
As we know from our discussion on scalar fields in Ch. 3, the Lagrangian of the free complex scalar field is invariant under a global
(i.e., spacetime independent) phase rotation,
(9.1)
This is because in the Lagrangian,
(9.2)
a <pI appears in any term where <p appears. So tbe phases cancel
between <p and <pl. In Eq. (9.1), e is the unit of electric charge, which
is taken to be the charge of the proton. And Q is the charge of the
particle in this unit.
Such invariance exists in the Lagrangian of the free Dirac field:
(9.3)
Since in both the terms we have both 1/J and 1/J, this Lagrangian is
invariant under the transformation
(9.4)
As in the case of complex scalars 1 8 is a global parameter, i.e., the
value of is the same at all spacetime points. The invariance of the
166
9.1. Local gauge in variance
167
action under a global transformation implies a conserved current by
Noether's theorem, as shown in 2.4. In this case, the current is
(9.5)
j" = eQ"rr"1/J .
The conserved charge corresponding to this was shown in Eq. (4.108)
to be
q = eQ
d3 x 1/J11/J
f:
= eQ
d3 p
L,
(j; (p)j, (p) 
fl (p)j;.(p)) , (96)
fl
where the In I in
arc the creation and annihilation operatc~s for
fermions and antifennions. In principle, this could be any charr;e that
comes as both positive and negative. For example it could be electron
number minus positron number, or proton number minus antiproton
number, or more generally lepton number or baryon number. We
shall consider a special case, where it is the electric charge. Any
electric charge must couple to the electromagnetic field. There is a
remarkable principle called the gauge principle which allows us to
determine this coupling.
Suppose we decide to view the phase rotation symmetry as a local
symmetry. Le., have the Lagrangian invariant under the transforma,.tions even when B is a function of spacetime. If we are talking about
the Lagrangians of Eqs. (9.2) and (9:3), we can see that we cannot
do this. Concentrating for the moment on the case of fermions, we
find that if we substitute 1/J' from Eq. (9.4), the Lagrangian becomes
:foP' =
1/J' (iJ!  m) 1/J'
= :foP
+ eQ(8"O)"rY"1/J.
(9.7)
As we can. see, the fermion Lagrangian fails to remain invariant
under a local phase rotation. We can work around this failure by
introducing new fields so that the modified Lagrangian has the loce.!
symmetry. In the transformed Lagrangian in Eq. (9.7), the extra
term on the right side involves a~8, which transforms like a 4vector.
To compensate for this term, we need to introduce another 4vector
in the theory. Let us call it A" and write a new Lagrangian:
(9.8)
If we now make a local transformation defined by
.,p + "p'
= e ieQ9 (x)7/J
AJl
+
A~,
(9.9)
168
Chapter 9. Quantum electrodynamics
where 8(x) is a local function on spacetime, we can calculate what
A~ should be so that the Lagrangian of Eq. (9.8) remains invariant
under this transformation. The transformen Lagrangian is
z'
1/: (ii!  m) 1/J'  eQ1/J'"I"1/J' A~
1/J (ii!  m) 1/J 
eQ(A~  8"0)1/J"I"1/J
(9.10)
This will be the same Lagrangian as the one of Eq. (9.8) if
(9.11)
But this transformation is familiar to us. This is the same local
transformation which keeps the free Lagrangian of the electromagnetic field invariant (see Eq. (8.12. So let us identify this field A"
with the photon field and introduce in the Lagrangian the term for
the free A" field, Eq. (8.11). Adding the free term, we can write
down the full Lagrangian for the fields 1/J and A",
(9.12)
If we look at only the terms containing photon field, we find exactly
the terms given in Eq. (8.l0L since the conserved current is given by
(9.13)
in this case, which was derived in Eq. (4.92). This confirms our
guess that A" can be thought of as the electromagnetic field. The
field theory with the Lagrangian of Eq. (9.12) is called quantum electrodynamics, or QED for short.
Let us summarize what we have done. \Ve had a global symmetry
in the free Lagrangian of the fermion field. We made this symmetry
local and demanded that the action remain invariant under this local symmetry. This required the introduction of the photon field A"
through a 1/J"I"A"1/J term. This procedure is known as gauging the
global symmetry, and the hypothesis that the action can be made
invariant by introducing new local fields is called the gauge principle. The field A" is called the gauge boson corresponding to the local
symmetry. Classically (which is how we implemented the gauge principle) the coefficient of this term, known as the coupling constant, is
arbitrary up to the normalization of fields. However, it so happens
9.1. Local gauge in variance
169
that the electric charge of all fermions are integral multiples of some
fundamental charge. The problem of finding a suitable explanation
for charge quantization remains open to this day.
It may seem from OUf presentation that the procedure works
specifically for fermion fields. But that is not so. It works for any
other field with similar global symmetry. To appreciate that, let us
write Eq. (9.12) in the following form:
(9.14)
where Dp. is called the gauge covariant derivative:
D~
= a,. + ieQA~.
(9.15)
For any complex field transforming as ~ e<Q8, its gauge
covariant derivative transforms as DJJ. + c ieQB D~ if we also im
plement the gauge transformation A~ ~ A~ + a~8. Thus all terms
that were invariant under the global phase rotation remain invariant
under the local phase rotation provided we make the minimal substitution a~ ~ D w The Lagrangian of Eg. (9.12) is constructed by
making the minimal substitution in the free Lagrangian and adding
in the Lagrangian for the free electromagnetic field. This prescription
can also be used for constructing the intera.ctions of a charged scalar
field with the electromagnetic field. This will give us the Lagrangian
of scalar electrodynamic.,;, which is
Of course, any nonderivative interaction term such as V(I) remains untouched by minimal substitution.
o Exercise 9.1 ShOUl tha.t the Lngrnnginn of Eq. (9.16) is immrio.nt
under lorol phase transformation!\, whe.re transforms b'Y Eq. (9.1)
with (J being a. function of spacetime, and AI! tra.nsjorms according
to Eq. (9.11).
Exercise 9.2 Show that, ij 1/J tTOnsjOTm by Eq. (9.4) with a spacetime de.pendf.nt 9, a.nd A~ tru1tsjorms b'Y Eq. (9.11), the tra.n8jormaHan of DJ11P is the same o.s that oj 1/;, i.e.,
D'IJ'P
~/.'
where D~
= 8IJ + ieQA;l
= e  ieQ8 D JI'P,
.1.
(9.17)
170
9.2
Chapter 9. Quantum electrodynamics
Interaction Hamiltonian
Quantum electrodynamics, the theory of photons ano fermions, is
described by the Lagrangian given in Eg. (9.12). In addition to the
free terms for the fields t/J and A Il1 it contains an interaction term,
(9.18)
which represents an interaction between the two kjnd~ of fields. We
can write down the Feynman rule for the vertex by stripping ~nt of
all the field operatnrs and multiplying by i:
ieQ 'Y" .
(9.19)
Here and in general, we will ignore time directions when writing the
Feynman rule for vertices. The external arrows of the vertex will be
positioned as appropriate to the process being considered.
Let us consider the specific case of the electron field. The electron
charge is e, which means Q = ~1. So for the interaction of the electron field with photons, the normal~ordered interaction Hamiltonian
is given by
(9.20)
To understand the nature of this interaction, let. liS use the nota
tion introduced in Eg. (5.44), where the creation and the annihilation
operator parts of the field operator where shown separately. If we do
a similar thing for the electron and the photon fields here, we will
obtain eight different terms. This will give rise to eight different kind
of events occurring at an interaction vertex, All eight are shown in
Fig, 9,1. Let us try to describe these terms in words and pictures.
First, consider the term :'I/J+ ')'~1P+A~;. The operator '1./)+ anni
hilates an electron, 7./J+ annihilates a positron, and A~ annihilates a
photon. Thus, this term describes a photon being annihilated along
with an electronpositron pair, 3.<; in Fig. 9.1a. Obviously, this cannot
be a physical process since the final state has no particles to carry
the energies of the particles in the initial stat.e.
Let us consider anothp.r term, e.g., :1j)_"'iJ.'I./I+A!~:. In this case, an
electron is annihilated, and an electron and a photon are created, as
9.2. Interaction Hamiltonian
171
(a) :VJ+ ,),"VJ+A~:
(b) :VJ ,),"VJ+A~;
~
(c) :VJ+ ')'"VJ_A~:
(d) :VJ ')'"VJ_A~:
~
(e) :VJ+ ,),"VJ+A~;
(f) :VJ ,),"VJ+A~:
(h) :VJ_ ')'"VJ_A~;
(9) :VJ+ ,),"VJ_A~:
Figure 9.1: The different events that can occur at
vertex with the QED
Hamiltonian.
shown in Fig. 9.lf. Carrying on in a similar way, we can summarize
the possibilities as follows. At each vertex, there will be two fermion
lines and one photon line. The fermion line corresponding to the
operator 1/J can be either annihilation of an electron or creation of
a positron. The other fermion line corresponds to the operator VJ,
which can either create an electron or annihilate a positron. The
photon line can correspond to the creation or the annihilation of a
photon. This gives eight combinations, which are shown in Fig. 9.1.
For one of these eight cases, we have already shown that the event
occurring at a vertex cannot occur as a physical process. This is the
case for all eight vertices. Let us denote the 4momenta of the two
fermion lines by p and p', and that of the photon line by k. Then
for all these vertices momentum conservation will yield an equation
of the form
p'
pk,
(9.21 )
for some combination of signs on the right side. Squaring both sides,
172
Chapter 9. Quantum electrodynamics
we obtain
p'2=p2+e2p.k.
(9.22)
If all of these particles are onshell, p2 = p'2 = m 2 and k 2 = 0, where
m is the mass of the electron. Thus we obtain
p' k = O.
(9.23)
Since this is a Lorentzinvariant equation , it is valid in any inertial
frame. In particular it is valid in the frame where one of the fermions
is at rest, p = (m,O,O,O). Then in that frame, k is spacelike, k =
(0, k). But since k 2 = 0, it follows that k = 0, i.e., the photon does
not exist , and therefore the process does not exist. Of course, the
vertex can be inside a bigger diagram describing a physical process,
because virtual particles inside a diagram do not have to obey p'2 =
p/2 = m 2, or k 2 = O.
Exercise 9.3 In a.n arbitrary fTa.me, ShOUl that Eq. (9.23) implies
Ew
cos8 = p k .
(9.24)
wh.en 0 is the angle between P Q.nd k.
sa.tisfied JOT' physical particles.
Argue that this cannot be
9.3
Lowest order processes
We have already seen that it is not possible to obtain a physical pre>cess with only one QED vertex. In other words, the first order term
in the Smatrix expansion has no matrix element between physical
states. So in order to look at some physical processes we have to go
at least to the second order in the Smatrix expansion, Le., we need
to consider Feynman diagrams with two vertices. Let us see what
the poSsibilities are in that case.
As we mentioned, not all the lines occurring at any single vertex
can be external lines. So, let us first consider the possibility that
only one line at each vertex is internal. If the internal line happens
to be a photon, we will have only the four fermions as external lines.
Since charge conservation has to be maintained, there can be only
9.3. Lowest order processes
173
the following processes in this category:
e + e
+ e+
e+ + e+
e
.,. e
+ e+ e+
e+
+ e+
+
+
e
,
(9.25)
These processes will be discussed in detail in 9.4 and 9.5.
Of course, there are other charged particles in nature. The photon
field couples to them just as well. If we consider charged fermions,
the Feynman rule for the interaction will be given by Eq. (9.19). So in
the lowest order sca.ttering process, some external lines may be some
other cha.rged fermion. We consider one such process, e + e+ 1" + 1"+, in 9.6.
If there is only one internal line, it can also be a fermion line. In
this case, two of the external lines will be fermion lines and the other
two, photon lines. So we can have only the following processes:
+ 'Y
e+ + 'Y
e
e +e+
+
~
+
')'+')' +
+ "y
e+ + 'Y,
e
'"'(+"
e + e+
(9.26)
Here the first one is electronphoton scattering, often called Compton
scattering. This will be discussed i'n 9.8. The second one is positronphoton scattering. The last two are inelastic processes where the
initial and final particles are different. The process e + e+ > 'Y + 'Y
is called pair annihilation. The reverse process is called pair creation.
(a)
(b)
Figure 9.2: Second order diagrams with two internal lines.
Let us now consider the diagrams with two internal lines. Here
there are two possibilities. First, both internal lines could be fermion
Chapter 9. Quantum electrodynamics
174
lines, in which case we get the diagram of Fig. 9.2a. On the other
hand, one of the internal lines can be a photon , in which case we
get Fig. 9.2b. These are not diagrams for scattering, since the initial
state does not have two particles. These cannot be decay processes
either, since one particle cannot decay to one particle in the final
state. These are called selfenergy diagrams. The first one, where
the external lines are photon lines, is called the se1f~energy diagram
for the photon, or sometimes vacuum polarization. The second one
is the selfenergy diagram for the fermion. The interpretation of
these diagrams, along with the eva.luation of these amplitudes, will
be discussed in Ch. )2.
Figure 9.3: Second order diagr<lm with three internal lines.
Finally, let us consider the possibility that all lines appearing in
both vertices are internal lines. This diagram has been shown in
Fig. 9.3. This represents a vacuum to vacuum amplitude since there
is no line left to be an external line. Since the vacuum of QED is
stable, we shall henceforth ignore such diagrams for reasons discussed
in 6.J.
o Exercise 9.4 The second order term in the Smo.trn expansi.on is:
Sl') =  e'
J J
d'xI
d'x, g
[:(~Y"1/>A"lz,:
:(1/>yv1/>A v lz,:]
(9.27)
Use Wick's theorem to write the time ordered product appearing here
in tenns oj normalordered products. ldentiJ1l aU the processes dis
cussed aoo'lle o.s different tenns in the Wick. e7;pa1'\s'ion.
9.4
Electronelectron scattering
The first example of a QED scattering process we shall consider is
electronelectron scattering:
e (PI)
+ e (1'2)
~ e (p',)
+ e (p;) ,
(9.28)
9.4. Electronelectron scattering
175
where PI, P2, etc. are the 4momenta of the different particles. This is
one of the processes which occur at the second order in the Smatrix
expansion. There are two Feynman diagrams corresponding to this
process, shown in Fig. 9.4. The two diagrams come from the same
Smatrix term, as in Fig. 6.4.
'(PI)
(a)
Figure 9.4: lowest order
(b)
diagrams for electronelectron scattering
in QED.
The Feynman amplitude
We can write down the Feynman amplitude for these diagrams using
the Feynman rules of QED. For the diagram marked (a), we obtain
where ka = PI  pi : ;:;: P2  1'2, which comes from the conservation of
4momentum at the two vertices. For the other diagram, we obtain
where kb = PI  P, = PI  P2. Since the two diagrams are related by
an exchange of the final particles, we should add them with a relative
minus sign between the amplitudes ofthe two diagrams, as we found
at the end of 6.4.
Now we have to substitute the expression for the photon propagator. In order to show the gaug..independence of the Feynman
amplitude, let us use the expression in Eq. (8.40),
(9.31)
176
Chapter 9.
Quantum eJectrodyn8111;cs
From the definition of k a above, we have
(9.32)
Now yre use the relations jYU(p) = mu(p) and u(p)p = mu(p), which
weregiven in Eqs. (4.46) and (4.48), and find that this expression
vanishes. As a result, the terms involving k~ in the propagator do
not contribute to the amplitude. Similarly, for the other diagr8111
the terms involving k~ do not contribute. Since these are the terms
which contain the gaugefixing parameter . we have shown that the
amplitude does not depend on~. We are thus left with only the
g~vterm in the propagators, and this gives the following expression
for the total amplitude:
(9.33)
where we have written tt, for u(p'tl etc. This expression is clearly
antisymmetric if we interchange the two electrons in the final state.
Since the denominator in the second term can also be written 88
(PI  p2)2, it is straightforward to see that the expression is also antisymmetric under the interchange of the two electrons in the initial
state. This is of course how it should be, since the electrons are
fermions.
OUf aim is to calculate the scattering cross section. For this, we
will need the absolute square of the Feynman amplitude. If we do
not care about the spin of the electrons in either the initial or the
final state, we will need to sum this over the final spins and average
over the initial spins. The techniques for doing this have already
been described in 7.2.1. In this case, averaging over the initial spins
would involve a division by 4, since each electron can have two spin
states. Thus we obtain
9.4. Electronelectron scattering
177
The sum over spins can be turned into troces, as we discussed in
various examples of Ch. 7. Here we obtain
where
TIl =
Tr[(P, +mhv(P',
+mh~I'lr[(p,+mhV(p~+mh~]'
T" = Tr[(p,+mhv(p; +mh~ITr[(PI +mhv(p~+mh~]'
T i , = Tr [(p, + m)(p; + mh~(p, + mhAp', + mh~l,
T" = Tr [(p, + mhv<P'i + mh~(p, + mhV(p~ + mh~l. (9.36)
We will now calculate these traces.
Calculation of the traces
Evaluation of the traces is straightforward, using the trace formulas.
For example, for any two vectors a and b,
which takes care of the traces occurring in
obtain
TIl
Til
and T22 . Thus we
= 16IP,vp;~ + Pl~P"v  g~v(PI .1",  m')]
x 1P2P;~ + ~p;v  g~V(P2. P; _ m')]
= 32[(p, . P2)' + (PI p;)' + 2m'(rn'  P, . p'd) ,
(9.38)
where in the last step we have used
, ,
(9.39)
PI . P' = P, . P,
and other similar equations, which follow from momentum conservation, PI + P2 = P; + P2' Similarly, the second term can be obtained
by interchanging P'J and P2. which gives
T"
= 32[(1"
. P2)'
+ (Pi 1';)' + 2m'(m' 
PI .1';)).
(9.40)
Chapter 9. Quantum electrodynamics
178
Let us now look at the other two terms. Since an odd number of
"'(matrices have vanishing trace, we can write
TI2
[PI"!VP2"!"P2"!VP; "!,,]
+m 2Tr [PI"!Vp2,,!""!V"!V + PI"!v"!vp2"!v"!v + PI"!v"!""!vP;"!,,
+"! v.J.'
l' 2"!".J.1'2"!v"!,, + "!v.J.'
I""!""!vl'.J. 0" + "! v"!v.J.1""!VI'.J.'I"!" ]
+m 4Tr [,,!V"!""!V,,!,,] .
(9.41)
= Tr
Using the contraction formulas of Eq. (4.42), this can be rewritten
as
T I,
[PIP,"!vP2P;"!,,]
+4m' Tr [P,P2 + P,P, + PIP; + P,P2 + P'IP2 + PIP,]
_2m 4 Tr [,,!v,,!,,]
32pI . P2 P2 . p, + 32m' [PI . P2 + PI . P2 + PI' PI]  32m 4 .
2 Tr
(9.42)
Note that this contribution is unaffected under the interchange
P2, so T,I must be the same as this. Thus we obtain
p',
(9.43)
using Eq. (9.39) and similar equations to simplify the expression.
Differential crosssection in the eM frame
We now need to insert these expressions into the general formula for
elastic crosssections in the eM frame, Eq. (7.107):
do
d!l =
1 ,
(9.44)
641T'S!.4f1 ,
We have expressed the traces involved in 1.4fI' in terms of the dot
products of various momentum vectors. We can express the denominators of Eq. (9.35) in terms of dot products as well:
(P I (PI 
PII ) ' =
PI2 + PI"
2PI' PII = 2('
m
P2)' = 2(m'  PI .P2)'
PI .PI ,
I )
(9.45)
9.4. Electronelectron scattering
179
All these dot products can be expressed in terms of the Mandelstarn variables, of which only twa are independent. Here we chOOBe
not to use the Mandelstam variables, but the scattering angle II, as
shown in Fig. 7.1, as one of the parameters. The energy of any of the
four external particles will be the other parameter. Note that each
particle has the same energy E in the eM frame. We then obtain,
for the dot products appearing in 111 2, the following expressions:
PI . P2
= E2 + p2 ,
PI . PI,
= E 2  P 2 cos II ,
PI . P2I = E2 + P 2 cos II ,
where p2 = E2 _ m 2. This gives (PI  p;)2
Collecting all the terms then, we obtain
4
111 2 = 2e
p4
(9.46)
= _2 p 2(1 
006(1) etc.
(E 2 + p2)2 + (E2 + p2cosll)2 _ 2m 2p 2(l 006(1)
(1  cos (1)2
(E2 + p2)2 + (E 2 _ p2 cos (1)2 _ 2m2p 2(1 + 006(1)
+ ,=',=,c.....:.'='T.(1"+"'::':'cos::":""11)"2='"''''''''::':':'':''!+2
(E2 + p2)2 _ 2m2(E2 + p2)}
2
sin II
(9.47)
The differential cross section in this frame is then given by Eq. (9.44),
with s = 4E2. The total cross section is obtained by integrating over
the angular variables. But since the final particles are identical,
only half the phase space is available, which means that II should be
integrated from 0 to 1r /2.
o Exercise 9.5 Argue th.at the. croS8 section for the positronpoaitTOT\
8CQ.ttering should be. the same.
o Exercise 9.6 Show tha.t in the. ultra.relo.ti\lutic limit E ~ m , the.
differential croBSsection becomu
(9.48)
180
9.5
Chapter 9. Quantum electrodynamics
Electronpositron scattering
Next we consider electronpositron scattering,
e(PI) + e+(1'2) ~ e(p;)
+ e+(p;).
(9.49)
This process also occurs at the second order in the Smatrix expansion. There are two diagrams corresponding to this process, shown
in Fig. 9.5.
e(ptl
e(p;)
A(kb )
e+(1'2)
e+(p2
(b)
(a)
Figure 9.5: Lowest order diagrams for electronpositron scattering in QED.
Just as for the e e scattering, we shaH use the Feynman rules
for QED to write down the. Feynman amplitudes for these diagrams.
The amplitude for the diagram marked (a) is
LII. = IU(P',)(iey~)u(pI)]iD~V(k.)[v(p2)(ie'Yv)v(p;)).
(9.50)
Here ka = PI  PI = P2  1>2, since momentum is conserved at each
vertex. For diagram (b), we obtain
iJlb = IV(P2)(iey~)u(ptl]iD~V(kb)[U(p;)(ieyv)V(P2)]'
(9.51)
P;
where kb = PI + 1'2 =
+ P2' The two diagrams are related by
a fermion exchange  the initial e+ is exchanged with the final eta get diagram (b) from diagram (a). Therefore the two amplitudes
should be added with a relative minus sign between them.
As 'before, the k~kv terms in the propagator do not contribute
because the spinors satisfy Eqs. (4.46) and (4.48), and we are left
with the following expression for the total amplitude:
JI
= JI. = e2 {
JIb
IU;'Y~udlv2'Y~V21 _ [V2'Y~Ul][U"'Y~V2J},
(PI  p;)2
(PI
+ P2)2
(9.52)
9.5. Electronpositron scattering
181
We shall consider the cross section of un polarized particles, so we
sum over the final spins and average over the initial spins. We get
Here again, we can use the techniques of Ch. 7 to write
of various traces,
T"
T 22
T2
'
T 21
IAfl 2 in terms
= Tr [(PI
+ mh"(p; + mh"] Tr rep;  mh"(p2  mh"] ,
= Tr[(P1
= Tr [(PI
= Tr[(P1
+ mh"(p2  mh"]Tr [(P;  mh"(p', + mh"],
+ mh"(p2  mh"(p;  mh"(p', + m)')'"J,
+ m)')'"(p'1 + m)')'(p;  mh"(P2  mh_l. (9.54)
We have seen how to calculate traces like these in 9.4. The results
can be obtained directly by interchanging P2 and Po in Eq. (9.36):
T" = 32[(PI . P2)2 + (PI' p;)2 + 2m 2(m 2  PI . p',)] ,
T 22 = 32[(PI . p;)2 + (PI' p;)2 + 2m 2(m 2 + PI . P2)),
T I2 = 32(PI . p;)2  32m 2[PI . P; + PI . P2  PI . p;] 32m 4 ,
T 21 = T 12
(9.55)
We note that T22 can be derived from Til by exchanging p\ with
P2. Similarly, the equality of T I2 and T21 follows from the fact that
T l2 is invariant under the exchange of pit with P2. Of course, these
traces can be derived hy direct calculations as well.
As before, we will express the dot products of the momenta in
terms of the scattering angle 8 and the energy E of each particle in
the CM frame. In this frame, these will he the same as in Eq. (9.46),
182
Chapter 9. Quantum electrodynamics
and we can write
1...K1
2e'{
(E 2 + p2)2
+ (E2 + p2cosB)2 _
2m 2p 2(1_ co. B)
p'(l  cos B)2
E' + p' cos 2 B + 2m2E 2
2E'
_ (E 2 + p2 cosB)2 + 2m 2(E2 + p2 cos B) }
E2 p 2(1 _ cosO)
.
(9.56)
Using s = 4E 2 in this frame, we can calculate the differential cross
section from Eq. (9.44). Using", = e 2/4rr, we can write this in the
high energy limit E m as
,0
du
dO
",2
1 + cos
8E2 (
. ,0
sm 
2
+
1 +cos 2 0
2
~)
2 cos'
. B
sm 2 2
(9.57)
Earlier we discussed the elastic scattering process for the electronpositron pair. If the energy of the incoming electronpositron pair is
high enough, the scattering need not be elastic. In the final state,
we can obtain a particleantiparticle pair corresponding to a ,higher
mass. Among the known elementary charged particles, the electron
is the lightest, and the next lightest is the muon. Here, we calculate.
the crosssection for muonantimuon production, although the analysis can be used with minor modification for the production of any
fermionanti fermion pair.
e(kIl
1l(pIl
A(k)
Figure 9.6: lowest order diagrams for
the process
e e+  jJ.Il+ in QED.
183
There is only one diagram at the lowest order, shown in Fig. 9.6.
The muon has t.he same charge as the electron, and so its vertex
with the photon would also be the same as that for the electron.
The Feynman amplitude can then be written as
.4 =
(9.58)
.; [v(k 2)y'u(kd] [i:i(Plh"v(P2)],
where s is the Mandelstam variable, which comes from the photon
propagator. We have denoted the initial state momenta by k , and
k 2 , and the final state momenta by PI and p,.
In calculating the crosssection, th~re is 110 point in keeping the
electron mass l since the energies involved must be higher than the
muon mass. We will however keep the muon mass m w Then
1.41 2 = 4~ L. 1..#1 2
spIn
e
4;2 Tr
[~n"~l1'P] Tr [(PI + 71l"hA(P, 
m"hp]
4e
[k 1Ak p + kPI k 2A  9 APk I ' k2 ]
.2
2
X
[PIAP2P + PlpP2A  g"p(PI . P2 +
m~)]
8e2 [ (k , pd 2
S
+ (k .l
P2) 2
2
+ m"k,
. k2 ] .
(9.59)
We have used kinematical identities like k 1 . PI = k2 . P2 in writing
the last step.
In the CM frame, if 0 is the angle between the incoming electron
and the outgoing muon, i.e., between k l and Pl, we obtain
k, . PI = E(E  pcosO),
k l P2 = E(E + pcosO).
k , . k2 = 2E 2 .
(9.60)
Here E is the CM eJr gy of the incoming electron, i.e.,
p = Ipll = Ip21 = ~ .<  4m~. Thus
1....1112 = e
= 4E2 , and
4[1+ ;2 (p2COS20+71l~,)]
=.' [ 1 + cos' 0 + m2
E~ (1 
cos 0)
(9.61)
184
Chapter 9. Quantum electrodynamics
Using the general scattering formula of Eq. (7.106), we obtain
o2~m~
4m~
2]
1   [ l+cos 28+{Ic05
8)
da= dfl4s
(9.62)
which leads to a total crosssection of
2
4"",2~m2
[
2m
0=1~ l+~]
38
(9.63)
Earlier we calculated the cross section for elastic e e+ scattering. If
the initial energy is sufficiently high, s > 4m~, that cross section will
be augmented by this inelastic contribution. Thus the total cross
section will increase. Creation of heavier particles increase the cross
section at even higher energies. This provides a tool for searching
for new particles.
9.7
Consequence of gauge invariance
So far, we have performed calculations involving only fermions in
the initial and final states. Now we discuss processes which involve
photons either in the initial Dr in the final states, Dr both.
For any physical process involving a photon with momentum k
and polarization vector f~(k) in the final state, the Feynman amplitude may be written in the form Jt = M~f~(k). We now refer back
to the calculation of the Smatrix elements in Ch. 6 and recall how
this amplitude arises. Let us suppose that the photon was created at
the spacetime point x. Then in the terms in the Smatrix, we must
have made use of a term J d 4 x .Jt'}{x), apart from any other terms at
any other spacetime points that may have entered the calculation.
In other words, the Smatrix element must have arisen from
(9.64)
where ,( k) denotes the photon and f' denotes any other particle that
may be present in the final state, whereas the dots denote any other
factors of the interaction Hamiltonian that might be present. Once
we use the analog of the oneparticle states shown in 6.2, we will
obtain
if'~(k) Jd'x ~ (I' IMx) 1 i) .
(9.65)
9.8. Compton scattering
185
Thus the quantity we called M ~ must have arisen from the quantity
inside the integral sign, after factoring out the initial a:nd final state
factors.
Now imagine replacing "~(k) with k~ in the expression above.
This will give
ik~
rrx ; ; ; , (I'
= 
1J~(x)1 i)
(8"e;"%)
rrx
J2WV (I' IMx) 1 i)
rrx ; ; ; , (I'
l8"j~(x) ... \ i) .
(9.66)
The last step is obtained by partial integration. Since the other factors, represented by the dots, are not at the point x, the derivative
does not act on them. This shows that the quantity under consideration vanishes since 8"j~(x) = O. Thus if we replace the polarization
vector in an Smatrix element by the corresponding momentum vec~
tor, the amplitude goes to zero. In other words,
(9.67)
Similarly, we can prove that the same relation would hold if there is
a photon in the initial state. Said in another way, if we make the
substitution
(9.68)
in the amplitude, the coefficient of t') shouid be zero.
For physical photons, the consequence of using the Lorenz gauge
condition translates in momentum space to the relation
k. =
o.
(9.69)
So the substitution of Eq. (9.68) does not affect the normalization of
the polarization vectors for physical photons, given in Eq. (8.51).
9.8
Compton scattering
Compton scattering is the name given to the elastic scattering of
electron and photon:
e (P, 8)
+ 1'(k, r)
~ e (P', 8')
+ 1'(k' , r ' ) .
(9.70)
Chapter 9. Quantum electrodynamics
186
In the parentheses after the electrons, we have put the notations
for their momenta and spins. After the photons, we have indicated
their momenta and polarizations. The lowest order diagrams for this
process are given in Fig. 9.7. There are two diagrams, marked (a)
and (b).
..,(k)
..,(k)
c(p)
..,(k ' )
..,(k' )
(b)
(a)
Figure 9.7: Lowest order ditlgrams for Compton scattering in QED.
The Feynman amplitude
The amplitudes for the two diagrams can be written as
U(. =
+;_
[U(pl)(ie.,~) P
LK'b = [U(pl)(ie.,V) P_ ~~
m (ie.,V)U(P)]
<;,~(k')<rv(k),
_ m (ie.,~)u(P)] <;,~(k'l<rv(k).
(9.71)
Summing up the two contributions, we obtain
2
.A = _e u(p')
[I" p +:_ml +I P _ ~~ _ ml"] u(p)
P+ ~ + m J J P f + m J,.] up
()
F (p+kj2_m2F+F(p_kl)2_m2F
2( ') [J ,.
= e up
(9.72)
for the total Feynman amplitude, where we have used the shorthands
<v
=frv(k) ,
<~
fr,~(k').
(9.73)
We note an interesting point here. If we make the substitutions
k _ k' ,
f+(
1
I
(9.74)
in the amplitude of Eq. (9.72), it is invariant. Physically, this means
that if we interchange the photons in the outer lines, the amplitude is
unaffected. This is the result of the indistinguishability of photons,
and such symmetries are called crossing symmetries.
9.8. Compton scattering
187
Gauge invariance
The consequence of gauge invariance has been discussed in 9.7. Let
us first verify it for the present amplitude. We rewrite Eq. (9.72),
shifting the polarization vectors by a term proportional to the corr~
sponding momentum. Denoting the resulting amplitude by ~', we
can write
~' =
e 2u(p') [U"
+ '11")
p+m
U + 11)
+U H19) P_ ~ _ m U" + '19")] u(p).
(9.75)
Thus,
Let us look, for example, at the coefficient of 19. In the first term,
we can write
If.
(p+m)  (pm).
(9.77)
Since this expression operates on' u(p) to the right, the second parenthesis gives zero while acting on it. The other part cancels the propagator, and so we are left with just 1". Similarly, in the second term,
we can write
If. = (p' m)  (p' If. m).
(9.78)
Here also, the first parenthesis vanishes because of the spinar on its
left. The other part cancels the propagator but for a sign, using
p + k = p' + k' from momentum conservation. Thus, this cancels the
first te~m. One can similarly see that the coefficient of {j'. vanishes,
and so does the coefficient of
a{)'.
Choice of gauge and frame
Having established gauge invariance, we can use it to our advantage.
For example, given a frame, we can choose anyone component of the
Chapter 9. Quantum electrodynamics
188
polarization vectors to be zero by using this freedom. We will choose
f"
<'"=(0,.')
= (O,f),
(9.79)
in the Lab frame in which the initia.l electron is at rest. The advantage of this choice is that
pt=O=pt
l
,
(9.80)
which make the expressions particularly simple.
For example, the first term in Eq. (9.72) contains the expression
(j + m)/u(p) = (P"fv'Y"'Yv + mfv'Yv)u(p)
= [P"f (29"v  "Iv')") + mil u(p)
= [2p <+ I( p + m)1 u(p)
V
= 2p fU(p).
(9.81)
This derivation makes no assumption about the choice of the polarization vector or frame. If however we stick to the choice of Eq.
(9.80), this expression vanishes. The argument is no different if the
polarization vector on the right is f'. So the amplitude of Eq. (9.72)
can be written as
.A =
e 2 u(p')Fu(p) ,
(9.82)
F=
nl Iln' .
+
2pk
2p'k'
(9.83)
where
We have used the onshell conditions
(9.84)
to simplify the denominators, and assumed the polarization vectors
to have real components for the sake of simplicity.
W, will derive the crosssection in the Lab frame for specific polarizations for the initial and the final photons. The averaging over
the initial electron spin will be done by summing over the spins and
dividing by 2. Thus,
1.A1 2 = ! L I.AI'
2 spin
~e4 Tr [(j' + m)F(p + m)FI]
(9.85)
9.8. Compton scattering
189
and in this case,
(9.86)
Since in the Lab frame
pk=mw,
p. k' = rnw',
(9.87)
where wand w' are the initial and final photoll energics. we can write
2
1.411
8m2
[Til
w2
22 + T l2 + T 21 ]
+ Tw'2
ww'
(9.88)
In this formula, Til through T2\ arc various traces, which we evaluate
now.
Calculation of the traces
We start with Til, which is given by
Til = Tr [tp' + mll'Mp + m)I~I']
2
= Tr If' I'm)I~I'] + m Tr [l'm~1'J
(9.89)
We first show that the term proportional to .",2 is zero. For physical
states of polarization, we can ~se II = (2 = 1. no... given in Eq.
(8.51). This enables os to write
Tr [l'm~l'] =  Tr !I'm']
=  k 2 Tr lI'l'l
= 0,
(9.90)
using Eq. (9.84).
As for the other trace in Eq. (lI.89). we Gill lise the relation of Eq.
(4.65) repeatedly to reduce it, "nn then usc <1" (9.80) and (9.84).
This gives
Til = 2p f Tr [p'nm'J  Tr [P'I'PlI~1'J
= Tr If'np~l']
= 2p k Tr [p'I'~1'J  Tr [P'I'#~I'] .
(9.91)
Again, 11/1 = k 2 = 0, so the second trace vanishes, and we are left
with only the first term, which gives
(9.92)
190
Chapter 9. Quantum electrodynamics
UsingEqs. (9.69) and (9.80), we can writep',' = (p+kk')., =
kf'. Also. using momentum conservation, we find p'k = p.k'. Thus ,
finally we can write
Til
= 8mw
[2(k.' ')2
+ mw']
(9.93)
using Eq. (9.87).
It is now easy to evaluate T221 which is given by
T 22
Tr [(p'
+ m)/f"(p + rn)/T/]
(9.94)
This is related to Til by crossing symmetry mentioned in Eq. (9.74).
Thus we can immediately write down the result of the trace, using
crossing symmetry OIl the final result for Til. This gives
Tn
= 8mw'
We are now left with
T l2
[2(k'. ,)2 
and
T21'
mw] .
(9.95)
which are defined as
T l2 = Tr [(p'+m)l'l(p+rn)"W/]
T21 = Tr [(p' + m)/f' I'(p + m)/fl'J
(9.96)
A useful result, shown in E'l. (A.28) of Appendix A, is that the trace
of a string of ')'matrices is unaffected if we take the string in reverse
order. Thus
T21
Tr
[1'II1(p + m)I''/(p' + m)] ,
(9.97)
which is the same as T I2 because of cyclicity of trace. So we can
evaluate either one of them. For example, starting with the expression for T l2 and using momentum c.onservation. pi = P + k  k'. we
can write
T l2
= Tr [(p + m)1 'M + m)"f I]
+Tr W"N(p + m)I''I]  Tr W' I 'f/(p + mll'f I]
(9.98)
In the last two traces, we can drop the [Jla.~5 term in the parcnthesp~<;
since it involves all odd !lumber of ...,matriccs. Then lIsing t.he cyclic
property of trace and the relations in E'ls. (n.6n), (n.SO) and (9.84),
9.8. Compton scattering
191
we can reduce the second trace to the following form:
2nd term in T 12 = 'IT [(2k t' 1'1f.)NPi'l/'il
= 2k <' 'IT [If.hli'If.'il
=  2k . <' 'IT [iIf.Pi' If.'il
=  2k . <' 'IT [If.Pi' I/' iil
= 2k . ' 'IT [/f.# 'If.']
= 8(k. ')2 p' k'
= llmw'(k. ')2 .
(9.99)
Following exactly similar steps, we can find that
3rd term in T 12 = 8mw(k' . )2 .
(9.100)
As for the first term in T 12 , we can reduce it by writing
(j + m)l'lf.i('p + m) =
1'( p + m)lf.(p + m)i
= i'[2p k +If.(p + m)J (p + m)i,
= 2pkj'(jm)i,
(9.101)
where we have used Eqs. (9.80) and (4.65). Putting this back and
noting that the term proportional to m has an odd number of "(matrices, we obtain
1st term in T I2
= 2pk'IT
[i'PiI'If.'i]
= 2p k 'IT [i' #1/' tiJ
= 2p k'IT [(2 t'  ii')PI/.'til
= 4p' k . t' 'IT [PI/.' tiJ  2p . k 'IT [pIf. ')
= 16pkp.k'(.<,)28p.kp.k'
= 8m2 ww'[2(. <,)2 _ 1] .
(9.102)
Adding all the contributions, we obtain
T I2 = T 21 = 8m 2ww' [2(' ')2 1]  8mw'(k t')' + 8mw(k' . )2.
(9.103)
Putting the results for
Til
through T21 into Eq. (9.88), we get
['ww
12
I"H'I 2 = e 4  + W + 4( . )
 2 .
w'
(9.104)
192
Chapter 9. Quantum electrodynamics
The differential crosssection
Substituting this in the general formula of Eq. (7.118) for 2t02
scattering obtained in the lab frame, we obtain
1~12
du
1
dn=641T 2(m+wwcosO)2
(9.105)
'
where 0 is the angle at which the photon is scattered, i.e., the angle
between the directions of the incoming and the outgoing photons.
Noting that the energy of the scattered photon is given by
,
w
w = 1 + (w/m)(1 _ cosO)'
(9.106)
which follows from the general expression given in Eq. (7.115), we
can write the differential crosssection as
~ (W')2 [W' ~ 4.
2] .
du =
+ +
<')2 _
(9.107)
dO
4m 2
w
w
w'
This is called the KleinNishina formula, after the people who derived
it.
The total crosssection can be obtained by integrating over the
angular variables after we substitute w' from Eq. (9.106). But there
is a word of caution. In the expression for the differential crosssection, the dot product of the initial and final polarization vectors
appear. This may also depend on the angle O. Below, we calculate
this dependence explicitly in a specific case.
Differential crosssection for un polarized photons
If our initial photon is unpolarized and we do not measure the polarization of the final photon, we should average over initial polarizations and sum over final ones. Since there are two physical states
of polarization, this means that we should sum over initial and final
polarizations, and divide the result by 2. For all terms which are
independent of the polarization vectors, this procedure will give an
overall factor of 2. For the polarization dependent expression .')2,
it is better to revert back to the longer notation which was given up
in Eq. (9.73). We need to evaluate
~L
r,r'
[<,(k) . <,,(k')j' =
~ [L <,;(k)<,j(kl]
r
[L <",(k')<"j(k')] .
r'
(9.108)
9.8. Compton scattering
193
For either of these sums, we can take a look at the various choices of
polarization vectors given in 8.5, which give
(9.109)
where the sum is over transverse, i.e., the physical polarization states
only. Using this for the real polarization vectors that we are considering here, we obtain finally
~ L [<,( k) <,' (k')
r,r'
~ (I + cos 2 0) ,
(9.110)
where 0 is the angle between k and k'. So we obtain
d"l
dO
unpol
Ot
2 (W')2 [w'
= 2m2
:;
+ w'
2 ]
 sin 8
(9.111)
o Exercise 9.7 Den'lle the diffe.rential cross section 101' unpotarized
photons using th.e 'Polarizo.tion sum gillen in Eq. (8.8B). INote: You
cannot use Eq. (9.19) at the. same. time.. I
Total crosssection for unpolarized photons
The integration over angular variables is now straightforward, and it
gives
ounpol
21fOt
2
[
(
+r
. 2
2 )2 + "2
1+ r
T
where' = w/m. If
2(1
+ r)
2 .,
ro
,2
In(1
+ 2,)
. (9.112)
I, i.e., if the incident photon has very low
energy, then this reduces to
87l'"Q'2
ounpol
= 3m 2 = 0.665
10
24
cm.
(9.113)
The crosssection in this limit is called the Thomson crosssection.
This is often taken as a benchmark value for electromagnetic crosssections. Eq. (9.113) can be deduced using the methods of classical
electromagnetic theory, assuming that the frequency of the incident
radiation does not change in the process of scattering. As seen from
Eq. (9.106), this is a good approximation if w m.
Chapter 9. Quantum electrodynamics
194
9.9
Scattering by an external field
So far we have talked about the electromagnetic field as a dynamical
entity which can be quantized in terms of photon creation and annihilation operators. What ha.ppens if we have an electron moving
through a field such that the back reaction of the electron on the field
can be neglected? In such cases the 'external' field can be described
safely by classical functions.
Suppose we have a heavy nucleus which scatters an incoming
electron beam. We can treat the electromagnetic field of the nucleus
as a static Coulomb field in the rest frame of the nucleus. If the
nucleus is sufficiently heavy, we can ignore the momentum imparted
to it by the scattered electron. Let us then split the vector potential
into an 'external' part Ae~ and a dynamical part A~, and write the
interaction Lagrangian as
(9.114)
Nothing new has been done yet.
p'
+Ze
Figure 9.8:
Scattering
of
electron
by
nucleus
of charge +Ze.
However, neglecting the change of momentum of the nucleus
means that we can get a nonzero first order process out of this.
Let us he specific and consider the situation where an electron of 4momentum p is scattered by a static Coulomb field. The first order
term in the Smatrix in this case is
(9.115)
We know from 9.3 that the dynamical part does not contribute
because of energymomentum conservation. So we are left with only
9.9. Scattering by an external field
195
the contribution from the static texternaP part ,
(9.116)
Since the external field is independent of time, we can write it as
d q e ;q.ZA" ( q ) .
(211")3
A" = A"(
x ) 
(9.117)
So the transition amplitude is
( e (p')
15(1)1
e (p))
= ie
x
1
(1
d"x
j2E'V
d3q
(211")3 e
u(p')e;P''
.qz 4 ( ) )
1
'p., (P)
", q
j2EVe
u
. 211"6(EE') 1~ 63(
,
 ')4 )
j2EVj2E'V
q
q  p + p)u(p ",(q u(P)
te
_ 211"6( E  E') il( .
 ,j2EVj2E'V
/,
9.118
where
1(/. = eu(p') 4,(q = p'  p)u(p).
(9.119)
The Smatrix element can be viewed as in Ch. 6 provided we
make the following adjustments:
1. In the overall kinematic factors appearing in the Smatrix, re
place (211")46 4(p.  PI) by. (211")6(E;  E/). This implies that
the 4momentum is not conserved at the vertex. This is not a
failure of the theory. Rather, it means that we are ignoring the
3momentum transferred to the nucleus.
2. In the Feynman amplitude part, for each interaction vertex
with an external static field write a factor
q
~
(9.120)
o Exercise 9.8 The nucleus is at rest in the Lab frame. ]f the energy
E of the initial electron is negligible compared to the mo.s! M oj the
nucleus, ShOUl from the kinematics that the energy ' of the final
electron is a.pproximateLy equa.l to the initial energy.
196
Chapter 9. Quantum electrodynamics
Putting this back into Eq. (9.118) and squaring the Smatrix
element, we obtain the transition rate to be
ISld
a(E  E')
IT
2
= 2E'V 2EV 14f1 ,
(9.121)
where we have used the arguments leading to Eq. (6.17) to put
la(E 
Ell 2 = ~ a(E 
E'),
(9.122)
T being the time in which we perform all the calculations. The cross
section can then be written, in analogy with Eq. (7.88), as
1
dn
~~
= 2p (211')32E'
2
Ell4f1 ,
(211')a(E 
(9.123)
where the factor of 2p sitting in front appears as a product of 2E
which comes from the initial state factor, and the velocity of the
incoming particle, which is piE. The energy conserving afunction
ensures p = pi, Integration over the magnitude of p' gives
dn
dO
2
1611'2 14f1
(9.124)
Let us calculate the crosssection of scattering of an electron from
a nucleus of charge Ze using these rules. The external field corresponding to the nucleus is the fourvector
A~(x) = (Ze
,0,0,0)
411'r
(9.125)
The Fourier transform of this is
A~(q) = (~~,o,o,o)
(9.126)
Then,
(9.127)
9.10. Bremsstrahlung
197
Denoting the angle between p' and p by B, we have
Therefore,
1~12 = 27r2(Za~2
p4 sin4 _
47r 2(Za)2
p' sin'
(9.129)
Pntting this back in Eq. (9.124), we obtain
da
dO =
(Za)2
4p' sin'
~2
(9.130)
Since the velocity v is given by v = pIE, we can rewrite the
differential cross section as
(9.131)
In the nonrelativistic limit, this reduces to the Rutherford scattering
formula:
(9.132)
9.10
Bremsstrahlung
Bremsstrahlung, or braking radiation, is the radiation produced by
the deceleration of a charge by an electric field, such as that of a
heavy nucleus. There are two diagrams at the lowest order of QED
which contribute to Bremsstrahlung of an electron in the field of a
static nucleus, shown in Fig. 9.9. Both the quantized field and the
external field now play a role. However I there is a serious problem
with the scattering amplitude as calculated for the process. We run
198
Chapter 9. QUl1J1tum electrodynamics
pi
pi
+k
pk
~q=p'+kp
~q=p'+kp
+Ze
+Ze
Figure 9.9: The two lowest order diagrams contributing to Bremsstrahlung.
into infmred divergence  the crosssection becomes infinite as the
energy of the emitted photon goes to zero. It is as if an electron
passing by a heavy nucleus produces a very large number of very
lowfrequency photons.
The Feynman amplitude for the emission of a photon is
iJ( = ie 2 u(p') [/(kl ; : : m 4e (q) +4e(q)P
=1 \m
p
I(k)] u(p).
(9.133)
The matrix element for the transition is then
(f lSI i) = 2rrb(E  E '  w) ~
2EV
The final density of states is now
is
:v
2E'V
Vd3 p'Vd3 k
(211")
6'
~ J(.
2wV
(9.134)
and the incident flux
so that the differential crosssection is
(9.135)
da =
In the soft photon limit w '" 0, using Eq. (9.81) and similar
equations, we can write the Feynman amplitude as
2
p.,
p., ]
iJ( '" ie u(p')4e(q)u(p) [  I
 p'k
pk
= teLa ,    .
p .,
p.,
p'k
pk
(9.136)
9.10. Bremsstrahlung
199
where Ao is the Feynman amplitude for scattering without emission,
Eq. (9.119). This gives
du
dO =
(du)
dO
[p'"
p.
<>
f d k
]2
(2,,)2 1m ~ p'. k  p' k
+ .. , , (9.137)
where the integration is limited to small values of w, since this is the
region we are interested in. This is indicated by the subscript 'IR'
on the integral, and the dots stand for the contribution from larger
w. Tbe differential crosssection (as well as the Feynman amplitude)
clearly diverges as w ~ O. This is called the infrared divergence.
o Exercise 9.9 ShOUl tha.t the crosssection, Eq. (9.13'7L wh.en
summed OtleT fill pola.rizo.tions, is
One way this divergence can be understood is by noting that
when k "" 0, the momentum of the internal electron is k + p' "" p'.
But since p" = m 2 , the internal propagator is divergent. Physically,
what we find here is that it is possible to fit in an infinite number of
zero energy photons in the radiation.
This is not a problem specific to this process.
In fact,
Bremsstrahlung can take place from a charged external line in any
process. For example in electronelectron scattering, a photon can
be emitted from any of the external lines. Such processes also suffer
from the same infrared divergence problem. The infrared problem
has its roots in the classical theory, but it is quantum field theory
that comes to the rescue. Although the divergence shows up at the
lowest order quantum calculation, the removal of the divergence requires higher order corrections. This will be discussed in Ch. 12.
1
Chapter 10
P, T, C and their combinations
As we mentioned before, the symmetries of a physical system provide
important clues to the structure of the Lagrangian. So far, whenever
we talked about symmetries, we meant contiuuotls symmetries. In
Ch. 2, for example, we discllssed the consequences of invariancc of a
Lagrangian under such continuous symmetries and derived Noether's
theorem. We applied Noether's theorem in various circumstances in
subsequent chapters. Finally, in Ch. 9, we discnssed how a continuous
symmetry can be realized as a local symmetry if some gauge fields,
like the electromagnetic field, is present.
There is another clac;s of symmetries, called discrete symmetries.
The name implies that the transformation relating to these symmetries cannot be viewed as the continuous change of a variable.
Rather, a discrete operation is involved. Some sHch symmetries are
extremely important in understanding the physical content of a theory and restricting the form of interactions in the theory. In this.
chapter, we discuss some symmetries of this sort and discuss their
physical implications.
ID.1
Motivations from classical physics
We begin our discussion by noting how some discrete symmetries
are inherent in the law~ of classical physics. For this, consider the
equation of motion of a nonrelativistic t~st particle of mass m and
charge q in the Coulomb field of a particle of charge q':
d2x
qq'
m dt 2 = 4rrr' x ,
200
(10.1)
10.2. Parity
201
where r = IxL and we took the origin of our coordinate system at
the position of the charge q'. Note that if we make the substitution
xx,
(10.2)
this equation remains invariant. This operation is called parity operation, and therefore parity is a symmetry of Eq. (10.1). Another
discrete symmetry which keeps the same equation invariant is time
reversal l which is defined to be the transformation
+ ~
t.
(10.3)
Since Eq. (10.1) involves a double rlerivatiVl' of lime on the left hand
side, it is unaffected by the change of .':iign of tillle.
Apart frolll havillg parity and time rev('r~al ~ymmctrics, Eq.
(10.1) is also invariant jf we change til(' sign of the charges of each
particle:
q ;  q,
q 
(10.4)
 If
This symmetry is callC1:! charge conjuglltifln sylllH1ptry. In the rest
of this chapter, we will discuss how tlJes(' S.vIIlIJH~tries c.an he implemented into quantum field theory,
Exercise 10.1 Verily that llllTity rtnd
lTnnJ:'llormationa.
10.2
Parity
10.2.1
Free scalar fields
til1ll'
l'l'll('l'RUl <In'.
Lonntz
Let us start with free Lagrangians and sec wbd.lwr they rcspect the
parity symmetry, COil sider first the free Lag:rang:irtll of a. real scalar
field, Eq. (3,5). \Vc write it more pxplir.itl.y IIl'n~, showillg the time
and the space derivatives separately:
(10.5)
The parity transfonna.tion is just
coordillates
lI1ea."llrill~ t.lliugs
i=(t.x).
!Ising a system of
( 10.6)
Chapter 10. P, T, C and their combinations
202
Since
is related to c<:rordinates x by a special kind of Lorentz
transformation, let us try to represent the parity transformation also
as a constant
linear transformation on the fields,
,
p(x)
= P(X)p1
= ryp(i) ,
(10.7)
where P is the parity operator and 'IP is independent of spacetime.
If we write the Lagrangian in terms of this paritytransformed field,
but in the original coordinates x, the result should be the same
as writing the Lagrangian in terms of the original fields, but coordinates i. In other words, parity invariance means
P~(X)pl = ~(x).
(10.8)
So the action remains invariant. Now,
P~(X)pl
[(8,p(x))'  (V',,p(x)f 
m'~(x)],
(10.9)
and Eq. (10.8) holds provided rll' = l, i.e.,
p(x) _ P(X)pl = (x).
(1010)
The factor ryp is called the intrinsic parity of the particle corresponding to the field , whose values can be 1. For a complex scalar field,.
we obtain in the same way that ryp can be allY arbitrary phase. But
parity operation done twice is the identity operation, which restricts
ryp to the values 1. From the free Lagrangian, there is no way to
choose between these two possibilities for intrinsic parity. We will see
later that this need not be the case when interactions are introduced.
o Exercise 10.2 Cnlcula.te the effect of the parity o"pt:Ta.tor on the
cT'wtion and o.nnihilo.tion operators of <1>. As:mming that the uacuum
is parity inuo.riant, show that the onepm'tide stutes tro.nsfonn as
Plk) = ryplk).
10.2.2
(1011)
Free Dirac field
For fermions the situation is a bit more involved because a Dirac
fermion has four components. We start from the Dirac Lagrangian
of Eg. (4.84), written in a more explicit notation:
2'(x) = ,pix)
(ho8, + h V'"  m) ,p(x).
(10.12)
10.2. Parity
203
As in the case of the scalar field, let us write Pljl(x)pl as a constant
linear transform of ,p(x). Since parity is a special Lorentz transformation, it will mix the components of ,p like the usual Lorentz
transformations,
,pp(X) '" P,p(x)pI = P1/J(x) ,
(10.13)
where P is a 4 x 4 matrix, independent of spacetime.
Our task is now to find out whether there exists a matrix P which
will make the Lagrangian parityinvariant, i.e., will ensure Eq. (10.8)
for the Dirac Lagrangian. Since ,pp(x) = 1/J),(xho = 1/J t (x)pt')'o =
,p(xhopt')'o, we get
P!L'(x)pl = v'(xhopt')'o (hoo, + i,), V z  m) P,p(x)
= ,p(xhopt')'o (hoo,  i")' V z  m) P1/J(X). (10.14)
This will be the same as the expression for !L'(x) provided the matrix
P satisfies the following conditions:
ptp = 1,
')'opt')'o')',P = ')'"
')'opt')'oP = 1.
(10.15)
Of course there is a matrix for which these conditions are satisfied.
In fact, if we put P = ')'0, all these equations will be satisfied. A more
general solution can be written as
p=
'IP')'o,
(10.16)
where 'IP is called the intrinsic parity of the fermion. The parity
transformation properties of free fermion fields are then
1/Jp(x)
= P,p(x)pl
= 'IPI'O,p(x).
(10.17)
Applying parity transformation again, we find 'IP = l.
Exercise 10.3 Use the Dirac matric.es and the planewa.lle solution
in the. DirQ.cPa.uli 're'PTf'.sentation to uerify that
1'ou.(p) = u.( p),
1'ov.(p) =  v.( p).
(10.18)
Note tha.t these relations must be obeyed in anlj representa.tion of
the Dira.c matrices.
o Exercise 10.4
A~~ly
the
~arity
tTaltsjormatiolt rule, Eq. (IO.I?},
on the Fourier f'.z'Po.nsion of a. Iree fermion. field giuen in Eq. (496 .
Using Eq. (10.18), show that the single pa.rticle and a.ntipnrtic e
sta.tes transform oP'Positety under parity.
204
Chapter 10. P, T, C and their combinations
10.2.3
Free photon field
We now turn to the Lagrangian of the free photon field and examine
whether it is invariant under the parity symmetry. For this purpose,
let us rewritf' the Lagrangian,
y = 
~F;wFIW
4
~
= 
~(o"Av) [(0"
AV
[(doAiHaOAi  a i AO)
(tJ" A")]
+ (a,.1o)(tJ'A"
+(aiAj)(d'Aj  (PA i )]
 aO Ai)
(10.19)
In the parity tra.nsformed Lagrangian, AO and Ai should be replaced
by A~ and A~ ff'A..;;pectivl'ly. The result shows t.hat the Lagrangian is
parity invariant if we dC'finc the parity trallsforlllation on the photon
field by
= AO(i).
= A(i),
(10.20)
= A(\(i) ,
= AU).
(10.21)
A'!,(x)
PA"(:r)P'
AI>(J:)
PA(J:)pl
or by
Ay,(x)
PAO(x)p
Ap(J:)
PA(x)P1
From the poiut of view of the free field t.h('ory, we call1lot choose
between tile two. HOW{'VN, once we int.rodll(,{~ the sources allfJ find'
solutions for the cl('ctromagnetic field ill some specific ca...;es. the sit.~
uation change::;. For exa.mple,
at the origin, the snllltit)ll for
if the sourr.(~ is n point charge q
E j:. the Cuulomh solin,ion:
'I
E = Jx.
47l"r
at rc::;t
( 10.22)
Since the ri~ht band side is odd uncler pari ly tr,ul:;formation, we
conclude that. the elect.ric field E must 1)(' odd Huder parity. Thi:;
force:; us to choose Eq. (10.20) a.s the ('orn~('t. parH.y transformation
property of the photoll held, becau,,, E' = u"A,  D;A".
SIWUl lhut th.e VU1'ity t1'(t1I~ful'lTlution ruleR deJine.d
in boUt ECPL (10.20) cmd (lO.'2L) ubotH' kt'('P the.. free :\1 ltJ:lIlf'H t:qlW
lions (i.e,. r...1fLT.we.ll c'lunt.i.nTl.!\ in t.h(~ lI.b:;('lH'(' of !,;Olll'(,C~) inlJaT'ilL1l.t.
o Exercise 10.5
10.2. Parity
10.2.4
205
Interacting fields
We have thus shown that for all kinds of fields discussed earlier, the
free Lagrangians obey parity invariance. Of course, free Lagrangians
are no good for describing any physical phenomena, since physical
phenomena are interactions. The question therefore is whether parity
remains a good symmetry, i.e., Eq. (10.8) holds in t.he presence of
interactions. The answer depends on what the interactions arc. Here
we will discuss a few examples to gain some experience in this matter.
Let us st.art with a tht.'Ory of fcrmions amI real scalars, with an
interaction term
2iot =  h>/J>/J"',
( 10.23)
which was int.rodnced in Eq. (5.9). Under pmit.y t.ransformat.ion, t.he
combination 1/nj; remaius invariant, as is evident [rom Eq. (10.16).
Thus the above interaction term is invariant only if we have 'l1J = +1
for the field J. In other words, among the' twu choices available for
parity transformation of free scalar fields, we ;\rc forced to accept
onlyone.
If instead we have an interaction of th~ form
(LJ
..Lint
, , '
=  h
(10.24)
tjJ"(s1/)(j>.
parity invariam:e is obtained only if p(J") =  'Mi) by the same
arguments (1.<:; above. Scalar fielcls with Ilq~a.t.ive inl.rinsic parity or
odd parity arc called pseudoscalar field.". U:;illM this tcrlIlillo!ogy, we
can rephrase the earlier statement by sa.yillf.!,: tha.t in a theory wlwre
the term in Eq. (10.24) is the only interactiull t.enll, we have parity
invariance in the theory provided the field 'b is ~seudoscalar.
If we now include the selfinterac.tion terms of sca.lar fields in the
interaction Lagrangia.n, we find that pseudoscalar fields c:an only have
even order interaction terms, e.g., a q'J"1.t.CrIU, if parity is a symmetry
of the Lagrangian. If we have a. theory with t.ilt, int.eraction
'"
Lillt
=  h'
~
I <
tp"(0~)4) 
'1 IHIi
, ..I
AlJ')
( 10.25)
\ve find tha.t parity is not a symmetry of the interaction Lagrangian.
This is hecause with d>p(:r) =  (), til(' r/,:Jt.erlfl is not invariant
under parity. On the other haml, if we c1JoOSt' wp(:c) = J(i), both
J3 anel (/)4 terms arc invariant.. bllt. the illteract.ioB with t.he fcrmiol1s
206
Chapter 10. P, T, C and their combinations
is not. Thus there is no definition of the intrinsic parity of the scalar
field which makes this Lagrangian parity invariant.
Such an impasse can also occur without selfinteractions of the
scalar. For example, suppose we have a theory in which the interaction terms arc of the form
(1O.26)
No definition of intrinsic parity of <p can make this interaction parity
invariant. This is because the term with
would demand the parity
of the scalar field to be 1 in order to be parity invariant, whereas
the other term would demand the same quantity to be +1 in order
to be parity invariant.
1'5
Exercise 10.6 In. the absence oj a mass term. Jor th.e jermion, the
third equ.tion of Eq. (10.15) need not be .ati,jied.
a.) Show that with this choice 1'01'5 is a solution, i.e., P
= 1]P")'0'1'5
will satish! the other two equations.
b) Shom th.at in this ease, the intera.ction in Eq. (10.23) implies
th.t q, i' p..udo,calar, wherea' that of Eq. (10.24) lorce' q,
to be 0. scalar.
c) Sh.om th.a.t f'.:uen in this rose, the interactions in Eq. (10.25) and
Eq. (10.26) oonnot be made po.rityinuannnt by any choice oj
intrinsic parity of .
We now consider the Lagrangian of QED. Here, the interaction
term is
.'in' =
eQl/J1'~l/JA~
= eQ [l/J1'ol/JAo 
l/J1'il/JAi]
(10.27)
as discussed in 9.1. For the fermion bilinears, we find
l/J p(xlJ'ol/Jp(x) = ('Ipl/Jt (:;:) )1'o('IP1'ol/J(x)) = l/J(xlJ'ol/J(x), (10.28)
and
l/Jp(xhl/Jp(x) = ('Ipl/Jt (x)
)1'i('IP1'ol/J(x)) =
1/J(xhl/J(x). (10.29)
Using these, we find that the interaction Lagrangian is in fact invariant under parity if the parity transformation rules for Ao and Ai are
given by Eq. (10.20). Fields transforming like the photon field under
parity are called vector fields. If on the other hand there are spinI
fieids which transform like Eq. (10.21) under parity, they would be
called axial vector fields.
10.3. Charge conjugation
207
o Exercise 10.7 Shom that, if JOT some spin.l field Ell> the interaction with. jermions is gillen by
(10.30)
'Where a is 0. constant, the La.gra.ngian is parity inuarianl prouided
B~ is an o.xio.llJeetor fleld.
Exercise 10.8 Show tha.t, ij fOT some 8pinl jield Zj.l, the interaction with jermions \s giuen by
(10.31)
when a a.nd b aTe consta.nts, the La.grangia.n cannot be parity in
ua.ria.nt.
10.3
Charge conjugation
Charge conjugation was defined in 1O.1 as the operation of changing
the electric charge of any particle involved. In a more general setting
where we will be discussing not only electromagnetic interactions but
other types of interactions as well, the charge conjugation operation
would mean changing the sign of not only the electric charge, but
of all types of charges that a particle may possess. Since we know
that any charge of a particle should be equal and opposite to that
of the antiparticle, in effect this means interchanging particles and
antiparticles.
10.3.1
Free fields
For scalar fields, this just means replacing 4> by 4>t, and it is easy
to see that the free Lagrangian of a scalar field is invariant under
this operation. More generally, we can define the effect of the charge
conjugation operation to be
4>c(x) '" C4>(x)c' = 1)c4>! (x),
(10.32)
where C is the charge conjugation operator, and rye can be called
the intrinsic charge conjugation phase of the field. And) of course,
the free Lagrangian for a complex scalar field is invariant under this
symmetry for any phase 1)C. For real scalar fields, since both 4> and
4>c have to be hermitian, 4> should be an eigenstate of the charge
conjugation operator with eigenvalue + 1 or 1.
208
Chapter 10. P, T, C and their combinations
For the photon field which is also a real field, the free Lagrangian
again demands that charge conjugation is a symmetry provided the
intrinsic charge conjuga.tion factor is either t 1 or .. 1. When we bring
in interactions, we find that the Maxwell equations are invariant only
if we choose
A6(x)
= CA~(x)Cl
=  A~(x)
(10.33)
under charge conjugation operation.
This is because the current
changes sign if all the charges are reversed, Cj~C1 = j~, and j~
appears in the Lagrangian in the combination j~ A~.
For the fermion fields, the charge conjugation operation is a bit
more complicated since the fermion fields are spinor fields. We cannot
just replace '" by",t in the free Dirac Lagrangian, since ",t is thought
of as a row matrix whereas '" is a column matrix. This problem
can be easily avoided by considering, instead of t/Jt, its transpose,
which we can call "". But even this cannot. replace'" because "'.
does not have the same Lorentz transformation property as ,p. This
can be seen as follows. In 4.2, we showed that under a Lorentz
transformation
(10.34)
the spinor fields transform like
""(x') = exp ( ~O'~"W~") 1/)(x).
(10.35)
Thus, ,p'(x) would transform like
1/,"'(x') = exp ( +~O';"w~u) 1/)(x).
(10.36)
f:.  ofl.l/' Thus, if we
were to make the simpleminded replacement of 1/) by ,p', we would
This is not the same transformation since
a~v
end up with a Lagrangian that is not Lorentz invariant.
This, however, is not an impasse. Even for the case of parity, ,pp
and ,p were not proportional. Rather they were related by a matrix
P, which was defined in Eg. (10.13). In the same manner, here we can
ask whether the charge conjugation transform of the field ,p, which
will be called 1/)C, can be related to ,p' by a matrix. III other words,
we will look for a unitary matrix C such that 1/Jc can be defined as
T
"c
opC =
'10 '" ,
(10.37)
209
10.3. Charge conjugation
with the condition that ,pc transforms the same way as ,p under
Lorentz transformations. The appearance of "'YO in this equation is
purely conventional. We could have called the combination (1J by
some name C' and proceeded to look for C'.
The essential property of the desired matrix ( was already suggested in Ex. 4.6 (p 54). We now derive this property for the reader
who has not already solved that exercise. We want >!Jc, as defined in
Eq. (10.37), to transform according to Eq. (10.35). ThiS means
(1J >!J'(x') = exp ( ~a""w"") (1ci >!J(x).
We know from Eq. (10.36) how
ten as
>!J. transforms.
(10.38)
So this can be rewrit
(1ci exp ( +~<T;"w"") ,p(x) = exp ( ~<TIWW"") (1ci >!J(x). (10.39)
ff this has to be satisfied for arbitrary WI'" and arbitrary,p, the matrix
( must satisfy the relation
( T
( 1oT d 1J.J,I
= oJ.lV 'Yo
(1040)
or
.
T
( 1 a~1I ( =  "YoT
alJv"Yo = 
0Jw
(1041)
The last equation is obtained by using the definition of the amatrices
and the hermiticity of the 1matrices given in Eq. (4.18).
The question now is, does there exist a matrix C which satisfies
Eq. (1041) for all six of the matrices a""'1 The first step towards an
answer is to realize that if we can find a matrix ( which satisfies the
relation
( I
1"
= 1" '
(10.42)
this ( will definitely satisfy Eq. (1041), hecause
[1",1"]~ =
[1!,1:L = 
h~,1!L
(10.43)
Secondly, if the matrices 1 satisfy the anticornmutation relation of
Eq. (4.9), the matrices 1"t also do so. [n other words, since
(10.44)
Chapter 10. P, T, C and their combinations
210
we can take a transpose of both sides of this equation and obtain
(10.45)
By the theorem mentioned in connection with Eq. (4.20), we can
therefore say that there must exist a unitary matrix C such that Eq.
(10.42) is satisfied. The precise form of this matrix will depend on
the representation of the ,matrices.
Exercise 10.9 ShOUl thnt under chnrge conjugntion the jennion
current tmns!onns ns 1/,y.", ~  ",..,.",. Show thnt the cUTTent JOT
complex 8oola.rs change sign a.s welt.
Exercise 10.10 Show thnt
C 1y5 C
o
o
Exercise 10.11 Using ("'c)e
=..,i,
= "', show thnt the mntri< C must
be antisymmetric.
Exercise 10.12 Consider two different representntions oj ..,.
mntrius re!nted b~ 'Y. = U..,.U t jor some unitnTy mntri< U. Show
that the charge conjugation ma.trices in the two representations are
relnted b~

C = UCU .
(10.46)
(10.47)
Exercise 10.13 Show that, in th.e Diro.cPauli representa.tion of
the ymntrius, n mntri< C thnt snti.jie. Eq, (10.42) is
In the Ma.jorana representa.tion of ymatrices gwen in
pendiz A.1, ShOUl th.at one c.o.n dej\ne
C = iy0.
Ap
(10.49)
Exercise 10.14 U.e the conjugntion mntrix dejined in Eq. (1048)
to uerij~ thnl our choiu oj the .pinoT solutions in Eqs. (4.61) nnd
(4.62) snti.j~ the relntion.
C..,ri u;(p)
= v.(p) ,
Cyriv:(p) = u.(p).
(10.50)
10.3. Charge conjugation
10.3.2
211
Interactions
Real scalar fields are invariant under charge conjugation, so any interactions containing only real scalar fields will of course be invariant
under the same operation. For theories with complex scalar fields ,
the interactions are invariant if they contain only the combination
4>f 4>. Let us now check a nontrivial case, viz., that of the interactions
involving fermion fields.
Because of Lorentz invariance the fermion fields always appear in
pairs, in bilinears of the form
(10.51)
with some constant matrix F sandwiched between them. We ha'le
kept the bilinear in a general form in which the fermion fields ..pI
and !/J2 need not be the same. In order to find the charge conjugation properties of fermion interactions, we need to know how such a
bilinear behaves under charge conjugation.
The charge conjugated form for the bilinear above is
(10.52)
For any fermion field !/J,
.1.
'+'C
.,,1
'+'c1'o
'+' 1'0.CI 1'0,
.J.T
(10.53)
using Ct = CI. Recalling now that ')'0 = 1'J and using the definition
of the matrix C from Eq. (10.42),' we obtain
(10.54)
Thus for the expression in Eq. (10.52), we can write
(..p,)cF(!/J2)c
= ..pic'PC1'ci!/Ji
(10.55)
This looks very different from the bilinears that we have seen SO far,
where the field at the left has a bar on it. But it can be put into that
form. To see that let us start with a bilinear of the form !/Ji M')'ci ..pi,
and rewrite it by explicitly putting the spinor indices:
..pi M')'ci!/Ji
(..ptl<>M<>~ (1'ci)~, (!/J2);
=  (..p2);
(')'o),~ (M T ) ~<> (!/JdQ'
(10.56)
Chapter 10. P, T, C and their combinations
212
In the last step, we have merely written the objects in a different
order. In the process, we had to interchange the positions of two
fermionic operators, which brought in a minus sign because of the
anticommuting nature of the fermion fields. Now we can revert to
the matrix notation to write the expression as
(10.57)
This is an identity. Using it on Eq. (10.55), we can write
(,pIlc
F(lh)c = ,p2 (e'Fe) T ,pl.
(10.58)
We can now use it to find the charge conjugation properties of
various fermion jnteractions. As an example, consider the interaction
with a pseudoscalar field, given by
(10.59)
Notice that the second term is the hermitian conjugate of the first
one, and both aTe necessary in order that the interaction Lagrangian
is hermitian. If the charge conjugation phase of the scalar field is
taken to be 1JcP' the charge conjugated interaction would he
(Jiint)c = h (vJIlc)s(,p2)c 1J~t  h (VJ2)c'YS(lh)c1J;
= h1J~VJ2'YSlht  h1J;,pI'YS1/'2,
(10.60)
using Eq. (10.58) in the last step. We see that this is the same as
the original interaction if hW = h. Which means that even for a
complex coupling constant h, this Lagrangian can be made charge
conjugation invariant by appropriately choosing 1JiP'
Exercise 10.15 ShOUl that the intf'.Tndion term in QED is in.ua:rianl under charge conjugation prouidcd A~ transjorms us in. Eq.
(10.33).
IDA
Time reversal
10.4.1
Antilinearity
Time reversal symmetry was defined in Eg. (10.3) as the reversal of
the arrow of time. Discussion of this symmetry is somewhat more
10.4. Time reversal
213
involved than the earlier ones. Let us explain why. Suppose we are
considering a physical process where some initial state J>II} turns into
some final state I>II'}. In the time reversed picture, the final state
would become the initial one, and vice versa. In other words, if
we denote the time reversal operator by T, time reversal symmetry
would imply
(H' IH) = (>II 1>1I')
(10.61)
An operator U satisfying the relation
(U>II' IU>II) = (>11'1>11)
(10.62)
is called a unitary operator. In analogy, operators which satisfy Eq.
(10.61) are called antiunitary.
There is a very powerful theorem due to Wigner which states
that any symmetry operation, Le" any transformation on states of
a Hilbert space which leaves probabilities of all physical processes
invariant, can be represented either by a unitary operator which is
linear, or by an antiunitary operator which is antilinear. We will
not prove the theorem here, but let us explain the words l'linear"
and ltantilinear".
{j
Consider two arbitrary states 1>11 1 ) and 1>11 2 ). Suppose an operator
has the property that
(10.63)
for all complex numbers al and a2. Then the operator is linear. If
on the other hand the operator is such that
( 10.64)
for all al and a2, the operator is antilinear. The time reversal operator, since it is antiunitary, must be anti linear, as the theorem tells
us.
10.4.2
Free fields
With this background, let us discuss whether the free Lagrangians
of different kinds of fields are invariant under the time reversal operation. For spinO as well as the electromagnetic fields, there is
214
Chapter 10. P, T, C alld their comhinations
not much to discuss. The arguments are very similar to what we
used for showing parity invariance. The time reversed system has
coordinates
XT
= (t,x) = x,
(10.65)
where .T. wa.~ introduced ill Eq. (10.6). As ill the ca,:se of parity, the
intrinsic phase associated with time reversal can be arhitrary for both
spinO and spinl fields as long as we consider the free Lagrangiall
only. For the photon field. the interactions dictal(' that under time
reversal one should have
T A(:r)T 1 =  A( x). (10.66)
This is because under time revcrsal l
T J.o( 3; )T1 = J.o( :r')
Tj(1:)T 1 = j(i).
since j represents the flow of charge
wherciL"'; jU
(10.67)
represents the static
charge density.
In order to discuss time reversal propt'ftics of Dirac fields, we
write the free Lagrangian in the form
2'(x) = 1/,1 (:r) (iD, +hoY . V x  111/'11) 1/)(x).
(10.68)
The time rever:soo form of the Lagrangian is
T 2'(x)T 1 = tPi(x) (W,  h,iy' . V"  m/o) tP'l'(x)
= !/'hx) ( WI
+iyciyT. V x
milT) 1/Jr(x) . (10.69)
The complex conjugates of thf! l'rnatrices, a..':i well a.s i, have appeared in the first step becallse of the antilinearity of the operation.
In the following ~tCPI we have used the hermiticity properties of the
Imatrices to turn the complex conjugates iHto transpo:;es. In kp.p.ping with the notatiolis used for parity alld charge conjugation, we
will define
(10. 70)
for some matrix T which is independent of spacetime. Then we call
write Eg. (10.69) as
T2'(x)T 1 = tPt(i)TI
(iO_, +hciyT. V x my,T) TtP(i)
(10.71)
10.4.
Time
215
rl~versa.J
Time rever:;al invarianc.c will reqUIre
T Y(x)T 1 = Y(i).
This can be ar;hievcd if the matrix T
satisfif~s
TtT = J.
T t 1'0T'YiTT = 'YU)'I
Tt yJ"T = Yo.
The first of these
trix  both unitary
unitary matrices. In
po,es by Eg. (10.42)
then gives
(10.72)
th' relatiolls
'
(10.73)
equations says that T has to he a unitary maand antiunitary operat.ors are represented by
the other two equatioll:), we replace the tra.nswhich define, the matrix C. The ia,,;t equation
(10.74)
This equation ca.n be rewritten a.s,
(10.75)
[CT, Yol. = U.
Using this, the second
~quatioll
of Eq. (10.7:.1)
Cilll
1)/~
ca.st into the
fOfm
[CT, yd. =
o.
( 10.76)
Thus the ma.trix CT anticomUlutes with a.1I t!It' )'lllatr1CeS, Therefore
it has to he a. mult.iple of 1'5, TIl(' I!lOst f.,!;(~JH'r;ll snillt.io!l for T is then
( 10.77)
where 11'1' is a pha.,,;p. factor.
Dirac field is given by
Thll~.
t.he
tillH'
[{'versal Iuolwrty of a
(10.78)
Exercise 10.16
COllliidt'T lUlO
diITI:nnl
ITJlnslllt.(lt:i.(Jll~
oj the
,~
tfLatric,r.s rdnh~cl h1J )j' = U'j,U t . Sh.ow t.h II I tit.' li.mc'n,lllT.'U'.cl DiraC'
.fld<l8 in the lUlo rqllT:'lf'.llto.lioll::l llT'f~ n1Clfl'lt II'!I Hit, l'.lru.ntlO1l.
(10.70)
216
10.4.3
Chapter 10. P, T, C and their combinations
Interactions
Since the free Lagrangians are invariant under timereversal, we can
now ask whether the int~ractions are. The task is easy if we have
only scalar and spinl fields. For spinor fields which always appear
as bilinears, we can start addressing this question by finding out how
a bilinear behaves under time reversal.
Consider then a general form of a bilinear given by h1/J)FW21
where 1/J1 and 'l/J2 are in general two different spinor fields, h is a
coupling constant and F is a constant 4 x 4 matrix. Under time
reversal, this becomes
T (h1/>,(x)F,p,(x)) y' = hOT (1/>I(xhuF1/>,(x)) y'
= hO 1/>: (x)Tt'YoFoT'!J2( x)
= ItO 1/> I ( xho'Ys C'YJ F" C ' "Is 1/>,( x).
(10.80)
Using the definition of the matrix C from Eq. (10.42) to write C'YJ
')'0(, we can put it in the form
T (h 1/>, (x)F1/>,(x)) T l = hO
.p, (i:) Fr1/>,( x),
(10.81)
where
Fr = 'YsCF"CI'Ys .
We now list the form for
F
Fr
FT for various
"Yi
po.~sibilities for
')'0')'5
(10.82)
F:
(10.83)
Using this table, we can now discuss whether specific interactions
involving fermions are timereversal invariant. As an example, let us
take the interaction Lagrangian of QED, which was given in Eq.
(9.18). For our purpose, we write it as
(10.84)
The chargp. eQ is a real number, so complex conjugation does not affect it. For the field operators, we can use the table in Eq. (10.83) and
the transformation properties of the photon field from Eq. (10.66) to
see that this interaction is indeed invariant under timereversal.
10.5. CP
217
Exercise 10.11 Consider thf'. inlt'.md.ioll introduc:ed in Eq.
(1 O.:U). ShOUl that Uti)! inten.Lction is imJnl'icmt un.der timc.l'eueT'8ul
pro1,ided ZJJ transforms the same. 'WUY Ululr.r tinu:re.Uc.Tsal as the
photon jl.eld. IHint: F'ir!lot ShOUl from the lu:rmitic.it:y argument tlta.t
a a.nd b haue to be re..nl.!
10.5
CP
We have already discussed the discrete opprations C, P and T separately. Sometimes it is useful to talk abollt combinations of these
operations. One such combination which is very useful is CP, i.e.,
the product of charge conjugation and parity.
The transformation of different fields under CP need not be discussed in detail. They can be obtained by combining the results of
P and C operations, both of which have hflP.1l discussed earlier. For
example, for a fermion field, we would obtain
CP >jJ(x) (Cp)l = CP >jJ(x) P'C I = C7IVyOljJ(i)C1
= ryP"(oCVJ(i)C '
= 71J>IIC1'IlC1'J' >jJ'(i).
(10.85)
Using the fundamental property of the matrix C from Eq. (10.42),
we can rewri te th is as
CP >jJ(x) (Cp)I = ryCpCv,'(i:) ,
(10.86)
where ryCP is an intrinsic CPphase of the field, defined hy
ryCP
= '1pryc
(10.87)
o Exercise 10.18 Use the CPt'ransjonnulion rule for
0. fermion
jietd to show tho.t, if two fermion fields 1/JI and th ha.ue the same
intrinsic CP phasc, the general fermion biliTtt'O.r c.o1tstrucled from
them lrn.nslorms the following wo.'Y under CP:
C1'~,(x)P,p,(x) (C1')' = wi(i)C'YoPCt/';(x)
= t/>,(xhoCpTC'Yl)>p'(x).
(10.88)
lUling Eq. (10.58) in the lnst step.
Exercise 10.19 Suppose we define. Fcp
oy the: relation
C1' t/>,(x)Pt/>,(x) (C1')' = ~,(:i')PcPV', (x).
(10.89)
Using the results of the pre:uious problem, verily th.e foLlowing table
giuing Fcp corresponding ua.rio'Us possibilitie8jor F:
PCP
o
1',
'Yo
"Ii
"Ii
(10.90)
Chapter 10. P, T, C and their combinations
218
For the photon field, using the transformation properties under
parity and charge conjugation given in Eqs. (10.20) and (10.33), we
can write
CPAO(X)(Cp)1 = Ao(i),
CPA(x)(CP)1 = A(i).
(10.91)
With this and Eq. (10.88), we can show that the QED interaction is
CP invariant. But it is somewhat of an unnecessary exercise since we
have already known that the QED Lagrangian is invariant separately
under C and P.
The more interesting point to note is that if there is a spin1
field which has both vector and axial vector type interactions with
fermions, as for example was given in Eq. (10.31), the interaction is
not separately invariant under C and P, but is invariant under the
combined operator CPo
10.6
CPT
We now consider the product of all the three independent discrete
symmetries discussed so far. For the sake of convenience, we use the
shorthand
8=CPT.
(10.92)
In this case, let us first see how the photon field transforms under
this operation. Using the transformation property of the photon field
under C, P and T separately, we obtain
(10.93)
Moving over now to Dirac fields, we find
e 1jJ(x) 8 1 = CPT 1jJ(x) T~' P1C 1
= '/TC 1')'5 CP 1jJ( i) p 1C'
=
'/T'ICPCI')'5C1jJ'( x),
(10.94)
using the CP transformation property from Eq. (10.86) in the last
step. Using the definition of the matrix C, this can be rewritten as
(10.95)
10.6. CPT
219
where
1)6
= 1)cprn' = 1)C1)P1JT
(10.96)
If we change the order of the three operators in the definition of e,
the phases for operators appearing to the right of T will have to be
replaced by their complex conjugates since Tact. antilinearly. Of
course, that changes only the definition of ".. , but not a~y of the
conclusions below.
For bilinears of the general form "'I F1/'2, the CPT transform....
tion property can be deduced from here. For this, we will have to
remember that since time reversal is an antilinear operation whereas
parity and charge conjugation are linear ones, the product CPT is
antilinear. Therefore,
e "'(xl e I = e "'(x)t'Yo e I =
=
(S1/J(xj eI)t 'Yo
,.T( x ) 'YsT'YoT ,
1)60/
(10.97)
using the fact that 'Yo and 'Ys are hermitian matrices. Assuming for
the moment that we can assign the same CPTphase to all fermions,
we obtain
using the identity of Eq. (10.57) at the last step. Let us write this as
(10.99)
where
( 10.100)
using the notation FI = 'YoFt'Yo that was introduced in Eq. (7.15).
The significance of this symbol is that the hermitian conjugate of the
general bilinear discussed above is given by
(10.101)
If there is a term in the Lagrangian containing the bilinear WI F"'2,
there must also be a term containing the bilinear ""FI"'I since the
220
Chapter 10. P, T, C and their combinations
Lagrangian is hermitian. In fact, the CPTconjugate of any term
involving 1/;, F1/J2 must equal the term involving 1/J2FI1/;1 if the Lagrangian has to be CPT invariant.
Since 1'5 anticommutes with I'p, we find from Eq. (10.100) that
if we assign the same intrinsic CPT phase for all fermions, FI and
Fa are equal for a.ll fermion bilinears involving an even number of
vector indices. On the other hand, for fermion bilinears carrying an
odd number of vector indices, Fl and Fe differ by a sign.
The observation can be extended by looking back at the CPT
transformation property of the photon field. It carries one vector
index, and it differs from its CPT conjugate by a sign. Let us now
carry it further and say that for any spilll field, we will define the
intrinsic CPT phase to be the Same il.. that of the photon field, i.e.,
any spinl field will transform under CPT as in Eq. (10.93). Moreover, for all spinO fields, we will define the intrinsic CPT phase so
that
(10.102)
and will also take the CPT phase of all feTInion fields to be + 1. In
that case, we can extend the statement made a little earlier. For
these assignments of the intrinsic CPT pIll""" of the fields, the CPT
transformation property of any fermion bilinear or tensor (which in
cludes scalar and vector) field carrying n lIl11nber of Lorentz indices
is related to the hermitian conjugate by a factor (_1)n.
Since any term in the Lagra.ngian must be Lorentz invariant, it
must have an even number of Lorentz indices! suitably contracted.
So the CPT transform of any term would be the same as its complex
(hermitian) conjugate, and we conclude that. CPT is a good symme .
try of the Lagrangian. Our argument makcs no reference to whether
we are talking of a free Lagrangian or one with interactions.
What have wc assumed in arriving at this strong result? First, we
have assumed that the Lagrangian is Lorentz invariant. There is an
implicit second assumption, viz. that the .spinor fields anticommute,
a fact that we used in deriving Eg. (10.57), which has played a crucial
role in the derivation of thp. transformation of fermion bilinears under
CPT. Similarly, we have also assumed that fields with integral spin
are quantized by commutation relations. Thus in any theory with
these constraints, CPT must be a good symmetry of the Lagrangian.
This is the CPT theorem. Although we have demonstrated it with
10.6. CPT
221
fields of spin 0, ~ and 1 only, it is thought to be valid for fields with
higher spins as well.
Chapter 11
Electromagnetic form factors
11.1
General electromagnetic vertex
Suppose we want the matrix element of the electromagnetic current operator between two physical or onshell states of a fermion
of charge eQ. Using the expression for the electromagnetic current
from Eq. (9.13), we obtain
(p',s'lj,,(xllp,s) = eQ (P',s'10(.rh,,4{rllp,s)
e lqZ
,
.j2EpV.j2Ep,Vu"(p)eQ/,,,u.,(p).
(11.1)
Here we have used the free field normalized states of 6.2, and written
q=pp',
(11.2)
which ~tands for the transferred momentum.
However, WhCll we include interactions, we call no longer use the
free fields to describe the current. Let. us find out. the general form of
the matrix element of jfl in this case. SincE' the momentum operator
&p generates spacetime translations, we can write
j,.(x) = ei."x j,,(O)e d
".
(11.3)
So
(p', s' Ij,.(x)1 p. s) = e iqx (p', "U,.(Oll p, 8) .
The matrix element of
J"
(0) is parametrized
(11.4)
",eI, that
iqz
(p', s'U,,(x)1 p, s) =
e
u,,(p') ef,.(/I,p') u.,(p) (11.5)
.j2" V .j2p' V
222
11.1. General electromagnetic vertex
223
We have kept a factor of e outside, since this is a constant which
will appear in any electromagnetic vertex. All the rest, which might
depend on the specific particle states used here, have been dumped
into an object called r", We can think of this object as appearing
from an effective interaction term (j")A" in the Lagrangian. Then
the corresponding Feynman rule for the effective vertex is ief IJ"
For this reason, r jJ. is called the vertex function, and we will now
investigate some of its general features.
First of all, r Ii must contain only one free vector index. This
means that the most general form for r ~ is
r"
= "I"(F, + Fns) + (iF, + F2'Ys)o,wq"
+F3 q",ns + q"(F, + Fns).
(116)
The vertex function does not contain terms dependent on (p + p/)jJ.
because the matrix element of (p + p')" can be expressed in terms of
those of "I" and o""q". This is the result of Gordon identity, which
was shown in Eq. (4.52). Also, lIotice that although the terms seem
to come in pairs, one with a ')'5 factor and one without, there is no
such pair for the F3 term. This is because u(p')fiu(p) = O.
The above form given for r" can he simplified by noting that
the electromagnetic current is conserved. This is a consequence of
Noether's theorem, which gives
0= (p',s'IO"Mxllp,s) = iq"(p',s'lj"(x)lp,s).
( 11.7)
Using Eq. (11.5), we can write this as
q" u,,(p')
r" u,(p) ~ O.
( 11.8)
Remembering that u(p')fiu(p) = 0, this gives the following condition:
( 11.9)
Since this condition has to hold for arbitrary va.lues of p and p', we
conclude that
( 11.10)
Thus the most general parametrization consistent with gauge invariance is given by
224
Chapter 11. Electromagnetic form factors
The objects F I1 F21 F2 and F.1 appearing in this expression are called
form factors.
Let us now discuss what these [arm factors can depend on. They
are Lorentz invariant quantities, so they can depend only on the dot
products of various 4vectors in the problem. There are really only
two independent 4vectors, p and p'. With these we can construct
three dot produc.ts, p2, p/2 and P . pi, The first two of these are
not variables, they just give the mass of the physical particle whose
vertex we are examining. Thus we are left with only one variable
which is Lorentz invariant. Instead of taking it to be p  pi, we can
also choose it to be q2 = (p  p')2 The form [actors appearing in
Eq. (11.11) are to be understood to be [unctions of q2
Exercise 11.1 Consider the ffia.tri:r; element oj the electromagnetic
current betmeen o.n initial state oj momentu.m p and a. final one oj
momen.tum pi oj (l complex 8calnr field. Using Lorentt inua.ria.nce
a.nd current conservation, shaUl that th.e most genera.t vertex is of
the form
(11.12)
where F(q2) is a. form fa.ctor.
11.2
Physical interpretation of form factors
In some limits, the form factors defined above correspond to concepts
such as charge and dipole moments with which we are familiar from
I
classical electrodynamics. Here we examine the physics of these form
factors.
11.2.1
Charge form factor PI
To take the first example, let us consider the situation where p = p'
and s = s', i.e., the initial and the final slates of the fermion are the
same. The momentum trl\nsfer 4vector q vanishes in this ease, and
therefore q2 = O. One now obtains
.
ePI (0)
2 V u,(p) 'Y~ lL.(p)
p
eP, (O)P"
EpV
(11.13)
11.2. Physical interpretation of form facto,"
225
In deducing the last step, we have used the normalization of the
spinors from Eq. (4.51). We have also used Gordon identity, whose
restricted form was given in Eq. (4.50).
This expectation value is like a classical current, Let 11S now suppose that this current couples to an electronHtg:llctic field for which
A = 0, and only AO := 'P is nonvanishing. This could arise from the
following effective interaction term in the Lagrangian:
_ ( . ) A" = _ eFI (0)1"
V'
Jp
(11.14)
using pO = Ep' Since A = 0, we expect the electromagnetic field in
this case to be purely electric, a.nd the Lagrangian term to be p<p,
where p is the charge density. Thus here p = eFdO)/V, and since
this is the result of the calculation in a volume V, we conclude that
eFI (0) = charge of the parricle.
(11.15)
Indeed, this agrees with the treelevel reslIlt, for which F I = Q, as
can be seen from Eq. (ll.l). In other worns, FI(O) is the charge,
in the fundamental unit of e, of the particle \vho:;e vertex is being
considered. For this reason, F I (q2) is often called the charge form
factor.
Let us consider a more general form for tlIe photon field AP. The
Ftterm in the matrix element of Eq. (11.5) call be rewritten as
F ( 2)
iq.x
eel q
.j2E V .j2E ' V
p
2m
(' [
U,' p) (p
+ P, )"
.
"]
)
 la,wq u.(p,
(11.16)
using the Gordon identity given in Eq. (4.52). Now consider the
nonrelatlvistic ca...,;e, i.e., when Ep ~ Ep ' ::::::: 71/.. The dominant contribution from the (p+p')p in the square brackets renuces to the electric
charge, which has been discussed above. So let Wi now concentrate
on the other term. In the Lagrangian, the contribution of this term
can be obtaillt..d if we couple this term to a photon field AJ.I. The
resultant effective term is
e iqx eF (q2)
 (jp) AP = 2mV
~rn u,.(p')ia,,"q"u.,(p)A"(q). (11.17)
In the convention that qu is positive, the photon is being created at
the vertex, so in the Fourier expansion for the phot.on field, it is the
Chapter 11. Electromagnetic form factors
226
A~ term, Le., the term containing the factor eiq,x, which is relevant
here. With this term, the Fourier transform of
F$J.v
would read
(11.18)
Noting the antisymmetry of UfLlh we can thus rewrite the expression
of Eq. (11.17) in the following form:
. )A~
 (JIJ.
= 
e,qx eF,(q')_ ( ')
( )F~V( )
2 V
4
u~, p a~wus p
q . (11.19)
nt
TrI.
At this point, it is instructive to look at the DiracPauli representation of the ')'matriccs to gain further insight. In the nonrelativistic
limit, the solution of the lLspillors given in Eq. (4.61) have negligible
lower components, and upper components that are jUtit Xs defined in
Eq. (4.60). In other words, in this representatiun we can write
u,(p) = J2m (
~' )
+ small
terms,
(11.20)
using the normalization of Ch. 4 and using E ~ 171. This form of
the spinoe solutions shows that in this representation, the dominant
contribution to the spinae bilinear given above will come from the
components of (JjJl) which have a nonzero eutry in the upperleft
corner. These are
O'ii
= 'iik
( O'Ok
~k)
(11.21)
whereas aOi's have a zero in the upperlp.ft corner. Putting this into
Eq. (11.19), we ubtain the dominant contribution tu the matrix element to be
e'' X eF,(q') t k
i'
 V
4m X"O' X'<'ikF l(q).
(11.22)
Looking back at the tensor F~" in Eq. (8.6), we can see that
'iikFii(q) = 2BdQ), so that the last expres.ion can be written as
(11.23)
using the fact that ok B k
= OkBk = a' B.
11.2. Physical interpretation of form factors
227
Since we do not know the explicit form of the form factor F1(q'),
it is impossible to tell how this interaction should look like in the
coordinate space. However, let us expand F I (q') in a Taylor series
around q' = O. Then the first term is a constant, and at least for this
term it is easy to find the inverse Fourier transform of the expression
in Eq. (11.23). In fact, inverse Fourier transform would be the matrix
element of the operator
(11.24)
between X~, and x., where now B is the magnetic field in coordinate
space. Once again, remember that this calculation is done within a
volume V. For the total Lagrangian, we need to multiply by V,
assuming a constant Bfield. The corresponding term in the total
Hamiltonian will differ by a sign, i.e., will be
eQ
2m
uB.
(11.25)
This is exactly the interaction of a particle with a magnetic moment
ILD =
eQ
2m
eQ
m
00 = S,
(11.26)
where S = 100 is the spin vector for the particle. We have put a
subscript D on the magnetic moment since this contribution to the
magnetic moment, coming from the charge form factor, is usually
called the Dirac magnetic moment.
Usually, the magnetic moment is expressed in terlTlS of the Lande
gfactor, which is defined by
eQ
j.L=? gS,
.m
(11.27)
where eQ is the charge of the particle and Tn its mass. Comparing,
we see that the charge form factor gives the following contribution
to the gfactor:
go = 2.
(11.28)
Chapter 11. Electromagnetic form factors
228
11.2.2
Anomalous magnetic moment F 2
Let us now move 011 to the interpretation of the form factor F2 . The
task is easy here, since this term looks exactly like the term we had
been discussing so far, Going through the same steps, we can now
conclude that there is another contribution to the magnetic moment
comi ng from the form factor F2 . This is called the anomalous magne.tic moment, which we will denote by JLA It is given by
(11.29)
For a particle with a charge eQ, the Lande gfactor is thus obtained by summing the two contributions:
(11.30)
But the interesting point is that even if a particle does not have a
charge, there is no fundamental reason why it cannot have a nonzero
value of the form factor F2. Thus even uncharged particles may have
magnetic moments. For such particles, of course, the entire magnetic
moment is anomalous.
11.2.3
Electric dipole moment F2
The analysis of this form factor is very tiimilar to that of the form
factor F2. The only difference is that there is HOW an extra 1'5. Thus
when we go to the DiracPauli representation. we now will have to ask
which of the matrices a J,w"Y5 have nonzero entries in the upperleft
corner. These are
(11.31)
So now, instead of the components ptj of the field tensor, we have the
components pOi which give the dominant contrihution. And these
are the "Components of the electric field. Thus instead of B in the
effective interaction, we now will have E. The interaction is therefore
that of an electric dipole interacting with the electric field, and the
dipole moment is given hy
(11.32)
11.2.
Physical interpretation of form factors
229
The form factor F2(q2) is therefore called the electric dipole form
factor. Just like the anomalous magnetic moment, this can be nonzero even for an uncharged particle.
11.2.4
Anapole moment F3
The matrix element for this term can be written as
iqx
,j2E:V,j2E ,v eF3(q2) [u,,(p'hv"Y5U,(p)] (qpqv  gpvq2). (11.33)
p
Using the momentumspace version of the electromagnetic field equation given in Eq. (8.8), we find the contrihution of this term to the
 jP A p term in the Lagrangian to be
In the nonrelativistic limit, using the DiracPauli representation of
the spinors given in Eq. (11.20), this can be written as
(11.35)
As with the other form factors, we can take an inverse Fourier transform of the constant term in the form factor, and in the coordinate
space it gives a total Lagrangian of the form
(11.36)
which is obtained after multiplying the Lagrangian (density) by the
volume V. This is thus an interaction of the spin of the particle with
the current density, something which was a.bsent in classical physics.
The quantity F3 (0) is given a name in analogy with the multipole
moments  it is called the anapole moment.
We have used concepts from classical physics in interpreting these
form factors. It should be remembered however that classically,
dipole moments can exist only for extended charge distributions. A
point particle in classical physics can have only the total charge,
which of course does not depend on the momentum transfer q. Every other form factor, as well as the momentum dependence of all of
them, is a ha.lImark of quantum physics.
230
Chapter 11. Electromagnetic form factors
Exercise 11.2 Verily tha.t the electric dipole Qnd the. nno.pole mo
me:nt \nte:ra.ctions uiola.te pa.rity inua.rinnce..
o Exercise 11.3 Using the genera.lJonn of the electroma.tJnetic V1't.e:t oj n complez scolnr jield dejined in Ez. 11.1 (p 224), jind the
relntion betwe", F(O) nnd the chnrge oj the pnrticle.
11.3
Anomalous magnetic moment of the
electron
p'
+k
Figure 11.1: Lowest order correction to the vertex function in QED.
We now try to calculate the lowest order corrections to the vertex
function of QED. At the Iloop level, the contribution to the vertex
comes from the Feynrnan diagram of Fig. 11.1. Notice that this
diagram does not represent an Smatrix element since the external
photon line cannot be onshell if tbe electron lines are onshell, as
dis<!ussed in 9.2. Nevertheless, we can calculate its amplitude using
the Feynman rules of QED and obtain the effective term (j~)A~ in
the Lagrangian. Omitting the external spinors from both sides, we
can write the one loop contribution to r ~ as
J
ie~11) = / (~~, ie"y~ iSp(p' + k) iey~ iSF(P + k)ieyp iD~P(k),
(11.37)
or
r(l)
_.2/
~ ..
d'k
y~('" + + mh~(" + + mh~ (11 38)
(27f)' [(P' + k)2  m 2 J I(p + k}2  m 2 J k 2 .
.
11.3. Anomalous magnetic moment of the electron
231
The task is now to evaluate this integral. We know from Gh. 10
that QED by itself does not violate parity. and from Ex. 11.2 (this
page) that nonzero F2 or F3 indicate parity violation. SO F2 and
F3 can appear only if we include other interactions which violate
parity. In pure QED, we have only the form factors F , and F2.
In the evaluation, we will concentrate on extracting the anomalous
magnetic moment form factor F2, the one associated with ia~iJqv.
The evaluation of F, has some intricacies which will be pointed out
in 11.4.
Feynman parameters for the denominator
This is the first time in this book that we are trying to evaluate the
amplitude of a loop diagram. Therefore IH us discuss some general
techniques which will be useful for evaluation of any loop diagram.
One of these is the introduction of Feynman pa.rameters.
For this we need the identity
_1_ =
a, a2
r' d( [(a, + (1I  ()n21
in
(11.39)
2'
which can easily be checked by evaluating the integral on the right.
Alternatively, by introducing a ofunction on the right hand side, we
can write
_1_ = r ' de, r ' <1(2 0(1  (,  (2; .
a,a2
in
in
[(In, + (2a21
(11.40)
More generally, for any number of factors on the left, we can write
= (n  1)' fol de, fo' d(2' . . fo' d(" 0(1["''' L 'II"'
n
ala2 .. . an
()
L...i=l (jai
(11.41)
The parameters (i are called Feynman parameters. We now demonstrate their use.
D Exercise 11.4 Prove Eq. (11.41).
For the integral in Eq. (11.38), we have three factors in the d<>nominator, so we should use Eq. (11.41) with n = 3 to write
N,.(k)
[(pi + k)2  m 2) [(p + k)2 _ m 2] k 2
r'
= 2 in d(]
inr' d(2 inr' d(3 0 (1 
(,  (2  (3)
N (k)
~3 '
(11.42)
Chapter 11. Electromagnetic form factors
232
where N p is the numerator in Eq. (11.38), and the denominator is
given by
D = (d(p' + k)2  m 2 J + (2[(p
= k
+ 2k (IP' + (2P)
+ k)2
 m 2 J + (3k2
(11.43)
In the second step, we have used that (I + (2 + (3 = I because of the
6function appearing in Eq. (11.42), and that p 2 = p'2 = m 2 .
At this stage, it seems that we have actually made our task more
complicated. Instead of performing the integral that we had in Eq.
(11.38), we have introduced three more integrations over the Feynman parameters. The advantage of this procedure will be obvious if
we introduce a new momentum variable k' defined by
(11.44)
In terms of this variable, we can write
(11.45)
so that the integral of Eq. (11.38) can be rewritten in the form
(11.46) .
This shows the first advantage of using the Fcynman parameters.
The denominator is now an even function of the new momentum
variable k'. Thus terms in the numerator which are odd in k' will
vanish on integration. Moreover, for the remaining terms, the integration over k' can be done very easily, as we will see shortly.
Let us first look at the numerator. Since k' is a dummy variable,
the final result will not depend on it. So let us write it simply as k,
because the old variable k will never be llsed again. Integration over
the Feynman parameter (3 gives us the step function, so we obtain
(11.47)
11.3. Anomalous magnetic moment of the electron
233
Here we have used p2 = p" = m 2 in the denominator, which also
implies that q' = 2m 2  2p p', so that 2p p' = 2m'  q2. We have
also used the finestructure constant 0 in the prefactoT, which, in our
units, is given bye 2 /41<.
The numerator
In the numerator of Eq. (11.42), we must replace k by k  (11"  (2P,
as indicated in Eq. (11.47). This gives
where for the sake of brevity we have written
a~ = (1  (tlp~  (2P~,
b~ = (1  (,)p~  (IP~'
(11.49)
Thus we can write
N~
Y"h~~Y" + Y"I!Y~h"
+my" (I!y~
+ y~~) y" + m'y>.'Y~Y" ,
(11.50)
apart from terms linear in k, which integrate to zero when we perform
the k integration, as mentioned earlier,
We now use the contraction relations of ymatrices given in Eq.
(4.42) to simplify the numerator further. Take for example the last
term of Eq. (11.50). This can be rewritten as
(11.51)
by using the contraction formula with three y.matrices. This term
therefore does not contribute to the anomalous magnetic moment,
and we will ignore this term for our present purpose. Similarly, the
first term on the right side of Eq. (11.50), which contains two factors
of the loop momentum, can be shown to contribute only to H. We
will show this explicitly in 11.4 wben we discuss the contribution
to Fl. The other two terms can be simplified by the use of the
contraction formulas in Eq. (4.42). This gives
(11.52)
234
Chapter 11. Electromagnetic form factors
where the dots, for the rest of this section, stand for any term which
we know do not contain any contribution to the anomalous magnetic
moment.
To extract the terms contributing to the form factor F2 from
these remaining terms in the numerator, we note two things. First,
(I and (2 can be interchanged since they are dummy variables. Also,
it is only the numerator which changes under this exchange. Thus,
for example, the second term in Eq. (11.52) can be written as
Remembering that this appears in the expression for r~, which is
sandwiched hetween the spinors l we can use the Gordon identity to
write this term as
(11.54)
Only the second term in this contributes to F2 .
The second thing to note is that, since the whole expression has
u(p') to the left and u(p) to the right, we can use the definitions of
the spinors to write u(p')p' = mu(p') and pu(p) = mu(p). In other
words, if we obtain a factor of '1/ at the extreme left of the expression
for r~} we can replace it by the mass m. Similarly, if we obtain a
factor of p at the extreme right, we can replace it by m as well. With
this observation} we see that in the first term on the right side of Eq..
(11.52), we can write
~= (1(2)p(,p'
= (1  (,  (2)P'
+ (1 
(2)1f
= (1  (1  (2)m
+ (1 
(2)'
(11.55)
As we already said, the last step does not follow in general, but is
valid when the entire expression has the u(p') to the left. Similarly,
we can write
(11.56)
In the combination hp~ which appears in Eq. (11.52), we now obtain
four terms. Among these, the term containing both factors of m
from the expressions for ~ and ~ contains only a "(p, and therefore
11.3. Anomalous magnetic moment of the electron
235
contributes only to the form factor F" Then there is another term
which contains ;'Yp;, which also does not contribute to F,. We are
thus left with the terms with one factor of m and one factor of ,.
Again interchanging (1 and (, in one of these terms, we can write
2m(1  (,  (,)(1  (d ('Ypfl ''Yp) + ...
= 4m(1  (,  (,)(1  (Ilia,wq" + ....
(11.57)
2hp~ =
Adding the two contributions from Eqs. (11.54) and (11.57), we can
extract the general expression for F,(q'). For the anomalous magnetic moment, we need just F2(O), which is
where we have used the step function to adjust the limits of the (2
integration.
Wick rotation and momentum integration
We now want to perform the integrations. As we mentioned earlier I
the whole point of using the Feynman parameters is that the me>mentum integration can be done first, and done easily. So let us try
that.
Let us first recall that the denominator in the expression for F2
came from the denominator of various propagators. While writing
the propagators, we consistently ignored the i term. Partly this was
done to save writing, knowing well that we could bring it back whenever necessary. The need arises now because we have to integrate
over all values of the loop momentum. Looking at Eq. (11.38), we
notice that since p and p' are onshell momenta, the internal fermion
propagators would diverge near k = O. With the redefined loop me>mentum, we have an expression of the form
I.(A)
d4k
(211")4 Ik2 _ A' + i)' ,
(11.59)
for a suitably defined quantity A 2 The power s equals 3 for the
problem at hand, but our argument will hold for other positive integers, as discussed in A.4 of the Appendix. The integrand has poles
236
Chapter 11. Electromagnetic form factors
for certain values of k Z . Since A Z is real, if we did not have the little
imaginary part in the denominator, these poles would have been on
the real axis, and would pose problems while we tried to perform the
integration. This predicament was discussed while we found out the
propagators for various fields in earlier chapters of this book.
lmko
r~tI':::0'
Re ko
Figure 11.2: The contour for performing tne ko integration is shown withthick lines. The crosses indicate the poles of the integrand.
Once we have the imaginary part, the poles are not on the real
axis and we have no problem. Suppose first we are trying to integrate
over ko. The denominator in this case can be written as
B 2 + iE,
where B Z = k Z + A 2 If we consider complex values of ko, the poles
are at B  it and  B + it, as shown by the little crosses in Fig. 11.2.
We have been careless about the factors going with the t term in
the denominator. This is because c is a very small parameter \ which
should be taken to zero at the end of the calculation anyway. The
only thing to remember is that it has to be taken to ~ero from the
positive side. Thus, we have to be careful that the quantity which
may multiply E is positive, and we have been c.areful about that.
In any easel let us turn to the integration over ko. Imagine that
we are performing it for the integrand which appears in Eq. (11.59),
but over the contour in the complex kowplallc. The contour we have
k5 
11.3. Anomalous magnetic moment of the electron
237
in mind is shown in Fig. 1l.2. Since this is a closed contour and
it does not contain any of the poles of the integrand, the integral
should vanish, irrespective of the radius of the curved regions of the
contour.
Consider this statement when this radius is infinitely large. In
that case, the contribution to the integral from the curved regions is
infinitesimally small. Thus, the vanishing of the integral would mean
that the contributions from the two straight lines add up to zero, i.e.,
+00 dkn(k2
00
0 
B2
.).
+ If
1~;00
.
+'00
dko(k'_B2
0
. ). =0. (11.60)
+ 1.
In the second term, we can change the integration variable from kn
to a new variable knE, defined by ko = iknE. This allows us to write
] +00 dkn (2ko 00
B2 + t
.).=i
]+00 dknE (
00
1

k2DE
B2 + 1
.). (11.61)
Reinstating the integrations over the spatial components of k, we can
then write Eq. (11.59) as
.J
I. (A) = ,
d kE
(211") 4
(1l.62)
+'i',leo. ,
'[,k:O.l,,A"""2
where now
(11.63)
kl
The advantage of this form over the earlier one is that
is always
nonnegative, so we can apply techniques which are familiar to us
from our experience of spatial integration. This method of using the
complex analysis to bring in k~ in place of k 2 is called Wick rotation.
Of course, it does not refer to any physical rotation. It means that, in
the complex plane, we have chosen a new path of integration which
is rotated with respect to the original one along the real axis.
The method for performing this integral is described in detail
in Appendix A.4.2. Using Eq. (A.54) here, we can write down the
result:
(11.64)
Chapter 11.
238
Electromagnetic form [utors
Integration over Feynman parameters
We now put the result of the momentum integration into Eq. (11.58).
Since A' = (, + (.)'rn' in our case, we obtain
(11.65)
The integration is straight forward, and yields the result
(11.66)
Plugging this back into Eq. (11.30) and remembering that the charge
of the electron is e, i.e., Q = 1 for the electron, we obtain the
Lande gfactor to be
0<
(11.67)
g=2+.
"
The second term is the anomalous contribution of the gfactor, which
was first calculated by Schwinger. Of course there are further corrections to the gfactor, which comes from higher order diagrams, and
they contain higher powers of 0<.
Calculation of the anomalous magnetic moment of the electron'
has been one of the greatest triumphs of QED. To date, the calculations have been done up to the order n'. The results give the
theoretical prediction to be
~(g 
2) = 1159652460(127)(75)
10 1.
(11.68)
The uncertainties for the last digits are shown in parentheses  the
first one due to the experimental uncertainty in 0< and the second
one due to computational limitations. The experimentally measured
value for this quantity is
~(g 
2) = 1159652193(10) x 10 1.
(11.69)
Thus we see that the two agree up to seven significant digits! The
anomalous moment of the muon agrees with experiment up to eight
significant digits.
11.4.
11.4
Charge form factor
239
Charge form factor
Since QED is parity invariant, the only other form factor that appears
in pure QED interactions is the charge form factor Fl. Let us now
consider the contributions to Fl' These are terms in Eq. (11.50)
which are proportions.! to "/~. The last term reduces to the expression
of Eq. (11.51) and so contributes only to Fl. The contribution of the
third term can be read from Eq. (11.54), and it is 8m 2(1 (I  (2l"1...
As for the second term, i.e., the term 2h~~, we have indicated
which terms do not contribute to F2. These can now be assembled:
2h~~ = 2m 2(1  (I  (2)21'"
+ 2(1
 (!l(1  (2);"/,,;
+ ... ,
(11.70)
where in this section, the dots mean the terms which are irrelevant
for Fl. Using the anticommutation relation of the l'matrices, we can
write ;'I"i = (2q"  'I"i); = 2q,,;  l'"q2. Of these, the first term
vanishes between the two spinors, and the second term contributes
to Fl' Thus we can write
2h"~ = 2m 2(1  (,  (2)21'" 
2q2(1
 (1)(1  (2l"1"
+ ....
(11.71)
We now have to discuss the first term in Eq. (11.50). This can
be written as
using the contraction formula from Eq. (4.42). Thus for this term
the integration over k has the following generic form:
(11.73)
where f(k 2 ) is some function of k 2 . The integrs.! must be symmetric
in the indices v and p and obviously cannot depend on the 4vector
k which has been integrated over. Thus it must be proportions.! to
the metric tensor glJP and we can write
I
[VP = g"P J.
(11. 74)
240
Chapter 11. Electromagnetic form factors
Contracting both sides of this equation with 9vp and using the fact
that 9vp9 vP = 4, we obtain
(11.75)
using the definition of I"P from Eq. (11.73). Thus we can write
(II. 76)
a result which is extremely useful for performing loop integrations.
a
Exercise n.5 Verih Eq. (11.76) expli<itly by <hanging integration uariabtes 8uitabt'Y. For this, first check that the integral on the
lejt sh.ould \Janish if v I p. Then, bll change oj llnriabtes, ShOUl that
(11. 77)
for any i = 1,2,3. FinaUy. add these jour equal objects and diuide
by 4 to obtain Eq. (11.76).
a Exercise 11.6 Show that
j d'kkvkVk'kPj(k') =
2.
(9""9'P+ 9"'9
24
x
vP
+9"P 9 "')
j d'k k'f(k').
(11.78)
Going back to Eq. (11.72), we now see that in the integrand, we
can write the numerator for this term as
(11.79)
using the contraction formula from Eq. (4.42). Thus this term also
contributes only to the charge form factor, as was claimed in 11.3.
Collecting all the terms, we now find that the contribution to
Fl(q2~coming from the diagram of Fig. 11.1 is:
11.5. Electronproton scattering
241
where
N 1 = k 2  2q2(1  (Il(l  (2)  2m 2 [(I
+ (2)2
 2(1  (I  (2)] .
(11.81)
Notice that there is a term in N , which is just k 2 Consider the
momentum integration for this term. Apart from an overall factor
and the integrations over the Feynman pa.rameters , we ohtain an
expression of the form
(11.82)
This is a function of q2 Consider now the contribution to this integral coming from large values of k 2 , i.e., for 1,,21 > M 2 , where M is
a mass scale much larger than both m and Jjq1j. For this part, we
can neglect m 2 and q2 and write this part a.,;
d'k
J1k21>M2 k 4
(11.83)
The indefinite integral will go like the logarithm of Ik 2 1. But the
upper limit of this integration is, of course, infinite. Thus, onCe
we perform the integration, we will obtain an infinite result. Such
infinities are called ultraviolet divergence.s since they are caused by
the targek behavior of the integrand. Ultraviolet (UV) divergences
appear in other diagrams as well. A careful and extended treatment
of such divergences will be presented in Ch. l2.
11.5
Electronproton scattering
Form factors are also useful in considering treelevel processes, when
there are vertices involving fermions which are not elementary particles. As an example, we can consider the clastic scattering of electrons a"nd protons. Protons are known to he c.omposite objects, com
posed of quarks. Thus, protons are not simple Dirac particles. The
only way to write down the electromagnetic vertex involving them is
to use form factors.
To be more specific, let us look at Fig. 11.3, which gives the
lowest order diagram for the scattering. The Feynman amplitude
242
Chapter 11. Electromagnetic form factors
k
k'
A(q)
p
Figure 11.3: Lowest order diagr.am for electronproton elastic scattering.
The lighter lines denote electrons, the heavier lines protons. The momentum
for each line is shown.
can be written as
[U(P) (p') (ier
V
)
U(p) (p)]
(11.B4)
where the subscripts on the spinors denote the particle concerned.
Here r v stands for the vertex function of the proton. Had the proton
been a Dirac particle, we would have written r v = '"Iv just as we
have done for the electron. But as we said, the proton is not a Dirac
particle. So we will have to use the general expression for the vertex
function. Assuming parity invariance, the vertex function contains
two form factors, FI and F 2 :
(11.85)
As explained in 11.2, F I (0) = 1 which gives the proton charge in
units of e, and F2 (0) is related to the anomalous magnetic moment of
the proton. Indeed, proton has a large anomalous magnetic moment,
indicated by the fact that its Lande gfactor is given by
9 = 5.58,
whereas the Dirac contribution is only 2.
So we can write the Feynman amplitude as
(11.86)
11.5. Electronproton scattering
243
where r ~ is given by Eq. (11.85). For the sake of simplicity, let us
neglect the electron mass and denote the proton mass by m p Then,
averaging over initial spins and summing over final ones, we obtain
(11.88)
where
Tr [f'Y"h V ]
f~V =
= 4 [k'"k
+ k~k'v + ~'J'g"v]
(11.89)
and
(11.90)
Using the form for
The final result is
r"
W~v = 4 [WI ( 9p.v 
given in Eq. (11.85). one can take the trace.
7'J"'Jv) + W,
m~
( PJ.l
pq2. 'J QJ.l. ) ( Pu
P . q )]
 qrqv
(11.91)
where
(11.92)
The calculation of the differential cross section is now straightforward. In the Lab frame where the initial proton is at rest, it comes
out to be
. ,0
2W, sm :2
do
+ W'COS
,0
:2
. OJ
0
[1 + (2E/rn p ) sin':2 sin':2
dO
'
(11.93)
where E is the energy of the incident electron.
There is a big advantage of writing the differential cross section
in terms of W, and W, rather than F, and F,. Suppose we now
consider the inelastic process
e
+p ~
e
+X,
(11.94)
244
Chapter 11. Electromagnetic form factors
where X represents any number of particles. In this case, we cannot
characterize the vertex in terms of a vertex function of the form of
Eq. (11.85). However, even in this case, Lorentz invariance implies
that the square of the Feynman amplitude must be expressible in the
form given in Eq. (11.88), with lPv given by Eq. (11.89). Moreover,
due to current conservation, WltV must satisfy the conditions
(11.95)
In addition, Wf'&/ ca.n depend only on the vectors k and p, or equivalently on q and p. The most general tensor constructed from two
vectors and satisfying Eq. (11.95) has the form
Wpv = 4 [WI (9 pv q;~v ) + ~i (p"  P/qp) (Pv  P/qv)
3
AP]
W
+2m~pVAPP
q
(11.96)
If parity invariance holds, W 3 = 0, so we are left with the form of
Eq. (11.91). The differential cross section is still of the form of Eq.
(11.93), although WI and W 2 cannot be expressed as in E'l. (11.92).
o Exercise 11. 7 The pions an
8pin~O
purticle. wh.ich, like 'Protons,
are com'Posite objects. Use th.e pa.rOometriz.a.tion oj the 'Uerte:x
~iuen
'in
E~.
11.1 (1)224) to c.a.lcuto.te the differential cross section oj ela.stic
sca.ttel"ing oj electrons off cha.rged pions. Sh.OUl that
do
d!l
F2 cos 2 ~
42
0] 0.
[1+ (2E/m.) sin' 2 sin' 2
(11.97)
Chapter 12
Renormalization
In calculating various amplitudes, we have
two kinds
of divergences. In 9.10, we found an infrared divergence, i.e., an
amplitude which becomes infinite for vanishingly sillall values of some
momentum. On the other hand, in 11.4 we J;;'lW an example of
ultraviolet divergence, which is caused hy arbitrarily large values of
momenta in a loop integration. In this chapter, we take up the issue
of how these infinities ca.n be interpreted. As a paradigm, we will
often use QED as the interacting theory. bllt SOffle of the general
arguments will be valid for any field theory.
Our primary concern will be the eliminat.ioll of the ultraviolet
divergences. In fact, unless otherwise specified, ';divergcnc.e" will
mean ult.raviolet divergence in this chaptPr. \Ve will see that the
structure needed to interpret ultraviolet div('rg(~llces will also allow
the cancellation of infrared divergence::;.
CIlCOUlIlcred
12.1
Degree of divergence of a diagram
12.1.1
Superficial degree of divergence
Let us find out which diagrams arc divergl'IlL i.(~.! make infinite contributions to the amplitllCie. As we already saw in ~fl1.3, the infinities
ill an amplitude come from momentum integrat.ioll.
So let u::; consider a generic. diagram wit.h f loops. ill which the
number of internal fermion lines is Ttf and tht: 1l111!lhcr of internal
boson lines is nb. Let us also say tlJat ill this g{'lleric. diagram there
are different kinds of vertices and the itil kinel apr~a.rs Vi times.
245
246
Chapter 12. Renormalization
In a generic field theory, the interaction terms in the Lagrangian
which give rise to these vertices may also involve derivatives. Let the
number of derivatives in the ith type of vertex be d;.
Now suppose we have written down the Feynman amplitude of
this diagram and we are trying to see how it behaves when the internal momenta are in the ultraviolet region, i.e. , are much larger than
any of the masses of the particles in the loops. Each internal fermion
line contributes a propagator which behaves like lip. For internal
scalar lines, the propagators behave like I/p 2 at large momentum.
Let us assume that the internal vector boson propagators also behave the same way, as was the case for thc photon propagator given
in Eq. (8.40). Integration over each momentum loop has four powers
of momentum due to the d 4p, and each derivative in the interaction
terms contribute one power of p. We can therefore count the overall
power of momentum in the integral,
D = 4e  2nb  n I
+ L Vi d; .
(12.1 )
This quantity D is called the superficial degree of divergence of the
diagram. If this turns out to be negative, the corresponding amplitude is expected to be convergent. If on the other hand D is positive,
the amplitude apparently diverges when we consider the integration
up to infinitely large values of the loop momenta. Even if D is zero,
the integral can be logarithmically divergent. This was the case of
the diagram for the electronphoton vertex in Fig. ILl, which had
only one kind of vertex with d = 0, and the diagram had e = I,
nb = I, nl = 2, so that D = O. Indeed, we observed in 1l.4 that
the form factor F, in this diagram turned out to be logarithmically
divergent.
The expression for the superficial degree of divergence given in
Eq. (12.1) can be written in a more convenient form if we try to
eliminate the number of internal lines from the expression and use
the number of external lines instead. For this, we use the relation
mentioned earlier in Eq. (6.56):
(12.2)
with a trivial change of notation. Let us also suppose tbat there are
f; fermion lines and bi boson lines at a vertex of the ith type. If
12.1. Degree of divergence of a diagram
247
there are E b external boson lines and EI external fermion lines, we
must have the relations
EI
+ 2n l
= Lviii,
(12.3)
because each internal line must connect two vertices and each exter
nalline attaches to only one vertex. Using these we can eliminate
nb and nl from Eq. (12.1) to get
D = 4 Eb
~EI + LVi
(d' +b, + ~f; 4)
,
e,
(12.4)
The quantity multiplying Vi in the last term has a simple interpretation. The mass dimension of the field operators, including the
derivatives, at any vertex of the ith type is di + b; + ~ /;, and thus
the coupling constant associated with this interaction must have mass
dimension
Oi = 4  d i  bi 
21, .
(12.5)
Putting everything together we can write the superficial degree of
divergence in the form
(12.6)
Let us now see what this implies. Suppose we have a theory in
which all the interactions have 0; > o. Consider a diagram involving
Eb external bosonic lines and E I external fermionie lines. If Eb,
or EI' or both, are large enough, the degree of divergence becomes
negative and the Feynman amplitude for this diagram is superficially
convergent. The degree of divergence can be positive or zero at most
for a finite number of choices of E b and E I . This means that the
theory has only a finite number of divergent amplitudes. If we can
figure out how to make sense of these few divergent amplitudes, we
can expect to get meaningful physical quantities from a theory in
which Oi ~ O. There is a consistent procedure for doing this, called
,..normalization, which we will describe in the rest of the chapter.
248
Chapter 12. Renormaljzatjon
Theories in which the coupling constants have zero or positive mass
dimension are called powercounting renormalizable.
On the other hand, consider a theory where there is at least one
interaction for which 0, < O. In this case, no matter what values of
Eb and E f we consider, we can always find a diagram, with large
enough value of v" which will make the dcgree of divergence nonnegative. In other words, there will be an infinite number of infinite
amplitudes. And since the superficial degree of divergence is different
for diagrams with different number of external lines, the divergent
integrals in these amplitudes are all different. In such a case, there
is no consistent way to deal with the divergcnces, and the theory is
called nonrenormalizable. Such theories would either be unacceptable physical theories, or may be valid as an approximate theory only
in a. specified momentum range, so that the momentum integrations
do not go to large values and the problem of infinities does not arise.
o Exercise 12.1 Consider (l th.eory with a massiue ueetor boson
whose 'PTopagntoT' is 0(11 for lnrge morn.entQ, as shown in Eq. (8. 22).
ShOUl that the 8uperjida degree of di,,{';rgence, for this theoT1l is giuen
by
D = 4E,, ;;E,2EA
+ LlJ,(a,
0;),
(12.7)
wh.ere E and E A are the. number of eJ:teMtal f\calar and 'Vector lines,
is th.e number of uector boson lines at n tlcrte:t of the ith. t"pe,
and all other 81lmbols hnue the meaning yiuen in the text. Prom thi.s,
argue that the thwry would not be powercounting renorrna.lha.bte.
Qj
12.1.2
Superficial vs. real degree of divergence
In the entire discussion above we have IlSed the word "superficial"
several times. Consider a diagram for which the value of D, calculated from Eq. (12.6), is positive or zero. Does it necessarily imply
that the amplitude of this diagram is divergent? Not really, because
in finding D, we have counted the powers of momentum in the integrand and in the measure of the integral. If all of these momenta are
internal, then of course the amplitude would be infinite. But we do
not know if that is the case. If for some reason the amplitude has to
contain one or more powers of external momenta, the number of powers of internal momenta may be negative, and the amplitude would
be finite. We will see such a case in QED later, where the amplitude
1
12,1. Degree of divergence of a diagram
.,
,,
249
"
"
_____L ........... '_____
(a)
Figure 12.1:
(b)
An example of a superficially convergent diagram with a
divergent subdiagram.
is finite despite the fact that the superficial degree of divergence is
zero.
There are other ways in which a superficially divergent diagram
can be convergent. Consider, for example, supplementing the QED
Lagrangian with a photon mass term M 2 A"A". It would appear
from Eq. (12.7) that this theory is nonrenormalizable. However, the
part of the propagator which behaves as O( 1) at large mOmenta is
proportional to k"k v ' Since the photon always couples to conserved
currents, this part does not contribute to any a.mplitude. So effectively the propagator still behaves as 1/ k 2 in the ultraviolet region,
and the theory is powercounting renormalizable.
There is a flip side of "superficiality". Consider a diagram now
for which D is negative, so that the diagram is superficially finite.
But the quantity D counts the overall powers of momentum. In
a diagram with more than one loops, there are more than one internal 4momenta which have to be integrated. Even if the total
degree of divergence is negative; it is certainly possible that if we
consider only the powers of one particular internal 4momentum, the
net power of this momentum in the integral is zero or positive. In
that case, integration over that particular 4momentum will diverge.
For example, consider the 2loop diagram of Fig. 12.1a in the Yukawa
theory of scalars and fermions discussed in 6.1. The diagram has
D = 1 so that it is superficially convergent. However, the loop
shown separately in Fig. 12.lb is a subdiagram with D = +1, and
the momentum integration over it diverges. Clearly, a diagram containing a divergent subdiagram can be superficially convergent even
if the number of loops is the same for both. For example, the Iloop
diagrams in Fig. 6.5 have D = 2 and are superficially convergent.
But the loop itself, shown in Fig. 6.6, is a subdiagram with D = 0
250
Chapter 12. Renormalization
and is therefore divergent.
However, we can ignore such hidden divergences when we try to
apply the renormalization procedure order by order in perturbation
theory. The reason is simple. If there is a divergent subdiagram
in a superficially convergent diagram, that subdiagram will be simpler than the overall diagram in the sense that it will contain either
smaller number of loops, or smaller number of external legs, or both.
If at the level of discussing these simpler diagrams, we have already
decided how to interpret the infinities occurring there, we can just
use that wisdom in this larger diagram and obtain a result that is
physically meaningful.
12.2
Specific examples in QED
We now discuss the implications of our discussion on QED, which is
the theory with a spinor and the photon field, with the interaction
term given by
(12.8)
The coupling constant associated with this interaction is dimensionless. Therefore, 8 = O. The formula for the superficial degree of
divergence is given simply by
(12.9)
where Eb is now the number of external photon lines and Ef, as
before, is the number of external fermion lines. Because of charge
conservation, EJ must be even. In additioll charge conjugation invariance of QED implies that any amplitude with Ef = 0 and odd
Eb has to vanish. Therefore we can immediately conclude that only
the following amplitudes have nonnegative superficial degrees of di1
vergence:
Eb
0
2
1
Ef
2
0
2
0
D
1
2
0
0
(12.10)
12.2. Specific examples in QED
251
Of these, the last one is not really divergent. The reason is the gauge
invariance of QED, which demands that if all external lines of a diar
gram are photon lines, the amplitude should vanish when contracted
with the momentum of any of those lines. We will illustrate later
that this implies that the amplitude of a diagram with n external
photon lines must contain at least n powers of external momenta.
Thus for the amplitude with four external photons, the real degree
of divergence cannot be larger than 4. Hence this amplitude must
be convergent. For the amplitude with two external photon lines, we
can also say that there must be at least two external momenta, but
this reduces the degree of divergence from 2 to zero, and therefore
the amplitude can still be divergent. Thus we have found that there
are three kinds of amplitude which can be divergent in QED, viz.,
those corresponding to the first three rows of the table in Eq. (12.10).
(A)
(B)
(C)
Figure 12.2: Schem.tic represent.tions of divergent .mplitudes in QED.
The blobs stand for .ny number of intern.1 lines.
In Fig. 12.2, we show these amplitudes in a schematic way. The
diagram marked (A) with two external fermion lines is called the
selfenergy diagram for the fermion. By the same token, diagram
(B) should be called the selfenergy diagram for the photon. This
name is used sometimes, but the name vacuum polarization diagram
is more popular.
The last diagram, which has two fermion lines and a photon line,
is called the vertex diagram because it contributes to the interaction
vertex of QED. In Ch. 11 we have discussed the Iloop contribution
to such amplitudes and found that the amplitude indeed turns out
to be infinite. This infinity showed up in the form factor F 1 , which
252
Chapter 12. Renormalization
we discussed in 11.4. In the rest of this chapter, we will try to figure
out a way to consistently interpret the infinite amplitudes.
12.3
Outline of the program
The infinite amplitudes are tackled in two steps. In the first step, the
goal is to somehow perform the loop integrations and write it in a
closed form by introducing; some extra parameter in the theory. The
parameter is introduced in a way that the integrals call be performed
for a range of its values. However l if we were to take the values
appropriate for the physica.l limit, the result would b~ infinite. This
part of the process is c.alled 1egularization.
There are many ways ill which regularization can be performed.
For example, we c.an c.hoose to integrate all loop momenta up to a
certain maximum value of A. Then we call perform the integrations
and would obtain results which depend on A. This procedure is
called a cutoff regularization. Alternatively. we can use sontc auxiliary fields which do not actually appear in the physical theory. These
are introduced in slich it way that the infinities occurring from physical particles in the loop arc canceled by the infinities arising from
these auxiliary particle::, in the loop. The integrations can then be
performed allu the results depend on the lTl~ses of these auxiliary
particles. The infinities reappear in the limit when the masses of
these auxiliary pnrtic1cs are set to infinity. This procedure is called
Pauli Villm.s H~gulaT'izlLf.ion. A thirrl procedure is to formally make.
the nilmhcr of spacetime dimensions an arbitrary real number. The
results of integratiolls now would depend on this number. When it is
set equal to four, the infinities show up. This process is called the dimensional regularization. In all cases, the parameter dependent part
whic} becomes infinite in the physical limit is called the divergent
part of the amplitude.
Once we have rcgulari:;;ed a. divergent integral to a parameter
dependent finite number, we can introducc' what arc called CDuntertenns into the Lagrangian. These are products of the fields, with
coefficients chosen such that the divcrgcllt part of each regularized
amplitude is canceled by the contribution from the corresponding
COllntertenn. This procedure is called Tenormalization. If all the
CDuntcrtcrms are of the same form as some or all of the terms present
12.4. WardTakahashi identity
253
in the original Lagrangian, this is equivalent to only a rescaling of
masses, coupling constants and fields by (parameter dependent, divergent) constant numbers. Theories for which this can be done are
called renormalizable theories.
We will not discuss the cutoff regulari7.ation procedure in any
detail because it is not explicitly Lorentz invariant. But we will use
the other two types of regulari7.ation. And we will abo show that the
physical results are independent of the regularization procedure.
12.4
WardTakahashi identity
We already identified three types of diver~pnt amplitudes for QED.
There is in fact a relation hetween them which makes it easier to
tackle them. This can be guessed from the fact that the interaction
between fermions and photon~ wa..o:; lntroduced by the minimal substitution method summari7.ed in Eq. (9.15), which means 11 ~ l1+ieQ/l.,
using the gauge principle.
The relation can be explicitly stated now. The minimal substitution method implies that the vertex function introduced in 11.1
is given at the tn..'C level by
r(ll)
( P,
jl
P _q) 
Q1 J
(12.11)
whrrc the superscript on r denotes the JllIlnber of loops\ and we
have indicated the momenta of the incoming a,nd outgoing fermions
in parenthesis. Thus,
(12.12)
This relation turns out to be valid at all orders in perturbation
theory. In other words, we can write
q~r~(p,p  q) = Q[Spl(p)  Sp'(p  q)]
(12.13)
as an exact result. This equality is called the Ward Takahashi identity.
On the left hand side we have r~, whidl is the general 3point
function involving two fermions and one photon, as we have seen in
Chapter 12. Renormalization
254
ll:h~.11.
The usefulness of this identity can be seen by interpreting
tHe J'ight hand side as the difference between 2point functions, i.e.,
Feynman amplitudes of diagrams with two external legs. Let us first
draw the total 2point amplitude as the sum of the treelevel and the
'I
... :..._.
;)1
.. ,:::
';.,'
......
+
.'
(12.14)
The treelevel amplitude, which is the undecorated line on the right
htmd, .side 0{.Eq.(12.14), can be read off from the Lagrangian by
trell!.,\llg .the.quadratic term just like an interaction and using the
procoolJ.f;e,for writing Feynman rules described in 6.6. This will
g).ve,' .... m., which is just the inverse of the tr"",level propagator.
~he Feynman amplitude for the remainder, which is drawn as the
gray box, is conventionally denoted by E(p). Thus Eq. (12.14) can
!:la.written algebraically as
! .!
i.
:"
Sp'(p) =]ImE(p).
(12.15)
l~
is t~is full 2point function that goes into the WardTakahashi
Identity of Eq. (12.13).
," '.
; I '
"
.' ,
",
Fi~ure
.C\.
12.3:.
'.
lIo~p diagram for fermion selfenergy in
QED .
To show that the WardTakahashi identity is at least valid for the
Iloop
contributions, .let us
first
evaluate
the Iloop corrections to the
.", I ' ! .
.
.
..
2~'poiilt functiOli'for the fermlons, i.e., the selfenergy of the fermions.
This
comes
from.the.diagram.of
Fig. 12.3. Using
the Feynman rules
. "
I
.
" ,", .. , " . .
.
,
'
,
'.',..
.
of QED, we can write the Iloop contribution to it as
~
,,; ." . '(II"
.
:::. ~iE. (P) = (ieQ)
2J (271"d'k)' "'("]I +11i :....'m)"
'
.
.
iD"~ (k):,
(12.16)
12.4.
Ward Takahashi identity
255
where iDPV(k) is the photon propagator.
As for the vertex function, we have already seen the Iloop contribution in Eq. (11.37). Although that was written for onshell electrons, the expression remains the same except e is now replaced by
eQ:
where p' = p  q. Thus
P
q f
(1) _ ( .
P
zeQ)
J
Q
d k
(27l")'Y~ tJ
i"
+ _ m
"
p + _ m "Ip ,D
~p( )
k.
(12.18)
Since p' = p  q, we can write
Ii as (p +  m)  (p' + 
m). Thus,
p'+~m Ii p+:m = i[p'+~_m  p+:m]'
(12.19)
Putting this back into Eq. (12.18) and comparing with the expression
for the selfenergy in Eq. (12.16), we obtain
(12.20)
Adding this to the tree level relation gives Eq. (12.13) up to Iloop
contributions.
It is true that our proof takes only the Iloop contribution into
account and considers only the fermionpllOtoll interaction. But one
can construct a more general proof from Cllrrent conservation, In
fact the identity holds true to all orders, even jf the fermion has other
interactions which do not violate gauge illvariancc. Also, we know
from Eq. (11.8) that qPfp must vanish between thespinors. However,
if we consider the effective electromagnetic vertex of a neutral fermion
[or which Q = 0, we see from Eq. (12.13) that. for neut.ral fermions a
more stringent. relation, qJJT J1. = 0, is satisfied irrespective of whether
we consider it between spinors or not. Thes" facts are very useful in
considering electromagnetic interactions of partic1('~ in the presence
of other int.eractions a.<; well.
256
Chapter 12. Renormalization
Here we will stick to the electromagnetic interactions only. In
view of the general identity of Eq. (12.13), we will consiner helow
in detail only the loop contributions for selfenergies of fermions and
photons.
a Exercise 12.2
SU'PPose electrons also intera.ct with a scalar field
\lin a. Yuka:wo. interaction term h1JJl/J. ShOUl th.at at the 1loop Level,
Eq. (12.20) still holds fOT the electTonpholon ueTlex.
12.5
General forms for divergent amplitudes
12.5.1
Fermion selfenergy
Let us start with the fermion selfenergy in QED. We will take the
fermion to be the electron, with Q = 1. The expression for the
fermion selfenergy can be written from Eq. (12.16) as
J
J
d'k 1',,(p + iI + mhl'
(271")' [(1' + k)2  m21k2
E(p) = ie 2
d' k 2(p + ill  4m
(271")' [(1'+ kj2 _ m2]k2'
. 2
= ,e
(12.21)
In the last slep, we have used the conwu:tion formulas for the 1'matrices, Eq. (4.42).
We can now use the Feynman parameters for the denominator,
which were introduced in 11.3. Using Eq. (11.41), we write
E(p)
= ie 2
= ie 2
io
d(
J
J
2(p
d'k
(27r)' [({(p+ k)'
r d(
l
io
711 2 }
4m
+ (1 ()k 2!,
2(p + iI)  4m
2' (12.22)
+ 2(k . p + ((1'"  m 2)]
integration variahle k = k + (1', so that we
d'k
(271")' [k 2
Let us now define a new
can write this as
"'()
. 2
d
L. l' = ,e in
(
o
+ iI) 
d'k
(271")
()p + 271  4m
2 . (12.23)
+ ((1 _ ()p2 _ (m2]
, .2(1 
[k'
At this point, the denominator is evell in k. so we can throw out the
terms in the numerator which arc odd in k. Thus we find that 2:(1')
has the form:
( 12.24)
12.5. General forms for divergent amplitudet>
where
r d Jd k
(1  0
io ( (21f)4Ik2 + (1 _ ()p2 _ (m2f '
2 r
Jd k
1
4mie in d( (21f)4 [k2 + ((1 _ Op2 _ (m
2' 2
a p ) = ,e
( 2
b(p2) =
257
2J2 .
(12.25)
Wc have dropped the bar on k. Of course, the form of these integrals
show that both a and b are divergent. And this is the problem that
we will have to deal with.
What we have done so far is to find the form of the fermion selfenergy from the Iloop diagram. However, a simple argument shows
that no matter which diagram we consider, the general form of the
selfenergy will be given by Eq. (12.24) with suitably defined a and b.
The selfenergy function E(p) will be a 4 x 4 matrix which can depend
only on the 4vector p. Now, any 4 x 4 matrix can be expanded in
terms of the 16 basis matrices introduced in Eq. (4.14), which are
the unit matrix, the four ')'matrices, the six O'p&.Imatrices, t.he four
matrices "I P"l5, and finally "15. Of these, the combinations involving
"15 can never appear in QED because of parity invariance. With a p ",
the only Lorentz covariant combination that one can make using the
vector pP is a pvpppv, which is zero because of the antisymmetry of
the amatrices. Thus we are left with the unit matrix and tbe ~'s
only. With the "IP'S, the Lorentz covariant combination will be "IPpp,
or p, which can appear with a coefficient a, and the unit matrix can
appear with a coefficient b. Both these coefficients must be Lorentz
invariant, and can therefore depend only on p2. This leads us to the
form of Eq. (12.24).
12.5.2
Vacuum polarization
Let us denote the vacuum polarization amplitude by 1f pv (k), where
the external photon momentum is k. The general form for the dia.gram has been shown in Fig. 12.2B. Consider the incoming photon
line at the left. In any diagram, it must first encounter a vertex with
the fermion. Thus, we can schematically represent the diagrams 88
(12.26)
258
Chapter 12. Renormalization
rhe blobs can be all possible combination of internal lines. The
"mion line with the blob is the full fermion propagator, and the
ertex with the blob is the full vertex function.
Therefore, the amplitude can be written as
i1r,,"(k) = 
f (~~4
Tr [ie Q')'" i5F(P) ier"(k
+ P, p)
i5p(k
+ p) J.
(12.27)
If we contract the left hand side of this equation by k" and use the
WardTakahashi identity of Eq. (12.13), we obtain
There are two terms here. In the second one, we can shift the loop
integration variable from p to k + p, so that the two terms cancel
each other and we obtain
( 12.29)
Similarly, starting from the diagrammatic identity
(12.30)
we can show that
(12.31 )
A small remark is in order. We have represented the vacuum
polarization diagram in two ways, viz., in Eqs. (12.26) and (12.30). It
must be understood that these are the most general characterizations
of the vacuum polarization diagrams. For example, we cannot put
a blob at both vertices, because this will double count diagrams like
the one in Fig. 12.4.
The function 1r,,"(k) must satisfy the conditions of Eqs. (12.29)
and (12.31) and can depend only on the 4momentum k. The only
tensor which has these properties is g,,"k 2  k"k". Therefore, the
12.5. General forms for divergent amplitudes
259
'\
/
Figure 12.4: 21oop v~cuum polarization di,;agram which would be double
counted if we put blobs at both vertices of Eq. (12.26) or Eq. (12.30).
most general form for the vacuum polarization tensor will be given
by
(12.32)
where IT(k 2 ) is a Lorentz invariant object. and must therefore be
a function only of k 2 . Eq. (12.32) viudic~tes the comment made
carlier because the amplitude of the 2poiut function with external
photon hnes is proportional to at least two powers of the external
momentum.
o Exercise 12.3 Find the tensor slruct.uff' for the effective .Itph.oton
vertex 7r 11I .\f.{ k 1 , k 2 , k;\. k 4 ). Assume. JlQ.rity in.1JtLrinnce, and use rela.tiona like k\'1f/,v>,p = 0, and the indisttllgui:'lhubility oj ph.clons which
implies lha.t the intnchange between all.J.I t.wo photons should nol
change the l.lertex. Do not a8aUffie. thf' photons to be on~hdl.
12.5.3
Vertex function
Finally, there is the divergence a.':isociated with the vertex function.
The general form of the vertex function, dellotcd by fJ.t! was discussed
in eh. 11. We found that in pure QED. the vert.ex can in general
have two form factors. Of these, the anolllalous magnetic moment
term has an extra power of the external photon momentum in it,
so the degree of divergellce associated with this term should be 1.
In other words, this term should be cOllvergellt, as we saw in the
explicit Iloop calculation of 11.3. The divergellce. therefore, must
he associated with the form factor F l , i.e., with the charge form
factor. This was explicitly shown in 1l.4.
260
Chapter 12. Renormalization
12.6
Regularization of selfenergy diagrams
12.6.1
Vacuum polarization diagram
So far, we have not performed any regularization on the divergent
amplitudes. In this section, let liS show how regularization can be
performed, taking the vacuum polarization diagram as an example.
The lowest order loop diagram for vacuum polarization is shown in
Fig. 12.5.
p
.. ~~
.... k
p
Figure 12.5: Iloop diagram contributing to the vacuum polarization in
QED.
As we mentioned in 12.3, we must introduce a regularization
parameter in the theory in order to perform the integrals. We also
mentioned some of the pos::;ibilitics for the rq~l1larization parameter.
In this section, let us use dimensional regularization. This means
that all the integrals should be performed assuming the number of
spacetime dimensions to be an arbitrary real number N. We will
also assume that the fermion in the loop is electron} so that its charge
is e.
Calculating in N dimensions poses a problem. The Lagrangian
must now have mass dimension N. Using the Dirac Lagrangian, we
find that the mass dimension of a fermion Held is Iv'>J = (N  1)/2.
Similarly, from the Maxwell Lagrangian we find that [A~] = N/21.
Since the interaction term must also have mass dimension N, the
coupling constant of the interaction term seems to have a nonzero
mass dimension. We can take ca.re of this by introducing an arbitrary
constant mass Jl and writing the interaction term as
2l",
= eJ.l'
v'>ltv'>
where e is dimensionless as usual, and we have defined
1
<=2"2 N .
(12.33)
(12.34)
12.6. Regularization of selfenergy diagrams
261
This Jl has the interpretation of a mass scalc, and is usually called
the subtraction point. We shall see that physical amplitudes and
couplings are independent of this parameter.
Using the Feynman rules of QED, we can now write the amplitude
of the vacuum polarization diagram of Fi~. 12.5 in the form
"'~v(k) = 
dNp
[.,
i(p+m}.
i(p+;!+m)]
+ k)2 _ m 2
(27l')N tr 2eJl "I~ p2 _ m 2 teJl "Iv (p
'
(12.35)
where the minus sign in front of the integra! sign comes because of
the closed fermion loop. Also, we now have eJl' instead of e at each
vertex. This gives
introducing the Feynman parameter ( in the last step. Changing the
integration variable p to p + (k, we can write this in the following
form:
where
(12.38)
Let us now look at the trace in the numerator. At this point, the
denominator is already even in the integration variable p, so we need
only to keep the even terms in the numerator. Thus the numerator,
which we will call Nj.lJll can be written as
262
Chapter 12. RenormaIization
Such traces have been performed before in various calculations, but
one should be careful here. Tbe reason is that we are now supposed
to perform the traces for Ndimensional spacetime. So, let us say
that, for Ndimensional spacetime
(12.40)
where dN = 4 when N = 4. Once this is assumed, we can find the
trace of a string of four ,,matrices in the way described in Eq. (A.20)
of tbe Appendix, and obtain
(12.41)
Applying it to the present case, we get
N~"
= dN
[(2p~p"  g~"p2)  (I  ()(2k~k"  g~"k2) + m2g~"]
= 2dN(1  ()(k~k"  g~"k2)
+dN [2p~p" + g~" (_p2
+ a2)]
(12.42)
using the definition of the quantity a 2 from Eq. (12.38).
For four dimensional spacetime, we showed in Eq. (11.76) that
the integral of p~p" is the same as the integral of ~g~"p2. In the same
way, we can argue that in Ndimensional spacetime, the integral of
p~p" is the same as the integral of ~ g~"p2. Therefore, the numerator
can be rewritten as
N~"
= 2dN(1 
()(g~"k2  k~k") + dNg~" [(~ 
1) p2 + a2] .
(12.43)
We can now perform the momentum integration. The general
technique for this has been described in A.4 of the Appendix. Let
us first apply the results on the second term in the expression for the
numerator. The coefficient of dNg~" in this integral is
(12.44)
where
(12.45)
12.6. Regularization of selfenergy diagrams
263
as defined in Eq. (A.45). Borrowing the result of this integration
from Eq. (A.53), we can write the expression of Eq. (12.44) as
i
r'
(4,,)N/2f(~N)f(2) Jo
d(
[(2
)f(I+~N)f(1~N)
N  1
(a2)' N/2
 a
2f(~N)f(2  ~N)]
. (12.46)
(a 2)2N/2
Using now the property f(1 + z) = zf(z) which is valid ror any z,
we can write it as
i
(4,,)N/2
/01 d(
0
:(a02)"11_N
"/M2
[~ (~
 1) r(1 
~N) 
f(2 
~N)]
(12.47)
But the first term inside the square brackets is (1 ~N)r(I ~N) =
f(2  ~N). So the integral vanishes.
Actually, this was only expected. From the general grounds of
gauge invariance, we have already argued in 12.5 that the tensor
structure or ,,"V should be given by Eq. (12.32). And in writing the 1loop expression ror the numerator N"v in Eq. (12.43), we have already
separated out the terms which correspond to thi::; tensor structure.
The remaining tenns must then vanish, which is what we have shown
explicitly. We can think of this as a crosscheck on our implicit
belier that our procedure of regularization oo,'s not o,,,,troy the gauge
invariance of QED. We now go back to the expression in Eq. (12.43)
and put it in the rormula obtained in Eq. (12.37) to write
Np
2 kklJd
d
22'(
k
(k)
""V = 2, Ne I' 9"v   "V (2)N
"
/o
fI
d( 1()2
jp2  a 2 J
(12.48)
The momentum integration can be performed with the help or Eq.
(A.53). This gives
r'
_ 2dNe 21'2'
2
f(E)
1r,w(k)   (41r)N/2 (9"v k  k"k v ) Jo d( ((I  () (a 2)' (12.49)
We can now see why the result diverges for N = 4. For small e,
1
f(e) =   'IE
E
+ O(e),
(12.50)
Chapter 12. Renormalizat;on
264
where "IE is the EulerMascheroni constant which is defined by
"IE = 
foOD dx e% In x =
0.57721 ... ,
(12.51)
and the symbol 0() in Eq. (12.50) represent terms which have at
least one power of , and therefore vanish for = O. Using this
expansion of the rfunction with d N = 4, we can write the dimensiondependent terms of Eq. (12.49) in the form:
1'2<2 rt)
41r
(a
41r
)< =
=
[! _ + exp (In (~))
4~2 [~  [1 dn (4;:2)] +
4"2 [! _ In (~)] +
_1
41r 2
0()]
"IE
41r1'2
0()
"IE]
= _1_
"IE
41rI,2
0().
(12.52)
Putting this back in Eq. (12.49) and using a = e 2/41r, we can now
write
(12.53)
where we have used the shorthands
(12.54)
reinstating the definition of a 2 from Eq. (12.38), and
1
'
 =   "IE
+ In(41r).
(12.55)
In the notation of Eq. (12.32),
n(k 2 )
= _ 2a
7r
[_1 _ ](k 2 )]
6'
[1
a   In
= 3rr ['
= n(O)
(m

/.12
..
2 + ... ]
+ (fimte)k
+ k2 n'(0) +... .
(12.56)
12.6. Regularizatjon of selfenergy diagrams
265
This is as far as we can go with the regularization procedure. We
introduced a regularization parameter Nt or equivalently , as well
as a mass scale J.l, in order to have some control over the infinities.
At the end, we see that the amplitude that we calculated depends
on and 1". This is certainly unphysical, since these are unphysical
parameters which we introduced into the theory. So we need to have
some way of making the amplitude independent of these parameters.
This is the process of renormalization, which will be discussed in
12.8. Before that, we want to take a look at the regularization of
other divergent amplitudes of QED.
o Exercise 12.4 Show that in
N~dimen.sion8,
th.e conlrQ.ctio1\ for
mulas for the Dirac matrices take. the forms
"I' y"
"In""I'
12.6.2
= N = 4  2 ,
= (2
+ 2)"(".
(12.57)
Fermion selfenergy diagram
For the vacuum polarization diagram, we used the dimensional regularization. We could use the same technique here. However, to give
some idea of different regularization techniques, let us employ the
PauliVillars regularization for this case. In this method, we introd uce some extra particle in the theory whose mass is very heavy, to
be denoted by M. This is the regularization parameter, with the
physical limit corresponding to M ~ 00. We suppose that there is
an extra diagram where this heavy particle can replace the photon.
Also, the couplings of this heavy particle is such that the sign of this
extra contribution is opposite to that of the photon diagram. Effectively this means that the denominator of the photon propagator
should be modified. Instead of just k 2 , we should now have
(12.58)
Therefore, instead of Eq. (12.21), we should now write
.2
2/
,e M
(2,,)4 P(k2 _ M2)[(p + k)2 _ m2j .
4
d k
E(p) =
2(p+~)+4m
(12.59)
266
Chapter 12. Renormalization
We can now introduce Feynman parameters for the denominator to
write
d4k
J(k)
(2".)4 k 2(k 2  M2)[(p + k)2 _ m 2 j
= 2!
J(~:~4 fa' fa' fa'
d(,
d(2
d(J J(k)o(1 
~J 
(2  (J) .(12.60)
Here
D = (, Up + k)2  m 2 ) + (2(k 2  M 2 ) + (Jk 2
= k 2 + 2(,k. P + (1(p2  m 2 )  (2 M2
= (k + (,p)2 _ C 2 ,
(12.61)
with
c2 =
(,m 2 + (2M2  (dl  (,)p2,
(12.62)
and we have used (J = 1 (I  (2. Putting this back into Eq. (12:59)
and changing the integration variable from k to k+(tp, we can write
l
H ,
. 2
2
~k
2(1  (!l]l + 4m
E(p) = 2.. M
(2".)4 1 d(, 1
d(2
(k2 _ C 2 )J
.
(12.63)
The parameter (J has been integrated over, and the resulting 8function has been used to change the limits of integration.
The momentum integration is now convergent, and we can eval
uate it using Eq. (A.54). In the notation of Eq. (12.24), we find
2
r'
oM
a =  2;; 1 d(, (1  (!lJ,) ,
0
b = oM m
".
where
J,) =
'(,
10o
10' d(,
0
J,) ,
(12.64)
1
d(2
(1m2
+ (2M2
 (,(I  (,)p2
(12.65)
The integral over (2 in Eq. (12.65) can be performed to give
) 2]
J,) = _1_ In (1  (!lM + (1 m  (,(I  (, p
M2
(, m 2  (, (1 _ (,)p2
'" _1 In [
M2
(1  (!lM
]
(,m 2  (,(I  (,)p2 .
(12.66)
12.7. Counterterms
267
In the last step, we have used the fact that M' is much larger than
all the other mass or momentum scales in the problem. Putting this
back in Eq. (12.64) and writing ( for (I, we obtain
r
[(m'(1_(1()M'
_ ()P']
J d( (1  () In
l
a
a = 2"
b = _ am
"
~, In [(m'  (1  ()P
Jor' ....
(1 _ ()M2
2
] .
(12.67)
We see that the selfenergy E(p) depends on the regularization parameter M through a and b.
o Exercise 12.5
Starting jrom ttl< lir.t expression oj Eq. (12.21),
use dimension.al re.gutariZo.tiofl to f'..\laluo.t.e t.h.e coeJjl.cients a and b
and obtain the results
a =
[ __
I
211'"
b= _
2t:'
~
1r
r' d(I_()ln{(m' (I()P'}]
.!. + .!. + r'd( In {(TIl'  (I  ()P
(12.68)
2 io
+.!.+
2
Jo
J1.2
[_
}] .
(;,
Jl2
[Hint: You must use the contra.ction IornluLnfl of Eq. (12.57).1
12.7
Counterterms
12.7.1
Vacuum polarization diagram
In I2.6.I, we calculated the amplitude for the Iloop diagram for
the vacuum polarization, and found the amplitllde to be dependent
on the regularization parameter. Other divergent amplitudes were
also found to be dependent on the regularization parameter. All this
was obtained by starting from the Lagrangian
1
.sf = 4F~vF~v
+ 1/J
(i"ta~
 m) 1/, + e1/J'Y~1/,A~,
(12.69)
which is the Lagrangian of QED. There is al,o a gauge fixing term,
as discussed in eh. 8, but that is unimportant for the following discus..~;]on.
Let us consider for example the 2photoll amplitude. We can
decompose it into a treelevel amplitude alld the rest, as we did for
the fermion in Eq. (12.14). The treelevel mllplitllde is given by
'VVVVVV'o
= 
(.q~vk'  k"kv) ,
(12.70)
268
Chapter 12. Renormalization
taking into account a symmetry factor of 2 which comes because A",
e.g_, can refer to either of the two external lines. In addition, we
have found the Iloop contribution to the 2photon amplitude in Eq.
(12.53). Adding these two contributions, we can write
'VVVVVV\.
+ ~
 (g~vIc2  Ic~/c") { 1  TI(k 2 )}

(g~"1c2  Ic~/c") {I + 2; [6~' I(k 2)]}
(12.71)
This is the correct expression for the 2photon amplitude at order Q
that comes from the Lagrangian of Eq. (12.69).
The Lagrangian of Eq. (12.69) is the classical Lagrangian. This
was useful in figuring out what the terms in the perturbation series
would look like. But we cannot rule out quantum corrections to
the Lagrangian which vanish as h ~ O. So far, we have ignored h
because of our choice of units. However, h can be thought of as a
loopcounting parameter for the following reason.
The evolution operator given in Eq. (5.33) is dimensionless.
When we restore units, we find a factor of h 1 with every HI. So
in calculating the amplitude of a loop, we will find one power of
h I for each vertex. The free Hamiltonian also picks up a factor of
h I . Since the propagator is the inverse of the differential operator
occurring in the quadratic terms in the Lagrangian) there will be a
power of h for each propagator. So the total power of h for an iloop
amplitude with n internal lines and v vertices is
nv=el ,
(12.72)
where we have used Eq. (6.56). In other words, k.JfeK has a contribution from iloop diagrams which is O(h l ).
Therefore, when we talk about loop amplitudes, we are in fact
talking about terms of higher order in h which vanish as h ~ O. The
full 'quantum' Lagrangian is therefore of the form
(12.73)
where o>f(n) will affect the results at nIoop order. For example,
when we calculated the propagator to 1.100p order in Eq. (12.71),
269
we ought to have included some unspecified 02'(1) in Eq. (12.69).
Let us include such a term, and add its contribution to the result
of Eq. (12.71). This will give the full amplitude at order h. Let us
denote the contribution of 02'(1) diagrammatically by a cross on the
photon line. All we know about 02'(1) is that M2'(I) vanishes as
h ~ O. Other than that we are free to choose its form to be whatever
is convenient for us. Let us suppose its contribution is given by
(12.74)
If we now include this contribution, the total contribution to the
2photon amplitude up to order" is given by
= 
(g~"k2  k~k")
x {1
d( (1()ln (1((1()
~:) }.
(12.75)
There are two things to note here. First, the dependence on the
regularization parameter has vanished. For this reason, this new
term is called a counterterm, i.e., something which counters the dependence of a divergent amplitude on the regularization parameter.
Second, for a physical photon, k 2 = 0, the logarithm term in the integrand vanishes, so that the contribution for the 2photon amplitude
is exactly what we obtained from the tree diagram. These are the two
criteria which led us to choose the precise form of the counterterm
contribution of Eq. (12.74). Formally, we define the renormalized
vacuum polarization through
(12.76)
and fix
nCT by the onshell renormalization prescription
(12.77)
For arbitrary k, the 2point function now becomes
(12.78)
270
Chapter 12. Renormalization
What is the counterterm which will give the contribution of Eq.
(12.74)? This is easy. In fact, we can use a counterterm Lagrangian
which is
.:,(8)
CT
~ !!...
4 3".
(2.
'
1 m
n Jl2
~v
F~v
(12.79)
This, of course, is the proper form for the counterterm only at the
order n. If more complicated diagrams for vacuum polarization are
discussed, including terms of higher order in n, the precise form
for the counterterm Lagrangian will be different. However, since the
divergent contributions to 7rJJV always have the same tensor structure
as in Eq. (12.32), the counterterm will in general be of the form
.st;}P =  ~
(Z3 
I)F~vF~V ,
(12.80)
for some Z3. At Iloop, our choice corresponds to
(1
12.7.2
  In m
'
Jl2
Z3 = 1  n
3".
(12.81)
Fermion selfenergy diagram
In the fermion selfenergyl we found that there is a. divergent part
which goes like p, and there is another one which goes like the unit
matrix. Thus, the divergences in these can be canceled by a counterterm Lagrangian of the form
(12.82)
for some Z2 and Jm. We want this term to counter the divergences
coming from fermion selfenergy. The total inverse propagator would
now be
p m
ER(P) ,
(12.83)
where ER(P) is the sum of the divergent loop contributions and the
counterterms l
(12.84)
Comparing this equation with Eq. (12.82), we see that Z2 I = aCT
and (Z2  I)(m  Jm) = bCT.
12.7. Counterterms
271
We now fix aCT and bCT by the renormalization prescription
(12.85)
This gives
aCT = a(m 2)  2m 2a'(m 2 )  2mb'(m 2 ) ,
bCT = _b(m 2) + 2m 3a'(m 2) + 2m 2b'(m 2) ,
(12.86)
where a', b' are derivatives with respect to p2. After some algebra,
we get
aCT
fa'd( (I() [In CI ~2~~:12) + 2(1 :()] ,
"';' fa'd( [In ((I ~2~~2) + 1 ~ (2] .
(12.87)
=  2:
bCT =
If we now substitute a(p2), b(p2) from Eq. (12.67) and aCT, bCT
from Eq. (12.87), we find that En is independent of the regularization parameter M. Since the M ~ 00 limit gives the ultraviolet
divergences, we can say that after adding tILe countertenns, the ultraviolet divergence has vanished in the 2point. function of Eq. (12.83).
This is not to say that Eq. (12.87) is free from divergences altogether. It involves the integral J~ d(/(, which clearly diverges at the
lower limit. This divergence c"n be traced hack to Eq. (12.23). Near
( = 0, the momentum integral is J d 4 k/k 4 , which diverges at both
the infrared and ultraviolet regions. The COllnterterm has removed
the ultraviolet divergence, but the infrared divergence remains. We
will show in 12.9.3 that this infrared divergeu"e cancels in physical
amplitudes.
The onshell criterion for choosing our counter terms guarantees
that divergent decorations of the extern,,1 (onshell) lines in a IPR
diagram can be safely ignored, because they cancel with the COUI1terterm.
o Exercise 12.6
Supvl~
Ute missing ulgebru b.hu.en Eqs. (12.85)
a.nd (12.87). For arbitrury c:rterfiul m01ltf'nf.um p, fin.d the 2point
function till to order 0 b1J adding the, f.n'('li'.uel l'esult, the. Iloop
result, and the c.ounte.T1.erm.
272
Chapter 12. Renormalization
Exercise 12.7 Use the dimensional regulariza.tion technique, loT'
which the iloop results for the a a.nd the b coefikients no.'Ue been
given in Eq. (12.68). Now choose the counterterms b~ the same
prescription as in Eq. (12.85). Show that alter adding the countertenns. the expression obta.ined 101' the 2point Junction with fermions
\s the. same. as the one obta.ined with the PnuLi ViLla.rs regula.rizo.tion
scheme.
12.7.3
Vertex function
The remaining divergence is in the vertex amplitude. It can be canceled by a counterterm of the form
for some Z!. Notice that although the vertex can have anomalous
magnetic moment kind of terms, those terms are not divergent, so
one does not need any counterterm for them.
The counterterms are related by the WardTakahashi identity
derived in 12.4. To show this, let us ""rite the vertex function for
the electron as
(12.89)
where A~(p, p') is the loop contribution. Taking an infinitesimal q in
Eq. (12.13), we can write the identity in the form
A~(p,p) = 
8E(p)
8~
(12.90)
which is the original form in which the identity was derived by Ward.
In principle, we can introduce counterterms so that this relation is
violated at higher loops. But suppose we design the renormalization prescriptions in such a way that Eq. (12.90) is obeyed for the
corresponding counterterms as well. From Eq. (12.82), we see that
ECT = (Z2  l)p + terms independent of p,
(12.91)
so that
8EcT
8~
= (Z2  Ih~
(12.92)
On the other hand, Eq. (12.88) gives
A;;r = (ZI 
Ih~,
(12.93)
12.8. Full Lagrangian
273
so that we obtain
(12.94)
12.8
Full Lagrangian
We have seen that if we start from the classical Lagrangian and
calculate amplitudes for various physical processes, we encounter infinities. So we need to add counterterms to the classical Lagrangian
of Eq. (12.69). The general form of these counterterms were given
in Eqs. (12.80), (12.82) and (12.88). The full Lagrangian would be
obtained by adding these counterterms to the original Lagrangian.
This gives
where
mB = m  (1 Zi')6m.
(12.96)
Suppose now we define a new set of fields, represented by a script
index B (for 'bare'), by the relations
Ai; =
v% A~,
. 1/IB =
v% 1/1.
(12.97)
What we find is that in terms of these new fields, the full or bare
Lagrangian can be written in the form
'zB =
~FB~vF~v + 1/IB (h~a~ 
mB) 1/IB
+ eB1/IB'Y~1/IBAi;,
(12.98)
where
eB =
Z\
Z,v'Z3
e.
(12.99)
The interesting thing is that the Lagrangian of Eq. (12.98) looks
exactly like the Lagrangian of Eq. (12.69), except with a different
274
Chapter 12. Renormalization
set of fields and a different set of parameters, like rna and ea in
place of rn and e. All we have done is to change the normalization of
the fields, a process which can aptly be called renormalization. We
have not explicitly written down the gauge fixing term, but the same
comment applies to it as well. The parameters rna and ea contain
infinite contributions, but physical quantities like amplitudes depend
on the finite physical parameters rn and e.
Another way of saying the same thing is that, all the counterterms
can be hidden in the definitions of the fields and the parameters in the
full Lagrangian. This is the essence of the renormalization procedure.
We could ask what would happen if we perform all calculations
starting from the bare Lagrangian of Eq. (12.98), or equivalently of
Eq. (12.95). In this case, we will not have any counterterm. The tree
amplitudes for, say, the 2point function of the photon or the fermion
would be infinite in this case, since the bare parameters and bare
fields are infinite. However, once we add the loop diagrams, there
will be divergences coming from them as well. These divergences
will precisely cancel those present in the tree amplitude and we will
cbtain a finite result. These are results which are compared with
experiments.
Since the physical results can be obtained from the full Lagrangian, the bare parameters cannot depend on the renormalization
prescriptions. This may sound odd since we have constructed the full
Lagrangian by adding counterterms to the classical one. However,
the division of the full Lagrangian into a classical part and the counterterms is no more than a choice. For example, in the vacuum
polarization amplitude, we chose the counterterm to cancel the loop
contribution at k 2 = O. We could have chosen some nonzero value of
k 2 to calculate the counterterm without affecting the physical results.
Coming back to Eq. (12.98) we see that after adding counterterms, the operator corresponding to the gauge covariant derivative
is
. Zl
(a).
D"
= 8" 
waAa" = 8"  'Z2 eA" '
(12.100)
where we have used Eqs. (12.97) and (12.99). So we see that as
long as the counterterms are chosen to satisfy the WardTakahashi
identity, which implies Zl = ZJ' the gauge covariant derivative is
unaffected by quantization,
=
In other words, although
D1 ) D".
12.9. Observable effects of renorma/jzatioll
275
we have qlJantized the theory by adding gauge fixing terms, all other
terms obey the gauge symmetry even in the full quantum theory.
We have gone through the procedure for the specific theory
of QED. But the general philosophy gives some insight into all
field theories. For example, we understand why theories in which
some coupling con~tants have negative ma.'SS dimensions are nonrenormalizablc, a. statement that we made earlier. As we discussed,
in such theories, infinite number of amplitudes become divergent.
Thus, to tame these divergences, we would need an infinite number
of counterterms. But the original Lagrangian does not have infinite
number of fields or parameters. Thus there is no way that we could
hide the infinitely many countcrterms in the definitions of fields and
parameters of the theory, and this is equivalent to the statement that
such theories are non~renormalizable.
12.9
Observable effects of renormalization
The renormalization procedure descrihed above might seem only like
a mathematical trick to hide the infinities. It is not so. In fact, it
has observable consequences, some of which we now discuss.
12.9.1
Modification of Coulomb interaction
Classically, the interaction between two point charges is governed
by the inverse square law. A small test charge would scatter off a
fixed point charge according to the Rutherford scattering formula.
We have already seen that treating the electron and the photon field
as quantum objects leads to a modification of the scattering formula
at the lowest order. However, there are also highn order terms in
the Smatrix which contribute to this process, and they have to be
included for highprecision results. The hil\her order diagrams will
require renormalization , i,c" introduction uf c.ountcrterms. But the
numerical value of the counterterm depends on how many orders we
include, and the energy of the particle involved.
The corrections to the photon propagator are shown schematically in Fig. 12.6. We have already calculated this correction at first
order in Q. For small momentum k 2 m.:l, we ca.n take the inverse
276
Chapter 12. ReuormaJizatioD
Figure 12.6: Correction to the static field in the scattering of electron by
.l
nucleus of charge
+Ze.
of Eq. (12.7S) to obtain
(k) = _g~v 1   '" k2 )
~v
k2 (
IS" m
+ gauge
terms.
(12.101)
This will affect the scattering of an electron off a static charge. Sel>arating this propagator into its transver.se, Coulomb and 'remainder'
(gaugedependent) parts as in 8.7, we find that the correction corresponds to replacing the static field of the nucleus by
Ze + Zm 83 (x).
47fT
IS"m 2
(12.102)
Because of the isfunction, this extra term contributes only to the
1 = 0 states of an atom. As a result. it contributes to the energy
difference between the 281 and 2p! levels, called Lamb shift.
12.9.2
Running coupling constant
Let us go back to Eq. (12.99). Since we now know that Ward identity
gives Zl = Z2, we can rewrite it as
e
eB =
(12.103)
,fZJ'
In Eq. (12.81), the expression for Z3 w""' derived assuming that
the counterterms to the photon propagators should vanish onshell.
While defining the electric charge, however we usually use the interaction strength between onshell fermions. As shown in Cb. 9, the
photon cannot be onshell at a QED vertex if the fermions are.
So one needs to put fresh conditions to define the physical charge.
The physical quantities appear in the Lagrangian of Eq. (12.69).
1
12.9. Observable effects of renormalization
277
Suppose someone decides that the quantity e appearing in this Lagrangian is the value of the coupling constant at some specific value
of k 2 , where k is the momentum Aowing through the photon line.
For the interaction term, this person would then choose Z3 in such ~
way that the counterterm cancels the divergent contributions at that
value of k 2
As for the possible values of k 2 , we note one thing. Suppose the
two fermion lines connected with the photon line carry momenta p
and p'. Then k 2 = (p  p')2 = 2(m 2  p . p'). If we consider it in
the rest frame of the initial particle, this gives k 2 = 2m(m  E'),
where E' is the time component of pl. Since E' > m, this quantity
is always negative. And since k 2 is a Lorentz invariant quantity, it
must be negative in every frame.
So let us say we decided that the quantity e appearing in Eq.
(12.69) is the valne of the coupling constant at k 2 = 8, where now
8 is a positive quantity. For this choice, let us write e as e(s), and the
resulting choice of Z3 will be denoted by Z3(S). Looking back at the
sum of the tree contribution and the Iloop divergent contribution
in Eq. (12.71), we now see that we should choose
Z3(8) = 1 20 [  1  1(8) ] .
11'
(12.104)
6/
Thus to order 0 in which these counterterms have been calculated,
this exercise would yield
OB
O(8)
0
= Z3(S)
= o(s) ( 1 +. 3".
[1
"  6/( s)
]) ,
(12.105)
ignoring terms which are higher order in o.
This equation has very interesting consequences. The quantity
OB is defined through eB, which appears in the bare Lagrangian.
SO 0B does not depend on the scale at which we want to define our
physical coupling constant e. However, since Z3(S) depends on s, the
conclusion is that 0(8) must also depend on s. The physical coupling
constant depends on the momentum scale at which the interaction is
taking place, and changes when the scale changes. Usually, this phenomenon is described by saying that the physical coupling constant
.
.
18
runn'lng.
To see this running explicitly, we eliminate the bare coupling OB
from Eq. (12.105). For this, let us consider another person who wants
278
Chapter 12. Renormalization
to define the physical coupling constant appearing in Eq. (12.69) by
the value of the coupling for k 2 = .'. Then this person would obtain
an equation which is obtained by replacing. by.' in Eq. (12.105).
Equating now the two expressions for 0.8, we obtain
a(.) = o(s')
+ 20
[I(s)  I(.')J
(12.106)
"
It is to be noted that, although we are distinguishing between 0(')
and o.(s'), we do not maintain the distinctioH while we write the
2 term. This is because OUf expressions are correct only so far
0.
as the 0(0) corrections are concerned.. If we want to distinguish
the 0(0 2) terms, we need the expressions correct to 0(0 2), which
can be obtained only from a 2loop calculation. As far as our 1loop calculation goes, Eq. (12.106) shows the correct accuracy of our
result. It already shows that a(.) varies with " but to make the
nature of the variation explicit, let us consider two interesting limits.
Case 1: s m 2 , Sf m 2 : In this case, we can neglect m 2 in the
definition of I(.) in Eq. (12.54) and get
r'
I(s)"'Jod(I()ln
=
((I()S)
1'2
~ In (;2) + 10' d(1 
(12.107)
() In ((1  ())
Putting this back into Eq. (12.106), we obt"in
I
0:
0(') = 0(' ) + 3"2 In
()
s'
(12.108)
Notice that the effective (k increases with increasing values of s. This
is why higher order diagrams are needed at higher energies in order
to obtain the same precision.
Case 2:
. m2, .' =
0:
In this case we need the value of I(O),
which is
I(O) = Jor' d(1 
(mIl'
2
() In
(mIl'
2
= il1 ln
(12.109)
12.9. Observable effects of renormalization
279
Therefore,
I( s)  1(0)
= ~ In (~2) +
=
d(
~ [In (~2)  ~]
W  () In (((1 
())
(12.110)
Thus,
a(3) = 0(0)
+ ;;
[In
(~2) ~]
(12.111)
At s = (SOGeV)2, corresponding to the mass of the Wb06On,
taking m to be the mass of the electron, and using a(O) = 1/137.03,
we find a(3) = 1/134.7 from this formula. In our calculations we
included only the contribution for electron loops. But there are other
charged particles with masses below 80 GeV, which include the muon
(106 MeV) and the tau (1.78 GeV). There are also the quarks u
(5MeV), d (9 MeV), 3 (170 MeV), c (1.4 GeV) and b (4.4 GeV) below
80GeV. The masses shown here are upper limits of their estimates.
Loops involving them will also contribute to the calculation. The
effective coupling constant at 80 GeV turns out to be about 1/128.
This is the value that should be used in calculations of QED at
energies around 80 GeV.
o Exercise 12.8
Use the mQ.8ses of tne lermions gi1Jen aboue to calcul"te a "t s = (BOGeV)'.
.
12.9.3
Cancellation of infrared divergences
In 9.1O, we discussed the cross section for the Bremsstrahlung prer
cess in the field of a heavy static charge. We showed that the differential cross section calculated from the lowest order diagram suffers
from an infrared divergence. This would mean that we obtain an
infinite number of very low energy photons in the radiation.
The resolution of this problem is a vindication of quantum field
theory. Quantum physics, roughly speaking, is all about measurements. When making experimental observations on Bremsstrahlung,
we can set up an experiment that records the final electron as well as
the final photon. However, no matter how good our detector is, some
very low energy photons will always go through without detection.
280
Chapter 12. Renormalization
In other words, a process with no photon emission is experimentally
indistinguishable from a process which emits a very low energy photon. Therefore in our calculations of low energy photon emission, we
should include processes where no photon is emitted.
Such a process was already shown in Fig. 9.8, and its Feynman
amplitude was denoted by .A'o in 9.1O. However, it cannot cancel
the infrared divergence of the process with soft photon emission
by itself for two reasons. First,.A'o does not have any infrared
divergence. Second, it has one less power of the coupling constant
e compared to the Feynman amplitude for Bremsstrahlung, as seen
from Eq. (9.136).
p'
(a)
tq=p'_p
+Ze
pi
(b)
+Ze
Figure 12.7: These processes cannot be distinguished from Bremsstrahlung
with a soft final photon. so they must be included. The first one is a loop cor
reetien to the vertex (unction, the second one involves the vertex counterterm.
We now consider the next higher order term for the scattering
process with no photon emission, shown in Fig. 12.7. The amplitude
of these diagrams are O(e'.A'o). When we add this with the lowest
order amplitude .A'o and square it, the cross term will have the same
power of e as the square of the Bremsstrahlung amplitude. We now
show that this cross term wiJr also be infrared divergent, and the
two divergences exactly cancel each other.
We start with Fig. 12.7a. The expression for the vertex function
is given in Eq. (11.38). For soft photons k" "" 0, so we neglect all
occurrences of k in the numerator. Simplifying the denominator with
the onshell conditions p' = pi' = m' and putting in the if terms in
12.9. Observable effects of renormaJization
281
the propagators we can write it as
j
rfl) = ie2
d'k
'Y,,(i/ + mh~(p + mh"
(2rr)' [21'" k + if] [2p k + if] [k 2 + iEJ
JIR
+ ...
.
(12.112)
In this formula, the integral is not over all values of k, but rather only
on small values, indicated by the subscript 'rR' on the integration
sign. As in 9.1O, the dots stand for the contribution from other
values of k.
Using the contraction formulas of Eq. (4.42) and remembering
that this entire expression is sandwiched between u(p') and u(p), the
numerator can be written as 4p pl"Yw So we obtain
rfl)
~
= ie 2 p. P''Y
1
IR
d'k
1
1
(2rr)4 Poko  p . k + if Pok o  P . k + iE
x
k5 
k2
+ if
+ ...
(12.113)
There are four poles of k o in the integrand. Let us choose the contour
of integration to enclose the upper half plane, so that it will cbntain
only one of them, viz., ko = w + if, where w = k. The result of the
kointegration will be 2rri times the residue at this pole, so
fl)
r~ = e 'Y~
1
IR
d k
p' p'
(2)3
rr (I" . k)(p k)( 2w)
+ ... ,
(12.114)
where k~ = (w, k). Changing now the integration variables from
k i to _k i , we obtain
rfl) _ a "I
~  (2rr)2 ~
where now
then
k~
rdk
J w
3
IR
p' p'
(I'" k)(p . k)
+ ...
,
(12.115)
= (w, k). The Feynman amplitude for Fig. 12.7" is
(')[_'rfll]
()A~ (2rr)2A<O
a
,;
up
te ~ up e 
rdk
3
JIR ;;;
p.p'
(p'. k)(p k)
+ ....
(12.116)
We now need to add the counterterm diagram given in Fig. 12.7b.
The vertex counterterm is simply r~ for q = 0, i.e., p = p'. Putting
282
Chapter 12. Renormalization
p = p' in Eq. (12.116), we obtain the infrared divergent part of the
Feynman amplitude of this diagram to be
d k
p2
,;;;o.A
.,.!'"'"'"
{2,,)2 o IR w (p. k)2 .
o
(12.117)
Performing the angular integrations, it is easy to see that the result
does not depend on p, so we can write it equivalently as
3
o .Ao
_d_k [ p2
2{2,,)2
1lR w (p.k)2
+ .,....;p'...,.2;oJ
(12.118)
{p'k)2
Adding the contributions of Eqs. (12.1I6) and (12.1I8) with that
of the lowest order scattering diagram, we obtain the amplitude for
diagrams with no photon emission to be
.A,=[1
o
2{2,,)2
1lR
3
d k (p
w
pk
 p')2
 + ... J.Ao.
p'k
(12.119)
When we square this, the cross term exactly cancels the infrared
divergent contribution in the Bremsstrahlung amplitude given in Eq.
(9.138). Of course the 0(02) term appearing in l.A l2 is infrared
divergent as well. But to see the cancellation of that, we need to
consider terms of higher order in 0 for the Bremsstrahlung amplitude..
The part of the integral written as dots will contribute to finite 1loop corrections, and to cancellation of ultraviolet divergences, both
of which we have already seen.
The counterterms were introduced to eliminate the ultraviolet
divergences. But we now see that they are required to cancel the
infrared divergences. We have used only the vertex counterterm for
this calculation. This is because the fermions are onshell, therefore
fermion selfenergy counterterms cancel exactly with the selfenergy
loop diagrams. The photon is not onshell here, but its counterterm
has no infrared divergence and so is irrelevant for this calculation.
We have done the analysis for scattering from an external source
and the associated Bremsstrahlung. But a similar analysis can be
performed for the general situation where the infrared divergence of
soft photon emission from external charged particles in any process is
canceled by the infrared divergence of soft virtual photon exchange
between external lines. Further it can be shown that the cancellation
is effective at all orders of perturbation theory.
Chapter 13
Symmetries and symmetry
breaking
At various points in this book, we have talkcd about different kinds
of symmetries in field theories. Symmetries have important consequences on any quantum theory. They can rd"tc masses or energies
of different states, or scattering cross sections of different processes.
For field theories in general, we discussed an important consequence
of continuous symmetries in Ch. 2, viz., til<' Noethcr theorem. In
this chapter, we will discuss some other general is...m es pertaining to
symmetries in quantum field theories.
13.1
Classification of symmetries
Symmetrif'~~
can be spacetime symmetries or interna.l symmetries.
In the first case, the symmetry transformations connect fields at different spacetime points in general, as for example in the case of
Lorentz trallsformatioJls given in Cil. 2. In t.he other, the transformations connect differcnt ficlds, or diffcrcnt. componcnt.s of the same
fip.lrl, at the same spacet.ime point.. An example is t.lle gA,l1gP. t.ransformation of QED given in Ch. 9.
Symmetries can be discrete, where the parameter for the symmetry transformation can take only discrete valllcs. Examples of such
symmetries are parity, time reversal 1 charge conjugation 1 as discussed
in Ch. 10. On the other hand, symmetries call also be continuous 1
where the relevant parameter (or paramcters) ("\11 take continuous
values. Example of such symmetries arc invariauces of various free
283
284
Chapter 13. Symmetries and symmetry breaking
Lagrangians under phase rotations of the fields involved.
Among continuous symmetries, we have discussed two impor
tant classes. Symmetries tan be global, where the transformation
parameters are independent of spacetime. The phase symmetries
mentioned above are of this type. On the other hand, the transforITlation parameters can also depend on spacetime, as we saw in the
case of QED. Such symmetries are called local or gauge symmetries.
As discussed in Ch. 9, such symmetries require the presence of some
new particles in the theory, called gauge bosons in general.
13.2
Groups and symmetries
Associated with every symmetry is a mathematical object known as
a symmetry group. In order to gain an understanding of symmetries ,
we will need to learn some group theory. Our discussion will by
no means be complete or even rigorous. Interested readers should
consult the any textbook on group theory and its applications in
physics.
13.2.1
Symmetry group
A group is a set of 'elements' in which two clements can be combined.
by some operation satisfying four properties described below. The
operation in general is called group multiplication, although it may
not have anything to do with ordinary multiplication of numbers.
We will denote this operation by a small circle (0) and the elements
of the group by x, y, etc. The operation should have the following
properties:
I. The 'product' x a y must belong to the group for all elements
x,y.
2. There must bc an clement e in the group such that e 0 x x 0 e = x for each element x. This e is called the identity.
3. Each element x must possess an inverse in the group, i.e., there
must exist another element called XI such that x 0 xI =
xlox = e.
4. The group multiplication must be associative, i.e., x
(x 0 y) 0 z for all elements x, y, z.
(y 0 z)
13.2.
Groups and symmetries
285
Exercise 13.1 Sh.OUl tha.t thf, jotlowin!) Ret, of f'lf'.ments tann a
grou'P untll':.l' th.e 8peci.ficd l:group multivliclltiOll' opnation. IdentiJy the ide.nlily
demenl.
ch',HH'.nl in. cILch
case.,
(lwt
tlu:
in:IJI'.r:'\f,
oj u typicul
It) Inlegt'.l':'\ undt'.r Itrldiboll oj nUmbl'.l':;;
b) ;\oTtl.no reaL lLltmbH8 tinneT m,ultiphcul'i.llll. of ltumbcn;
e) CompLex numbcrl:i
pkr n.Hmbcr3;
(J!
the
f0l'111 C
ifl
HIl/ln
lTHLlt.i.vhr,ution. of r,om,
71rl:iln('.n~ionnl SqUfLl'('. mo.tricf'i' with 'l\.fJlI;:cro (letl'rlni1'lllnl under ll\rtlrix 1l1,llltipLieutioll;
f') ndilll.f..l\~ionul unilcLrll mnlric.('~ Illl.Ol'l" lTwtrl:X: multiplication.
d)
Exercise 13.2 Show thnl the folhnl1iHg ~wt of d(,lTtt'ltlli do nnt form
u group lilt/le.,.. the ll)l('cifir.d "yrou'P lrl1l1til'linllion" Opt'.Tution.
0.) POMitiur intege.rs lLlllLe.T nrlrLilion of
1lItlrtb('l'H;
b) All inlt'.gt'xs unlit'.r mulliphcutioll of lll11n.!lns;
c.) Alt
1ldimt:n~ionnl
squarc. tn.alriel's "tllult.'.1" mnlri:t multipliCll
lion.
Shl)~ll t.hut the. Lorentz t.rnn;;foTTlllLtion llUlt.rices ~(Ll~
isfying lh.e COflilition ill Eq. (1.28) for1ll n ~jl'OU1) 1I1l(lt.r Hlnh'ix. multiplic.o.tion.
o Exercise 13.3
The symmetries of a phy::dcal s.Y~telll provide lIseful constraints
on the possible forms of its Lagrangian. A s.vnllnctry operation is
something that leaves thp. action ullchanged. It. is lIot very difficult
to see that all sYlIlmetry operations form a group. The combination of two such operations will'also not. dli\IJge the action and will
therefore he a symmetry operation. Sec.ondl.y, there is an identitYI
which is the operation of simply doing nothillg r.o the action. The
inverse clement of any operation is just undoing tht~ operatioll. Anu
finally, t.lw product of three such operations means performing the
oper;t.tions in a given order. Since each of them leaves t.he action unchanged. the result cannot depenrl 011 how they may be perceived to
be combined together. III ot.her worrl:>, the action remains uIlchanged
irrespective of how tlwy are combilled together. Thus. symmetry operations satisfy all the conditions for a group. This group is called
the symmel7'V group of the theory. The sYlJIlJletry grollp includes the
Poincare group in a relativistically covarilwt theory. This invariallce
leads to the conservation of 4momcntUTTl and angular momentum,
as rliscussed in 2.4. in addition, the sYlIlmetry group call include
internal symmetries, on which we shall focus ill ellis chapter.
286
Chapter 13. Symmetries and .symmetry breaking
13.2.2
Examples of continuous symmetry groups
As an example. let us go back to the phase freedom of the free Dirac
Lagrangian. The symmdry operations are given by
(13.1)
as mentioned in Eq. (4.91). Thus the element.s of tI", symmetry group
in this case are complex numbers of the form e iq8 , and two such
elements combine by the law of multiplication of complex numbers.
Alternatively, these elements can be viewed H.'i Idimensional unitary
matrices, which combine by the law of multiplication of matrices. For
this reason, this symmetry group is denotcd by U(I) . the letter
IU' for unitary matrices, and the number ill the parenthesis for the
dimension of the matrices. For the free Lagrangian, this is a global
internal symmetry. We ~an thus say that the free Dirac Lagrangian
in invariant under a global U(I) symmetry.
In QED, we still have t.he same symmetry, except that the parameters f) can depend on spacetime. This is lIsna.lly encoded in
the statement that the QED Lagrangian ha.s a local U(I) symmetry.
This is also an internal symmetry.
o Exercise 13.4 Ttle orttlogonnl group in two dim.r:nsions is denoted
by 0(2). The trnnsjormo.tions are given b1)
yl
( X')
(C?sO .inO)
smB
cosB
(.r)
.
y
(132)
ShoUl that the.se transformations ure Qlll\)/tll'nt to U (1) tro.nsforma.tiona on the object z =
iy.
One can think of larger symmetries as well. For example, consider
the free Lagrangian of two Dirac fields:
(13.3)
Of course this is invariant under independent pha.'ic rotations of the
two fields, i.e., two independent U(l) transformations. This symmetry is called U(I) x U(I). However, in the special case of Tn, = m2,
the symmetry is even larger. This is best seen by constructing the
object
(13.4)
13.2. Groups and symmetries
Here,
VJl
and
1/J2
287
are independent spinors, meaning that the compo
nents of,p\ do not mix with those of ,p2 under Lorentz transformations. However. here we will be considering an internal symmetry
transformation in which the elements of two different spinors will
mix with each other. For this purpose, it is sufficient to suppress the
component structure of each spinoT and treat each of them as one
single object. The object V can then justifiably be called a doublet.
Using this doublet, we can cast the Lagrangian of Eq. (13.3) into the
form
!L' =
I{J
(i{.! m) V,
(13.5)
where the thing in the parenthesis is to be understood to be a multiple
of a 2 )( 2 unit matrix, and m = ml = m2.
Now consider the following transformation on 'if which changes
I{J to V', gi ven by
V' = Uv ,
(13.6)
where U is a 2 x 2 unitary matrix. This will keep the Lagrangian of
Eq. (13.5) invariant. We have already introduced the group of n x n
unitary matrices in Ex. 13.1 (p285). For a general n, this group is
called the Urn) group. The Lagrangian of Eq. (13.5) thus has a U(2)
global symmetry.
Exercise 13.5 Argue that the free Lagrangia.n of n Dira.c fields ha.s
a. U (n) 81:lmmetry ij the mQ.sses .oj nll the. JcnTLions nre equal.
It is convenient to think of the group U(2) as a product of two
subgroups. Any unitary matrix can be written as a product of two
objects, 8. unitary matrix with determinant 1 and an overall phase.
Both these parts form a group among thcmselves. The second of
these is the group of phase transformations discussed earlier. which
is called U( 1). The first group is called SU(2), where the'S' stands for
'special" indicating the unit determinant. Thus U(2) can be written
as SU(2) x U(I), and in general U(n) as SU(n) x U(I).
If we encounter the situation when m = 0 in Eq. (13.5), the
symmetry is even larger. Because now the U(2) symmetry described
above is still there, but in addition one can make the transformations
defined by
V' =
V'Y51{J ,
(13.7)
Chapter 13. Symmetries and symmetry breaking
288
where V is any 2 x 2 unitary matrix, independent of the matrix U
defined in Eq. (13.6). Thus, the symmetry group now is U(2) x U(2),
i.e., SU(2) x SU(2) x U(I) x U(I).
Such larger symmetries can exist even in the presence of interactions. As an example, we can consider the original theory of Yukawa,
which deals with the interaction of nucleons (i.e., neutrons and protons) with pions. The nucleons can be thought of as a doublet, much
like the one introduced in Eq. (13.4):
(13.8)
"'n
"'p
where
and
denote the proton and the neutron fields respectively. There are three pion fields, with charges +1, 0 and I in
units of proton charge. To include them, consider a triplet of real
scalar fields:
(13.9)
Now consider the following Lagrangian, which includes the free Lagrangian of these fields, as well as their interactions:
~ =
IlJ (il!
m) IlJ
I
1 2
2(8.4. (8"4  2/" 4>.4>
I
2
4,,(4)4>) +ig'lJl'sTallJ4'a,
(13.10)
where T a are the usual Pauli matrices. The last term includes interactions like ig'''pI'S'''n(4), + ;4>2). Looking at this term, we see that
(4), +i4>2) should annihilate a positively charged particle. We identify
this with ,,+. The properly normalized pion fields arc
I
,,+ = 112(4)1 + i4>2) ,
,, = )2(4)1  i4>2) ,
,,0 = 4>3 (13.11)
The Lagrangian is invariant under the U(I) symmetry of electromagnetism
(13.12)
In addition, this Lagrangian is invariant under a global U(I) symmetry whose transformations are defined by
(13.13)
13.2.
289
Groups and symmetries
Using a more compact
notation~
+
one can
writ~
eiLJO X,
(13.14)
where X is any field. and B = 1 for the nucleons 'Uld ",ern for the pions. This B is usually called baryon numlw,., ",,,I the U(l) symmetry
the baryon number symmetry.
In addition to this symmetry, OIlt' call ch('ck that the Lagrangian
of Eq. (13.10) is also invariant IInder til(' J\lohal SU(2) transformations defined by
where the
by
Tn ':;
(iii" ~ )
IjI
exp
1>
exp(i(J"T,,)1>.
are the Pa.uli matrices. and
T2 =
i)
0 0
0 00
( i 0 0
IjI ,
(13.15)
tht~ lIlat.ri~f~s
T.=
To
Rre
OiO)
00 .
(o 0
i
given
(13.16)
This 5U(2) symmetry is called isospin .<>ymmd.ry. Notice tha.t this
symmetry requires that the mn..'iSCS of the lH'utrlHl ,llle! the proton be
equal, and the': rnu..'iSCS of the three pion:.; To b<' equal a~ well. This
is only roughly true in rea.lity, since the llt.'lILrOIl alld proton masses
differ by only about 0.14%, while the mass of ,,0 differs from that
of Jr by about 3.5%. This makes the syfTlll;dry all approximate
symmetry, which will be discussed in 13.3.
Exercise 13.6 Show tha.t the matrix f'Xp (iI3" 1) \ ulhich induce~
the transformation on the nucleon jietds, hax ctf':lerminant equal to
unity for a.nll ua.luc oj Uu'. purumeters (:J".
Exercise 13.7 Write. the. inji.nitf.!\imal fonns of the lso!lpin trans
jormulion, oj Eq. (13.15). Show thut the Logrcmgiun oj Eq. (13.10)
if! indeEd inuuTiunl under these inji.nitf'.Rimut tra.nsjollnations. IHint:
You musl disrc.go.rd all tenns oj Ordl'.T 1:3 2 .J
Construe.t the current genera.ted by the il'\lIf'pin IT(m~fornt(ltion a.nd
\lcrifll tha.t lhelJ nre conserued by using t.he equations oj motion.
13:2.3
Generators of continuous groups
The number of elements in any continuous group is infinite. But they
can be represented by a finite number of continuous JJararneters. For
290
Chapter 13. Symmetries and symmetry breaking
example, if we consider the U(I) transformation mentioned in Eq.
(13.1), the parameter is 0, which can take continuous values. For the
SU(2) transformation shown in Eq. (13.15), we have three parameters
f3.
Exercise 13.8 A general n x n comple,x ma.trix has 2n 2 Tool puTameteT8. Show that if this matm is unitary, th.e n.umbe_t' of inde'Pendent real "pa.ra.meter~ is n 2 . If th.e determil\.o:nt is 1.mity, the numbe,..
1.8n 2 1.
For any group G, we can write a general element as
U(f3" (3" ... ,f3,.), where r is the total number of parameters needed
to specify a general group element. One call always choose these
parameters in such a wa.y that all of them are zero for the identity
element of the group. The generators of the group are then defined
by
(13.17)
This rneans that the infinitesimal elements of the group can be writ
ten as
(13.18)
using the summation convention for the index ll.. For 1\ class of groups
called Lie groups which includes the unitary groups that we have been
discussing, this implies that the general group clement is given by
U = exp (if3.T.) ,
(13.19)
where the f3o's need Ilot be small. Obviously, the choices for T.
are not unique. Que can always define some linearly independent
combinations of the /3a IS to be the independent parameters, in which
case the generators will also be linear combinations of the Ta's given
above.
Exercise 13.9 Sh.ow that the generators Rl>an
(l
11edor apa.ce.
A group is defined by the multiplication of all pairs of elements.
Thus in order to reproduce the correct multiplication, the generators
T. must satisfy certain properties. For SU(2), for example, the To's
do not commute, as seen from E'l. (13.16). So the multiplication of
13.2. Groups and symmetries
291
two group elements will be given by the BakerCampbellHausdorff
formula:
eAe
= exp
(A + B + ~[A,BJ1
+ 12 (fA, [A, BJJ  [B, [A, BJI) + ... ).
(13.20)
Therefore it is clear that once the commutator of the generators
is known, one can evaluate the multiplication of any pair of group
elements. For the SU(2) group, the generators can be chosen in such
a way that the commutators are given by
(13.21)
where falx denotes the completely antisymmctric symbol with <123 =
+1.
If we want to discuss other groups, e.g., any SU(n) group, most
of the previous discussion remains unchanged. Only the number of
generators is different from 3. For SU(n) it is n 2  1, as indicated
in Ex. 13.8 (p290). Otherwise, Eq. (13.19) is still valid, with the
implied sum over the index a running from a = 1 to n 2  1. The
commutator of the generators also have a rnore general form than
that given in Eq. (13.21), viz.:
(13.22)
where Jabe are called the structure constants of the group. Obviously,
Jabe =  Jba,' Moreover, the generators can be chosen in a way that
the structure constants are completely antisymmetric in the indices,
but the values depend on the specific group lluder consideration. For
the group SU(2), Jab< = <abe as shown in Eq. (13.21). The collection
of all commutation relations of the generator; is called the algebm of
the group. If all structure constants vanish for a group, it implies that
group multiplication is commutative. In this case, the group is called
a commutative groupl or more commonly, an Abelian group. If group
multiplication does not commute in general, at least some structure
constants will be nODz,ero, and the group is noncommutative or
non A be/ian.
202
13.2.4
Chapter 13. Symmetries find symmetry breaking
Representations
A representation of a group is a set of linear transformations or matrices R(x) defined for all elements x belonging t.o the group and
having the property
R{l:)R(y) = R(x
y).
(13.23)
where the left hand side is obtained hy uSllftl matrix multiplication.
The representation is said to be of the same dimellsion as these matrices. In a given representation! the generators can be chosen to be
matrices of the same dimensionality such that Eq. (13.22) is satisfied.
The group SU(n), for example, forms a subspace of the space
of all n x n matrices. In an abstract way. the group SU(n) can be
defined to be any group whose elements have a one to one correspondence with the n x n matrices of unit determinant in such a way
that the group multiplication law is satbfied. However, this definition itself implies that we can have a representation of the abstract
group SU(n) in terms of Tt x n matrices. This representation is called
the fundamental representation of the SU(n) group. In this representation, the generators are also n x n matrices and they satisfy the
algebra of the group. Usually, they are chosen to satisfy the relation
(13.24)
It can be easily checked that for SU(2), tile matrices T a = T a /2 satisfy
this condition.
The fundamental representation is by lIO means the only representation for SU(n). For example, for any group there is always a
idimensional representation in which all tile elements of the group
are represented by the number (or the I x 1 matrix) 1. Obviously
this satisfies Eq. (13.23). The generators ill this representation are
given by T a = 0 for all a. This is called the singlet representation.
Another important representation ean be defined through the
structure constants of the group. To see t.his, let. us define the matrices Ta by the following matrix elements:
(Tc)ab
= 
ifabe'
(13.25)
Obviously, the dimensionality of these matrices is equal to the number of generators in the group, Le., n 2  1 for SU(n). One can now
13.3. Approximate symmetries
293
check that these matrices indeed satisfy the algehra of the group. The
representation so defined is called the adjoint rep7'sentation of the
group. There are, of course, many other representations of a group.
But in this hook, we will need only the one~ mentioned. above.
Representations determine how fields trallsform under the symmetry groop. For example, in Eq. (13.15), the SU(2) doublet W transforms in the fundamental representation and the triplet cf> transforms
in the adjoint representation. In general, the tniIlsformation of an
nplet is obtained by using; the ndimellsiutlal representation of the
group.
o Exercise 13.10 Vel'ijy the .luc.obi inentit1j,
[Ta [T,,1:tL + [T,.. iT" T"LL + [T" [T", T"I_L
= 0,
(13.26)
which is sulisped beCnlLf\e the multiplication. of the generators is (LSsoc.iatiue. U lie. this to ueT'i!'y th,a.t the matrl.CI'.S dc.jinen in Eq. (13.25)
indeed ~ntisfy th'o nlgebra giuen in Eq. (1 ~1. 22).
13.3
Approximate symmetries
Symmetries have importa.nt consequences 011 field theories, some of
which were mentioned at the beginning of this chapter. For example,
if a number of fields transform like a multiplet under a given symmetry, the quanta of those field::; should havl~ the same mass, as we
showed in our examples.
Sometimes real data St.'effiS to'show the contrary. For example, in
13.2.3, we introduced the iso.>pin symmetry uuder which the proton
and the neutron form a doublet. This should have implied that the
proton and neutron have the same mass, a,s we saw in Eq. (13.10).
But in reality, their masses are not the same.
The contradiction is resolved if we realize tha.t isospin is a symmetry of the Lagrangian written in Eq. (13.10), but that is not the
complete Lagrangian to describe the physical properties of proton
and neutron. We have left out other interactions of these particles.
For example, we know that the proton has an electric charge. Therefore, it should interact with the photon field. This means that if we
want to write down a Lagrangian that indudeti all physical propcr~
ties of the proton and the neutron, we should at IC:l.:':it augment that
Lagrangian of Eq. (13.10) by the following t"rm:
.'L" = e1/Jpf ~ljJpA~,
(13.27)
294
Chapter 13. Symmetries and symmetry breaking
which is the effective electromagnetic interaction of the proton. This
involves the charge form factor as well as the anomalous magnetic
moment, as we saw in Eq. (11.85). The neutron, on the other hand,
is electrically neutral and therefore does not have any charge form
factor. Although there can be other interactions involving the neutron and the proton, Eq. (13.27) is sufficient for the purpose of the
present argument. This term in the Lagrangian is not invariant under the isospin symmetry introduced in Eq. (13.15). In other words,
the isospin symmetry is not a symmetry of the complete Lagrangian
that can describe interactions of protons and neutrons. What is the
point of talking about it then?
The point is that the coupling 9 that appears in Eq. (13.1O) is
much larger than the value of e that appears in Eq. (13.27). So,
if we consider a process which gets contributions from both types
of coupling, the contribution from the coupling 9 will be larger in
general. In that sense, we can think of the Lagrangian of Eq. (13.27)
to be a perturbation on top of that of Eq. (13.1O). Isospin symmetry
is a symmetry of Eq. (13.10). The term in Eq. (13.27) then breaks
this symmetry. However, since the effect of breaking is "small" in
the sense described above, isospin symmet.ry can still be called an
l'approxirnate symmetri1 of the Lagrangian.
The label "approximate symmetry" can also refer to some discrete symmet.ries. For example, parity is a symmetry of t.he QED
Lagrangian, as we discussed in Ch. 10. However, if we introduce effects of weak intera.ction, the Lagrangian contains terms which do not
respect parity invariance. But weak interactions, as the name indicates, are weak compared to electromagnetic interactions. So parity
violating effects are small compared to parity conserving ones. In
this sense, parity is an approximate symmetry. The same is true of
charge conjugation and time reversal.
13.4
Spontaneous breaking of symmetries
In the last section, we discussed one rea."iOIl why the consequences
of a symmetry may not be realized in t.he physical world, viz., that
there might be some terms in the Lagrangian which do not obey this
symmetry. There is another interesting phenomenon which can lead
to :;imilar consequences even though the Lagrangian in fact obeys
13.4. Spontaneous breaking of symmetries
295
the symmetry. This phenomenon is called spontaneous symmetry
breaking, which we will discuss in this section with some examples.
13.4.1
Discrete symmetry
Consider a theory with only one real scalar field, governed by the
Lagrangian
(13.28)
Clearly, this is symmetric under the following transformation
4>~
4>,
(13.29)
which produces a discrete symmetry group. This has some interesting consequences. For example, if we treat 4> as a quantum field, the
symmetry says that processes involving odd number of quanta of this
field should not take place. This statement has a loophole which we
will now investigate.
Using the usual procedure elaborated in connection with Eq.
(3.7), we obtain
' =
~ (TI 2 + (V4>f + Jl24>2) + ~>'4>4.
( 13.30)
where TI is the canonical momentum, given by TI = 4>. For the
moment, let us forget about quantum physics and think of this as
the Hamiltonian of a classical field. We can now ask what is the
value of the field 4> for which the field has minimum energy.
Clearly, the terms involving derivatives of 4> are nonnegative, so
the minimum value for them is zero. In other words, in the ground
state, the classical field must be a constant over all spac&time. Thus,
finding the minimum of energy reduces to the problem of finding the
minimum of the nonderivative terms, which are given by
V(4)) =
~Jl24>2 + ~>'4>4.
(13.31 )
This function is usually called the potential in field theory parlance.
Since in Ch. 1, we argued that the concept of potentials is untenable
in relativistic field theories, the introduction of this name may cause
some concern. However, the potential defined in Eq. (13.31) is not
296
Chapter 13. Symmetries and symmetry breaking
the potential in the sense of nonrelativistic mechanics. For example,
the gradient of the function in Eq. (13.31) does not give the force. It
is a different kind of physical quantity for which the same name has
become conventional.
In any case, at the extrema of the function V (4)), the classical
field 4>cl satisfies the condition
(13.32)
There are three solutions to this equation. To determine which one
really corresponds to the minimum, we need some more information
about the parameters involved.
Let us first discuss the parameter oX. Of course oX has to be real
because the Hamiltonian has to be hermitian (or real, if we insist
on a purely classical vocabulary). For very large values of the field,
the potential, and therefore the Hamiltonian, is dominated by 'the
oX4>~1 term. Thus, if oX < 0, the value of V(4)) becomes smaller and
smaller (more and more negative) as 4>cl increases, and the classical
potential does not have any minimum. This will be a system without
a ground state, which indicates an unstable system. In other words,
the existence of a ground state demands that.
oX > 0.
(13.33)
Now, what about p.2? Again, it has to be real. Normally, we
would think of p.2 as mass squared when we quantize. But classically
p.2 can be either positive or negative. If p.2 > 0, the pot.ential function
looks like Fig. 13.la ann 4>d = gives its minimum. This is not the
interesting case for us in the present context.
(a)
Figure 13.1: The shape of V(<i for (a)
(b)
1"
> O. (b) J." < O.
13.4. Spontaneous breaking of symmetries
297
If /12 < 0, the potential function looks like Fig. 13.1b and the real
minima are obtained at
1>c1
hii
VT
(13.34)
Now we have two minima, and they are degenerate, as can be seen
by plugging these solutions into Eq. (13.31). The classical system
is now as likely to be in one of these minima as in the other. If we
have a classical system, it is either in the minimum with the positive
square root, or in the one with the negative square root.
Let us now turn to quantum field theory. The classical field now
should be interpreted as the expectation value of the field 1> in the
ground state, or the vacuum. This is caJled the vacuum expectation
value of the field, or VEV for short. Everything that we have said
so far about the classical field will now he valid about the VEV of
the field 1>. Thus if /12 < 0, the VEV of the field 1> can be either of
the two solutions given in Eq. (13.34). It does not matter which one,
but let us call this value v:
(011)10)
= v '" o.
(13.35)
But this creates a problem. So far we have expanded the quantum
fields in creation and annihilation operators, c.g. in Eq. (3.18). Since
the vacuum state is annihilated by the annihilation opemtors, we also
have (01 at(p) = 0 for any p. Therefore, if we use the expansion of
Eq. (3.18), the VEV of the field must vanish, contrary to Eq. (13.35).
In short, the field 1> cannot be treated as a qnantum field if /12 < o.
There is, of course, a very easy way out of this problem. We can
define ry(x) by the relation
1>(x) = v + ry(x) ,
(13.36)
where this T/(x) is a quantum field in the sense of Eq. (3.18), i.e.,
which has zero VEV and therefore can be expanded in terms of creation and annihilation operators. Then,
(0 I'll 0) = 0,
(13.37)
and so Eq. (13.35) is satisfied. The field 11(X) can therefore be interpreted as the fluctuations around the classical value of the field,
which was denoted hy v.
Chapter 13. Symmetries and symmetry breaking
298
However l in this case, in order to obtain the quantum behavior
of the field quanta, we should rewrite the Lagrangian of Eq. (13.28)
in terms of the field I)(x). This can be easily done by using the
definition of I) in Eq. (13.36), and we obtain
1
1
.! = 2(0"1))(8"1))  2(Jl'
+ 3AV')I)' A
AVI)3  41)4
= 2(8"'1)(8"1))  AV'I)'  AVI)3  "41)4.
(13.38)
In writing these forms, we have used the relation Jl' = AV' which
follows from Eq. (13.34), and also omitted a constant term in the
Lagrangian as it has no effect on the physics of the system. What
we see from the final form is that the quantum of the field I) has a
given by
mass
m.
( 13.39)
And, because of the presence of the 1)3 term, the Lagrangian is not
invariant under the operation of a sign change of the quantum field.
So we can have processes with an odd number of I) quanta in the
external lines.
The Lagrangian of Eq. (13.28) had the symmetry. But the values
of the parameters may be such that the ground state of the Hamiltonian does not obey this symmetry. Whenever the ground state is
not invariant under a symmetry of the Lagrangian, the phenomenon
is called spontaneous symmetry breaking. The name derives from
the fact that in this case, symmetry breaking is not driven by any
external agent, but rather the Lagrangian itself leads to a situation
where the symmetry is not obeyed in physical processes. In particular, the fluctuations around this ground state will not also manifest
the symmetry.
One can argue at this point that the Lagrangian of Eq. (13.38)
does not have the symmetry of Eq. (13.29) anyway, so this amounts
to explicit symmetry breaking in the Lagrangian written in terms
of the quantum field. But there is an important difference between
the two cases. To appreciate it, suppose we wanted to write down a
Lagrangian in terms of a scalar field I), without paying any attention
to the discrete symmetry of Eq. (13.29). We would have obtained
a mass term, an 1)3 term and an 1)4 term in that case. But the c0efficients of these terms would have been completely independent of
13.4. Spontaneous breaking of symmetries
299
one another. On the other hand, in the case of spontaneous symmetry breaking, the coupling constants with these three terms are
determined by two parameters, A and v. This means that there is
a relation between these parameters, which is the remnant of the
original symmetry.
13.4.2
U(I) symmetry
We now discuss the spontaneous breaking of a continuous symmetry,
where some new features emerge. For this, we consider a. theory
involving a complex scalar field, with the Lagrangian given by
(13.40)
This Lagrangian has a global U(1) symmetry corresponding to the
transformations
(13.41)
As in 13.4.1, we can argue that the vacuum expectation value
of the field should be independent of spacetime, so we need to
minimize the potential, which in this case is
(13.42)
Spontaneous symmetry breaking can occur if !J.2
minimum is obtained if
1(0110)1 =
~,
< 0,
for which the
(13.43)
where
(13.44)
Only the magnitude of the VEV is determined by the minimization
condition. The phase can be arbitrary. The minima corresponding to
all values of this phase have the same energy, Le., they are degenerate.
The system can be in any of these minima.
Let us assume that the vacuum is where the phase value is zero,
i.e., the VEV of is v /.;2. Following the same arguments as in
Chapter /3. Symmetries and symmetry breaking
300
13.4.1, we can say that 4>(x) cannot be the quantum field in this
case. We can write
4>(x) =
~(v+')(X)+i(X)),
(13.45)
where 1J(x) and (x) have zero VEVs, and can therefore be expanded
in creation and annihilation operators. Rewriting the original L....
grangian in terms of these quantum fields, we obtain
1
!f = 2(8"'))(a~'))
+ 2(8"()(a~()
>'v,)(1J'
 >.vV
>.
+ (')  (')'
+ (')' .
4
(13.46)
This Lagrangian describes fluctuations around a minimum which
is not U(I) symmetric. The symmetry in this case is spontaneously
broken. This part is no different from the example in 13.4.1. The
field ')(x) has obtained a mass just as it did there, and the mass is
given again by J2>.v'.
What seems to be different now is the appearance of a massless
field. The field (x) does not have any mass term in Eq. (13.46). This
is a consequence of what is known as Goldstone's theorem, whose
general form will be discussed in 13.5. But before that, we want to
discuss another example of spontaneous symmetry breaking with a
larger group.
13.4.3
NonAbelian symmetry
We now take up the case of an 8U(2) symmetric Lagrangian, as in
Eq. (13.10). Fields carrying spin cannot obtain any VEV in a Lorentz
invariant theory because spin always picks out a preferred direction.
80 we focus on the terms involving scalars in that Lagrangian, which
are:
(13.47)
where <1> is a triplet of scalars defined in Eq. (13.9).
Following now the same kind of arguments as in the previous
examples, we can conclude that the symmetry will be broken if Ji' <
O. The VEV of the multiplet <1> will satisfy the relation
(13.48)
13.5. Goldstone's theorem
301
where v is given by Eq. (13.44). Just as in the previous example,
many vacuum states are possible with this constraint. Let us assume
that our system is in the vacuum sta.te where
4>cl
= (014)10) =
(~)
(13.49)
One can now check that although the quantum field corresponding
to 4>3 is massive, the fields 4>. and 4>2 are massless. This constitutes
another example of Goldstone1s theorem, which we now discuss.
Exercise 13.11 Writf'. down the Lngru.nginn u~ing the Quantum
field. around tlte vacuum cle,fined in EQ. (1 ~. 49). Sltou, tltat tlti. Lagra.ngian stilt has 0. U(I) 811mmelry unneT tllhic:h $+ = (rJ.ll +irJ>2)/J2
change.s to e lB + an.d. 3 remains unchct1lgf'.cl.
13.5
Goldstone's theorem
13.5.1
Appearance of massless states
In this section we give a modelindependent proof o( the statement
that the spontaneous breaking of a continuous symmetry leads to the
existence of a massless scalar. When we say modelindependent, we
mean that we a...;;sume no specific mechanislll for symmetry breaking,
such as some potential of special form or the nonvanishing VEV
of some field. By spontaneous symmetry breaking, we only mean
that the vacuum docs not have the sym!ll~try of the Lagrangian.
We shall stick to internal symmetries only, and shall not deal with
the breaking of Poincare invariaIl'ce. Th('Sc invariunccs have to be
maintained by the Smatrix if the fundamental processes obey the
postulates of special relativity.
A continuous symmetry implies a con~crved current jll through
Noether's theorem. The usual interpretation of it is the existence of
a charge Q, defined in Eq. (2.49), which is timeindependent. In the
c~e of spontaneous symmetry breaking however, this illterpretation
is untenable, because tIle spatial integral over jO may diverge due to
poor convergence properties of some field operator,,; at spatial infinity_
For example, if we consider the vacUUITI expectation value of the
operator Q2(t), we obtain
(1350)
302
Chapter 13, Symmetries and symmetry breaking
Using the generators !]I of translational symmetry which are the momenta, we can write jo(x) = ei!J'., jo(O)ei::i'., etc, Since the vlll:uum
has to be translationally invariant, ei!J"'IO) = 10), so that
(13,51 )
The integrand is independent of x since it contains operators at the
spacetime origin onlYl and is nonvanishing since
Q(O) 10) f:. 0
(13,52)
when a symmetry is spontaneously broken, Therefore, the integral
diverges,
The problem can be reformulated in terms of the field operators,
Since Q is the generator of the corresponding symmetry, the vacuum
state transforms as
10) 
IQ)
= ei'Q 10)
(13,53)
under the symmetry transformation, But Q 10) f:. 0 as mentioned
above, SO IQ) is not the vacuum, Sincc the vacuum was defined
as the state annihilated by all annihilation operators, there must
be some quantum field A whose expectation value in IQ) IS nonvanishing, Using the definition of IQ), we can then write
(13,54)
Since (0 IAI 0) = 0 for a quantum field, this gives
(OIlQ(t), Al_IO)
= a '" 0,
(13,55)
Inserting a complete set of momentum eigenstates as intermediate
states and using the translational symmetry as indicated above, we
obtain
13.5. Goldstone's theorem
=
'f J
303
d x ( (0 IjO(O)1 n) (n
 (OIAln)
IAI 0) e ipn '
(nIj'I(Oll0) eipn , )
= 'f(21f)3 03(Pn) ( (OljO(O)H (nIAIO)e
iEnl
 (0 IAI n) (n 1/(0) 10) eip.n,),
(13.56)
performing the x integration in the last step. On the other hand, we
notice that l since
~;
=
=
aJ.Lj~
0,
J (0
At 0)
Jd (0 [al.j~, AI  IV j.AI 0)
J x(01 [Vj,AIIO).
= 
[:tjO(x),
d x
3
(13.57)
The last. term involves an integral over all space and can be written
as a surface integral at infinity. The surface illtegral vanishes because
the commutator is between A somewhere interior to the volume and
j on the surface, operators separated by very large spacelike interval.
Therefore, it shows that a mnst be timeindependent.
Looking back at Eq. (13.56) now, we try to argue how the right
hand side may be timeindependent and IlOIlZCro. The positive and
llegative energy parts each are time dependent, and they cannot cancel ill general. The time independence can be obtained if, for a state
which satisfies the condition (0 IAI n) (n Ull(O) I0) # 0, we also have
En = 0 for Pn = O. This is therefore a millisless state, with the
same quantum numhers as jo and A. This massless state is called
a NambuGoldstone boson associated with the symmetry breaking.
This boson need not bc an observable st.ate. We will encounter unphysical states of this kind in 13.6.
13.5.2
Examples of NambuGoldstone bosons
We have already ""countered some massless scalars which arise when
symmctries are spontaneously broken by scalar potelltials. We also
304
Chapter 13. Symmetries and symmetry breaking
mentioned that the occurrences of such particles are examples of
Goldstone's theorem. Let us now examine these examples in the
light of the general proof of the theorem given above.
In 13.4.1, the symmetry was discrete, and we did not find any
NambuGoldstone boson. In the other two eases discussed in 13.4,
the symmetries were continuous l and we found some massless fields
which were NambuGoldstone bosons. In 13.4.2, we discussed the
spontaneous breaking of a theory with a U(I) symmetry. The Lagrangian was written down in Eq. (13.40). Since the interaction
terms contain no derivatives, the conserved charge is given by that
obtained from the free Lagrangian, Eq. (3.57). Let us use the field
4> in terms of its real and imaginary parts (or, in quantum language,
its hermitian lind antihermitian parts) as shown in Eq. (3.46). In
terms of these, the charge can be written a,
(13.58)
It is now easy to calculate the commutators of this charge with 4>1
and 4>2, which give
[Q,4>d = iqr/>2,
[Q,4>21 = +iq<f>, .
(13.59)
We chose the VEV of the field 4> in the direction of its real part,
which means that 4>2 has a zero VEV, whereas 4>, has a nonzero VEV
denoted by v. Thus Eq. (13.59) shows the existence of an operator,
viz. 4>2, whose commutator with the charge has a nonzero VEV. In
this case, 4>2 plays the role of the operator A introduced in 13.5.J.
According to the general proof, the NambuGoldstone boson will be
a state In) such that (014)21 n) ~ O. This is certainly satisfied if In}
is the quantum of the field 4>2 itself. Indeed, that is what we found
in 13.4.2, where we had denoted the imaginary part of the field 4>
by (.
In the other example of 13.4.3, we started with an SU(2) symmetry, which had three generators. After symmetry breaking, a U(I)
group remains unbroken, as indicated in Ex. 13.11 (p301). This
means that the Lagrangian after the symmetry breaking does not
change if acted upon by the elements of this group. From the generators given in Eq. (13.16) and the vacuum expectation value in Eq.
13.5. Goldstone's theorem
305
(13.49), it is dear that
( 13.60)
so that
( 13.61)
The intuitive meaning of this statement is the following. The original
symmetry was 8U(2). which is locally the san,e as 80(3), the orthogonal group with three variables. Under t.I)(' SO(3) transformations,
the quantity q, . cP remains invariallt. Once we have spontaneous
symmetry breaking, this symmetry cannot h~ Iwtllifest. However if
we choose the VEV to be "long the 4>" Jim:t.ion, we still h"ve a 1'0tatiotlal syrrullctry in the tPlc!>2 plane, which is exactly the remnant
U(I) mentioned in Ex. 13.11 (p30l). W.' thell "'" t.lmt two of the
three original gCIlp.rators have bP.e1l hrokl'lI. And th~r(' .UP. also two
Goldstone bOSOH:; 4>1 and 4>2o
Exercise 13.12 C()n~ider UlC L(tnT<Ul!ji.Il.U of Eq. (I :L41) and 'Write
the. ("..()1U~('lUl'.d ch,arge.R of th.e. SL1(2) ~l.Illllltdrll. Show I.h.nt, Ulh.f.lt
the ::lljmmetrl.l i~ brokp.n b1) Eq. (1:3.49) lhe nlH\Tll.uhll.nn of brok(~n
ehnrge.~ wit.h GOtdSt01ll' h01:'on~ l).Cll)(' lwnrll.lI.i.!oihilt!J VEV~.
Exercise 13.13 Con.sider un SU(2)illl1nri/lllt Lfl!jrnn!11UTt, similar
to that in Eq. (1 :L4 7), but this lilllt' ll~i tl!1 (l ltoubkt oJ (',omple.x.
scalar jicl.d:;. If S1J1ll.11wt.ry i~ sllnnt(lnf'_(jlJ.."l~1 hrohn hll (\ VEV of this
Tnulti.plf'.t, ShOll'; tha.t. no r.on.ti.nuou~ 1'>1Jlll111l'tl'\1 i.s left. mlrr, nnd three
GOltlRt.OTlt'. b080TlS fTl1.t'.rgl.
13.5.3
Interaction of Goldstone bosons
Let us go hack to the simplest. example of till' II(TlIrWIlCC of Goldstone
bOSOllS which wa..'i presp'llt,ed in 13.4.2. Afler Sp()lIt.atH~OUs tiymmetry
breaklug 1 the Lagrangian was written ill WrlllS of t1w quantum fields
'1 a.nd ( a.nd pn's('llt.(~d ill Eq. (13.46). \V(\ lIOW wallt t.o show how
Goldstone bosom; illt.eract. For this, let liS first t.ake t.he exa.lIIple of
a scattering process
(13.62)
At the tree level. t.h(~ diagm.ll1s for t.his process arr. given in
Fig. 13.2. The Fe.ynmau amplitude call 1)(' (~lk1l1at.ed from the Lagrangian of Eq. (13.46) to he
.
jj
1.. 4<
2"
'"
,(2i.\1ij2
(I'
+ k)
i(2;.\I))'
(I'  k')
i(Gi.\,')(2;.\v)
+ Ik' 
.,
k) 
2'
111."
13.63
Chapter 13. Symmetrie. and 'yIIJmetry breaking
306
..
.
'
((k)'"
/((k')
((ki..
./((k')
(
... 
1)(p)./ .....1)(p' )
11(P)./
.!I(p')
..
((ki
((ki.
,,(p')
.(W)
!(,
:1)
,
71(P)/ .....((k')
11(P)/ .... !/(p' )
..
Figure 13.2: Treelevel di.lgrams for 1/( elastic scattering.
A c.omment is in order fnr the Ilumf!rical factors 011 the right hand
side. For example. the fir:::;! term comes frolll tlw 1]2(2 interaction
term in Eg. (13.46). The coefficient of this t.erm in the Lagrangian
is >../2. Howp.ver, eat]} of the operators 11 CH.1l either allIlihilatc the
initial TJ particle. or create the final one, This introduces a permutation factor of 2, as mentioned in ~j6.6. Similarly. we get a factor
of 2 for the two factors of (. This leans t.o t.1", nllmerical coefficient
2.\ of t.he fir~t term. The coefficients on t.h~ ot.her t.erms appear for
similar rea.')olls.
Now, using k'2 = k''2 = 0 and p2 = pf'2 = m~ = 2>...,,"2, we call write
it Il..<;
.4'( =  2>' _
(2).,,)'1
(2).,,)2
m;  2p . k'
711; + 2p . k
2>' 1 +
7tI
2
1}
m.~
+ 2)) . k
+ HI 2
11
(6)',,)(2>'v)
',;~c;+:
711; + 2k . k '
3ut;,]
m,~

2), . k'
1/!.
2"
'I
..
2fl: . ,.;.'
(13.64)
Remembering the kinematical relations p' k = p'. k' iLlld p' k' = p'. k,
we see that this amplitnde vanbhes if eiUH'r of t.1H~ Colcbtone boson:;
ha...' zero 41lI0IlHmtlllIl. i,p.. if either k l ' = (] or ]..;'1 = u.
1
o Exercise 13.14
SUlnf:
morld.
Tltl'
COHHill(T t.Iu' (.( H{'.nt.ll'l'l.ll.~l flt t.he bTl'. ll'lld in the
'II101ll./'nt.n. (tn' k 1 . k',! 111111 t.ilf' jlllut lTl.Ol1H'lltU
in.itiol
13.5. Goldstone's theorem
307
k 1, k;. Show that the um:plitude is giuen b11
(1365)
which uan'ishes
Tnomentu1n.
if
CLn1J
one oj th.e Goldstone. hogons has :tero 4
This is a genera] feature of any interaction of Goldstone bosons.
In the limit that the 4momentum of any external Goldstone boson
is zero , the amplitude for any process involving such bosans vanishes.
There is a general way to see this directly. Consider, e.g., the model
of 13.4.2. We commented that since the VEV of the field 1> was nonzero, it cannot be treated as a qnantum field which can be expanded
in creation and annihilation operators. Consequently, we wrote 1>(x)
;n the form given in Eq. (13.45), where " ann ( have zero VEV so
that they can be treated as quantum fields. But this is not the only
way that 1> can be written in terms of quantum fields. Alternatively,
we could have written
</>(x) = v
+ 1)(x) e,(xl/"
J2
(13.66)
If we keep only up to linear terms in the quantum fields, this representation coincides with that of Eq. (13.45). However, this form
is illuminating for a different reason. If ((x) were a constant in
spacetime, it would have been a 'constant phase for the field 1>. And
the phase of </> does not appear in the Lagrangian  that is in fact
the U(I) symmetry that the Lagrangian has. Thus, only spacetime
derivatives of ( can appear in the Lagrangian. Indeed, substituting
Eq. (13.66) in the Lagrangian of Eq. (13.40). we obtain
!L' =
~(8~11)(8~1)) + ~ (1 + 1)~)) 2 (8P()(8,,()
1 2
2
2J1(V+1))
4
41 A(v+1)).
( 13.67)
This clearly shows that the field ( enters into the Lagrangian only
through its derivatives. Thus, the Feynman rule for all the vertices
involving this field will have at least one power of 4momentum of
Goldstone bosons, and should vanish if the 4momentum is taken to
zero.
308
Chapter 13. Symmetrj(s and symmetry breaking
Exercise 13.15 Usc lhe. Lagrangian oj Eq. (13.61) to calculate lhe
Feynmo.n amplitude. oj the. TJ( elastic 8(:uU{Ting
lho.t the Te8utt coincides with that in Eq. (13.6.4).
13.6
pr(l('.t'.88
o.nd show
Higgs mechanism
So far, we have lalked about global symmetries only. We have seen
in 13.4 that if such symmetries are spontancously broken, massless
bosonic states appear in the theory. If the broken symmetry is \ocal 1
however, this is not the case. We show this with an example.
We start with a Lagrangian with a local U(I) symmetry:
where J.1.2 < O. The gauge covariant derivative D Jj is defined by
(13.69)
The potentia] of lhis model is ident,ical with tl,at of the model of
13.4.2 with a global U(I) symmetry. Thus, t.he solution for the
ground state remains the Hame, and we call define the quantum fields
as in Eq. (13.45). When we write down the Lagrangian in terms of
the quantum fields, the part coming from t.he potential is exactly the
same as in Eq. (13.46). The interesting part is the modified 'kinetic'
term (D"<It(D"<I. Using Eq. (13.45), we find
(D"<It(D"q,) =
~(iJI'rJ)(a"'I) + ~(8"()(a,,()
1
+evA"a,,( + '2,2,,2 A" A" + ..
(13.70)
where the dots stand for terms which cOIlt.ain at lca.:.;;t three fields
and are therefore interaction terms. Comparing with the Proca Lagrangian of Eq. (8.21), we see that the U(I) gauge b",on has a mass
term after the symmetry is broken, and the ma.."iS is given by
(13.71)
But before we conclude that the gauge bo....on is massive, there is
a problem to settle. There is a term A"D"( in Eg. (13.70). Notice
that it is qnadratic in the fields, so it should be part of the free
13.6. Higgs mechanism
309
Lagrangian of the system. However, it contains two different fields,
AP and (, involving a derivative. If written in terms of creation and
annihilation operators, this would imply that we can have Feynman
diagrams in which an AP changes into a (, without any other particle
interacting with them. For example, external AP can become (, and
Vice versa.
However, this situation can be easily avoided. We know that
we have to add a gauge fixing term in order to quantize the theory.
Suppose we choose
(13.72)
The cross term present here would pair up with the problematic
term of Eq. (13.70) to make up a total divergence, which can be
disregarded in the Lagrangian. Including this gauge fixing term, the
terms quadratic in the gauge boson field can be summarized as below:
2:rA)=_~F pV_~(aAP)2+~M2APA
A
o
2~P
4Pv
P'
(13.73)
where MA is given by Eq. (13.71). On the other hand, the quadratic
terms involving the (.field are given by
(13.74)
From these, we can calculate the propagators of both A and (, and
find
(13.75)
,
iLl.(k) = k 2
 ~
M2'
(13.76)
Exercise 13.16 VeTilY that tile propagators 01 the U(I) gauge jield
and the jir.ld ( that 101l0Ul lrom Eq. (13.1:1) and (13.14) are giuen
by Eq. (13.15) and (13.16).
In quantum field theory, the pole of the propagator gives the mass
of the corresponding particle. However, here we seem to have some
problem with the poles. Of course the gauge boson propagator has
a pole at k 2 = M~, which corresponds to a pole at its mass. But it
310
Chapter 13.
Symmetries and symmetry breaking
also has another pole at k 2 = ~M~. Remember that for an unbroken
U(I) gauge theory, we argued in 8.4 that ~ is an arbitrary unphysical
parameter which was introduced in the gange fixing term and should
not appear in any Smatrix element. The same should be true here.
Thus the gauge boson propagator has a pole at an unphysical point
as well! And for the (field, this is in fact the only pole.
These are not catastrophes. These facts show that the field ( is
an unphysical field in this theory. The easy way to see that is to use,
instead of the representation of the field given in Eq. (13.45), the
alternative one given in Eq. (13.66). In that case, (x)/v is the phase
of the field (x). And any phase, even if it is a spacetime dependent
one, is irrelevant because of the local U(I) symmetry. More explicitly,
the Lagrangian of Eq. (13.68) is invariant under the transformations
(x) ~ '(x) = e ie8 (I)(x),
A~(x) ~ A~(x) = A~(x)
+ a~O(x).
(13.77)
Therefore, given the representation of the field (x) as in Eq. (13.66),
we can always choose a new gauge through Eq. (13.77) without
changing any physical implication. In particular, if we choose
O(x) = ((x)/ev, the gauge transformed fields are given by
'(x) = v+r/(x)
A~(x)
J2
A~(x) + ~a~((x).
ev
(13.78)
If we substitute these primed fields instead of the unprimed fields,
the ((x) field disappears altogether from the Lagrangian. This Lagrangian then contains physical fields only, and the gauge in which
this takes place is called the unitary gauge. In the language of the
general ~gauges introduced in Eq. (13.72), this can be identified
as making the choice ~ ~ 00. In this case, the propagator for the
(field has an infinite contribution in the denominator, and therefore any Feynman amplitude involving a (propagator vanishes. For
finite values of the gauge parameter ~, the (field appears in the Lagrangian and therefore in the Feynman rules l but it corresponds to
an unphysical degree of freedom nevertheless.
If the symmetry were global, i,e., the gauge field A~ were not
present, the field ((x) would have been the Goldstone boson after
13.6. Higgs mechanism
311
symmetry breaking. This is what we saw in 13.4.2. In the case
of local symmetry, we find that not only does the mass term of the
field disappear from the Lagrangian, the entire field disappears in the
unitary gauge. This is a new feature for local symmetries. Another
feature is that the gauge field, which would have been massless if
the gauge symmetry were unbroken, develops a mass after symmetry
breaking.
The two features are related. In Ch. 8, we remarked that a mll8Sless gauge boson has two independent degrees of freedom. In the
unbroken Lagrangian of Eq. (13.68), there are thus four independent
degrees of freedom in total  two for the gauge field and two for
the complex scalar field. In the symmetry broken Lagrangian, the
gauge boson is massive. Massive spinl particles can ha.ve a longitudinal mode of polarization in addition to the two transverse modes.
The polarization vector will be explicitly constructed in 15.4. This
increase from two to three in the gauge degree of freedom can occur only at the expense of one degree of freedom from the scalar
sector. Indeed,7)(x), being a hermitian field, accounts for only one
scalar degree of freedom left over. The other one disappears from the
physical degrees of freedom. Metaphorically, one says that the gauge
boson has Ueaten Upll a scalar degree of freedom to get its mass. This
phenomenon was noticed by several authors, but is usually referred
to as the Higgs mechanism after one of them. In Ch. 15, we will
see how this phenomenon has played a central role in understanding
weak and electromagnetic interactions.
Chapter 14
YangMills theory of
nonAbelian gauge fields
In Gh. 9, we constructed QED, which is a gauge theory based on
a U(I) symmetry. In this chapter, we will see how to write down
the Lagrangian. of gauge theories based on nonAbelian symmetries.
Such theories are often called YangMills theories after the people
who discovered their structure. We will see that they have some
novel fcatures which were absent in the Abelian gauge theory of
QED.
14.1
Gauge fields of nonAbelian symmetry
Let us recall the crux of the argument of 59.1 where we introduced
the U(I) gauge theory. Suppose we have a free Lagrangian with some
global symmetries. However, the free Lagrangian is not invariant if
these symmetries are considered to be local. [f we intTod uce some
spinI fields in the theory, we can make the theory locally invariant.
These spinI fields are the gauge fields.
Let us try to follow the same path for the case of a larger symmetry. We will consider an SU(n) symmetry. So let us start with
a Lagrangian which has a global SU(n) symmetry. We encountered
such symmetries in 13.2.2. Making a trivial generalization of Eq.
(13.5), we can write thc free Lagrangian of n Dirac fields in the form
2'01<
= IIi (if; 
01) IIi,
where the masses of all fields arc equal.
312
(14.1)
Here IIi
IS
an nplet of
14.1. Gauge fields of nonAbelian symmetry
313
Dirac fields which transforms in the fundamental representation of
SV(n). This Lagrangian is invariant under global SV(n) transforma.tions which can be written as
w~ w' = Uw,
(14.2)
where U is any n x n unitary matrix of unit determinant, and is
independent of spacetime.
However, if U is a function of spacetime, the invariance is lost,
because the Lagrangian in terms of 11" reads
2''1<' = iii' (i41 m) '1"
= 2''1< + >VU'i')'"(8"U)W.
(14.3)
As in the case of the V(I) gauge theory, we can try to ensure
local invariance by introducing new fields into the theory. These
fields must be vector fields, as in the case of QED. Since the number
of independent parameters in a general SV(n) group element is n'I,
we need to introduce
I vector fields to cancel the derivatives of
these parameters. Let us denote these vector fields by A~ and write
the new Lagrangian in the form
n' 
2'=IIi(i.Qm)W,
(14.4)
where, in analogy with the case of electromagnetism, we define
(14.5)
where 9 is the gauge coupling constant, analogous to the factor e
appearing in Eq. (9.15), and Ta are the generators of SV(n) which
replace the factor Q appearing in the same equation. The gauge
index can be written up or down , whichever is convenient.
If we now make a local transformation defined by
W ~ >V' = U(x)>V ,
(14.6)
where U(x) are functions on spacetime. we can calculate what A~
should be so that the Lagrangian of Eq. (14.4) remains invariant
under this transformation. The transformed Lagrangian is
2" = W' (~ m) W'  g>V'1'"TaW' A~
= W (~ m) W + WU'i')'"(8"U)>V
 gWU'1'"TaUWA~.
(14.7)
314
Chapter 14.
YangMills theory of nonAhdiall gauge fields
This will be the same Lagrangian
provided
ll.'l
the one given in Eq. (14.4)
1
:(ap U)U I + UTn A"UJI
.
(14.8)
.9
We have thus gauged the global SU(n) synHnetry, and the fields
are the gange fields that have emerged.
A~
Exercise 14.1 ShOUl that, under a local SU(ll) trnnsformntion l
D~31 transforms the. Snme way ns IIJ, i.f' ..
(D" 11')' = Vex) (D" 11') ,
(14.9)
o Exercise 14.2 ShOUl that, if we slarted from tht'. La.~ra.ngia.n oj
scala.r ftelds in the funda.mental re'PTesentn.tion obeying a.n SU(n)
global ~r\lmme.try and tried to gauge it, the. gangf'. [lelds 'Would haue
tra.nsformed a.s in Eq. (14.8) in that case us 'Well.
Although we started with the definition of the covariant derivative D~ which mimics the U(I) gauge theory, we notice that some
new features have already emerged in the HonAbelian case. To see
them, let us write down the transformation properties of the fermion
and the gauge ficlds for infinitesimal transformations. In this case,
we can write
U(x)
= exp ( 
igJ3a(x)T,,) '" I  igJ3"(x)T,,.
(14.10) .
Then for infinitesimal SU(n) transformations,
iii' = iii  i9J3" (x) Taiii ,
A~ = A~
+ a~(3" + gfabc(3b(T)A~ .
(14.11)
In the transformation equation of the gauge fields, the a~J3" term
resembles the U(1) gauge transformation. But the term containing
the structure constants is something new. This could not have appeared in a U(l) theory because in an Abelian group, the structure
constants vanish.
This is a new feature, but to appreciate its full importance, let
us rewrite Eq. (14.11) with all group indicf'S for the case of global
transformations, i.e., 8~J3" = O. The transformations of the fermion
fields, whieh are in the fundamental reprcscntation, now read
(14.12)
14.2.
Pure gauge Lagrangian
315
For the gauge fields, remembering the definition of the arljoint reprcscntlHion given in Eq. (13.25), we can writi'
A '"
j.J
= A"Jt
. (ih
1.g
[1:(b "'I)]
A'II
(14.13)
'"
where T~at:I) denotes the generators in the adjoillt. represent.ation. We
conclude that the gauge firlns trrlllsform ill ,he 1l<ljoint reprcsentatiun
of the gauge group.
In the case of QED l we f01111d that thl' photoll field remains inva.ria.nt nnder a glohal transformation. Thi:; implies t.h~tt the photon
does not carry any electric' charge. The 1101lAhdiall gauge fields, on
the contrary, transform nontrivially und<'1" tilt' gaugf' group, which
rneam:i that they carry charges co:rcspulldillg t.o the gauge currents.
This is the big difference, and it h,l.:j other itllplicatiot\s. WP. can see
them if we try to complete the Lagrangia!l !lOW hy adding the pure
f';allge Lagrangian, i.e., the part of the Lagrangian which contains
the gauge fields only.
14.2
Pure gauge Lagrangian
\Ve have learned that if we start frum a. glohaJly illvariant La.gran~iall
and try to make the invariance local, W(~ {wed 1.0 replace all partial
oeriva.tives by covariant derivat.ives. \Ve 11a\,(' <lone it for t.he fcrmions.
Now we nc(xi to do it for the gallgf~ b0S0llS ;t." \\dl.
This is all issue which we ov~~rJook('d \\'hil(~ cOllstructing U(l)
gauge theory ill Ch. 9. hecause the COtlS('C!lI('TlC('S arc trivial. To
confirm that and also to pa.ve the way fol' IlOIlAlwliaJ1 tlwories, it i1"
best to go back to the U(1) tiwury.
COllsidcr a local U(l) trausfor11latioll 011 a ferlllion ficleJ, a.s given
in Eq. (9.4). For infinitesimal U(1) rotatiolls. Wt~ call writc it as
<Ii = <I'  ie(JO(:r}t/' .
( 14.14)
On the other hand, the ucfillitiotl of the covariant deriva.tive for U(l)
is given in Eq. (9.15). which is
D,,<I! =
o"lj; + ieQA,,(:r)d'.
(14.15)
Looking at these two equations~ we observe t.he following. Eq. (14.14)
h& the original field il.=i t.he first. term. which is illdt'p~llri~llt of the'
316
Chapter 14.
YangMWs theory of nonAbe/ian
~auge
fields
transformation parameter 8(x). Similarly, the covariant derivative
also contains the partial derivative, which is independent of the gauge
fields. As for the other term, we see that if we take the infinitesimal transformation equation and replace 8(x) by A"(x), we obtain
the corresponding term in the covariant derivative. This is not a
coincidence. This is the property which ensures that the effects of
the local phase rotation can be compensated by the gauge fields. So
the difference between the covariant and the ordinary derivatives for
a general field is obtained by taking the infinitesimal change of the
multiplet and replacing by  A w
If we now try the same thing for the photon fields, we can start
from the transformation property of the photon field given in Eq.
(9.11). viz.,
(14.16)
Using the same prescription then, we will obtain
(14.17)
In other words, for the U(1) gauge theory, we could have defined the
electromagnetic field tensor as
(14.18)
and written the pure gauge Lagrangian as
(14.19)
This might have been more elegant in some sense, viz" that the
Lagrangian of the photon field resembled more closely the gaugecovariant kinetic term for scalar field. Ell t as we see from Eq. (14.17),
it would not have made any difference in the content.
Let us now return to nonAbelian gauge theorie!':i. Hen\ the infinitesimal transformation law for a field W in the fundamental representation is giver, ill Eq. (14.11), whereas the covariant derivative
appears in Eq. (14.5). Again, we see that the gauge covariant derivative is obtained by replacing the gauge tran~formation parameters (3a
of Eq. (14.11) by A~ ill the nontrivial terlll of the cov"ri"nt derivative. Going back now to the infiuitesimal transforlllatiun property of
14.2. Pure gauge Lagrangian
317
the gauge fields in Eq. (14.11), we can write the covariant derivative
of the gauge fields as
(14.20)
Let us now try to see why this definition is llseful. If we consider
the gauge transformation property of the covariant derivative acting
on any field, the transformed quantity does not involve any derivative
of the transformation parameters. This was shown for the covariant
derivatives of fermion fields in Eq. (9.17) for the U(I) theory and in
Eq. (14.9) for an SU(n) theory. We can try the same thing on the
covariant derivative of the gauge field. It is convenient to transform
Eq. (14.20) to Ii matrix equation by multiplying both sides by the
generators Ta in the fundamental representation and summing over
the gauge index a, After making a gauge transformation now lone
obtains
(TaF:v )' = (D~TaA~)' = 8~(TaA~)'  8v(TaA~)'  gIaocTaAt' A~'
= 8~(TaA~)'  8v(TaA~)' + ig [TbAt', TcA~'L, (14.21)
using in the last step the definition of the structure constants of the
group to write IaocTa = i[Tb, T,I_. If we now use the transformation
properties of the gauge fields given in E'l. (14.8), we will obtain
(14.22)
As anticipated, like the covariant derivatives acting on fields in the
fundamental representation, here also we do not obtain any term
involving 8~U.
o Exercise 14.3 Verify Eq. (14.22).
This shows that if
of F~v:
We
form the following term involving two factors
(14.23)
it will be gange invariant and Lorentz invariant. Using tr (TaTb) =
~5ab from E'l. (13.24), we finally arrive at the pure YangMills LagrangIan:
""
1 FaIW F''"
...z.YM
_  4"
I'l
(14.24)
Chapter 14.
318
Yml",Mills theory of lJollAbelialJ galJ",e fields
Again, this looks very much like the U(I) i(auge Lagrangian, but it
has to be remembered that
v cont.ains a term which is CJuadratic
in the gauge fields. It is easy to see that a mass term of the form
~M2A~A~ would not respect the gauge symmetry, just a, in the U(I)
case.
F::
Exercise 14.4 Show th.at if we lake the (,.ollurinnl def'i\)nti'Ue~ in
a.ny rCVrf.scnlall.on tllhe.TC the gene.ralor lTl.utl'iCeH (H"C denoted blJ Tal
the .field strength. ten~or can also bl? defined by t.h.l'. relation
(14.25 )
where D J, L~ now considered to be an opel'nf.oT th.at 0pf',Tales on any
th.ing to its right.
o Exercise 14.5 Show that the .f\e.ld shf'.ngth tensors ~a.ti8jy the
ide,ntii"
(1426)
These are tile anologs of the homogeneous MU:tUlt'.ll c.quations\ called
tnc Bia1lchi ide1ltlty. IHint: It is easy if yau use Eq. (14.25).1
14.3
Interactions of nonAbelian gauge fields
We have encouutered two different kinds of interactions of nonAbelian gange. fields so far. First, as in the c",,", of QED, the gauge
fields interact with other particles in the theory. And second, because
of the presence of the nOlllinear terms in the definition of F~'.d the
pure gauge Lagra.ngian involves interaction t.erms among the gauge'
fields. Let us discuss these one by one.
14.3.1
Gauge interactions of other particles
The interaction term between fermions ano gauge bosons is
(14.27)
The interaction vertex and the corresponding Feynman rule is given
by
Ae.a
(14.28)
14.3. Interactions of nonAbelian gauge fields
319
where we have to use the generators of the appropriate representation. For a particle in the singlet representation , Tr, = 0 for all a.
so there is no gauge interaction. And for all particles in nontrivial
representations, there will be some gauge illteractioll.
Exercise 14.6 Consider n gauge lheol'1l bn~t>.rl on UtI' grOll'll SU(2).
Instea.d oj th.e ~tQ.nd<Lrd hermiticm gencrut.nrs, tuk~ the gc,n.e:rnLoTs La
be T
(T, ,T,)/.j2 and To = T,. Tn, gHUgC jida. couple. to two
fermlons whieh tra.nsfonn like one noubtl'l. De,rille the Fellnmnn
rules for all 9<L1J.ge intera.clions oj fermions.
For scalars in a gauge theory. the interactions can be similarly
calculated. In this ease, we should start with the term containing
covariant derivatives acting on the scalar fields, i.e.!:
This contains the kinetic terms for the scalar fields in the multiplet
~, but in addition contains the interaction terms
~nl = ig(a,,4>j)(T"A~4  ig(TnA~,4t(i:I"4
+/(T"A~4t(nAf.4 .
(14.30)
The vertices involve two scalar particles, with p.ither one or two gauge
hosons.
Exercise 14.7 Find th.e. Fellnmun rules foT' lite, interactions of
gauge bosons with sc.alur p.etds,
14.3.2
Selfinteractions of gauge bosons
Using the definition of F;v from Eq. (14.20), t.he pure gange Lagrangian of Eq. (14.24) can be written as
UL
~YM
= _ 4 (aJJ A"v  a lJ A")
 DV A")
JJ (D"A"
It
0
+~9f"bcArA~ (a"A~  DvA~)
1 2fabc f ode A JJb ACA"A
49
d
lJ
V
t"
(14.31)
The terms involving only the derivatives of the gauge fields are t.he
kinetic terms! but in addition this Lagrangian contains interactions
among the gauge bosons. There arc cubic and quartic interactions.
The Feynman rules for t.hese vertices are given in Fig. 14.1.
320
Chapter 14. YangMills theory of nonAbelian gauge fields
A~(p)
: 9Iobe[(q  T)"g",
+ (T 
P)"9'" + (p  q)>.g,,"]
: _ig 2 [/abelCde(g,,>.g"P  g"pg",)
+ laeelbdAg"pg>."  g,,"g>.p)
+ ladelbce(g,,"gp>.  9,,>.9p")]
Figure 14.1: Feynman rules for cubic and quartic interactions among gauge
bosons. We have taken the convention that all momenta are coming into the
vertices.
As an instructive exercise, let us trace the steps to determining
the Feynman rule for the cubic inter""tion vertex. Obviously, this
will come from the term in Eq. (14.31) containing one f""tor of structure constant. Using the antisymmetry of the structure constants,
this term can be rewritten as
.>fcubic =
gla'b'c,Af:;A~, (8oA~) ,
(14.32)
where we have also changed the notations for the indices appearing
in the expression. Let us now re[~r to the vertex in Fig. 14.1. Suppose all the momenta are pointed towards the vertex. There will be
several contributions to this vertex, depending on which particles are
annihilated by which o[ the
A~.
There will be thus six terms, like:
a' = a, b' = b, c' = c => glabc g;:g~ (ipolI"fJ) = iglabcP"9", ,
a' = a, b' = c, c' = b => g!acb g~g~ (ip~lI"fJ) = ig!acbP>.9,," ,
a' = b, b' = a, c' = C => glOOe g~g~ (iq,,9"fJ) = ;9IOOcll,,9'"
(14.33)
Evaluating the other terms similarly and noting the antisymmetry o[
the structure constants, we finally obtain the Feynman rule [or the
cubic vertex given in Fig. 14.1. The rule for the quartic vertex can
be derived similarly.
Exercise 14.8 VerihJ the Feynmnn rule. Jor the. quartic uerte:r
giuen in Fig_ 14.1.
14.4. Equations of motion and conserved currents
14.4
321
Equations of motion and conserved currents
The gauge invariant kinetic terms of the basic fields of a YangMills
theory are already known to us. The Lagrangian is
=  ~F:J:" + ~i)'~D~w + (D~~)t (D~~)  V(W,~),
2.'
where V includes the mass terms for W and
~,
(14.34)
and may also include
interaction terms, but no derivatives. As we mentioned earlier, a
mass term like A~A~ would not be gauge invariant.
The Noether currents corresponding to the gauge symmetry can
be derived using the definition in Eq. (2.64). We obtain
~_
Jo
82.' OAt
82.' ow
[82.' o~
]
8(8~At) 0(30 + 8(8~w) 0(30 + 8(8~~) 0(30 + h.c. ,(14.35)
where ow etc are changes of the fields under infinitesimal gauge transformations characterized by transformation parameters (30' This
gives
J:: = fabcFt" A~ + ~y~Taw + [i(~~)Ta~ + h.c.].
(14.36)
Notice that the gauge current now includes contributions from the
gauge fields. This is because of the fact that the gauge fields transform nontrivially under gauge transformations. As a result, gauge
fields carry nonAbelian charge, as was mentioned before.
Using the Lagrangian, we can derive the classical equations of
motion for the fields. For the YangMills fields, these are
(14.37)
This looks similar to the classical equation of motion for the electromagnetic field, Eq. (8.8). The difference, however, is that the
currents contain YangMills fields as well. An alternative way of
writing Eq. (14.37) is
D
where
j~
~Fa
j.!V)Vl
(14.38)
contains all contributions to the current except those from
the gauge fields.
o Exercise 14.9 ShOUl the ""uivalence oj Eq. (14.37) and (1438).
322
14.5
Chapter 14. YangMills theory of nonAbelian gauge fields
Quantization
fields
of
nonAbelian
gauge
In Eq. (14.31), we wrote down the Lagrangian of YangMills fields.
It contained interaction terms which are new features of YangMills
theories. But the quadratic part of the Lagrangian is the same as
the Lagrangian for a free electromagnetic field. Thus the problems
in quantizing the Lagrangian of gauge fields discussed in 8.2 will
appear here as well.
The way to get rid of the problem is same as that described in
8.3, viz., we need to introduce a gauge fixing term. We can choose
a covariant gauge fixing term similar to the one for the photon field:
(14.39)
Following the same steps as in 8.4, we can now write down the
momentum space propagator as
(14.40)
This means that in diagrams containing virtual gauge boson lines,
we should use the following Feynman rule:
(14.41)
where D~t(k) is given by Eq. (14.40).
Unlike the electromagnetic case, this is not the end of the story.
For reasons which remain outside the scope of the book, gauge invariance cannot be maintained unless one introduces extra unphysical fields in the theory. These are called FadeevPopov ghost fields,
which will be denoted by calx) and antighost fields denoted by ba(x).
As the index a implies, there is one such pair corresponding to each
gauge boson. These are scalar fields transforming in the adjoint
representation, but nevertheless obey anticommutation relations like
fermions. For the gauge choice of Eq. (14.39), the Lagrangian involving these fields is given by
(14.42)
14.6. Quantum Chromodynamics
323
The fields ba and Ca are real in this Lagrangian. so this term is
hermitian by itself. They can replace gaugc fields in any diagram
where there is a dosed gauge field loop. As internal lines, their
propagators can he read off from Eg. (14.42). This is
(14.43)
The only interaction of these ghost fields C'U1 itlso be read off from
Eg. (14.42). This is an interaction vertcx involving a gauge boson.
The Feynman rule for this vertcx is
(14.44)
This Feyuman rule show:; why we could a.voici discussing the ghost
fields for the Abelian case. The structurc constants an' all zero for
an Abelian group, so the ghosts have no int.eraction with a.ny other
field at all. For nonAbelian gauge groups. this is not the case, and
one must use thC'sc fields in order to obtain correct results. However,
it is possihle to consistently project out states cOlltaining ghost fields
from the Hilbert spaces of initial and filial st.ates. Thcn ghostfree
physical states evolve only into other ghost.free states. As a result,
the ghost fields appear only in joops ..., virtnal I""tides. For tree
level calculations, they are irrelevant..
Con.~idfT the ItoolJ !{f'lJC.1Lf'r!l'!I fliuqn'1m~ Jor non
Abelian ~o.uge fidd~. Th.f'_Te will be loop~ iH\lnluing other particles
that coupte with guuge hOSOll~ 0.8 fOT QED (Jermio1t~, sc:ula.nJ etc.).
But nnw the.re. 'will bt' diugraml'5 inuoh1inu !IClll!II' h01WltR ill the tOO'lll
as in Fig. 14.2n 1b. Calculate. theRe 1.11:0 clill!jl"mll!'\ in the. Ft'.'ynmun't. Hooft gClUgt', awl Rhou: tho.t the l1UClill.1ll 1IOlflnz.ut.ion i~ not gnuge
int:uTiant, i.~., ib t.t',lllwr structure i:'l not. k'2.'h'lI ~ kl,k,,, (18 urgued in
Ch.. 12. jnr Abelian theories. Nom udd th/" ('lllltriblLtlOll of Fig. 14.2c
inuoluillq u ghost loop a.nd ucrifu lIluL Ofl(' l'l"'~luill}\ tIl{'. nece88urll U>;nSal' struct.urt'.. IHint: U 8t'. dime.nJoll0nul rt'!}ulnriz.utiolt.1
o Exercise 14.10
14.6
Quantum Chromodynamics
NonAbelian gallge thcorie:; are importallt !H'caUS(' according to our
present level of under:;tanding, all (ulldalIlf'llt.al interactiuns except
324
Chapter 14. YangMills theory of nonAbelian gauge fields
(b)
(a)
,
, ....
.'
(c)
Figure 14.2:
Selfenergy diagrams for gauge fields in
pure nonAbelian
gauge theory.
gravity are described by such theories. The gauge bosons of various
nonAbelian symmetries mediate different interoctions. Even electromagnetic interactions form part of a bigger nonAbelian gauge
theory. The detailed nature of these symmetries for weak and electromagnetic interactions will be discussed in Ch. 15. Here, we briefly
look at the strong interactions, which is described by a SU(3) gauge'
theory.
All known fundamental fermion fields arc divided into two broad
classes. One class, called leptons, such as the electron, muon and
neutrinos, do not have any strong interactions. In gauge theory parlance. we can say that these are singlets under the strong interaction
gauge group. The other class, called quarks, can be described by
SU(3) multiplets, and therefore have strong interactions. They are
the constituents of strongly interacting particles like the proton and
the neutron.
Quarks come in three 'colors'. Denoting these colors by an index
a, we can write the free Lagrangian involving the quarks as
!L' = LilAa (i9
m q. ) iJAa,
(14.45)
where the sum over A runs over different quarks. Obviously this free
Lagrangian is invariant under a global SU(3) symmetry under which
three colors transform like a triplet. If we try to gauge this color
14.6. Quantum Chromodynamics
325
symmetry, we obtain a theory called Quantum Chro1nodynamics, or
QeD for short. This theory will contain eight gauge fields G~, which
are called gluons.
Chapter 15
Standard electroweak theory
To the best of our knowledge, there are four different kinds of interactions  strong, electromagnetic, weak anrl gravitational. Description of gravitational interactions requires general coordinate invarianee in curved spaces, as in Einstein '8 general theory of relativity.
We have been discussing field theories based on Lorentz invariance,
which are inadequate for describing gravitational interactions. There
have been attempts to construct quantum theories of gravity using
general coordinate invarianee !IS the gauge symmetry, bnt so far all
such attempts have led to nonrenormalizable theories. Among the
other three, electromagnetic interactions arc described by the gauge
theory of QED, which was discnssed ill detail in Ch. 9. Strong interactions are described by the nonAbelian gauge theory of QCD, as we'
mentioned in Ch. 14. In this chapter, we show how weak interactions
are also described by a gauge theory. The gauge gronp required for
this theory also includes the gauge group of QED, so this theory is
called eleetroweak theory.
15.1
Gauge group
15.1.1
Choice of gauge group
The earliest known example of a weak interaction process is that of
beta decay: n _ p + (~ + ve. Among the four particles appearing in
the initial and finnl states of this process, the. neutron and the proton
are hadrons, i.e., they have strong interactio~lS
H..,)
well. In analogy
with QED, let ns try 1.0 think of this proce'" as being mediated by
326
327
15.1. Gauge group
a gauge boson. Suppose we try to keep the neutron and proton in
the same multiplet, as we did for the isospin symmetry in Eq. (13.8).
Then there will be a vertex of neutron and proton with a gauge boson.
The other end of the gauge boson line has to end on a vertex involving
the electron and the neutrino, which therefore should also belong to
another multiplet of the same group. The resulting diagram would
look like Fig. 15.1.
v,
Figure 15.1: (3dec.y medi.ted by. gauge boson.
The simplest possibility is that the members of a pair are parts
of a doublet. Among the unitary groups, only SU(2) has a doublet
representation, and so it indicates that the gauge group to describe
weak interactions should either be SU(2) itself, or should contain
SU(2) as a subgroup.
Let us consider the first alternative first, since it is more economical than the other. An SU(2) group has three generators. Corresponding to them, there would be three gauge bosons. The interesting point to note is that in the betadecay process, the electric charge
in the hadronic sector changes by one unit. This indicates that there
must be gauge bosons which carry one unit of electric charge. These
are termed W+ and its antiparticle W. If neutron and proton are
taken as the two components of a doublet in an SU(2) gauge theory
and one writes down the gauge interactions of this doublet, one finds
that the third gauge boson must be uncharged, and has to couple to
the neutron at the tree level. Similarly the gauge boson must also
couple to the neutrino, which appears in a doublet along with the
electron. The only uncharged spinI particle which was known at
the time when these theories were being formulated was the photon.
But the photon cannot couple to uncharged particles.
o Exercise 15.1 Consid" lln SU(2) glllige theo11j. For
II
Jennion
Chapter 15. Standard electroweak theory
328
t/J whose TJ.eigenl.lutue is tJ under the go.u~e group, show that the
intera.ctions with the neutral gauge boson "' ::1 i8 given by
(15.1)
One can consider various possibilities at this point. One is to
put either the electron or the neutrino in the same multiplet as the
neutron. The diagram for i1decay would be different in this case.
Another alternative is t.o enlarge the fermion spectrum, including
new fermions, such that the neutron and the proton are parts of an
8U(2) triplet, with the neutron transforming as the zeroth comp<>nent. According to Eq. (15.1), the neutroll will not have fundamental
couplings with the uncharged gauge bosoll ill this case, and the gauge
boson can then be identified with the photoll. These possibilities are
not phenomenologically successful.
Alternatively, we could keep the fermions in doublets, but acknowledge that the neutral gauge boson of tl", 8U(2) we have been
considering is not the photon. In this ca,e. if we want to construct
a theory that includes the photon as well. we need a larger gauge
group. The smallest gauge group which contains one more gauge
boson apart from the 8U(2) ones is 5U(2) )( U(I), or equivalently
U(2). This leads to the standard electroweak theory, which has been
tremendously sucC<'.ssful ill describing weak and electromagnetic interactions. This is the theory we want to outline in this chapter.
15.1.2
Pure gauge Lagrangian
The gauge group, as we said, is 8U(2) )( U(1). 8U(2) has three
generators, and U(1) has one. In tlie douhlet representation of 8U (2),
we can write the generators as r/2, wlwT(, the T'S a.re the Pauli
matrices. It is convenient to consider thes(\ lIlatrice~ ill the following
combinations:
1 x
T+  J2(T
Tj
J2(T x
T,
. y
(OJ2)
+ 2T
0
} =

iTy )
C() )
0 1
( J20
() 0)
(15.2)
15.1. Gauge group
329
The only nonvanishing structure constants corresponding to this
choice are
/+3
/+3+
13
1+3
= i ,
=  h++ = i ,
=
h
(15.3)
= i.
Note that the structure constants are not completely antisymmetric
in this basis. The U(I) generator will have to commute with all SU(2)
generators, and therefore must be a multiple of the unit matrix.
Corresponding to the four generators, there are four gauge hosans
in this theory. We will call the SU(2) gauge bosons
and
W~. The U(I) gauge boson will be called B w The generator of this
U(I) is not the electric charge (it is something called hypercharge)
and so B~ is r:ot the photon field. The photon will be identified later.
The pure gauge Lagrangian, containing the E?;auge fields only, can be
written as
W:' W;
<p
Lg suge 
'4I W.~lJ WIa ~v 
I B
4"
Bl'v
(15.4)
wbw,
J1
(15.5)
IHI
where
W JW
VJi
W"v
iJv W
iJ.  9 1alK'
II
as defined in Eq. (14.20) for general YangMills fields. We have put
a dagger in one of the factors of W~v in Eq. (15.4). This is because
the index a, b, c take the values +, ,3, and fabc are not real in this
basis. For the U(I) part of the theory, the definition of B~v is similar
to that of the field tensor for electromagnetism:
(15.6)
As was observed for a general YangMills theory and for QED, the
pure gauge Lagrangian cannot contain allY mass term for the gauge
bosons since such a term would not be gauge inva.riant, However,
experiments show that the gauge hosons nwdialing weak interactions
are massive.
Chapter 15. Standard electroweak theory
330
15.2
Spontaneous symmetry breaking
15.2.1
Introducing the Higgs boson multiplet
The way to solve this problem was indicat.ed in 13.6, where we
showed how spontaneous symmetry breaking in a gauge theory leads
to the generation of mass for gauge bosons. To implement the same
idea in the present cont.ext, we need some sca.lar fields in the theory.
Let us introduce a .calar multiplet which is a doublet under the
SU(2) part of the gauge group, and write this as
(15.7)
On the right, we have ::;l1ow11 the gauge transformation properties of
this multiplet. The '2' indicates that it i. a doublet of SU(2), and
we have normalized the U(I) charge such t.hat its value is ~ for the
multiplet .
The Lagrangian now contains terms involving ,1."\ well. These
are
(15.8)
Here Dp.. as before, is the gauge covariant derivative. Since the gauge
group l.lOW contains two factors, we should have the gauge bosons of
both factors ill the definition of D JJ, and these two sets can corne
with two different coupling constants. In general, when a multiplet
transforms like an ndimensional representation of SU(2) and has a
U(I) quantum number Y, we should write
DJ.l =
JJ
. T(n)w
. 'YB Ii
+ 1.9
a
I.l + 19
a
(15.9)
where T!") deuote the generators of SU(2) in the ndimensional rej>resentation, and Y is the identity matrix times the hypercharge. For
the doublet , the SU(2) generators are Ta /2, and we can write
(::)
(15.10)
15.2.
SpolJtaneous sym1IJetry breaking
331
Let us now suppose, as in various cxamples 011 spoutallcous symmetry breaking in Ch. 13, that 1,2 < 0 in Eq. (15.8). In this case, the
gauge sYlllIllctry will bc spontaneollsly brok~l\. The minimuJII for the
scalar potential will be ohtained for ~calar ricld COil figurations given
by
v
rt>"
(15.11)
= /2 X ,
where X is a douhlet sati,fying XIX = I, and
,,=/_,,2/>,.
(15.12)
vVithout a.ny loss of generality, we will consider our system to be in
the vacuum sta.te
(15.13)
\.e., X
= (~).
Any other vacu II rn ,atisfvinJl E'1' (15.11) can be
reachcd from this by a Jllobll.1 8U(2) x
vacuum state, the generators T 1 and
symmctry, since Tx 1>" # 0, T y 1>c1 # O.
and the U(I) generator Y abo do not
U(I) t.ran,furlllatiou. In this
T'). are no more part of the
Tlw dial(ollal gencrat.ors T3
annihilate the VaCUUm state.
However I notice that
(15.14)
Thus there is one diagonal gencrator which annihilates tile vacuum.
This is a linear combinat,ioll of T3 and Y 1 givell b.y
(15.15)
The original gauge ,yrnmetry is therefore hroken down to a U(I)
sYIIlTlletry gencrated hy Q. Thi, is not the originaJ U(I) part of the
symlllctry group. As we have shown, its generator j::; actually a combination of an 8U(2) generator and the original U(I) generator. To
make this distinction, we will write the origillal sYlllmetry froUl
Oil
flOW
;c, 8U(2) x U(I)y. wherea., thc rellillall\' s.Vlllllletry will be called
lI(l)Q. TIll' lat.ter is in fact the electrOlU<lj!,'!H:tic gallgc syrnmctry,
Chapter 15. Standard electroweak theory
332
as we will see fro!)l the couplings of its gauge boson with fer!)lions.
For now, notice that the superscripts on the cO!)lponents of '" which
appeared in Eq. (15.7) are indeed the electric charges of the corresponding fields.
15.2.2
Gauge boson masses
Since 4>0 cannot be described by creation anel annihilation operators,
we can use the technique employed in Ch. 13 and write
(15.16)
where 4>+, Hand (are all quantum fields with vanishing expectation
values in the vacuum state. Using this decomposition. we can write
the Lagrangian terms of Eq. (15.8) in tenns of the quantum fields.
Using the expression for D,,4> from Eq. (15.10) and ",hstituting Eq.
(15.16) in it, we find various kinds of terms. Oue of thcse pick up
only the VEV part of the. scalar field 4>. The.e quadratic terms in
the gauge fields are
where
2'2 =
~ (gW,~+9IB"
IIxI1 2 =
XIX for any column matrix ,t. Explicitly. this gives
2'2 
J2gW;
J2gW,:
gW~ + g' fl"
) (
~ )
72
1 '.!
,:\
2
41 g 2 v 2 w,.+W" _ + 8"
(gU ,. + 9 fl,,) .
I
(15.17)
(15.18)
For any charged spiul field V;, of mass M. the. milSs term in the
Lagrangian is A,f2~rVJ'. Thus, the IlHl.5S of the charged W' boson is
1
MW=2 gv .
(15.19)
gW; + g' Bj.J of neutral gauge bosuns appearing in Eq. (15.18), we first define the combin"tions
As for the combination
J
Z" = cos Ow Wj.J  sin8w DIJ
A"
= sin Ow
w;: + ern; Ow B
I, .
(15.20)
15.2. Spontaneous symmetry breaking
333
where Ow is called the Weinberg angle, defi ned by
g'
tanOw=.
(15.21)
Then the kinetic terms for Z~ and A~ are properly normalized because this is an orthogonal transformation, and the last term of Eq.
(15.18) can be written as kv2(g2 + gfl)Z~Z~. Since the mass term
for a real spin1 field v~ is ~M2V~ V~, we conclude that the mass of
the Z particle is
(2
1
M z=g+g
2
'2) y, v=
Mw .
cos Ow
(15.22)
We see that A~ remains massless even after the symmetry breaking process. As we mentioned, only a part of the gauge symmetry is
broken. There is "remaining V(l) symmetry. A~ is the gauge boson
corresponding to this symmetry. In 15.3, we will see that this gauge
boson is in fact the photon, and so the remnant symmetry is the
V(l) symmetry of QED.
15.2.3
Scalar modes
In 13.6, we discussed that a gauge boson can become massive in
a spontaneously broken theory by eating up a scalar degree of freedom. This scalar disappears altogether from the theory in the unitary
gauge. In other gauges, it appears with a propagator containing an
unphysical pole and therefore represents an unphysical mode. In the
electroweak theory so far, we have seen that three gauge bosans, the
W+ I W and the Z I become masSive. There must he three unphysical modes corresponding to them.
One way to identify these modes is to look at the 4>Lagrangian
in Eq. (15.8) and substitute the expression for </> in terms of the
quantum fields which appears in Eq. (15.16). Among other things,
we will find terms like
(15.23)
As argued in 13.6, this shows that the W+ can be annihilated at a
point with the creation of a +, without any ot.her particle interacting
with them. This shows t.hat</>+ is indeed the unphysical mode eaten
up by the W in the process of symmetry breakin!?;. Similarly, W
334
Chapter 15. SUlIulard electroweak theory
eats up </>, and Z eats up the field (. We can see this clearly if
we go to the unitary gauge by writing t.he doublet cb in a manner
similar to what wa., done in Eq. (13.66). In sneh a gange, only the
observable degrees of freedom appear in the Lagrangian. and the
particle interpretation is straight forward.
o Exercise 15.2 ShOUl that Eq.
(15.8) tuhich inuolve n
~:S'i.nglt'.
(15.2~l) ("()'fllnllls
uLl te.rnt3 in Eq.
rleri\Jutille.
o Exercise 15.3 * Construct thr, unitury gn.uge. explir.itly blJ writing th~ doublet 1> in. tc.rms oj quantum, tichb l:\O t1lut after a. ga.uge
tTansforTllation the un:physir,ul de.grt',E'.s oj jTerdam disappe.ur jrom the
Lagrangian.
For calculational purposes, however, it is more cOllvenient to usc a
gauge fixing term snch tilllt the mixed quadratic terms of Eq. (15.23)
disappear from the Lagrangian. As we discussed in 13.6, this can
be achieved if we choose the gauge fixing term to be
(15.24)
The terms of Eq. (15.23) now pair witli the cm'" terms from this
equation to become tota.l divergence terms. awl we are left with the
gauge boson propagators in the form
W)
(
iD,,"
(10)
~
( Z)
iD,,~
r
I.
10 2 
Ma,
+ iE
i
(10) = 102 _ Ml+
''" 
[
iE
,w 
(1  010,,10"]
1: 2  t:Ma,
(15.25)
Ok",:,,]
'2
(15.26)
(I k"  t: M
The unphYRical ~calar propagators a.re similar t.o that ohtained in Eq.
(13.76):
i~($'l(k) =
i~(()(k) =
102
. t:,\["\I' .
I
10 2 
t:AI'j, .
(15.27)
(15.28)
This choice of gallg" is called t.he R, gaug". Th" 011 bscript. t: in the
name refers to the gauge parameter: and tht, [t'tter H indicates that
the powercounting rC'lIonnali:tauility of the theory i::; obvious ill this
gauge since all propa~at.ors seale like 11k'}. for largE" vallles of k.
15.3. Fermions in the theory
335
There is also a physical scalar mode called the Higgs boson, which
corresponds to the qUi\ntum field H in Eq. (15.16). It is an uncharged
scalar boson with mass given by
M~ = 2.\v 2 .
(15.29)
Existence of this Higgs boson can be counted ",nong the definitive
predictions of the standard model.
15.3
Fermions in the theory
When the electroweak theory was first formulated. it. appeared only
as a theory of leptons. Although subsequently the quarks were included in the framework and it is now known that. the theory suffers from some inconsistencies without the quarks. it is simplest to
discuss first the leptonic sector of the theory without bringing. the
quarks in. Moreover, we will start with only the electron and the
electronneutrino, leaving the other leptons and quarks for 15.3.4.
15.3.1
Gauge interactions
The electron and the electronneutrino form what is called the first
generation of leptons,. Parity is known to be violated in weak interactions, so it is expected that the different chiral components will
have diffcrcnt interactions. In the standard c1ectroweak model, these
fermions are supposed to transform like the following gauge multiplets:
WeI. 
(~~,)
e/l
1
(2. 2)'
(1,1).
(15.30)
It should be noted that we have written dowli both the left and right
chiral projc'Ctions of the electron. but olily the left chiml projection
for the neutrino. At the time of the formulation of the theory and
for quite a while afterwards, all laboratory experiments showed that
neut.rinos always appeared to be left halld"d. and t.hat is why t.he
right chiral neutrino .state was assumed not to exist ill the standard
model. As we will see, a consequence of thi:> assumption is that
the neutrinos turn out to be ma.."islcS';, III !lI0[P' recent timeoS there
Chapter 15. Standard electroweak theory
336
has been some indication that neutrinos may have mass. If these
observations are confirmed, some modification of the standard model
would be necessary. It is not clear what the correct modification
would be. Howeverl neutrino masses, even if nonzero, must be small.
For most processes which do not depend crucially on a nonzero
neutrino mass, one can still obtain a good estimate by taking the
neutrinos to be massless. For this reason, we present here the original
version of the standard model, without any possible modifications
which might be prompted by nonzero neutrino masses.
We can write down the gauge interactions for tne leptons. The
gauge covariant kineti<: terms for the leptons are
it' = W. Li'Y~ D~oJ!'L
+ eRiy~ D"c/i.
(15.31)
D,.
The general form for
was given in Eq. (15.9). Since eR is an SU(2)
singlet, the SU(2) generators Tn = 0 for it. Using Q = T 3 + Y, we see
that both eL and eR have electric charge 1, while V,L is uncharged.
This justifies the hypcrcharge assignments for these fields.
o Exercise 15.4 ShOUl that the. ITec. Lugrun.yinn oj a Dirac jield can.
be written in tcrm~ of the c.hirol projt'..etioTt~ ILS
.!L' = 'hh"a",pt
+ ,pRh"a~,pR  m('h,pR + ,pR,pd (15.32)
The derivative tcrllls in Eq. (15.31) give the usual kinetic terms
for the elect rOll and the neutrino fields. The other terms in Eq.
(15.31) are the interaction terms of the leptons with the gauge
bosons:
~nl
\IIcL'Y /1 (g7 (1
2
W,~  ~dBJ.i)
9 (" w+
J2
veL'YeL
I!
1JJt'f, +eR'YJ1.g'BJ1.eR
+ el,'Y " I/r:f"'l
\1')
t
1_
J1.
3
I
:iVd,'! v,dgW~  9 B,,)
1
3
+:ien~Cf,(9W~
+ 9' B") + '1I)"g' B"ell.
(15.33)
The interactions involving W are cal1~d r;harged current interactions because these gauge hosons arc elect.rically charged. These
lead to the following Feynmall rules:
W"
34
JLL
';2'Y ,
(15.34)
15.3. Fermions in the theory
337
!(1
'Ys), the projection operator for left chirality defined
where L =
in 4.7. The incoming fermion line is V e and the outgoing one electron
if the gauge boson going out is W+, and the converse if the gauge
boson going out is W.
Let us now look at the interactions of neutral gauge bosons in
Eq. (15.33). These are called neutml current intemctions, and we
should rewrite them using the neutral boson eigenstates Z" and the
photon. Looking back at the definition of the Z boson in Eq. (15.20),
we see that neutrinos have neutral current interactions with only the
Z boson. This should have been expected. Neutrinos do not have
electric charge l and therefore cannot have any treelevel interaction
with the photon. The neutral current interaction of neutrinos is given
by
..>f.(v,)
nc
(15.35)
The Feynman rule for the neutrino vertex with the Zhoson is thus
given by
Z"
(15.36)
lie
As for the electron, we ca.n rewrite the terms containing the electron field in Eq. (15.33) as
,,>(e) _
'xnc
1_ "L e(W
2"e"'(
9 ~  9 '8)
iJ.
3
+ 9,"
F:"Y e 8 /i.
(15.37)
using L + R = 1. In terms of the gauge boson eigenstates, this can
be written as
2
go
cas w
ey"LeZ"+g'e'Y"e(sinOwZ"+cosOwA")
2CO~ OW ey" [L 
2sin Ow]eZ" + g' cosOwe'Y"eA" '
(15.38)
usmg the definition of the Weinberg angle from Eg. (15.21). The
338
Chapter 15. StaJldard electroweak theory
Feynman rule for the Zinteraction of the electron is thus given by
Z,.
(15.39)
The interaction betwccn the electron and A" agrees with QED if we
identify the magnitude of the electron charge as
e = g' cos Ow = gsinBw.
~
(15.40)
Exercise 15.5 Shorn that , for any fermion. oj charge eQ whose leftchiraL component has a T3 eigenvaluf'. oj t3 and thf'. rightchirnl com
panent is a.n 5U(2) singlet, th.e. Fe.ynmnn rule lor the ZinteTa.clion
1S
'~
co::> W
)'"(t3L _ Q sm 2 Ow).
(15.41)
Exercise 15.6 Show tha.t the ga.uge intera.ction~ oj th.c fermions
uiola.te purity and CimJnria.nce, but not CP.
15.3.2
Electron mass
So far. we have discussed the gauge covariant derivative terms involving the fermions, which include the kinetic term for free fermions and
the gauge interactions. We now discuss the mass of the fermions.
For any fermion field ,p. the mass term can be written as
(15.42)
Take the electron field, for example. The eL transforms like part
of a doublet under the SU(2) part of the gauge group, and the eR
transforms like a singlet. The combination of the two cannot be
gauge invariant.. Thus, in the fundamental Lagrangian of the standard model, there is no mass term for the electron.
However, the Lagrangiall has spontaneolls symmetry breaking,
and to drive this process, we had to introduce a doublet of Higgs
bosans. Leptons can have gauge invariallt interactions involving the
Higgs bosans. These arc Yukawatypc interac.tions, given by
YY"k = h, (I!!,Le R + eR,ptl!!cl.)
where he is
<\
coupling c.onstant.
(15.43)
15.3. Fermions in the theory
339
Exercise 15.7 Ch.eck that th~ intunction
te.rTni'l
of Eq. (15.43)
(lTe
inuaria.nt undt'.T the. gauge group SU(2)xU(1).
After spontaneous symmetry breaking, if we rewrite the Higgs field
> in terms of the quantum fields defined in Eq. (15.16), we obtain
2'Yuk
= he
[;2
(eLeR
+ eNeLl + 'D,LeR> + + eRlIeL>
+ ;2 (eeH + i "'Ys e()]
(15.44)
The terms involving the vacuum expectation value v of the Higgs field
are the mass terms for the electron field, which have been generated
by the spontaneous symmetry breaking process. Comparing with Eq.
(15.42), we find th"t the electron mass is given in this model in terms
of the Yukawa coupling constant he and the vacuum expectation
value of the Higgs boson field:
(15.45)
The neutrino, on the other hand, remains ma.."islcss even after spontaneous symmetry breaking because of the absence of the lIe R.
15.3.3
Yukawa couplings
The Yukawa couplings, i.e.,
fields of the theory, appear
using Eq. (15.45). For the
gives the following Feynman
the couplings of fermions with scalar
in Eq. (15.44). We can rewrite them
unphysical charged sc"lar modes, this
rules:
i J2m e R ~R
v
'.j2Mw
'
(15.46)
_ . J2m e L  ' ~me
t
VI
v'2Mw L
where in both cases we have used Eq. (15.19) in writing the latter
form. Similarly, the neutral scalar modes have the following cou
340
Chapter 15. Standard eJectroweak theory
plings:
>'
>
H
. me _
1 V
. :::,9<"._
1.
2cosew
Mz '
(15.47)
mc
11
_
'"15 
[J
2 co", IJw
Tn
iFz
The neutrino has no coupling with the nelltra.l scalar modes since
the model has no rightchiral neutrino.
15.3.4
Other fermions in the model
It is now known that there are three generations of leptons. In the
standard model, the second and the third generations of leptons look
exactly the same as the first one. The charged leptons in the second
and the third generations arc the muon and the tau. We can write
all the couplings involving these particles hy looking at what we have
done for the electron and merely changing the electron mass to the
relevant lepton ma..c;s. There is also a neutrino corresponding to each
charged lepton.
If we try to incorporate qua.rks in the model, we find similarities
and differences with leptons. Like lepton~. quarks also come in three
generations. In anyone g:encration, the lcftchiral components of
quarks come in 5U(2) dOllblets and the right.ehiral ones in singlets.
The representations are
1
(2, 6)'
UAll
dAR
2
(1'3)'
1
(1, 3)'
(15.48)
Here A is a generation index that fUllS frolll 1 to 3. Collectively, we
will call all the UA'S a" u[>type quarks and <LA's as dowutype quarks.
Since all generations have the same group transformation property,
the Yukawa couplings c.an connect any two generations. So for the
quarks,
~Yuk
= 
L
A.B
(hAuiiH<i>dBR
+ h~BiiAI.J,llHR + h.c.),
(15.49)
15.3. Fermions in the theory
341
where </> = iTo</>'. We now see the differences with the leptonic sector,
which we discuss in some detail.
Here we have two kinds of terms. In the leptonic sector, we had
only the first kind since there are no right handed neutrinos in the
model. The right chiralities of the uptype quarks ,::ow give rise to
the second term. In constructing this, we have used <I> defined above,
which contains the complex conjugates of the fields appearing in the
multiplet <1>:
1
: (2'2)
o Exercise 15.8 Ij </> is
lln~ ~U(2)
(15.50)
doublet, ShOUl thllt </>' does not
transform like a doublet, but = iT2cP dOE'_l:I.
So far, there does not seem to be any problem. We can of course
have Yukawa couplings involving two different fermions. The problem arises when we replace </> by the quantum fields using Eq. (15.16).
This will now give mass terms as in Eq. (15.44), but they will involve
two different fermions in general.
We can, of course, partly avoid the problem by choosing the basis
states of Eq. (15.48) in such a way that the 3 x 3 matrix h' is diagonal. This implies choosing a basis in which Ut, UO, 113 are really the
physical particles, the upquark 1l, the charm quark c and the top
quark t. However, the matrix h need not be diagonal in this basis.
This is a new feature, not encountered for leptons, since there was
only one kind of coupling in the leptonic case which could always be
taken as diagonal. In the present case, we obtain a mass matrix for
the downtype quarks, given by
(15.51)
To obtain the eigenstates of the downtype quarks, we then need
to diagonalize this mass matrix. This ca.n he donp. in general by
llsing a biunitary transformation. It means that for any matrix
m, it is possible to find two unitary matrices K and K' such that
m = K'mKt is diagonal, with nonnegative entries. These entries
can then be interpreted as the masses of the physical downtype
quarks, which we identify with the down quark d, the strange quark 5
and the bottom quark b. We will denote these by '?A with A = 1,2,3.
342
Chapter 15. Standard eJectroweak theory
The mass terms of these quarks coming from Eq. (15.49) can then
be written as
L(KmK't)AB dALdBR.
A.B
(15.52)
This shows that the eigenstates can be defined as
{15.53)
This has an interesting consequence on the gauge interactions of
quarks. The interaction with the Wbosons can be written as in
Eq. (15.33), where one must use the components of the same gauge
multiplets. But since the physical downtype quarks are different
from those in the multiplets, we get
,,,,(IV)
oLio<
=
9 "(J2
'"AL'Y "d AL W+
"+ h .c. )
=  ~2 L
(UAL"Y"KAB9BLW: + h.c.)
(15.54)
A.B
using Eq. (15.53) to express everytbing in terms of the eigenstates.
The matrix K is called the quark mixing matrix, or the CabibboKobayashiMaskawa matrix. The Feynman rule for the W coupling
for the quarks is therefore
(15.55)
Exercise 15.9 Find the couplings of the Z bOSOT\s 'With the qUQTk
eigensta.les. Show tl\o.t these do not inuolue the mixing ma.trix K I
and i. giuen by Eq. (15.41).
15.4
Gauge boson decay
We have discussed processes involving photons in Ch. 9. Photons
are massless, and hence cannot decay. In the electroweak theory, we
have massive gauge bosons Wand Z, which can decay. This is a new
feature of the electroweak theory, which we discuss now.
To be specific, let us consider the decay of the Zboson to a
fermionanti fermion pair. The coupling depends on which fermion
15.4, Gauge boson decay
343
we are talking about. In order to be general, let us write the coupling
as
9
2int =  2 cos 8W
h~(af  bns)! Z~,
(15.56)
where
(15,57)
which follow from Eq, (15.41), The Feynman amplitude obtained
from this would be
At = 
'98
2cos
,,(ph~(af  bns)v(p') (~(k),
where p and p' denote the momenta
tifermion in the final state, and (~(k)
the decaying Zboson,
We want the decay of unpolarized
to average over the initial polarizations
This gives
(15,58)
of the fermion and the anis the polarization vector of
Z bosons, For this we need
and sum over the final spins.
(15,59)
In writing this expression, we have neglected the rermion masses since
the Zboson is much heavier than all known fermions which it can
decay to, Thus we obtain
1..... 1
9
(~(k)(~(k))
 cos28w
x (aJ + bJ)(P"p'" + p"p'~  9~vp' p'),
/, 2 _
(15.60)
There is also an antisymmetric contribution to the trace, but it does
not contribute in the final count since the polarization average is
symmetric in the Lorentz indices, as we will now see.
Polarization sum
In Ch, 8, we showed that the photon has only two physical degrees
of freedom and calculated the polarization sum that can be used for
344
Chapter 15. Standard electIOweak theory
external photon legs in Eq. (8.88). For a massive gauge boson like
the Z, this expression cannot be used for two reasons. First, k 2 is not
zero for a massive onshell particle with momentum k. Second, an
arbitrary massive gauge boson need not couple to conserved currents.
Since a spinI boson callnot have more than three internal degrees
of freedom, we can always choose the physical polarization vectors
so that each of them satisfies the. relation
(15.61)
and are normalized by
f~(k)f.~(k)' =
0".
(15.62)
In the case of the photon, only two such vectors are allowed for
reasons described in Ch. 8. But in the case of a massive gauge boson,
there can be three of them. For the 3momentum k and energy
E = Vk2 + M2, these are the two unit spatial vectors orthogonal to
k, and a third one given by
(15.63)
where k is the unit vector along k. This is usually called the longitudinal polarization vector since its spatial components are in the
same direction as k, and so we have denote.d it with the subscript
'I'. The transverse polarization vectors, on the other hand, can be
chosen just as for the photons. Combining, we find that the three
physical polarization vectors satisfy the relation
(15.64)
To obtain the polarization average, we need to divide this by the
number of polarization states, i.e., by 3. We see that the polarization
average is symmetric in the Lorentz indices.
15.4. Gauge boson decay
345
Decay rate
Going back to Eq. (15.60) now, we can write the matrix element
square as
. (15.65)
The rest frame of the Zboson is defined by
k"=(Mz,O).
(15.66)
In this frame, we can write
(15.67)
where p = ~ M z since the masses of the fermions are being neglected.
Using these, we obtain
1~12 = 3
2
'2
')
20 (af+bj).
9 2MZ
cos
(15.68)
Plugging this into Eq. (7.119) and performing the angular integrations, we find the decay rate to be
qz > f
f) =
g2Mz
(02+b 2 )
4811" cos 2 Ow f
f
12sin 2
aMz
(112 +b2 ).
2
Ow cos Ow f
f
(15.69)
using Eq. (15.40) in the last step. Note thiit this is the decay rate
into a specific pair of decay products. To find the total decay rate of
the Zboson, one needs to sum the rate.s over all possible final states.
o Exercise 15.10 Using Mz
= 91 GeV. shaUl that the decay mte
into a neulrinoantineutrino pair is about 165 MeV. Us. sin 2 8 w =
0.23.
o Exercise 15.11 Using Mw
= BOGeV, ShOUl that the mtc lor the
decay W+ . e+lIe is abou.t 220 MeV. You CUll negLect the mass of
the. electron.
346
Chapter 15. Standard electroweak theory
o Exercise 15.12 Con8ideT the decay of the Higg8 bo80n H into"
lepton"ntilepton pniT t+ t (where t = e, /' etc). Show th"l lhe
decay rote. i8 gi"en by
r(H t+r) =
<>m)MH
8sin 2 Bwcos2 8wMi
(1 _
4m))~'
M'h
(15.70)
IHint: Ha\le you sun this done ea,,lieT in th.e book in 80me jonn?1
15.5
Scattering processes
15.5.1
Forwardbackward asymmetry
In 9.6, we discussed lhe process e e+ . /l /l+ in QED. We can
now calculate how the cross section is modified by weak corrections.
"(, Z
H,(
Figure 15.2: Lowest order dioagrams (or the process ee+ 1JJ.1.+ in the
electrowea k theory.
There are four diagrams in the lowest order, as shown in Fig. 15.2.
Of these, we have already calculated the amplitude for the photon
mediated diagram in 9.6 and found
(15.71)
where 8 is the Mandelstam variable. Notice also that the diagrams
mediated by the Hand ( contain couplings which are proportional
to the fermion masses. It was already argued in 9.6 that it is perfectly justifiable to neglect the electron mass in studying this process.
Hence these two diagrams do not contribute to the amplitude in this
approximation. The coupling of the electron and the muon with the
Zboson are the same, given in Eq. (15.39). Using Eq. (15.40), we
15.5. Scattering processes
347
can write the Feynman rule in the form
2e
. 28
sm
'Y,,(gV  gA'YS) ,
(15.72)
where
(15.73)
Then we can write the Zmediated amplitude as
2
e .fz
Jltz =  [jj(k2h~(gv  gA'Y5)U(kt} ]
by choosing the
[u(ptl'Y~(9V  9A'YS)V(P2)] ,
(15.74)
= 1 gauge, and defining
fz =
sin 2 28
w (s 
M~)
(15.75)
So the square of the total amplitude, averaged over initial spins
and summed over final ones, is given by
IJltl 2= ~ L
IJIt, + JltZ!2 .
(15.76)
SP1~
There are four terms, one of which has been calculated in 9.6 and
found to be
(15.77)
where now we neglect the mnon mass as well, anticipating that the
corrections from the Zboson can be important only if s is comparable to M~, and therefore much higher than m~. The square of the
Zmediated amplitude is given by
IJltzl2 = e4fH(g~ + g~)2(1 + cos 20) + 8g~g~cosOl.
(15.78)
As for the cross terms, we find
JIt;Jltz = e:{ZTr
[jln~(gv 
9A'YoJjlnP]
xTr [Pl'Y~(gV  9A'YS)P2'YP] .
(15.79)
Chapter 15. Standard electroweak theory
348
The 9A9Y terms vanish because they multiply one trace involving "15
which is antisymmetric in the Lorentz indices A, p, and another trace
which is symmetric in these indices. We are then left with
.41;.41.
+ h.c.
2e4fZ[9~(l + cos 2 0) + 29~ cosO]. (15.80)
Adding up all the contributions, we can write
(15.81)
where
at = 2Jz9~
2
a2 = 4 f Z9A
+ fi(9~ + 9~f ,
2 2
+ 8f2Z9Y9A'
(15.82)
The at contriblltion has the same anglllar dependence as the
pure QED result. The important qualitative differellce from the pure
QED result is the appearance of the a2term. which is proportional
to cos O. Without this term, the differential cross section would be invariant under the trallsformation (j + 1f  e. In other words, it would
have been forwardbackward symmetric. The a2tcrm introduces a
forwardbackward asymmetry.
The forwardbackw"rd asymmetry is ill general defined by the
ratio
.A
"/2
dC!;"
dC!
dOsinO dOsinOu
dn
"/2
dn
_ 1
In
dC!
(15.83)
dOsinO dn
with the integration over the azimuthal alll4lc implied ror each illtegral. For this process. Eq. (15.81) gives
A=~
0'2
8 1 + al
(1584)
o Exercise 15.13 Eunluf'ltc the amount oj JOl"wurdbuck1..llurd u.81Jm.
mctTy in lhil:l proC('.~~ ul ~ =
IlHsumi.ll!1 sin 2 Ow = 0.23.
tA11,
349
15.5. Scattering processes
1I,,(k)
Z
e(p)
e(p')
Lowest order diagram for the process v~e
electroweak theory.
Figure 15.3:
15.5.2
+
vpe
in the
Low energy weak interactions
The process we discussed above can be mediated by electromagnetic
as well as by weak interaction. Now we consider processes which can
go only via the mediation of the W or the Z. These will be purely
weak processes.
As an example, we take the elastic scattpring of muonneutrinos
(II,,) by electrons. This can be mediated by the Zboson exchange, as
shown in Fig. 15.3. There is no diagram with neutral scalar exchange
because, as we saw in 15.3, the neutrinos do not couple to neutral
scalars.
The Feynman amplitude of this proceso call he directly written
using the Feynman rules given in Eqs. (15.36) and (15.39):
,.J{' =
Ap
(i
)2
_ig
9

2cosOw
q  Mz
x [U(k')YALu(k)] [u(p')yp(9V  9A'Y.;)U(pJ] ,
(15.85)
where we have used the Feynman 't Houft gaUR" and introduced the
notation q = k  k' = pi  p.
Let us suppose now that the energies involved in the process are
much smaller than M z. Then q2 M~. and we call neglect the
momentum dependence of the Z propagator to write
.4 = ( 2M z
~osew
x [U(k')YALlL(kJ] [u(p')yA('I\'  '1nr,}1L(f/)].
(15.86)
But this is exactly the n.lnplitude that we would have' ohtained if we
350
Chapter 15. Standard eJeetroweak theory
had started from a 4fermion interaction Lagrangian
2lnt =
~ [,p(""n~(1 
"YS)V>hl]
[,p(,)"Y~CqV
 9A"YS),pk)] . (15.87)
provided we had identified the Fermi constant as
g2
Cp 
g2
 4v'2M~ eos2 Ow  4v'2Ma, ,
(15.88)
using Eq. (15.22) in the last step. Thus at low energies, weak interaction is given by 4fermion interactions.
In this limit therefore, the calculation of the cross section is very
similar to what was shown in 7.5. We omit the details and write
directly
1A'l12 =
16C}[(gV
+ gA)2(k. p)2 + (gV
_ gA)2(k' . p)2
(g~  g~)m;k. k'] (15.89)
for this case. Then, in the Lab frame in which the initial electron is
at rest, the differential cross section can be written down using Eq.
(7.118):
da
dO
C2p m 2,
4,,2
(m, +w wcasO)2
x [(9V
+ 9A)2w2 + (gV
 gA)2 w'2  (9~  g~)ww'(1  cosO)].
(15.90)
Here 0 is the angle between the initial and the final neutrino momen ta, and
(15.91)
is the energy of the final neutrino. The total cross section can be
found by integrating over the angular variables. For w m, one
obtains
a =
C}m.w [
2,,'
(gv
1
+ gAl 2+ 3(gV
o Exercise 15.14 The, muon deco.'Y,
tlin u W excha.nge gra.ph.
jJ.
1
 gAl
e
2] .
+ Vc + V/l.'
(15.92)
VTOCe.eds
Draw the diagram and write down the
amplitude. NoUJ URI'. Eq. (15.88) to show that ij th.e energies a.re
~ma.ll C'.ompurtd to the ",' ~m(l~~, the tOHlf'.Rt order amplitude is the
sa.me. flR that obta.ined Irom the 4jermiort Lagra.ngian of Eq. (7.36).
15.5. Scattering processes
351
Exercise 15.15 For the elastic 8Cilttering between the Lie Q,nd the
electron, there is a1\ additional diagram with a. W bo80n exchange
which contributea at this order. Add th.e a.mplitude of this diagmm
with that oj the Zmedio.ted diagrom, a.nd show that IAfj 2 is giuen
by the same form 0.8 in Eq. (15.89) prouide.d we define
1 2sm
' 8w,
9V=2+
15.5.3
9A =
'2.
(15.93)
High energy scattering
As a last example of scattering processes in the standard electroweak
theory, we consider the pair creation of W bosnns:
(15.94)
The initial center of mass energy has to be high enough to produce
the W's in the final state. Therefore we can neglect the electron
mass.
1, Z
(a)
(b)
Figure 15.4: Lowest order diagrams for ee+  WW+.
There are five diagrams for this process at the tree level. One of
them, shown in Fig. 15.4a, has an intermediate neutrino line. Two
others are collectively shown in Fig. 15Ab, where the internal line
can be either a photon or a Zboson. In addition, there are diagrams
involving exchanges of neutral scalars Hand (.
The calculation is very lengthy for unpolarized particles in the
initial as well as in the final states. So we will demonstrate this
calculation in a very specific easel characterized by the following
conditions:
1. The electrons have positive helicity.
Chapter 15. Standard electroweak theory
352
2. Only the production of longitudinal polarization states of the
W's is considered.
3. The center of mass energy is much higher than Mw, i.e.,
Ma.
Since we are neglecting the electron mass, the diagrams involving
neutral scalar exchange vanish. In addit.ion, the first assumption
means that the electrons arc right chira!' In this case, Fig. 15.4a
does not contribute either. This can be underst.ood from the coupling
given in Eq. (15.34), which shows t.hat it. vanishes for a rightchiral
electron. Since the couplings involve a ,,matrix, it also implies that
the positron h"" to he rightchiral as wei!.
The second assumption means that the polarization vector for the
final state particles can he chosen as in Eq. (15.63). Moreover, since
E = k due to the third "",,umption, we can write the longitudinal
polarization vector simply a.s
(15.95)
These considerations allow us to write down the amplitudes in
the Feynman't Hooft gauge ""
where f are the longitudinal polarization vectors for the W, and
V~v~(L, k+) =
(L 
k+hg~v
+ (2k+ + L)".'!v;  (2L + k+)vq~~.
(15.97)
Exercise 15.16 C:u: the FCllumnn rule for cuhic coupling from
Fi{J. 14.1 and the. fLt'.ji.nition:; oj UH~ Z boson lLlut th.e. photon in. Eq.
(15.20) to 1Je.rif1J lhf' ~VWZ una WH/A toll.phn.~p! UlH'rt (~houc.
Since un = Rll where R = ~(I + ",,), th,' kftd,ira! project.or in
the Zcouphng docs not contribute. TheIl. using Eq. (15.10). we can
353
15.5. Scattering processes
combine the two amplitudes to write
0S_lM1)
2
vf(= vf(,+vf(z = e [VR(p+h'UR(P_)]
x V~",(L,k+),~(L)<~(k+).
(15.98)
H we now use Eq. (15.95), we obtain
1
+ Mw)(k+
M;t (k+ L
s
M2 (k+  L)"
(15.99)
2 w
llsing s
Mev in the la.~t step to write k+ . k_
 L),
= !s in the center of
mass frame. The factor in Eq. (15.98) coming from the propagators
can also be simplified as
1
s
s M~
M1
M1 .
;("M::e,C7
'"  s2
ss z)
Putting all of these together, we finally obtain the
(15.100)
~mplitude to
be
(15.101)
using the relation b<!tween the masses of the Wand the Z bosons
from Eq. (15.22).
Exercise 15.17 * Show that one obtains c:mctllJ th.e same a.mplitude as in Eq. (t5.101) if the external Ht'linea aTe imagined to be
repln('.ed by Ute tines. You need to deriue titf'. Fe:ynman ruLes jar
the 4> coupLings to cnleulate this dia.grum. INote: Thi.~ cqui\lUlence
of the amplitude holds only for the lonfjitudinul polnrizn.tion stale a.t
energies much high.er than thf'. gClU!Je hOMn
tnc eqtLivalenCf~ theorem. I
In.U~l(,
and is known as
There is now no average over initial spin ~illce we have taken the
initial fermions in specific spin states. So wp cannot directly use
the spin sum formulas while calculatinR the ;jquare of this matrix
element. Instead, we use the chirality projec.tic)J] operat.ors to write
(15.102)
354
Chapter 15. Standard eJectroweak theory
It then follows that
[VR(P+ hPUR(P)] t [VR(P+ h>'UR(P)]
= [u(p_hPRv(p+)] [v(p+)Y>Ru(p_)]
[u(p_ h P Rv(p+)] [v(p+ h>' Ru(p_) 1.
(15.103)
spin
The last equality works because of the projector appearing in the
bilinears. Even if we sum over all spins, the projector will pick out
only the corresponding projection. This allows us to use the spin
sum technique to write the above expression as
Tr [p+ Y>Rp_ "'(PRJ = 2(p~P".
+ p~rI;.
 g>'Pp+. p_)
+ <term.
(15.104)
The quantity denoted by IE_term' contains tl1e antisymmetric tensor,
and is irrelevant since the expression obtained here is contracted with
something which is symmetric in >., p. So we get
1.41'1
4
(
>.
P
= 28 2 cos4 Ow P+I'
>. p i >
. )
+ pp+
 28g P (k+
 L)>.(k+  k_)p
e4
sin 2 O.
(15.105)
4cos 4 Ow
Here 0 is the scattering angle, which we have taken to be the angle
between p_ and k_. We have also used kinematical relations like
p+ . p_ = k+ . k_ = ~8 which are valid with our assumptions.
The scattering cross section can now easily be obtained using Eq.
(7.106):
7,",,2
a = 68 cos4 Ow .
(15.106)
o Exercise 15.18 Consider the decay oj muons at Test with. spins
aligned aLong the direction s. With the interuction of Eq. (7.36)
2
and using spin 'Projection operators, ShO'Ul that l.lr1 is obtained by
replacing p>' b'Y p>"  Ullin>' in Eq. (7.49L where n>' is defined by Eq.
(4.81). Assuming me = OJ ShOUl that thf': differential aecay is giuen
by
<if
C'm' (
1
dfl = 7:8"~ 1;) coso).
(15.107)
where 8 is the angle between s and the, electron momentum. {N ote:
Under 'Parity 8 I' IT  B since the spin remains invariant while the
momenta are reuensed. The cos B term then 8ignals parity uiotation1
15.6. Propagator for unstable particles
15.6
355
Propagator for unstable particles
Clearly, the scattering formulas derived in 15.5 do not hold if the
incident particle energies are such that s = Mi. For example, the
quantity !z, defined in Eq. (15.75), blows up. This is because the
Zboson is an unstable particle. In writing the propagator of any
unstable particle, we must take its nonzero decay width into account.
To see how this can be done, let us consider a scalar field for
the sake of simplicity. For a stable particle, the solutions of the free
equation of motion are plane waves. If however the particle has a
decay width r in its rest frame, the free particle solutions "'(x) should
decay with time as
(15.108)
up to normahzation factors. The factor /H/ E occurs in a general
frame to account for the time dilation factor for its lifetime l/r.
Thus we can write
"'(x)
~ cxp (iEt + ip' x
;~rt)
(15.109)
In order to identify a particle by its energy, the uncertainty in
energy measurements must be much less than its energy, i.e.\ .6.E
E. In the re.st frame of the particle, this implies l:J.E m. Due
to the timeenergy uncertainty relation, the time taken to make the
measurement will satisfy l:J.t l/m. Unless this time is much less
than the lifetime l/r of the particle, no such measurement would be
possible. Thus for a particle interpretation of the quanta of the field,
it is necessary that the condition r m and therefore r E be
satisfied. In that case, ignoring O(r' /m') corrections, the equation
of motion for the field "'(x) can be writteu from Eq. (15.109) as
(0 + Tn'  imr) "'(x) = O.
(15.110)
We can now use the methods of 3.7 to fiud the propagator of this
field. We will get
l:J.p(p) =
p' 
.'
Tn' + unr
(15.1ll)
Chapter 15. Standard electroweak theory
356
A similar analysis for vector bosons would give the following propagator for an unstable vector boson in the Feynman 't HODft gauge:
D"v(p) = 
2
g"v.
.
p  rn 2 + Mar
(15.112)
Since r Tn by OUT as~umptjon, the extra term added is immaterial
unless p2 is very close to ,n'l, i.c., unless the particle is onshell, or
very nearly so. But for a nearly onshell particle in the intermediate
state, one should use this propltgator.
Exercise 15.19 Find thf", jorwnrdbackulO.nl. o.slJmme.trll in e e+ Ali. Take sin 2 8w = 0.23 UR hf'jore, and r z = 2.49 Ge V.
p. Ji+ o.t s =
15.7
Global symmetries of the model
The standard clectroweak model is ba.'.)cd 011 a. local symmetry with
the gauge group SU(2) x U(I). Thi. of r.our.c entails the same
global symmetry. But in addition, the ,tandard model has some
other glohnl symmetries as well.
One of these is the ba.ryon number symllletry, which is defined ~
(15.113)
where q herc stands for all quark fields, and B i. the baryon number
of quarks, usually taken to be
Antiquarks havc B =
All
other particles are assumed to have zero baryon number and hence
do not transform under this symmetry operation. One can go back
and check easily that no iuteraction of the standard model changes
a quark into a lepton, hence this symmetry.
In the leptonic sector also there are symmetries of this kind. For
example consider
1.
1.
(15.114)
where e is any negatively charged lepton or it.s corresponding neutrino. Conventionally, the quantity (" called it'plon number, is taken
to be 1 for all of them and I for their l1.nt.iparticlco. All other particles have zero lepton numher. In fact. unlike the quarks, there is.
no intergcnerational mixing in the lcptollic spctor of the :::;tundard
model, so even the lepton numbers corrcspondiug t.o each generation
15.7. Global symmetries of the model
357
is conserved. In other words, if we assign a global quantum number
I to the electron and the electronneutrino, I for their antiparticles, and zero to all other particles, the quantum number will be
conserved. The same applies if we assign the number to any other
charged lepton and its associated neutrino only.
These symmetries have a very different character than the gauge
symmetry. First of all, these are global symmetries, as opposed to the
gauge symmetry which is local. Secondly, these are not symmetries
that we imposed on the model. We built up the theory with the gauge
symmetry and with the specific fermion content, and after doing that
we found these extra symmetries in the theory. Such symmetries are
called accidental symmetries.
There is another difference. In this book, we have always relied
on perturbation theory to derive the interactions. However, if nonperturbative effects are allowed, it can be shown that a global U(I)
quantum number ' cannot be a symmetry of the model unless it
satisfies the relations
The two sums in this equation are over the leftchiral and the rightchiral fermions, and !Z is any diagonal local charge. The general
condition is somewhat more involved, but this is enough to show
that baryon number and lepton number are not symmetries if we
include nonperturbative effects. However, the difference B  . is
still a global symmetry in the model, sinc.e it satisfies all the tests
like the ones given in Eq. (15.115).
Exercise 15.20 Verih th"t
(~=~) (Bf2 = O.
(~= ~) (B )..'11' =
0,
where. 12 co.n be the dectric chnrge. or liu'. h'Yperchuige Y.
Remember tha.t each quul'k is uctua.U'Y Cl C:010T triplelo[
(15.116)
IHint:
Appendix A
Useful formulas
A.I
Representation of ')'matrices
The 'Ymatrices are defined by the anticommutation relation
(A.I)
and their hermiticity properties:
('Y~)t = 'Y0'Y~'Y0.
(A.2)
We mentioned tbat these two properties do not uniquely define the
specific form of the matrices. In particular, if two sets of matrices
h~) and h~} both satisfy Eqs. (A.I) and (A.2), they will be related
by
(A.3)
for some unitary matrix U. In view of this fact, we tried to do everything in a way that is manifestly independent of the representation
of the 'Ymatrices.
In order to gain some insight into the solutions, sometimes a
specific representation is helpful. For example, in 4.3.2, we used the
DiracPauli representation, in which the matrices are given by
'Y
0=
(I 0)
0 I
. (0
"'(I =
0")
_a i 0
(A.4)
'
where I is the 2 x 2 unit matrix and the O"'s are the Pauli matrices:
1
(0 I)
0'=10'
0'
359
(I 0)
0 I
(A.5)
360
Appendix A. Useful formulas
The advantage of this representation is understood by looking at the
solutions for the spinors derived in 4.3.2. For a nonrelativistic particle, the lower two components of thenspinors are negligible. Hence
the nonrelativistic spinor is effectively a twocomponent one, which
is easier to use. Thus the DiracPauli representation is particularly
helpful for the nonrelativistic reduction of a problem. It was used
for this purpose in 11.2. In this representation, "15 and o~v take the
forms
~5 = (JooJ),
I
0' = i (0,. 00') ,
aij :=::: [ijk
(ok0 )
uk
(A.6)
where "'jk is completely antisymmetric symbol with "123 = + 1.
Another useful representation is called the chiral representation,
in which
J)
(A.7)
"I = ( J 0
Notice that the matrices "I' have the same form as in the DiracPauli
is different. In this representation,
representation, but
,0
(A.B)
Thus the chirality projection operators in this representation have
the form
~(l+"I5)= (;~),
(A.9)
So the chiral eigenstates will have zeros for either the upper two components, or the lower two. If one has to deal with chiral eigenstates
explicitly, this is a more convenient representation to use. It is used
extensively, for example, in discussing supersymmetric field theories,
a subject which is beyond the scope of the present book. In this representation, Oij arc the same as in the DiracPauli representation,
whereas
_ .
a 0, 1
(0' )
0
0
'
(A.IO)
A.2. Traces of 'Ymatrices
361
A third useful representation
Here,
IS
the Majorana representation.
;u
'Y = (
iU
0 )
iu 3
1
_;u 1
(A.Il)
from which we can calculate
2
02)
'Y5 = ( uO u
(A.12)
Notice that all the 'Ymatrices are purely imaginary. For one thing,
it assures that the u~v's are also purely imaginary. In the Dirac
equation, the operator iy~a~  m is now completely real. As a result,
if we have a solution where all the components are real at one time,
they will remain real for all subsequent times. This is very useful if
there are fermionic particles which are antiparticles of themselves.
o Exercise A.I Find
A.2
(1~./I18
in. the Mo.joro.na representation.
Traces of ,,matrices
For the sake of notational simplicity, we will use a number of 4vectors which we will denote by a" with r = 1,2,3 , " and consider
traces of the form
(A.13)
We first show that Tn vanishes if n is odd. For this, we use the
properties of the matrix 'Y5. Since its square is the unit matrix, we
can write
Tn(al,02,'''On) = Tr ['Y5'Y5~1~2'''~nJ
= Tr 1'Y5~1~2'" ~n'Y5J ,
(A.14)
using the cyclic property of traces. We now take the 'Y5 appearing
at the end through the ~k 's. Since 'Y5 anticommutes with all the
'Y" 's, we will obtain a minus sign each time we go through one of the
slashed vectors. To go through n of them, we will obtain a factor
362
Appendix A, Useful formulas
(_I)n, at which point the two factors of ", will be sitting next to
each other and we will forget them since tilo:;e two factors will give
1
the unit matrix. Thus we obtnin
Tn = (I) nTn '
(AI5)
This equation says nothing if n is even, but if n is odd, it proves that
Tn
vanishes.
The nontrivial traces thus involve an p.vetl 1l1llnber of Imatrices.
For the ease of n = 2, we obtained the trace in ~7,2,1 at some length.
Here we repeat tile derivation ill a. different Hotation, which will be
lIseful for the trac.es with higher number of l'matrices. For any two
vectors a ano b,
1# = 2a b  N,
(A.16)
which follows from the anticommutation r<'iation of the ,'matrices,
Therefore,
T1 (a"a2) = Tr 1~'~2]
= Tr 12a, 'a11  Tr [~2~,1
= Tr [2a, . a2] Tr [~,hJ
where we have lIsed the c.yclic property of traces.
terms, we obtain
T 2(01,a2) = Tr [a, a2] =
'
(A.17)
By rearranging the
0, '"1Tr [IJ,
(A.18)
where we have explicitly written the unit matrix implied in Eq.
(A.16). The trace of the IInit matrix is 4, so filially we get
T2 = 4o, . "1.
(A.19)
Once this is done, all higher traces can 1)(> expressed in terms of
T2 . For example. starting now with the definit.ion of T, and \Ising
this relation repeatedly, we obtain
T,(a" a2, a3, a,) = Tr [~'~2~3~,J
= Tr [(2a, . a2  ~2~tl~3~,J
= 2o, . 02 T 2(a3, a,)  Tr [~2~,h~.d
= 2al . a2 T2 (03. a,)  Tr 1~2(2a, . n:j  ~3"1 )~'1J
= 2al' a2 T 2 (a3.(/.,)  2al' "3 T2("2,''l) + Tr [~2~3~1~,J
= 2a, a2 'f.,(a3' a,)  2"1' a3 T2(1t2."")
+2a, . a., T,(02, a:l)  Tr l~d:l~,~,1 .
(A.20)
A.2. nares of "Imatrices
363
The remaining trace is equal to the original trace due to the cyclic
property of traces. So we obtain
T4 (01,02,03,0.)
= 0, . 02 T2(03, 0.)  0, . 03 T2(a2. a,) + 0, . a4 T2(a2,a3)
= 4(a, . a2"3' a,  a, a3a2' a. + a, . "4"2' a,,)
(A.21)
In a similar way, we can show that
T2n (al
a2,a2n)
= at a 2 T2n2(a3,Q4,a2n)
al . a3 T2n 
2 (a2, f11,'"
a2n)
+al' a4 T2n 2(a2! a:t,Us ... a:l 71 )
+ ...
(A.22)
The traces can also be expressed in their bare form, without using
the 4vcctors. E'or ex;\mple, from Eq. (A.19). we c;\n extract the coefficients of afa2 from both sidC'B"i to write
(A.23)
Simil;\rly. from Eq. (A.21), we can take t.h,' c(>efficient" of "'t"2a5a~
to obtain
(A.24)
Since "(5 contains four ,,matrices, traces involving 1'.") can bp. nOI1vallishing only if there is atl even Humber of ,matric~s lIlultiplying
1'5. If this numht:'r happens to be zero, tlH' lracl' is zero, since we
ha.ve shown ill eh. 4 that 15 is tra<:eless. The trace also vanishes if
1'5 is ar.eolll panicd by two 1 matrices. Til is is hecause
Tr 1"1,,)""15)   Tr 1"1",,',,,]
  Tr h,,"1,,),,].
(A 25)
At the fir~t equality sign, we lH1ve used the a.llticOllllllllt.ing: property
of 151 and at the sc<.~ond, we have llsed t.IH' ('.velie propert.y of traces.
\Vhat \...e have outaincd iti that the trace sholllJ be alltisyrnmctric
in the indices IL, v. There is no sHch rnllk2 t.ensor in the structure of spacetime. and hence tlH' tra.ce must vanish. Tlw smallest
nonvanishing trace involving "/.5 thlls cOllla.ills fonr lI10re l'lIlat.rkcs.
364
Appendix A. Useful formulas
These four have to be all different, because if any two are the same,
the trace can be reduced to the form in Eq. (A.25). Because difference "Imatrices anticommute, the trace has to be proportional to the
antisymmetric tensor. The proportionality constant can be fixed by
calculating Tr hO"lI1'2'Y3'Y51,
(A.26)
To obtain higher traces involving "15, it is useful to use the identity
"Ip"l""I' = gp""I'  gp'''I"
+ g",gp 
i<P"'p"lP"l5,
(A.27)
which can be checked by substitution.
One interesting property of the "Imatrix traces can be proved
using the matrix C introduced in 1O.3. This is
Tr [~Hj2'" ~2nl~2n] = Tr [~2n~2nl .. '~2~11
(A.28)
Exercise A.2 Proue Eq. (A.28).
Exercise A.3 ShOUl that a.n" ma.tN 'Which ant\commutes with all
the ~JJ mU8t be a. multiple oj "(5. [Hint: Use the ba.sis matrices gi'Uen
in Eq. (.\.14).1
A.3
The antisymmetric tensor
The antisymmetric, or the LeviCivita tensor\ is a rank4 tensor
which is antisymmetric in the interchange of any pair of indices.
Thus, the tensor can have nonzero components only when all the
four indices are different, and in this case, if we define its value for
any order of the indices, the antisymmetry fixes the values for all
others. We have chosen
'0123
+1.
(A.29)
Following the usual rules for raising and lowering of indices, this
would also imply
,0123
= 1.
(A.30)
When there is a pair of antisymmetric tensors, we can express
them in terms of the metric tensor:
(A.31)
A.4. Useful ;ntegration formulas
365
where the square brackets indicate that all permutations of the lower
indices are to be added, even permutations with a factor of + 1 and
odd permutations with a factor of 1. This can be verified by direct
substitution. When one index is contracted, we obtain
f
IJV>'P
E~II'),.!p'
(A.32)
We can now go further and contract one more index. This will
give
(A.33)
Another contraction would lead to the identity
~v>..p
fiJv>.(I 
69pl'
p
(A.34)
and finally, if all the indices are contracted, we obtain
f iJV>'P fp.v>.p
24 ,
= 
(A.35)
A.4
Useful integration formulas
A.4.1
Angular integrations in Ndimensional space
Consider an Ndimem:iional space with Cartesian coordinates
XI,X2XN. Let us denote the magnitude ofx by r, i.e.,
N
LX; = r
(A.36)
i=l
If we integrate a function of T over the entire space, we should be
able to express the integral in the form
d N x f(T) = eN
faOC dr TN 1f(T)
(A.37)
for some number eN. Essentially, this eN b obtained by integrating
over the angular variables in a spherical coordinate system. Our
ta.c;k is to determine this eN.
366
Appendix A. Useful formulas
Since the angular integrations do not depend on the form of the
function f(rL we can use any convenient function to obtain eN. Let
us consider
(A.38)
fir) = exp (_r2/a2) .
In this case, the integral appearing on the right side of Eq. (A.37)
can be rewritten as
fo~ drr N  1 exp(_ r2 / a2 )
=a N fo~ dppN1
In mathematical analysis, the
is defined by the relation
r function
cxp(/). (A.39)
for 'lny complex variable z
(AAO)
where we have written (
p2 in the la.~t step. Thus we ohtain the
result:
Right hand side of Eq. (A.37) =
~CN(/"f(NI2).
(AAl)
On the other hand. using Eq. (A.36), we can write the left side
in the form
JdNxex[l(~x:la2)
= [[:dTex p
=
(x 2/ a2
aN 1r(1/2)]N .
)r
(AA2)
For N = 2, Eqs. (AAl) and (AA2) give f(1/2) =.fii. For arbitrary
N, we then obtain
eN
o
27r N /'l
= r(NI2) .
(AA3)
Exercise A.4 IJ u:t: considererl
f(r)={~
ifr
if,. >
R,
n.
(A4~)
the integTation clt'.jill('.(l in Eq. (A.3"1) stw'ld,d. hl1.n. gi,llf'lt 1I~ tlH' votume
of n sphere in N~di1rl('1L:flon. UI\f> the Jormtlln for (',\,' !)il:ell. ClhOl~f' tn
ucrifll thllt. one ohtlLi.ll~ the ('orn'cl "uol'U.1lL(," for N = 2.3.
A.4. Useful integration formulas
A.4.2
367
Momentum integration in loops
In loop diagrams, after introducing the Feynman parameters, the
momentum integrations can be reduced to the form
,
_JdNk
(k')'
I. (a, N) (21f)N (k' _ a2 ).
(A.45)
Here, N is the dimension of spacetime. For convergent integrals}
it can be taken as 4. In order to tackle divergent integrals as well,
a subject which has been discussed in Ch. 12, we keep N arbitrary
here.
After performing the Wick rotation, the integral becomes
'( N)
1,a,
.(
=,
I
)'H
d kE
(k'jy
(21f)N (k~ + a2)' '
(A.46)
At this point, we can use the result obtained above for the integration
over the angular variables to write
(A.47)
where
T
roo
Jo
J,(N) 
y"l+~N
dy (y + I)'
(A.48)
with y = k1/a'. This remaining integral can be expressed in a
standard form if we substitute
z=
y+1
(A.49)
In that case,
Y
I  z = "..,y
+1'
dy
dZ=(y+I)2'
(A.50)
So we can write
J;(N)
= 10"0 (Y~YI)2
=
fal dz
(1 
(y~Jl+jN (y~J'ljN
zrl+ jN Z''IjN
=B(r+~N,sr~N),
(A.51)
368
Appendix A. Useful formulas
In the last step, we have introduced the betafunction of mathematical analysis, which can be given in terms of the ffunction defined
above:
B(
z,
z') = f(z)f(z')
(A.52)
r(z+z').
Substituting this into Eq. (A.47), we finally obtain
i(1) r +o ..,._';;;f(r + !N)f(s  r  !N) (
)
I. (a, N) = (41T)N(2 (a 2 ).r1 N
f(~N)f(s)
. A.53
r
For convergent integrals, we can put N
suppress the dimension. This gives
1
(a2).
="
f(r
r
4 as mentioned earlier and
+ 2)fts f(s)
r  2)
(A.54)
Appendix B
Answers to selected exercises
Ex. 1.3.
N
1
"(ala.
+ )"",.
2
.,
H = L
Hamiltonian :
i=l
state :
N
number operator:
.%=
Lara..
i=l
Ex. 1.1. Maxwell equations are given in Ch. 8. The Lorentz force law on
a particle with charge q is
dpl' = qF~. dx.
dr
dr
where
x~
are the coordinates of the worldline of the particle, and r is
proper time, dr'l = g",.udx""dx v .
Ex. 1.8. r = 2 x lO6 s .
8V
Ex. .. (0 + m 2 ) =  ().
8V
Ex. .3. (0 + m 2 )' =  () .
Ex.
.~
The answer appears in Ch. 8.
369
370
Answers to selected exerdses
Ex. fl.7.
= Iq.
a) P.
00
b)
H= ;1 L
(p~ + z2w~q~).
k=1
c) 11 a!n] d) H
= o'm , 11., amJ = [al, a!nl = o.
~ 2(1'1.
""".
1 + 1.1.).
1
= L.
k=l
Ex. 2.8.
a) T"V = 8"</>8"</>  9"v!f,
b) T"" = 8"</>18"</> + 8"</>8"</>1  9""!f.
c) TIJI/ = _F~P8tJ A p _ gJ.JII,:LJ.
Ex. fl.9.
a) j" = ;q(</>18"</>  </>8"</>1).
b) j" = iq(</>18"</>  <I>8"</>1 + 2iqA"</>I</.
Ex. fl.10. j" =8"(x"</8"</>x"!f.
Ex. 3.5. P"
Ex. 3.7. Q
J
J
=~
= iq
d3 p P" (1 1(p)a(p) +a(p)af(p).
3
d p H(p)a,(p) 
Ex. 4.8. mv,(p)y"v,(p)
Ex.
4.9.
a~(p)adp)].
= p"v,(p)v,(p).
v(p'h" v(p) =  ,:. v(p') [(p + p')" 
i""" qvl v(p) .
Ex. 4.1fl. Same as in Eq. (4.61) and Eq. (4.62).
Ex.
4.26e
>.
.,;f(AJlIJ
. >.
!1/J'Y>'aJ.tIll/J
(T\x u
T\,xj.J),
where
>.
T " = 1/"'Y 0"'"  9 "!f.
Ex. 6.2.
Ex.
G" v'2 "'(".)'1"(1  '15)"'(.) "'(n)'1"(1

'15)"'(p) .
6.4.
S(3') = (_ih)3
/
(2rr)'o'(k _ p _ p') [1
J(~:~,
1]
v'2WkV )2 Ep V .j2Ep ,v
i6p(q) [u.(p)iSF(p  q) iSF(p)V,(p')1 '
Answers to selected exercises
Sf(3,'")
= (
,
(2
h)'
x
"
)'6'(k
J(~:~,
371
1'1'
')
[1
)
2W k
I]
V J2E p V J2E p'V
iilF(q) lu.(p)iSF(I') iSp(p  q)v.,(p')] .
Ex. 7.3. Yes.
Ex. 7.5.
Ex. 7.6
r=
G'FffiJ's
r(1f+ . e+ve )
r(7r+
8:~ +)
192,,3 (I 
m'
mP
=2
J.L+v j , )
<+
,
(mm;.m')'
m; .
2
Ex. 7.9. ]] GoVapproximately.
Ex. 8.1 01,FIW
+ A,f2 All
Ex. 8.2. B(x) = Bo(x)
jV.
d'x' Go(x  x')f(x'), where Go is the mass
less scalar propagator (Green's function.for the wave equation) and 80
satisfies the wave equation.
Ex. 8.5. Physical states: a!(k) 10) for,. ~ 1,2.
Ex. 8.6. :PI': =
d3 k k P
L, a;(k)a,(k), wherp kO =
Wk
This expres
sion is valid on physical states selected by the GuptaBleuler condition,
so that only the transverse modes contrihute for cac:h kf' in the integral.
(Physical quantities like Hamiltonian and momentum are independent
of ~ when restricted to physical states.)
Ex. 11 ..1. eF(O).
Ex. 12.3. Define EJ.'olvt3 = 9p,1I9ot3  gJ.<l.Bgvo Then
akfk~ kj k~ (EJ.<IQlyjJE~"lpo
Ep,ClI>...,Eyf3l p/J
1f JI1I
:>..p(k t , k2 , k3 , kot ) =
E}.laIJld ElIt3I~")
bkf k~ kj k~ ( Ep,QljJ'v,E~~>'" E>".,I>..'p' E~P.' + EIJ011 "1I' Er~>..' E ym >..' p' E:~P.'
+
+
EJ.'QIJ."v,E~i'Ep<SI~'p,E~~'), where a and b are two form factors which
depend on all possible Lorentz invariant combinations of the momenta.
Ex. 12.8. 1/128 approximately.
Ex. 13.12. Q. = Eab'
Ex. 15.13. 17%.
Ex. 15.19. +1.9%
d x tPbtP,.
Index
IPI diagrams, see irreducible
anticommutator, see anticommutation
diagrams
IPR diagrams, see reducible diagrama
relation
antilinear operator, 212214, 219
antineutrino, 2, 74
2point function, 254, 259, 267269,
271,272,274
antiparticle, 3941, 45, 67, 71, lOS,
123, 182, 207, 327, 361
antisymmetric symbol, 291, 360
antisymmetric tensor, 49, 65, 354,
Abelian group, 291
accidental symmetries, see
symmetries, accidental
364365
antisymmetry of fermions, lOS, lID,
action, 1527, 72, 151, 152, 155, 167,
176
168, 285
antiunitary operator, 213, 215
approximate symmetries, see
symmetries, approximate
asymptotic series, 80
axial gauge, 151
adjoint representation, see under
group representation
amplitude, see Feynman amplitude
anapole moment, 229, 230
angular momentum, 26
eigenstates, 65
of a spinar field, 49, 53, 54
orbital, 54
annihilation and creation operators,
35, 3439, 44, 45, 65, 66,
72, 76, 82, 83, 86, 88, 94,
97, 102, 158, 160, 162, 167,
170, 194, 202, 297, 300, 302,
307, 309, 332
axial vector, 206, 207, 218
BakerCampbellHausdorff formula.
291
bare
fields, 273, 274
Lagrangian, 273, 274, 277
parameters, 274, 277
baryon number, 167, 289, 35&357
l1decay, 2, 74, 32&328
dimension of, 95
anomalous magnetic moment, see
magnetic moment,
constant, 74
biunitary transformation, 341
Bianchi identity, 318
bilinear covariants of fermion fields,
anomalous
anticommutation relation, 5, 48, 86,
124, 220
of Dirac matrices, 4850, 54, 59,
120, 209, 215, 239
of fermionic fields, 6668, 84, 97,
102, 103, 212, 220, see also
antisymmetry of fermions
206,211, 216, 217, 219, 220,
226,354
BremBStrahlung, 197199, 27~282
broken symmetries, see spontaneous
symmetry breaking
CabibboKobayashiMaskawa matrix,
342
and exclusion principle, 68
373
374
Index
canonical
commutation relations, 30, 32,
33,38
momentum, 1921,29,30,32,
38,150,153,161,295
qU8J\tization. 29, 80, 153
centerofm8.9S frame. see frame,
eM
charge conjuga.tion, 201. 207212,
214,217219,250,283,294
charge quantization, 169
chiral representation, see
representation of gamma
matrices
chirality, 335, 336, 338, 340, 341, 352,
357, 360
projection operators, 6162, 337,
353, 360
color, see qua..rk color
Compton scattering, 173, 185193
Coulomb gauge, l51
Coulomb interaction, to, 163, 194,
200, 204
modification at higher order,
275276
counterterm, 252, 267277, 280, 282
CP symmetry, 217218, 338
CPT symmetry, 218221
CPT theorem, 220
creation operators, see annihilation
and creation operators
cr088 section, 41, 115
for 2to2 scattering, 133140
for individual processes, see
under the proau name
general formula, 130133
Lorentz invariance of, 144
crossing symmetry, 186, 190
decay rate, 41
general formula, 115117, 140
of a scalar, 8794, 97101, 106,
112,117122
of Higgs boson, 346
of muon, su muon decay
of pion, .lI~e pion decay
of W boson, 345
of Zboson, 342345
degenerate vacua, 297, 299
derivative interactions, 88, 130, 246,
307
differential
cross section, 137, 139, 142144,
179,182, 192,193,197199,
243, 244, 348, 350
decay rate, 129, 130, 140
dilatation, 27
dipole moment, se~ electric dipole
moment &. magnetic
moment
Dirac equation, 1,2,49,5158,361
covEU"iance 0(, 5254
plane wave 8OIutions of, 5458
Dira.c Hamiltonian, 48, 65
Dirac matrices
contraction formul8.9, 54, 178
in Ndimensiona, 265
general properties, 4850, 359
hermiticity properties, 50
nonuniqueness of, 5051, 359
representations, see
representation of gamma
matrices
trace formulas, 49, 119120,
361364
DiTfw:Pauli representation, .tee
representation of gamma
matrices
discrete symmetries, see under
symmetries
electric dipole moment, 228230
electron mass, 144, 339
electronelectron scattering, 199
in QED, 173179
with sceJ.ar exchange, 101105
electronphoton scattering, see
Compton scattering
electronpositron annihilation
to muon pairs, 173, 1821&4,
346348
to Wpairs, 351354
electronpositron scattering, 173,
180182
electronproton scattering, see und~r
proton
electrowcak theory, 326357
Index
375
energymomentum tensor, see
stressenergy tensor
equivalence theorem, 353
EulerLagrange equations, 1214,
1718, 27,63,64, 14B
EulerMascheroni constant, 264
evolution operator, 7580, 82, 268
exclusion principle, see Pauli
exclusion principle
FadeevPopov ghosts, 322
Fermi constant, 123, 350
Feynman amplitude, 105 J10, 113,
117, 120, 121, 123125,
135137,141,142,144,175,
176, lBO, IB3, IB4, IB6, 195,
19B, 199, 241, 242, 244, 246,
247,254, 2BO, 305, 30B, 310,
343, 349
Feynman diagram, 87112, 155,172,
175, 197, 230, 309
Feynman parameters, 231233, 235,
23B, 241, 256, 261, 266, 367
Fcynman propagator
as Wick contraction, 85
(or Dirac fields, 6971
for ghost fields, 323
for massive vector hosans, 150,
309
for nonAbelian gauge bosons,
322, 334
for scalar fields, 4146
for the photon field, 154
for unphysical scalars, 309, 334
for unstable particles, 355356
!'eynman rule, 106110, 114, 170,254
for charged current interactions,
336, 342
for derivative coupling, 130
for effective vertex, 223
for electroweak theory, 339, 353
for external field, 195, 196
for external lines, 107, 164
for ghos"', 323
for Goldstone hosans, 307
for internal lines, 107, 164
for loop integration, 108
for neutral current interactions,
337, 33B, 347, 349
for QED, 170, 173, 175, lBO, 230,
254,261
for vertices, 110
for YM theories, 31B32O, 322,
352
Feynman't Hoeft gauge, 155, t61,
162, 164, 323, 349, 352, 356
finestructure consta.nt, 10, 179, 233
Fock space, 36, 37, 6B, 77, 94
form factor, 224244, 246, 251, 259
Lorentz invariance of, 224
4fermion interaction, 7475
as lowenergy electroweak limit,
350
inelastic scattering with, 140144
muon decay with, 122130
(rame
eM, 127, 134137, 142144,
17B179, IBl, IB3
Lab, 134, 137140, 143, 18B, 189,
195, 243
full Lagrangian, 273275
functional, 17, 19,20,64
derivative, 19,20
Poisson bra.cket of, .'lee Poisson
bracket
fundamental representation, jee under
group representation
'Ymatrices,
.'lee
Dirac matrices
gauge boson, 168, 2B4, 301>311, 315,
31B, 319, 322324, 327330,
332334, 336, 337, 342, 344,
353
selfinteractions, 319320
gauge covariant derivative, 169, 274,
30B, 314319, 330, 336, 33B
gauge fixing, 20, 151154, 176, 267,
274, 275, 309, 310, 322, 334
gauge pri"nciple, 167, 168, see auo
gauge b080ns & gauge
covariant derivative
gauge theory
Abelian, jee QED
nonAbelian, see nonAbelian
gauge theory
Index
376
GaU88 theorem, 17, 18, 22
generation, 335, 340
generators, 289293, 302, 304, 305,
313,315,317319,327331,
336
ghost fields, au FadeevPopov ghosts
global transformations, su under
symmetries
gluOlI8, 325
Goldstone bosons, see
internal symmetries, see symmetries,
internal
intrinsic
charge conjugation, 207, 208
CP,217
CPT,22O
parity, 202, 203, 205, 206
time reversa.!, 214
ineducible diagr8l1lS, 112
"","pin, 289, 293, 294, 327
NambuGoldstone bosons
Goldstone theorem, 300304
Gordon identity, 56, 223, 225, 234
gravity, 324, 326
Green's function, 41, 43, 44, 149, 151,
154, 163, see a130 Feynman
propagator
group
algebra, 291, see also structure
constants
general properties, 284
representation, 292293
adjoint, 293, 315, 322
fundamental, 292, 313, 314,
316, 317
singlet, 292, 319
Cupta.Bleulcr formalism, 160162
hadron, 326, 327
Heavisid~Lorentz
units, 10, J46
helicity
eigenstates, 61, 62, 351
of neutrinos, .fee neutrino helicity
projection operators, 6062
Higgs booon, 75, 335, 339, 340, 346
field, 339
interactions, 338
multiplet, 330, 33B
neutral, 349
Higgs mechanism, 308311
hypercharge, 329, 330, 336, 357
'in' and 'out I states, 82
inelastic processes, 115, 140144, )73,
184, 243
infrarecl. divergence, 198199, 245,
271, 279282
interaction picture, 78
Jacobi identity, 293
KleinGordon equation, 28, 29, 41, 42,
47,54
with a source, 41
KleinNishina formula, 192
Kronecker delta, 6"
Lab frame, see frame, Lab
Lamb shift, 276
Landau gauge, 155
Lande gfactor, 227, 228
for the electron, 238
for the proton, 242
lepton number, 167,356357
leptons, 130,324,335,336,340, 341,
346, 356
LeviCivita tensor, see antisymmetric
tensor
Lie grou ps, 290
lifetime, 10, 11, 117, 122, 130,355
loop diagrams, 93, 94, 100, lOS, 230,
245, 249, 250, 252, 254, 257,
259, 260, 267, 274, 280, 323
evaluation of, 231241
Feynman rule, 109, 110
loop expansion, 268
Lorentz force formula, 8
Lorentz invariance. 126, 144, 149, 160,
172, 2OS, 220, 224, 253, 257,
259,277, 317, 326
and VEV, 300
of action, 24, 73
of cross section, see under cross
section
of Feynman amplitude, 144, 244
377
Index
Lorentz transformation, 7, 16, 2526,
28, 30, 47, 52, 53, 156,
201203.283.285
indices, 54, 123, 125, 220, 343,
344
of coordinates, 7
of fermion bilinears, 54
of spinoTs, 5254, 73, 208, 209,
211, 257, 287
of vector fields, 156
Lorenz gauge, 151, 185
magnetic moment
anomalous. 228238, 242, 259,
272. 294
Dirac, 227
Majora"a representation, see
representation of gamma
matrices
Mandelstam variables. 144145, 179,
183, 346
mass matrix. 341
Maxwell equations, 8, 18, 40, 146,
147, 150. 152. 153, 155. 160.
165, 318
discrete symmetries, 204. 208
minimal substitution, 169, 253
momentum
conjugate, 14, 150
muon, 279, 324, 340
coupling with the Zb06On, 346
decay, 10, 122130. 141.350.354
magnetic moment, 238
mass, 10, 144
pair production . .see e e+
annihilation
NambuGoJdstone boson, 303305
interactions. 305308
natural units, 911. 28, 29,117
neutrino. 123, 126, 127, 129, 130.324.
327,335337,340,351.356,
357
coupling with the Higgs booon,
340
helicity, 141,335
mass. 124,336
masslessness of, 335, 336, 339
neutral current interactions, 337
scattering off electron
elastic, 349350
inelastic, 140144
neutron, 2, 74, 75, 288, 289, 293, 294,
324. 326328.
aUo
I.'
Pdecay
mass, 289, 293
Noether
charge, 24, 67
current, 24, 39, 64, 321
theorem, 2121, 40, 167,200,
223, 283, 30 I
nonAbelian
gauge theory, 312326. 329
group, 291
symmetry,3()()'301
non.perturbative effects, 357
normal ordered
charge, 67
Hamiltonian, 36, 41, 67,161,110
momenLum, 36
product. 89. 91. 92.102.174
and time ordered product, see
Wick theorem
of bosonic fields, 35
of fermionic fields, 66, 67
normalization
of fields. 30, 288
of flux, 132
of hypercharge, 330
of polarization vectors, 185, 344
of spinors, 56, 58, 66, 61, 225,
226
of stales. 5, 35, 37, 44. 76. 9497,
159, 222
nucleons, 75, 288
number operator, 5, 37, 39, 40
offshell particle, )) 1, 5U 000 virtual
particles
onshell
condition, 111, )88,235,280
particle, 111, 112, 172, 222, 230,
255,271,276,344,356
renormaliution, 269, 271, 276
pair annihilation, 173
378
Index
pair creation. 173
p.... ity, 201208, 214, 217219,231,
242, 244, 257, 283, 294
intrinsic, see intrinsic parity
violation, 230, 231, 338, 354
Pauli exclusion principle, 68
Pauli matrices. 57, 288, 289. 328, 359
PauliLubansky vector, 65
perturbation theory, 76, 80, 82, 106,
130, 250, 253, 268, 282, 357
phase space, 116
(or 2body decay, 120122
rOT identical particles, 117. 179
photon
charge conjugation, 208
in electroweak theory, 328, 332,
333
masslessness of, 150, 333
parity t.ransformation, 204, 206
polarization, see under
polarization
propagator, f49150, 153155,
162164
scattering off electrons, see
Compton scattering
under time reversal, 214217
pion, 75, 288, 289
decay, 130
decay constant, 130
m....... ,289
scattering off electrons, 244
Poincare invariance, 16, 72, 285, 301
Poisson bracket, 15, 20, 29
polarization atates
circular. 158
elliptical, 158
number of, 159, 161, 162
physical, 192, 193
transverse, 162
polarization Bum
(or massive vector bosons, 344
for photons, 164165, 192, 193,
199
polarization vectors, ~ee abo
polarization states
for massive vector bosons, 344
longitudinal, 344, 352, see also
equivalence theorem
transverse, 344
for photons, 156159, 161,
184189, 192, 193
and gauge invariance, 184185,
187
longitudinal, 157
scalar, 156. 157, 161
transverse, 157
pole of a propagator 309, 310, 333
principle of least aelion, 12, 13, 17
Proca Lagrangian, ISO, 308
projection operators, 5962
for chirality, see under chirality
for energy. 5960
for helicity, see under helicity
I
for spin, 6263, 354
propaga.tor. see Feynman propagator
proton, 2, 74, 75, 167, 288, 293, 294,
324, 326328
electric charge as unit, 10. 166,
288
magnetic moment, 242
mass, 289, 293
scattering with electrons,
241244
pseudoscalar., 205, 206, 212
QeD, 323325
QED, 166199, 245, 248, 283, 284,
286, 312, 313, 315, 318, 323,
326, 329, 333, 338, 346, 348
divergent amplitudes, 2:)0252
invariance under discrete
symmetries. 206, 212, 216,
218, 294
renormaJiza.tion, 253282
vertex function, 230241
quantiza.tion
canonical, see canonical
quantization
first, 37
second, 37
Quantum chromodynamics, su QeD
Quantum electrodynamics, see QED
quark
color, 324
masses, 279
Index
379
mixing matrix, 342
quarks, 241, 324, 335, 34ll342, 356,
357
redudble diagrams, 1l2, 271
regularization, 252, 26~267
and gauge invariance, 263
cutoff, 252, 253
dimensional, 252, 26ll265, 267,
272, 323
Pauli Villars, 252, 265267, 272
relativity
general theory, 326
special theory, 58, 301
renormalizability
nonrenormalizable theories, 248,
275, 326
of electroweak theory, 334
renormalizable theories, 248, 253
representation of gamma matrices
chirsl, 360
DiracPauli, 60, 203, 210, 226,
228, 229, 359360
spinoT solutions in, 5658, 226
Majorana, 210, 361
running coupling constant, 27&279
Rutherford scattering formula, 197,
275
R( gauge, 334
Smatrix, 8ll86, liS, 116, 130, 131,
155, 172, 174, 175, 180, 184,
185, 194, 230, 275
scalar electrodynamics, 169
scattering angle, 135, 142, 143, 181,
354
selfenergy
for fermions, 174.251,254157,
265267, 27ll272
for nonAbelian gauge fields, 323,
324
for photon,
see. vacuum
polarization
simple harmonic oscillator, 35, 32 36
singlet representation, see under
group representat.ion
spin of a Dirac particle, 65
spin sum, 56,118120, 176,177,181,
343, 353, 354
spont.aneous symmetry breaking,
294311,330,331,333,338,
339
standard electroweak theory, see
clectroweak theory
step function, 31
stress energy tensor, 23, 25, 21
structure constants, 291, 292, 314,
317,320,323,329
SU(2)
definition, 287
doublet, 327, 328
generators, 290292
isospin, 289
nplct, 330
transformation, 289, 290
SU(3), 324 325
superficial degree of divergence,
245251
symmetries
accidental, 351
and conservation laws, su
Noether theorem
approximate, 289, 293294
continuous, 200, 283, 284,
286291
breaking of, 299301, se~ also
Goldstone theorem,
NambuGoldstone bosons
discrete, 20ll221, 283, 294, 295,
298, 304
global, 148, 166169, 284,
286289, 299, 308, 310,
312315,324,331, :156357,
see also Noether theorem
internal, 16,2627,283,285281,
301
sracetimp., 2426, 283
tau, 279, :l40
temporal gauge, 151
Thomson crosssection, 193
t.hreshold energy, 144
time dilaLiOll, 122, 355
time reversal, 'lOI, 212217, 219,283,
294
380
Index
timeordered product, 79, BO, 174
and normalordered product, see
Wick theorem
for fermion fields, 11
for scal", fields, 45
total divergence, 18, 23, 24, 64. 73,
148, 15], 309, 334
tree level, 100,253255, 267, 269, 271,
274, 277, 305, 306, 323, 327,
337, 351
interaction, 225, 241
ultraviolet divergences, 241, 245282
uncertainty rela.tions, 9, Ill, 355
unitary gauge, 310, 311, 333, 334
We;nberg angle, 333, 337, 345
Wick
contractions, 84
expansion, 86, 87, 90, 91, 94, 98,
104, 174
theorem, 8286, 88, 101, 106, 174
Wick rotation, 235237, 367
YangMills theories, ~ee nonAbelian
gauge theory
YukaW8.
coupling, 33934l
interaction, 75,87,101,117,256,
338
theory, 75, 104, 288
unitary groups, 327
unitary matrices, 50, 51, 210, 215,
286288,290,313,341, 359
as a group, 285, 290
unitary operator, 76, 81, 213. 215
units,
8U
natural units.
HeavisideLorentz units
unphys;cal mode, 303, 310, 322, 333,
334, 339
vacuum expectation value, 35, 84, 85,
98, 297, 299, 301, 304, 332,
339
of normal ordered products, 82
of timeordered product, 84
vacuum polarization, 174,251,
257265, 267, 270
vacuum stat.e. 34, 36, 37, 40, 41, 45,
68, 82, 83, 89, 94, 95, 97,
102, 103, 107, 159, 297, 299,
301,302,331,332
vacuum to vacuum transition, 94, 174
vertex function, 223224, 230, 242,
244, 253, 255, 258, 259, 272,
280
virtual particles, 1U}113, 172, 282.
322, 323
VVboson, 279, 327354
decay of. sa decay of W boson
mass, 332, 345
VV",dTakahashi identity, 253256,
258, 272, 274, 276
Zboson, 207, 2l7, 332353, 355
decay of. see decay of Zboson
decay width, 356
mass, 345