Beruflich Dokumente
Kultur Dokumente
E GROUP 61
2.1 Inertial observers . . . . . . . . . . . . . . . . . . . . . . . . . 62
2.1.1 Principle of relativity . . . . . . . . . . . . . . . . . . . 62
2.1.2 Inertial transformations . . . . . . . . . . . . . . . . . 64
2.2 Galilei group . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
2.2.1 Multiplication law of the Galilei group . . . . . . . . . 66
2.2.2 Lie algebra of the Galilei group . . . . . . . . . . . . . 67
2.2.3 Transformations of generators under rotations . . . . . 70
2.2.4 Space inversions . . . . . . . . . . . . . . . . . . . . . . 73
2.3 Poincare group . . . . . . . . . . . . . . . . . . . . . . . . . . 74
2.3.1 Lie algebra of the Poincare group . . . . . . . . . . . . 75
2.3.2 Transformations of translation generators under boosts 80
3 QUANTUM MECHANICS AND RELATIVITY 83
3.1 Inertial transformations in quantum mechanics . . . . . . . . . 83
3.1.1 Wigners theorem . . . . . . . . . . . . . . . . . . . . . 84
3.1.2 Inertial transformations of states . . . . . . . . . . . . 87
3.1.3 Heisenberg and Schr odinger pictures . . . . . . . . . . 88
3.2 Unitary representations of the Poincare group . . . . . . . . . 89
3.2.1 Projective representations of groups . . . . . . . . . . . 90
CONTENTS ix
3.2.2 Elimination of central charges in the Poincare algebra . 91
3.2.3 Single-valued and double-valued representations . . . . 99
3.2.4 Fundamental statement of relativistic quantum theory 100
4 OPERATORS OF OBSERVABLES 103
4.1 Basic observables . . . . . . . . . . . . . . . . . . . . . . . . . 104
4.1.1 Energy, momentum and angular momentum . . . . . . 104
4.1.2 Operator of velocity . . . . . . . . . . . . . . . . . . . 106
4.2 Casimir operators . . . . . . . . . . . . . . . . . . . . . . . . . 106
4.2.1 4-vectors . . . . . . . . . . . . . . . . . . . . . . . . . . 107
4.2.2 Operator of mass . . . . . . . . . . . . . . . . . . . . . 108
4.2.3 Pauli-Lubanski 4-vector . . . . . . . . . . . . . . . . . 109
4.3 Operators of spin and position . . . . . . . . . . . . . . . . . . 111
4.3.1 Physical requirements . . . . . . . . . . . . . . . . . . 111
4.3.2 Spin operator . . . . . . . . . . . . . . . . . . . . . . . 113
4.3.3 Position operator . . . . . . . . . . . . . . . . . . . . . 115
4.3.4 Alternative set of basic operators . . . . . . . . . . . . 119
4.3.5 Canonical form and power of operators . . . . . . . . 120
4.3.6 Uniqueness of the spin operator . . . . . . . . . . . . . 123
4.3.7 Uniqueness of the position operator . . . . . . . . . . . 125
4.3.8 Boost transformations of the position operator . . . . . 126
5 SINGLE PARTICLES 129
5.1 Massive particles . . . . . . . . . . . . . . . . . . . . . . . . . 131
5.1.1 Irreducible representations of the Poincare group . . . 131
5.1.2 Momentum-spin basis . . . . . . . . . . . . . . . . . . 134
5.1.3 Action of Poincare transformations . . . . . . . . . . . 137
5.2 Momentum and position representations . . . . . . . . . . . . 141
5.2.1 Spectral decomposition of the identity operator . . . . 141
5.2.2 Wave function in the momentum representation . . . . 145
5.2.3 Position representation . . . . . . . . . . . . . . . . . . 147
5.2.4 Inertial transformations of observables and states . . . 150
5.3 Massless particles . . . . . . . . . . . . . . . . . . . . . . . . . 154
5.3.1 Spectra of momentum, energy and velocity . . . . . . . 154
5.3.2 Representations of the little group . . . . . . . . . . . . 156
5.3.3 Massless representations of the Poincare group . . . . . 159
5.3.4 Doppler eect and aberration . . . . . . . . . . . . . . 162
x CONTENTS
6 INTERACTION 167
6.1 Hilbert space of a many-particle system . . . . . . . . . . . . . 167
6.1.1 Tensor product theorem . . . . . . . . . . . . . . . . . 168
6.1.2 Particle observables in a multiparticle system . . . . . 170
6.1.3 Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . 171
6.2 Relativistic Hamiltonian dynamics . . . . . . . . . . . . . . . . 173
6.2.1 Non-interacting representation of the Poincare group . 174
6.2.2 Diracs forms of dynamics . . . . . . . . . . . . . . . . 175
6.2.3 Total observables in a multiparticle system . . . . . . . 177
6.3 Instant form of dynamics . . . . . . . . . . . . . . . . . . . . . 177
6.3.1 General instant form interaction . . . . . . . . . . . . . 178
6.3.2 Bakamjian-Thomas construction . . . . . . . . . . . . . 179
6.3.3 Non-Bakamjian-Thomas instant forms of dynamics . . 181
6.3.4 Cluster separability . . . . . . . . . . . . . . . . . . . . 184
6.3.5 Non-separability of the Bakamjian-Thomas dynamics . 186
6.3.6 Cluster separable 3-particle interaction . . . . . . . . . 188
6.4 Bound states and time evolution . . . . . . . . . . . . . . . . . 192
6.4.1 Mass and energy spectra . . . . . . . . . . . . . . . . . 193
6.4.2 Doppler eect revisited . . . . . . . . . . . . . . . . . . 194
6.4.3 Time evolution . . . . . . . . . . . . . . . . . . . . . . 198
6.5 Classical Hamiltonian dynamics . . . . . . . . . . . . . . . . . 200
6.5.1 Quasiclassical states . . . . . . . . . . . . . . . . . . . 200
6.5.2 Heisenberg uncertainty relation . . . . . . . . . . . . . 202
6.5.3 Spreading of quasiclassical wave packets . . . . . . . . 203
6.5.4 Phase space . . . . . . . . . . . . . . . . . . . . . . . . 204
6.5.5 Poisson brackets . . . . . . . . . . . . . . . . . . . . . 206
6.5.6 Time evolution of wave packets . . . . . . . . . . . . . 210
7 SCATTERING 215
7.1 Scattering operators . . . . . . . . . . . . . . . . . . . . . . . 216
7.1.1 S-operator . . . . . . . . . . . . . . . . . . . . . . . . . 216
7.1.2 S-operator in perturbation theory . . . . . . . . . . . . 219
7.1.3 Adiabatic switching of interaction . . . . . . . . . . . . 223
7.1.4 T-matrix . . . . . . . . . . . . . . . . . . . . . . . . . . 225
7.1.5 S-matrix and bound states . . . . . . . . . . . . . . . . 227
7.2 Scattering equivalence . . . . . . . . . . . . . . . . . . . . . . 228
7.2.1 Scattering equivalence of Hamiltonians . . . . . . . . . 228
7.2.2 Bakamjians construction of the point form dynamics . 230
CONTENTS xi
7.2.3 Scattering equivalence of forms of dynamics . . . . . . 231
8 THE FOCK SPACE 237
8.1 Annihilation and creation operators . . . . . . . . . . . . . . . 238
8.1.1 Sectors with xed numbers of particles . . . . . . . . . 238
8.1.2 Non-interacting representation of the Poincare group . 240
8.1.3 Creation and annihilation operators. Fermions . . . . . 241
8.1.4 Anticommutators of particle operators . . . . . . . . . 243
8.1.5 Creation and annihilation operators. Photons . . . . . 245
8.1.6 Particle number operators . . . . . . . . . . . . . . . . 245
8.1.7 Continuous spectrum of momentum . . . . . . . . . . . 246
8.1.8 Generators of the non-interacting representation . . . . 248
8.1.9 Poincare transformations of particle operators . . . . . 250
8.2 Interaction potentials . . . . . . . . . . . . . . . . . . . . . . . 251
8.2.1 Conservation laws . . . . . . . . . . . . . . . . . . . . . 252
8.2.2 Normal ordering . . . . . . . . . . . . . . . . . . . . . 254
8.2.3 General form of interaction operators . . . . . . . . . . 255
8.2.4 Five types of regular potentials . . . . . . . . . . . . . 257
8.2.5 Products and commutators of potentials . . . . . . . . 261
8.2.6 More about t-integrals . . . . . . . . . . . . . . . . . . 264
8.2.7 Solution of one commutator equation . . . . . . . . . . 266
8.2.8 Two-particle potentials . . . . . . . . . . . . . . . . . . 267
8.2.9 Cluster separability in the Fock space . . . . . . . . . . 271
8.3 A toy model theory . . . . . . . . . . . . . . . . . . . . . . . . 274
8.3.1 Fock space and Hamiltonian . . . . . . . . . . . . . . . 274
8.3.2 Drawing a diagram in the toy model . . . . . . . . . . 276
8.3.3 Reading a diagram in the toy model . . . . . . . . . . 279
8.3.4 Electron-electron scattering . . . . . . . . . . . . . . . 280
8.3.5 Eective potential . . . . . . . . . . . . . . . . . . . . 282
8.4 Diagrams in a general theory . . . . . . . . . . . . . . . . . . . 283
8.4.1 Properties of products and commutators . . . . . . . . 283
8.4.2 Cluster separability of the S-operator . . . . . . . . . . 289
8.4.3 Divergence of loop integrals . . . . . . . . . . . . . . . 291
9 QUANTUM ELECTRODYNAMICS 295
9.1 Interaction in QED . . . . . . . . . . . . . . . . . . . . . . . . 296
9.1.1 Construction of simple quantum eld theories . . . . . 297
9.1.2 Interaction operators in QED . . . . . . . . . . . . . . 300
xii CONTENTS
9.2 S-operator in QED . . . . . . . . . . . . . . . . . . . . . . . . 302
9.2.1 S-operator in the second order . . . . . . . . . . . . . 302
9.2.2 Lorentz invariance of the S-operator . . . . . . . . . . 308
9.2.3 S
2
in Feynman-Dyson perturbation theory . . . . . . . 310
9.2.4 Feynman diagrams . . . . . . . . . . . . . . . . . . . . 314
9.2.5 Virtual particles? . . . . . . . . . . . . . . . . . . . . . 318
10 RENORMALIZATION 321
10.1 Renormalization conditions . . . . . . . . . . . . . . . . . . . . 322
10.1.1 Regularization . . . . . . . . . . . . . . . . . . . . . . . 322
10.1.2 No-self-scattering renormalization condition . . . . . . 324
10.1.3 Charge renormalization condition . . . . . . . . . . . . 327
10.1.4 Renormalization in Feynman-Dyson theory . . . . . . . 328
10.2 Counterterms . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
10.2.1 Electron self-scattering . . . . . . . . . . . . . . . . . . 329
10.2.2 Electron self-energy counterterm . . . . . . . . . . . . . 331
10.2.3 Photon self-scattering . . . . . . . . . . . . . . . . . . . 334
10.2.4 Photon self-energy counterterm . . . . . . . . . . . . . 335
10.2.5 Charge renormalization . . . . . . . . . . . . . . . . . . 337
10.2.6 Vertex renormalization . . . . . . . . . . . . . . . . . . 339
10.3 Renormalized S-matrix . . . . . . . . . . . . . . . . . . . . . . 341
10.3.1 Diagrams 10.1(d) + (k) . . . . . . . . . . . . . . . . . 342
10.3.2 Diagrams 10.1(e) + (h) . . . . . . . . . . . . . . . . . . 342
10.3.3 Diagram 10.1(f) . . . . . . . . . . . . . . . . . . . . . . 345
10.3.4 Diagram 10.1(g) . . . . . . . . . . . . . . . . . . . . . . 350
10.3.5 Renormalizability . . . . . . . . . . . . . . . . . . . . . 353
10.3.6 On the origins of QED interaction . . . . . . . . . . . . 354
II QUANTUM THEORY OF PARTICLES 357
11 DRESSED PARTICLE APPROACH 361
11.1 Troubles with renormalized QED . . . . . . . . . . . . . . . . 362
11.1.1 Renormalization in QED revisited . . . . . . . . . . . . 362
11.1.2 Time evolution in QED . . . . . . . . . . . . . . . . . 364
11.1.3 Unphys and renorm operators in QED . . . . . . . . . 365
11.2 Dressing transformation . . . . . . . . . . . . . . . . . . . . . 367
11.2.1 No-self-interaction condition . . . . . . . . . . . . . . . 367
CONTENTS xiii
11.2.2 Main idea of the dressed particle approach . . . . . . . 370
11.2.3 Unitary dressing transformation . . . . . . . . . . . . . 371
11.2.4 Dressing in the rst perturbation order . . . . . . . . . 373
11.2.5 Dressing in the second perturbation order . . . . . . . 374
11.2.6 Dressing in arbitrary order . . . . . . . . . . . . . . . . 376
11.2.7 Innite momentum cuto limit . . . . . . . . . . . . . 377
11.2.8 Poincare invariance of the dressed particle approach . . 379
11.3 Dressed interactions between particles . . . . . . . . . . . . . . 380
11.3.1 General properties of dressed potentials . . . . . . . . . 380
11.3.2 Energy spectrum of the dressed theory . . . . . . . . . 384
11.3.3 Comparison with other dressed particle approaches . . 385
12 COULOMB POTENTIAL AND BEYOND 387
12.1 Darwin-Breit Hamiltonian . . . . . . . . . . . . . . . . . . . . 388
12.1.1 Electron-proton potential in the momentum space . . . 388
12.1.2 Position representation . . . . . . . . . . . . . . . . . . 390
12.2 Hydrogen atom . . . . . . . . . . . . . . . . . . . . . . . . . . 393
12.2.1 Non-relativistic Schr odinger equation . . . . . . . . . . 393
12.2.2 Relativistic energy corrections (orbital) . . . . . . . . . 396
12.2.3 Relativistic energy corrections (spin-orbital) . . . . . . 400
13 DECAYS AND RADIATION 403
13.1 Unstable system at rest . . . . . . . . . . . . . . . . . . . . . . 404
13.1.1 Quantum mechanics of particle decays . . . . . . . . . 404
13.1.2 Non-interacting representation of the Poincare group . 408
13.1.3 Normalized eigenvectors of momentum . . . . . . . . . 409
13.1.4 Interacting representation of the Poincare group . . . . 410
13.1.5 Decay law . . . . . . . . . . . . . . . . . . . . . . . . . 414
13.2 Breit-Wigner formula . . . . . . . . . . . . . . . . . . . . . . . 416
13.2.1 Schr odinger equation . . . . . . . . . . . . . . . . . . . 416
13.2.2 Finding function (m) . . . . . . . . . . . . . . . . . . 421
13.2.3 Exponential decay law . . . . . . . . . . . . . . . . . . 427
13.2.4 Wave function of decay products . . . . . . . . . . . . 429
13.3 Spontaneous radiative transitions . . . . . . . . . . . . . . . . 432
13.3.1 Instability of excited atomic states . . . . . . . . . . . 432
13.3.2 Bremsstrahlung scattering amplitude . . . . . . . . . . 434
13.3.3 Perturbation Hamiltonian . . . . . . . . . . . . . . . . 438
13.3.4 Transition rate . . . . . . . . . . . . . . . . . . . . . . 441
xiv CONTENTS
13.3.5 Energy correction due to level instability . . . . . . . . 442
13.4 Decay law for moving particles . . . . . . . . . . . . . . . . . . 447
13.4.1 General formula for the decay law . . . . . . . . . . . . 447
13.4.2 Decays of states with denite momentum . . . . . . . . 449
13.4.3 Decay law in the moving reference frame . . . . . . . . 450
13.4.4 Decays of states with denite velocity . . . . . . . . . . 452
13.5 Time dilation in decays . . . . . . . . . . . . . . . . . . . . 453
13.5.1 Numerical results . . . . . . . . . . . . . . . . . . . . . 453
13.5.2 Decays caused by boosts . . . . . . . . . . . . . . . . . 455
13.5.3 Particle decays in dierent forms of dynamics . . . . . 458
13.6 Radiative corrections . . . . . . . . . . . . . . . . . . . . . . . 459
13.6.1 Fitting particle potentials to the S-matrix . . . . . . . 460
13.6.2 Product term in (13.111) . . . . . . . . . . . . . . . . . 460
13.6.3 Radiative corrections to the Coulomb potential . . . . 463
13.6.4 Lamb shift . . . . . . . . . . . . . . . . . . . . . . . . . 464
13.6.5 Electrons anomalous magnetic moment . . . . . . . . 466
14 CLASSICAL ELECTRODYNAMICS 469
14.1 Hamiltonian formulation . . . . . . . . . . . . . . . . . . . . . 469
14.1.1 Darwin-Breit Hamiltonian . . . . . . . . . . . . . . . . 471
14.1.2 Two charges . . . . . . . . . . . . . . . . . . . . . . . . 472
14.1.3 Denition of force . . . . . . . . . . . . . . . . . . . . . 474
14.1.4 Wire with current . . . . . . . . . . . . . . . . . . . . . 475
14.1.5 Charge and current loop . . . . . . . . . . . . . . . . . 479
14.1.6 Charge and spins magnetic moment . . . . . . . . . . 482
14.1.7 Two types of magnets . . . . . . . . . . . . . . . . . . 483
14.2 Experiments and paradoxes . . . . . . . . . . . . . . . . . . . 486
14.2.1 Conservation laws in Maxwells theory . . . . . . . . . 486
14.2.2 Kislev-Vaidman paradox . . . . . . . . . . . . . . . . 489
14.2.3 Trouton-Noble paradox . . . . . . . . . . . . . . . . 492
14.2.4 Longitudinal forces in conductors . . . . . . . . . . . . 494
14.3 Electromagnetic induction . . . . . . . . . . . . . . . . . . . . 494
14.3.1 Moving magnets . . . . . . . . . . . . . . . . . . . . . 495
14.3.2 Homopolar induction: non-conservative forces . . . . . 497
14.3.3 Homopolar induction: conservative forces . . . . . . . . 499
14.4 Aharonov-Bohm eect . . . . . . . . . . . . . . . . . . . . . . 502
14.4.1 Innitely long solenoids or magnets . . . . . . . . . . . 503
14.4.2 Aharonov-Bohm experiment . . . . . . . . . . . . . . . 504
CONTENTS xv
14.4.3 Toroidal magnet and moving charge . . . . . . . . . . . 507
14.5 Fast moving charge . . . . . . . . . . . . . . . . . . . . . . . . 513
14.5.1 Fast moving charge in RQD . . . . . . . . . . . . . . . 513
14.5.2 Fast moving charge in Maxwells electrodynamics . . . 517
14.5.3 Experiment at Frascati . . . . . . . . . . . . . . . . . . 519
14.5.4 Proposal for modied experiment . . . . . . . . . . . . 520
14.6 RQD vs. Maxwells electrodynamics . . . . . . . . . . . . . . . 522
14.6.1 Electromagnetic elds and interactions . . . . . . . . . 522
14.6.2 Electromagnetic elds and photons . . . . . . . . . . . 523
15 PARTICLES AND RELATIVITY 525
15.1 Localizability of particles . . . . . . . . . . . . . . . . . . . . . 526
15.1.1 Measurements of position . . . . . . . . . . . . . . . . 527
15.1.2 Localized states in a moving reference frame . . . . . . 528
15.1.3 Spreading of well-localized states . . . . . . . . . . . . 529
15.1.4 Superluminal spreading and causality . . . . . . . . . . 530
15.2 Inertial transformations in multiparticle systems . . . . . . . . 534
15.2.1 Events and observables . . . . . . . . . . . . . . . . . . 534
15.2.2 Non-interacting particles . . . . . . . . . . . . . . . . . 536
15.2.3 Lorentz transformations for non-interacting particles . 538
15.2.4 Interacting particles . . . . . . . . . . . . . . . . . . . 539
15.2.5 Time translations in interacting systems . . . . . . . . 540
15.2.6 Boost transformations in interacting systems . . . . . . 542
15.2.7 Spatial translations and rotations . . . . . . . . . . . . 543
15.2.8 Physical inequivalence of forms of dynamics . . . . . . 546
15.2.9 No interaction theorem . . . . . . . . . . . . . . . . 547
15.3 Comparison with special relativity . . . . . . . . . . . . . . . . 553
15.3.1 On derivations of Lorentz transformations . . . . . . 553
15.3.2 On experimental tests of special relativity . . . . . . . 555
15.3.3 Poincare invariance vs. manifest covariance . . . . . . . 557
15.3.4 Is time an observable? . . . . . . . . . . . . . . . . . . 558
15.3.5 Is geometry 4-dimensional? . . . . . . . . . . . . . . . 561
15.3.6 Dynamical relativity . . . . . . . . . . . . . . . . . . 562
15.4 Action-at-a-distance and causality . . . . . . . . . . . . . . . . 564
15.4.1 Retarded interactions in Maxwells theory . . . . . . . 565
15.4.2 Interaction of particles in RQD . . . . . . . . . . . . . 567
15.4.3 Does action-at-a-distance violate causality? . . . . . . . 568
15.4.4 Superluminal propagation of evanescent waves . . . . . 569
xvi CONTENTS
15.5 Are quantum elds necessary? . . . . . . . . . . . . . . . . . 572
15.5.1 Dressing transformation in a nutshell . . . . . . . . . . 572
15.5.2 What is the reason for having quantum elds? . . . . 575
15.5.3 Fields and space-time . . . . . . . . . . . . . . . . . . . 576
16 QUANTUM THEORY OF GRAVITY 579
16.1 Two-body problem . . . . . . . . . . . . . . . . . . . . . . . . 580
16.1.1 General relativity vs. quantum gravity . . . . . . . . . 580
16.1.2 Two-particle gravitational Hamiltonian . . . . . . . . . 582
16.1.3 Precession of the Mercurys perihelion . . . . . . . . . 584
16.1.4 Photons and gravity . . . . . . . . . . . . . . . . . . . 590
16.2 Principle of equivalence . . . . . . . . . . . . . . . . . . . . . . 592
16.2.1 Free fall universality . . . . . . . . . . . . . . . . . . . 592
16.2.2 Composition invariance of gravity . . . . . . . . . . . . 593
16.2.3 n-particle gravitational potentials . . . . . . . . . . . . 595
16.2.4 Gravitational red shift . . . . . . . . . . . . . . . . . . 598
16.2.5 Gravitational time dilation . . . . . . . . . . . . . . 600
16.2.6 RQD vs. general relativity . . . . . . . . . . . . . . . . 601
17 CONCLUSIONS 603
III MATHEMATICAL APPENDICES 607
A Sets, groups and vector spaces 609
A.1 Sets and mappings . . . . . . . . . . . . . . . . . . . . . . . . 609
A.2 Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 609
A.3 Vector spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . 611
B The delta function and useful integrals 615
C Some lemmas for orthocomplemented lattices. 619
D Rotation group 621
D.1 Basics of the 3D space . . . . . . . . . . . . . . . . . . . . . . 621
D.2 Scalars and vectors . . . . . . . . . . . . . . . . . . . . . . . . 623
D.3 Orthogonal matrices . . . . . . . . . . . . . . . . . . . . . . . 624
D.4 Invariant tensors . . . . . . . . . . . . . . . . . . . . . . . . . 627
D.5 Vector parameterization of rotations . . . . . . . . . . . . . . 629
CONTENTS xvii
D.6 Group properties of rotations . . . . . . . . . . . . . . . . . . 632
D.7 Generators of rotations . . . . . . . . . . . . . . . . . . . . . . 634
E Lie groups and Lie algebras 637
E.1 Lie groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 637
E.2 Lie algebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . 641
F Hilbert space 645
F.1 Inner product . . . . . . . . . . . . . . . . . . . . . . . . . . . 645
F.2 Orthonormal bases . . . . . . . . . . . . . . . . . . . . . . . . 646
F.3 Bra and ket vectors . . . . . . . . . . . . . . . . . . . . . . . . 647
F.4 Tensor product of Hilbert spaces . . . . . . . . . . . . . . . . . 649
F.5 Linear operators . . . . . . . . . . . . . . . . . . . . . . . . . . 649
F.6 Matrices and operators . . . . . . . . . . . . . . . . . . . . . . 651
F.7 Functions of operators . . . . . . . . . . . . . . . . . . . . . . 654
F.8 Linear operators in dierent orthonormal bases . . . . . . . . 658
F.9 Diagonalization of Hermitian and unitary matrices . . . . . . . 661
G Subspaces and projection operators 665
G.1 Projections . . . . . . . . . . . . . . . . . . . . . . . . . . . . 665
G.2 Commuting operators . . . . . . . . . . . . . . . . . . . . . . . 667
H Representations of groups 675
H.1 Unitary representations of groups . . . . . . . . . . . . . . . . 675
H.2 Stones theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 677
H.3 Heisenberg Lie algebra . . . . . . . . . . . . . . . . . . . . . . 678
H.4 Double-valued representations of the rotation group . . . . . . 679
H.5 Unitary irreducible representations of the rotation group . . . 681
H.6 Pauli matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . 683
I Special relativity 685
I.1 4-vector representation of the Lorentz group . . . . . . . . . . 685
I.2 Lorentz transformations for time and position . . . . . . . . . 690
I.3 Ban on superluminal signaling . . . . . . . . . . . . . . . . . . 691
I.4 Minkowski space-time and manifest covariance . . . . . . . . . 693
I.5 Decay of moving particles in special relativity . . . . . . . . . 695
xviii CONTENTS
J Quantum elds for fermions 697
J.1 Diracs gamma matrices . . . . . . . . . . . . . . . . . . . . . 697
J.2 Bispinor representation of the Lorentz group . . . . . . . . . . 698
J.3 Construction of the Dirac eld . . . . . . . . . . . . . . . . . . 701
J.4 Properties of factors u and v . . . . . . . . . . . . . . . . . . . 703
J.5 Explicit formulas for u and v . . . . . . . . . . . . . . . . . . . 706
J.6 Convenient notation . . . . . . . . . . . . . . . . . . . . . . . 709
J.7 Transformation laws . . . . . . . . . . . . . . . . . . . . . . . 710
J.8 Functions U
and W
. . . . . . . . . . . . . . . . . . . . . . . 713
J.9 (v/c)
2
approximation . . . . . . . . . . . . . . . . . . . . . . . 713
J.10 Anticommutation relations . . . . . . . . . . . . . . . . . . . . 716
J.11 Dirac equation . . . . . . . . . . . . . . . . . . . . . . . . . . 717
J.12 Fermion propagator . . . . . . . . . . . . . . . . . . . . . . . . 720
K Quantum eld for photons 723
K.1 Construction of the photons quantum eld . . . . . . . . . . . 723
K.2 Explicit formula for e
(p, ) . . . . . . . . . . . . . . . . . . . 724
K.3 Useful commutator . . . . . . . . . . . . . . . . . . . . . . . . 725
K.4 Equal time commutator of photon elds . . . . . . . . . . . . 727
K.5 Photon propagator . . . . . . . . . . . . . . . . . . . . . . . . 728
K.6 Poincare transformations of the photon eld . . . . . . . . . . 730
L QED interaction in terms of particle operators 735
L.1 Current density . . . . . . . . . . . . . . . . . . . . . . . . . 735
L.2 First-order interaction in QED . . . . . . . . . . . . . . . . . 739
L.3 Second-order interaction in QED . . . . . . . . . . . . . . . . 739
M Loop integrals in QED 755
M.1 4-dimensional delta function . . . . . . . . . . . . . . . . . . . 755
M.2 Feynmans trick . . . . . . . . . . . . . . . . . . . . . . . . . . 755
M.3 Some basic 4D integrals . . . . . . . . . . . . . . . . . . . . . 757
M.4 Electron self-energy integral . . . . . . . . . . . . . . . . . . . 761
M.5 Integral for the vertex renormalization . . . . . . . . . . . . . 765
M.6 Integral for the ladder diagram . . . . . . . . . . . . . . . . . 775
M.7 Integral in (13.114) . . . . . . . . . . . . . . . . . . . . . . . . 782
CONTENTS xix
N Relativistic invariance of RQD 785
N.1 Relativistic invariance of simple QFT . . . . . . . . . . . . . . 785
N.2 Relativistic invariance of QED . . . . . . . . . . . . . . . . . . 787
N.3 Relativistic invariance of classical electrodynamics . . . . . . . 794
N.4 Relativistic invariance of gravity theory . . . . . . . . . . . . . 798
O Dimensionality checks 803
xx CONTENTS
PREFACE
Looking back at theoretical physics of the 20th century, we see two monu-
mental achievements that radically changed the way we understand space,
time and matter the special theory of relativity and quantum mechanics.
These theories covered two sides of the natural world that are not normally
accessible to human senses and experience. Special relativity was designed
to work with observers and objects moving with extremely high speeds (and
high energies). Quantum mechanics was essential to describe properties of
matter at microscopic scale: nuclei, atoms, molecules, etc. The challenge
remains in the unication of these two theories, i.e., in the theoretical de-
scription of energetic microscopic particles and their interactions.
It is commonly accepted that the most promising candidate for such
an unication is the local quantum eld theory (QFT). Indeed, this theory
achieved astonishing accuracy in calculations of certain physical observables,
such as scattering cross-sections and energy spectra. In some instances, the
discrepancies between experiments and predictions of quantum electrody-
namics (QED) are less than 0.000000001%. It is dicult to nd such accu-
racy anywhere else in science! However, in spite of its success, quantum eld
theory cannot be regarded as the ultimate unication of relativity and quan-
tum mechanics. Just too many fundamental questions remain unanswered
and too many serious problems are left unsolved.
It is fair to say that everyone trying to learn QFT is struck by its detach-
ment from physically intuitive ideas and enormous complexity. A successful
physical theory is expected to have, as much as possible, real-life counter-
parts for its mathematical constructs. This is often not the case in QFT,
where such physically transparent concepts of quantum mechanics as the
Hilbert space, wave functions, particle observables, and Hamiltonian were
substituted (though not completely discarded) by more formal and obscure
notions of quantum elds, ghosts, propagators and Lagrangians. It was even
xxi
xxii PREFACE
declared that the concept of a particle is not fundamental anymore and must
be abandoned in favor of the eld description of nature:
In its mature form, the idea of quantum eld theory is that quan-
tum elds are the basic ingredients of the universe and particles
are just bundles of energy and momentum of the elds. S. Wein-
berg [Wei]
The most notorious failure of QFT is the problem of ultraviolet diver-
gences: To obtain sensible results from QFT calculations one must drop cer-
tain innite terms. Although rules for doing such tricks are well-established,
they cannot be considered a part of a mathematically sound theory. As Dirac
remarked
This is just not sensible mathematics. Sensible mathematics in-
volves neglecting a quantity when it turns out to be small not
neglecting it because it is innitely large and you do not want it!
P. A. M. Dirac
In modern QFT the problem of ultraviolet innities is not solved, it is swept
under the rug. Even if the innities in scattering amplitudes are renor-
malized, one ends up with an ill-dened Hamiltonian, which is not suitable
for describing the time evolution of states. The prevailing opinion is that
ultraviolet divergences are related to our lack of understanding of physics at
short distances. It is often argued that QFT is a low energy approximation
to some yet unknown truly fundamental theory and that in this nal theory
the small distance or high energy (ultraviolet) behaviors will be tamed some-
how. There are various guesses about what this ultimate theory may be.
Some think that future theory will reveal a non-trivial, probably discrete, or
non-commutative structure of space at distances comparable to the Planck
scale of 10
33
cm. Others hope that paradoxes will go away if we substitute
point-like particles with tiny extended objects, like strings.
Many researchers agree that the most fundamental obstacle on the way
forward is the deep contradiction between quantum theory and Einsteins
relativity theory (both special and general). In a more general sense, the
basic question is what is space and time? The answers given by Einsteins
theory of relativity and by quantum mechanics are quite dierent. In special
relativity, position and time are treated on an equal footing, both of them
xxiii
being coordinates in the 4-dimensional Minkowski space-time. However in
quantum mechanics position and time play very dierent roles. Position (as
any other physical observable) is an observable described by an Hermitian
operator, whereas time is a numerical parameter, which cannot be cast into
the operator form without contradictions.
In our book we would like to take a fresh look at these issues. Two basic
postulates of our approach are completely non-controversial. They are the
principle of relativity (= the equivalence of all inertial frames of reference)
and the laws of quantum mechanics. From the mathematical perspective,
the former postulate is embodied in the notion of the Poincare group and
the latter postulate leads to the algebra of operators in the Hilbert space.
When combined, these two statements inevitably imply the idea of unitary
representations of the Poincare group in the Hilbert space as the major math-
ematical tool for the description of any isolated physical system. One of our
goals is to demonstrate that observable physics ts nicely into this math-
ematical framework. We will also see that traditional theories sometimes
deviate from these postulates, which often leads to unphysical conclusions
and paradoxes. Our goal is to nd, analyze and correct these deviations.
Although the ideas presented here have rather general nature, most cal-
culations will be performed for systems of charged particles and photons and
electromagnetic forces acting between them.
2
Traditionally, these systems
were described by quantum electrodynamics (QED). However, our approach
will lead us to a dierent theory, which we call relativistic quantum dynamics
or RQD. Our approach is exactly equivalent to the renormalized QED as
long as properties related to the S-matrix (scattering cross-sections, lifetimes,
energies of bound states, etc.) are concerned. However, dierent results are
expected for the time evolution and boost transformations in interacting sys-
tems.
RQD diers from the traditional approach in two important aspects: the
recognition of the dynamical character of boosts and the primary role of
particles rather than elds.
The dynamical character of boosts. Lorentz transformations for
space-time coordinates of events are the most fundamental relationships in
Einsteins special relativity. These formulas are usually derived for simple
events associated either with light beams or with free (non-interacting) par-
ticles. Nevertheless, special relativity tacitly assumes that these Lorentz
2
Gravitational forces will be discussed in chapter 16.
xxiv PREFACE
formulas can be extended to all events with interacting particles regardless
of the interaction strength. We will show that this assumption is actually
wrong. We will derive boost transformations of particle observables by using
Wigners theory of unitary representations of the Poincare group [Wig39]
and Diracs approach to relativistic interactions [Dir49]. It will then follow
that boost transformations should be interaction-dependent. Usual universal
Lorentz transformations of special relativity are thus only approximations.
The Minkowski 4-dimensional space-time is an approximate concept as well.
Particles rather than elds. Presently accepted quantum eld theo-
ries (e.g., the renormalized QED) have serious diculties in describing the
time evolution of even simplest systems, such as vacuum and single-particle
states. Direct application of the QED time evolution operator to these states
leads to spontaneous creation of extra (virtual) particles, which have not been
observed in experiments. The problem is that bare particles of QED have
rather remote relationship to physically observed electrons, positrons, etc.
and the rules connecting bare and physical particles are not well established.
We solve this problem by using the dressed particle formalism, which is the
cornerstone of our RQD approach. The dressed Hamiltonian of RQD is ob-
tained by applying a unitary dressing transformation to the traditional QED
Hamiltonian. This transformation does not change the S-operator of QED,
therefore the perfect agreement with experimental data is preserved. The
RQD Hamiltonian describes electromagnetic phenomena in terms of directly
interacting physical particles (electrons, photons, etc.) without reference to
spurious bare and virtual particles. Quantum elds play only an auxiliary
role. In addition to accurate scattering amplitudes, our approach allows us
to obtain the time evolution of interacting particles and oers a rigorous way
to nd both energies and wave functions of bound states. All calculations
with the RQD Hamiltonian can be done by using standard recipes of quan-
tum mechanics without encountering embarrassing ultraviolet divergences
and without the need for articial cutos, regularization, renormalization,
etc.
Of course, the idea of particles with action-at-a-distance forces is not new.
The original Newtonian theory of gravity had this form and (quasi-)particle
approaches are often used in modern theories. However, the consensus opin-
ion is that such approaches can be only approximate, in particular, because
instantaneous interactions are believed to violate important principles of rel-
ativistic invariance and causality. Textbooks tell us that these important
xxv
principles can be reconciled with quantum postulates only in a theory based
on local (quantum) elds.
In this book we are going to challenge this consensus and demonstrate
that the particle picture and action-at-a-distance do not contradict relativity
and causality.
Our central message can be summarized in few sentences
The physical world is composed of point-like particles.
They obey laws of quantum mechanics and interact with
each other via instantaneous action-at-a-distance po-
tentials, which depend on distances between the par-
ticles and on their momenta. These potentials may
lead to the creation and annihilation of particles as
well. This picture is in full agreement with princi-
ples of relativity and causality. In order to establish
this agreement one should recognize that boost transfor-
mations of particle observables depend on interactions
acting in the system. Thus special-relativistic formu-
las for Lorentz transformations are approximate. Ex-
act relativistic theories of interacting particles should
be formulated without reference to the unphysical 4D
Minkowski space-time.
This book is divided into three parts. Part I: QUANTUM ELEC-
TRODYNAMICS comprises ten chapters 1 - 10. In this part we avoid
controversial issues and stick to traditionally accepted views on relativistic
quantum theories, such as QFT. We specify our basic assumptions, notation
and terminology while trying to follow a logical path starting from basic
postulates of relativity and probability and culminating in calculation of the
renormalized S-matrix in QED. The purpose of Part I is to set the stage for
introducing our non-traditional particle-based approach in the second part
of the book.
In chapter 1 Quantum mechanics the basic laws of quantum mechanics
are derived from simple axioms of measurements (quantum logic). In chapter
2 Poincare group we introduce the Poincare group as a set of transforma-
tions that relate dierent (but equivalent) inertial reference frames. Chapter
3 Quantum mechanics and relativity unies the two above pieces and
xxvi PREFACE
establishes unitary representations of the Poincare group in the Hilbert space
of states as the most general mathematical description of any isolated phys-
ical system. In the next two chapters 4 and 5 we explore some immediate
consequences of this formalism. In chapter 4 Operators of observables we
nd the correspondence between known physical observables (such as mass,
energy, momentum, spin, position, etc.) and concrete Hermitian operators in
the Hilbert space. In chapter 5 Single particles we show how Wigners the-
ory of irreducible representations of the Poincare group provides a complete
description of basic properties and dynamics of isolated stable elementary
particles. The next important step is to consider multi-particle systems. In
chapter 6 Interaction we discuss major properties of relativistically invari-
ant interactions in such systems. In particular, chapter 7 Scattering is
devoted to quantum-mechanical description of particle collisions. In chapter
8 Fock space we extend these discussions to the general class of systems
in which particles can be created and annihilated and their numbers are
not conserved. In chapter 9 Quantum electrodynamics we apply all the
above ideas to the description of systems of charged particles and photons
in the formalism of QED. Chapter 10 Renormalization concludes this rst
traditional part of the book. This chapter discusses the appearance of ul-
traviolet divergences in QED and oers a way of their elimination by means
of counterterms in the Hamiltonian.
Part II of the book QUANTUM THEORY OF PARTICLES (chap-
ters 11 - 17) examines the new particle-based RQD approach, its connection
to the basic QED from part I and its advantages. Our goal is to dispel
the traditional prejudice against using particle interpretation in relativistic
quantum theories. We show that the view of the world as consisting of point
particles interacting via instantaneous direct potentials is not contradictory
and is capable to describe and explain physical phenomena just as well - or
even better - as the mainstream eld-based view.
This non-traditional part of the book begins with chapter 11 Dressed
particle approach, which provides a deeper analysis of renormalization
and the bare particle picture in quantum eld theories. The main ideas of
our particle-based approach are formulated here and QED is being rewritten
in terms of creation and annihilation operators of physical particles, rather
than bare quantum elds. In chapter 12 Coulomb potential and be-
yond we derive the dressed interaction between charged particles and use it
to calculate the spectrum of the hydrogen atom. In chapter 15 Particles
and relativity we discuss real and imaginary paradoxes usually associated
xxvii
with quantum relativistic descriptions in terms of particles. In particular, we
discuss the superluminal spreading of localized wave packets and the Currie-
Jordan-Sudarshan no interaction theorem. We show that superluminality
and action-at-a-distance can coexist with causality if Lorentz transforma-
tions are properly interpreted. Having established the fundamentals of our
theory, we are now ready to consider some concrete examples and compare
them with experiments. In chapter 14 Classical electrodynamics we show
that classical electromagnetic theory can be reformulated as a Hamiltonian
theory of charged particles with action-at-a-distance forces. These forces de-
pend not only on the distance between the charges, but also on their velocities
and spins. In this formulation, electromagnetic elds and potentials are not
present at all and Maxwells equations do not play any role. This allows us
to resolve a number of theoretical paradoxes and, at the same time, remain
in agreement with experimental data. Even the famous Aharonov-Bohm ex-
periment gets its explanation as an eect of inter-particle interactions on the
phases of quantum wave packets - i.e., without any involvement of electro-
magnetic potentials and non-trivial space topology. In chapter 13 Decays
and radiation we are interested in a rigorous description of unstable quan-
tum systems. The special focus is on decays of fast moving particles. Here
we show that the usual Einsteins time dilation formula cannot be rigorously
applied to such phenomena. In principle, it should be possible to observe
deviations from this formula in experiments, but, unfortunately, the required
precision cannot be reached with the currently available technology. In this
chapter we also discuss infrared divergences (and their cancelation) in inter-
actions of dressed particles at higher perturbation orders. In particular, we
calculate the electrons anomalous magnetic moment and the Lamb shifts of
atomic energy levels.
The last chapter 16 (Quantum theory of gravity) of this book is not
directly related to our main theme - quantum relativistic description of elec-
tromagnetic interactions. In this chapter we intend to demonstrate that RQD
ideas can be successfully used in gravitational physics as well. We describe
gravity in terms of instantaneous particle potentials, rejecting the idea of
4-dimensional space-time and its curvature. Perhaps the most important
feature of this approach is its perfect compatibility with postulates of quan-
tum mechanics. The nal small chapter 17 Conclusions summarizes major
results and conclusions of this work and briey mentions possible directions
for future investigations.
Some useful mathematical facts and more technical derivations are col-
xxviii PREFACE
lected in the Part III: MATHEMATICAL APPENDICES.
Remarkably, the development of the new particle-based RQD approach
did not require introduction of radically new physical ideas. Actually, all
key ingredients of this study were formulated a long time ago, but for some
reason they have not attracted the attention they deserve. For example, the
fact that either translations or rotations or boosts must have dynamical de-
pendence on interactions was rst established in Diracs work [Dir49]. These
ideas were further developed in direct interaction theories by Bakamjian
and Thomas [BT53], Foldy [Fol61], Sokolov [Sok75, SS78], Coester and Poly-
zou [CP82] and many others. The primary role of particles in formulation
of quantum eld theories was emphasized in an excellent book by Weinberg
[Wei95]. The dressed particle approach was advocated by Greenberg and
Schweber [GS58]. First indications that this approach can solve the prob-
lem of ultraviolet divergences in QFT are contained in papers by Ruijgrok
[Rui59], Shirokov and Visinesku [Shi72, VS74]. The formulation of RQD
presented in this book just combined all these good ideas into one compre-
hensive approach, which, we believe, is a step toward a consistent unication
of quantum mechanics and the principle of relativity.
In this book we are using the Heaviside-Lorentz system of units
3
in which
the Coulomb law has the form V = q
1
q
2
/(4r) and the proton charge has
the value of e = 2
4.803 10
10
statcoulomb. The speed of light is
c = 2.998 10
10
cm/s; the Planck constant is = 1.054 10
27
erg s, so the
ne structure constant is e
2
/(4c) 1/137.
The new material contained in this book was partially covered in nine
articles [Ste01, Ste96, Ste02, Steb, Ste06b, Stea, Ste06a].
I would like to express my gratitude to Peter Enders, Theo Ruijgrok and
Boris Zapol for reading this book and making valuable critical comments
and suggestions. I also would like to thank Harvey R. Brown, Rainer Grobe,
William Klink, Vladimir Korda, Chris Oakley, Federico Piazza, Wayne Poly-
zou, Alexander Shebeko and Mikhail Shirokov for enlightening conversa-
tions as well as Bilge, Bernard Chaverondier, Wolfgang Engelhardt, Juan R.
Gonzalez-
by ten parameters
, v, r, t.
These parameters include the time dierence t between the clocks of O and
O
, the position vector r that connects the origin of O with the origin of O
,
the rotation angle
10
that relates orientations of axes in O
to orientations
of axes in O and the velocity v of O
relative to O.
It is convenient to introduce the notion of inertial transformations of
observers and laboratories. Transformations of this kind include
rotations,
9
Of course, one can decide to consider the laboratory clock as a physical system and
perform physical measurements on it. For example, one can investigate the quantum
uncertainty in the clocks arm position. However, then this particular clock is no longer
suitable for measuring time. Some other device must be used for time-keeping purposes.
This is similar to our insistence on the separation between the physical system and the
measuring apparatus. See footnote on page xxxiii.
10
The vector parameterization of rotations is discussed in Appendix D.5.
xxxvi INTRODUCTION
space translations,
time translations,
changes of velocity or boosts.
There are three independent rotations (around x, y and z axes), three inde-
pendent translations and three independent boosts. So, along with the time
translations that makes 10 basic types of inertial transformations. More gen-
eral inertial transformations can be made by performing two (or more) basic
transformations in succession. We will postulate that for any pair of inertial
observers O and O
= gO.
In chapter 2 we will make an important observation that transformations g
form a group.
An important comment should be made about the denition of observer
used in this book. Usually, an observer is understood as a person (or a mea-
suring apparatus) that exists and performs measurements for innitely long
time. For example, it is common to discus the time evolution of a physi-
cal system from the point of view of this or that observer. However, this
colloquial denition does not t our purposes. The problem with this def-
inition is that it singles out time translations as being dierent from space
translations, rotations and boosts. In this approach time translations be-
come associated with the observer herself rather than being treated equally
with other inertial transformations between observers. The central idea of
our approach to relativity is to treat all ten types of inertial transformations
on equal footing. Therefore, we will use a slightly dierent denition of ob-
server. In our denition observers are short-living. They exist and perform
measurements in a short time interval and they can see only a snapshot of
the world around them. Individual observers cannot see the time evolution
of a physical system. In our approach the time evolution is described as a
succession of measurements performed by a series of instantaneous observers
related to each other by time translations. Then the colloquial observer is
actually a continuous sequence of our short-living observers displaced in
time with respect to each other.
The goals of physics. It is important to underscore that in our quest
we are not looking for some master equation or theory of everything.
xxxvii
We are interested in a more modest task: description of observations made
by observers on real physical systems. So, as much as possible, we will try to
avoid speculations about features that cannot be directly observed (such as,
the microscopic structure of the space-time, vacuum energy, virtual particles,
etc.) Even if these features had good theoretical sense, their non-observability
would make it impossible to verify their properties in experiment and, there-
fore, places them outside good physical theory.
One of the most important tasks of physics is to establish the relation-
ship between measurements performed by two dierent observers on the same
physical system. These relationships will be referred to as inertial transfor-
mations of observables. In particular, if values of some observables measured
by O are known, and the inertial transformation connecting O with O
is
known as well, then we should be able to gure out the values of those ob-
servables from the point of view of O
on the photographic
plate is created only by light rays emitted from point A and passed straight
through the hole.
essence of this centuries-old debate on an example of a thought experiment
with pinhole camera.
1.1.1 Corpuscular theory of light
You probably saw or heard about a simple device called camera obscura or
pinhole camera. You can easily make this device yourself: Take a light-tight
box, put a photographic plate inside the box and make a small hole in the
center of the side opposite to the photographic plate (see Fig. 1.1). The light
passing through the hole inside the box creates a sharp inverted image on the
photographic plate. You will get even sharper image by decreasing the size
of the hole, though the brightness of the image will become lower, of course.
This behavior of light was well known for centuries (a drawing of the camera
obscura is present in Leonardo da Vincis papers). One of the earliest scien-
tic explanations of this and other properties of light (reection, refraction,
etc.) was suggested by Newton. In modern language, his corpuscular theory
would explain the formation of the image like this:
Corpuscular theory: Light is a ow of tiny particles (photons)
propagating along straight classical trajectories (light rays). Each
particle in the ray carries a certain amount of energy which gets
6 CHAPTER 1. QUANTUM MECHANICS
AA
BB
AA BB
(a)
(b)
Figure 1.2: (a) Image in the pinhole camera with a very small aperture; (b)
the density of the image along the line AB
released upon impact in a very small volume corresponding to
one grain of the photographic emulsion and produces a small dot
image. When intensity of the source is high, there are so many
particles, that we cannot distinguish their individual dots. All
these dots merge into one continuous image and the intensity of
the image is proportional to the number of particles hitting the
photographic plate during the exposure time.
Let us continue our experiment with the pinhole camera and decrease the
size of the hole even further. The corpuscular theory would insist that the
smaller size of the hole must result in a sharper image. However this is not
what experiment shows! For a very small hole the image on the photographic
plate will be blurred. If we further decrease the size of the hole, the detailed
picture will completely disappear and the image will look like one large diuse
spot (see Fig. 1.2), independent on the form and shape of the light source
outside the camera. It appears as if light rays scatter in all directions when
they pass through a small aperture or near a small object. This eect of the
light spreading is called diraction and it was discovered by Grimaldi in the
middle of the 17th century.
Diraction is rather dicult to reconcile with the corpuscular theory. For
example, we can try to save this theory by assuming that light rays deviate
from their straight paths due to some interaction with the box material sur-
1.1. WHY DO WE NEED QUANTUM MECHANICS? 7
(a)
(b)
LL RR
L+R
Figure 1.3: (a) The density of the image in a two-hole camera according
to nave corpuscular theory is a superposition of images created by the left
(L) and right (R) holes; (b) Actual interference picture: In some places the
density of the image is higher than L+R (constructive interference); in other
places the density is lower than L+R (destructive interference).
rounding the hole. However this is not a satisfactory explanation, because
one can easily establish by experiment that the shape of the diraction pic-
ture is completely independent on the type of material used to make the
walls of the pinhole camera. The most striking evidence of the fallacy of
the nave corpuscular theory is the eect of light interference discovered by
Young in 1802 [You04]. To observe the interference we can slightly modify
our pinhole camera by making two small holes close to each other, so that
their diraction spots on the photographic plate overlap. We already know
that when we leave the left hole open and close the right hole we get a diuse
spot L (see Fig. 1.3(a)). When we leave open the right hole and close the
left hole we get another spot R. Let us try to predict what kind of image we
will get if both holes are opened.
Following the corpuscular theory and simple logic we might conclude
that particles reaching the photographic plate are of two sorts: those passed
through the left hole and those passed through the right hole. When the two
holes are opened at the same time, the density of the left hole particles
should add to the density of the right hole particles and the resulting
image should be a superposition L+R of the two images (full line in Fig.
8 CHAPTER 1. QUANTUM MECHANICS
1.3(a)). Right? Wrong! This seemingly reasonable explanation disagrees
with experiment. The actual image has new features (brighter and darker
regions) called the interference picture (full line in Fig. 1.3(b)).
Can the corpuscular theory explain this strange interference pattern? We
could assume, for example, that some kind of interaction between light cor-
puscles is responsible for the interference, so that passages of dierent par-
ticles through left and right holes are not independent events and the law of
addition of probabilities does not hold for them. However, this idea must be
rejected because, as we will see later, the interference picture persists even
if photons are released one-by-one, so that they cannot interact with each
other.
1.1.2 Wave theory of light
The inability to explain such basic eects of light propagation as diraction
and interference was a major embarrassment for the Newtonian corpuscular
theory. These eects as well as all other properties of light known before
quantum era (reection, refraction, polarization, etc.) were brilliantly ex-
plained by the wave theory of light advanced by Grimaldi, Huygens, Young,
Fresnel and others. The wave theory gradually replaced Newtonian corpus-
cles in the course of the 19th century. The idea of light as a wave found its
strongest support from Maxwells electromagnetic theory which unied op-
tics with electric and magnetic phenomena. Maxwell explained that the light
wave is actually an oscillating eld of electric E(x, t) and magnetic B(x, t)
vectors a sinusoidal wave propagating with the speed of light. According to
the Maxwells theory, the energy of the wave and consequently the intensity
of light I, is proportional to the square of the amplitude of the eld vector
oscillations, e.g., I E
2
. Then formation of the photographic image can be
explained as follows:
Wave theory: Light is a continuous wave or eld propagat-
ing in space in an undulatory fashion. When the light wave
meets molecules of the photo-emulsion, the charged parts of the
molecules start to oscillate under the inuence of the lights elec-
tric and magnetic eld vectors. The portions of the photographic
plate with higher eld amplitudes have more violent molecular
oscillations and higher image densities.
1.1. WHY DO WE NEED QUANTUM MECHANICS? 9
This provides a natural explanation for both diraction and interference:
Diraction simply means that light waves can deviate from straight paths and
go around corners, just like sound waves do.
1
To explain the interference, we
just need to note that when two portions of the wave pass through dierent
holes and meet on the photographic plate, their electric vectors add up.
However intensities of the waves are not additive: I (E
1
+E
2
)
2
= E
2
1
+2E
1
E
2
+E
2
2
,= E
2
1
+E
2
2
I
1
+I
2
. It follows from simple geometric considerations
that in the two-hole conguration there are places on the photographic plate
where the two waves always come in phase (E
1
(t) E
2
(t) and E
1
E
2
> 0,
which means constructive interference) and there are other places where the
two waves always come with opposite phases (E
1
(t) E
2
(t) and E
1
E
2
< 0,
i.e., destructive interference).
1.1.3 Low intensity light and other experiments
In the 19th century physics, the wave-particle debate was decided in favor
of the wave theory. However, further experimental evidence showed that the
victory was declared prematurely. To see what goes wrong with the wave
theory, let us continue our thought experiment with the interference picture
created by two holes and gradually tune down the intensity of the light source.
At rst, nothing interesting will happen: we will see that the density of the
image predictably decreases. However, after some point we will recognize that
the photographic image is not uniform and continuous as before. It consists
of small blackened dots as if some grains of photo-emulsion were exposed to
light and some not. This observation is very dicult to reconcile with the
wave theory. How a continuous wave can produce this dotty image? However
this is exactly what the corpuscular theory would predict. Apparently the
dots are created by particles hitting the photographic plate one-at-a-time.
A number of other eects were discovered at the end of the 19th century
and in the beginning of the 20th century, which could not be explained by
the wave theory of light. One of them was the photo-electric eect: It was
observed that when the light is shined on a piece of metal, electrons can
escape from the metal into the vacuum. This observation was not surprising
by itself. However it was rather puzzling how the number of emitted electrons
depended on the frequency and intensity of the incident light. It was found
1
Wavelengths corresponding to the visible light are between 0.4 micron for the violet
light and 0.7 micron for the red light. So for large obstacles or holes, the deviations from
the straight path are very small and the corpuscular theory of light works reasonably well.
10 CHAPTER 1. QUANTUM MECHANICS
that only light waves with frequencies above some threshold
0
were capable
of knocking out electrons from the metal. Radiation with frequency below
0
could not produce the electron emission even if the light intensity was
high. According to the wave theory explanation above, one could assume
that the electrons are knocked out of the metal due to some kind of force
exerted on them by electromagnetic vectors E, B in the wave. A larger light
intensity (= larger wave amplitude = higher values of E and B) naturally
means a larger force and a larger chance of the electron emission. Then why
the low frequency but high intensity light could not do the job?
In 1905 Einstein explained the photo-electric eect by bringing back New-
tonian corpuscles in the form of light quanta later called photons. He
described the light as consisting of nite number of energy quanta which
are localized at points in space, which move without dividing and which can
only be produced and absorbed as complete units [AP65]. According to the
Einsteins explanation, each photon carries the energy of , where is the
frequency
2
of the light wave and is the Planck constant. Each photon has
a chance to collide with and pass its energy to just one electron in the metal.
Only high energy photons (those corresponding to high frequency light) are
capable of passing enough energy to the electron to overcome certain energy
barrier
3
E
b
between the metal and the vacuum. Low-frequency light has
photons with low energy < E
b
. Then, no matter what is the amplitude
(= the number of photons) of such light, its photons are just too weak
to kick the electrons with sucient energy.
4
In the Comptons experiment
(1923) the interaction of light with electrons could be studied with much
greater detail. And indeed, this interaction more resembled a collision of two
particles rather than shaking of the electron by a periodic electromagnetic
wave.
These observations clearly lead us to the conclusion that light is a ow of
particles after all. But what about the interference? We already agreed that
the corpuscular theory had no logical explanation of this eect.
For example, in an interference experiment conducted by Taylor in 1909
[Tay09], the intensity of light was so low that no more than one photon was
2
is the so-called circular frequency (measured in radians per second) which is related
to the usual frequency (measured in oscillations per second) by the formula = 2.
3
The barriers energy is roughly proportional to the threshold frequency E
b
0
.
4
Actually, the low-frequency light may lead to the electron emission when two or more
low-energy photons collide simultaneously with the same electron, but such events have
very low probability and become observable only at very high light intensities.
1.2. PHYSICAL FOUNDATIONS OF QUANTUM MECHANICS 11
present at any time instant, thus eliminating any possibility of the photon-
photon interaction and its eect on the interference picture. Another ex-
planation that the photon somehow splits and passes through both holes
and then rejoins again at the point of collision with the photographic plate
does not stand criticism as well: One photon never creates two dots on the
photographic plate, so it is unlikely that the photon can split during prop-
agation. Finally, can we assume that the particle passing through the right
hole somehow knows whether the left hole is open or closed and adjusts its
trajectory accordingly? Of course, there is some eect on the photon near
the left hole depending on whether the right hole is open or not. However
by all estimates this eect is negligibly small.
So, young quantum theory had an almost impossible task to reconcile two
apparently contradicting classes of experiments with light: Some experiments
(diraction, interference) were easily explained with the wave theory while
the corpuscular theory had serious diculties. Other experiments (photo-
electric eect, Compton scattering) could not be explained from the wave
properties and clearly showed that light consists of particles. Just adding to
the confusion, de Broglie in 1924 advanced a hypothesis that such particle-
wave duality is not specic to photons. He proposed that all particles of
matter like electrons have wave-like properties. This crazy idea was
soon conrmed by Davisson and Germer who observed the diraction and
interference of electron beams in 1927.
Certainly, in the rst quarter of the 20th century, physics faced the great-
est challenge in its history. This is how Heisenberg described the situation:
I remember discussions with Bohr which went through many hours
till very late at night and ended almost in despair; and when at
the end of the discussion I went alone for a walk in the neighbor-
ing park I repeated to myself again and again the question: Can
nature possibly be as absurd as it seemed to us in those atomic
experiments? W. Heisenberg [Hei58]
1.2 Physical foundations of quantum mechan-
ics
In this section we will try to explain the main dierence between classical
and quantum views of the world. To understand quantum mechanics, we
12 CHAPTER 1. QUANTUM MECHANICS
must accept that certain concepts, which were taken for granted in classical
physics, cannot be applied to micro-objects like photons and electrons. To see
what is dierent, we must revisit basic ideas about what is physical system,
how its states are prepared and how its observables are measured.
1.2.1 Single-hole experiment
The best way to understand the main idea of quantum mechanics is to ana-
lyze the single-hole experiment discussed in the preceding section. We have
established that in the low-intensity regime, when the source emits individ-
ual photons one-by-one, the image on the screen consists of separate dots.
We now accept this fact as a sucient evidence that light is made of small
countable localizable particles, called photons.
However, the behavior of these particles is quite dierent from the one
expected in classical physics. Classical physics is based on one tacit axiom,
which we formulate here as an Assertion
5
Assertion 1.1 (determinism) If we prepare a physical system repeatedly
in the same state and measure the same observable, then each time we will
get the same measurement result.
This obvious Assertion is violated in the single-hole experiment. Indeed,
suppose that the light source is monochromatic, so that all photons reaching
the hole have the same momentum and energy. At the moment of passing
through the hole the photons have rather well-dened x, y and z-components
of position too. This guarantees that at this moment all photons are prepared
in (almost) the same state, as required by the Assertion 1.1. We can make
this state to be dened even better by reducing the size of the aperture. Then
according to the Assertion, each photon should produce the same measure-
ment result, i.e., each photon should land at exactly the same point on the
photographic plate. However, this is not what happens in reality! The dots
made by photons are scattered all over the photographic plate. Moreover,
the smaller is the aperture the wider is the distribution of the dots. Results
5
In this book we will distinguish Postulates, Statements and Assertions. Postulates
form a foundation of our theory. In most cases they follow undoubtedly from experiments.
Statements follow logically from Postulates and we believe them to be true. Assertions are
commonly accepted in the literature, but they do not have a place in the RQD approach
developed in this book.
1.2. PHYSICAL FOUNDATIONS OF QUANTUM MECHANICS 13
of measurements are not reproducible even though preparation conditions
are tightly controlled!
Remarkably, it is not possible to nd an ordinary explanation of this
extraordinary fact. For example, one can assume that dierent photons pass-
ing through the hole are not exactly in the same conditions. What if they
interact with atoms surrounding the hole and for each passing photon the
conguration of the nearby atoms is dierent? This explanation does not
seem plausible, because one can repeat the single-hole experiment with dif-
ferent materials (paper, steel, etc.) without any visible dierence. Moreover,
the same diraction picture is observed if other particles (electrons, neutrons,
C
60
molecules, etc) are used instead of photons. It appears that there are
only two parameters, which determine the shape of the diraction spot - the
size of the aperture and the particles momentum. So, the explanation of this
shape must be rather general and should not depend on the nature of par-
ticles and the material surrounding the hole. Then we must accept a rather
striking conclusion: all these carefully prepared particles behave randomly.
It is impossible to predict which point on the screen will be hit by the next
released particle.
1.2.2 Ensembles and measurements in quantum me-
chanics
The main lesson of the single-hole experiment is that classical Assertion 1.1
is not true. Even if the system is prepared each time in the same state, we are
not going to get reproducible results in repeated measurements. Why does
this happen? The honest answer is that nobody knows. This is one of great-
est mysteries of nature. Quantum theory does not even attempt to explain
the physical origin of randomness in microsystems. This theory assumes the
randomness as given
6
and simply tries to formulate its mathematical descrip-
tion. In order to move forward, we should go beyond simple constatation of
randomness in microscopic events and introduce more precise statements and
new denitions.
We will call experiment the preparation of an ensemble (= a set of iden-
tical copies of the system prepared in the same conditions) and performing
6
In this book we claim that this randomness is absolute and fundamental; that it cannot
be explained and does not even require an explanation. In section 1.7 we will briey discuss
other approaches to this deep question.
14 CHAPTER 1. QUANTUM MECHANICS
measurements of the same observable in each member of the ensemble.
7
Suppose that we prepared an ensemble of N identical copies of the system
and measured the same observable N times. As we have established above,
we cannot say a priori that all these measurements will produce the same
results. However, it seems reasonable to assume the existence of ensembles in
which measurements of one observable can be repeated with the same result
innite number of times. Indeed, there is no point to talk about a physical
quantity, if there are no ensembles in which this quantity can be measured
with certainty. So, we begin our construction of the mathematical formalism
of quantum mechanics by introducing the following Postulate
Postulate 1.2 (partial determinism) For any observable F and any value
f from its spectrum, one can prepare an ensemble in such a state that mea-
surements of this observable are reproducible, i.e., always yield the same value
f.
Note that in classical mechanics this Postulate follows immediately from
Assertion 1.1. The Postulate itself allows us to talk only about the repro-
ducibility of just one observable in the ensemble, while classical Assertion
1.1 referred to all observables. As we will see later, quantum theory predicts
that measurements of certain groups of (compatible) observables
8
can be re-
producible in a given ensemble, but, unlike in the classical case, this cannot
be true for all observables. For example, quantum mechanics says that with
certain eorts we can prepare an ensemble of particles in such a state that
measurements of all 3 components of position yield the same result each
time, but then results for the components of momentum would be dierent
all the time. We can also prepare (another) ensemble in a state with certain
momentum, then the position will be all over the place. We cannot prepare
an ensemble in which the uncertainties of both position r and momentum
p are zero.
9
7
It is worth noting here that in this book we are not considering repeated measurements
performed on the same copy of the physical system. We will work under assumption that
after the measurement has been performed, the copy of the system is discarded. Each
measurement requires a fresh copy of the physical system. This means, in particular,
that we are not interested in the state to which the system may have collapsed after
the measurement. The description of repetitive quantum measurements is an interesting
subject, but it is beyond the scope of this book.
8
For example, three components of particles momentum (p
x
, p
y
, p
z
) are compatible
observables.
9
See discussion of the Heisenberg uncertainty relation in subsection 6.5.2.
1.3. LATTICE OF PROPOSITIONS 15
1.3 Lattice of propositions
Having described the fundamental Postulate 1.2 of quantum mechanics in
the preceding section, we now need to turn it into a working mathematical
formalism. This is the goal of the present section and next two sections.
When we say that measurements of observables in the quantum world can
be irreproducible we mean that this irreproducibility or randomness is of ba-
sic, fundamental nature. It is dierent from the randomness often observed
in the classical world,
10
which is related merely to our inability to prepare
well-controlled ensembles, incomplete knowledge of initial conditions and in-
ability to solve dynamical equations even if the initial conditions are known.
So, classical randomness is of technical rather than fundamental nature. On
the other hand, the ever-present quantum randomness cannot be reduced by
a more thorough control of preparation conditions, lowering the temperature,
etc. This means that quantum mechanics is bound to be a statistical theory
based on postulates of probability. However, this does not mean that QM
is a subset of classical statistical mechanics. Actually, the statistical theory
underlying QM is of a dierent more general non-classical kind.
We know that classical (Boolean) logic is at the core of classical proba-
bility theory. The latter theory assign probabilities (real numbers between 0
and 1) to logical propositions and tells us how these numbers are aected by
logical operations (AND, OR, NOT, etc.) with propositions. Quite similarly,
quantum probability theory is based on logic, but this time the logic is not
Boolean. In QM we are dealing with quantum logic whose postulates dier
slightly from the classical ones. So, at the most fundamental level, quan-
tum mechanics is built on two powerful ideas: the prevalence of randomness
in nature and the non-classical logical relationships between experimental
propositions.
The initial idea that the fundamental dierence between classical and
quantum mechanics lies in their dierent logical structures belongs to Birkho
and von Neumann. They suggested to substitute classical logic of Aristotle
and Boole by the quantum logic. The formalism presented in this section
summarizes their seminal work [BvN36] as well as further research most no-
tably by Mackey [Mac63] and Piron [Pir76, Pir64]. Even for those who do not
accept the necessity of such radical change in our views on logic, the study
of quantum logic may provide a desirable bridge between intuitive concepts
10
e.g., when we roll a die
16 CHAPTER 1. QUANTUM MECHANICS
of classical mechanics and abstract formalism of quantum theory.
In introductory quantum physics classes (especially in the United
States), students are informed ex cathedra that the state of a
physical system is represented by a complex-valued wave function
, that observables correspond to self-adjoint operators, that the
temporal evolution of the system is governed by a Schrodinger
equation and so on. Students are expected to accept all this uncrit-
ically, as their professors probably did before them. Any question
of why is dismissed with an appeal to authority and an injunc-
tion to wait and see how well it all works. Those students whose
curiosity precludes blind compliance with the gospel according to
Dirac and von Neumann are told that they have no feeling for
physics and that they would be better o studying mathematics or
philosophy. A happy alternative to teaching by dogma is provided
by basic quantum logic, which furnishes a sound and intellectually
satisfying background for the introduction of the standard notions
of elementary quantum mechanics. D. J. Foulis [Fou]
The main purpose of our sections 1.3 - 1.5 is to demonstrate that pos-
tulates of quantum mechanics are robust and inevitable consequences of the
laws of probability and simple properties of measurements. Basic axioms of
quantum logic which are common for both classical and quantum mechan-
ics are presented in section 1.3. The close connection between the classical
logic and the phase space formalism of classical mechanics is discussed in
section 1.4. In section 1.5 we will note a remarkable fact that the only dier-
ence between classical and quantum logics (and, thus, between classical and
quantum physics in general) is in a single obscure distributivity postulate.
This classical postulate must be replaced by the orthomodularity postulate
of quantum theory. We will also demonstrate (see section 1.6) how postulates
of quantum logic lead us (via Pirons theorem) to the standard formalism of
quantum mechanics with Hilbert spaces, Hermitian operators, wave func-
tions, etc. In section 1.7 we will briey mention some philosophical issues
surrounding the formalism and interpretation of quantum mechanics.
1.3.1 Propositions and states
There are dierent types of observables in physics: position, momentum,
spin, mass, etc. As we discussed in Introduction, we are not interested in
1.3. LATTICE OF PROPOSITIONS 17
the design of the apparatus measuring each particular observable F. We can
imagine such an apparatus as a black box that produces a real number f R
(the measured value of the observable) each time it interacts with the physical
system. So, the act of measurement can be simply described by words the
value of observable F was found to be f. However, for our purposes a
slightly dierent description of measurements will be more convenient.
With each subset
11
X of the real line R we can associate a proposition x of
the type the value of observable F is inside the subset X. If the measured
value f has been found inside the subset X, then we say that the proposition
x is true. Otherwise we say that the proposition x is false. At the rst
sight it seems that we have not gained much by this reformulation. However,
the true advantage is that propositions introduced above are key elements of
mathematical logic and probability theory. The powerful machinery of these
theories will be found very helpful in our analysis of quantum measurements.
It is also useful to note that propositions
12
can be also regarded as special
types of observables whose spectrum consists of just two points: 1 (if the
proposition is true; the answer is yes) and 0 (if the proposition is false; the
answer is no).
Propositions are not necessarily related to single observables. As we will
see later, there are sets of (compatible) observables F
1
, F
2
, . . . , F
n
that are
simultaneously measurable with arbitrary precision. For such sets of ob-
servables one can dene propositions corresponding to subsets in the (n-
dimensional) common spectrum of these observables. For example, the propo-
sition particles position is inside volume V is a statement about simulta-
neous measurements of three observables - the x, y and z components of the
position vector. Experimentally, this proposition can be realized using a
Geiger counter occupying the volume V . The counter clicks (the proposition
is true) if the particle passes through the counters chamber and does not
click (the proposition is false) if the particle is outside of V .
In what follows we will denote by / the set of all propositions
13
about the
physical system. The set of all possible states of the system will be denoted
by . Our goal in this chapter is to study the mathematical relationships
between elements x / and in these two sets.
The above discussion referred to a single measurement performed on one
11
Subsets X are not necessarily limited to contiguous intervals in R. All results remain
valid for more complex subsets of R, such as unions of any number of disjoint intervals.
12
sometimes they are also called yes-no experiments.
13
/ also called the propositional system.
18 CHAPTER 1. QUANTUM MECHANICS
copy of the physical system. Let us now prepare multiple copies (an ensem-
ble) of the system and perform measurements of the same proposition in all
copies. As we discussed earlier, there is no guarantee that the results of all
these measurements will be the same. So, for some members in the ensemble
the proposition x will be found true, while for other members it will be
false, even if every eort is made to ensure that the state of the system is
exactly the same in all cases.
Using results of measurements as described above, we can introduce a
function ([x) called the probability measure which assigns to each state
and to each proposition x the probability of x being true in the state . The
value of this function (a real number between 0 and 1) is obtained by the
following recipe:
(i) prepare a copy of the system in the state ;
(ii) determine whether x is true or false;
(iii) repeat steps (i) and (ii) N times, then
([x) = lim
N
M
N
where M is the number of times when the proposition x was found to be
true.
Two states and of the same system are said to be equal ( = ) if
for any proposition x we have
([x) = ([x)
Indeed, there is no physical dierence between these two states as all experi-
ments yield the same results (probabilities). For the same reason we will say
that two propositions x and y are identical (x = y) if for all states of the
system
([x) = ([y) (1.1)
It follows from the above discussion that the probability measure ([x) con-
sidered as a function on the set of all states is a unique representative of
1.3. LATTICE OF PROPOSITIONS 19
proposition x (dierent propositions have dierent representatives). So, we
can gain some insight into the properties of dierent propositions by studying
properties of corresponding probability measures.
There are propositions which are always true independent on the state
of the system. For example, the proposition the value of the observable F
1
is somewhere on the real line is always true.
14
For any other observable
F
2
, the proposition the value of the observable F
2
is somewhere on the real
line is also true for all states. Therefore, according to (1.1), we will say that
these two propositions are identical. So, we can dene a unique maximal
proposition J / which is always true. Inversely, the proposition the value
of observable is not on the real line is always false and will be called the
minimal proposition. There is just one minimal proposition in the set /,
and for each state we can write
([J) = 1 (1.2)
([) = 0 (1.3)
1.3.2 Partial ordering
Suppose that we found two propositions x and y, such that their measures
satisfy ([x) ([y) for all states in the set . Then we will say that
proposition x is less than or equal to proposition y and denote this relation by
x y. The meaning of this relation is obvious when x and y are propositions
about the same observable and X and Y are their corresponding subsets in
R. Then x y if the subset X is inside the subset Y , i.e., X Y . In this
case, the relation x y is associated with logical implication, i.e., if x is true
then y is denitely true as well; x IMPLIES y; or IF x THEN y. If x y
and x ,= y we will say that x is less than y and denote this relationship by
x < y.
The implication relation has three obvious properties.
Lemma 1.3 (reectivity) Any proposition implies itself: x x.
Proof. This follows from the fact that for any it is true that ([x) ([x).
14
Measurements of observables always yield some value, since we agreed in Introduction
that an ideal measuring apparatus never misres.
20 CHAPTER 1. QUANTUM MECHANICS
Lemma 1.4 (symmetry) If two propositions imply each other, then they
are equal: If x y and y x then x = y.
Proof. If two propositions x and y are less than or equal to each other, then
([x) ([y) and ([y) ([x) for each state , which implies ([x) = ([y)
and, according to (1.1), x = y.
Lemma 1.5 (transitivity) If x y and y z, then x z.
Proof. If x y and y z, then ([x) ([y) ([z) for every state .
Consequently, ([x) ([z) for each state and x z.
Properties 1.3, 1.4 and 1.5 tell us that is a partial ordering relation.
It is ordering because it tells which proposition is smaller and which is
larger. The ordering is partial, because it doesnt apply to all pairs of
propositions.
15
Thus, the set / of all propositions is a partially ordered set.
From equations (1.2) and (1.3) we also conclude that
Postulate 1.6 (denition of J) x J for any x /.
Postulate 1.7 (denition of ) x for any x /.
1.3.3 Meet
For two propositions x and y, suppose that we found a third proposition z
such that
z x (1.4)
z y (1.5)
There could be more than one proposition satisfying these properties. We
will assume that we can always nd one maximal proposition z in this set.
This maximal proposition will be called a meet of x and y and denoted by
x y.
The existence of a unique meet is obvious in the case when x and y are
propositions about the same observable, such that they correspond to two
15
There could be propositions x and y, such that for some states ([x) > ([y), while
for other states ([x) < ([y).
1.3. LATTICE OF PROPOSITIONS 21
subsets of the real line R: X and Y , respectively. Then the meet z = x y
is a proposition corresponding to the intersection of these two subsets Z =
X Y .
16
In this one-dimensional case the operation meet can be identied
with the logical operation AND: the proposition xy is true only when both
x AND y are true.
The above denition of meet can be formalized as
Postulate 1.8 (denition of ) x y x and x y y for all x and y.
Postulate 1.9 (denition of ) If z x and z y then z x y.
It seems reasonable to assume that the order in which meet operations are
performed is not relevant
Postulate 1.10 (commutativity of ) x y = y x.
Postulate 1.11 (associativity of ) (x y) z = x (y z).
1.3.4 Join
Similar to our discussion of meet, we can assume that for any two propositions
x and y there always exists a unique join x y, such that both x and y are
less or equal than x y, and x y is the minimal proposition with such a
property.
In the case of propositions about the same observable, the join of x and
y is a proposition z = x y whose associated subset Z of the real line is a
union of the subsets corresponding to x and y: Z = X Y . The proposition
z is true when either x OR y is true. So, the join can be identied with the
logical operation OR.
17
The formal version of the above denition of join is
Postulate 1.12 (denition of ) x x y and y x y.
Postulate 1.13 (denition of ) If x z and y z then x y z.
16
If X and Y do not intersect, then z = .
17
It is important to note that from x y being true it does not necessarily follow that
either x or y are denitely true. There is a signicant dierence between interpretations
of the logical operation OR in quantum and classical logics. We will discuss this dierence
in subsection 1.6.2.
22 CHAPTER 1. QUANTUM MECHANICS
Similar to Lemmas 1.10 and 1.11 we see that the order of join operations
is irrelevant
Postulate 1.14 (commutativity of ) x y = y x.
Postulate 1.15 (associativity of ) (x y) z = x (y z).
The properties of propositions listed so far (partial ordering, meet and
join) mean that the set of propositions / is what mathematicians call a
complete lattice.
1.3.5 Orthocomplement
There is one more operation on the set of propositions that we need to con-
sider. This operation is called orthocomplement. has the meaning of the
logical negation (operation NOT). For any proposition x its orthocomple-
ment is denoted by x
is zero.
Inversely, if x is false then x
is necessarily true.
More formally, the orthocomplement x
) = 1 ([x) (1.6)
Lemma 1.16 (non-contradiction) x x
= .
Proof. Let us prove this Lemma in the case when x is a proposition about
one observable F. Suppose that xx
1 = ([y) ([x)
1 = ([y) ([x
)
1.3. LATTICE OF PROPOSITIONS 23
It then follows that ([x) = 1 and ([x
= .
Lemma 1.17 (double negation) (x
= x.
Proof. From equation (1.6) we can write for any state
([(x
) = 1 ([x
) = 1 (1 ([x)) = ([x)
Lemma 1.18 (contraposition) If x y then y
.
Proof. If x y then ([x) ([y) and (1 ([x)) (1 ([y)) for all
states . But according to our denition (1.6), the two sides of the latter
inequality are probability measures for propositions x
and y
, respectively,
which proves the Lemma.
Propositions x and y are said to be disjoint if x y
or, equivalently,
y x
.
When x and y are disjoint propositions about the same observable, their
corresponding subsets do not intersect: X Y = . For such mutually
exclusive propositions the probability of either x OR y being true (i.e., the
probability corresponding to the proposition xy) is the sum of probabilities
for x and y. It seems natural to generalize this property to all pairs of disjoint
propositions
Postulate 1.19 (probabilities for mutually exclusive propositions) If
x and y are disjoint, then for any state
([x y) = ([x) + ([y)
The following Lemma establishes that the join of any proposition x with its
orthocomplement x
= J.
24 CHAPTER 1. QUANTUM MECHANICS
Proof. From Lemmas 1.3 and 1.17 it follows that x x = (x
and that
propositions x and x
) = ([x) + ([x
) = ([x) + (1 ([x)) = 1
which proves the Lemma.
Adding the orthocomplement to the properties of the propositional sys-
tem (complete lattice) /, we conclude that / is an orthocomplemented lattice.
Axioms of orthocomplemented lattices are collected in the upper part of Ta-
ble 1.1 for easy reference.
1.3. LATTICE OF PROPOSITIONS 25
Table 1.1: Axioms of quantum logic
Property Postulate/Lemma
Axioms of orthocomplemented lattices
Reectivity 1.3 x x
Symmetry 1.4 (x y) & (y x) x = y
Transitivity 1.5 (x y) & (y z) x z
Denition of J 1.6 x J
Denition of 1.7 x
Denition of 1.8 x y x
Denition of 1.9 (z x) & (z y) z (x y)
Denition of 1.12 x x y
Denition of 1.13 (x z) & (y z) (x y) z
Commutativity 1.10 x y = y x
Commutativity 1.14 x y = y x
Associativity 1.11 (x y) z = x (y z)
Associativity 1.15 (x y) z = x (y z)
Non-contradiction 1.16 x x
=
Non-contradiction 1.20 x x
= J
Double negation 1.17 (x
= x
Contraposition 1.18 x y y
and y
.
18
The rst row in table 1.3 refers to all classical pure states in
which both propositions x and y are false. The second row refer to states in
which x is false and y is true, etc.
Table 1.3: Truth table for basic logical operations
x y x y x y x
0 0 0 0 1 1
0 1 0 1 1 0
1 0 0 1 0 1
1 1 1 1 0 0
Table 1.4: Demonstration of the distributive law using truth table
x y z y z x (y z) x y x z (x y) (x z)
0 0 0 0 0 0 0 0
0 0 1 0 0 0 1 0
0 1 0 0 0 1 0 0
1 0 0 0 1 1 1 1
0 1 1 1 1 1 1 1
1 0 1 0 1 1 1 1
1 1 0 0 1 1 1 1
1 1 1 1 1 1 1 1
Another example is shown in Table 1.4. It demonstrates the validity of the
classical distributive law.
19
As this law involves three dierent propositions,
we need to consider 8 = 2
3
dierent groups of classical states that have
dierent values of their truth functions on the propositions x, y and z. In
all these cases truth functions in columns 5 and 8 are identical which means
that
x (y z) = (x y) (x z)
18
Here we assumed that all these propositions are non-empty.
19
Assertion 1.25
30 CHAPTER 1. QUANTUM MECHANICS
Assertion 1.26 can be derived in a similar way.
Thus we have shown that in the certain world of classical mechanics the
set of propositions / is an orthocomplemented atomic lattice with distribu-
tive laws 1.25 and 1.26. Such a lattice will be called a classical propositional
system or, shortly, classical logic. Study of classical logics and its relationship
to classical mechanics is the topic of the present section.
1.4.2 Atomic propositions in classical logic
Our next step is to demonstrate that classical logic provides the entire math-
ematical framework of classical mechanics, i.e., the description of observables
and states in the phase space. First, we prove four Lemmas.
Lemma 1.27 In classical logic, if x < y, then there exists an atom p such
that p x = and p y.
Proof. Clearly, y x
) = (y x) (y x
) = (y x) = y x x
and, by Lemma 1.4, x = y in contradiction with the condition of the present
Lemma. Since y x
and by Lemma
C.3 p x x
x = .
Lemma 1.28 In classical logic, the orthocomplement x
of a proposition x
(where x ,= J) is a join of all atoms not contained in x.
Proof. First, it is clear that there should exist al least one atom p that
is not contained in x. If it were not true, then we would have x = J in
contradiction to the condition of the Lemma. Let us now prove that the
atom p is contained in x
) = (p x) (p x
)
According to Lemma 1.22 we now have four possibilities:
1.4. CLASSICAL LOGIC 31
1. p x = and p x
= ; then p x x
= , which is impossible;
2. p x = p and p x
= p; then p = p p = (p x) (p x
) =
p (x x
) = p = , which is impossible;
3. p x = p and p x
, i.e., p is contained
in x
.
This shows that all atoms not contained in x are contained in x
. Further,
Lemmas 1.16 and 1.24 imply that all atoms contained in x
.
Proof. By Lemma 1.28, p
.
Lemma 1.30 In classical logic, the join x y of two propositions x and y
is a join of atoms contained in either x or y.
Proof. If p x or p y then p xy. Conversely, suppose that p xy
and p x = , p y = , then
p = p (x y) = (p x) (p y) = =
which is absurd.
Now we are ready to prove the important fact that in classical mechanics
(or in classical logic) propositions can be interpreted as subsets of a set S,
which is called the phase space.
32 CHAPTER 1. QUANTUM MECHANICS
Theorem 1.31 For any classical logic /, there exists a set S and an iso-
morphism f(x) between elements x of / and subsets of the set S such that
x y f(x) f(y) (1.7)
f(x y) = f(x) f(y) (1.8)
f(x y) = f(x) f(y) (1.9)
f(x
) = S f(x) (1.10)
where , , and are usual set-theoretical operations of inclusion, inter-
section, union and relative complement.
Proof. The statement of the theorem follows immediately if we choose
S to be the set of all atoms. Then property (1.7) follows from Lemma
1.23, equation (1.8) follows from Lemma 1.24. Lemmas 1.30 and 1.28 imply
equations (1.9) and (1.10), respectively.
1.4.3 Atoms and pure classical states
Lemma 1.32 In classical logic, if p is an atom and is a pure state such
that ([p) = 1,
20
then for any other atom q ,= p we have ([q) = 0.
Proof. According to Lemma 1.29, q p
) = 1 ([p) = 0
Lemma 1.33 In classical logic, if p is an atom and and are two pure
states such that ([p) = ([p) = 1, then = .
Proof. There are propositions of two kinds: those containing the atom p
and those not containing the atom p. For any proposition x containing the
atom p we denote by q the atoms contained in x and obtain using Postulate
1.21(2), Lemma 1.29, Postulate 1.19 and Lemma 1.32
20
such a state always exists due to Postulate 1.2.
1.4. CLASSICAL LOGIC 33
([x) = ([
qx
q) =
qx
([q) = ([p) = 1
The same equation holds for the state . Similarly we can show that for any
proposition y not containing the atom p
([y) = ([y) = 0
Since probability measures of and are the same for all propositions, these
two states are equal.
Theorem 1.34 In classical logic, there is an isomorphism between atoms p
and pure states
p
such that
(
p
[p) = 1 (1.11)
Proof. From Postulate 1.2 we know that for each atom p there is a state
p
in which equation (1.11) is valid. From Lemma 1.32 this state is unique. To
prove the reverse statement we just need to show that for each pure state
p
there is a unique atom p such that (
p
[p) = 1. Suppose that for each atom p
we have (
p
[p) = 0. Then, taking into account that J is a join of all atoms,
that all atoms are mutually disjoint and using (1.2) and Postulate 1.19, we
obtain
1 = (
p
[J) = (
p
[
pI
p) =
pI
(
p
[p) = 0
which is absurd. Therefore, for each state
p
one can always nd at least
one atom p such that equation (1.11) is valid. Finally, we need to show that
if p and q are two such atoms, then p = q. This follows from the fact that
for each pure classical state the probability measures (or the truth functions)
corresponding to propositions p and q are exactly the same. For the state
p
the truth function is equal to 1, for all other pure states the truth function
is equal to 0.
34 CHAPTER 1. QUANTUM MECHANICS
1.4.4 Phase space of classical mechanics
Now we are fully equipped to discuss the phase space representation in clas-
sical mechanics. Suppose that the physical system under consideration has
observables A, B, C, . . . with corresponding spectra S
A
, S
B
, S
C
, ... Accord-
ing to Theorem 1.34, for each atom p of the propositional system we can
nd its corresponding pure state
p
. All observables A, B, C, . . . have de-
nite values in this state.
21
Therefore the state
p
is characterized by a set
of real numbers A
p
, B
p
, C
p
, . . . - the values of observables. Let us suppose
that the full set of observables A, B, C, . . . contains a minimal subset of
observables X, Y, Z, . . . whose values X
p
, Y
p
, Z
p
, . . . uniquely enumerate
all pure states
p
and therefore all atoms p. So, there is a one-to-one corre-
spondence between groups of numbers X
p
, Y
p
, Z
p
, . . . and atoms p. Then
the set of all atoms can be identied with the direct product
22
of spectra of
this minimal set of observables S = S
X
S
Y
S
Z
. . .. This direct product
is called the phase space of the system. The values X
s
, Y
s
, Z
s
, . . . of the
independent observables X, Y, Z, . . . in each point s S provide the phase
space with coordinates. Other (dependent) observables A, B, C, . . . can
be represented as real functions A(s), B(s), C(s), . . . on S or as functions of
independent observables X, Y, Z, . . ..
In this representation, propositions can be viewed as subsets of the phase
space. Another way is to consider propositions as special cases of observables
(= real functions on S): The function corresponding to the proposition about
the subset T of the phase space is the characteristic function of this subset
(s) =
_
1, if s T
0, if s / T
(1.12)
Atomic propositions correspond to single-point subsets of the phase space S.
In subsection 6.5.4 we will consider one massive particle and build an explicit
and realistic example of the phase space for this physical system.
1.4.5 Classical probability measures
Probability measures have a simple interpretation in the classical phase space.
Each state (not necessarily a pure state) denes probabilities ([p) for all
21
See Assertion 1.1
22
See Appendix A.1 for the denition of the direct product of sets.
1.4. CLASSICAL LOGIC 35
atoms p (= all points s in the phase space). Each proposition x is a join of
disjoint atoms contained in x.
x =
qx
q
Then, by Postulate 1.21, Lemma 1.29 and Postulate 1.19 the probability of
the proposition x being true in the state is
([x) = ([
qx
q) =
qx
([q) (1.13)
So, the value of the probability measure for all propositions x is uniquely
determined by its values on atoms. In many important cases, the phase
space is continuous and instead of considering probabilities ([q) at points in
the phase space (= atoms) it is convenient to consider probability densities
which are functions (s) on the phase space such that
1) (s) 0;
2)
_
S
(s)ds = 1.
Then the value of the probability measure ([x) is obtained by the integral
([x) =
_
X
(s)ds
over the subset X corresponding to the proposition x.
For a pure classical state , the probability density is represented by the
delta function (s) = (s s
0
) localized at one point s
0
in the phase space.
For such states, the value of the probability measure in each proposition x
can be either 0 or 1: ([x) = 0 if the point s
0
does not belong to the subset
X corresponding to the proposition x and ([x) = 1 otherwise
23
23
States whose probability densities are nonzero at more than one point in the phase
space (i.e., the probability density is dierent from the delta function) are called classical
mixed states. We will not discuss them in this book.
36 CHAPTER 1. QUANTUM MECHANICS
([x) =
_
X
(s s
0
)ds =
_
1, if s
0
X
0, if s
0
/ X
This shows that for pure classical states the probability measure degenerates
into a two-valued truth function. This is in agreement with our discussion in
subsection 1.4.1.
As we discussed earlier, in classical pure states all observables have well
dened values. So, classical mechanics is a fully deterministic theory in which
one can, in principle, obtain a full information about the system at any given
time and knowing the rules of dynamics predict exactly its development in
the future. This belief was best expressed by P.S. Laplace:
An intelligence that would know at a certain moment all the forces
existing in nature and the situations of the bodies that compose
nature and if it would be powerful enough to analyze all these
data, would be able to grasp in one formula the movements of the
biggest bodies of the Universe as well as of the lightest atom.
1.5 Quantum logic
The above discussion of classical logic and phase spaces relied heavily on
the determinism (Assertion 1.1) of classical mechanics and on the validity of
distributive laws (Assertions 1.25 and 1.26). In quantum mechanics we are
not allowed to use these Assertions. In this section we will build quantum
logic, in which the determinism is not assumed a priori and the distributive
laws are not necessarily valid. Quantum logic is a foundation of the entire
mathematical formalism of quantum theory, as we will see in the rest of this
chapter.
1.5.1 Compatibility of propositions
Propositions x and y are said to be compatible (denoted x y) if
x = (x y) (x y
) (1.14)
y = (x y) (x
y) (1.15)
1.5. QUANTUM LOGIC 37
The notion of compatibility has a great importance for quantum theory. In
subsection 1.6.3 we will see that two propositions can be measured simulta-
neously if and only if they are compatible.
Theorem 1.35 In an orthocomplemented lattice all propositions are com-
patible if and only if the lattice is distributive.
Proof. If the lattice is distributive then for any two propositions x and y
(x y) (x y
) = x (y y
) = x J = x
and, changing places of x and y
(x y) (x
y) = y
These formulas coincide with our denitions of compatibility (1.14) and
(1.15), which proves the direct statement of the theorem.
The proof of the inverse statement (compatibility distributivity) is
more lengthy. We assume that all propositions in our lattice are compatible
with each other and choose three arbitrary propositions x, y and z. Now we
are going to prove that the distributive laws
24
(x z) (y z) = (x y) z (1.16)
(x z) (y z) = (x y) z (1.17)
are valid. First we prove that the following 7 propositions (some of them
may be empty) are mutually disjoint (see Fig. 1.4)
q
1
= x y z
q
2
= x
y z
q
3
= x y
z
q
4
= x y z
q
5
= x y
q
6
= x
y z
q
7
= x
z
38 CHAPTER 1. QUANTUM MECHANICS
xx yy
zz
qq
1 1
qq
5 5
qq
4 4
qq
6 6
qq
3 3
qq
2 2
qq
7 7
Figure 1.4: To the proof of Theorem 1.35.
For example, to show that propositions q
3
and q
5
are disjoint we notice that
q
3
z and q
5
z
5
and
q
3
z q
5
. Therefore by Lemma 1.5 q
3
q
5
.
Since by our assumption both xz and xz
) = q
1
q
3
x z
= (x z
y) (x z
) = q
4
q
5
x = (x z) (x z
) = q
1
q
3
q
4
q
5
Similarly we show
y z = q
1
q
2
y = q
1
q
2
q
4
q
6
z = q
1
q
2
q
3
q
7
Then denoting Q = q
1
q
2
q
3
we obtain
(x z) (y z) = (q
1
q
3
) (q
1
q
2
) = q
1
q
2
q
3
= Q (1.18)
24
Assertions 1.25 and 1.26
1.5. QUANTUM LOGIC 39
From Postulate 1.9 and y x = Q q
4
q
5
q
6
it follows that
Q (Q q
7
) (Q q
4
q
5
q
6
) = (x y) z (1.19)
On the other hand, from q
4
q
5
q
6
q
7
, Lemma C.3 and the denition of
compatibility it follows that
(x y) z = (Q q
4
q
5
q
6
) (Q q
7
) (Q q
7
) (Q q
7
) = Q(1.20)
Therefore, applying the symmetry Lemma 1.4 to equation (1.19) and (1.20),
we obtain
(x y) z = Q (1.21)
Comparing equations (1.18) and (1.21) we see that the distributive law (1.16)
is valid. The other distributive law (1.17) is obtained from equation (1.16)
by duality (see Appendix C).
This theorem tells us that in classical mechanics all propositions are compat-
ible. The presence of incompatible propositions is a characteristic feature of
quantum theories.
1.5.2 Logic of quantum mechanics
In quantum mechanics we are not allowed to use classical Assertion 1.1 and
we must abandon the distributive laws. However, in order to get a non-trivial
theory we need some substitute for these two properties. This additional pos-
tulate should be specic enough to yield sensible physics and general enough
to be non-empty and to include the distributive law as a particular case. The
latter requirement is justied by our desire to have classical mechanics as a
particular case of more general quantum mechanics.
To nd such a generalization we will use the following arguments. From
Theorem 1.35 we know that the compatibility of all propositions is a char-
acteristic property of classical Boolean lattices. We also mentioned that this
property is equivalent to simultaneous measurability of all propositions. We
know that in quantum mechanics not all propositions are simultaneously
40 CHAPTER 1. QUANTUM MECHANICS
measurable, therefore they cannot be compatible as well. This suggests that
we may try to nd a generalization of classical theory by limiting the set of
propositions that are mutually compatible. More specically, we will postu-
late that two propositions are denitely compatible if one implies the other
and leave it to mathematics to tell us about the compatibility of other pairs.
Postulate 1.36 (orthomodularity) Propositions about physical systems obey
the orthomodular law: If a implies b, then these two propositions are com-
patible
a b a b. (1.22)
Orthocomplemented lattices with additional orthomodular Postulate 1.36 are
called orthomodular lattices.
Is there any deeper physical justication for the above postulate? As
far as I know, there is none. The only justication is that the orthomodu-
larity postulate really works, i.e., it results in the well-known mathematical
structure of quantum mechanics, which has been thoroughly tested in ex-
periments. In principle, one can try to introduce a dierent postulate to
replace the classical distributivity relationships. If the resulting set of postu-
lates turned out to be self-consistent, then one would obtain a non-classical
theory that is also dierent from quantum mechanics. We will not explore
this possibility. So, in this book we will stick to orthomodular lattices and
to traditional laws of quantum mechanics that follow from them.
Before proceeding further, we need to introduce important notions of the
irreducibility of lattices and their rank. The center of a lattice is the set of
elements compatible with all others. Obviously and J are in the center.
A propositional system in which there are only two elements in the center
( and J) is called irreducible. Otherwise it is called reducible. Any
Boolean lattice having more than two elements ( and J are present in any
lattice, of course) is reducible and its center coincides with the entire lattice.
Orthomodular atomic irreducible lattices are called quantum propositional
systems or quantum logics. The rank of a propositional system is dened as
the maximum number of mutually disjoint atoms. For example, the rank of
the classical propositional system of one massive spinless particle described
in subsection 6.5.4 is the number of points in the phase space R
6
.
1.5. QUANTUM LOGIC 41
The most fundamental conclusion from our discussion in this section is
the following
Statement 1.37 (quantum logic) Experimental propositions form a quan-
tum propositional system (=orthomodular atomic irreducible lattice).
In principle, it should be possible to perform all constructions and calcu-
lations in quantum theory by using the formalism of orthomodular lattices
based on just described postulates. Such an approach would have certain
advantages because all its components have clear physical meaning: exper-
imental propositions x are realizable in laboratories and probabilities ([x)
can be directly measured in experiments. However, this approach meets
tremendous diculties mainly because lattices are rather exotic mathemati-
cal objects and we lack intuition when dealing with lattice operations.
We saw that in classical mechanics the happy alternative to obscure lattice
theory is provided by Theorem 1.31 which proves the isomorphism between
the language of classical logic and the physically transparent language of
phase spaces. Is there a similar equivalence theorem in the quantum case? To
answer this question, we may notice that there is a striking similarity between
algebras of projections on closed subspaces in a complex Hilbert space 1 (see
Appendices F and G) and quantum propositional systems discussed above.
In particular, if operations between projections (or subspaces) in the Hilbert
space are translated to the lattice operations according to Table 1.5,
25
then all
axioms of quantum logic can be directly veried. For example, the validity
of the Postulate 1.36 follows from Lemmas G.4 and G.5. Atoms can be
identied with one-dimensional subspaces or rays in 1. The irreducibility
follows from Lemma G.6.
1.5.3 Example: 3-dimensional Hilbert space
One can verify directly that distributive laws 1.25 and 1.26 are generally not
valid for subspaces in the Hilbert space 1. Consider, for example the system
of basis vectors and subspaces in a 3-dimensional Hilbert space 1 shown in
Fig. 1.5. The triples of vectors (a
1
, a
2
, a
3
) and (a
1
, b
2
, b
3
) form two orthogonal
25
We denote Sp(A, B) the linear span of two subspaces A and B in the Hilbert space.
(See Appendix A.3.) AB denotes the intersection of these subspaces. A
is the subspace
orthogonal to A.
42 CHAPTER 1. QUANTUM MECHANICS
Table 1.5: Translation of terms, symbols and operations used for subspaces in
the Hilbert space, projections on these subspaces and propositions in quan-
tum logics.
Subspaces Projections Propositions
X Y P
X
P
Y
= P
Y
P
X
= P
X
x y
X Y P
XY
x y
Sp(X, Y ) P
Sp(X,Y )
x y
X
1 P
X
x
0 0
1 1 J
ray x [xx[ x is an atom
sets. They correspond to 1-dimensional subspaces X
1
, X
2
, X
3
, Y
2
, Y
3
. In addi-
tion, two 2-dimensional subspaces Z and Z
1
can be formed as Z = Sp(X
1
, X
2
)
and Z
1
= Sp(X
2
, X
3
) = Sp(Y
2
, X
3
). These subspaces satisfy obvious rela-
tionships
Sp(X
2
, X
3
) = Sp(Y
2
, X
3
) = Sp(X
2
, Y
2
) = Z
1
, Sp(X
1
, X
2
) = Z, X
3
Y
2
= 0
X
2
Y
2
= 0, X
1
= Z
1
, Z Z
1
= X
2
Then one can nd a triple of subspaces for which the distributive laws in
Assertions 1.25 and 1.26 are not satised
Sp(Y
2
, (X
3
X
2
)) = Sp(Y
2
, 0) = Y
2
,= Z
1
= Z
1
Z
1
= Sp(Y
2
, X
3
) Sp(Y
2
, X
2
)
Y
2
Sp(X
3
, X
2
) = Y
2
Z
1
= Y
2
,= 0 = Sp(0, 0) = Sp((Y
2
X
3
), (Y
2
X
2
))
This means that the logic represented by subspaces in the Hilbert space
is dierent from the classical Boolean logic. However, the orthomodularity
postulate is valid there. For example, X
1
Z so the condition in (1.22) is
satised and these two subspaces are compatible according to (1.14) - (1.15)
Sp((Z X
1
), (Z X
1
)) = Sp(X
1
, (Z Z
1
)) = Sp(X
1
, X
2
) = Z
Sp((Z X
1
), (Z
X
1
)) = Sp(X
1
, (X
3
X
1
)) = Sp(X
1
, 0) = X
1
1.5. QUANTUM LOGIC 43
XX
2 2
XX
3 3
YY
2 2
HH
aa
1 1
aa
2 2
aa
3 3
bb
2 2
bb
3 3
00
Y Y
3 3
XX
1 1
ZZ
ZZ
1 1
Figure 1.5: Subspaces in a 3-dimensional Hilbert space 1.
1.5.4 Pirons theorem
Thus we have established that the set of closed subspaces (or projections
on these subspaces) in any complex Hilbert space 1 is a representation of
some quantum propositional system. The next question is: can we nd a
Hilbert space representation for each quantum propositional system? The
positive answer to this question is given by the important Pirons theorem
[Pir76, Pir64]
Theorem 1.38 (Piron) Any irreducible quantum propositional system (=
orthomodular atomic irreducible lattice) / of rank 4 or higher is isomorphic
to the lattice of closed subspaces in a Hilbert space 1 such that the corre-
spondences shown in Table 1.5 are true.
The proof of this theorem is beyond the scope of our book. Two further
remarks can be made regarding this theorems statement. First, all proposi-
tional systems of interest to physics have innite (even uncountable) rank, so
the condition rank 4 is not a signicant restriction. Second, the original
Pirons theorem does not specify the nature of scalars in the Hilbert space.
This theorem leaves the freedom of choosing any division ring with involutive
antiautomorphism as the set of scalars in 1. We can greatly reduce this free-
dom if we remember the important role played by real numbers in physics.
26
Therefore, it makes physical sense to consider only those rings which include
26
e.g., values of observables are always in R
44 CHAPTER 1. QUANTUM MECHANICS
R as a subring. In 1877 Frobenius proved that there are only three such
rings. They are real numbers R, complex numbers C and quaternions H. Al-
though there is vast literature on real and, especially, quaternionic quantum
mechanics [Stu60, Jau71], the relevance of these theories to physics remains
uncertain. Therefore, we will stick with complex numbers from now on.
Pirons theorem forms the foundation of the mathematical formalism of
quantum physics. In particular, it allows us to express the important notions
of observables and states in the new language of Hilbert spaces. In this lan-
guage pure quantum states are described by unit vectors in the Hilbert space.
Observables are described by Hermitian operators in the same Hilbert space.
These correspondences will be explained in the next section. Orthomodular
lattices of quantum logic, phase spaces of classical mechanics and Hilbert
spaces of quantum mechanics are just dierent languages for describing re-
lationships between states, observables and their measured values. Table 1.6
can be helpful for translation between these three languages.
1.5.5 Should we abandon classical logic?
In this section we have reached a seemingly paradoxical conclusion that one
cannot use classical logic as well as classical probability theory for reason-
ing about quantum systems. How could that be? Classical logic is the
foundation of the whole mathematics and the scientic method in general.
All mathematical theorems are being proved with the logic of Aristotle and
Boole. Even theorems of quantum mechanics are being proved using this
logic. However, quantum mechanics insists that propositions about quantum
systems satisfy laws of the non-Boolean quantum logic. The logic of classical
distributive lattices is just an approximation. Isnt there a contradiction?
Not really.
It is still permissible to use classical logic in quantum mechanical proofs
because, thanks to the Pirons theorem, we have replaced real life objects,
such as experimental propositions and probability measures, with abstract
and articial notions of Hilbert spaces, state vectors and Hermitian operators.
These abstractions is the price we pay for the privilege to keep using simple
classical logic. In principle, it should be possible to formulate entire quantum
theory using the language of propositions, quantum logic and probability
measures. However, such an approach has not been developed yet.
1.6. QUANTUM OBSERVABLES AND STATES 45
Table 1.6: Glossary of terms used in general quantum logic, in classical phase
space and in the Hilbert space of quantum mechanics.
nature Quantum logic Phase space Hilbert space
Statement proposition subset closed subspace
Unambiguous Atom Point Ray
statement
AND meet intersection intersection
OR join union linear span
NOT orthocomplement relative orthogonal
complement complement
IF...THEN implication inclusion inclusion
of subsets of subspaces
Observable proposition-valued real function Hermitian
measure on 1 operator
jointly compatible all observables commuting
measurable propositions are compatible operators
observables
mutually exclusive disjoint non-intersecting orthogonal
statements propositions subsets subspaces
Pure state Probability delta function Ray
measure
Mixed state Probability Probability Density
measure function operator
1.6 Quantum observables and states
1.6.1 Observables
Each observable F naturally denes a mapping (called a proposition-valued
measure) from the set of intervals of the real line R to propositions F
E
in
/. These propositions can be described in words: F
E
= the value of the
observable F is inside the interval E of the real line R. We already discussed
properties of propositions about one observable. They can be summarized
as follows:
The proposition corresponding to the intersection of intervals E
1
and
46 CHAPTER 1. QUANTUM MECHANICS
E
2
is the meet of propositions corresponding to these intervals
F
E
1
E
2
= F
E
1
F
E
2
(1.23)
The proposition corresponding to the union of intervals E
1
and E
2
is
the join of propositions corresponding to these intervals
F
E
1
E
2
= F
E
1
F
E
2
(1.24)
The proposition corresponding to the complement of interval E is the
orthocomplement of the proposition corresponding to E
F
R\E
= F
E
(1.25)
The minimal proposition corresponds to the empty subset of the real
line
F
= (1.26)
The maximal proposition corresponds to the real line itself.
F
R
= J (1.27)
Intervals E of the real line form a Boolean (distributive) lattice with re-
spect to set theoretical operations , , and . Due to the isomorphism
(1.23) - (1.27), the corresponding one-observable propositions F
E
also form a
Boolean lattice, which is a sublattice of our full propositional system. There-
fore, according to Theorem 1.35, all propositions about the same observable
are compatible. Due to the isomorphism propositionssubspaces, we
can use the same notation F
E
for subspaces (projections) in 1 corresponding
to intervals E. Then, according to Lemma G.5, all projections F
E
, referring
to one observable F, commute with each other.
Each point f in the spectrum of observable F is called an eigenvalue of
this observable. The subspace F
f
1 corresponding to the eigenvalue f is
called eigensubspace and projection P
f
onto this subspace is called a spectral
projection. Each vector in the eigensubspace is called eigenvector.
27
27
In the next subsection we will see that each vector in the Hilbert space denes a unique
pure quantum state. States dened by the above eigenvectors will be called eigenstates (of
the given observable F). Apparently, the observable F has denite values (=eigenvalues)
in its eigenstates. This means that eigenstates are examples of states whose existence was
guaranteed by Postulate 1.2.
1.6. QUANTUM OBSERVABLES AND STATES 47
Consider two distinct eigenvalues f and g of observable F. The corre-
sponding intervals (=points) of the real line are disjoint. Then propositions
F
f
and F
g
are disjoint too and corresponding (eigen)subspaces are orthogo-
nal. The linear span of subspaces F
f
, where f runs through entire spectrum
of F, is the full Hilbert space 1. Therefore, spectral projections of any
observable form a decomposition of unity.
28
So, according to discussion in
Appendix G.2, we can associate an Hermitian operator
F =
f
fP
f
(1.28)
with each observable F. In what follows we will often use terms observable
and Hermitian operator as synonyms.
1.6.2 States
As we discussed in subsection 1.3.1, each state of the system denes a
probability measure ([x) on propositions in quantum logic /. According
to the isomorphism propositions subspaces,
29
the state also denes
a probability measure ([X) on subspaces X in the Hilbert space 1. This
probability measure is a function from subspaces to the interval [0, 1] R
whose properties follow directly from equations (1.2), (1.3) and Postulate
1.19
The probability corresponding to the whole Hilbert space is 1 in all
states
([1) = 1 (1.29)
The probability corresponding to the empty subspace is 0 in all states
([0) = 0 (1.30)
The probability corresponding to the direct sum of orthogonal sub-
spaces
30
is the sum of probabilities for each subspace
([X Y ) = ([X) + ([Y ), if X Y (1.31)
28
see Appendix G.1
29
See Table 1.6.
30
Note that according to Table 1.6 orthogonal subspaces correspond to disjoint propo-
sitions. For denition of the direct sum of subspaces see Appendix G.1.
48 CHAPTER 1. QUANTUM MECHANICS
The following important theorem provides a classication of all such proba-
bility measures (= all states of the physical system).
Theorem 1.39 (Gleason [Gle57]) If ([X) is a probability measure on
closed subspaces in the Hilbert space 1 with properties (1.29) - (1.31), then
there exists a non-negative Hermitian operator in 1 such that
Tr() = 1 (1.32)
and for any subspace X with projection P
X
the value of the probability mea-
sure is
([X) = Tr(P
X
) (1.33)
Just a few comments about the terminology and notation used here: First,
a Hermitian operator is called non-negative if all its eigenvalues are greater
than or equal to zero. Second, the operator is usually called the density
operator or the density matrix. Third, Tr denotes trace
31
of the matrix of
the operator .
The proof of Gleasons theorem is far from trivial and we refer interested
reader to original works [Gle57, RB99]. Here we will focus on the physical
interpretation of this result. First, we may notice that, according to the
spectral theorem F.8, the operator can be always written as
=
i
[e
i
e
i
[ (1.34)
where [e
i
is an orthonormal basis in 1. Then the Gleasons theorem means
that
i
0 (1.35)
i
= 1 (1.36)
0
i
1 (1.37)
31
Trace is, basically, the sum of all diagonal elements of a matrix. See Appendix F.7.
1.6. QUANTUM OBSERVABLES AND STATES 49
Among all states satisfying equation (1.35) - (1.37) there are simple states
for which just one coecient
i
is non-zero. Then, from (1.36) it follows
that
i
= 1,
j
= 0 if j ,= i and the density operator degenerates to a
projection onto the one-dimensional subspace [e
i
e
i
[.
32
Such states will be
called pure quantum states. It is also common to describe a pure state by a
unit vector from its ray. Any unit vector from this ray represents the same
state, i.e., in the vector representation of states there is a freedom of choosing
an unimodular phase factor of the state vector. In what follows we will often
use the terms pure quantum state and state vector as synonyms.
Mixed quantum states are expressed as weighed sums of pure states
whose coecients
i
in equation (1.34) reect the probabilities with which
the pure states enter in the mixture. Therefore, in quantum mechanics there
are uncertainties of two types. The rst type is the uncertainty present in
mixed states. This uncertainty is already familiar to us from classical (sta-
tistical) physics. This uncertainty results from our insucient control of
preparation conditions (like when a bullet is red from a shaky rie). The
second uncertainty is present even in pure quantum states where probabili-
ties (1.33) can be dierent from classical values 0 and 1. This uncertainty
does not have a counterpart in classical physics and it cannot be avoided by
tightening the preparation conditions. This uncertainty is a reection of the
mysterious unpredictability of microscopic phenomena. We will not discuss
mixed quantum states in this book. So, we will deal only with uncertainties
of the fundamental quantum type.
1.6.3 Commuting and compatible observables
In subsection 1.5.1 we dened the notion of compatible propositions. In
Lemma G.5 we showed that the compatibility of propositions is equivalent
to the commutativity of corresponding projections. The importance of these
denitions for physics comes from the fact that for a pair of compatible
propositions (=projections=subspaces) there are states in which both these
propositions are certain, i.e., simultaneously measurable. A similar state-
ment can be made for two compatible (=commuting) Hermitian operators of
observables. According to Theorem G.9, such two operators have a common
basis of eigenvectors (=eigenstates). In these eigenstates both observables
have denite (eigen)values.
32
One-dimensional subspaces are also called rays.
50 CHAPTER 1. QUANTUM MECHANICS
We will assume that for any physical system there always exists a minimal
set of mutually compatible (= commuting) observables F, G, H, . . ..
33
Then,
we should be able to build an orthonormal basis of common eigenvectors [e
i
i
[e
i
(1.38)
where in the bra-ket notation
34
i
= e
i
[ (1.39)
The set of coecients
i
can be viewed as a function (f, g, h, . . .) on the
common spectrum of observables F, G, H, . . .. In this form, the coecients
i
are referred to as the wave function of the state [ in the representa-
tion dened by observables F, G, H, . . .. When the spectrum of operators
F, G, H, . . . is continuous, the index i is, actually, a continuous variable.
35
1.6.4 Expectation values
Equation (1.28) denes a spectral decomposition for each observable F, where
index f runs over all distinct eigenvalues of F. Then for each pure state [
we can nd the probability of measuring a value f of the observable F in
this state by using formula
36
33
The set F, G, H, . . . is called minimal if no observable from the set can be expressed
as a function of other observables from the same set. Any function of observables from the
minimal commuting set also commutes with F, G, H, . . . and with any other such function.
34
see Appendix F.3
35
Wave functions in the momentum and position representations for a single particle
will be discussed in section 5.2.
36
This is simply the value of the probability measure ([P
f
) (see subsection 1.3.1) cor-
responding to the spectral projection P
f
. One can also see that this formula is equivalent
to the Gleasons expression (1.33).
1.6. QUANTUM OBSERVABLES AND STATES 51
f
=
m
i=1
[e
f
i
[[
2
(1.40)
where [e
f
i
are basis vectors in the range of the projection
P
f
m
i=1
[e
f
i
e
f
i
[ (1.41)
and m is the dimension of the corresponding subspace. Sometimes we also
need to know the weighed average of values f. This is called the expectation
value of the observable F in the state [ and denoted F
F
f
f
Substituting here equation (1.40) we obtain
F =
n
j=1
[e
j
[[
2
f
j
n
j=1
[
j
[
2
f
j
where the summation is carried out over the entire basis [e
j
of eigenvectors
of the operator F with eigenvalues f
j
. By using decompositions (1.38), (1.28)
and (1.41) we obtain a more compact formula for the expectation value F
[F[ =
_
i
e
i
[
__
j
[e
j
f
j
e
j
[
__
k
[e
k
_
=
ijk
i
f
j
k
e
i
[e
j
e
j
[e
k
=
ijk
i
f
j
ij
jk
=
j
[
j
[
2
f
j
= F (1.42)
52 CHAPTER 1. QUANTUM MECHANICS
1.6.5 Basic rules of classical and quantum mechanics
Results obtained in this chapter can be summarized as follows. If we want
to calculate the probability for measuring the value of the observable F
inside the interval E R for a system prepared in a pure state , then we
need to perform the following steps:
In classical mechanics:
1. Dene the phase space S of the physical system;
2. Find a real function f : S R corresponding to the observable F;
3. Find the subset U of S corresponding to the subset E of the spectrum
of the observable F (U is the set of all points s S such that f(s) E);
4. Find the point s
U and = 0 otherwise.
In quantum mechanics:
1. Dene the Hilbert space 1 of the physical system;
2. Find the Hermitian operator F in 1 corresponding to the observable;
3. Find the eigenvalues and eigenvectors of the operator F;
4. Find a spectral projection P
E
corresponding to the subset E of the
spectrum of the operator F.
5. Find the unit vector [ (dened up to an arbitrary unimodular factor)
representing the state of the system.
6. Use formula = [P
E
[
At this point, there seems to be no connection between the classical and
quantum rules. However, we will see in subsection 6.5 that in the macroscopic
world with massive objects and poor resolution of instruments, the classical
rules emerge as a decent approximation to the quantum ones.
1.7. INTERPRETATIONS OF QUANTUM MECHANICS 53
1.7 Interpretations of quantum mechanics
In sections 1.3 - 1.6 of this chapter we focused on the mathematical formalism
of quantum mechanics. Now it is time to discuss the physical meaning and
interpretation of these formal rules.
1.7.1 Quantum unpredictability in microscopic systems
Experiments with quantum microsystems have revealed one simple and yet
mysterious fact: if we prepare N absolutely identical physical systems in the
same conditions and measure the same observable in each of them, we may
nd N dierent results.
Let us illustrate this experimental nding by few examples. We know from
experience that each photon passing through the hole in the camera obscura
will hit the photographic plate at some point on the photographic plate.
However, each new released photon will hit at a dierent place. Quantum
mechanics allows us to calculate the probability density for these points,
but apart from that, the behavior of each individual photon appears to be
completely random. Quantum mechanics does not even attempt to predict
where each individual photon will hit the target.
Another example of such an apparently random behavior is the decay of
unstable nuclei. The nucleus of
232
Th has the lifetime of 14 billion years.
This means that in any sample containing thorium, approximately half of all
232
Th nuclei will decay after 14 billion years. In principle, quantum mechan-
ics can calculate the probability of the nuclear decay as a function of time by
solving the corresponding Schr odinger equation.
37
However, quantum me-
chanics cannot even approximately guess when any given nucleus will decay.
It could happen today, or it could happen 100 billion years from now.
Although, such unpredictability is certainly a hallmark of microscopic
systems it would be wrong to think that it is not aecting our macroscopic
world. Quite often the eect of random microscopic processes can be am-
plied to produce a sizable equally random macroscopic eect. One famous
example of the amplication of quantum uncertainties is the thought exper-
iment with the Schr odinger cat [Sch35].
So, our world (even at the macroscopic scale) is full of truly random events
whose exact description and prediction is beyond capabilities of modern sci-
37
though our current knowledge of nuclear forces is insucient to make a reliable cal-
culation of that sort for thorium.
54 CHAPTER 1. QUANTUM MECHANICS
ence. Nobody knows why physical systems have this random unpredictable
behavior. Quantum mechanics simply accepts this fact and does not attempt
to explain it. Quantum mechanics does not describe what actually happens;
it describes the full range of possibilities of what might have happened and
the probability of each possible outcome. Each time nature chooses just
one possibility from this range, while obeying the probabilities predicted by
quantum mechanics. QM cannot say anything about which particular choice
will be made by nature. These choices are completely random and beyond
explanation by modern science. This observation is a bit disturbing and em-
barrassing. Indeed, we have real physically measurable eects (the actual
choices made by nature) for which we have no control and no power to pre-
dict the outcome. These are facts without an explanation, eects without
a cause. It seems that microscopic particles obey some mysterious random
force. Then it is appropriate to ask what is the reason for such stochastic
behavior of micro-systems? Is it truly random or it just seems to be random?
If quantum mechanics cannot explain this random behavior, maybe there is
a deeper theory that can?
1.7.2 Hidden variables
One school of thought attributes the apparently random behavior of mi-
crosystems to some yet unknown hidden variables, which are currently
beyond our observation and control. According to these views, put some-
what simplistically, each photon in camera obscura has a guiding mechanism
which directs it to a certain predetermined spot on the photographic plate.
Each unstable nucleus has some internal alarm clock ticking inside. The
nucleus decays when the alarm goes o. The behavior of quantum systems
just appears to be random to us because so far we dont have a clue about
these guiding mechanisms and alarm clocks.
According to the hidden variables theory, quantum mechanics is not the
nal word and future theory will be able to fully describe the properties of
individual systems and predict events without relying on chance. There are
two problems with this point of view. First, so far nobody was able to build
a convincing theory of hidden variables and to predict (even approximately)
outcomes of quantum measurements beyond calculated probabilities. The
second reason to reject the hidden variables argument is more formal.
The hidden variables theory says that the randomness of micro-systems
does not have any special quantum-mechanical origin. It is the same classi-
1.7. INTERPRETATIONS OF QUANTUM MECHANICS 55
cal pseudo-randomness as seen in the usual coin-tossing or die-rolling. The
hidden variables theory implies that rules of classical mechanics apply to
micro-systems just as well as to macro-systems. As we saw in section 1.4.1,
these rules are based on the classical Assertion 1.1 of determinism. Quan-
tum mechanics simply discards this unprovable Assertion and replaces it
with a weaker Postulate 1.2. So, quantum mechanics with its probabilities
is a more general mathematical framework and classical mechanics with its
determinism can be represented as a particular case of this framework. As
Mittelstaedt put it [Mit]
...classical mechanics is loaded with metaphysical hypotheses which
clearly exceed our everyday experience. Since quantum mechanics
is based on strongly relaxed hypotheses of this kind, classical me-
chanics is less intuitive and less plausible than quantum mechan-
ics. Hence classical mechanics, its language and its logic cannot
be the basis of an adequate interpretation of quantum mechanics.
P. Mittelstaedt
1.7.3 Measurement problem
If we now accept the probabilistic quantum view of reality, we must address
some deep paradoxes. The major apparent paradox is that in quantum me-
chanics the wave function of a physical system evolves in time smoothly and
unitarily according to the Schr odinger equation (5.49) up until the instant of
measurement, at which point the wave function experiences an unpredictable
and abrupt collapse.
The puzzling part is that it is not clear how the wave function knows
when it can undergo the continuous evolution and when it should collapse.
Where is the boundary between the measuring device and the quantum sys-
tem? For example, it is customary to say that the photon is the quantum
system and the photographic plate is the measuring device. However, we can
adopt a dierent view and include the photographic plate together with the
photon in our quantum system. Then, we should, in principle, describe both
the photon and the photographic plate by a joint wave function. When does
this wave function collapse? Where is the measuring apparatus in this case?
Humans eye? Does it mean that while we are not looking, the entire system
(photon + photographic plate) remains in a superposition state? Following
this logic we may easily reach a seemingly absurd conclusion that the ultimate
56 CHAPTER 1. QUANTUM MECHANICS
measuring device is humans brain and all events remain potentialities until
they are registered by mind. For many physicists these contradictions signify
some troubling incompleteness of quantum theory, its inability to describe
the world as it is.
In order to avoid the controversial wave function collapse, several so-
called interpretations of quantum mechanics were proposed. In the de
BroglieBohms pilot wave interpretation it is postulated that the electron
propagating in the double-slit setup is actually a classical point particle,
whose movement is guided by a separate material wave that obeys the
Schr odinger equation. In the many worlds interpretation it is assumed that
at the instant of measurement (when several outcomes are possible, according
to quantum mechanics) the world splits into several (or even innite number
of) copies, so all outcomes are realized at once. We see only a single outcome
because we live just in one copy of the world and lack the bird view of the
many worlds reality.
Interpretations of quantum mechanics attempt to suggest some kinds of
physical mechanisms of the quantum systems behavior and the measure-
ment process. However, how we can be sure that these mechanisms are
correct? The only method of verication available in physics is experiment,
but suggested mechanisms are related to things happening in the physical
system while it is not observed. So, it is impossible to design experiments
that would (dis)prove interpretations of quantum mechanics. Being unac-
cessible to experimental verication these interpretations should belong to
philosophy rather than physics.
Actually, the collapse or measurement paradoxes are not as serious
as they look. In authors view, their appearance is related simply to our
unrealistic expectations regarding the explanatory power of physical theory.
Intuitively we wish to have a physical theory that encompasses all physical
reality: the physical system, the measuring apparatus, the observer and the
entire universe. However this goal is perhaps too ambitious and misleading.
Recall that the goal of a physical theory declared in Introduction is to provide
a formalism that allows us to predict results of experiments.
38
In physics we
do not want and do not need to describe the whole world as it is. We
should be entirely satised if our theory allows us to calculate the outcome
of any conceivable measurement, which is a more modest task.
Thus certain aspects of reality are beyond the reach of quantum theory.
38
More precisely, the theory should be able to calculate probabilities of measurements.
1.7. INTERPRETATIONS OF QUANTUM MECHANICS 57
In this sense, quantum mechanics can be regarded as incomplete theory.
However, this author believes that this incompleteness is not a problem, but
a reection of the fundamental unavoidable unpredictability of nature. Here
the following quote from Einstein seems appropriate:
I now imagine a quantum theoretician who may even admit that
the quantum-theoretical description refers to ensembles of systems
and not to individual systems, but who, nevertheless, clings to the
idea that the type of description of the statistical quantum theory
will, in its essential features, be retained in the future. He may
argue as follows: True, I admit that the quantum-theoretical de-
scription is an incomplete description of the individual system. I
even admit that a complete theoretical description is, in principle,
thinkable. But I consider it proven that the search for such a com-
plete description would be aimless. For the lawfulness of nature
is thus constructed that the laws can be completely and suitably
formulated within the framework of our incomplete description.
To this I can only reply as follows: Your point of view - taken as
theoretical possibility - is incontestable. A. Einstein [Ein49]
The quantum mechanical distinction between the observed system and
the measuring apparatus is not as problematic as often claimed. This dis-
tinction is naturally present in every experiment. If properly asked, the
experimentalist will always tell you which part of his setup is the observed
system and which part is the measuring apparatus.
39
Therefore there is noth-
ing wrong in applying dierent descriptions to these two parts. In quantum
theory the state of the physical system is described as a vector in the Hilbert
space and the measuring apparatus is described as an Hermitian operator
in the same Hilbert space. The measuring apparatus is not considered to
be a dynamical object. This means that there is no point to describe the
39
For instance, in the above example of the double-slit experiment the photon is the
physical system and the photographic plate is the measuring apparatus. The one-photon
Hilbert space should be used for the quantum-mechanical analysis of this experiment. If
we like, we can consider the photon + photographic plate as our physical system, but
this would mean that we have changed completely the experimental setup and the range
of meaningful questions that can be asked about it. The new setup should be described
quantum-mechanically in a dierent Hilbert space with dierent state vectors and dierent
operators of observables.
58 CHAPTER 1. QUANTUM MECHANICS
act of measurement as interaction between the physical system and the
measuring apparatus by means of some dynamical theory.
40
The collapse of the wave function is not a dynamical process, it is just
a part of a mathematical formalism that allows us to full to true task of
any physical theory - to predict outcomes of experiments. So there is no
any contradiction or paradox between the unitary time evolution of wave
functions and the abrupt collapse at the time of measurement.
1.7.4 Agnostic interpretation of quantum mechanics
The things we said above can be summarized in the following statements:
1. Quantum mechanics does not pretend to provide a description of the
entire universe. It only applies to the description of specic experiments
in which the physical system and the measuring apparatus are clearly
separated.
2. Quantum mechanics does not provide a mechanism of what goes on
while the physical system is not observed or while it is measured. Quan-
tum mechanics is just a mathematical recipe for calculating probabil-
ities of experimental outcomes. The ingredients used in this recipe
(Hilbert space, state superpositions, wave functions, Hermitian opera-
tors, etc.) have no direct relationship to things observable in nature.
They are just mathematical symbols.
3. We cannot measure all observables at once. A realistic experiment
measures only one observable, or, in the best case, a few mutually
compatible observables.
4. Nature is inherently probabilistic. There exists a certain level of ran-
dom noise that leads to unpredictability of the results of measure-
ments.
5. Logical propositions about measurements do not obey the set of clas-
sical Boolean axioms. The distributive law of logic is not valid and
should be replaced by the orthomodular law.
40
See e.g., von Neumanns measurement theory.
1.7. INTERPRETATIONS OF QUANTUM MECHANICS 59
The most important philosophical lesson taught to us by quantum me-
chanics is the unwillingness to speculate about things that are not observable.
Einstein was very displeased with this point of view. He wrote:
I think that a particle must have a separate reality independent of
the measurements. That is an electron has spin, location and so
forth even when it is not being measured. I like to think that the
moon is there even if I am not looking at it. A. Einstein
Actually, quantum mechanics does not deny that things exist even when they
are not being measured. However, we will prefer to remain agnostic about
non-observable features and refuse to use them as the basis for building our
theory.
60 CHAPTER 1. QUANTUM MECHANICS
Chapter 2
THE POINCAR
E GROUP
There are more things in Heaven and on earth, dear Horacio,
than are dreamed of in your philosophy.
Hamlet
In the preceding chapter we have learned that each physical system can
be described mathematically by a Hilbert space. Rays (unit vectors dened
up to a phase multiplier) in this space are in one-to-one correspondence
with (pure) states of the system. Observables are described by Hermitian
operators. This vague description is not sucient for a working theory. We
are still lacking precise classication of possible physical systems; we still
do not know which operators correspond to usual observables like position,
momentum, mass, energy, spin, etc. and how these operators are related to
each other; we still cannot tell how states and observables evolve in time.
Our theory is not complete.
It appears that the missing pieces mentioned above are supplied by the
principle of relativity - which is one of the most powerful ideas in physics.
This principle has a very general character. It works independent on what
physical system, state or observable is considered. Basically, this principle
says that there is no preferred inertial reference frame (or observer or lab-
oratory). All frames are equivalent if they are at rest or move uniformly
without rotation or acceleration. Moreover, the principle of relativity rec-
ognizes certain (group) properties of inertial transformations between these
61
62 CHAPTER 2. THE POINCAR
E GROUP
observers. Our primary goal here is to establish that the group of transforma-
tions between inertial observers is the celebrated Poincare group. In the rest
of this book we will have many opportunities to appreciate the fundamental
importance of this idea for relativistic physics.
One can notice that the principle of relativity discussed here is the same
as the rst postulate of Einsteins special relativity. In this book we will
not need Einsteins second postulate, which claims the independence of the
speed of light on the velocity of the source or observer. Actually, we will
nd out that by combining the rst postulate, the Poincare group idea and
laws of quantum mechanics one can obtain a complete working formalism of
relativistic quantum theory. This will be done in chapter 3. We will also
see in chapter 5 that the second postulate is redundant, because the speed
of photons appears to be invariant anyway. Another distinctive feature of
our relativistic approach is that we never assume the existence of the 4-
dimensional Minkowski manifold, which unies space and time. Time and
position play very dierent roles in our theory. The signicance of this idea
will be discussed in chapter 15 in the second part of this book.
2.1 Inertial observers
2.1.1 Principle of relativity
As has been said in Introduction, in this book we consider only inertial
laboratories. What is so special about them? The answer is that one can
apply the powerful principle of relativity to such laboratories. The essence of
this principle was best explained by Galileo more than 370 years ago [Gal01]:
Shut yourself up with some friend in the main cabin below
decks on some large ship and have with you there some ies, but-
teries and other small ying animals. Have a large bowl of water
with some sh in it; hang up a bottle that empties drop by drop
into a wide vessel beneath it. With the ship standing still, observe
carefully how the little animals y with equal speed to all sides of
the cabin. The sh swim indierently in all directions; the drops
fall into the vessel beneath; and, in throwing something to your
friend, you need to throw it no more strongly in one direction
than another, the distances being equal; jumping with your feet
together, you pass equal spaces in every direction. When you have
2.1. INERTIAL OBSERVERS 63
observed all of these things carefully (though there is no doubt that
when the ship is standing still everything must happen this way),
have the ship proceed with any speed you like, so long as the mo-
tion is uniform and not uctuating this way and that. You will
discover not the least change in all the eects named, nor could
you tell from any of them whether the ship was moving or stand-
ing still. In jumping, you will pass on the oor the same spaces
as before, nor will you make larger jumps toward the stern than
towards the prow even though the ship is moving quite rapidly,
despite the fact that during the time that you are in the air the
oor under you will be going in a direction opposite to your jump.
In throwing something to your companion, you will need no more
force to get it to him whether he is in the direction of the bow or
the stern, with yourself situated opposite. The droplets will fall as
before into the vessel beneath without dropping towards the stern,
although while the drops are in the air the ship runs many spans.
The sh in the water will swim towards the front of their bowl
with no more eort than toward the back and will go with equal
ease to bait placed anywhere around the edges of the bowl. Finally
the butteries and ies will continue their ights indierently to-
ward every side, nor will it ever happen that they are concentrated
toward the stern, as if tired out from keeping up with the course
of the ship, from which they will have been separated during long
intervals by keeping themselves in the air.
These observations can be translated into the statement that all inertial lab-
oratories cannot be distinguished from the laboratory at rest by performing
experiments conned to those laboratories. Any experiment performed in
one laboratory, will yield exactly the same result as an identical experiment
in any other laboratory. The results will be the same independent on how
far apart the laboratories are and what are their relative orientations and
velocities. Moreover, we may repeat the same experiment at any time, to-
morrow, or many years later, still results will be the same. This allows us to
formulate one of the most important and deep postulates in physics
Postulate 2.1 (the principle of relativity) In all inertial laboratories,
the laws of nature are the same: they do not change with time, they do not
depend on the position and orientation of the laboratory in space and on its
64 CHAPTER 2. THE POINCAR
E GROUP
velocity. The laws of physics are invariant against inertial transformations
of laboratories.
2.1.2 Inertial transformations
Our next goal is to study inertial transformations between laboratories in
more detail. To do this we do not need to consider physical systems at all. It
is sucient to think about a world inhabited only by laboratories. The only
thing these laboratories can do is to measure parameters
; v; r; t of their
fellow laboratories. It appears that even in this oversimplied world we can
learn quite a few useful things about properties of inertial laboratories and
their relationships to each other.
Let us rst introduce a convenient labeling of inertial observers and in-
ertial transformations. We choose an arbitrary frame O as our reference
observer, then other examples of observers are
(i) an observer
0; 0; 0; t
1
O displaced in time by the amount t
1
;
1
(ii) an observer
0; 0; r
1
; 0O shifted in space by the vector r
1
;
(iii) an observer
0; v
1
; 0; 0O moving with velocity v
1
;
(iv) an observer
1
; 0; 0; 0O rotated by the vector
1
.
2
Suppose now that we have three dierent inertial observers O, O
and O
.
There is an inertial transformation
1
; v
1
; r
1
; t
1
which connects O and O
1
; v
1
; r
1
; t
1
O (2.1)
where parameters
1
, v
1
, r
1
and t
1
are measured by the ruler and clock
belonging to the reference frame O with respect to its basis vectors. Similarly,
there is an inertial transformation that connects O
and O
2
; v
2
; r
2
; t
2
O
(2.2)
1
Recall that observers considered in this book are instantaneous, so the time ow is
regarded as time translation of observers.
2
The parameterization of rotations by 3-vectors is discussed in Appendix D.5.
2.1. INERTIAL OBSERVERS 65
where parameters
2
, v
2
, r
2
and t
2
are dened with respect to the basis
vectors, ruler and clock of the observer O
3
; v
3
; r
3
; t
3
O (2.3)
with all transformation parameters referring to O. We can represent the
transformation (2.3) as a composition or product of transformations (2.1)
and (2.2)
3
; v
3
; r
3
; t
3
=
2
; v
2
; r
2
; t
2
1
; v
1
; r
1
; t
1
(2.4)
Apparently, this product has the property of associativity.
3
Also, there ex-
ists a trivial (identity) transformation
; v; r; t there is an inverse
transformation
; v; r; t
1
such that their product is the identity transfor-
mation
; v; r; t
; v; r; t
1
=
; v; r; t
1
; v; r; t =
0; 0; 0; 0 (2.5)
In other words, the set of inertial transformations forms a group (see Ap-
pendix A.2). Moreover, since these transformations smoothly depend on
their parameters, this is a Lie group (see Appendix E.1). The main goal of
the present chapter is to study the properties of this group in some detail. In
particular, we will need explicit formulas for the composition and inversion
laws.
First we notice that a general inertial transformation
; v; r; t can be
represented as a product of basic transformations (i) - (iv). As these basic
transformations generally do not commute, we must agree on the canonical
order in this product. For our purposes the following choice is convenient
; v; r; tO =
; 0; 0; 0
0; v; 0; 0
0; 0; r; 00; 0;
0; tO (2.6)
This means that in order to obtain observer O
; v; r; tO we rst shift
observer O in time by the amount t, then shift the time-translated observer
by the vector r, then give it velocity v and nally rotate the obtained observer
by the angle
.
3
see equation (A.1)
66 CHAPTER 2. THE POINCAR
E GROUP
2.2 Galilei group
In this section we begin our study of the group of inertial transformations
by considering a non-relativistic world in which observers move with low
speeds. This is a relatively easy task, because in these derivations we can
use our everyday experience and common sense. The relativistic group of
transformations will be approached in section 2.3 as a formal generalization
of the Galilei group derived here.
2.2.1 Multiplication law of the Galilei group
Let us rst consider four examples of products (2.4) in which
1
; v
1
; r
1
; t
1
2
; v
2
; r
2
; t
2
is one of the basic
transformations from the list (i) - (iv). Applying a time translation to a
general reference frame
1
; v
1
; r
1
; t
1
O will change its time label and change
its position in space according to equation
0; 0; 0; t
2
1
; v
1
; r
1
; t
1
O =
1
; v
1
; r
1
+v
1
t
2
; t
1
+ t
2
O (2.7)
Space translations aect the position
0; 0; r
2
; 0
1
; v
1
; r
1
; t
1
O =
1
; v
1
; r
1
+r
2
; t
1
O (2.8)
Boosts change the velocity
0; v
2
; 0; 0
1
; v
1
; r
1
; t
1
O =
1
; v
1
+v
2
; r
1
; t
1
O (2.9)
Rotations aect all vector parameters
4
2
; 0; 0; 0
1
; v
1
; r
1
; t
1
O =
(R
2
R
1
); R
2
v
1
; R
2
r
1
; t
1
O (2.10)
Now we can calculate the product of two general inertial transformations in
(2.4) by using (2.6) - (2.10)
5
4
For denition of 3 3 rotation matrices R
and function
see Appendix D.5.
5
Note that sometimes the product of Galilei transformations is written in other forms.
See, for example, section 3.2 in [Bal98], where the assumed canonical order of factors was
dierent from our formula (2.6).
2.2. GALILEI GROUP 67
2
; v
2
; r
2
; t
2
1
; v
1
; r
1
; t
1
2
; 0; 0; 0
0; v
2
; 0; 0
0; 0; r
2
; 0
0; 0; 0; t
2
1
; v
1
; r
1
; t
1
2
; 0; 0; 0
0; v
2
; 0; 0
0; 0; r
2
; 0
1
; v
1
; r
1
+v
1
t
2
; t
1
+ t
2
2
; 0; 0; 0
0; v
2
; 0; 0
1
; v
1
; r
1
+v
1
t
2
+r
2
; t
1
+ t
2
2
; 0; 0; 0
1
; v
1
+v
2
; r
1
+v
1
t
2
+r
2
; t
1
+ t
2
(R
2
R
1
); R
2
(v
1
+v
2
); R
2
(r
1
+v
1
t
2
+r
2
); t
1
+ t
2
(2.11)
By direct substitution to equation (2.5) it is easy to check that the inverse
of a general inertial transformation
; v; r; t is
; v; r; t
1
=
; v; r +vt; t (2.12)
Equations (2.11) and (2.12) are multiplication and inversion laws which fully
determine the structure of the Lie group of inertial transformations in non-
relativistic physics. This group is called the Galilei group.
2.2.2 Lie algebra of the Galilei group
In physical applications the Lie algebra of the group of inertial transforma-
tions plays even greater role than the group itself. According to our discus-
sion in Appendix E, we can obtain the basis (1,
T,
/,
) in the Lie algebra
of generators of the Galilei group by taking derivatives with respect to pa-
rameters of one-parameter subgroups. For example, the generator of time
translations is
1 = lim
t0
d
dt
0; 0; 0; t
For generators of space translations and boosts along the x-axis we obtain
T
x
= lim
x0
d
dx
0; 0; x, 0, 0; 0
/
x
= lim
v0
d
dv
0; v, 0, 0; 0; 0
68 CHAPTER 2. THE POINCAR
E GROUP
The generator of rotations around the x-axis is
x
= lim
0
d
d
, 0, 0; 0; 0; 0
Similar formulas are valid for y- and z-components. According to (E.1) we
can also express nite transformations as exponents of generators
0; 0; 0; t = e
Ht
1 +1t (2.13)
0; 0; r; 0 = e
Pr
1 +
Tr (2.14)
0; v; 0; 0 = e
Kv
1 +
/v (2.15)
; 0; 0; 0 = e
1 +
Then each group element can be represented in its canonical form (2.6) as
the following function of parameters
; v; r; t
; 0; 0; 0
0; v; 0; 0
0; 0; r; 00; 0;
0; t
= e
Kv
e
Pr
e
Ht
(2.16)
Let us now nd the commutation relations between generators, i.e., the
structure constants of the Galilei Lie algebra. Consider, for example, trans-
lations in time and space. From equation (2.11) we have
0; 0; 0; t
0; 0; x, 0, 0; 0 =
0; 0; x, 0, 0; 0
0; 0; 0; t
This implies
e
Ht
e
Pxx
= e
Pxx
e
Ht
1 = e
Pxx
e
Ht
e
Pxx
e
Ht
Using equations (2.13) and (2.14) for the exponents we can write to the rst
order in x and to the rst order in t
2.2. GALILEI GROUP 69
1 (1 +T
x
x)(1 +1t)(1 T
x
x)(1 1t)
1 +T
x
1xt T
x
1xt 1T
x
xt +T
x
1xt
= 1 1T
x
xt +T
x
1xt
hence
[T
x
, 1] T
x
11T
x
= 0
So, generators of space and time translations have vanishing Lie bracket.
Similarly we obtain Lie brackets
[1, T
i
] = [T
i
, T
j
] = [/
i
, /
j
] = [/
i
, T
j
] = 0
for any i, j = x, y, z (or i, j = 1, 2, 3). The composition of a time translation
and a boost is more interesting since they do not commute. We calculate
from equation (2.11)
e
Kxv
e
Ht
e
Kxv
=
0; v, 0, 0; 0, 0
0; 0; 0; t
0; v, 0, 0; 0; 0
=
0; v, 0, 0; 0; 0
0; v, 0, 0; vt, 0, 0; t
=
0; 0, 0, 0; vt, 0, 0; t
= e
Ht
e
Pxvt
Therefore, using equations (2.13), (2.15) and (E.13) we obtain
1t + [/
x
, 1]vt = 1t T
x
vt
[/
x
, 1] = T
x
Proceeding in a similar fashion for other pairs of transformations we obtain
the full set of commutation relations for the Lie algebra of the Galilei group.
70 CHAPTER 2. THE POINCAR
E GROUP
[
i
, T
j
] =
3
k=1
ijk
T
k
(2.17)
[
i
,
j
] =
3
k=1
ijk
k
(2.18)
[
i
, /
j
] =
3
k=1
ijk
/
k
(2.19)
[
i
, 1] = 0 (2.20)
[T
i
, T
j
] = [T
i
, 1] = 0 (2.21)
[/
i
, /
j
] = 0 (2.22)
[/
i
, T
j
] = 0 (2.23)
[/
i
, 1] = T
i
(2.24)
From these Lie brackets one can identify several important sub-algebras of
the Galilei Lie algebra and, therefore, subgroups of the Galilei group. In
particular, there is an Abelian subgroup of space and time translations (with
generators
T and 1, respectively), a subgroup of rotations (with generators
= gO
Suppose that observer O performs an inertial transformation with the group
element h (e.g., h is a translation along the x-axis). We want to nd a
transformation h
to O, performing
translation h there and then returning back to the reference frame O
in dier-
ent reference frames. g = exp(J
z
) is a rotation around the z-axis that is
perpendicular to the page.
h
= ghg
1
Similarly, if / is a generator of an inertial transformation in the reference
frame O, then
/
= g/g
1
(2.25)
is the same generator in the reference frame O
= gO.
Let us consider the eect of rotation around the z-axis on generators of
the Galilei group. We can write
/
x
/
x
() = e
Jz
/
x
e
Jz
/
y
/
y
() = e
Jz
/
y
e
Jz
/
z
/
z
() = e
Jz
/
z
e
Jz
where
/ is any of the generators
T,
or
/. From Lie brackets (2.17) -
(2.19) we obtain
72 CHAPTER 2. THE POINCAR
E GROUP
/
x
() = e
Jz
(
z
/
x
/
x
z
)e
Jz
= e
Jz
/
y
e
Jz
= /
y
() (2.26)
/
y
() = e
Jz
(
z
/
y
/
y
z
)e
Jz
= e
Jz
/
x
e
Jz
= /
x
()
/
z
() = e
Jz
(
z
/
z
/
z
z
)e
Jz
= 0 (2.27)
Taking a derivative of equation (2.26) by we obtain a second order dier-
ential equation
/
x
() =
/
y
() = /
x
()
with the general solution
/
x
() = B cos +Tsin
with arbitrary B and T. From the initial conditions we obtain
B = /
x
(0) = /
x
T =
d
d
/
x
()
=0
= /
y
so that nally
/
x
() = /
x
cos +/
y
sin (2.28)
Similar calculations show that
/
y
() = /
x
sin +/
y
cos (2.29)
/
z
() = /
z
(2.30)
Comparing (2.28) - (2.30) with equation (D.12), we see that
2.2. GALILEI GROUP 73
00
SS
S
xx
xx
vv
vv
Figure 2.2: Transformation of generators under space inversion.
/
i
= e
Jz
/
i
e
Jz
=
3
j=1
(R
z
)
ij
/
j
(2.31)
where R
z
is the rotation matrix. As shown in equation (D.21), we can nd
the result of application of a general rotation
; 0; 0; 0 to generators
/
= e
/e
=
/cos +
_
/
_
(1 cos )
_
/
_
sin
= R
/
This means that
T,
and
/ are 3-vectors.
6
The Lie bracket (2.20) obviously
means that 1 is a 3-scalar.
2.2.4 Space inversions
We will not consider physical consequences of discrete transformations (in-
version and time reversal) in this book. It is physically impossible to prepare
6
see Appendix D.2
74 CHAPTER 2. THE POINCAR
E GROUP
an exact mirror image or a time-reversed image of a laboratory, so the rel-
ativity postulate has nothing to say about such transformations. Indeed, it
has been proven by experiment that these discrete symmetries are not ex-
act. Nevertheless, we will nd it useful to know how generators behave with
respect to space inversions. Suppose we have a classical system S and its in-
version image S
(see Fig. 2.2) with respect to the origin 0. The question is:
how the image S
will be shifted by x.
This can be interpreted as the change of sign of the generator of translation
acquires
velocity v if the original was boosted by v. So, inversion changes the sign
of the boost generator as well
/ (2.32)
Vectors, such as
T and
/, changing their sign after inversion are called true
vectors. However, the generator of rotation
is not a true vector. In-
deed, if we rotate S by angle
, then the image S
E GROUP 75
Derivation of the relativistic group of inertial transformations is a di-
cult task, because we lack the experience of dealing with fast-moving objects
in our everyday life. So, we will use more formal mathematical arguments
instead. In this section we will nd that there is almost a unique way to
obtain the Lie algebra of the Poincare group by generalizing the commuta-
tion relations of the Galilei Lie algebra (2.17) - (2.24), so that they remain
compatible with some simple physical requirements.
2.3.1 Lie algebra of the Poincare group
We can be condent about the validity of Galilei Lie brackets involving gen-
erators of space and time translations and rotations, because properties of
these transformations have been veried in everyday life and in physical ex-
periments over a wide range of involved parameters (distances, times and
angles). The situation with respect to boosts is quite dierent. Normally,
we do not experience high speeds in our life and we lack any physical intu-
ition that was so helpful in deriving the Galilei Lie algebra. Therefore the
arguments that lead us to the Lie brackets (2.22) - (2.24) involving boost gen-
erators may be not exact and these formulas may be just approximations that
can be tolerated only for low-speed observers. So, we will base our derivation
of the relativistic group of inertial transformations on the following ideas.
(I) Just as in the non-relativistic world, the set of inertial transformations
should remain a 10-parameter Lie group. However, Lie brackets in
the exact (Poincare) Lie algebra are expected to be dierent from the
Galilei Lie brackets (2.17) - (2.24).
(II) The Galilei group does a good job in describing the low-speed transfor-
mations and the speed of light c is a natural measure of speed. There-
fore we may guess that the correct Lie brackets should include c as a
parameter and they must tend to the Galilei Lie brackets in the limit
c .
7
(III) We will assume that only Lie brackets involving boosts may be subject
to revision.
7
Note that here we do not assume that c is a limiting speed or that the speed of light is
invariant. These facts will come out as a result of application of our approach to massive
and massless particles in chapter 5.
76 CHAPTER 2. THE POINCAR
E GROUP
(IV) We will further assume that relativistic generators of boosts
/ still
form components of a true vector, so equations (2.19) and (2.32) remain
valid.
Summarizing requirements (I) - (IV), we can write the following relativistic
generalizations for the Lie brackets (2.22) - (2.24)
[/
i
, T
j
] = |
ij
(2.33)
[/
i
, /
j
] = T
ij
[/
i
, 1] = T
i
+1
i
(2.34)
where T
ij
, |
ij
and 1
ij
are some yet unknown linear combinations of genera-
tors. The coecients of these linear combinations must be selected in such a
way that all Lie algebra properties
8
are preserved. Let us try to satisfy these
conditions step by step.
First note that the Lie bracket [/
i
, T
j
] is a 3-tensor. Indeed, using equa-
tion (2.31) we obtain the tensor transformation law (D.15)
e
[/
i
, T
j
]e
=
_
3
k=1
R
ik
(
)/
k
,
3
k=1
R
jl
(
)T
l
_
=
3
kl=1
R
ik
(
)R
jl
(
)[/
k
, T
l
]
Since both
/ and
T change their signs upon inversion, this a true tensor.
Therefore |
ij
must be a true tensor as well. This tensor should be constructed
as a linear function of generators among which we have a true scalar 1, a
pseudovector
and two true vectors
T and
/. According to our discussion
in Appendix D.4, the only way to make a true tensor from these ingredients
is by using formulas in the rst and third rows in table D.1. Therefore, the
most general expression for the Lie bracket (2.33) is
[/
i
, T
j
] = 1
ij
+
3
k=1
ijk
k
8
in particular, the Jacobi identity (E.10)
2.3. POINCAR
E GROUP 77
where and are yet unspecied real constants.
Similar arguments suggest that T
ij
is also a true tensor. Due to the
relationship
[/
i
, /
j
] = [/
j
, /
i
]
this tensor must be antisymmetric with respect to indices i and j. This
excludes the term proportional to
ij
, hence
[/
i
, /
j
] =
3
k=1
ijk
k
,
where is, again, a yet undened constant.
The quantity 1
i
in equation (2.34) must be a true vector, so, the most
general form of the Lie bracket (2.34) is
[/
i
, 1] = (1 + )T
i
+ /
i
.
So, we have reduced the task of generalization of Galilei Lie brackets to
nding just ve real parameters , , , and . To proceed further, let us
rst use the following Jacobi identity
0 = [T
x
, [/
x
, 1]] + [/
x
, [1, T
x
]] + [1, [T
x
, /
x
]]
= [T
x
, /
x
]
= 1
which implies
= 0 (2.35)
Similarly,
0 = [/
x
, [/
y
, T
y
]] + [/
y
, [T
y
, /
x
]] + [T
y
, [/
x
, /
y
]]
= [/
x
, 1] [/
y
,
z
] + [T
y
,
z
]
= (1 + )T
x
/
x
/
x
+ T
x
= ( + + )T
x
( + )/
x
= ( + + )T
x
/
x
78 CHAPTER 2. THE POINCAR
E GROUP
implies
= (1 + ) (2.36)
= 0 (2.37)
The system of equations (2.35) - (2.36) has two possible solutions (in both
cases remains undened)
(i) If ,= 0, then = (1 + ) and = 0.
(ii) If = 0, then = 0 and is arbitrary.
From the condition (II) we know that parameters , , , must depend on
c and tend to zero as c
lim
c
= lim
c
= lim
c
= lim
c
= 0 (2.38)
Additional insight into the values of these parameters may be obtained by
examining their dimensions. To keep the arguments of exponents in (2.16)
dimensionless we must assume the following dimensions (denoted by angle
brackets) of the generators
< 1 > = < time >
1
< T > = < distance >
1
< / > = < speed >
1
< > = < angle >
1
= dimensionless
It then follows that
< > =
< / >
2
< >
=< speed >
2
< > =
< / >< T >
< 1 >
=< speed >
2
< > = < 1 >=< time >
1
< > = dimensionless
2.3. POINCAR
E GROUP 79
and we can satisfy condition (2.38) only by setting = = 0 (i.e., the
choice (i) above) and assuming = c
2
. This approach does not
specify the coecient of proportionality between (and ) and c
2
. To be
in agreement with experimental data we must choose this coecient equal
to 1.
= =
1
c
2
Then the resulting Lie brackets are
[
i
, T
j
] =
3
k=1
ijk
T
k
(2.39)
[
i
,
j
] =
3
k=1
ijk
k
(2.40)
[
i
, /
j
] =
3
k=1
ijk
/
k
(2.41)
[
i
, 1] = 0 (2.42)
[T
i
, T
j
] = [T
i
, 1] = 0 (2.43)
[/
i
, /
j
] =
1
c
2
3
k=1
ijk
k
(2.44)
[/
i
, T
j
] =
1
c
2
1
ij
(2.45)
[/
i
, 1] = T
i
(2.46)
This set of Lie brackets is called the Poincare Lie algebra and it diers from
the Galilei algebra (2.17) - (2.24) only by small terms on the right hand sides
of Lie brackets (2.44) and (2.45). The general element of the corresponding
Poincare group has the form
9
e
Kc
Px
e
Ht
(2.47)
9
Note that here we adhere to the conventional order of basic transformations adopted
in (2.6); from right to left: time translation space translation boost rotation.
80 CHAPTER 2. THE POINCAR
E GROUP
In equation (2.47) we denoted the parameter of boost by c
, where = [
[
is a dimensionless quantity called rapidity. Its relationship to the velocity of
boost v is
v(
) =
c tanh
cosh = (1 v
2
/c
2
)
1/2
The reason for introducing this new quantity is that rapidities of successive
boosts in the same direction are additive, while velocities are not.
10
In spite of their simplicity, equations (2.39) - (2.46) are among the most
important equations in physics and they have such an abundance of exper-
imental conrmations that one cannot doubt their validity. We therefore
accept that the Poincare group is the true mathematical expression of rela-
tionships between dierent inertial laboratories.
Postulate 2.2 (the Poincare group) Transformations between inertial lab-
oratories form the Poincare group.
Even a brief comparison of the Poincare (2.39) - (2.46) and Galilei (2.17)
- (2.24) Lie brackets reveals a number of important new features in the
relativistic theory. For example, due to the Lie bracket (2.44), boosts no
longer form a subgroup. However, boosts together with rotations do form
a 6-dimensional subgroup of the Poincare group which is called the Lorentz
group.
2.3.2 Transformations of translation generators under
boosts
Poincare Lie brackets allow us to derive transformation properties of genera-
tors
T and 1 with respect to boosts. Using Equation (2.25) and Lie brackets
(2.45) - (2.46) we nd that if T
x
and 1 are generators in the reference frame
at rest O, then their counterparts T
x
() and 1() in the reference frame O
E GROUP 81
1() = e
Kxc
1e
Kxc
T
x
() = e
Kxc
T
x
e
Kxc
Taking derivatives of these equations with respect to the parameter
1() = ce
Kxc
(/
x
11/
x
)e
Kxc
= ce
Kxc
T
x
e
Kxc
= cT
x
()
T
x
() = ce
Kxc
(/
x
T
x
T
x
/
x
)e
Kxc
=
1
c
e
Kxc
1e
Kxc
=
1
c
1() (2.48)
and taking a derivative of equation (2.48) again, we obtain a dierential
equation
T
x
() =
1
c
1() = T
x
()
with the general solution
T
x
() = /cosh +B sinh
From the initial conditions we obtain
/ = T
x
(0) = T
x
B =
T
x
()
=0
=
1
c
1
and nally
T
x
() = T
x
cosh
1
c
sinh
Similar calculation shows that
82 CHAPTER 2. THE POINCAR
E GROUP
1() = 1cosh cT
x
sinh (2.49)
T
y
() = T
y
T
z
() = T
z
Similar to our discussion of rotations in subsection D.5, we can nd the
transformation of
T and 1 corresponding to a general boost vector
in
the coordinate-independent form. First we decompose
T into sum of two
vectors
T =
T
+
T
. The vector
T
= (
=
T
T
/c
exp(
/c
) =
T
cosh c
1
1sinh
.
Therefore
= e
Kc
Te
Kc
=
T +
__
_
(cosh 1)
1
c
1sinh
_
(2.50)
1
= e
Kc
1e
Kc
= 1cosh c
_
_
sinh (2.51)
It is clear from (2.50) and (2.51) that boosts perform linear transformations
of components c
_
1
cT
x
cT
y
cT
z
_
_
= B(
)
_
_
1
cT
x
cT
y
cT
z
_
_
.
Explicit form of the matrix B(
,
such that all lattice relations between propositions remain unchanged. In
particular, we will require that K
g
transforms atoms to atoms; K
g
maps
minimal and maximal propositions of O to the minimal and maximal propo-
sitions of O
, respectively
1
We will turn to transformations of states g in the next subsection.
3.1. INERTIAL TRANSFORMATIONS IN QUANTUM MECHANICS 85
K
g
(J) = J (3.2)
K
g
() = (3.3)
and for any X, Y /(1)
K
g
(X Y ) = K
g
(X) K
g
(Y ) (3.4)
K
g
(X Y ) = K
g
(X) K
g
(Y ) (3.5)
K
g
(X
) = K
g
(X)
(3.6)
As discussed in subsection 1.5.2, working with propositions is rather in-
convenient. It would be better to translate conditions (3.2) - (3.6) into the
language of vectors in the Hilbert space. In other words, we would like to nd
a vector-to-vector transformation k
g
: 1 1 which generates the subspace-
to-subspace transformation K
g
. More precisely, we demand that for each
subspace X, if K
g
(X) = Y , then the generator k
g
maps all vectors in X into
vectors in Y , so that Sp(k
g
(x)) = Y , where x runs through all vectors in X.
The problem with nding generators k
g
is that there are just too many
of them. For example, if a ray p goes to the ray K
g
(p), then the generator k
g
must map each vector [x p somewhere inside K
g
(p), but the exact value
of k
g
[x remains undetermined. Actually, we can multiply each image vector
k
g
[x by an arbitrary nonzero factor ([x) and still have a valid generator.
Factors ([x) can be chosen independently for each [x 1. This freedom
is very inconvenient from the mathematical point of view.
This problem was solved by the celebrated Wigners theorem, [Wig31]
which states that we can always select factors ([x) in such a way that the
vector-to-vector mapping ([x)k
g
becomes either unitary (linear) or antiu-
nitary (antilinear).
2
Theorem 3.1 (Wigner) For any isomorphic mapping K
g
of a proposi-
tional lattice /(1) onto itself, one can nd either unitary or antiunitary
transformation k
g
of vectors in the Hilbert space 1, which generates K
g
.
This transformation is dened uniquely up to an unimodular factor. For a
given K
g
only one of these two possibilities (unitary or antiunitary) is real-
ized.
2
See Appendix F.7 for denitions of antilinear and antiunitary operators.
86 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY
In this formulation, Wigners theorem has been proven in ref. [Uhl63] (see
also [AD78a]). The signicance of this theorem comes from the fact that
there is a powerful mathematical apparatus for working with unitary and
antiunitary transformations, so that their properties (and, thus, properties
of subspace transformations K
g
) can be studied in great detail.
From our study of inertial transformations in chapter 2, we know that
there is always a continuous path from the identity transformation e =
i
[e
i
e
i
[
4
is a projection (proposition) associated
with the observer O, then observer O
i
U
g
[e
i
e
i
[U
1
g
= U
g
XU
1
g
Similarly, if F =
i
f
i
[e
i
e
i
[ is an operator of observable associated with
the observer O then
3
The antiunitary operators may still represent discrete transformations, e.g., time
inversion, but we agreed not to discuss such transformations in this book, because they
do not correspond to exact symmetries.
4
Here [e
i
is an orthonormal basis in the subspace X.
3.1. INERTIAL TRANSFORMATIONS IN QUANTUM MECHANICS 87
F
i
f
i
U
g
[e
i
e
i
[U
1
g
= U
g
FU
1
g
(3.7)
is operator of the same observable from the point of view of the observer
O
= gO.
3.1.2 Inertial transformations of states
In the preceding subsection we analyzed the eect of an inertial transforma-
tion g on observers, measuring apparatuses, propositions and observables.
Now we are going to examine the eect of g on preparation devices and
states. We will try to answer the following question: if [ is a vector de-
scribing a pure state prepared by the preparation device P, then which state
vector [
= gP?
To nd the connection between [ and [
[F
[U
g
FU
1
g
[
(3.8)
This equation should be valid for any choice of observable F. Let us choose
F = [[, i.e., the projection onto the ray containing vector [. Then
equation (3.8) takes the form
[[ =
[U
g
[[U
1
g
[
[U
g
[
[U
g
[
= [
[U
g
[[
2
The left hand side of this equation is equal to 1. So, for each [, the
transformed vector [
is such that
[
[U
g
[[
2
= 1
88 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY
Since both U
g
[ and [
= (g)U
g
[
where (g) is an unimodular factor. Operator U
g
is dened up to a unimod-
ular factor,
5
therefore, we can absorb the factor (g) into the uncertainty of
U
g
and nally write the action of the inertial transformation g on states
[ [
= U
g
[ (3.9)
Then, taking into account the transformation law for observables (3.7) we
can check that, in agreement with the relativity principle, the expectation
values remain the same in all laboratories
F
[F
= ([U
1
g
)(U
g
FU
1
g
)(U
g
[) = [F[
= F (3.10)
3.1.3 Heisenberg and Schrodinger pictures
The conservation of expectation values (3.10) is valid only in the case when
inertial transformation g is applied to the laboratory as a whole. What would
happen if only observer or only preparation device is transformed?
Let us rst consider inertial transformations of observers. If we change
the observer without changing the preparation device (=state) then operators
of observables change according to (3.7) while the state vector remains the
same [. As expected, this transformation changes results of experiments.
For example, the expectation values of observable F are generally dierent
for dierent observers O and O
= gO
F
= [(U
g
FU
1
g
)[ , = [F[ = F (3.11)
On the other hand, if the inertial transformation is applied to the preparation
device and the state of the system changes according to equation (3.9), then
the results of measurements are also aected
5
see subsection 3.1.1
3.2. UNITARY REPRESENTATIONS OF THE POINCAR
E GROUP 89
F
= ([U
1
g
)F(U
g
[) ,= [F[ = F (3.12)
Formulas (3.11) and (3.12) play a prominent role because many problems
in physics can be formulated as questions about descriptions of the same
physical system by dierent observers. An important example is dynamics,
i.e., the time evolution of the system. In this case one considers time trans-
lation elements of the Poincare group g =
E GROUP 91
3.2.2 Elimination of central charges in the Poincare
algebra
In principle, we could keep the arbitrarily chosen unitary representatives of
the subspace transformations U
g
1
, U
g
2
, . . ., as discussed above and work with
thus obtained projective representation of the Poincare group, but this would
result in a rather complicated mathematical formalism. The theory would
be signicantly simpler if we could judiciously choose the representatives
6
in
such a way that the factors (g
2
, g
1
) in (3.16) are simplied or eliminated
altogether. Then we would have a much simpler linear unitary group repre-
sentation (see Appendix H) instead of the projective group representation.
In this subsection we are going to demonstrate that in any projective repre-
sentation of the Poincare group such elimination of factors (g
2
, g
1
) is indeed
possible [CJS63].
The proof of the last statement is signicantly simplied if conditions
(3.17), (3.20) and (3.21) are expressed in the Lie algebra notation. In the
vicinity of the unit element of the group we can use vectors
from the
Poincare Lie algebra to identify other group elements (see equation (E.1)),
i.e.
g = e
= exp
_
10
a=1
t
a
_
where t
a
is the basis of the Poincare Lie algebra (1,
T,
/,
) from subsection
2.3.1. Then we can write unitary representatives U
g
of inertial transforma-
tions g in the form
7
U
= exp
_
10
a=1
a
F
a
_
(3.22)
where is a real constant which will be left unspecied at this point,
8
and
F
a
are ten Hermitian operators in the Hilbert space 1 called the generators
6
i.e., multiply unitary operators U
g
by some unimodular factors U
g
(g)U
g
7
Here we have used Stones theorem H.2. The nature of one-parameter subgroups
featuring in the theorem is rather obvious. These are subgroups of similar transformations,
i.e., a subgroup of space translations, a subgroup of rotations about the z-axis, etc.
8
We will identify with the Planck constant in subsection 4.1.1
92 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY
of the unitary projective representation. Then we can write equation (3.16)
in the form
U
= (
)U
(3.23)
Since is unimodular we can set (
) = exp[i(
)], where (
) is a
real function. Conditions (3.20) and (3.21) then can be rewritten in terms of
0) = (
0,
) = 0 (3.24)
(
) + ( ,
) = (
) + ( ,
) (3.25)
Note that we can write the lowest order term in the Taylor series for near
the group identity element in the form
9
(
) =
10
ab=1
h
ab
b
(3.26)
The constant term, the terms linear in
a
and
b
, as well as the terms propor-
tional to
a
b
and
a
b
are absent on the right hand of (3.26) as a consequence
of the condition (3.24).
Using the same arguments as during our derivation of equation (E.6), we
can expand all terms in (3.23) around
=
=
0
_
1
i
10
a=1
a
F
a
1
2
2
10
bc=1
c
F
bc
+ . . .
__
1
i
10
a=1
a
F
a
1
2
2
10
bc=1
c
F
bc
+ . . .
_
=
_
1 + i
10
ab=1
h
ab
b
+ . . .
_
_
1
i
10
a=1
_
a
+
a
+
10
bc=1
f
a
bc
c
+ . . .
_
F
a
1
2
2
10
ab=1
(
a
+
a
+ . . .)(
b
+
b
+ . . .)F
ab
+ . . .
_
Equating the coecients multiplying products
a
b
on both sides, we obtain
9
see also the third term on the right hand side of equation (E.4)
3.2. UNITARY REPRESENTATIONS OF THE POINCAR
E GROUP 93
1
2
2
(F
ab
+ F
ba
) =
1
2
F
a
F
b
ih
ab
+
i
10
c=1
f
c
ab
F
c
The left hand side of this equation is symmetric with respect to interchange
of indices a b. The same should be true for the right hand side. From this
condition we obtain commutators of generators F
F
a
F
b
F
b
F
a
= i
10
c=1
C
c
ab
F
c
+ E
ab
(3.27)
where C
c
ab
f
c
ab
f
c
ba
are familiar structure constants of the Poincare Lie
algebra (2.39) - (2.46) and E
ab
= i
2
(h
ab
h
ba
) are imaginary constants,
which depend on our original choice of representatives U
g
in rays [U
g
].
10
These constants are called central charges. Our main task in this subsection
is to prove that representatives U
g
can be chosen in such a way that E
ab
= 0,
i.e., the central charges get eliminated.
First we consider the original (arbitrary) set of representatives U
g
. In
accordance with our notation in section 2.3 we will use symbols
(
H,
P,
J,
K) (3.28)
to denote ten generators F
a
of the projective representation U
g
.
11
These
generators correspond to time translation, space translation, rotations and
boosts, respectively. Then using the structure constants C
a
bc
of the Poincare
Lie algebra from equations (2.39) - (2.46) we obtain the full list of commu-
tators (3.27).
[
J
i
,
P
j
] = i
3
k=1
ijk
P
k
+ E
(1)
ij
(3.29)
10
To be exact, we must write E
ab
on the right hand side of equation (3.27) multiplied
by the identity operator I. However, we will omit the symbol I here for brevity.
11
Note that generators (1,
T,
/,
) in subsection 2.3.1 were abstract quantities
that could be interpreted as derivatives of group transformations, while generators
(
H,
P,
J,
K) here are Hermitian operators in the Hilbert space of states of our physical
system.
94 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY
[
J
i
,
J
j
] = i
3
k=1
ijk
(
J
k
+ iE
(2)
k
) (3.30)
[
J
i
,
K
j
] = i
3
k=1
ijk
K
k
+ E
(3)
ij
(3.31)
[
P
i
,
P
j
] = E
(4)
ij
(3.32)
[
J
i
,
H] = E
(5)
i
(3.33)
[
P
i
,
H] = E
(6)
i
(3.34)
[
K
i
,
K
j
] = i
c
2
3
k=1
ijk
(
J
k
+ iE
(7)
k
) (3.35)
[
K
i
,
P
j
] = i
c
2
H
ij
+ E
(8)
ij
, (3.36)
[
K
i
,
H] = i
P
i
+ E
(9)
i
(3.37)
Here we arranged E
ab
into nine sets of central charges E
(1)
. . . E
(9)
. In equa-
tion (3.30) and (3.35) we took into account that their left hand sides are
antisymmetric tensors. So, the central charges must form antisymmetric
tensors as well and, according to Table D.1, they can be represented as
3
k=1
ijk
E
(2)
k
and c
2
3
k=1
ijk
E
(7)
k
, respectively, where E
(2)
k
and E
(7)
k
are 3-vectors.
Next we will use the requirement that commutators (3.29) - (3.37) must
satisfy the Jacobi identity.
12
This will allow us to make some simplications.
For example, using
13
P
3
=
i
[
J
1
,
P
2
] +
i
E
(1)
12
and the fact that all constants
E commute with generators of the group, we obtain
[
P
3
,
P
1
] =
i
[([
J
1
,
P
2
] + E
(1)
12
),
P
1
] =
i
[[
J
1
,
P
2
],
P
1
]
=
i
[[
P
1
,
P
2
],
J
1
]
i
[[
J
1
,
P
1
],
P
2
] =
i
[E
(4)
12
,
J
1
]
i
[E
(1)
11
,
P
2
]
= 0
so E
(4)
31
= 0. Similarly, we can show that E
(4)
ij
= E
(5)
i
= E
(6)
i
= 0 for all values
of indices i, j = 1, 2, 3.
12
equation (E.10), which is equivalent to the associativity condition (3.15)
13
Here we used equation (3.29).
3.2. UNITARY REPRESENTATIONS OF THE POINCAR
E GROUP 95
Using the Jacobi identity we further obtain
i[
J
3
,
P
3
] = [[
J
1
,
J
2
],
P
3
] = [[
P
3
,
J
2
],
J
1
] + [[
J
1
,
P
3
],
J
2
]
= i[
J
1
,
P
1
] + i[
J
2
,
P
2
] (3.38)
and, similarly,
i[
J
1
,
P
1
] = i[
J
2
,
P
2
] + i[
J
3
,
P
3
] (3.39)
By adding equations (3.38) and (3.39) we see that
[
J
2
,
P
2
] = 0 (3.40)
Similarly, we obtain [
J
1
,
P
1
] = [
J
3
,
P
3
] = 0, which means that
E
(1)
ii
= 0 (3.41)
Using the Jacobi identity again, we obtain
i[
J
2
,
P
3
] = [[
J
3
,
J
1
],
P
3
] = [[
P
3
,
J
1
],
J
3
] + [[
J
3
,
P
3
],
J
1
]
= i[
J
3
,
P
2
]
This antisymmetry property is also true in the general case (for any i, j =
1, 2, 3; i ,= j)
[
J
i
,
P
j
] = [
J
j
,
P
i
] (3.42)
Putting together (3.40) and (3.42) we see that tensor [
J
i
,
P
j
] is antisymmetric.
This implies that we can introduce a vector E
(1)
k
such that
E
(1)
ij
=
3
i=1
ijk
E
(1)
k
[
J
i
,
P
j
] = i
3
i=1
ijk
(
P
k
+ iE
(1)
k
) (3.43)
Similarly, we can show that E
(3)
ii
= 0 and
96 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY
[
J
i
,
K
j
] = i
3
i=1
ijk
(
K
k
+ iE
(3)
k
)
Taking into account the above results, commutation relations (3.29)-
(3.37) now take the form
[
J
i
,
P
j
] = i
3
k=1
ijk
(
P
k
+ iE
(1)
k
) (3.44)
[
J
i
,
J
j
] = i
3
k=1
ijk
(
J
k
+ iE
(2)
k
) (3.45)
[
J
i
,
K
j
] = i
3
k=1
ijk
(
K
k
+ iE
(3)
k
) (3.46)
[
P
i
,
P
j
] = [
J
i
,
H] = [
P
i
,
H] = 0 (3.47)
[
K
i
,
K
j
] = i
c
2
3
k=1
ijk
(
J
k
+ iE
(7)
k
) (3.48)
[
K
i
,
P
j
] = i
c
2
H
ij
+ E
(8)
ij
, (3.49)
[
K
i
,
H] = i
P
i
+ E
(9)
i
(3.50)
where E on the right hand sides are certain imaginary constants. The next
step in elimination of the central charges E is to use the freedom of choos-
ing unimodular factors (g) in front of operators of the representation U
g
:
Two unitary operators U
and (
)U
)
generate the same subspace transformation K
) are
unimodular, we can write
(
) = exp(i(
)) 1 + i
10
a=1
R
a
a
Therefore, in the rst order, the presence of factors (
) results in adding
some real constants R
a
to generators F
a
. We would like to show that by
adding such constants we can make all central charges equal to zero.
3.2. UNITARY REPRESENTATIONS OF THE POINCAR
E GROUP 97
Let us now add constants R to the generators
P
j
,
J
j
and
K
j
and denote
the redened generators as
P
j
=
P
j
+ R
(1)
j
J
j
=
J
j
+ R
(2)
j
K
j
=
K
j
+ R
(3)
j
Then commutator (3.45) takes the form
[J
i
, J
j
] = [
J
i
+ R
(2)
i
,
J
j
+ R
(2)
j
] = [
J
i
,
J
j
] = i
3
k=1
ijk
(
J
k
+ iE
(2)
k
)
So, if we choose R
(2)
k
= iE
(2)
k
, then
[J
i
, J
j
] = i
3
k=1
ijk
J
k
and central charges are eliminated from this commutator.
Similarly, central charges can be eliminated from commutators
[J
i
, P
j
] = i
3
k=1
ijk
P
k
[J
i
, K
j
] = i
3
k=1
ijk
K
k
(3.51)
by choosing R
(1)
k
= iE
(1)
k
and R
(3)
k
= iE
(3)
k
. From equation (3.51) we then
obtain
[K
1
, K
2
] =
i
[[J
2
, K
3
], K
2
] =
i
[[J
2
, K
2
], K
3
]
i
[[K
2
, K
3
], J
2
]
=
i
[
i
c
2
(J
1
+ iE
(7)
1
), J
2
] =
i
c
2
J
3
98 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY
so, our choice of the constants R
(1)
k
, R
(2)
k
and R
(3)
k
eliminates the central
charges E
(7)
i
.
From equation (3.51) we also obtain
[K
3
,
H] =
i
[[J
1
, K
2
],
H] =
i
[[
H, K
2
], J
1
]
i
[[J
1
,
H], K
2
]
= [J
1
, P
2
] = iP
3
which implies that the central charge E
(9)
is canceled as well. Finally
[K
1
, P
2
] =
i
[[J
2
, K
3
], P
2
] =
i
[[J
2
, P
2
], K
3
] +
i
[[K
3
, P
2
], J
3
] = 0
[K
1
, P
1
] =
i
[[J
2
, K
3
], P
1
] =
i
[[J
2
, P
1
], K
3
] +
i
[[K
3
, P
1
], J
3
]
= [K
3
, P
3
]
It then follows that E
(8)
ij
= 0 if i ,= j and we can introduce a real scalar E
(8)
such that
E
(8)
11
= E
(8)
22
= E
(8)
33
i
c
2
E
(8)
[K
i
, P
i
] =
i
c
2
ij
(
H + E
(8)
)
Finally, by redening the generator of time translations H =
H + E
(8)
we
eliminate all central charges from commutation relations of the Poincare Lie
algebra
[J
i
, P
j
] = i
3
k=1
ijk
P
k
(3.52)
[J
i
, J
j
] = i
3
k=1
ijk
J
k
(3.53)
[J
i
, K
j
] = i
3
k=1
ijk
K
k
(3.54)
3.2. UNITARY REPRESENTATIONS OF THE POINCAR
E GROUP 99
[P
i
, P
j
] = [J
i
, H] = [P
i
, H] = 0 (3.55)
[K
i
, K
j
] =
i
c
2
3
k=1
ijk
J
k
(3.56)
[K
i
, P
j
] =
i
c
2
H
ij
(3.57)
[K
i
, H] = iP
i
(3.58)
Thus Hermitian operators H, P, J and K provide a representation of the
Poincare Lie algebra and the redened unitary operators (g)U
g
form a
unique unitary representation of the Poincare group that corresponds to the
given projective representation U
g
in the vicinity of the group identity. We
have proven that projective representations of the Poincare group are equiv-
alent to certain unitary representations, which are much easier objects for
study (see Appendix H).
Commutators (3.52) - (3.58) are probably the most important equations
of relativistic quantum theory. In the rest of this book we will have many
opportunities to appreciate a deep physical content of these formulas.
3.2.3 Single-valued and double-valued representations
In the preceding subsection we eliminated the phase factors (g
2
, g
1
) from
equation (3.16) by resorting to Lie algebra arguments. However, these ar-
guments work only in the vicinity of the groups unit element. There is a
possibility that non-trivial phase factors may reappear in the multiplication
law (3.16) when the group manifold has a non-trivial topology and group
elements are considered which are far from the unit element.
In Appendix H.4 we established that this possibility is realized in the case
of the rotation group. This means that for quantum-mechanical applications
we need to consider both single-valued and double-valued representations of
this group. Since the rotation group is a subgroup of the Poincare group, the
same conclusion is relevant for the Poincare group: both single-valued and
double-valued unitary representations should be considered.
14
In chapter 5
we will see that these two cases correspond to integer-spin and half-integer-
spin systems, respectively.
14
Equivalently, one can choose to consider all single-valued representations of the uni-
versal covering group of the Poincare group.
100 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY
3.2.4 Fundamental statement of relativistic quantum
theory
The most important result of this chapter is the connection between relativity
and quantum mechanics summarized in the following statement (see, e.g.,
[Wei95])
Statement 3.2 (Unitary representations of the Poincare group) In a
relativistic quantum description of a physical system, inertial transforma-
tions are represented by unitary operators which furnish a unitary (single- or
double-valued) representation of the Poincare group in the Hilbert space of
the system.
It is important to note that this statement is completely general. The Hilbert
space of any isolated physical system (no matter how complex) must carry a
unitary representation of the Poincare group. Construction of Hilbert spaces
and Poincare group representations in them is the major part of theoretical
description of physical systems. The rest of this book is primarily devoted
to performing these dicult tasks.
Basic inertial transformations from the Poincare group are represented in
the Hilbert space by unitary operators: e
Pr
for spatial translations, e
for rotations, e
ic
Ht
for time translations,
15
A gen-
eral inertial transformation g =
, v(
ic
Pr
e
i
Ht
(3.59)
We will frequently use notation
U
g
U(
; r, t) U(; r, t) (3.60)
where is a Lorentz transformation of inertial frames that combines boost
and rotation
. Then, in the Schr odinger picture
17
state vectors transform
between dierent inertial reference frames according to
18
15
The exponential form of the unitary group representatives follows from equation (3.22).
16
compare with equation (2.16)
17
see subsection 3.1.3
18
We will see in subsection 5.2.4 that this is active transformation of states. In most
physical applications one is interested in passive transformations of states (i.e., how the
3.2. UNITARY REPRESENTATIONS OF THE POINCAR
E GROUP 101
[
= U
g
[ (3.61)
In the Heisenberg picture inertial transformations of observables have the
form
F
= U
g
FU
1
g
(3.62)
For example, the equation describing the time evolution of the observable F
in the Heisenberg picture
19
F(t) = e
i
Ht
Fe
Ht
(3.63)
= F +
i
[H, F]t
1
2
2
[H, [H, F]]t
2
+ . . . (3.64)
can be also written in a dierential form
dF(t)
dt
=
i
[H, F]
which is the familiar Heisenberg equation.
Note also that analogous Heisenberg equations can be written for trans-
formations of observables with respect to space translations, rotations and
boosts
dF(r)
dr
=
i
[P, F]
dF(
)
d
=
i
[J, F]
dF(
)
d
=
ic
[K, F] (3.65)
same state is seen by two dierent observers), which are given by the inverse operator
U
1
g
.
19
see equation (E.13)
102 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY
We already discussed the point that transformations of observables with
respect to inertial transformations of observers cover many interesting prob-
lems in physics (the time evolution, boost transformations, etc.). From the
above formulas we see that solution of these problems requires the knowledge
of commutators between observables F and generators (H, P, J and K) of the
relevant Poincare group representation. In the next chapter we will discuss
denitions of various observables, their connections to Poincare generators
and their commutation relations.
Chapter 4
OPERATORS OF
OBSERVABLES
Throwing pebbles into the water, look at the ripples they form on
the surface, otherwise, such occupation becomes an idle pastime.
Kozma Prutkov
In chapters 1 and 3 we established that in quantum theory any physical sys-
tem is described by a complex Hilbert space 1, pure states are represented
by rays in 1, observables are represented by Hermitian operators in 1 and
there is a unitary representation U
g
of the Poincare group in 1 which de-
termines how state vectors and operators of observables change when the
preparation device or the measuring apparatus undergoes an inertial trans-
formation. Our next goal is to clarify the structure of the set of observables.
In particular, we wish to nd which operators correspond to such familiar ob-
servables as velocity, momentum, energy, mass, position, etc, what are their
spectra and what are the relationships between these operators? We will also
nd out how these observables change under inertial transformations from
the Poincare group. This implies that we will use the Heisenberg picture
everywhere in this chapter.
We should stress that physical systems considered in this chapter are
completely arbitrary: they can be either elementary particles or compound
systems of many elementary particles or even systems (such as unstable par-
ticles) in which the number of particles is not precisely dened. The only
103
104 CHAPTER 4. OPERATORS OF OBSERVABLES
signicant requirement is that our system must be isolated, i.e., its interac-
tion with the rest of the universe can be neglected.
In this chapter we will focus on observables whose operators can be ex-
pressed as functions of generators (P, J, K, H) of the Poincare group rep-
resentation U
g
. In chapter 8, we will meet other observables, such as the
number of particles. They cannot be expressed through ten generators of the
Poincare group.
4.1 Basic observables
4.1.1 Energy, momentum and angular momentum
The generators of the Poincare group representation in the Hilbert space of
any system are Hermitian operators H, P, J and K and we might suspect
that they are related to certain observables pertinent to this system. What
are these observables? In order to get a hint, let us now postulate that the
constant introduced in subsection 3.2.2 is the Planck constant
= 6.626 10
34
kg m
2
s
(4.1)
whose dimension can be also expressed as < >=< mass >< speed ><
distance >. Then the dimensions of generators can be found from the con-
dition that the arguments of exponents in (3.59) must be dimensionless
< H >=
<>
<time>
=< mass >< speed >
2
;
< P >=
<>
<distance>
=< mass >< speed >;
< J >=< >=< mass >< speed >< distance >
< K >=
<>
<speed>
=< mass >< distance >;
Based on these dimensions we can guess that we are dealing with observables
of energy (or Hamiltonian) H, momentum P, and angular momentum
J of the system.
1
We will call them basic observables. Operators H, P
1
There is no common observable directly associated with the boost generator K, but
we will see later that K is intimately related to systems position and spin.
4.1. BASIC OBSERVABLES 105
and J generate transformations of the system as a whole, so we will assume
that these are observables for the entire system, i.e., the total energy, the
total momentum and the total angular momentum. Of course, these dimen-
sionality considerations are not a proof. The justication of these choices
will become more clear later, when we consider properties of operators and
relations between them.
Using this interpretation and commutators in the Poincare Lie algebra
(3.52) - (3.58), we immediately obtain commutation relations between op-
erators of observables. Then we know which pairs of observables can be
simultaneously measured.
2
For example, we see from (3.55) that energy
is simultaneously measurable with the momentum and angular momentum.
From (3.53) it is clear that dierent components of the angular momentum
cannot be measured simultaneously. These facts are well-known in non-
relativistic quantum mechanics. Now we have them as direct consequences
of the principle of relativity and the Poincare group structure.
From commutators (3.52) - (3.58) we can also nd formulas for transfor-
mations of operators H, P, J and K from one inertial frame to another. For
example, each vector observable F = P, J or K transforms under rotations
as
3
F(
) = e
Fe
i
= Fcos +
_
F
_
(1 cos )
_
F
_
sin
(4.2)
The boost transformation law for generators of translations is
4
P() = e
ic
Pe
ic
= P+
_
P
_
(cosh 1)
c
H sinh
(4.3)
H() = e
ic
He
ic
= H cosh c
_
P
_
sinh (4.4)
2
i.e., they have a common basis of eigenvectors, as explained in Appendix G.2
3
see equation (D.21)
4
see equations (2.50) and (2.51)
106 CHAPTER 4. OPERATORS OF OBSERVABLES
It also follows from (3.55) that energy H, momentum P and angular mo-
mentum J do not depend on time, i.e., they are conserved observables.
4.1.2 Operator of velocity
The operator of velocity is dened as
5
(see, e.g., [AW75, Jor77])
V
Pc
2
H
(4.5)
Denoting V() the velocity measured in the frame of reference moving with
the speed v = c tanh along the x-axis, we obtain
V
x
() = e
ic
Kx
P
x
c
2
H
e
ic
Kx
=
c
2
P
x
cosh cH sinh
H cosh cP
x
sinh
=
c
2
P
x
H
1
c tanh
1 cP
x
H
1
tanh
=
V
x
v
1 V
x
v/c
2
(4.6)
V
y
() =
V
y
(1
Vx
c
tanh ) cosh
=
V
y
_
1 v
2
/c
2
1 V
x
v/c
2
, (4.7)
V
z
() =
V
z
(1
Vx
c
tanh ) cosh
=
V
z
_
1 v
2
/c
2
1 V
x
v/c
2
(4.8)
These formulas coincide with the usual relativistic law of addition of veloci-
ties. In the limit c they reduce to the familiar non-relativistic form
V
x
(v) = V
x
v
V
y
(v) = V
y
V
z
(v) = V
z
4.2 Casimir operators
Observables H, P, V and J depend on the observer, so they do not represent
intrinsic fundamental properties of the system. For example, if a system has
5
The ratio of operators is well-dened here because P and H commute with each other.
4.2. CASIMIR OPERATORS 107
momentum p R
3
in one frame of reference, then, according to (4.3), there
are other (moving) frames of reference in which momentum takes any other
value from R
3
. The measured momentum depends on both the state of the
system and the reference frame in which the observation is made. Are there
observables which reect some intrinsic observer-independent properties of
the system? If there are such observables, then their operators (they are
called Casimir operators) must commute with all generators of the Poincare
group. It can be shown that the Poincare group has only two independent
Casimir operators [FN94]. Any other Casimir operator of the Poincare group
is a function of these two. So, there are two invariant physical properties of
any physical system. One such property is mass, which is a measure of the
matter content in the system. The corresponding Casimir operator will be
considered in subsection 4.2.2. Another invariant property is related to the
speed of rotation of the system around its own axis or spin.
6
The Casimir
operator corresponding to this invariant property will be found in subsection
4.2.3.
4.2.1 4-vectors
Before addressing Casimir operators, let us introduce some useful deni-
tions. We will call a quadruple of operators (/
0
, /
x
, /
y
, /
z
) a 4-vector
7
if
(/
x
, /
y
, /
z
) is a 3-vector, /
0
is a 3-scalar and their commutators with the
boost generators are
[/
i
, /
j
] =
i
c
/
0
ij
(i, j = x, y, z) (4.9)
[
/, /
0
] =
i
c
/ (4.10)
Then, it is easy to show that the 4-square
/
2
/
2
x
+ /
2
y
+ /
2
z
/
2
0
of the
4-vector
/ is a 4-scalar, i.e., it commutes with both rotations and boosts.
For example
6
The invariance of the absolute value of spin is evident for macroscopic freely moving
objects. Indeed, no matter how we translate, rotate or boost the frame of reference we
cannot stop the spinning motion of the system or force it to spin in the opposite direction.
7
see also Appendix I.1
108 CHAPTER 4. OPERATORS OF OBSERVABLES
[K
x
,
/
2
] = [K
x
, /
2
x
+/
2
y
+/
2
z
/
2
0
]
=
i
c
(/
x
/
0
+/
0
/
x
/
0
/
x
/
x
/
0
) = 0
Therefore, in order to nd the Casimir operators of the Poincare group we
should be looking for two functions of the Poincare generators, which are
4-vectors and, in addition, commute with H and P. Then 4-squares of these
4-vectors are guaranteed to commute with all Poincare generators.
4.2.2 Operator of mass
It follows from (4.2) - (4.4) that four operators (H, cP) satisfy all condi-
tions specied in subsection 4.2.1 for 4-vectors. These operators are called
the energy-momentum 4-vector. Then we can construct the rst Casimir
invariant called the mass operator as the 4-square of this 4-vector
M = +
1
c
2
H
2
P
2
c
2
(4.11)
The operator of mass must be Hermitian, therefore we demand that for any
physical system H
2
P
2
c
2
0, i.e., that the spectrum of operator H
2
P
2
c
2
does not contain negative values. Honoring the fact that masses of all known
physical systems are non-negative we choose the positive value of the square
root in (4.11). Then the relationship between energy, momentum and mass
takes the form
H = +
P
2
c
2
+ M
2
c
4
(4.12)
In the non-relativistic limit (c ) we obtain from equation (4.12)
H Mc
2
+
P
2
2M
which is the sum of the famous Einsteins rest mass energy E = Mc
2
and
the usual kinetic energy term P
2
/(2M).
4.2. CASIMIR OPERATORS 109
4.2.3 Pauli-Lubanski 4-vector
The second 4-vector commuting with H and P is the Pauli-Lubanski operator
whose components are dened as
8
W
0
= (P J) (4.13)
W =
1
c
HJ c[PK] (4.14)
Let us check that all required 4-vector properties are, indeed, satised for
(W
0
, W). We can immediately observe that
[J, W
0
] = 0
so W
0
is a scalar. Moreover, W
0
changes its sign after changing the sign of
P so it is a pseudoscalar. W is a pseudovector, because it does not change
its sign after changing the signs of K and P and
[J
i
, W
j
] = i
3
k=1
ijk
W
k
Let us now check the commutators with boost generators
[K
x
, W
0
] = [K
x
, P
x
J
x
+ P
y
J
y
+ P
z
J
z
]
= i
_
HJ
x
c
2
P
y
K
z
+ P
z
K
y
_
=
i
c
W
x
(4.15)
[K
x
, W
x
] =
_
K
x
,
HJ
x
c
cP
y
K
z
+ cP
z
K
y
_
=
i
c
(P
x
J
x
P
y
J
y
P
z
J
z
) =
i
c
W
0
(4.16)
[K
x
, W
y
] =
_
K
x
,
HJ
y
c
cP
z
K
x
+ cP
x
K
z
_
=
i
c
(HK
z
P
x
J
y
HK
z
+ P
x
J
y
) = 0 (4.17)
[K
x
, W
z
] = 0 (4.18)
8
These denitions involve products of Hermitian commuting operators, therefore oper-
ators W
0
and W are guaranteed to be Hermitian.
110 CHAPTER 4. OPERATORS OF OBSERVABLES
Putting equations (4.15) - (4.18) together we obtain the characteristic 4-
vector relations (4.9) - (4.10)
[K, W
0
] =
i
c
W (4.19)
[K
i
, W
j
] =
i
c
ij
W
0
(4.20)
Next we need to verify that commutators with generators of translations
(H, P) are all zero. First, for W
0
we obtain
[W
0
, H] = [P J, H] = 0
[W
0
, P
x
] = [J
x
P
x
+ J
y
P
y
+ J
z
P
z
, P
x
] = P
y
[J
y
, P
x
] + P
z
[J
z
, P
x
]
= iP
y
P
z
+ iP
z
P
y
= 0
For the vector part W we obtain
[W, H] = c[[PK], H] = c[[P, H] K] c[P[K, H]] = 0
[W
x
, P
x
] =
1
c
[HJ
x
, P
x
] c[[PK]
x
, P
x
] = c[P
y
K
z
P
z
K
y
, P
x
] = 0
[W
x
, P
y
] =
1
c
[HJ
x
, P
y
] c[[PK]
x
, P
y
] =
i
c
HP
z
c[P
y
K
z
P
z
K
y
, P
y
]
=
i
c
HP
z
i
c
HP
z
= 0
This completes the proof that the 4-square of the Pauli-Lubanski 4-vector
2
= W
2
W
2
0
is a Casimir operator. Although operators (W
0
, W) do not have direct phys-
ical interpretation, we will nd them very useful in the next section for de-
riving the operators of position R and spin S. For these calculations we will
need commutators between components of the Pauli-Lubanski 4-vector. For
example,
4.3. OPERATORS OF SPIN AND POSITION 111
[W
x
, W
y
] =
_
W
x
,
HJ
y
c
+ cP
x
K
z
cP
z
K
x
_
= i
_
HW
z
c
W
0
P
z
_
[W
0
, W
x
] =
_
W
0
,
HJ
x
c
cP
y
K
z
+ cP
z
K
y
_
= iP
y
W
z
+ iP
z
W
y
= i[P W]
x
The above equations are easily generalized for all components
[W
i
, W
j
] =
i
c
3
k=1
ijk
(HW
k
cW
0
P
k
) (4.21)
[W
0
, W
j
] = i[P W]
j
(4.22)
4.3 Operators of spin and position
Now we are ready to tackle the problem of nding expressions for spin and
position as functions of the Poincare group generators [Pry48, NW49, Ber65,
Jor80].
4.3.1 Physical requirements
We will be looking for the total spin operator S and the center-of-mass posi-
tion operator R which have the following natural properties:
(I) Owing to the similarity between spin and angular momentum,
9
we
demand that S is a pseudovector (just like J)
9
It is often stated that spin is a purely quantum-mechanical observable which does not
have a classical counterpart. We do not share this point of view. From classical mechanics
we know that the total angular momentum of a body is a sum of two parts. The rst part
is the angular momentum resulting from the linear movement of the body as a whole with
respect to the observer. The second part is related to the rotation of the body around its
own axis, or spin. The only signicant dierence between classical and quantum intrinsic
angular momenta (spins) is that the latter has a discrete spectrum, while the former is
continuous. In addition, components of the quantum spin operator do not commute with
each other.
112 CHAPTER 4. OPERATORS OF OBSERVABLES
[J
j
, S
i
] = i
3
k=1
ijk
S
k
(II) and that components of S satisfy the same commutation relations as
components of J (3.53)
[S
i
, S
j
] = i
3
k=1
ijk
S
k
(4.23)
(III) We also demand that spin can be measured simultaneously with mo-
mentum
[P, S] = 0
(IV) and with position
[R, S] = 0 (4.24)
(V) From the physical meaning of R it follows that space translations of
the observer simply shift the values of position.
e
Pxa
R
x
e
i
Pxa
= R
x
a
e
Pxa
R
y
e
i
Pxa
= R
y
e
Pxa
R
z
e
i
Pxa
= R
z
This implies the following commutation relations
[R
i
, P
j
] = i
ij
(4.25)
(VI) Finally, we will assume that position is a true vector
[J
i
, R
j
] = i
3
k=1
ijk
R
k
(4.26)
4.3. OPERATORS OF SPIN AND POSITION 113
4.3.2 Spin operator
Now we would like to make the following guess about the form of the spin
operator
10
S =
W
Mc
W
0
P
M(Mc
2
+ H)
(4.27)
=
HJ
Mc
2
[PK]
M
P(P J)
(H + Mc
2
)M
(4.28)
which is a pseudovector commuting with P as required by the above con-
ditions (I) and (III). Next we are going to verify that condition (II) is also
valid for this operator. To calculate the commutators (4.23) between spin
components we denote
F
1
M(Mc
2
+ H)
(4.29)
use commutators (4.21) and (4.22), the equality
(P W) =
1
c
H(P J) =
1
c
HW
0
(4.30)
and equation (D.17). Then
[S
x
, S
y
] =
_
FW
0
P
x
+
W
x
Mc
, FW
0
P
y
+
W
y
Mc
_
= i
_
FP
x
[PW]
y
Mc
+
FP
y
[PW]
x
Mc
+
HW
z
cW
0
P
z
M
2
c
3
_
= i
_
F[P[PW]]
z
Mc
+
HW
z
cW
0
P
z
M
2
c
3
_
= i
_
F(P
z
(P W) W
z
P
2
)
Mc
+
HW
z
cW
0
P
z
M
2
c
3
_
10
Note that operator S has the mass operator M in the denominator, so expressions
(4.27) and (4.28) have mathematical sense only for systems with strictly positive mass
spectrum.
114 CHAPTER 4. OPERATORS OF OBSERVABLES
= i
_
F(P
z
HW
0
c
1
W
z
P
2
)
Mc
+
HW
z
cW
0
P
z
M
2
c
3
_
= iW
z
_
P
2
F
Mc
+
H
M
2
c
3
_
+ iP
z
W
0
_
HF
Mc
2
1
M
2
c
2
_
For the expressions in parentheses we obtain
P
2
F
Mc
+
H
M
2
c
3
=
P
2
M
2
c(Mc
2
+ H)
+
H
M
2
c
3
=
H(Mc
2
+ H) P
2
c
2
M
2
c
3
(Mc
2
+ H)
=
H(Mc
2
+ H) (Mc
2
+ H)(H Mc
2
)
M
2
c
3
(Mc
2
+ H)
=
1
Mc
HF
Mc
2
1
M
2
c
2
=
H
M
2
c
2
(Mc
2
+ H)
1
M
2
c
2
=
H (Mc
2
+ H)
M
2
c
2
(Mc
2
+ H)
=
1
M(Mc
2
+ H)
= F
Thus, property (4.23) follows
[S
x
, S
y
] = i
_
W
z
Mc
+ FW
0
P
z
_
= iS
z
Let us now prove that spin squared S
2
is a function of M
2
and
2
, i.e., a
Casimir operator
S
2
=
_
W
Mc
+ W
0
PF
_
2
=
W
2
M
2
c
2
+
2W
0
F(P W)
Mc
+ W
2
0
P
2
F
2
=
W
2
M
2
c
2
+ W
2
0
F
_
2H
Mc
2
+ P
2
F
_
=
W
2
M
2
c
2
+ W
2
0
F
2H(Mc
2
+ H) P
2
c
2
Mc
2
(Mc
2
+ H)
=
W
2
M
2
c
2
W
2
0
H
2
+ 2HMc
2
+ M
2
c
4
M
2
c
2
(Mc
2
+ H)
2
=
W
2
W
2
0
M
2
c
2
=
2
M
2
c
2
So far we guessed the form of the spin operator and veried that the required
properties are satised. In subsection 4.3.6 we will demonstrate that S is the
unique operator satisfying all conditions from subsection 4.3.1.
4.3. OPERATORS OF SPIN AND POSITION 115
Sometimes it is convenient to use the operator of spins projection on
momentum (S P)/P that is called helicity. This operator is related to the
0-th component of the Pauli-Lubanski 4-vector
(P S) =
(P J)H
Mc
2
P
2
(P J)(H Mc
2
)
P
2
Mc
2
= (P J) = W
0
(4.31)
4.3.3 Position operator
Now we are going to switch to the derivation of the position operator. Here
we will follow a route similar to that for S: we will rst guess the form of the
operator R and then in subsection 4.3.7 we will prove that this is the unique
expression satisfying all requirements from subsection 4.3.1. Our guess for
R is the Newton-Wigner position operator
11
[Pry48, NW49, Ber65, Jor80,
Can65]
R =
c
2
2
(H
1
K+KH
1
)
c
2
[PS]
H(Mc
2
+ H)
(4.32)
=
c
2
H
K
ic
2
P
2H
2
c[PW]
MH(Mc
2
+ H)
(4.33)
which is a true vector having properties (V) and (VI), e.g.,
[R
x
, P
x
] =
c
2
2
[(H
1
K
x
+ K
x
H
1
), P
x
] =
i
2
(H
1
H + HH
1
) = i
[R
x
, P
y
] =
c
2
2
[(H
1
K
x
+ K
x
H
1
), P
y
] = 0
Let us now calculate
12
J [RP] = J +
c
2
H
[KP] +
c
2
[[PS] P]
H(Mc
2
+ H)
11
Similarly to the operator of spin, the Newton-Wigner position operator is dened only
for systems whose mass spectrum is strictly positive.
12
Note that [K
x
P
y
K
y
P
x
, H] = i(P
x
P
y
P
y
P
x
) = 0, therefore [KP] commutes
with H and operator H
1
[KP] is Hermitian.
116 CHAPTER 4. OPERATORS OF OBSERVABLES
= J +
c
2
H
[KP]
c
2
(P(P S) SP
2
)
H(Mc
2
+ H)
= J +
c
2
H
[KP]
(c
2
P(P S) S(H Mc
2
)(H + Mc
2
))
H(Mc
2
+ H)
= J +
c
2
H
[KP] +S
c
2
P(P S)
H(Mc
2
+ H)
Mc
2
H
S
= J +
c
2
H
[KP] +S
c
2
P(P S)
H(Mc
2
+ H)
J +
c
2
P(P J)
H(Mc
2
+ H)
+
c
2
H
[PK]
= S
Therefore, just as in classical physics, the total angular momentum is a sum
of two parts: the orbital angular momentum [RP] and the intrinsic angular
momentum or spin S
J = [RP] +S
Next we can check that condition (IV) is satised as well, e.g.,
[S
x
, R
y
] = [J
x
[RP]
x
, R
y
] = iR
z
[P
y
R
z
P
z
R
y
, R
y
]
= iR
z
iR
z
= 0
Theorem 4.1 All components of the position operator commute with each
other: [R
i
, R
j
] = 0.
Proof. First, we calculate the commutator [HR
x
, HR
y
] which is related to
[R
x
, R
y
] via formula
13
[HR
x
, HR
y
] = [HR
x
, H]R
y
+ H[HR
x
, R
y
]
= H[R
x
, H]R
y
+ H[H, R
y
]R
x
+ H
2
[R
x
, R
y
]
= ic
2
(P
x
R
y
R
y
P
x
) + H
2
[R
x
, R
y
]
= ic
2
[PR]
z
+ H
2
[R
x
, R
y
]
= ic
2
J
z
+ ic
2
S
z
+ H
2
[R
x
, R
y
] (4.34)
13
here we used (E.11)
4.3. OPERATORS OF SPIN AND POSITION 117
Using formula (4.33) for the position operator, we obtain
[HR
x
, HR
y
]
=
_
c
2
K
x
ic
2
P
x
2H
+ cF[PW]
x
, c
2
K
y
ic
2
P
y
2H
+ cF[PW]
y
_
Non-zero contributions to this commutator are
[c
2
K
x
, c
2
K
y
] = c
4
[K
x
, K
y
] = ic
2
J
z
(4.35)
_
ic
2
P
x
2H
, c
2
K
y
_
=
ic
2
2
_
K
y
,
P
x
H
_
=
2
c
4
P
y
P
x
2H
2
(4.36)
_
c
2
K
x
,
ic
2
P
y
2H
_
=
2
c
4
P
y
P
x
2H
2
(4.37)
[c
2
K
x
, cF[PW]
y
]
=
c
3
M
_
K
x
,
P
z
W
x
P
x
W
z
H + Mc
2
_
=
c
3
M
_
P
z
W
x
P
x
W
z
(H + Mc
2
)
2
[K
x
, H] +
P
z
[K
x
, W
x
]
H + Mc
2
[K
x
, P
x
]W
z
H + Mc
2
_
= ic
3
(MF
2
(P
z
W
x
P
x
W
z
)P
x
+ FP
z
W
0
c
1
FHW
z
c
2
)
[cF[PW]
x
, c
2
K
y
]
= c
3
(MF
2
(P
y
W
z
P
z
W
y
)[K
y
, H] + FP
z
[K
y
, W
y
] F[K
y
, P
y
]W
z
)
= ic
3
(MF
2
(P
y
W
z
P
z
W
y
)P
y
+ FP
z
W
0
c
1
FHW
z
c
2
)
Adding together two last results and using (4.30) we obtain
[c
2
K
x
, cF[PW]
y
] + [cF[PW]
x
, c
2
K
y
]
= ic
3
(MF
2
[P[PW]]
z
+ 2FP
z
W
0
c
1
2FHW
z
c
2
)
= ic
3
(MF
2
(P
z
(P W) W
z
P
2
) + 2FP
z
W
0
c
1
2FHW
z
c
2
)
= ic
3
(MF
2
(P
z
HW
0
c
1
W
z
P
2
) + 2FP
z
W
0
c
1
2FHW
z
c
2
)
118 CHAPTER 4. OPERATORS OF OBSERVABLES
= ic
2
MF
2
P
z
W
0
(H 2(H + Mc
2
)) + icMF
2
W
z
((H Mc
2
)(H + Mc
2
)
+2H(H + Mc
2
))
= ic
3
P
z
W
0
(Fc
1
M
2
F
2
c) +
icW
z
M
(4.38)
One more commutator is
[cF[PW]
x
, cF[PW]
y
]
= c
2
F
2
[P
y
W
z
P
z
W
y
, P
z
W
x
P
x
W
z
]
= c
2
F
2
(P
z
P
y
[W
z
, W
x
] P
2
z
[W
y
, W
x
] + P
x
P
z
[W
y
, W
z
])
= icF
2
(P
z
P
y
(HW
y
cW
0
P
y
) + P
2
z
(HW
z
cW
0
P
z
) + P
x
P
z
(HW
x
cW
0
P
x
))
= icF
2
(W
0
cP
z
(P
2
x
+ P
2
y
+ P
2
z
) + HP
z
(P
x
W
x
+ P
y
W
y
+ P
z
W
z
))
= ic
2
F
2
(W
0
P
z
P
2
+ H
2
P
z
W
0
/c
2
)
= iF
2
W
0
(P
z
(H
2
M
2
c
4
) + H
2
P
z
)
=
ic
4
W
0
P
z
(H + Mc
2
)
2
(4.39)
Now we collect all terms (4.35) - (4.39) and nally calculate
[HR
x
, HR
y
] = ic
2
J
z
+ ic
3
P
z
W
0
(F M
2
F
2
c) +
icW
z
M
+ ic
4
M
2
F
2
W
0
P
z
= ic
2
J
z
+ ic
2
_
P
z
W
0
M(H + Mc
2
)
+
W
z
Mc
_
= ic
2
J
z
+ ic
2
S
z
Comparing this with equation (4.34) we obtain
H
2
[R
x
, R
y
] = 0
Operator H
2
= M
2
c
4
+ P
2
c
2
has no zero eigenvalues, because we have as-
sumed that M is strictly positive. Thus we get the desired result
[R
x
, R
y
] = 0
4.3. OPERATORS OF SPIN AND POSITION 119
4.3.4 Alternative set of basic operators
So far, our plan was to construct operators of observables from 10 basic
generators P, J, K, H. However, this set of operators is sometimes dicult
to use in calculations due to rather complicated commutation relations in
the Poincare Lie algebra (3.52) - (3.58). For systems with a strictly positive
spectrum of the mass operator, we may nd it more convenient to use an
alternative set of basic operators P, R, S, M whose commutation relations
are much simpler
[P, M] = [R, M] = [S, M] = [R
i
, R
j
] = [P
i
, P
j
] = 0 (4.40)
[R
i
, P
j
] = i
ij
[P, S] = [R, S] = 0
[S
i
, S
j
] = i
3
k=1
ijk
S
k
(4.41)
Summarizing our previous results, we can express operators in this set through
generators of the Poincare group
14
R =
c
2
2
(H
1
K+KH
1
)
c[PW]
MH(Mc
2
+ H)
(4.42)
S = J [RP] (4.43)
M = +
1
c
2
H
2
P
2
c
2
(4.44)
Conversely, we can express generators of the Poincare group P, K, J, H
through operators P, R, S, M. For the energy and angular momentum we
obtain
H = +
M
2
c
4
+ P
2
c
2
(4.45)
J = [RP] +S (4.46)
and the expression for the boost operator is
14
Operator P is the same in both sets.
120 CHAPTER 4. OPERATORS OF OBSERVABLES
1
2c
2
(RH + HR)
[PS]
Mc
2
+ H
=
1
2
_
1
2
(H
1
KH +K)
[PS]
Mc
2
+ H
_
1
2
_
1
2
(K+ HKH
1
)
[PS]
Mc
2
+ H
_
[PS]
Mc
2
+ H
=
1
4
(H
1
KH +K+K+ HKH
1
)
= K
i
4
(H
1
PPH
1
)
= K (4.47)
These two sets provide equivalent descriptions of Poincare invariant theories.
Any function of operators from the set P, J, K, H can be expressed as a
function of operators from the set P, R, S, M and vice versa. We will use
this property in subsections 4.3.5, 6.3.2 and 7.2.2.
4.3.5 Canonical form and power of operators
In this subsection, we would like to mention some mathematical facts which
will be helpful in further calculations. When performing calculations with
functions of Poincare generators, we meet a problem that the same operator
can be expressed in many equivalent functional forms. For example, accord-
ing to (3.58) K
x
H and HK
x
iP
x
are two forms of the same operator. To
solve this non-uniqueness problem, we will agree to write operator factors
always in the canonical form, i.e., from left to right in the following order:
15
C(P
x
, P
y
, P
z
, H), J
x
, J
y
, J
z
, K
x
, K
y
, K
z
(4.48)
Consider, for example, the non-canonical product K
y
P
y
J
x
. To bring it to the
canonical form, we rst move factor P
y
to the leftmost position using (3.57)
15
Since H, P
x
, P
y
and P
z
commute with each other, the part of the operator depend-
ing on these factors can be written as an ordinary function of commuting arguments
C(P
x
, P
y
, P
z
, H), whose order is irrelevant.
4.3. OPERATORS OF SPIN AND POSITION 121
K
y
P
y
J
x
= P
y
K
y
J
x
+ [K
y
, P
y
]J
x
= P
y
K
y
J
x
i
c
2
HJ
x
The second term on the right hand side is already in the canonical form, but
the rst term is not. We need to switch factors J
x
and K
y
there:
K
y
P
y
J
x
= P
y
J
x
K
y
+ P
y
[K
y
, J
x
]
i
c
2
HJ
x
= P
y
J
x
K
y
iP
y
K
z
i
c
2
HJ
x
(4.49)
Now all terms in (4.49) are in the canonical form.
The procedure for bringing a general operator to the canonical form is
not more dicult than in the above example. If we call the original operator
the primary term, then this procedure can be formalized as the following
sequence of steps: First we transform the primary term itself to the canonical
form. We do that by switching the order of pairs of neighboring factors if
they occur in the wrong order. Let us call them the left factor L and
the right factor R. If R happens to commute with L, then such a change
has no other eect. If R does not commute with L, then the result of the
switch is LR RL+[L, R]. This means that apart from switching we must
add another secondary term to the original expression. The secondary term
is obtained from the primary term by replacing the product LR with the
commutator [L, R].
16
At the end of the rst step we have all factors in the
primary term in the canonical order. If during this process all commutators
[L, R] were zero, then we are done. If there were nonzero commutators, then
we have a number of additional secondary terms. In the general case, these
terms are not yet in the canonical form and the above procedure should be
repeated for them resulting in tertiary, etc. terms until all terms are in the
canonical order.
Then, for each operator there is a unique representation as a sum of terms
in the canonical form
F = C
00
+
3
i=1
C
10
i
J
i
+
3
i=1
C
01
i
K
i
+
3
i,j=1
C
11
ij
J
i
K
j
+
3
i,j=1;ij
C
02
ij
K
i
K
j
+ . . .
(4.50)
16
The second and third terms on the right hand side of (4.49) are secondary.
122 CHAPTER 4. OPERATORS OF OBSERVABLES
where C
= C
(P
x
, P
y
, P
z
, H) are functions of translation generators.
We will also nd useful the notion of power of terms in (4.50). We
will denote pow(A) the number of factors J and/or K in the term A. For
example, the rst term on the right hand side of (4.50) has power 0. The
second and third terms have power 1, etc. The power of a general operator F
(which is a sum of several terms A) is dened as the maximum power among
terms in F. For operators considered earlier in this chapter, we have
pow(H) = pow(P) = pow(V) = 0
pow(W
0
) = pow(W) = pow(S) = pow(R) = 1
Lemma 4.2 If L and R are operators from the list (4.48) and [L, R] ,= 0,
then
pow([L, R]) = pow(L) + pow(R) 1
Proof. The commutator [L, R] is non-zero in two cases.
1. pow(L) = 1 and pow(R) = 0 (or, equivalently, pow(L) = 0 and
pow(R) = 1). From commutation relations (3.52), (3.55), (3.57) and (3.58),
it follows that non-vanishing commutators between Lorentz generators and
translation generators are functions of translation generators, i.e., have zero
power. The same is true for commutators between Lorentz generators and
arbitrary functions of translation generators C(P
x
, P
y
, P
z
, H).
2. If pow(L) = 1 and pow(R) = 1, then pow([L, R]) = 1 follows directly
from commutators (3.53), (3.54) and (3.56). For example, if C and D are
two functions of P
x
, P
y
, P
z
, H, then using (E.11) and [C, D] = 0 we obtain
[CJ
x
, DJ
y
] = [CJ
x
, D]J
y
+ D[CJ
x
, J
y
] = C[J
x
, D]J
y
+ DC[J
x
, J
y
] + DJ
x
[C, J
y
]
The power of the right hand side is 1.
The primary term for the product of two terms AB has exactly the same
number of Lorentz generators as the original operator, i.e., pow(A)+pow(B).
4.3. OPERATORS OF SPIN AND POSITION 123
Lemma 4.3 For two terms A and B, either secondary term in the product
AB is zero or its power is equal to pow(A) + pow(B) -1.
Proof. Each secondary term results from replacing a product of two gener-
ators LR in the primary term with their commutator [L, R]. According to
Lemma 4.2, if [L, R] ,= 0 such a replacement decreases the power of the term
by one unit.
The powers of tertiary and higher order terms are less than the power of
secondary terms. Therefore, for any product AB its power is determined by
the primary term only
pow(AB) = pow(BA) = pow(A) + pow(B)
This implies
Theorem 4.4
17
For two non-commuting terms A and B
pow([A, B]) = pow(A) + pow(B) 1
Proof. In the commutator AB BA, the primary term of AB cancels
out the primary term of BA. If [A, B] ,= 0, then the secondary terms do not
cancel. Therefore, there is at least one non-zero secondary term whose power
is pow(A) + pow(B) 1 according to Lemma 4.3.
Having at our disposal basic operators P, R, S and M we can form a
number of Hermitian scalars, vectors and tensors which are classied in table
4.1 according to their true/pseudo character and power:
4.3.6 Uniqueness of the spin operator
Let us now prove that (4.27) is the unique spin operator satisfying conditions
(I) - (IV) from subsection 4.3.1. Suppose that there is another spin operator
S
satisfying the same conditions. Denoting the power of the spin components
17
This theorem was used by Berg in ref. [Ber65].
124 CHAPTER 4. OPERATORS OF OBSERVABLES
Table 4.1: Scalar, vector and tensor functions of basic operators
power 0 power 1 power 2
True scalar P
2
; M P R+R P R
2
; S
2
Pseudoscalar P S R S
True vector P R; [PS] [RS]
Pseudovector S; [PR]
True tensor P
i
P
j
3
k=1
ijk
S
k
; P
i
R
j
+ R
j
P
i
S
i
S
j
+ S
j
S
i
; R
i
R
j
Pseudotensor
3
k=1
ijk
P
k
3
k=1
ijk
R
k
; P
i
S
j
R
i
S
j
by p = pow(S
x
) = pow(S
y
) = pow(S
z
) we obtain from (4.23) and Theorem
4.4
pow([S
x
, S
y
]) = pow(S
z
)
2p 1 = p
Therefore, the components of S
= b(M, P
2
)S + f(M, P
2
)[PR] + e(M, P
2
)(S P)P
where b, f and e are arbitrary real functions.
18
From condition (III) we
obtain f(M, P
2
) = 0. Comparing commutator
19
[S
x
, S
y
] = [bS
x
+ e(S P)P
x
, bS
y
+ e(S P)P
y
]
= b
2
[S
x
, S
y
] iebP
x
[S P]
y
+ iebP
y
[S P]
x
= ib
2
S
z
ieb(P[S P])
z
= i(b
2
S
z
ebP
2
S
z
+ eb(S P)P
z
)
with the requirement (II)
[S
x
, S
y
] = iS
z
= i(bS
z
+ e(S P)P
z
)
18
These functions depend on scalars P
2
and M in order to satisfy condition (I).
19
Here we used equation [S, (S P)] = i[S P].
4.3. OPERATORS OF SPIN AND POSITION 125
we obtain the system of equations
b
2
ebP
2
= b
eb = e
whose non-trivial solution is b = 1 and e = 0. Therefore, the spin operator
is unique S
= S.
4.3.7 Uniqueness of the position operator
Assume that in addition to the Newton-Wigner position operator R there
is another position operator R
= a(P
2
, M)R+ d(P
2
, M)[S P] + g(P
2
, M)P
where a, d and g are arbitrary real functions. From condition (IV) it follows,
for example, that
0 = [R
x
, S
y
] = d(P
2
, M)[S
y
P
z
S
z
P
y
, S
y
] = id(P
2
, M)P
y
S
x
which implies that d(P
2
, M) = 0. From (V) we obtain
i = [R
x
, P
x
] = a(P
2
, M)[R
x
, P
x
] = ia(P
2
, M)
and a(P
2
, M) = 1. Therefore the most general form of the position operator
is
R
= R+ g(P
2
, M)P (4.51)
In Theorem 15.1 we will consider boost transformations for times and posi-
tions of events in non-interacting systems of particles. If the term g(P
2
, M)P
in (4.51) were non-zero, we would not get an agreement with Lorentz trans-
formations known from Einsteins special relativity.
20
Therefore, we will
20
see (15.5) - (15.8) and Appendix I.2
126 CHAPTER 4. OPERATORS OF OBSERVABLES
assume that the factor g(P
2
, M) vanishes and R
P
2
c
2
+ M
2
c
4
] = i
P
2
c
2
+ M
2
c
4
P
=
iPc
2
P
2
c
2
+ M
2
c
4
= i
Pc
2
H
= iV
where V is the velocity operator (4.5). Therefore, as expected, for an observer
shifted in time by the amount t, the position of the physical system appears
shifted by Vt:
R(t) = exp
_
i
Ht
_
Rexp
_
Ht
_
(4.54)
= R+
i
ic
Kx
K
y
e
ic
Kx
= K
y
ic
[K
x
, K
y
]
c
2
2
2!
2
[K
x
, [K
x
, K
y
]] +
ic
3
3
3!
3
[K
x
, [K
x
, [K
x
, K
y
]]] + . . .
= K
y
c
J
z
+
2
2!
K
y
3
3!c
J
z
. . . = K
y
cosh
1
c
J
z
sinh
Then the y-component of position in the reference frame O
moving along
the x-axis is
21
R
y
() = e
ic
Kx
R
y
e
ic
Kx
=
c
2
2
e
ic
Kx
(K
y
H
1
+ H
1
K
y
)e
ic
Kx
=
c
2
2
(K
y
cosh
J
z
c
sinh )(H cosh cP
x
sinh )
1
c
2
2
(H cosh cP
x
sinh )
1
(K
y
cosh
J
z
c
sinh ) (4.57)
Similarly, for the x- and z-components
R
x
() =
c
2
2
K
x
(H cosh cP
x
sinh )
1
c
2
2
(H cosh cP
x
sinh )
1
K
x
(4.58)
R
z
() =
c
2
2
(K
z
cosh +
J
y
c
sinh )(H cosh cP
x
sinh )
1
c
2
2
(H cosh cP
x
sinh )
1
(K
z
cosh +
J
y
c
sinh ) (4.59)
21
Here we used (4.4).
128 CHAPTER 4. OPERATORS OF OBSERVABLES
These transformations do not resemble usual Lorentz formulas from special
relativity.
22
This is not surprising, because the Newton-Wigner position op-
erator does not constitute a 3-vector component of any 4-vector quantity.
23
Furthermore, we can nd the time dependence of the position operator in
the moving reference frame O
. We use label t
in O
= e
Kxc
He
i
Kxc
(4.60)
Then we obtain
R(, t
) = e
i
R()e
= e
i
ic
Kx
Re
ic
Kx
e
=
_
e
ic
Kx
e
i
Ht
e
ic
Kx
_
e
ic
Kx
Re
ic
Kx
_
e
ic
Kx
e
Ht
e
ic
Kx
_
= e
ic
Kx
e
i
Ht
Re
Ht
e
ic
Kx
= e
ic
Kx
(R+Vt
)e
ic
Kx
= R() +V()t
(4.61)
where velocity V() in the reference frame O
[0 belongs to 1
0
6
Pe
[0 = e
e
i
Pe
[0
= e
__
P
(1 cos ) +Pcos
_
P
_
sin
_
[0
= 0 (5.1)
This means that representation of the rotation subgroup dened in the full
Hilbert space 1 induces a unitary representation V
g
of this subgroup in 1
0
.
The generators of rotations in 1are, of course, represented by the angular
momentum vector J. However, in the subspace 1
0
, they can be equivalently
represented by the vector of spin S, because
S
z
[0 = J
z
[0 [RP]
z
[0 = J
z
[0 (R
x
P
y
R
y
P
x
)[0 = J
z
[0
6
Here we used equation (4.2).
5.1. MASSIVE PARTICLES 133
We will show later that the representation of the full Poincare group
is irreducible if and only if the representation V
g
of the rotation group in
1
0
is irreducible. So, we will be interested only in such irreducible rep-
resentations V
g
. The classication of unitary irreducible representations of
the rotation group (single- and double-valued) depends on one integer or
half-integer parameter s
7
that we will identify with particles spin. The
trivial one-dimensional representation is characterized by spin zero (s = 0)
and corresponds to a spinless particle. The two-dimensional representation
corresponds to particles with spin one-half (s = 1/2). The 3-dimensional
representation corresponds to particles with spin one (s = 1), etc.
It is customary to choose a basis of eigenvectors of S
z
in 1
0
and denote
these vectors by [0, , i.e.,
P[0, = 0
H[0, = mc
2
[0,
M[0, = m[0,
S
2
[0, =
2
s(s + 1)[0,
S
z
[0, = [0,
where = s, s + 1, . . . , s 1, s. The action of a rotation on these basis
vectors is
e
[0, = e
[0, =
s
=s
D
s
)[0,
(5.2)
where D
s
are (2s + 1) (2s + 1) matrices of the representation V
g
. This
denition implies that
8
s
=s
D
s
2
)[0,
= e
1
e
2
[0, = e
1
s
=s
D
s
2
)[0,
7
see Appendix H.5
8
Here
2
denotes the composition of two rotations parameterized by vectors
1
and
2
, respectively.
134 CHAPTER 5. SINGLE PARTICLES
=
s
=s
_
s
=s
D
s
1
)D
s
2
)
_
[0,
and
D
s
2
) =
s
=s
D
s
1
)D
s
2
)
which means that matrices D
s
furnish a representation of the rotation group.
5.1.2 Momentum-spin basis
In the preceding subsection we constructed basis vectors [0, in the subspace
1
0
. We also need basis vectors [p, in other subspaces 1
p
with p ,= 0. We
will build basis [p, by propagating the basis [0, to other points in the
3D momentum space using pure boost transformations.
9
The unique pure
boost, which transforms momentum 0 to p, will be denoted by
p
.
10
The
corresponding unitary operator in the Hilbert space is
11
U(
p
; 0, 0) e
ic
p
(5.3)
where
p
=
p
p
sinh
1
p
mc
(5.4)
Therefore we can write
[p, = N(p)U(
p
; 0, 0)[0, = N(p)e
ic
p
[0, (5.5)
9
Of course, this choice is rather arbitrary. A dierent choice of transformations con-
necting momenta 0 and p (e.g., boosts coupled with rotations) would result in a dierent
but equivalent basis set. However, once the basis set has been xed, all formulas should
be written with respect to it.
10
see Fig. 5.1
11
for this notation see (3.60)
5.1. MASSIVE PARTICLES 135
LL
pp
p
00
pp
=
=
= =
=
=
1/2
1/2
1/2
1/2
1/2
1/2
Figure 5.1: Construction of the momentum-spin basis for a spin one-half par-
ticle. Spin eigenvectors (with eigenvalues = 1/2, 1/2) at zero momentum
are propagated to non-zero momenta p and p
p
, respectively. As discussed in subsection 5.1.3, there is a unique pure
boost L which connects momenta p and p
.
where N(p) is a normalization factor. The explicit expression for N(p) will
be given in equation (5.27).
To verify that vector (5.5) is indeed an eigenvector of the momentum
operator with eigenvalue p we use equation (4.3)
P[p, = N(p)Pe
ic
p
[0, = N(p)e
ic
p
e
ic
p
Pe
ic
p
[0,
= N(p)e
Kc
p
c
p
H sinh
p
[0, = N(p)e
ic
p
mc sinh
p
[0,
= N(p)pe
ic
p
[0, = p[p, (5.6)
Let us now nd the action of the spin component S
z
on the basis vectors
[p,
12
S
z
[p, = N(p)S
z
e
ic
p
[0,
12
Here we use (4.27) and take into account that W
0
[0, = P[0, = 0 and H[0, =
Mc
2
[0, . We also use boost transformations (4.3) and (4.4) of the energy-momentum 4-
vector (H, cP) and similar formulas for the Pauli-Lubanski 4-vector (W
0
, W). For brevity,
we denote
z
the z-component of the vector
p
and its absolute value.
136 CHAPTER 5. SINGLE PARTICLES
= N(p)e
ic
p
e
ic
p
_
W
z
Mc
W
0
P
z
M(Mc
2
+ H)
_
e
ic
p
[0,
= N(p)e
ic
p
_
W
z
+
z
[(W
)(cosh 1) W
0
sinh ]
Mc
_
W
0
cosh (W
) sinh
__
P
z
+
z
[(P
)(cosh 1)
1
c
H sinh ]
_
M(Mc
2
+ H cosh c(P
) sinh )
_
[0,
= N(p)e
ic
p
_
W
z
+
z
(W
)(cosh 1)
Mc
(W
) sinh (
z
Mc sinh )
M(Mc
2
+ Mc
2
cosh )
_
[0,
= N(p)e
ic
p
_
W
z
Mc
+
z
_
W
_
_
cosh 1
Mc
sinh
2
Mc(1 + cosh )
_
_
[0,
= N(p)e
ic
p
W
z
Mc
[0, = N(p)e
ic
p
S
z
[0, = N(p)e
ic
p
[0,
= [p,
So, [p, are eigenvectors of the momentum, energy and z-component of
spin
13
P[p, = p[p,
H[p, =
p
[p,
M[p, = m[p,
S
2
[p, =
2
s(s + 1)[p,
S
z
[p, = [p,
where we denoted
p
_
m
2
c
4
+ p
2
c
2
(5.7)
13
Note that eigenvectors of the spin operator S are obtained here by applying pure boosts
to vectors at p = 0. Dierent transformations (involving rotations) connecting bases in
points 0 and p (see footnote on page 134) would result in dierent momentum-spin basis
and in dierent spin operator S
) will
not be expressed as functions of basic generators of the Poincare group. This condition
was important for our proof of the uniqueness of S (and R) in section 4.3.
5.1. MASSIVE PARTICLES 137
p p
pp p
LL
PP
HH
00
m
>
0
m
=
0
Figure 5.2: Mass hyperboloid in the energy-momentum space for massive
particles and the zero-mass cone for m = 0.
the one-particle energy
The common spectrum of the energy-momentum eigenvalues (
p
, p) can
be conveniently represented as points on the mass hyperboloid in the 4-
dimensional energy-momentum space (see Fig. 5.2). For massive particles,
the spectrum of the velocity operator V = Pc
2
/H is the interior of a 3-
dimensional sphere [v[ < c in the 4D energy-momentum space. This spec-
trum does not include the surface of the sphere, therefore massive particles
cannot reach the speed of light.
14
5.1.3 Action of Poincare transformations
We can now dene the action of transformations from the Poincare group
on the basis vectors [p, constructed above.
15
Translations act by simple
multiplication
14
In quantum mechanics, the speed of propagation of particles is not a well-dened
concept. The value of particles speed is denite in states having certain momentum.
However, such states are described by innitely extended plane waves (5.39) and one
cannot speak about particle propagation in such states. So, strictly speaking, the speed
of a particle cannot be obtained by measuring its positions at two dierent time instants
and dividing the traveled distance by the time interval. This is a consequence of the
non-commutativity of the operators of position and velocity.
15
We are working in the Schrodinger picture here.
138 CHAPTER 5. SINGLE PARTICLES
e
Pa
[p, = e
pa
[p, (5.8)
e
i
Ht
[p, = e
i
pt
[p, (5.9)
Let us now apply rotation e
[p, = N(p)e
ic
p
[0,
= N(p)e
ic
p
e
i
[0,
= N(p)e
ic
(R
1
K)
p
s
=s
D
)[0,
= N(p)e
ic
KR
p
s
=s
D
)[0,
=
s
=s
D
)[R
p,
(5.10)
This means that both momentum and spin of the particle are rotated by the
angle
, as expected.
Applying a boost L e
ic
)Q(p,
) (5.12)
Here B and Q are yet undened quantum-mechanical operators and now
we are going to learn more about them. Multiplying both sides of equation
(5.12) by B
1
(p,
), we obtain
5.1. MASSIVE PARTICLES 139
B
1
(p,
)LU(
p
; 0, 0) = Q(p,
) (5.13)
Since operator Q on the right hand side is a representative of a rotation, it
keeps invariant the subspace with zero momentum 1
0
. Therefore, the se-
quence of boosts on the left hand side of equation (5.13) when acting on a
vector with zero momentum [0, returns this vector back to the zero mo-
mentum subspace. This fact is clearly seen in Fig. 5.1: The zero momentum
vector is mapped to a vector with momentum p by the boost
p
. Subse-
quent application of L transforms this vector to another eigenstate of the
momentum operator with eigenvalue p
= PL[p = Pe
ic
[p = e
ic
e
ic
Pe
ic
[p
= e
ic
_
P+
__
P
_
(cosh 1) +
H
c
sinh
__
[p
=
_
p +
__
p
_
(cosh 1) +
p
c
sinh
__
[p
So we conclude that
p
= p +
__
p
_
(cosh 1) +
p
c
sinh
_
p (5.14)
It then follows that B(p,
) =
1
p
W
(p,)
; 0, 0) Q(p,
) = U
1
(
p
; 0, 0)LU(
p
; 0, 0) (5.15)
where
W
(p, ) is called the Wigner angle
17
16
For simplicity, in this derivation we omit spin indices.
17
Here function
assigns a unique rotation angle to the given rotation matrix, as
explained in Appendix D.5.
140 CHAPTER 5. SINGLE PARTICLES
W
(p, ) =
(
1
p
p
) (5.16)
Explicit formulas for this angle can be found, e.g., in ref. [Rit61]. Then,
substituting (5.15) in (5.12), we obtain
e
ic
[p, = N(p)LU(
p
; 0, 0)[0, = N(p)U(
p
; 0, 0)R
W
(p,)
[0,
= N(p)U(
p
; 0, 0)
s
=s
D
s
W
(p, ))[0,
=
N(p)
N(p)
s
=s
D
s
W
(p, ))[p,
(5.17)
Equations (5.10) and (5.17) show that rotations and boosts are accompa-
nied with turning the spin vector in each subspace 1
p
by rotation matrices
D
s
. If the representation of the rotation group D
s
were reducible, then each
subspace 1
p
would be represented as a direct sum of irreducible components
1
k
p
1
p
=
k
1
k
p
and each subspace
1
k
=
pR
3 1
k
p
would be irreducible with respect to the entire Poincare group. Therefore, in
order to construct an irreducible representation of the Poincare group in 1,
the representation D
s
must be an irreducible unitary representation of the
rotation group, as was mentioned already in subsection 5.1.1. In this book we
will be interested in describing interactions between electrons and protons,
which are massive particles with spin 1/2. Then the relevant representation
D
s
of the rotation group is the 2-dimensional representation from Appendix
H.5.
Let us now review the above construction of unitary irreducible repre-
sentations of the Poincare group for massive particles.
18
First we chose a
18
This construction is known as the induced representation method [Mac00].
5.2. MOMENTUM AND POSITION REPRESENTATIONS 141
standard momentum vector p = 0 and found a little group, which was a
subgroup of the Lorentz group leaving this vector invariant. The little group
turned out to be the rotation group in our case. Then we found that if the
subspace 1
0
corresponding to the standard vector carries an irreducible rep-
resentation of the little group, then the entire Hilbert space is guaranteed to
carry an irreducible representation of the Poincare group. In this represen-
tation, translations are represented by multiplication (5.8) - (5.9), rotations
and boosts are represented by formulas (5.10) and (5.17), respectively. It can
be shown that a dierent choice of the standard vector in the spectrum of
momentum would result in a representation of the Poincare group isomorphic
to the one found above.
5.2 Momentum and position representations
So far we discussed the action of inertial transformations on common eigen-
vectors [p, of the operators Pand S
z
. All other vectors in the Hilbert space
1 can be represented as linear combinations of these basis vectors, i.e., they
can be represented as wave functions (p, ) in the momentum-spin repre-
sentation. Similarly one can construct the position space basis [r, from
common eigenvectors of the (commuting) Newton-Wigner position operator
and operator S
z
. Then arbitrary states in 1 can be represented in this basis
by their position-spin wave functions (r, ). In this section we will consider
the wave function representations of states in greater detail. For simplic-
ity, we will omit the spin label and consider only spinless particles. It is
remarkable that formulas for the momentum-space and position-space wave
functions appear very similar to those in non-relativistic quantum mechanics.
5.2.1 Spectral decomposition of the identity operator
Two basis vectors with dierent momenta [p and [p
= 0 if p ,= p
= e
Ie
i
= e
__
dp[pp[
_
e
i
=
_
dp[R
pR
p[
=
_
dqdet
dp
dq
[qq[ =
_
dq[qq[ = I
where we used (5.10) and the fact that det [dp/dq[ = det(R
) = 1 is the
Jacobian of transformation from variables p to q = R
p.
Let us consider more closely the invariance of I with respect to boosts.
Using equation (5.17) we obtain
I
= e
ic
Ie
ic
= e
ic
__
dp[pp[
_
e
ic
=
_
dp[pp[
N(p)
N(p)
2
=
_
dqdet
d
1
q
dq
[qq[
N(
1
q)
N(q)
2
(5.21)
where N(q) is the normalization factor introduced in (5.5) and det [d
1
q/dq[
is the Jacobian of transformation from variables p to q = p. So, now we
have the opportunity to calculate N(q). The Jacobian should not depend on
the direction of the boost
and we can choose this direction along the z-axis
to simplify calculations. Then from (5.14) we obtain
1
q
x
= q
x
(5.22)
1
q
y
= q
y
(5.23)
1
q
z
= q
z
cosh
1
c
_
m
2
c
4
+ q
2
c
2
sinh (5.24)
1
q
=
m
2
c
4
+ c
2
q
2
x
+ c
2
q
2
y
+ c
2
_
q
z
cosh
1
c
_
m
2
c
4
+ q
2
c
2
sinh
_
2
=
q
cosh cq
z
sinh
144 CHAPTER 5. SINGLE PARTICLES
and
20
det
d
1
q
dq
= det
_
_
1 0 0
0 1 0
cqx sinh
m
2
c
4
+q
2
c
2
cqy sinh
m
2
c
4
+q
2
c
2
cosh
cqz sinh
m
2
c
4
+q
2
c
2
_
_
= cosh
cq
z
sinh
_
m
2
c
4
+ q
2
c
2
=
1
q
q
(5.26)
Inserting this result in equation (5.21) we obtain
I
=
_
dp
1
p
p
[pp[
N(
1
p)
N(p)
2
Thus, to ensure the invariance of I, we should dene our normalization factor
as
21
N(p) =
mc
2
p
(5.27)
Putting together our results from equations (5.8) - (5.10), (5.17) and
(5.27), we can nd the action of an arbitrary Poincare group element on
basis vectors [p, . Bearing in mind that in a general Poincare transfor-
mation
22
(; r; t) we agreed
23
rst to perform translations (r, t) and then
boosts/rotations , we obtain for the general case of a particle with spin
24
20
This means that in 3D momentum integrals we are allowed to use the equality
dq
q
=
d(q)
q
(5.25)
for any element of the Lorentz group, i.e., that dq/
q
is a Lorentz invariant measure.
21
We could also multiply this expression for N(p) by an arbitrary unimodular factor,
but this would not have any eect, because state vectors and their wave functions are
dened up to an unimodular factor anyway.
22
is a product of a rotation and a boost, as in equation (I.12).
23
see equations (2.6) and (3.59)
24
Here we use active transformation of the state as explained in subsection 5.2.4.
5.2. MOMENTUM AND POSITION REPRESENTATIONS 145
U(; r, t)[p, = U(; 0, 0)e
Pr
e
i
Ht
[p,
=
_
p
e
pr+
i
pt
s
=s
D
W
(p, ))[p,
(5.28)
5.2.2 Wave function in the momentum representation
The inner product of two normalized vectors [ =
_
dp(p)[p and [ =
_
dp(p)[p can we written in terms of their wave functions
[ =
_
dpdp
(p)(p
)p[p
=
_
dpdp
(p)(p
)(p p
)
=
_
dp
(p)(p) (5.29)
So, for a state vector [ with unit normalization, the wave function (p)
must satisfy the condition
1 = [ =
_
dp[(p)[
2
This wave function has a direct probabilistic interpretation, e.g., if is a
region in the momentum space, then the integral
_
dp[(p)[
2
gives the prob-
ability of nding particles momentum inside this region.
Poincare transformations of the state vector [ can be viewed as transfor-
mations of the corresponding momentum-space wave function. For example,
using equation (5.28) we obtain
25
e
ic
(p) p[e
ic
[ =
_
1
p
1
p[
=
_
1
p
p
(
1
p) (5.30)
25
Strictly speaking, operators always act on state vectors. When we apply operators to
wave functions, as in (5.30), we will place a caret above the operator symbol.
146 CHAPTER 5. SINGLE PARTICLES
Then the boost invariance of the inner product (5.29) can be easily proven
using property (5.25)
=
_
dp
(
1
p)(
1
p)
1
p
p
=
_
d(
1
p)
1
p
(
1
p)(
1
p)
1
p
=
_
dp
(p)(p) = [.
The action of Poincare generators and the Newton-Wigner position op-
erator on momentum-space wave functions of a massive spinless particle can
be derived from formula (5.28)
P
x
(p) = i lim
a0
d
da
e
Pxa
(p) = p
x
(p) (5.31)
H(p) = i lim
t0
d
dt
e
i
Ht
(p) =
p
(p) (5.32)
K
x
(p) =
i
c
lim
0
d
d
e
ic
Kx
(p)
=
i
c
lim
0
d
d
_
m
2
c
4
+ p
2
c
2
cosh cp
x
sinh
_
m
2
c
4
+ p
2
c
2
_
p
x
cosh
1
c
_
m
2
c
4
+ p
2
c
2
sinh , p
y
, p
z
_
= i
_
p
c
2
d
dp
x
p
x
2
p
_
(p) (5.33)
R
x
(p) =
c
2
2
(
H
1
K
x
+
K
x
H
1
)(p)
=
i
2
_
1
p
p
d
dp
x
p
d
dp
x
1
p
p
x
c
2
2
p
_
(p)
= i
d
dp
x
(p) (5.34)
J
x
(p) = (
R
y
P
z
R
z
P
y
)(p) = i
_
p
z
d
dp
y
p
y
d
dp
z
_
(p) (5.35)
5.2. MOMENTUM AND POSITION REPRESENTATIONS 147
5.2.3 Position representation
In the preceding subsection we considered particles wave functions in the
momentum representation, i.e., with respect to common eigenvectors of three
commuting components of momentum P
x
, P
y
and P
z
. Three components of
the position operator R
x
, R
y
and R
z
also commute with each other,
26
and
their common eigenvectors [r also form a basis in the Hilbert space 1 of
one massive spinless particle. In this section we will describe particles wave
functions with respect to this basis set, i.e., in the position representation.
First we can expand eigenvectors [r in the momentum basis
[r =
_
dp
r
(p)[p (5.36)
The momentum-space eigenfunctions are
r
(p) = p[r = (2)
3/2
e
pr
(5.37)
as can be veried by substitution of (5.34) and (5.37) to the eigenvalue equa-
tion
R
r
(p) = (2)
3/2
R e
pr
= i(2)
3/2
d
dp
e
pr
= r(2)
3/2
e
pr
= r
r
(p)
As operator R is Hermitian, its eigenvectors with dierent eigenvalues r
and r
[r = (2)
3
_
dpdp
pr+
i
[p
= (2)
3
_
dpdp
pr+
i
(p p
)
= (2)
3
_
dpe
p(rr
)
= (r r
) (5.38)
26
see Theorem 4.1
148 CHAPTER 5. SINGLE PARTICLES
which means that [r are improper states just as [p are. Similarly to (5.18),
a normalized state vector [ can be represented as an integral over the
position space
[ =
_
dr(r)[r
where (r) = r[ is the wave function of the state [ in the position
representation. The absolute square [(r)[
2
of the wave function is the prob-
ability density for particles position. The inner product of two vectors [
and [ can be expressed through their position-space wave functions
[ =
_
drdr
(r)(r
)r[r
=
_
drdr
(r)(r
)(r r
) =
_
dr
(r)(r)
Using Equations (5.36) and (5.37) we nd that the position space wave
function of a momentum eigenvector is the usual plane wave
p
(r) = r[p = (2)
3/2
e
i
pr
(5.39)
As expected, eigenvectors of the position operator in its own representation
are given by delta-functions (5.38).
27
From (5.20) we can also obtain a position-space representation of the
identity operator
_
dr[rr[ = (2)
3
_
drdpdp
r
[pp
[e
i
pr
=
_
dpdp
[pp
[(p p
) =
_
dp[pp[ = I
Similar to momentum-space formulas (5.31) - (5.35) we can represent
generators of the Poincare group by their actions on position-space wave
functions. For example, it follows from (5.8), (5.36) and (5.37) that
27
Note that position eigenfunctions derived in ref. [NW49] do not satisfy this important
requirement.
5.2. MOMENTUM AND POSITION REPRESENTATIONS 149
e
Pa
_
dr(r)[r =
_
dr(r)[r +a =
_
dr(r a)[r
Therefore we can apply operators directly to wave functions
e
Pa
(r) = (r a)
P(r) = i lim
a0
d
da
e
Pa
(r) = i lim
a0
d
da
(r a) = i
d
dr
(r)
(5.40)
Other operators in the position representation have the following forms
28
H(r) =
_
m
2
c
4
2
c
2
d
2
dr
2
(r)
J
x
(r) = i
_
y
d
dz
z
d
dy
_
(r)
K
x
(r) =
1
2
_
_
m
2
c
4
2
c
2
d
2
dr
2
x + x
_
m
2
c
4
2
c
2
d
2
dr
2
_
(r)
R(r) = r(r)
The switching between the position-space and momentum-space wave
functions of the same state are achieved by Fourier transformation formulas.
To derive them, assume that the state [ has a position-space wave function
(r). Then using (5.36) and (5.37) we obtain
[ =
_
dr(r)[r = (2)
3/2
_
dr(r)
_
dpe
pr
[p
= (2)
3/2
_
dp
__
dr(r)e
pr
_
[p
28
Here we used a formal notation for the Laplacian operator
d
2
dr
2
2
x
2
+
2
y
2
+
2
z
2
150 CHAPTER 5. SINGLE PARTICLES
and the corresponding momentum-space wave function is
(p) = (2)
3/2
_
dr(r)e
pr
(5.41)
Inversely, if the momentum-space wave function is (p), then the position-
space wave function is
(r) = (2)
3/2
_
dpe
i
pr
(p) (5.42)
5.2.4 Inertial transformations of observables and states
Here we would like to discuss how observables and states change under iner-
tial transformations of observers. We already touched this issue in few places
in the book, but it would be useful to summarize the denitions and to clarify
the physical meaning of transformations. What do we mean exactly when
expressing observables and states in the reference frame O
(primed) through
observables and states in the reference frame O (unprimed)?
F
= U
g
FU
1
g
(5.43)
[
= U
g
[ (5.44)
where
29
U
g
= U
g
(; a, t) = e
ic
Pa
e
i
Ht
is the unitary representative of an inertial transformation g in the Hilbert
space of the system.
Let us start with transformations of observables (5.43). For deniteness
we will assume that F is the x-component of position (F = R
x
) and that g
is a translation by the distance a along the x-axis
U
g
= U
g
(1; a, 0, 0, 0) = e
Pxa
29
see equation (3.59)
5.2. MOMENTUM AND POSITION REPRESENTATIONS 151
xx
yy
zz
x
y
z
00 11 22 1 2 00 11 22 1 2
O OO
aa
Figure 5.3: Rods for measuring the x-component of position in the reference
frame O and in the frame O
x
= e
Pxa
R
x
e
i
Pxa
(5.45)
is the operator that describes measurements of position in the reference frame
O
with respect to axes and measuring rods in this frame. In Fig. 5.3 we
show a measuring rod in the reference frame O. The zero pointer on this rod
coincides with the origin of the coordinate system O. The operator R
x
is the
mathematical representation of position measurements performed with this
rod. The measuring rod R
x
associated with the observer O
is shifted by the
distance a with respect to the rod X. The zero pointer on X
coincides with
the origin of the coordinate system O
yields x
x
are
related by
R
x
= R
x
a
Of course, the same relationship is obtained by a formal application of (5.45)
R
x
= R
x
[P
x
, R
x
]a = R
x
a
152 CHAPTER 5. SINGLE PARTICLES
The position operator R
x
can be represented also through its spectral
decomposition
R
x
=
dxx[xx[
where [x are eigenvectors (states) with positions x. Then equation (5.45)
can be rewritten as
R
x
= e
Pxa
_
_
dxx[xx[
_
_
e
i
Pxa
=
dxx[x + ax + a[ =
dx(x a)[xx[ = R
x
a
From this we see that the action of U
g
on state vectors
30
e
Pxa
[x = [x + a
should be interpreted as an active shift of the states, i.e., translation of the
states by the distance a in this case.
31
For example, operator e
Pxa
moves a
particle localized in the origin of the frame O (x = 0) to the origin of the frame
O
= U
1
g
[ (5.46)
30
Here we switch to the Schrodinger representation, in which operators of observables
are assumed to be xed.
31
One can also interpret this as a result of the inertial transformation g being applied
to the state preparation device.
5.2. MOMENTUM AND POSITION REPRESENTATIONS 153
This means, in particular, that if the vector [ = [x describes a state of the
particle located at point x from the point of view of the observer O (measured
by the rod X), then the same state is described by the vector
[
= U
1
g
[ = e
i
Pxa
[x = [x a (5.47)
from the point of view of the observer O
this same
particle is described by the vector [ a.
We can also apply inertial transformations to wave functions instead of
state vectors. For example, the state vector
[ =
_
dr(r)[r
has wave function (x, y, z) in the position representation. When we shift
the observer by the distance a in the positive direction along the x-axis, we
should apply a passive transformation to the state vector
32
[
= e
i
Pxa
[ =
_
dr(x, y, z)[x a, y, z =
_
dr(x + a, y, z)[x, y, z
This means that passive transformation of the wave function has the form
e
i
Pxa
(x, y, z) = (x + a, y, z)
The above considerations nd their most important applications in the
case when the inertial transformation U
g
is a time translation. As we estab-
lished in (4.55), the position operator in the time translated reference frame
O
x
= e
i
Ht
R
x
e
Ht
= R
x
+ V
x
t
32
see equation (5.47)
154 CHAPTER 5. SINGLE PARTICLES
If we want to nd how the state vector [ looks from the time translated
reference frame O
= e
Ht
[ (5.48)
It is common to consider a continuous sequence of time shifts parameterized
by the value of time t and to speak about time evolution of the state vector
[(t). Then equation (5.48) can be regarded as a solution of the time-
dependent Schr odinger equation
i
d
dt
[(t) = H[(t) (5.49)
For actual calculations it is more convenient to deal with numerical functions
(wave functions in a particular basis) rather than with abstract state vectors.
To get this type of description, (5.49) can be multiplied on the left by certain
basis bra-vectors. For example, if we multiply (5.49) by position eigenvectors
r[, we obtain Schr odinger equation in the position representation
i
d
dt
r[(t) = r[H[(t)
i
d
dt
(r, t) =
H(r, t) (5.50)
where (r, t) r[(t) is a wave function in the position representation and
the action of the Hamiltonian on this wave function is denoted by
H(r, t)
r[H[(t).
5.3 Massless particles
5.3.1 Spectra of momentum, energy and velocity
In the case of massless particles (m = 0), such as photons, the method
used in section 5.1 to construct irreducible unitary representations of the
Poincare group does not work. For massless particles the position operator
(4.33) cannot be dened. Therefore we cannot apply the Stone-von Neumann
5.3. MASSLESS PARTICLES 155
theorem H.3 to gure out the spectrum of the operator P. To nd the
spectrum of P in the massless case we will use another argument.
Let us choose a state of the massless particle with some nonzero momen-
tum p.
33
There are two kinds of inertial transformations that can aect this
momentum value: rotations and boosts. Any vector p
obtained from p by
rotations and boosts is also in the spectrum of P.
34
So, we can use these
transformations to explore the spectrum of the momentum operator. Rota-
tions generally change the direction of the momentum vector, but preserve
its length p, so all rotation images of p form a surface of a sphere with radius
p. Boosts along the momentum vector p do not change the direction of this
vector, but do change its length. To decrease the length of the momentum
vector we can use a boost vector
which points in the direction opposite to
p, i.e.,
/ = p/p. Then, using formula (5.14) and equality
35
p
= cp (5.51)
we can write
p
= p = p +
p
p
[p(cosh 1) p sinh ] = p[cosh sinh ]
= pe
(5.52)
so the transformed momentum reaches zero only in the limit . This
means that the point p = 0 does not belong to the spectrum of the momen-
tum of any massless particle
36
. Then we see that for massless particles the
mass hyperboloid (5.7) degenerates to a cone (5.51) with the point p = 0
deleted (see Fig. 5.2). Therefore, the spectrum of velocity V = Pc
2
/H is the
surface of a sphere [v[ = c. This means that massless particles can move only
with the speed of light in any reference frame. This is the famous second
postulate of Einsteins special theory of relativity.
Statement 5.1 (invariance of the speed of light) The speed of massless
particles (e.g., photons) is equal to c independent on the velocity of the source
and/or observer.
33
We assume that such a value exists in the spectrum of the momentum operator P.
34
The proof of this statement is the same as in equations (5.1) and (5.6).
35
This equality follows from (5.7) if m = 0.
36
The physical meaning of this result is clear because there are no photons with zero
momentum and energy.
156 CHAPTER 5. SINGLE PARTICLES
5.3.2 Representations of the little group
Next we need to construct unitary irreducible representations of the Poincare
group for massless particles. To do that we can slightly modify the method
of induced representations used for massive particles in section 5.1.
We already established that vector p = (0, 0, 0)
37
does not belong to
the momentum spectrum of a massless particle. We also mentioned that
the choice of the standard vector is arbitrary and representations built on
dierent standard vectors are unitarily equivalent. Therefore, in the massless
case we will choose a dierent standard momentum vector
k = (0, 0, 1) (5.53)
The next step is to nd the little group whose transformations leave this
vector invariant. The energy-momentum 4-vector corresponding to the stan-
dard vector (5.53) is (ck, ck) = (c, 0, 0, c). Therefore, in the 4D notation from
Appendix I.1, the matrices S of little group elements must satisfy equation
S
_
_
c
0
0
c
_
_
=
_
_
c
0
0
c
_
_
Since the little group is a subgroup of the Lorentz group, condition (I.3) must
be fullled as well
S
T
gS = g
One can verify that the most general matrix S with these properties has the
form [Wei64a]
S(X
1
, X
2
, ) =
_
_
1 +
1
2
(X
2
1
+ X
2
2
) X
1
X
2
1
2
(X
2
1
+ X
2
2
)
X
1
cos + X
2
sin cos sin X
1
cos X
2
sin
X
1
sin + X
2
cos sin cos X
1
sin X
2
cos
1
2
(X
2
1
+ X
2
2
) X
1
X
2
1
1
2
(X
2
1
+ X
2
2
)
_
_
(5.54)
37
which was chosen as the standard vector for the construction of induced representa-
tions for massive particles in section 5.1
5.3. MASSLESS PARTICLES 157
which depends on three independent real parameters X
1
, X
2
and . The
three generators of these transformations are obtained by dierentiation
T
1
= lim
X
1
,X
2
,0
X
1
S(X
1
, X
2
, ) =
_
_
0 1 0 0
1 0 0 1
0 0 0 0
0 1 0 0
_
_
=
x
c/
y
T
2
= lim
X
1
,X
2
,0
X
2
S(X
1
, X
2
, ) =
_
_
0 0 1 0
0 0 0 0
1 0 0 1
0 0 1 0
_
_
=
y
+ c/
x
R = lim
X
1
,X
2
,0
S(X
1
, X
2
, ) =
_
_
0 0 0 0
0 0 1 0
0 1 0 0
0 0 0 0
_
_
=
z
where
and
/ are Lorentz group generators (I.13) and (I.14). The commu-
tators are easily calculated
[T
1
, T
2
] = 0
[R, T
2
] = T
1
[R, T
1
] = T
2
These are commutation relations of the Lie algebra for the group of trans-
lations (T
1
and T
2
) and rotations (R) in a 2D plane.
The next step is to nd the full set of unitary irreducible representations of
the little group constructed above. We will do that by following the induced
representation prescription outlined at the end of subsection 5.1.3. First we
introduce three Hermitian operators
= (
1
,
2
) and J
z
, which provide
a representation of the Lie algebra generators T and R, respectively.
38
So,
38
One can notice a formal analogy of operators
and with 2-dimensional momen-
tum and angular momentum, respectively.
158 CHAPTER 5. SINGLE PARTICLES
little group translations and rotations are represented in the subspace 1
k
39
by unitary operators e
1
x
, e
2
y
and e
.
Next we should clarify the structure of the Hilbert subspace 1
k
, keeping
in mind that this subspace should carry an irreducible representation of the
little group. Suppose that the subspace 1
k
contains a state vector [ which
is an eigenvector of
with a nonzero momentum ,= 0
[ = (
1
,
2
)[
Then the rotated vector
e
[
1
,
2
= [
1
cos +
2
sin ,
1
sin
2
cos (5.55)
also belongs to the subspace 1
k
. Vectors (5.55) form a circle
2
1
+
2
2
= const
in the 2D momentum plane. The linear span of these vectors form an
innite-dimensional Hilbert space. This means that 1
k
, is innite-dimensional.
If we used this representation of the little group to build the unitary irre-
ducible representation of the full Poincare group,
40
we would obtain massless
particles having an innite number of internal (spin) degrees of freedom, or
continuous spin. Such particles have not been observed in nature, so we will
not discuss this possibility further. The only case having relevance to physics
is the zero-radius circle = 0. These vectors form a one-dimensional irre-
ducible subspace 1
k
, where translations are represented trivially
e
r
[ = 0 = [ = 0 (5.56)
and rotations around the z-axis are represented by unimodular factors
e
[ = 0 e
Jz
[ = 0 = e
i
[ = 0 (5.57)
The allowed values of the parameter can be obtained from the fact that the
representation must be either single- or double-valued.
41
Therefore, the rota-
tion through the angle = 2 can be represented by either 1 or -1 and must
39
This is the eigensubspace of the momentum operator, corresponding to the standard
eigenvector k.
40
see next subsection
41
see Statement 3.2
5.3. MASSLESS PARTICLES 159
be either integer or half-integer number: = . . . , 1, 1/2, 0, 1/2, 1, . . .. We
will refer to the parameter as to helicity.
42
This parameter distinguishes
dierent massless unitary irreducible representations of the Poincare group,
i.e., dierent types of elementary massless particles.
5.3.3 Massless representations of the Poincare group
In the preceding subsection we have built an unitary representation of the
little group (which is a subgroup of the Poincare group) in the 1-dimensional
subspace 1
k
of the standard momentum k = (0, 0, 1). In this subsection our
goal is to build irreducible unitary representations of the full Poincare group
in the entire Hilbert space
1 =
p
1
p
(5.58)
of a massless particle with helicity .
First we would like to build a basis in the Hilbert space 1. To do that
we choose an arbitrary basis vector [k, in the subspace 1
k
. Similarly to
what we did in the massive case, we are going to propagate this basis vector
to other values of momentum p by using transformations from the Lorentz
group. So, we need to dene elements
p
of the Lorentz group, which connect
the standard momentum k with all other momenta p
p
k = p
Just as in the massive case, the choice of the set of transformations
p
is not
unique. However, one can show that representations of the Poincare group
constructed with dierently chosen
p
are unitarily equivalent. So, we are
free to choose any set
p
, which makes our calculations more convenient.
Our decision is to dene
p
as a Lorentz boost along the z-axis
B
p
=
_
_
cosh 0 0 sinh
0 1 0 0
0 0 1 0
sinh 0 0 cosh
_
_
(5.59)
42
Note that is eigenvalue of the helicity operator (J P)/P.
160 CHAPTER 5. SINGLE PARTICLES
followed by a rotation R
p
which brings direction k = (0, 0, 1) to
p
p
p
= R
p
B
p
(5.60)
(see Fig. 5.4). The rapidity of the boost = log(p) is such that the length
of B
p
k is equal to p.
43
The absolute value of the rotation angle in R
p
is
cos =
_
k
p
p
_
= p
z
/p (5.61)
and the direction of
is
=
_
k
p
p
_
/ sin = (p
2
x
+ p
2
y
)
1/2
(p
y
, p
x
, 0)
The full basis [p, in 1 is now obtained by propagating the basis vector
[k, to other xed momentum subspaces 1
p
44
[p, =
1
_
[p[
U(
p
; 0, 0)[k, (5.62)
Here we used notation from subsection 3.2.4 in which U(
p
; 0, 0) denotes
the unitary representative of the Poincare transformation consisting of the
Lorentz transformation
p
and zero translations in space and time.
The next step is to consider how general elements of the Poincare group
act on the basis vectors (5.62). First we apply a general transformation from
the Lorentz subgroup U(; 0, 0) to an arbitrary basis vector [p,
U(; 0, 0)[p, = U(; 0, 0)U(
p
; 0, 0)[k,
= U(
p
; 0, 0)U(
1
p
; 0, 0)U(; 0, 0)U(
p
; 0, 0)[k,
= U(
p
; 0, 0)U(
1
p
p
; 0, 0)[k,
= U(
p
; 0, 0)U(B
1
p
R
1
p
R
p
B
p
; 0, 0)[k,
43
see formula (5.52)
44
compare with equations (5.5) and (5.27)
5.3. MASSLESS PARTICLES 161
pp
x x
pp
zz
00
pp
pp
BB
p p
BB
p p
RR
p p
RR
p p
k=(0,0,1)
Figure 5.4: Each point p (except p = 0) in the momentum space of a massless
particle can be reached from the standard vector k = (0, 0, 1) by applying a
boost B
p
along the z-axis followed by a rotation R
p
.
The product of Lorentz group transformations
1
p
p
= B
1
p
R
1
p
R
p
B
p
on the right hand side brings vector k back to k (see Fig. 5.4), therefore
this product is an element of the little group. The translation part of this
element is irrelevant for us due to equation (5.56). The relevant angle of
rotation around the z-axis is called the Wigner angle
W
(p, ).
45
According
to equation (5.57), this rotation acts as multiplication by a unimodular factor
U(
1
p
p
; 0, 0)[k, = e
i
W
(p,)
[k,
Thus, taking into account (5.62) we can write for arbitrary Lorentz transfor-
mation
U(; 0, 0)[p, = U(
p
; 0, 0)e
i
W
(p,)
[k, =
_
[p[
_
[p[
e
i
W
(p,)
[p,
For a general Poincare group element we nally obtain a transformation that
is similar to the massive case result (5.28)
U(; r, t)[p, =
[p[
[p[
e
pr+
ic
|p|t
e
i
W
(p,)
[p, (5.63)
45
Explicit expressions for the Wigner rotation angle can be found in [Rit61, CR].
162 CHAPTER 5. SINGLE PARTICLES
As was mentioned in the beginning of this chapter, photons are described
by a reducible representation of the Poincare group which is a direct sum
of two irreducible representations with helicities = 1 and = 1. In the
classical language these two irreducible components correspond to the left
and right circularly polarized light.
5.3.4 Doppler eect and aberration
To illustrate results obtained in this section, here we are going to derive well-
known formulas for the Doppler eect and the aberration of light. These for-
mulas connects energies and propagation directions of photons viewed from
reference frames in relative motion.
We denote H(0) the photons energy and P(0) its momentum in the
reference frame O at rest. We also denote H() and P() the photons energy
and momentum in the reference frame O
tanh.
Then using (4.4) we obtain the usual formula for the Doppler eect
46
H() = H(0) cosh cP(0)
sinh
= H(0) cosh
_
1
cP(0)P(0)
H(0)P(0)
tanh
_
= H(0) cosh
_
1
v
c
cos
0
_
(5.64)
where we denoted the angle between the direction of photons propagation
(seen in the reference frame O) and the direction of movement of the reference
frame O
with respect to O
cos
0
P(0)
P(0)
(5.65)
Sometimes the Doppler eect formula is written in another form where
the angle between the photon momentum and the reference frame velocity
is measured from the point of view of O
46
The frequency of light is proportional to the photons energy (H = ), so our formula
(5.64) applies to frequencies as well.
5.3. MASSLESS PARTICLES 163
cos
P()
P()
(5.66)
From (4.4) we can write
H(0) = H() cosh + cP()
sinh
= H() cosh
_
1 +
cP()P()
H()P()
tanh
_
= H() cosh
_
1 +
v
c
cos
_
Therefore
H() =
H(0)
cosh (1 +
v
c
cos )
(5.67)
The dierence between angles
0
and , i.e., the dependence of the direc-
tion of light propagation on the observer is known as the aberration eect.
In order to see the same star in the sky observers O and O
. The
connection between these angles can be found by taking the scalar product
of both sides of (4.3) with
move with the same speed c and arrive to Earth at the same
time. Two stars are seen by O in the same region of the sky independent
on the velocity v. We also assume that sources S and S
registered by O
seen by
O we can apply formula (5.67) with the opposite sign of velocity
5.3. MASSLESS PARTICLES 165
h() =
h(0)
cosh (1
v
c
cos )
(5.68)
where is the angle between velocity v of the star S
) = (f
1
(x))
1
see sections 1.3 - 1.5
6.1. HILBERT SPACE OF A MANY-PARTICLE SYSTEM 169
The same properties are valid for the mapping f
2
: /
2
/.
(II) The results of measurements on two subsystems are independent.
This means that in the compound system all propositions about subsystem
1 are compatible with all propositions about subsystem 2:
f
1
(x
1
) f
2
(x
2
)
where x
1
/
1
, x
2
/
1
(III) If we have full information about subsystems 1 and 2, then we have
full information about the combined system. Therefore, if x
1
/
1
and x
2
/
2
are atoms then the meet of their images f
1
(x
1
) f
2
(x
2
) is also an atomic
proposition in /.
The following theorem [Mat75, AD78b] allows us to translate the above
properties of the compound system from the language of quantum logic to
the more convenient language of Hilbert spaces.
Theorem 6.2 (Matolcsi) Suppose that 1
1
, 1
2
and 1 are three complex
Hilbert spaces corresponding to the propositional lattices /
1
, /
2
and /, re-
spectively. Suppose also that f
1
and f
2
are two mappings satisfying all condi-
tions from Postulate 6.1. Then the Hilbert space 1 of the compound system
is either one of the four tensor products
2
1 = 1
1
1
2
, or 1 = 1
1
1
2
,
or 1 = 1
1
1
2
, or 1 = 1
1
1
2
.
The proof of this theorem is beyond the scope of our book.
So we have four ways to couple two one-particle Hilbert spaces into one
two-particle Hilbert space. Quantum mechanics uses only the rst possibility
1 = 1
1
1
2
.
3
This means that if particle 1 is in a state [1 1
1
and particle
2 is in a state [2 1
2
, then the state of the compound system is described
by the vector [1 [2 1
1
1
2
.
2
For denition of the tensor product of two Hilbert spaces see Appendix F.4. The star
denotes a dual Hilbert space as described in Appendix F.3.
3
It is not yet clear what is the physical meaning of the other three possibilities.
170 CHAPTER 6. INTERACTION
6.1.2 Particle observables in a multiparticle system
The mappings f
1
and f
2
from Postulate 6.1 map propositions (projections)
from Hilbert spaces of individual particles 1
1
and 1
2
into the Hilbert space
1 = 1
1
1
2
of the compound system. Therefore, they also map particle
observables from 1
1
and 1
2
to 1. For example, consider an 1-particle ob-
servable G
(1)
that is represented in the Hilbert space 1
1
by the Hermitian
operator with a spectral decomposition (1.28)
G
(1)
=
g
gP
(1)
g
Then the mapping f
1
transforms G
(1)
into a Hermitian operator f
1
_
G
(1)
_
in
the Hilbert space 1 of the compound system
f
1
(G
(1)
) =
g
gf
1
_
P
(1)
g
_
which has the same spectrum g as G
(1)
. Thus we conclude that observables
of individual particles, e.g., P
1
, R
1
in 1
1
and P
2
, R
2
in 1
2
have well-dened
meanings in the Hilbert space 1 of the combined system.
In what follows we will use small letters to denote observables of individual
particles in 1.
4
For example, the position and momentum of the particle 1
in the two-particle system will be denoted as p
1
and r
1
. The operator of
energy of the particle 1 will be written as h
1
=
_
m
2
1
c
4
+ p
2
1
c
2
, etc. Similarly,
observables of the particle 2 in 1 will be denoted as p
2
, r
2
and h
2
. According
to Postulate 6.1(II), spectral projections of observables of dierent particles
commute with each other in 1. Therefore, observables of dierent particles
commute with each other as well.
Just as in the single-particle case discussed in chapter 5, two-particle
states can be also described by wave functions. From the properties of the
tensor product of Hilbert spaces it can be derived that if
1
(r
1
) is the wave
function of particle 1 and
2
(r
2
) is the wave function of particle 2, then the
wave function of the compound system is simply a product
4
We will keep using capital letters for the total observables (H, P, J and K) of the
compound system.
6.1. HILBERT SPACE OF A MANY-PARTICLE SYSTEM 171
(r
1
, r
2
) =
1
(r
1
)
2
(r
2
) (6.1)
In this case, both particles 1 and 2 and the compound system are in pure
quantum states. However, the most general pure 2-particle state in 1
1
1
2
is described by a general (normalizable) function of two vector variables
(r
1
, r
2
) which is not necessarily expressed in the product form (6.1). In this
case, individual subsystems are in mixed states: the results of measurements
performed on the particle 1 are correlated with the results of measurements
performed on the particle 2, even though the particles do not interact with
each other. The existence of such entangled states is a distinctive feature of
quantum mechanics, which is not present in the classical world.
6.1.3 Statistics
The above construction of the two-particle Hilbert space 1 = 1
1
1
2
is
valid when particles 1 and 2 belong to dierent species. If particles 1 and
2 are identical, then there are vectors in 1
1
1
2
that do not correspond
to physically realizable states and the Hilbert space of states is less than
1
1
1
2
. Indeed, if two particles 1 and 2 are identical, then no measurable
quantity will change if these particles are interchanged. Therefore, after
permutation of two particles, the wave function may at most acquire an
insignicant unimodular phase factor
(r
2
,
2
; r
1
,
1
) = (r
1
,
1
; r
2
,
2
) (6.2)
If we swap the particles again then the original wave function must be re-
stored
(r
1
,
1
; r
2
,
2
) = (r
2
,
2
; r
1
,
1
) =
2
(r
1
,
1
; r
2
,
2
)
Therefore
2
= 1, which implies that the factor for any physical state
(r
1
,
1
; r
2
,
2
) in 1 can be either 1 or -1. If a vector in 1 does not have
this property, then this vector does not correspond to a physically realizable
state. Thus the Hilbert space of physical states of two identical particles is
only a subspace in 1.
172 CHAPTER 6. INTERACTION
Is it possible that in a system of two identical particles one state
1
(r
1
,
1
; r
2
,
2
)
has factor equal to 1
1
(r
1
,
1
; r
2
,
2
) =
1
(r
2
,
2
; r
1
,
1
) (6.3)
and another state
2
(r
1
,
1
; r
2
,
2
) has factor equal to -1?
2
(r
1
,
1
; r
2
,
2
) =
2
(r
2
,
2
; r
1
,
1
) (6.4)
If equations (6.3) and (6.4) were true, then the linear combination of the
states
1
and
2
(r
1
,
1
; r
2
,
2
) = a
1
(r
1
,
1
; r
2
,
2
) + b
2
(r
1
,
1
; r
2
,
2
)
would not transform like (6.2) after permutation
(r
2
,
2
; r
1
,
1
) = a
1
(r
2
,
2
; r
1
,
1
) + b
2
(r
2
,
2
; r
1
,
1
)
= a
1
(r
1
,
1
; r
2
,
2
) b
2
(r
1
,
1
; r
2
,
2
)
,= (r
1
,
1
; r
2
,
2
)
It then follows that the factor must be the same for all states in the Hilbert
space 1 of the system of two identical particles. This result implies that all
particles in nature are divided in two categories: bosons and fermions.
For bosons = 1 and two-particle wave functions are symmetric with
respect to permutations. Wave functions of two bosons form a linear subspace
1
1
sym
1
2
1
1
1
2
. This means, in particular, that two identical bosons
may occupy the same quantum state, i.e., the wave function (r
1
,
1
)(r
2
,
2
)
belongs to the bosonic subspace 1
1
sym
1
2
.
For fermions, = 1 and two-particle wave functions are antisymmetric
with respect to permutations of particle variables. The Hilbert space of two
identical fermions is the subspace of antisymmetric functions 1
1
asym
1
2
1
1
1
2
. This means, in particular, that two identical fermions may not
occupy the same quantum state (this is called the Pauli exclusion principle),
i.e., the wavefunction (r
1
,
1
)(r
2
,
2
) does not belong to the antisymmetric
fermionic subspace 1
1
asym
1
2
.
6.2. RELATIVISTIC HAMILTONIAN DYNAMICS 173
A remarkable spin-statistics theorem has been proven in the framework
of quantum eld theory. This theorem establishes (in full agreement with
experiment) that the symmetry of two-particle wave functions is related to
their spin: all particles with integer spin (e.g., photons) are bosons and
all particles with half-integer spin (e.g., neutrinos, electrons, protons) are
fermions.
All results of this section can be immediately generalized to the case of
n-particle system, where n > 2. For example, the Hilbert space of n identical
bosons is the symmetrized tensor product 1 = 1
1
sym
1
2
sym
. . .
sym
1
n
and the Hilbert space of n identical fermions is the antisymmetrized tensor
product 1 = 1
1
asym
1
2
asym
. . .
asym
1
n
.
6.2 Relativistic Hamiltonian dynamics
To complete our description of the 2-particle system initiated in the preceding
section we need to specify an unitary representation U
g
of the Poincare group
in the Hilbert space 1 = 1
1
1
2
.
5
We already know from chapter 4
that generators of this representation (and some functions of generators)
will dene total observables of the compound system. From subsection 6.1.2
we also know how to dene observables of individual particles in 1. If we
assume that total observables in 1 may be expressed as functions of particle
observables p
1
, r
1
, p
2
and r
2
, then the construction of U
g
is equivalent to
nding 10 Hermitian operator functions
H(p
1
, r
1
, p
2
, r
2
) (6.5)
P(p
1
, r
1
, p
2
, r
2
) (6.6)
J(p
1
, r
1
, p
2
, r
2
) (6.7)
K(p
1
, r
1
, p
2
, r
2
) (6.8)
which satisfy commutation relations of the Poincare Lie algebra (3.52) -
(3.58). Even in the two-particle case this problem does not have a unique so-
lution and additional physical principles should be applied to make sure that
generators (6.5) - (6.8) are selected in agreement with observations. For a
general multiparticle system, the construction of the representation U
g
is the
5
For simplicity we will assume that particles 1 and 2 are massive, spinless and not
identical.
174 CHAPTER 6. INTERACTION
most dicult and the most important part of relativistic quantum theories.
A large portion of the rest of this book is devoted to the analysis of dierent
ways to construct representation U
g
. It is important to understand that once
this step is completed, we get everything we need for a full description of the
physical system and for comparison with experimental data.
6.2.1 Non-interacting representation of the Poincare
group
There are innitely many ways to dene the representation U
g
of the Poincare
group, in the Hilbert space 1 = 1
1
1
2
. Let us start our analysis from one
legitimate choice which has a transparent physical meaning. We know from
chapter 5 that one-particle Hilbert spaces 1
1
and 1
2
carry irreducible unitary
representations U
1
g
and U
2
g
of the Poincare group. Functions f
1
and f
2
dened
in subsection 6.1.1 allow us to map these representations to the Hilbert space
1 of the compound system, i.e., to have two representations of the Poincare
group f
1
(U
1
g
) and f
2
(U
2
g
) in 1.
6
We can then dene a new representation
U
0
g
of the Poincare group in 1 by making a (tensor product) composition of
f
1
(U
1
g
) and f
2
(U
2
g
). More specically, for any vector [1 [2 1 we dene
U
0
g
([1 [2) = f
1
(U
1
g
)[1 f
2
(U
2
g
)[2 (6.9)
and the action of U
0
g
on other vectors in 1 follows by linearity. Represen-
tation (6.9) is called the tensor product of unitary representations U
1
g
and
U
2
g
and is denoted by U
0
g
= U
1
g
U
2
g
. Generators of this representation are
expressed as sums of one-particle generators
H
0
= h
1
+ h
2
(6.10)
P
0
= p
1
+p
2
(6.11)
J
0
= j
1
+j
2
(6.12)
K
0
= k
1
+k
2
(6.13)
The Poincare commutation relations for generators (6.10) - (6.13) follow im-
mediately from the facts that one-particle generators corresponding to par-
6
These representations are no longer irreducible, of course. For example, f
1
(U
1
g
) is a
direct sum of (innitely) many irreducible representations isomorphic to U
1
g
.
6.2. RELATIVISTIC HAMILTONIAN DYNAMICS 175
ticles 1 and 2 satisfy Poincare commutation relations separately and that
operators of dierent particles commute with each other.
With denitions (6.10) - (6.13), inertial transformations of particle ob-
servables with respect to the representation U
0
g
are easy to nd. For example,
positions of particles 1 and 2 change with time as
r
1
(t) = e
i
H
0
t
r
1
e
H
0
t
= e
i
h
1
t
r
1
e
h
1
t
= r
1
+v
1
t
r
2
(t) = r
2
+v
2
t
Comparing this with equation (4.55), we conclude that all observables of
particles 1 and 2 transform independently from each other as if these particles
were alone. So, the representation (6.10) - (6.13) corresponds to the absence
of interaction and is called the non-interacting representation of the Poincare
group.
6.2.2 Diracs forms of dynamics
Obviously, the simple choice of generators (6.10) - (6.13) is not realistic, be-
cause particles in nature do interact with each other. Therefore, to describe
interactions in multi-particle systems one should choose an interacting repre-
sentation U
g
of the Poincare group in 1 which is dierent from U
0
g
. First we
write the generators (H, P, J, K) of the desired representation U
g
in the most
general form where all generators are dierent from their non-interacting
counterparts by the presence of interaction terms denoted by V, U, Y, Z
7
H = H
0
+ V (r
1
, p
1
, r
2
, p
2
) (6.14)
P = P
0
+U(r
1
, p
1
, r
2
, p
2
) (6.15)
J = J
0
+Y(r
1
, p
1
, r
2
, p
2
) (6.16)
K = K
0
+Z(r
1
, p
1
, r
2
, p
2
) (6.17)
7
Our approach to the description of interactions based on equations (6.14) - (6.17)
and their generalizations for multiparticle systems is called the relativistic Hamiltonian
dynamics [KP91]. For completeness, we should mention that there is a number of other
methods for describing interactions which can be called non-Hamiltonian. Overviews of
these methods and further references can be found in [Kei, DW65, Pol85]. We will not
discuss the non-Hamiltonian approaches in this book.
176 CHAPTER 6. INTERACTION
It may happen that some interaction operators on the right hand sides of
equations (6.14) - (6.17) are zero. Then these generators and corresponding
nite transformations coincide with generators and transformations of the
non-interacting representation U
0
g
. Such generators and transformations are
called kinematical. Generators which contain interaction terms are called
dynamical.
Table 6.1: Comparison of three relativistic forms of dynamics
Instant form Point form Front form
Kinematical generators
P
0x
K
0x
P
0x
P
0y
K
0y
P
0y
P
0z
K
0z
1
2
(H
0
+ P
0z
)
J
0x
J
0x
1
2
(K
0x
+ J
0y
)
J
0y
J
0y
1
2
(K
0y
J
0x
)
J
0z
J
0z
J
0z
K
0z
Dynamical generators
H H
1
2
(H P
z
)
K
x
P
x
1
2
(K
x
J
y
)
K
y
P
y
1
2
(K
y
+ J
x
)
K
z
P
z
The description of interaction by equations (6.14) - (6.17) generalizes
traditional classical non-relativistic Hamiltonian dynamics in which the only
dynamical generator is the Hamiltonian H. To make sure that our theory
reduces to the familiar non-relativistic approach in the limit c , we
will also assume that time translations are generated by a dynamical Hamil-
tonian H = H
0
+ V . The choice of other generators is restricted by the
observation that kinematical transformations should form a subgroup of the
Poincare group, so that kinematical generators should form a subalgebra of
the Poincare Lie algebra.
8
The set (P, J, K) does not form a subalgebra.
This explains why in the relativistic case we cannot introduce interaction
in the Hamiltonian alone. We must add interaction terms to some of the
8
Indeed, if two generators A and B do not contain interaction terms, then their com-
mutator [A, B] should be interaction-free as well.
6.3. INSTANT FORM OF DYNAMICS 177
other generators P, J, or K in order to be consistent with relativity. We will
say that interacting representations having dierent kinematical subgroups
belong to dierent forms of dynamics. In his famous paper [Dir49], Dirac
provided a classication of forms of dynamics based on this principle. There
are three Diracs forms of dynamics most frequently discussed in the litera-
ture. They are shown in Table 6.1. In the case of the instant form dynamics
the kinematical subgroup is the subgroup of spatial translations and rota-
tions. In the case of the point form dynamics the kinematical subgroup is
the Lorentz subgroup [Tho52]. In both mentioned cases the kinematical sub-
group has dimension 6. The front form dynamics has the largest number (7)
of kinematical generators.
6.2.3 Total observables in a multiparticle system
Once the interacting representation of the Poincare group and its generators
(H, P, J, K) are dened, we immediately have expressions for total observ-
ables of the physical system considered as a whole. These are the total energy
H, the total momentum P and the total angular momentum J. Other total
observables of the system (the total mass M, spin S, center-of-mass position
R, etc.) can be obtained as functions of these generators by formulas derived
in chapter 4.
Note also that inertial transformations of the total observables (H, P, J, K)
coincide with those presented in chapter 4. This is because total observables
coincide with the Poincare group generators and this coincidence is indepen-
dent on the interaction present in the system. For example, the total energy
H and the total momentum P form a 4-vector whose boost transformations
are given by equation (4.3) - (4.4). Boost transformations of the center-of-
mass position R are derived in subsection 4.3.8. Time translations result
in a uniform movement of the center-of-mass with constant velocity along a
straight line (4.55). Thus we conclude that inertial transformations of total
observables are completely independent on the form of dynamics and on the
details of interactions acting within the multiparticle system.
6.3 Instant form of dynamics
In sections 15.2 and 13.5 we will see that instant form of dynamics agrees
with observations better than other forms. So, in this book we will prefer to
178 CHAPTER 6. INTERACTION
use instant form interactions to describe realistic systems.
6.3.1 General instant form interaction
In the instant form we can rewrite equations (6.14) - (6.17) as
H = H
0
+ V (6.18)
P = P
0
(6.19)
J = J
0
(6.20)
K = K
0
+Z (6.21)
As we discussed in subsections 4.1.1 and 6.2.3, the observables H, P, J and
K are total observables that correspond to the compound system as a whole.
The total momentum P (6.11) and the total angular momentum J (6.12) are
simply vector sums of the corresponding operators for individual particles.
The total energy H and the boost operator K are written as sums of one-
particle operators plus interaction terms. The interaction term V in the
energy operator is usually called the potential energy operator. Similarly, we
will call Z the potential boost. It is important to note that in an instant-form
relativistic interacting system the potential boost operator cannot vanish.
We will see later in this book that the non-vanishing boost interaction has
a profound eect on transformations of observables between moving reference
frames. The potential boost Z will play a crucial role in our non-traditional
approach to relativity.
Other total observables (e.g., the total mass M, spin S, center-of-mass
position R and its velocity V, etc.) are dened as functions of generators
(6.18) - (6.21) by formulas from chapter 4. For interacting systems, these
Hermitian operators may become interaction-dependent as well.
Even with interaction potentials V and Z present, ten operators (6.18)
- (6.21) must obey Poincare commutation relations (3.52) - (3.58). This
requirement leads to the following equivalent relationships
[J
0
, V ] = [P
0
, V ] = 0 (6.22)
[Z
i
, P
0j
] =
i
ij
c
2
V (6.23)
6.3. INSTANT FORM OF DYNAMICS 179
[J
0i
, Z
j
] = i
3
k=1
ijk
Z
k
(6.24)
[K
0i
, Z
j
] + [Z
i
, K
0j
] + [Z
i
, Z
j
] = 0 (6.25)
[Z, H
0
] + [K
0
, V ] + [Z, V ] = 0 (6.26)
So, the task of constructing a Poincare invariant theory of interacting parti-
cles has been reduced to nding a non-trivial solution for the set of equations
(6.22) - (6.26) with respect to V and Z. These equations are necessary and
sucient conditions for the Poincare invariance of our theory.
6.3.2 Bakamjian-Thomas construction
The set of equations (6.22) - (6.26) is rather complicated. The rst non-
trivial solution of these equations for multiparticle systems was found by
Bakamjian and Thomas [BT53]. The idea of their approach was as follows.
Instead of working with 10 generators (P, J, K, H), it is convenient to use
an alternative set of operators P, R, S, M introduced in subsection 4.3.4.
Denote P
0
, R
0
, S
0
, M
0
and P
0
, R, S, M the sets of operators obtained by
using formulas (4.42) - (4.44) from the non-interacting (P
0
, J
0
, K
0
, H
0
) and
interacting (P
0
, J
0
, K, H) generators, respectively. In a general instant form
dynamics all three operators R, S and M may contain interaction terms.
However, Bakamjian and Thomas decided to look for a simpler solution in
which the position operator remains kinematical R = R
0
. It then immedi-
ately follows that the spin operator is kinematical as well
S = J [RP] = J
0
[R
0
P
0
] = S
0
Then interaction term N is present in the mass operator only.
M = M
0
+ N
From commutators (6.22), the interaction N must satisfy
[P
0
, N] = [R
0
, N] = [J
0
, N] = 0 (6.27)
So, we have reduced our task of solving (6.22) - (6.26) to a simpler problem
of nding one operator N satisfying conditions (6.27). Indeed, by knowing N
180 CHAPTER 6. INTERACTION
and non-interacting operators M
0
, P
0
, R
0
, S
0
, we can restore the interacting
generators using formulas (4.45) - (4.47)
P = P
0
(6.28)
H = +
_
M
2
c
4
+ P
2
0
c
2
(6.29)
K =
1
2c
2
(R
0
H + HR
0
)
[P
0
S
0
]
Mc
2
+ H
(6.30)
J = J
0
= [R
0
P
0
] +S
0
(6.31)
Now let us turn to the construction of N in the case of two massive
spinless particles. Suppose that we found two vector operators and such
that they form a 6-dimensional Heisenberg Lie algebra
[
i
,
j
] = i
ij
(6.32)
[
i
,
j
] = [
i
,
j
] = 0 (6.33)
commuting with the center-of-mass position R
0
and the total momentum P
0
.
[, P
0
] = [, R
0
] = [ , P
0
] = [ , R
0
] = 0 (6.34)
Suppose also that these relative operators have the following non-relativistic
(c ) limits
p
1
p
2
r
1
r
2
Then observables and can be interpreted as relative momentum and rela-
tive position in the two-particle system, respectively. Moreover, any operator
in the Hilbert space 1 can be expressed either as a function of (p
1
, r
1
, p
2
, r
2
)
or as a function of (P
0
, R
0
, , ). Moreover, the interaction operator N sat-
isfying conditions [N, P
0
] = [N, R
0
] = 0 can be expressed as a function of
and only. To satisfy the last condition [N, J
0
] = 0 we will simply require
N to be a function of rotationally invariant combinations of the 2-particle
relative observables
6.3. INSTANT FORM OF DYNAMICS 181
N = N(
2
,
2
, ( )) (6.35)
In this ansatz, the problem of building a relativistically invariant interaction
has reduced to nding operators of relative positions and momenta sat-
isfying equations (6.32) - (6.34). This problem has been solved in a number
of works [BT53, BF62, Osb68, FS64]. We will not need explicit formulas for
the operators of relative observables, so we will not reproduce them here.
For systems of n massive spinless particles (n > 2) similar arguments
apply, but instead of one pair of relative operators and we will have n1
pairs,
r
,
r
, r = 1, 2, . . . , n 1 (6.36)
These operators should form a 6(n1)-dimensional Heisenberg algebra com-
muting with P
0
and R
0
. Explicit expressions for
r
and
r
were constructed,
e.g., in ref. [Cha64]. As soon as these expressions are found, we can build
a Bakamjian-Thomas interaction in an n-particle system by dening the in-
teraction N as a function of rotationally invariant combinations of relative
operators (6.36)
N = N(
2
1
,
2
1
, (
1
1
),
2
2
,
2
2
, (
2
2
), (
1
2
), (
2
1
), . . .) (6.37)
6.3.3 Non-Bakamjian-Thomas instant forms of dynam-
ics
In the Bakamjian-Thomas construction, it was assumed that R = R
0
, but
this limitation is rather articial and we will see later that realistic particle
interactions do not satisfy this condition. Any other variant of the instant
form dynamics has position operator R dierent from the non-interacting
Newton-Wigner position R
0
. Let us now establish a connection between
such a general instant form interaction and the Bakamjian-Thomas form. We
are going to demonstrate that corresponding representations of the Poincare
group are related by a unitary transformation.
182 CHAPTER 6. INTERACTION
Suppose that operators
(P
0
, J
0
, K, H) (6.38)
dene a Bakamjian-Thomas dynamics. Let us now choose an unitary op-
erator W commuting with P
0
and J
0
.
9
and apply this transformation to
generators (6.38).
J
0
= WJ
0
W
1
(6.39)
P
0
= WP
0
W
1
(6.40)
K
= WKW
1
(6.41)
H
= WHW
1
(6.42)
Since unitary transformations preserve commutators
W[A, B]W
1
= [WAW
1
, WBW
1
]
the transformed generators (6.39) - (6.42) satisfy commutation relations of
the Poincare Lie algebra in the instant form. However, generally, the new
mass operator M
= c
2
_
(H
)
2
P
2
0
c
2
does not commute with R
0
, so (6.39)
- (6.42) are not necessarily in the Bakamjian-Thomas form.
Thus we have a way to build a non-Bakamjian-Thomas instant form repre-
sentation (P
0
, J
0
, K
, H
) if a Bakamjian-Thomas representation (P
0
, J
0
, K, H)
is given. However, this construction does not answer the question if all in-
stant form interactions can be connected to the Bakamjian-Thomas dynamics
by a unitary transformation? The answer to this question is yes: For any
instant form interaction
10
P
0
, R
, S
, M
, S
, M
W = P
0
, R
0
, S
0
, M (6.43)
9
In the case of two massive spinless particles such an operator must be a function of
rotationally invariant combinations of vectors P
0
, and .
10
Here it is convenient to use alternative sets of basic operators introduced in subsec-
tion 4.3.4.
6.3. INSTANT FORM OF DYNAMICS 183
To see that, let us consider the simplest two-particle case. Operator
T R
R
0
commutes with P
0
. Therefore, it can be written as a function of P
0
and
relative operators and : T(P
0
, , ). Then one can show that unitary
operator
11
W = e
i
W
J =
_
P
0
0
T(P, , )dP (6.44)
performs the desired transformation (6.43). Indeed
W
1
P
0
W = P
0
W
1
J
0
W = J
0
because W is a scalar, which explicitly commutes with P
0
. Operator J has
the following commutators with the center-of-mass position
[J, R
0
] =
_
R
0
,
_
P
0
0
T(P, , )dP
_
= i
P
0
__
P
0
0
T(P, , )dP
_
= iT(P
0
, , ) = i(R
R
0
)
[J, [J, R
0
]] = 0
Therefore
WR
0
W
1
= e
i
W
R
0
e
W
= R
0
+
i
[J, R
0
]
1
2!
2
[J, [J, R
0
]] + . . .
= R
0
+ (R
R
0
) = R
W
1
R
W = R
0
W
1
S
W = W
1
(J
0
R
P
0
)W = J
0
R
0
P
0
= S
0
11
The integral in (6.44) can be treated formally as an integral of ordinary function
(rather than operator) along the segment [0, P
0
] in the 3D space of variable P
0
with
arguments and being xed.
184 CHAPTER 6. INTERACTION
Finally we can apply transformation W to the mass operator M
and obtain
operator
M = W
1
M
W
which commutes with both R
0
and P
0
. This demonstrates that operators
on the right hand side of (6.43) describe a Bakamjian-Thomas instant form
of dynamics.
6.3.4 Cluster separability
As we saw above, the requirement of Poincare invariance imposes rather
loose conditions on interaction. Relativistic invariance can be satised in
many dierent ways. However, there is another physical requirement which
limits the admissible form of interaction. We know from experiment that all
interactions between particles vanish when particles are separated by large
distances.
12
So, if in a 2-particle system we remove particle 2 to innity by
using the space translation operator e
i
p
2
a
, then interaction (6.35) must tend
to zero
lim
a
e
p
2
a
N(
2
,
2
, ( ))e
i
p
2
a
= 0 (6.45)
This condition is not dicult to satisfy in the two-particle case. However, in
the relativistic multi-particle case the mathematical form of this condition
becomes rather complicated. This is because now there is more than one
way to separate particles in mutually non-interacting groups. The form of
the n-particle interaction (6.37) must ensure that each spatially separated
m-particle group (m < n) behaves as if it were alone. This, in particular,
implies that we cannot independently choose interactions in systems with
dierent number of particles. The interaction in the n-particle sector of the
theory must be consistent with interactions in all m-particle sectors, where
m < n.
Interactions satisfying these conditions are called cluster separable. We
will postulate that all interactions in nature have the property of separability.
12
We are not considering here a hypothetical potential between quarks, which supposedly
grows as a linear function of the distance and results in the connement of quarks inside
hadrons.
6.3. INSTANT FORM OF DYNAMICS 185
Postulate 6.3 (cluster separability of interactions) : All interactions
are cluster separable. This means that for any division of an n-particle system
(n 2) into two spatially separated groups (or clusters) of l and m particles
(l + m = n)
1. the interaction separates too, i.e., the clusters move independent of each
other;
2. the interaction in each cluster is the same as in separate l-particle and
m-particle systems, respectively.
A counterexample of a non-separable interaction can be built in the 4-
particle case. The interaction Hamiltonian
V =
1
[r
1
r
2
[[r
3
r
4
[
(6.46)
has the property that no matter how far two pairs of particles (1+2 and 3+4)
are from each other, the relative distance between 3 and 4 aects the force
acting between particles 1 and 2. Such innite-range interactions are not
present in nature.
In the non-relativistic case the cluster separability is achieved without
much eort. For example, the non-relativistic Coulomb potential energy in
the system of two charged particles is
13
V
12
=
1
[ [
1
[r
1
r
2
[
(6.47)
which clearly satises condition (6.45). In the system of three charged par-
ticles 1, 2 and 3, the potential energy can be written as a simple sum of
two-particle terms
V = V
12
+ V
13
+ V
23
=
1
[r
1
r
2
[
+
1
[r
2
r
3
[
+
1
[r
1
r
3
[
(6.48)
13
Here we are interested just in the general functional form of interaction, so we are not
concerned with putting correct factors in front of the potentials.
186 CHAPTER 6. INTERACTION
The spatial separation between particle 3 and the cluster of particles 1+2 can
be increased by applying a large space translation to the particle 3. In agree-
ment with Postulate 6.3, such a translation eectively cancels interaction
between particles in clusters 3 and 1+2, i.e.
lim
a
e
i
p
3
a
(V
12
+ V
13
+ V
23
)e
p
3
a
= lim
a
1
[r
1
r
2
[
+
1
[r
2
r
3
+a[
+
1
[r
1
r
3
+a[
=
1
[r
1
r
2
[
This is the same potential (6.47) as in an isolated 2-particle system. There-
fore, both conditions (1) and (2) are satised and interaction (6.48) is cluster
separable. As we will see below, in the relativistic case construction of a
general cluster-separable multi-particle interaction is a more dicult task.
Let us now make some denitions which will be useful in discussions
of cluster separability. A smooth m-particle potential V
(m)
is dened as
operator that depends on variables of m particles and tends to zero if any
particle or a group of particles is removed to innity.
14
For example, the
potential (6.47) is smooth while (6.46) is not. Generally, a cluster separable
interaction in a n-particle system can be written as a sum
V =
{2}
V
(2)
+
{3}
V
(3)
+ . . . + V
(n)
(6.49)
where
{2}
V
(2)
is a sum of smooth 2-particle potentials over all pairs of
particles;
{3}
V
(3)
is a sum of smooth 3-particle potentials over all triples of
particles, etc. The example in equation (6.48) is a sum of smooth 2-particle
potentials.
6.3.5 Non-separability of the Bakamjian-Thomas dy-
namics
We expect that the property of cluster separability (Postulate 6.3) must be
valid for both potential energy and potential boosts in realistic interacting
14
In section 8.4 we will explain why we call such potentials smooth.
6.3. INSTANT FORM OF DYNAMICS 187
systems. For example, in the relativistic case of 3 massive spinless particles
with interacting generators
H = H
0
+ V (p
1
, r
1
; p
2
, r
2
; p
3
, r
3
)
K = K
0
+Z(p
1
, r
1
; p
2
, r
2
; p
3
, r
3
)
the cluster separability requires, in particular, that
lim
a
e
i
p
3
a
V (p
1
, r
1
; p
2
, r
2
; p
3
, r
3
)e
i
i
p
3
a
= V
12
(p
1
, r
1
; p
2
, r
2
) (6.50)
lim
a
e
i
p
3
a
Z(p
1
, r
1
; p
2
, r
2
; p
3
, r
3
)e
p
3
a
= Z
12
(p
1
, r
1
; p
2
, r
2
) (6.51)
where V
12
and Z
12
are interaction operators for the 2-particle system.
Let us see if these principles can be satised by Bakamjian-Thomas in-
teractions. In this case the potential energy is
V = H H
0
=
_
(p
1
+p
2
+p
3
)
2
c
2
+ (M
0
+ N(p
1
, r
1
; p
2
, r
2
; p
3
, r
3
))
2
c
4
_
(p
1
+p
2
+p
3
)
2
c
2
+ M
2
0
c
4
By removing particle 3 to innity we obtain
lim
a
e
i
p
3
a
V (p
1
, r
1
; p
2
, r
2
; p
3
, r
3
)e
p
3
a
=
_
(p
1
+p
2
+p
3
)
2
c
2
+ (M
0
+ N(p
1
, r
1
; p
2
, r
2
; p
3
, ))
2
c
4
_
(p
1
+p
2
+p
3
)
2
c
2
+ M
2
0
c
4
(6.52)
According to (6.50) we should require that the right hand side of equation
(6.52) depends only on variables pertinent to particles 1 and 2. Then we
must set
N(p
1
, r
1
; p
2
, r
2
; p
3
, ) = 0
which also means that
188 CHAPTER 6. INTERACTION
V (p
1
, r
1
; p
2
, r
2
; p
3
, ) = V
12
(p
1
, r
1
; p
2
, r
2
) = 0
and interaction in the 2-particle sector 1+2 vanishes. Similarly, we can show
that interaction V tends to zero when either particle 1 or particle 2 is removed
to innity. Therefore, V is a smooth 3-particle potential, and there is no
interaction in any 2-particle subsystem: the interaction turns on only if there
are three or more particles close to each other. This is clearly unphysical.
So, we conclude that the Bakamjian-Thomas construction cannot describe
a non-trivial cluster-separable interaction in many-particle systems (see also
[Mut78]).
6.3.6 Cluster separable 3-particle interaction
The problem of constructing relativistic cluster separable many-particle in-
teractions can be solved by allowing non-Bakamjian-Thomas instant form in-
teractions. Our goal here is to construct the interacting Hamiltonian H and
boost K operators in the Hilbert space 1 = 1
1
1
2
1
3
of a 3-particle sys-
tem so that interaction satises the separability Postulate 6.3, i.e., it reduces
to a non-trivial 2-particle interaction when one of the particles is removed to
innity. In this construction we follow ref. [CP82].
Let us assume that 2-particle potentials V
ij
and Z
ij
, i, j = 1, 2, 3 resulting
from removing particle k ,= i, j to innity are known. They depend on
variables of the i-th and j-th particles only. For example, when particle 3 is
removed to innity, the interacting operators take the form
15
lim
a
e
i
p
3
a
He
p
3
a
= H
0
+ V
12
H
12
(6.53)
lim
a
e
i
p
3
a
Ke
p
3
a
= K
0
+Z
12
K
12
(6.54)
lim
a
e
i
p
3
a
Me
p
3
a
=
1
c
2
_
H
2
12
P
2
0
c
2
M
12
(6.55)
lim
a
e
i
p
3
a
Re
p
3
a
=
c
2
2
(K
12
H
12
+ H
12
K
12
)
c[P
0
W
12
]
M
12
H
12
(M
12
c
2
+ H
12
)
R
12
(6.56)
15
Here we used (4.32) and took into account that [P
0
S
12
] = [P
0
W
12
]/(M
12
c).
Similar equations result from the removal of particles 1 or 2 to innity. They are obtained
from (6.53) - (6.56) by permutation of indices (1,2,3).
6.3. INSTANT FORM OF DYNAMICS 189
where operators H
12
, K
12
, M
12
and R
12
(energy, boost, mass and center-
of-mass position, respectively) will be considered as given. Now we want to
combine the two-particle potentials V
ij
and Z
ij
together in a cluster-separable
3-particle interaction in analogy with (6.48). It appears that we cannot
form the interactions V and Z in the 3-particle system simply as a sum of
2-particle potentials. One can verify that such a denition would violate
Poincare commutators. Therefore
V ,= V
12
+ V
23
+ V
13
Z ,= Z
12
+Z
23
+Z
13
and the relativistic addition of interactions should be more complicated.
When particles 1 and 2 are split apart, operators V
12
and Z
12
must tend
to zero, therefore
lim
a
e
i
p
1
a
M
12
e
p
1
a
= M
0
(6.57)
lim
a
e
i
p
2
a
M
12
e
p
2
a
= M
0
(6.58)
lim
a
e
i
p
3
a
M
12
e
p
3
a
= M
12
(6.59)
The Hamiltonian H
12
and boost K
12
dene an instant form representation
U
12
of the Poincare group in the 3-particle Hilbert space 1. The correspond-
ing position operator (6.56) is generally dierent from the non-interacting
Newton-Wigner position operator
R
0
=
c
2
2
(K
0
H
0
+ H
0
K
0
)
c[P
0
W
0
]
M
0
H
0
(M
0
+ H
0
)
(6.60)
which is characteristic for the Bakamjian-Thomas form of dynamics. How-
ever, we can unitarily transform the representation U
12
, so that it acquires a
Bakamjian-Thomas form with operators R
0
, H
12
, K
12
, M
12
.
16
Let us denote
such an unitary transformation operator by B
12
. We can repeat the same
steps for two other pairs of particles 1+3 and 2+3 and write in the general
case i, j = 1, 2, 3; i ,= j
16
see subsection 6.3.3
190 CHAPTER 6. INTERACTION
B
ij
R
ij
B
1
ij
= R
0
B
ij
H
ij
B
1
ij
= H
ij
B
ij
K
ij
B
1
ij
= K
ij
B
ij
M
ij
B
1
ij
= M
ij
Operators B
ij
= B
12
, B
13
, B
23
commute with P
0
and J
0
. Since representa-
tion U
ij
becomes non-interacting when the distance between particles i and
j tends to innity, we can write
lim
a
e
i
p
3
a
B
13
e
p
3
a
= 1 (6.61)
lim
a
e
i
p
3
a
B
23
e
p
3
a
= 1 (6.62)
lim
a
e
i
p
3
a
B
12
e
p
3
a
= B
12
(6.63)
The transformed Hamiltonians H
ij
and boosts K
ij
dene Bakamjian-Thomas
representations and their mass operators M
ij
now commute with R
0
. So, we
can add M
ij
together to build a new mass operator
M = M
12
+ M
13
+ M
23
2M
0
= B
12
M
12
B
1
12
+ B
13
M
13
B
1
13
+ B
23
M
23
B
1
23
2M
0
which also commutes with R
0
. Using this mass operator, we can build a
Bakamjian-Thomas representation with generators
H =
_
P
2
0
+ M
2
(6.64)
K =
1
2c
2
(R
0
H + HR
0
)
c[P
0
W
0
]
MH(Mc
2
+ H)
(6.65)
This representation has interactions between all particles, however, it does
not satisfy the cluster property yet. For example, by removing particle 3 to
innity we do not obtain the interaction M
12
characteristic for the subsystem
of two particles 1 and 2. Instead, we obtain a unitary transform of M
12
17
17
Here we used (6.57) - (6.59) and (6.61) - (6.63).
6.3. INSTANT FORM OF DYNAMICS 191
lim
a
e
i
p
3
a
Me
p
3
a
= lim
a
e
i
p
3
a
(B
12
M
12
B
1
12
+ B
13
M
13
B
1
13
+ B
23
M
23
B
1
23
2M
0
)e
p
3
a
= B
12
M
12
B
1
12
2M
0
+ lim
a
(e
i
p
3
a
M
13
e
p
3
a
+ e
i
p
3
a
M
23
e
p
3
a
)
= B
12
M
12
B
1
12
2M
0
+ 2M
0
= B
12
M
12
B
1
12
(6.66)
To x this deciency, let us perform a unitary transformation of the repre-
sentation (6.64) - (6.65) with operator B
18
H = B
1
HB (6.67)
K = B
1
KB (6.68)
M = B
1
MB (6.69)
We choose the transformation B from the requirement that it must cancel
factors B
ij
and B
1
ij
in equation (6.66) as particle k is removed to innity. In
other words, B can be any unitary operator, which has the following limits
lim
a
e
i
p
3
a
Be
p
3
a
= B
12
(6.70)
lim
a
e
i
p
2
a
Be
p
2
a
= B
13
(6.71)
lim
a
e
i
p
1
a
Be
p
1
a
= B
23
(6.72)
One can check that one possible choice of B is
B = exp(lnB
12
+ ln B
13
+ ln B
23
)
Indeed, using equations (6.61) - (6.63) we obtain
lim
a
e
i
p
3
a
Be
p
3
a
= lim
a
e
i
p
3
a
exp(ln B
12
+ ln B
13
+ ln B
23
)e
p
3
a
= exp(ln B
12
) = B
12
18
which must commute with P
0
and J
0
, of course, to preserve the instant form of
interaction
192 CHAPTER 6. INTERACTION
Then, it is easy to show that the interacting representation of the Poincare
group generated by operators (6.67) and (6.68) satises cluster separability
properties (6.53) - (6.56). For example,
lim
a
e
i
p
3
a
He
p
3
a
= lim
a
e
i
p
3
a
B
1
HBe
p
3
a
= lim
a
B
1
12
e
i
p
3
a
_
P
2
0
c
2
+ M
2
c
4
e
p
3
a
B
12
= B
1
12
_
P
2
0
c
2
+ (B
12
M
12
B
1
12
)
2
c
4
B
12
=
_
P
2
0
c
2
+ M
2
12
c
4
= H
12
Generally, operator B does not commute with the Newton-Wigner position
operator (6.60). Therefore, the mass operator (6.69) also does not com-
mute with R
0
and the representation generated by operators (P
0
, J
0
, K, H)
does not belong to the Bakamjian-Thomas form. This is consistent with our
conclusion in subsection 6.3.5 that Bakamjian-Thomas dynamics cannot be
made cluster-separable.
Obviously, this method of constructing relativistic cluster-separable in-
teractions is very cumbersome. Moreover, its applicability is limited to inter-
actions that conserve the number of particles. In chapters 9 and 11 we will
consider a simpler and more general approach
19
that can be easily adapted
to physically relevant interactions.
6.4 Bound states and time evolution
We already mentioned that the knowledge of U
g
in the Hilbert space 1 of a
multiparticle system is sucient for getting any desired physical information
about the system. In this section, we would like to make this statement
more concrete by examining two types of information, which can be com-
pared with experiment: the mass and energy spectra of the system and the
time evolution of its observables. In the next section we will discuss scatter-
ing experiments, which are currently the most informative way of studying
microscopic systems.
19
which is based on the idea of quantum elds
6.4. BOUND STATES AND TIME EVOLUTION 193
6.4.1 Mass and energy spectra
The mass operator of a non-interacting 2-particle system is
M
0
= +
1
c
2
_
H
2
0
P
2
0
c
2
= +
1
c
2
_
(h
1
+ h
2
)
2
(p
1
+p
2
)
2
c
2
= +
1
c
2
_
_
m
2
1
c
4
+ p
2
1
c
2
+
_
m
2
2
c
4
+ p
2
2
c
2
_
2
(p
1
+p
2
)
2
c
2
(6.73)
As particles momenta can have any value in the 3D momentum space, the
eigenvalues m of the mass operator have continuous spectrum in the range
m
1
+ m
2
m < (6.74)
where the minimum value of mass m
1
+ m
2
is obtained from (6.73) when
both particles are at rest p
1
= p
2
= 0. It then follows that the common
spectrum of mutually commuting operators P
0
and
H
0
= +
_
M
2
0
c
4
+ P
2
0
c
2
is the union of mass hyperboloids
20
in the 4-dimensional momentum-energy
space. This spectrum is shown by the hatched region in Fig. 6.1(a).
In the presence of interaction, the eigenvalues
n
of the mass operator
M = M
0
+ N can be found by solving the stationary Schr odinger equation
M[
n
=
n
[
n
(6.75)
It is well-known that in the presence of attractive interaction N, new dis-
crete eigenvalues in the mass spectrum may appear below the threshold
m
1
+ m
2
. The eigenvectors of the interacting mass operator with eigen-
values
n
< m
1
+ m
2
are called bound states. The mass eigenvalues
n
are
highly degenerate. For example, if [
n
is an eigenvector corresponding to
n
, then for any Poincare group element g the vector U
g
[
n
is also an eigen-
vector with the same mass eigenvalue.
21
To remove this degeneracy (at least
20
with masses in the interval (6.74)
21
This means that eigensubspaces with xed mass
n
are invariant with respect to
Poincare group actions.
194 CHAPTER 6. INTERACTION
(m
11
+m
22
)c
22
PP
xx
cc
H H
0 0
(m
11
+m
22
)c
22
PP
x x
cc
H H
00
(a) (b)
Figure 6.1: Typical momentum-energy spectrum of (a) non-interacting and
(b) interacting two-particle system.
partially) one can consider operators P
0
and H, which commute with M and
among themselves, so that they dene a basis of common eigenvectors
M[
p,n
=
n
[
p,n
P
0
[
p,n
= p[
p,n
H[
p,n
=
M
2
c
4
+ P
2
c
2
[
p,n
=
_
2
n
c
4
+ p
2
c
2
[
p,n
2
n
c
4
+ p
2
c
2
which are shown in Fig. 6.1(b) below the continuous part of the common
spectrum of P
0
and H. An example of a bound system whose mass spec-
trum has both continuous and discrete parts - the hydrogen atom - will be
considered in greater detail in section 12.2.
6.4.2 Doppler eect revisited
In our discussion of the Doppler eect in subsection 5.3.4 we were interested
in the energy of free photons measured by moving observers or emitted by
6.4. BOUND STATES AND TIME EVOLUTION 195
moving sources. There we applied boost transformations to the energy E of
a free massless photon. It is instructive to look at this problem from another
point of view. Photons are usually emitted by compound massive physical
systems (atoms, molecules, nuclei, etc.) in transitions between two discrete
energy levels E
2
and E
1
, so that the photons energy is
22
E = E
2
E
1
When the source is moving with respect to the observer (or observer is
moving with respect to the source), the energies of levels 1 and 2 experience
inertial transformations given by formula (4.4). Therefore, to check our the-
ory for consistency, we would like to prove that the Doppler shift calculated
with this formula is the same as that obtained in subsection 5.3.4.
Suppose that the compound system has two bound states characterized by
mass eigenvalues m
1
and m
2
> m
1
(see Fig. 6.2). Suppose also that initially
the system is in the excited state with mass m
2
, total momentum p
2
and
energy E
2
=
_
m
2
2
c
4
+ p
2
2
c
2
. In the nal state we have the same system with a
lower mass m
1
, dierent total momentum p
1
and energy E
1
=
_
m
2
1
c
4
+ p
2
1
c
2
.
In addition, there is a photon with momentum k and energy ck. From the
momentum and energy conservation laws we can write
p
2
= p
1
+k
E
2
= E
1
+ ck
_
m
2
2
c
4
+ p
2
2
c
2
=
_
m
2
1
c
4
+ p
2
1
c
2
+ ck
=
_
m
2
1
c
4
+ (p
2
k)
2
c
2
+ ck
Taking squares of both sides of this equality we obtain
k
_
m
2
1
c
2
+ (p
2
k)
2
=
1
2
2
c
2
+ p
2
k cos k
2
22
The transition energy E is actually not well-dened, because the excited state 2 is
not a stationary state. (See section 13.1.) Therefore our discussion in this subsection is
valid only approximately for long-living states 2, for which the uncertainty of energy can
be neglected.
196 CHAPTER 6. INTERACTION
PP
xx
cc
EE
00
mm
22
mm
11
(p
2 2
,E
2 2
))
(p
11
,E
1 1
))
BB
AA
kk
Figure 6.2: Energy level diagram for a bound system with the ground state
of mass m
1
and the excited state of mass m
2
. If the system is at rest, its
excited state is represented by point A. Note that the energy of emitted
photons (arrows) is less than (m
2
m
1
)c
2
. A moving excited state with
momentum p
2
is represented by point B. The energies and momenta k of
emitted photons depend on the angle between k and p
2
.
6.4. BOUND STATES AND TIME EVOLUTION 197
where
2
m
2
2
m
2
1
and is the angle between vectors p
2
and k.
23
Taking
squares of both sides again we obtain a quadratic equation
k
2
(m
2
2
c
2
+ p
2
2
p
2
2
cos
2
) k
2
c
2
p
2
cos
1
4
4
c
4
= 0
with the solution
24
k =
2
c
2
2m
2
2
c
2
+ 2p
2
2
sin
2
_
p
2
cos +
_
m
2
2
c
2
+ p
2
2
_
Introducing the rapidity of the initial state, we obtain p
2
= m
2
c sinh ,
_
m
2
2
c
2
+ p
2
2
= m
2
c cosh and
k =
2
c(sinh cos + cosh )
2m
2
(cosh
2
sinh
2
cos
2
)
=
2
c
2m
2
cosh (1
v
c
cos )
This formula gives the energy of the photon emitted by a system moving
with the speed v = c tanh
E(, ) ck =
E(0)
cosh (1
v
c
cos )
where
E(0) =
2
c
2
2m
2
is the energy of the photon emitted by a source at rest. This is in agreement
with our earlier result (5.68).
23
Note also that vector k points from the light emitting system to the observer, so the
angle can be interpreted as the angle between the velocity of the source and the line of
sight, which is equivalent to the denition of in subsection 5.3.4.
24
Only positive sign of the square root leads to the physical solution with positive k
198 CHAPTER 6. INTERACTION
6.4.3 Time evolution
In addition to stationary energy spectra discussed above, we are often inter-
ested in the time evolution of a compound system. This includes reactions,
scattering, decays, etc. As we discussed in subsection 5.2.4, in quantum the-
ory the time evolution of states from (earlier) time t
to (later) time t is
described by the time evolution operator
U(t t
) = e
H(tt
)
(6.76)
This operator has the following useful properties
U(t t
) = e
H(tt
1
)
e
H(t
1
t
)
= U(t t
1
)U(t
1
t
) (6.77)
for any t, t
, t
1
and
U(t t
) = U
1
(t
t) (6.78)
In the Schr odinger picture, the time evolution of a state vector is given
by (5.48)
[(t) = U(t t
)[(t
) = e
H(tt
)
[(t
) (6.79)
[(t) is also a solution of the time dependent Schr odinger equation
i
d
dt
[(t) = i
d
dt
e
H(tt
)
[(t
) = He
H(tt
)
[(t
)
= H[(t) (6.80)
In spite of simple appearance of formula (6.79), the evaluation of the expo-
nents of the Hamilton operator is an extremely dicult task. In rare cases
when all eigenvalues E
n
and eigenvectors [
n
of the Hamiltonian are known
H[
n
= E
n
[
n
6.4. BOUND STATES AND TIME EVOLUTION 199
the initial state can be represented as a sum (and/or integral) of basis eigen-
vectors
[(0) =
n
C
n
[
n
and the time evolution can be calculated as
[(t) = e
Ht
[(0) = e
Ht
n
C
n
[
n
=
n
C
n
e
Ent
[
n
(6.81)
There is another useful formula for the state vectors time evolution in a
theory with Hamiltonian H = H
0
+ V . Denoting
V (t) = e
i
H
0
(tt
0
)
V e
H
0
(tt
0
)
it is easy to verify that the time-dependent state vector
25
[(t) = e
H
0
(tt
0
)
_
1
i
_
t
t
0
V (t
)dt
2
_
t
t
0
V (t
)dt
_
t
t
0
V (t
)dt
+ . . .
_
[(t
0
)
(6.82)
satises the Schr odinger equation (6.80) with the additional condition that
at t = t
0
the solution coincides with the given initial state [(t
0
). Indeed
i
d
dt
[(t)
= i
d
dt
e
H
0
(tt
0
)
_
1
i
_
t
t
0
V (t
)dt
2
_
t
t
0
V (t
)dt
_
t
t
0
V (t
)dt
+ . . .
_
[(t
0
)
= H
0
e
H
0
(tt
0
)
_
1
i
_
t
t
0
V (t
)dt
2
_
t
t
0
V (t
)dt
_
t
t
0
V (t
)dt
+ . . .
_
[(t
0
)
+e
H
0
(tt
0
)
V (t)
_
1
i
_
t
t
0
V (t
)dt
+ . . .
_
[(t
0
)
= (H
0
+ V )[(t)
25
Note that time integration variables satisfy inequalities t t
. . . t
0
.
200 CHAPTER 6. INTERACTION
Formula (6.82) will be found useful in our discussion of scattering in subsec-
tion 7.1.2.
Unfortunately, the above methods for calculating the time evolution of
quantum systems have very limited practical value: The full spectrum of
eigenvalues and eigenvectors of the interacting Hamiltonian H
26
can be found
only for very simple models. The convergence of the perturbative expansion
(6.82) is usually rather poor. So, the time evolution is a challenging task
for quantum mechanics. There are, however, two areas where we can make
further progress in solving this problem. First, in most circumstances, quan-
tum eects are too small to be observable. So, it is important to understand
how solutions of the Schr odinger equation correspond to classical trajecto-
ries of particles that we see in everyday life. The classical limit of quantum
mechanics will be tackled in section 6.5. Second, there is an important class
of scattering experiments, which do not require a detailed description of the
time evolution of quantum states. A powerful formalism of scattering theory
can be applied, as discussed in chapter 7.
6.5 Classical Hamiltonian dynamics
In section 1.5.2 we have established that distributive (classical) propositional
systems are particular cases of orthomodular (quantum) propositional sys-
tems. Therefore, we may expect that quantum mechanics includes classical
mechanics as a particular case. However, this is not obvious how exactly
the phase space of classical mechanics is linked to the Hilbert space of quan-
tum mechanics. We would like to analyze this link in the present section.
For simplicity, we will use as an example a system of spinless particles with
non-zero masses m
i
> 0. For classical treatment of massless particles, e.g.,
photons, see discussion in subsection 14.6.2.
6.5.1 Quasiclassical states
In the macroscopic world we do not meet localized eigenvectors [r of the
position operator. According to equation (5.37), such states have innite un-
certainty of momentum which is rather unusual. Similarly, we do not meet
states with sharply dened momentum. Such states are delocalized over large
26
which are required for formula (6.81)
6.5. CLASSICAL HAMILTONIAN DYNAMICS 201
distances (5.39). The reason why it is unusual see such states
27
is not well
understood yet. The most plausible hypothesis is that eigenstates of the
position or eigenstates of the momentum are susceptible to small perturba-
tions (e.g., due to temperature or external radiation) and rapidly transform
to more robust wave packets or quasiclassical states in which both position
and momentum have good, but not perfect localization.
So, when discussing the classical limit of quantum mechanics, we will
not consider general states allowed by quantum mechanics. We will limit
our attention only to the class of particle states [
r
0
,p
0
that we will call
quasiclassical. Wave functions of these states are assumed to be well-localized
around a point r
0
in the position space and also well-localized around a point
p
0
in the momentum space. Without loss of generality such wave functions
in the position representation can be written as
r
0
,p
0
(r) r[
r
0
,p
0
= (r r
0
)e
i
e
i
p
0
(rr
0
)
(6.83)
where (r r
0
) is a real smooth (non-oscillating) function with a maximum
near the expectation value of position r
0
and is a real phase.
28
The last
factor in (6.83) is needed to ensure that the expectation value of momentum
is p
0
.
29
As we will see later, in order to discuss the classical limit of quantum
mechanics the exact choice of the function (r r
0
) is not important. For
example, it is convenient to choose it in the form of a Gaussian
r
0
,p
0
(r) = Ne
(rr
0
)
2
/d
2
e
i
p
0
r
(6.84)
where = 0, parameter d controls the degree of localization and N is a
coecient required for the proper normalization
_
dr[
r
0
,p
0
(r)[
2
= 1
27
Spatially delocalized states of particles play a role in such low-temperature eects as
superconductivity and superuidity.
28
such that e
i
0,0
(r) = Ne
r
2
/d
2
(6.85)
and its counterpart in the momentum space is
30
0,0
(p) = (2)
3/2
N
_
dre
r
2
/d
2
e
pr
= (2)
3/2
Nd
3
e
p
2
d
2
/(4
2
)
(6.86)
The product of the uncertainties of the momentum-space (p
2
d
) and
position-space (r d) wave functions is
rp 2 (6.87)
This is an example of the Heisenberg uncertainty relation, which tells us
that for all quantum states the above uncertainties must satisfy the famous
inequality
rp /2 (6.88)
30
Here we used equations (5.41) and (B.13).
6.5. CLASSICAL HAMILTONIAN DYNAMICS 203
6.5.3 Spreading of quasiclassical wave packets
Suppose that at time t = 0 the particle was prepared in the state with well-
localized wave function (6.85), i.e., the uncertainty of position r d is
small. The corresponding time-dependent wave function in the momentum
representation is
(p, t) = e
Ht
0,0
(p, 0)
=
Nd
3
(2)
3/2
e
p
2
d
2
/(4
2
)
e
it
m
2
c
4
+p
2
c
2
Returning back to the position representation we obtain the time-dependent
wave function
31
(r, t) =
Nd
3
(4
2
)
3/2
_
dpe
p
2
d
2
/(4
2
)
e
i
pr
e
it
m
2
c
4
+q
2
c
2
Nd
3
(4
2
)
3/2
e
mc
2
t
_
dpexp
_
p
2
_
d
2
4
2
+
it
2m
_
+
i
pr
_
= N
_
d
2
m
d
2
m + 2it
_
3/2
e
mc
2
t
exp
_
mr
2
d
2
m+ 2it
_
and the probability density
(r, t) = [(r, t)[
2
= [N[
2
_
d
4
m
2
d
4
m
2
+ 4
2
t
2
_
3/2
exp
_
2r
2
d
2
m
2
d
4
m
2
+ 4
2
t
2
_
The size of the wave packet at large times t is easily found as
r(t)
_
d
4
m
2
+ 4
2
t
2
d
2
m
2
2t
dm
31
Due to the factor e
p
2
d
2
/(4
2
)
, only small values of momentum contribute to the in-
tegral and we can use the non-relativistic approximation
_
m
2
c
4
+p
2
c
2
mc
2
+
p
2
2m
and
equation (B.13).
204 CHAPTER 6. INTERACTION
So, the position-space wave packet is spreading out and the speed of spread-
ing v
s
is directly proportional to the uncertainty of velocity in the initially
prepared state
32
v
s
2
dm
p
m
(6.89)
One can verify that at large times this speed does not depend on the shape
of the initial wave packet. The important parameters are the size d of this
wave packet and the particles mass m.
A simple estimate demonstrates that for macroscopic objects this spread-
ing phenomenon can be safely neglected. For example, for a particle of mass
m = 1 mg and the initial position uncertainty of d = 1 micron, the time
needed for the wave function to spread to 1 cm is more than 10
11
years.
Therefore, for quasiclassical states of macroscopic particles with suciently
high masses, their positions and momenta are well dened at all times and
their time evolution can be described by a classical trajectory. So, in these
conditions one can safely replace quantum mechanics with its classical coun-
terpart.
6.5.4 Phase space
In subsection 6.5.1 we have established that quasiclassical wave packets have
general form (6.83). If resolution of measuring instruments is poor, then
the shape of the envelope function (r r
0
) cannot be discerned.
33
All
quantum states (6.83) with dierent shapes of the function (r r
0
) can
now be treated as the same classical state. So, each classical state is fully
characterized by three parameters: the average position of the packet r
0
,
the average momentum p
0
and the phase . These states are approximate
eigenstates of both position and momentum operators simultaneously:
32
Here we used equality (6.87).
33
This shape has a negligible eect on the time evolution of the wave packet as a whole.
It controls only the packets spreading, which is ignored in the classical limit anyway.
Apparently, the quality of the classical approximation depends on the value of the Planck
constant (4.1). All quantities proportional to positive powers of are ignored in the
classical limit. In most circumstances, the resolution of measuring devices is much poorer
than the quantum of action [KvB], so the classical picture of the world works extremely
well.
6.5. CLASSICAL HAMILTONIAN DYNAMICS 205
R[
r
0
,p
0
r
0
[
r
0
,p
0
(6.90)
P[
r
0
,p
0
p
0
[
r
0
,p
0
(6.91)
All such states can be represented by one point (r
0
, p
0
) in a 6-dimensional
manifold R
6
with coordinates r
x
, r
y
, r
z
, p
x
, p
y
, p
z
.
34
This is the one-particle
phase space that was discussed from the logico-probabilistic point of view in
subsection 1.4.4. Thus we have established a series of approximations that
allow us to represent quantum states in the familiar classical phase space
notation.
We can continue this line of reasoning and translate other quantum no-
tions to the classical language as well. For example, we know that any
1-particle quantum observable F can be expressed as a function of the par-
ticle position r, momentum p and mass M.
35
The eigenvalue of M is just a
constant. Therefore, in the classical phase space picture, all observables (the
energy, angular momentum, velocity, etc.) are represented as real functions
f(p, r) on the phase space.
For example, consider logical propositions, which form a special class of
observables, whose spectrum consists of only two points 0 and 1. In classical
theory, to dene a proposition one selects a subset T in the phase space and
the characteristic function
36
of that subset. Let us consider two examples
of propositions in R
6
. The proposition R = position of the particle is r
0
[f, g] = ik
1
+ i
2
k
2
+ i
3
k
3
. . .
where k
i
are Hermitian operators. From equation (6.95) it is clear that the
Poisson bracket is equal to the coecient of the dominant term of the rst
order in
[f, g]
P
= k
1
As a consequence, the classical Poisson bracket [f, g]
P
is much easier to calcu-
late than the full quantum commutator [f, g]. The following theorem demon-
strates that calculation of the Poisson bracket can be reduced to simple dif-
ferentiation.
208 CHAPTER 6. INTERACTION
Theorem 6.4 If f(R, P) and g(R, P) are two observables of a massive spin-
less particle, then
37
[f(R, P), g(R, P)]
P
=
f
R
g
P
f
P
g
R
(6.96)
Proof. Consider for simplicity the one-dimensional case (the 3D proof is
similar) in which the desired result (6.96) becomes
lim
0
i
, m
, q
, s
, where n
n, m
m,
q
q, s
[R
n
P
m
, R
q+1
P
s
]
= lim
0
i
[R
n
P
m
, R]R
q
P
s
lim
0
i
R[R
n
P
m
, R
q
P
s
]
= [R
n
P
m
, R]
P
R
q
P
s
+ R[R
n
P
m
, R
q
P
s
]
P
= mR
n+q
P
m+s1
+ (ns mq)R
n+q
P
m+s1
= (ns mq m)R
n+q
P
m+s1
Therefore, by induction, equation (6.97) holds for all values of n, m, q, s 0
and for all smooth functions f(R, P) and g(R, P).
Since Poisson brackets are obtained from commutators (6.95), all prop-
erties of commutators from Appendix E.2 remain valid for Poisson brackets
as well.
For a concrete example, let us apply the above formalism of Poisson
brackets to the time evolution. We can use formulas (6.94) and (6.96) in the
case when F is either position or momentum and obtain
38
38
Here we use equation (4.53) and a similar formula [P
x
, f(R
x
)] = if(R
x
)/R
x
.
210 CHAPTER 6. INTERACTION
dP(t)
dt
= [H(R, P), P]
P
=
H(R, P)
R
(6.99)
dR(t)
dt
= [H(R, P), R]
P
=
H(R, P)
P
(6.100)
where one recognizes the classical Hamiltons equations of motion. This
means that trajectories of centers of quasiclassical wave packets are exactly
the same as those predicted by classical Hamiltonian mechanics.
6.5.6 Time evolution of wave packets
In this subsection we will investigate further the connection between the time
evolution of quantum wave functions and classical particle trajectories. In
particular, we will pay attention to the time dependence of the phase factor
that was ignored in the preceding discussion.
Earlier in this section we have established that in many cases the spread-
ing of a quasiclassical wave packet can be ignored and that the center of the
packet moves along classical trajectory (r
0
(t), p
0
(t)). This means that the
shape of the wave packet is described by the function (r r
0
(t)), which
remains localized around the classical path r = r
0
(t). This also means that
the exponential r-dependent factor in (6.83) is approximately exp(
i
p
0
(t)r).
For generality, we also need to assume that the phase (t) is time-dependent
too. Then the time-dependent quasiclassical wave packet is described by the
following ansatz
(r, t) = (r r
0
(t)) exp
_
i
A(t)
_
(6.101)
A(t) p
0
(t)(r r
0
(t)) + (t) (6.102)
where r
0
(t), p
0
(t), (t) are yet undetermined numerical functions. In order
to nd these functions we insert (6.101) - (6.102) in the Schr odinger equation
(5.50)
i
(r, t)
t
+
2
2m
2
(r, t)
r
2
V (r)(r, t) = 0 (6.103)
6.5. CLASSICAL HAMILTONIAN DYNAMICS 211
which is valid for the position-space wave function (r, t) of a single particle
moving in an external potential V (r).
39
Omitting for brevity time arguments
and denoting time derivatives by dots we can write
i
(r)
t
=
_
i
_
r
r
0
_
( p
0
r) + ( p
0
r
0
) + (p
0
r
0
)
_
exp
_
i
A
_
2
2m
2
(r)
r
2
=
2
2m
r
(r)
r
=
2
2m
r
_
r
exp
_
i
A
_
+
i
p
0
exp
_
i
A
__
=
2
2m
_
r
2
+
2i
r
p
0
_
p
2
0
2
_
exp
_
i
A
_
V (r)(r) =
_
V (r
0
)
V (r)
r
r=r
0
(r r
0
)
_
exp
_
i
A
_
Thus, there are three kinds of terms on the left hand side of (6.103): those
proportional to
0
,
1
and
2
. They must vanish independently. The
2
-
dependent terms are too small; they are beyond the accuracy of the quasi-
classical approximation and can be ignored. The terms that are rst order
in result in
r
0
=
p
0
m
which is the usual relationship between velocity and momentum for momentum-
independent potentials.
40
0
terms lead us to equation
0 = ( p
0
r) + ( p
0
r
0
) + (p
0
r
0
)
p
2
0
2m
V (r
0
)
V (r)
r
r=r
0
(r r
0
)
39
Here we make several assumptions and approximations to simplify our calculations.
First, we consider a particle moving in a xed potential. This is not an isolated system,
which is a subject of most discussions in the book. Nevertheless, this is still a good
approximation in the case when the object creating the potential V (r) is heavy, so that its
dynamics does not depend on the dynamics of the considered (light) particle. Second, the
position-dependent potential V (r) does not depend on the particles momentum. Third, we
are working in the non-relativistic approximation. The quality of all these approximations
will become clear in section 12, where we will construct a realistic interaction potential V
between charged particles.
40
This is also the 2nd Hamiltons equation (6.100).
212 CHAPTER 6. INTERACTION
(6.104)
The function p
0
(t) can be determined from the rst Hamiltons equation
(6.99)
p
0
=
V (r)
r
r=r
0
So, we can rewrite (6.104) as an equation for the last undetermined (phase)
function (t)
t
=
p
2
0
(t)
2m
V (r
0
(t))
The solution of this equation for a particle propagating in the time interval
[t
0
, t] is given by the so-called action integral
41
(t) = (t
0
) +
t
_
t
0
dt
_
p
2
0
(t
)
2m
V (r
0
(t
))
_
(6.105)
From the above discussion we conclude that the center of a quasiclassical
wave packet moves along a trajectory determined by Hamiltons equations
of motion (6.99) - (6.100). This is not dierent from classical mechanics. In
addition, there is a genuine quantum eect: the change of the overall phase
of the wave packet according to equation (6.105).
This phase change explains the double-slit (or double-hole) interference
eect discussed in section 1.1. Suppose that a monochromatic source
42
emits
electrons, which pass through two slits and form an image on the scintillating
screen as shown in Fig. 6.3. The electron wave packets can reach the point
C on the screen by two alternative ways: either through slit A or through slit
B. Both kinds of packets contribute to the wave function at point C. Their
complex phase factors exp(
i
) and U
0
( ),
respectively.
3
Then we can write the time evolution operator from the innite
past to the innite future
4
U()
U
0
()U(
)U
0
(
)
= U
0
()U
0
( 0)[U
0
(0 )U(
)U
0
(
0)]U
0
(0
)U
0
(
)
= U
0
(0)S
,
U
0
(0 ) (7.1)
where
S
,
U
0
(0 )U(
)U
0
(
0) (7.2)
Equation (7.1) means that a simplied description of the time evolution in
scattering events is possible in which the evolution is free at all times except
sudden change at t = 0 described by the unitary operator S
,
: Approxima-
tion (7.1) becomes more accurate if we increase the time interval [
, ] during
which the exact time evolution is taken into account, i.e.,
, .
5
2
The short interaction time can be guaranteed if three conditions are met: First, the
interaction between particles is short-range or, more generally, cluster separable. Second,
states of particles are describable by localized wave packets, such as those in subsection
6.5.1. Third, particles velocities (or momenta) are suciently high.
3
Here we denoted U
0
(t t
) exp(
i
H
0
(t t
,
S
,
= lim
,
U
0
(0 )U(
)U
0
(
0)
= lim
,
e
i
H
0
H(
)
e
H
0
(7.4)
= lim
S()
S() lim
e
i
H
0
H(
)
e
H
0
(7.5)
A better understanding of how scattering theory describes time evolution
can be obtained from Fig. 7.1. In this gure we plot the state of the scattering
system (represented abstractly as a point on the vertical axis) as a function
of time (the horizontal axis). The exact evolution of the state is governed
by the full time evolution operator U and is shown by the thick line A
218 CHAPTER 7. SCATTERING
D. In asymptotic regions (when t is large negative or large positive) the
interaction between parts of the scattering system is weak and the exact time
evolution can be well approximated by the free time evolution (described by
the operator U
0
and shown in the gure by two thin straight lines with
arrows, one for large positive times C D and one for large negative times
A B). The thick line (the exact interacting time evolution) asymptotically
approaches thin lines (free time evolutions) in the remote past (around A)
and in the remote future (around D). The past and future free evolutions
can be extrapolated to time t = 0 and there is a gap B C between these
extrapolated states. The S-operator (which connects states B and C as
shown by the dashed arrow) is designed to bridge this gap. This operator
provides a mapping between free states extrapolated to time t = 0. Thus,
in scattering theory the exact time evolution A D is approximated by
three steps: the system rst evolves freely until time t = 0, i.e., from A to
B. Then there is a sudden jump B C represented by the S-operator.
Finally, the time evolution is free again C D. As seen from the gure,
this description of the scattering process is perfectly exact, as long as we are
interested only in the mapping from asymptotic states in the remote past A
to asymptotic states in the remote future D. However, it is also clear that
scattering theory does not provide a good description of the time evolution
in the interacting region around t = 0, because in the scattering operator S
the information about particle interactions enters integrated over the innite
time interval t (, ). In order to describe the time evolution in the
interaction region (t 0) the S-matrix approach is not suitable. The full
interacting time evolution operator U is needed for this purpose.
In applications we are mostly interested in matrix elements of the S-
operator
S
if
= f[S[i (7.6)
where [i is a state of non-interacting initial particles and [f is a state of
non-interacting nal particles. Such matrix elements are called the S-matrix.
Formulas relating the S-matrix to observable quantities, such as scattering
cross-sections, can be found in any textbook on scattering theory.
An important property of the S-operator is its Poincare invariance,
i.e., zero commutators with generators of the non-interacting representation
of the Poincare group [Wei95, Kaz71]
7.1. SCATTERING OPERATORS 219
[S, H
0
] = [S, P
0
] = [S, J
0
] = [S, K
0
] = 0 (7.7)
The vanishing commutator [S, H
0
] = 0 implies that in (7.3) one can change
places of U
0
and S, so that the interacting time evolution operator can be
written as the full free time evolution operator times the S-operator
U() = SU
0
() = U
0
()S (7.8)
7.1.2 S-operator in perturbation theory
There are various techniques available for calculations of the S-operator.
Currently, the perturbation theory is the most powerful and eective one. To
derive the perturbation expansion for the S-operator, rst note that operator
S(t) in (7.5) satises equation
d
dt
S(t)
=
d
dt
lim
t
e
i
H
0
t
e
H(tt
)
e
H
0
t
= lim
t
_
e
i
H
0
t
_
i
H
0
_
e
H(tt
)
e
H
0
t
+ e
i
H
0
t
_
H
_
e
H(tt
)
e
H
0
t
_
=
i
lim
t
e
i
H
0
t
(H H
0
)e
H(tt
)
e
H
0
t
=
i
lim
t
e
i
H
0
t
V e
H(tt
)
e
H
0
t
=
i
lim
t
e
i
H
0
t
V e
H
0
t
e
i
H
0
t
e
H(tt
)
e
H
0
t
=
i
lim
t
V (t)e
i
H
0
t
e
H(tt
)
e
H
0
t
=
i
V (t)S(t) (7.9)
where we denoted
6
6
Note that the t-dependence of V (t) does not mean that we are considering time-
220 CHAPTER 7. SCATTERING
V (t) = e
i
H
0
t
V e
H
0
t
(7.10)
One can directly check that solution of equation (7.9) with the natural
initial condition S() = 1 is given by the old-fashioned perturbation
expansion
S(t) = 1
i
_
t
V (t
) dt
2
_
t
V (t
) dt
_
t
V (t
) dt
+ . . . ,
Therefore, the S-operator can be calculated by putting t = +as the upper
limit of t-integrals
S = 1
i
_
+
V (t) dt
1
2
_
+
V (t) dt
_
t
V (t
) dt
+ . . . (7.11)
This formula can be also derived from equation (6.82) in the case when the
initial time t
0
= is in the remote past and the nal time t = + is in
the distant future
[(+)
= lim
t+
e
H
0
(tt
0
)
_
1
i
e
i
H
0
(t
t
0
)
V e
H
0
(t
t
0
)
dt
2
_
e
i
H
0
(t
t
0
)
V e
H
0
(t
t
0
)
dt
_
t
e
i
H
0
(t
t
0
)
V e
H
0
(t
t
0
)
dt
+ . . .
_
[()
Next we shift integration variables t
t
0
t
and t
t
0
t
, so that
7
dependent interactions. The argument t has very little to do with actual time dependence
of operators in the Heisenberg representation, which must be generated by the full inter-
acting Hamiltonian H and not by the free Hamiltonian H
0
as in equation (7.10). In such
cases we will use the term t-dependence instead of time dependence.
7
Note that the trick of adiabatic switching described in the next subsection (and
tacitly assumed to be working here) allows us to keep unchanged the innite limits (
and ) of integrals.
7.1. SCATTERING OPERATORS 221
[(+)
= lim
t+
e
H
0
(tt
0
)
_
1
i
e
i
H
0
t
V e
H
0
t
dt
2
_
e
i
H
0
t
V e
H
0
t
dt
_
t
e
i
H
0
t
V e
H
0
t
dt
+ . . .
_
[()
Comparing this formula with representation (7.8) of the time evolution op-
erator we conclude that the S-factor is the same as (7.11).
We will avoid discussion of (non-trivial) convergence properties of the
series on the right hand side of equation (7.11). Throughout this book we
will tacitly assume that all innite perturbation series do converge.
We will often use the following convenient symbols for t-integrals
Y (t)
i
_
t
Y (t
)dt
(7.12)
Y (t)
..
i
_
+
Y (t
)dt
= Y () (7.13)
In this notation the perturbation expansion of the S-operator (7.11) can be
written compactly as
S = 1 + (t)
..
(7.14)
(t) = V (t) + V (t)V (t
) + V (t)V (t
)V (t
) + V (t)V (t
)V (t
)V (t
) + . . .
(7.15)
Formula (7.11) is not the only way to write the perturbation expansion for
the S-operator and, perhaps, not the most convenient one. In most books on
quantum eld theory the covariant FeynmanDyson perturbation expansion
[Wei95] is used, which involves a time ordering of operators in the integrands
8
8
When applied to a product of several t-dependent bosonic operators, the time ordering
symbol T changes the order of operators in such a way that the t label increases from right
to left, e.g.
222 CHAPTER 7. SCATTERING
S = 1
i
+
_
dt
1
V (t
1
)
1
2!
2
+
_
dt
1
dt
2
T[V (t
1
)V (t
2
)]
+
i
3!
3
+
_
dt
1
dt
2
dt
3
T[V (t
1
)V (t
2
)V (t
3
)]
+
1
4!
4
+
_
dt
1
dt
2
dt
3
dt
4
T[V (t
1
)V (t
2
)V (t
3
)V (t
4
)] + . . . (7.17)
For our purposes in chapter 11 we found more useful yet another equivalent
perturbative expression suggested by Magnus [Mag54, PL66, BCOR]
S = exp(F(t)
..
) (7.18)
where Hermitian operator F(t) will be referred to as the scattering phase
operator. It is represented as a series of multiple commutators with t-integrals
F(t) = V (t)
1
2
[V (t
), V (t)] +
1
6
[V (t
), [V (t
), V (t)]]
+
1
6
[[V (t
), V (t
)], V (t)]
1
12
[V (t
), [[V (t
), V (t
)], V (t)]]
1
12
[[V (t
), [V (t
), V (t
)]], V (t)]
1
12
[[V (t
), V (t
)], [V (t
), V (t)]] + . . . (7.19)
One important advantage of this representation is that expression (7.18) for
the S-operator is manifestly unitary in each perturbation order. The three
T[A(t
1
)B(t
2
)] =
_
A(t
1
)B(t
2
), if t
1
> t
2
B(t
2
)A(t
1
), if t
1
< t
2
(7.16)
For the time ordered product of fermionic (anticommuting) operators see equation (J.86).
7.1. SCATTERING OPERATORS 223
perturbative expansions (old-fashioned, FeynmanDyson and Magnus) are
equivalent in the sense that they converge to the same result if all perturba-
tion orders are added up to innity. However, in each xed order n the three
types of terms can be dierent.
9
7.1.3 Adiabatic switching of interaction
In formulas for scattering operators (7.15) and (7.19) we meet t-integrals
V (t). A straightforward calculation of such integrals gives rather discourag-
ing result. Let us introduce a complete basis [n of eigenvectors of the free
Hamiltonian
H
0
[n = E
n
[n (7.20)
n
[nn[ = 1 (7.21)
and calculate matrix elements of V (t) in this basis
n[V (t)[m
i
t
_
n[e
i
H
0
t
V e
i
H
0
t
[mdt
=
i
V
nm
t
_
e
i
(EnEm)t
dt
= V
nm
_
e
i
(EnEm)t
E
n
E
m
e
i
(EnEm)()
E
n
E
m
_
(7.22)
What shall we do with the meaningless term containing () on the right
hand side?
This term can be made harmless if we take into account an important
fact that the S-operator cannot be applied to all states in the Hilbert space.
It can be applied only to scattering states [ in which free particles are
far from each other in asymptotic limits t . Then the time evolu-
tion of these states coincides with the free evolution in the distant past and
distant future
10
and the full time evolution in the innite time interval is
given exactly by formula (7.8). Certainly, the above assumptions cannot be
applied to all states in the Hilbert space. For example, the time evolution
9
the dierence being of the order n + 1 or higher
10
Of course, interaction V must be cluster separable to ensure that.
224 CHAPTER 7. SCATTERING
of bound states of the interacting Hamiltonian H, does not resemble the
free evolution at any time. It appears that if we exclude such bound states
from consideration and limit our application of the S-operator and t-integrals
(7.22) only to scattering states consisting of one-particle wave packets with
good localization in both position and momentum spaces, then no ambiguity
arises.
For scattering states the interaction operator is eectively zero in asymp-
totic regimes, so we can write
lim
t
V e
H
0
t
[ = 0
lim
t
V (t)[ = 0 (7.23)
One approach to the exact treatment of scattering is to explicitly consider
only wave packets described above. Then the cluster separability of V will
ensure the correct asymptotic behavior of the colliding wave packets and the
validity of equation (7.23). However, such an approach is rather complicated
and we would like to stay away from working with wave packets.
There is another way to achieve the same goal by using a trick called the
adiabatic switching of interaction. The trick is to add the property (7.23)
to the interaction operator by hand. To do that we multiply V (t) by a
numerical function of t which slowly grows from the value of zero at t =
to the value of one at t 0 (turning the interaction on) and then slowly
decreases back to zero at t = (turning the interaction o). For example,
it is convenient to choose
V (t) = e
i
H
0
t
V e
H
0
t
e
|t|
(7.24)
If the parameter is small and positive, such a modication would not aect
the movement of wave packets and the S-matrix. At the end of calculations
we will take the limit +0. Then the t-integral (7.22) takes the form
n[V (t)[m V
nm
_
e
i
(EnEm)t|t|
E
n
E
m
e
i
(EnEm)()()
E
n
E
m
_
V
nm
e
i
(EnEm)t
E
n
E
m
(7.25)
so the embarrassing expression e
i
does not appear.
7.1. SCATTERING OPERATORS 225
7.1.4 T-matrix
In this subsection we will introduce the idea of T-matrix, which is often useful
in scattering calculations.
11
Let us now calculate matrix elements of the S-
operator (7.11) in the basis of eigenvectors of the free Hamiltonian (7.20) -
(7.21)
12
n[S[m
=
nm
n[e
i
H
0
t
V e
H
0
t
[mdt
2
_
n[e
i
H
0
t
V e
H
0
t
[kdt
_
t
k[e
i
H
0
t
V e
H
0
t
[mdt
+ . . .
=
nm
e
i
(EnEm)t
V
nm
dt
2
_
e
i
(EnE
k
)t
V
nk
dt
_
t
e
i
(E
k
Em)t
V
km
dt
+ . . .
=
nm
2i(E
n
E
m
)V
nm
+
i
e
i
(EnE
k
)t
dt
e
i
(E
k
Em)t
E
m
E
k
V
nk
V
km
+ . . .
=
nm
2i(E
n
E
m
)V
nm
+ 2i(E
n
E
m
)
1
E
m
E
k
V
nk
V
km
+ . . .
=
nm
2i(E
n
E
m
)V
nk
km
+
1
E
m
E
k
V
km
+
1
E
m
E
k
V
kl
1
E
m
E
l
V
lm
+ . . .
_
(7.26)
The matrix in the second term is called the T-matrix (or transition matrix).
T
nm
V
nk
_
km
+
1
E
m
E
k
V
km
+
1
E
m
E
k
V
kl
1
E
m
E
l
V
lm
+ . . .
_
= n[V
_
1 +
1
E
m
H
0
V +
1
E
m
H
0
V
1
E
m
H
0
V + . . .
_
[m
11
I am indebted to Cao Bin for numerous discussions which resulted in writing this
subsection.
12
Summation over indices k and l is implied. Formula (7.25) is used for t-integrals.
226 CHAPTER 7. SCATTERING
The series in the parentheses can be summed up using the standard formula
(1 x)
1
= 1 + x + x
2
+ . . .
T
nm
= n[V
1
1 (E
m
H
0
)
1
V
[m (7.27)
= n[V (E
m
H
0
)(E
m
H
0
V )
1
[m
= n[V (E
m
H
0
)(E
m
H)
1
[m (7.28)
The beauty of this result is that it provides a non-perturbative closed-form
expression for the S-operator and scattering amplitudes. Expression (7.27)
has been used in numerical scattering calculations [RMM74]. The major
computational diculty is associated with the inversion operation in (1
(E H
0
)
1
V ).
According to (7.26) and (7.28), the S-matrix can be represented as a
function of energy (E = E
m
= E
n
)
S
nm
(E) =
nm
2i(E
n
E
m
)T
nm
=
nm
2i(E
n
E
m
)n[V (E H
0
)(E H)
1
[m (7.29)
Let us analyze the structure of this expression in more detail. As we discussed
in subsection 7.1.3, the S-matrix is dened only for states that behave asymp-
totically as free states. For such states, the energy E is not lower than the
energy of separated reactants E
0
. In the center-of-mass reference frame this
threshold is the sum of rest energies of all N particles participating in the
scattering event
E
0
=
N
a=1
m
a
c
2
Therefore
E [E
0
, ) (7.30)
Let us pick a value E in this interval. Then T
nm
in (7.29) can be calculated
as a matrix of an energy-dependent T-operator T(E)
7.1. SCATTERING OPERATORS 227
T
nm
= n[V (E H
0
)(E H)
1
[m = n[T(E)[m
The product (E
n
E)T
nm
can be interpreted as a matrix, which is zero
everywhere except the diagonal sub-block corresponding to the eigenvalue E
of H
0
. If we denote P
E
the projection on this eigensubspace, then (E
n
E)T(E) = P
E
T(E)P
E
and the full S-operator (for all values of E) can be
written in a basis-independent form
S = 1 2i
E
P
E
V (E H
0
)(E H)
1
P
E
(7.31)
The corresponding S-matrix has a block-diagonal form, i.e., the matrix ele-
ment S
nm
is non-zero only if the indices n and m satisfy condition E
m
= E
n
.
This implies that the S-operator commutes with the free Hamiltonian H
0
.
13
Exact formula (7.31) is not very useful in practical scattering calculations.
However, it helps to derive some interesting results, such as the connection
between poles of the S-matrix and energies of bound states derived in the
next subsection.
7.1.5 S-matrix and bound states
There are good reasons to believe that the S-operator is an analytical function
of energy E. So, it is interesting to nd where the poles of this function are
located. If operator V is non-singular, then poles of S coincide with those
values of E at which operator (EH
0
)(EH)
1
in (7.31) is singular. In other
words, these are values E for which the denominator has zero eigenvalue.
Thus poles E
= E
Obviously, this is equivalent to the stationary Schr odinger equation for bound
states. This means that there exists a correspondence between poles of the S-
operator (or T-operator) and bound state energies E
= e
i
He
i
Then they yield the same scattering (and Hamiltonians H and H
are called
scattering equivalent) as long as condition
lim
t
e
i
H
0
t
e
H
0
t
= 0 (7.32)
is satised.
16
Indeed, in the limit +,
= lim
,
e
i
H
0
)
e
H
0
= lim
,
e
i
H
0
_
e
i
e
H(
)
e
i
_
e
H
0
= lim
,
_
e
i
H
0
e
i
e
H
0
_
e
i
H
0
H(
)
e
H
0
_
e
i
H
0
e
i
e
H
0
_
15
The two latter quantities are represented by positions of poles of the S-operator on
the complex energy plane, as discussed in the preceding subsection.
16
A rather general class of operators that satisfy this condition will be found in
Theorem 11.2 from subsection 11.2.3.
230 CHAPTER 7. SCATTERING
= lim
,
e
i
H
0
H(
)
e
H
0
= S (7.33)
Note that due to Lemma F.11, the energy spectra of two scattering equiv-
alent Hamiltonians H and H
[Q
0
S
0
]
1 +
_
1 + c
2
Q
2
0
H
0
= M
0
c
2
_
1 + c
2
Q
2
0
.
A point form interaction can be now introduced by modifying the mass op-
erator M
0
M provided that the following conditions are satised
17
[M, Q
0
] = [M, X
0
] = [M, S
0
] = 0
These conditions, in particular, guarantee that M is Lorentz invariant
[M, K
0
] = [M, J
0
] = 0.
This modication of the mass operator introduces interaction in generators
of the translation subgroup
P = Mc
2
Q
0
H = Mc
2
_
1 + c
2
Q
2
0
(7.34)
while Lorentz subgroup generators K
0
and J
0
remain interaction-free.
7.2.3 Scattering equivalence of forms of dynamics
The S-matrix equivalence of Hamiltonians established in subsection 7.2.1
remains valid even if the transformation e
i
changes the relativistic form of
dynamics [Sok75, SS78]. Here we would like to demonstrate this equivalence
on an example of Diracs point and instant forms of dynamics [Sok75]. We
will use denitions and notation from subsections 7.2.2 and 7.1.3.
17
Just as in subsection 6.3.2, these conditions can be satised by dening M = M
0
+L,
where L is a rotationally-invariant function of relative position and momentum operators
commuting with Q
0
and X
0
.
232 CHAPTER 7. SCATTERING
First we assume that a Bakamjians point form representation of the
Poincare group is given, which is built on operators
M ,= M
0
P = Q
0
Mc
2
J = J
0
R =
X
0
Mc
2
Then we introduce the unitary operator
=
0
1
where
0
= exp(i ln(M
0
c
2
)Z
0
)
= exp(i ln(Mc
2
)Z
0
)
Z
0
=
1
2
(Q
0
X
0
+X
0
Q
0
)
Our goal here is to demonstrate that the set of operators M
1
, P
1
,
J
1
and R
1
generates a representation of the Poincare group in the
Bakamjian-Thomas instant form of dynamics. Moreover, the S-operators
computed with the point-form Hamiltonian H =
M
2
c
4
+ P
2
c
2
and the
instant form Hamiltonian H
= H
1
are the same.
Let us denote
Q
0
(b) = e
ibZ
0
Q
0
e
ibZ
0
, b R
From the commutator
[Z
0
, Q
0
] = iQ
0
it follows that
7.2. SCATTERING EQUIVALENCE 233
d
db
Q
0
(b) = i[Z
0
, Q
0
] = Q
0
Q
0
(b) = e
b
Q
0
This formula remains valid even if b is a Hermitian operator commuting with
both Q
0
and X
0
. For example, if b = ln(M
0
c
2
), then
e
i ln(M
0
c
2
)Z
0
Q
0
e
i ln(M
0
c
2
)Z
0
= e
ln(M
0
c
2
)
Q
0
= M
1
0
c
2
Q
0
Similarly, one can prove
e
i ln(Mc
2
)Z
0
Q
0
e
i ln(Mc
2
)Z
0
= M
1
c
2
Q
0
e
i ln(M
0
c
2
)Z
0
X
0
e
i ln(M
0
c
2
)Z
0
= M
0
c
2
X
0
e
i ln(Mc
2
)Z
0
X
0
e
i ln(Mc
2
)Z
0
= Mc
2
X
0
which imply
P
1
= e
i ln(M
0
c
2
)Z
0
e
i ln(Mc
2
)Z
0
Q
0
Mc
2
e
i ln(Mc
2
)Z
0
e
i ln(M
0
c
2
)Z
0
= e
i ln(M
0
c
2
)Z
0
Q
0
e
i ln(M
0
c
2
)Z
0
= Q
0
M
0
c
2
= P
0
J
0
1
= J
0
R
1
= e
i ln(M
0
c
2
)Z
0
e
i ln(Mc
2
)Z
0
X
0
M
1
c
2
e
i ln(Mc
2
)Z
0
e
i ln(M
0
c
2
)Z
0
= e
i ln(M
0
c
2
)Z
0
X
0
e
i ln(M
0
c
2
)Z
0
= X
0
M
1
0
c
2
= R
0
From these formulas it is clear that the transformed dynamics corresponds
to the Bakamjian-Thomas instant form.
Let us now demonstrate that the scattering operator S computed with
the point form Hamiltonian H (7.34) is the same as S
= H
1
. Note that we can write equation
(7.4) as
S =
+
(H, H
0
)
(H, H
0
)
where operators
234 CHAPTER 7. SCATTERING
(H, H
0
) lim
t
e
i
H
0
t
e
Ht
are called Mller wave operators. Now we can use the Birman-Kato invariance
principle [Dol76] which states that
(H, H
0
) =
(f(H), f(H
0
)) where f is
any smooth function with positive derivative. Using the following connection
between the point form mass operator M and the instant form mass operator
M
M =
1
M =
1
1
M
=
1
1
0
M
1
=
1
0
M
0
we obtain
(H, H
0
)
_
Mc
2
_
1 + c
2
Q
2
0
, M
0
c
2
_
1 + c
2
Q
2
0
_
=
(Mc
2
, M
0
c
2
) =
(
1
0
M
0
c
2
, M
0
c
2
)
=
1
0
(M
c
2
, M
0
c
2
)
0
=
1
0
_
_
(M
)
2
c
4
+P
0
c
2
,
_
M
2
0
c
4
+P
0
c
2
_
0
=
1
0
(H
, H
0
)
0
S
=
+
(H
, H
0
)
(H
, H
0
)
=
0
+
(H, H
0
)
1
0
0
(H, H
0
)
1
0
=
0
+
(H, H
0
)
(H, H
0
)
1
0
=
0
S
1
0
but S commutes with free generators (7.7) and thus commutes with
0
, which
implies that S
, positrons e
+
, protons p
+
, antiprotons p
and
photons within, allegedly, the most successful physical theory quantum
electrodynamics (QED).
8.1 Annihilation and creation operators
In this section we are going to build the Fock space 1 of QED and introduce
creation and annihilation operators which provide a very convenient notation
for working with operators in 1.
8.1.1 Sectors with xed numbers of particles
The numbers of particles of any type are readily measured in experiments, so
we can introduce 5 new observables in our theory: the numbers of electrons
(N
el
), positrons (N
po
), protons (N
pr
), antiprotons (N
an
) and photons (N
ph
).
According to general rules of quantum mechanics, these observables must
be represented by ve Hermitian operators in the Hilbert (Fock) space 1.
Apparently, the allowed values (the spectrum) for the number of particles of
each type are non-negative integers (0,1,2,...). We assume that these observ-
ables can be measured simultaneously, therefore the corresponding operators
commute with each other and have common spectrum. So, the Fock space
1 separates into a direct sum of corresponding orthogonal eigensubspaces or
sectors 1(i, j, k, l, m) with i electrons, j positrons, k protons, l antiprotons
and m photons
1 =
ijklm=0
1(i, j, k, l, m) (8.1)
where
N
el
1(i, j, k, l, m) = i1(i, j, k, l, m)
N
po
1(i, j, k, l, m) = j1(i, j, k, l, m)
N
pr
1(i, j, k, l, m) = k1(i, j, k, l, m)
N
an
1(i, j, k, l, m) = l1(i, j, k, l, m)
N
ph
1(i, j, k, l, m) = m1(i, j, k, l, m)
8.1. ANNIHILATION AND CREATION OPERATORS 239
The one-dimensional subspace with no particles 1(0, 0, 0, 0, 0) is called
the vacuum subspace. The vacuum vector [0 is then dened as a vector in
this subspace, up to an insignicant phase factor. The one-particle sectors
are built using prescriptions from chapter 5. The subspaces 1(1, 0, 0, 0, 0)
and 1(0, 1, 0, 0, 0) correspond to one electron and one positron, respectively.
They are subspaces of unitary irreducible representations of the Poincare
group characterized by the mass m = 0.511 MeV/c
2
and spin 1/2 (see Table
5.1). The subspaces 1(0, 0, 1, 0, 0) and 1(0, 0, 0, 1, 0) correspond to one pro-
ton and one antiproton, respectively. They have mass M = 938.3 MeV/c
2
and spin 1/2. The subspace 1(0, 0, 0, 0, 1) correspond to one photon. It is
characterized by zero mass and it is a direct sum of two irreducible subspaces
with helicities 1 and -1.
1
Sectors with two or more particles are constructed as (anti)symmetrized
tensor products of one-particle sectors.
2
For example, if we denote 1
el
the
one-electron Hilbert space and 1
ph
the one-photon Hilbert space, then sectors
having only electrons and photons can be written as
1(0, 0, 0, 0, 0) = [0 (8.2)
1(1, 0, 0, 0, 0) = 1
el
(8.3)
1(0, 0, 0, 0, 1) = 1
ph
(8.4)
1(1, 0, 0, 0, 1) = 1
el
1
ph
(8.5)
1(2, 0, 0, 0, 0) = 1
el
asym
1
el
(8.6)
1(0, 0, 0, 0, 2) = 1
ph
sym
1
ph
(8.7)
1(1, 0, 0, 0, 2) = 1
el
(1
ph
sym
1
ph
) (8.8)
1(2, 0, 0, 0, 1) = 1
ph
(1
el
asym
1
el
) (8.9)
1(2, 0, 0, 0, 2) = (1
ph
sym
1
ph
) (1
el
asym
1
el
) (8.10)
. . .
In each sector of the Fock space we can dene observables of individual
particles, e.g., position momentum, spin, etc., as described in subsection
6.1.2.
For example, in each (massive) 1-particle subspace of the Fock space there
is a Newton-Wigner operator that describes position measurements on this
1
see subsection 5.3.3
2
See section 6.1. Note that electrons and protons are fermions, while photons are
bosons.
240 CHAPTER 8. THE FOCK SPACE
particle. In 2-particle sectors we can dene two dierent position operators
for each one of the two particles. In addition, we can also dene the center-
of-mass position operators for the 2-particle system in a usual way. Similar
position operators exist in each N-particle sector.
Then, in each sector we can select a basis of common eigenvectors of a
full set of mutually commuting one-particle observables. A general state [
in the Fock space may have components in all sectors.
3
Thus the number of
particles in [ may be not well-dened.
For future discussions it will be convenient to use the basis in which
momenta and z-components of the spin of massive particles (or helicity
of massless particles) are diagonal. For example, basis vectors in the two-
electron sector 1
el
asym
1
el
are denoted by [p
1
1
; p
2
2
. This allows us
to dene in each sector multi-particle wave functions in the momentum-spin
representation.
8.1.2 Non-interacting representation of the Poincare
group
The above construction provides us with the Hilbert (Fock) space 1 where
multiparticle states and observables of our theory reside and where a conve-
nient orthonormal basis set is dened. To complete the formalism we need to
build a realistic interacting representation of the Poincare group in 1. Let
us rst fulll an easier task and construct the non-interacting representation
U
0
g
of the Poincare group in the Fock space 1.
From subsection 6.2.1, we already know how to build a non-interacting
representation of the Poincare group in each individual sector of 1. This
can be done by making tensor products (with proper (anti)symmetrization)
of single-particle irreducible representations U
el
g
, U
ph
g
, etc. Then the non-
interacting representation of the Poincare group in the entire Fock space can
be constructed as a direct sum of such sector representations. In agreement
with the sector decomposition (8.2) - (8.10) we can write
U
0
g
= 1 U
el
g
U
ph
g
(U
el
g
U
ph
g
) (U
el
g
asym
U
el
g
) . . . (8.11)
Generators of this representation will be denoted as (H
0
, P
0
, J
0
, K
0
). In
3
Superselection rules forbid linear combinations of states with, e.g., dierent charges.
8.1. ANNIHILATION AND CREATION OPERATORS 241
each sector these generators are simply sums of one-particle generators.
4
As
usual, we assume that operators H
0
, P
0
and J
0
describe the total energy,
linear momentum and angular momentum of the non-interacting system,
respectively.
Here we immediately face a serious problem. For example, according to
(8.11), the free Hamiltonian can be represented as a direct sum of sector
components
H
0
= H
0
(0, 0, 0, 0, 0) H
0
(1, 0, 0, 0, 0) H
0
(0, 0, 0, 0, 1) H
0
(1, 0, 0, 0, 1) . . .
It is tempting to use notation from section 6.2 and express each sector Hamil-
tonian using observables of individual particles there: p
1
, p
2
, etc. For exam-
ple, in the sector 1(1, 0, 0, 0, 0), the free Hamiltonian is
H
0
(1, 0, 0, 0, 0) =
_
m
2
c
4
+ p
2
c
2
(8.12)
while in the sector 1(2, 0, 0, 0, 2) the Hamiltonian is
5
H
0
(2, 0, 0, 0, 2) = p
1
c + p
2
c +
_
m
2
c
4
+ p
2
3
c
2
+
_
m
2
c
4
+ p
2
4
c
2
(8.13)
Clearly, this notation is very cumbersome because it does not provide a
unique expression for the operator H
0
in the entire Fock space. Moreover,
it is not clear at all how one can use this notation to express operators
changing the number of particles, i.e., moving state vectors across sector
boundaries. We need to nd a better and simpler way to write operators in
the Fock space. This task is accomplished by introduction of annihilation
and creation operators in the rest of this section.
8.1.3 Creation and annihilation operators. Fermions
First, it is instructive to consider the case of the discrete spectrum of momen-
tum. This can be achieved by using the standard trick of putting the system
in a box or applying periodic boundary conditions. Then eigenvalues of the
4
For example, in each 2-particle sector equations (6.10) - (6.13) are valid.
5
Two photons are denoted by indices 1 and 2 and two electrons are denoted by indices
3 and 4
242 CHAPTER 8. THE FOCK SPACE
momentum operator form a discrete 3D lattice p
i
and the usual continuous
momentum spectrum can be obtained as a limit when the size of the box
tends to innity.
Let us examine the case of electrons. We dene the (linear) creation
operator a
p,
for the electron with momentum p and spin projection by its
action on basis vectors with n electrons
[p
1
,
1
; p
2
,
2
; . . . ; p
n
,
n
(8.14)
We need to distinguish two cases. The rst case is when the one-particle state
(p, ) created by a
p,
is among the states listed in (8.14), for example (p, ) =
(p
i
,
i
). Since electrons are fermions and two fermions cannot occupy the
same state due to the Pauli exclusion principle, this action leads to a zero
result, i.e.
a
p,
[p
1
,
1
; p
2
,
2
; . . . ; p
n
,
n
= 0 (8.15)
The second case is when the created one-particle state (p, ) is not among
the states listed in (8.14). Then the creation operator a
p,
just adds one
electron in the state (p, ) to the beginning of the list of particles
a
p,
[p
1
,
1
; p
2
,
2
; . . . ; p
n
,
n
= [p, ; p
1
,
1
; p
2
,
2
; . . . ; p
n
,
n
(8.16)
Operator a
p,
has transformed a state with n electrons to a state with n +1
electrons. Applying multiple creation operators to the vacuum state [0 we
can construct all basis vectors in the Fock space. For example,
a
p
1
,
1
a
p
2
,
2
[0 = [p
1
,
1
; p
2
,
2
p,
. It can be proven [Wei95] that the action of
a
p,
on the n-electron state (8.14) is the following: If the electron state with
parameters (p, ) was already occupied, e.g. (p, ) = (p
i
,
i
) then this state
is annihilated and the number of particles in the system is reduced by one
8.1. ANNIHILATION AND CREATION OPERATORS 243
a
p,
[p
1
,
1
; . . . ; p
i1
,
i1
; p
i
,
i
; p
i+1
,
i+1
; . . . ; p
n
,
n
= (1)
P
[p
1
,
1
; . . . ; p
i1
,
i1
; p
i+1
,
i+1
; . . . ; p
n
,
n
(8.17)
where P is the number of permutations of particles required to bring the
one-particle i to the rst place in the list. If the state (p, ) is not present
in the list, i.e., (p, ) ,= (p
i
,
i
) for each i, then
a
p,
[p
1
,
1
; p
2
,
2
; . . . ; p
n
,
n
= 0 (8.18)
Annihilation operators always yield zero when acting on the vacuum state
a
p,
[0 = 0
The above formulas fully dene the action of creation and annihilation
operators on basis vectors in purely electronic sectors. These rules are easily
generalized to all states: they do not change if other particles are present and
they can be extended to linear combinations of the basis vectors by linearity.
Creation and annihilation operators for other fermions - positrons, protons
and antiprotons - are constructed similarly.
For brevity we will refer to creation and annihilation operators collec-
tively as to particle operators. This will distinguish them from operators of
momentum, position, energy, etc. of individual particles which will be called
particle observables. Let us emphasize that creation and annihilation oper-
ators are not intended to directly describe any real physical process in the
system and they do not correspond to physical observables. They are just
formal mathematical objects that simplify our notation for other operators
having more direct physical meaning. We will see how operators of observ-
ables are built from particle operators later in this book, e.g., in subsection
8.1.8.
8.1.4 Anticommutators of particle operators
In practical calculations one often uses anticommutators of fermion operators.
First consider the case of unequal particle states (p, ) ,= (p
)
244 CHAPTER 8. THE FOCK SPACE
a
p
,
, a
p,
a
p,
a
p
,
+ a
p
,
a
p,
Then, acting on a state [p
, we obtain
(a
p,
a
p
,
+ a
p
,
a
p,
)[p
= a
p
,
[p, ; p
= 0
Similarly, we obtain
(a
p,
a
p
,
+ a
p
,
a
p,
)[p, = 0
(a
p,
a
p
,
+ a
p
,
a
p,
)[p
= a
p,
[0 + a
p
,
[p, ; p
= [p, [p, = 0
One can easily demonstrate that the result is still zero when acting on zero-,
two-, three-, etc. particle states as well as on their linear combinations. So,
we conclude that in the entire Fock space
a
p
,
, a
p,
= 0, if (p, ) ,= (p
)
Similarly, in the case (p, ) = (p
) we obtain
a
p,
, a
p,
= 1
Therefore for all values of p, p
, and
we have
a
p,
, a
p
,
=
p,p
,
(8.19)
Using similar arguments one can show that
a
p,
, a
= a
p,
, a
p
,
= 0
8.1. ANNIHILATION AND CREATION OPERATORS 245
8.1.5 Creation and annihilation operators. Photons
For photons, which are bosons, the properties of creation and annihilation
operators are slightly dierent from those described above. Two or more
bosons may coexist in the same state. Therefore, we dene the action of the
photon creation operator c
p,
on a many-photon state as
c
p,
[p
1
,
1
; p
2
,
2
; . . . ; p
n
,
n
= [p, ; p
1
,
1
; p
2
,
2
; . . . ; p
n
,
n
independent of whether or not the state (p, ) already existed. The action
of the adjoint photon annihilation operator c
p,
is
c
p,
[p
1
,
1
; p
2
,
2
; . . . ; p
n
,
n
= 0
if the annihilated state (p, ) was not present in the original state and
c
p
i
,
i
[p
1
,
1
; . . . ; p
i1
,
i1
; p
i
,
i
; p
i+1
,
i+1
; . . . ; p
n
,
n
= [p
1
,
1
; . . . ; p
i1
,
i1
; p
i+1
,
i+1
; . . . ; p
n
,
n
otherwise.
The above formulas are easily extended to states where, in addition to
photons, other particles are also present. The action of creation and an-
nihilation operators on linear combinations of basis vectors is obtained by
linearity.
Similar to subsection 8.1.4, we obtain the following commutation relations
for photon creation and annihilation operators
[c
p,
, c
] =
p,p
,
[c
p,
, c
p
,
] = [c
p,
, c
] = 0
8.1.6 Particle number operators
With the help of particle creation and annihilation operators we can now
build explicit expressions for various operators in the Fock space. Consider,
for example, the product of two photon operators
246 CHAPTER 8. THE FOCK SPACE
N
p,
= c
p,
c
p,
(8.20)
Acting on a state with two photons with quantum numbers (p, ) this oper-
ator yields
N
p,
[p, ; p, = N
p,
c
p,
c
p,
[0 = c
p,
c
p,
c
p,
c
p,
[0
= c
p,
c
p,
c
p,
c
p,
[0 + c
p,
c
p,
[0 = c
p,
c
p,
c
p,
c
p,
[0 + 2c
p,
c
p,
[0
= 2[p, ; p,
while acting on the state [p, ; p
we obtain
N
p,
[p, ; p
= N
p,
c
p,
c
[0 = c
p,
c
p,
c
p,
c
[0
= c
p,
c
p,
c
p,
c
[0 + c
p,
c
[0 = c
p,
c
p,
c
c
p,
[0 + c
p,
c
[0
= [p, ; p
,
, a
p,
=
,
(p p
) (8.21)
The following chain of formulas
,
(p p
)
= p, [p
= 0[a
p,
a
[0 = 0[a
a
p,
[0 +
,
(p p
)
=
,
(p p
)
8.1. ANNIHILATION AND CREATION OPERATORS 247
conrms that our choice (8.21) is consistent with normalization of momentum
eigenvectors (5.19).
The same arguments now can be applied to positrons (operators b
p,
and b
p,
), protons (d
p,
and d
p,
), antiprotons (f
p,
and f
p,
) and photons
(c
p,
and c
p,
). So, nally, we obtain the full set of anticommutation and
commutation relations pertinent to QED
a
p,
, a
= b
p,
, b
= d
p,
, d
= f
p,
, f
= (p p
(8.22)
a
p,
, a
p
,
= b
p,
, b
p
,
= d
p,
, d
p
,
= f
p,
, f
p
,
= a
p,
, a
= b
p,
, b
= d
p,
, d
= f
p,
, f
= 0 (8.23)
[c
p,
, c
] = (p p
(8.24)
[c
p,
, c
] = [c
p,
, c
p
,
] = 0 (8.25)
Commutators of operators related to dierent particles are always zero.
In the continuous momentum limit, the analog of the particle counter
operator (8.20)
p,
= c
p,
c
p,
(8.26)
can be interpreted as the density of photons with helicity at momentum p.
By summing over photon polarizations and integrating density (8.26) over
entire momentum space we can dene an operator for the total number of
photons in the system
N
ph
=
_
dpc
p,
c
p,
We can also write down similar operator expressions for the numbers of other
particles. For example
N
el
=
_
dpa
p,
a
p,
(8.27)
248 CHAPTER 8. THE FOCK SPACE
is the electron number operator. Then we conclude that operator
N = N
el
+ N
po
+ N
pr
+ N
an
+ N
ph
(8.28)
corresponds to the total number of all particles in the system.
8.1.8 Generators of the non-interacting representation
Now we can fully appreciate the benets of introducing annihilation and cre-
ation operators. The expression for the non-interacting Hamiltonian H
0
can
be simply obtained from the particle number operator (8.28) by multiplying
the integrands (particle densities in the momentum space) by energies of free
particles
H
0
=
_
dp
p
=1/2
[a
p,
a
p,
+ b
p,
b
p,
] +
_
dp
p
=1/2
[d
p,
d
p,
+ f
p,
f
p,
]
+c
_
dpp
=1
c
p,
c
p,
(8.29)
where we denoted
p
=
_
m
2
c
4
+ p
2
c
2
the energy of free electrons and
positrons,
p
=
_
M
2
c
4
+ p
2
c
2
the energy of free protons and antiprotons
and cp is the energy of free photons. One can easily verify that H
0
in (8.29)
acts on states in the sector 1(1, 0, 0, 0, 0) just as equation (8.12) and it acts
on states in the sector 1(2, 0, 0, 0, 2) exactly as equation (8.13). So, we have
obtained a single expression which works equally well in all sectors of the
Fock space. Similar arguments demonstrate that operator
P
0
=
_
dpp
=1/2
[a
p,
a
p,
+ b
p,
b
p,
] +
_
dpp
=1/2
[d
p,
d
p,
+ f
p,
f
p,
]
+
_
dpp
=1
c
p,
c
p,
(8.30)
can be regarded as the total momentum operator in QED.
Expressions for the generators J
0
and K
0
are more complicated as they
involve derivatives of particle operators. Let us illustrate their derivation on
8.1. ANNIHILATION AND CREATION OPERATORS 249
an example of a massive spinless particle. Consider the action of a space
rotation e
J
0z
J
0z
[p = [p
x
cos + p
y
sin , p
y
cos p
x
sin , p
z
p
and
p
are, respectively, creation and annihilation operators for
the particle, then
e
J
0z
[p
x
, p
y
, p
z
=
px,py,pz
[p
x
, p
y
, p
z
Rz()p
p
[p
x
, p
y
, p
z
Therefore, for arbitrary 1-particle state, the operator of nite rotation takes
the form
e
J
0z
=
_
dp
Rz()p
p
(8.31)
It is easy to show that the same form is valid everywhere on the Fock space.
An explicit expression for the generator J
0z
can be obtained now by taking
a derivative of (8.31) with respect to
J
0z
= i lim
0
d
d
e
J
0z
= i lim
0
d
d
_
dp
Rz()p
p
= i
_
dp
_
p
y
p
p
x
p
x
p
p
y
_
p
(8.32)
The action of a boost along the z-axis is obtained from (5.28)
e
ic
K
0z
[p =
p
cosh + cp
z
sinh
p
[p
x
, p
y
, p
z
cosh +
p
cosh
(8.33)
6
See equation (5.28).
250 CHAPTER 8. THE FOCK SPACE
This transformation can be represented as annihilation of the state [p =
[p
x
, p
y
, p
z
followed by creation of the state (8.33)
e
ic
K
0z
[p =
p
cosh + cp
z
sinh
px,py,pz
[p
x
, p
y
, p
z
Therefore, for arbitrary 1-particle state in the Fock space, the operator of a
nite boost takes the form
e
ic
K
0z
=
_
dp
_
p
(8.34)
An explicit expression for the generator K
0z
can be now obtained by taking
a derivative of (8.34) with respect to
K
0z
=
i
c
lim
0
d
d
e
ic
K
0z
=
i
c
lim
0
d
d
_
dp
p
cosh + cp
z
sinh
px,py,pz cosh +c
1
p sinh
p
= i
_
dp
_
p
z
2
p
p
+
p
c
2
p
p
z
p
_
(8.35)
Similar derivations can be done for other components of J
0
and K
0
.
8.1.9 Poincare transformations of particle operators
From transformations (5.28) of 1-particle state vectors with respect to the
non-interacting representation
U
0
(; r, t) e
J
0
ic
K
0
P
0
r
e
i
H
0
t
in the Fock space we can nd corresponding transformations of creation-
annihilation operators. For electron creation operators we obtain
7
7
Here we took into account that the vacuum vector is invariant with respect to U
0
.
8.2. INTERACTION POTENTIALS 251
U
0
(; r, t)a
p,
U
1
0
(; r, t)[0 = U
0
(; r, t)a
p,
[0 = U
0
(; r, t)[p,
=
_
p
e
pr+
i
pt
D
1/2
W
(p, ))[p,
Therefore
8
U
0
(; r, t)a
p,
U
1
0
(; r, t) =
_
p
e
pr+
i
pt
D
1/2
W
(p, ))a
p,
=
_
p
e
pr+
i
pt
(D
1/2
)
W
(p, ))a
p,
(8.36)
Similarly, we obtain the transformation law for annihilation operators
U
0
(; r, t)a
p,
U
1
0
(; r, t) =
_
p
e
i
pr
i
pt
D
1/2
W
(p, ))a
p,
(8.37)
Transformation laws for photon operators are obtained from equation
(5.63)
U
0
(; r, t)c
p,
U
1
0
(; r, t) =
[p[
p
e
(pr)+
ic
pt
e
i
W
(p,)
c
p,
(8.38)
U
0
(; r, t)c
p,
U
1
0
(; r, t) =
[p[
p
e
i
(pr)
ic
pt
e
i
W
(p,)
c
p,
(8.39)
8.2 Interaction potentials
Our primary goal in the rest of this rst part of the book is to learn how to
calculate the S-operator in QED, which is the quantity most readily com-
parable with experiment. Equations in subsection 7.1.2 tell us that in order
8
Here
and
denote complex conjugation and Hermitian conjugation, respectively.
We also use the property D
T
(
) = (D
))
= (D
1
(
))
= D
Ht
F(0)e
Ht
= F(0)
It then follows that conserved observables commute with the Hamiltonian
[F, H] = [F, H
0
+V ] = 0, which imposes some restrictions on the interaction
operator V . For example, the conservation of the total momentum and the
total angular momentum implies that
[V, P
0
] = 0 (8.40)
[V, J
0
] = 0 (8.41)
These commutators are automatically satised in the instant form of dynam-
ics (6.22) adopted in our study. It is also well-established that all interactions
8.2. INTERACTION POTENTIALS 253
conserve the lepton number (the number of electrons minus the number of
positrons, in our case). Therefore, H = H
0
+ V must commute with the
lepton number operator
L = N
el
N
po
=
_
dp(a
p,
a
p,
b
p,
b
p,
) (8.42)
Since H
0
already commutes with L, we obtain
[V, L] = 0 (8.43)
Moreover, all known interactions conserve the baryon number (=the number
of protons minus the number of antiprotons in our case). So, V must also
commute with the baryon number operator
B = N
pr
N
an
=
_
dp(d
p,
d
p,
f
p,
f
p,
) (8.44)
[V, B] = 0 (8.45)
Taking into account that electrons have charge e, protons have charge e and
antiparticles have charges opposite to those of particles, we can introduce the
electric charge operator
Q = e(B L)
= e
_
dp(b
p,
b
p,
a
p,
a
p,
+ d
p,
d
p,
f
p,
f
p,
) (8.46)
and obtain the charge conservation law
[H, Q] = [V, Q] = e[V, B L] = 0 (8.47)
from equations (8.43) and (8.45).
As we saw above, both operators H
0
and V in QED commute with P
0
,
J
0
, L, B and Q. Then from subsection 7.1.2 it follows that scattering op-
erators F, and S also commute with P
0
, J
0
, L, B and Q, which means
254 CHAPTER 8. THE FOCK SPACE
that corresponding observables (total momentum, total angular momentum,
lepton number, baryon number and charge, respectively) are conserved in
scattering events. Although, separate numbers of particles of individual
species, i.e., electrons, or protons may not be conserved, the above con-
servation laws require that charged particles may be created or annihilated
only together with their antiparticles, i.e., in pairs. Creation of pairs is sup-
pressed in low energy reactions as such processes require additional energy
of 2m
el
c
2
= 2 0.51MeV = 1.02MeV for an electron-positron pair and
2m
pr
c
2
= 1876.6MeV for an proton-antiproton pair. Therefore such high-
energy processes can be safely neglected in most applications usually con-
sidered in classical electrodynamics. Since photons have zero mass, charge,
lepton and baryon numbers, the energetic threshold for the photon emission
is zero and there are no restrictions on creation and annihilation of photons.
They can be created and destroyed in any quantities.
8.2.2 Normal ordering
In the next subsection we are going to express operators in the Fock space
as polynomials in particle creation and annihilation operators. But rst we
need to overcome one notational problem related to the non-commutativity
of particle operators: two dierent polynomials may, actually, represent the
same operator. To have a unique polynomial representative for each operator,
we will agree always to write products of operators in the normal order, i.e.,
creation operators to the left from annihilation operators. Among creation
(annihilation) operators we will enforce a certain order based on particle
species: We will write particle operators in the order proton - antiproton -
electron - positron - photon from left to right. With these rules and with
(anti)commutation relations (8.22) - (8.25) we can always convert a product
of particle operators to the normally ordered form. This is illustrated by the
following example
a
p
,
c
q
,
a
p,
c
q,
= a
p
,
a
p,
c
q
,
c
q,
= (a
p,
a
p
,
+ (p p
)
,
)(c
q,
c
q
,
+ (q q
)
,
))
= a
p,
c
q,
a
p
,
c
q
,
+ a
p,
a
p
,
(q q
)
,
q,
c
q
,
(p p
)
,
+ (p p
)
,
(q q
)
,
N=0
M=0
V
NM
(8.49)
V
NM
=
{,
}
_
dq
1
. . . dq
N
dq
1
. . . dq
M
D
NM
(q
1
; . . . ; q
N
; q
1
1
; . . . ; q
M
M
)
_
N
i=1
q
j=1
q
j
_
1
,
1
. . .
N
,
q
1
,
1
. . .
q
M
,
M
(8.50)
where the summation is carried over all spin/helicity indices ,
of cre-
ation and annihilation operators and integration is carried over all particle
momenta. Individual terms V
NM
in the expansion (8.49) of the interaction
Hamiltonian will be called potentials. Each potential is a normally ordered
product of N creation operators
_
N
i=1
q
j=1
q
j
_
which expresses the conservation of momentum: the sum of momenta of
annihilated particles is equal to the sum of momenta of created particles.
The interaction Hamiltonian V enters in formulas (7.15), (7.17) and (7.19)
for the S-operator in the t-dependent form
V (t) = e
i
H
0
t
V e
H
0
t
(8.52)
Operators with t-dependence determined by the free Hamiltonian H
0
as in
equation (8.52) and satisfying conservation laws (8.48), (8.51) will be called
regular. Such operators will play an important role in our calculations of
the S-operator below. In what follows, when we write a regular operator V
without its t-argument, this means that either this operator is t-independent,
i.e., it commutes with H
0
, or that we take its value at t = 0.
One nal notational remark. If potential V
NM
has coecient function
D
NM
, we introduce notation V
NM
for the operator whose coecient func-
tion D
NM
is a product of D
NM
and a numerical function of the same
arguments
D
NM
(q
1
; . . . ; q
N
; q
1
1
; . . . ; q
M
M
)
= D
NM
(q
1
; . . . ; q
N
; q
1
1
; . . . ; q
M
M
)(q
1
; . . . ; q
N
; q
1
1
; . . . ; q
M
M
)
Then, substituting (8.50) in (8.52) and using (8.36) - (8.39), the t-dependent
form of any regular potential V
NM
(t) can be written as
V
NM
(t) = e
i
H
0
t
V
NM
e
H
0
t
= V
NM
e
i
E
NM
t
where
8.2. INTERACTION POTENTIALS 257
E
NM
(q
1
, . . . , q
N
, q
1
, . . . , q
M
)
N
i=1
j=1
q
j
(8.53)
is the dierence of energies of particles created and destroyed by V
NM
, which
is called the energy function of the term V
NM
. We can also extend this
notation to a general sum of potentials V
NM
V (t) = e
i
H
0
t
V e
H
0
t
= V e
i
E
V
t
which means that for each potential V
NM
(t) in the sum V (t), the argument
of the t-exponent contains the corresponding energy function E
NM
. In this
notation we can conveniently write
d
dt
V (t) = V (t)
_
i
E
V
_
V (t)
..
i
V (t)dt = 2iV (E
V
) (8.54)
Equation (8.54) means that each term in V (t)
..
is non-zero only on the hyper-
surface of solutions of the equation
E
NM
(q
1
, . . . , q
N
, q
1
, . . . , q
M
) = 0 (8.55)
(if such solutions exist). This hypersurface in the 3(N + M) dimensional
momentum space is called the energy shell of the potential V . We will also
say that V (t)
..
in equation (8.54) is zero outside the energy shell of V . Note
that the scattering operator (7.14) S = 1+(t)
..
is dierent from 1 only on the
energy shell, i.e., where the energy conservation condition (8.55) is satised.
8.2.4 Five types of regular potentials
Here we would like to introduce a classication of regular potentials (8.50) by
dividing them into ve groups depending on their index (N, M). We will call
258 CHAPTER 8. THE FOCK SPACE
RR
R, O
unphys
u
n
p
h
y
s p
h
y
s
M
NN 0 11 2 33 4
00
1
22
3
44
unphys, decay
u
n
p
h
y
s
,
d
e
c
a
y
Figure 8.1: Positions of dierent operator types in the index space (N, M).
N and M are numbers of creation and annihilation operators, respectively.
R = renorm, O = oscillation.
these types of operators renorm, oscillation, decay, phys and unphys.
10
The
rationale for introducing this classication and nomenclature will become
clear in chapters 10 and 11 where we will examine renormalization and the
dressed particle approach in quantum eld theory.
Renorm potentials have either index (0,0) (such operator is simply a
numerical constant) or index (1,1) in which case both created and annihi-
lated particles are required to have the same mass. The most general form
of a renorm potential obeying conservation laws is the sum of a numerical
constant C and (1,1) terms corresponding to each particle type.
11
R = a
a + b
b + d
d + f
f + c
c + C (8.56)
10
The correlation between potentials index (N, M) and its type is shown in Fig. 8.1.
There is no established terminology for the types of potentials. In the literature, our phys
operators are sometimes called good; unphys operators may be called bad or virtual.
11
Here we write just the operator structure of R omitting all numerical factors, indices,
integration and summation signs. Note also that terms like a
b or d
f are forbidden by
the charge conservation law (8.47).
8.2. INTERACTION POTENTIALS 259
Note that the free Hamiltonian (8.29) and the total momentum (8.30) are
examples of renorm operators, i.e., sums of renorm potentials. Renorm po-
tentials are characterized by the property that the energy function (8.53)
is identically zero. So, renorm potentials always have energy shell where
they do not vanish. Renorm potentials commute with H
0
, therefore regular
renorm operators
12
do not depend on t.
Oscillation potentials have index (1, 1). In contrast to renorm po-
tentials with index (1,1), oscillation potentials destroy and create dierent
particle species having dierent masses. For this reason, the energy function
(8.53) of an oscillation potential never turns to zero, so there is no energy
shell. In QED there can be no oscillation potentials, because they would
violate either lepton number or baryon number conservation law. However,
there are particles in nature, such as kaons and neutrinos, for which oscil-
lation interactions play a signicant role. These interactions are responsible
for time-dependent oscillations between dierent particle species [GL].
Decay potentials satisfy two conditions:
1. they must have indices (1, N) or (N, 1) with N 2;
2. they must have a non-empty energy shell on which the coecient func-
tion does not vanish;
Apparently, these potentials describe decay processes 1 N
13
in which
one particle decays into N decay products, so that the energy conservation
condition is satised. There are no decay terms in the QED Hamiltonian and
in the corresponding S-matrix: decays of electrons, protons, or photons would
violate conservation laws.
14
Nevertheless, particle decays play important
roles in other areas of high energy physics, and they will be considered in
chapter 13.
12
whose t-dependence is determined by (8.52)
13
as well as reverse processes N 1
14
Exceptions to this rule are given by operators describing the decay of a photon into odd
number of photons, e.g., c
k1,1
c
k2,2
c
k3,3
c
k1+k2+k3,4
. This potential obeys all conserva-
tion laws if momenta of involved photons are collinear and k
1
+k
2
+k
3
[k
1
+k
2
+k
3
[ = 0.
However, it was shown in [FM96] that such contributions to the S-operator are zero on
the energy shell, so photon decays are forbidden in QED.
260 CHAPTER 8. THE FOCK SPACE
Phys potentials have at least two creation operators and at least two
destruction operators (index (N, M) with N 2 and M 2). For phys
potentials the energy shell always exists. For example, in the case of a phys
potential d
p+k,
f
qk,
a
p,
b
q,
the energy shell is determined by the solution
of equation
p+k
+
qk
=
p
+
q
which is not empty.
Table 8.1: Types of potentials in the Fock space.
Potential Index of potential Energy shell Examples
(N, M) exists?
Renorm (0, 0),(1, 1) yes a
p
a
p
Oscillation (1, 1) no forbidden in QED
Unphys (0, N 1),(N 1, 0) no a
p
b
pk
c
k
Unphys (1, N 2),(N 2, 1) no a
p
a
pk
c
k
Decay (1, N 2),(N 2, 1) yes forbidden in QED
Phys (N 2, M 2) yes d
q+k
a
pk
d
q
a
p
All regular operators not mentioned above belong to the class of
Unphys potentials. They come in two subclasses with following indices
1. (0, N), or (N, 0), where N 1. Obviously, there is no energy shell in
this case.
2. (1, M) or (M, 1), where M 2. This is the same condition as for
decay potentials, however, in contrast to decay potentials, for unphys
potentials it is required that either the energy shell does not exist or
the coecient function vanishes on the energy shell.
An example of an unphys potential satisfying condition 2. is
a
p,
a
pk,
c
k,
(8.57)
Its energy shell equation is
pk
+ ck =
p
whose only solution is k = 0.
However zero vector is excluded from the photon momentum spectrum,
15
so the energy shell of the potential (8.57) is empty. This means that a
free electron cannot decay into the pair electron+photon without violating
energy-momentum conservation laws.
15
see subsection 5.3.1
8.2. INTERACTION POTENTIALS 261
Properties of potentials discussed above are summarized in Table 8.1.
These ve types of potentials exhaust all possibilities, therefore any regular
operator V must have a unique decomposition
V = V
ren
+ V
unp
+ V
dec
+ V
ph
+ V
osc
As mentioned above, in QED interaction, oscillation and decay contributions
are absent. So, everywhere in this book
16
we will assume that all interaction
operators are sums of renorm, unphys and phys potentials
V = V
ren
+ V
unp
+ V
ph
Now we need to learn how to perform various operations with these three
classes of potentials, i.e., how to calculate products, commutators and t-
integrals required for calculations of scattering operators in (7.14) - (7.19).
8.2.5 Products and commutators of potentials
Lemma 8.1 The product of two (or any number of ) regular operators is
regular.
Proof. If operators A(t) and B(t) are regular, then
A(t) = e
i
H
0
t
Ae
H
0
t
B(t) = e
i
H
0
t
Be
H
0
t
and their product C(t) = A(t)B(t) has t-dependence
C(t) = e
i
H
0
t
Ae
H
0
t
e
i
H
0
t
Be
H
0
t
= e
i
H
0
t
ABe
H
0
t
characteristic for regular operators. The conservation laws (8.48), (8.51) are
valid for the product AB if they are valid for A and B separately. Therefore
C(t) is regular.
16
except chapter 13 where we discuss decays
262 CHAPTER 8. THE FOCK SPACE
Lemma 8.2 A Hermitian operator A is phys if and only if it yields zero
when acting on the vacuum [0 and one-particle states [1
[0
17
A[0 = 0 (8.58)
A[1 = A
[0 = 0 (8.59)
Proof. Normally ordered phys operators have two annihilation operators
on the right, so equations (8.58) and (8.59) are satised. Let us now prove
the inverse statement. Renorm operators cannot satisfy (8.58) and (8.59)
because they conserve the number of particles. Unphys operators (1, N) can
satisfy equations (8.58) and (8.59), e.g.,
3
[0 = 0
3
[1 = 0.
However, for Hermiticity, such operators should be always present in pairs
with (N, 1) operators
1
. Then, there exists at least one one-particle
state [1 for which equation (8.59) is not valid, e.g.,
1
[1 =
2
[0 , = 0
The same argument is valid for unphys operators having index (0, N). There-
fore, the only remaining possibility for A is to be phys.
Lemma 8.3 Product and commutator of any two phys operators A and B
is phys.
Proof. By Lemma 8.2 if A and B are phys, then
A[0 = B[0 = A[1 = B[1 = 0.
Then the same conditions are true for the Hermitian combinations i(AB
BA) and AB +BA. Therefore, the commutator [A, B] and anticommutator
AB + BA are phys and
17
Here denotes any one of the ve particle operators (a, b, d, f, c) relevant for QED.
The spin index is omitted for simplicity.
8.2. INTERACTION POTENTIALS 263
AB =
1
2
(AB + BA) +
1
2
[A, B]
is phys as well.
Lemma 8.4 If R is a renorm operator and [A, R] ,= 0, then operator [A, R]
has the same type (i.e., renorm, phys, or unphys) as A.
Proof. The general form of a renorm operator is given in equation (8.56).
Let us consider just one term in that sum
R =
_
dpf(p)
p
We calculate the commutator [A, R] = AR RA by moving the factor R
in the term AR step-by-step to the leftmost position. If the product
p
(from R) changes places with a particle operator (from A) dierent from
q
(from A) then, as discussed in subsection 8.2.2, a secondary term
should be added which, instead of
q
contains the commutator
18
q
__
dpf(p)
p
_
__
dpf(p)
p
_
q
=
_
dpf(p)
_
dpf(p)
q
=
_
dpf(p)
_
dpf(p)
p
(p q)
_
dpf(p)
q
= f(q)
q
This commutator is proportional to
q
, so the secondary term has the same
operator structure as the primary term and it is already in the normal order,
so no tertiary terms need to be created. If the product
changes places
with an annihilation operator
q
then the commutator f(q)
q
is propor-
tional to the annihilation operator. If there are many
and operators in A
18
The upper sign is for bosons and the lower sign is for fermions.
264 CHAPTER 8. THE FOCK SPACE
having non-vanishing commutators with R, then each one of them results in
one additional term whose type remains the same as in the original operator
A.
Lemma 8.5 A commutator [P, U] of a Hermitian phys operator P and a
Hermitian unphys operator U can be either phys or unphys, but not renorm.
Proof. Acting by [P, U] on a one-particle state [1, we obtain
[P, U][1 = (PU UP)[1 = PU[1
If U is Hermitian then the state U[1 has at least two particles (see proof of
Lemma 8.2) and the same is true for the state PU[1. Therefore, [P, U] cre-
ates several particles when acting on a one-particle state, which is impossible
if [P, U] were renorm.
Finally, there are no limitations on the type of commutator of two unphys
operators [U, U
H
0
t
Ae
H
0
t
=
i
e
i
H
0
t
[H
0
, A]e
H
0
t
=
i
[H
0
, A(t)] (8.60)
In addition, it is easy to check that the derivative obeys all conservation laws
(8.48), (8.51). Therefore, it is regular.
Suppose that
d
dt
A(t) has a non-zero renorm part R. Then Ris t-independent
and originates from a derivative of the term Rt + S in A(t), where S is t-
independent. Since A(t) is regular, its renorm part must be t-independent,
therefore R = 0.
From formula (7.25) we conclude that t-integrals of regular phys and
unphys operators are regular
19
V (t) = V (t)
1
E
V
(8.61)
However, this property does not hold for t-integrals of renorm operators.
These operators are t-independent, therefore
V
ren
= lim
+0
V
ren
ie
t
= lim
+0
V
ren
V
ren
it
+ . . . (8.62)
V
ren
..
= (8.63)
Thus, renorm operators are dierent from others in the sense that their t-
integrals (8.62) are innite and non-regular. Innite-limit t-integrals (8.63)
are innite, unless V
ren
= 0.
20
Since for any unphys operator V
unp
either the energy shell does not ex-
ist or the coecient function is zero on the energy shell, we conclude from
equation (8.54) that
19
Here we assume the adiabatic switching of interaction (7.24).
20
This fact does not limit the applicability of our theory, because, as we will see in
subsection 10.1.2, a properly renormalized expression for scattering operators F in (7.19)
and in (7.15) should not contain renorm terms and pathological expressions like (8.62)
- (8.63).
266 CHAPTER 8. THE FOCK SPACE
V
unp
..
= 0 (8.64)
From equations (7.15) and (7.19) it is then clear that unphys terms in F and
do not make contributions to the S-operator. Results obtained so far in
this subsection are presented in last three columns of Table 8.2.
In calculations of scattering operators we always assume adiabatic switch-
ing of interaction.
21
So, we can simplify our notation by dropping t-arguments
and dening t-integrals by formulas (8.61) and (8.54)
V = V
1
E
V
(8.65)
V
..
= 2iV (E
V
) (8.66)
Then expressions for the S-operator (7.14) and (7.18) can be written in a
shorthand notation
S = exp( F
..
) = 1 +
..
(8.67)
F = V
1
2
[V , V ] + . . . (8.68)
= V + V V + V V V + . . . (8.69)
8.2.7 Solution of one commutator equation
In section 11.2 we will nd it necessary to solve equation of the type
i[H
0
, A] = V (8.70)
where H
0
is the free Hamiltonian, V is a given regular Hermitian operator in
the Fock space and A is the desired solution (an unknown Hermitian opera-
tor). What can we say about the solution of this commutator equation? Let
us rst multiply both side of (8.70) by the usual t-exponents e
i
H
0
t
. . . e
H
0
t
21
See subsection 7.1.3.
8.2. INTERACTION POTENTIALS 267
i[H
0
, A(t)] = V (t)
Using (8.60) this can be written as
d
dt
A(t) = V (t) (8.71)
Note that it follows from Lemma 8.6 that the operator on the right hand
side of (8.70) cannot contain renorm terms. Indeed, according to (8.71), the
left hand side of (8.70) is a t-derivative, which cannot be renorm. Luckily,
for our purposes in this book we will never meet equations of the above type
with renorm right hand sides. Therefore, we will assume V
ren
= 0.
Next we assume that the usual adiabatic switching (7.24) works, so
that V () = 0 and the same property is valid for the solution A(t)
A() = 0 (8.72)
Equation (8.71) with the initial condition (8.72) has a simple solution
A(t) =
1
t
_
V (t
)dt
= iV (t) (8.73)
In order to get a t-independent solution of our original equation (8.70), we
can simply set t = 0 and obtain
A A(0) = iV iV
1
E
V
(8.74)
8.2.8 Two-particle potentials
Our next goal is to express n-particle potentials (n 2)
22
using the formalism
of annihilation and creation operators. These potentials conserve the number
and types of particles, so they must have equal numbers of creation and
22
studied in subsection 6.3.4
268 CHAPTER 8. THE FOCK SPACE
annihilation operators (N = M, N 2, M 2). Therefore, their type must
be phys.
Consider now a two-particle subspace 1(1, 0, 1, 0, 0) of the Fock space.
This subspace describes states of the system consisting of one electron and
one proton. A general phys operator leaving this subspace invariant must
have N = 2, M = 2 and, according to equation (8.50), it can be written as
23
V =
_
dpdqdp
dq
D
22
(p, q, p
, q
)(p +q p
)d
p
a
q
d
p
a
q
=
_
dpdqdp
D
22
(p, q, p
, p +q p
)d
p
a
q
d
p
a
p+qp
=
_
dpdqdkV (p, q, k)d
p
a
q
d
pk
a
q+k
(8.75)
where we denoted k = p p
dq
(p
, q
)d
[0 (8.76)
we obtain
V [
=
_
dpdqdkV (p, q, k)d
p
a
q
d
pk
a
q+k
_
dp
dq
(p
, q
)d
[0
=
_
dpdqdkV (p, q, k)
_
dp
dq
(p
, q
)(p k p
)(q +k q
)d
p
a
q
[0
=
_
dpdq
_
_
dkV (p, q, k)(p k, q +k)
_
d
p
a
q
[0 (8.77)
Comparing this with (8.76) we see that the momentum-space wave function
(p, q) has been transformed by the action of
V to the new wave function
23
In this subsection we use variables p and q to denote momenta of the proton and
electron, respectively. We also omit spin indices for brevity.
8.2. INTERACTION POTENTIALS 269
(p, q)
V (p, q) =
_
dkV (p, q, k)(p k, q +k)
This is the most general linear transformation of a two-particle wave function
which conserves the total momentum.
For comparison with traditionally used inter-particle potentials, it is more
convenient to have expression for operator
V in the position space. We can
write
24
(x, y)
V (x, y) =
1
(2)
3
_
e
i
px+
i
qy
dpdq
(p, q)
=
1
(2)
3
_
e
i
px+
i
qy
dpdq
__
dkV (p, q, k)(p k, q +k)
_
=
1
(2)
3
_
e
i
(p+k)x+
i
(qk)y
dpdq
_
dkV (p +k, q k, k)(p, q)
=
_
dke
i
k(xy)
V (p +k, q k, k)
_
1
(2)
3
_
dpdqe
i
px+
i
qy
(p, q)
_
(8.78)
where expression in square brackets is recognized as the original position-
space wave function
(x, y) =
1
(2)
3
_
dpdqe
i
px+
i
qy
(p, q) (8.79)
and the rest is an operator acting on this wave function. This operator
acquires especially simple form if we assume that V (p, q, k) does not depend
on p and q
V (p, q, k) = v(k)
Then
24
Here x and y are positions of the proton and the electron, respectively; and we use
(5.42) to change from the momentum representation to the position representation.
270 CHAPTER 8. THE FOCK SPACE
V (x, y) =
_
dke
i
k(xy)
v(k)(x, y) = w(x y)(x, y) (8.80)
where
w(r) =
_
dke
i
kr
v(k)
is the Fourier transform of v(k). We see that interaction (8.75) acts as
multiplication by the function w(r) in the position space. So, it is a usual
position-dependent potential. Note that the requirement of the total mo-
mentum conservation implies automatically that this potential depends on
the relative position r x y.
As an example consider interaction operator of the form (8.75)
V =
q
1
q
2
(2)
3
_
dpdqdk
k
2
d
p
a
q
d
pk
a
q+k
(8.81)
where constants q
1
and q
2
can be interpreted as charges of the two particles
and v(k) = q
1
q
2
/(8
3
k
2
). Then the position-space interaction is the usual
Coulomb potential
25
w(r) =
q
1
q
2
(2)
3
_
dk
k
2
e
i
kr
=
q
1
q
2
4r
(8.82)
Let us now consider the general case (8.78). Without loss of generality
we can represent function V (p +k, q k, k) as a series
26
V (p +k, q k, k) =
j
(p, q)v
j
(k)
Then we obtain
25
see equation (B.7)
26
For example, a series of this form can be obtained by writing a Taylor expansion with
respect to the variable k with
j
being the coecients depending on p and q.
8.2. INTERACTION POTENTIALS 271
V (x, y)
=
j
_
dke
i
k(xy)
v
j
(k)
_
1
(2)
3
_
dpdq
j
(p, q)e
i
px+
i
qy
(p, q)
_
=
j
w
j
(x y)
j
( p, q)
_
1
(2)
3
_
dpdqe
i
px+
i
qy
(p, q)
_
=
j
w
j
(x y)
j
( p, q)(x, y) (8.83)
where p = i(d/dx) and q = i(d/dy) are dierential operators, i.e.,
position-space representations (5.40) of the momentum operators of the two
particles. Expression (8.83) then demonstrates that interaction d
da can
be always represented as a general 2-particle potential depending on the
distance between particles and their momenta. We will use equation (8.83)
in our derivation of 2-particle RQD potentials in subsection 12.1.2.
8.2.9 Cluster separability in the Fock space
We know that a cluster separable interaction potential can be constructed as a
sum of smooth potentials (6.49) depending on particle observables (positions,
momenta and spins). However, this notation is very inconvenient to use in
the Fock space, because such sums have rather dierent forms in dierent
Fock sectors. For example, the Coulomb interaction has the form (6.47)
in the 2-particle sector and the form (6.48) in the 3-particle sector. This
notational dierence is very inconvenient. It would be more preferable to
have a unique formula for inter-particle interaction, which remains valid in
all N-particle sectors. Fortunately, it is easy to satisfy cluster separability
within our standard notation (8.49) - (8.50). We just need to make sure that
factors D
NM
are smooth functions of particle momenta.
27
Let us verify this statement on a simple example. We are going to nd
out how the 2-particle potential (8.75)
28
acts in the 3-particle (one proton
and two electrons) sector of the Fock space 1(2, 0, 1, 0, 0) where state vectors
have the form
27
see section 4 in [Wei95].
28
As a concrete example, it is instructive to choose the Coulomb interaction (8.81) in
these calculations.
272 CHAPTER 8. THE FOCK SPACE
[ =
_
dpdq
1
dq
2
(p, q
1
, q
2
)d
p
a
q
1
a
q
2
[0 (8.84)
Applying operator (8.75) to this state vector we obtain
V [
=
_
dp
dq
dk
_
dpdq
1
dq
2
V (p
, q
, k)(p, q
1
, q
2
)d
d
p
k
a
q
+k
d
p
a
q
1
a
q
2
[0
(8.85)
The product of particle operators acting on the vacuum state can be normally
ordered as
d
d
p
k
a
q
+k
d
p
a
q
1
a
q
2
[0
= d
p
a
q
1
a
q
2
d
p
k
a
q
+k
[0 + (p
k p)d
a
q
+k
a
q
1
a
q
2
[0
(q
1
q
k)d
p
a
q
2
d
p
k
[0 + (q
2
q
k)d
p
a
q
1
d
p
k
[0
= (p
k p)(q
1
q
k)d
q
2
[0 (q
2
q
k)(p
k p)d
q
1
[0
Inserting this result in (8.85) we obtain
V [
=
_
dkdpdq
1
dq
2
V (k +p, q
1
k, k)(p, q
1
, q
2
)d
k+p
a
q
1
k
a
q
2
[0
_
dkdpdq
1
dq
2
V (k +p, q
2
k, k)(p, q
1
, q
2
)d
k+p
a
q
2
k
a
q
1
[0
=
_
dpdq
1
dq
2
_
_
dkV (p, q
1
, k)(p k, q
1
+k, q
2
)
+
_
dkV (p, q
2
, k)(p k, q
1
, q
2
+k)
_
d
p
a
q
1
a
q
2
[0 (8.86)
Comparing this with equation (8.77) we see that, as expected from the con-
dition of cluster separability, the two-particle interaction in the three-particle
sector separates in two terms. One term acts on the pair of variables (p, q
1
).
The other term acts on variables (p, q
2
).
8.2. INTERACTION POTENTIALS 273
Removing the electron 2 to innity is equivalent to multiplying the momentum-
space wave function (p, q
1
, q
2
) by exp(
i
q
2
a) where a . The action of
V on such a wave function
29
is
lim
a
_
_
dkV (p, q
1
, k)(p k, q
1
+k, q
2
)e
i
q
2
a
+
_
dkV (p, q
2
, k)(p k, q
1
, q
2
+k)e
i
(q
2
+k)a
_
In the limit a the exponent in the integrand of the second term is a
rapidly oscillating function of k. If the coecient function V (p, q, k) is a
smooth function of k then the integral on k is zero due to the Riemann-
Lebesgue lemma B.1. Therefore, only the interaction proton - electron(1)
does not vanish
30
lim
a
V e
i
q
2
a
[
= lim
a
_
dpdq
1
dq
2
__
dkV (p, q
1
, k)(p k, q
1
+k, q
2
)
_
e
i
q
2
a
d
p
a
q
1
a
q
2
[0
=
_
lim
a
_
dq
2
e
i
q
2
a
a
q
2
[0
___
dpdq
1
dkV (p, q
1
, k)(p k, q
1
+k, q
2
)d
p
a
q
1
[0
_
which demonstrates that
V is a cluster separable potential.
For general potentials (8.49) - (8.50) with smooth coecient functions the
above arguments can be repeated: If some particles are removed to innity
such potentials automatically separate into sums of smooth terms, as required
by cluster separability. Therefore,
Statement 8.7 (cluster separability) The cluster separability of the in-
teraction (8.49) is guaranteed if coecient functions D
NM
of all interaction
potentials V
NM
are smooth functions of momenta.
The power of this statement is that when expressing interacting potentials
through particle operators in the momentum representation (as in (8.50)) we
have a very simple criterion of cluster separability: the coecient functions
29
i.e., the term in parentheses in (8.86)
30
compare with equation (8.77)
274 CHAPTER 8. THE FOCK SPACE
must be smooth, i.e., they should not contain singular factors, like delta
functions.
31
This is the great advantage of writing interactions in terms
of particle (creation and annihilation) operators (8.50) instead of particle
(position and momentum) observables as in section 6.3.
32
8.3 A toy model theory
Before considering real QED interactions in the next chapter, in this section
we are going to perform a warm-up exercise. We will introduce a simple yet
quite realistic model theory with variable number of particles in the Fock
space. In this theory, the perturbation expansion of the S-operator can be
evaluated with minimal eorts, in particular, with the help of a convenient
diagram technique.
8.3.1 Fock space and Hamiltonian
The toy model introduced here is a rough approximation to QED. This ap-
proximation describes only electrons and photons and their interactions. No
particle-antiparticle pair creation is allowed. So, we will work in the part of
the Fock space with electrons, photons and no other particles. This part is a
direct sum of electron-photon sectors like those described in formulas (8.2) -
(8.10). We will also assume that interaction does not aect the electron spin
and photon polarization degrees of freedom, so the corresponding labels will
be omitted. Then relevant (anti)commutation relations of particle operators
can be taken from (8.22) - (8.25)
a
p
, a
= (p p
) (8.87)
[c
p
, c
] = (p p
) (8.88)
a
p
, a
p
= a
p
, a
= 0 (8.89)
[c
p
, c
p
] = [c
p
, c
] = 0 (8.90)
31
This is the reason why cluster separable potentials were called smooth in subsection
6.3.4.
32
Recall that in subsection 6.3.6 it was a very non-trivial matter to ensure the cluster
separability for interaction potentials written in terms of particle observables even in a
simplest 3-particle system.
8.3. A TOY MODEL THEORY 275
[a
p
, c
] = [a
p
, c
p
] = [a
p
, c
] = [a
p
, c
p
] = 0
The full Hamiltonian
H = H
0
+ V
1
(8.91)
as usual, is the sum of the free Hamiltonian
H
0
=
_
dp
p
a
p
a
p
+ c
_
dkkc
k
c
k
and interaction, which we choose in the following unphys form
V
1
=
ec
(2)
3/2
_
dpdk
ck
a
p
c
k
a
p+k
+
ec
(2)
3/2
_
dpdk
ck
a
p
a
pk
c
k
(8.92)
The coupling constant e is equal to the absolute value of the electrons charge.
Here and in what follows the perturbation order of an operator (= the power
of the coupling constant e in the operator) is shown by the subscript. For
example, the free Hamiltonian H
0
does not depend on e, so it is of zero
perturbation order; the perturbation order of V
1
is one, etc.
The above theory satises conservation laws
[H, Q] = [H, P
0
] = [H, J
0
] = 0
where operators P
0
, J
0
and Q refer to the total momentum, total angular
momentum operator and total charge, respectively
P
0
=
_
dpp(a
p
a
p
+ c
p
c
p
)
Q = e
_
dpa
p
a
p
The number of electrons is conserved, due to the conservation of charge.
However the number of photons can vary. So, this theory can describe im-
portant processes of the photon emission and absorption. However, our toy
model has two major drawbacks:
276 CHAPTER 8. THE FOCK SPACE
First, it is not Poincare invariant. This means that we have not con-
structed an interacting boost operator K such that the Poincare commuta-
tion relations with H, P
0
and J
0
are satised. In this section we will tolerate
the lack of invariance, but in chapter 9 we will show how the Poincare invari-
ance can be satised in a more comprehensive theory (QED) which includes
both particles and antiparticles.
The second drawback is that due to the presence of singularities k
1/2
in
the coecient functions of (8.92), the interaction V
1
formally does not satisfy
our criterion of cluster separability in Statement 8.7. Luckily, for our low-
perturbation-order calculations here, these singularities are harmless. The
eective interaction
33
remains cluster separable anyway.
8.3.2 Drawing a diagram in the toy model
In this subsection we would like to introduce a diagram technique which
would greatly facilitate perturbative calculations of scattering operators (8.67)
- (8.69). Let us graphically represent each term in the interaction potential
(8.92) as a vertex (see Fig. 8.2). Each particle operator in V
1
is represented
as an oriented line or arrow. The line corresponding to an annihilation oper-
ator enters the vertex and the line corresponding to a creation operator leaves
the vertex. Electron lines are shown by full arcs and photon lines are shown
by dashed arrows on the diagram. Each line is marked with the momentum
label of the corresponding particle operator. Free ends of the electron lines
are attached to the vertical electron order bar on the left hand side of the
diagram. The order of these external lines (from bottom to top of the order
bar) corresponds to the order of electron particle operators in the potential
(from right to left). An additional numerical factor is indicated in the upper
left corner of the diagram.
The t-integral
V
1
=
ec
(2)
3/2
_
dpdk
ck
a
p
c
k
a
p+k
p
+ ck
p+k
ec
(2)
3/2
_
dpdk
ck
a
p
a
pk
c
k
p
ck
pk
(8.93)
33
see subsection 8.3.5
8.3. A TOY MODEL THEORY 277
+1 +1
p+k
pp
kk k k
pk
pp
(a)
((bb))
Figure 8.2: Diagram representation of the interaction operator V
1
.
diers from V
1
only by the factor E
1
V
1
(see equation (8.65)), which is rep-
resented in the diagram 8.3 by drawing a box that crosses all external lines.
A line entering (leaving) the box contributes its energy with the negative
(positive) sign to the energy function E
V
1
.
In order to calculate the -operator (8.69) in the 2nd perturbation order
we need products V
1
V
1
. The diagram corresponding to the product of two
diagrams AB is obtained by simply placing the diagram B below the diagram
A and attaching external electron lines of both diagrams to the same order
bar. For example, the diagram for the product of the second term in (8.92)
(Fig. 8.2(b)) and the rst term in (8.93) (Fig. 8.3(a))
V
1
V
1
(a
p
a
pk
c
k
)(a
q
c
a
q+k
) + . . . (8.94)
is shown in Fig. 8.4(a).
34
This product should be further converted to the
normal form, i.e., all creation operators should be moved to the left. On the
diagram, the movement of creation operators from right to left is represented
by the movement of free outward pointing arrows upward, so that at the end
of this process all outgoing lines are positioned below incoming lines. Due to
34
By convention, we will place free ends of photon external lines on the right hand side
of the diagram. The order of these free ends (from top to bottom of the diagram) will
correspond to the order of photon particle operators in the expression (from left to right).
For example, in Fig. 8.4(a) the incoming photon line is above the outgoing photon line,
which corresponds to the order cc
q
to the leftmost position and add
another term due to the anticommutator a
pk
, a
q
= (q p +k).
V
1
V
1
a
q
a
p
a
pk
a
q+k
c
k
c
+ (q p +k)a
p
a
q+k
c
k
c
= a
q
a
p
a
pk
a
q+k
c
k
c
+ a
p
a
pk+k
c
k
c
+ . . . (8.95)
Expression (8.95) is represented by two diagrams 8.4(b) and 8.4(c). In
the diagram 8.4(b) the electron line marked q has been moved to the top of
the electron order bar. In the diagram 8.4(c) the product (q p +k) and
the integration by q are represented by joining or pairing the incoming elec-
tron line carrying momentum p k with the outgoing electron line carrying
momentum q. This produces an internal electron line carrying momentum
p k between two vertices.
8.3. A TOY MODEL THEORY 279
+1
q+k
kk
(a) pp
pk
qq
k
+1
kk
pk
qq ((bb))
k
q+k
pp
+1
k k
pk+k
((c c))
k
pp
==
+1
kk
pk
qq
((dd))
k
q+k
pp
+1
kk
qq
pk
((ee))
pp
q+k
+1
((ff)) +1
k k
p p
((g g))
pk
p p
==
Figure 8.4: The normal product of operators in Fig. 8.2(b) and 8.3(a).
In the expression (8.95), electron operators are in the normal order, how-
ever, photon operators are not there yet. The next step is to change the
order of photon operators
V
1
V
1
a
q
a
p
c
a
pk
a
q+k
c
k
+ a
q
a
p
a
pk
a
q+k
(k
k)
+a
p
c
a
pk+k
c
k
+ a
p
a
pk+k
(k
k) + . . .
= a
q
a
p
c
a
pk
a
q+k
c
k
+ a
q
a
p
a
pk
a
q+k
+a
p
c
a
pk+k
c
k
+ a
p
a
p
+ . . . (8.96)
The normal ordering of photon operators in 8.4(b) yields diagrams 8.4(d) and
8.4(e) according to equation (8.88). Diagrams 8.4(f) and 8.4(g) are obtained
from 8.4(c) in a similar way.
8.3.3 Reading a diagram in the toy model
With the above diagram rules and some practice, one can perform calcula-
tions of scattering operators (8.68) and (8.69) much easier than in the usual
280 CHAPTER 8. THE FOCK SPACE
algebraic way. During these diagram manipulations we, actually, do not need
to keep track of the momentum labels of lines. The algebraic expression of
the result can be easily restored from an unlabeled diagram by following
these steps:
(I) Assign a distinct momentum label to each external line, except one,
whose momentum is obtained from the (momentum conservation) con-
dition that the sum of all incoming external momenta minus the sum
of all outgoing external momenta is zero.
(II) Assign momentum labels to internal lines so that the momentum con-
servation law is satised at each vertex: The sum of momenta of lines
entering the vertex is equal to the sum of momenta of outgoing lines.
If there are loops, one needs to introduce new independent loop mo-
menta
35
(III) Read external lines from top to bottom of the diagram and write corre-
sponding particle operators from left to right.
36
Do it rst for electron
lines and then for photon lines.
(IV) For each box, write a factor (E
f
E
i
)
1
, where E
f
is the sum of
energies of particles going out of the box and E
i
is the sum of energies
of particles coming into the box.
(V) For each vertex introduce a factor
ec
(2)
3
ck
, where k is the momentum
of the photon line attached to the vertex.
(VI) Integrate the obtained expression by all independent external momenta
and loop momenta.
8.3.4 Electron-electron scattering
Let us now try to extract some physical information from the above theory.
We will calculate low order terms in the perturbation expansion (8.69) for
the -operator
35
see diagram 8.4(g) in which k is the loop momentum.
36
Incoming lines correspond to annihilation operators; outgoing lines correspond to cre-
ation operators.
8.3. A TOY MODEL THEORY 281
1
= V
1
(8.97)
2
= (V
1
V
1
)
unp
+ (V
1
V
1
)
ph
+ (V
1
V
1
)
ren
(8.98)
To obtain corresponding contributions to the S-operator we need to take
t-integrals
S = 1 +
1
..
+
2
..
+. . .
Note that the right hand side of (8.97) and the rst term on the right hand
side of (8.98) are unphys, so, due to equation (8.64), they do not contribute
to the S-operator. For now, we also ignore the contribution of the renorm
term in (8.98).
37
Then we obtain in the 2nd perturbation order
S
2
= (V
1
V
1
)
ph
. .
+. . . (8.99)
Operator V
1
V
1
has several terms corresponding to dierent scattering pro-
cesses. Some of them were calculated in subsection 8.3.2. For example, the
term of the type a
aa
described by the diagram in Fig. 8.4(e). According to the rules (I) - (VII),
this diagram can be written algebraically as
Fig.8.4(e) =
2
e
2
c
(2)
3
_
dpdqdk
k(ck +
pk
p
)
a
pk
a
q+k
a
p
a
q
The t-integral of this expression is
Fig.8.4(e)
. .
=
2ie
2
2
c
(2)
3
_
dpdqdk
(
pk
+
q+k
p
)
k(ck +
pk
p
)
a
pk
a
q+k
a
p
a
q
(8.100)
37
In fact, the renorm term (V
1
V
1
)
ren
should have been canceled by a renormalization
counterterm, as will be discussed in chapter 10.
282 CHAPTER 8. THE FOCK SPACE
The delta function in (8.100) expresses the conservation of energy in the
scattering process. We will also say that expression (8.100) is non-zero only
on the energy shell which is a solution of the equation
pk
+
q+k
=
q
+
p
In the non-relativistic approximation (p, q mc)
p
_
p
2
c
2
+ m
2
c
4
mc
2
+
p
2
2m
(8.101)
Then in the limit of small momentum transfer (k mc) the denominator
can be approximated as
k (ck +
pk
p
) k
_
ck + mc
2
+
(p k)
2
2m
mc
2
p
2
2m
_
ck
2
(8.102)
Substituting this result in (8.100), we obtain the second order contribution
to the S-operator
S
2
[a
aa]
ie
2
4
2
_
dpdqdk
(
pk
+
q+k
p
)
k
2
a
pk
a
q+k
a
q
a
p
(8.103)
8.3.5 Eective potential
We have discussed in subsection 7.2.1 that the same scattering matrix can
correspond to dierent total Hamiltonians with dierent interaction poten-
tials. Here we would like to demonstrate this idea by constructing the fol-
lowing 2nd order eective interaction between our model electrons
V
2eff
=
e
2
(2)
3
_
dpdqdk
k
2
a
pk
a
q+k
a
q
a
p
(8.104)
With this interaction the 2nd order amplitude for electron-electron scattering
is obtained by the usual formula (8.67)
8.4. DIAGRAMS IN A GENERAL THEORY 283
S
2eff
= V
2eff
. .
=
ie
2
4
2
_
dpdqdk
(
pk
+
q+k
p
)
k
2
a
pk
a
q+k
a
q
a
p
This is the same result as (8.103), in spite of the fact that the new eective
Hamiltonian H
0
+ V
2eff
is completely dierent from the original Hamilto-
nian (8.91). In particular, it is important to note that interaction (8.104)
is phys, while (8.92) is unphys. The replacement of an unphys interaction
with a scattering-equivalent eective phys potential is the central idea of the
dressed particle approach to quantum eld theory in chapter 11.
From equations (8.80) and (B.7) it follows that interaction (8.104) corre-
sponds to the ordinary position-space Coulomb potential
38
w(r) =
e
2
(2)
3
_
dk
e
i
kr
k
2
=
e
2
4r
(8.105)
So, our toy model is quite realistic.
8.4 Diagrams in a general theory
8.4.1 Properties of products and commutators
The diagrammatic approach developed for the toy model above can be easily
extended to interactions in the general form (8.49): Each potential V
NM
with
N creation operators and M annihilation operators can be represented by a
vertex with N outgoing and M incoming lines. In calculations of scattering
operators (8.68) and (8.69) we meet products of such potentials.
39
Y = V
(1)
V
(2)
. . . V
(V)
(8.106)
As explained in subsection 8.2.2, we should bring these products to the nor-
mal order. The normal ordering transforms (8.106) into a sum of terms y
(j)
38
see also equation (8.82)
39
1 is the number of potentials in the product.
284 CHAPTER 8. THE FOCK SPACE
Y =
j
y
(j)
(8.107)
each of which can be described by a diagram with 1 vertices.
Each potential V
(i)
in the product (8.106) has N
(i)
creation operators,
M
(i)
annihilation operators and N
(i)
+M
(i)
momentum integrals. Then each
term y
(j)
in the expansion (8.107) has
A =
V
i=1
(N
(i)
+ M
(i)
) (8.108)
integrals and independent momentum integration variables. This term also
has a product of 1 delta functions, which express the conservation of the total
momentum in each of the factors V
(i)
. In the process of normal ordering of the
product (8.106), a certain number of pairs of external lines in the factors V
(i)
have to be joined together to make internal lines and to introduce additional
delta-functions. Let us denote by J the number of such delta functions in
y
(j)
. Then the total number of delta functions in y
(j)
is
N
= 1 +J (8.109)
and the number of external lines is
c = A 2J (8.110)
The terms y
(j)
in the normally ordered product (8.107) can be either discon-
nected or connected. In the latter case there is a continuous sequence (path)
of internal lines connecting any two vertices. In the former case such a path
does not exist and the diagram splits into several separated pieces.
Consider a product of two potentials V
(1)
and V
(2)
V
(1)
V
(2)
=
j
y
(j)
(8.111)
where the right hand side is written in the normally ordered form. From the
example (8.94) it should be clear that in a general product (8.111) there is
only one disconnected term in the sum on the right hand side. Let us denote
this term y
(0)
(V
(1)
V
(2)
)
disc
. This is the term in which the factors from
8.4. DIAGRAMS IN A GENERAL THEORY 285
the original product are simply rearranged and no pairings are introduced.
40
All other terms y
(1)
, y
(2)
, . . . on the right hand side of (8.111) are connected,
because they have at least one pairing which is represented on the diagram
by one or more internal lines connecting vertices V
(1)
and V
(2)
.
Lemma 8.8 The disconnected part of a product of two connected bosonic
operators
41
does not depend on the order of the product
(V
(1)
V
(2)
)
disc
= (V
(2)
V
(1)
)
disc
(8.112)
Proof. Operators V
(1)
V
(2)
and V
(2)
V
(1)
dier only by the order of particle
operators. So, after all particle operators are brought to the normal order in
(V
(1)
V
(2)
)
disc
and (V
(2)
V
(1)
)
disc
, they may dier, at most, by a sign. So, our
goal is to show that this sign is plus (+). Any reordering of boson particle
operators does not aect the sign of an expression, so for our proof we do
not need to pay attention to creation and annihilation operators of bosons
in the two factors. Let us now focus only on fermion particle operators
in V
(1)
and V
(2)
. For simplicity, we will assume that only electron and/or
positron particle operators are present in V
(1)
and V
(2)
. The inclusion of
the proton and antiproton operators will not change anything in this proof,
except its length. For the two factors V
(i)
(where i = 1, 2) let us denote N
(i)
e
the numbers of electron creation operators, N
(i)
p
the numbers of positron
creation operators, M
(i)
e
the numbers of electron annihilation operators and
M
(i)
p
the numbers of positron annihilation operators. Taking into account
that V
(i)
are assumed to be normally ordered, we may formally write
V
(1)
[N
(1)
e
][N
(1)
p
][M
(1)
e
][M
(1)
p
]
V
(2)
[N
(2)
e
][N
(2)
p
][M
(2)
e
][M
(2)
p
]
where the bracket [N
(1)
e
] denotes the product of N
(1)
e
electron creation op-
erators from the term V
(1)
, the bracket [N
(1)
p
] denotes the product of N
(1)
p
positron creation operators from the term V
(1)
, etc. Then
40
see, for example, the rst term on the right hand side of (8.96) and Fig. 8.4(d)
41
As discussed in subsection 8.2.3, all potentials considered in this book are bosonic.
286 CHAPTER 8. THE FOCK SPACE
V
(1)
V
(2)
[N
(1)
e
][N
(1)
p
][M
(1)
e
][M
(1)
p
][N
(2)
e
][N
(2)
p
][M
(2)
e
][M
(2)
p
] (8.113)
V
(2)
V
(1)
[N
(2)
e
][N
(2)
p
][M
(2)
e
][M
(2)
p
][N
(1)
e
][N
(1)
p
][M
(1)
e
][M
(1)
p
] (8.114)
Let us now bring particle operators on the right hand side of (8.114) to the
same order as on the right hand side of (8.113). First we move N
(1)
e
electron
creation operators to the leftmost position in the product. This involves
N
(1)
e
M
(2)
e
permutations with electron annihilation operators from the factor
V
(2)
and N
(1)
e
N
(2)
e
permutations with electron creation operators from the
factor V
(2)
. Each of these permutations changes the sign of the disconnected
term, so the acquired factor is (1)
N
(1)
e
(N
(2)
e
+M
(2)
e
)
.
Next we need to move the [N
(1)
p
] factor to the second position from the
left. The factor acquired after this move is (1)
N
(1)
p
(N
(2)
p
+M
(2)
p
)
. Then we move
the factors [M
(1)
e
] and [M
(1)
p
] to the third and fourth places in the product,
respectively. Finally, the total factor acquired by the expression (V
(2)
V
(1)
)
disc
after all its terms are rearranged in the same order as in (V
(1)
V
(2)
)
disc
is
f = (1)
K
(1)
e
K
(2)
e
+K
(1)
p
K
(2)
p
(8.115)
where we denoted
K
(i)
e
N
(i)
e
+ M
(i)
e
K
(i)
p
N
(i)
p
+ M
(i)
p
the total (= creation + annihilation) numbers of electron and positron oper-
ators, respectively, in the factor V
(i)
. Then it is easy to prove that the power
of (-1) in (8.115) is even, so that f = 1. Indeed, consider the case when K
(1)
e
is even and K
(2)
e
is odd. Then the product K
(1)
e
K
(2)
e
is odd. From the bosonic
character of V
(1)
and V
(2)
it follows that K
(1)
e
+ K
(1)
p
and K
(2)
e
+ K
(2)
p
are
even numbers. Therefore K
(1)
p
is odd and K
(2)
p
is even, so that the product
K
(1)
p
K
(2)
p
is odd and the total power of (-1) in (8.115) is even.
The same result is obtained for any other assumption about the even/odd
character of K
(1)
e
and K
(2)
e
. This proves (8.112).
8.4. DIAGRAMS IN A GENERAL THEORY 287
Theorem 8.9 A multiple commutator of bosonic potentials is connected.
Proof. Let us rst consider a single commutator of two potentials V
(1)
and
V
(2)
.
V
(1)
V
(2)
V
(2)
V
(1)
(8.116)
According to Lemma 8.8, the disconnected terms (V
(1)
V
(2)
)
disc
and (V
(2)
V
(1)
)
disc
in the commutator (8.116) are canceled and all remaining terms are con-
nected. This proves the theorem for a single commutator (8.116). Since this
commutator is also bosonic, by repeating the above arguments, we conclude
that all multiple commutators of bosonic operators are connected.
Lemma 8.10 In a connected diagram the number of independent loops is
42
/ = J 1 + 1 (8.117)
Proof. If there are 1 vertices, they can be connected together without
making loops by 1 1 internal lines. Each additional internal line will make
one independent loop. Therefore, the total number of independent loops is
J (1 1).
An example of a connected diagram is shown in Fig. 8.5. This diagram
has 1 = 4 vertices, c = 5 external lines, J = 7 internal lines and / = 4 inde-
pendent loops. This diagram describes a nine-fold momentum integral. Five
integration momentum variables correspond to external lines in the diagram:
There are two incoming momenta p
1
, p
2
and three outgoing momenta p
3
,
p
4
and p
5
. These ve integrals/variables are a part of the general expression
for the potential (8.50). Four additional integrals are performed by loop mo-
menta p
6
, p
7
, p
8
and p
9
. These integrals can be absorbed in the denition
of the coecient function
D
3,2
(p
3
, p
4
, p
5
; p
1
, p
2
) =
_
dp
6
dp
7
dp
8
dp
9
D
A
(p
6
, p
7
, p
8
, p
1
+p
2
p
6
p
7
p
8
; p
1
, p
2
)
42
Here 1 is the number of vertices in the diagram and J is the number of internal lines.
288 CHAPTER 8. THE FOCK SPACE
pp
11
+p
22
p
55
p
88
p
99
pp
11
+p
22
p
77
p
88
p
99
pp
11
+p
22
p
66
p
77
p
88
pp
22
pp
11
pp
66
pp
99 pp
77
pp
88
pp
33
pp
44
pp
55
DD
CC
BB
AA
Figure 8.5: A diagram representing one term in the 4-order product of a
hypothetical theory. Here we do not draw the order bars as in subsection
8.3.2. However, we draw all outgoing lines on the top of the diagram and
all incoming lines at the bottom to indicate that the diagram is normally
ordered. Note that all internal lines are oriented upwards because all paired
operators (i.e., those operators whose order should be changed by the normal
ordering procedure) in the product (8.106) always occur in the order
.
8.4. DIAGRAMS IN A GENERAL THEORY 289
D
B
(p
9
, p
1
+p
2
p
7
p
8
p
9
; p
6
, p
1
+p
2
p
6
p
7
p
8
)
D
C
(p
5
, p
1
+p
2
p
5
p
8
p
9
; p
7
, p
1
+p
2
p
7
p
8
p
9
)
D
D
(p
3
, p
4
; p
8
, p
9
, p
1
+p
2
p
5
p
8
p
9
)
1
E
A
(E
C
+ E
D
)
(8.118)
where E
A
, E
C
and E
D
are energy functions of the corresponding vertices and
D
A
, D
B
, D
C
, D
D
are coecient functions at the vertices.
8.4.2 Cluster separability of the S-operator
In agreement with Postulate 6.3 (cluster separability) and Statement 8.7
(cluster separability of smooth potentials), interactions considered in this
book are cluster-separable. In this subsection we are going to prove that the
S-operator calculated with such interaction is always cluster-separable too.
Physically, this means that if a multiparticle scattering system is separated
into two (or more) distant subsystems, then the result of scattering in each
subsystem will not depend on what is going on in other subsystems.
From perturbation formulas for the S-operator in subsection 7.1.2 we
know that S is, generally, a sum of products of interaction potentials like
(8.106). Mathematically, the cluster separability of interactions means that
coecient functions of interaction potentials V
(i)
in the product (8.106) are
smooth. If we could show that the product (8.106) itself is a sum of smooth
operators, then the desired cluster separability of the S-operator would follow
directly from Statement 8.7. The question about the smoothness of (8.106)
is not trivial, because bringing such a product to the normal order involves
permutations of particle operators that produce singular delta functions.
The following theorem establishes an important connection between the
smoothness of terms on the right hand side of (8.107) and the connectivity
of corresponding diagrams.
Theorem 8.11 Each term y
(j)
in the expansion (8.107) of the product of
smooth potentials is smooth if and only if it is represented by a connected
diagram.
Proof. Let us rst assume that y
(j)
is represented by a connected diagram.
We will establish the smoothness of the term y
(j)
by proving that it can be
represented in the general form (8.50) in which the integrand contains only
290 CHAPTER 8. THE FOCK SPACE
one delta function required by the momentum conservation condition and the
coecient function D
NM
is smooth.
43
From equation (8.108), the original
number of integrals in y
(j)
is A. Integrals corresponding to c external lines
are parts of the general form (8.50) and integrals corresponding to / loops
can be absorbed into the denition of the coecient function of y
(j)
. The
number of remaining integrals is then
A
= A c / = J +1 1 (8.119)
Then we have just enough integrals to cancel all momentum delta functions
(8.109) except one, which proves that the term y
(i)
is smooth.
Inversely, suppose that the term y
(j)
is represented by a disconnected
diagram with 1 vertices and J internal lines. Then the number of indepen-
dent loops / is greater than the value J 1 +1 characteristic for connected
diagrams. Then the number of integrations A
2
c
(2)
3
_
dpdk
a
p
a
p
(
pk
p
+ ck)k
(8.121)
Substituting this result to the right hand side of (8.98) we see that the S-
operator in the second order
2
..
is innite, which makes it meaningless and
unacceptable.
The appearance of innities in products and commutators of potentials in
formulas for the S-operator (8.68) and (8.69) is a commonplace in quantum
eld theories. So, we need to better understand this phenomenon. In this
292 CHAPTER 8. THE FOCK SPACE
subsection, we will discuss the convergence of general loop integrals and we
will formulate a sucient condition under which loop integrals are conver-
gent. We will nd these results useful in our discussion of the renormalization
of QED in chapter 10 and in our construction of a divergence-free theory in
section 11.2.
Let us consider, for example, the diagram in Fig. 8.5. There are three
dierent reasons why loop integrals may diverge there:
(I) The coecient functions D
A
, D
B
, . . . in (8.118) may contain singular-
ities. A good example is interaction (8.92), which produces the k = 0
divergence in the integral (8.121).
(II) There can be also singularities due to zeroes in energy denominators
E
A
and E
C
+ E
D
.
(III) The coecient functions D
A
, D
B
, . . . may not decay fast enough at
large values of loop momenta, so that the integrals may be divergent
due to the innite integration range.
The singularities in coecient functions of interaction potentials are usually
related to the vanishing photon mass. They correspond to the so-called
infrared divergences of loop integrals. We will discuss them in greater detail
in chapters 10 and 13.6. The energy denominators may be rendered nite
and harmless if we use the adiabatic switching prescription from subsection
7.1.3.
44
The divergence of loop integrals at large integration momentum or
ultraviolet divergence (point (III) above) is a more serious problem, which
we are going to discuss here. We would like to show that this divergence is
closely related to the behavior of coecient functions of potentials far from
the energy shell. In particular, we would like to prove the following
Theorem 8.13 If coecient functions of potentials decay suciently rapidly
(e.g., exponentially) when arguments move away from the energy shell, then
all loop integrals converge.
Idea of the proof. Equation (8.118) is an integral in a 12-dimensional
space of 4 loop momenta p
6
, p
7
, p
8
and p
9
. Let us denote this space .
Consider for example the dependence of the integrand in (8.118) on the loop
44
see also footnote on page 290
8.4. DIAGRAMS IN A GENERAL THEORY 293
momentum p
9
as p
9
and all other momenta xed. Note that we
have chosen integration variables in Fig. 8.5 in such a way that each loop
momentum is present only in the internal lines forming the corresponding
loop, e.g., momentum p
9
is conned to the loop BDCB and the energy
function E
A
of the vertex A does not depend on p
9
. Such a selection of
integration variables can be done for any arbitrary diagram. Taking into
account that at large values of momentum
p
cp, we obtain in the limit
p
9
E
A
const,
E
B
=
p
1
+p
2
p
7
p
8
p
9
+
p
9
p
6
p
1
+p
2
p
6
p
7
p
8
2cp
9
,
E
C
=
p
1
+p
2
p
5
p
8
p
9
+
p
5
p
7
p
1
+p
2
p
7
p
8
p
9
const,
E
D
=
p
3
+
p
4
p
8
p
9
p
1
+p
2
p
5
p
8
p
9
2cp
9
.
So, in this limit, according to the condition of the theorem, the coecient
functions at vertices B and D tend to zero rapidly, e.g., exponentially. So,
the loop integral on p
9
converges. In order to prove the convergence of
all four loop integrals, we need to make sure that the same rapid decay is
characteristic for all directions in the space . Here are some arguments that
this is, indeed, true.
The above analysis is applicable to all loop variables: Any loop has a
bottom vertex (vertex B in our example), a top vertex (vertex D in our
example) and possibly a number of intermediate vertices (vertex C in our
example). As the loop momentum goes to innity, energy functions of the top
and bottom vertices tend to innity, i.e., move away from the energy shell.
This ensures a fast (e.g., exponential) decay of the corresponding coecient
function.
Now we can take an arbitrary direction to innity in the space . Along
this direction, there is at least one loop momentum which goes to innity.
Then there is at least one energy function (E
A
, E
B
, E
C
, or E
D
) which grows
linearly, while others stay constant (in the worst case). Therefore, according
to the condition of the theorem, the integrand decreases rapidly (e.g., expo-
nentially) along this direction. Therefore the integrand rapidly tends to zero
in all directions and integral (8.118) converges.
294 CHAPTER 8. THE FOCK SPACE
In chapter 10 we will see that in realistic theories, like QED, the asymp-
totic decay of the coecient functions of potentials at large momenta is not
fast enough, so Theorem 8.13 is not applicable and loop integrals usually
diverge. A detailed discussion of such divergences in quantum eld theory
and their elimination will be presented in chapter 10, in section 11.2 and in
chapter 13.6.
Chapter 9
QUANTUM
ELECTRODYNAMICS
If it turned out that some physical system could not be described
by a quantum eld theory, it would be a sensation; if it turned
out that the system did not obey the rules of quantum mechanics
and relativity, it would be a cataclysm.
Steven Weinberg
So far we have developed a general formalism of quantum theory. We
emphasized that any such theory must obey, at least, three important re-
quirements:
the theory must be relativistically invariant, in the instant form of
dynamics;
the interaction must be cluster separable;
the theory must allow for processes involving creation and annihilation
of particles.
We have considered a few model examples, but they were purely academic
and not directly relevant to real systems observed in nature. The reason for
such inadequacy was that our models failed to satisfy all three requirements
mentioned above.
295
296 CHAPTER 9. QUANTUM ELECTRODYNAMICS
For example, in subsection 6.3.6 we constructed an interacting model
that explicitly satised the requirement of relativistic invariance. We also
managed to ensure that the model is cluster separable in the 3-particle sector.
In principle, it is possible to show that by following this approach one can
build cluster-separable interactions in all n-particle sectors. There is even a
possibility for describing systems with variable number of particles [Pol03].
However, the resulting formalism is very cumbersome and it can be applied
only to model systems. It seems that the major problem is with cluster
separability, which is formulated in terms of dicult-to-use conditions like
(6.53) - (6.56).
In section 8.3 we considered another toy model of interacting particles,
which was based on the formalism of creation and annihilation operators.
The great advantage of this formalism was that the cluster separability con-
dition could be conveniently expressed in terms of smoothness of interaction
potentials.
1
The processes of particle creation and annihilation were easily
described as well. However, the dicult part was to ensure the relativistic
invariance. In our toy model we have not even tried to make the theory
relativistic.
Fortunately, there is a class of theories, which allow one to satisfy all
three conditions listed above. What is even more important, these theories
are directly applicable to realistic physical systems and allow one to achieve
an impressive agreement with experiments. These are quantum eld theo-
ries (QFT). A particular version of QFT for describing interactions between
electrically charged particles and photons is called quantum electrodynamics
(QED), which is the topic of our discussion in this chapter.
In section 9.1 we will write down interaction terms V (potential energy)
and Z (potential boost) in QED. The relativistic invariance of this approach is
proven in Appendix N.2. In section 9.2 S-matrix elements will be calculated
in the lowest non-trivial order of perturbation theory.
9.1 Interaction in QED
Our goal in this section is to build a realistic interacting representation
U(, a) of the Poincare group in the Fock space (8.1). In this book we do not
pretend to derive QED interactions from rst principles. We simply borrow
from the traditional approach the form of four interacting generators of the
1
see Statement 8.7
9.1. INTERACTION IN QED 297
Poincare group H and K in terms of quantum elds for electrons/positrons
( x), protons/antiprotons
( x) and photons A
i
(x, t)
dened on an abstract Minkowski space-time /
4
and satisfying following
conditions:
(I) Operator
i
(x, t) contains only terms linear in creation or annihilation
operators of the particle and its antiparticle.
(II) Quantum elds are supposed to have simple transformation laws
2
A particle and its antiparticle are assumed to belong to the same particle type.
3
This means that for each value of its arguments (x, t) and index i, symbol denotes
an operator acting in the Fock space.
4
see Appendix I.1
298 CHAPTER 9. QUANTUM ELECTRODYNAMICS
U
0
(; a)
i
( x)U
1
0
(; a) =
j
D
ij
(
1
)
j
(( x + a)) (9.1)
with respect to the non-interacting representation
5
U
0
(; a) of the Poincare
group in the Fock space, where is a boost/rotation, a is a space-
time translation and D
ij
is a nite-dimensional representation
6
of the
Lorentz group.
(III) Quantum elds turn to zero at x-innity, i.e.
lim
|x|
i
(x, t) = 0 (9.2)
(IV) Fermionic elds (i.e., elds for particles with half-integer spin)
i
( x)
and
j
( x
) is a space-like 4-
vector, or equivalently
i
(x, t),
j
(y, t) = 0 if x ,= y (9.3)
Fermionic quantum elds for electrons-positrons
( x) and protons-
antiprotons
) is a space-like
4-vector, or equivalently
[
i
(x, t),
j
(y, t)] = 0 if x ,= y (9.4)
Bosonic quantum eld for photons A
( x) is discussed in Appendix K.
5
see subsection 8.1.8
6
The representation D
ij
is denitely non-unitary, because the Lorentz group is non-
compact and it is known that non-compact groups cannot have nite-dimensional unitary
representations.
9.1. INTERACTION IN QED 299
Step 2. Having at our disposal quantum elds
i
( x),
j
( x),
k
( x), . . . for all
particles we can build the potential energy density
V ( x) V (x, t) =
n
V
n
(x, t) (9.5)
in the form of polynomial where each term is a product of elds at the same
(x, t) point
V
n
(x, t) =
i,j,k,...
G
n
ijk...
i
(x, t)
j
(x, t)
k
(x, t) . . . (9.6)
and coecients G
n
ijk...
are such that V ( x)
(I) is a bosonic
7
Hermitian operator function on the space-time /;
(II) transforms as a scalar with respect to the non-interacting representa-
tion of the Poincare group:
U
0
(; a)V ( x)U
1
0
(; a) = V ( x + a) (9.7)
From properties (9.3) - (9.4) and the bosonic character of V ( x) it is easy
to prove that V ( x) commutes with itself at space-like separations, e.g.,
[V (x, t), V (y, t)] = 0 if x ,= y (9.8)
Step 3. Instant-form interactions in the Hamiltonian and boost operator are
obtained by integrating the potential energy density (9.5) on x and setting
t = 0
H = H
0
+ V = H
0
+
_
dxV (x, 0) (9.9)
K = K
0
+Z = K
0
+
1
c
2
_
dxxV (x, 0) (9.10)
7
i.e., there is an even number of fermionic elds in each product in (9.6)
300 CHAPTER 9. QUANTUM ELECTRODYNAMICS
Non-interacting generators H
0
, P
0
, J
0
and K
0
can be found in (8.29),
(8.30), (8.32) and (8.35), respectively. With these denitions, the commu-
tation relations of the Poincare Lie algebra are proved in Appendix N.1.
Coecient functions of potentials in (9.9) are smooth, so the cluster separa-
bility is established by reference to Statement 8.7. The terms that change the
number of particles appear naturally, too. Then all three conditions listed
in the beginning of this chapter are readily satised. This explains why
quantum eld theories are so useful for describing realistic physical systems.
9.1.2 Interaction operators in QED
Unfortunately, formulas (9.9) and (9.10) work only for simplest QFT mod-
els. More interesting cases, such as QED, require some modications in this
scheme. In particular, the presence of the additional term
( x, ) in the
transformation law of the photon eld (K.23) does not allow us to dene the
boost interaction in QFT by simple formula (9.10). Let us now formulate
QED interaction operators without proof.
The total Hamiltonian of QED has the usual form
H = H
0
+ V (9.11)
where the non-interacting Hamiltonian H
0
is that from equation (8.29) and
interaction is composed of two terms
8
V = V
1
+ V
2
(9.12)
The rst order interaction is a pseudoscalar product of two 4-component
quantities. One of them is the 4-vector fermion current operator
j( x) dened
in Appendix L.1. The other is the photon quantum eld
A( x)
9
8
Here and in what follows we denote the power of the coupling constant e (the pertur-
bation order of an operator) by a subscript, i.e., H
0
is zero order, V
1
is rst order, V
2
is
second order, etc.
9
Here we mark the photon quantum eld
A by tilde as if it was a 4-vector. However,
as shown in Appendix K.6, the components of
A do not transform by 4-vector rules. The
last equality in (9.13) follows from equation (K.8).
9.1. INTERACTION IN QED 301
V
1
=
1
c
_
dx
j(x, 0)
A(x, 0)
1
c
_
dxj
(x, 0)A
(x, 0)
=
1
c
_
dxj(x, 0) A(x, 0) (9.13)
The second order interaction is
V
2
=
1
2c
2
_
dxdyj
0
(x, 0)((x y)j
0
(y, 0) (9.14)
where
((x)
1
4[x[
(9.15)
Interaction in the boost operator
K = K
0
+Z (9.16)
is dened as
Z =
1
c
3
_
dxxj(x, 0)A(x, 0) +
1
2c
4
_
dxdyj
0
(x, 0)x((x y)j
0
(y, 0)
+
1
c
3
_
dxj
0
(x, 0)C(x, 0) (9.17)
where components of the operator function C(x, t) are given by equation
(N.7).
The above operators of energy H and boost K are those usually written
in the Coulomb gauge version of QED [Wei95, Wei64b]. In Appendix N.2 we
prove that this theory is Poincare invariant. For this proof it is convenient
to represent interaction operators of QED in terms of quantum elds, as
above. However, for some calculations in this book it will be more useful to
express interactions through particle creation and annihilation operators, as
in chapter 8. To do that, we just need to insert eld expansions (J.26) and
(K.2) in equations (9.13) and (9.14). The resulting expressions are rather
long and cumbersome, so this derivation is moved to Appendix L.
302 CHAPTER 9. QUANTUM ELECTRODYNAMICS
9.2 S-operator in QED
Having at our disposal all 10 generators of the Poincare group representation
in the Fock space, in principle, we should be able to calculate all physical
quantities related to systems of charged particles and photons. However, this
statement is overly optimistic. In chapters 10 and 11 we will see that the
theory outlined above has serious problems and internal contradictions. In
fact, this theory allows one to calculate only simplest physical properties in
low perturbation orders. An example of such a calculation will be given in
this section: Here we will calculate the S-operator for the proton-electron
scattering in the 2nd perturbation order.
9.2.1 S-operator in the second order
We are interested in S-operator terms of the type d
1
2
[V
1
, V
1
] (9.18)
. . .
Taking into account that operator V
1
is unphys, so that F
1
..
= V
1
..
= 0,
10
we obtain the following perturbation expansion
S = e
F
..
= 1 + F
..
+
1
2!
F
..
F
..
+. . .
= 1 + F
1
..
+ F
2
..
+
1
2!
F
1
..
F
1
..
+
1
2!
F
2
..
F
1
..
+
1
2!
F
1
..
F
2
..
+ F
3
..
+. . .
= 1 + F
2
..
+ F
3
..
. . .
= 1 + V
2
..
1
2
[V
1
, V
1
]
. .
+ F
3
..
. . . (9.19)
10
see equation (8.64)
9.2. S-OPERATOR IN QED 303
Let us rst evaluate expression
1
2
[V
1
, V
1
] in (9.19). Only the four rst
terms in equation (L.8) for V
1
are relevant for our calculation:
11
V
1
=
e
(2)
3/2
_
dkdpA
(p +k)A
(p)C
(k)
e
(2)
3/2
_
dkdpA
(p k)A
(p)C
(k)
+
e
(2)
3/2
_
dkdpD
(p +k)D
(p)C
(k)
+
e
(2)
3/2
_
dkdpD
(p k)D
(p)C
(k) + . . .
According to (8.65), the corresponding terms in V
1
are
V
1
=
e
(2)
3/2
_
dkdpA
(p +k)A
(p)C
(k)
1
p+k
p
ck
+
e
(2)
3/2
_
dkdpA
(p k)A
(p)C
(k)
1
pk
p
+ ck
e
(2)
3/2
_
dkdpD
(p +k)D
(p)C
(k)
1
p+k
p
ck
e
(2)
3/2
_
dkdpD
(p k)D
(p)C
(k)
1
pk
p
+ ck
+. . . (9.20)
In order to obtain terms of the type D
DA in the expression [V
1
, V
1
],
we need to consider four commutators: the 1st term in V
1
commuting with
the 4th term in V
1
, the 2nd term in V
1
commuting with the 3rd term in V
1
,
the 3rd term in V
1
commuting with the 2nd term in V
1
and the 4th term in
V
1
commuting with the 1st term in V
1
. Using commutator (K.12) we obtain
1
2
[V
1
, V
1
]
11
Operators A, C, D are dened in (J.49) - (J.56) and (K.11).
304 CHAPTER 9. QUANTUM ELECTRODYNAMICS
=
e
2
2(2)
3
_
dkdpdk
dp
(p +k)A
(p)D
(p
)D
(p
)
[C
(k), C
(k
)]
1
p+k
p
ck
e
2
2(2)
3
_
dkdpdk
dp
(p k)A
(p)D
(p
+k
)D
(p
)
[C
(k), C
(k
)]
1
pk
p
+ ck
e
2
2(2)
3
_
dkdpdk
dp
(p +k)D
(p)A
(p
)A
(p
)
[C
(k), C
(k
)]
1
p+k
p
ck
e
2
2(2)
3
_
dkdpdk
dp
(p k)D
(p)A
(p
+k
)A
(p
)
[C
(k), C
(k
)]
1
pk
p
+ ck
+ . . .
=
e
2
2
c
4(2)
3
_
dkdpdq
k
(k)
_
D
(p k)D
(p)A
(q +k)A
(q)
1
q+k
q
ck
+ D
(p +k)D
(p)A
(q k)A
(q)
1
qk
q
+ ck
D
(p +k)D
(p)A
(q k)A
(q)
1
p+k
p
ck
+ D
(p k)D
(p)A
(q +k)A
(q)
1
pk
p
+ ck
+ . . .
_
=
e
2
2
c
4(2)
3
_
dkdpdq
k
(k)
_
D
(p k)D
(p)A
(q +k)A
(q)
1
q+k
q
ck
+ D
(p k)D
(p)A
(q +k)A
(q)
1
q+k
q
+ ck
D
(p k)D
(p)A
(q +k)A
(q)
1
pk
p
ck
9.2. S-OPERATOR IN QED 305
+ D
(p k)D
(p)A
(q +k)A
(q)
1
pk
p
+ ck
+ . . .
_
=
e
2
2
c
2
2(2)
3
_
dkdpdq
(k)
( q +
k q)
2
(p k)A
(q +k)D
(p)A
(q)
e
2
2
c
2
2(2)
3
_
dkdpdq
(k)
(
P
K
P)
2
(p k)A
(q +k)D
(p)A
(q) (9.21)
where we denoted
12
( p q)
2
(
p
q
)
2
c
2
(p q)
2
(9.22)
(
P
Q)
2
(
p
q
)
2
c
2
(p q)
2
(9.23)
Next take into account that we need to know our S-operator only in the
vicinity of the energy shell where
pk
p
=
q
q+k
(
P
K
P)
2
= ( q +
k q)
2
(9.24)
Also use notation (J.62) - (J.63) in which
A
(q +k)
A(q) =
mc
2
q+k
(q +k, ; q,
)a
q+k
a
q
D
(p k)
(p) =
Mc
2
_
pk
(p k, ; p,
)d
pk
d
p
and equation (K.14). Then
1
2
[V
1
, V
1
]
12
( p q)
2
is a 4-square of the dierence between 4-vectors p and q, i.e., ( p q)
2
=
( p q)
( p q)
q
)
2
c
2
k
2
.
306 CHAPTER 9. QUANTUM ELECTRODYNAMICS
=
e
2
2
c
2
(2)
3
Mmc
4
q+k
q
_
pk
_
dkdpdq
h
(k)U
(q +k, ; q,
)W
(p k, ; p,
)
( q +
k q)
2
d
pk
d
p
a
q+k
a
q
=
e
2
2
c
2
(2)
3
Mmc
4
q+k
q
_
pk
_
dkdpdq
_
(U(q +k, ; q,
) W(p k, ; p,
)
( q +
k q)
2
(k U(q +k, ; q,
))(k W(p k, ; p,
)
k
2
( q +
k q)
2
_
d
pk
d
p
a
q+k
a
q
Combining this expression with the term D
DA in V
2
,
13
we see that
operator F
2
in (9.18) can be written in the form
F
2
=
e
2
2
c
2
(2)
3
Mmc
4
_
q+k
pk
_
dkdpdq
_
U(q +k, ; q,
) W(p k, ; p,
)
( q +
k q)
2
+
(k U(q +k, ; q,
))(k W(p k, ; p,
))
k
2
( q +
k q)
2
U
0
(q +k, ; q,
)W
0
(p k, ; p,
)
c
2
k
2
_
d
pk
d
p
a
q+k
a
q
(9.25)
From (J.83) we obtain
(k W(p k, ; p,
)) =
pk
p
c
W
0
(p k, ; p,
)
=
q+k
q
c
W
0
(p k, ; p,
)
(k U(q +k, ; q,
)) =
q+k
q
c
U
0
(q +k, ; q,
)
and
13
the third term in equation (L.11)
9.2. S-OPERATOR IN QED 307
(k U)(k W)
k
2
( q +
k q)
2
U
0
W
0
c
2
k
2
=
(
q+k
q
)
2
U
0
W
0
c
2
k
2
( q +
k q)
2
[(
q+k
q
)
2
c
2
k
2
]U
0
W
0
c
2
k
2
( q +
k q)
2
=
U
0
W
0
( q +
k q)
2
so that our nal expression for the F-operator is
F
2
=
e
2
2
c
2
(2)
3
Mmc
4
_
q+k
pk
_
dkdpdq
U
0
(q +k, ; q,
)W
0
(p k, ; p,
) (U(q +k, ; q,
) W(p k, ; p,
)
( q +
k q)
2
pk
d
p
a
q+k
a
q
=
e
2
2
c
2
(2)
3
Mmc
4
_
q+k
pk
_
dkdpdq
U
(q +k, ; q,
)W
(p k, ; p,
)
( q +
k q)
2
d
pk
d
p
a
q+k
a
q
(9.26)
Now we insert this result in formula (9.19) for the S-operator. According to
(8.66), in order to perform the integration on t from to , we just need
to multiply the coecient function by the factor 2i(E(p, q, k)), where
E(p, q, k) =
q+k
q
+
pk
p
is the energy function. This makes the S-operator non-trivial only on the
energy shell E(p, q, k) = 0. Finally, we can represent the 2nd order scattering
operator in the general form (8.50)
S
2
[d
da] =
,,
_
dpdqdp
dq
s
2
(p, q, p
, q
; , ,
)d
p,
a
q,
d
p
,
a
q
(9.27)
308 CHAPTER 9. QUANTUM ELECTRODYNAMICS
with the coecient function
s
2
(p, q, p
, q
; , ,
)
=
ie
2
c
2
mMc
4
4
( p + q p
)U
(q, ; q
)W
(p, ; p
)
4
2
q
p
p
( q q
)
2
(9.28)
where the 4-dimensional delta function
14
4
( p + q p
) (p +q p
)(
p
+
q
p
q
) (9.29)
guarantees the conservation of both momentum and energy in the collision
process.
Note that s
2
(p, q, p
, q
; , ,
da]d
[0
=
,,
0[a
q,
d
p,
_
dsdtds
dt
s,
a
t,
s
2
(s, t, s
, t
; , ,
)
d
s
,
a
t
,
d
[0
=
,,
_
dsdtds
dt
s
2
(s, t, s
, t
; , ,
)
(s p)
,
(t q)
,
(s
,
(t
= s
2
(p, q, p
, q
; , ,
) (9.30)
9.2.2 Lorentz invariance of the S-operator
In this subsection we would like to verify the Lorentz invariance of the S-
operator calculated above. More specically, in accordance with (7.7) we
would like to check that
e
ic
K
0
S
2
[d
da]e
ic
K
0
= S
2
[d
da]
14
see Appendix M.1
9.2. S-OPERATOR IN QED 309
On the left hand side of this equality we apply boost operators to particle
operators and use formulas (8.36) - (8.37)
,,
_
dpdqdp
dq
s
2
(p, q, p
, q
; , ,
)e
ic
K
0
p,
a
q,
d
p
,
a
q
,
e
ic
K
0
,,
_
dpdqdp
dq
s
2
(p, q, p
, q
; , ,
)
_
(D
1/2
)
W
(p, ))d
p,
(D
1/2
)
W
(q, ))a
q,
p
p
D
1/2
W
(p
, ))d
p
q
q
D
1/2
W
(q
, ))a
q
,
where
q
=
_
m
2
c
4
+ q
2
c
2
, q is the boost transformation of the electrons
momentum
15
and
W
(q, ) is the corresponding Wigner angle (5.16). The
analogous proton-related quantities
q
, q and
W
(q, ) are obtained by
replacing the electron mass m with the proton mass M. Changing summation
and integration variables and denoting R(p, ) and R(p, ) the results of
Wigner rotations on the electron and proton spin components, respectively,
we obtain
16
=
,,
_
d(
1
p)d(
1
q)d(
1
p
)d(
1
q
)
_
1
p
1
q
1
p
q
1
q
1
p
1
q
1
p
1
q
s
2
(
1
p,
1
q,
1
p
,
1
q
; R
1
(p, ), R
1
(q, ), R(p
, )
, R(q
, )
)
d
p,
a
q,
d
p
,
a
q
,,
_
dpdqdp
dq
1
p
1
q
1
p
1
q
p
q
s
2
(
1
p,
1
q,
1
p
,
1
q
; R
1
(p, ), R
1
(q, ), R(p
, )
, R(q
, )
)
d
p,
a
q,
d
p
,
a
q
Comparing this result with (9.27) we conclude that the Lorentz invariance
15
see formula (5.14)
16
We also used equation (5.25) here.
310 CHAPTER 9. QUANTUM ELECTRODYNAMICS
condition will be satised if the coecient function can be written in the
form
s
2
(p, q, p
, q
; , ,
)
Mmc
4
_
p
q
o
2
(p, q, p
, q
; , ,
)
and the new function o
2
satises a simpler invariance condition
o
2
(p, q, p
, q
; , ,
)
= o
2
(
1
p,
1
q,
1
p
,
1
q
; R
1
(p, ), R
1
(q, ), R(p
, )
, R(q
, )
)
(9.31)
In our case (9.28)
o
2
=
ie
2
c
2
4
2
4
( p + q p
)
U
(q, ; q
)W
(p, ; p
)
( q q
)
2
As shown in Appendix J.8, the quantities U
and W
transform as 4-vectors
under the change of arguments indicated on the right hand side of (9.31). So
the 4-product U
)
2
is invariant as
well. This proves Lorentz invariance of the 2nd order contribution (9.28) to
the S-operator.
9.2.3 S
2
in Feynman-Dyson perturbation theory
The relativistic invariance of the S
2
operator looks almost accidental in our
approach. Indeed, we have used the interacting Hamiltonian V
1
+ V
2
, which
did not have simple transformation properties with respect to boosts. We
have also used a non-covariant form (K.14) of the matrix h
(x, 0)A
(x, 0)
=
_
dx(e(x, 0)
(x, 0) + e(x, 0)
(x, 0))A
(x, 0)
(9.32)
2. In calculations involving photon elds
18
one should use the covariant
expression (g
, q
; , ,
)
= 0[a
q,
d
p,
S
2
d
[0
= 0[a
q,
d
p,
_
_
1
2!
2
+
_
dt
1
dt
2
T[V
1
(t
1
)V
1
(t
2
)]
_
_
d
[0
=
1
2!
2
c
2
_
d
4
x
1
d
4
x
2
0[a
q,
d
p,
T[(J
( x
1
)A
( x
1
) +
( x
1
)A
( x
1
))
(J
( x
2
)A
( x
2
) +
( x
2
)A
( x
2
))]d
[0
=
1
2!
2
c
2
_
d
4
x
1
d
4
x
2
0[a
q,
d
p,
_
T[J
( x
1
)A
( x
1
)
( x
2
)A
( x
2
)]
17
So, we will call V
1
the Feynman-Dyson interaction operator to distinguish it from the
Hamiltonian interaction operator V
1
+V
2
.
18
commutators (K.12) and photon propagators (K.16)
312 CHAPTER 9. QUANTUM ELECTRODYNAMICS
+T[
( x
1
)A
( x
1
)J
( x
2
)A
( x
2
)]
_
d
[0
=
e
2
2
_
d
4
x
1
d
4
x
2
0[a
q,
d
p,
T[( x
1
)
( x
1
)A
( x
1
)( x
2
)
( x
2
)A
( x
2
)]d
[0
(9.33)
If the operator sandwiched between vacuum vectors 0[ . . . [0 is converted
to the normal order, then all its terms will not contribute to the matrix ele-
ment, except the c-number term. In order to provide such a c-number term
the operator under the T-symbol should have the structure d
da. From
expressions (J.26) and (J.29) for quantum elds and we conclude that
operator d
mc
2
(2)
3
q
exp
_
i
q x
i
_
u
(q, ) (9.34)
After these routine manipulations the coecient function takes the form
19
s
2
(p, q, p
, q
; ,
, ,
_
d
4
x
1
d
4
x
2
e
2
mMc
4
2
(2)
6
_
q
p
exp
_
i
q x
1
_
exp
_
x
1
_
exp
_
i
p x
2
_
exp
_
x
2
_
19
where the matrix element 0[T[A
( x
1
)A
( x
2
)][0 (the photon propagator) was taken
from (K.17)
9.2. S-OPERATOR IN QED 313
u(q, )
u(q
)w(p, )
w(p
)0[T[A
( x
1
)A
( x
2
)][0
=
1
2i
_
d
4
x
1
d
4
x
2
d
4
s
e
2
c
2
mMc
4
(2)
9
_
q
p
exp
_
i
( q q
s) x
1
_
exp
_
i
( p p
+ s) x
2
_
u
a
(q, )
ab
u
b
(q
)
g
s
2
w
c
(p, )
cd
w
d
(p
)
=
ie
2
c
2
mMc
4
4
2
q
p
4
( p + q p
)
u
a
(q, )
ab
u
b
(q
)
g
( q q
)
2
w
c
(p, )
cd
w
d
(p
) (9.35)
=
ie
2
c
2
mMc
4
4
2
q
p
4
( p + q p
)
U
(q, ; q
)W
(p, ; p
)
( q q
)
2
(9.36)
which, as expected, is exactly the same as in the non-covariant approach
(9.28).
Using results from Appendix J.9 and assuming that the proton is very heavy
(M ), we obtain its coecient function in the (v/c)
2
approximation
20
s
2
(p, q, p
, q
; ,
, ,
ie
2
(2)
2
mc
2
1
k
2
U
0
(q +k, ; q,
ie
2
(2)
2
1
k
2
_
1
q
2
2m
2
c
2
qk
2m
2
c
2
k
2
4m
2
c
2
_
_
1 +
(2q +k)
2
8m
2
c
2
+
i
el
[k q]
4m
2
c
2
_
ie
2
,
,
(2)
2
1
k
2
_
1
q
2
2m
2
c
2
qk
2m
2
c
2
k
2
4m
2
c
2
+
q
2
2m
2
c
2
+
qk
2m
2
c
2
+
k
2
8m
2
c
2
_
+
ie
2
(2)
2
1
k
2
i
el
[k q]
4m
2
c
2
=
ie
2
,
,
(2)
2
_
1
k
2
1
8m
2
c
2
_
4m
2
c
el
[k q]
k
2
20
We omit the energy delta function and introduce the vector of transferred momentum
k = q
q = p p
. Here e
2
/(4c) is the ne structure constant.
314 CHAPTER 9. QUANTUM ELECTRODYNAMICS
q,
q,
kk
aa bb
cc dd
p,
p,
Figure 9.1: Feynman diagram for the electron-proton scattering in the 2nd
perturbation order.
In the extreme non-relativistic approximation for the S-operator, we can
ignore terms with c in denominators and obtain
S
2
[d
da] =
ie
2
(2)
2
_
dpdqdk
(E(p, q, k))
k
2
d
pk,
d
p,
a
q+k,
a
q,
(9.37)
This is consistent with our toy model (8.103). The dierence in sign is related
to the fact that equation (8.103) describes scattering of two electrons having
the same charge and, therefore, repelling each other, while equation (9.37)
refers to a Coulomb-type attractive electron-proton potential.
21
9.2.4 Feynman diagrams
Expression (9.35) for the scattering amplitude can be conveniently repre-
sented by the Feynman diagram shown in Fig. 9.1
22
Here we would like to
formulate general rules for drawing and interpreting Feynman diagrams in
QED. They are called Feynman rules.
21
see subsection 8.3.5
22
Note that in spite of some similarities, Feynman diagrams are fundamentally dierent
from diagrams considered in sections 8.3 and 8.4. Feynman diagrams describe expressions
obtained in the Feynman-Dyson perturbation theory, while diagrams from sections 8.3
and 8.4 are for the use in the old-fashioned (non-covariant) perturbation theory.
9.2. S-OPERATOR IN QED 315
The initial state of the electron with momentum q and spin component
is represented by the factor
23
mc
2
(2)
3
q
u
a
(q, ) (9.38)
in (9.35) and by a thin incoming arrow in the diagram 9.1. Similarly, the
nal state of the electron is represented by the factor
mc
2
(2)
3
u
b
(q
) (9.39)
in (9.35) and by a thin outgoing arrow in the diagram 9.1. The incoming
and outgoing proton lines are described by thick arrows in the diagram and
by factors
Mc
2
(2)
3
p
w
c
(p, ),
Mc
2
(2)
3
w
d
(p
) (9.40)
respectively. Similarly incoming and outgoing external photon lines (not
present in the graph 9.1) correspond to factors
c
_
2p(2)
3
e
(p, ),
c
_
2p(2)
3
e
(p, ) (9.41)
respectively.
An internal photon line carrying 4-momentum
k corresponds to the pho-
ton propagator in (9.35)
24
2
c
2
g
2i(2)
3
k
2
(9.42)
23
This is expression as (9.34) without the exponential factor. Exponential factors coming
from dierent sources will be collected and analyzed together later.
24
Compare with (K.17). Here we omit the integration sign and the exponential factor,
which will be tackled later.
316 CHAPTER 9. QUANTUM ELECTRODYNAMICS
An internal electron line
25
connects two vertices and corresponds to the elec-
tron propagator
26
(,k + mc
2
)
bc
2i(2)
3
(
k
2
m
2
c
4
)
(9.43)
Similarly, an internal proton line is associated with the factor
(,k + Mc
2
)
bc
2i(2)
3
(
k
2
M
2
c
4
)
(9.44)
The number of vertices (1) in the graph is the same as the order of
perturbation theory (=2 in our case). Each vertex is associated with the
factor
i(2)
4
e
ab
This factor has three summation indices. Two bispinor indices a and b are
coupled to indices of fermion factors (9.38) - (9.40) or (9.43) - (9.44). Thus,
in the diagram, each vertex is connected to two fermion lines (either external
or internal). The 4-vector index is coupled to indices of either free pho-
ton factor (9.41) or photon propagator (9.42). Correspondingly, each vertex
connects to one photon line (either external or internal). So, we conclude
that each contribution to the QED S-matrix can be represented simply by
drawing a connected
27
diagram, whose edges and vertices respect the above
connectivity rules.
Let us now discuss exponential factors and integrations. Each interaction
vertex is associated with a 4-integral
_
d
4
x. Each incoming external particle
line (attached to the vertex with integration variable x
) is associated with
exponential factor exp(
i
p x
p ( x x
)).
So, the full exponential factor that depends on x is exp(
i
( p
1
+ p
2
+ p
3
) x),
where p
i
are 4-momenta (with appropriate signs) of the three lines attached
to the vertex. Integrating on d
4
x we obtain the 4-momentum -function
(2)
4
4
( p
1
+ p
2
+ p
3
), which expresses conservation of the 4-momentum
28
at the interaction vertex. This conservation rule helps us to assign 4-
momentum labels to all lines: The external lines correspond to real observable
particles, so their energy-momenta are considered given (p
0
, p) and (p
0
, p
)
and they are always on the mass shell. I.e., they satisfy conditions
p
0
=
p
=
_
m
2
c
4
+ p
2
c
2
(9.45)
p
0
=
p
=
_
m
2
c
4
+ (p
)
2
c
2
(9.46)
We can arbitrarily assign directions of the momentum ow through inter-
nal lines. For each independent loop we should also introduce an additional
4-momentum, which, as we will see, is a dummy integration variable. Then,
following the above conservation rule, we can assign a unique 4-momentum
label to each line in the diagram.
In addition to d
4
x integrations discussed above we also have 4-momentum
integrals associated with each external line and each internal line (propaga-
tor). As we discussed in subsection 8.4.1, these integrations are sucient to
kill all 4-momentum delta functions, leaving just one delta function express-
ing the conservation of the overall momentum and energy in the scattering
process. The number of integrals left is equal to the number of independent
loops in the diagram.
To summarize, we have the following rules for writing 1-order matrix
element of the scattering operator
1. Draw a Feynman diagram with 1 vertices, J internal lines and / =
J 1 + 1 independent loops. Each vertex in the diagram should be
connected to two electron (or proton) lines (either external or internal)
and to one photon line (either external or internal). External incoming
(outgoing) lines correspond to initial (nal) conguration of particles
28
We put the word conservation in quotes, because the 4-momenta of virtual parti-
cles involved here are just integration variables. They cannot be measured and, therefore,
unphysical. See next subsection.
318 CHAPTER 9. QUANTUM ELECTRODYNAMICS
in the considered scattering event. Momenta and spins of particles in
these asymptotic states are assumed to be given.
2. Assign arbitrary 4-momentum labels to / internal loop lines.
3. Following the 4-momentum conservation rule at each vertex, assign
4-momentum labels to all remaining internal lines.
4. The integrand is now obtained by putting together numerical factors
corresponding to all lines and vertices in the diagram as shown in Table
9.1.
5. Integrate the obtained expression on all loop 4-momenta.
6. Multiply by 1/1! (which comes from formula (7.17)) and by an appro-
priate symmetry factor.
29
7. Multiply by a 4D delta function that expresses the conservation of the
total energy-momentum in the scattering process.
9.2.5 Virtual particles?
As we mentioned above, external lines of Feynman diagrams represent asymp-
totic states of real particles. The momentum and spin labels attached to these
lines directly correspond to the values of observables that can be measured
in these states.
In some QFT textbooks one can also read about interpretation, according
to which internal lines in Feynman diagrams correspond to so-called virtual
particles. Thus, internal photon lines correspond to virtual photons and
internal electron (positron) lines correspond to virtual electrons (positrons).
Clearly, energy-momentum labels attached to internal lines do not satisfy
the basic energy-momentum relationships (9.45) - (9.46) characteristic for
29
For example, in our calculation (9.33) the symmetry factor of 2 is needed due to
the appearance of two identical expressions under the t-ordering sign. This symmetry
factor canceled the 1/2! multiplier exactly. In all S-matrix calculations in this book such
cancelation of the symmetry factor and the 1/1! multiplier also occurs. For more complex
cases this may not be true.
9.2. S-OPERATOR IN QED 319
Table 9.1: The correspondence between elements in a Feynman diagram
and factors in the Feynman-Dyson perturbative expression for the scattering
amplitude.
Diagram element Factor Physical interpretation
incoming electron line
_
mc
2
(2)
3
p
u
a
(p, ) electron in the state [p,
at t =
outgoing electron line
_
mc
2
(2)
3
p
u
a
(p, ) electron in the state [p,
at t = +
incoming photon line
(2)
3
2p
e
(2)
3
2p
e
interaction vertex
i(2)
4
e
ab
no interpretation
real particles.
30
This is usually described as virtual particles being out of
the mass shell. In spite of their strange properties, virtual particles are
often regarded as real objects and their exchange is claimed to be the
origin of interactions. For example, diagram 9.1 is interpreted as a depiction
of a process in which a virtual photon is exchanged between electron and
proton, thus leading to their attraction.
These interpretations are not supported by evidence. They are rather
misleading. In fact, Feynman diagrams are not depictions of particle tra-
jectories or real events. Lines and vertices in Feynman diagrams are simply
graphical representations for certain factors in integrals for scattering am-
30
In the example shown in diagram 9.1, the momentum of the virtual photon is qq
[ =
q
q
is not satised.
320 CHAPTER 9. QUANTUM ELECTRODYNAMICS
plitudes. Quantum theory does not provide any mechanistic description of
interactions (like exchange of virtual particles). The only reliable informa-
tion about interactions is contained in the interaction Hamiltonian, which
does not suggest that some invisible virtual particles are emitted, absorbed
and exchanged during particle collisions.
Chapter 10
RENORMALIZATION
There is no great thing that would not be surmounted by a still
greater thing. There is no thing so small that no smaller thing
could t into it.
Kozma Prutkov
All 2nd and 4th order Feynman diagrams that are relevant to the electron-
proton scattering are shown in Fig. 10.1.
1
The tree diagram 10.1(a)
2
is the
only one in the 2nd perturbation order. In section 9.2 we calculated the
corresponding scattering amplitude (9.35), which agreed with experiments
very well. Similarly, one can obtain rather accurate 2nd order results for
other scattering events, such as Compton scattering or electron-positron an-
nihilation. In general, all amplitudes whose Feynman diagrams are tree-like
(i.e., do not contain loops), come out fairly accurate. However, in higher
perturbation orders of QED the appearance of loops is inevitable, and this
leads to serious problems.
4th order diagrams 10.1(b-g) contain troublesome loops,
3
and there are
1
Diagrams 10.1(h-k) are obtained from renormalization counterterms that will be dis-
cussed in section 10.2.
2
This is the same diagram as 9.1.
3
Here we show only diagrams, in which loops are associated with electron lines. For a
complete treatment, one should also take into account loops, similar to 10.1(b), (c), (g),
but associated with proton lines. However, it can be shown that their contributions to
scattering amplitudes is much smaller due to the inequality m M. So, we will omit
321
322 CHAPTER 10. RENORMALIZATION
two types of problems associated with them. First, it can be shown
4
that loop
integrals in diagrams 10.1(b-c) and (e-g) diverge due to singularity at zero
loop momentum. These are infrared divergences [Wei95] whose cancelation
will be discussed in Chapter 13.6.
Another problem is the divergence of loop integrals 10.1(b-e) at high loop
momenta.
5
These are also known as ultraviolet divergences. The way to x
this problem is provided by the renormalization theory developed by Tomon-
aga, Schwinger and Feynman in the late 1940s. Basically, this approach says
that the QED interaction operator (9.13) - (9.14)
V = V
1
+ V
2
(10.1)
is not complete. It must be corrected by adding certain counterterms. The
counterterms are formally innite operators. However, if they are carefully
selected, then one can cancel the innities occurring in loop integrals, so that
only some residual nite contributions (radiative corrections ) remain in each
perturbation order. These small radiative corrections are exactly what is
needed to obtain scattering cross sections, energies of bound states and some
other properties in remarkable agreement with experiments. The procedure
of adding counterterms to the interaction operator is called renormalization.
10.1 Renormalization conditions
In this section we will be interested in rather general physical principles that
underlie the renormalization approach. We will summarize these principles
in the form of two renormalization conditions. We will call them the no-self-
scattering condition and the charge renormalization condition.
10.1.1 Regularization
As we mentioned above, loop integrals in QED are ultraviolet-divergent
and/or infrared-divergent and it is dicult to do calculations with innite
quantities. To make things easier, it is convenient to perform regularization.
them in our analysis.
4
see Appendix M
5
see subsection 8.4.3
10.1. RENORMALIZATION CONDITIONS 323
(a)
(b)
(c)
(i)
(j)
(k)
(d) (e)
(h)
(f) (g)
qq
q
hh
qh
qh
kk
pp
p
qq
q
pp
p
kk
qq
q
pp
p
qh p+h
hh
h+k
qq
q p
pp
qh
hh
h+k
ph
qq
q p
pp
kk kk
hh
kh
Figure 10.1: Feynman diagrams for the electron-proton scattering up to the
4th perturbation order. Thick full lines - protons, thin full lines - electrons,
dotted lines - photons.
324 CHAPTER 10. RENORMALIZATION
The idea is to modify the theory by hand in such a way that all loop integrals
are forced to be nite, so that intermediate manipulations do not involve in-
nities. The simplest regularization approach adopted in Appendix M is to
introduce momentum cutos in all loop integrals. The cutos depend on two
parameters having the dimensionality of mass: the ultraviolet cuto limits
integrals at high loop momenta and infrared cuto controls integrations
at low momenta singularities. Of course, the theory with such truncated
integrals cannot be exact. To obtain nal results, at the end of calculations
the ultraviolet cuto momentum should be set to innity and the
infrared cuto should be set to zero 0.
6
If counterterms in the Hamilto-
nian are chosen correctly, then resulting S-matrix elements should be nite,
accurate and cuto-independent.
10.1.2 No-self-scattering renormalization condition
First we would like to note that the divergence of loop integrals is not the
biggest problem that we face in QED. Even if all loop integrals were conver-
gent, the S-operator might still contain nasty innities. Let us now consider
in more detail how these innities appear and what we can do about them.
Recall that QED interaction operator (10.1) has unphys, phys and renorm
terms. So the corresponding scattering phase operator F
7
must contain terms
of the same types. Then we can write the S-operator as
S = exp( F
..
) = exp(F
unp
..
+ F
ph
..
+F
ren
..
)
= exp(F
ph
..
+F
ren
..
) (10.2)
where we noticed that unphys terms in F do not contribute to the S-operator
due to equation (8.64). Let us now apply scattering operator (10.2) to an one-
electron state a
p,
[0. It follows from Lemma 8.2 that phys operators yield
zero when acting on one-particle states. Renorm operators do not change
the number of particles. Therefore, we can write
Sa
p,
[0
6
In this chapter we will keep non-zero. The limit 0 and associated infrared
divergences will be discussed in chapter 13.6.
7
see equation (9.19)
10.1. RENORMALIZATION CONDITIONS 325
= exp(F
ph
..
+F
ren
..
)a
p,
[0
=
_
1 + F
ph
..
+F
ren
..
+
1
2!
(F
ph
..
+F
ren
..
)
2
+ . . .
_
a
p,
[0
=
_
1 + F
ph
..
+F
ren
..
+
1
2!
(F
ph
..
)
2
+
1
2!
F
ph
..
F
ren
..
+
1
2!
F
ren
..
F
ph
..
+
1
2!
(F
ren
..
)
2
+ . . .
_
a
p,
[0
=
_
1 + F
ren
..
+
1
2!
(F
ren
..
)
2
+ . . .
_
a
p,
[0
= exp(F
ren
..
)a
p,
[0 (10.3)
A similar derivation can be performed for a one-photon state c
p,
[0 and the
vacuum vector
Sc
p,
[0 = exp(F
ren
..
)c
p,
[0 (10.4)
S[0 = exp(F
ren
..
)[0 (10.5)
So, the scattering in these states depends only on the renorm part of F.
We know from (8.63) that the t-integral F
ren
..
is innite, even if F
ren
is itself
nite. Therefore, if F
ren
,= 0 then the S-operator multiplies 0- and 1-particle
states by innite phase factors. This divergence is unacceptable.
Intuitively, we expect single particle states and the vacuum to evolve
freely during the entire time interval from t = to t = +. This means
that there can be no non-trivial scattering in such states. This also means
that the S-operator must be equal to the identity operator when acting on
such states. But this condition is not satised in the QED theory presented
so far.
We have two options to deal with this problem. One option is to claim (as
advised in many textbooks) that [0 is not the true (physical) vacuum state
of the system and that a
p,
[0, c
p,
[0 are not true (physical) one-particle
states. These are examples of the so-called bare states.
8
The real physical
0-particle and 1-particle states should be obtained as linear combinations of
the bare states for which the self-scattering is absent. Then scattering theory
of such physical particles would not have divergences and paradoxes.
8
Some say that the bare electron is surrounded by a cloud of virtual photons and
particle-antiparticle pairs.
326 CHAPTER 10. RENORMALIZATION
In this book we will adopt a dierent
9
point of view. We will maintain
that [0, a
p,
[0, c
p,
[0, . . . are true representatives of real physical 0-particle
and 1-particle states. Then our explanation for the divergent results (10.3) -
(10.5) is that we have started to develop our theory from a wrong interaction
operator (10.1). We insist that this operator must be modied or renormal-
ized, so that the theory is forced to be nite. In particular, we will demand
that the new renormalized interaction satises the condition
F
ren
= 0 (10.6)
This implies that operator F
..
is purely phys
F
..
= F
ph
..
If this condition is satised then (10.3) - (10.5) take physically acceptable
forms
Sa
p,
[0 = a
p,
[0
Sc
p,
[0 = c
p,
[0
S[0 = [0
Taking into account the perturbation expansion S = 1 + S
2
+ S
3
+ . . ., we
can also write in each perturbation order
S
n
a
p,
[0 = 0 (10.7)
S
n
c
p,
[0 = 0 (10.8)
S
n
[0 = 0 (10.9)
where n = 2, 3, . . .. And for elements of the S-matrix we have
0[S
n
[0 = 0 (10.10)
0[a
p,
S
n
a
[0 = 0 (10.11)
0[c
p,
S
n
c
[0 = 0 (10.12)
The above conditions can be summarized as the following
9
but, in some respect, equivalent
10.1. RENORMALIZATION CONDITIONS 327
Statement 10.1 (no-self-scattering renormalization condition) There
should be no (self-)scattering in the vacuum and one-particle states.
10
The physical interpretation is obvious: scattering is expected to occur only
when there are at least two particles which interact with each other. One
particle has nothing to collide with, so it cannot experience scattering. Sim-
ilarly, no scattering can happen in the no-particle vacuum state.
10.1.3 Charge renormalization condition
The no-self-scattering renormalization condition sets strict requirements (10.10)
- (10.12) for matrix elements of the S-operator between 0-particle and 1-
particle states. However, this condition alone is not sucient to guarantee
cancelation of ultraviolet divergences in scattering calculations. On physical
grounds we can derive another necessary renormalization condition.
Recall that the 2nd order electron-proton scattering amplitude (9.35) has
a singularity e
2
/
k
2
at zero transferred momentum
k = q
q = 0. As
shown in subsection 8.3.5, in the position space this singularity gets Fourier-
transformed into the long-range Coulomb potential e
2
/(4r). From clas-
sical physics and experiment we also know that the Coulomb potential is a
very accurate description of the interaction of charges at large distances and
low energies. We are now raising this observation to the level of fundamental
physical principle
Postulate 10.2 (charge renormalization condition) Scattering of charged
particles at large distances and low energies is described exactly by the 2nd
order term S
2
in the S-operator. All higher order contributions to the low-
energy scattering should vanish.
Mathematically, the charge renormalization condition implies that in orders
higher than 2nd, scattering amplitudes for charged particles should not be
singular at low values of the transfer momentum
k. Suppose that this is
not true and that the 4th order electron-proton scattering amplitude has a
singularity e
4
/
k
2
. Then the long-range electron-proton potential would
obtain the unacceptable form
10
Note that our no-self-scattering condition is actually equivalent to the more traditional
mass renormalization condition. For example, in section 10.2 we will see that our 2nd
order renormalization counterterms are exactly the same as electron and photon self-energy
counterterms in textbook QED.
328 CHAPTER 10. RENORMALIZATION
V (r)
e
2
+ Ce
4
4r
with a non-zero constant C. From experiments we know that C = 0 to a
high degree of precision.
10.1.4 Renormalization in Feynman-Dyson theory
In the next section we will see that the no-self-scattering and charge renor-
malization conditions are not satised in QED with interaction Hamiltonian
(10.1). This can be seen already from the fact that interaction operator V
1
(9.13) contains unphys terms like a
a + a
1
2
[V
1
, V
1
] + . . .
in violation of the condition (10.6). We will see that interaction (10.1) vio-
lates the charge renormalization condition as well. These two problems are
very serious even though they are not directly related to divergences in loop
diagrams. The presence of such divergences is just another argument that
QED interaction V
1
+ V
2
must be modied.
The idea of the renormalization approach is to switch from the interaction
Hamiltonian V
1
+V
2
to another interaction V
c
by addition of renormalization
counterterms, which will be denoted collectively by Q
V
c
= V
1
+ V
2
+ Q (10.13)
The form of Q must be chosen such that the no-self-scattering and charge
renormalization conditions are satised. In particular, the new scattering
phase operator
F
c
= V
c
1
2
[V
c
, V
c
] + . . . (10.14)
11
see Table 8.2
10.2. COUNTERTERMS 329
should not contain renorm terms. Moreover, high-order contributions F
c
n
(n >
2) to the scattering of charged particles should be nonsingular at
k = 0.
Unfortunately, the program outlined above is dicult to implement. The
reason is that scattering calculations with the QED Hamiltonian (10.1) are
rather cumbersome. As we discussed in subsection 9.2.3, it is much easier
to use the Feynman-Dyson approach, in which the 2nd order interaction op-
erator V
2
is omitted and the momentum space photon propagator is taken
simply as g
/ p
2
. This is the standard way to calculate the S-matrix
in QED and we will adopt this approach in the present chapter. The gen-
eral idea of renormalization remains the same. We are looking for certain
counterterms Q
FD
that can be added to the original interaction operator
V
1
(t) = e
_
dx( x)
( x)A
( x) (10.15)
to obtain renormalized interaction
V
c
FD
= V
1
+ Q
FD
(10.16)
with which the Feynman-Dyson perturbation expansion (7.17)
S
c
= 1
i
+
_
dt
1
V
c
FD
(t
1
)
1
2!
2
+
_
dt
1
dt
2
T[V
c
FD
(t
1
)V
c
FD
(t
2
)] . . . (10.17)
becomes nite and our two renormalization conditions are satised.
10.2 Counterterms
Next we would like to see how the program outlined above works in practice.
In this section we are going to derive explicit expressions for counterterms
Q
FD
in the 2nd and 3rd order.
10.2.1 Electron self-scattering
Let us rst discuss the electronelectron scattering and see exact how con-
dition (10.11) is violated.
330 CHAPTER 10. RENORMALIZATION
p,
p','
a b
c d
k
p, p','
(a)
(b)
Figure 10.2: Feynman diagrams for the scattering electron electron in the
2nd perturbation order.
There are only two connected diagrams that contribute to the electronelectron
scattering in the 2nd order. They are shown in Fig. 10.2. Here we will inves-
tigate the eect of 10.2(a). The other diagram 10.2(b) can be analyzed in a
similar manner, and this analysis is left as an exercise for interested readers.
Applying Feynman rules from Table 9.1 to 10.2(a), we obtain
0[a
p,
S
FD
2
a
[0
=
me
2
c
4
4
( p p
)
(2i)
2
(2)
p
u
a
(p, )
__
d
4
k
ab
(,p ,k + mc
2
)
bc
( p
k)
2
m
2
c
4
k
2
cd
_
u
b
(p
)
(10.18)
=
me
2
c
4
4
( p p
)
(2)
2
(2)
p
u
a
(p, )
_
C
0
ab
+ C
1
(,p mc
2
)
ab
+ R
ab
(,p)
_
u
b
(p,
)
(10.19)
where (divergent) constants C
0
and C
1
are calculated in (M.30) and (M.31),
respectively.
12
The nite quantity R(,p) includes terms quadratic, cubic and
higher order in ,p mc
2
R(,p) = C
2
(,p mc
2
)
2
+ C
3
(,p mc
2
)
3
+ . . . (10.20)
By stripping away factors corresponding to external electron lines and
the delta function in (10.19), we obtain the contribution from the loop itself
and two attached vertices
12
Quantities C
0
and R have the dimension of mc
1
and C
1
= c
3
. Therefore,
the dimension of (10.19) is p
3
, as expected from (O.1).
10.2. COUNTERTERMS 331
T
loop
(,p) =
2
e
2
c
2
_
C
0
+ C
1
(,p mc
2
) + R(,p)
_
(10.21)
This non-vanishing result contradicts the no-self-scattering condition (10.11).
Moreover, this expression is clearly innite in the limit . So, we are
dealing with an ultraviolet divergence here.
Now let us consider an arbitrary Feynman diagram, which can contain
electron-photon loops in external and/or internal lines. For loops in external
electron lines, the 4-momentum p is on the mass shell,
13
and only the constant
term in (10.21) survives
14
T
loop
(,p = mc
2
) =
2
e
2
c
2
C
0
= 3ie
2
2
mc
_
2 ln
m
+
1
2
_
(10.22)
For loops in internal electron lines, the 4-momentum p is not necessarily on
the mass shell, so the full factor (10.21) should be taken into account.
10.2.2 Electron self-energy counterterm
From subsection 10.1.4 we know that the above divergences should be can-
celed by a 2nd order counterterm. Of course, that counterterm cannot be
chosen arbitrarily. It becomes a part of the interaction operator, so it should
obey the conditions formulated for such operators in subsection 9.1.1. In
particular, our addition of the counterterm should not aect the relativistic
invariance of the theory, so the condition (9.7) is essential. Taking these
considerations into account, let us choose the following electron self-energy
counterterm
Q
FD
2el
(t) = m
2
_
dx( x)( x) + (Z
2
1)
2
_
dx( x)(ic
+ mc
2
)( x)
(10.23)
where the 4-gradient
is dened as
13
This is true for loops in diagrams 10.1(b-c) and also for the diagram 10.2.
14
Here we formally write the mass shell condition ( p
2
= m
2
c
4
) as ,p = mc
2
due to
(J.23).
332 CHAPTER 10. RENORMALIZATION
1
c
t
,
x
,
y
,
z
_
(10.24)
and parameters m
2
and (Z
2
1)
2
must be adjusted to satisfy renormalization
conditions.
15
The 2nd order contribution to the electronelectron scattering
amplitude resulting from interaction (10.23) is
16
0[a
p,
S
count
2
a
[0
=
im
2
0[a
p,
_
d
4
x( x)( x)a
[0
i(Z
2
1)
2
0[a
p,
_
d
4
x( x)(ic
+ mc
2
)( x)a
[0
=
im
2
_
d
4
x
1
(2)
3
mc
2
e
i
p x
e
x
u
a
(p, )u
a
(p
i(Z
2
1)
2
_
d
4
x
mc
2
(2)
3
e
i
p x
e
x
u
a
(p, )( ,p + mc
2
)u
a
(p
)
=
2i(m
2
)mc
2
4
( p p
p
u
a
(p, )u
a
(p
2i(Z
2
1)
2
mc
2
4
( p p
p
u
a
(p, )( ,p + mc
2
)u
a
(p
) (10.25)
Dropping factors corresponding to external electron lines and the delta func-
tion, we obtain the pure counterterm contribution
T
count
(,p) =
i(2)
4
m
2
+
i(2)
4
(Z
2
1)
2
(,p mc
2
) (10.26)
This factor can be ascribed to a new interaction vertex, which will be denoted
by a cross placed on electron lines, as in gs. 10.1(i-j). Such vertices can be
placed on electron lines in Feynman diagrams of arbitrary topological shape
15
m
2
has the dimension of energy and (Z
2
1)
2
is dimensionless. Both of them are
second-order quantities, as indicated by the subscripts.
16
This formula is obtained by inserting Q
FD
2el
(t) instead of V
c
FD
(t) in the second term on
the right hand side of (10.17).
10.2. COUNTERTERMS 333
thus increasing the order of the diagram by 2. If the counterterm vertex is
placed on an external electron line, then the 4-momentum p is on the mass
shell where the 2nd term in (10.26) vanishes. So, in this case
T
count
(,p = mc
2
) =
i(2)
4
m
2
This contribution should be added to the loop term (10.22). Thus we con-
clude that loops in external electron lines will be canceled exactly
17
if we
choose
18
m
2
=
ie
2
c
2
C
0
(2)
4
=
3mc
3
e
2
16
2
_
1
2
+ 2 ln
m
_
(10.27)
In other words, when doing calculations in the renormalized theory the self-
energy loops in external electron lines and contributions from the counterterm
(10.23) can be simply ignored. In our study this means that diagrams 10.1(b-
c) and (i-j) can be omitted.
Electron-photon loops and cross vertices can also appear in internal
electron lines whose 4-momentum p is not necessarily on the mass shell. Then
the loop contribution (10.21) has a non-vanishing divergent term proportional
to C
1
. In order to cancel this divergence, it is sucient to choose the other
renormalization factor in (10.23)
19
(Z
2
1)
2
=
ie
2
c
2
C
1
(2)
4
=
e
2
8
2
c
_
ln
m
+ 2 ln
m
+
9
4
_
(10.28)
17
In the language of traditional mass renormalization theory this cancelation comes from
the requirement that the renormalized electron propagator has a pole at ,p = mc
2
, where
m is the physical mass of the electron.
18
This expression for the mass shift can be compared with formula for m in (21)
[Fey49], with expression right after equation (8.42) in [BD64] and with second equation
on page 523 in [Sch61].
19
Compare this result with (8.43) in [BD64] and with equation (94b) in section 15 in
[Sch61]. Note that the niteness requirement does not specify this factor uniquely. One
can replace C
1
with C
1
+ , where is any nite constant and still have a nite result.
Usually, the correct choice = 0 is justied by the requirement that the residue of the
renormalized electron propagator is equal to 1. Note that this requirement is not covered
by our no-self-scattering and charge renormalization conditions. This is an additional
demand whose physical meaning is not clear to the author.
334 CHAPTER 10. RENORMALIZATION
Then innite contributions from the loop and the counterterm cancel each
other, and only a nite and harmless R-correction remains
T
loop
(,p) +T
count
(,p) =
2
e
2
c
2
R(,p)
It is responsible for so-called electron self-energy radiative corrections to
scattering amplitudes. Such corrections do not play any role in processes
discussed in this book, so we will not discuss them any further.
10.2.3 Photon self-scattering
The amplitude of scattering photonphoton in the second order is obtained
from the diagram 10.3
20
0[c
p,
S
FD
2
c
[0
=
ce
2
(2)2p(2i)
2
4
( p
p)e
(p, )
_
_
d
4
k
(,k + mc
2
)
ac
k
2
m
2
c
4
ab
(,p ,k + mc
2
)
bd
( p
k)
2
m
2
c
4
cd
_
e
(p,
)
=
ce
2
(2)2p(2)
2
4
( p
p)e
(p, )( p
2
g
)( p
2
)e
(p,
)
(10.29)
where ( p
2
) is a divergent function. It is convenient to write ( p
2
) as a sum
of its (innite) value (0) on the photons mass shell ( p
2
= 0) plus a nite
remainder ( p
2
):
( p
2
) = (0) + ( p
2
)
Function ( p
2
) can be represented by integral (11.2.22) in [Wei95], which
takes the following form at low values of energy-momentum p
( p
2
) =
(2)
4
2
2
ic
3
_
1
0
x(1 x) ln
_
1 +
p
2
x(1 x)
m
2
c
4
_
i(2)
4
p
2
60
2
m
2
c
7
(10.30)
20
We omit calculation of the integral in square brackets. This calculation can be found,
e.g., in section 11.2 of [Wei95], in section 7.5 of [PS95b] and in section 8.2 of [BD64].
10.2. COUNTERTERMS 335
p,
p,
aa
bb
cc
dd
pk
kk
Figure 10.3: Feynman diagram for the scattering photonphoton in the 2nd
perturbation order.
In particular, this function vanishes on the photons mass shell
(0) = 0 (10.31)
The factor in (10.29) associated only with the loop and two attached vertices
(no contributions from external lines) is
T
loop
( p) = e
2
( p)( p
2
g
) (10.32)
In equation (10.29), the 4-momentum p is on the mass shell, therefore the
loop contribution (10.32) vanishes there
21
and the no-self-scattering renor-
malization condition (10.12) is satised without extra eort. The same can
be said for loops in external photon legs of any diagram: these loops can
be simply ignored. However, we cannot ignore loop contributions in internal
photon lines.
22
In such cases the 4-momentum p is not necessarily on the
mass shall and factor (10.32) is divergent.
10.2.4 Photon self-energy counterterm
Similar to the electron self-energy renormalization described above, we are
going to cancel this innity by adding to the interaction operator of QED a
21
The second term in parentheses in (10.32) is not contributing due to the property
p e(p, ) = 0 proved in (K.10).
22
See, for example, the graph 10.1(d).
336 CHAPTER 10. RENORMALIZATION
new counterterm
23
Q
FD
2ph
(t) =
(Z
3
1)
2
4
_
dxF
( x)F
( x) (10.33)
where we denoted
F
= (
)(
)
=
= 2
and (Z
3
1)
2
is yet unspecied 2nd order renormalization factor. Let us
now evaluate the eect of the counterterm (10.33) on the photonphoton
scattering amplitude. From denitions (K.2) and (10.24) it follows
24
(x, t) =
i
c
(2)
3/2
_
dq
2q
q
[e
q x
e
(q, )c
q,
+ e
i
q x
e
(q, )c
q,
]
0[c
p,
(x, t) 0[
i
(2)
3/2
2pc
e
i
p x
p
(p, )
(x, t)c
[0
i
(2)
3/2
2p
c
e
x
p
(p
)[0
Then the S-matrix contribution from (10.33) has a form similar to (10.29)
0[c
p,
S
count
2
c
[0
=
i(Z
3
1)
2
2
0[c
p,
_
d
4
x
_
( x)
( x)
( x)
( x)
_
c
[0
=
i(Z
3
1)
2
2c
_
d
4
x
1
(2)
3
4pp
e
i
( p p
) x
p
(p, )e
(p
, )
23
From the dimension (O.2) of the photon quantum eld, it is easy to show that this
operator has the required dimension of energy if (Z
3
1)
2
is dimensionless.
24
Here symbol denotes the part that is relevant for calculation of the matrix element
(10.34).
10.2. COUNTERTERMS 337
+
i(Z
3
1)
2
4c
_
d
4
x
1
(2)
3
4pp
e
i
( p p
)x
p
(p, )e
(p
, )
=
i(Z
3
1)
2
4
( p p
)
2pc
[p
(p, )e
(p, ) p
(p, )e
(p, )]
=
i(Z
3
1)
2
4
( p p
)
2pc
e
(p, )[ p
2
g
]e
(p, ) (10.34)
In the Feynman diagram notation the counterterm (10.33) gives rise to a new
type of vertex,
25
which corresponds to the factor
26
T
count
( p) =
8i(Z
3
1)
2
c
2
( p
2
g
) (10.35)
The constant (Z
3
1)
2
should be chosen such that this factor exactly cancels
the loop factor (10.32) when p = 0, i.e., for external photon lines
(Z
3
1)
2
=
ie
2
c
2
(0)
8
4
(10.36)
Then for internal photon lines the sum of the loop (10.32) and the countert-
erm (10.35) is nite
T
loop
( p) +T
count
( p) = e
2
( p
2
)( p
2
g
) (10.37)
This means that an electron-positron loop and a photon-line cross taken
together
27
result in a nite correction to the scattering amplitude. This is
the so-called vacuum polarization radiative correction.
10.2.5 Charge renormalization
If our only goal is to make the perturbation theory expansion nite, then the
choice of the renormalization parameter (10.36) is not unique. Indeed, we
could add to (0) in (10.36) an arbitrary nite number , so that
25
denoted by a cross placed on photon lines, as in Fig. 10.1(k)
26
This factor is obtained from (10.34) by stripping o the 4-momentum delta func-
tion as well as factors c
1/2
(2)
3/2
(2p)
1/2
e
(p, ) and c
1/2
(2)
3/2
(2p)
1/2
e
(p, )
associated with external photon lines.
27
e.g., the sum of diagrams 10.1(d) and 10.1(k)
338 CHAPTER 10. RENORMALIZATION
(Z
3
1)
2
=
ie
2
c
2
((0) + )
8
4
) (10.38)
Why dont we do that? The answer is that such an addition would be
inconsistent with the charge renormalization condition in Postulate 10.2.
To see that, let us evaluate the contribution to the electron-proton scat-
tering from diagrams 10.1(d) and (k). Using Feynmans rules and (10.38) we
obtain
28
0[a
q,
d
p,
S
(d)+(k)
4
d
[0
=
e
2
mMc
4
4
c
4
(2i)
2
(2)
4
_
q
p
4
( q q
+ p)
e
2
((
k
2
) )
2
U
(q, ; q
)
g
k
2
(
k
2
g
)
g
k
2
W
(p, ; p
)
=
e
4
mMc
4
c
4
2
(2)
6
_
q
p
4
( q q
+ p)
(
k
2
)
k
2
U
(q, ; q
)W
(p, ; p
e
4
c
4
(2)
6
4
( q q
+ p)
(
k
2
)
k
2
,
,
(10.39)
If ,= 0, then this matrix element has a singularity /
k
2
at small values of
q = p p
= U
= 0.
10.2. COUNTERTERMS 339
10.2.6 Vertex renormalization
Our renormalization task is not completed yet. One more type of counterterm
is required in order to make QED calculations nite and accurate.
Let us evaluate diagram 10.1(e) using Feynman rules
0[a
q,
d
p,
S
(e)
4
d
[0
e
4
c
4
mMc
4
(2i)
4
(2)
2
_
q
p
4
( q q
+ p)u(q, )
__
d
4
h
,h+ ,q + mc
2
(
h q)
2
m
2
c
4
,h+ ,q
+ mc
2
(
h q
)
2
m
2
c
4
h
2
_
u(q
)
1
( q
q)
2
W
(p, ; p
) (10.40)
The integral in square brackets I
( q, q
) is calculated in (M.46)
I
( q, q
)
=
2
ic
3
_
8
tan(2)
ln
m
+
8
tan(2)
_
0
tan d +
1
2
+ 6 cot + 2 ln
m
_
2
2
(q + q
imc
5
sin(2)
(10.41)
To apply the charge renormalization condition, we should consider this
expression at small values of the transferred momentum (i.e., when q q
,
0). In this case we can use equation (J.4)
0 = u(q, )(
(,q mc
2
) + (,q mc
2
)
)u(q,
)
= u(q, )(
mc
2
)u(q,
)
= u(q, )(2g
mc
2
)u(q,
)
= 2u(q, )(q
mc
2
)u(q,
)
Therefore, in (10.41) we can set q
(q
mc
2
and obtain
29
29
In our notation (M.40), (M.41) sin
1
_
k
2
2mc
2
_
.
340 CHAPTER 10. RENORMALIZATION
lim
k0, q0
I
( q, q
) =
F
2
ic
3
where we introduced an ultraviolet-divergent and infrared-divergent con-
stant
30
F 4 ln
m
+ 2 ln
m
+
9
2
Then at small values of
k the scattering amplitude (10.40)
0[a
q,
d
p,
S
(e)
4
d
[0
=
ie
4
c
4
F
2
mMc
4
4
( q q
+ p)
c
3
(2i)
4
(2)
2
_
q
p
p
( q
q)
2
U
(q, ; q
)W
(p, ; p
)
(10.42)
has a singularity F/
k
2
. This means that in disagreement with our Postu-
late 10.2, the 4th perturbation order makes a non-trivial contribution to the
long-distance electron-proton scattering. Even more disturbing is the fact
that this eect is innite in the limit .
This unacceptable situation can be xed by adding one more (vertex)
renormalization counterterm to the QED interaction
Q
FD
3
(t) = e(Z
1
1)
2
_
dx( x)
( x)A
( x) (10.43)
In Feynman diagrams we will denote the corresponding three-leg vertex by a
circle, as shown in diagram 10.1(h). The renormalization constant (Z
1
1)
2
is of the 2nd perturbation order, so the order of the counterterm (10.43) is 3.
It has the same form as the basic QED interaction (10.15), so the diagram
10.1(h) is easily evaluated
31
30
See equation (23) in [Fey49].
31
compare with equation (9.36)
10.3. RENORMALIZED S-MATRIX 341
0[a
q,
d
p,
S
(h)
4
d
[0
=
ie
2
c
2
(Z
1
1)
2
mMc
4
4
( q + p q
)
4
2
q
p
p
( q
q)
2
U
(q, ; q
)W
(p, ; p
)
Our requirement to cancel the innite/singular term (10.42) tells us that we
need to choose our renormalization constant as
32
(Z
1
1)
2
=
e
2
F
16
2
c
= (Z
2
1)
2
After adding all three renormalization counterterms (10.23), (10.33) and
(10.43) the full Feynman-Dyson interaction operator (10.16) takes the form
V
c
FD
(t)
= V
1
(t) + Q
FD
2el
(t) + Q
FD
2ph
(t) + Q
FD
3
(t) + . . .
= e
_
dx( x)
( x)A
( x) + e
_
dx( x)
( x)A
( x)
+m
2
_
dx( x)( x) + (Z
2
1)
2
_
dx( x)(ic
+ mc
2
)( x)
(Z
3
1)
2
4
_
dxF
( x)F
( x) e(Z
1
1)
2
_
dx( x)
( x)A
( x) + . . .
(10.44)
10.3 Renormalized S-matrix
Equation (10.44) is the renormalized QED interaction operator that is ac-
curate up to the 3rd perturbation order. Our claim was that inserting this
interaction in the usual formula (10.17) for the S-operator we can obtain
ultraviolet-nite scattering amplitudes. Let us now support this claim with
explicit calculation of all 4th order diagrams in Fig. 10.1.
33
In this section
32
The equality with the electron-photon loop renormalization factor (Z
2
1)
2
in (10.28)
is not accidental. It is explained in section 8.6 in [BD64].
33
As we already know, diagrams 10.1(b), (c), (i), (j) cancel out exactly.
342 CHAPTER 10. RENORMALIZATION
we are going to calculate four coecient functions s
4
on the right hand side
of
0[a
q,
d
p,
S
c
4
d
[0 = (s
(d)+(k)
4
+ s
(e)+(h)
4
+ s
(f)
4
+ s
(g)
4
)
4
( q + p q
)
(10.45)
For our purposes it will be sucient to work in the limit of low momenta
of particles
34
and small transferred momentum
k.
10.3.1 Diagrams 10.1(d) + (k)
Inserting (10.30) in (10.39) and setting = 0 we nd that in our approxi-
mation the S-matrix elements described by diagrams 10.1(d) and (k) do not
depend on particle momenta and spins
35
s
(d)+(k)
4
ie
4
2
(2)
2
60
2
m
2
c
3
,
,
=
i
2
15
2
m
2
c
,
,
(10.46)
10.3.2 Diagrams 10.1(e) + (h)
The full electron vertex contribution
36
is given by equation (10.40) where the
square bracket should be replaced by the ultraviolet-nite expression
I
( q, q
)
F
2
ic
3
=
2
ic
3
_
8
tan(2)
ln
m
+
8
tan(2)
_
0
tan d +
1
2
+ 6 cot + 2 ln
m
_
2
2
( q + q
imc
5
sin(2)
2
ic
3
_
4 ln
m
+ 2 ln
m
+
9
2
_
=
2
ic
3
_
_
8
tan(2)
4
_
ln
m
+
8
tan(2)
_
0
tand 4 + 6 cot
_
34
see Appendix J.9
35
Here e
2
/(4c) 1/137 is the ne structure constant.
36
Figs. 10.1(e) and (h)
10.3. RENORMALIZED S-MATRIX 343
2
2
( q + q
imc
5
sin(2)
In the limit of small momentum transfer ( 0) this formula simplies
_
k
2
2mc
2
8
tan(2)
4[
k[
mc
2
tan
_
|
k|
mc
2
_
4[
k[
mc
2
_
|
k|
mc
2
+
|
k|
3
3m
3
c
6
_ 4
4
k
2
3m
2
c
4
8
tan(2)
_
0
tand
4
_
0
2
d =
4
3
3
k
2
3m
2
c
4
6 cot 6
_
1
3
_
= 6 2
2
6
k
2
2m
2
c
4
2
sin(2)
2
2 (4/3)
3
1 +
2
2
3
1 +
k
2
6m
2
c
4
I
( q, q
)
F
2
ic
3
2
2
ic
3
_
1
k
2
12m
2
c
4
_
2
( q + q
imc
5
_
1 +
k
2
6m
2
c
4
_
ic
3
4
k
2
3m
2
c
4
ln
m
so that diagrams 10.1(e) + (h) become
s
(e)+(h)
4
ic
3
2
4
2
k
2
Mmc
4
_
q
p
(p, ; p
)
u(q, )
_
2
_
1
k
2
12m
2
c
4
_
( q + q
mc
2
_
1 +
k
2
6m
2
c
4
_
k
2
3m
2
c
4
ln
m
_
u(q
)
(10.47)
This expression can be divided into two parts
s
(e)+(h)
4
= s
(e)+(h)AMM
4
+ s
(e)+(h)div
4
344 CHAPTER 10. RENORMALIZATION
where s
(e)+(h)AMM
4
remains nite in the infrared limit 0.
37
and s
(e)+(h)div
4
contains the infrared-divergent logarithm ln(/m). Let us now introduce the
vector of transferred momentum k = q
q = pp
pk
q+k
u(q +k, )
_
2
k
2
6m
2
c
2
(q + k)
+ q
mc
2
_
1
k
2
6m
2
c
2
__
u(q,
)
W
(p k, ; p,
) (10.48)
ic
2
4
2
k
2
Mmc
4
_
pk
q+k
_
2
_
1 +
k
2
12m
2
c
2
_
U
0
(q +k, ; q,
)
_
1
k
2
6m
2
c
2
_
q+k
+
q
mc
2
u(q +k, )u(q,
)
_
We use formulas from Appendices J.5, J.9 and (H.12) to obtain
2ic
2
4
2
k
2
Mmc
4
_
pk
q+k
q
_
1 +
k
2
12m
2
c
2
_
U
0
(q +k, ; q,
ic
2
2
2
k
2
_
1
q
2
2m
2
c
2
qk
2m
2
c
2
k
2
4m
2
c
2
__
1 +
k
2
12m
2
c
2
_
_
1 +
(2q +k)
2
+ 2i
el
[k q]
8m
2
c
2
_
ic
2
2
2
k
2
_
1
q
2
2m
2
c
2
qk
2m
2
c
2
k
2
4m
2
c
2
+
k
2
12m
2
c
2
+
q
2
2m
2
c
2
+
qk
2m
2
c
2
+
k
2
8m
2
c
2
+ i
el
[k q]
4m
2
c
2
_
ic
2
2
2
_
1
k
2
1
24m
2
c
2
+ i
el
[k q]
4m
2
c
2
k
2
_
(10.49)
ic
2
4
2
k
2
Mmc
4
_
pk
q+k
q
k
2
_
1
k
2
6m
2
c
2
_
q+k
+
q
mc
2
u(q +k, )u(q,
)
37
In chapter 13.6 we will see that this expression is related to the electrons anomalous
magnetic moment (AMM).
38
see Appendix J.9. For example, in this limit we can replace W
0
,
and W 0.
10.3. RENORMALIZED S-MATRIX 345
=
ic
2
2
2
k
2
_
1
q
2
2m
2
c
2
qk
2m
2
c
2
k
2
4m
2
c
2
k
2
6m
2
c
2
+
q
2
+ k
2
+ 2qk
4m
2
c
2
+
q
2
4m
2
c
2
_
_
_
q+k
+ mc
2
,
_
q+k
mc
2
_
q +k
[q +k[
el
__
_ _
q
+ mc
2
q
mc
2
_
q
q
el
_
_
2mc
2
=
ic
2
2
2
k
2
_
1
k
2
6m
2
c
2
_
_
_
q+k
+ mc
2
_
q
+ mc
2
2mc
2
_
q+k
mc
2
_
q
mc
2
2mc
2
_
q +k
[q +k[
el
__
q
q
el
_
_
ic
2
2
2
k
2
_
1
k
2
6m
2
c
2
_
__
1 +
(q +k)
2
8m
2
c
2
__
1 +
q
2
8m
2
c
2
_
[q +k[q
4m
2
c
2
_
q +k
[q +k[
el
__
q
q
el
__
=
ic
2
2
2
1
k
2
+
1
24m
2
c
2
+
i
el
[k q]
4m
2
c
2
k
2
_
(10.50)
Putting (10.49) and (10.50) together, we obtain
39
s
(e)+(h)AMM
4
=
2
4
2
m
2
c
el
[k q]
k
2
(10.51)
For the -dependent part of (10.47) we use non-relativistic approximation
s
(e)+(h)div
4
=
i
2
Mmc
4
3
2
m
2
c
_
pk
q+k
q
ln
_
m
_
(
U
W)
i
2
3
2
m
2
c
ln
_
m
_
,
,
(10.52)
10.3.3 Diagram 10.1(f)
Let us investigate the contribution in S
c
4
corresponding to the ladder diagram
10.1(f). According to Feynman rules, it is given by the integral
40
39
As expected, Coulomb-like terms 1/k
2
have canceled out.
40
Here we dropped spin indices, as in our approximation the spin dependence will be
lost anyway.
346 CHAPTER 10. RENORMALIZATION
s
(f)
4
=
e
4
(2)
16
4
1
(2)
2
(2)
6
4
c
4
(2)
2
(2)
6
Mmc
4
(2)
6
_
q
p
_
d
4
h
u(q)
(,q ,h + mc
2
)
u(q
)
( q
h)
2
m
2
c
4
w(p)
(,p+ ,h + Mc
2
)
w(p
)
( p +
h)
2
M
2
c
4
1
[
h
2
2
c
4
][(
h +
k)
2
2
c
4
]
In the numerator we use Dirac equations (J.82) - (J.81) for functions u(q)
and w(p) and the anticommutator relationship (J.4) for gamma matrices to
write
[u(q)
(,q ,h + mc
2
)
u(q
)] [w(p)
(,p+ ,h + Mc
2
)
w(p
)]
= [u(q)
(,q + mc
2
)
u(q
) u(q)
,h
u(q
)]
[w(p)
(,p + Mc
2
)
w(p
) + w(p)
,h
w(p
)]
= [2u(q)q
u(q
) u(q)
,h
u(q
)][2w(p)p
w(p
) + w(p)
,h
w(p
)]
= 4( q p)u(q)
u(q
)w(p)
w(p
) + 2u(q)
u(q
)w(p) ,q
w(p
)h
2u(q) ,p
u(q
)w(p)
w(p
)h
u(q)
u(q
)w(p)
w(p
)h
h)
2
m
2
c
4
=
h
2
2( q
h)
( p +
h)
2
M
2
c
4
=
h
2
+ 2( p
h)
Using non-relativistic approximations we then obtain
s
(f)
4
e
4
c
4
(2)
4
(2)
2
[4( q p)u(q)
u(q
)w(p)
w(p
)b(p, q, k)
+ 2u(q)
u(q
)w(p) ,q
w(p
)b
(p, q, k)
2u(q) ,p
u(q
)w(p)
w(p
)b
(p, q, k)
u(q)
u(q
)w(p)
w(p
)b
h
2
2( q
h][
h
2
+ 2( p
h)][
h
2
2
c
4
][(
h +
k)
2
2
c
4
]
(10.54)
b
(p, q, k) =
_
d
4
hh
h
2
2( q
h)][
h
2
+ 2( p
h)][
h
2
2
c
4
][(
h +
k)
2
2
c
4
]
b
(p, q, k) =
_
d
4
hh
h
2
2( q
h)][
h
2
+ 2( p
h)][
h
2
2
c
4
][(
h +
k)
2
2
c
4
]
In our calculations we are interested only in leading infrared-divergent
terms. They come from those regions in the 4D space of the integration vari-
able
h, where integrands denominators vanish in the limit 0. These are
regions
h 0 and
h
(p, q, k) k
b(p, q, k)
b
(p, q, k) k
b(p, q, k)
Now substitute these results in (10.53) and use denitions (J.62) - (J.63) of
functions U
and W
s
(f)
4
=
e
4
c
4
(2)
4
(2)
2
b(p, q, k)
[4( q p)(u(q)
u(q
)w(p)
w(p
) 2u(q)
u(q
)w(p) ,q ,k
w(p
)
+ 2u(q) ,p ,k
u(q
)w(p)
w(p
) u(q)
,k
u(q
)w(p)
,k
w(p
)]
=
e
4
c
4
(2)
4
(2)
2
b(p, q, k)[4( q p)(
U
W) 2U
w(p) ,q ,k
w(p
)
+ 2u(q) ,p ,k
u(q
)W
u(q)
,k
u(q
)w(p)
,k
w(p
)]
Next we need to simplify separate pieces of this expression
w(p) ,q ,k
w(p
) = w(p) ,q ,p
w(p
) w(p) ,q ,p
w(p
)
= w(p) ,q ,p
w(p
) w(p) ,q(2(p
mc
2
)w(p
)
= w(p)( ,p ,q + 2( q p))
w(p
) 2(p
( q
W) + mc
2
w(p) ,q
w(p
)
348 CHAPTER 10. RENORMALIZATION
= w(p)(mc
2
,q + 2( q p))
w(p
) 2(p
( q
W) + mc
2
w(p) ,q
w(p
)
= mc
2
w(p) ,q
w(p
) + 2( q p)W
2(p
( q
W)
+ mc
2
w(p) ,q
w(p
) = 2( q p)W
2(p
( q
W) (10.55)
u(q) ,p ,k
u(q
) = u(q) ,p(,q
,q)
u(q
)
= u(q) ,p ,q
u(q
) u(q) ,p ,q
u(q
)
= u(q) ,p(
,q
+ 2(q
)u(q
) u(q)( ,q ,p + 2( q p))
u(q
)
= u(q) ,p
,q
u(q
) + 2u(q) ,p(q
u(q
) + u(q) ,q ,p
u(q
)
2( q p)U
= mc
2
u(q) ,p
u(q
) + 2(q
( p
U) + mc
2
u(q) ,p
u(q
) 2( q p)U
= 2(q
( p
U) 2( q p)U
u(q)
,k
u(q
)
= u(q)
,q
u(q
) u(q)
,q
u(q
)
= u(q)
,q
+ 2(q
)u(q
) u(q)( ,q
+ 2q
u(q
)
= u(q)
,q
u(q
) + 2u(q)
(q
u(q
) + u(q) ,q
u(q
)
2u(q)q
u(q
)
= mc
2
u(q)
u(q
) + 2(q
+ mc
2
u(q)
u(q
) 2q
= 2(q
2q
(10.56)
w(p)
,k
w(p
) = 2(p
+ 2p
(10.57)
Then we use equalities (M.47), (M.48) and non-relativistic approximations
( q
) Mmc
4
, (
U
W) 1 to write
s
(f)
4
=
e
4
c
4
(2)
4
(2)
2
b(p, q, k)
[4( q p)(
U
W) 2U
(2( q p)W
2(p
( q
W))
+ 2(2(q
(p U) 2( q p)U
)W
(2(q
2q
)(2(p
+ 2p
)]
10.3. RENORMALIZED S-MATRIX 349
=
4e
4
c
4
(2)
4
(2)
2
b(p, q, k)
[( q p)(
U
W) ( q p)(
U
W) + ( p
U)( q
W) + ( q
W)( p
U) ( q p)(
U
W)
+ ( q
)(
U
W) ( q
W)( p
U) ( q
W)( p
U) + ( q p)(
U
W)]
=
4e
4
c
4
(2)
4
(2)
2
b(p, q, k)( q
)(
U
W)
4e
4
Mmc
8
(2)
4
(2)
2
b(p, q, k) (10.58)
Function b(p, q, k) is evaluated in (M.52)
b(p, q, k) =
2
ic
3
k
2
ln
_
k
2
2
c
4
_
1
_
0
dy
( p + q)
2
y
2
2 p( p + q)y + p
2
(10.59)
This integral is of the table form
_
dy
ay
2
+ by + c
=
2
4ac b
2
tan
1
_
2ax + b
4ac b
2
_
+ const
So, we get
1
_
0
dy
( p + q)
2
y
2
2 p( p + q)y + p
2
= 2B
1
tan
1
_
2( p + q)
2
y 2 p( p + q)
B
_
y=1
y=0
= 2B
1
_
tan
1
_
2 q
2
+ 2( p q)
B
_
+ tan
1
_
2 p
2
+ 2( p q)
B
__
1
iMc
3
q
_
tan
1
_
mc
iq
_
+ tan
1
_
Mc
iq
__
(10.60)
where we used M m and denoted
B
_
4( p + q)
2
p
2
4( p
2
+ ( p q))
2
= 2
_
p
2
q
2
( p q)
2
= 2
_
M
2
m
2
c
8
( p q)
2
350 CHAPTER 10. RENORMALIZATION
2
M
2
m
2
c
8
__
Mc
2
+
p
2
2M
__
mc
2
+
q
2
2m
_
c
2
(pq)
_
2
2
_
M
2
c
6
q
2
= 2iMc
3
q
Putting results (10.58) - (10.60) together and using
k
2
c
2
k
2
, we nally
obtain
s
(f)
4
2
mc
2
2
qk
2
_
tan
1
_
mc
iq
_
+ tan
1
_
Mc
iq
__
ln
_
k
2
2
c
2
_
(10.61)
We will not elaborate this result further as we expect some cancelations with
the crossed ladder diagram evaluated in the next subsection.
10.3.4 Diagram 10.1(g)
Similar to the above ladder diagram we calculate the crossed ladder diagram
shown in Fig. 10.1(g)
s
(g)
4
=
e
4
c
4
(2)
4
(2)
2
_
d
4
h
u(q)
(,q ,h + mc
2
)
u(q
)
( q
h)
2
m
2
c
4
+ i
w(p)
(,p
,h + Mc
2
)
w(p
)
( p
h)
2
M
2
c
4
1
[
h
2
2
c
4
][(
h +
k)
2
2
c
4
]
In the numerator we use (J.4), (J.82) - (J.81) to write
[u(q)
(,q ,h + mc
2
)
u(q
)] [w(p)
(,p
,h + Mc
2
)
w(p
)]
= [u(q)
(,q + mc
2
)
u(q
) u(q)
,h
u(q
)]
[w(p)
(,p
+ Mc
2
)
w(p
) w(p)
,h
w(p
)]
= [2u(q)q
u(q
) u(q)
,h
u(q
)]
[2w(p)(p
w(p
) w(p)
,h
w(p
)]
= 4u(q) ,p
u(q
)w(p) ,qw(p
) 2u(q)
u(q
)w(p) ,q
w(p
)h
2u(q)
,p
u(q
)w(p)
w(p
)h
+ u(q)
u(q
)w(p)
w(p
)h
u(q
)w(p) ,qw(p
)b(p
, q, k)
2u(q)
u(q
)w(p) ,q
w(p
)b
(p
, q, k)
2u(q)
,p
u(q
)w(p)
w(p
)b
(p
, q, k)
+ u(q)
u(q
)w(p)
w(p
)b
(p
, q, k)] (10.62)
Here we noticed that integral
b(p
, q, k)
_
d
4
h
[
h
2
2( q
h)][
h
2
2( p
h)][
h
2
2
c
4
][(
h +
k)
2
2
c
4
]
can be obtained from (10.54) by replacing p p
, q, k)
b
(p
, q, k)
_
d
4
hh
h
2
2( q
h)][
h
2
2( p
h)][
h
2
2
c
4
][(
h +
k)
2
2
c
4
]
k
b(p
, q, k)
b
(p
, q, k)
_
d
4
hh
h
2
2( q
h)][
h
2
2( p
h)][
h
2
2
c
4
][(
h +
k)
2
2
c
4
]
k
b(p
, q, k)
Then in (10.62) we can use (10.55), (10.56), (10.57) and
u(q)
,k ,p
u(q
) = u(q)
(,q
,q) ,p
u(q
)
= u(q)
,q
,p
u(q
) u(q)
,q ,p
u(q
)
= u(q)
,p
,q
u(q
) + 2u(q)
(q
)u(q
) + u(q) ,q
,p
u(q
)
2u(q)q
,p
u(q
) = 2u(q)
(q
)u(q
) 2u(q)q
,p
u(q
)
= 2( q
)U
2( p
U)q
to obtain
352 CHAPTER 10. RENORMALIZATION
s
(g)
4
=
e
4
c
4
(2)
4
(2)
2
b(p
, q, k)
[4( p
U)( q
W) + 2U
(2( q p)W
2(p
(q W))
+ 2(2( q
)U
2( p
U)q
)W
+ (2(q
2q
)(2(p
+ 2p
)]
=
4e
4
c
4
(2)
4
(2)
2
b(p
, q, k)
[( p
U)( q
W) + ( q p)(
U
W) ( p
U)( q
W) + ( q
)(
U
W)
( p
U)( q
W) ( q
)(
U
W) + ( q
W)( p
U) + ( q
W)( p
U)
( q p)(
U
W)]
=
4e
4
c
4
(2)
4
(2)
2
b(p
, q, k)( q
W)( p
U)
4e
4
Mmc
8
(2)
4
(2)
2
b(p
, q, k)
For the integral
b(p
, q, k) =
2
ic
3
k
2
ln
_
k
2
2
c
4
_
1
_
0
dy
( p
+ q)
2
y
2
+ 2 p
( p
+ q)y + p
2
we use the same method as in (10.60). This time in our non-relativistic
approximation ( p
q) Mmc
4
B
_
4( q p
)
2
( p
)
2
4(( p
)
2
( p
q))
2
B = 2iMc
3
q
1
_
0
dy
( p
+ q)
2
y
2
+ 2 p
( p
+ q)y + p
2
2B
1
tan
1
_
2( p
+ q)
2
y + 2 p
( p
+ q)
B
_
y=1
y=0
= 2B
1
_
tan
1
_
2 q
2
2( p
q)
B
_
+ tan
1
_
2 p
2
2( p
q)
B
__
1
iMc
3
q
_
tan
1
_
mc
iq
_
+ tan
1
_
Mc
iq
__
10.3. RENORMALIZED S-MATRIX 353
and
s
(g)
4
2
mc
2
2
qk
2
_
tan
1
_
mc
iq
_
+ tan
1
_
Mc
iq
__
ln
_
k
2
2
c
2
_
Adding this result to (10.61) and using approximation tan
1
(Mc/(iq))
/2 we obtain the joint contribution of the ladder and crossed ladder dia-
grams
s
(f)+(g)
4
2
mc
2
qk
2
ln
_
k
2
2
c
2
_
(10.63)
10.3.5 Renormalizability
Combining results (10.46), (10.51), (10.52), (10.63) we get the following -
independent 4th order amplitude for the electron-proton scattering
0[a
q,
d
p,
S
c
4
d
[0
4
( q q
+ p)
_
i
2
15
2
m
2
c
+
i
2
3
2
m
2
c
ln
_
m
_
mc
2
2
qk
2
ln
_
k
2
2
c
2
_
(
el
[k q])
4
2
m
2
ck
2
_
(10.64)
This amplitude contains unpleasant infrared-divergent logarithms. Their
physical origin is in the fact that any collision involving charged particles
is inevitably accompanied by the emission of a large (even innite) num-
ber of low-energy (soft) photons. In most cases these soft photons escape
experimental detection, but in a rigorous theoretical treatment one must
take them all into account in order to obtain scattering cross-sections in
good agreement with experiments. This would require rather involved non-
perturbative calculations [Wei95, PS95b] that are beyond the scope of this
book. The cancelation of infrared divergences in calculations of the hydrogen
energy spectrum will be discussed in chapter 13.6.
Can we get even better accuracy by extending our renormalization ap-
proach to higher perturbation orders? Yes, but then we would need to add
higher-order ultraviolet-divergent counterterms to our interaction (10.44),
354 CHAPTER 10. RENORMALIZATION
so that the no-self-scattering and charge renormalization conditions are en-
forced in each perturbation order. It is remarkable that all these higher-order
counterterms will have exactly the same functional form as those already
discussed. In other words, the complete innite-order interaction operator
of the renormalized QED has the same form as our 3rd order expression
(10.44). Only the values of exact renormalization constants m, Z
2
1, Z
3
1
and Z
1
1 get more complicated forms than our 2nd order expressions
m
2
, (Z
2
1)
2
, (Z
3
1)
2
, (Z
1
1)
2
. This fact is referred to as the renor-
malizability of QED. The usual interpretation is that renormalization (=the
addition of counterterms) is equivalent to redenition of parameters (masses
and charges) in the Lagrangian. In fact, these parameters become innite
after the renormalization.
Our interpretation is dierent. We claim that [0, a
p
[0 and c
p
[0 rep-
resent real physical 0-particle and 1-particle states with nite masses and
charges. The only eect of renormalization is to add certain divergent coun-
terterms to the original QED interaction V
1
, as shown in (10.44). In the
second part of this book we will show how one can get rid of these articial
divergences in the Hamiltonian.
We have applied renormalization only to the potential energy operator V
in QED. In order to have a relativistic theory one also need to nd appropri-
ate counterterms for the potential boost operator Z, so that the renormal-
ized boost satises appropriate Poincare commutation relations with the
renormalized energy. As far as I know, there were no attempts to extend
the renormalization theory to boosts. Nevertheless, we will assume that such
a construction is possible and that renormalized QED is a fully relativistic
theory.
10.3.6 On the origins of QED interaction
So far we simply postulated the QED interaction operator (9.12) - (9.15) or
its renormalized form (10.44). What are the physical origins of these expres-
sions? Are there deeper fundamental principles that demand this particular
form of electromagnetic interactions? The standard textbook answer is yes,
and it is claimed that the true reason for interactions between charged par-
ticles and photons is the principle of local gauge invariance. It is usually
postulated that the Lagrangian of electromagnetic theory must be invariant
with respect to certain simultaneous gauge transformations of the fermion
elds (x, t), (x, t) and the photon eld A
, where B
is a Lorentz
scalar despite the non-4-vector character of A
is the fermion current density (L.1). This line of arguments leads one to the
electromagnetic interaction operator (9.12) - (9.15) that is consistent with
gauge-based derivations.
Weinbergs arguments for choosing the interacting eld theory Hamilto-
nian (or Lagrangian) do not appear very convincing, especially if one takes
into account the need for adding divergent renormalization counterterms as
41
see condition (II) in step 2. on page 299
42
see equation (K.23)
356 CHAPTER 10. RENORMALIZATION
in (10.44). The apparent success of the renormalization program seems com-
pletely mysterious. It is puzzling how innite counterterms (almost) cancel
divergences in the S-matrix expansion and how the tiny residual radiative
corrections come out in perfect agreement with measurements.
43
With our
present ad hoc theoretical basis of renormalized QED, this remarkable agree-
ment looks like an accident.
In conclusion we would like to stress that the presently accepted formu-
lation of QED lacks solid theoretical foundation. Quantum elds and gauges
seem to be heuristic devices, and the whole construction is supported by
agreement with experiments more than by reliance on well-tested physical
principles. Bearing this weakness in mind, in the second part of this book we
will attempt to reformulate the QED formalism. Instead of elds and gauges,
our approach will be based on the ideas of point particles and instantaneous
interaction potentials.
43
see chapter 13.6
Part II
QUANTUM THEORY OF
PARTICLES
357
359
In the rst part of this book we presented a fairly traditional view on
relativistic quantum eld theory. This well-established approach had great
successes in many important areas of high energy physics, in particular, in
the description of scattering events. However, it also has a few troubling
spots. The rst one is the problem of ultraviolet divergences. The idea of
self-interacting bare particles with innite masses and charges seems com-
pletely unphysical. Moreover, QFT is not suitable for the description of time
evolution of particle observables and their wave functions. In this second part
of the book, we suggest that these problems can be solved by abandoning
the idea of quantum elds as basic ingredients of nature and returning to the
old (going back to Newton) concept of particles interacting via direct forces.
This reformulation of QFT is achieved by applying the dressed particle
approach rst developed by Greenberg and Schweber [GS58].
Instantaneous forces acting between dressed particles imply the real pos-
sibility of sending superluminal signals. Then we nd ourselves in contra-
diction with special relativity, where faster-than-light signaling is strictly
forbidden (see Appendix I.3). This paradox forces us to take a second look
on the derivation of basic results of special relativity, such as Lorentz trans-
formations for space and time coordinates of events. We nd that previous
theories missed one important point. Specically, they ignored the fact that
in interacting systems generators of boost transformations are interaction-
dependent. The recognition of this fact allows us to reconcile faster-than-
light interactions with the principle of causality in all reference frames and
to build a consistent relativistic theory of interacting quantum particles in
this second part of the book.
360
Chapter 11
DRESSED PARTICLE
APPROACH
The rst principle is that you must not fool yourself - and you
are the easiest person to fool.
Richard Feynman
In this chapter we will continue our discussion of quantum electrodynam-
ics - the theory of interacting charged particles (electrons, protons, etc.) and
photons. Great successes of this theory are well known. In section 11.1 we
are going to focus on its weak points. The most obvious problem of QED is
related to extremely weird properties of its fundamental ingredients - bare
particles. The masses and charges of bare electrons and protons are innite
and the Hamiltonian H
c
of QED is formally innite as well. More precisely,
coecient functions of interaction terms in H
c
= H
0
+ V
c
written in the
bare-particle representation (10.13) diverge as the ultraviolet cuto momen-
tum is sent to innity. As we explained earlier, this operator can be used for
S-matrix calculations, where all divergences cancel out. However, its use for
bound state or time evolution studies seems problematic.
We are going to demonstrate that the formalism of QED can be signi-
cantly improved by removing ultraviolet-divergent terms from the Hamilto-
nian and abandoning the ideas of non-observable virtual and bare particles.
In particular, in sections 11.2 - 11.3 we will nd a nite dressed parti-
cle Hamiltonian H
d
which, in addition to accurate scattering operators, also
361
362 CHAPTER 11. DRESSED PARTICLE APPROACH
provides a good description of the time evolution and bound states. We
will call this approach the relativistic quantum dynamics (RQD). The word
dynamics is used here because, unlike the traditional quantum eld theory
concerned with calculations of time-independent S-matrices, RQD empha-
sizes the dynamical, i.e., time-dependent, nature of interacting processes.
11.1 Troubles with renormalized QED
11.1.1 Renormalization in QED revisited
Let us now take a closer look at the renormalized QED and recall the logic
which led us from the original Feynman-Dyson interaction Hamiltonian V
1
in (10.15) to the interaction with counterterms V
c
in (10.44).
One distinctive feature of the operator V
1
is that it contains unphys (U)
terms. In order to obtain the scattering operator F one needs to calculate
multiple commutators of V
1
as in (8.68). It is clear from Table 8.2 that these
commutators will give rise to renorm terms
1
in each perturbation order of
F. However, according to equation (10.6) and Statement 10.1 (the no-self-
scattering renormalization condition), there should be no renorm terms in
the operator F of any sensible theory. This requirement can be satised only
if there is a certain balance of unphys and renorm terms in V
1
, so that all
renorm terms in F cancel out. However, there is no such balance in the
purely unphys Feynman-Dyson interaction V
1
. So, we have a contradiction.
The traditional renormalization approach discussed in chapter 10 sug-
gests the following resolution of this paradox: change the interaction oper-
ator from V
1
to V
c
by adding (innite) counterterms. Two conditions are
used to select the counterterms. The rst (no-self-scattering renormaliza-
tion) condition requires cancelation of all renorm terms in the scattering
phase operator F
c
calculated with V
c
. The second (charge renormalization)
condition demands the consistency with classical electrodynamics in the low
energy regime. Somewhat miraculously, the S-matrix obtained with thus
modied Hamiltonian H
c
= H
0
+ V
c
agrees with experiment at all energies
and for all scattering processes.
Frequently one can meet interpretations of the renormalization approach,
which say that innities in the Hamiltonian H
c
(10.44) have a real physical
1
in commutators [U, U
]
11.1. TROUBLES WITH RENORMALIZED QED 363
meaning. One can hear also that bare electrons and protons really have in-
nite masses and innite charges.
2
The fact that such particles were never
observed in nature is then explained as follows: Bare particles are not eigen-
states of the total Hamiltonian H
c
. The physical electrons and protons ob-
served in experiments are complex linear combinations of multiparticle bare
states. These linear combinations are eigenstates of the total Hamiltonian
and they do have correct (nite) measurable masses m and M and charges
e. This situation is often described as bare particles being surrounded by
clouds of virtual particles, thus forming physical or dressed particles. The
virtual cloud modies the mass of the bare particle by an innite amount,
so that the resulting mass is exactly the one measured in experiments. The
cloud also shields the (innite) charge of the bare particle, so that the
eective charge becomes e.
Even if we accept this weird description of physical reality, it is clear
that the renormalization program did not solve the problem of ultraviolet
divergences in quantum eld theory. The divergences were removed from the
S-operator, but they reappeared in the Hamiltonian H
c
in the form of innite
counterterms and this approach just shifted the problem of innities from one
place to another. Inconsistencies of the renormalization approach concerned
many prominent scientists, such as Dirac and Landau. For example, Rohrlich
wrote
Thus, present quantum electrodynamics is one of the strangest
achievements of the human mind. No theory has been conrmed
by experiment to higher precision; and no theory has been plagued
by greater mathematical diculties which have withstood repeated
attempts at their elimination. There can be no doubt that the
present agreement with experiments is not fortuitous. Neverthe-
less, the renormalization procedure can only be regarded as a tem-
porary crutch which holds up the present framework. It should be
noted that, even if the renormalization constants were not innite,
2
It is also common to hypothesize that these bare parameters may be actually very large
rather than innite. The idea is that the granularity of space-time or other yet unknown
Planck-scale eect sets a natural momentum cuto. This eective eld theory approach
assumes that QED is just a low energy approximation to some unknown divergence-free
truly fundamental theory operating at the Planck scale. Speculations of this kind are
not needed for the dressed particle approach developed in the next section. The dressed
particle Hamiltonian and the corresponding S-operator remain nite, even if the cuto
momentum is set to innity.
364 CHAPTER 11. DRESSED PARTICLE APPROACH
the theory would still be unsatisfactory, as long as the unphysical
concept of bare particle plays a dominant role. F. Rohrlich
[Roh]
11.1.2 Time evolution in QED
In addition to peculiar innite masses and charges of bare particles, tra-
ditional QED predicts rather complex dynamics of bare vacuum and one-
particle states. Let us forget for a moment that interaction terms in H
c
are
innite and apply the time evolution operator U
c
(t 0) = exp(
i
H
c
t)
to the vacuum (no-particle) state. Expressing V
c
in terms of creation and
annihilation operators of particles we obtain
3
[0(t) = e
H
c
t
[0 = (1
it
(H
0
+ V
c
) + . . .)[0
[0 + ta
[0 + td
[0 + . . .
[0 + t[abc + t[dfc + . . . (11.1)
We see that various multiparticle states ([abc, [dfc, etc.) are created from
the vacuum during time evolution. So, the vacuum is full of appearing and
disappearing virtual particles. The physical vacuum in QED is not just
an empty state without particles. It is more like a boiling soup of bare
particles, antiparticles and photons.
Similar disturbing problems become evident if we consider the time evo-
lution of one-electron states. Such behaviors have not been seen in experi-
ments. Obviously, if a theory cannot get right the time evolutions of simplest
zero-particle and one-particle states, there is no hope of predicting the time
evolution in more complex multiparticle states.
The reason for these unphysical time evolutions is the presence of unphys
(e.g., a
+d
) and renorm (a
a+b
b+d
d+f
and d
interaction terms
in (L.8). All other terms are omitted. We also omit factors i, and coecient functions
which are not relevant in this context.
4
Note that in the traditional renormalized QED S-matrix elements are calculated on
bare particle states. This appears to be in contradiction with the absence of well-dened
11.1. TROUBLES WITH RENORMALIZED QED 365
The important point is that unphys and renorm interaction terms in H
c
are absolutely harmless when the time evolution in the innite time range
(from to ) is considered. As we saw in equation (7.8), such time
evolution is represented exactly by the product of the non-interacting time
evolution operator and the S-operator
U
c
() = S
c
U
0
() = U
0
()S
c
The factor U
0
in this product leaves invariant no-particle and one-particle
states. The factor S
c
has the same property due to the cancelation of unphys
and renorm terms in F
c
as discussed in subsection 10.1.2. So, in spite of
ill-dened operators H
c
and exp(
i
H
c
t), the renormalized QED is perfectly
capable of describing scattering.
Luckily for QED, current experiments with elementary particles are not
designed to measure time-dependent dynamics in the interaction region. In-
teraction processes occur almost instantaneously in particle collisions. High
energy particle physics experiments are, basically, limited to measurements of
scattering cross-sections and energies of bound states, i.e., properties encoded
in the S-matrix. In this situation, the inability of the theory to describe time
evolution can be tolerated. But there is no doubt that time-dependent pro-
cesses in high energy physics will be eventually accessible to more advanced
experimental techniques. Time dynamics of wave functions can be resolved
in some experiments in atomic physics, e.g., with Rydberg states of atoms
[AZ91]. Moreover, time evolution is clearly observable in everyday life. So,
a consistent theory of subatomic phenomena should be able to describe such
phenomena at least in the low-energy limit. The renormalized QED cannot
do that. Clearly, without accurate description of time evolution, we cannot
claim a success in developing a comprehensive theory.
11.1.3 Unphys and renorm operators in QED
In the preceding subsection we saw that the presence of unphys and renorm
interaction operators in V
c
was responsible for unphysical time evolution of
bare particles. It is not dicult to see that the presence of such questionable
time evolution of such states and with the general understanding (see subsection 11.1.1)
that bare particles are not physical.
366 CHAPTER 11. DRESSED PARTICLE APPROACH
interaction terms is inevitable in any local quantum eld theory where inter-
action Hamiltonian is constructed as a polynomial of quantum elds [Shi07].
We saw in (J.26) and (K.2) that quantum elds of both massive and mass-
less particles always have the form of a sum (creation operator + annihilation
operator)
+
Therefore, if we constructed interaction as a product (or polynomial) of
elds,
5
we would necessarily have unphys and renorm terms there. For ex-
ample, converting a product of four elds to the normally ordered form
V
4
= (
+ )(
+ )(
+ )(
+ )
=
+ +
+ (11.2)
+
+ C (11.3)
+
(11.4)
we obtain unphys terms (11.2) together with renorm terms (11.3) and one
phys term (11.4). The presence of unphys terms is an indication that bare
states created by operators
+ . . . (11.5)
According to Table 8.2 in subsection 8.2.5, commutators of phys terms can
be only phys. Therefore, when the scattering operator F
d
is calculated from
6
Recall that decay and oscillation operators are not present in QED. So, we will not
consider them here.
368 CHAPTER 11. DRESSED PARTICLE APPROACH
V
d
via equation (8.68) only phys terms can appear there. So, F
d
has a form
similar to (11.5) and both V
d
and F
d
yield zero when acting on zero-particle
and one-particle states
V
d
[0 = V
d
[0 = 0
F
d
[0 = F
d
[0 = 0
as required by the no-self-scattering renormalization condition 10.1. More-
over, time evolutions of the vacuum and one-particle states are the same as
their free time evolutions
[0(t) = e
H
d
t
[0 =
_
1
it
(H
0
+ (V
d
)
ph
) + . . .
_
[0
=
_
1
it
H
0
+ . . .
_
[0 = e
H
0
t
[0
[(t) = e
H
d
t
[0 =
_
1
it
(H
0
+ (V
d
)
ph
) + . . .
_
[0
=
_
1
it
H
0
+ . . .
_
[0 = e
H
0
t
[
as it should be. Physically this means that, in addition to forbidding self-
scattering in zero-particle and one-particle states,
7
by demanding V
d
=
(V
d
)
phys
we also forbid any self-interaction in these states. This is an im-
portant restriction on the allowed form of interaction and we are going to
formulate this restriction as an additional postulate
Postulate 11.1 (stability of vacuum and one-particle states) There is
no (self-)interaction in the vacuum and one-particle states, i.e., the time evo-
lution of these states is not aected by interaction and is governed by the
non-interacting Hamiltonian H
0
. Mathematically, this means that the type
of the interaction Hamiltonian V
d
is phys.
Summarizing discussions from various parts of this book, we can put together
a list of conditions that should be satised by any realistic interaction
7
see Statement 10.1
11.2. DRESSING TRANSFORMATION 369
(A) Poincare invariance (Statement 3.2);
(B) instant form of dynamics (Postulate 15.2);
(C) cluster separability (Postulate 6.3);
(D) no self-interactions = phys character of V
d
(Postulate 11.1);
(E) niteness of coecient functions of interaction potentials;
(F) coecient functions should rapidly tend to zero at large values of mo-
menta
8
As we saw in subsection 11.1.3, requirement (D) practically excludes all
usual eld-theoretical Hamiltonians. The question is whether there are non-
trivial good interactions that have all the properties (A) - (F)? And the
answer is yes.
One set of examples of allowed interacting theories is provided by direct
interaction models.
9
Two-particle models of this kind were rst constructed
by Bakamjian and Thomas [BT53]. Sokolov [Sok75, SS78], Coester and Poly-
zou [CP82] showed how this approach can be extended to cover multi-particle
systems. There are recent attempts [Pol03] to extend this formalism to in-
clude description of systems with variable number of particles. In spite of
these achievements, the direct interaction approach is currently applicable
only to model systems. One of the reasons is that conditions for satisfying
the cluster separability are very cumbersome. This mathematical complexity
is evident even in the simplest 3-particle case discussed in subsection 6.3.6.
In the direct interaction approach, interactions are expressed as func-
tions of (relative) particle observables, e.g., relative distances and momenta.
However, it appears more convenient to write interactions as polynomials in
particle creation and annihilation operators (8.50). We saw in Statement 8.7
that in this case the cluster separability condition (C) is trivially satised
if coecient functions have smooth dependence on particle momenta. The
no-self-interaction condition (D) simply means that all interaction terms are
phys. The instant form condition (B) means that generators of space transla-
tions P = P
0
and rotations J = J
0
are interaction-free and that interaction V
8
According to Theorem 8.13, this condition guarantees convergence of all loop integrals
involving vertices V
d
and, therefore, the niteness of the operator S
d
.
9
Some of them were discussed in section 6.3.
370 CHAPTER 11. DRESSED PARTICLE APPROACH
commutes with P
0
and J
0
. The most dicult part is to ensure the relativistic
invariance (condition (A)), i.e., commutation relations of the Poincare group.
One way to solve this problem is to x the operator structure of interaction
terms and then nd the momentum dependence of coecient functions by
solving a set of dierential equations resulting from Poincare commutators
(6.22) - (6.26) [Kaz71, Kit66, Kit68, Kit70, Kit72b, Kit72a, Kit73]. Kita
demonstrated that there is an innite number of solutions for these equa-
tions and provided some non-trivial examples. Apparently, there should be
additional physical principles that would single out a unique theory of inter-
acting particles that agrees with experimental observations. Unfortunately,
these additional principles are not known at this moment.
11.2.2 Main idea of the dressed particle approach
The Kita-Kazes approach [Kaz71, Kit66, Kit68, Kit70, Kit72b, Kit72a, Kit73]
is dicult to apply to realistic particle interactions, so, currently, it cannot
compete with QFT. It might be more promising to abandon the idea to build
relativistic interactions from scratch and, instead, try to modify traditional
QFT to make it consistent with the requirements (D), (E) and (F). One idea
how to make this possible is to note that the S-matrix of the usual renormal-
ized QED agrees with experiments very well. So, we may add the following
requirement to the above list (A) - (F):
(G) the scattering operator S
d
in our dressed theory is exactly the same
(in each perturbation order) as the operator S
c
in renormalized QED.
We have denoted the desired phys interaction operator by V
d
. Then,
condition (G) means that the dressed Hamiltonian H
d
= H
0
+V
d
is scattering
equivalent to the renormalized QED Hamiltonian H
c
= H
0
+V
c
. According
to our discussion in subsection 7.2.1, this means that H
d
and H
c
are related
by a unitary transformation
H
d
= H
0
+ V
d
= e
i
H
c
e
i
(11.6)
= e
i
(H
0
+ V
c
)e
i
= (H
0
+ V
c
) + i[, (H
0
+ V
c
)]
1
2!
[, [, (H
0
+ V
c
)]] + . . .(11.7)
where Hermitian operator satises condition (7.32). Transformation e
i
will be called the dressing transformation.
11.2. DRESSING TRANSFORMATION 371
11.2.3 Unitary dressing transformation
Now our goal is to nd a unitary transformation e
i
, which ensures that the
dressed particle Hamiltonian H
d
satises all properties (A) - (G). In this
study we will need the following useful results
Theorem 11.2 (transformations preserving the S-operator) A unitary
transformation of the Hamiltonian
H
= e
i
He
i
preserves the S-operator if the Hermitian operator has the form (8.49) -
(8.50) where all terms
NM
have smooth coecient functions.
Idea of the proof. Assume that operator has the standard form (8.49)
- (8.50)
=
N=0
M=0
NM
NM
=
{,
}
_
dq
1
. . . dq
N
dq
1
. . . dq
M
D
NM
(q
1
; . . . ; q
N
; q
1
1
; . . . ; q
M
M
)
_
N
i=1
q
j=1
q
j
_
1
,
1
. . .
N
,
q
1
,
1
. . .
q
M
,
M
Then the left hand side of the scattering-equivalence condition (7.32) for each
term
NM
is
lim
t
e
i
H
0
t
NM
e
H
0
t
= lim
t
{,
}
_
dq
1
. . . dq
N
dq
1
. . . dq
M
D
NM
(q
1
; . . . ; q
N
; q
1
1
; . . . ; q
M
M
)
_
N
i=1
q
j=1
q
j
_
e
i
E
NM
t
1
,
1
. . .
N
,
q
1
,
1
. . .
q
M
,
M
372 CHAPTER 11. DRESSED PARTICLE APPROACH
where E
NM
is the energy function of this term. In the limits t mo-
mentum integrals tend to zero by Riemann-Lebesgue lemma B.1, because
the coecient function D
NM
is smooth, while the factor e
E
NM
t
oscillates
rapidly in the momentum space. Therefore, according to (7.33), Hamiltoni-
ans H and H
are scattering-equivalent.
Lemma 11.3 Potential B
10
is smooth
11
if B is either unphys with arbitrary
smooth coecient function or phys with a smooth coecient function, which
is identically zero on the energy shell.
Proof. The only possible source of singularity in B is the energy denom-
inator E
1
B
, which is singular on the energy shell. However, for operators
satisfying conditions of this Lemma, either the energy shell does not exist,
or the coecient function vanishes there. So, the product of the coecient
function with E
1
B
is smooth and nite on the energy shell.
We will assume that all relevant operators can be written as expansions
in powers of the coupling constant and that all series converge
H
c
= H
0
+ V
c
1
+ V
c
2
+ . . . (11.8)
H
d
= H
0
+ V
d
1
+ V
d
2
+ . . . (11.9)
=
1
+
2
+ . . . (11.10)
As usual, the subscript denotes the power of e (= the perturbation order).
Next, following the plan outlined in subsection 10.1.1, we introduce reg-
ularization cutos and , which ensure that in all perturbation orders in-
teractions and counterterms V
c
i
in the Hamiltonian of QED are non-singular
and nite. Moreover, with these cutos all loop integrals involved in calcula-
tions of products and commutators of V
c
i
become convergent. In this section
we are going to prove that in this regulated theory the operator can be
chosen so that conditions (A) - (G) are satised in all perturbation orders.
Of course, in order to get accurate results, in the end of calculations the
momentum cuto should be lifted. Only those quantities may have physi-
cal meaning, which remain nite in this limit, which will be considered in
subsection 11.2.7.
10
For denition of an underlined symbol see (7.12) and (8.65).
11
i.e., it has a smooth coecient function
11.2. DRESSING TRANSFORMATION 373
Using expansions (11.8) - (11.10) in (11.7) and collecting together terms
of equal order we obtain an innite set of equations
V
d
1
= V
c
1
+ i[
1
, H
0
] (11.11)
V
d
2
= V
c
2
+ i[
2
, H
0
] + i[
1
, V
c
1
]
1
2!
[
1
, [
1
, H
0
]] (11.12)
V
d
3
= V
c
3
+ i[
3
, H
0
] + i[
2
, V
c
1
] + i[
1
, V
c
2
]
1
2!
[
2
, [
1
, H
0
]]
1
2!
[
1
, [
2
, H
0
]]
1
2!
[
1
, [
1
, V
c
1
]]
i
3!
[
1
, [
1
, [
1
, H
0
]]] . . . (11.13)
. . .
Now we need to solve these equations order-by-order. This means that we
need to choose appropriate operators
i
=
ph
i
+
unp
i
+
ren
i
, so that inter-
action terms V
d
i
on left hand sides satisfy above conditions (B) - (G).
12
Let
us start with equation (11.11).
11.2.4 Dressing in the rst perturbation order
In renormalized QED the 1st order interaction operator V
c
1
= V
1
is unphys,
13
therefore in equation (11.11) we can choose
14
ph
1
=
ren
1
= 0 and use (8.74)
to solve the commutator equation
i[
unp
1
, H
0
] = V
1
unp
1
= iV
1
(11.14)
This choice ensures that the unphys part of V
d
1
is zero and that V
d
1
= 0, so
that conditions (B) - (F) are trivially satised in this order. The coecient
function of V
1
is non-singular. By Lemma 11.3 this implies that
unp
1
in
equation (11.14) is smooth. By Theorem 11.2, the presence of this term in
the dressing transformation e
i
does not aect the S-operator in agreement
with our condition (G).
12
We will discuss condition (A) separately in subsection 11.2.8.
13
see equation (L.8)
14
More generally, we can also choose
ph
1
to be any phys operator whose coecient
function vanishes on the energy shell. See next subsection.
374 CHAPTER 11. DRESSED PARTICLE APPROACH
11.2.5 Dressing in the second perturbation order
Now we can substitute the operator
1
found above into equation (11.12)
and obtain expression for the 2nd order dressed potential
V
d
2
= V
c
2
+ i[
2
, H
0
] [V
1
, V
1
] +
1
2!
[V
1
, V
1
]
= V
c
2
+ i[
2
, H
0
]
1
2
[V
1
, V
1
] (11.15)
It is convenient to write separately unphys, phys and renorm parts of this
equation and take into account that [
ren
2
, H
0
] = 0
(V
d
2
)
unp
= (V
c
2
)
unp
+ i[
unp
2
, H
0
]
1
2
[V
1
, V
1
]
unp
(11.16)
(V
d
2
)
ph
= (V
c
2
)
ph
+ i[
ph
2
, H
0
]
1
2
[V
1
, V
1
]
ph
(11.17)
(V
d
2
)
ren
= (V
c
2
)
ren
1
2
[V
1
, V
1
]
ren
(11.18)
Operators (V
c
2
)
unp
and (V
c
2
)
ph
are basically the same as V
unp
2
in (L.12) and
V
ph
2
in (L.11). There is also a contribution from the vertex renormalization
counterterm (10.43) in (V
c
2
)
unp
. Operator (V
c
2
)
ren
is coming from electron and
photon self-energy counterterms discussed in subsections 10.2.2 and 10.2.4,
respectively.
From the condition (D) it follows that (V
d
2
)
unp
must vanish, therefore we
should choose in (11.16)
15
unp
2
= iV
unp
2
i
2
[V
1
, V
1
]
unp
(11.19)
Operators V
1
and V
unp
2
are smooth. Then, by Lemma 11.3, the operator V
1
is also smooth and by Lemma 8.12 the commutator [V
1
, V
1
]
unp
is smooth as
well. Using Lemma 11.3 again, we see that operator
unp
2
is smooth and
by Theorem 11.2 its presence in the transformation e
i
does not aect the
S-operator. This is exactly what we need.
Let us now turn to equation (11.17) for the phys part of the dressed
particle interaction V
d
2
. What are the conditions for choosing
ph
2
? For
15
Here we use result (8.74).
11.2. DRESSING TRANSFORMATION 375
example, we cannot simply choose
ph
2
= 0, because in this case the dressed
particle interaction would acquire the form
(V
d
2
)
ph
= V
ph
2
1
2
[V
1
, V
1
]
ph
and there is absolutely no guarantee that the coecient function of (V
d
2
)
ph
rapidly tends to zero at large values of particle momenta (condition (F)). In
order to have this guarantee, we are going to choose
ph
2
such that the right
hand side of (11.17) rapidly tends to zero when momenta are far from the
energy shell. In addition, we will require that
ph
2
is non-singular.
16
Both
conditions can be satised by choosing
17
ph
2
=
_
iV
ph
2
i
2
[V
1
, V
1
]
ph
_
(1
2
) (11.20)
where
2
is a real function,
18
such that
(I)
2
is equal to 1 on the energy shell;
(II)
2
depends on rotationally invariant combinations of momenta (to make
sure that V
d
2
commutes with P
0
and J
0
);
(III)
2
is smooth;
(IV)
2
rapidly tends to zero when the arguments move away from the energy
shell.
19
16
This is needed to obey the charge renormalization postulate from subsection 10.1.3.
17
Note that this part of our dressing transformation closely resembles the similarity
renormalization procedure suggested by G lazek and Wilson [GW93, G la].
18
The arguments of
2
(particle momenta and spin projections) should be the same as
arguments of coecient functions in V
ph
2
and [V
1
, V
1
]
ph
. The small circle notation was
dened in subsection 8.2.3
19
For example, we can choose
2
= e
E
2
where is a positive constant and E is the
energy function of the operator on the right hand side of (11.20). Actually, it may happen
that loop integrals involving V
d
2
converge even without involvement of convergency factors
2
. For example, in subsection 13.6.2 we will see that in QED the loop integral in the
product V
d
2
V
d
2
converges even if
2
= 1 everywhere.
376 CHAPTER 11. DRESSED PARTICLE APPROACH
With the choice (11.20) we obtain
(V
d
2
)
ph
=
_
V
ph
2
1
2
[V
1
, V
1
]
ph
_
2
(11.21)
so that (V
d
2
)
ph
rapidly tends to zero when momenta of particles move away
from the energy shell in agreement with condition (F). Moreover, property
(I) guarantees that expression under the t-integral in (11.20) vanishes on
the energy shell. Therefore, the t-integral
20
is non-singular and, according
to Theorem 11.2,
ph
2
does not modify the S-operator, i.e., condition (G) is
satised.
Now, let us choose
ren
2
= 0 (11.22)
and prove that condition (D) is satised automatically with this choice. We
already proved (V
d
2
)
unp
= 0, so we are left to demonstrate (V
d
2
)
ren
= 0.
First note that with the above denitions (11.14), (11.19), (11.20), (11.22)
the operator
1
+
2
is smooth. Therefore, according to Theorem 11.2,
the S-operator obtained with the transformed interaction V
d
2
agrees with the
S-operator S
c
up to the second perturbation order (condition (G)). In partic-
ular, F
d
2
= F
c
2
. This would be impossible if V
d
2
contained a non-zero renorm
term. Indeed, (V
d
2
)
ren
,= 0 would imply that operator F
d
2
and, therefore, F
c
2
have non-zero renorm terms in disagreement with equation (10.6). Thus we
must conclude that (V
d
2
)
ren
= 0 and that two terms on the right hand side
of (11.18) cancel each other. This cancelation can be veried by a direct
calculation as well.
Finally, in (11.21) V
1
and V
1
are smooth operators, so, according to The-
orem 8.12, their commutator is also smooth. Operator V
ph
2
and function
2
are also smooth. So, due to Statement 8.7, we conclude that the second-order
dressed interaction (V
d
2
)
ph
is separable in accordance with our requirement
(C).
11.2.6 Dressing in arbitrary order
For any perturbation order i > 2, the selection of
i
and the proofs of (B)
- (G) are similar to those described above for the 2nd order. The dening
20
calculated by formula (8.65)
11.2. DRESSING TRANSFORMATION 377
equation for V
d
i
can be written in a general form
21
V
d
i
= V
c
i
+ i[
i
, H
0
] +
i
(11.23)
where
i
is a sum of multiple commutators involving V
c
j
from lower orders
(1 j < i) and their t-integrals (= underlines). This equation is solved
by
ren
i
= 0
unp
i
= i
unp
i
+ i(V
c
i
)
unp
,
ph
i
= i(
ph
i
+ (V
c
i
)
ph
) (1
i
) (11.24)
where functions
i
have properties (I) - (IV) from the preceding subsection.
Similar to the 2nd order discussed above, one then proves that
i
is smooth,
so that condition (G) is satised in the i-th order.
Solving equations (11.23) order-by-order we obtain the dressed particle
Hamiltonian
H
d
= e
i
H
c
e
i
= H
0
+ V
d
2
+ V
d
3
+ V
d
4
+ . . . (11.25)
which satises properties (B) - (G) as promised.
11.2.7 Innite momentum cuto limit
So far our calculations of operators and V
d
were performed under the
assumption of nite momentum cuto . This permitted us to avoid ultravi-
olet divergences in our formulas. In the complete and nal theory we must,
obviously, take the limit . Our approach can be viable only if we
can prove that all physically relevant dressed operators remain nite in this
limit.
22
It seems rather obvious that conditions (B) - (D) and (F) are independent
on the momentum cuto . Therefore, they also remain valid in the limit
21
compare with (11.15)
22
Note that operator providing the link (11.25) between Hamiltonians H
c
and H
d
does not correspond to any observable property, so it is OK if does not converge in the
large cuto limit.
378 CHAPTER 11. DRESSED PARTICLE APPROACH
. Let us now demonstrate that condition (E) is satised in this limit
as well. To do that, we note than on one hand traditional QED gives us a
perturbation series for the S-operator
S
c
= 1 + S
c
2
+ S
c
3
+ S
c
4
+ . . .
= 1 +
c
2
..
+
c
3
..
+
c
4
..
+. . .
On the other hand, in the dressed particle approach with Hamiltonian (11.25),
the S-operator can be written using formulas (8.67) and (8.69)
S
d
= 1 +
d
2
..
+
d
3
..
+
d
4
..
+. . .
= 1 + V
d
2
..
+ V
d
3
..
+ V
d
4
..
+V
d
2
V
d
2
. .
. . .
According to our condition (G), the S-operator of the dressed approach
should be equal to the renormalized QED S-operator order-by-order (S
c
i
=
S
d
i
). Thus we obtain the following set of relations between V
d
i
and S
c
i
on the
energy shell
23
V
d
2
..
= S
c
2
=
c
2
..
(11.26)
V
d
3
..
= S
c
3
=
c
3
..
(11.27)
V
d
4
..
= S
c
4
V
d
2
V
d
2
. .
=
c
4
..
V
d
2
V
d
2
. .
(11.28)
V
d
i
..
= S
c
i
+ Y
i
..
, i > 4 (11.29)
where Y
i
stands for a sum of certain products of V
d
j
from lower orders (2
j i 2) with t-integrations (underlines). The relations (11.26) - (11.29)
are independent on the cuto , so they remain valid when . In this
limit operators S
c
i
and
c
i
are nite and assumed to be known on the energy
shell from the standard renormalized QED theory. This immediately implies
23
Recall that S-operator is dened only on the energy shell. Moreover the underbrace
symbol was dened in (8.66) as V
..
= 2iV (E
V
). So, V
..
is non-zero only on the
energy shell of the operator V .
11.2. DRESSING TRANSFORMATION 379
that V
d
2
and V
d
3
are nite on the energy shell and, due to the condition (F),
they are nite for all momenta even outside the energy shell.
Can we say that operator V
d
4
is nite too? The equation for the 4th order
potential (11.28) is dierent from (11.26) and (11.27) by the presence of an
additional term
V
d
2
V
d
2
. .
(11.30)
on its right hand side. How can we be sure that this expression is nite
on the energy shell? This is where the yet undened factor
2
comes into
play. According to our discussion in subsection 11.2.6, this factor can be
chosen fast decaying, so that loop integrals present in the product (11.30)
are guaranteed to converge. See Theorem 8.13. Then operator (11.30) is
nite on the energy shell and V
d
4
in (11.28) is also nite on the energy shell
and everywhere else. These arguments can be repeated in all higher orders,
thus proving that the dressed particle Hamiltonian H
d
is free of ultraviolet
divergences.
11.2.8 Poincare invariance of the dressed particle ap-
proach
The next question is whether our theory with the transformed Hamiltonian
H
d
is Poincare invariant (condition (A))? In other words, whether there exists
a boost operator K
d
such that the set of generators P
0
, J
0
, K
d
, H
d
satis-
es Poincare commutators? With the dressing operator exp(i) constructed
above, this problem has a simple solution. If we dene K
d
= e
i
K
c
e
i
, then
we can obtain a full set of dressed generators via unitary transformation of
the old generators
24
P
0
, J
0
, K
d
, H
d
= e
i
P
0
, J
0
, K
c
, H
c
e
i
The dressing transformation e
i
is unitary and, therefore, preserves commu-
tators. Since old operators obey the Poincare commutators, the same is true
for the new generators. This proves that the transformed theory is Poincare
invariant and belongs to the instant form of dynamics [SS98].
24
Note that operator exp(i) commutes with P
0
and J
0
by construction.
380 CHAPTER 11. DRESSED PARTICLE APPROACH
11.3 Dressed interactions between particles
11.3.1 General properties of dressed potentials
One may notice that even after conditions (I) - (IV) on page 375 are satised
for functions
i
, there is a great deal of ambiguity in choosing their behavior
outside the energy shell. Therefore, the dressing transformation e
i
is not
unique and there is an innite set of dressed particle Hamiltonians that satisfy
our requirements (A) - (G). Which dressed Hamiltonian should we choose?
Before trying to answer this question, we can notice that all Hamiltonians
satisfying conditions (A) - (G) have some important common properties,
which will be described here.
Note that electromagnetic interactions are rather weak. In most situa-
tions the (expectation value) of the interaction potential energy is much less
than the sum of particle energies (mc
2
). To describe such situations it is
sucient to know the coecient functions of the interaction only near the
energy shell where we can use condition (I) and set approximately
i
1 for
each perturbation order i. This observation immediately allows us to obtain
a good approximation for the second-order interaction from equation (11.21)
by setting
2
1
V
d
2
V
ph
2
1
2
[V
1
, V
1
]
ph
(11.31)
Operator V
ph
2
can be taken from formula (L.11) and calculations involved
in [V
1
, V
1
]
ph
have been explained in subsection 9.2.1. So, obtaining the full
operator V
d
2
is not that dicult.
However, in higher perturbation orders commutator formulas (11.23) be-
come rather complicated. It is much easier to t V
d
i
directly to the renor-
malized S-operator (or its components
c
i
) of traditional QED, as described
in subsection 11.2.7. In the 2nd order we obtain from (11.26) and our as-
sumption
i
1
V
d
2
(
c
2
)
ph
which is consistent with (11.31).
Some examples of potentials present in V
d
2
are shown in Table 11.1. We
can classify them into two groups: elastic potentials and inelastic potentials.
11.3. DRESSED INTERACTIONS BETWEEN PARTICLES 381
Elastic potentials do not change the particle content of the system: they
have equal number of annihilation and creation operators of the same particle
types. As shown in subsection 8.2.8, elastic potentials correspond to particle
interactions familiar from ordinary quantum mechanics and classical physics.
Inelastic potentials change the number and/or types of particles. Among
inelastic 2nd order potentials in RQD there are potentials for pair creation,
pair annihilation, and pair conversion.
Similarly to the 2nd order discussed above, the third-order interaction V
d
3
can be unambiguously obtained near the energy shell by setting
3
1 in
(11.27)
V
d
3
(
c
3
)
ph
All 3rd order potentials are inelastic. Two of them are shown in Table
11.1: The term d
da + a
aa + . . . (11.33)
The contribution (
c
4
)
ph
in equation (11.32) is well-dened near the energy
shell, because we assume exact knowledge of the S-operator of renormalized
QED in all perturbation orders. However, there is less clarity about the con-
tribution (V
d
2
V
d
2
)
ph
. This product depends on the behavior of V
d
2
everywhere
25
See section 13.3.
382 CHAPTER 11. DRESSED PARTICLE APPROACH
Table 11.1: Examples of interaction potentials in RQD. Bold numbers in
the third column indicate perturbation orders in which explicit interaction
operators can be unambiguously obtained near the energy shell as discussed
in subsection 11.3.1.
Operator Physical meaning Perturbation
Orders
Elastic potentials
a
aa e
potential 2, 4, 6, . . .
d
da e
p
+
potential 2, 4, 6, . . .
a
ac e
aaa e
potential 4, 6, . . .
Inelastic potentials
a
cc e
e
+
pair creation 2, 4, 6, . . .
c
ab e
e
+
annihilation 2, 4, 6, . . .
d
ab conversion of e
e
+
pair to p
p
+
pair 2, 4, 6, . . .
d
da e
p
+
bremsstrahlung 3, 5, . . .
d
p
+
collision 3, 5, . . .
a
aa pair creation in e
collision 4, 6, . . .
in the momentum space. So, it depends on our global choice of
2
outside
the energy shell. The function
2
satises conditions (I) - (IV), but still there
is a great freedom which is reected in the uncertainty of V
d
4
even on the
energy shell. Therefore, we have two possibilities depending on the operator
structure of the 4th order potential we are interested in.
First, there are potentials contained only in the term (
c
4
)
ph
in (11.32)
and not present in the product (V
d
2
V
d
2
)
ph
. For example, this product does
not contain operator a
m
2
c
4
+ P
2
c
2
. Energy-momenta of 2-particle states form a dense (hatched)
region limited from below by the hyperboloid E =
_
(2m)
2
c
4
+ P
2
c
2
. Energy-
momenta of 3-particle states form a (double-hatched) region limited from
below by the hyperboloid E =
_
(3m)
2
c
4
+ P
2
c
2
, etc.
We know that dressed interaction does not aect 0-particle and 1-particle
states, so the corresponding energies remain exactly the same as in the
non-interacting case.
29
Dressed interaction does perturb states with two or
more particles. In particular, if inter-particle potentials in V
d
are attractive,
one can expect formation of hyperboloids of bound states, as shown in Fig.
11.1(b). In the next section we will illustrate the description of bound states
in RQD using the hydrogen atom as an example.
Traditional renormalized quantum eld theories also make similar state-
27
which is the most relevant for comparison with experiments
28
compare with Fig. 6.1(a)
29
See Fig. 11.1(b) and compare with Fig. 6.1(b).
11.3. DRESSED INTERACTIONS BETWEEN PARTICLES 385
ments about the energy-momentum spectrum of multiparticle states.
30
How-
ever, in eld theories these statements are not obvious. They cannot be
deduced directly from the renormalized Hamiltonian H
c
.
11.3.3 Comparison with other dressed particle approaches
In this subsection, we would like to discuss another point of view on the
dressing transformation. This point of view is philosophically dierent but
mathematically equivalent to ours. It is exemplied by the works of Shirokov
and coauthors [Shi93, Shi94, SS01]. In contrast to our approach in which the
dressing transformation e
i
was applied to the eld-theoretical Hamiltonian
H
c
of QED while (bare) particle creation and annihilation operators were not
aected, Shirokov et al. kept the H
c
intact, but applied the (inverse) dressing
transformation e
i
to creation and annihilation operators of particles
d
= e
i
e
i
d
= e
i
e
i
to the vacuum state
[0
d
= e
i
[0
and to particle observables. Physically, this means that instead of bare par-
ticles (created and annihilated by
d
and
d
, respectively), i.e., particles together with their virtual
clouds. Within this approach the Hamiltonian H
c
must be expressed as a
function of the new particle operators H
c
= T(
d
,
d
). Apparently, the same
function T expresses the Hamiltonian H
d
of our approach through original
(bare) particle operators: H
d
= T(
, )e
i
= T(e
i
e
i
, e
i
e
i
)
= T(
d
,
d
)
30
See, for example, Fig. 17.4 in [Sch61], Fig. 16.1 in [BD65] and Fig. 7.1 in [PS95b].
386 CHAPTER 11. DRESSED PARTICLE APPROACH
So, mathematically, these two approaches are equivalent. Let us demon-
strate this equivalence on a simple example. Suppose we want to calculate
a trajectory (=the time dependence of the expectation value of the position
operator) of the electron in a 2-particle system (electron + proton). In our
approach the initial state of the system has two particles
[ = a
[0
and the expectation value of the electrons position is given by formula
r(t) = [R(t)[
= 0[dae
i
H
d
t
Re
H
d
t
a
[0
= 0[da(e
i
e
i
H
c
t
e
i
)R(e
i
e
H
c
t
e
i
)a
[0 (11.34)
where R is the position operator for the electron. However, we can rewrite
this expression in the following form characteristic for the Shirokovs ap-
proach
r(t) = 0[e
i
(e
i
dae
i
)e
i
H
c
t
(e
i
Re
i
)e
H
c
t
(e
i
a
e
i
)e
i
[0
=
d
0[d
d
a
d
e
i
H
c
t
R
d
e
H
c
t
a
d
d
d
[0
d
=
d
0[d
d
a
d
R
d
(t)a
d
d
d
[0
d
(11.35)
where the time evolution is generated by the original Hamiltonian H
c
, while
dressed denitions are used for the vacuum state, particle operators and
the position observable
[0
d
= e
i
[0
a
d
, d
d
, a
d
, d
d
= e
i
a, d, a
, d
e
i
R
d
= e
i
Re
i
In spite of dierent formalisms, physical predictions of both theories, e.g.,
expectation values of observables (11.34) and (11.35), are exactly the same.
An interesting and somewhat related approach to particle interactions in
QFT was recently developed by Weber and co-authors [Webb, Weba, WL02,
WL].
Chapter 12
COULOMB POTENTIAL
AND BEYOND
This work contains many things which are new and interesting.
Unfortunately, everything that is new is not interesting, and ev-
erything which is interesting, is not new.
Lev D. Landau
In the preceding chapter we obtained formulas (11.31) and (11.32) for the
dressed particle Hamiltonian H
d
in a rather abstract form. In this chapter
we would like to demonstrate how this Hamiltonian can be cast into a form
suitable for calculations, i.e., expressed through creation and annihilation
operators of electrons, protons, photons, etc. Here we will focus on pair
interactions between electrons and protons in the lowest (second) order of
the perturbation theory.
1
In the (v/c)
2
approximation we will obtain what
is commonly known as the Darwin-Breit potential. The major part of this
interaction is the usual Coulomb potential. In addition, there are relativistic
corrections responsible for magnetic, contact, spin-orbit, spin-spin and other
interactions which are routinely used in relativistic calculations of atomic and
molecular systems. This derivation demonstrates how formulas familiar from
ordinary quantum mechanics and classical electrodynamics follow naturally
from RQD. In section 12.2 we will solve the stationary Scrodinger equation
1
Other potentials in Table 11.1) can be obtained by similar methods.
387
388 CHAPTER 12. COULOMB POTENTIAL AND BEYOND
with H
d
and obtain energy spectrum of the hydrogen atom with relativistic
corrections.
12.1 Darwin-Breit Hamiltonian
12.1.1 Electron-proton potential in the momentum space
Note that the second-order dressed interaction near the energy shell (11.31)
is given by the same formula as F
2
in (9.18). So, for the electron-proton
potential we can simply reuse our result (9.25)
V
d
2
[d
da] = V
d
2A
+ V
d
2B
+ V
d
2C
(12.1)
V
d
2A
=
e
2
2
(2)
3
_
dkdqdpMmc
4
_
q+k
pk
1
k
2
W
0
(p k, ; p,
)U
0
(q +k, ; q,
)
d
pk,
a
q+k,
d
p,
a
q,
(12.2)
V
d
2B
=
e
2
2
c
2
(2)
3
_
dkdqdpMmc
4
_
q+k
pk
1
( q +
k q)
2
W(p k, ; p,
) U(q +k, ; q,
)
d
pk,
a
q+k,
d
p,
a
q,
(12.3)
V
d
2C
=
e
2
2
c
2
(2)
3
_
dkdqdpMmc
4
_
q+k
pk
1
( q +
k q)
2
k
2
(k W(p k, ; p,
))
d
pk,
a
q+k,
d
p,
a
q,
(12.4)
In these formulas we integrate over the electron (q), proton (p) and trans-
ferred (k) momenta and sum over spin projections of the two particles ,
and ,
.
Operator (12.1) has non-trivial action in all sectors of the Fock space
which contain at least one electron and one proton. However, for simplicity,
we will limit our attention to the 1 proton + 1 electron subspace 1
pe
in
12.1. DARWIN-BREIT HAMILTONIAN 389
the Fock space. If (p, ; q, ) is the wave function of a two-particle state,
then the interaction Hamiltonian (12.1) will transform it to
2
(p, , q, ) = V
d
2
[d
da](p, ; q, )
=
_
dkv
2
(p, q, k; , ,
)(p k,
; q +k,
) (12.5)
where v
2
is the coecient function of the interaction operator V
d
2
[d
da].
We are going to write our formulas with the accuracy of (v/c)
2
. So, we use
(J.66) - (J.69), (J.65) and (J.64) to obtain the coecient function in (12.5)
as a sum of three terms
v
2
= v
2A
+ v
2B
+ v
2C
(12.6)
where
3
v
2A
=
e
2
2
(2)
3
(el)
(pr)
_
1
k
2
1
8M
2
c
2
1
8m
2
c
2
i
pr
[k p]
4M
2
c
2
k
2
+ i
el
[k q]
4m
2
c
2
k
2
_
(el)
(pr)
v
2B
=
e
2
2
(2)
3
(el)
(pr)
_
pq
Mmc
2
k
2
kq
2Mmc
2
k
2
+
pk
2Mmc
2
k
2
1
4Mmc
2
i[
pr
k] q
2Mmc
2
k
2
+
ip [
el
k]
2Mmc
2
k
2
+
(
el
pr
)
4Mmc
2
(
pr
k)(
el
k)
4Mmc
2
k
2
_
(el)
(pr)
v
2C
=
e
2
2
(2)
3
k
4
1
4Mmc
2
(el)
(pr)
(2pk k
2
)(2qk + k
2
)
(el)
(pr)
=
e
2
2
(2)
3
(el)
(pr)
_
(pk)(qk)
Mmc
2
k
4
qk
2Mmc
2
k
2
+
pk
2Mmc
2
k
2
1
4Mmc
2
_
(el)
(pr)
2
See subsection 8.2.8.
3
We used properties of Pauli matrices from Appendix H.6 and formulas from Appendix
J.9. Our calculations in this section can be compared with 83 in ref. [BLP01] and with
[Ito65].
390 CHAPTER 12. COULOMB POTENTIAL AND BEYOND
Putting these three terms together we nally rewrite (12.6) in the form
v
2
(p, q, k; , ,
)
=
e
2
2
(2)
3
(el)
(pr)
1
k
2
+
1
8M
2
c
2
+
1
8m
2
c
2
+
pq
Mmc
2
k
2
(pk)(qk)
Mmc
2
k
4
+ i
pr
[k p]
4M
2
c
2
k
2
i
el
[k q]
4m
2
c
2
k
2
i
pr
[k q]
2Mmc
2
k
2
+ i
el
[k p]
2Mmc
2
k
2
+
(
el
pr
)
4Mmc
2
(
pr
k)(
el
k)
4Mmc
2
k
2
_
(el)
(pr)
(12.7)
In most applications we can assume that the proton is innitely heavy (M
m) and skip terms with M in denominators. Then
v
2
(p, q, k; , ,
)
e
2
2
(2)
3
,
(el)
1
k
2
+
1
8m
2
c
2
i
el
[k q]
4m
2
c
2
k
2
_
(el)
(12.8)
12.1.2 Position representation
The physical meaning of interaction (12.7) is more transparent in the po-
sition representation, which is derived by replacing variables p and q with
dierential operators p = i(d/dx) and q = i(d/dy) and taking the
Fourier transform
4
V
d
2
[d
da](x, ; y, )
=
_
dke
i
k(xy)
v
2
( p, q, k; , ,
)(x,
; y,
)
=
e
2
2
(2)
3
_
dke
i
k(xy)
_
1
k
2
+
1
8M
2
c
2
+
1
8m
2
c
2
+
p q
Mmc
2
k
2
( pk)( qk)
Mmc
2
k
4
+i
pr
[k p]
4M
2
c
2
k
2
i
el
[k q]
4m
2
c
2
k
2
i
pr
[k q]
2Mmc
2
k
2
+ i
el
[k p]
2Mmc
2
k
2
+
(
el
pr
)
4Mmc
2
(
pr
k)(
el
k)
4Mmc
2
k
2
_
(x, ; y, )
4
see subsection 8.2.8; x is the protons position and y is the electrons position
12.1. DARWIN-BREIT HAMILTONIAN 391
Using integral formulas (B.7) - (B.11) we obtain the following position-space
representation of this potential (where r x y)
5
V
d
2
[d
da] =
e
2
4r
+
e
2
2
8c
2
_
1
M
2
+
1
m
2
_
(r) +
e
2
8Mmc
2
r
_
p q +
(r q)(r p)
r
2
_
e
2
[r p]
pr
16M
2
c
2
r
3
+
e
2
[r q]
el
16m
2
c
2
r
3
+
e
2
[r q]
pr
8Mmc
2
r
3
e
2
[r p]
el
8Mmc
2
r
3
+
e
2
2
4Mmc
2
_
pr
el
4r
3
+ 3
(
pr
r)(
el
r)
4r
5
+
2
3
(
pr
el
)(r)
_
(12.9)
With the accuracy of (v/c)
2
the free Hamiltonian H
0
can be written as
H
0
=
_
M
2
c
4
+ p
2
c
2
+
_
m
2
c
4
+ q
2
c
2
= Mc
2
+ mc
2
+
p
2
2M
+
q
2
2m
p
4
8M
3
c
2
q
4
8m
3
c
2
+ . . .
where the rest energies of particles Mc
2
and mc
2
are simply constants, which
can be eliminated by a proper choice of zero on the energy scale. Note
also that Pauli matrices are proportional to particle spin operators (H.5):
S
el
=
2
el
,
S
pr
=
2
pr
. So, nally, the QED Hamiltonian responsible for
the electron-proton interaction in the 2nd order is obtained in the form of
Darwin-Breit potential
H
d
=
H
0
+
V
d
2
( p, q, r,
S
el
,
S
pr
) + . . .
=
p
2
2M
+
q
2
2m
+
V
Coulomb
+
V
orbit
+
V
spinorbit
+
V
spinspin
+ . . . (12.10)
This form is similar to the familiar non-relativistic Hamiltonian in which
p
2
/(2M) + q
2
/(2m) is treated as the kinetic energy operator and
V
Coulomb
is
the usual Coulomb interaction between two charged particles
V
Coulomb
=
e
2
4r
(12.11)
5
Some of these interaction terms are non-Hermitian due to the non-commutativity of
operators r and p, q. This minor problem can be solved by symmetrizing non-commutative
products, e.g., by replacing AB (AB +BA)/2.
392 CHAPTER 12. COULOMB POTENTIAL AND BEYOND
This is the only interaction term which survives in the non-relativistic limit
c .
V
orbit
is a spin-independent relativistic correction to the Coulomb
interaction
V
orbit
=
p
4
8M
3
c
2
q
4
8m
3
c
2
+
e
2
2
8c
2
_
1
M
2
+
1
m
2
_
(r)
+
e
2
8Mmc
2
r
_
p q +
(r q)(r p)
r
2
_
(12.12)
The rst two terms do not depend on relative variables, so they can be
regarded as relativistic corrections to energies of single particles. The contact
interaction (proportional to
2
(r)) can be neglected in the classical limit
0. Keeping the (v/c)
2
accuracy and substituting p/M v
pr
and
q/m v
el
the remaining terms can be rewritten in a more familiar form of
the Darwin potential [Bre68]
V
Darwin
=
e
2
8c
2
r
_
v
el
v
pr
+
(r v
pr
)(r v
el
)
r
2
_
(12.13)
which describes velocity-dependent (magnetic) interaction between charged
particles.
Two other terms
V
spinorbit
and
V
spinspin
in (12.10) depend on particle
spins
V
spinorbit
=
e
2
[r p]
S
pr
8M
2
c
2
r
3
+
e
2
[r q]
S
el
8m
2
c
2
r
3
+
e
2
[r q]
S
pr
4Mmc
2
r
3
e
2
[r p]
S
el
4Mmc
2
r
3
(12.14)
V
spinspin
=
e
2
Mmc
2
_
S
pr
S
el
4r
3
+ 3
(
S
pr
r)(
S
el
r)
4r
5
+
2
3
(
S
pr
S
el
)(r)
_
(12.15)
Since our dressing transformation preserved commutation relations of the
Poincare Lie algebra,
6
we can be condent that the Darwin-Breit Hamilto-
nian is relativistically invariant, at least up to the order (v/c)
2
. In Appendix
N.3 we additionally verify this important fact by a direct calculation.
7
6
See subsection 11.2.8.
7
see also [CV68, CO70, KF74]
12.2. HYDROGEN ATOM 393
The Darwin-Breit Hamiltonian was successfully applied to various elec-
tromagnetic problems, such as the ne structure of atomic spectra [BLP01,
Bre68], superconductivity and properties of plasma [Ess07, Ess95, Ess96,
Ess99]. In chapter 14 we will see that in the classical limit this Hamilto-
nian reproduces correctly all major results of classical electrodynamics. In
chapter 13.6 we will calculate small radiative corrections to the Darwin-Breit
potential.
12.2 Hydrogen atom
Having derived the electron-proton interaction potential, now we can study
the bound state of these two particles - the hydrogen atom. We are interested
in energies and wave functions of its stationary states. In the dressed particle
approach, this task is accomplished simply by diagonalizing the dressed par-
ticle Hamiltonian in the electron+proton sector 1
pe
of the Fock space. In
other words, the stationary Schr odinger equation needs to be solved. In this
section, we will study this solution with the Hamiltonian (12.10) including
interaction terms up to the 2nd perturbation order. Higher order corrections
will be considered in chapter 13.6.
12.2.1 Non-relativistic Schrodinger equation
We can use the fact that Hamiltonian (12.10) commutes with the operator of
total momentum P = p+q. Therefore this Hamiltonian leaves invariant the
zero-total-momentum subspace of 1
pe
. Working in this subspace we can set
Q = i
r
If we make these substitutions in (12.10), then the energies and wave func-
tions
H
d
_
i
r
, r, S
el
, S
pr
_
(r, , ) =
(r, , ) (12.16)
394 CHAPTER 12. COULOMB POTENTIAL AND BEYOND
Analytical solution of equation (12.16) is not possible. Realistically, one can
rst solve equation (12.16) leaving just the Coulomb interaction term (12.11)
there and rewriting the rst two terms in (12.10) as
p
2
2M
+
q
2
2m
=
(m+ M)
Q
2
2mM
=
Q
2
2
where = mM/(m + M) m is the reduced mass. In this approximation
equation (12.16) takes the form
_
Q
2
2
e
2
4r
_
(r, , ) =
(r, , ) (12.17)
It does not depend on spin variables , , so the solution can be written as a
product of orbital and spin parts
(r, , ) =
(r)(, )
The energy is independent on the spin part (, ), which can be chosen
as an arbitrary set of four complex numbers, satisfying the normalization
condition
[(1/2, 1/2)[
2
+[(1/2, 1/2)[
2
+[(1/2, 1/2)[
2
+[(1/2, 1/2)[
2
= 1
The orbital parts and their energy eigenvalues satisfy the dierential equation
_
2
2
2
r
2
e
2
4r
_
(r) =
(r) (12.18)
or in spherical coordinates
_
2
2
_
1
r
2
r
_
r
2
r
_
+
1
r
2
sin
_
sin
_
+
1
r
2
sin
2
2
_
e
2
4r
_
(r, , )
=
(r, , )
12.2. HYDROGEN ATOM 395
This is the familiar non-relativistic problem with a well-known analytical
solution, which can be found in any textbook on quantum mechanics, e.g.,
[Bal98, LL77]. Eigenstates will be labeled by the principal (n), orbital (l)
and magnetic (m) quantum numbers. Energy eigenvalues are degenerate
with respect to l and m
(n, l, m) =
c
2
2
2n
2
(12.19)
Few low energy solutions are shown in table 12.1, where a
0
4
2
/(e
2
)
/(mc) denotes the Bohr radius and e
2
/(4c) 1/137 is the ne
structure constant.
Table 12.1: Normalized low energy solutions for non-relativistic hydrogen
atom
State(n, l, m) Wave function
(r, , ) Energy ()
1S(1, 0, 0)
1
a
3
0
e
r/a
0
c
2
2
/2 = -13.6 eV
2S(2, 0, 0)
1
4
2a
3
0
(2
r
a
0
)e
r/(2a
0
)
c
2
2
/8 = -3.4 eV
2P(2, 1, 0)
1
4
2a
3
0
r
a
0
e
r/(2a
0
)
cos c
2
2
/8 = -3.4 eV
2P(2, 1, 1)
1
8
a
3
0
r
a
0
e
r/(2a
0
)
sin e
i
c
2
2
/8 = -3.4 eV
2P(2, 1, 1)
1
8
a
3
0
r
a
0
e
r/(2a
0
)
sin e
i
c
2
2
/8 = -3.4 eV
For further calculations we will need expectation values for inverse powers
of r in dierent eigenstates. For example
r
1
(2S)
_
dr
2S
(r)
1
r
2S
(r) =
1
8a
3
0
_
0
drr
_
2
r
a
0
_
2
e
r/a
0
=
1
8a
3
0
_
4
_
0
drre
r/a
0
4
a
0
_
0
drr
2
e
r/a
0
+
1
a
2
0
_
0
drr
3
e
r/a
0
_
=
1
8a
3
0
_
4a
2
0
8a
2
0
+ 6a
2
0
_
=
1
4a
0
These results are shown in Table 12.2 along with probability densities at the
origin [(0)[
2
.
396 CHAPTER 12. COULOMB POTENTIAL AND BEYOND
Table 12.2: Properties of low energy solutions for non-relativistic hydrogen
atom
State [(0)[
2
r
1
r
2
r
3
1S
1
a
3
0
1
a
0
2
a
2
0
2S
1
8a
3
0
1
4a
0
1
4a
2
0
2P 0
1
4a
0
1
12a
2
0
1
24a
3
0
12.2.2 Relativistic energy corrections (orbital)
In the preceding subsection we obtained energies and wave functions
for
a simple model of the hydrogen atom in which the electron-proton interaction
is approximated by the Coulomb potential e
2
/(4r). We can consider these
results as a zero-order approximation for the full exact solution. Then other
interaction terms in (12.10) can be treated as a perturbation V
pert
V
orbit
+
V
spinorbit
+V
spinspin
. In the rst approximation, this perturbation does not
aect the wave functions but shift energies [Bal98]. The resulting energy
correction for the state [
[V
pert
[
(12.20)
Then perturbations V
orbit
and V
spinorbit
are responsible for the ne structure
of the hydrogen atom and V
spinspin
is responsible for the hyperne structure
(see Fig. 12.1). More details can be found in 84 of ref. [BLP01].
Let us rst calculate energy level corrections due to the perturbation
V
orbit
. We will take into account that M m, thus ignoring terms propor-
tional to inverse powers of M in (12.12)
8
and assuming that = m. The
energy correction due to the second term in (12.12) is
relat
=
1
8m
3
c
2
_
dr
Q
4
(12.21)
If
Q
4
=
Q
2
Q
2
= 2m
Q
2
_
+
e
2
4r
_
= 2m
2
r
_
r
+
r
_
e
2
4r
__
= 2m
2
_
r
2
+
e
2
4
_
2
r
2
1
r
_
+
e
2
2
_
r
1
r
r
_
+
e
2
4r
r
2
_
= 2m
2
_
2m
2
_
+
e
2
4r
_
2
e
2
(r)
+
e
2
2
_
r
1
r
r
_
_
Using expression for the gradient in spherical coordinates
10
f(r, , )
r
=
f
r
r +
r sin
f
r
f
in (12.21) we obtain
relat
=
2
4m
2
c
2
_
dr
2m
2
_
+
e
2
4r
_
2
e
2
(r)
e
2
2r
2
r
_
The last term in square brackets can be evaluated as
11
e
2
2
_
dr
1
r
2
r
=
e
2
2
_
2
0
d
_
0
sin d
_
0
dr
r
=
e
2
4
_
2
0
d
_
0
sin d
_
0
dr
[
[
2
r
=
e
2
4
_
2
0
d
_
0
sin d[
(0)[
2
= e
2
[
(0)[
2
(12.22)
The second term in square brackets is
10
Here r r/r,
,
are unit vectors directed along directions of growth of the corre-
sponding coordinates.
11
we take into account that
(r, , ) 0 as r
12.2. HYDROGEN ATOM 399
e
2
_
dr
(r)
= e
2
[
(0)[
2
so it cancels with (12.22) and
relat
=
1
2mc
2
_
dr
2
+
e
2
2r
+
e
4
16
2
r
2
_
=
1
2mc
2
_
2
+
e
2
2
r
1
+
e
4
16
2
r
2
_
Energy correction due to the third term in (12.12) is
contact
=
e
2
2
8m
2
c
2
_
dr(r)[
(r)[
2
=
e
2
2
8m
2
c
2
[
(0)[
2
(12.23)
Using data from Tables 12.1 and 12.2 we obtain orbital energy corrections
for individual states
nlm
as shown in the 2nd and 3rd rows of Table 12.3.
Table 12.3: 2nd order perturbative relativistic energy corrections to low-lying
states of the hydrogen atom.
1S
1/2
2S
1/2
2P
1/2
2P
3/2
non-relativistic energy (12.19)
mc
2
2
2
mc
2
2
8
mc
2
2
8
mc
2
2
8
Energy corrections:
relativistic (12.21)
1
8m
3
c
2
Q
4
5mc
2
4
8
13mc
2
4
128
7mc
2
4
384
7mc
2
4
384
contact (12.23)
e
2
2
8m
2
c
2
(r)
mc
2
4
2
mc
2
4
16
0 0
spin-orbit (12.24)
e
2
8m
2
c
2
LS
el
r
3
0 0
mc
2
4
48
mc
2
4
48
Total correction
mc
2
4
8
5mc
2
4
128
5mc
2
4
128
mc
2
4
384
400 CHAPTER 12. COULOMB POTENTIAL AND BEYOND
12.2.3 Relativistic energy corrections (spin-orbital)
Let us now consider the eect of the spin-orbit interaction (12.14)
V
spinorbit
e
2
[r q]
S
el
8m
2
c
2
r
3
=
e
2
L
S
el
8m
2
c
2
r
3
(12.24)
where L = [r
Q] is the orbital angular momentum of the atom. This
interaction does not act on states with l = 0, which are eigenvectors of the
orbital angular momentum operator L
2
with eigenvalue 0. So, we need to
consider only 2P-states, where l = 1.
Totally, there are 6 dierent substates in 2P: those with dierent com-
binations of l = 1, 0, 1 and s = 1/2, 1/2. In these substates the total
angular momentum
12
J = L + S
el
can be either j = (1 (1/2)) = /2
or j = (1 + (1/2)) = 3/2. So, the 6 substates separate into two groups.
One group of two states corresponds to j = 1/2. It is denoted by 2P
1/2
. The
other group of four states corresponds to j = 3/2 and is denoted by 2P
3/2
.
The non-perturbed Hamiltonian
H
ep
=
Q
2
2m
e
2
4r
(12.25)
commutes with the orbital angular momentum operator L, with the electron
spin operator S
el
and with the total angular momentum operator J. So, all
six substates are degenerate with respect to (12.25).
On the other hand, the total Hamiltonian (12.10) commutes only with
the total angular momentum J and it does not commute with L and S
el
.
Therefore, the total energies of the two groups 2P
1/2
and 2P
3/2
are dierent.
Let us demonstrate the eect of perturbation (12.24) on the state 2P
1/2
. We
use formula
J
2
= (L +S
el
)
2
= L
2
+ S
2
el
+ 2(L S
el
)
Then
12
Here we ignore the protons spin S
pr
whose contribution to the energy can be ignored
in our approximation.
12.2. HYDROGEN ATOM 401
J
2
2P
1/2 =
2
j(j + 1)
2P
1/2 = 3/4
2
2P
1/2
L
2
2P
1/2 =
2
l(l + 1)
2P
1/2 = 2
2
2P
1/2
S
2
el
2P
1/2 =
2
s(s + 1)
2P
1/2 = 3/4
2
2P
1/2
(L S
el
)
2P
1/2 =
(J
2
L
2
S
2
el
)
2
2P
1/2 =
(3/4 2 3/4)
2
2
2P
1/2
=
2
2P
1/2
spinorbit
(2P
1/2
) =
e
2
2
8m
2
c
2
r
3
=
mc
2
4
48
A similar calculation gives us the spin-orbit correction to the energy of 2P
3/2
spinorbit
(2P
3/2
) =
mc
2
4
48
One can see from Table 12.3 that the total 2nd order energy corrections to
states 2S
1/2
and 2P
1/2
are the same. So, these two states remain degenerate
in our approximation
(2S
1/2
) (2P
1/2
) = 0 (12.26)
In chapter 13.6 we will derive 4th order radiative corrections to the electron-
proton interaction potential. These corrections will result in a small gap
between 2S
1/2
and 2P
1/2
levels, which is known as the Lamb shift.
402 CHAPTER 12. COULOMB POTENTIAL AND BEYOND
Chapter 13
DECAYS AND RADIATION
Many things are incomprehensible to us not because our compre-
hension is weak, but because those things are not within the frames
of our comprehension.
Kozma Prutkov
The formulation of quantum theory in the Fock space with unspecied
number of particles gives us an opportunity to describe not just inter-particle
interactions, but also processes of creation and absorption of particles. A
simplest example of such processes is the decay of an unstable particle. This
is the topic of the present chapter.
Unstable particles are interesting objects for study for several reasons.
First, an unstable particle is a rare example of a quantum interacting system
whose time evolution can be observed relatively easily. This time evolution
is especially simple, because in many cases it can be described by just one
parameter - the non-decay probability . Second, a rigorous description of
the decay is possible in a small portion of the Fock space that contains only
states of the particle and its decay products, so a rather accurate solution of
this time-dependent problem can be obtained in a closed form.
In sections 13.1 - 13.2 we will discuss the decay law of a general unstable
system at rest. Single particle decays are forbidden in quantum electrody-
namics, because, as we discussed in subsection 8.2.4, there are no decay type
interactions in the Hamiltonian of QED. However, even in QED decays may
403
404 CHAPTER 13. DECAYS AND RADIATION
occur in compound systems. Section 13.3 studies a specic example of an
unstable QED system an excited energy level of the hydrogen atom. Based
on RQD approach, we calculate the probability of the photon emission from
such a state. In section 13.4 we are interested in the decay law observed
from a moving reference frame. In section 13.5 it is shown that the famous
Einsteins time dilation formula is not exactly applicable to such decays.
13.1 Unstable system at rest
In this section we will pursue two goals. The rst goal is to present a pre-
liminary material for our discussion of decays of moving particles in sections
13.4 and 13.5. The second goal is to derive a beautiful result, due to Breit
and Wigner, which explains why the time dependence of particle decays is
(almost) always exponential.
13.1.1 Quantum mechanics of particle decays
The decay of unstable particles is described mathematically by the non-decay
probability which has the following denition. Suppose that we have a piece
of radioactive material with N unstable nuclei prepared simultaneously at
time t = 0 and denote N
u
(t) the number of nuclei that remain undecayed at
time t > 0. So, at each time point the piece of radioactive material can be
characterized by the ratio N
u
(t)/N.
In this paper, in the spirit of quantum mechanics, we will treat N unstable
particles as an ensemble of identically prepared systems and consider the
ratio N
u
(t)/N as a property of a single particle (nucleus) the probability of
nding this particle in the undecayed state. Then the non-decay probability
(t)
1
is dened as a large N limit
(t) = lim
N
N
u
(t)/N (13.1)
Let us now turn to the description of an isolated unstable system from
the point of view of quantum theory. We will consider a model theory with
1
Function (t) will be called the decay law of the particle. Perhaps, it would be more
consistent to call this quantity the non-decay law or survival probability.
13.1. UNSTABLE SYSTEM AT REST 405
particles a, b and c, so that particle a is massive and unstable, while its decay
products b and c are stable and their masses satisfy the inequality
m
a
> m
b
+ m
c
(13.2)
which makes the decay a b +c energetically possible. In order to simplify
calculations and avoid being distracted by issues that are not relevant to the
problem at hand we assume that particle a is spinless and has only one decay
channel. For our discussion, the nature of this particle is not important. For
example, this could be a muon or a radioactive nucleus or an atom in an
excited state.
Observations performed on the unstable system may result in only two
outcomes. One can nd either a non-decayed particle a intact or its decay
products b + c. Thus it is appropriate to describe states of this system in
just two sectors of the Fock space
2
1 = 1
a
1
bc
(13.3)
where 1
a
is the subspace of states of the unstable particle a and 1
bc
1
b
1
c
is the orthogonal subspace of the decay products.
The simplest interaction Hamiltonian of the type (8.50) that can be re-
sponsible for the decay a b + c is
3
V =
_
dpdq
_
G(p, q)a
p+q
b
p
c
q
+ G
(p, q)b
p
c
q
a
p+q
_
(13.4)
As expected, the interaction operator (13.4) leaves invariant the sector 1 =
1
a
1
bc
of the total Fock space.
2
In principle, a rigorous description of systems involving these three types of particles
must be formulated in the full Fock space where particle numbers N
a
, N
b
and N
c
are
allowed to take any values from zero to innity. However, for most unstable particles the
interaction between decay products b and c in the nal state can be ignored. Creation of
additional particles due to this interaction can be ignored too. So, considering only the
subspace (13.3) in the full Fock space is a reasonable approximation.
3
Note that in order to have a Hermitian Hamiltonian we need to include in the inter-
action both the term b
Ht
[ (13.8)
and the decay law is
(t) = [e
i
Ht
Te
Ht
[ (13.9)
From this equation it is clear that the Hamiltonian H describing the unstable
system should not commute with the projection operator T
[H, T] ,= 0 (13.10)
13.1. UNSTABLE SYSTEM AT REST 407
HH
aa
HH
bc
t=0
t>0
(tt))
11
Figure 13.1: Time evolution of the state vector of an unstable system.
Otherwise, the subspace 1
a
of states of the particle a would be invariant
with respect to time translations and the particle would be stable.
Now we can suggest a schematic visual representation of the decay process
in the Hilbert space. In Fig. 13.1 we show the full Hilbert space 1
a
1
bc
as
a sum of two orthogonal subspaces 1
a
and 1
bc
. We assume that the initial
normalized state vector [ at time t = 0 lies entirely in the subspace 1
a
. So
that the non-decay probability (0) is equal to 1 as in equation (13.7). From
equation (13.10) we know that the subspace 1
a
is not invariant with respect
to time translations. Therefore, at time t > 0 the vector [(t) e
Ht
[
develops a component
5
lying in the subspace of decay products 1
bc
. The
decay is described as a gradual time evolution of the state vector from the
subspace of unstable particle 1
a
to the subspace of decay products 1
bc
. Then
the non-decay probability (t) decreases with time.
Before calculating the decay law (13.9) we will need to do some prepara-
tory work rst. In subsections 13.1.2 - 13.1.4 we are going to construct two
useful bases. One is the basis [p of eigenvectors of the total momentum
4
In this chapter we are working in the Schrodinger picture.
5
shown by a broken-line arrow in the gure
408 CHAPTER 13. DECAYS AND RADIATION
operator P
0
in 1
a
. Another is the basis [p, m of common eigenvectors of
P
0
and the interacting mass operator M in 1.
13.1.2 Non-interacting representation of the Poincare
group
Let us rst consider a simple case when the interaction responsible for the
decay is turned o. This means that dynamics of the system is governed
by the non-interacting representation of the Poincare group U
0
g
in 1.
6
This
representation is constructed in accordance with the structure of the Hilbert
space (13.3) as
U
0
g
U
a
g
(U
b
g
U
c
g
) (13.11)
where U
a
g
, U
b
g
and U
c
g
are unitary irreducible representations of the Poincare
group corresponding to particles a, b and c, respectively. Generators of this
representation are denoted by P
0
, J
0
, H
0
and K
0
. According to (13.2), the
operator of non-interacting mass
M
0
= +
1
c
2
_
H
2
0
P
2
0
c
2
has a continuous spectrum in the interval [m
b
+m
c
, ) and a discrete point
m
a
embedded in this interval.
From denition (13.11) it is clear that the subspaces 1
a
and 1
bc
are
separately invariant with respect to U
0
g
. Moreover, the projection operator
T commutes with non-interacting generators
[T, P
0
] = [T, J
0
] = [T, K
0
] = [T, H
0
] = 0 (13.12)
Exactly as we did in subsection 5.1.2, we can use the non-interacting rep-
resentation U
0
g
to build a basis [p of eigenvectors of the momentum operator
P
0
in the subspace 1
a
. Then any state [ 1
a
can be represented by a
linear combination of these basis vectors
6
see subsection 8.1.2
13.1. UNSTABLE SYSTEM AT REST 409
[ =
_
dp(p)[p (13.13)
and the projection operator T can be written as
T =
_
dp[pp[ (13.14)
13.1.3 Normalized eigenvectors of momentum
Basis vectors [p are convenient for writing arbitrary states [ 1
a
as
linear combinations (13.13). However vectors [p themselves are not good
representatives of quantum states, because they are not normalized. For
example, the momentum space wave function of the basis vector [q is a
delta function
(p) = p[q = (p q) (13.15)
and the corresponding probability of nding the particle is innite
_
dp[(p)[
2
=
_
dp[(p q)[
2
=
Therefore, states [q cannot be used in formula (13.5) to calculate the non-
decay probability. If we would like to have a method to calculate the decay
law for states with denite (or almost denite) momentum p
0
, then we should
use state vectors
7
that have normalized momentum-space wave functions
sharply localized near p
0
. In order to satisfy the normalization condition
_
dp[(p)[
2
= 1
wave functions of [p
0
) may be formally represented as a square root of the
Diracs delta function
8
7
which we denote by [p
0
) to distinguish them from [p
0
8
Another way to achieve the same goal would be to keep the delta-function represen-
tation (13.15) of denite-momentum states, but use (formally vanishing) normalization
factors, like N = (
_
dp[(p)[
2
)
1/2
. Perhaps such manipulations with innitely large and
innitely small numbers can be justied within non-standard analysis [Fri94].
410 CHAPTER 13. DECAYS AND RADIATION
(p) =
_
(p p
0
) (13.16)
According to equation (5.34), the exponent of the Newton-Wigner posi-
tion operator e
i
R
0z
b
acts as a translation operator in the momentum space.
In particular, we can apply this operator to the wave function with zero mo-
mentum
_
(p) and obtain a wave function localized at momentum p
0
=
(0, 0, m
a
c sinh )
e
i
R
0z
mac sinh
_
(p) =
_
(p p
0
)
On the other hand, applying a boost transformation (5.30) to
_
(p) we
obtain
e
ic
K
0z
_
(p) =
L
1
p
p
_
(L
1
p) =
L
1
p
1
[J[
(p p
0
)
=
_
(p p
0
)
where
p
=
_
m
2
a
c
4
+ p
2
c
2
(13.17)
and [J[ =
L
1
p
p
is the Jacobian of transformation p L
1
p. This suggests
that momentum eigenvectors have a useful representation
[p) = e
i
R
0
p
[0)
[p = e
i
R
0
p
[0 (13.18)
13.1.4 Interacting representation of the Poincare group
In order to study dynamics of the unstable system we need to dene an
interacting unitary representation of the Poincare group in the Hilbert space
1. This representation will allow us to relate results of measurements in
dierent reference frames. In this section we will take the point of view of
13.1. UNSTABLE SYSTEM AT REST 411
the observer at rest. We will discuss particle decay from the point of view of
a moving observer in sections 13.4 and 13.5.
Let us now turn on the interaction responsible for the decay and discuss
the interacting representation U
g
of the Poincare group in 1 with generators
P, J, K and H. As usual, we prefer to work in the Diracs instant form
of dynamics. Then the generators of space translations and rotations are
interaction-free,
P = P
0
J = J
0
while generators of time translations (the Hamiltonian H) and boosts contain
interaction-dependent terms
H = H
0
+ V
K = K
0
+Z
We will further assume that the interacting representation U
g
belongs to the
Bakamjian-Thomas form of dynamics
9
in which the interacting operator of
mass M commutes with the Newton-Wigner position operator (4.32)
M c
2
_
H
2
P
2
0
c
2
[R
0
, M] = 0 (13.19)
Our next goal is to dene the basis of common eigenvectors of commuting
operators P
0
and M in 1.
10
These eigenvectors must satisfy conditions
9
For the Bakamjian-Thomas theory see subsection 6.3.2. The possibilities for the decay
interaction to be in other forms of dynamics are discussed in subsection 13.5.3.
10
In addition to these two operators, whose eigenvalues are used for labeling eigenvectors
[p, m, there are other independent operators in the mutually commuting set containing
P
0
and M. These are, for example, the operators of the square of the total angular
momentum J
2
0
and the projection of J
0
on the z-axis J
0z
. Therefore a unique character-
ization of any basis vector requires specication of all corresponding quantum numbers
as [p, m, j
2
, j
z
, . . .. However these additional quantum numbers are not relevant for our
discussion and we omit them.
412 CHAPTER 13. DECAYS AND RADIATION
P
0
[p, m = p[p, m (13.20)
M[p, m = m[p, m (13.21)
They are also eigenvectors of the interacting Hamiltonian H =
_
M
2
c
4
+P
2
0
c
2
H[p, m =
p
[p, m
where
p
_
m
2
c
4
+ c
2
p
2
.
11
In the zero-momentum eigensubspace of the
momentum operator P
0
we can introduce a basis [0, m of eigenvectors of
the interacting mass M
P
0
[0, m = 0
M[0, m = m[0, m
Then the basis [p, m in the entire Hilbert space 1 can be built by formula
12
[p, m =
mc
2
p
e
ic
[0, m
where vector
is related to momentum by formula p = mc
1
sinh . These
improper eigenvectors are normalized to delta functions
q, m[p, m
= (q p)(mm
) (13.22)
The actions of inertial transformations on these states are found by the same
method as in section 5.1. In particular, for boosts along the x-axis and time
translations we obtain
13
11
Note the dierence between
p
that depends on the eigenvalue m of the interacting
mass operator and
p
in equation (13.17) that depends on the xed value of mass m
a
of
the particle a.
12
compare with (5.5) and (5.27)
13
compare with equation (5.28)
13.1. UNSTABLE SYSTEM AT REST 413
e
ic
Kx
[p, m =
_
p
[p, m (13.23)
e
i
Ht
[p, m = e
i
pt
[p, m (13.24)
p =
_
p
x
cosh +
p
c
sinh , p
y
, p
z
_
(13.25)
Next we notice that due to equations (4.25) and (13.19) vectors e
i
R
0
p
[0, m
also satisfy eigenvector equations (13.20) - (13.21), so they must be propor-
tional to the basis vectors [p, m
e
i
R
0
p
[0, m = (p, m)[p, m (13.26)
where (p, m) is an unimodular factor. Unlike in (13.18), we cannot conclude
that (p, m) = 1. However, if the interaction is not pathological we can
assume that the factor (p, m) is smooth, i.e., without rapid oscillations.
Obviously, vector [0 from the basis (13.18) can be expressed as a linear
combination of zero-momentum basis vectors [0, m, so we can write
14
[0 =
_
m
b
+mc
dm(m)[0, m (13.27)
where (m) is yet unknown function, which depends on the choice of the
interaction Hamiltonian V and satises equations
(m) = 0, m[0 (13.28)
_
m
b
+mc
dm[(m)[
2
= 1
14
Here we assume that interaction responsible for the decay does not change the spec-
trum of mass. In particular, we will neglect the possibility of existence of bound states of
particles b and c, i.e., discrete eigenvalues of M below m
b
+m
c
. Then the spectrum of M
(similar to the spectrum of M
0
) is continuous in the interval [m
b
+m
c
, ) and integration
in (13.27) should be performed from m
b
+m
c
to innity. Note that this assumption does
not hold in the example considered in subsection 13.2.2, where one eigenvalue of the mass
operator M is lower than m
b
+m
c
, as shown in Figs. 13.2 and 13.3.
414 CHAPTER 13. DECAYS AND RADIATION
The physical meaning of (m) is the probability amplitude for nding the
value m of the interacting mass M in the initial unstable state [0 1
a
.
We now use equations (13.18) and (13.27) to expand vectors [p 1
a
in the
basis [p, m
[p = e
i
R
0
p
[0 = e
i
R
0
p
_
m
b
+mc
dm(m)[0, m
=
_
m
b
+mc
dm(m)(p, m)[p, m (13.29)
Then any state vector from the subspace 1
a
can be written as
[ =
_
dp(p)[p (13.30)
=
_
dp
_
m
b
+mc
dm(m)(p, m)(p)[p, m (13.31)
From (13.22) we also obtain a useful formula
q[p, m =
_
m
b
+mc
dm
(m
(q, m
)q, m
[p, m
=
(p, m)
(m)(q p) (13.32)
13.1.5 Decay law
Let us nd the time evolution of the state vector (13.30) prepared within the
subspace 1
a
at time t = 0. We apply equations (13.8), (13.24) and (13.29)
[(t) =
_
dp(p)e
Ht
[p
=
_
dp(p)
_
m
b
+mc
dm(m)(p, m)e
Ht
[p, m
13.1. UNSTABLE SYSTEM AT REST 415
=
_
dp(p)
_
m
b
+mc
dm(m)(p, m)e
pt
[p, m
The inner product of this vector with [q is found by using (13.32)
q[(t)
=
_
dp(p)
_
m
b
+mc
dm(m)(p, m)e
pt
q[p, m
=
_
dp(p)
_
m
b
+mc
dm[(m)[
2
(p, m)
(p, m)e
pt
(q p)
= (q)
_
m
b
+mc
dm[(m)[
2
e
qt
(13.33)
The decay law is then obtained by substituting (13.14) in equation (13.9)
and using (13.33)
(t) =
_
dq(t)[qq[(t) =
_
dq[q[(t) [
2
=
_
dq[(q)[
2
_
m
b
+mc
dm[(m)[
2
e
qt
2
(13.34)
This formula is valid for the decay law of any state [ 1
a
. In the particular
case of the normalized state [0) whose wave function (q) is well-localized
in the momentum space near zero momentum, we can set approximately
(q)
_
(q)
_
dq[(q)[
2
_
dq(q) = 1
and
15
15
compare, for example, with equation (3.8) in [FGR78]
416 CHAPTER 13. DECAYS AND RADIATION
|0)
(t)
_
m
b
+mc
dm[(m)[
2
e
mc
2
t
2
(13.35)
This result demonstrates that the decay law is fully determined by the func-
tion [(m)[
2
which is referred to as the mass distribution of the unstable
particle. In the next section we will consider an exactly solvable decay model
for which the mass distribution and the decay law can be explicitly calcu-
lated.
13.2 Breit-Wigner formula
13.2.1 Schrodinger equation
In this section we are discussing decay of a particle at rest. Therefore, it
is sucient to consider the subspace 1
0
1 of states having zero total
momentum. The subspace 1
0
can be further decomposed into the direct
sum
1
0
= 1
a0
1
(bc)0
where
16
1
a0
= 1
0
1
a
1
(bc)0
= 1
0
(1
b
1
c
)
1
a0
is, of course, the one-dimensional subspace spanning the zero-momentum
vector [0 of the particle a. In the subspace 1
(bc)0
of decay products the total
momentum is zero P = p
b
+ p
c
= 0. Then 2-particle basis states [ can be
labeled by eigenvectors of the relative momentum operator
= p
b
= p
c
16
Recall that symbol denotes intersection of two subspaces in the Hilbert space.
13.2. BREIT-WIGNER FORMULA 417
Therefore, each state in 1
0
can be written as an expansion in the above basis
([0, [ )
[ = [0 +
_
d ( )[
The coecients of this expansion can be represented as an innite column
vector
[ =
_
(
1
)
(
2
)
(
3
)
. . .
_
_
whose rst component is a complex number .
17
All other components are
values of the complex function ( ) at dierent momenta .
18
For brevity,
we will use the following notation
[ =
_
( )
_
(13.36)
The vector [ should be normalized, hence its wave function satises the
normalization condition
[[
2
+
_
d [( )[
2
= 1 (13.37)
The probability of nding the unstable particle in the state [ is
= [[
2
17
which is projection of the vector [ on the basis state [0
18
These are projections of [ on the relative momentum eigenvectors [ . Of course,
the spectrum of is continuous and, strictly speaking, cannot be represented by a set of
discrete values
i
. However, we can avoid this diculty by the usual trick of placing the
system in a nite box (then the momentum spectrum becomes discrete) and then taking
the limit in which the size of the box goes to innity. This approach will play an important
role in the next subsection.
418 CHAPTER 13. DECAYS AND RADIATION
and in the initial state
[0 =
_
1
0
_
(13.38)
the unstable particle is found with 100% probability.
We can now nd representations of various operators in the basis ([0, [ ).
In the subspace 1
0
of zero total momentum, the Hamiltonian H =
M
2
c
4
+ P
2
c
2
coincides with the mass operator M (multiplied by c
2
), which is a sum of the
non-interacting mass M
0
and interaction V
M = M
0
+ V
The non-interacting mass is diagonal
M
0
=
_
_
m
a
0 0 0 . . .
0
1
0 0 . . .
0 0
2
0 . . .
0 0 0
3
. . .
0 0 0 0 . . .
_
_
m
a
0
0
_
where
=
1
c
2
_
_
m
2
b
c
4
+ c
2
2
+
_
m
2
c
c
4
+ c
2
2
_
(13.39)
is the mass of the two-particle (b + c) system expressed as a function of the
relative momentum. In the subspace 1
0
interaction operator (13.4) takes
the form
V =
_
d
_
G( , )a
0
b
c
+ G
( , )b
a
0
_
_
d
_
g( )a
0
b
c
+ g
( )b
a
0
_
13.2. BREIT-WIGNER FORMULA 419
Its matrix representation is
19
V =
_
_
0 g(
1
) g(
2
) g(
3
) . . .
g
(
1
) 0 0 0 . . .
g
(
2
) 0 0 0 . . .
g
(
3
) 0 0 0 . . .
. . . 0 0 0 . . .
_
_
0
_
dqg(q) . . .
g
( ) 0
_
where g( ) is the matrix element of the interaction operator between states
[0 and [
g( ) = 0[V [ (13.40)
Then the action of the full Hamiltonian H = H
0
+ V on vectors (13.36) is
H
_
( )
_
=
_
_
m
a
c
2
g(
1
) g(
2
) g(
3
) . . .
g
(
1
)
1
c
2
0 0 . . .
g
(
2
) 0
2
c
2
0 . . .
g
(
3
) 0 0
3
c
2
. . .
. . . 0 0 0 . . .
_
_
_
(
1
)
(
2
)
(
3
)
. . .
_
_
=
_
_
m
a
c
2
+
_
dqg(q)(q)
g
(
1
) +
1
(
1
)c
2
g
(
2
) +
2
(
2
)c
2
g
(
3
) +
3
(
3
)c
2
. . .
_
_
m
a
c
2
+
_
dqg(q)(q)
g
( ) +
( )c
2
_
The next step is to nd eigenvalues (which we denote mc
2
) and eigenvec-
tors
[0, m
_
(m)
m
( )
_
(13.41)
19
Here symbol
_
dqg(q) . . . denotes a linear operator, which produces a number
_
dqg(q)(q) when acting on arbitrary test function (q). The function g( ) coincides
with G(p, q) on the zero-momentum subspace g( ) G( , ).
420 CHAPTER 13. DECAYS AND RADIATION
of the Hamiltonian H.
20
This task is equivalent to the solution of the follow-
ing system of linear equations:
m
a
c
2
(m) +
_
dqg(q)
m
(q) = mc
2
(m) (13.42)
g
( )
(m) +
c
2
m
( ) = mc
2
m
( ) (13.43)
From Equation (13.43) we obtain
m
( ) =
g
( )
(m)
mc
2
c
2
(13.44)
Substituting this result to equation (13.42) we obtain equation which deter-
mines the spectrum of eigenvalues m
mm
a
=
1
c
4
_
dq
[g(q)[
2
m
q
(13.45)
To comply with the law of conservation of the angular momentum, the func-
tion [g(q)[ should depend only on the absolute value q [q[ of its argument.
Therefore, we can rewrite equation (13.45) in the form
mm
a
= F(m) (13.46)
where
F(m)
_
0
dq
G(q)
m
q
(13.47)
G(q)
4q
2
c
4
[g(q)[
2
(13.48)
From the normalization condition (13.37)
20
Note that the function (m) in (13.41) is the same as in (13.28). So, in order to
calculate the decay law (13.35), all we need to know is [(m)[
2
.
13.2. BREIT-WIGNER FORMULA 421
[(m)[
2
+
_
dq[
m
(q)[
2
= 1 (13.49)
and equation (13.44) we obtain
[(m)[
2
_
1 +
_
dq
G(q)
(m
q
)
2
_
= 1
[(m)[
2
=
1
1 F
(m)
(13.50)
where F
i=1
[g(
i
)[
2
mm
i
= F(m) (13.51)
where m
i
=
i
are eigenvalues of the non-interacting mass of the 2-particle
system b + c and the lowest eigenvalue is m
1
= m
b
+ m
c
. The function
on the right hand side of equation (13.51) is a superposition of functions
c
2
[g(
i
)[
2
(m m
i
)
1
for all values of i = 1, 2, 3, . . .. These functions have
singularities at points m
i
. Positions of these singularities are shown as open
circles and dashed vertical lines in Fig. 13.3. The overall shape of the function
F(m) in this approximation is shown by the thick full line. According to
equation (13.51), the spectrum of the interacting mass operator can be found
at points where the line mm
a
intersects with the graph F(m). These points
M
i
are shown by full circles in Fig. 13.3. So, the derivatives required in
equation (13.50) are graphically represented as slopes of the function F(m)
at points M
1
, M
2
, M
3
, . . .. The diculty is that in the limit of continuous
spectrum the distances between points m
i
tend to zero, function F(m) wildly
oscillates and its derivative tends to innity everywhere.
To overcome this diculty we will use the following idea [Liv47]. Let rst
us change the integration variable in (13.47)
z =
_
m
b
+mc
dz
(z)
2(mz)
=
m
_
m
b
+mc
dz
(z)
2(mz)
+
m+
_
m
dz
(z)
2(mz)
+
_
m+
dz
(z)
2(mz)
(13.53)
where is a small number such that function (z) may be considered con-
stant ((z) = (m)) in the interval [m, m+ ]. When 0, the rst
and third terms on the right hand side of (13.53) give the principal value
integral (denoted by P
_
)
m
_
m
b
+mc
dz
(z)
2(mz)
+
_
m+
dz
(z)
2(mz)
P
_
m
b
+mc
dz
(z)
2(mz)
T(m)
(13.54)
Let us now look more closely at the second integral on the right hand side
of (13.53). The interval [m, m+ ] can be divided into 2N small equal
segments
m
j
= m
0
+ j
N
where m
0
= m, integer j runs from N to N and the integral can be ap-
proximated as a partial sum
m+
_
m
dz
(z)
2(mz)
(m
0
)
2
m+
_
m
dz
1
mz
(m
0
)
2
N
j=N
/N
mm
0
j
N
(13.55)
Next we assume that N and index j runs from to . Then the
right hand side of equation (13.55) denes an analytical function with poles
at points
13.2. BREIT-WIGNER FORMULA 425
m
j
= m
0
+ j
N
(13.56)
and with residues (m
0
)/(2N). As any analytical function is uniquely
determined by the positions of its poles and the values of its residues, we
conclude that integral (13.55) has the following representation
m+
_
m
dz
(z)
2(mz)
(m
0
)
2
cot
_
N
(mm
0
)
_
(13.57)
Indeed, the cot function on the right hand side of (13.57) also has poles at
points (13.56). The residues of this function are exactly as required too. For
example, near the point m
0
(where j = 0) the right hand side of (13.57) can
be approximated as
(m
0
)
2
cot
_
N
(mm
0
)
_
(m
0
)/N
2(mm
0
)
which agrees with (13.55). Now we can put equations (13.54) and (13.57)
together and write
F(m) = T(m) +
(m)
2
cot
_
Nm
_
Then, using
cot(ax)
= a(1 + cot
2
(ax))
and ignoring derivatives of smooth functions T(m) and (m) we obtain
F
(m) =
(m)N
2
(1 + cot
2
(N
1
m)) (13.58)
For formula (13.50) we need values of F
2
(m)
Substituting this to (13.58) and (13.50) we obtain the desired result
F
(m) =
(m)N
2
_
1 +
4(mm
a
T(m))
2
2
(m)
_
[(m)[
2
=
1
1 + (m)N/(2)
_
1 +
4(ma+P(m)m)
2
)
2
(m)
_ (13.59)
(m)/(2N)
2
(m)/4 + (m
a
+T(m) m)
2
(13.60)
where we neglected the unity in the denominator of (13.59) as compared to
the large factor N
1
. Formula (13.60) gives the probability for nding
particle a at each point of the discrete spectrum M
1
, M
2
, M
3
, . . .. This proba-
bility tends to zero as the density of points N
1
tends to innity. However,
when approaching the continuous spectrum in the limit N we do not
need the probability at each spectrum point. We, actually, need the prob-
ability density which can be obtained by multiplying the right hand side of
equation (13.60) by the number of points per unit interval N
1
. Then the
mass distribution for the unstable particle takes the famous Breit-Wigner
form
[(m)[
2
=
(m)/(2)
(m)
2
/4 + (m
a
+T(m) m)
2
(13.61)
13.2. BREIT-WIGNER FORMULA 427
mm
b b
+m
cc
mm
AA
mm
||(m)|
22
/(2)
2
/4 + (m
a
m)
2
(13.62)
13.2.3 Exponential decay law
To complete our discussion of the unstable system at rest we are now going
to calculate its time-dependent wave function. For the initial state vector
at t = 0 we choose state (13.38) of the particle a. Its time dependence is
described by the time evolution operator
22
the center of the resonance
428 CHAPTER 13. DECAYS AND RADIATION
[0, t = e
Ht
[0
To evaluate this expression it is convenient to represent [0 as an expansion
(13.27) in the basis of eigenvectors of the Hamiltonian H. Then, using (13.41)
and (13.44) we obtain
e
Ht
[0 =
_
m
b
+mc
dm(m)e
Ht
[0, m =
_
m
b
+mc
dm(m)e
mc
2
t
[0, m
=
_
m
b
+mc
dme
mc
2
t
[(m)[
2
_
1
g
( )/(mc
2
c
2
)
_
_
I(t)
J( , t)
_
(13.63)
The rst integral I(t) determines the decay law for the particle at rest. Sub-
stituting (13.62) in the integrand, we obtain
(t) = [I(t)[
2
1
4
2
_
m
b
+mc
dm
e
mc
2
t
2
/4 + (m
a
m)
2
2
(13.64)
For most unstable systems
m
a
(m
b
+ m
c
) (13.65)
so the integrand is well localized around the value m m
a
and we can
introduce further approximation by setting the lower integration limit in
(13.64) to . Then the decay law obtains a familiar exponential form
(t)
1
4
2
2e
mac
2
t
exp
_
c
2
t
2
_
2
= exp
_
c
2
t
_
= exp
_
0
_
(13.66)
where
13.2. BREIT-WIGNER FORMULA 429
0
=
c
2
(13.67)
is the lifetime of the unstable particle. For a particle prepared initially in
the undecayed state (0) = 1, the nondecay probability decreases from 1 to
1/e during its lifetime.
Using formulas (13.40), (13.48) and (13.52) we can also see that the decay
rate
23
1
0
=
c
2
=
2c
2
G(
1
(m
a
))
d
1
(z)
dz
z=ma
=
8
2
2
c
2
[0[V [[
2
d
1
(z)
dz
z=ma
(13.68)
is proportional to the square of the matrix element of the perturbation V the
initial and nal states of the system. It is also proportional to the kinemat-
ical factor d
1
(z)/dz[
z=ma
, which is fully determined by the three involved
masses m
a
, m
b
and m
c
.
The importance of formulas (13.62), (13.66) and (13.68) is that they were
derived from very general assumptions. We have not used the perturbation
theory.
24
Actually, the only signicant approximation is the weakness of the
interaction responsible for the decay, i.e., the narrow width of the resonance
(13.65). This condition is satised for all known decays.
25
Therefore, the
exponential decay law is expected to be universally valid. This prediction is
conrmed by experiment: so far no deviations from the exponential decay
law (13.66) were observed.
13.2.4 Wave function of decay products
The second integral J( , t) in (13.63) describes the wave function of decay
product b and c. We note that function [(m)[
2
has poles at m
a
i/2 and
m
a
+ i/2. Then
23
Actually, parameter has the dimensionality of mass, so the true decay rate is 1/
0
=
c
2
/ (Hz). The momentum in (13.68) should be calculated as =
1
(m
a
).
24
Thus our result is more general than the perturbative Fermis golden rule.
25
Approximation (13.65) may be not accurate for particles (or resonances) decaying due
to strong nuclear forces. However, their lifetime is very short
0
10
23
s, so the time
dependence of their decays cannot be observed experimentally.
430 CHAPTER 13. DECAYS AND RADIATION
0
m
a
+i/2
m
a
-i/2
Re(m)
Im(m)
Figure 13.5: Complex plane integration contour for integral (13.69).
J( , t) =
g
( )
2c
2
mc
2
t
dm
(m
)(mm
a
i/2)(mm
a
+ i/2)
(13.69)
The integration contour should be closed as shown in Fig. 13.5, because
then the integral along the large semi-circle in the lower half-plane can be
ignored,
26
so we obtain
J( , t) =
g
( )
c
2
(m
a
i/2)
_
e
(mai/2)c
2
t
ie
c
2
t
2(m
a
) + i
_
(13.70)
To be consistent with the initial condition (13.38), our solution must satisfy
J( , 0) = 0. This is, indeed, true as within our approximations we can set
= m
a
in the second term in the parentheses, so that the whole expression
vanishes at t = 0.
The rst term in the parentheses is signicant only at short times compa-
rable with the particles lifetime
0
. In the limit t only the second term
contributes and we can write the wave function in the position representation
(5.42)
26
the factor e
mc
2
t
tends to zero there
13.2. BREIT-WIGNER FORMULA 431
J(r, t)
i
2c
2
(2)
3/2
_
d e
i
r
g
( )
e
c
2
t
(m
a
)
2
+
2
/4
(13.71)
where r r
b
r
c
is the relative position of the two decay products, which is
an observable conjugate to . In our model, the interaction strength [g( )[ is
spherically symmetric, so let us assume g
0
m
a
. We can assume that within this layer the
modulus of the interaction function stays nearly constant [g()[ [g(
0
)[.
Similarly, we can approximate equation (13.39) as
0
+
1
c
2
_
c
2
0
_
m
2
b
c
4
+ c
2
2
0
+
c
2
0
_
m
2
c
c
4
+ c
2
2
0
_
(
0
)
=
0
+
1
c
2
(v
b
+ v
c
) (
0
)
where v
b
v
c
are the average speeds with which the two decay products leave
the region of their creation. With these approximations we rewrite equation
(13.71)
J(r, t)
C
(2)
3
_
d e
i
r
e
(v
b
+vc)t
(13.72)
where C is a constant whose value is not important to us here. Evaluating
this integral in spherical coordinates we obtain
J(r, t)
2C
(2)
3
_
0
sin d
_
0
2
de
i
r cos
e
(v
b
+vc)t
2C
ir(2)
3
d(e
i
r
e
r
)e
(v
b
+vc)t
1
r
[(r (v
b
+ v
c
)t) + (r + (v
b
+ v
c
)t)]
432 CHAPTER 13. DECAYS AND RADIATION
The second delta function in square brackets can be ignored, because r, v
b
, v
c
, t
are positive quantities and, in addition, the limit t + is taken. Then
J(r, t)
1
r
(r (v
b
+ v
c
)t)
which means that in the semiclassical approximation presented here the wave
function of the decay products has the form of a spherical shell expanding
around the decay point with a constant speed. The separation between two
decay products changes as
r = (v
b
+ v
c
)t (13.73)
This indicates that particles b and c are not interacting in the asymptotic
regime: they move apart with constant velocities, as expected.
13.3 Spontaneous radiative transitions
In the preceding section we discussed rather general properties of the decay
process. In particular, we have not specied the exact type of the unstable
system and the form of the decay interaction operator V . So, we left our
formula for the decay constant in an unprocessed form (13.68). In this section
we would like to ll this gap and perform a complete calculation of the decay
rate for a realistic system an excited state of the hydrogen atom.
13.3.1 Instability of excited atomic states
We have mentioned in the preceding section that decay theory developed
there applies not only to unstable elementary particles, but also to excited
states of any compound physical system. Let us now apply this theory to
excited states of the hydrogen atom. We found in section 12.2 that in the
non-relativistic approximation the 2nd order Darwin-Breit Hamiltonian is
27
H
ep
=
p
2
e
2m
+
p
2
p
2M
e
2
4r
(13.74)
27
p
e
and p
p
are the electrons and protons momentum operators, respectively; r
r
e
r
p
13.3. SPONTANEOUS RADIATIVE TRANSITIONS 433
This Hamiltonian has a well-dened spectrum of bound stationary states
with negative energies. Consider two stationary eigenstates [
i
and [
f
of
this Hamiltonian with dierent energies E
i
and E
f
, respectively
H
ep
[
i
= E
i
[
i
H
ep
[
f
= E
f
[
f
E
i
> E
f
The Coulomb potential in (13.74) is just the leading part of interaction. As
we discussed in subsection 11.3.1, there is an innite number of additional
smaller interaction terms. In this section we will be interested in 3rd order
interaction potentials of the type
V
3
= V
3
[d
da] (13.75)
If interaction V
3
is added to the Hamiltonian H
ep
then states [
i
and [
f
plus a photon.
28
Thus interaction (13.75) is responsible for spontaneous light emission by
the atom. The two-particle subspace 1
pe
is not invariant with respect to this
potential. For example, operator d
da][2P
1/2
All excited atomic states become unstable and their energy (mass) distri-
butions becomes shifted and broadened according to the Breit-Wigner reso-
nance formula (13.62). The lowest-energy ground state [1S
1/2
cannot decay
28
This is exactly the situation discussed in the preceding section. The particles a, b, c
from section 13.1 are analogous to our states [
i
, [
f
and the emitted photon, respec-
tively.
29
see Fig. 12.1
434 CHAPTER 13. DECAYS AND RADIATION
spontaneously with the emission of a photon, simply because there are no
any lower energy states where to decay. Thus only the ground state [1S
1/2
dt
1
dt
2
dt
3
T[V
1
(t
1
)V
1
(t
2
)V
1
(t
3
)]
=
i
3!
3
_
d
4
x
1
d
4
x
2
d
4
x
3
T[V
1
( x
1
)V
1
( x
2
)V
1
( x
3
)] (13.76)
=
i
3!
3
_
d
4
x
1
d
4
x
2
d
4
x
3
T[(J
( x
1
)A
( x
1
) +
( x
1
)A
( x
1
))
(J
( x
2
)A
( x
2
) +
( x
2
)A
( x
2
))
(J
( x
3
)A
( x
3
) +
( x
3
)A
( x
3
))] (13.77)
where J
and
, q
, s; , ,
, )
=
i
2
3
c
3
_
d
4
x
1
d
4
x
2
d
4
x
3
0[a
q,
d
p,
T[J
( x
1
)A
( x
1
)J
( x
2
)A
( x
2
)
( x
3
)A
( x
3
)]d
s,
[0
=
ie
3
2
3
_
d
4
x
1
d
4
x
2
d
4
x
3
0[a
q,
d
p,
T[(( x
1
)
( x
1
)A
( x
1
))(( x
2
)
( x
2
)A
( x
2
))
(( x
3
)
( x
3
)A
( x
3
))]d
s,
[0
This function can be evaluated by drawing two Feynman diagrams shown in
Fig. 13.6 and processing them according to Feynman rules from subsection
9.2.4.
S
JJJ
3
(p, q, p
, q
, s; , ,
, )
=
ie
3
c
3/2
4
2
mMc
4
_
q
p
c
(2)
3/2
2s
4
( s + q + p p
)
1
( p p
)
2
u
a
(q, )
_
,e
ab
(s, )
(,q ,s + mc
2
)
bc
( q s)
2
m
2
c
4
,W
cd
(p, ; p
)
+ ,W
ab
(p, ; p
)
(,s+ ,q
+ mc
2
)
bc
( s + q
)
2
m
2
c
4
,e
cd
(s, )
_
u
d
(q
)
Now let us assume that the electron and the proton are non-relativistic
and simplify the above expression. According to approximations derived in
Appendix J.9 and using
p
2
= ( p
)
2
= m
2
c
4
s
2
= 0
( p p
)
2
c
2
(p p
)
2
30
Summation is assumed on indices , , = 0, 1, 2, 3.
436 CHAPTER 13. DECAYS AND RADIATION
p,
p,
q,
q,
s,
p,
p,
q,
q,
s,
(a)
(b)
Figure 13.6: 3rd order Feynman diagrams for the photon emission in
electron-proton collisions.
( s + q
)
2
m
2
c
4
= 2 s q
( q s)
2
m
2
c
4
= 2 s q
W
0
(p, , p
)
,
W 0
_
q
p
p
Mmc
4
we obtain
S
JJJ
3
(p, q, p
, q
, s; , ,
, )
ie
3
c
5/2
1
4
2
1
(2)
3/2
2s
4
( s + q + p p
c
2
(p p
)
2
u
a
(q, )
_
,e
ab
(s, )
(,q ,s + mc
2
)
bc
2 q s
cd
0
+
ab
0
(,s+ ,q
+ mc
2
)
bc
2 q
s
,e
cd
(s, )
_
u
d
(q
)
Next we assume that the energy and momentum of the emitted photon
13.3. SPONTANEOUS RADIATIVE TRANSITIONS 437
is much less than energies and momenta of charged particles. We also use
Dirac equations (J.82), (J.81) and the non-relativistic approximation (J.70)
to write
31
u(q, ) ,e(,q ,s + mc
2
)
0
u(q
)
u(q, )e
(q
+ mc
2
)
0
u(q
)
= u(q, )((q
+ mc
2
)e
+ 2g
)
0
u(q
)
= u(q, )(( ,q + mc
2
)e
+ 2 q e)
0
u(q
)
= 2u(q, )( q e)
0
u(q
) = 2U
0
(q, ; q
)( q e)
2
,
( q e)
u(q, )
0
(,s+ ,q
+ mc
2
) ,eu(q
)
= 2U
0
(q, ; q
)( q
e) 2
,
( q
e)
Further approximations yield
s q
= cs
q
c
2
(s q
) mc
3
s
s q mc
3
s
q e = c(q e)
q
e = c(q
e)
Therefore
32
S
JJJ
3
(p, q, p
, q
, s; , ,
, )
ie
3
c
4
( s + q + p p
)
4
2
(2)
3/2
2s
(p p
)
2
_
q
e
q
s
q e
q s
_
(13.78)
ie
3
4
( q + p p
)
4
2
m(2)
3/2
_
2(cs)
3
,
(q
q) e(s, )
(q
q)
2
(13.79)
31
Here we used the tilde to write e in order to stress the 4-component nature of this
quantity despite the fact that it does not transform as a 4-vector.
32
Our result in (13.78) can be compared with equations (7.57) - (7.58) in [BD64].
438 CHAPTER 13. DECAYS AND RADIATION
This is our nal expression for the terms JJ in the scattering operator
(13.77). The contribution from J terms can be obtained simply by re-
placing the electrons mass m in (13.79) by the protons mass M. So, this
contribution is much smaller and will be neglected.
13.3.3 Perturbation Hamiltonian
From results in the preceding subsection we can nd the 3rd order contri-
bution V
3
to the dressed particle interaction Hamiltonian. The relationship
between the dressed Hamiltonian and the scattering operator in the 3rd order
is given by equation (11.27). So
V
3
=
_
dpdqdp
dq
dsV
JJJ
3
(p, q, p
, q
, s; , ,
, )a
s,
a
q,
d
p,
(13.80)
whose coecient function is
33
V
JJJ
3
(p, q, p
, q
, s; , ,
, )
e
3
8
3
m(2)
3/2
_
2(cs)
3
(q +p p
,
(q
q) e(s, )
(q
q)
2
The action of the operator (13.80) on a two particle (electron+proton) initial
state
[
i
_
dp
dq
(p
, q
; , )a
,
d
,
[0
is
33
This formula is obtained simply by dividing scattering amplitude (13.79) by the factor
(2i) and omitting the delta function (
q
+
p
q
p
). This rule follows immediately
from formulas (8.66) and (8.67).
13.3. SPONTANEOUS RADIATIVE TRANSITIONS 439
V
3
[
i
=
_
dp
dq
_
dpdqdp
dq
dsV
3
(p, q, p
, q
, s; , ,
, )
(p
, q
; , )a
s,
a
q,
d
p,
a
,
d
,
[0
=
_
dp
dq
_
dpdqdp
dq
dsV
3
(p, q, p
, q
, s; , ,
, )
(p
, q
; , )a
s,
(q q
(p p
[0
=
_
dpdqdp
dq
dsV
3
(p, q, p
, q
, s; , ,
, )
(p, q; , )a
s,
[0
=
_
dp
dq
ds
_
_
dpdqV
3
(p, q, p
, q
, s; , ,
, )
(p, q; , )
_
a
s,
[0
where expression in big parentheses is the transformed wave function of the 3-
particle system (electron+proton+photon). Using (13.80), this wave function
can be written as
(p
, q
, s;
)
=
_
dpdq
e
3
(q +p p
,
(q
q) e(s, )
8
3
m(2)
3/2
_
2(cs)
3
(q
q)
2
(p, q; , )
=
_
dk
e
3
8
3
m(2)
3/2
_
2(cs)
3
k e(s, )
k
2
(p
+k, q
k;
)
By taking a Fourier transform and using (8.79), (B.8) we can switch to
the position representation for fermions
34
(x, y, s;
)
=
1
(2)
3
_
dp
dq
e
i
x+
i
y
_
dk
e
3
k e(s, )
8
3
m(2)
3/2
_
2(cs)
3
k
2
34
x and y are position vectors of the proton and the electron, respectively.
440 CHAPTER 13. DECAYS AND RADIATION
(p
+k, q
k;
)
=
1
(2)
3
_
dp
dq
e
i
(p
k)x+
i
(q
+k)y
_
dk
e
3
k e(s, )
8
3
m(2)
3/2
_
2(cs)
3
k
2
(p
, q
)
=
3/2
e
3
(2)
3
m(2)
3/2
_
2(cs)
3
_
dke
i
k(yx)
k e(s, )
k
2
_
1
(2)
3
_
dp
dq
e
i
x+
i
y
(p
, q
)
_
=
e
3
1/2
m(2)
3/2
_
2(cs)
3
i(y x) e(s, )
4[y x[
3
(x, y;
)
This means that the position-space interaction potential between two charged
particles that leads to the creation of a photon with momentum s and helicity
is
V
3
(r, s, ) =
i
1/2
e
3
r e(s, )
4m
_
2(2cs)
3
r
3
(13.81)
In contrast to 2nd order interaction terms in (13.74), this potential does not
conserve the number of particles. It is responsible for the emission of photons
by an electron moving in the eld of a heavy proton.
35
We can expect that
the radiation emission rate should be proportional to the square of the matrix
element of this operator between appropriate initial and nal states. One can
notice that potential (13.81) is proportional to the electrons acceleration
a
e
2
r
4mr
3
Thus we conclude that the total radiated power should depend on the square
of electrons acceleration a
2
. This is in agreement with the well-know Lar-
mors formula of classical electrodynamics. Thus the 3rd order bremsstrahlung
interaction V
3
has direct relevance to the radiation reaction eect [McDb,
Par02, Par].
35
To maintain the Hermiticity of the full Hamiltonian it must contain also a term, which
is Hermitian conjugate to V
3
. Apparently, this term is responsible for the absorption of
photons by the interacting system electron+proton.
13.3. SPONTANEOUS RADIATIVE TRANSITIONS 441
13.3.4 Transition rate
The (bremsstrahlung) perturbation V
3
derived in the preceding subsection is
also responsible for radiative transitions between energy levels in atoms and
other bound systems. In this subsection we are going to calculate the rate of
such transitions, i.e., the brightness of spectral lines in the hydrogen atom.
The decay constant for the radiative transition between two stationary
atomic states can be obtained from formula (13.68)
36
=
8
2
(s)
2
c
4
[
i
[V
3
(r, s, )[
f
[
2
d
1
(z)
dz
z=ma
(13.82)
Using equality
[p
e
, H
ep
] =
_
p
e
,
e
2
4r
_
= ie
2
r
4r
3
(13.83)
and denoting E E
i
E
f
= cs the energy of the emitted photon we obtain
for the matrix element
i
[V
3
(r, s, )[
f
=
i
1/2
e
3
m
_
2(2cs)
3
_
r e(s, )
4r
3
f
_
=
e
m
_
2(2E)
3
i
[(p
e
e)H
ep
H
ep
(p
e
e)[
f
=
e
m
_
2(2E)
3
E
i
[(p
e
e)[
f
Next we use
im
[r, H
ep
] =
im
[r
e
, H
ep
] +
im
[r
p
, H
ep
] p
e
m
M
p
p
p
e
36
The derivation in this subsection is not completed yet. First, equation (13.68) was
obtained in the case of isotropic interaction, while in our case interaction V
3
is not isotropic.
Second, formula (13.82) should be summed over two possible values (+1 and -1) of the
photons helicity . So, our results in this subsection should be considered as qualitative
(up to a constant factor) only. The correct numerical factor in (13.85) should be 4/3
instead of our 1/2. See equation (19.85) in [Bal98].
442 CHAPTER 13. DECAYS AND RADIATION
to obtain
i
[V
3
(r, s, )[
f
=
ie
_
2(2E)
3
E
i
[(r e)H
ep
H
ep
(r e)[
f
=
ie
_
2(2E)
3
E
2
i
[(r e)[
f
=
ie
E
_
2(2)
3
i
[(r e)[
f
(13.84)
In order to nd function
1
(z) in (13.82) we turn to the denition (13.39).
In the case of atomic radiative transitions considered here, one of the decay
products (the photon) is massless (m
c
= 0) and its energy is much smaller
than rest energies of the atomic states cs m
a
c
2
m
b
c
2
z =
s
=
1
c
2
_
_
m
2
b
c
4
+ c
2
s
2
+ cs
_
1
c
2
(m
b
c
2
+ cs)
s =
1
(z) (z m
b
)c
d
1
(z)
dz
z=ma
c
Putting these results in (13.82) we nd that the rate of emission of photons
with momentum s and helicity is given by the formula
1
0
=
c
2
=
8
2
E
2
c
3
_
e
E
_
2(2)
3
_
2
[
i
[(r e)[
f
[
2
=
E
3
2
4
c
3
[
i
[(d e(s, ))[
f
[
2
(Hz) (13.85)
where d er is the atoms dipole moment operator.
13.3.5 Energy correction due to level instability
Let us now assume that the photon has a small mass . This can be modeled
by adding a rest energy term
2
c
4
to the photons energy c
2
s
2
. Then
interaction (13.81) can be rewritten as
13.3. SPONTANEOUS RADIATIVE TRANSITIONS 443
V
3
(r, s, ) =
i
1/2
e
3
r e(s, )
4m
_
2(2)
3
(
2
c
4
+ c
2
s
2
)
3/4
r
3
(13.86)
Consider state [n of the hydrogen atom with energy E
n
. Perturbation
(13.86) changes the energy of this state by the amount E
n
, which can
be calculated by the perturbation theory formula
37
E
n
=
_
ds
n[V
3
[l; s, l; s, [V
3
[n
E
n
E
l
cs
=
e
6
2m
2
(2)
3
l
+1
=1
_
ds
_
n
(r e(s, ))
4r
3
l; s,
__
l; s,
(r e(s, ))
4r
3
n
_
1
(
2
c
4
+ c
2
s
2
)
3/2
(E
n
E
l
cs)
where [l; s, [l[s, is a basis state, which has the atom in a stationary
state [l
38
and a free photon in the state [s, with momentum s and helicity
. From equality (K.13)
+1
=1
e
i
(s, )e
j
(s, ) =
ij
s
i
s
j
s
2
we obtain
E
n
=
e
6
2m
2
(2)
3
lij
_
ds
_
ij
s
i
s
j
s
2
__
n
r
i
4r
3
l
__
l
r
j
4r
3
n
_
1
(
2
c
4
+ c
2
s
2
)
3/2
(E
n
E
l
cs)
Let us now set E
nl
E
n
E
l
and
37
See equation (10.70) in [Bal98]. This energy shift can be compared with the mass shift
T(m
A
) in (13.54) and (13.61).
38
[l is an eigenstate of the non-perturbed 2nd order Hamiltonian H
ep
(13.74) with
eigenvalue E
l
.
444 CHAPTER 13. DECAYS AND RADIATION
I
nl
_
0
s
2
ds
(
2
c
4
+ c
2
s
2
)
3/2
(E
nl
cs)
Then
_
ds
s
i
s
j
s
2
(
2
c
4
+ c
2
s
2
)
3/2
(E
nl
cs)
=
ij
_
ds
s
2
z
s
2
(
2
c
4
+ c
2
s
2
)
3/2
(E
nl
cs)
= 2
ij
_
0
sin d
_
0
ds
s
2
cos
2
(
2
c
4
+ c
2
s
2
)
3/2
(E
nl
cs)
= 2
ij
1
_
1
dt
_
0
ds
s
2
t
2
(
2
c
4
+ c
2
s
2
)
3/2
(E
nl
cs)
=
4
ij
3
I
nl
and we can write
E
n
e
6
2m
2
(2)
3
lij
_
n
r
i
4r
3
l
__
l
r
j
4r
3
n
_
_
4
ij
I
nl
4
ij
3
I
nl
_
=
4e
6
3m
2
(2)
3
li
_
n
r
i
4r
3
l
__
l
r
i
4r
3
n
_
I
nl
Integral I
nl
is calculated as follows
I
nl
=
1
c
3
_
2
c
4
E
nl
cs
(
2
c
4
+ E
2
nl
)
2
c
4
+ c
2
s
2
+
E
2
nl
(
2
c
4
+ E
2
nl
)
3/2
ln
_
_
2
c
4
+ E
2
nl
2
c
4
+ c
2
s
2
+
2
c
2
+ E
nl
cs
E
nl
cs
_
_
s=
s=0
1
c
3
_
1
E
nl
+
1
[E
nl
[
ln
_
([E
nl
[ +
2
c
4
[E
nl
[
1
/2) E
nl
_
1
[E
nl
[
ln
_
[E
nl
[c
2
E
nl
_
_
1
c
3
[E
nl
[
ln
_
[[E
nl
[ +
2
c
4
[E
nl
[
1
/2 + E
nl
]E
nl
[E
nl
[c
2
_
13.3. SPONTANEOUS RADIATIVE TRANSITIONS 445
If E
nl
> 0
I
nl
=
1
c
3
E
nl
ln
_
2E
nl
c
2
_
If E
nl
< 0
I
nl
=
1
c
3
E
nl
ln
_
[
2
c
4
E
1
nl
/2]E
nl
E
nl
c
2
_
=
1
c
3
E
nl
ln
_
2E
nl
c
2
_
So, taking into account that ln(1) ln (2[E
nl
[/(c
2
)), we can write for all
values of E
nl
I
nl
1
c
3
E
nl
ln
_
2[E
nl
[
c
2
_
Then
E
n
4e
6
3m
2
(2)
3
c
3
li
_
n
r
i
4r
3
l
_
2
1
E
nl
ln
_
2[E
nl
[
c
2
_
Next we use equation (13.83)
r
i
4r
3
=
1
ie
2
[p
i
, H
ep
]
_
n
r
i
4r
3
l
_
2
=
1
2
e
4
n[(p
i
H
ep
H
ep
p
i
)[ l l [(p
i
H
ep
H
ep
p
i
)[ n
=
E
2
nl
2
e
4
n[p
i
[ l l [p
i
[ n
to obtain
E
n
e
2
6m
2
2
c
3
li
[n[p
i
[l[
2
E
nl
ln
_
c
2
2[E
nl
[
_
446 CHAPTER 13. DECAYS AND RADIATION
Our next step is to use the so-called Bethe logarithm E
n
dened for s-states
as
39
li
[n[p
i
[l[
2
E
nl
ln
_
c
2
2[E
nl
[
_
ln
_
c
2
2E
n
_
li
[n[p
i
[l[
2
E
nl
Then
40
E
n
2
3m
2
c
2
ln
_
c
2
2E
n
_
l
[n[p
i
[l[
2
E
nl
=
3m
2
c
2
ln
_
c
2
2E
n
_
l
_
n[(H
ep
p
i
p
i
H
ep
)[ll[p
i
[n
n[p
i
[ll[(H
ep
p
i
p
i
H
ep
)[n
_
=
3m
2
c
2
ln
_
c
2
2E
n
_
(n[(H
ep
p
i
p
i
H
ep
)p
i
[n
n[p
i
(H
ep
p
i
p
i
H
ep
)[n)
=
3m
2
c
2
ln
_
c
2
2E
n
_
n[([H
ep
, p
i
]p
i
p
i
[H
ep
, p
i
])[n
=
3m
2
c
2
ln
_
c
2
2E
n
_
n[[p
i
, [p
i
, H
ep
]][n
=
2
3m
2
c
2
ln
_
c
2
2E
n
__
n
2
r
2
e
2
4r
n
_
=
4
3
2
3m
2
c
ln
_
c
2
2E
n
_
n[(r)[n (13.87)
This energy correction aects only spherically-symmetric S-states of the
atom. From formula (13.87) and the well-known result
41
2E
2S
1/2 = 16.64mc
2
2
we can calculate the eect of spontaneous emission on the hydrogen 2S
1/2
states energy
E
se
2S
1/2
=
4
3
2
3m
2
c
ln
_
16.64m
2
_
[
2S
1/2(0)[
2
=
mc
2
5
6
ln
_
16.64m
2
_
39
See, e.g., formulas (8.87) in [BD64] and (14.3.51) in [Wei95].
40
here we used equation (B.2)
41
See section 14.3 in [Wei95].
13.4. DECAY LAW FOR MOVING PARTICLES 447
(13.88)
It is rather shocking that in the limit of zero photon mass ( 0) this
energy shift becomes innite. This is an example of infrared divergence dis-
cussed in greater detail in chapter 13.6. We will see there, that if this result
is combined with 4th perturbation order radiative corrections from renor-
malized QED, then infrared divergences get canceled and the residual small
energy correction yields the famous Lamb shift in agreement with experi-
ments.
13.4 Decay law for moving particles
In equation (13.34) we found the decay law (0, t) observed from the reference
frame O at rest. In the present section we will derive an exact formula for
the decay law (, t) in a moving frame O
= t = 0,
where t
Kx
[
The time dependence of this state is
[(, t
) = e
Ht
e
ic
Kx
[ (13.89)
According to the general formula (13.6), the decay law from the point of view
of O
is
(, t
) = (, t
)[T[(, t
) (13.90)
= |T[(, t
)|
2
(13.91)
448 CHAPTER 13. DECAYS AND RADIATION
Let us use the basis set decomposition (13.31) of the state vector [.
Then, applying equations (13.89), (13.23) and (13.24) we obtain
[(, t
) =
_
dp(p)e
Ht
e
ic
Kx
[p
=
_
dp(p)
_
m
b
+mc
dm(m)(p, m)e
Ht
e
ic
Kx
[p, m
=
_
dp(p)
_
m
b
+mc
dm(m)(p, m)e
p
t
p
[p, m
The inner product of this vector with [q can be found with the help of
(13.32) and new integration variables r = p
q[(, t
)
=
_
dp(p)
_
m
b
+mc
dm(m)(p, m)e
p
t
q[p, m
_
p
=
_
dp(p)
_
m
b
+mc
dm[(m)[
2
(p, m)
(p, m)e
p
t
(q p)
_
p
=
_
m
b
+mc
dm
_
dr
1
r
r
_
r
1
r
(
1
r)(
1
r)
(r)[(m)[
2
e
rt
(q r)
=
_
m
b
+mc
dm
_
1
q
q
(
1
q)(
1
q, m)
(q, m)[(m)[
2
e
qt
)
=
_
dq(, t
)[qq[(, t
) =
_
dq[q[(, t
) [
2
13.4. DECAY LAW FOR MOVING PARTICLES 449
=
_
dq
_
m
b
+mc
dm
_
1
q
q
(
1
q)(
1
q, m)
(q, m)[(m)[
2
e
qt
2
(13.92)
which is an exact formula valid for all values of and t
.
13.4.2 Decays of states with denite momentum
In the reference frame at rest ( = 0), formula (13.92) coincides exactly with
our earlier result (13.34)
(0, t) =
_
dq[(q)[
2
_
m
b
+mc
dm[(m)[
2
e
qt
2
(13.93)
In section 13.1 we applied this formula to calculate the decay law of a particle
with zero momentum. Here we will consider the case when the unstable par-
ticle has a non-zero momentum p, i.e., the state is described by a normalized
vector [p) whose wave function is (13.16)
(q) =
_
(q p) (13.94)
From equation (13.93) the decay law for such a state is
|p)
(0, t) =
_
m
b
+mc
dm[(m)[
2
e
pt
2
(13.95)
In a number of works [Ste96, Kha97, Shi04] it was noticed that this result dis-
agrees with Einsteins time dilation formula (I.25). Indeed, if one interprets
the state [p) as a state of unstable particle moving with denite speed
v =
c
2
p
_
m
2
a
c
4
+ p
2
c
2
= c tanh
450 CHAPTER 13. DECAYS AND RADIATION
then the decay law (13.95) cannot be connected with the decay law of the
particle at rest (13.35) by Einsteins formula (I.25)
42
|p)
(0, t) ,=
|0)
(0, t/ cosh) (13.96)
This observation prompted authors of [Ste96, Kha97, Shi04] to question the
applicability of special relativity to particle decays. However, at a closer
inspection it appears that this result does not challenge the special-relativistic
time dilation (I.25) directly. Formula (13.96) is comparing decay laws of two
dierent momentum eigenstates [0) and [p) viewed from the same reference
frame. This is quite dierent from (I.25) which compares observations made
on the same particle from two frames of reference moving with respect to
each other. If from the point of view of observer O the particle is described
by the state vector [0) which has zero momentum and zero velocity, then
from the point of view of O
[0) (13.97)
which is not an eigenstate of the momentum operator P
0
. So, strictly speak-
ing, formula (13.95) is not applicable to this state. However, it is not dicult
to see that (13.97) is an eigenstate of the velocity operator [Shi06]. Indeed,
taking into account V
x
[0) = 0 and equations (4.3) - (4.4), we obtain
V
x
e
ic
Kx
[0) = e
ic
Kx
e
ic
Kx
V
x
e
ic
Kx
[0) = e
ic
Kx
V
x
c tanh
1
Vx tanh
c
[0)
c tanh e
ic
Kx
[0) (13.98)
A fair comparison with the time dilation formula (I.25) requires consideration
of unstable states having denite values of velocity for both observers. This
will be done in subsection 13.4.4.
13.4.3 Decay law in the moving reference frame
Let us now calculate decay laws in dierent reference frames using equation
(13.92). Unfortunately exact computation for ,= 0 is not possible, so we
42
In subsection 13.5.1 we will illustrate this inequality with numerical calculations.
13.4. DECAY LAW FOR MOVING PARTICLES 451
need to make approximations. To see what kinds of approximations may
be appropriate, let us discuss properties of the initial state [ 1
a
in
more detail. First, in all realistic cases this is not an exact eigenstate of
the total momentum operator: the wave function of the unstable particle
is not localized at one point in the momentum space (as was assumed, for
example, in (13.94)) but has a spread (or uncertainty) of momentum [p[
and, correspondingly, an uncertainty of position [r[ /[p[. Second, the
state [ 1
a
is not an eigenstate of the mass operator M. The initial
state [ is characterized by the uncertainty of mass (see Fig. 13.4) that
is related to the particles lifetime (
0
) by formula (13.67). It is important
to note that in all cases of practical interest the mentioned uncertainties are
related by inequalities
[p[ c (13.99)
[r[ c
0
(13.100)
In particular, the latter inequality means that the uncertainty of position
is mush less than the distance passed by light during the lifetime of the
particle. For example, in the case of muon
0
2.2 10
6
s and, accord-
ing to (13.100), the spread of the wave function in the position space must
be much less than 600m, which is a reasonable assumption. Therefore, we
can safely assume that factor [(m)[
2
in (13.92) has a sharp peak near the
value m = m
a
. Then we can move the value of the smooth
43
function
_
q
q
(q)(q, m)
(q, m) at m = m
a
outside the integral on m
(, t
_
dq
1
q
q
(
1
q)(
1
q, m
a
)
(q, m
a
)
_
m
b
+mc
dm[(m)[
2
e
qt
2
=
_
dq
L
1
q
q
[(L
1
q)[
2
_
m
b
+mc
dm[(m)[
2
e
qt
2
43
see discussion after equation (13.26)
452 CHAPTER 13. DECAYS AND RADIATION
=
_
dp[(p)[
2
_
m
b
+mc
dm[(m)[
2
e
Lp
t
2
(13.101)
where p is given by equation (13.25) and Lp = (p
x
cosh +
p
c
sinh , p
y
, p
z
).
This is our nal result for the decay law of a particle in a moving reference
frame.
13.4.4 Decays of states with denite velocity
Next we consider an initial state which has zero velocity from the point of
view of observer O. The wave function of this state is localized near zero
momentum p = 0. So, we can set in equation (13.101)
44
[(p)[
2
(p) (13.102)
and obtain
|0)
(, t
_
m
b
+mc
dm[(m)[
2
e
it
m
2
c
4
+m
2
a
c
4
sinh
2
2
(13.103)
If we approximately identify m
a
c sinh with the momentum p of the particle
a from the point of view of the moving observer O
45
then
|0)
(, t
_
m
b
+mc
dm[(m)[
2
e
pt
2
(13.104)
So, in this approximation the decay law (13.104) in the frame of reference O
moving with the speed c tanh takes the same form as the decay law (13.95)
of a particle moving with momentum m
a
c sinh with respect to the stationary
observer O.
46
In the next section we will evaluate (13.104) numerically.
44
As we mentioned in the preceding subsection, in reality this state is not exactly an
eigenstate of momentum (velocity) [0). However, its wave function is still much better
localized in the p-space than the slowly varying second factor under the integral in (13.101),
so, approximation (13.102) is justied.
45
From the point of view of this observer, particles velocity is c tanh, however its
momentum is not well-dened due to the uncertainty of mass.
46
Note the contradiction between this result and conclusions of ref. [Shi06].
13.5. TIME DILATION IN DECAYS 453
13.5 Time dilation in decays
In this section we will present a specic example, in which predictions of our
RQD approach deviate from special relativity. In particular, we will demon-
strate the approximate character of the Einsteins time dilation formula
(I.25) for decays of fast moving particles.
13.5.1 Numerical results
In this subsection we will calculate the dierence between the accurate quan-
tum mechanical result (13.104) and the special-relativistic time dilation for-
mula (I.25)
SR
|0)
(, t) =
|0)
_
0,
t
cosh
_
(13.105)
In this calculation we assume that the mass distribution [(m)[
2
of the un-
stable particle has the Breit-Wigner form
47
[(m)[
2
=
_
_
_
/2
2
/4+(mma)
2
, if m m
b
+ m
c
0, if m < m
b
+ m
c
(13.106)
where parameter is a factor that ensures the normalization to unity
_
m
b
+mc
[(m)[
2
= 1
The following parameters of this distribution were chosen: The mass of the
unstable particle was m
a
= 1000 MeV/c
2
, the total mass of the decay prod-
ucts was m
b
+m
c
= 900 MeV/c
2
and the width of the mass distribution was
= 20 MeV/c
2
. These values do not correspond to any real particle, but
they are typical for strongly decaying baryon resonances.
It is convenient to measure time in units of the lifetime
0
cosh . Denoting
t/(
0
cosh ), we nd that special-relativistic decay laws (13.105) for any
47
see equation (13.62) and Fig. 13.4
454 CHAPTER 13. DECAYS AND RADIATION
|0)
(,)
SR
()
=10
=1.4
=0.2
0.001
0.002
0.001
0.002
11
22 33 44 5 5 6 6
SR
()
Figure 13.7: Corrections to the Einsteins time dilation formula (I.25)
for the decay law of unstable particle moving with the speed v = c tanh.
Parameter is time measured in units of
0
/ cosh .
rapidity are given by the same universal function
SR
(). This function as
well as equation (13.95) were evaluated for values of in the interval from 0
to 6 with the step of 0.1. Calculations were performed by direct numerical
integration of equation (13.95) using the Mathematica program shown below
gamma = 20
mass = 1000
theta = 0.0
Do[Print[(1/0.9375349) Abs[NIntegrate [gamma/(2 Pi) / (gamma^2/4 +(x
- mass)^2) Exp[ I t Sqrt [x^2 + mass^2 (Sinh [theta])^2] Cosh
[theta] / gamma], {x, 900, 1010, 1100, 300000}, MinRecursion -> 3,
MaxRecursion -> 16, PrecisionGoal -> 8, WorkingPrecision -> 18]]^2],
{t, 0.0, 6.0, 0.1}]
As expected, function
SR
() (shown by the thick solid line in Fig. 13.7) is
very close to the exponent e
Kx
[ 1
a
Then the subspace 1
a
is invariant under action of boosts e
ic
Kx
, which means
that operator K
x
commutes with the projection T on the subspace 1
a
. Then
49
see subsection 15.3.6
50
see Appendix I.5
51
This assertion does not hold in our RQD theory, as illustrated in Fig. 15.3.
13.5. TIME DILATION IN DECAYS 457
from Poincare commutator (3.57) and [T, P
0x
] = 0 it follows by Jacobi iden-
tity that
[T, H] =
ic
2
[T, [K
x
, P
0x
]] =
ic
2
[K
x
, [T, P
0x
]]
ic
2
[P
0x
, [T, K
x
]]
= 0
which contradicts the fundamental property (13.10) of unstable systems.
This contradiction implies that, in fact, the state e
ic
Kx
[ does not cor-
respond to the particle a with 100% probability. This state must contain
contributions from decay products even at t = 0
e
ic
Kx
[ / 1
a
(13.108)
(, 0) < 1, for ,= 0 (13.109)
This is the decay caused by boost, which means that special-relativistic
equations (I.25) and (13.107) are not accurate and that boosts of the observer
have a non-trivial eect on the internal state of the observed unstable system.
The presence of decays caused by boosts means that particle composi-
tion of systems involving unstable states is not a relativistic invariant. For
example, one should be careful when making assertions like this one:
Flavor is the quantum number that distinguishes the dierent
types of quarks and leptons. It is a Lorentz invariant quantity.
For example, an electron is seen as an electron by any observer,
never as a muon. C. Giunti and M. Lavender [GL]
Although this statement about the electron is correct (because the electron
is a stable particle), it is not true about the muon. According to (13.108)
an unstable muon can be seen as a single particle by the observer at rest
and as a group of three decay products (an electron, a neutrino
and an
antineutrino
e
) by a moving observer.
In spite of its fundamental importance, the eect of boosts on the non-
decay probability is very small. For example, our rather accurate approxi-
mation (13.101) failed to catch this eect. Indeed, for t = 0 this formula
predicted
458 CHAPTER 13. DECAYS AND RADIATION
(, 0) =
_
dp[(p)[
2
_
m
b
+mc
dm[(m)[
2
2
= 1
instead of the expected (, 0) < 1.
13.5.3 Particle decays in dierent forms of dynamics
Throughout this section we assumed that interaction responsible for the de-
cay belongs to the Bakamjian-Thomas instant form of dynamics. However, as
we saw in subsection 6.3.5, the Bakamjian-Thomas form does not allow sep-
arable interactions, so, most likely, this is not the form preferred by nature.
Therefore, it is important to calculate decay laws in non-Bakamjian-Thomas
instant forms of dynamics as well. Although no such calculations have been
done yet, one can say with certainty that there is no form of interaction in
which special-relativistic result (13.105) is exactly valid. This follows from
the fact that in any instant form of dynamics boost operators contain inter-
action terms, so the decays caused by boosts - which contradict equation
(13.105) - are always present.
What if the interaction responsible for the decay has a non-instant form?
Is it possible that there is a form of dynamics in which Einsteins time dilation
formula (I.25) is exactly true? Our answer to this question is No. Let us
consider, for example, the point form of dynamics.
52
In this case the subspace
1
a
of the unstable particle is invariant with respect to boosts, [K
0x
, T] = 0 so
there can be no boost-induced decays (13.108). However, we obtain a rather
surprising relationship between decay laws of the same particle viewed from
the moving reference frame (, t) and from the frame at rest (0, t)
53
(, t) = 0[e
ic
K
0x
e
i
Ht
Te
Ht
e
ic
K
0x
[0
= 0[e
ic
K
0x
e
i
Ht
e
ic
K
0x
ic
K
0x
Te
ic
K
0x
ic
K
0x
Ht
e
ic
K
0x
[0
= 0[e
ic
K
0x
e
i
Ht
e
ic
K
0x
Te
ic
K
0x
Ht
e
ic
K
0x
[0
= 0[e
it
it
H cosh
Te
it
H cosh
[0
= (0, t cosh)
where the last equality follows from comparison with equation (13.9). This
means that the decay rate in the moving frame is cosh times faster than
that in the rest frame. This is in direct contradiction with experiments.
The point form of dynamics is not acceptable for the description of de-
cays for yet another reason. Due to the interaction-dependence of the total
momentum operator (15.37), one should expect decays induced by space
translations
e
i
Pxa
[ / 1
a
, for a ,= 0 (13.110)
Translation-induced and/or rotation-induced decays are expected in all forms
of dynamics (except the instant form). This is in contradiction with our
experience, which suggests that the composition of an unstable particle is
not aected by these kinematical transformations. Therefore only the instant
form of dynamics is appropriate for the description of particle decays.
13.6 Radiative corrections
So far our studies conrmed that the 2nd order dressed particle Hamiltonian
(12.10) - (12.15) provides a very good approximation for electromagnetic in-
teractions between charged particles. In particular, in section 12.2 we used
this Hamiltonian to calculate the energy spectrum of the hydrogen atom.
There are two ways how these results can be improved. One possibility is to
go beyond the (v/c)
2
approximation as was indicated, for example, in section
14.5. Another direction is to nd interaction potentials in higher perturba-
tion orders. This direction is the topic of the present section. We are going
to derive 4th order radiative corrections to the electron-proton interaction
potential. In subsection 13.6.1 we will outline the general technique for t-
ting high-perturbation-order dressed particle potentials to the S-matrix from
the traditional renormalized QED. Then we will derive 4th order radiative
corrections to the electron-proton interaction potential. In particular, we
will see how these corrections are responsible for such well-known eects as
the electrons anomalous magnetic moment and Lamb shifts in the hydrogen
atom.
460 CHAPTER 13. DECAYS AND RADIATION
13.6.1 Fitting particle potentials to the S-matrix
In section 11.2 we derived the 2nd order dressed particle Hamiltonian H
d
by
applying the unitary dressing transformation to the eld-based Hamiltonian
of QED. However, this approach is hardly applicable to higher perturbation
orders. The source of the problem is in the fact that dressing transforma-
tion requires separate processing of terms of dierent types (unphys, renorm,
phys). To achieve that, one needs to keep all operators expressed through cre-
ation and annihilation operators as in (8.49) - (8.50). In this representation
even the original QED Hamiltonian takes rather inconvenient cumbersome
form shown in Appendix L. The task of dressing transformation is further
complicated by the necessity to calculate commutators of these long expres-
sions.
Fortunately, there is a much simpler approach, which leads to the same
desired result for the dressed Hamiltonian H
d
. This approach is based on
formulas from subsection 11.2.7. Suppose, for example, that we want to nd
the 4th order contribution V
d
4
to the electron-proton dressed interaction. To
do that we can use equation (11.32) and write
V
d
4
..
S
c
4
V
d
2
V
d
2
. .
(13.111)
All terms on the right hand side can be found relatively easily. The 4th order
scattering matrix S
c
4
has been calculated in (10.64). In the next subsection
we will calculate the product term in (13.111).
13.6.2 Product term in (13.111)
In (13.111) V
d
2
is the usual non-relativistic Coulomb potential
54
V
d
2
(t)
54
This is the leading term in the momentum representation of the Darwin-Breit inter-
action (12.7) with a screening provided by the photon mass . Here we are interested
only in the dominant infrared-divergent term in the product V
d
2
V
d
2
, so we will work in
the non-relativistic approximation from Appendix J.9 and thus assume that momenta of
interacting particles are much less that mc. Accordingly, we will omit all terms contain-
ing positive powers of p, q and/or
k. In this approximation the product V
d
2
V
d
2
becomes
spin-independent. So, we disregard spins and drop spin labels of particle states. The same
approximations will be employed for other infrared-divergent terms.
13.6. RADIATIVE CORRECTIONS 461
=
e
2
2
(2)
3
_
dpdqdp
dq
(p +q p
)e
i
(p+q
p
q
)t
(q q
)
2
+
2
c
2
d
p
a
q
d
p
a
q
and V
d
2
is obtained with the help of (8.61)
V
d
2
(t)
=
e
2
2
(2)
3
_
dtdsdt
ds
(t +s t
)e
i
(t+s
t
s
)t
[(s s
)
2
+
2
c
2
][
s
s
+
t
t
]
d
t
a
s
d
t
a
s
Using
d
p
a
q
d
p
a
q
d
t
a
s
d
t
a
s
= (d
p
d
p
d
t
d
t
)(a
q
a
q
a
s
a
s
)
= (d
p
d
t
d
p
d
t
+ d
p
d
t
(t p
))(a
q
a
s
a
q
a
s
+ a
q
a
s
(s q
))
= d
p
a
q
d
t
a
s
(t p
)(s q
) + . . . (13.112)
we obtain
55
V
d
2
(t)V
d
2
(t)
=
e
4
4
(2)
6
_
dpdqdp
dq
dsdtds
dt
p
a
q
d
p
a
q
d
t
a
s
d
t
a
s
(p +q p
)(s +t s
)
[(q q
)
2
+
2
c
2
][
s
s
+
t
t
][(s s
)
2
+
2
c
2
]
(13.113)
=
e
4
4
(2)
6
_
dpdqdp
dq
dsdtds
dt
p
a
q
d
t
a
s
(p +q p
)(s +t s
)(q
s)(p
t)
[(q q
)
2
+
2
c
2
][
s
s
+
t
t
][(s s
)
2
+
2
c
2
]
55
For denition of the underbrace sign see (8.66). As we have mentioned in subsection
11.3.1, operators V
d
2
and V
d
2
are well-dened only on the energy shell. However, momentum
integrations in (13.113) include regions outside the energy shell, where the integrands are
not known precisely. (See discussion in subsection 11.2.5.) Nevertheless, this uncertainty
does not seem to be signicant, because here we are interested only in the leading infrared-
divergent contribution, which comes from integration in a small region near the k
q
4
(2)
6
_
dqdpdp
dq
(q +p p
)d
p
a
q
d
p
a
q
_
ds
[(q s)
2
+
2
c
2
][
s
q
+
q+ps
p
][(s q
)
2
+
2
c
2
]
Next we introduced non-relativistic approximations
q
mc
2
+q
2
/(2m) and
p
p
Mc
2
. We also choose the center-of-mass frame, where the heavy
proton remains motionless and the electron scattering is elastic: q = q
.
56
e
4
4
(2)
6
_
ds
[(q s)
2
+
2
c
2
][(s q
)
2
+
2
c
2
]
1
2e
4
4
m
(2)
6
_
ds
[(q s)
2
+
2
c
2
][s
2
q
2
i][(q
s)
2
+
2
c
2
]
This integral is calculated in equation (M.53) in Appendix M.7. Noting also
that the underbrace sign introduces an additional factor of 2i, we nd
that the coecient function of the operator V
d
2
(t)V
d
2
(t)
. .
is
2
mc
2
qk
2
ln
_
k
2
2
c
2
_
(13.114)
This means that the product term (13.114) cancels ladder and crossed ladder
diagrams, i.e., the third term in square brackets in (10.64).
57
Then for the
left hand side of (13.111) we obtain
0[a
q,
d
p,
V
d
4
..
d
[0
4
( q q
+ p)
_
i
2
15
2
m
2
c
(
el
[k q])
4
2
m
2
ck
2
+
i
2
3
2
m
2
c
ln
_
m
_
_
(13.115)
56
Here is a small positive constant, which should be taken to 0 at the end of calcula-
tions.
57
This observation justies the omission of contributions (13.114) and (10.63) in most
textbook calculations of the Lamb shift.
13.6. RADIATIVE CORRECTIONS 463
13.6.3 Radiative corrections to the Coulomb potential
Equation (13.115) gives us the 4th order interaction V
d
4
only on the energy
shell. Outside the energy shell we can adopt the usual assumption about
the near constancy of the coecient function.
58
Then the momentum-space
coecient function of the operator V
d
4
has three components
59
v
d
4
(p, q, k; , ,
)
=
,
,
30
3
m
2
c
_
_
e
2
2
(2)
3
i
el
[k q]
4m
2
c
2
k
2
,
2
,
,
6
3
m
2
c
ln
_
m
_
The corresponding position-space potential is obtained using the Fourier
transform, formulas from Appendix B and S
el
=
el
/2.
V
d
4
(q, r, S
el
) =
8
3
2
30m
2
c
(r) +
e
2
[r q] S
el
8m
2
c
2
r
3
_
4
3
2
3m
2
c
ln
_
m
_
(r)
(13.116)
In a theory of electron-proton interaction valid to the 4th perturbation order
this expression must be added to the 2nd order interaction operator (12.9),
thus leading to the following potential that depends on the position, momen-
tum and spin of the electron
V
d
2+4
(q, r, S
el
) =
e
2
4r
+
e
2
[r q] S
el
8m
2
c
2
r
3
_
1 +
_
+
e
2
2
8c
2
m
2
_
1
8
15
_
(r)
4
3
2
3m
2
c
ln
_
m
_
(r) (13.117)
The 1st term is the usual Coulomb potential. The 2nd term describes the
spin-orbit interaction of the electrons spin with its own momentum. The
3rd and 4th terms are contact potentials. The latter one is rather troubling:
it diverges in the limit of zero photon mass 0. From the point of
view of classical electrodynamics,
60
this is not a reason for concern, because
58
See condition
i
1 in subsections 11.2.5 - 11.2.6 and discussion in subsection 11.3.1.
59
They are obtained by dividing the coecient function in (13.115) by (2i). Compare
with (12.7).
60
See chapter 14.
464 CHAPTER 13. DECAYS AND RADIATION
such short-range potentials do not aect macroscopic dynamics of point-
like charges. However, this potential has innite eect on eigenstates and
eigenvalues of quantum compound systems, e.g., the hydrogen atom. This
problem will be solved in the next subsection.
13.6.4 Lamb shift
As we saw in (12.26), in the 2nd order theory hydrogen levels 2S
1/2
and 2P
1/2
have exactly the same energies. However, in 1947 Lamb and Retherford found
a small gap between the two levels, which is now known as the Lamb shift.
Its modern experimental value is [WHSK
+
95]
2S
1/2
2P
1/2 = 4.37 10
6
eV
= 2 1057.8 MHz (13.118)
The presence of the Lamb shift was completely unexpected from the point
of view of the quantum theory available at the time. Attempts to explain
this eect played an important role in the development of quantum elec-
trodynamics. Successful calculation of the shift value (13.118) was a major
triumph of QED. Here we are going to calculate the Lamb shift within our
dressed particle approach.
The eects of the 4th order potentials (13.116) on energies of the 2S
1/2
and 2P
1/2
hydrogen states can be calculated using perturbation theory, as in
section 12.2. The results are collected in Table 13.1.
The short-range contact potential in the 1st term in (13.116) shifts only
the s-state, whose wave function is non-zero at the origin
contact
2S
1/2
=
8
3
2
30m
2
c
[
2S
(0)[
2
=
mc
2
5
30
The energy correction due to the last term in (13.116)
vertex
2S
1/2
=
4
3
2
3m
2
c
ln
_
m
_
[
2S
(0)[
2
=
mc
2
5
6
ln
_
m
_
(13.119)
diverges in the limit 0, which seems to be unphysical. Luckily, there
is another interaction that cancels this divergence. This is the 3rd order
13.6. RADIATIVE CORRECTIONS 465
Table 13.1: 4th order perturbative energy corrections to 2S
1/2
and 2P
1/2
energy levels of the hydrogen atom.
contribution potential
2S
1/2
2P
1/2
contact (13.116)
e
2
30m
2
c
2
(r)
mc
2
5
30
0
vertex (13.116)
4
3
2
3m
2
c
ln
_
m
_
(r)
mc
2
5
6
ln
_
m
_
0
emission (13.88)
mc
2
5
6
ln
_
17.6
2
m
_
0
spin-orbit (13.116)
e
2
[rq]S
el
16
2
m
2
c
2
r
3
0
mc
2
5
48
Total correction
mc
2
5
6
_
1
5
ln(17.6
2
)
mc
2
5
48
bremsstrahlung potential,
61
(13.81) which induces the energy shift (13.88)
emission
2S
1/2
=
mc
2
5
6
ln
_
16.64
2
m
_
The only interaction aecting the 2P
1/2
level is the 2nd term in (13.116). The
corresponding energy shift can be evaluated by the method from subsection
12.2.3.
Finally, within our approximations, the full 4th order contribution to the
Lamb shift
2S
1/2
2P
1/2 =
contact
2S
1/2
+
emission
2S
1/2
+
vertex
2S
1/2
spinorbit
2P
1/2
=
mc
2
5
6
_
3
40
ln(16.64
2
)
_
= 3.91 10
6
eV
= 2 945 MHz
is in a good agreement with the experimental value (13.118). Relative posi-
tions of lowest energy levels of the hydrogen atom are shown in Fig. 12.1.
61
This interaction couples the electron+proton subspace of the Fock space with the
electron+proton+photon subspace. So, strictly speaking, this is not a true electron-
proton potential.
466 CHAPTER 13. DECAYS AND RADIATION
Let us stress once again that in RQD we do not assume the existence of
virtual particles or non-trivial vacuum. Therefore, we explain the high-order
eects entirely in terms of small corrections to inter-particle potentials with-
out any reference to virtual particle exchanges, vacuum polarization and
other eld-theoretical terminology. In the literature one can nd a number
of similar eective particle approaches [Hol04, PS98, PS, GR80, GRI89,
FS88], which use inter-particle potentials with radiative corrections.
13.6.5 Electrons anomalous magnetic moment
As we discussed in chapter 10, renormalization has no eect on the electrons
charge, because this is forbidden by the charge renormalization condition
postulate 10.2. However, there is another electrons property - the magnetic
moment - which is not restricted by any postulate. The eect of renormaliza-
tion on the electrons magnetic moment was rst calculated by Schwinger in
1948. This was a major triumph of the renormalized QED. In this subsection
we are going to reproduce this result within our dressed particle approach.
The electrons magnetic moment manifests itself by the electrons dy-
namics in external magnetic elds.
62
In our electron-proton system, in
principle, the proton can play the role of a source of such a magnetic eld.
Unfortunately, so far we assumed that the proton is innitely massive and
motionless, so we have lost this eect completely. In order to have a model
of electron-magnet interaction, let us consider the eect of nite proton mass
M < . In the 2nd perturbation order the relevant potential was obtained
in equation (12.14)
V
so
=
e
2
[r p] S
el
4Mmc
2
r
3
(13.120)
where p is the protons momentum. It is customary to dene the electrons
magnetic moment and its interaction with a moving charge e by formulas
63
el
geS
el
2mc
V
so
=
e[r p]
el
4Mcr
3
62
Experimental manifestations of particle magnetic moments will be discussed in more
detail in chapter 14.
63
see equation (11.100) in [Jac99]
13.6. RADIATIVE CORRECTIONS 467
where g is the so-called gyromagnetic ratio or simply the g-factor. Thus,
in the 2nd perturbation order we have g = 2. In higher orders we expect
some corrections to this value. Here we will be interested in the 4th order
correction
4
, such that
g = 2(1 +
4
) (13.121)
Recall that interaction (13.120) has resulted from the momentum-space
coecient function in (12.7)
v
2so
(p, q, k; , ,
) =
ie
2
(2)
3
(el)
el
[k p]
2Mmc
2
k
2
_
(el)
Now our plan is to nd a 4th order terms with the similar structure. One
can easily see that the only relevant S-matrix term is (10.48)
s
4so
(p, q, k; , ,
) =
ic
2
2
2
k
2
U(q +k, ; q,
) W(p k, ; p,
)
Next we apply the (v/c)
2
approximation from Appendix J.9, introduce the
usual factor 1/(2i) required for the potential coecient function and ob-
tain
v
4so
=
ic
2
(2i)2
2
k
2
(el)
_
i[
el
k]
2mc
_
_
2p k
2Mc
_
(el)
=
ie
2
(2)
3
_
2
_
(el)
el
[k p]
2Mmc
2
k
2
_
(el)
This 4th order potential diers from the 2nd order potential (13.122) only
by the factor /(2). This is the value of
4
in (13.121). So, in our approxi-
mation, the g-factor is
g = 2
_
1 +
2
_
2.0023 (13.122)
which is the standard 4th order QED result.
468 CHAPTER 13. DECAYS AND RADIATION
Chapter 14
CLASSICAL
ELECTRODYNAMICS
All of physics is either impossible or trivial. It is impossible until
you understand it and then it becomes trivial.
Ernest Rutherford
In this chapter we will apply the direct interaction approach to classical
electrodynamics. Our goal is to show that this is a plausible alternative to
the standard theory based on Maxwells equations.
14.1 Hamiltonian formulation
The central idea of Maxwells electrodynamics is that charged particles in-
teract with each other indirectly via electric and magnetic elds and that
electromagnetic radiation is an electromagnetic eld varying in time and
space. In this book we are challenging this universally accepted point of
view. Our main concern is that such primary ingredients of the classical
theory as Maxwell elds, Lienard-Wiechert potentials and the Lorentz force
law cannot be expressed in the language of Poincare group generators, e.g.,
the Hamiltonian. In our opinion, this is the universal language in which
all physical theories ought to be formulated. In particular, Poincare group
469
470 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
representations are essential for ensuring conservation laws and correct trans-
formations of observables between dierent reference frames. In Maxwells
approach the validity of these important requirements is not obvious at all.
We argue that all results of conventional classical electromagnetic theory
can be equally well (or even better) explained from the viewpoint of Hamil-
tonian dynamics of charged particles with direct interactions, where elds
are not involved at all. In our approach light is described as a ow of massless
particles photons, rather than the transverse electromagnetic wave.
In section 12 we already derived the Darwin-Breit Hamiltonian (12.10) for
charged particles as an approximation to the full-edged RQD. This Hamil-
tonian was obtained in the 2nd order perturbation theory within the (v/c)
2
approximation. Our goal now is to demonstrate that this Hamiltonian can
be used successfully even in classical (non-quantum) approximation. Then
it provides a reasonably accurate description of electromagnetic processes
in which acceleration of charged particles is low, so that one can neglect
the emission of electromagnetic radiation (photons). The Darwin-Breit ap-
proach adopted here is fundamentally dierent from the generally accepted
Maxwells theory.
1
In the Darwin-Breit approach charged particles interact
via instantaneous potentials; there are no electromagnetic elds and no spe-
cic eld energy associated with them. In spite of these dierences, we
will see that in many cases it is very dicult to distinguish these two ap-
proaches experimentally, as both of them lead to very similar predictions.
We will also nd situations
2
in which the traditional Maxwells theory leads
to contradictions and paradoxes. These paradoxes nd their resolution in
the Darwin-Breit electrodynamics.
In this chapter, we will be working in the classical approximation: ig-
noring all quantum eects,
3
not paying attention to the order of dynamical
variables in their products and using Poisson brackets [. . . , . . .]
P
instead of
quantum commutators (i/)[. . . , . . .]. We will also represent all quantities
as series in powers of v/c and leave only terms whose order is not higher than
1
This classical theory of electromagnetic phenomena should be called more appropri-
ately Maxwell-Abraham-Lorentz theory, because it was signicantly modied by Abra-
ham and Lorentz in the beginning of the 20th century to take into account the existence
(not known to Maxwell) of pointlike charged elementary particles - electrons. However,
for brevity, we will call this theory by the name of its original inventor.
2
See sections 14.2 - 14.5.
3
The only exception is our discussion of the Aharonov-Bohm eect in section 14.4.
14.1. HAMILTONIAN FORMULATION 471
(v/c)
2
.
4
14.1.1 Darwin-Breit Hamiltonian
The Darwin-Breit Hamiltonian H = H
0
+ V for a system of two charges q
1
and q
2
consists of the free part the free part is
H
0
= h
1
+ h
2
=
_
m
2
1
c
4
+ p
2
1
c
2
+
_
m
2
2
c
4
+ p
2
2
c
2
m
1
c
2
+ m
2
c
2
+
p
2
1
2m
1
+
p
2
2
2m
2
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
(14.1)
and the potential energy V (12.11) - (12.15)
5
V
q
1
q
2
4r
q
1
q
2
8m
1
m
2
c
2
r
_
(p
1
p
2
) +
(p
1
r)(p
2
r)
r
2
_
q
1
q
2
[r p
1
] s
1
8m
2
1
c
2
r
3
+
q
1
q
2
[r p
2
] s
2
8m
2
2
c
2
r
3
+
q
1
q
2
[r p
2
] s
1
4m
1
m
2
c
2
r
3
q
1
q
2
[r p
1
] s
2
4m
1
m
2
c
2
r
3
+
q
1
q
2
(s
1
s
2
)
4m
1
m
2
c
2
r
3
3q
1
q
2
(s
1
r)(s
2
r)
4m
1
m
2
c
2
r
5
(14.2)
In order to use this Hamiltonian in practical calculations, we introduce
few adjustments. First, we omit the rest energies of the two particles, because
they have no eect on dynamics. Second, we notice that particle spins s
i
are not easily measurable in classical experiments. It is more convenient to
replace them with magnetic moments
i
, which are known to be proportional
to spins. This dependence includes anomalous contributions, i.e., those not
described by the classical formula
i
= q
i
s
i
/(m
i
c). The electrons anomalous
4
This restriction will be lifted in section 14.5.
5
We denote r r
1
r
2
throughout this chapter. In the case of electron-proton system,
the charges are q
1
= e and q
2
= +e. Contact terms proportional to (r) are not relevant
for classical mechanics and are omitted. We also omitted 3rd and 4th order corrections to
this Hamiltonian that were derived in sections 13.3 and 13.6. In Appendix N.3 we veried
that with a properly chosen boost generator K = K
0
+ Z the Hamiltonian H = H
0
+ V
satises all Poincare Lie algebra relationships within the (v/c)
2
approximation.
472 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
magnetic moment has been discussed in subsection 13.6.5. We will not dwell
on this issue here and simply postulate that the full Hamiltonian for two
charged spinning classical particles takes the form
H =
p
2
1
2m
1
+
p
2
2
2m
2
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
+
q
1
q
2
4r
q
1
q
2
8m
1
m
2
c
2
r
_
p
1
p
2
+
(r p
2
)(r p
1
)
r
2
_
q
1
[r p
1
]
2
4m
1
cr
3
+
q
1
[r p
2
]
2
8m
2
cr
3
q
2
[r p
1
]
1
8m
1
cr
3
+
q
2
[r p
2
]
1
4m
2
cr
3
+
(
1
2
)
4r
3
3(
1
r)(
2
r)
4r
5
(14.3)
14.1.2 Two charges
Let us consider a system of two spinless charged particles. The full Hamilto-
nian of this system (which is called the Darwin Hamiltonian) is obtained by
dropping spin-dependent terms from the Darwin-Breit Hamiltonian (14.3)
H =
p
2
1
2m
1
+
p
2
2
2m
2
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
+
q
1
q
2
4r
q
1
q
2
8m
1
m
2
c
2
r
_
(p
1
p
2
) +
(p
1
r)(p
2
r)
r
2
_
(14.4)
This Hamiltonian fully determines the dynamics in the system via Hamiltons
equations of motion (6.99) - (6.100) and Poisson brackets (6.96). The time
derivative of the rst particles momentum can be obtained from the rst
Hamiltons equation
dp
1
dt
= [p
1
, H]
P
=
H
r
1
=
q
1
q
2
r
4r
3
q
1
q
2
(p
1
p
2
)r
8m
1
m
2
c
2
r
3
+
q
1
q
2
p
1
(p
2
r)
8m
1
m
2
c
2
r
3
+
q
1
q
2
(p
1
r)p
2
8m
1
m
2
c
2
r
3
3q
1
q
2
(p
1
r)(p
2
r)r
8m
1
m
2
c
2
r
5
(14.5)
14.1. HAMILTONIAN FORMULATION 473
Since Hamiltonian (14.4) is symmetric with respect to permutations of the
two particles, we can obtain the time derivative of the second particles mo-
mentum by replacing indices 1 2 in (14.5)
dp
2
dt
=
dp
1
dt
(14.6)
Velocities of particles 1 and 2 are obtained from the second Hamiltons equa-
tion
6
v
1
dr
1
dt
= [r
1
, H]
P
=
H
p
1
=
p
1
m
1
p
2
1
p
1
2m
3
1
c
2
q
1
q
2
p
2
8m
1
m
2
c
2
r
q
1
q
2
(p
2
r)r
8m
1
m
2
c
2
r
3
(14.7)
v
2
dr
2
dt
=
p
2
m
2
p
2
2
p
2
2m
3
2
c
2
q
1
q
2
p
1
8m
1
m
2
c
2
r
q
1
q
2
(p
1
r)r
8m
1
m
2
c
2
r
3
(14.8)
From these results we can calculate second time derivatives of particle posi-
tions (=accelerations)
7
d
2
r
1
dt
2
=
p
1
m
1
p
2
1
p
1
2m
3
1
c
2
2(p
1
p
1
)p
1
2m
3
1
c
2
+
q
1
q
2
p
2
(r r)
8m
1
m
2
c
2
r
3
q
1
q
2
(p
2
r)r
8m
1
m
2
c
2
r
3
+
3q
1
q
2
(p
2
r)r(r r)
8m
1
m
2
c
2
r
5
q
1
q
2
(p
2
r) r
8m
1
m
2
c
2
r
3
q
1
q
2
r
4m
1
r
3
q
1
q
2
(p
1
p
2
)r
8m
2
1
m
2
c
2
r
3
+
q
1
q
2
p
1
(p
2
r)
8m
2
1
m
2
c
2
r
3
+
q
1
q
2
(p
1
r)p
2
8m
2
1
m
2
c
2
r
3
3q
1
q
2
(p
1
r)(p
2
r)r
8m
2
1
m
2
c
2
r
5
p
2
1
2m
2
1
c
2
q
1
q
2
r
4m
1
r
3
2q
1
q
2
(p
1
r)p
1
8m
3
1
c
2
r
3
+
q
1
q
2
p
2
(r p
1
)
8m
2
1
m
2
c
2
r
3
q
1
q
2
p
2
(r p
2
)
8m
1
m
2
2
c
2
r
3
q
1
q
2
(p
2
p
1
)r
8m
2
1
m
2
c
2
r
3
+
q
1
q
2
(p
2
p
2
)r
8m
1
m
2
2
c
2
r
3
+
3q
1
q
2
(p
2
r)r(r p
1
)
8m
2
1
m
2
c
2
r
5
3q
1
q
2
(p
2
r)r(r p
2
)
8m
1
m
2
2
c
2
r
5
q
1
q
2
(p
2
r)p
1
8m
2
1
m
2
c
2
r
3
+
q
1
q
2
(p
2
r)p
2
8m
1
m
2
2
c
2
r
3
6
This relationship between velocity and momentum is interaction-dependent because
interaction energy in (14.4) is momentum-dependent.
7
In this derivation we omitted terms proportional to q
2
1
q
2
2
due to their smallness (for-
mally they belong to the 4th perturbation order). Also keeping the accuracy of (v/c)
2
we
can set r =
dr1
dt
dr2
dt
v
1
v
2
p1
m1
p2
m2
in terms that already have the factor (1/c)
2
.
474 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
=
q
1
q
2
r
4m
1
r
3
+
q
1
q
2
(v
1
v
2
)
2
r
8m
1
c
2
r
3
q
1
q
2
v
2
1
r
4m
1
c
2
r
3
+
q
1
q
2
(v
1
r)(v
2
v
1
)
4m
1
c
2
r
3
3q
1
q
2
(v
2
r)
2
r
8m
1
c
2
r
5
(14.9)
d
2
r
2
dt
2
q
1
q
2
r
4m
2
r
3
q
1
q
2
(v
1
v
2
)
2
r
8m
2
c
2
r
3
+
q
1
q
2
v
2
2
r
4m
2
c
2
r
3
q
1
q
2
(v
2
r)(v
1
v
2
)
4m
2
c
2
r
3
+
3q
1
q
2
(v
1
r)
2
r
8m
2
c
2
r
5
(14.10)
14.1.3 Denition of force
There are two denitions of force commonly used in classical mechanics.
In one denition the force acting on a particle is identied with the time
derivative of that particles momentum
f
i
dp
i
dt
(14.11)
In another denition [CV68] the force is a product of the particles rest mass
and its acceleration
8
f
i
m
i
d
2
r
i
dt
2
(14.12)
These two denitions are identical only for not-so-interesting potentials that
do not depend on momenta (or velocities) of particles. In the Darwin-Breit
electrodynamics we are dealing with momentum-dependent potentials, so we
need to decide which denition of force we are going to use.
The usual denition (14.11) has the advantage that the third Newtons
law of motion (the law of action and reaction) in a two-body system has a
simple formulation
9
f
1
= f
2
(14.13)
8
This is equivalent to the second Newtons law of motion.
9
see (14.6)
14.1. HAMILTONIAN FORMULATION 475
This is a trivial consequence of the law of conservation of the total momen-
tum. It follows immediately from the vanishing Poisson bracket [P
0
, H]
P
= 0
in the instant form of relativistic dynamics
10
dp
2
dt
= [p
2
, H]
P
= [P
0
p
1
, H]
P
= [p
1
, H]
P
=
dp
1
dt
(14.14)
Contrary to the usual practice, in this book we will use an alternative
denition of force (14.12). Although this denition does not imply the bal-
ance of forces (14.13), as can be seen from comparing (14.9) and (14.10), it
is preferable for several reasons. First, denition (14.12) is consistent with
the standard notion that equilibrium (or zero acceleration d
2
r/dt
2
= 0) is
achieved when the force vanishes.
11
Second, denition (14.11) is less conve-
nient because it is rather dicult to measure momenta of particles and their
time derivatives in experiments. It is much easier to measure velocities and
accelerations of particles, e.g., by time-of-ight techniques. For example, by
measuring current in a wire we actually measure the amount of charge pass-
ing through the cross-section of the wire in a unit of time. This quantity is
directly related to the velocity of electrons, while it has no direct connection
to electrons momenta.
14.1.4 Wire with current
Experimentally, it is very dicult to isolate two charged particles and mea-
sure their trajectories with the precision sucient to verify theoretical pre-
dictions. In many cases it is more convenient to study behavior of electrons
whose movement is conned inside wires made of conducting materials. In
this subsection we will consider forces acting between electrons in wires and
outside charges.
Let us consider the force exerted by a metal wire on a test charge q
1
located at point r
1
outside the wire and moving with velocity v
1
.
12
There
are two kinds of charges in the wire: xed positive ions of the lattice and
10
Note that in the traditional Maxwells theory the proof of the validity of the third
Newtons law is rather non-trivial. This proof requires introduction of such dubious
notions as hidden momentum and/or momentum of electromagnetic elds [Kel42, PN45,
SJ67, Jef99a].
11
This is the rst Newtons law of motion.
12
Here we ignore the magnetic moment of the test particle:
1
= 0.
476 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
mobile negatively charged electrons. In most cases the total charge of the
ions compensates exactly the total charge of the electrons, so that the wire
is electrically neutral. We assume that the spins (or magnetic moments
2
)
of ions and electrons in the wire are oriented randomly. Therefore, all
2
-
dependent terms in (14.3) vanish after averaging over angles. If the wire
moves as a whole with velocity w, then w-dependent Darwin interactions
13
of the charge 1 with electrons and ions in the wire cancel each other. So,
the force acting on the charge 1 does not depend on the wires velocity and
we can assume that the wire remains stationary and that only electrons in
the wire are moving with velocity v
2
. Electrons in the wire participate in
two kinds of movements: thermal and drift movements. The velocities of
the thermal movement are rather high, but their orientations are distributed
randomly. The drift velocity is directed along the applied voltage and its
magnitude is very small (mm/sec).
Let us rst see the eect of thermally agitated electrons
14
on the external
charge 1. In this case we can omit terms that do not depend on v
2
in
equation (14.9), because these terms are canceled by forces from positively
charged lattice ions. We can also neglect terms having linear dependence on
v
2
, because they average out to zero due to isotropy of the thermal movement.
So, the remaining force acting on the charge 1 due to the thermal chaotic
movement of electrons 2 is proportional to v
2
2
f
1
=
q
1
q
2
v
2
2
r
8c
2
r
3
3q
1
q
2
(v
2
r)
2
r
8c
2
r
5
(14.15)
Now consider a small piece of conductor located in the origin and the charge
1 at the point (0, 0, R) on the z-axis (see Fig. 14.1). Our goal is to show that
the total force
15
acting on the charge 1 is zero. To prove this fact it is sucient
to show that expression (14.15) yields zero when averaged over directions of
v
2
with the absolute value v
2
kept xed. The x- and y-components of this
average are zero by symmetry and the z-component is given by the integral
on the surface of a sphere of radius v
2
16
13
the second line in equation (14.3)
14
They are marked by the index 2 in this derivation.
15
or the average of forces (14.15) over dierent values of v
2
16
Here we use spherical coordinates with angles [0, 2) and [0, ], so that
(v
2
r) = v
2
Rcos .
14.1. HAMILTONIAN FORMULATION 477
(0,0,R)
xx
yy
zz
vv
22
Figure 14.1: Interaction of a charge 1 at (0, 0, R) with a piece of conductor
placed in the origin. It is assumed that conductors electrons have thermal
velocities with absolute value v
2
and random orientations.
I
z
=
q
1
q
2
8
_
0
sin d
2
_
0
d
_
v
2
2
R
2
3v
2
2
cos
2
R
2
_
=
q
1
q
2
8
_
_
4v
2
2
R
2
+
6v
2
2
R
2
1
_
1
t
2
dt
_
_
= 0
By similar arguments one can show that the reciprocal force exerted by the
charge on the conductor without current vanishes as well. So, we conclude
that the thermal movement of electrons can be ignored in conductor-charge
calculations.
Let us now consider a charge 1 and an innite straight wire with a non-
zero drift velocity of electrons v
2
in the geometry shown in Fig. 14.2. The
linear density of conduction electrons in the wire is
2
. First we would like
to calculate the force acting on the charge 1 from a small portion dr
2z
of the
wire. Then we use formula (14.9), keeping only terms dependent on v
2
df
1
= q
1
2
dr
2z
_
(v
1
v
2
)r
4c
2
r
3
+
v
2
2
r
8c
2
r
3
+
(v
1
r)v
2
4c
2
r
3
3(v
2
r)
2
r
8c
2
r
5
_
= q
1
2
dr
2z
_
[v
1
[v
2
r]]
4c
2
r
3
+
v
2
2
r
8c
2
r
3
3(v
2
r)
2
r
8c
2
r
5
_
(14.16)
478 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
xx
yy
zz
(R,0,0)
vv
22
rr
22
Figure 14.2: Interaction of a charge at (R, 0, 0) with an innite straight
vertical wire with current.
The full force is obtained by integrating (14.16) on the full length of the
wire. Let us rst show that the integral of the 2nd and 3rd term vanishes.
The y- and z-components of this integral are zero due to symmetry. For the
x-component we obtain
I
x
=
q
1
2
v
2
2
8m
1
c
2
dr
2z
_
R
(R
2
+ r
2
2z
)
3/2
3r
2
2z
R
(R
2
+ r
2
2z
)
5/2
_
= 0
This result means, in particular, that a neutral superconducting (=zero re-
sistance) wire with current does not create v
2
2
-dependent electrostatic poten-
tial in the surrounding space. In other words, a straight wire with current
does not act on charges at rest. The observation of such a potential was
erroneously reported in [EKL76]. Subsequent more accurate measurements
[LEK92, SSS
+
02] did not conrm that report.
So, the full force acting on the charge 1 is obtained by integration of the
rst term in (14.16) on the length of the wire
F
1
=
q
1
2
4c
2
dr
2z
[v
1
[v
2
r]]
r
3
(14.17)
In this expression one easily recognizes the Biot-Savart force law of the tra-
ditional Maxwells theory. This means that all results of Maxwells theory
14.1. HAMILTONIAN FORMULATION 479
xx
yy
zz
00
aa
uu
rr
11
pp
11
qq
11
22
Figure 14.3: Interaction between current loop and charge. The charge q
1
is
located at a general point in space r
1
= (r
1x
, r
1y
, r
1z
) and has an arbitrary
momentum p
1
= (p
1x
, p
1y
, p
1z
).
referring to magnetic properties of wires with currents remain valid in our
approach.
14.1.5 Charge and current loop
Let us use the Darwin Hamiltonian (14.4) to calculate the interaction en-
ergy between a neutral circular current-carrying wire of a small radius a
and a point charge in the geometry shown in Fig. 14.3. As we saw in the
preceding subsection, the movement of the wire as a whole does not have
any eect on its interaction with the charge. So, we will assume that the
current loop is xed in the origin. We need to take into account only the
velocity-dependent interaction between the charge 1 and negative charges of
conduction electrons having linear density
2
and drift velocity v
2
p
2
/m
2
,
whose tangential component is u, as shown in Fig. 14.3. Then the potential
energy of interaction between the charge 1 and the loop element dl is given
by the Darwins formula
480 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
V
dl2q1
q
1
2
dl
8m
1
c
2
_
(p
1
v
2
)
r
+
(p
1
r)(v
2
r)
r
3
_
In the coordinate system shown in Fig. 14.3 the line element in the loop
is dl = ad and v
2
= (u sin , u cos , 0). In the limit a 0 we can
approximate
1
r
1
[r
1
r
2
[
1
r
1
+
a(r
1x
cos + r
1y
sin )
r
3
1
(14.18)
1
r
3
1
[r
1
r
2
[
3
1
r
3
1
+
3a(r
1x
cos + r
1y
sin )
r
5
1
(14.19)
The full interaction between the charge and the loop is obtained by inte-
grating V
dl2q1
on from 0 to 2 and neglecting small terms proportional to
a
3
V
loop2q1
aq
1
2
8m
1
c
2
2
_
0
d
_
(up
1x
sin + up
1y
cos )
_
1
r
1
+
a(r
1x
cos + r
1y
sin )
r
3
1
_
+(ur
1x
sin + ur
1y
cos )((p
1
r
1
) p
1x
a cos p
1y
a sin )
_
1
r
3
1
+
3a(r
1x
cos + r
1y
sin )
r
5
1
_
_
a
2
uq
1
2
[r
1
p
1
]
z
4m
1
c
2
r
3
1
(14.20)
Taking into account the usual denition of the loops magnetic moment
17
as
a vector
2
whose length is
2
= a
2
2
u/c and whose direction is orthogonal
to the plane of the loop, we can generalize (14.20) for arbitrary position and
orientation of the loop
V
loop2q1
q
1
[
2
r] p
1
4m
1
cr
3
(14.21)
17
see equation (5.42) in [Jac99]
14.1. HAMILTONIAN FORMULATION 481
So, the full Hamiltonian for the system of charge 1 and current loop 2 is
H =
p
2
1
2m
1
+
p
2
2
2m
2
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
q
1
[
2
r] p
1
4m
1
cr
3
(14.22)
Now we can use this Hamiltonian to obtain the dynamics in the loop+charge
system. The time derivative of the particles momentum can be obtained
from the Hamiltons equation of motion (6.99)
dp
1
dt
=
H
r
1
=
q
1
[p
1
2
]
4m
1
cr
3
3q
1
([p
1
2
] r)r
4m
1
cr
5
The time derivative of the loops momentum follows from the momentum
conservation law (14.14)
dp
2
dt
=
dp
1
dt
The velocity of the charge 1 is obtained from the 2nd Hamiltons equation
of motion
v
1
=
H
p
1
=
p
1
m
1
p
2
1
p
1
2m
3
1
c
q
1
[
2
r]
4m
1
cr
3
(14.23)
Acceleration of this particle is obtained as a time derivative of (14.23)
18
a
1
dv
1
dt
p
1
m
1
q
1
[
2
r]
4m
1
cr
3
+
3q
1
[
2
r](r r)
4m
1
cr
5
18
Here we noticed that p
1
(v/c)
2
, therefore the time derivative of the second term
on the right hand side of (14.23) is (v/c)
3
, so it can be ignored in (14.24). We also
neglected the time derivative of the magnetic moment
2
= [
2
, H]
P
, because, due to
(15.14), the Poisson bracket [
2i
,
2j
]
P
= q
2
/(m
2
c)
ijk
2k
has an extra factor of c
in the denominator, which means that terms proportional to
2
are much smaller than
other terms in (14.24). Vector identities (D.17) and (D.18) were used in the derivation of
(14.24).
482 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
=
q
1
[p
1
2
]
2m
2
1
cr
3
3q
1
([
2
r] p
1
)r
4m
2
1
cr
5
+
3q
1
[
2
r](r p
1
)
4m
2
1
cr
5
+
q
1
[
2
p
2
]
4m
1
m
2
cr
3
3q
1
[
2
r](r p
2
)
4m
1
m
2
cr
5
=
q
1
[p
1
2
]
2m
2
1
cr
3
3q
1
[p
1
[r [
2
r]]]
4m
2
1
cr
5
_
d
dt
_
2
q
1
[
2
r]
4m
1
cr
3
=
q
1
[p
1
2
]
4m
2
1
cr
3
+
3q
1
[p
1
r](
2
r)
4m
2
1
cr
5
_
d
dt
_
2
q
1
[
2
r]
4m
1
cr
3
(14.24)
The notation (
d
dt
)
2
means the time derivative (of r) when only particle 2 (the
loop) is allowed to move. For example
_
d
dt
_
2
r = v
2
p
2
m
2
14.1.6 Charge and spins magnetic moment
Let us now consider the system of a spinless charged particle 1 and a spins
magnetic moment 2. The relevant Hamiltonian is obtained from (14.3) by
dropping terms depending on
1
and q
2
19
H =
p
2
1
2m
1
+
p
2
2
2m
2
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
+
q
1
[
2
r] p
2
8m
2
cr
3
q
1
[
2
r] p
1
4m
1
cr
3
(14.25)
As usual, we employ Hamiltons equations of motion to calculate the time
derivative of the momentum, the velocity and acceleration
dp
1
dt
= [p
1
, H]
P
=
H
r
1
=
q
1
[p
1
2
]
4m
1
cr
3
3q
1
([p
1
2
] r)r
4m
1
cr
5
q
1
[p
2
2
]
8m
2
cr
3
+
3q
1
([p
2
2
] r)r
8m
2
cr
5
19
Here we ignore the charge of the particle 2. Note that if the spins magnetic moment is
not moving (p
2
= 0) then the interaction energy of charge + moment (the last term in
(14.25)) is exactly the same as the interaction energy of charge + current loop (14.22).
For a moving spin the interaction energy has an additional term (the last term in (14.25))
which is absent in (14.22).
14.1. HAMILTONIAN FORMULATION 483
dp
2
dt
=
dp
1
dt
v
1
dr
1
dt
= [r
1
, H]
P
=
H
p
1
=
p
1
m
1
p
2
1
p
1
2m
3
1
c
q
1
[
2
r]
4m
1
cr
3
a
1
dv
1
dt
p
1
m
1
q
1
[
2
r]
4m
1
cr
3
+
3q
1
[
2
r](r r)
4m
1
cr
5
=
q
1
[p
1
2
]
2m
2
1
cr
3
3q
1
([p
1
2
] r)r
4m
2
1
cr
5
+
3q
1
[
2
r](r p
1
)
4m
2
1
cr
5
3q
1
[v
2
2
]
8m
1
cr
3
+
3q
1
([
2
r] v
2
)r
8m
1
cr
5
3q
1
[
2
r](r v
2
)
4m
1
cr
5
=
q
1
[p
1
2
]
2m
2
1
cr
3
3q
1
[p
1
[r [
2
r]]]
4m
2
1
cr
5
q
1
[v
2
2
]
8m
1
cr
3
+
3q
1
([
2
r] v
2
)r
8m
1
cr
5
+
q
1
[
2
v
2
]
4m
1
cr
3
3q
1
[
2
r](r v
2
)
4m
1
cr
5
=
q
1
[p
1
2
]
4m
2
1
cr
3
+
3q
1
[p
1
r](
2
r)
4m
2
1
cr
5
_
d
dt
_
2
q
1
[
2
r]
4m
1
cr
3
d
dr
1
q
1
([v
2
2
] r)
8m
1
cr
3
(14.26)
This means that acceleration of the charge 1 in the eld of the spins mag-
netic moment is basically the same as in the eld of a current loop (14.24).
The only dierence is the presence of an additional gradient term.
20
This
dierence will be discussed in subsections 14.3.1 - 14.3.3 in greater detail.
14.1.7 Two types of magnets
Let us now consider the system moving charge 1 + magnetic moment at rest
2. As we discussed above, the magnetic moment can be produced either by
a spinning particle or by a small current loop. The Hamiltonian
21
is obtained
either from (14.22) or from (14.25) by setting p
2
= 0
H =
p
2
1
2m
1
+
p
2
2
2m
2
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
q
1
[r p
1
]
2
4m
1
cr
3
(14.27)
20
the last term on the right hand side of (14.26)
21
which is the same in both cases
484 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
The force acting on the charge 1 is given by formula (14.24)
f
1
= m
1
a
1
=
q
1
[p
1
2
]
4m
1
cr
3
+
3q
1
[p
1
r](
2
r)
4m
1
cr
5
q
1
c
[v
1
b
1
] (14.28)
which is the standard denition of the magnetic part of the Lorentz force if
another standard expression
b
1
=
2
4r
3
+
3(
2
r)r
4r
5
(14.29)
is used for the magnetic eld of the magnetic moment
2
at point r
1
.
22
There are, however, important dierences between our formulas and the
standard approach. First, in the usual Lorentz force equation
23
the force
is identied with the time derivative of momentum (14.11). In our case,
the force is mass times acceleration. Second, in our approach, there are
no elds (electric or magnetic) having independent existence at each space
point. There are only direct inter-particle forces. This is why we put mag-
netic eld in quotes.
For comparison with experiment it is not sucient to discuss point mag-
netic moments. We need to apply the above results to macroscopic magnets
as well. It is important to mention that there are two origins of magnetization
in materials. The rst origin is due to the orbital motion of electrons. The
second one is due to spin magnetic moments of electrons
24
. In permanent
magnets both components play roles. The relative strength of the orbital
and spin magnetizations varies among dierent types of magnetic mate-
rials. However, in most cases the dominant contribution is due to electron
spins [RF69]. The full magnetization can be described by summing up total
magnetic moments over all atoms in the body, and the full magnetic eld of
the macroscopic magnet is obtained by adding up contributions like (14.29).
The above discussion referred to permanent bar magnets. However, there
is an alternative way to produce magnetic eld by means of electromag-
nets solenoids with current. In solenoids only the orbital component of
22
See equation (5.56) in [Jac99].
23
See, e.g., equation (11.124) in [Jac99].
24
the contribution from nuclear spins is much weaker
14.1. HAMILTONIAN FORMULATION 485
xx
yy
zz
uu
22
Figure 14.4: A thin solenoid can be represented as a stack of small current
loops. The magnetization vector
2
is directed along the solenoids axis.
C
11
C
3
C
22
I
II
I
II
Figure 14.5: A wire coil (black thick line) with current I can be represented as
a superposition of innitesimally small wire loops C
1
, C
2
, C
3
, . . . (grey lines)
with the same current I. All (imaginary) inside currents cancel each other,
so that only the (real) peripheral current remains.
486 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
magnetization (due to electrons moving in wires) is present. For example,
a straight thin solenoid of nite length can be represented as a collection
of small current loops
25
stacked on top of each other (see Fig. 14.4). The
magnetic eld of such a stack can be obtained by integrating (14.29) along
the length of the stack.
This result can be also generalized for macroscopic solenoids with non-
vanishing cross-sections. It is easy to see that each current-carrying coil
in such a solenoid can be represented as a superposition of innitely small
loops (see Fig. 14.5). Then a macroscopic thick cylindrical solenoid can be
represented as a set of parallel thin solenoids joined together.
14.2 Experiments and paradoxes
In this section we will discuss a number of real or thought electromagnetic
experiments, whose description in classical Maxwells electrodynamics is in-
adequate or paradoxical. We will also consider these experiments from the
point of view of the RQD direct interaction approach developed in the preced-
ing section. Our goal is to demonstrate that in all cases the RQD description
is more logical and consistent.
14.2.1 Conservation laws in Maxwells theory
One important class of diculties characteristic to Maxwells electrodynam-
ics is related to the apparent non-conservation of total observables (energy,
momentum, angular momentum, etc.) in systems of interacting charges.
Indeed, in the theory based on Maxwells equations there is no guarantee
that total observables are conserved and that the total energy and mo-
mentum form a 4-vector quantity. Suggested solutions of these paradoxes
[But69, Roh60, Fur69, APV88, Com96, Com00, Kho05, Hni04, SG03, Teu96,
Jac04, McD06, KY07a, KY07b, KY08, TY13] involved such ad hoc construc-
tions as hidden momentum, the energy and momentum of electromagnetic
elds, Poincare stresses, etc.
In order to discuss conservation laws in Maxwells theory we rst consider
a system of two charges 1 and 2, which are free to move without inuence of
external forces, i.e., they form an isolated system. By applying the standard
Biot-Savart force law
25
see subsection 14.1.5
14.2. EXPERIMENTS AND PARADOXES 487
f
1
=
q
1
q
2
4c
2
[v
1
[v
2
r]]
r
3
(14.30)
f
2
=
q
1
q
2
4c
2
[v
2
[v
1
r]]
r
3
(14.31)
and the traditional force denition f = dp/dt it is easy to see that the
Newtons third law (f
1
= f
2
) is not satised for most geometries [How44].
As we discussed in subsection 14.1.3, this means that the total momentum
of particles P
p
is not conserved. The usual explanation [Kel42, PN45] of
this paradox is that the two charges alone do not constitute a closed physical
system. In order to restore the momentum conservation one needs to take into
account the momentum contained in the electromagnetic eld surrounding
two charges.
According to Maxwells theory, electric and magnetic elds E(r), B(r)
generally have non-zero momentum and energy given by integrals over entire
space
P
f
=
1
4c
_
dr[E(r) B(r)] (14.32)
H
f
=
1
8
_
dr(E
2
(r) + B
2
(r)) (14.33)
So, in the standard explanation, the idea is that the total momentum of
particles + elds (P
p
+P
f
) is conserved in all circumstances.
From the point of view of RQD, it is understandable when Maxwells
theory associates momentum and energy with transverse time-varying elec-
tromagnetic elds in free space. As we will discuss in subsection 14.6.2,
these elds can be accepted as rough models of electromagnetic radiation.
For free propagating elds, equations (14.32) and (14.33) are supposed to be
equivalent to the sums of momenta and energies of photons, respectively.
However, Maxwells theory goes even farther and claims that bound
26
electromagnetic elds surrounding charges or magnets also have non-zero
momentum and energy. If this were true then one could easily imagine sta-
tionary systems (e.g., a charged magnet) where nothing is moving and where
26
stationary, non-radiating
488 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
elds E, B would possess a non-zero momentum.
27
This electromagnetic
eld energy idea does not seem attractive for a couple of reasons.
First, the electromagnetic energy integral (14.33) for the electric eld
E = qr/(4r
3
) associated with a stationary point charge (e.g., an electron) is
innite.
28
To avoid this diculty, various classical models of the electron
were suggested, the simplest of which is a charged sphere of a small but nite
radius. However, these models led to other problems. One of them is the
famous 4/3 paradox: It can be shown that the momentum of the electro-
magnetic eld associated with a nite-radius electron does not form a 4-vector
quantity together with its electromagnetic energy [Roh60, But69, Com97].
This violation of relativistic invariance can be xed if one introduces an
extra factor of 4/3 in the formula for the eld momentum. To justify this
extra factor the ad hoc idea of Poincare stresses is sometimes introduced.
29
On the other hand, if one adopts the RQD no-elds approach, then
only observable and nite momenta and energies of particles should be taken
into account and correct relativistic transformation laws of these quantities
hold exactly without any ad hoc assumptions. In relativistic Hamiltonian
dynamics (which is the basis of our RQD approach to electrodynamics) the
conservation laws and transformation properties of observables are direct
consequences of the Poincare group structure. The Poisson bracket of any
observable F with the Hamiltonian H determines the time evolution of this
observable
dF(t)
dt
= [F, H]
P
Then the conservation of observables H, P and J follows automatically from
their vanishing Poisson brackets with H.
30
Similarly, boost transformations
of F can be obtained as solutions of (3.65)
27
See, e.g., subsection 14.4.3. The angular momentum of static electromagnetic elds
in Maxwells theory was discussed in [Rom66].
28
See also [Fra07] for discussion of other diculties related to the idea of energy and mo-
mentum contained in the electromagnetic eld. An interesting critical review of Maxwells
electrodynamics and Minkowski space-time picture can be found in section 1 of [GZL].
29
see sections 16.4 - 16.6 in [Jac99]
30
For example, the explicit conservation of the total momentum P in our theory guar-
antees the resolution of paradoxes 6 and 7 in [Kho].
14.2. EXPERIMENTS AND PARADOXES 489
dF(
)
d
= c[F, K]
P
In the case of total momentum-energy (P, H), the commutators (3.57) -
(3.58) hold independent on the strength of interaction. Then the 4-vector
transformation formulas (4.3) - (4.4) follow.
14.2.2 Kislev-Vaidman paradox
In both Maxwells theory and in RQD, the Darwin Hamiltonian (14.4) is a
(v/c)
2
approximation to the full interaction of two charges. However, there is
a signicant dierence between these two approaches. In Maxwells theory it
is assumed that the direct-interaction form of the Darwin potential is only an
approximation: in higher (v/c) orders one would get retarded (propagating
with the speed of light) Lienard-Wiechert potentials between charges. On
the other hand, in RQD the interaction remains instantaneous in all (v/c)
orders.
31
There is a remarkable paradox [KV02] associated with the assumption
of retarded interactions in standard Maxwells electrodynamics.
32
Consider
two particles 1 and 2 both having the unit charge. Let us assume that
their electromagnetic interaction is transmitted by retarded potentials and
that the movement of both particles is conned on the x axis. Let us now
force the two particles to move along certain prescribed paths plotted in Fig.
14.6 by full thick lines. Initially (at times t < 0) both particles are kept at
rest with the distance L between them. The Coulomb interaction energy is
1/(4L). At time t = 0 we apply external force which displaces particle 1 by
the distance d < L toward the particle 2. The work performed by this force
will be denoted W
1
4W
1
=
1
L d
1
L
Then we wait
33
until time t
2
and move both particles simultaneously by the
distance d/2 away from each other. If we make this move rapidly during a
31
see subsection 14.5.2
32
A number of related paradoxes were discussed also in [Eng03, Kho05, Kho06].
33
The displacement of the charge 1 and its acceleration results in emission of electro-
490 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
ct
xx
00
LL
dd
Ld
d/2
LL
11
22
aa
tt
11
tt
22
tt
33
11
22
d/2
Figure 14.6: Movements of two charged particles in the Kislev-Vaidman
paradox plotted in the t x plane. The time on the horizontal axis is mul-
tiplied by c, so that photon trajectories (dashed arrows) are at 45
angles.
short time interval (t
3
t
2
) < (Ld)/c, then the retarded eld of the particle
2 in the vicinity of the particle 1 remains unperturbed as if the particle 2 has
not been moved at all. The same is true for the eld of the particle 1 in the
vicinity of the particle 2. Therefore the work performed by such a move is
4W
2
= 2
_
1
L d/2
1
L d
_
The total work performed in these two steps is nonzero
4(W
1
+ W
2
)
=
1
L d
1
L
+
2
L d/2
2
L d
magnetic radiation (indicated by dashed arrows in Fig. 14.6). So, we would need to wait
for a suciently long time until the emitted photons propagated far enough, so that they
do not have any eect on our two-charge system anymore.
14.2. EXPERIMENTS AND PARADOXES 491
1
L(1 d/L)
1
L
+
2
2(1 d/(2L))
2
L(1 d/L)
1
L
_
1 +
d
L
+
d
2
L
2
_
1
L
+
2
L
_
1 +
d
2L
+
d
2
4L
2
_
2
L
_
1 +
d
L
+
d
2
L
2
_
=
1
L
+
d
L
2
+
d
2
L
3
1
L
+
2
L
+
d
L
2
+
d
2
2L
3
2
L
2d
L
2
2d
2
L
3
=
d
2
2L
3
(14.34)
This means that after the shifts are completed we nd both charges in the
same conguration as before (at rest and separated by the distance L), how-
ever we gained some amount of energy (14.34). Of course, the balance of
energy (14.34) is not complete. It does not include the energy of photons
emitted by accelerated charges.
34
However, one could, in principle, recap-
ture this emitted energy by surrounding the pair of particles by appropriate
photon absorbers and redirect the captured energy to perform the work of
moving the charges again. Then, it would become possible to build a per-
petuum mobile machine in which the two steps described above are repeated
indenitely and each time the energy (14.34) is gained.
The following explanation of this paradox was suggested by Kislev and
Vaidman [KV02]: They claim that there is another energy term missed in the
above analysis which is related to the interference of electromagnetic waves
emitted by the two particles
35
and which restores the energy balance. This
explanation does not look plausible, because there is actually no interaction
energy associated with the interference of light waves: The interference re-
sults in a redistribution of the wave amplitude (formation of minima and
maxima) and its local energy in space, while the total energy of the waves
remains unchanged [Gau03]. In other words, there is no interaction between
photons.
36
The true explanation of the Kislev-Vaidman paradox is provided by
the Darwin-Breit instantaneous action-at-a-distance theory. In the absence
of retardation of the Coulomb potential, it is easy to show that W
1
+W
2
= 0,
34
According to Larmors formula, the energy of emitted photons is proportional to the
square of acceleration of the charges. See subsection 13.3.3.
35
For example, in Fig. 14.6 electromagnetic waves emitted by the two charges meet at
point a and their interference proceeds from that time on.
36
QED predicts a very weak photon-photon interaction in the 4th perturbation order,
however it is negligibly small in the situation considered here.
492 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
and the total work performed by moving the charges is equal to the energy
of the emitted radiation.
14.2.3 Trouton-Noble paradox
In RQD the total angular momentum J of any isolated system of interacting
particles is conserved. In other words, there can be no torque
37
in any isolated
system of charges. This follows directly from the following Poisson bracket
in the Poincare Lie algebra
dJ
dt
= [J, H]
P
= 0
This result should hold in any inertial frame of reference. For example, in a
moving frame dynamical variables are
38
J(
) = e
ic
Je
ic
H(
) = e
ic
He
ic
)
dt
= [J(
), H(
)]
P
= [e
ic
Je
ic
, e
ic
He
ic
]
P
= e
ic
[J, H]
P
e
ic
= 0
Maxwells classical electrodynamics cannot make such a clear statement
about the conservation of the total angular momentum and the absence of
torque in all frames. This failure is in the center of the Trouton-Noble
paradox which haunted Maxwells theory for more than a century [TN04,
PN45, Fur69, SG03, Jac04, But69, Teu96, Jef99b].
Imagine two charges moving with the same velocity vector v, which makes
an angle
40
with the vector r = r
1
r
2
connecting positions of the charges
37
The torque is dened here as the time derivative of the total angular momentum of
the system.
38
in quantum notation
39
Here t
is time measured by the moving observer. Note that this result is valid only if
K is the full interaction-dependent boost (N.27) - (N.28).
40
that is dierent from 0 and 90
2
] r)
8cr
3
_
d
dt
_
2
q
1
[
2
r]
4cr
3
(14.35)
If the magnetic moment is created by a small current loop, then we should
use (14.24)
f
orb
1
=
_
d
dt
_
2
q
1
[
2
r]
4cr
3
(14.36)
In other words, the force produced by a moving spin has two components,
the rst of which is conservative and the second is non-conservative
45
f
spin
1
= f
cons
1
+f
noncons
1
(14.37)
The force produced by the current loop has only a non-conservative compo-
nent
f
orb
1
= f
noncons
1
(14.38)
Let us rst focus on the non-conservative force component f
noncons
1
, which
is common for both spin and orbital magnetic moments. We will return to
the conservative force component in subsection 14.3.3.
For macroscopic magnets the innitesimal quantities considered thus far
should be integrated on the magnets volume V , e.g., the full non-conservative
force exerted by a macroscopic magnet on the charge at rest q
1
is
F
noncons
1
=
_
d
dt
_
2
_
V
q
1
[
2
r]
4cr
3
dr
2
(14.39)
This means that the magnet,
46
moving near a wire loop L, induces a current
in the loop as shown in Fig. 14.8.
45
The force is dened as conservative if it can be represented as a gradient of a scalar
function (an example is given by the rst term on the right hand side of (14.35)). Otherwise
the force is called non-conservative. The integral of a conservative force vector along any
closed loop is zero. Therefore, conservative forces on electrons cannot be detected by
measuring a current in a closed circuit.
46
either a permanent magnet or a solenoid with current
14.3. ELECTROMAGNETIC INDUCTION 497
Let us now show that this prediction agrees quantitatively with Maxwells
electrodynamics. We denote by symbol e
1
the force with which a microscopic
magnetic moment acts on a unit charge
47
e
1
f
noncons
1
/q
1
If we take curl of this quantity, we obtain
_
r
1
e
1
_
=
1
4c
_
d
dt
_
2
_
r
1
[
2
r]
r
3
_
=
1
4c
_
d
dt
_
2
_
2
r
3
+
3(
2
r)r
r
5
_
=
1
c
_
d
dt
_
2
b
1
(14.40)
where b
1
is the magnetic eld (14.29) of the magnetic moment. After
integrating both sides of equation (14.40) on the magnets volume we obtain
exactly Maxwells equation
_
r
1
E
1
_
=
1
c
_
d
dt
_
2
B
1
which expresses the Faradays law of induction.
It is important to stress that the origin of electromagnetic induction pro-
posed in our work is fundamentally dierent from that adopted in Maxwells
theory. The traditional explanation is that electromagnetic induction results
from inter-dependence of time-varying electric and magnetic elds. In our
approach, this is the consequence of velocity-dependent interactions between
magnetic dipoles and charges.
14.3.2 Homopolar induction: non-conservative forces
One interesting application of the electromagnetic induction law is the ho-
mopolar generator shown in Fig. 14.9. It consists of a conducting disk C and
47
In Maxwells electrodynamics this is the denition of the electric eld e
1
.
498 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
Amp
N
NN
N
NNN NNN
NN
N
NN
S
SS
S
SSS SSS
S S
S S
S S
Amp
NN
N
NN
N NNN NN
N
NN
N
S S
S S
S S
S S S S S S S S
S S
S S
S
AA
B
CC
A
BB
C
MM
M
(a)
(b)
Figure 14.9: Homopolar generator. (a) the conducting disk C rotates; (b)
the magnet M rotates.
a cylindrical magnet M. Both the conducting disk and the magnet are rigidly
attached to their own shafts and both can independently rotate about their
common axis. The magnetization vector
2
of each small volume element
of the magnet is directed along the axis, so the total magnets moment is
time-independent for both stationary and rotating magnets. The shaft AB
is conducting. Points A and C are connected to sliding contacts (shown by
arrows), and the circuit is closed through the galvanometer.
There are two modes of operation of this device. In the rst mode (see Fig.
14.9(a)) the magnet is stationary while the conducting disk rotates about its
axis. The galvanometer detects a current in the circuit. This has a simple
explanation: The force acting on electrons in the metal can be obtained by
integrating formula (14.28),
48
on the magnets volume V
F
1
=
_
V
q
1
c
[v
1
b
1
]dr
2
(14.41)
The full electromotive force in the circuit is obtained by integrating expres-
sion (14.41) along the closed contour A B C galvanometer A.
48
As we saw in subsection 14.1.7, this formula is applicable to both orbital and spin
magnets at rest.
14.3. ELECTROMAGNETIC INDUCTION 499
The velocity v
1
is non-zero only on the segment B C,
49
where the force
F
1
is directed radially. The integral is non-zero and the galvanometer must
show a non-vanishing current in agreement with experiments.
In the second operation mode (see Fig. 14.9(b)) the disk C is xed
and the magnet rotates. It was established by careful experiments [Gup63,
The62] that there is no current in this case. If both the magnet and the
disk rotate, then the current is the same as in the rst mode, i.e., with
a xed magnet. This means that rotation of the magnet has no eect on
the produced current. This experimental result looked somewhat surprising,
because from the principle of relativity one could expect that the physical
outcome (the current) should depend only on the relative movement of the
magnet and the disk. However, this conclusion is incorrect, because the
principle of relativity is applicable only to inertial movements. It cannot be
applied to rotational movements without contradictions.
Let us now analyze the rotating magnet case shown in Fig. 14.9(b) from
the point of view of the Darwin-Breit electromagnetic theory. We need to
know the integral of the force acting on electrons along the closed circuit
A B C galvanometer A. The conservative portion of the force
f
cons
1
does not contribute to this integral. Since here we have a cylindrical
magnet rotating about its axis of magnetization, the volume integral in the
expression (14.39) for the non-conservative force is time-independent and the
total non-conservative force acting on electrons is zero. This agrees with the
observed absence of the current.
14.3.3 Homopolar induction: conservative forces
So far in our discussion of homopolar induction we considered only non-
conservative forces, mainly because they can be detected rather easily by
measuring induced currents in closed circuits. In the beginning of the 20th
century Barnett and Kennard performed experiments [Bar12, Ken17]
50
with
the specic purpose to detect the conservative part of forces from moving
magnets. Barnetts experimental setup (shown schematically in Fig. 14.10)
resembled the homopolar generator discussed above. Its main parts were a
cylindrical solenoid S with current and two conducting cylinders C
1
and C
2
placed inside the solenoid. All three cylinders shared the same rotation axis
49
Velocities of electrons in the rotating conductor are shown by small arrows in Fig.
14.9(a).
50
see also [Kho03]
500 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
SS
CC
11
CC
22
WW
zz
Figure 14.10: The Barnetts experiment.
z. Conductors C
1
and C
2
formed a cylindrical capacitor. Initially they were
connected by a conducting wire W. Note that in contrast to the homopolar
generator experiment, where a current in a closed circuit was measured, the
system C
1
W C
2
in the Barnetts setup did not form a closed circuit.
So, the capacitor would obtain a non-zero charge even if the force acting on
electrons in W was conservative.
Similar to the homopolar generator discussed above, this apparatus could
operate in two dierent modes. In the rst mode the cylindrical capacitor
spun about its axis. Due to the presence of the magnetic eld inside S a
current ran through the wire W and the capacitor C
1
C
2
became charged.
Then the wire was disconnected, capacitors rotation stopped and the ca-
pacitors charge measured. As expected, the measured charge was consistent
with the standard Lorentz force formula (14.41).
In the second operation mode, the capacitor was xed while the solenoid
rotated about its axis. No charge on the capacitor was registered in this
case. This result is consistent with our theory, because, just as in the case
of homopolar generator, the non-conservative force (14.39) vanishes due to
the cylindrical symmetry of the setup and the conservative force is absent
in the case of moving solenoid (14.38). So, the null result of the Barnetts
experiment conrms our earlier conclusion that moving solenoids with current
do not exert conservative forces on nearby charges.
A dierent result is expected in the case of rotating permanent magnet.
In this case the conservative force component is non-zero. It can be obtained
by integrating the rst term on the right hand side of (14.35) on the volume
V of the magnet
14.3. ELECTROMAGNETIC INDUCTION 501
MM
zz
SS
11
SS
22
dd
BB
Figure 14.11: Schematic of the Wilson-Wilson experiment.
F
cons
1
=
d
dr
1
_
V
q
1
([v
2
2
] r)
8cr
3
dr
2
(14.42)
So, rotating cylindrical permanent magnet should induce a non-zero charge
in a stationary capacitor. A relevant experiment was performed in 1913 by
Wilson and Wilson [WW13]. This experiment was repeated again in 2001
with improved accuracy [HBH
+
01]. For theoretical discussion of the Wilson-
Wilson experiment from the point of view of Maxwells electrodynamics see
[McDc, PS95a].
Schematic representation of the Wilson-Wilson experiment is shown in
Fig. 14.11. A hollow cylinder M made of magnetic dielectric (non-conducting
material) was placed in a constant magnetic eld B parallel to the axis z.
The inner and outer surfaces of the cylinder (S
1
and S
2
, respectively) were
covered by metal and the electrostatic potential between the two surfaces
was measured. When the cylinder was at rest, no potential was recorded,
as expected. However, when the cylinder was rotated a non-zero potential
dierence was observed. This potential is a result of electric dipoles d created
in the bulk of the magnet. There are two physical mechanisms for the ap-
pearance of these dipoles. First, molecules of the dielectric material moving
in the magnetic eld B get polarized (the Lorentz forces act in opposite di-
rections on positive and negative charges in the molecules). Second, moving
induced magnetic moments create electric elds similar to the eld of
an electric dipole. Indeed, if we compare expression (14.42) with the force
exerted on the charge q
1
by an electric dipole d
502 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
f
dipole
1
=
r
1
q
1
(d r)
4r
3
we see that a magnetic moment moving with velocity v acquires an electric
dipole of the magnitude
51
d =
[v ]
2c
(14.43)
The Wilson-Wilson experiment clearly demonstrated that both kinds of dipole
moments (those due to dielectric polarization and those due to moving dipole
moments) are present in the rotating magnet. This conrms qualitatively
our conclusion about the presence of conservative forces (14.42) near moving
permanent magnets. A quantitative description of this experiment would re-
quire calculation of the polarization and magnetization of bodies moving in
an external magnetic eld. This is beyond the scope of the theory developed
here.
14.4 Aharonov-Bohm eect
The central idea of our approach to classical electrodynamics is the rejection
of electric and magnetic elds. This also means that we reject the notion
of electromagnetic potentials A
dz
_
(0, 0,
2
)
4(x
2
1
+ y
2
1
+ z
2
)
3/2
3
2
z(x
1
, y
1
, z)
4(x
2
1
+ y
2
1
+ z
2
)
5/2
_
= 0 (14.44)
A solenoid with arbitrary cross-section can be represented as a bunch of
thin solenoids.
54
If the observation point r
1
is outside the solenoids volume,
then equation (14.44) holds for each thin segment, and the total magnetic
eld at point r
1
also vanishes. The same analysis applies to innitely long
bar magnets of arbitrary cross-section. Thus we conclude that the force
acting on a moving charge outside innitely long magnet (either permanent
magnet or solenoid with current) is zero. This conclusion agrees with calcu-
lations based on Maxwells equations. See, for example, Problem 5.2(a) in
[Jac99].
However, the vanishing force does not mean that the potential energy of
the charge-solenoid interaction is zero as well. In the case of a thin solenoid,
the potential energy can be found by integrating the last term in equation
(14.27) along the length of the solenoid and noticing that the mixed product
([
2
v
1
] r
1
) is independent of z. Denoting r (x
2
1
+ y
2
1
)
1/2
the particle-
solenoid distance, we obtain
52
i.e., solenoids points have coordinates r
2
= (0, 0, z)
53
Here we integrate equation (14.29) on the (innite) length of the solenoid. This time
2
should be understood as magnetization per unit length of the solenoid.
54
see Fig. 14.5
504 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
V
long
=
dz
q
1
([
2
v
1
] r
1
)
4c(x
2
1
+ y
2
1
+ z
2
)
3/2
=
q
1
([
2
v
1
] r
1
)
2cr
2
(14.45)
The acceleration of the moving charge is found, as usual, by application of
the Hamiltons equations of motion
dp
1
dt
= [p
1
, H]
P
=
V
long
r
1
=
q
1
[p
1
2
]
2m
1
cr
2
+
q
1
([
2
p
1
] r
1
)r
1
m
1
cr
4
dr
1
dt
= [r
1
, H]
P
=
H
p
1
=
p
1
m
1
p
2
1
p
1
2m
3
1
c
+
q
1
[r
1
2
]
2m
1
cr
2
d
2
r
1
dt
2
p
1
m
1
+
q
1
[p
1
2
]r
2
1
2m
2
1
cr
4
q
1
[r
1
2
](r
1
p
1
)
m
2
1
cr
4
=
q
1
m
2
1
cr
4
([p
1
2
]r
2
1
([p
1
2
] r
1
)r
1
[r
1
2
](r
1
p
1
))
=
q
1
m
2
1
cr
4
([r
1
[r
1
[p
1
2
]]] [r
1
2
](r
1
p
1
))
=
q
1
m
2
1
cr
4
([r
1
p
1
](r
1
2
) + [r
1
2
](r
1
p
1
) [r
1
2
](r
1
p
1
))
= 0 (14.46)
where we took into account that (r
1
2
) = 0. This agrees with the van-
ishing magnetic eld found earlier and presents a curious example of a
non-vanishing potential, which does not produce any force on charges. Ex-
perimental manifestations of such potentials will be discussed in this section.
14.4.2 Aharonov-Bohm experiment
In the preceding subsection we concluded that charges do not experience
any force (acceleration) when they move in the vicinity of a straight innite
magnetized solenoid or a permanent bar magnet. However, the absence of
force does not mean that charges do not feel the presence of the magnet.
In spite of zero magnetic (and electric) eld, innite solenoids/rods have
a non-zero eect on particle wave functions and their interference. This
eect was rst predicted by Aharonov and Bohm [AB59] and later conrmed
in experiments [Cha60, TOM
+
86, OMK
+
86]. The explanation proposed by
14.4. AHARONOV-BOHM EFFECT 505
xx
yy
zz
22
RR
00
vv
11
R
AA
BB
AA
11
AA
22
BB
22
BB
11
Figure 14.12: The Aharonov-Bohm experiment. The vertical innite thin
magnetized rod with linear magnetization density
2
is shown by grey arrows.
Aharonov and Bohm was based on electromagnetic potentials in the multiply-
connected topology of space induced by the presence of the solenoid. Here
we suggest a dierent explanation [Stea].
Let us consider the idealized version of the Aharonov-Bohm experiment
shown in Fig. 14.12: An innite solenoid or ferromagnetic ber with negli-
gible cross-section and linear magnetization density
2
is erected vertically
in the origin. The electron wave packet is split into two parts (e.g., by
using a double-slit) at point A. The subpackets travel on both sides of
the solenoid/bar with constant velocity v
1
and the distance of the closest
approach is R. The subpackets rejoin at point B, where the interference
is measured. (The two trajectories AA
1
B
1
B and AA
2
B
2
B are denoted by
dashed lines.) The distance AB is suciently large, so that the two paths
can be regarded as parallel to the y-axis everywhere
r
1
(t) = (R, v
1y
t, 0) (14.47)
Experimentally it was found that the interference of the two wave packets at
point B depends on the magnetization of the solenoid/rod, in spite of zero
force acting on the electrons.
506 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
There exist attempts to explain the Aharonov-Bohm eect as a result of
classical electromagnetic force that creates a time lag between wave packets
moving on dierent sides of the solenoid [RGR91, Boy06, Boy05, Boy07a,
Boy07b]. However, this approach seems to be in contradiction with recent
measurements, which failed to detect such a time lag [CBB07]. Moreover,
our result (14.46) clearly shows that velocities of charges remain constant in
the eld of the solenoid, thus refuting the time lag idea. Several other non-
conventional explanations of the Aharonov-Bohm eect were also suggested
in the literature [SC92, Wes98, Pin04, HN08], but they will not be discussed
here.
To estimate the solenoids eect on the interference, we need to turn to
the quasiclassical representation of particle dynamics from subsection 6.5.6.
We have established there that the center of the wave packet is moving in
accordance with Heisenbergs equations of motion. In our case, no force
is acting on the electrons, so their trajectories (14.47) are independent on
magnetization. We also established in 6.5.6 that the overall phase factor of
the wave packet changes in time as exp(
i
)
2
V
long
(t
)
_
dt
(14.48)
and V
long
(t) is the time dependence of the potential (14.45) experienced by
the electron. In the Aharonov-Bohm experiment the electrons wave packet
separates into two subpackets that travel along dierent paths AA
1
B
1
B and
AA
2
B
2
B. Therefore, the phase factors accumulated by the two subpackets
are generally dierent and the interference of the left and right wave
packets at point B will depend on this phase dierence
=
1
(
left
right
)
Let us now calculate the relative phase dierence in the geometry of
Fig. 14.12. The kinetic energy term in (14.48) does not contribute, because
velocity remains constant and equal for both paths. However, the potential
energy of the charge 1 is dierent for the two paths. For all points on the
14.4. AHARONOV-BOHM EFFECT 507
right path the numerator of the expression (14.45) is q
1
2
v
1y
R and for
the left path the numerator is q
1
2
v
1y
R. Then the total phase dierence
=
1
q
1
2
Rv
1y
c(R
2
+ v
2
1
t
2
)
dt =
e
2
c
(14.49)
does not depend on the electrons velocity and on the value of R. This
phase dierence is proportional to the solenoids magnetization
2
. So, all
essential properties of the Aharonov-Bohm eect are fully reproduced within
our approach.
55
It is interesting that the presence of the phase dierence is not specic to
line magnets of innite length. This eect was also seen in experiments with
short magnetized nanowires [MIB03]. This observation presents a challenge
for the traditional explanation, which must apply one logic (electromagnetic
potential in the space with multiple-connected topology) for innitely long
magnets and another logic (the presence of the magnetic eld) for nite
magnets. Our description of the Aharonov-Bohm eect is more economical,
as it applies the same logic independent on whether the magnet is innite
or nite.
56
In both cases there is a dierence between action integrals for
electrons paths passing the line magnet on the right and on the left.
14.4.3 Toroidal magnet and moving charge
The system consisting of a toroidal magnet and a moving charge is inter-
esting for two reasons. First, toroidal permanent magnets were used in
Tonomuras experiments [TOM
+
86, OMK
+
86], which are regarded as the
best evidence for the Aharonov-Bohm eect. Second, classical Maxwells
electrodynamics has a serious trouble in explaining how the total momen-
tum is conserved in this system. This is known as the Cullwicks paradox
[Cul52, AHR04, McDa]. In an attempt to explain this paradox, let us apply
Maxwells theory to a charge moving along the symmetry axis through the
center of a magnetized torus (see Fig. 14.13). As we will see below, there
is no magnetic eld outside the toroidal magnet, so the force acting on
55
Our result was derived for thin ferromagnetic rods and solenoids, however the same
arguments apply to innite cylindrical rods and solenoids of any cross-section.
56
To nd the potential energy in the case of a nite linear magnet one should simply
use nite integral limits in (14.45).
508 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
xx
yy
zz
RR
aa
22
qq
11
qq
11
path 1
path 2
Figure 14.13: Toroidal magnet and moving charge. path 1 passes through
the center of the torus; path 2 is outside the torus.
the charge is zero. However, the moving charge creates its own magnetic
eld which does act on the torus with a non-zero force.
57
So, the New-
tons third law is apparently violated. According to McDonald [McDa], the
balance of force can be restored if one takes into account the hypothetical
momentum of the electromagnetic eld. However, this is not the whole
story yet. The eld momentum turns out to be non-zero even in the case
when both the magnet and the charge are at rest. This leads to the absurd
conclusion that the linear momentum of the system does not vanish even if
nothing moves. The problem is allegedly xed by assuming the existence
of the hidden momentum in the magnet. However this explanation does
not seem satisfactory and here we would like to suggest a dierent version of
events.
First we need to derive the Hamiltonian describing dynamics of the sys-
tem charge 1 + toroidal magnet 2. We introduce a Cartesian coordinate
system shown in g 14.13. Assume that the torus has radius a and linear
magnetization density
2
and that particle 1 moves straight through the cen-
57
Note that in the McDonalds treatment the force is identied with the time derivative
of momentum, while in our approach the force is dened as (mass)(acceleration).
14.4. AHARONOV-BOHM EFFECT 509
ter of the torus with momentum p
1y
(path 1 in Fig. 14.13). Then we can use
symmetry arguments to disregard x and z components of forces and write
58
r
2
= (a cos , 0, a sin )
r = r
1
r
2
= (a cos , r
1y
, a sin )
2
= (
2
sin , 0,
2
cos )
[
2
r]
y
= a
2
sin
2
+ a
2
cos
2
= a
2
[
2
r] p
1
= a
2
p
1y
[
2
r] P
2
= a
2
M
2
V
2y
Then the potential energy of interaction between the charge and the magnet
is obtained by integrating the potential energy in (14.25) on the length of
the torus
59
V =
2
_
0
d
_
q
1
a
2
2
p
1y
4m
1
c(a
2
cos
2
+ (r
1y
R
2y
)
2
+ a
2
sin
2
)
3/2
+
q
1
a
2
2
V
2y
8c(a
2
cos
2
+ (r
1y
R
2y
)
2
+ a
2
sin
2
)
3/2
_
=
q
1
a
2
2
p
1y
2m
1
c(a
2
+ (r
1y
R
2y
)
2
)
3/2
+
q
1
a
2
2
P
2y
4M
2
c(a
2
+ (r
1y
R
2y
)
2
)
3/2
(14.50)
and the full Hamiltonian can be written as
H =
p
2
1y
2m
1
+
P
2
2y
2M
2
p
4
1y
8m
3
1
c
2
P
4
2y
8M
3
2
c
2
q
1
a
2
2
p
1y
2m
1
c(a
2
+ (r
1y
R
2y
)
2
)
3/2
+
q
1
a
2
2
P
2y
4M
2
c(a
2
+ (r
1y
R
2y
)
2
)
3/2
58
Here m
2
is the linear density (mass per unit length) of the torus. We do not assume
that the magnet is stationary. It can move along the y-axis. The y-component of its
velocity is denoted V
2y
P
2y
/M
2
, where M
2
is the full mass of the magnet and P
2
is the
magnets momentum.
59
R
2
is the center of mass of the toroid. Here we assume that we are dealing with
a permanent toroidal magnet. For a toroidal solenoid one should integrate the potential
energy expression (14.21). Then the second term on the right hand side of (14.50) would
be absent.
510 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
Hamiltons equations of motion lead to the following results
dp
1y
dt
=
V
r
1y
=
3q
1
a
2
2
p
1y
(r
1y
R
2y
)
2m
1
c(a
2
+ (r
1y
R
2y
)
2
)
5/2
+
3q
1
a
2
2
P
2y
(r
1y
R
2y
)
4M
2
c(a
2
+ (r
1y
R
2y
)
2
)
5/2
dP
2y
dt
=
dp
1y
dt
So, unlike in Maxwells theory, the rate of change of the 1st particles momen-
tum is non-zero and the 3rd Newtons law is satised without involvement
of the electromagnetic eld momentum. Acceleration of the charge 1 is
calculated as follows
dr
1y
dt
=
p
1y
m
1
p
3
1y
2m
3
1
c
+
V
p
1y
=
p
1y
m
1
p
3
1y
2m
3
1
c
q
1
a
2
2
2m
1
c(a
2
+ (r
1y
R
2y
)
2
)
3/2
d
2
r
1y
dt
2
=
p
1y
m
1
+
3q
1
a
2
2
(r
1y
R
2y
)(v
1y
V
2y
)
2m
1
c(a
2
+ (r
1y
R
2y
)
2
)
5/2
3q
1
a
2
2
V
2y
(r
1y
R
2y
)
4m
1
c(a
2
+ (r
1y
R
2y
)
2
)
5/2
When the magnet is at rest (V
2y
= 0) this expression vanishes, so there is no
force (acceleration) on the particle 1, as expected.
60
The force (acceleration)
acting on the magnet is found by the following steps
dR
2y
dt
=
P
2y
M
2
P
3
2y
2M
3
2
c
+
V
P
2y
=
P
2y
M
2
P
3
2y
2M
3
2
c
+
q
1
a
2
2
4M
2
c(a
2
+ (r
1y
R
2y
)
2
)
3/2
d
2
R
2y
dt
2
=
p
2y
M
2
3q
1
a
2
2
(r
1y
R
2y
)(v
1y
V
2y
)
4M
2
c(a
2
+ (r
1y
R
2y
)
2
)
5/2
3q
1
a
2
2
v
1y
(r
1y
R
2y
)
4M
2
c(a
2
+ (r
1y
R
2y
)
2
)
5/2
60
This result holds also for a toroidal solenoid.
14.4. AHARONOV-BOHM EFFECT 511
So, the magnets acceleration does not vanish even if V
2y
= 0. This is an ex-
ample of situation described in subsection 14.1.3: the forces are not balanced
despite exact conservation of the total momentum.
To complete consideration of the quasiclassical wave packet passing through
the center of a stationary torus we need to calculate the action integral. It
is obtained by the time integration of the potential (14.50), where we set
R
2y
= P
2y
= 0, p
1y
m
1
v
1y
, r
1y
= v
1y
t
0
=
dt
q
1
a
2
2
v
1y
2c(a
2
+ v
2
1y
t
2
)
3/2
=
q
1
2
c
(14.51)
Now let us consider a charge whose trajectory passes outside the station-
ary torus (path 2 in Fig. 14.13). The force acting on the charge vanishes,
so we can assume that the wave packet travels with constant velocity along
straight line
r(t) r
1
(t) = (R, v
1y
t, 0) (14.52)
To calculate the action integral we repeat our earlier derivation of the po-
tential energy (14.50), this time taking into account x- and z-components of
vectors. We will assume that the torus is small, so that at all times r a,
r
1
r and
[
2
r] = (
2
r
1y
cos ,
2
r
1z
sin +
2
r
1x
cos
2
a, r
1y
sin )
[
2
r] p
1
=
2
(p
1x
r
1y
cos + p
1y
r
1z
sin + p
1y
r
1x
cos p
1y
a p
1z
r
1y
sin )
=
1
a
(N
2
p
1
) +
2
[r
1
p
1
]
z
cos
2
[r
1
p
1
]
x
sin
Here we characterized magnetic properties of the toroidal magnet by the
vector N
2
= (0,
2
a
2
, 0) which is perpendicular to the plane of the torus and
whose length is
2
a
2
. Then using approximation (14.19), setting p
2
= 0 and
integrating the potential energy in (14.25) on the length of the torus, we
obtain
V =
2
_
0
d
q
1
a[
2
r] p
1
4m
1
cr
3
512 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
q
1
4m
1
c
2
_
0
d ((N
2
p
1
) +
2
a[r p
1
]
z
cos
2
a[r p
1
]
x
sin )
_
1
r
3
+
3a(r
x
cos + r
z
sin )
r
5
_
=
q
1
(N
2
p
1
)
2m
1
cr
3
+
3q
1
2
a
2
4m
1
cr
5
2
_
0
d
_
[r p
1
]
z
r
x
cos
2
[r p
1
]
x
r
z
sin
2
_
=
q
1
(N
2
p
1
)
2m
1
cr
3
+
3q
1
2
a
2
4m
1
cr
5
([r p
1
]
z
r
x
[r p
1
]
x
r
z
)
=
q
1
(N
2
p
1
)
2m
1
cr
3
+
3q
1
4m
1
cr
5
([[p
1
r] r] N
2
)
=
q
1
(N
2
p
1
)
2m
1
cr
3
3q
1
4m
1
cr
5
((p
1
N
2
)r
2
(r N
2
)(p
1
r))
=
q
1
(N
2
p
1
)
4m
1
cr
3
+
3q
1
(r N
2
)(p
1
r)
4m
1
cr
5
(14.53)
The time dependence of this potential energy is obtained by substitution of
(14.52) in (14.53). As expected, the corresponding action integral vanishes
R
=
V (t)dt =
dt
_
q
1
N
2
v
1y
4c(R
2
+ v
2
1y
t
2
)
3/2
+
3q
1
N
2
v
3
1y
t
2
4c(R
2
+ v
2
1y
t
2
)
5/2
_
= 0
Comparing this result with (14.51) we see that the phase dierence for the
two paths (inside and outside the torus) is
=
1
(
0
R
) =
e
c
This is the same result as in the case of innite linear solenoid (14.49). Note
that this phase shift does not depend on the radius of the magnet a and on the
charges velocity v
1y
. This is in full agreement with Tonomuras experiments
[TOM
+
86, OMK
+
86].
14.5. FAST MOVING CHARGE 513
14.5 Fast moving charge
In subsection 14.1.2 we calculated forces (14.9) - (14.10) acting between two
charges in relative motion. These formulas were approximate as they included
only terms of order (v/c)
2
and lower. So, they could not be applied to
situations in which charges move with high velocities comparable to the speed
of light. With modern accelerators it is not dicult to produce such fast-
moving particles and measure their properties. An interesting experiment of
this kind was performed at Frascati National Laboratory in Italy [CdSF
+
12].
In subsection 14.5.3 we will discuss this work in greater detail.
Here we will be interested in a specic setup in which charge q
1
is moving
with a high constant momentum p
1
m
1
c along the z-axis, while the charge
2 is resting at the distance y from the beam line, as shown in g. 14.15. For
simplicity, we will choose our axes in such a way that the point z
1
= 0 on
the beam line corresponds to the closest approach between the two charges.
Likewise, t = 0 is time when particle 1 passes through this point. We will
assume that charge 2 is very small (q
2
q
1
), so that its presence has no
visible eect on the straight-line movement of q
1
. We will also assume that the
mass m
2
is innitely large, so that in the course of our experiment this particle
does not move (v
2
= 0). Our goal is to calculate the force f
2
experienced by
the test charge q
2
. More exactly, we are interested in the ratio
e f
2
/q
2
(14.54)
which in Maxwells electrodynamics goes by the name electric eld.
14.5.1 Fast moving charge in RQD
Let us start from evaluating the interaction energy between charges 1 and 2
beyond the (v
1
/c)
2
approximation. Near the energy shell we can use formula
(9.26)
61
V
d
2
q
1
q
2
2
c
2
(2)
3
_
dkdp
2
dp
1
m
1
c
2
p
1
p
1
+k
W
(p
2
k; p
2
)U
(p
1
+k; p
1
)
(
p
1
p
1
+k
)
2
c
2
k
2
61
we ignore spins of the two particles and take the limit m
2
.
514 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
d
p
2
k
a
p
1
+k
d
p
2
a
p
1
q
1
q
2
2
c
2
(2)
3
_
dkdp
2
dp
1
m
1
c
2
p
1
p
1
+k
U
0
(p
1
+k; p
1
)
(
p
1
p
1
+k
)
2
c
2
k
2
d
p
2
k
a
p
1
+k
d
p
2
a
p
1
(14.55)
According to subsection 8.2.8, the position-space representation of this
potential can be obtained by Fourier-transforming the coecient function
v
d
2
(p
1
+ k, p
1
, k) in (14.55). Since we are interested only in long-distance
interactions, the relevant integration range is around [k[ = 0. So, we will
assume k p
1
and
p
cp. Then from (J.67) and (H.12) we obtain
m
1
c
2
p
1
p
1
+k
m
1
c
p
1
U
0
(p
1
+k; p
1
) =
_
_
p
1
+k
+ m
1
c
2
_
p
1
+ m
1
c
2
+
_
p
1
+k
m
1
c
2
_
p
1
m
1
c
2
(p
1
+k) p
1
[p
1
+k[p
1
_
1
2m
1
c
2
p
1
m
1
c
v
d
2
(p
1
+k, p
1
, k)
q
1
q
2
2
(2)
3
c
2
(
p
1
p
1
+k
)
2
c
2
k
2
x
c
2
k
2
y
c
2
k
2
z
(14.56)
The non-negative expression (k
x
, k
y
, k
z
) (
p
1
p
1
+k
)
2
in the denomi-
nator is a function, which vanishes at k
x
= k
y
= k
z
= 0 and has zero rst
derivatives there
62
k
y
k=0
= 2(
p
1
p
1
+k
)
c
2
k
y
p
1
+k
k=0
= 0
k
z
k=0
= 2(
p
1
p
1
+k
)
c
2
(p
1z
k
z
)
p
1
+k
k=0
= 0
For second derivatives we obtain
k
2
z
k=0
= 2
c
2
(p
1z
k
z
)
p
1
+k
k
z
(
p
1
p
1
+k
)
k=0
62
We took into account that p
1x
= p
1y
= 0.
14.5. FAST MOVING CHARGE 515
= 2
c
2
(p
1z
k
z
)
p
1
+k
c
2
(p
1z
k
z
)
p
1
+k
k=0
=
2c
4
p
2
1z
2
p
1
= 2v
2
1z
k
2
y
k=0
=
2
k
y
k
z
k=0
= 0
Then the Taylor expansion around k = 0 yields (k
x
, k
y
, k
z
) k
2
z
v
2
1
. Substi-
tuting this expression in (14.56), we obtain
v
d
2
(p
1
+k, p
1
, k)
q
1
q
2
2
(2)
3
1
k
2
x
+ k
2
y
+ k
2
z
(1 v
2
1
/c
2
)
and the position-space potential is
v
d
2
(p
1
, r) =
_
dkv
2A
(p
1
+k, p
1
, k)e
i
kr
=
q
1
q
2
2
(2)
3
_
dk
e
i
kr
k
2
x
+ k
2
y
+ k
2
z
/
2
=
q
1
q
2
4
_
x
2
+ y
2
+
2
z
2
(14.57)
where we dened 1/
_
1 v
2
1
/c
2
1 and r r
2
r
1
.
Formula (14.57) is the potential energy of interaction between charges 1
and 2. The full Hamiltonian can be written as
H m
2
c
2
+
p
2
2
2m
2
+ cp
1
+
q
1
q
2
4
_
x
2
+ y
2
+
2
z
2
The force acting on the particle 2 is
f
2
m
2
dr
2
dt
2
=
dp
2
dt
=
H
r
2
and the electric eld (14.54) at t = 0 can be obtained as the gradient of
the potential (14.57)
e
()
x
(x, y, z) =
1
q
2
v
d
2
x
2
=
q
1
x
4(x
2
+ y
2
+
2
z
2
)
3/2
(14.58)
e
()
y
(x, y, z) =
1
q
2
v
d
2
y
2
=
q
1
y
4(x
2
+ y
2
+
2
z
2
)
3/2
(14.59)
e
()
z
(x, y, z) =
1
q
2
v
d
2
z
2
=
q
1
3
z
4(x
2
+ y
2
+
2
z
2
)
3/2
(14.60)
516 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
E
y E
z
z
z
0
0
(a)
(b)
Figure 14.14: Schematic electric eld proles along z-direction (x = 0, y >
0 and t = 0 are xed) for a charge moving with velocity v = c
_
2
1/
along the z-axis. Broken line - charge at rest ( = 0); thick line - RQD
electric eld e for a moving charge ( = 2); thin full line - eld E for the
moving charge ( = 2) in Maxwells theory: (a) transversal eld components
e
y
and E
y
coincide for all ; (b) longitudinal eld components [e
z
[ > [E
z
[.
It is interesting to compare this eld with the one produced by a charge at
rest ( = 1)
e
(=1)
(x, y, z) =
q
1
r
4(x
2
+ y
2
+ z
2
)
3/2
(14.61)
For x = 0 and xed value y > 0 we plotted e
y
- and e
z
-components of (14.61)
as functions of z in Fig. 14.14. They are shown by broken lines. Field
components for the moving charge ((14.59) and (14.60) with 2) are
shown there by thick full lines. The eect of the charges velocity is twofold:
First, the eld prole gets squeezed towards the charges position (z
1
= 0).
Second, the magnitude of the eld increases. This means that the electric
eld conguration around a fast-moving charge is concentrated in a narrow
disk perpendicular to the direction of motion. This disk moves together with
the charge as if it was rigidly attached to the instantaneous charges position.
14.5. FAST MOVING CHARGE 517
z
y
r
(
t
'
)
c
|
t
'
|
v
1
|t'|
v
1
v
1
1
2
0
Figure 14.15: For calculation of Lienard-Wiechert elds in (14.62). Full
circles mark positions of particles 1 and 2 at time t = 0. The open circle
marks position of the particle 1 at an earlier time t
.
14.5.2 Fast moving charge in Maxwells electrodynam-
ics
In the preceding subsection we used our RQD formalism to nd the electric
eld generated by a fast-moving charge. Let us now see how the same
problem is solved in classical Maxwells electrodynamics.
The standard derivation
63
involves the concept of retarded Lienard-Wiechert
elds. The idea is that electric elds are not rigidly attached to the moving
charge. They radially spread around the charge with the speed equal to the
speed of light. So, the total eld around the charge is not determined by the
charges instantaneous position. It is rather a function of previous locations
of the particle. The formula for the Lienard-Wiechert eld produced by the
uniformly moving charge 1 at the point 2 at time t = 0 is
64
63
see section 14.1 in [Jac99]
64
See equation (14.14) in [Jac99]. The Lienard-Wiechert eld is denoted by the capital
E in order to distinguish it from the electric eld e (14.58) - (14.60) predicted in our
theory.
518 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
E(r
2
) =
q
1
4
r(t
) r(t
)v
1
/c
2
[r(t
) (r(t
) v
1
)/c]
3
(14.62)
Various components in this formula are shown in Fig. 14.15. In particular,
r(t
) = r
2
r
1
(t
= r(t
) = (0, 0, z
1
(t
=
_
y
2
+ z
2
1
(t
) (14.63)
On the other hand, in the time interval [t
= z
1
(t
) (14.64)
Solving the system of equations (14.63) - (14.64) we obtain
t
=
y
_
c
2
v
2
1
=
y
c
(14.65)
r(t
) = (0, y, y
v
1
c
)
r(t
) = y
_
1 +
2
v
2
1
c
2
= y (14.66)
Using these results in the Lienard-Wiechert formula (14.62) we nd electric
eld components at time t = 0
65
65
For a detailed derivation see references quoted in [CdSF
+
12], e.g., section 14.1 in
[Jac99]. Note that this is the same result as the one obtained by Lorentz-transforming
eld components (14.61) to the moving frame (see section 11.10 in [Jac99]). However, the
method of Lorentz transformations is questionable, because, as we explained in section
15.3, standard special-relativistic formulas for such transformations are valid only in the
absence of interactions and cannot be used for transforming forces between particles (or
electric elds).
14.5. FAST MOVING CHARGE 519
E
x
(x, y, z) =
q
1
x
4(x
2
+ y
2
+
2
z
2
)
3/2
(14.67)
E
y
(x, y, z) =
q
1
y
4(x
2
+ y
2
+
2
z
2
)
3/2
(14.68)
E
z
(x, y, z) =
q
1
z
4(x
2
+ y
2
+
2
z
2
)
3/2
(14.69)
Field components E
x
and E
y
perpendicular to the direction of motion are
exactly the same as in our result (14.58) - (14.59). But the parallel component
(14.69) is
2
time smaller than (14.60). This component is shown by the thin
full line in Fig. 14.14(b). More experimental studies will be needed in order
to decide, which theoretical prediction (e
z
or E
z
) is the correct one.
Formulas (14.65) and (14.66) from the traditional theory mean that the
electric eld grows older as the observation point moves away from the
beam line, i.e., as y increases. The eld observed at the transverse distance y
from the beam line originates from an earlier location of the charge, which is
removed from the observation point by the distance of y y. This means
that in Lienard-Wiechert theory results (14.67) - (14.69) are valid for large
_
x
2
+ y
2
only if, prior to the eld observation, the charge was in the state of
uniform rectilinear motion for a long time (14.65). On the other hand, in our
approach formulas (14.58) - (14.60) describe an action-at-a-distance force.
This force depends only on the instantaneous position and velocity of the
charge 1 and it does not depend on the charges trajectory at previous times.
This distinction between the two approaches will play an important role in
our discussion of Frascati experiments in the following two subsections.
14.5.3 Experiment at Frascati
Relativistic electron bunches are available from modern accelerators. Electric
elds generated by such bunches can be measured with high accuracy. So,
theories presented above can be veried by experimental means. One such
experiment [CdSF
+
12] used 500 MeV electron beams ( 10
3
) with (0.5
5.0) 10
8
electrons per pulse. The transverse y-component of the electric
eld was measured at distances y = 3, 5, 10, 20, 40 and 55 cm from the beam
line. This experiment conrmed theoretical results (14.59) or (14.68) within
experimental errors.
As the authors correctly pointed out, this conrmation spells trouble
for the traditional Lienard-Wiechert theory, whose result (14.68) relies on
520 CHAPTER 14. CLASSICAL ELECTRODYNAMICS
the requirement that the electron bunch has moved uniformly long before
the measurement was made.
66
For example, for the validity of (14.68) at
y = 55 cm and t = 0,
67
the uniform rectilinear movement of the bunch
should extend as far in the past as t
ic
Kx
(p) = (2)
3/2
p
cosh cp
x
sinh
p
and transforming back to the position representation via (5.42) we obtain
1
see subsection 8.1.1
2
This is a non-normalizable state that we called improper in section 5.2. Similar
arguments apply to normalized localized wave functions, like
_
(r).
3
see equation (5.30)
15.1. LOCALIZABILITY OF PARTICLES 529
AA
BB
t=0
t>0
rr
ct
Figure 15.1: Spreading of the probability distribution of a localized wave
function. Full line: at time t = 0; dashed line: at time t > 0 (the distance
between points A and B is greater than ct).
e
ic
Kx
(r) = (2)
3
_
dp
_
cosh (cp
x
/
p
) sinh e
i
pr
(15.3)
We are not going to calculate this integral explicitly, but one property of the
function (15.3) must be clear: for non-zero this function is non-vanishing
for all values of r.
4
Therefore, the moving observer O
.
The non-invariant nature of localization is a property not familiar in clas-
sical physics. Although this property has not been observed in experiments
yet, it does not contradict any postulates of relativistic quantum theory and
does not constitute a sucient reason to reject the notion of localizability.
15.1.3 Spreading of well-localized states
Here we are going to discuss the wide-spread opinion that superluminal
spreading of particle wave functions violates the principle of causality [HC,
4
This property follows from the non-analyticity of the square root in the integrand
[Str04].
530 CHAPTER 15. PARTICLES AND RELATIVITY
Walc, Mal96, Heg98, DB94].
In the preceding subsection we found how a localized state (15.1) looks
from the point of view of a moving observer. Now, let us nd the appearance
of this state from the point of view of an observer displaced in time. Again, we
rst make a detour to the momentum space (15.2), apply the time translation
operator
(p, t) = e
Ht
(p, 0) = (2)
3/2
e
it
m
2
c
4
+p
2
c
2
and then use equation (5.42) to nd the position-space wave function at
non-zero t
(r, t) = (2)
3/2
_
dp(p, t)e
i
pr
= (2)
3
_
dpe
it
m
2
c
4
+p
2
c
2
e
i
pr
This integral can be calculated analytically [Rui]. However, for us the most
important result is that the wave function is non-zero at distances larger
than ct from the initial point A (r > ct), i.e., outside the light cone.
5
The corresponding probability density [(r, t)[
2
is shown schematically by
the dashed line in Fig. 15.1. Although the probability density outside the
light cone is very small, there is still a non-zero chance that the particle
propagates faster than the speed of light.
Note that superluminal propagation of the particles wave function in the
position space does not mean that particles speed is greater than c. As we
have established in subsection 5.1.2, for a free massive particle eigenvalues
of the quantum-mechanical operator of speed are less than c. So, the possi-
bility of wave functions propagating faster than c is a purely quantum eect
associated with the non-commutativity of operators R and V.
15.1.4 Superluminal spreading and causality
The superluminal spreading of localized wave packets described in the pre-
ceding subsection holds under very general assumptions in relativistic quan-
tum theory [Heg98]. It is usually regarded as a sign of a serious trouble
[HC, Walc, Mal96, DB94, Zub00], because the superluminal propagation of
any signal is strictly forbidden in special relativity.
6
This contradiction is
5
This fact can be justied by the same analyticity argument as in footnote on page
529. See also [WWS
+
12] and section 2.1 in [PS95b].
6
see Appendix I.3
15.1. LOCALIZABILITY OF PARTICLES 531
OO O
AA
CC
xx
BB
x
ct ct
Figure 15.2: Space-time diagram demonstrating the causality paradox as-
sociated with superluminal spreading of wave functions. Observers O and
O
, ct
), respec-
tively. The situation here is similar to that shown in Fig. I.1. Observers
O and O
(r
, t
(x, y, z, t) =
( x) (15.4)
This wave function is shown schematically in Fig. 15.2 by inclined parallel
thin dashed lines. Then we see that there is a non-zero probability of nding
particles emitted by O
to
O!
What is wrong with this picture? The traditional answer is that this weird
behavior is the consequence of the superluminal propagation of the wave
function. The usual conclusion is that sanity can be restored by forbidding
such superluminal eects. However, this would go against the entire theory
developed in this book. Could there be a dierent answer?
We would like to suggest the following explanation: Apparently, the cru-
cial step in the above derivation is the use of the wave function transfor-
mation law (15.4). However, there are serious reasons to doubt that this
formula is applicable even approximately. First, here we are dealing with
a system (particles conned in a box) where interactions play a signicant
role. In such a system the boost operator is interaction-dependent and boost
transformations of wave functions should depend on the details of interaction
15.1. LOCALIZABILITY OF PARTICLES 533
potentials.
7
Therefore, it is obvious that the wave function transformation
cannot be described by the universal interaction-independent formula (15.4).
Moreover, our system is not isolated. It is described by a time-dependent
Hamiltonian (the box is opened at some point in time). This makes Poincare
group arguments unapplicable and further complicates the analysis of boost
transformations. Even if we assume that interaction-dependence of boost
transformations can be neglected in some approximation, there is no justi-
cation for equation (15.4). This formula cannot be used to transform even
wave functions of free particles. For example, this formula contradicts the
transformation law (15.3) derived earlier for localized particle states. So,
there is absolutely no evidence that the wave function of particles emitted
by observer O
arrives to O
at point C in violation of the causality law.
It is plausible that using the correct boost transformation law one would
obtain that the wave function of particles released by O
at point B propa-
gates superluminally in the reference frame O as well.
8
This conclusion can
be supported by the following argument. From the point of view of observer
O, the particle emitted at point B (t = 0) is in a localized state with denite
position. Such states do not have any denite velocity (or momentum), so
their free
9
time evolution is determined only by the value of position char-
acteristic for the initial state. Therefore particles emitted by boxes at rest
(e.g., the box A) and by moving boxes (e.g., the box B) are described by
essentially the same time-dependent wave functions at t > 0. The only dif-
ference being a relative shift along the x-axis. Then instead of the acausal
response signal B C
10
one would have a signal B A, which, in spite of
being instantaneous, does not violate the principle of causality.
7
This dependence will be discussed in the next section in greater detail.
8
this means that thin dashed lines around point B in Fig. 15.2 should be drawn parallel
to the axis x
9
At times t > 0 the box B remains opened and the particles evolution is described by
the non-interacting Hamiltonian H
0
.
10
as predicted incorrectly in the traditional approach based on equation (15.4)
534 CHAPTER 15. PARTICLES AND RELATIVITY
15.2 Inertial transformations in multiparticle
systems
One of the goals of physics declared in Introduction
11
includes nding trans-
formations of observables between dierent inertial reference frames. In chap-
ter 4 and in subsection 6.2.3 we discussed inertial transformations of total
observables in a multiparticle system and we found that these transforma-
tions have universal forms, which do not depend on the systems composition
and interactions acting there. In this section we will be interested in estab-
lishing inertial transformations for observables of individual particles within
an interacting multiparticle system. Our goal is to compare these predic-
tions of RQD with Lorentz transformations for time and position of events
in special relativity
t
= y (15.7)
z
= z (15.8)
Here we will reach a surprising conclusion that formulas of special relativity
may be not accurate.
15.2.1 Events and observables
One of the most fundamental concepts in physics is the concept of an event.
Generally, event is some physical process or phenomenon occurring in a small
volume of space in a short interval of time. So, each event can be charac-
terized by four numbers: its time t and its position r. These numbers are
referred to as space-time coordinates (t, r) of the event. For the event to be
observable, there should be some material particles present at time t at the
point r. The simplest example of an event is an intersection of trajectories
of two particles. We will dene t as the reading of the clock belonging to the
observer witnessing the particles collision and r as the (expectation) value
of the position of particles present in the events volume.
11
see page xxxvii
15.2. INERTIAL TRANSFORMATIONS IN MULTIPARTICLE SYSTEMS535
In this section we would like to derive the relationship between events
space-time coordinates (t, r) measured in the reference frame at rest O and
space-time coordinates (t
, r
.
Since we just identied events position with the expectation value of par-
ticles position operators, nding boost transformations r r
is just an
exercise in straightforward application of the general rule for transforma-
tions of operators of observables between dierent reference frames.
12
By
following this rule we should be able to derive analogs of Lorentz transfor-
mations (15.5) - (15.8) without articial assumptions from Appendix I and
we should be able to tell whether Lorentz transformations formulas are exact
or approximate. This is the plan of our presentation in this section.
For simplicity, here we will consider a system of two massive spinless
particles described in the Hilbert space 1 = 1
1
1
2
, where one-particle
observables (position, momentum, velocity, angular momentum, spin, en-
ergy,...) are denoted by lowercase letters:
r
1
, p
1
, v
1
, j
1
, s
1
, h
1
, . . . (15.9)
r
2
, p
2
, v
2
, j
2
, s
2
, h
2
, . . . (15.10)
Transformations of these observables between reference frames O and O
Fb
ge
i
Fb
= g
ib
[F, g]
b
2
2!
[F, [F, g]] + . . . (15.11)
Application of this formula to events position is not straightforward, because
particle localization does not have absolute meaning in quantum mechanics.
If observer O registers a localized event (or locallized particles constituting
this event), then other observers may disagree that the event is localized
or that it has occurred at all. Examples of such a behavior are common
in quantum mechanics. Some of them were discussed in subsections 6.5.3,
12
see subsections 4.3.8 and 5.2.4
536 CHAPTER 15. PARTICLES AND RELATIVITY
15.1.2 and 15.1.3. Thus we are going to apply boost transformations only to
expectation values of positions. In other words, in the rest of this chapter we
will work in the classical limit, where wave packet spreading can be ignored
and particle positions and trajectories can be unambiguously dened. Then
we will interpret (15.9) and (15.10) as numerical (expectation) values of ob-
servables in quasiclassical states and instead of quantum operator equation
(15.11) with commutators we will use its classical analog involving Poisson
brackets (6.95)
g(b) g + b[F, g]
P
+
b
2
2!
[F, [F, g]
P
]
P
+ . . .
In order to perform calculations with this formula one needs two major things.
First, one needs to know expressions for Poincare generators F in terms
of one-particle observables (15.9) - (15.10). This is equivalent to having a
full dynamical description of the system. Such a description can be easily
obtained in the case of a non-interacting particle system. However, for in-
teracting particles this is a rather non-trivial problem that can be solved
only approximately. Second, one needs to know Poisson brackets between all
one-particle observables (15.9) - (15.10). This is an easier task, which has
been accomplished already in chapters 4, 5 and in section 6.1. In particular,
we found there that observables of dierent particles always have vanishing
Poisson brackets. The Poisson brackets for observables referring to the same
particle are (i, j, k = 1, 2, 3)
[r
i
, r
j
]
P
= [p
i
, p
j
]
P
= [r
i
, s
j
]
P
= [p
i
, s
j
]
P
= 0 (15.12)
[r
i
, p
j
]
P
=
ij
(15.13)
[s
i
, s
j
]
P
=
3
k=1
ijk
s
k
(15.14)
[p, h]
P
= [s, h]
P
= 0 (15.15)
[r, h]
P
=
pc
2
h
(15.16)
15.2.2 Non-interacting particles
First we assume that the two particles 1 and 2 are non-interacting, so that
generators of inertial transformations in the Hilbert space 1 are
15.2. INERTIAL TRANSFORMATIONS IN MULTIPARTICLE SYSTEMS537
H
0
= h
1
+ h
2
(15.17)
P
0
= p
1
+p
2
(15.18)
J
0
= j
1
+j
2
(15.19)
K
0
= k
1
+k
2
(15.20)
The trajectory of the particle 1 in the reference frame O is obtained from the
usual formula (4.55)
r
1
(t) = e
i
H
0
t
r
1
e
H
0
t
= e
i
(h
1
+h
2
)t
r
1
e
(h
1
+h
2
)t
= e
i
h
1
t
r
1
e
h
1
t
r
1
+ t[h
1
, r
1
]
P
+
t
2
2!
[h
1
, [h
1
, r
1
]
P
]
P
+ . . . = r
1
+v
1
t (15.21)
Applying boost transformations to (15.21) and taking into account (4.6) -
(4.8), (4.57) - (4.59) and (4.61) we nd the trajectory of this particle in the
reference frame O
) =
_
r
1x
cosh
+ (v
1x
v)t
_
(15.22)
r
1y
(, t
) =
_
r
1y
+
j
1z
v
h
1
+
v
1y
t
cosh
_
= r
1y
+
_
r
1x
v
1y
v
c
2
+
v
1y
t
cosh
_
(15.23)
r
1z
(, t
) =
_
r
1z
+
j
1y
v
h
1
+
v
1z
t
cosh
_
= r
1z
+
_
r
1x
v
1z
v
c
2
+
v
1z
t
cosh
_
(15.24)
where we denoted (1 v
1x
vc
2
)
1
. Similar formulas are valid for the
particle 2.
The important feature of these formulas is that inertial transformations
for particle observables are completely independent on the presence of other
13
If we set t
= 0 then these formulas coincide with (23) - (24) in ref. [MM97]. Set-
ting also v
1
= 0 we obtain the usual Lorentz length contraction formulas r
1x
(, 0) =
r
1x
/(cosh), r
1y
(, 0) = r
1y
, r
1z
(, 0) = r
1z
. Compare with equation (I.20).
538 CHAPTER 15. PARTICLES AND RELATIVITY
particles in the system, e.g. formulas for r
1
(, t
as well. So O
? The answer to
this question is given by the following theorem.
Theorem 15.1 (Lorentz transformations for time and position) For
events dened as intersections of trajectories of non-interacting particles, the
Lorentz transformations for time and position (15.5) - (15.8) are exactly
valid.
Proof. Let us rst prove that Lorentz formulas (15.5) - (15.8) are correct
transformations for the trajectory of the particle 1 between reference frames
O and O
. For simplicity, we will consider only the case in which the particle
is moving along the x-axis: r
1y
(t) = r
1z
(t) = v
1y
= v
1z
= 0. (More general
situations can be analyzed similarly.) Then we can neglect the y- and z-
coordinates in our proof. So, we need to prove that
14
r
1x
(, t
) = r
1x
(0, t) cosh ct sinh
= (r
1x
+ v
1x
t) cosh ct sinh (15.25)
14
Here the left hand side is the Newton-Wigner position of the particle 1 seen from the
reference frame O
at time t
= t cosh
r
1x
(t)
c
sinh (15.26)
To do that, we calculate the dierence between the right hand sides of equa-
tions (15.22) and (15.25) with t
Ht
r
1
e
Ht
= e
i
(h
1
+h
2
+V )t
r
1
e
(h
1
+h
2
+V )t
= r
1
+ t[h
1
+ V, r
1
]
P
+
t
2
2
[(h
1
+ h
2
+ V ), [h
1
+ V, r
1
]
P
]
P
+ . . .
= r
1
+v
1
t + t[V, r
1
]
P
+
t
2
2
[V, v
1
]
P
+
t
2
2
[(h
1
+ h
2
), [V, r
1
]
P
]
P
+
t
2
2
[V, [V, r
1
]
P
]
P
+ . . . (15.31)
In the simplest case when interaction V commutes with particle positions
and in the non-relativistic approximation v
1
p
1
/m
1
this formula simplies
15
see equations (6.14) - (6.17)
15.2. INERTIAL TRANSFORMATIONS IN MULTIPARTICLE SYSTEMS541
r
1
(t) r
1
+v
1
t
t
2
2m
1
V
r
1
+ . . . = r
1
+v
1
t +
f
1
t
2
2m
1
+ . . .
= r
1
+v
1
t +
a
1
t
2
2
+ . . .
where we denoted
f
1
(r
1
, p
1
, r
2
, p
2
)
V (r
1
, p
1
, r
2
, p
2
)
r
1
the force with which particle 2 acts on the particle 1. The vector a
1
f
1
/m
1
can be interpreted as acceleration of the particle 1 in agreement with the
Newtons second law of mechanics. The trajectory r
1
(t) of the particle 1
depends in a non-trivial way on the trajectory r
2
(t) of the particle 2 and
vice versa. Curved trajectories of particles 1 and 2 are denitely observable
in macroscopic experiments. However, this interacting time evolution, by
itself, cannot tell us which form of relativistic dynamics is responsible for the
interaction. Other types of inertial transformations should be examined in
order to make this determination.
As an example, in this section we will explain which experimental mea-
surements should be performed to tell apart two popular forms of dynamics:
the instant form
H = h
1
+ h
2
+ V (15.32)
P = p
1
+p
2
(15.33)
J = j
1
+j
2
(15.34)
K = k
1
+k
2
+Z (15.35)
and the point form
H = h
1
+ h
2
+ V (15.36)
P = p
1
+p
2
+U (15.37)
J = j
1
+j
2
(15.38)
K = k
1
+k
2
(15.39)
542 CHAPTER 15. PARTICLES AND RELATIVITY
15.2.6 Boost transformations in interacting systems
Similar to the above analysis of time translations, we can examine boost
transformations. For interactions in the point form (15.36) - (15.39), the po-
tential boost Z is zero, so boost transformations of the position and velocity
are the same as in the non-interacting case
16
r
1x
() = e
ic
K
0x
r
1x
e
ic
K
0x
= e
ic
k
1x
r
1x
e
ic
k
1x
r
1x
c[k
1x
, r
1x
]
P
+ . . .
=
r
1x
cosh (1 v
1x
vc
2
)
(15.40)
v
1x
() = e
ic
K
0x
v
1x
e
ic
K
0x
= e
ic
k
1x
v
1x
e
ic
k
1x
v
1x
c[k
1x
, v
1x
]
P
+ . . .
=
v
1x
v
1 v
1x
vc
2
(15.41)
On the other hand, in the instant form generators of boosts (15.35) are
dynamical, and transformation formulas are dierent. For example, the boost
transformation of position is
r
1x
() = e
ic
Kx
r
1x
e
ic
Kx
= e
ic
(K
0x
+Zx)
r
1x
e
ic
(K
0x
+Zx)
r
1x
c[k
1x
, r
1x
]
P
c[Z
x
, r
1x
]
P
+ . . .
=
r
1x
cosh (1 v
1x
vc
2
)
c[Z
x
, r
1x
]
P
+ . . . (15.42)
The rst term on the right hand side is the same interaction-independent term
as in (15.40). This term is responsible for the well-known relativistic eect
of length contraction (I.20). The second term in (15.42) is a correction due
to interaction with the particle 2. This correction depends on observables
of both particles 1 and 2 and it makes boost transformations of positions
dependent in a non-trivial way on the state of the system and on interactions
acting there. So, in the instant form of dynamics, there is a strong analogy
between time translations and boosts of particle observables. They are both
interaction-dependent, i.e., dynamical.
16
For simplicity, we consider only x-components here. For a general case, see (4.6) -
(4.8) and (15.22) - (15.24). As usual, v c tanh.
15.2. INERTIAL TRANSFORMATIONS IN MULTIPARTICLE SYSTEMS543
In order to observe the dynamical eect of boosts described above, one
would need to use measuring devices moving with very high speeds com-
parable to the speed of light. This presents enormous technical diculties.
So, boost transformations of particle positions have not been directly ob-
served with an accuracy sucient to detect the kinematical relativistic eect
(15.40), let alone the deviation [Z
x
, r
1x
]
P
due to interactions.
Similarly, we can consider boost transformations of velocity in the instant
form of dynamics
v
1x
() = e
ic
Kx
v
1x
e
ic
Kx
= e
ic
(K
0x
+Zx)
v
1x
e
ic
(K
0x
+Zx)
= v
1x
c[k
1x
, v
1x
]
P
c[Z
x
, v
1x
]
P
+ . . . =
v
1x
v
1 v
1x
vc
2
c[Z
x
, v
1x
]
P
+ . . .
= (v
1x
v) +
v
1x
v(v
1x
v)
c
2
c[Z
x
, v
1x
]
P
+ . . .
The terms on the right hand side have clear physical meaning: The rst
term v
1x
v is the usual non-relativistic change of velocity in the moving
reference frame. This is the most obvious eect of boosts that is visible in our
everyday life. The second term is a relativistic correction that is valid for both
interacting and non-interacting particles. This correction is a contribution of
the order c
2
to the relativistic law of addition of velocities (4.6). Currently,
there is abundant experimental evidence for the validity of this law.
17
The
third term is a correction due to the interaction between particles 1 and 2.
This eect has not been seen experimentally, because it is very dicult to
perform accurate measurements of observables of interacting particles from
fast moving reference frames, as we mentioned above.
To summarize, detailed measurements of boost transformations of parti-
cle observables are very dicult and with the present level of experimental
precision they cannot help us to decide which form of dynamics is active
in any given physical system. Let us now turn to space translations and
rotations.
15.2.7 Spatial translations and rotations
In both instant and point forms of dynamics, rotations are interaction-
independent, so the term Y in the generator of rotations (15.29) is zero
17
see subsection 15.3.2
544 CHAPTER 15. PARTICLES AND RELATIVITY
and rotation transformations of particle positions (and other observables)
are exactly the same as in the non-interacting case, e.g.,
18
r
1
(
) = e
r
1
e
i
= e
j
1
r
1
e
i
j
1
= R
r
1
(15.43)
This is in full agreement with experimental observations.
In the instant form of dynamics, space translations are interaction-independent
as well
r
1
(a) = e
Pa
r
1
e
i
Pa
= e
(p
1
+p
2
)a
r
1
e
i
(p
1
+p
2
)a
= e
p
1
a
r
1
e
i
p
1
a
= r
1
a
Again this result is supported by experimental observations and our common
experience in various physical systems and in a wide range of values of the
transformation parameter a.
However, the point-form generator of space translations (15.37) does de-
pend on interaction, thus translations of the observer have a non-trivial eect
on measured positions of interacting particles. For example, the action of a
translation along the x-axis on the x-component of position of the particle 1
is
r
1x
(a) = e
Pxa
r
1x
e
i
Pxa
= e
(p
1x
+p
2x
+Ux)a
r
1x
e
i
(p
1x
+p
2x
+Ux)a
r
1x
a[(p
1x
+ U
x
), r
1x
]
P
+ . . .
= r
1x
a a[U
x
, r
1x
]
P
+ . . . (15.44)
where the last term on the right hand side is the interaction correction. Such
a correction has not been seen in experiments in spite of the fact that there
is no diculty in arranging observations from reference frames displaced by
large distances a. So, there is a good reason to believe that interaction
dependence (15.44) has not been seen because it is non-existent.
Thus we conclude that the eect of space translations and rotations must
be independent on interactions in the system. This means that these trans-
18
Rotation matrix R
) and p(, t
)
in the moving reference frame O
, where time t
) and p(, t
) r
1x
(0, t) ct (15.47)
r
1y
(, t
) = r
1y
(0, t) (15.48)
r
1z
(, t
) = r
1z
(0, t) (15.49)
p
1x
(, t
) p
1x
(0, t)
1
c
h
1
(0, t) (15.50)
p
1y
(, t
) = p
1y
(0, t) (15.51)
p
1z
(, t
) = p
1z
(0, t) (15.52)
t
t
c
r
1x
(0, t) (15.53)
We can rewrite equation (15.47) without aecting the rst order accuracy
level in t
and
r
1x
(, t
) = r
1x
_
0, t
+
r
1x
(0, t)
c
_
_
t
+
r
1x
(t)
c
_
c
r
1x
(0, t
) +
dr
1x
(0, t
)
cdt
r
1x
(0, t) ct
r
1x
(0, t
) +
dr
1x
(0, t
)
cdt
r
1x
(0, t
) ct
(15.54)
Next we use the second method (i.e., the direct application of interacting
time translations and boosts)
r
1x
(, t
) = e
ic
Kx
e
i
Ht
e
ic
Kx
e
ic
Kx
r
1x
(0, 0)e
ic
Kx
e
ic
Kx
e
Ht
e
ic
Kx
15.2. INERTIAL TRANSFORMATIONS IN MULTIPARTICLE SYSTEMS549
= e
i
Ht
cosh
e
ic
Pxt
sinh
e
ic
Kx
r
1x
(0, 0)e
ic
Kx
e
ic
Pxt
sinh
e
Ht
cosh
e
i
Ht
ic
Pxt
(r
1x
(0, 0) c[r
1x
(0, 0), K
x
]
P
)e
ic
Pxt
Ht
e
i
Ht
(r
1x
(0, 0) ct
c[r
1x
(0, 0), K
x
]
P
)e
Ht
= r
1x
(0, t
) c[r
1x
(0, t
), K
x
(t
)]
P
ct
(15.55)
p
1x
(, t
) = e
i
Ht
cosh
e
ic
Pxt
sinh c
e
ic
Kx
p
1x
(0, 0)e
ic
Kx
e
ic
Pxt
sinh c
e
Ht
cosh
e
i
Ht
ic
Pxt
(p
1x
(0, 0) c[p
1x
(0, 0), K
x
]
P
)e
ic
Pxt
Ht
e
i
Ht
(p
1x
(0, 0) c[p
1x
(0, 0), K
x
]
P
)e
Ht
= p
1x
(0, t
) c[p
1x
(0, t
), K
x
(t
)]
P
(15.56)
If Lorentz transformations were exact, then results of both methods would
be identical and comparing (15.54) and (15.55) we would obtain
1
c
dr
1x
(0, t)
dt
r
1x
(0, t) = c[r
1x
(0, t), K
x
(t)]
P
or using
dr
1x
dt
= [r
1x
, H]
P
=
H
p
1x
and [r
1x
, K
x
]
P
=
Kx
p
1x
c
2
K
x
p
1x
= r
x
H
p
1x
Similar arguments lead us to the general case (i, j = 1, 2, 3)
c
2
K
j
p
1i
= r
j
H
p
1i
(15.57)
Comparing equations (15.56) and (15.50) we get
p
1x
(0, t)
h
1
(0, t)
c
= p
1x
_
0, t
r
1x
(0, t)
c
_
c[p
1x
(0, t
), K
x
(t
)]
P
p
1x
(0, t)
r
1x
(0, t)
c
p
1x
(0, t)
t
c[p
1x
(0, t
), K
x
(t
)]
P
p
1x
(0, t)
r
1x
(0, t)
c
[p
1x
(0, t), H]
P
c[p
1x
(0, t), K
x
(t)]
P
and
550 CHAPTER 15. PARTICLES AND RELATIVITY
c
2
[p
1x
, K
x
]
P
= r
1x
[p
1x
, H]
P
+ h
1
In the general case (i, j = 1, 2, 3) we have
c
2
K
j
r
1i
= r
1j
H
r
1i
+
ij
h
1
(15.58)
Putting equations (15.57) - (15.58) together, we conclude that if tra-
jectories of interacting particles transform by Lorentz, then the following
equations must be valid
c
2
K
k
p
1
= r
1k
H
p
1
(15.59)
c
2
K
k
p
2
= r
2k
H
p
2
(15.60)
c
2
K
k
r
1i
= r
1k
H
r
1i
+
ik
h
1
(15.61)
c
2
K
k
r
2i
= r
2k
H
r
2i
+
ik
h
2
(15.62)
Our next goal is to show that these equations lead to a contradiction. Taking
derivatives of (15.59) by p
2
and (15.60) by p
1
and subtracting them we obtain
2
H
p
2
p
1
= 0
In a similar way we get
24
2
H
r
2
r
1
=
2
H
r
2
p
1
=
2
H
p
2
r
1
= 0
The only non-zero cross-derivatives are
24
Here we used h
2
/r
1
=
_
m
2
c
4
+p
2
2
c
2
/r
1
= 0 and h
2
/p
1
= h
1
/r
2
=
h
1
/p
2
= 0.
15.2. INERTIAL TRANSFORMATIONS IN MULTIPARTICLE SYSTEMS551
2
H
p
1
r
1
,= 0
2
H
p
2
r
2
,= 0
Therefore, only pairs of arguments (p
1
, r
1
) and (p
2
, r
2
) are allowed to be
together in H and we can represent the full Hamiltonian in the form
H = H
1
(p
1
, r
1
) + H
2
(p
2
, r
2
)
This result means that the force acting on the particle 1 does not depend on
the state (position and momentum) of the particle 2.
f
1
=
p
1
t
= [p
1
, H]
P
= [p
1
, H
1
(p
1
, r
1
)]
P
and vice versa. Therefore, both particles move independently, i.e., there
is no interaction. This already proves the statement of the Theorem. A
stronger result can be obtained if we disregard the possibility of non-cluster
separable (or long-range) interactions. From the Poisson bracket with the
total momentum we obtain
0 = [P, H]
P
= [p
1
+p
2
, H]
P
=
H
1
(p
1
, r
1
)
r
1
H
2
(p
2
, r
2
)
r
2
(15.63)
Since two terms on the right hand side of (15.63) depend on dierent vari-
ables, we must have
H
1
(p
1
, r
1
)
r
1
=
H
2
(p
2
, r
2
)
r
2
= C
where C is a constant vector. Then the Hamiltonian can be written in the
form
H = H
1
(p
1
) + H
2
(p
2
) +C (r
1
r
2
)
552 CHAPTER 15. PARTICLES AND RELATIVITY
To ensure the cluster separability (=short range) of the interaction we must
set C = 0. Then the resulting form of the Hamiltonian H = H
1
(p
1
)+H
2
(p
2
)
implies that the force acting on the particle 1 vanishes
f
1
=
p
1
t
= [p
1
, H]
P
= [p
1
, H
1
(p
1
)]
P
= 0
The same is true for the force acting on the particle 2.
The above theorem shows us that if particles have Lorentz-invariant
worldlines, then they are not interacting. In special relativity, Lorentz
transformations are assumed to be exactly and universally valid (see As-
sertion I.1). Then the theorem leads to the conclusion that inter-particle
interactions are simply impossible. This justies the common name the
no-interaction theorem. Of course, it is absurd to think that there are no
interactions in nature. So, in current literature there are two interpretations
of this result. One interpretation is that the Hamiltonian dynamics cannot
properly describe interactions. Then a variety of non-Hamiltonian versions of
dynamics were suggested [Kei, DW65, Pol85]. Another view is that variables
r and p do not describe real observables of particle positions and momenta,
or even that the notion of particles themselves becomes irrelevant in quantum
eld theory. Quite often the Currie-Jordan-Sudarshan theorem is considered
as an evidence that particle-based description of nature is not adequate and
one should seek a eld-based approach [Boy08].
However, we reject both these explanations. The non-Hamiltonian ver-
sions of particle dynamics contradict fundamental postulates of relativistic
quantum theory, which were formulated and analyzed throughout this book.
We also would like to stick to the idea that physical world is described by
particles with well-dened positions, momenta, spins, etc. These physical
particles do interact via instantaneous potentials obtained in the dressed
particle approach to QFT. So, for us the only way out of the paradox is to
admit that Lorentz transformations of special relativity are not applicable to
observables of interacting particles. Then from our point of view, it is more
appropriate to call Theorem 15.3 the no-Lorentz-transformation theorem.
This result simply conrms our earlier conclusion that boost transformations
are not kinematical and not universal. In contrast to the special-relativistic
Assertion I.1, boost transformations of observables of individual particles
should depend on the observed system, its state and on interactions acting
in the system. So, boost transformations are dynamical.
15.3. COMPARISON WITH SPECIAL RELATIVITY 553
15.3 Comparison with special relativity
In this section we would like to discuss the physical signicance of our con-
clusion about the dynamical character of boosts and its contradiction with
Einsteins special relativity. In subsection 15.3.1 we will analyze existing
proofs of Lorentz transformations and show that these proofs do not ap-
ply to transformations of observables of interacting particles. In subsection
15.3.2 we are going to discuss experimental verications of special relativity.
We will see that in most cases these experiments are dealing with free parti-
cles, for which our theory makes the same predictions as the standard special
relativity. When genuine interacting systems are observed (such as decaying
particles) the measurements are not accurate enough to register deviations
from Einsteins theory. So, it is not surprising that this theory has withstood
all experimental tests so far. In subsections 15.3.3 - 15.3.6 we will suggest
that such fundamental assertions of relativistic theories as the manifest co-
variance and the 4-dimensional Minkowski space-time continuum should be
rejected.
15.3.1 On derivations of Lorentz transformations
Einstein based his special relativity [Ein05] on two postulates. One of them
was the principle of relativity. The other was the independence of the speed
of light on the velocity of the source and/or observer. Both these statements
remain true in our theory as well (see our Postulate 2.1 and Statement 5.1).
Then Einstein discussed a series of thought experiments with measuring rods,
clocks and light rays, which demonstrated the relativity of simultaneity, the
length contraction of moving rods and the slowing-down of moving clocks.
These observations were formalized in Lorentz formulas (15.5) - (15.8), which
supposedly connected times and positions of a localized event in dierent
moving reference frames. As we demonstrated in Theorem 15.1, our approach
leads to exactly the same transformation laws for events associated with
non-interacting particles. So far our approach and special relativity are in
complete agreement.
Note that although Einsteins relativity postulate had universal appli-
cability to all kinds of events and processes, his invariance of the speed of
light postulate is only relevant to freely propagating light pulses. So, strictly
speaking, all conclusions made in [Ein05] can be applied only to space and
time coordinates of events (such as intersections of light pulses) related in
554 CHAPTER 15. PARTICLES AND RELATIVITY
some way to the propagation of light. Nevertheless, in his work Einstein
tacitly assumed
25
that the same conclusions should be extended to all other
events independent on their physical nature and on involved interactions.
26
There is a large number of publications [Sch84, Fie97, LK75, LL76, Sar82,
Pol], which claim that Lorentz transformation formulas (15.5) - (15.8) can
be derived even without using the Einsteins second postulate. However,
these works do not look conclusive. There are two common features in these
derivations, which we nd troublesome. First, they assume an abstract (i.e.,
independent on real physical processes and interactions) nature of events oc-
cupying space-time points (t, x, y, z). Second, they postulate the isotropy
and homogeneity of space around these points [Sch84, Fie97, LK75, LL76].
The main problem with these approaches is that in physics we should be
interested in transformations of observables of real interacting particles, not
of abstract space-time points. One cannot make an assumption that trans-
formations of these observables are completely independent of what occurs in
the space surrounding the particle and what are interactions of this particle
with the rest of the observed system.
One can reasonably assume that all directions in space are exactly equiv-
alent for a single isolated particle [Sar82], but this is not at all obvious when
the particle participates in interactions with other particles. Suppose that
we have two interacting particles 1 and 2 at some distance from each other.
Suppose that we want to derive boost transformations for observables of the
particle 1. Clearly, for this particle dierent directions in space are not equiv-
alent: For example, the direction pointing to the particle 2 is dierent from
other directions. So, the assumption of spatial isotropy cannot be applied in
this case.
Sometimes the following argument is presented in order to justify the ap-
plicability of (15.5) - (15.8) even for interacting systems. Suppose that we
have two events A and B having the same coordinates (r, t) in the frame at
rest. Suppose also that event A is related to light pulses (therefore, Lorentz
formulas are exactly applicable to it), but event B is associated with some
interacting system. If space-time coordinates of A and B transform by dif-
ferent formulas, then we will have a seemingly intolerable situation in which
events A and B coincide in the frame at rest, but they occur at dierent
space-time points if observed from a moving frame. Therefore, the argument
25
and this assumption was being repeated in all relativity textbooks ever since
26
see Assertion I.1
15.3. COMPARISON WITH SPECIAL RELATIVITY 555
goes, all events, independent on their physical nature, must transform by the
same universal formulas (15.5) - (15.8). Though seemingly reasonable, the
above argument is not convincing. There is absolutely no experimental or
theoretical support for the above coincidence postulate (i.e., that events,
overlapping in one frame, overlap in all other frames as well).
Thus we conclude that there are no compelling theoretical reasons to
believe in the universal validity of Lorentz transformations (15.5) - (15.8).
Note that special relativity rst postulates these transformations and then
tries to formulate dynamical (interacting) theories, which conform with this
assumption. Our approach to relativistic dynamics is fundamentally dier-
ent, in fact, opposite. We start with formulation of relativistic (=Poincare
invariant) interacting theory. Then we derive boost transformations of par-
ticle observables using standard formulas of quantum theory and see
27
that
they are dierent from universal Lorentz formulas (15.5) - (15.8). Correct
boost transformations depend on the state of the observed multiparticle sys-
tem and on interactions acting there.
We also see
28
that geometric universality of boosts contradicts the (well-
established) dynamical character of time translations. A theory in which time
translations are dynamical while space translations, rotations and boost are
kinematical cannot be invariant with respect to the Poincare group. So,
ironically, the assumptions of kinematical boosts, universal Lorentz trans-
formations and invariant worldlines are in conict with the principle of
relativity. This contradiction is the main reason for the no interaction
Theorem 15.3.
15.3.2 On experimental tests of special relativity
Supporters of special relativity usually invoke an argument that predictions of
this theory were conrmed by experiment with astonishing precision. This
is, indeed, true. However, at a closer inspection it appears that existing
experiments cannot distinguish between special relativity and the approach
presented in this book. In some cases, this is because two theories really
agree. In other cases, the disagreement is so small that the required precision
is out of reach for modern technology.
From the preceding discussion it should be clear that Lorentz formulas
27
in section 15.2
28
in subsection 15.2.7
556 CHAPTER 15. PARTICLES AND RELATIVITY
of special relativity are exactly applicable to observables of non-interacting
particles and to total observables of any physical system, whether interacting
or not. It appears that almost all experimental tests of special relativity
are concerned with these kinds of measurements: they either look at non-
interacting (free) particles or at total observables in a compound system.
Below we briey discuss several major classes of such experiments [NFRS78,
Mac86, Sch, Rob00].
One class of experiments is related to measurements of the frequency
(energy) of light and its dependence on the movement of the source and/or
observer. These Doppler eect experiments [IS38, IS41, KPR85, HMS79] can
be formulated either as measurements of the photons energy dependence on
the velocity of the source (or observer) or as velocity dependence of the energy
level separation in the source. These two interpretations were discussed in
subsections 5.3.4 and 6.4.2, respectively. In the former interpretation, one
is measuring the energy (or frequency) of a free particle - the photon. In
the latter interpretation, measurements of the total energy (dierences) in
an interacting system are performed. In both these formulations, predictions
of our theory exactly coincide with special relativity.
Another class of experiments is concerned with measuring the speed of
light and conrming its independence on the movement of the source and/or
observer. This class includes interference experiments of Michelson-Morley
and Kennedy-Thorndike as well as direct speed measurements [AFKW64].
These experiments are performed with free photons or light rays, so, again,
our theory and special relativity make exactly the same predictions for such
non-interacting systems. The same is true for tests of relativistic kinemat-
ics, which include relationships between velocities, momenta and energies of
free massive particles as well as changes of these parameters after particle
collisions or decays.
An exceptional type of experiment where one can, at least in principle,
observe the dierences between our theory and special relativity is the decay
of fast moving unstable particles. In this case we are dealing with a physical
system in which the interaction acts during a long time interval (of the order
of particles lifetime) and there is a clearly observable time-dependent pro-
cess (the decay) which is controlled by the strength of the interaction. The
relativistic time dilation in decays of moving particles has been considered
in chapter 13.
15.3. COMPARISON WITH SPECIAL RELATIVITY 557
15.3.3 Poincare invariance vs. manifest covariance
From our above discussion it should be clear that there are two rather dier-
ent approaches to constructing relativistic theories. One is the traditional ap-
proach pioneered by Einstein and Minkowski and used in theoretical physics
ever since. This approach accepts without proof the validity of Assertion I.1
(the universality of Lorentz transformations) and its various consequences,
like Assertions I.2 (no superluminal signaling) and I.3 (manifest covariance).
It also assumes the existence of space-time, its 4-dimensional geometry and
universal tensor transformations of space-time coordinates of events. The
distinguishing feature of this approach is that boost transformations of ob-
servables are interaction-independent and universal. We will call it the man-
ifestly covariant approach.
In this book we take a somewhat dierent viewpoint on relativity. We
would like to call it the Poincare invariant approach. It is built on two fun-
damental Postulates: the principle of relativity (Postulate 2.1) and the laws
of quantum mechanics from sections 1.5 and 1.6. From these Postulates we
found that transformations of any observable F induced by inertial transfor-
mations of the observer can be obtained by applying an unitary representa-
tion of the Poincare group. For example, if F is an operator of observable in
the reference frame at rest, then in the moving frame the same observable is
represented by the transformed operator
F
= e
ic
Fe
ic
T(r, t) = t(r, t)
According to our postulate 1.2, the (assumed) existence of such an observable
implies existence of states (eigenstates of
T) in which time acquires a denite
xed value t
0
. The wave function of the particle in such a state is zero at all
times except (small neighborhood of the) time t
0
. Physically this means that
the particle was created spontaneously out of nothing, existed for a short
31
Recall that in our denition (see Introduction) observers are instantaneous: each
observer is characterized by a denite time label.
15.3. COMPARISON WITH SPECIAL RELATIVITY 561
time interval around t
0
and then disappeared. Such states violate all kinds
of conservation laws and they are clearly unphysical.
15.3.5 Is geometry 4-dimensional?
Our position in this book is that there is no symmetry between space and
time coordinates. So, there is no need for a 4-dimensional background
continuum of special relativity. All we care about (in both experiment and
in theory) are particle observables (e.g., positions and momenta) and how
they transform with respect to inertial transformations (e.g., time transla-
tions and boosts) of observers. Particle observables are given by Hermitian
operators in the Hilbert space of the system. Inertial transformations enter
the theory through the unitary representation of the Poincare group in the
same Hilbert space. Once these ingredients are known, one can calculate
the eect of any transformation on any observable. To do that, there is no
need to make assumptions about the symmetry between space and time
coordinates and to introduce a 4-dimensional spacetime geometry. The clear
evidence for non-universal, non-geometrical and interaction-dependent char-
acter of boost transformations was obtained in section 15.2. So, we suggest
that 4D Minkowski space-time should not be used in physical theories at all.
Historical and philosophical discussion of the idea that relativistic eects
(such as length contraction and time dilation) result from dynamical behavior
of individual physical systems rather than from kinematical properties of the
universal space-time continuum can be found in the book [Bro05]. In our
work we go further and claim that the dierence between dynamical and
kinematical approaches is not just philosophical one. It has real observable
consequences. We have shown in section 15.2 that boost transformations are
interaction-dependent and that they cannot be reduced to simple universal
Lorentz formulas or pseudo-rotations in the Minkowski space-time. Then
the ideas of the universal pseudo-Euclidean space-time continuum and of
the manifest covariance of physical laws
32
can be accepted only as approx-
imations. Additional physical arguments against the notion of Minkowski
space-time can be found in [Bac04].
We cannot deny that the Minkowski space-time idea turned out to be very
fruitful in the formalism of quantum eld theory. However, in section 15.5
we will take a viewpoint that quantum elds are just formal mathematical
32
see Assertions I.1 and I.3
562 CHAPTER 15. PARTICLES AND RELATIVITY
objects and that the 4-dimensional manifold on which the elds are dened
has nothing to do with real physical space and time.
15.3.6 Dynamical relativity
From the dynamical character of boosts advocated in this book one can pre-
dict some curious eects which, nevertheless, do not contradict any experi-
mental observations. For example, our approach implies that two measuring
rods made from dierent materials (e.g., wood and tungsten) may contract
in slightly dierent ways when viewed from a moving frame of reference. An-
other consequence is that Einsteins time dilation formula (I.21) may be not
accurate. See section 13.
The most signicant dierence between our approach and special rela-
tivity concerns the eect of boosts on interacting systems. Let us see how
an isolated system is seen by time-translated and boosted observers. In our
cartoon 15.3 we placed images of the same system on the plane t , where
t is the time parameter of the observer and is its rapidity. Our approach
and special relativity agree about the eect of time translations: As the time
parameter increases, the system may undergo some dramatic changes (e.g.,
an explosion) caused by internal forces acting in the system. These changes
result from the presence of interaction V in the Hamiltonian (the generator
of time translations H = H
0
+ V ) that describes the system.
The fundamental disagreement is about the eects of boosts. From the
point of view of special relativity, the boosted observer sees only simple
kinematical changes in the system. They include the change of the systems
velocity and relativistic contraction. These eects also take place in our
approach. However, in addition to them, we expect non-trivial changes,
which result from the presence of interaction Z in the generator of boost
transformations K = K
0
+ Z. For example, it is quite possible that for
suciently high boost parameters the system may look completely dierent,
e.g., exploded (the image in the upper left corner of Fig. 15.3). For this
reason our approach can be characterized as dynamical relativity in contrast
to kinematical relativity of Einsteins special theory.
15.3. COMPARISON WITH SPECIAL RELATIVITY 563
tt
TNT TNT
TNT
TNT
vv
vv
Figure 15.3: Non-trivial dynamics of an isolated interacting system as a
function of time (t) and boost parameter (). Three images on the t-axis
illustrate the usual time sequence of events associated with the piece of ex-
plosive and attached burning fuse. Three images on the axis show how
the unexploded device is perceived by a moving observer. If the observers
velocity (v = c tanh ) is low, then the observer sees a moving (unexploded)
device whose length is contracted along the velocitys direction. This trivial
(kinematical) change is predicted also by special relativity. At higher speeds
the moving observer may notice more signicant (dynamical) changes, e.g.,
the device may be seen as exploded. Such non-trivial changes result from
the interaction-dependence of boost generators predictions by RQD.
564 CHAPTER 15. PARTICLES AND RELATIVITY
15.4 Action-at-a-distance and causality
We saw in chapter 12 that RQD describes interactions between particles in
terms of instantaneous potentials. However, textbooks teach us that inter-
actions cannot propagate faster than light:
In non-relativistic quantum mechanics, it is straightforward to
construct Hamiltonians which describe particles interacting via
long-range forces (for a simple example, consider two charged par-
ticles interacting via a Coulomb force). However, the concept of a
long-range interaction prima facie requires some sort of preferred
reference frame, which seems to cast doubt upon the possibility
of constructing such an interaction in a relativistically covariant
way. D. Wallace [Walc]
The traditional viewpoint is that interactions between particles ought to be
retarded, i.e., they should propagate with the speed of light. The usual ar-
gument in favor of this hypothesis is the observation that faster-than-light
interactions violate the special relativistic ban on superluminal signals.
33
If
one accepts the validity of this ban, then logically there is no other choice,
but to accept also a eld-based approach, rather than the picture of directly
interacting particles advocated in this book. Indeed, interactions are al-
ways accompanied by redistribution of the momentum and energy between
particles. If we assume that interactions are retarded, then the transferred
momentum-energy must exist in some form while en route from one par-
ticle to another. This implies existence of some interaction carriers and
corresponding degrees of freedom not directly related to particle observables.
These degrees of freedom are usually associated with elds, e.g., the electro-
magnetic eld of Maxwells theory. In other words
...the interaction is a result of energy momentum exchanges be-
tween the particles through the eld, which propagates energy and
momentum and can transfer them to the particles by contact. F.
Strocchi [Str04]
The eld concept came to dominate physics starting with the work
of Faraday in the mid-nineteenth century. Its conceptual advan-
tage over the earlier Newtonian program of physics, to formulate
33
see Appendix I.3
15.4. ACTION-AT-A-DISTANCE AND CAUSALITY 565
the fundamental laws in terms of forces among atomic particles,
emerges when we take into account the circumstance, unknown to
Newton (or, for that matter, Faraday) but fundamental in special
relativity, that inuences travel no farther than a nite limiting
speed. For then the force on a given particle at a given time
cannot be deduced from the positions of other particles at that
time, but must be deduced in a complicated way from their pre-
vious positions. Faradays intuition that the fundamental laws
of electromagnetism could be expressed most simply in terms of
elds lling space and time was of course brilliantly vindicated in
Maxwells mathematical theory. F. Wilczek [Wil99]
In this section we will argue against the above logic. Our point is that if the
dynamical character of boosts is properly taken into account, then instan-
taneous action-at-a-distance does not contradict the principle of causality in
all reference frames.
15.4.1 Retarded interactions in Maxwells theory
Let us consider two charged classical particles 1 and 2 repelling each other in
two scenarios. In scenario I, the particles are propagating without inuence
of any external force, subject to only their mutual interaction. Their classical
trajectories are shown by dashed lines CAG and DBEF in Fig. 15.4(b). In
scenario II, at time t = 0 (point A on the trajectory of particle 1) particle 1
experiences some external force. For example, this could be an impact from
a third particle,
34
which changes the trajectory of the particle 1 to the one
shown by a full line CAG
H
d
t
r
1
e
H
d
t
p
1
(t) = e
i
H
d
t
p
1
e
H
d
t
r
2
(t) = e
i
H
d
t
r
2
e
H
d
t
p
2
(t) = e
i
H
d
t
p
2
e
H
d
t
and the force acting on particle 2
f
2
(t) =
d
dt
p
2
(t) =
i
[p
2
(t), H
d
]
f
2
(r
1
(t), p
1
(t); r
2
(t), p
2
(t)) (15.64)
depends on positions and momenta of both particles at the same time instant
t. Applying these formulas to the scenario II, we obtain trajectory BEF
shown in Fig. 15.4(a): The eect of the collision at point A is felt by the
particle 2 immediately and this particle changes its course at time t = 0, i.e.,
right after point B.
The instantaneous potential described above is only one part of the total
interaction between charged particles in RQD. There is also an additional
568 CHAPTER 15. PARTICLES AND RELATIVITY
retarded interaction whose origin can be explained as follows. The impact on
the particle 1 at point A creates bremsstrahlung photons.
35
These photons,
being real massless particles, propagate with the speed c away from the
particle 1 (thin dotted line in Fig. 15.4(a)). There is a chance that such a
photon will reach particle 2 at point E of its trajectory and force 2 to change
its course again (EF
and DBEF
in Fig. 15.4(a).
15.4.3 Does action-at-a-distance violate causality?
The instantaneous propagation of interactions in RQD is in sharp contradic-
tion with Assertion I.2 of special relativity, which says that no signal may
propagate faster than light. So, we need to explain this paradox.
The impossibility of superluminal signals is usually proven by applying
Lorentz transformations to space-time coordinates of two causally related
events and claiming that there exists a moving frame in which the temporal
order of these events is reversed.
37
However, we know from subsection 15.2.9
that for systems with interactions Lorentz transformations are no longer ex-
act. So, the special-relativistic ban on superluminal propagation of interac-
tions may not be valid as well.
Consider again the two-particle interacting system discussed in subsection
15.4.2, this time from the point of view of a moving reference frame O
.
Trajectories of particles 1 and 2 in this frame are
38
r
1
(, t
) = e
ic
e
i
H
d
t
r
1
e
H
d
t
e
ic
p
1
(, t
) = e
ic
e
i
H
d
t
p
1
e
H
d
t
e
ic
r
2
(, t
) = e
ic
e
i
H
d
t
r
2
e
H
d
t
e
ic
p
2
(, t
) = e
ic
e
i
H
d
t
p
2
e
ic
H
d
t
e
i
35
See section 13.3.
36
For similar ideas about electromagnetic interactions being composed of both instan-
taneous and retarded parts see [CSR96, Fie97, Fie06, Kho06].
37
see Appendix I.3
38
Here t
is
H
d
() = e
ic
H
d
e
ic
(15.65)
therefore the force acting on the particle 2 in this frame
f
2
(, t
) =
d
dt
p
2
(, t
) =
i
[p
2
(, t
), H
d
()]
=
ic
[e
e
i
H
d
t
p
2
e
H
d
t
e
ic
, e
Kc
H
d
e
ic
]
=
i
ic
[e
i
H
d
t
p
2
e
H
d
t
, H
d
]e
ic
=
i
ic
[p
2
(0, t
), H
d
]e
ic
= e
ic
f
2
(0, t
)e
ic
= e
ic
f
2
(r
1
(0, t
), p
1
(0, t
); r
2
(0, t
), p
2
(0, t
))e
ic
= f
2
(r
1
(, t
), p
1
(, t
); r
2
(, t
), p
2
(, t
)) (15.66)
is a function of positions and momenta of both particles at the same time
instant t
AA
Figure 15.5: A beam of light impinging on the glass-air interface. (a) If
the incidence angle is greater than the Brewster angle, then all light is
reected at the surface. The region of evanescent light is shown by vertical
dashed lines. (b) If a second piece of glass is placed near the interface, then
evanescent light is converted to the normal light propagating into the
second piece of glass (G2).
572 CHAPTER 15. PARTICLES AND RELATIVITY
15.5 Are quantum elds necessary?
The general idea of RQD is that particles are the most fundamental ingredi-
ents of nature and that everything we know in physics can be explained as
manifestations of quantum behavior of particles interacting with each other
at a distance. If this idea is correct, then the notion of elds becomes re-
dundant. On the other hand, it is also true that (quantum) elds are in the
center of all modern relativistic quantum theories and we actually started
our formulation of RQD from the quantum eld version of QED in section
9.1. This surely looks like a contradiction. Then we are pressed to answer the
following question: what is the role of quantum elds in relativistic quantum
theory?
15.5.1 Dressing transformation in a nutshell
Before discussing the meaning of quantum elds, let us now review the pro-
cess by which we arrived to the nite dressed particle Hamiltonian H
d
=
H
0
+ V
d
in sections 12.1 and 13.6. We started with the QED Hamiltonian
H = H
0
+V in subsection 9.1.2 (the upper left box in Fig. 15.6) and demon-
strated some of its good properties, such as the Poincare invariance and the
cluster separability. However, when we used this Hamiltonian to calculate
the S-operator beyond the lowest non-vanishing perturbation order (arrow
(1) in Fig. 15.6) we obtained meaningless innite results. The solution to this
problem was given by renormalization theory in chapter 10 (arrow (2)): in-
nite counterterms were added to the Hamiltonian H and a new Hamiltonian
was obtained H
c
= H
0
+V
c
. Although the Hamiltonian H
c
was innite, these
innities canceled in the process of calculation of the S-operator (dashed ar-
row (3)), and very accurate values for observable scattering cross-sections
and energies of bound states were obtained (arrow (4)). As a result of the
renormalization procedure, the divergences were swept under the rug, and
this rug was the Hamiltonian H
c
. This Hamiltonian was not satisfactory:
First, in the limit of innite cuto matrix elements of H
c
on bare particle
states were innite. Second, the Hamiltonian H
c
contained unphys terms
like a
and a
(x, 0)
=
_
dp
_
2(2)
p
e
i
(px)
p
+
_
dp
_
2(2)
p
e
(px)
p
The annihilation part
+
(x, 0) of this expression cannot be regarded as an
operator annihilating a particle at the space point x. The correct expression
for such an operator would be
47
(x) =
1
(2)
3/2
_
dpe
i
(px)
p
where the denominator
p
is absent. From the annihilation operator (x)
and its conjugate creation operator
(x, 0)
+
(x, 0) does not
have such a simple interpretation [WWS
+
12].
As can be seen from formulas for scattering operators in subsection 7.1.2
and from equations (9.9) - (9.10), the parameters t and x are just integration
variables and they are not present in the nal expression for the fundamental
measurable quantity calculated in QFT - the S-matrix.
We certainly agree with the following two quotes:
Every physicist would easily convince himself that all quantum cal-
culations are made in the energy-momentum space and that the
Minkowski x
Gh
1
h
2
c
4
r
Gh
2
p
2
1
h
1
c
2
r
Gh
1
p
2
2
h
2
c
2
r
+
7G(p
1
p
2
)
2c
2
r
+
G(p
1
r)(p
2
r)
2c
2
r
3
+
G
2
h
1
h
2
(h
1
+ h
2
)
2c
8
r
2
. . . (16.1)
where r r
1
r
2
and G is the gravitational constant. The remarkable
analogy with the spin-independent part of the Darwin-Breit Hamiltonian
1
or, more generally, in the Fock space with variable number of particles
2
Interactions in (16.1) are supposed to be accurate within (v/c)
2
approximation. Note
also that these interactions are symmetric with respect to the particle interchange 1 2.
The ellipsis indicates yet unknown interaction terms of higher orders in (v/c).
16.1. TWO-BODY PROBLEM 583
(14.1) + (14.2) from classical electrodynamics is quite obvious [Ker62]. In-
terestingly, equation (16.1) indicates that the source of gravity is particles
energy h
i
(m
2
i
c
4
+ p
2
i
c
2
)
1/2
. This is dierent from both electrodynamics
and Newtonian gravity where the roles of coupling constants are played by
charges and masses, respectively. This will be important in our discussion of
the equivalence principle in section 16.2. Another remarkable feature is that
Hamiltonian (16.1) is well-dened for both massive and massless particles.
We will see later that the same Hamiltonian describes well both the orbit of
Mercury and the bending of light near the Sun. Relativistic invariance of our
Hamiltonian theory is proved in Appendix N.4.
For massive (m
i
> 0) particles whose velocities are small in comparison
with the speed of light (p
i
m
i
c) we can use approximation
h
i
m
i
c
2
+
p
2
i
2m
i
p
4
i
8m
3
i
c
2
+ . . .
Then, omitting inconsequential rest energies m
i
c
2
, the two-particle Hamilto-
nian takes the form
H
p
2
1
2m
1
+
p
2
2
2m
2
Gm
1
m
2
r
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
3Gm
2
p
2
1
2m
1
c
2
r
3Gm
1
p
2
2
2m
2
c
2
r
+
7G(p
1
p
2
)
2c
2
r
+
G(p
1
r)(p
2
r)
2c
2
r
3
+
G
2
m
1
m
2
(m
1
+ m
2
)
2c
2
r
2
(16.2)
This expression coincides with the famous Einstein-Infeld-Homann Hamil-
tonian for interacting point masses, which was derived in the (v/c)
2
approx-
imation to general relativity.
3
The rst three terms in (16.2) are recognized
as the usual Hamiltonian of classical Newtonian gravity.
Unlike textbook presentations, we will not assume that general relativity
is the exact theory of gravity and that Hamiltonian (16.1) is only a rough
approximation to this theory. Instead, we will maintain that the true com-
plete and rigorous formulation of gravitational dynamics has a Hamiltonian
form with instantaneous interaction potentials, like in (16.1). We will remain
3
see [EIH38] and 106 in [LL73]
584 CHAPTER 16. QUANTUM THEORY OF GRAVITY
agnostic about the fundamental origin of (16.1). Nevertheless, we will ap-
preciate the fact that this Hamiltonian can successfully describe experiments
and observations,
4
and that it is fully appropriate as a basis for relativistic
quantum theory of gravity.
16.1.3 Precession of the Mercurys perihelion
Explanation of the Mercurys perihelion shift was the rst success of Ein-
steins general theory of relativity. Let us see if this eect can be repro-
duced within our approach. In this subsection we will use the Einstein-
Infeld-Homann Hamiltonian (16.2) and apply usual procedures of classical
Hamiltonian mechanics in order to calculate the Mercurys orbit.
As we mentioned earlier, the Hamiltonian (16.2) applies to structureless
elementary particles, so, strictly speaking, it is not valid for the system of
macroscopic bodies Sun-Mercury. However, within desired accuracy we can
safely neglect such things as 3-body forces,
5
tidal eects and spinning mo-
tions. In this case, it becomes reasonable to represent Mercury and Sun as
material points with masses m
1
and m
2
, respectively. We will use index 1 to
denote variables referring to the light body - Mercury and index 2 will refer to
the heavy body - Sun. Since Sun is much heavier than Mercury (m
2
m
1
),
we can simplify the Hamiltonian (16.2). In the center-of-mass reference frame
we have the relationship p
1
= p
2
, which implies v
2
p
2
/m
2
v
1
, so the
speed of Sun can be neglected, when compared with the speed of Mercury.
In a reasonable approximation, the Sun can be assumed to rest in the ori-
gin of the coordinate system r
2
= 0. Moreover, the terms having m
2
in
their denominators can be omitted in (16.2), thus leading to the following
approximate Hamiltonian
H
p
2
1
2m
1
Gm
1
m
2
r
p
4
1
8m
3
1
c
2
3Gm
2
p
2
1
2m
1
c
2
r
+
7G(p
1
p
2
)
2c
2
r
+
G(p
1
r)(p
2
r)
2c
2
r
3
+
G
2
m
1
m
2
2
2c
2
r
2
(16.3)
The rst Hamiltons equation of motion yields
4
Most observable relativistic gravitational eects appear in the order (v/c)
2
. So, we
will be working in this approximation.
5
see subsection 16.2.3
16.1. TWO-BODY PROBLEM 585
dp
1
dt
=
H
r
1
Gm
1
m
2
r
r
3
3Gm
2
p
2
1
r
2m
1
c
2
r
3
+
7G(p
1
p
2
)r
2c
2
r
3
G(p
2
r)p
1
2c
2
r
3
G(p
1
r)p
2
2c
2
r
3
+
3G(p
1
r)(p
2
r)r
2c
2
r
5
+
G
2
m
1
m
2
2
r
c
2
r
4
(16.4)
dp
2
dt
=
H
r
2
=
dp
1
dt
(16.5)
Mercurys velocity is obtained from the second Hamiltons equation of motion
dr
1
dt
=
H
p
1
p
1
m
1
p
2
1
p
1
2m
3
1
c
2
3Gm
2
p
1
m
1
c
2
r
+
7Gp
2
2c
2
r
+
G(p
2
r)r
2c
2
r
3
(16.6)
To obtain Mercurys acceleration we dierentiate equation (16.6) on time
and use results (16.4) - (16.6) to obtain
d
2
r
1
dt
2
p
1
m
1
p
2
1
p
1
2m
3
1
c
2
(p
1
p
1
)p
1
m
3
1
c
2
3Gm
2
p
1
m
1
c
2
r
+
3Gm
2
(r r)p
1
m
1
c
2
r
3
+
7G p
2
2c
2
r
7G(r r)p
2
2c
2
r
+
G( p
2
r)r
2c
2
r
3
+
G(p
2
r)r
2c
2
r
3
+
G(p
2
r) r
2c
2
r
3
3G(p
2
r)(r r) r
2c
2
r
5
Gm
2
r
r
3
Gm
2
p
2
1
r
m
2
1
c
2
r
3
+
4Gm
2
(p
1
r)p
1
m
2
1
c
2
r
3
+
4G
2
m
2
2
r
c
2
r
4
(16.7)
We can make farther simplications by noticing that the total angular mo-
mentum vector
J
0
[r
1
p
1
] + [r
2
p
2
] (16.8)
has zero Poisson bracket with the total Hamiltonian (16.3) and therefore does
not change with time. Hence the vector
586 CHAPTER 16. QUANTUM THEORY OF GRAVITY
j
J
0
m
1
=
[r p
1
]
m
1
is constant, both p
1
and r r
1
are orthogonal to the xed direction of j and
the orbit is conned within a plane perpendicular to j. Furthermore, we will
assume that the planets orbit is nearly circular. Then we can write
(p
1
r) 0 (16.9)
j
rp
1
m
1
(16.10)
and the third term on the right hand side of (16.7) can be omitted. So, the
equation of motion, which describes the Mercurys orbit, takes the form
d
2
r
dt
2
Gm
2
r
r
3
Gm
2
p
2
1
r
m
2
1
c
2
r
3
+
4G
2
m
2
2
r
c
2
r
4
(16.11)
In the plane of the orbit we can introduce polar coordinates (r, ) and
two mutually orthogonal unit vectors r r/r and
, such that
r = rr
r = rr + r
r = ( r r
2
)r + (2 r
+ r
[r
] = j/j
[r r] = r
2
j/j (16.12)
The acceleration (16.11) is directed along the vector r, so we can write
r r
2
=
Gm
2
r
2
Gm
2
p
2
1
m
2
1
c
2
r
2
+
4G
2
m
2
2
c
2
r
3
(16.13)
In order to evaluate the rate of change of the angle , we use (16.6) and
(16.10) to write
16.1. TWO-BODY PROBLEM 587
[r r] r
_
p
1
m
1
p
2
1
p
1
2m
3
1
c
2
3Gm
2
p
1
m
1
c
2
r
_
= j
_
1
p
2
1
2m
2
1
c
2
3Gm
2
c
2
r
_
j
_
1
j
2
2c
2
r
2
3Gm
2
c
2
r
_
A comparison with equation (16.12) yields
j
r
2
_
1
j
2
2c
2
r
2
3Gm
2
c
2
r
_
Substituting this result in (16.13) we obtain
0 r
j
2
r
3
_
1
j
2
2c
2
r
2
3Gm
2
c
2
r
_
2
+
Gm
2
r
2
+
Gm
2
j
2
c
2
r
2
4G
2
m
2
2
c
2
r
3
r
j
2
r
3
+
Gm
2
r
2
+
7Gm
2
j
2
c
2
r
4
4G
2
m
2
2
c
2
r
3
+
j
4
c
2
r
5
(16.14)
For our purposes it is more convenient to nd the dependence of r on the
angle rather than the time dependence r(t). To do that, we introduce a new
variable u 1/r and obtain the following relationship between t-derivatives
and -derivatives
d
dt
=
d
d
ju
2
_
1
j
2
u
2
2c
2
3Gm
2
u
c
2
_
d
d
Then we can express the second time derivative of r in the form
r =
_
ju
2
_
1
j
2
u
2
2c
2
3Gm
2
u
c
2
_
d
d
_ _
ju
2
_
1
j
2
u
2
2c
2
3Gm
2
u
c
2
_
d
d
_
1
u
= j
2
u
2
_
1
j
2
u
2
2c
2
3Gm
2
u
c
2
_
d
d
_
1
j
2
u
2
2c
2
3Gm
2
u
c
2
_
du
d
= j
2
u
2
_
1
j
2
u
2
2c
2
3Gm
2
u
c
2
_
_
_
j
2
u
c
2
3Gm
2
c
2
__
du
d
_
2
+
_
1
j
2
u
2
2c
2
3Gm
2
u
c
2
_
d
2
u
d
2
_
j
2
u
2
_
_
j
2
u
c
2
3Gm
2
c
2
__
du
d
_
2
+
_
1
j
2
u
2
c
2
6Gm
2
u
c
2
_
d
2
u
d
2
_
588 CHAPTER 16. QUANTUM THEORY OF GRAVITY
and equation (16.14) can be rewritten as
0 =
d
2
u
d
2
j
2
u
2
_
1
j
2
u
2
c
2
6Gm
2
u
c
2
_
+
_
du
d
_
2
j
2
u
2
_
j
2
u
c
2
+
3Gm
2
c
2
_
j
2
u
3
+ Gm
2
u
2
+
7Gm
2
j
2
u
4
c
2
4G
2
m
2
2
u
3
c
2
+
j
4
u
5
c
2
=
d
2
u
d
2
_
1
j
2
u
2
c
2
6Gm
2
u
c
2
_
+
_
du
d
_
2
_
j
2
u
c
2
+
3Gm
2
c
2
_
u +
Gm
2
j
2
+
7Gm
2
u
2
c
2
4G
2
m
2
2
u
c
2
j
2
+
j
2
u
3
c
2
(16.15)
To solve this equation let us rst consider the non-relativistic case in
which all terms proportional to c
2
are omitted
0
d
2
u
d
2
u +
Gm
2
j
2
(16.16)
It is easy to show that this equation is satised by orbits of elliptical shape
6
u() = L
1
+ Acos (16.17)
Substituting (16.17) in (16.16) we obtain
L
1
=
Gm
2
j
2
(16.18)
Now let us seek an approximate solution of the full relativistic equation
(16.15) in the form
u() = L
1
+ Acos (16.19)
6
Parameters L (semilatus rectum) and A of the orbit can be expressed through the
semimajor axis f and eccentricity e of the ellipse: L = f(1 e
2
) and A = eL
1
.
16.1. TWO-BODY PROBLEM 589
L
Sun
Mercury
2
Figure 16.2: Mercurys orbit shown schematically. The planets orbital move-
ment is counterclockwise. Full line - rst year; broken line - second year. 2
is the yearly precession angle.
In this ansatz we assume that the orbit basically keeps its elliptical shape
(16.17) and that relativistic
7
perturbation terms in equation (16.15) lead
to constant precession of the orbit with the rate (see Fig. 16.2). This
precession rate is the quantity we are interested in. To nd it, we substitute
(16.19) and
du
d
= A(1 ) sin
d
2
u
d
2
= A(1 )
2
cos
in (16.15)
8
0 A(1 )
2
cos
_
1
j
2
L
2
c
2
6Gm
2
Lc
2
_
L
1
Acos
+
Gm
2
j
2
+
7Gm
2
(L
2
+ 2L
1
Acos )
c
2
4G
2
m
2
2
(L
1
+ Acos )
c
2
j
2
7
proportional to c
2
8
We also take into account that for the nearly circular orbit A L
1
, A
2
A.
Likewise, the term (du/d)
2
can be ignored because it is proportional to the small quantity
A
2
.
590 CHAPTER 16. QUANTUM THEORY OF GRAVITY
+
j
2
(L
3
+ 3L
2
Acos )
c
2
The right hand side of this equation contains terms of two types: those
independent on and those proportional to cos . These two types of terms
should vanish independently, thus leading to two equations
0 = L
1
+
Gm
2
j
2
+
7Gm
2
c
2
L
2
4G
2
m
2
2
c
2
j
2
L
+
j
2
c
2
L
3
0 = (1 )
2
_
1
j
2
L
2
c
2
6Gm
2
Lc
2
_
1 +
14Gm
2
c
2
L
4G
2
m
2
2
c
2
k
2
+
3j
2
c
2
L
2
From the rst equation we can obtain relativistic corrections to the relation-
ship (16.18), which determines the size of the orbit. This is not important
for our purposes. From the second equation and (16.18) we nd the desired
precession rate
(1 )
2
=
1 13j
2
/(L
2
c
2
)
1 7j
2
/(L
2
c
2
)
_
1
6j
2
L
2
c
2
_
3Gm
2
Lc
2
This means that the perihelion advances by the angle of
2
6Gm
2
Lc
2
(radian) after each full revolution, which agrees with astronomical observa-
tions and with predictions of general relativity. See, for example, equation
(8.6.11) in [Wei72] as well as [LL73, DD85].
16.1.4 Photons and gravity
As we mentioned earlier, the Hamiltonian (16.1) can describe interactions
between both massive bodies and massless particles. Let us consider the
case when the massive body 2 is very heavy (e.g., Sun) and particle 1 is
massless, e.g. a photon whose momentum satises inequality p
1
m
2
c.
Then in (16.1) we can take the limit m
1
0, replace h
1
p
1
c, h
2
m
2
c
2
,
16.1. TWO-BODY PROBLEM 591
Actual star position
Visible star position
x
z
Sun
Earth
R
Figure 16.3: Light bending and slowing in the Suns gravitational poten-
tial. Ticks on the photons trajectory indicate segments passed in equal time
intervals.
ignore the inconsequential rest energy m
2
c
2
of the massive body and obtain
a Hamiltonian accurate to the order (1/c)
H = p
1
c
2Gm
2
p
1
cr
(16.20)
which can be used to evaluate the motion of photons in the Suns gravitational
eld. The time derivative of the photons momentum is found from the rst
Hamiltons equation
dp
1
dt
=
H
r
1
=
2Gm
2
p
1
r
cr
3
(16.21)
This is a small correction of the order c
1
. In the c
0
approximation p
1
=
const, so we can assume that the photon moves with the speed c along the
straight line ( r
1x
= ct, r
1y
= 0, r
1z
= R) having the impact parameter R
equal to the radius of the Sun, which is located in the origin r
2x
= r
2y
=
r
2z
= 0. See Fig. 16.3. Then the accumulated momentum in the z-direction
is obtained by integrating the z-component of (16.21) on time
p
1z
_
2Gm
2
p
1
Rdt
c(R
2
+ c
2
t
2
)
3/2
=
4Gm
2
p
1
c
2
R
592 CHAPTER 16. QUANTUM THEORY OF GRAVITY
The negative sign of this expression indicates that the photon was being
attracted to the Sun. The deection angle
tan =
[p
1z
[
p
1
=
4Gm
1
c
2
R
coincides with the observed bending of starlight by the Suns gravity [Wil06].
The second Hamiltons equation
dr
1
dt
=
H
p
1
=
p
1
p
1
_
c
2Gm
2
cr
_
(16.22)
can be interpreted as gravitational reduction of the speed of light.
9
This
means that in the presence of gravity it takes photons an extra time to travel
the same path. Let us nd the time delay for a photon traveling from the
Suns surface to the observer on Earth. Denoting d the distance Sun - Earth
and taking into account that R d we obtain
t
1
c
_
d/c
0
2Gm
2
dt
c(R
2
+ c
2
t
2
)
1/2
=
2Gm
2
c
3
ln
_
2d
R
_
which agrees with the leading general-relativistic contribution to the Shapiro
time delay of radar signals near the Sun [Wil06].
16.2 Principle of equivalence
16.2.1 Free fall universality
Without losing the (v/c)
2
accuracy we can rewrite acceleration formula (16.7)
as
9
The limit h
1
p
1
c that was used to derive this result is also applicable to high-
energy massive particles (when their kinetic energy is much higher than the rest energy).
For this reason, the gravitationally reduced speed of light remains the limiting speed for
massive particles as well. I.e., no particle can move faster than c(1 2Gm
2
/(c
2
r)) in
the gravitational eld.
16.2. PRINCIPLE OF EQUIVALENCE 593
d
2
r
1
dt
2
Gm
2
r
r
3
Gm
2
v
2
1
r
c
2
r
3
+
4Gm
2
(v
1
r)v
1
c
2
r
3
+
4G
2
m
2
2
r
c
2
r
4
(16.23)
This demonstrates the remarkable fact that acceleration of the light particle
1 does not depend on its mass m
1
. In other words, in the gravitational eld
of the heavy mass m
2
all massive bodies move with the same acceleration
independent on their mass. This property is called the universality of free
fall.
Let us check that massless photons also have the same acceleration. Tak-
ing the time derivative of (16.22) we obtain photons acceleration
d
2
r
1
dt
2
=
p
1
c
p
1
c(p
1
p
1
)p
1
p
3
1
2Gm
2
p
1
cp
1
r
+
2Gm
2
(p
1
p
1
)p
1
cp
3
1
r
+
2Gm
2
(r r)p
1
cp
1
r
3
2Gm
2
r
r
3
+
2Gm
2
(p
1
r)p
1
p
2
1
r
3
+
4G
2
m
2
2
r
c
2
r
4
4G
2
m
2
2
(p
1
r)p
1
c
2
p
2
1
r
4
+
2Gm
2
(r p)p
1
p
2
1
r
3
4G
2
m
2
2
(p
1
r)p
1
c
2
p
2
1
r
4
=
2Gm
2
r
r
3
+
4Gm
2
(p
1
r)p
1
p
2
1
r
3
+
4G
2
m
2
2
r
c
2
r
4
8G
2
m
2
2
(p
1
r)p
1
c
2
p
2
1
r
4
(16.24)
In realistic situations the ratio Gm
2
/(c
2
r) is very small, so the last term
on the right hand side of (16.24) is much smaller than the 2nd term there.
Then, taking into account that (with the accuracy required here) for photons
v
1
c and p
1
/p
1
v
1
/v
1
, we can see that photons acceleration (16.24) is the
same as formula (16.23) for massive particles. So, we conclude that within
the (1/c)
2
approximation all particles (massive and massless) have the same
accelerations in the gravitational eld of a massive body m
2
. This conrms
the free fall universality principle.
16.2.2 Composition invariance of gravity
Having discussed two elementary particles, now we would like to address
gravitational interactions in multi-particle systems. Our conclusion about
594 CHAPTER 16. QUANTUM THEORY OF GRAVITY
the universal gravitational acceleration was based on the facts that in the
gravitational Hamiltonian
H
gr
h
1
+ h
2
Gh
1
h
2
c
4
r
12
(16.25)
both bodies were treated as material points and that their (gravitational )
masses m
1
, m
2
present in the potential energy term were exactly the same as
(inertial ) masses present in the kinetic energy terms h
i
=
_
m
2
i
c
4
+ p
2
i
c
2
.
It is well-established experimentally that the free fall universality also holds
for macroscopic material bodies composed of many elementary particles [Wil06,
DV96, Nor03, TWKN
+
04]. Therefore, the equality of the inertial and grav-
itational masses should hold for such compound bodies as well. We then
conclude that the gravitational Hamiltonian of a 2-body system should fol-
low the same pattern as the Hamiltonian of the 2-particle system (16.25)
10
H
gr
H
1
+ H
2
GH
1
H
2
c
4
R
12
(16.26)
=
_
2
1
c
4
+P
1
c
2
+
_
2
2
c
4
+P
2
c
2
G
_
2
1
c
4
+P
1
c
2
_
2
2
c
4
+P
2
c
2
c
4
R
12
Due to the defect of mass eect, the energy of a compound body H
i
11
is
not equal to the sum of energies of constituent particles. In particular,
this means that gravitational interaction energy cannot be represented as
2-particle terms summed over all particle pairs in the system. There must
be signicant contributions of 3-particle, 4-particle, etc. potentials.
12
Let us demonstrate this point on an example of a 3-particle system. We
will assume that the Hamiltonian for each 2-particle subsystem is
10
Here H
i
is the full Hamiltonian of the i-th body, which includes all particle interactions
(including gravitational ones) inside that body and R
12
is the separation between two
centers of mass. This expression is approximate not only due to omission of higher-order
terms in G and (1/c). Here we also ignore tidal gravitational eects characteristic for
multiparticle bodies.
11
which serves the role of charge in the gravitational interaction
12
In this respect, gravitational interaction is dierent from the electromagnetic one. In
the latter case, two-particle potentials provide a dominant contribution to the total energy
of a multiparticle system.
16.2. PRINCIPLE OF EQUIVALENCE 595
H
ij
= h
i
+ h
j
Gh
i
h
j
c
4
[r
i
r
j
[
+ V (r
i
, r
j
)
where V (r
i
, r
j
) is a non-gravitational (e.g., electromagnetic) interaction po-
tential between the two particles. Then let us attempt to write the Hamil-
tonian for the 3-particle system by simply adding pairwise interactions
H = h
1
+ h
2
+ h
3
Gh
1
h
2
c
4
[r
1
r
2
[
Gh
1
h
3
c
4
[r
1
r
3
[
Gh
2
h
3
c
4
[r
2
r
3
[
+V (r
1
, r
2
) + V (r
1
, r
3
) + V (r
2
, r
3
) (16.27)
Let us further assume that particles 2 and 3 form a charge-neutral (q
2
+q
3
=
0) bound system and that the distance between these two particles [r
2
r
3
[
is much smaller than their separation from particle 1, which we denote by R
R [r
1
r
2
[ [r
1
r
3
[ [r
2
r
3
[ (16.28)
Then the Hamiltonian (16.27) can be simplied
H h
1
+
_
h
2
+ h
3
Gh
2
h
3
c
4
[r
2
r
3
[
+ V (r
2
, r
3
)
_
Gh
1
(h
2
+ h
3
)
c
4
R
h
1
+ H
23
Gh
1
(h
2
+ h
3
)
c
4
R
(16.29)
This result is dierent from our expectation (16.26), because the non-interacting
energy (h
2
+h
3
) of the system 2+3 is present in the numerator of the gravi-
tational potential instead of the total energy H
23
. Therefore, our assumption
of simple pairwise gravitational forces violates the universality of free fall.
16.2.3 n-particle gravitational potentials
We have just established that our guessed Hamiltonian (16.27) with pairwise
gravitational potentials between elementary constituents is not accurate. In
order to comply with the free fall universality we need to assume that gravity
couples not only kinetic energies (h
i
) of particles, but also their interaction
596 CHAPTER 16. QUANTUM THEORY OF GRAVITY
energies (both electromagnetic and gravitational). This implies the presence
of signicant n-particle gravitational forces (where n > 2).
Let us now try to formulate the Hamiltonian of a 3-particle system, which
conforms with the principle of free fall universality. Suppose that in the
absence of gravitational forces the Hamiltonian of this system includes only
pairwise electromagnetic potentials V
ij
V (r
i
, r
j
)
H
em
= h
1
+ h
2
+ h
3
+ V
12
+ V
13
+ V
23
Then our claim is that one can include gravity by the following approach:
First, rearrange terms in H
em
as
H
em
= H
1
+ H
2
+ H
3
H
1
h
1
+
1
2
V
12
+
1
2
V
13
H
2
h
2
+
1
2
V
12
+
1
2
V
23
H
3
h
3
+
1
2
V
13
+
1
2
V
23
Then the desired full Hamiltonian is
13
H = H
em
+ H
gr
(16.30)
H
gr
=
GH
1
H
2
c
4
r
12
GH
1
H
3
c
4
r
13
GH
2
H
3
c
4
r
23
Note that gravitational interactions in H
gr
include 3-particle potentials. For
example, the product H
1
H
2
contains terms like V
12
V
23
, depending on posi-
tions of three particles.
To see that the claimed property of the free fall universality does, indeed,
hold, we will simplify the problem and assume that only particles 2 and 3
have non-zero charges, interact electromagnetically and form a bound state.
Gravitational interaction between these particles is much weaker, so it will be
neglected. We will characterize the subsystem 2+3 by its total momentum,
energy and center-of-mass position
13
The principle of construction of the gravitational interaction H
gr
can be easily ex-
tended to systems with arbitrary number of particles n.
16.2. PRINCIPLE OF EQUIVALENCE 597
P
23
= p
2
+p
3
H
23
= h
2
+ h
3
+ V
23
= H
2
+ H
3
R
23
=
h
2
r
2
+ h
3
r
3
+ q
2
q
3
V
23
(r
2
+r
3
)/2
h
2
+ h
3
+ V
23
=
H
2
r
2
+ H
3
r
3
H
23
We will further assume that particle 1 has zero charge and interacts with
2+3 by gravitational forces only. Then the full Hamiltonian (16.30) takes
the form
H = h
1
+ h
2
+ h
3
+ V
23
Gh
1
H
2
c
4
r
12
Gh
1
H
3
c
4
r
13
h
1
+ H
23
Gh
1
H
23
c
4
[r
1
R
23
[
H
2
= h
2
+
1
2
V
23
H
3
= h
3
+
1
2
V
23
which agrees with the desired pattern (16.26). Let us now study the dynamics
of the center-of-mass R
23
. The rst time derivative is
dR
23
dt
= [R
23
, H]
P
[R
23
, H
23
]
P
_
1
Gh
1
c
4
[r
1
R
23
[
_
c
2
P
23
H
23
If we further assume that the subsystem 2+3 is at rest (P
23
0), then its
acceleration
d
2
R
23
dt
2
=
_
c
2
P
23
H
23
, H
_
P
=
_
c
2
P
23
H
23
,
Gh
1
H
23
c
4
[r
1
R
23
[
_
P
Gh
1
(r
1
R
23
)
c
2
[r
1
R
23
[
3
Gm
1
(r
1
R
23
)
[r
1
R
23
[
3
does not depend on its energy or mass M
23
= H
23
/c
2
. This result agrees
with the principles of free fall universality and composition invariance. For
example, if particle 1 represents the gravitational force of Earth, and the
598 CHAPTER 16. QUANTUM THEORY OF GRAVITY
bound state 2+3 represents a falling atom, then the falls acceleration does
not depend on the atoms binding energy or on the mass defect M
23
m
2
+m
3
M
23
. Likewise, the free fall of any compound object is independent
on its composition (chemical or nuclear). This property was conrmed with
great accuracy in modern experiments.
16.2.4 Gravitational red shift
So far in this section we discussed how non-gravitational interactions (electro-
magnetic, nuclear) modify energies (masses) of bodies and, therefore, change
the gravitational attraction of the bodies. This modication manifested itself
in the universality of free fall. There is another side of this coin: Gravita-
tional eld aects spectra of bound systems (nuclei, atoms, etc.) and lead to
such remarkable phenomena as gravitational red shift and time dilation.
Let us rst consider an isolated multi-particle quantum physical system
(e.g., an atom, molecule, stable nucleus, etc), which can be regarded as a
source of electromagnetic radiation. In the absence of gravity, this system is
described by its Hamiltonian H. Here it will be convenient to represent this
Hamiltonian by its spectral decomposition (1.28)
H =
k
E
k
P
k
(16.31)
where index k labels energy eigenvalues E
k
and P
k
are projections on energy
eigensubspaces. If the system was prepared originally in an unstable level
E
i
, then, eventually, emits a photon and nds itself in a lower energy level
E
f
.
14
The photons energy is
E = E
i
E
f
(16.32)
Now let us place this source of radiation in the Earths gravitational
eld. In a reasonable approximation, the full Hamiltonian of the system
source+Earth is (16.26)
H
Mc
2
+ H
GMH
c
2
R
(16.33)
14
See section 13.3.
16.2. PRINCIPLE OF EQUIVALENCE 599
where M and R are the Earths mass and radius, respectively. Ignoring the
constant term Mc
2
, we see that the Hamiltonian of the light source in the
gravitational eld is
H
H
_
1
GM
c
2
R
_
H (16.34)
The factor 1GM/(c
2
R) < 1 can be regarded as a numerical multiplier,
which does not aect eigenvectors of the Hamiltonian. This means that
energy eigenvalues of H
i
= E
i
_
1
GM
c
2
R
_
= E
i
(16.35)
Therefore, the energies of emitted photons also scale by the same factor
E
E (16.36)
So, all emitted photon experience a red shift to lower energies. Gravi-
tational red shift experiments [CSW60, PJ60, PS65, Sni72, KR81, PSS
+
92,
Wil06] conrmed this formula to a high precision. For example, in the fa-
mous Pound-Rebka experiment [PJ60] two identical samples of
57
Fe nuclei
were used in a Mossbauer setup. One sample was used as a source of gamma
radiation and the other as a detector. If the source and the detector were
at dierent elevations (dierent gravitational potentials), then the mismatch
in their energy level separations (16.36) made the resonant absorption im-
possible. The radiation emitted by a source at a lower altitude appeared as
red-shifted to the detector at a higher altitude.
Note that during its travel from the source to the detector, the photons
kinetic energy (cp) varies so as to ensure conservation of energy (16.20).
Should we take this variation into account when determining the condition
of resonant absorption in the Pound-Rebka experiment? The answer is no.
When the photon gets absorbed by the detector it disappears completely, so
its total energy (kinetic energy plus potential energy) gets transferred to the
detector rather than the kinetic energy alone. The photons total energy
15
15
it would be more correct to talk about the total energy of the system photon+Earth
600 CHAPTER 16. QUANTUM THEORY OF GRAVITY
(and its frequency) remains constant during its travel, so the attraction of
photons to massive bodies (16.20) does not play any role in the gravitational
red shift [OST00]. The true origin of the red shift is the variation of energy
levels (16.35) in the source and/or in the detector when they are placed in
the gravitational potential.
16.2.5 Gravitational time dilation
Gravitational time dilation experiments [HK72, BL77, VL79, VLM
+
80, KAC90,
KMA93, FRS
+
02, Wil06] are fundamentally similar to red shift experiments
discussed above. The main dierence is that, unlike red shift experiments
dealing with the eect of gravity on energy levels, time dilation experiments
focus on the speed of processes in gravitational potentials. Any clock (or
any time-dependent process for that matter) is a quantum system that can
be described by the Hamiltonian (16.31) in the absence of gravity. The time
dependence (= the clocks ticking) means that the quantum state of the clock
is not an eigenstate of this Hamiltonian. The initial state [(0) of the clock
is prepared as a superposition of two or more energy eigenstates [
k
[(0) =
k
C
k
[
k
Then the time evolution in the absence of the gravitational eld is generated
by the Hamiltonian (16.31)
16
[(t) = e
Ht
[(0) =
k
e
E
k
t
C
k
[
k
(t) = e
t
[
(0) =
k
e
k
t
C
k
[
k
=
k
e
E
k
[1GM/(c
2
R)]t
C
k
[
k
= [([1 GM/(c
2
R)]t) = [(t)
16
see equation (6.81)
17
Here we assumed that the initial state [
) a = a
for all a, a
, 90
, 180
, 90
90
180
90
90
180
90
90
90
180
90
180
180
90
90
90
90
90
180
Table A.2: Inversion table for the symmetry group of the square
element inverse element
0
90
90
180
180
90
90
example, a 90
rotation
about the y-axis is a transformation that is dierent from these two rotations
performed in the reverse order.
Group homomorphism is a mapping h : G
1
G
2
which preserves group
operations. A mapping which is a bijection and a homomorphism at the
same time is called isomorphism.
A.3 Vector spaces
A vector space H is a set of objects (called vectors and further denoted by
boldface letters x) with two operations: addition of two vectors and multi-
plication of a vector by scalars. In this book we are interested only in vector
spaces whose scalars are either complex (C) or real (R) numbers. If x and y
are two vectors and a and b are two scalars, then
ax + by
is also a vector. A vector space forms an Abelian group with respect to vector
additions. This means associativity
(x +y) +z = x + (y +z),
612 APPENDIX A. SETS, GROUPS AND VECTOR SPACES
existence of the group unity (denoted by 0 and called zero vector)
x +0 = 0 +x = x
and existence of the opposite (additive inverse) element denoted by x
x + (x) = 0,
In addition, the following properties are postulated in the vector space:
The associativity of scalar multiplication
a(bx) = (ab)x
The distributivity of scalar sums:
(a + b)x = ax + bx
The distributivity of vector sums:
a(x +y) = ax + ay
The scalar multiplication identity:
1x = x
We leave it to the reader to prove from these axioms the following useful
results for an arbitrary scalar a and a vector x
0x = a0 = 0
(a)x = a(x) = (ax)
ax = 0 a = 0 or x = 0
An example of a vector space is the set of all columns of n numbers
2
2
If x
i
are real (complex) numbers then this vector space is denoted by R
n
(C
n
).
A.3. VECTOR SPACES 613
_
_
x
1
x
2
.
.
.
x
n
_
_
The sum of two columns is
_
_
x
1
x
2
.
.
.
x
n
_
_
+
_
_
y
1
y
2
.
.
.
y
n
_
_
=
_
_
x
1
+ y
1
x
2
+ y
2
.
.
.
x
n
+ y
n
_
_
The multiplication of a column by a scalar is
_
x
1
x
2
.
.
.
x
n
_
_
=
_
_
x
1
x
2
.
.
.
x
n
_
_
A set of nonzero vectors x
i
is called linearly independent if from
i
a
i
x
i
= 0
it follows that a
i
= 0 for each i. A set of linearly independent vectors x
i
is
called basis if by adding arbitrary nonzero vector y to this set it is no longer
linearly independent. If x
i
is a basis and y is an arbitrary nonzero vector,
then equation
a
0
y +
i
a
i
x
i
= 0
has a solution in which a
0
,= 0.
3
This means that we can express an arbitrary
vector y as a linear combination of basis vectors
3
because otherwise we would have a
i
= 0 for all i, meaning that the full set x
i
, y is
linearly independent in disagreement with our assumption.
614 APPENDIX A. SETS, GROUPS AND VECTOR SPACES
y =
i
a
i
a
0
x
i
=
i
y
i
x
i
(A.4)
Note that any vector y has unique components y
i
with respect to the basis
x
i
. Indeed, suppose we found another set of components y
i
, so that
y =
i
y
i
x
i
(A.5)
Then subtracting (A.5) from (A.4) we obtain
0 =
i
(y
i
y
i
)x
i
and y
i
= y
i
since x
i
are linearly independent.
One can choose many dierent bases in the same vector space. However,
the number of vectors in any basis is the same and this number is called
the dimension of the vector space V (denoted dimV ). The dimension of the
space of n-member columns is n. An example of a basis set in this space is
given by n vectors
_
_
1
0
.
.
.
0
_
_
,
_
_
0
1
.
.
.
0
_
_
, . . . ,
_
_
0
0
.
.
.
1
_
_
A linear subspace is a subset of vectors in H which is closed with respect
to addition and multiplication by scalars. For any set of vectors x
1
, x
2
, . . .
there is a spanning subspace (or simply span) Sp(x
1
, x
2
, . . .) which is the set
of all linear combinations
i
a
i
x
i
with arbitrary coecients a
i
. A span of a
non-zero vector Sp(x) is also called ray.
Appendix B
The delta function and useful
integrals
Diracs delta function (x) is dened by the property of the integral
a
_
a
f(x)(x)dx = f(0)
where f(x) is any smooth function and a > 0. The delta function can be
also dened by its integral representation
1
2
e
i
ax
da = (x)
Another useful property is
(ax) =
1
a
(x)
The delta function of a vector argument is dened as
(r) = (x)(y)(z)
or
615
616APPENDIX B. THE DELTA FUNCTION AND USEFUL INTEGRALS
1
(2)
3
_
e
i
kr
dk = (r) (B.1)
It has the property
2
r
2
1
4r
= (r) (B.2)
The step function (t) is dened as
(t)
_
1, if t 0
0, otherwise
(B.3)
It has the following integral representation
(t) =
1
2i
ds
e
ist
s + i
(B.4)
Consider integral
1
_
dr
r
e
i
pr
=
_
0
sin d
2
_
0
d
_
0
r
2
dr
e
i
pr cos
r
= 2
1
_
1
dz
_
0
drre
i
prz
= 2
_
0
rdr
e
i
pr
e
pr
ipr
=
4
p
_
0
dr sin
_
pr
_
=
4
2
p
2
_
0
d sin() =
4
2
p
2
(cos() cos(0)) =
4
2
p
2
(B.5)
Next consider integral
1
In this derivation one can set cos() = 0 because in applications the plane wave e
i
pr
in the integrand does not have innite extension. Typically it has a smooth damping
factor that makes it tend to zero at large values of r, so that cos() can be eectively
taken as zero.
617
K =
_
dxdy
e
i
(px+qy)
[x y[
First we change the integration variables
x =
1
2
(z +t)
y =
1
2
(z t)
x y = t
x +y = z
The Jacobian of this transformation is
J det
(x, y)
(z, t)
= 1/8
Then, using integrals (B.1) and (B.5), we obtain
K =
1
8
_
dtdz
e
i
2
(p(z+t)+q(zt))
t
=
1
8
_
dtdz
e
i
2
(z(p+q)+t(pq))
t
= (2)
3
(p +q)
_
dt
e
i
2
t(pq)
t
=
(2)
6
2
2
(p +q)
p
2
(B.6)
Other useful integrals are
_
dk
k
2
e
i
kr
=
(2)
3
4
2
r
(B.7)
_
dkk
k
2
e
i
kr
= i
r
_
dk
k
2
e
i
kr
=
i(2)
3
4
r
_
1
r
_
=
i(2)
3
r
4r
3
(B.8)
_
dkq [k p]
k
2
e
i
kr
=
i(2)
3
q [r p]
4r
3
(B.9)
618APPENDIX B. THE DELTA FUNCTION AND USEFUL INTEGRALS
_
dk(q k)(p k)
k
4
e
i
kr
=
(2)
3
8
2
r
_
(q p)
(q r)(p r)
r
2
_
(B.10)
_
dk(p k)(q k)
k
2
e
i
kr
=
(2)
3
4r
3
_
(p q) 3
(p r)(q r)
r
2
_
+
1
3
(p q)(r)
(B.11)
_
dk
k
4
e
i
kr
= c
(2)
3
r
8
4
(B.12)
where c is an innite constant (see [Wei64a]).
_
dre
ar
2
+br
= (/a)
3/2
e
b
2
/(4a)
(B.13)
Lemma B.1 (Riemann-Lebesgue [GR00]) Fourier image of a smooth
function tends to zero at innity.
When talking about smooth functions in this book we will presume that these
functions are continuous, can be dierentiated as many times as needed and
do not contain singularities.
Appendix C
Some lemmas for
orthocomplemented lattices.
From axioms of orthocomplemented lattices
1
one can prove a variety of useful
results
Lemma C.1
z x y z x (C.1)
Proof. From Postulate 1.8 we have x y x, hence z x y x and by
the transitivity Lemma 1.5 we obtain z x.
Lemma C.2
x y x y = x (C.2)
Proof. From x y and x x it follows by Postulate 1.9 that x x y.
On the other hand, x y x (1.8). Lemma 1.4 then implies x y = x. The
reverse statement follows from Postulate 1.8 written in the form
x y y (C.3)
If xy = x, then we can replace the left hand side of (C.3) with x and obtain
the left hand side of (C.2)
1
They are summarized in Table 1.1 as statements 1.3 - 1.21.
619
620APPENDIX C. SOME LEMMAS FOR ORTHOCOMPLEMENTED LATTICES.
Lemma C.3 For any proposition z
x y x z y z (C.4)
Proof. This follows from x z x y and x z z by using Postulate
1.9.
One can also prove equations
x x = x (C.5)
x = (C.6)
J x = x (C.7)
= J (C.8)
which are left as an exercise for the reader.
Proofs of lemmas and theorems for orthocomplemented lattices are facili-
tated by the following observation: Given an expression with lattice elements
we can form a dual expression by the following rules:
1) change places of and signs;
2) change the direction of the implication signs ;
3) change to J and change J to .
Then it is easy to see that all axioms in Table 1.1 have the property of duality:
Each axiom is either self-dual or its dual is also a valid axiom. Therefore, for
each logical (in)equality, its dual is also a valid (in)equality. For example, by
duality we have from (C.1), (C.2) and (C.4) - (C.8)
x y z x z
x y x y = y
y x y z x z
x x = x
J x = J
x = x
J
=
Appendix D
Rotation group
D.1 Basics of the 3D space
Let us now consider the familiar 3D position space. This space consists of
points. We can arbitrarily select one such point 0 and call it origin. Then
we can draw a vector a from the origin to any other point in space. We
can also dene a sum of two vectors (by the parallelogram rule as shown
in Fig. D.1) and the multiplication of a vector by a real scalar. There is a
natural denition of the length of a vector [a[ (also denoted by a) and the
angle (a, b) between two vectors a and b. Then the dot product (or scalar
product ) of two vectors is dened by formula
a b = b a = ab cos (a, b) (D.1)
Two non-zero vectors are called perpendicular or orthogonal if their dot prod-
uct is zero.
We can build an orthonormal basis of 3 mutually perpendicular vectors of
unit length i, j and k along x, y and z axes respectively.
1
Then each vector
a can be represented as a linear combination
a = a
x
i + a
y
j + a
z
k
1
Let us agree that the triple of basis vectors (i, j, k) forms a right-handed system as
shown in Fig. D.1. Such a system is easy to recognize by the following rule of thumb: If
we point a corkscrew in the direction of k and rotate it in the clockwise direction (from i
to j), then the corkscrew will move in the direction of vector k.
621
622 APPENDIX D. ROTATION GROUP
bb
ii
j
kk
aa
a+b
00
xx
yy
zz
Figure D.1: Some objects in the vector space R
3
: the origin 0, the basis
vectors i, j, k, a sum of two vectors a +b via the parallelogram rule.
or as a column of its components or coordinates
2
a =
_
_
a
x
a
y
a
z
_
_
The transposed vector can be represented as a row
a
T
= [a
x
, a
y
, a
z
]
One can easily verify that the dot product (D.1) can be written in several
equivalent forms
b a =
3
i=1
b
i
a
i
= b
x
a
x
+ b
y
a
y
+ b
z
a
z
= [b
x
, b
y
, b
z
]
_
_
a
x
a
y
a
z
_
_
= b
T
a
2
So, physical space can be identied with the vector space R
3
of all triples of real
numbers (see subsection A.3). We will mark vector indices either by letters x, y, z or by
numbers 1,2,3, as convenient.
D.2. SCALARS AND VECTORS 623
where b
T
a denotes the usual row by column product of the row b
T
and
column a.
The length of the vector a can be written as a [a[ =
a a
a
2
, and
the distance between two points (or vectors) a and b is dened as d = [ab[.
D.2 Scalars and vectors
There are two approaches to rotations, as well as to any inertial transfor-
mation: active and passive. An active rotation rotates all objects around
the origin while keeping the orientation of basis vectors. A passive rota-
tion simply changes the directions of the basis vectors and thus aects only
components of real vectors but not the vectors themselves. Unless noted
otherwise, we will use the passive representation of rotations.
We call a quantity / 3-scalar if it is not aected by rotations. Denoting
/
= /
Examples of scalars are distances and angles.
Let us now nd how rotations change the coordinates of vectors in R
3
.
By denition, rotations preserve the origin and linear combinations of vec-
tors, so the action of a rotation on a column vector can be represented as
multiplication by a 3 3 matrix R
a
i
=
3
j=1
R
ij
a
j
(D.2)
or in the matrix form
a
= Ra (D.3)
b
T
= (Rb)
T
= b
T
R
T
(D.4)
where R
T
denotes the transposed matrix.
624 APPENDIX D. ROTATION GROUP
D.3 Orthogonal matrices
Since rotations preserve distances and angles, they also preserve the dot
product:
b a = b
T
a = (Rb)
T
(Ra) = b
T
R
T
Ra (D.5)
The validity of equation (D.5) for any a and b implies that rotation matrices
satisfy condition
R
T
R = I (D.6)
where I denotes the unit matrix
I =
_
_
1 0 0
0 1 0
0 0 1
_
_
Multiplying by the inverse matrix R
1
from the right, equation (D.6) can be
also written as
R
T
= R
1
(D.7)
This implies a useful property
Rb a = b
T
R
T
a = b
T
R
1
a = b R
1
a (D.8)
In the coordinate notation, condition (D.6) takes the form
3
j=1
R
T
ij
R
jk
=
3
j=1
R
ji
R
jk
=
ik
(D.9)
where
ij
is the Kronecker delta symbol
D.3. ORTHOGONAL MATRICES 625
ij
=
_
1, if i = j
0, if i ,= j
(D.10)
Matrices satisfying condition (D.7) are called orthogonal. Thus, any rotation
has a unique representative in the set of orthogonal matrices.
However, not every orthogonal matrix R corresponds to a rotation. To
see that, we can write
1 = det(I) = det(R
T
R) = det(R
T
) det(R) = (det(R))
2
which implies that if R is orthogonal then det(R) = 1. Any rotation
can be connected by a continuous path with the trivial rotation which is
represented, of course, by the unit matrix with unit determinant. Since
continuous transformations cannot abruptly change the determinant from 1
to -1, only matrices with
det(R) = 1 (D.11)
have a chance to represent rotations.
3
We conclude that rotations are in one-
to-one correspondence with orthogonal matrices having a unit determinant.
The notion of a vector is more general than just an arrow directed to
a point in space. We will call any triple of quantities
/ = (/
x
, /
y
, /
z
) a
3-vector if it transforms under rotations in the same way as vector arrows
(D.2).
Let us consider examples of rotation matrices. Any rotation around the
z-axis does not change z-components of 3-vectors. The most general matrix
satisfying this property can be written as
R
z
=
_
_
a b 0
c d 0
0 0 1
_
_
and condition (D.11) translates to ad bc = 1. The inverse matrix is
3
Matrices with det(R) = 1 describe rotations coupled with inversion (see subsection
2.2.4).
626 APPENDIX D. ROTATION GROUP
R
1
z
=
_
_
d b 0
c a 0
0 0 1
_
_
According to the property (D.7) we must have
a = d
b = c
therefore
R
z
=
_
_
a b 0
b a 0
0 0 1
_
_
The condition det(R
z
) = a
2
+b
2
= 1 implies that matrix R
z
depends on one
parameter such that a = cos and b = sin
R
z
=
_
_
cos sin 0
sin cos 0
0 0 1
_
_
(D.12)
Obviously, parameter is just the rotation angle.
4
The matrices for rotations
around the x- and y-axes are
R
x
=
_
_
1 0 0
0 cos sin
0 sin cos
_
_
(D.13)
and
R
y
=
_
_
cos 0 sin
0 1 0
sin 0 cos
_
_
(D.14)
respectively.
4
Note that positive values of correspond to a clockwise rotation (from i to j) of the
basis vectors which drives the corkscrew in the positive z-direction.
D.4. INVARIANT TENSORS 627
D.4 Invariant tensors
Tensor of the second rank
5
/
ij
is dened as a set of 9 quantities which
depend on two indices and transform as a vector with respect to each index
/
ij
=
3
kl=1
R
ik
R
jl
/
kl
(D.15)
Similarly, one can also dene tensors of higher rank, e.g., /
ijk
.
There are two invariant tensors which play a special role because they
have the same components independent on the orientation of the basis vec-
tors. The rst invariant tensor is the Kronecker delta
ij
.
6
Its invariance
follows from the orthogonality of R-matrices (D.9).
ij
=
3
kl=1
R
ik
R
jl
kl
=
3
k=1
R
ik
R
jk
=
ij
Another invariant tensor is the Levi-Civita symbol
ijk
, which is dened as
xyz
=
zxy
=
yzx
=
xzy
=
yxz
=
zyx
= 1 and all other components of
ijk
are zero. We show its invariance by applying an arbitrary rotation R to
ijk
. Then
ijk
=
3
lmn=1
R
il
R
jm
R
kn
lmn
= R
i1
R
j2
R
k3
+ R
i3
R
j1
R
k2
+ R
i2
R
j3
R
k1
R
i2
R
j1
R
k3
R
i3
R
j2
R
k1
R
i1
R
j3
R
k2
(D.16)
The right hand side has the following properties:
1. it is equal to zero if any two indices coincide: i = j or i = k or j = k;
2. it does not change after cyclic permutation of indices ijk.
3.
123
= det(R) = 1.
5
Scalars and vectors are sometimes called tensors of rank 0 and 1, respectively.
6
see equation (D.10)
628 APPENDIX D. ROTATION GROUP
These are the same properties as those used to dene the Levi-Civita symbol
above. So, the right hand side of (D.16) must have the same components as
ijk
ijk
=
ijk
Using invariant tensors
ij
and
ijk
we can convert between scalar, vector
and tensor quantities, as shown in Table D.1. For example, any antisymmet-
ric 3-tensor has 3 independent components, so it can be always represented
as
/
ij
=
3
k=1
ijk
V
k
where V
k
are components of some 3-vector.
Table D.1: Converting between quantities of dierent rank using invariant
tensors
Scalar S S
ij
(tensor)
Scalar S S
ijk
(antisymmetric tensor)
Vector V
i
3
k=1
ijk
V
k
(antisymmetric tensor)
Tensor T
ij
3
ij=1
ij
T
ji
(scalar)
Tensor T
ij
3
jk=1
ijk
T
kj
(vector)
Using invariant tensors one can also build a scalar or a vector from two
independent vectors A and B. The scalar is constructed by using the Kro-
necker delta
A B =
3
ij=1
ij
A
i
B
j
A
1
B
1
+ A
2
B
2
+ A
3
B
3
This is the usual dot product (D.1). The vector can be constructed using
the Levi-Civita tensor
D.5. VECTOR PARAMETERIZATION OF ROTATIONS 629
[AB]
i
=
3
jk=1
ijk
A
j
B
k
This vector is called the cross product (or vector product ) of A and B. It
has the following components
[AB]
x
= A
y
B
z
A
z
B
y
[AB]
y
= A
z
B
x
A
x
B
z
[AB]
z
= A
x
B
y
A
y
B
x
and properties
[AB] = [BA]
[A[BC]] = B(A C) C(A B) (D.17)
The mixed product is a scalar which can be build from three vectors with the
help of the Levi-Civita invariant tensor
[AB] C =
3
ijk=1
ijk
A
i
B
j
C
k
Its properties are
[AB] C = [BC] A = [CA] B (D.18)
[AB] B = 0
D.5 Vector parameterization of rotations
The matrix representation of rotations (D.2) is useful for describing trans-
formations of vector and tensor components. However, sometimes it is more
convenient to characterize rotation in a more physical way by the rotation
axis and the rotation angle. In other words, a rotation can be described by a
630 APPENDIX D. ROTATION GROUP
PP
PP
||
PP
P
nn
Figure D.2: Transformation of vector components under active rotation
through the angle .
single vector
=
x
i +
y
j +
z
k, such that its direction represents the axis
of the rotation and its length [
=
x
,
y
,
z
.
7
Let us now make a link between the matrix and vector representations
of rotations. First, we nd the matrix R
. Each vector P in R
3
can be decomposed into two parts:
P = P
+ P
(P
= PP
= P
(D.19)
If P
,= 0, we
denote
n =
[P
7
This characterization is not unique: there are many vectors describing the same rota-
tion (see Appendix H.4).
D.5. VECTOR PARAMETERIZATION OF ROTATIONS 631
the vector which is orthogonal to both
and P
, n,
= P
= P
+P
=
_
P
(1 cos ) + Pcos
_
P
_
sin (D.21)
or in the component notation
P
x
= (P
x
x
+ P
y
y
+ P
z
z
)
2
(1 cos ) + P
x
cos (P
y
z
P
z
y
)
sin
y
= (P
x
x
+ P
y
y
+ P
z
z
)
2
(1 cos ) + P
y
cos (P
z
x
P
x
z
)
sin
z
= (P
x
x
+ P
y
y
+ P
z
z
)
2
(1 cos ) + P
z
cos (P
x
y
P
y
x
)
sin
= R
1
P = R
P
where the orthogonal matrix R
)
ij
= cos
ij
+
3
k=1
ijk
sin
+
i
j
1 cos
2
R
=
_
_
cos + m
2
x
(1 cos ) m
x
m
y
(1 cos ) m
z
sin m
x
m
z
(1 cos ) + m
y
sin
m
x
m
y
(1 cos ) + m
z
sin cos + m
2
y
(1 cos ) m
y
m
z
(1 cos ) m
x
sin
m
x
m
z
(1 cos ) m
y
sin m
y
m
z
(1 cos ) + m
x
sin cos + m
2
z
(1 cos )
_
_
(D.22)
632 APPENDIX D. ROTATION GROUP
and m
/.
Inversely, let us start from an arbitrary orthogonal matrix R
and try
to nd the corresponding rotation vector
. Obviously, this vector is not
changed by the transformation R
, so
R
) = 1 + e
i
+ e
i
= 1 + 2 cos
Therefore, we can dene the function
(R
) =
(which maps from the set
of rotation matrices to corresponding rotation angles) by the following rules:
the direction of the rotation vector
coincides with the direction of
the eigenvector of R
with eigenvalue 1;
the length of the rotation angle is given by
= cos
1
Tr(R
) 1
2
(D.23)
As expected, this formula is basis-independent, because the trace of a matrix
does not depend on the basis (see Lemma F.7).
D.6 Group properties of rotations
One can see that rotations form a group. If we perform rotation
1
followed
by rotation
2
, then the resulting transformation preserves the origin, the
linear combinations of vectors and their dot product, so it is another rotation.
8
One can check this result by using the explicit representation (D.22)
D.6. GROUP PROPERTIES OF ROTATIONS 633
The identity element in the rotation group is the rotation through zero
angle
0, which leaves all vectors intact and is represented by the unit matrix
R
0
= I. For each rotation
such that
0
The inverse rotation is represented by the inverse matrix R
= R
1
= R
T
.
The associativity law
1
(
3
) = (
2
)
= 1 +
3
i=1
i
t
i
+
1
2
3
ij=1
j
t
ij
+ . . .
At small values of we have simply
1 +
3
i=1
i
t
i
Quantities t
i
are called generators or innitesimal rotations. Generators
can be formally represented as derivatives of elements in one-parameter sub-
groups with respect to parameters
i
, e.g.,
t
i
= lim
0
d
d
i
For example, in the matrix notation, the generator of rotations around the
z-axis is given by the matrix
z
= lim
0
d
d
R
z
() = lim
0
d
d
_
_
cos sin 0
sin cos 0
0 0 1
_
_
=
_
_
0 1 0
1 0 0
0 0 0
_
_
(D.25)
Similarly, for generators of rotations around x- and y-axes we obtain from
(D.13) and (D.14)
x
=
_
_
0 0 0
0 0 1
0 1 0
_
_
,
y
=
_
_
0 0 1
0 0 0
1 0 0
_
_
(D.26)
10
Here we denote 1
= lim
N
_
N
N
_
= lim
N
_
N
_
N
= lim
N
_
1 +
3
i=1
i
N
t
i
_
N
= exp
_
3
i=1
i
t
i
_
(D.27)
Let us verify this formula in the case of a rotation around the z-axis
e
Jz
= 1 +
z
+
1
2!
2
z
+ . . .
=
_
_
1 0 0
0 1 0
0 0 1
_
_
+
_
_
0 0
0 0
0 0 0
_
_
+
_
_
2
2
0 0
0
2
2
0
0 0 0
_
_
+ . . .
=
_
_
1
2
2
+ . . . + . . . 0
+ . . . 1
2
2
+ . . . 0
0 0 1
_
_
=
_
_
cos sin 0
sin cos 0
0 0 1
_
_
= R
z
Exponent of any linear combination of generators t
i
also results in an or-
thogonal matrix with unit determinant, i.e., represents a rotation. There-
fore, objects t
i
form a basis in the vector space of generators of the rotation
group. This vector space is referred to as the Lie algebra of the rotation
group. General properties of Lie algebras will be discussed in Appendix E.2.
636 APPENDIX D. ROTATION GROUP
Appendix E
Lie groups and Lie algebras
E.1 Lie groups
In general, a group
1
can be thought of as a set of points (elements) with
a multiplication law such that the product of any two points gives you a
third element in the set. In addition, there is an inversion law that map
each point to an inverse point. For some groups the corresponding sets of
points are discrete. The symmetry groups of molecules and crystals are good
examples of discrete groups.
2
Here we would like to discuss a special class of
groups that are called Lie groups.
3
The characteristic feature of a Lie group
is that its set of points is continuous and smooth and that multiplication and
inversion laws are described by smooth functions. This set of points can be
visualized as a multi-dimensional hypersurface, which is called the group
manifold.
We saw in the Appendix D.5
4
that elements of the rotation group are in
isomorphic correspondence with points
in a certain topological space or
smooth manifold. The multiplication and inversion laws dene two smooth
mappings between points in this space. Thus, the rotation group is an exam-
ple of a Lie group. Similar to the rotation group, elements in a general Lie
group can be parameterized by n continuous parameters
i
, where n is the di-
mension of the Lie group. We will join these parameters in one n-dimensional
1
see subsection A.2
2
See also example in Appendix A.2.
3
Lie groups and algebras were named after Norwegian mathematician Sophus Lie who
rst developed their theory.
4
see also Appendix H.4
637
638 APPENDIX E. LIE GROUPS AND LIE ALGEBRAS
vector
and denote a general group element as
=
1
,
2
, . . .
n
, so
that the group multiplication and inversion laws are smooth functions of
these parameters.
It appears that similar to the rotation group, in a general Lie group it
is also possible to choose a parameterization
1
,
2
, . . .
n
such that the
following properties are satised
the unit element has parameters (0,0,...,0);
1
=
;
if elements
and
We will always assume that group parameters satisfy these properties. Then,
similar to what we did in subsection D.7 for the rotation group, we can
introduce innitesimal transformations or generators t
a
(a = 1, 2, . . . , n) for
a general Lie group and express group elements in the vicinity of the unit
element as exponential functions of generators
5
= exp
_
n
a=1
a
t
a
_
= 1 +
n
a=1
a
t
a
+
1
2!
n
bc=1
c
t
bc
+ . . . (E.1)
Let us introduce function g(
= g(
) (E.2)
Function g(
0,
) = g(
0) =
(E.3)
g(
) =
0
which follow from the property (A.2) of the unit element and property (A.3)
of the inverse element. To ensure agreement with equation (E.3), the Taylor
expansion of g up to the 2nd order in parameters must look like
g
a
(
) =
a
+
a
+
n
bc=1
f
a
bc
c
+ . . . (E.4)
where f
a
bc
are real coecients. Now we substitute expansions (E.1) and (E.4)
into (E.2)
_
1 +
n
a=1
a
t
a
+
1
2
n
bc=1
c
t
bc
+ . . .
__
1 +
n
a=1
a
t
a
+
1
2
n
bc=1
c
t
bc
+ . . .
_
= 1 +
n
a=1
_
a
+
a
+
n
bc=1
f
a
bc
c
+ . . .
_
t
a
+
1
2
n
ab=1
(
a
+
a
+ . . .)(
b
+
b
+ . . .)t
ab
+ . . .
Factors multiplying 1, , ,
2
,
2
are exactly the same on both sides of this
equation, but the factor in front of produces a non-trivial condition
1
2
(t
bc
+ t
cb
) = t
b
t
c
a=1
f
a
bc
t
a
The left hand side is symmetric with respect to the interchange of indices b
and c. Therefore the right hand side must by symmetric as well
t
b
t
c
a=1
f
a
bc
t
a
t
c
t
b
+
n
a=1
f
a
cb
t
a
= 0 (E.5)
If we dene the commutator of two generators by formula
[t
b
, t
c
] t
b
t
c
t
c
t
b
640 APPENDIX E. LIE GROUPS AND LIE ALGEBRAS
then, according to (E.5), this commutator is a linear combination of genera-
tors
[t
b
, t
c
] =
n
a=1
C
a
bc
t
a
(E.6)
where real parameters C
a
bc
= f
a
bc
f
a
cb
are called structure constants of the
Lie group.
Theorem E.1 Generators of a Lie group satisfy the Jacobi identity
[t
a
, [t
b
, t
c
]] + [t
b
, [t
c
, t
a
]] + [t
c
, [t
a
, t
b
]] = 0 (E.7)
Proof. Let us rst write the associativity law (A.1) in the form
6
0 = g
a
(
, g(
, )) g
a
(g(
), )
a
+ g
a
(
, ) + f
a
bc
b
g
c
(
, ) g
a
(
)
a
f
a
bc
g
b
(
)
c
a
+
a
+
a
+ f
a
bc
c
+ f
a
bc
b
(
c
+
c
+ f
c
xy
y
)
a
f
a
xy
a
f
a
bc
(
b
+
b
+ f
b
xy
y
)
c
= f
a
bc
c
+ f
a
bc
c
+ f
a
bc
c
+ f
a
bc
f
c
xy
y
f
a
xy
y
f
a
bc
b
f
a
bc
b
f
a
bc
f
b
xy
y
= f
a
bc
f
c
xy
y
f
a
bc
f
b
xy
c
= (f
a
bc
f
c
xy
f
a
cy
f
c
bx
)
b
y
Since elements
k=1
ijk
k
In the vicinity of the unit element, any Lie group element can be repre-
sented as exponent exp(x) of a Lie algebra element x (see equation (E.1)).
8
Note that this representation of the Lie bracket as a dierence of two products can
be used only when the generators are identied with matrices. This formula (as well as
(E.11)) does not apply to abstract Lie algebras, because the product of two elements AB
is not dened there.
E.2. LIE ALGEBRAS 643
As product of two group elements is another group element, we must have
for any two Lie algebra elements x and y
exp(x) exp(y) = exp(z) (E.12)
where z is also an element from the Lie algebra. Then there should exist a
mapping in the Lie algebra which associates with any two elements x and y a
third element z, such that equation (E.12) is satised. The Baker-Campbell-
Hausdor theorem [WM62] gives us the explicit form of this mapping
z = x + y +
1
2
[x, y] +
1
12
[[x, y], y] +
1
12
[[y, x], x]
+
1
24
[[[y, x], x], y]
1
720
[[[[x, y], y], y], y] +
1
360
[[[[x, y], y], y], x]
+
1
360
[[[[y, x], x], x], y]
1
120
[[[[x, y], y], x], y]
1
120
[[[[y, x], x], y], x] . . .
This means that commutation relations in the Lie algebra contain full infor-
mation about the group multiplication law in the vicinity of the unit element.
In many cases, it is much easier to deal with generators and their commuta-
tors than directly with group elements and their multiplication law.
In applications one often nds useful the following identity
exp(ax)y exp(ax) = y + a[x, y] +
a
2
2!
[x, [x, y]] +
a
3
3!
[x, [x, [x, y]]] . . . (E.13)
where a R. This formula can be proved by noticing that both sides are
solutions of the same dierential operator equation
dy(a)
da
= [x, y(a)]
with the same initial condition y(a) = y.
There is a unique Lie algebra A
G
corresponding to each Lie group G.
However, there are many Lie groups with the same Lie algebra. These groups
have the same structure in the vicinity of the unit element, but their global
topological properties can be dierent.
644 APPENDIX E. LIE GROUPS AND LIE ALGEBRAS
A Lie subalgebra B of a Lie algebra A is a subspace in A which is closed
with respect to commutator, i.e., if x, y B, then [x, y] B. If H is a
subgroup of a Lie group G, then its Lie algebra A
H
is a Lie subalgebra of
A
G
.
Appendix F
Hilbert space
F.1 Inner product
An inner product space H is dened as a complex vector space
1
which has a
mapping from ordered pairs of vectors to complex numbers. This mapping
is called the inner product ([y, [x) and it satises the following properties
([x, [y) = ([y, [x)
(F.1)
([z, [x + [y) = ([z, [x) + ([z, [y) (F.2)
([x, [x) R (F.3)
([x, [x) 0 (F.4)
([x, [x) = 0 [x = 0 (F.5)
where and are complex numbers. Given inner product we can dene the
distance between two vectors by formula d([x, [y)
_
([x y, [x y).
The inner product space H is called complete if any Cauchy sequence
2
of vectors in H converges to a vector in H. Analogously, a subspace in H is
called a closed subspace if any Cauchy sequence of vectors belonging to the
subspace converges to a vector in this subspace. The Hilbert space is simply
a complete inner product space.
3
1
See Appendix A.3. Vectors in H will be denoted by [x.
2
Cauchy sequence is an innite sequence of vectors [x
i
in which the distance between
two vectors [x
n
and [x
m
tends to zero when their indices tend to innity n, m .
3
The notions of completeness and closedness are rather technical. Finite dimensional
645
646 APPENDIX F. HILBERT SPACE
F.2 Orthonormal bases
Two vectors [x and [y are called orthogonal if ([x, [y) = 0. Vector [x
is called unimodular if ([x, [x) = 1. In Hilbert space we can consider
orthonormal bases consisting of mutually orthogonal unimodular vectors [e
i
which satisfy
([e
i
, [e
i
) =
_
1, if i = j
0, if i ,= j
or, using the Kronecker delta symbol
([e
i
, [e
i
) =
ij
(F.6)
Suppose that vectors [x and [y have components x
i
and y
i
, respectively,
in this basis
[x = x
1
[e
1
+ x
2
[e
2
+ . . . + x
n
[e
n
[y = y
1
[e
1
+ y
2
[e
2
+ . . . + y
n
[e
n
Then using (F.1), (F.2) and (F.6) we can express the inner product through
vector components
([x, [y) = (x
1
[e
1
+ x
2
[e
2
+ . . . + x
n
[e
n
, y
1
[e
1
+ y
2
[e
2
+ . . . + y
n
[e
n
)
= x
1
y
1
+ x
2
y
2
+ . . . + x
n
y
n
=
i
x
i
y
i
inner product spaces are always complete and their subspaces are always closed. Although
in quantum mechanics we normally deal with innite-dimensional spaces, most properties
having relevance to physics do not depend on the number of dimensions. So, we will
ignore the dierence between nite- and innite-dimensional spaces and freely use nite
n-dimensional examples in our proofs and demonstrations. In particular, we will tacitly
assume that every subspace A is closed or forced to be closed by adding all vectors which
are limits of Cauchy sequences in A.
F.3. BRA AND KET VECTORS 647
F.3 Bra and ket vectors
The notation ([x, [y) for the inner product is rather cumbersome. We will
use instead a more convenient bra-ket formalism suggested by Dirac, which
greatly simplies manipulations with objects in the Hilbert space. Let us
call vectors in the Hilbert space ket vectors. We dene a linear functional
f[ : H C as a function (denoted by f[x) which maps each ket vector
[x in H to complex numbers in such a way that linearity is preserved, i.e.
f[([x + [y) = f[x + f[y
Since any linear combination f[ +g[ of functionals f[ and g[ is again a
functional, then all functionals form a vector space (denoted H
). The vectors
in H
so
that it becomes a Hilbert space. To do that, let us choose an orthonormal
basis [e
i
in H. Then each functional f[ denes a set of complex numbers
f
i
which are values of this functional on the basis vectors
f
i
= f[e
i
These numbers dene the functional uniquely, i.e., if two functionals f[ and
g[ are dierent, then their values are dierent for at least one basis vector
[e
k
: f
k
,= g
k
.
4
Now we can dene the inner product of bra vectors f[ and
g[ by formula
(f[, g[) =
i
f
i
g
i
and verify that it satises all properties of the inner product listed in (F.1)
- (F.5). The Hilbert space H
by
formula
y[x ([y, [x) (F.7)
4
Otherwise, using linearity we would be able to prove that the values of functionals
f[ and g[ are equal on all vectors in H, i.e., f[ = g[.
648 APPENDIX F. HILBERT SPACE
for each [x H. This bra vector y[ will be called the dual of the ket vector
[y. Equation (F.7) tells us that in order to calculate the inner product of [y
and [x we should nd the bra vector (functional) dual to [y and then nd
its value on [x. So, the inner product is obtained by coupling bra and ket
vectors x[y, thus forming a closed bra(c)ket expression, which is a complex
number.
Clearly, if [x and [y are dierent kets then their dual bras x[ and y[
are dierent as well. We may notice that just like vectors in H
dene linear
functionals on vectors in H, any vector [x H also denes an antilinear
functional on bra vectors by formula y[x, i.e.,
(y[ + z[)[x =
y[x +
z[x
Then applying the same arguments as above, we see that if y[ is a bra vector,
then there is a unique ket [y such that for any x[ H
we have
x[y = (x[, y[) (F.8)
Thus we established an isomorphism of two Hilbert spaces H and H
. This
statement is known as the Riesz theorem.
Lemma F.1 If kets [e
i
form an orthonormal basis in H, then dual bras e
i
[
also form an orthonormal basis in H
.
Proof. Suppose that e
i
[ do not form a basis. Then there is a nonzero vector
z[ H
i
[e
i
x
i
=
i
[e
i
e
i
[x (F.9)
The bra vector y[ dual to the ket [y has complex conjugate components in
the dual basis
y[ =
i
y
i
e
i
[ (F.10)
This can be veried by checking that the functional on the right hand side
being applied to any vector [x H yields
i
y
i
e
i
[x =
i
y
i
x
i
= ([y, [x) = y[x
F.4 Tensor product of Hilbert spaces
Given two Hilbert spaces 1
1
and 1
2
one can construct a third Hilbert space
1 which is called the tensor product of 1
1
and 1
2
and denoted by 1 =
1
1
1
2
. For each pair of basis ket vectors [i 1
1
and [j 1
2
there is
exactly one basis ket in 1 which is denoted by [i [j. All other vectors in
1 are linear products of the basis kets [i [j with complex coecients.
The inner product of two basis vectors [a
1
[a
2
1 and [b
1
[b
2
1
is dened as a
1
[b
1
a
2
[b
2
. This inner product is extended to linear combi-
nations of basis vectors by linearity.
F.5 Linear operators
Linear transformations of vectors in the Hilbert space (also called operators)
play a very important role in quantum formalism. Such transformations
650 APPENDIX F. HILBERT SPACE
T[x = [x
[e
i
=
j
t
ij
[e
j
Coecients t
ij
of this expansion are called the matrix elements of the op-
erator T in the basis [e
i
. In the bra-ket notation we can nd a convenient
expression for the matrix elements
e
j
[(T[e
i
) = e
j
[e
i
= e
j
[
k
t
ik
[e
k
=
k
t
ik
e
j
[e
k
=
k
t
ik
jk
= t
ij
Knowing matrix elements of the operator T and components of vector [x in
the basis [e
i
one can always nd the components of the transformed vector
[x
= T[x
x
i
= e
i
[x
= e
i
[(T[x) = e
i
[
j
(T[e
j
)x
j
=
jk
e
i
[e
k
t
kj
x
j
=
jk
ik
t
kj
x
j
=
j
t
ij
x
j
(F.11)
In the bra-ket notation, the operator T has the form
F.6. MATRICES AND OPERATORS 651
T =
ij
[e
i
t
ij
e
j
[ (F.12)
Indeed, by applying the right hand side of equation (F.12) to arbitrary vector
[x we obtain
ij
[e
i
t
ij
e
j
[x =
ij
[e
i
t
ij
x
j
=
i
x
i
[e
i
= [x
= T[x
F.6 Matrices and operators
Sometimes it is convenient to represent vectors and operators in the Hilbert
space 1 in a matrix notation. Let us x an orthonormal basis [e
i
1 and
represent each ket vector [y by a column of its components
[y =
_
_
y
1
y
2
.
.
.
y
n
_
_
The bra vector x[ will be represented by a row
x[ = [x
1
, x
2
, . . . , x
n
]
of complex conjugate components in the dual basis e
i
[. Then the inner
product is obtained by the usual row by column matrix multiplication
rule.
x[y = [x
1
, x
2
, . . . , x
n
]
_
_
y
1
y
2
.
.
.
y
n
_
_
=
i
x
i
y
i
652 APPENDIX F. HILBERT SPACE
Matrix elements of the operator T in (F.12) can be conveniently arranged in
the matrix
T =
_
_
t
11
t
12
. . . t
1n
t
21
t
22
. . . t
2n
.
.
.
.
.
.
.
.
.
.
.
.
t
n1
t
n2
. . . t
nn
_
_
Then the action of the operator T on a vector [x
_
x
1
x
2
.
.
.
x
n
_
_
=
_
_
t
11
t
12
. . . t
1n
t
21
t
22
. . . t
2n
.
.
.
.
.
.
.
.
.
.
.
.
t
n1
t
n2
. . . t
nn
_
_
_
_
x
1
x
2
.
.
.
x
n
_
_
=
_
j
t
1j
x
j
j
t
2j
x
j
.
.
.
j
t
nj
x
j
_
_
So, each operator has a unique matrix and each n n matrix denes a
unique linear operator. This establishes an isomorphism between operators
and matrices. In what follows we will often use the terms operator and matrix
interchangeably.
The matrix corresponding to the identity operator is
ij
, i.e., the unit
matrix
I =
_
_
1 0 . . . 0
0 1 . . . 0
.
.
.
.
.
.
.
.
.
.
.
.
0 0 . . . 1
_
_
A diagonal operator has diagonal matrix d
i
ij
D =
_
_
d
1
0 . . . 0
0 d
2
. . . 0
.
.
.
.
.
.
.
.
.
.
.
.
0 0 . . . d
n
_
_
5
compare with (F.11)
F.6. MATRICES AND OPERATORS 653
The action of operators in the dual space 1
1
, y
2
, . . . , y
n
] = [y
1
, y
2
, . . . , y
n
]
_
_
s
11
s
12
. . . s
1n
s
21
s
22
. . . s
2n
.
.
.
.
.
.
.
.
.
.
.
.
s
n1
s
n2
. . . s
nn
_
_
or symbolically
y
i
=
j
y
j
s
ji
y
[ = y[S
Suppose that operator T with matrix t
ij
in the ket space H transforms
vector [x to [y, i.e.,
y
i
=
j
t
ij
x
j
(F.13)
What is the matrix of operator S in the bra space H
i
=
j
x
j
s
ji
(F.14)
On the other hand, taking complex conjugate of equation (F.13) we obtain
y
i
=
j
t
ij
x
j
Comparing this with (F.14) we have
s
ij
= t
ji
654 APPENDIX F. HILBERT SPACE
This means that the matrix representing the action of the operator T in the
dual space H
. In
particular, we can write
x[(T[y) = (x[T
)[y (F.15)
det(T
) = (det(T))
(F.16)
F.7 Functions of operators
The sum of two operators A and B and the multiplication of an operator A
by a complex number are easily expressed in terms of matrix elements
(A + B)
ij
= a
ij
+ b
ij
(A)
ij
= a
ij
We can dene the product AB of two operators as the transformation ob-
tained by a sequential application of B and then A. This product is also a
linear transformation, i.e., an operator. The matrix of the product AB is the
row-by-column product of their matrices a
ij
and b
ij
(AB)
ij
=
k
a
ik
b
kj
Lemma F.2 Adjoint of a product of operators is equal to the product of
adjoint operators in the opposite order.
(AB)
= B
6
This is equivalent to the reection of the matrix with respect to the main diagonal.
Such matrix operation is called transposition.
F.7. FUNCTIONS OF OPERATORS 655
Proof.
(AB)
ij
= (AB)
ji
=
k
a
jk
b
ki
=
k
b
ki
a
jk
=
k
(B
)
ik
(A
)
kj
= (B
)
ij
The inverse operator A
1
is dened by its two properties
A
1
A = AA
1
= I
The corresponding matrix is the inverse of the matrix A.
Using the basic operations of addition, multiplication and inversion we
can dene various functions f(A) of the operator A. For example, the expo-
nential function is dened by Taylor series
e
F
= 1 + F +
1
2!
F
2
+ . . . (F.17)
For any two operators A and B the expression
[A, B] AB BA (F.18)
is called the commutator. We say that two operators A and B commute
with each other if [A, B] = 0. Clearly, any two powers of A commute:
[A
n
, A
m
] = 0; and [A, A
1
] = 0. Consequently, any two functions of A
commute as well: [f(A), g(A)] = 0.
Trace of a matrix is dened as a sum of its diagonal elements
Tr(A) =
i
A
ii
Lemma F.3 Trace of a product of operators is invariant with respect to any
cyclic permutation of factors.
656 APPENDIX F. HILBERT SPACE
Proof. Take for example a trace of the product of three operators
Tr(ABC) =
ijk
A
ij
B
jk
C
ki
Then
Tr(BCA) =
ijk
B
ij
C
jk
A
ki
Changing in this expression summation indices k i, i j and j k, we
obtain
Tr(BCA) =
ijk
B
jk
C
ki
A
ij
= Tr(ABC)
We can dene two classes of operators (and their matrices) which play
important roles in quantum mechanics (see Table F.1). These are Hermitian
and unitary operators. We call operator T Hermitian or self-adjoint if
T = T
(F.19)
For a Hermitian T we can write
t
ii
= t
ii
t
ij
= t
ji
i.e., diagonal matrix elements are real and non-diagonal matrix elements
symmetrical with respect to the main diagonal are complex conjugates of
each other. Moreover, from equations (F.15) and (F.19) we can calculate the
inner product of vectors x[ and T[y with a Hermitian T
x[(T[y) = (x[T
(A
)
ij
= A
ij
Transposition A A
T
(A
T
)
ij
= A
ji
Hermitian conjugation A A
= (A
)
T
(A
)
ij
= A
ji
Inversion A A
1
inverse eigenvalues
Determinant det(A) product of eigenvalues
Trace Tr(A)
i
A
ii
Types of operators
Identity I I
ij
=
ij
Diagonal D D
ij
= d
i
ij
Hermitian A = A
A
ij
= A
ji
AntiHermitian A = A
A
ij
= A
ji
Unitary A
1
= A
unimodular eigenvalues
Projection A = A
, A
2
= A eigenvalues 0 and 1 only
From this symmetric notation it is clear that a Hermitian T can act either
to the right (on [y) or to the left (on x[)
Operator U is called unitary if
T
1
= T
or, equivalently
T
T = TT
= I
A unitary operator preserves the inner product of vectors, i.e.,
Ua[Ub (a[U
)(U[b) = a[U
1
U[b = a[I[b = a[b (F.20)
Lemma F.4 If F is an Hermitian operator then U = e
iF
is unitary.
658 APPENDIX F. HILBERT SPACE
Proof.
U
U = (e
iF
)
(e
iF
) = e
iF
e
iF
= e
iF
e
iF
= e
iF+iF
= e
0
= I
Lemma F.5 Determinant of a unitary matrix U is unimodular.
Proof. We use equation (F.16) to write
[ det(U)[
2
= det(U)(det(U))
= det(U) det(U
) = det(UU
) = det(I) = 1
Operator A is called antilinear if A([x+[y) =
A[x+
is called antiunitary.
F.8 Linear operators in dierent orthonor-
mal bases
So far, we have been working with matrix elements of operators in a xed or-
thonormal basis [e
i
. However, in a dierent basis the operator is represented
by a dierent matrix. Then we may ask if the properties of operators de-
ned above remain valid in other orthonormal basis sets? In other words, we
would like to demonstrate that all above denitions are basis-independent.
Theorem F.6 [e
i
and [e
i
are two orthonormal bases if and only if there
exists a unitary operator U such that
U[e
i
= [e
i
(F.21)
F.8. LINEAR OPERATORS IN DIFFERENT ORTHONORMAL BASES659
Proof. The basis [e
i
obtained by applying a unitary transformation U to
the orthonormal basis [e
i
is orthonormal, because unitary transformations
preserve inner products of vectors (F.20). To prove the reverse statement let
us form a matrix
_
_
e
1
[e
1
e
1
[e
2
. . . e
1
[e
e
2
[e
1
e
2
[e
2
. . . e
2
[e
.
.
.
.
.
.
.
.
.
.
.
.
e
n
[e
1
e
n
[e
2
. . . e
n
[e
_
with matrix elements
u
ji
= e
j
[e
jk
[e
j
u
jk
e
k
[ =
jk
[e
j
e
j
[e
k
e
k
[
So, acting on the vector [e
i
U[e
i
=
jk
[e
j
e
j
[e
k
e
k
[e
i
=
jk
[e
j
e
j
[e
ki
=
j
[e
j
e
j
[e
= [e
it makes vector [e
i
as required. Moreover, this operator is unitary because
7
(UU
)
ij
=
k
u
ik
u
jk
=
k
e
i
[e
k
e
j
[e
k
k
e
i
[e
k
e
k
[e
j
= e
i
[e
=
ij
= I
ij
7
Here we use the following representation of the identity operator
I =
i
[e
i
e
i
[ (F.22)
660 APPENDIX F. HILBERT SPACE
If F is operator with matrix elements f
ij
in the basis [e
k
, then its matrix
elements f
ij
in basis [e
k
= U[e
k
can be obtained by formula
f
ij
= e
i
[F[e
j
= (e
i
[U
)F(U[e
j
) = e
i
[U
FU[e
j
= e
i
[U
1
FU[e
j
(F.23)
Equation (F.23) can be viewed from two equivalent perspectives. One can
regard (F.23) either as matrix elements of F in the new basis set U[e
i
(a
passive view) or as matrix elements of the transformed operator U
1
FU in
the original basis set [e
i
(an active view).
When changing basis, the matrix of the operator changes, but its type
remains the same. If operator F is Hermitian, then in the new basis (adopting
active view and omitting symbols for basis vectors)
(F
= (U
1
FU)
= U
(U
1
)
= U
1
FU = F
it is Hermitian as well.
If V is unitary, then for the transformed operator V
we have
(V
V
= (U
1
V U)
V
= U
(U
1
)
V
= U
1
V
UV
= U
1
V
UU
1
V U
= U
1
V
V U = U
1
U = I
so, V
is also unitary.
Lemma F.7 Trace of an operator is independent on the basis.
Proof. From Lemma F.3 we obtain
Tr(U
1
AU) = Tr(AUU
1
) = Tr(A)
F.9. DIAGONALIZATION OF HERMITIAN ANDUNITARY MATRICES661
F.9 Diagonalization of Hermitian and unitary
matrices
We see that the choice of basis in the Hilbert space is a matter of conve-
nience. So, when performing calculations it is always a good idea to choose
a basis in which operators have the simplest form, e.g., diagonal. It appears
that Hermitian and unitary operators can always be made diagonal by an
appropriate choice of basis. Suppose that vector [x satises equation
F[x = [x
where is a complex number called eigenvalue of the operator F. Then [x
is called eigenvector of the operator F.
Theorem F.8 (spectral theorem) For any Hermitian or unitary operator
F there is an orthonormal basis [e
i
such that
F[e
i
= f
i
[e
i
(F.24)
where f
i
are complex numbers.
For the proof of this theorem see ref. [Rud91].
Equation (F.24) means that the matrix of the operator F is diagonal in
the basis [e
i
F =
_
_
f
1
0 . . . 0
0 f
2
. . . 0
.
.
.
.
.
.
.
.
.
.
.
.
0 0 . . . f
n
_
_
and according to (F.12) each Hermitian or unitary operator can be expressed
through its eigenvectors and eigenvalues
F =
i
[e
i
f
i
e
i
[ (F.25)
662 APPENDIX F. HILBERT SPACE
Lemma F.9 Eigenvalues of a Hermitian operator are real.
Proof. This follows from the fact that diagonal matrix elements of an Her-
mitian matrix are real.
Lemma F.10 Eigenvalues of an unitary operator are unimodular.
Proof. Using representation (F.25) we can write
I = UU
=
_
i
[e
i
f
i
e
i
[
__
j
[e
j
f
j
e
j
[
_
=
ij
f
i
f
j
[e
i
e
i
[e
j
e
j
[ =
ij
f
i
f
j
[e
i
ij
e
j
[ =
i
[f
i
[
2
[e
i
e
i
[
Since all eigenvalues of the identity operator are 1, we obtain [f
i
[
2
= 1.
One benet of diagonalization is that functions of operators are easily
dened in the diagonal form. If operator A has diagonal form
A =
_
_
a
1
0 . . . 0
0 a
2
. . . 0
.
.
.
.
.
.
.
.
.
.
.
.
0 0 . . . a
n
_
_
then operator f(A) (in the same basis) has the form
f(A) =
_
_
f(a
1
) 0 . . . 0
0 f(a
2
) . . . 0
.
.
.
.
.
.
.
.
.
.
.
.
0 0 . . . f(a
n
)
_
_
For example, the matrix of the inverse operator is
8
8
Note that inverse operator A
1
is dened only if all eigenvalues of A are nonzero.
F.9. DIAGONALIZATION OF HERMITIAN ANDUNITARY MATRICES663
A
1
=
_
_
a
1
1
0 . . . 0
0 a
1
2
. . . 0
.
.
.
.
.
.
.
.
.
.
.
.
0 0 . . . a
1
n
_
_
From Lemma F.10, there is a basis in which the matrix of unitary operator
U is diagonal
U =
_
_
e
if
1
0 . . . 0
0 e
if
2
. . . 0
.
.
.
.
.
.
.
.
.
.
.
.
0 0 . . . e
ifn
_
_
with real f
i
. It then follows that each unitary operator can be represented
as
U = e
iF
where F is Hermitian
F =
_
_
f
1
0 . . . 0
0 f
2
. . . 0
.
.
.
.
.
.
.
.
.
.
.
.
0 0 . . . f
n
_
_
Together with Lemma F.4 this establishes an isomorphism between the sets
of Hermitian and unitary operators.
Lemma F.11 Unitary transformation of a Hermitian or unitary operator
does not change its spectrum.
Proof. If [
k
is eigenvector of M with eigenvalue m
k
M[
k
= m
k
[
k
=
UMU
1
with the same eigenvalue
M
(U[
k
) = UMU
1
(U[
k
) = UM[
k
= Um
k
[
k
= m
k
(U[
k
)
Appendix G
Subspaces and projection
operators
G.1 Projections
Two subspaces A and B in the Hilbert space 1are called orthogonal (denoted
A B) if any vector from A is orthogonal to any vector from B. The span
of all vectors which are orthogonal to A is called the orthogonal complement
to the subspace A and denoted A
.
For a subspace A (with dim(A) = m) in the Hilbert space 1 (with
dim(1) = n > m) we can select an orthonormal basis [e
i
such that rst
m vectors with indices i = 1, 2, . . . , m belong to A and vectors with indices
i = m + 1, m + 2, . . . , n belong to the orthogonal complement A
. Then for
each vector [y we can write
[y =
n
i
[e
i
e
i
[y =
m
i=1
[e
i
e
i
[y +
n
i=m+1
[e
i
e
i
[y
The rst sum lies entirely in A and is denoted by [y
and is denoted [y
and
[y
1
.
1
We will also say that Hilbert space H is represented as a direct sum ( H = AA
) of
orthogonal subspaces A and A
.
665
666 APPENDIX G. SUBSPACES AND PROJECTION OPERATORS
[y = [y
+[y
[y
A
[y
i=1
[e
i
e
i
[
so that in the above basis [e
i
the operator P
A
has diagonal matrix with
rst m diagonal entries equal to 1 and all others equal to 0. From this, it
immediately follows that
P
A
= 1 P
A
A set of projections P
is called
decomposition of unity if
1 =
or, equivalently
H =
Thus P
A
and P
A
provide an example of the decomposition of unity.
G.2. COMMUTING OPERATORS 667
Theorem G.1 Operator P is a projection if and only if P is Hermitian and
P
2
= P.
Proof. For Hermitian P, there is a basis [e
i
in which this operator is
diagonal.
P =
i
[e
i
p
i
e
i
[
Then
0 = P
2
P =
_
i
[e
i
p
i
e
i
[
__
j
[e
j
p
j
e
j
[
_
i
[e
i
p
i
e
i
[
=
ij
[e
i
p
i
p
j
ij
e
j
[
i
[e
i
p
i
e
i
[ =
i
[e
i
_
p
2
i
p
i
_
e
i
[
Therefore p
2
i
p
i
= 0 and either p
i
= 0 or p
i
= 1. From this we conclude that
P is a projection on the subspace spanning eigenvectors with eigenvalue 1.
To prove the inverse statement we note that any projection operator is
Hermitian because it has real eigenvalues 1 and 0. Furthermore, for any
vector [y
P
2
[y = P[y
= [y
= P[y
which proves that P
2
= P.
G.2 Commuting operators
Lemma G.2 Subspaces A and B are orthogonal if and only if P
A
P
B
=
P
B
P
A
= 0.
Proof. Assume that
P
A
P
B
= P
B
P
A
= 0 (G.1)
668 APPENDIX G. SUBSPACES AND PROJECTION OPERATORS
and suppose that there is vector [y B such that [y is not orthogonal to
A. Then P
A
[y = [y
A
, = 0. From these properties we obtain
P
A
P
B
[y = P
A
[y = [y
A
= P
A
[y
A
P
B
P
A
[y = P
B
[y
A
P
A
P
B
[y
A
= P
A
P
A
[y
A
= P
A
[y
A
, = 0
So, we found a vector [y
A
for which P
A
P
B
[y
A
, = 0 in disagreement with our
original assumption (G.1).
The inverse statement is proven as follows. For each vector [x, the pro-
jection P
A
[x is in the subspace A. If A and B are orthogonal, then the
second projection P
B
P
A
[x yields zero vector. The same arguments show
that P
A
P
B
[x = 0 and P
A
P
B
= P
B
P
A
.
Lemma G.3 If A B then P
A
+ P
B
is the projection on the direct sum
AB.
Proof. If we build an orthonormal basis [e
i
in AB such that rst dim(A)
vectors belong to A and next dim(B) vectors belong to B, then
P
A
+ P
B
=
dim(A)
i=1
[e
i
e
i
[ +
dim(B)
j=1
[e
j
e
j
[ = P
AB
Lemma G.4 If A B (A is a subspace of B) then
P
A
P
B
= P
B
P
A
= P
A
G.2. COMMUTING OPERATORS 669
Proof. If A B then there exists a subspace C in B such that C A and
B = A C.
2
According to Lemmas G.2 and G.3
P
A
P
C
= P
C
P
A
= 0
P
B
= P
A
+ P
C
P
A
P
B
= P
A
(P
A
+ P
C
) = P
2
A
= P
A
P
B
P
A
= (P
A
+ P
C
)P
A
= P
A
If there exist three mutually orthogonal subspaces X, Y and Z, such that
A = X Y and B = X Z, then subspaces A and B (and projections P
A
and P
B
) are called compatible.
Lemma G.5 Subspaces A and B are compatible if and only if their corre-
sponding projections commute
[P
A
, P
B
] = 0
Proof. Let us rst show that if [P
A
, P
B
] = 0 then P
A
P
B
= P
B
P
A
= P
AB
is
the projection on the intersection of subspaces A and B.
First we nd that
(P
A
P
B
)
2
= P
A
P
B
P
A
P
B
= P
2
A
P
2
B
= P
A
P
B
and that operator P
A
P
B
is Hermitian, because
(P
A
P
B
)
= P
B
P
A
= P
B
P
A
= P
A
P
B
Therefore, P
A
P
B
is a projection by Theorem G.1. If A B, then the direct
statement of the Lemma follows from Lemma G.2. Suppose that A and B
are not orthogonal and denote C = A B (C can be empty, of course). We
can always represent A = C X and B = C Y , therefore
2
This subspace is composed of vectors in B, which are orthogonal to A.
670 APPENDIX G. SUBSPACES AND PROJECTION OPERATORS
P
A
= P
C
+ P
X
P
B
= P
C
+ P
Y
[P
C
, P
X
] = 0
[P
C
, P
Y
] = 0
We are left to show that X and Y are orthogonal. This follows from the
commutator
0 = [P
A
, P
B
] = [P
C
+ P
X
, P
C
+ P
Y
]
= [P
C
, P
C
] + [P
C
, P
Y
] + [P
X
, P
C
] + [P
X
, P
Y
] = [P
X
, P
Y
]
Let us now prove the inverse statement. From the compatibility of A and
B it follows that
P
A
= P
X
+ P
Y
P
B
= P
X
+ P
Z
P
X
P
Y
= P
X
P
Z
= P
Y
P
Z
= 0
[P
A
, P
B
] = [P
X
+ P
Y
, P
X
+ P
Z
] = 0
Lemma G.6 If projection P is compatible with all other projections in the
Hilbert space, then either P = 0 or P = 1.
Proof. Suppose that P ,= 0 and P ,= 1. Then P has a non-empty range A,
which is dierent from 1. So, the orthogonal complement A
is not empty as
well. Choose an arbitrary vector y with non-zero components [y
and [y
f
fP
f
(G.2)
where index f now runs over all distinct eigenvalues of F and P
f
are referred
to as spectral projections of F. This means that each Hermitian operator
denes an unique decomposition of unity I =
f
P
f
. Inversely, if P
f
is a
decomposition of unity and f are real numbers then equation (G.2) denes
an unique Hermitian operator.
Lemma G.7 If two Hermitian operators F and G commute then all spectral
projections of F commute with G.
Proof. Consider operator P which is a spectral projection of F. Take any
vector [x in the range of P, i.e.,
P[x = [x
F[x = f[x
for some real f. Let us rst prove that the vector G[x also lies in the range
of P. Indeed, using the commutativity of F and G we obtain
FG[x = GF[x = Gf[x = fG[x
This means that operator G leaves all eigensubspaces of F invariant. Then
for any vector [x the vectors P[x and GP[x lie in the range of P. Therefore
PGP = GP (G.3)
Taking adjoint of both sides we obtain
672 APPENDIX G. SUBSPACES AND PROJECTION OPERATORS
PGP = PG (G.4)
Now subtracting (G.4) from (G.3) we obtain
[G, P] = GP PG = 0
Theorem G.8 Two Hermitian operators F and G commute if and only if
all their spectral projections commute.
Proof. We write
F =
i
f
i
P
i
(G.5)
G =
j
g
j
Q
j
(G.6)
If [P
i
, Q
j
] = 0 for all i, j, then obviously [F, G] = 0. To prove the reverse
statement we notice that from Lemma G.7 each spectral projection P
i
com-
mutes with G. From the same Lemma it follows that each spectral projection
of G commutes with P
i
.
Theorem G.9 If two Hermitian operators F in (G.5) and G in (G.6) com-
mute then there is a basis [e
i
in which both F and G are diagonal, i.e., [e
i
i
P
i
I =
j
Q
j
I = I I =
_
i
P
i
__
j
Q
j
_
=
ij
P
i
Q
j
G.2. COMMUTING OPERATORS 673
where P
i
and Q
j
are spectral projections of operators F and G, respectively.
Since F and G commute, the operators P
i
Q
j
with dierent i and/or j are
projections on mutually orthogonal subspaces. So, these projections form a
spectral decomposition of unity and the desired basis is obtained by coupling
bases in the subspaces P
i
Q
j
.
674 APPENDIX G. SUBSPACES AND PROJECTION OPERATORS
Appendix H
Representations of groups
A representation of a group G is a homomorphism between the group G and
the group of linear transformations in a vector space. In other words, to each
group element g there corresponds a matrix U
g
with non-zero determinant.
1
The group multiplication is represented by the matrix product and
U
g
1
U
g
2
= U
g
1
g
2
U
g
1 = U
1
g
U
e
= I
Each group has a trivial representation in which each group element is rep-
resented by the identity operator. If the linear space of the representation is
a Hilbert space 1, then we can dene a particularly useful class of unitary
representations. These representations are made of unitary operators.
H.1 Unitary representations of groups
Two representations U
g
and U
g
in the Hilbert space 1 are called unitarily
equivalent if there exists a unitary operator V such that
U
g
= V U
g
V
1
(H.1)
1
Matrices with zero determinant cannot be inverted, so they cannot represent group
elements.
675
676 APPENDIX H. REPRESENTATIONS OF GROUPS
Having two representations U
g
and V
g
in Hilbert spaces H
1
and H
2
respec-
tively, we can always build another representation W
g
in the Hilbert space
H = H
1
H
2
by joining two matrices in the block diagonal form.
W
g
=
_
U
g
0
0 V
g
_
(H.2)
This is called the direct sum of two representations. The direct sum is de-
noted by the sign
W
g
= U
g
V
g
A representation is called reducible if there is a unitary transformation
(H.1) that brings representation matrices to the block diagonal form (H.2)
for all g. Otherwise, the representation is called irreducible.
Casimir operators are operators which commute with all representatives
of group elements.
Lemma H.1 (Schurs rst lemma [Hsi00]) Casimir operators of an uni-
tary irreducible representation of any group are constant multiples of the unit
matrix.
From Appendix E.1 we know that elements of any Lie group in the vicinity
of the unit element can be represented as
g = e
A
where A is an element from the Lie algebra of the group. Correspondingly,
any matrix of the unitary group representation in 1 can be written as
U
g
= e
F
A
where F
A
is a Hermitian operator and is a real constant.
2
Operators F
A
form a representation of the Lie algebra in the Hilbert space 1. If the Lie
2
Here we use the Plancks constant, but any other nonzero real constant will do as well.
H.2. STONES THEOREM 677
bracket of two Lie algebra elements is [A, B] = C, then the commutator of
their Hermitian representatives is
[F
A
, F
B
] = iF
c
H.2 Stones theorem
Stones theorem provides a valuable information about unitary representa-
tions of 1-dimensional Lie groups. Such groups are called also one-parameter
Lie groups, because all their elements g(z) can be parameterized with one
real parameter z R, so that
g(0) = e
g(z
1
)g(z
2
) = g(z
1
+ z
2
)
g(z)
1
= g(z)
Theorem H.2 (Stone [Sto32]) If U
g
is a unitary representation of a 1-
dimensional Lie group in the Hilbert space 1, then there exists an Hermitian
operator T in 1, such that
U
g(z)
= e
Tz
(H.3)
This theorem is useful not only for 1-dimensional Lie groups, but also
for Lie groups of arbitrary dimension. The reason is that in any Lie group
one can nd multiple one-parameter subgroups, for which the theorem can be
applied.
3
For example consider an arbitrary Lie group G and a basis vector
t from its Lie algebra. Consider a set of group elements of the form
g(z) = e
z
t
(H.4)
where parameter z runs through all real numbers z R. It is easy to see
that the set (H.4) forms a one-parameter subgroup in G. Indeed, this set
contains the unit element (when z = 0); the group product is dened as
3
See Appendix E.1
678 APPENDIX H. REPRESENTATIONS OF GROUPS
g(z
1
)g(z
2
) = e
z
1
t
e
z
2
t
= e
(z
1
+z
2
)
t
= g(z
1
+ z
2
)
and the inverse element is
g(z)
1
= e
z
t
= g(z)
From the Stones theorem we can then conclude that in any unitary rep-
resentation of G representatives of g(z) have the form (H.3) with some xed
Hermitian operator T.
H.3 Heisenberg Lie algebra
The Heisenberg Lie algebra h
2n
of dimension 2n has basis elements T
i
and
1
i
(i = 1, 2, . . . , n) with Lie brackets
[T
i
, T
j
] = [1
i
, 1
j
] = 0
[1
i
, T
j
] =
ij
The following theorem is applicable
Theorem H.3 (Stone-von Neumann [vN31]) If (P
i
, R
i
) (i = 1, 2, . . . , n)
is a Hermitian representation
4
of the Heisenberg Lie algebra h
2n
in the Hilbert
space 1, then
1. representatives P
i
and R
i
have continuous spectra from to .
4
This means that Hermitian operators P
i
and R
i
satisfy commutation relations
[P
i
, P
j
] = [R
i
, R
j
] = 0
[R
i
, P
j
] = i
ij
where is a real constant.
H.4. DOUBLE-VALUEDREPRESENTATIONS OF THE ROTATION GROUP679
2. any irreducible representation of h
2n
is unitary equivalent to the so-
called Schrodinger representation. In the physically relevant case n = 3,
the Schrodinger representation is the one described in subsection 5.2.3:
Vectors in the Hilbert space are represented by complex functions on R
3
;
operator R multiplies these functions by r; operator P is dierentiation
id/dr.
H.4 Double-valued representations of the ro-
tation group
The rotation group
5
has a peculiar non-trivial topology: Results of two ro-
tations around the same axis by angles and + 2n (with integer n) are
physically indistinguishable. Then the region of independent rotation vec-
tors
6
in R
3
can be described as the interior of the sphere of radius with
opposite points on the surface of the sphere identied. This set of points
will be referred to as ball (see Fig. H.1). The unit element
0 is in the
center of the ball. We will be interested in one-parameter families of group
elements which form continuous curves in the group manifold . Since the
opposite points on the surface of the ball are identied in our topology, any
continuous path that crosses the surface must reappear on the opposite side
of the sphere (see Fig. H.1(a)).
A topological space is simply connected if every loop can be continuously
deformed to a single point. An example of a simply connected topological
space is the surface of a sphere. However, the manifold of the rotation
parameters is not simply connected. The loop shown in Fig. H.1(a) crosses
the sphere once and can not be shrunk to a single point. However, the loop
shown in Fig. H.1(b) can be continuously deformed to a point, because it
crosses the sphere twice. It appears that for any rotation R there are two
classes of paths from the groups unit element
0, reaches
the surface of the sphere at point A and then continues from the opposite
point A
back to
0,
because it crosses the surface an odd number of times (1); (b) a loop
0
A A
B B
0 back to
0
we will end up with the identity operator I again. However, if we travel the
long path
0 A A
0
A A
0 A A
x
,
y
,
z
. See Appendix D.7.
682 APPENDIX H. REPRESENTATIONS OF GROUPS
and an explicit form of the three generators of the representation.
Table H.1: Unitary irreducible representations of SU(2)
Spin: s = 0 s = 1/2 s = 1 s = 3/2, 2, . . .
dimension 1 2 3 2s + 1
< S
2
> 0
3
4
2
2
2
2
s(s + 1)
s
x
or s
y
or s
z
0 /2, /2 , 0, s, (s + 1), . . . ,
(s 1), s
S
x
0
_
0 /2
/2 0
_
_
_
0 0 0
0 0 i
0 i 0
_
_
S
y
0
_
0 i/2
i/2 0
_
_
_
0 0 i
0 0 0
i 0 0
_
_
see, e.g., ref. [Ros57]
S
z
0
_
/2 0
0 /2
_
_
_
0 i 0
i 0 0
0 0 0
_
_
Representations characterized by integer spin s are single-valued. Half-
integer spin representations are double-valued.
8
For example, in the repre-
sentation with s = 1/2, the rotation through the angle 2 around the z-axis
is represented by negative unity
e
Sz2
= exp
_
2i
_
/2 0
0 /2
__
=
_
e
i
0
0 e
i
_
=
_
1 0
0 1
_
= I
while a 4 rotation is represented by the unit matrix
e
Sz4
=
_
1 0
0 1
_
= I
8
see Appendix H.4
H.6. PAULI MATRICES 683
H.6 Pauli matrices
Generators of the spin 1/2 representation of the rotation group (see Table
H.1) can be conveniently expressed through Pauli matrices
i
(i = x, y, z)
S
i
=
2
i
(H.5)
where
t
0
=
_
1 0
0 1
_
(H.6)
x
1
=
_
0 1
1 0
_
(H.7)
y
2
=
_
0 i
i 0
_
(H.8)
z
3
=
_
1 0
0 1
_
(H.9)
For reference we list here some properties of the Pauli matrices
[
i
,
j
] = 2i
3
i=1
ijk
i
,
j
= 2
ij
2
i
= 1
and for arbitrary numerical 3-vectors a and b
( a) = a
0
+ i[ a] (H.10)
( a) = a
0
i[ a] (H.11)
( a)( b) = (a b)
0
+ i [a b] (H.12)
[
pr
a] [
el
a] = [[
el
a]
pr
] a
= (a(
el
pr
)
pr
(
el
a)) a
= a
2
(
el
pr
) (
pr
a)(
el
a)
684 APPENDIX H. REPRESENTATIONS OF GROUPS
Appendix I
Special relativity
In this Appendix we present major assertions of Einsteins special relativity
[Ein05]. In chapter 15 we argued that this theory is approximate. We also
suggested there an alternative rigorous approach, which ensures the validity
of the relativity principle in interacting systems.
I.1 4-vector representation of the Lorentz group
The Lorentz group is a 6-dimensional subgroup of the Poincare group, which
is formed by rotations and boosts. Linear (tensor) representations of the
Lorentz group play a signicant role in many physical problems.
The 4-vector representation of the Lorentz group forms the mathematical
framework of special relativity discussed in this Appendix. This represen-
tation resembles the 3-vector representation of the rotation group.
1
Let us
rst dene the vector space where this representation is acting. This is a
4-dimensional real vector space / whose vectors are denoted by
2
=
_
_
ct
x
y
z
_
_
1
see Appendix D.2
2
Here c is the speed of light. Also in this book we always denote 4-vectors by the tilde.
685
686 APPENDIX I. SPECIAL RELATIVITY
and the pseudoscalar product of any two 4-vectors
1
and
2
can be written
in a number of equivalent forms
3
1
2
c
2
t
1
t
2
x
1
x
2
y
1
y
2
z
1
z
2
=
3
=0
(
1
)
(
2
)
= [ct
1
, x
1
, y
1
, z
1
]
_
_
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
_
_
_
_
ct
2
x
2
y
2
z
2
_
_
=
T
1
g
2
(I.1)
where g
_
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
_
_
For compact notation it is convenient to dene a vector with raised
index and Einsteins convention to sum over repeated indices
=0
g
= (ct, x, y, z)
Then, the pseudoscalar product can be rewritten as
1
2
(
1
)
(
2
)
= (
1
)
(
2
)
(I.2)
The tilde notation allows us to distinguish the pseudoscalar square (or 4-
square) of the 4-vector
2
=
=
2
0
2
1
2
2
2
3
=
2
0
2
3
Here indices and run from 0 to 3:
0
= ct,
1
= x,
2
= y,
3
= z.
I.1. 4-VECTOR REPRESENTATION OF THE LORENTZ GROUP 687
from the square of its 3-vector part
2
( ) =
2
1
+
2
2
+
2
3
A 4-vector (
0
, ) is called space-like if
2
>
2
0
. Time-like 4-vectors have
2
<
2
0
and for null 4-vectors the condition is
2
=
2
0
.
The 4-vector representation of the Lorentz group is dened as a repre-
sentation by linear transformations in the vector space / that conserve the
pseudoscalar product of 4-vectors. In other words, representation matrices
must satisfy
2
1
2
=
T
1
T
g
2
=
T
1
g
2
=
1
2
which means that matrices must have the property
g =
T
g (I.3)
One useful implication of this result is
1
2
=
T
1
T
g
2
=
T
1
g
1
2
=
1
1
2
(I.4)
Another property of can be obtained by taking the determinant of both
sides of (I.3)
1 = det(g) = det(
T
g) = det(
T
) det(g) det() = det()
2
which implies det() = 1. Writing equation (I.3) for the g
00
component we
also get
1 = g
00
=
3
=0
0
g
0
=
2
10
2
20
2
30
+
2
00
It then follows that
2
00
1, which means that either
00
1 or
00
1.
The unit element of the group is represented by the identity transforma-
tion I, which obviously has det(I) = 1 and I
00
= 1. As we are interested
688 APPENDIX I. SPECIAL RELATIVITY
only in rotations and boosts which can be continuously connected to the unit
element, we must choose
det() = 1 (I.5)
00
1 (I.6)
The matrices satisfying equation (I.3) with additional conditions (I.5) - (I.6)
will be called pseudoorthogonal. Thus we can say that 44 pseudoorthogonal
matrices form a representation of the Lorentz group.
Boost transformations can be written as
_
_
ct
_
= B(
)
_
_
ct
x
y
z
_
_
(I.7)
where general pseudoorthogonal matrix of boost is
4
B(
) =
_
_
cosh
x
sinh
y
sinh
z
sinh
sinh 1 +
2
x
x
y
x
sinh
x
y
1 +
2
y
y
sinh
x
z
y
z
1 +
2
z
_
_
(I.8)
and we denoted = (cosh 1)
2
. In particular, boosts along x, y and z
axes are represented by the following 4 4 matrices
B(, 0, 0) =
_
_
cosh sinh 0 0
sinh cosh 0 0
0 0 1 0
0 0 0 1
_
_
(I.9)
B(0, , 0) =
_
_
cosh 0 sinh 0
0 1 0 0
sinh 0 cosh 0
0 0 0 1
_
_
(I.10)
4
compare with equations (2.50) and (2.51)
I.1. 4-VECTOR REPRESENTATION OF THE LORENTZ GROUP 689
B(0, 0, ) =
_
_
cosh 0 0 sinh
0 1 0 0
0 0 1 0
sinh 0 0 cosh
_
_
(I.11)
Conservation of the pseudoscalar product by these transformations can be
easily veried.
Rotations are represented by 4 4 matrices
R(
) =
_
1 0
0 R
_
.
where R is a 3 3 rotation matrix (D.22). A general element of the Lorentz
group can be represented as (rotation) (boost),
5
so its matrix
= R(
)B(
) (I.12)
preserves the pseudoscalar product just as R(
) and B(
) do.
So far we discussed the matrix representation of nite Lorentz transfor-
mations. Let us now nd the matrix representation of the corresponding
Lie algebra. According to our discussion in Appendix H.1, the matrix of a
general Lorentz group element can be represented in the exponential form
= e
aF
where F is an element of the Lie algebra and a is a real constant. Condition
(I.3) then can be rewritten as
0 =
T
g g = e
aF
T
ge
aF
g = (1 + aF
T
+ . . .)g(1 + aF + . . .) g
= a(F
T
g gF) + . . .
where the ellipsis indicates terms proportional to a
2
, a
3
, etc. This sets the
following restriction on the matrices F
F
T
g gF = 0.
We can easily nd 6 linearly independent 4 4 matrices satisfying this con-
dition. Three generators of rotations are
6
5
This order of factors agrees with our convention (2.47).
6
Note that matrices (D.25) - (D.26) are 33 submatrices of (I.13).
690 APPENDIX I. SPECIAL RELATIVITY
x
=
_
_
0 0 0 0
0 0 0 0
0 0 0 1
0 0 1 0
_
_
,
y
=
_
_
0 0 0 0
0 0 0 1
0 0 0 0
0 1 0 0
_
_
,
z
=
_
_
0 0 0 0
0 0 1 0
0 1 0 0
0 0 0 0
_
_
(I.13)
Three generators of boosts can be obtained by dierentiating explicit repre-
sentation of boosts (I.9) - (I.11)
/
x
=
1
c
_
_
0 1 0 0
1 0 0 0
0 0 0 0
0 0 0 0
_
_
, /
y
=
1
c
_
_
0 0 1 0
0 0 0 0
1 0 0 0
0 0 0 0
_
_
, /
z
=
1
c
_
_
0 0 0 1
0 0 0 0
0 0 0 0
1 0 0 0
_
_
(I.14)
These six matrices form a basis of the Lie algebra of the Lorentz group
R(
) = e
B(
) = e
c
, x
) of this event
from the point of view of O
moves with the speed v = c tanh along the x-axis, then the
matrix B(
) is (I.9)
7
For denition of event see subsection 15.2.1.
I.3. BAN ON SUPERLUMINAL SIGNALING 691
B(, 0, 0) =
_
_
cosh sinh 0 0
sinh cosh 0 0
0 0 1 0
0 0 0 1
_
_
(I.15)
and Lorentz transformation (I.7) can be written in a more familiar form
t
= y (I.18)
z
= z (I.19)
It is important to note that special relativity makes the following assertion
Assertion I.1 (the universality of Lorentz transformations) Lorentz trans-
formations (I.16) - (I.19) are exact and universal: they are valid for all kinds
of events; they do not depend on the composition of the physical system and
on interactions acting in the system.
In the main body of this book
8
we explain why Assertion I.1 does not hold
in relativistic theory (RQD) developed here. The key dierence between our
approach and the standard logic of special relativity is that in RQD boost
transformations of space-time coordinates of events involving interacting par-
ticles have a more complicated form, which depends on interaction and on
the state of the physical system. So, from our standpoint all consequences of
the Assertion I.1 described in the rest of this Appendix are neither rigorous
nor accurate.
I.3 Ban on superluminal signaling
Special relativity says that if some physical process occurs at point A at time
t = 0, then it can have absolutely no eect on physical processes occurring
at point B during times less than t = R
AB
/c, where R
AB
is the distance
between points A and B. In other words
8
See, especially, chapter 15.
692 APPENDIX I. SPECIAL RELATIVITY
Assertion I.2 (no superluminal signaling) No signal may propagate faster
than the speed of light.
The proof of this Assertion goes like this [Rus05]. Consider a superluminal
signal propagating between A and B in the reference frame O, so that event
A can be described as the cause and event B is the eect. These two events
have space-time coordinates (t
A
, x
A
) and (t
B
, x
B
), where
t
B
< t
A
+[x
A
x
B
[/c
t
B
t
A
The fact of superluminal signal propagation, by itself, does not contradict
any sacred physical principle. The problem arises in the moving frame of
reference O
in which, according
to Lorentz transformations (I.16) - (I.19), the time order of events (t
A
and
t
B
) changes, i.e., instead of event B being later than A, it actually occurs
earlier that A (t
B
< t
A
). This means that for observer O
moves away
from O with speed v < c. Suppose that both frames contain devices that can
send superluminal signals. To simplify our discussion, we will consider the
extreme case of signals propagating with innite velocity. On the space-time
diagram (Fig. I.1) world-lines of the two devices are shown by bold lines. At
time t = 0 (measured by the clock in O) all events located on the horizontal
x-axis appear simultaneous from the point of view of O. On the other hand,
space-time events on the axis x
at point B of her
world-line. Upon the arrival of the signal, O
, t
,
x
, y
, z
of a coordinate system K
= l/ cosh (I.20)
from the point of view of a moving reference frame. Another well-known
result is that the duration of time intervals between any two events increases
by the same factor cosh
I.5. DECAY OF MOVING PARTICLES IN SPECIAL RELATIVITY 695
t
= t cosh (I.21)
One experimentally veriable consequence of this time dilation formula will
be discussed in the next section.
I.5 Decay of moving particles in special rela-
tivity
Suppose that from the viewpoint of observer O the unstable particle is pre-
pared at rest in the origin x = y = z = 0 at time t = 0 in the non-decayed
state, so that (0, 0) = 1.
9
Then observer O may associate the space-time
point
(t, x, y, z)
prep
= (0, 0, 0, 0) (I.22)
with the event of preparation. We know that the non-decay probability
decreases with time by (almost) exponential decay law
10
(0, t) exp
_
0
_
(I.23)
At time t =
0
the non-decay probability is exactly (0,
0
) = e
1
. This one
lifetime event has space-time coordinates
(t, x, y, z)
life
= (
0
, 0, 0, 0) (I.24)
according to the observer O.
Let us now take the point of view of the moving observer O
. Ac-
cording to special relativity, this observer will also see the preparation
9
Here we follow notation from chapter 13 by writing (, t) the non-decay probability
observed from the reference frame O
).
10
Actually, as we saw in subsection 13.2.3, the decay law is not exactly exponential, but
this is not important for our derivation of equation (I.25) here.
696 APPENDIX I. SPECIAL RELATIVITY
and the one lifetime events, when the non-decay probabilities are 1 and
e
1
, respectively. However, observer O
, the preparation
event has coordinates (0, 0, 0, 0) and the lifetime event has coordinates
(
0
cosh , c
0
sinh , 0, 0). Therefore, the time elapsed between these two
events is cosh times longer than in the reference frame O. This also means
that the decay law is exactly cosh slower from the point of view of the mov-
ing observer O
0
=
_
_
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
_
_
=
_
0
0
0
0
_
=
_
1 0
0 1
_
(J.1)
x
=
_
_
0 0 0 1
0 0 1 0
0 1 0 0
1 0 0 0
_
_
=
_
0
x
x
0
_
1
On the right hand sides each 2 2 block is expressed in terms of Pauli matrices from
Appendix H.6
697
698 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
y
=
_
_
0 0 0 i
0 0 i 0
0 i 0 0
i 0 0 0
_
_
=
_
0
y
y
0
_
z
=
_
_
0 0 1 0
0 0 0 1
1 0 0 0
0 1 0 0
_
_
=
_
0
z
z
0
_
=
_
0
0
_
(J.2)
These matrices satisfy the following properties
2
0
=
0
=
0
(J.3)
= 2g
(J.4)
0
= 1 (J.5)
i
= 1 (J.6)
Tr(
) = 0 (J.7)
Tr(
) = 4g
(J.8)
=
x
z
+
0
0
= 4 (J.9)
+ 2g
= 4
+ 2
= 2
(J.10)
If A, B, C are any linear combinations of gamma-matrices, then
= 2A (J.11)
AB
ABC
= 2CBA (J.13)
J.2 Bispinor representation of the Lorentz group
In this section, we would like to consider bispinor representation T() of the
Lorentz group. Similar to the 4-vector representation from Appendix I.1, the
bispinor representation is realized by 4 4 matrices.
2
The indices take values , = 0, 1, 2, 3, i = 1, 2, 3.
J.2. BISPINOR REPRESENTATION OF THE LORENTZ GROUP 699
The boost and rotation generators of the bispinor representation of the
Lorentz group are dened through commutators of gamma matrices
/ =
i
4c
[
0
, ] =
i
2c
_
0
0
_
(J.14)
x
=
i
4
[
y
,
z
] =
2
_
x
0
0
x
_
(J.15)
y
=
i
4
[
z
,
x
] =
2
_
y
0
0
y
_
(J.16)
z
=
i
4
[
x
,
y
] =
2
_
z
0
0
z
_
(J.17)
Using properties of Pauli matrices from Appendix H.6, it is not dicult
to verify that these generators indeed satisfy commutation relations of the
Lorentz algebra (3.53), (3.54) and (3.56). For example,
[
x
,
y
] =
2
4
_
[
x
,
y
] 0
0 [
x
,
y
]
_
=
i
2
2
_
z
0
0
z
_
= i
z
[
x
, /
y
] =
i
2
4c
__
x
0
0
x
_ _
0
y
y
0
_
_
0
y
y
0
_ _
x
0
0
x
__
=
2
2c
_
0
z
z
0
_
= i/
z
[/
x
, /
y
] =
2
4c
2
__
0
x
x
0
_ _
0
y
y
0
_
_
0
y
y
0
_ _
0
x
x
0
__
=
2
4c
2
_
[
x
,
y
] 0
0 [
x
,
y
]
_
=
i
2
2c
2
_
z
0
0
z
_
=
i
c
2
z
We also get the following representation of nite boosts
3
T
ij
(e
ic
) = exp
_
1
2
_
0
0
__
= 1 +
1
2
_
0
0
_
+
1
2!
_
2
_
2
_
1 0
0 1
_
+ . . .
3
Note that this representation is not unitary.
700 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
= I cosh
2
+
2c
i
sinh
2
(J.18)
This equation allows us to prove another important property of gamma ma-
trices
T
1
()
T() =
(J.19)
where is any Lorentz transformation and
is a 4 4 matrix (I.12)
realizing the 4-vector representation of the Lorentz group. Indeed, let us
consider a particular case of this formula with = 0 and being a boost
with rapidity along the x-axis. Then
T
1
()
0
T()
=
_
I cosh
2
2c
i
/
x
sinh
2
_
0
_
I cosh
2
+
2c
i
/
x
sinh
2
_
=
_
cosh
2
_
1 0
0 1
_
sinh
2
_
0
x
x
0
___
1 0
0 1
_
_
cosh
2
_
1 0
0 1
_
+ sinh
2
_
0
x
x
0
__
= cosh
2
2
_
1 0
0 1
_
2 sinh
2
cosh
2
_
0
x
x
0
_
+ sinh
2
2
_
1 0
0 1
_
=
0
cosh +
x
sinh
In agreement with formula for the boost matrix
(I.9).
One can also check for pure boosts
0
T(e
ic
)
0
= 1 +
1
2
0
_
0
0
_
0
+
1
2!
_
2
_
2
_
1 0
0 1
_
+ . . .
= 1
1
2
_
0
0
_
+
1
2!
_
2
_
2
_
1 0
0 1
_
+ . . .
= T
_
e
ic
_
= T
1
_
e
ic
_
J.3. CONSTRUCTION OF THE DIRAC FIELD 701
A similar calculation for rotations should convince us that for a general trans-
formation from the Lorentz group
0
T()
0
= T
1
() (J.20)
Another useful formula is
D()
0
D() = D()
0
D()
0
0
= D()D
1
()
0
=
0
(J.21)
It will be convenient to introduce a slash notation for pseudoscalar prod-
ucts of
with 4-vectors
k
,k k
0
k
0
k (J.22)
,k
2
=
= 1/2(
)k
= g
=
k
2
(J.23)
(,k mc
2
)(,k + mc
2
) = ,k ,k m
2
c
4
=
k
2
m
2
c
4
(J.24)
,k+ ,k
= 2k
(J.25)
J.3 Construction of the Dirac eld
According to the Step 1 in subsection 9.1.1, in order to construct relativistic
interaction operators, we need to associate with each particle type a nite-
dimensional representation of the Lorentz group and a quantum eld. In this
section we are going to build the quantum eld for electrons and positrons.
We postulate that this Dirac eld has 4 components that transform by means
of the representation T() constructed above. The explicit formula for the
eld is
4
( x)
(x, t)
=
_
dp
(2)
3/2
mc
2
_
e
p x
u
(p, )a
p,
+ e
i
p x
v
(p, )b
p,
_
(J.26)
4
This form (apart from the overall normalization of the eld) can be uniquely estab-
lished from the properties (I) - (IV) in Step 1 of subsection 9.1.1. The bispinor index
takes values 1,2,3,4.
702 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
Here a
p,
is the electron annihilation operator and b
p,
is the positron creation
operator. For brevity, we denote p (
p
, cp
x
, cp
y
, cp
z
) the energy-momentum
4-vector and x (t, x/c, y/c, z/c) the 4-vector in the Minkowski space-time.
5
The pseudo-scalar product of the 4-vectors is denoted by dot: p x p
px
p
t and
p
_
m
2
c
4
+ p
2
c
2
. Numerical factors u
(p, ) and v
(p, )
will be discussed in Appendix J.4. Note that according to equations (8.36)
and (8.37)
(x, t) = e
H
0
t
(x, 0)e
i
H
0
t
(J.27)
so, the t-dependence demanded by equation (8.52) for regular operators is
satised in our denition (J.26).
The Dirac eld can be represented by a 4-component column of operator
functions
( x) =
_
1
( x)
2
( x)
3
( x)
4
( x)
_
_
We will also need the conjugate eld
( x) =
_
dp
(2)
3/2
mc
2
_
e
i
p x
u
(p, )a
p,
+ e
p x
v
(p, )b
p,
_
usually represented as a row
= [
1
,
2
,
3
,
4
]
The adjoint eld
5
As discussed in section 15.5, the only purpose for introducing quantum elds is to
build interaction operators as in (9.13) - (9.14). In these formulas eld arguments x, y, z
are integration variables. Therefore, they should not be identied with positions in space.
Moreover, in applications bispinor labels serve as dummy summation indices, so no
physical meaning should be assigned to them as well.
J.4. PROPERTIES OF FACTORS U AND V 703
( x)
( x)
0
(J.28)
is also represented as a row
=
0
= [
1
,
2
,
3
,
4
]
_
_
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
_
_
= [
1
,
2
,
3
,
4
]
The quantum eld for the proton-antiproton system is built similarly to
(J.26)
( x) =
_
dp
(2)
3/2
Mc
2
_
e
p x
w(p, )d
p,
+ e
i
p x
s(p, )f
p,
_
(J.29)
where
p
=
_
M
2
c
4
+ p
2
c
2
, M is the proton mass,
P x px
p
t and
functions w(p, ) and s(p, ) are the same as u(p, ) and v(p, ) but with
the electron mass m replaced by the proton mass M.
J.4 Properties of factors u and v
The key components of the quantum eld formula (J.26) are numerical func-
tions u
(p, ) and v
_
0 1
1 0
0 0
0 0
_
_
, v(0) =
_
_
0 0
0 0
0 1
1 0
_
_
704 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
Sometimes it is convenient to represent these matrices as four vectors-columns
u(0, 1/2) =
_
_
0
1
0
0
_
_
(J.30)
u(0, 1/2) =
_
_
1
0
0
0
_
_
(J.31)
v(0, 1/2) =
_
_
0
0
0
1
_
_
(J.32)
v(0, 1/2) =
_
_
0
0
1
0
_
_
(J.33)
We will get more compact formulas if we introduce 2-component quantities
1/2
=
_
1
0
_
,
1/2
=
_
0
1
_
,
1/2
= (1, 0),
1/2
= (0, 1) (J.34)
Then we can write
u(0, ) =
_
0
_
, v(0, ) =
_
0
_
Let us verify that matrix u(0) has the following property
(R)u
(0, ) =
(0, )D
1/2
(R) (J.35)
where T is the bispinor representation of the Lorentz group,
6
D
1/2
is the
2-dimensional unitary irreducible representation of the rotation group,
7
and
6
see Appendix J.1
7
see Table H.1
J.4. PROPERTIES OF FACTORS U AND V 705
R is any rotation. By denoting
k
the generators of rotations in the rep-
resentation T
(R) and S
k
the generators of rotations in the representation
D
1/2
(
k
)
(R)u
(0, ) =
(0, )(S
k
)
(R)
Let us check that this equation is satised for rotations around the x-axis.
Acting with the 4 4 matrix (J.15)
x
=
2
_
_
0 1 0 0
1 0 0 0
0 0 0 1
0 0 1 0
_
_
on the index in u
(0, ) we obtain
x
u(0) =
2
_
_
0 1 0 0
1 0 0 0
0 0 0 1
0 0 1 0
_
_
_
_
0 1
1 0
0 0
0 0
_
_
=
2
_
_
1 0
0 1
0 0
0 0
_
_
This has the same eect as acting with 2 2 matrix (see Table H.1)
S
x
=
2
_
0 1
1 0
_
on the index in u
(0, )
u(0)J
x
=
2
_
_
0 1
1 0
0 0
0 0
_
_
_
0 1
1 0
_
=
2
_
_
1 0
0 1
0 0
0 0
_
_
This proves equation (J.35). Similarly, one can show
706 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
(R)v
(0, ) =
(0, )D
1/2
(R)
The corresponding formula for the adjoint factor u is obtained as follows:
take the Hermitian conjugate of (J.35), multiply it by
0
from the right and
take into account equations (J.5) and (J.20)
u
(0, )
0
0
T
(R)
0
=
(0, )
0
D
1/2
(R)
u(0, )T
(R) =
u(0, )D
1/2
(R) (J.36)
So far, we have discussed zero-momentum values of functions u and v.
The values of u
(p, ) and v
(p, )
(
p
)u
(0, ) (J.37)
v
(p, )
(
p
)v
(0, ) (J.38)
Taking a Hermitian conjugate of (J.37) and multiplying by
0
from the right
we obtain factors in adjoint elds
u(p, ) u
(p, )
0
= u
(0, )T
(
p
)
0
= u
(0, )
0
0
T(
p
)
0
= u
(0, )
0
T
1
(
p
) = u(0, )T
1
(
p
) (J.39)
v(p, ) = v(0, )T
1
(
p
)
J.5 Explicit formulas for u and v
Now let us nd explicit expressions for factors u, v, u and v for all momenta.
Using formulas (5.3), (J.18), (J.14) and
J.5. EXPLICIT FORMULAS FOR U AND V 707
= tanh
1
(v/c)
tanh
2
=
tanh
1 +
_
1 tanh
2
=
v/c
1 +
_
1 v
2
/c
2
=
pc
p
+ mc
2
cosh
2
=
1
_
1 tanh
2
2
=
_
p
+ mc
2
2mc
2
sinh
2
= tanh
2
cosh
2
we obtain
T(
p
) = e
ic
K
p
p
p
= I cosh
p
2
+
2c
i
/ p
p
sinh
p
2
= cosh
p
2
_
1 0
0 1
_
+ sinh
p
2
_
0
p
p
p
p
0
_
= cosh
p
2
_
1 + tanh
p
2
_
0
p
p
p
p
0
__
=
_
p
+ mc
2
2mc
2
_
1 +
pc
p
+ mc
2
_
0
p
p
p
p
0
__
=
_
p
+ mc
2
2mc
2
_
1
pc
p+mc
2
pc
p+mc
2
1
_
Then, inserting this result in (J.37) we obtain
u(p, ) =
_
p
+ mc
2
2mc
2
_
1
pc
p+mc
2
pc
p+mc
2
1
_
_
1
0
_
=
_ _
p
+ mc
2
_
p
mc
2
_
p
p
_
_
2mc
2
(J.40)
Similarly, the explicit expressions for v, u and v are
708 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
v(p, ) =
_ _
p
mc
2
(
p
p
)
_
p
+ mc
2
_
2mc
2
(J.41)
u(p, ) =
2mc
2
_
_
p
+ mc
2
,
_
p
mc
2
_
p
p
__
(J.42)
v(p, ) =
2mc
2
_
_
p
mc
2
_
p
p
_
,
_
p
+ mc
2
_
(J.43)
These functions are normalized to unity in the sense that
8
u(p, )u(p,
)
=
_
_
p
+ mc
2
,
_
p
mc
2
_
p
p
__
_ _
p
+ mc
2
p
mc
2
_
p
p
_
_
1
2mc
2
=
p
+ mc
2
(
p
mc
2
)
(p )(p )
p
2
_
1
2mc
2
=
=
,
(J.44)
Let us also calculate the sum
1/2
=1/2
u(p, )u
1/2
u(0, )u
(0, ) =
_
_
1 0 0 0
0 0 0 0
0 0 0 0
0 0 0 0
_
_
+
_
_
0 0 0 0
0 1 0 0
0 0 0 0
0 0 0 0
_
_
=
_
_
1 0 0 0
0 1 0 0
0 0 0 0
0 0 0 0
_
_
=
1
2
_
1 +
0
_
To generalize this formula for arbitrary momentum, we use (J.37), (J.39),
the Hermiticity of the matrix T(
p
) and properties (J.5), (J.19) - (J.21)
1/2
=1/2
u(p, )u
(p, ) = T(
p
)
_
_
1/2
=1/2
u(0, )u
(0, )
_
_
T
(
p
)
8
Here we used (H.12).
J.6. CONVENIENT NOTATION 709
=
1
2
T(
p
)
_
1 +
0
_
T(
p
) =
1
2
_
T(
p
)T(
p
) +
0
_
=
1
2
_
T(
p
)
0
0
T(
p
)
0
0
+
0
_
=
1
2
_
T(
p
)
0
T
1
(
p
)
0
+
0
_
=
1
2
_
T(
p
)
0
T
1
(
p
) + 1
_
0
=
1
2
_
0
cosh +
sinh + 1
_
0
=
1
2mc
2
_
p
pc + mc
2
_
0
=
1
2mc
2
_
,p + mc
2
_
0
(J.45)
Similarly we can derive a number of useful formulas
9
1/2
=1/2
u(p, )u(p, ) =
1
2mc
2
_
,p + mc
2
_
(J.46)
1/2
=1/2
u
(p, )u
(p, ) =
1
2mc
2
Tr
_
p
cp + mc
2
= 2
1/2
=1/2
v(p, )v
(p, ) =
1
2mc
2
_
,p mc
2
_
0
(J.47)
1/2
=1/2
v(p, )v(p, ) =
1
2mc
2
_
,p mc
2
_
(J.48)
1/2
=1/2
v
(p, )v
(p, ) = 2
J.6 Convenient notation
To simplify QED calculations we introduce the following combinations of
particle operators
A
(p) =
mc
2
(p, )a
p,
(J.49)
9
Here we used the facts that the trace of any gamma-matrix is zero and that the trace
of the unit 44 matrix is 4.
710 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
A
(p) =
mc
2
(p, )a
p,
(J.50)
B
(p) =
mc
2
(p, )b
p,
(J.51)
B
(p) =
mc
2
(p, )b
p,
(J.52)
D
(p) =
Mc
2
(p, )d
p,
(J.53)
D
(p) =
Mc
2
(p, )d
p,
(J.54)
F
(p) =
Mc
2
(p, )f
p,
(J.55)
F
(p) =
Mc
2
(p, )f
p,
(J.56)
In this notation, indices , = 1, 2, 3, 4 are those corresponding to the
bispinor representation of the Lorentz group, and index = 1/2 enumer-
ates two spin projections of fermions.
With the above conventions, the electron/positron and proton/antiproton
quantum elds can be written compactly
( x) = (2)
3/2
_
dp
_
e
p x
A
(p) + e
i
p x
B
(p)
_
(J.57)
( x) = (2)
3/2
_
dp
_
e
P x
D
(p) + e
i
P x
F
(p)
_
(J.58)
J.7 Transformation laws
Operators (J.49)-(J.56) have simple boost transformation laws. For example,
we can use (3.59), (8.37), (5.16) and (J.35) to obtain
U
0
(; 0)A(p)U
1
0
(; 0) = e
ic
K
0
A(p)e
ic
K
0
mc
2
u(p, )e
ic
K
0
a
p,
e
ic
K
0
mc
2
p
_
u(p, )
D
1/2
W
)a
p,
mc
2
p
_
p
T(
p
)
u(0, )
D
1/2
W
)a
p,
mc
2
p
_
p
T(
p
)T(
W
)
u(0, )a
p,
=
mc
2
p
_
p
T(
p
)T(
1
p
1
p
)
u(0, )a
p,
=
mc
2
p
_
p
T(
1
)T(
p
)
u(0, )a
p,
=
_
mc
2
p
T(
1
)
u(p, )a
p,
=
p
p
T(
1
)A(p) (J.59)
Similarly, using (J.36)
U
0
(; 0)A
(p)U
1
0
(; 0) =
mc
2
u(p, )e
ic
K
0
p,
e
ic
K
0
mc
2
p
_
u(p, )
(D
1/2
)
W
)a
p,
mc
2
p
_
u(0,
)(D
1/2
)
W
)T
1
(
p
)a
p,
mc
2
p
_
u(0, )T(
1
p
p
)T
1
(
p
)a
p,
=
mc
2
p
_
u(p, )T()a
p,
712 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
=
p
p
A
(p)T() (J.60)
Let us show that quantum eld
( x)U
1
0
(; a) =
j
T
(
1
)
(( x + a)) (J.61)
Transformations with respect to translations are
U
0
(1; a)
( x)U
1
0
(1; a)
=
_
dp
(2)
3/2
_
e
p x
U
0
(1; a)A
(p)U
1
0
(1; a) + e
i
p x
U
0
(1; a)B
(p)U
1
0
(1; a)
_
=
_
dp
(2)
3/2
_
e
p( x+ a)
A
(p) + e
i
p( x+ a)
B
(p)
_
=
( x + a)
For transformations with respect to boosts we use equations (J.59), (J.60),
(5.26) and (I.4)
U
0
(; 0)( x)U
1
0
(; 0)
= (2)
3/2
_
dp
_
e
p x
U
0
(; 0)A(p)U
1
0
(; 0) + e
i
p x
U
0
(; 0)B
(p)U
1
0
(; 0)
_
= (2)
3/2
T(
1
)
_
dp
p
_
e
p x
A(p) + e
i
p x
B
(p)
_
= (2)
3/2
T(
1
)
_
dq
_
e
(
1
q x)
A(q) + e
i
(
1
q x)
B
(q)
_
= (2)
3/2
T(
1
)
_
dq
_
e
( q x)
A(q) + e
i
( q x)
B
(q)
_
= T(
1
)( x)
We leave to the reader the proof of equation (J.61) in the case of rotations.
Thus we conclude that in agreement with Step 1(II) in subsection 9.1.1,
the Dirac eld transforms according to the 4D bispinor representation of the
Lorentz group.
J.8. FUNCTIONS U
AND W
. 713
J.8 Functions U
and W
.
In QED calculations one often meets products like u
u and w
w. It is
convenient to introduce special symbols for them
U
(p, ; p
) u(p, )
u(p
) (J.62)
W
(p, ; p
) w(p, )
w(p
) (J.63)
Note that quantities U
and W
_
p, R
1
(p, ); p
, R(p
, )
_
u
_
p, R
1
(p, )
_
u (p
, R(p
, )
)
= u
_
0, R
1
(p, )
_
T
1
(
p
)
T(
p
)u (0, R(p
, )
)
= u
_
0, R
1
(p, )
_
T
1
_
p
R
1
(p, )
_
T
_
p
R
1
(p
, )
_
u (0, R(p
, )
)
= u
_
0, R
1
(p, )
_
T
_
R
1
(p, )
_
T
1
(
p
) T
1
()
T()T(
p
) T
_
R
1
(p
, )
_
u (0, R(p
, )
)
= u(0, )T
1
(
p
)T
1
()
T()T(
p
)u(0,
)
= u(p, )T
1
()
T()u(p
)
= u(p, ) (
) u(p
)
=
(p, ; p
)
J.9 (v/c)
2
approximation
Often it is useful to obtain QED results in a weakly-relativistic or non-
relativistic case, when momenta of electrons are much less than mc and
momenta of protons are much less than Mc. In these case, with reasonable
accuracy we can represent all quantities as series in powers of v/c and leave
only terms having orders not higher than (v/c)
2
. First, we can use (8.101)
to write
10
Such a transformation acts by matrix (rotationboost) on momentum arguments
and by the corresponding Wigner rotation R on spin components. See subsection 9.2.2.
714 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
_
p
+ mc
2
_
mc
2
+
p
2
2m
+ mc
2
=
_
2mc
2
+
p
2
2m
=
2mc
2
_
1 +
p
2
4m
2
c
2
2mc
2
_
1 +
p
2
8m
2
c
2
_
_
p
mc
2
_
mc
2
+
p
2
2m
mc
2
=
p
2m
(q + k q)
2
= (
q+k
q
)
2
c
2
k
2
c
2
k
2
(J.64)
Mmc
4
_
pk
q+k
1
_
1 +
(pk)
2
2M
2
c
2
1
_
1 +
p
2
2M
2
c
2
1
_
1 +
(q+k)
2
2m
2
c
2
1
_
1 +
q
2
2m
2
c
2
1
(p k)
2
4M
2
c
2
p
2
4M
2
c
2
(q +k)
2
4m
2
c
2
q
2
4m
2
c
2
= 1
p
2
2M
2
c
2
+
pk
2M
2
c
2
k
2
4M
2
c
2
q
2
2m
2
c
2
qk
2m
2
c
2
k
2
4m
2
c
2
(J.65)
To obtain the (v/c)
2
approximation for expressions (J.62), (J.63) we use
equations (J.40) - (J.43) and (H.10) - (H.12)
U
0
(p, ; p
) = u(p, )
0
u(p
) = u
(p, )u(p
)
=
_
_
p
+ mc
2
,
_
p
mc
2
_
p
p
__
_
_
p
+ mc
2
_
p
mc
2
(
p
)
_
1
2mc
2
=
_
_
p
+ mc
2
_
p
+ mc
2
+
_
p
mc
2
_
p
mc
2
(p )(p
)
pp
1
2mc
2
__
1 +
p
2
8m
2
c
2
__
1 +
(p
)
2
8m
2
c
2
_
+
pp
4m
2
c
2
(p )(p
)
pp
_
1 +
p
2
+ (p
)
2
+ 2p p
+ 2i [p p
]
8m
2
c
2
_
J.9. (V/C)
2
APPROXIMATION 715
=
_
1 +
(p +p
)
2
+ 2i [p p
]
8m
2
c
2
_
(J.66)
W
0
(p, ; p
) = w(p, )
0
w(p
_
1 +
(p +p
)
2
+ 2i [p p
]
8M
2
c
2
_
(J.67)
U(p, ; p
) = u(p, )u(p
)
=
_
_
p
+ mc
2
,
_
p
mc
2
p
p
_ _
0
0
_
_
_
p
+ mc
2
_
p
mc
2
p
1
2mc
2
=
_
_
p
+ mc
2
,
_
p
mc
2
p
p
_
_
_
p
mc
2
(p
)
p
p
+ mc
2
_
1
2mc
2
=
_
_
p
+ mc
2
_
p
mc
2
(p
)
p
+
_
p
mc
2
_
p
+ mc
2
(p )
p
_
1
2mc
2
2mc
2
p
2m
(p
)
p
2mc
2
p
2m
(p )
p
_
1
2mc
2
=
(( p) +( p
))
1
2mc
=
(p + i[ p] +p
i[ p
])
1
2mc
=
(p +p
+ i[ (p p
)])
1
2mc
(J.68)
W(p, ; p
(p +p
+ i[ (p p
)])
1
2Mc
(J.69)
In the non-relativistic limit c , all formulas are further simplied
lim
c
p
= mc
2
lim
c
p
= Mc
2
lim
c
Mmc
4
_
pk
q+k
q
= 1
lim
c
U
0
(p, ; p
) =
=
,
(J.70)
lim
c
W
0
(p, ; p
) =
,
lim
c
U(p, ; p
) = 0
716 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
lim
c
W(p, ; p
) = 0
J.10 Anticommutation relations
To check the anticommutation relations (9.3) we calculate, for example,
11
(x, 0),
(y, 0)
=
_
dp
(2)
3/2
mc
2
p
dp
(2)
3/2
mc
2
1/2
=1/2
_
e
px
u
(p, )a
p,
+ e
i
px
v
(p, )b
p,
_
,
_
e
i
y
u
(p
)a
+ e
y
v
(p
)b
p
=
_
dpdp
(2)
3
mc
2
1/2
=1/2
_
e
px+
i
y
u
(p, )u
(p
)a
p,
, a
+e
i
px
i
y
v
(p, )v
(p
)b
p,
, b
p
,
_
=
_
dpdp
mc
2
(2)
3
p
1/2
=1/2
_
e
p(xy)
u
(p, )u
(p
)(p p
)
,
+e
i
p(xy)
v
(p, )v
(p
)(p p
)
,
_
= (2)
3
_
dpmc
2
p
1/2
=1/2
_
e
p(xy)
u
(p, )u
(p, ) + e
i
p(xy)
v
(p, )v
(p, )
_
=
_
dpmc
2
(2)
3
p
e
p(xy)
1/2
=1/2
_
u
(p, )u
(p, ) + v
(p, )v
(p, )
_
=
_
dpmc
2
(2)
3
p
e
p(xy)
p
mc
2
(
0
0
)
= (x y)
(J.71)
11
Here we used (J.45) and (J.47).
J.11. DIRAC EQUATION 717
We will also nd useful the following anticommutators
A
(p), A
(p
) =
mc
2
(p, )u
(p
)a
p,
, a
=
mc
2
p
_
(p, )u
(p, )
_
(p p
)
=
1
2
p
(
0
p
pc + mc
2
)
(p p
) (J.72)
(p), A
(p
) = 2(p p
) (J.73)
B
(p), B
(p
) =
1
2
p
(
0
p
pc mc
2
)
(p p
) (J.74)
(p), B
(p
) = 2(p p
) (J.75)
J.11 Dirac equation
We can write the electron-positron quantum eld (J.26) as a sum of two
terms
( x) =
+
( x) +
( x)
( x)
_
dp
(2)
3/2
mc
2
p
e
p x
u
(p, )a
p,
( x)
_
dp
(2)
3/2
mc
2
p
e
i
p x
v
(p, )b
p,
Let us now act on the component
+
( x) by operator in parentheses
12
_
0
t
+ c
x
imc
2
+
( x)
12
Here we use explicit denitions of gamma matrices from (J.1) and (J.2) as well as
equation (J.40).
718 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
=
_
0
t
+ c
x
imc
2
_
dp
(2)
3/2
mc
2
p
e
px+
i
pt
u(p, )a
p,
=
i
_
dp
(2)
3/2
mc
2
p
(
0
p
c p mc
2
)u(p, )e
px+
i
pt
a
p,
For the product on the right hand side we obtain
(
0
p
c p mc
2
)u(p, )
=
p
_ _
p
+ mc
2
p
mc
2
(
p
p
)
_
2mc
2
_ _
p
mc
2
p
p
+ mc
2
( p)
_
2mc
2
mc
2
u(p, )
=
_
p
_
p
+ mc
2
(
p
mc
2
)
_
p
+ mc
2
mc
2
_
p
+ mc
2
p
_
p
mc
2
(
p
p
) +
_
p
+ mc
2
(
p
p
)pc mc
2
_
p
mc
2
(
p
p
)
_
2mc
2
=
_
(
p
mc
2
)
_
p
+ mc
2
(
p
mc
2
)
_
p
+ mc
2
((
p
+ mc
2
)
_
p
mc
2
+ (
p
+ mc
2
)
_
p
mc
2
)(
p
p
)
_
2mc
2
= 0 (J.76)
This leads to the Dirac equation for the eld component
+
( x)
_
0
t
+ c
x
imc
2
+
( x) = 0 (J.77)
The same equation is satised by the component
0
t
+ c
x
imc
2
_
(x) = 0 (J.78)
The equation conjugate to (J.76) is
0 = u
(p, )
_
(
0
)
p
c()
p mc
2
_
= u
(p, )
_
p
+ c p mc
2
_
= u
(p, )
0
0
_
p
+ c p mc
2
_
= u(p, )
0
_
p
+ c p mc
2
_
= u(p, )
_
p
c p mc
2
_
0
(J.79)
Therefore, the equation satised by the conjugated eld is
J.11. DIRAC EQUATION 719
( x)
0
c
x
( x)
+
imc
2
( x) = 0
or multiplying from the right by
0
and using (J.3)
t
( x)
0
+ c
x
( x) +
imc
2
( x) = 0 (J.80)
It should be emphasized that in our approach to QFT Dirac equation
appears as a rather unremarkable property of the electron-positron quan-
tum eld ( x) . This equation does not play a fundamental role assigned
to it in many textbooks. Denitely, Dirac equation cannot be regarded as a
relativistic analog of the Schr odinger equation for electrons.
13
The correct
electron wave functions and corresponding relativistic Schr odinger equations
should be constructed by using Wigner-Dirac theory of unitary representa-
tions of the Poincare group. For free electrons such derivations are performed
in chapter 5. The relativistic analog of the Schr odinger equation for an in-
teracting electron-proton system is constructed in chapter 12.
In the slash notation (J.22) the momentum-space Dirac equations (J.76)
and (J.79) take compact forms
(,p mc
2
)u(p, ) = 0 (J.81)
u(p, )(,p mc
2
) = 0 (J.82)
If we denote ,k ,p
(p, ; p
)k
= u(p, ) ,ku(p
) = u(p, )[,p
u(p
)] [u(p, ) ,p]u(p
)
= (mc
2
mc
2
)u(p, )u(p
) = 0 (J.83)
W
(p, ; p
)k
= 0 (J.84)
We will also need the Gordon identity
14
13
A point of view similar to ours is adopted also in textbook [Wei95].
14
See Problem 3.2 in [PS95b].
720 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
u(p, )(
,k ,k
)u(p
) = u(p, )(
(,p
,p) (,p
,p)
)u(p
)
= u(p, )(
(mc
2
,p) (,p
mc
2
)
)u(p
)
= u(p, )(2
mc
2
,p ,p
)u(p
)
= u(p, )(2
mc
2
+ ,p
2p
,p
2p
)u(p
)
= u(p, )(2
mc
2
+ mc
2
2p
+ mc
2
2p
)u(p
)
= u(p, )(4mc
2
2p
2p
)u(p
) (J.85)
J.12 Fermion propagator
Let us calculate the electron propagator, which is frequently used in Feynman-
Dyson perturbation theory
T
ab
( x
1
, x
2
) 0[T(
a
( x
1
)
b
( x
2
))[0
if t
1
> t
2
we can omit the time ordering sign and use (J.46)
T
ab
( x
1
, x
2
) = 0[
a
( x
1
)
b
( x
2
)[0 0[(a + b
)(a
+ b)[0 0[aa
[0
= 0[
_
_
dp
(2)
3/2
mc
2
p x
1
u
a
(p, )a
p,
_
_
_
dq
(2)
3/2
mc
2
e
i
q x
2
u
b
(q, )a
q,
_
0
[0
=
_
dpdq
(2)
3
mc
2
p x
1
u
a
(p, )e
i
q x
2
u
b
(q, )(p q)
=
_
dp
(2)
3
mc
2
p
e
i
p( x
2
x
1
)
u
a
(p, )u
b
(p, )
=
_
dp
(2)
3
e
i
(p(t
2
t
1
)p(x
2
x
1
))
1
2
p
_
p
pc + mc
2
_
ab
if t
1
< t
2
we use (J.48) to obtain
15
15
Note that for the anticommuting fermion eld the denition of the time ordered prod-
J.12. FERMION PROPAGATOR 721
T
ab
( x
1
, x
2
) = 0[
b
( x
2
)
a
( x
1
)[0 0[(a
+ b)(a + b
)[0 0[bb
[0
= 0[(2)
3/2
_
dpe
p x
2
B
b
(p)2)
3/2
_
dqe
i
q x
1
B
a
(q)[0
= (2)
3
_
dp
mc
2
p
e
i
p( x
1
x
2
)
v
b
(p, )v
a
(p, )
=
_
dp
(2)
3
e
i
(p(t
1
t
2
)p(x
1
x
2
))
1
2
p
_
p
pc mc
2
_
ab
The sum of these two terms gives
T
ab
( x
1
, x
2
) = (t
1
t
2
)
_
dp
(2)
3
e
i
(p(t
2
t
1
)p(x
2
x
1
))
1
2
p
P
ab
(p,
p
)
+ (t
2
t
1
)
_
dp
(2)
3
e
i
(p(t
1
t
2
)p(x
1
x
2
))
1
2
p
P
ab
(p,
p
)
(J.87)
where we denoted
P
ab
(p,
p
) =
_
p
pc + mc
2
_
ab
and (t) is the step function dened in (B.3). Our next goal is to rewrite
equation (J.87) so that integration goes by 4 independent components of the
4-vector of momentum (p
0
, p
x
, p
y
, p
z
). We use integral representation (B.4)
for the step function to obtain
T
ab
( x
1
, x
2
)
=
1
2i
_
dp
(2)
3
_
ds
e
is(t
1
t
2
)
s + i
e
(p(t
1
t
2
)p(x
1
x
2
))
1
2
p
P
ab
(p,
p
)
uct involves change of sign (compare with (7.16))
T[
a
( x
1
)
b
( x
2
)] =
_
a
( x
1
)
b
( x
2
), if t
1
> t
2
b
( x
2
)
a
( x
1
), if t
1
< t
2
(J.86)
722 APPENDIX J. QUANTUM FIELDS FOR FERMIONS
1
2i
_
dp
(2)
3
_
ds
e
is(t
1
t
2
)
s + i
e
i
(p(t
1
t
2
)p(x
1
x
2
))
1
2
p
P
ab
(p,
p
)
=
1
2i
_
dp
(2)
3
_
ds
1
s + i
1
2
p
_
e
((
p+s
)(t
1
t
2
)p(x
1
x
2
))
P
ab
(p,
p
) + e
i
((p+s)(t
1
t
2
)p(x
1
x
2
))
P
ab
(p,
p
)
_
=
1
2i
_
dp
(2)
3
_
dp
0
1
p
0
p
+ i
1
2
p
_
e
(p
0
(t
1
t
2
)p(x
1
x
2
))
P
ab
(p,
p
) + e
i
(p
0
(t
1
t
2
)p(x
1
x
2
))
P
ab
(p,
p
)
_
=
1
2i
_
dp
(2)
3
_
dp
0
1
p
0
p
+ i
1
2
p
_
e
(p
0
(t
1
t
2
)p(x
1
x
2
))
P
ab
(p, p
0
) + e
(p
0
(t
1
t
2
)+p(x
1
x
2
))
P
ab
(p, p
0
)
_
=
1
2i
_
dp
(2)
3
_
dp
0
e
(p
0
(t
1
t
2
)p(x
1
x
2
))
1
2
p
_
P
ab
(p, p
0
)
p
0
p
+ i
+
P
ab
(p, p
0
)
p
0
p
+ i
_
=
1
2i
_
dp
(2)
3
_
dp
0
e
(p
0
(t
1
t
2
)p(x
1
x
2
))
1
2
p
P
ab
(p, p
0
)
2
p
p
2
0
2
p
+ i
=
1
2i(2)
3
_
d
4
pe
( p x)
P
ab
(p, p
0
)
p
2
0
c
2
p
2
m
2
c
4
+ i
=
1
2i(2)
3
_
d
4
pe
( p x)
(
0
p
0
pc + mc
2
)
ab
p
2
c
2
m
2
c
4
+ i
=
1
2i(2)
3
_
d
4
pe
( p x)
(,p + mc
2
)
ab
p
2
c
2
m
2
c
4
+ i
(J.88)
Appendix K
Quantum eld for photons
K.1 Construction of the photons quantum
eld
Let us now construct a quantum eld based on creation (c
p,
) and annihila-
tion (c
p,
) operators for photons.
1
Our goal is to satisfy conditions listed in
Step 1. in subsection 9.1.1.
We will postulate that Lorentz transformations (9.1) of the photon eld
A
( x)U
1
0
(; a) =
(( x + a)) (K.1)
with indices and taking values 0,1,2,3. Then we attempt to dene a
4-component quantum eld for photons as
A
( x) A
(x, t)
=
c
(2)
3/2
_
dp
2p
_
e
p x
e
(p, )c
p,
+ e
i
p x
e
(p, )c
p,
_
(K.2)
where p x pxcpt and the coecient functions e
(k, ) =
1
2
_
_
0
1
i
0
_
_
(K.3)
For all other photon momenta p we dene
3
e(p, ) =
p
e(k, ) (K.4)
e
(p, ) = e
(k, )
p
(K.5)
where
p
is a boost transformation which takes the particle from standard
momentum k to p.
p
= R
p
B
p
(K.6)
where B
p
is a boost along the z-axis and R
p
is a pure rotation, as in equation
(5.60).
K.2 Explicit formula for e
(p, )
Note that the boost B
p
in equation (5.59) has no eect on the 4-vector (K.3).
The 0-th component of this vector is not aected by rotations R
p
as well.
Therefore, we conclude that for all p and x
e
0
(p, ) = 0 (K.7)
A
0
(x, t) = 0 (K.8)
Let us now nd the 3-vector part of e
2e(p, )
=
_
_
cos + n
2
x
(1 cos ) n
x
n
y
(1 cos ) n
z
sin n
x
n
z
(1 cos ) + n
y
sin
n
x
n
y
(1 cos ) + n
z
sin cos + n
2
y
(1 cos ) n
y
n
z
(1 cos ) n
x
sin
n
x
n
z
(1 cos ) n
y
sin n
y
n
z
(1 cos ) + n
x
sin cos + n
2
z
(1 cos )
_
_
_
_
1
i
0
_
_
=
_
_
pz
p
+
p
2
y
(ppz)
p(p
2
x
+p
2
y
)
pxpy(ppz)
p(p
2
x
+p
2
y
)
px
p
pxpy(ppz)
p(p
2
x
+p
2
y
)
pz
p
+
p
2
x
(ppz)
p(p
2
x
+p
2
y
)
py
p
px
p
py
p
pz
p
_
_
_
_
1
i
0
_
_
=
_
_
pzp
2
x
+pp
2
y
p(p
2
x
+p
2
y
)
pxpy(ppz)
p(p
2
x
+p
2
y
)
px
p
pxpy(ppz)
p(p
2
x
+p
2
y
)
pzp
2
y
+pp
2
x
p(p
2
x
+p
2
y
)
py
p
px
p
py
p
pz
p
_
_
_
_
1
i
0
_
_
=
1
p(p
2
x
+ p
2
y
)
_
_
p
z
p
2
x
+ pp
2
y
ip
x
p
y
(p p
z
)
p
x
p
y
(p p
z
) + i(p
z
p
2
y
+ pp
2
x
)
p
x
(p
2
x
+ p
2
y
) ip
y
(p
2
x
+ p
2
y
)
_
_
Therefore
e(p, ) =
1
2p(p
2
x
+ p
2
y
)
_
_
0
p
z
p
2
x
+ pp
2
y
ip
x
p
y
(p p
z
)
p
x
p
y
(p p
z
) + i(p
z
p
2
y
+ pp
2
x
)
p
x
(p
2
x
+ p
2
y
) ip
y
(p
2
x
+ p
2
y
)
_
_
(K.9)
One can easily see that the 3-vector part of e(p, ) is orthogonal to the
momentum vector p = (p
x
, p
y
, p
z
) and that
e
(p, )p
= 0 (K.10)
K.3 Useful commutator
For our derivations in subsection 9.2.1 we need the following expression
726 APPENDIX K. QUANTUM FIELD FOR PHOTONS
C
(p) =
2p
(p, )c
p,
(K.11)
and the commutator
_
C
(p), C
(p
)
_
=
2
c
2
pp
(p, )e
(p
)[c
p,
, c
p
,
]
=
2
c
2p
(p, )e
(p
)(p p
)
,
2
c
2p
(p, )e
(p, )(p p
)
=
2
c
2p
(p)(p p
) (K.12)
where
h
(p)
(p, )e
(p, )
is a sum frequently met in calculations. First we calculate this sum at the
standard momentum k = (0, 0, 1) with the help of (K.3)
h
(k) =
1
2
_
_
0
1
i
0
_
_
_
0 1 i 0
+
1
2
_
_
0
1
i
0
_
_
_
0 1 i 0
=
1
2
_
_
0 0 0 0
0 1 i 0
0 i 1 0
0 0 0 0
_
_
+
1
2
_
_
0 0 0 0
0 1 i 0
0 i 1 0
0 0 0 0
_
_
=
_
_
0 0 0 0
0 1 0 0
0 0 1 0
0 0 0 0
_
_
K.4. EQUAL TIME COMMUTATOR OF PHOTON FIELDS 727
which can be also expressed in terms of components of the standard vector
k
h
0
(k) = h
0
(k) = 0
h
ij
(k) =
ij
k
i
k
j
k
2
At arbitrary momentum p we use formulas (K.4), (K.5) and (K.6)
h
(p) =
(p, )e
(p, )
= R
p
B
p
_
_
0 0 0 0
0 1 0 0
0 0 1 0
0 0 0 0
_
_
B
1
p
R
1
p
= R
p
_
_
0 0 0 0
0 1 0 0
0 0 1 0
0 0 0 0
_
_
R
1
p
It then follows that h
0
(p) = h
0
(p) = 0, that the 3 3 submatrix is
h
ij
(p) = R
p
_
ij
k
i
k
j
k
2
_
R
1
p
=
ij
p
i
p
j
p
2
(K.13)
and the nal formula for h
(p) is
h
(p) =
_
_
0 0 0 0
0 1
p
2
x
p
2
pxpy
p
2
pxpz
p
2
0
pxpy
p
2
1
p
2
y
p
2
pypz
p
2
0
pxpz
p
2
pzpy
p
2
1
p
2
z
p
2
_
_
(K.14)
K.4 Equal time commutator of photon elds
The photon quantum eld (K.2) commutes with itself at space-like intervals
(x ,= y), as required in equation (9.4)
728 APPENDIX K. QUANTUM FIELD FOR PHOTONS
[A
(x, 0), A
(y, 0)]
=
2
c
2(2)
3
_
dpdp
pp
__
e
px
e
(p, )c
p,
+ e
i
px
e
(p, )c
p,
_
,
_
e
i
y
e
(p
)c
+ e
y
e
(p
)c
p
__
=
2
c
2(2)
3
_
dpdp
pp
_
e
px
e
i
y
e
(p, )e
(p
)[c
p,
, c
]
+e
i
px
e
x
e
(p, )e
(p
)[c
p,
, c
p
,
]
_
=
2
c
2(2)
3
_
dpdp
p
(p p
_
e
p(xy)
e
(p, )e
(p
) e
i
p(xy)
e
(p, )e
(p
)
_
=
2
c
2(2)
3
_
dp
p
_
e
p(xy)
e
(p, )e
(p, ) e
i
p(xy)
e
(p, )e
(p, )
_
=
2
c
2(2)
3
_
dp
p
_
e
p(xy)
e
i
p(xy)
_
h
(p)
=
i
2
c
2(2)
3
_
dp
p
sin(p(x y))h
(p) (K.15)
= 0
because the integrand in (K.15) is an odd function of p.
K.5 Photon propagator
Next we need to calculate the photon propagator. We use the integral rep-
resentation (B.4) of the step function to write
0[T[A
( x
1
)A
( x
2
)][0
=
2
c
_
dp
2(2)
3
p
h
(p)
_
e
i
p( x
1
x
2
)
(t
1
t
2
) + e
i
p( x
2
x
1
)
(t
2
t
1
)
_
K.5. PHOTON PROPAGATOR 729
=
2
c
2i
ds
_
dp
2(2)
3
p
h
(p)
_
e
i
p( x
1
x
2
)
e
is(t
1
t
2
)
s + i
+ e
i
p( x
2
x
1
)
e
is(t
1
t
2
)
s + i
_
=
2
c
2i
ds
_
dp
2(2)
3
p
h
(p)
1
s + i
_
e
i
(cp(t
1
t
2
)p(x
1
x
2
))
e
is(t
1
t
2
)
+ e
i
(cp(t
1
t
2
)+p(x
1
x
2
))
e
is(t
1
t
2
)
_
=
2
c
2i
ds
_
dp
2(2)
3
p
h
(p)e
p(x
1
x
2
)
_
e
i
(cps)(t
1
t
2
)
s + i
+
e
(cps)(t
1
t
2
)
s + i
_
Next we change variables: in the rst integral p
0
= cp s; in the second
integral p
0
= cp +s
0[T[A
( x
1
)A
( x
2
)][0
=
2
c
2i
dp
0
_
dp
2(2)
3
p
h
(p)e
p(x
1
x
2
)
_
e
i
p
0
(t
1
t
2
)
cp p
0
+ i
+
e
i
p
0
(t
1
t
2
)
cp + p
0
+ i
_
=
2
c
2
2i
dp
0
_
dp
(2)
3
h
(p)e
i
p
0
(t
1
t
2
)
e
ic
p(x
1
x
2
)
1
p
2
+ i
=
2
c
2
2i
_
d
4
p
(2)
3
h
(p)e
i
p( x
1
x
2
)
1
p
2
+ i
(K.16)
where we denoted d
4
p dp
0
dp.
The matrix h
(p) = g
. Then we
obtain our nal propagator formula
0[T[A
( x
1
)A
( x
2
)][0 =
2
c
2
2i
_
d
4
p
(2)
3
e
i
p( x
1
x
2
)
g
p
2
+ i
(K.17)
730 APPENDIX K. QUANTUM FIELD FOR PHOTONS
K.6 Poincare transformations of the photon
eld
Now we need to determine transformations of the photon eld with respect
to the non-interacting representation of the Poincare group. Note that we
have dened coecient functions e
(x, t)U
1
0
(1; r, ) = A
(x +r, t + ) (K.18)
However, transformations with respect to boosts disagree with our expecta-
tion [Wei64a]
U
0
(; 0)A
( x)U
1
0
(; 0) =
( x) (K.19)
To demonstrate this disagreement we rst use equations (8.38) and (8.39) to
write
U
0
(; 0)A
( x)U
1
0
(; 0)
=
c
(2)
3/2
_
dp
2p
_
e
p x
e
(p, )U
0
(; 0)c
p,
U
1
0
(; 0)
+e
i
p x
e
(p, )U
0
(; 0)c
p,
U
1
0
(; 0)
_
=
c
(2)
3/2
_
dp
2p
[p[
p
_
e
p x
e
(p, )e
i
W
(p,)
c
p,
+e
i
p x
e
(p, )e
i
W
(p,)
c
p,
_
(K.20)
Next we take equation (K.4) for vector p
K.6. POINCAR
1
e(p, ) =
p
(
1
p
1
p
)e(k, )
The term in parentheses
1
p
1
p
is a member of the little group
4
which
corresponds to a Wigner rotation through the angle
W
, so we can use
representation (5.54)
1
p
1
p
e(k, ) = S(X
1
, X
2
,
W
)e(k, )
=
_
_
1 + (X
2
1
+ X
2
2
)/2 X
1
X
2
(X
2
1
+ X
2
2
)/2
X
1
cos
W
X
2
sin
W
cos
W
sin
W
X
1
cos
W
+ X
2
sin
W
X
1
sin
W
+ X
2
cos
W
sin
W
cos
W
X
1
sin
W
X
2
cos
W
(X
2
1
+ X
2
2
)/2 X
1
X
2
1 (X
2
1
+ X
2
2
)/2
_
_
_
_
0
1
i
0
_
_
= e
i
W
(p,)
_
_
0
1
i
0
_
_
+ (X
1
+ iX
2
)
_
_
1
0
0
1
_
_
= e
i
W
(p,)
e(k, ) +
X
1
+ iX
2
c
k
where k
= (c, 0, 0, c) and X
1
, X
2
are certain functions of and p. Our next
goal is to eliminate these unknown functions from our formulas. Denoting
X
(p, ) =
X
1
+ iX
2
c
(K.21)
we obtain
3
=0
(p, ) = e
i
W
(p,)
p
e
(k, ) + X
(p, )
p
k
= e
i
W
(p,)
e
(p, ) + X
(p, )
p
p
(K.22)
4
see subsection 5.3.2
732 APPENDIX K. QUANTUM FIELD FOR PHOTONS
where p
= (p, p
x
, p
y
, p
z
) is the energy-momentum 4-vector corresponding to
the 3-momentum p. By letting = 0 and taking into account (K.7) we also
obtain
3
=0
1
0
e
(p, ) = e
i
W
(p,)
e
0
(p, ) + X
(p, )
p
0
p
= X
(p, )
e
i
W
(p,)
e
(p, ) =
3
=0
(p, ) X
(p, )
p
p
=
3
=0
(p, )
p
p
3
=0
1
0
e
(p, )
=
3
=0
_
1
0
p
p
_
e
(p, )
The complex conjugate of this equation is
e
i
W
(p,)
e
(p, ) =
3
=0
_
1
0
p
p
_
e
(p, )
Then using (5.25) and (I.4) we can rewrite equation (K.20) as
U
0
(; 0)A
( x)U
1
0
(; 0)
=
2(2)
3/2
_
dp
[p[
p
1
=1
3
=0
_
e
p x
_
1
0
p
p
_
e
(p, )c
p,
+e
i
p x
_
1
0
p
p
_
e
(p, )c
p,
_
=
2(2)
3/2
3
=0
_
d(p)
[p[
_
[p[
1
=1
_
e
p x
e
(p, )c
p,
+ e
i
p x
e
(p, )c
p,
_
c
(2)
3/2
_
d(p)
[p[
_
[p[p
=1
3
=0
1
0
p
_
e
p x
e
(p, )c
p,
+ e
i
p x
e
(p, )c
p,
_
=
3
=0
2(2)
3/2
_
dp
p
1
=1
_
e
p x
e
(p, )c
p,
+ e
i
p x
e
(p, )c
p,
_
_
K.6. POINCAR
c
(2)
3/2
_
dp
p
1
=1
(
1
p)
[
1
p[
3
=0
1
0
_
e
1
p x
e
(p, )c
p,
+ e
i
1
p x
e
(p, )c
p,
_
=
3
=0
( x) +
( x, ) (K.23)
Thus we see that property (K.19) is not satised. In addition to the desired
covariant transformation
1
A( x), there is an extra term
( x, ) =
c
(2)
3/2
_
dp
2p
1
=1
(
1
p)
[
1
p[
3
=0
1
0
_
e
1
p x
e
(p, )c
p,
+ e
i
1
p x
e
(p, )c
p,
_
(K.24)
in the boost transformation law. The presence of this extra term is the rea-
son why QED with massless photons cannot be formulated via simple steps
outlined in subsection 9.1.1. A more elaborate construction is required in
order to maintain the relativistic invariance of QED as detailed in subsection
9.1.2 and in Appendix N.2.
From
lim
0
3
=0
1
0
e
(p, ) =
3
=0
0
e
(p, ) = e
0
(p, ) = 0 (K.25)
we obtain the following useful property
( x, 1) = 0 (K.26)
734 APPENDIX K. QUANTUM FIELD FOR PHOTONS
Appendix L
QED interaction in terms of
particle operators
L.1 Current density
In QED an important role is played by the operator of current density which
is dened as a sum of the electron/positron J
( x) and proton/antiproton
( x) current densities
j
( x) = J
( x) +
( x)
ec( x)
( x) + ec( x)
( x) (L.1)
where e is the absolute value of the electron charge, gamma matrices
are
dened in equations (J.1) - (J.2) and quantum elds ( x), ( x), ( x), ( x)
are dened in Appendix J.3.
1
Let us consider the electron/positron part
J
( x)U
1
0
(; 0)
1
Note that ( x) is a 4-component bispinor-column, ( x) is a 4-component bispinor-row
and
( x) are similar.
735
736APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
= ecU
0
(; 0)
( x)
0
( x)U
1
0
(; 0)
= ecU
0
(; 0)
( x)U
1
0
(; 0)
0
U
0
(; 0)( x)U
1
0
(; 0)
= ec
( x)T
(
1
)
0
T(
1
)( x)
= ec
( x)T(
1
)
0
T(
1
)T()
T(
1
)( x)
= ec
( x)
0
T()
T(
1
)( x)
= ec
3
=0
( x)
0
(
1
)
( x)
=
3
=0
(
1
)
( x) (L.2)
From this we obtain a useful commutator
[K
0z
, J
0
( x)]
=
i
c
lim
0
d
d
e
ic
K
0z
J
0
( x)e
ic
K
0z
=
i
c
lim
0
d
d
_
J
0
_
x, y, z cosh ct sinh , t cosh
z
c
sinh
_
cosh
+J
z
_
x, y, z cosh ct sinh , t cosh
z
c
sinh
_
sinh
_
= i
_
z
c
2
d
dt
+ t
d
dz
_
J
0
( x)
i
c
J
z
( x) (L.3)
Space-time translations act by shifting the argument of the current
U
0
(0; a)J
( x)U
1
0
(0; a) = J
( x + a) (L.4)
Second, the current density satises the continuity equation which can be
proven by using Dirac equations (J.78), (J.80) and property (J.3)
t
J
0
( x) = ec
t
(( x)
0
( x))
= ec
_
t
( x)
_
0
( x) + ( x)
_
0
t
( x)
_
= ec
_
c
x
( x)
+
i
mc
2
( x)
_
0
( x)
L.1. CURRENT DENSITY 737
+ec( x)
_
c
x
( x)
i
mc
2
( x)
_
= ec
2
x
( x)( x) + ec
2
( x)
x
( x)
= ec
2
x
(( x)( x))
= c
x
J( x) (L.5)
Third, from equation (J.71) it follows that current components commute at
spacelike separations
[j
(x, t), j
(y, t)] = 0, if x ,= y
Using expressions for elds (J.57) and (J.58), we can also write the current
density operator (L.1) in the normally ordered form
3
j
( x) = ec( x)
( x) + ec( x)
( x)
= ec(2)
3
_
dpdp
_
[e
i
p x
A
(p) + e
p x
B
(p)]
[e
x
A
(p
) + e
i
x
B
(p
)]
+ [e
i
P x
D
(p) + e
P x
F
(p)]
[e
x
D
(p
) + e
i
x
F
(p
)]
_
= ec(2)
3
_
dpdp
_
A
(p)A
(p
)e
( p
p) x
A
(p)B
(p
)e
i
( p
+ p) x
B
(p)A
(p
)e
( p
+ p) x
B
(p)B
(p
)e
i
( p
p) x
+ D
(p)D
(p
)e
(
P
P) x
+ D
(p)F
(p
)e
+
i
(
P
+
P) x
+ F
(p)D
(p
)e
(
P
+
P) x
+ F
(p)F
(p
)e
i
(
P
P) x
_
= ec(2)
3
_
dpdp
_
A
(p)A
(p
)e
( p
p) x
A
(p)B
(p
)e
i
( p
+ p) x
3
Summation on bispinor indices and is assumed.
738APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
B
(p)A
(p
)e
( p
+ p) x
+ B
(p
)B
(p)e
i
( p
p) x
+ D
(p)D
(p
)e
(
P
P) x
+ D
(p)F
(p
)e
+
i
(
P
+
P) x
+ F
(p)D
(p
)e
(
P
+
P) x
F
(p
)F
(p)e
i
(
P
P) x
B
(p), B
(p
)e
i
( p
p) x
+F
(p), F
(p
)e
i
(
P
P) x
_
Let us show that the two last terms vanish. We use anticommutator (J.74)
and properties of gamma matrices to rewrite these two terms as
ec(2)
3
_
dpdp
_
1
2
p
(
0
p
+pc mc
2
)
(p p
)e
i
( p
p)x
+
1
2
p
(
0
p
+pc Mc
2
)
(p p
)e
i
(
P
P)x
_
= ec(2)
3
_
dp
_
mc
2
2
p
Mc
2
2
p
_
+ ec(2)
3
_
dp(
0
)
1
2
+
1
2
_
(L.6)
+ ec(2)
3
_
dp(
pc
2
p
+
pc
2
p
_
= ec(2)
3
Tr(
)
_
dp
_
mc
2
2
p
Mc
2
2
p
_
+ ec
2
(2)
3
Tr(
)
_
dpp
_
1
2
p
+
1
2
p
_
The rst term vanishes due to the property (J.7). The second integral is
zero, because the integrand is an odd function of p.
4
So, nally, the normally
ordered form of the current density is
j
( x) = ec(2)
3
_
dpdp
4
Note that cancelation in (L.6) was possible only because our theory contains two
particle types (electrons and protons) with opposite electric charges.
L.2. FIRST-ORDER INTERACTION IN QED 739
_
A
(p)A
(p
)e
( p
p) x
A
(p)B
(p
)e
i
( p
+ p) x
B
(p)A
(p
)e
( p
+ p) x
+ B
(p
)B
(p)e
i
( p
p) x
+ D
(p)D
(p
)e
(
P
P) x
+ D
(p)F
(p
)e
+
i
(
P
+
P) x
+ F
(p)D
(p
)e
(
P
+
P) x
F
(p
)F
(p)e
i
(
P
P) x
_
(L.7)
L.2 First-order interaction in QED
Inserting (L.7) and (K.11) in (9.13) we obtain the 1st order interaction ex-
pressed in terms of creation and annihilation operators
V
1
=
e
(2)
9/2
_
dxdpdp
dk
_
A
(p)A
(p
)e
(p
p)x
+ . . .
_
_
e
kx
C
(k) + e
i
kx
C
(k)
_
=
e
(2)
3/2
_
dkdp
_
A
(p +k)A
(p)C
(k) A
(p k)A
(p)C
(k)
+ D
(p +k)D
(p)C
(k) + D
(p k)D
(p)C
(k)
+ B
(p +k)B
(p)C
(k) + B
(p k)B
(p)C
(k)
F
(p +k)F
(p)C
(k) F
(p k)F
(p)C
(k)
A
(p +k)B
(p)C
(k) A
(p k)B
(p)C
(k)
A
(p +k)B
(p)C
(k) A
(p k)B
(p)C
(k)
+ D
(p +k)F
(p)C
(k) + D
(p k)F
(p)C
(k)
+ D
(p +k)F
(p)C
(k) + D
(p k)F
(p)C
(k)
_
(L.8)
This operator is of the pure unphys type.
L.3 Second-order interaction in QED
The second order interaction Hamiltonian (9.14) has rather long expression
in terms of particle operators
740APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
V
2
=
1
c
2
_
dxdyj
0
(x, 0)
1
8[x y[
j
0
(y, 0)
= e
2
(2)
6
_
dxdy
_
dpdp
dqdq
1
8[x y[
_
A
(p)A
(p
)e
(p
p)x
A
(p)B
(p
)e
i
(p
+p)x
B
(p)A
(p
)e
(p
+p)x
B
(p)B
(p
)e
i
(p
p)x
+ D
(p)D
(p
)e
(p
p)x
+ D
(p)F
(p
)e
+
i
(p
+p)x
+ F
(p)D
(p
)e
(p
+p)x
+ F
(p)F
(p
)e
i
(p
p)x
_
_
A
(q)A
(q
)e
(q
q)y
A
(q)B
(q
)e
i
(q
+q)y
B
(q)A
(q
)e
(q
+q)y
B
(q)B
(q
)e
i
(q
q)y
+ D
(q)D
(q
)e
(q
q)y
+ D
(q)F
(q
)e
+
i
(q
+q)y
+ F
(q)D
(q
)e
(q
+q)y
+ F
(q)F
(q
)e
i
(q
q)y
_
= e
2
(2)
6
_
dxdy
_
dpdp
dqdq
8[x y[
_
+ A
(p)A
(p
)A
(q)A
(q
)e
(q
q)y
e
(p
p)x
+ A
(p)A
(p
)A
(q)B
(q
)e
i
(q
+q)y
e
(p
p)x
+ A
(p)A
(p
)B
(q)A
(q
)e
(q
+q)y
e
(p
p)x
+ A
(p)A
(p
)B
(q)B
(q
)e
i
(q
q)y
e
(p
p)x
A
(p)A
(p
)D
(q)D
(q
)e
(q
q)y
e
(p
p)x
A
(p)A
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
(p
p)x
A
(p)A
(p
)F
(q)D
(q
)e
(q
+q)y
e
(p
p)x
A
(p)A
(p
)F
(q)F
(q
)e
i
(q
q)y
e
(p
p)x
+ A
(p)B
(p
)A
(q)A
(q
)e
(q
q)y
e
i
(p
+p)x
+ A
(p)B
(p
)A
(q)B
(q
)e
i
(q
+q)y
e
i
(p
+p)x
+ A
(p)B
(p
)B
(q)A
(q
)e
(q
+q)y
e
i
(p
+p)x
L.3. SECOND-ORDER INTERACTION IN QED 741
+ A
(p)B
(p
)B
(q)B
(q
)e
i
(q
q)y
e
i
(p
+p)x
A
(p)B
(p
)D
(q)D
(q
)e
(q
q)y
e
i
(p
+p)x
A
(p)B
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
i
(p
+p)x
A
(p)B
(p
)F
(q)D
(q
)e
(q
+q)y
e
i
(p
+p)x
A
(p)B
(p
)F
(q)F
(q
)e
i
(q
q)y
e
i
(p
+p)x
+ B
(p)A
(p
)A
(q)A
(q
)e
(q
q)y
e
(p
+p)x
+ B
(p)A
(p
)A
(q)B
(q
)e
i
(q
+q)y
e
(p
+p)x
+ B
(p)A
(p
)B
(q)A
(q
)e
(q
+q)y
e
(p
+p)x
+ B
(p)A
(p
)B
(q)B
(q
)e
i
(q
q)y
e
(p
+p)x
B
(p)A
(p
)D
(q)D
(q
)e
(q
q)y
e
(p
+p)x
B
(p)A
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
(p
+p)x
B
(p)A
(p
)F
(q)D
(q
)e
(q
+q)y
e
(p
+p)x
B
(p)A
(p
)F
(q)F
(q
)e
i
(q
q)y
e
(p
+p)x
+ B
(p)B
(p
)A
(q)A
(q
)e
(q
q)y
e
i
(p
p)x
+ B
(p)B
(p
)A
(q)B
(q
)e
i
(q
+q)y
e
i
(p
p)x
+ B
(p)B
(p
)B
(q)A
(q
)e
(q
+q)y
e
i
(p
p)x
+ B
(p)B
(p
)B
(q)B
(q
)e
i
(q
q)y
e
i
(p
p)x
B
(p)B
(p
)D
(q)D
(q
)e
(q
q)y
e
i
(p
p)x
B
(p)B
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
i
(p
p)x
B
(p)B
(p
)F
(q)D
(q
)e
(q
+q)y
e
i
(p
p)x
B
(p)B
(p
)F
(q)F
(q
)e
i
(q
q)y
e
i
(p
p)x
D
(p)D
(p
)A
(q)A
(q
)e
(q
q)y
e
(p
p)x
D
(p)D
(p
)A
(q)B
(q
)e
i
(q
+q)y
e
(p
p)x
D
(p)D
(p
)B
(q)A
(q
)e
(q
+q)y
e
(p
p)x
D
(p)D
(p
)B
(q)B
(q
)e
i
(q
q)y
e
(p
p)x
+ D
(p)D
(p
)D
(q)D
(q
)e
(q
q)y
e
(p
p)x
742APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
+ D
(p)D
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
(p
p)x
+ D
(p)D
(p
)F
(q)D
(q
)e
(q
+q)y
e
(p
p)x
+ D
(p)D
(p
)F
(q)F
(q
)e
i
(q
q)y
e
(p
p)x
D
(p)F
(p
)A
(q)A
(q
)e
(q
q)y
e
+
i
(p
+p)x
D
(p)F
(p
)A
(q)B
(q
)e
i
(q
+q)y
e
+
i
(p
+p)x
D
(p)F
(p
)B
(q)A
(q
)e
(q
+q)y
e
+
i
(p
+p)x
D
(p)F
(p
)B
(q)B
(q
)e
i
(q
q)y
e
+
i
(p
+p)x
+ D
(p)F
(p
)D
(q)D
(q
)e
(q
q)y
e
+
i
(p
+p)x
+ D
(p)F
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
+
i
(p
+p)x
+ D
(p)F
(p
)F
(q)D
(q
)e
(q
+q)y
e
+
i
(p
+p)x
+ D
(p)F
(p
)F
(q)F
(q
)e
i
(q
q)y
e
+
i
(p
+p)x
F
(p)D
(p
)A
(q)A
(q
)e
(q
q)y
e
(p
+p)x
F
(p)D
(p
)A
(q)B
(q
)e
i
(q
+q)y
e
(p
+p)x
F
(p)D
(p
)B
(q)A
(q
)e
(q
+q)y
e
(p
+p)x
F
(p)D
(p
)B
(q)B
(q
)e
i
(q
q)y
e
(p
+p)x
+ F
(p)D
(p
)D
(q)D
(q
)e
(q
q)y
e
(p
+p)x
+ F
(p)D
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
(p
+p)x
+ F
(p)D
(p
)F
(q)D
(q
)e
(q
+q)y
e
(p
+p)x
+ F
(p)D
(p
)F
(q)F
(q
)e
i
(q
q)y
e
(p
+p)x
F
(p)F
(p
)A
(q)A
(q
)e
(q
q)y
e
i
(p
p)x
F
(p)F
(p
)A
(q)B
(q
)e
i
(q
+q)y
e
i
(p
p)x
F
(p)F
(p
)B
(q)A
(q
)e
(q
+q)y
e
i
(p
p)x
F
(p)F
(p
)B
(q)B
(q
)e
i
(q
q)y
e
i
(p
p)x
+ F
(p)F
(p
)D
(q)D
(q
)e
(q
q)y
e
i
(p
p)x
+ F
(p)F
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
i
(p
p)x
+ F
(p)F
(p
)F
(q)D
(q
)e
(q
+q)y
e
i
(p
p)x
L.3. SECOND-ORDER INTERACTION IN QED 743
+ F
(p)F
(p
)F
(q)F
(q
)e
i
(q
q)y
e
i
(p
p)x
_
(L.9)
We need to convert this expression to the normal order, i.e., move all creation
operators in front of annihilation operators. After this is done we will obtain
a sum of phys, unphys and renorm terms. It can be shown that the renorm
terms are innite. This is an indication of renormalization troubles with
QED. In the rest of this section we will simply ignore the renorm part of
interaction.
There are some cancelations among unphys terms. To see how they work,
let us convert to the normal order the 12th term in (L.9)
e
2
(2)
6
_
dxdy
_
dpdp
dqdq
8[x y[
A
(p)B
(p
)B
(q)B
(q
)e
i
(q
q)y
e
i
(p
+p)x
= e
2
(2)
6
_
dxdy
_
dpdp
dqdq
8[x y[
A
(p)B
(p
)B
(q
)B
(q)e
i
(q
q)y
e
i
(p
+p)x
+ e
2
(2)
6
_
dxdy
_
dpdp
dqdq
8[x y[
A
(p)B
(p
)B
(q), B
(q
)e
i
(q
q)y
e
i
(p
+p)x
Now we denote the second term on the right hand side of this expression by
I and use (J.74), (J.7) - (J.8) and (B.6)
I = e
2
(2)
6
_
dxdy
_
dpdp
dqdq
8[x y[
A
(p)B
(p
)
1
2
q
(
0
q
+qc mc
2
)
(q
q)e
i
(q
q)y
e
i
(p
+p)x
= e
2
(2)
6
_
dxdy
_
dpdp
dq
8[x y[
A
(p)B
(p
)
1
2
q
(
q
Tr(
0
0
) +qcTr(
0
) mc
2
Tr(
0
))e
i
(p
+p)x
744APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
= 2e
2
(2)
6
_
dxdy
_
dpdp
dq
8[x y[
A
(p)B
(p
)e
i
(p
+p)x
=
2e
2
2
(2)
3
_
dpdp
dq
0
(p)B
(p
)
(p
+p)
(p
+p)
2
(L.10)
This term is innite. However there are three other innite terms in (L.9)
that arise in a similar manner from A
FF
+ BB
FF
.
These terms cancel exactly with (L.10). Similar to (L.6), this cancelation is
possible only because of the condition q
electron
+ q
proton
= 0.
Taking into account the above results and using anticommutators like
(J.72) and (J.74) we can bring the second order interaction (L.9) to the
normal order
V
2
= e
2
(2)
6
_
dxdy
_
dpdp
dqdq
8[x y[
( A
(p)A
(q)A
(p
)A
(q
)e
(q
q)y
e
(p
p)x
A
(p)A
(q)A
(p
)B
(q
)e
i
(q
+q)y
e
(p
p)x
+ A
(p)A
(p
)A
(q
)B
(q)e
(q
+q)y
e
(p
p)x
A
(p)A
(p
)B
(q
)B
(q)e
i
(q
q)y
e
(p
p)x
A
(p)A
(p
)D
(q)D
(q
)e
(q
q)y
e
(p
p)x
A
(p)A
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
(p
p)x
A
(p)A
(p
)D
(q
)F
(q)e
(q
+q)y
e
(p
p)x
+ A
(p)A
(p
)F
(q
)F
(q)e
i
(q
q)y
e
(p
p)x
+ A
(p)A
(q)A
(q
)B
(p
)e
(q
q)y
e
i
(p
+p)x
+ A
(p)A
(q)B
(p
)B
(q
)e
i
(q
+q)y
e
i
(p
+p)x
+ A
(p)A
(q
)B
(p
)B
(q)e
(q
+q)y
e
i
(p
+p)x
A
(p)B
(p
)B
(q
)B
(q)e
i
(q
q)y
e
i
(p
+p)x
A
(p)B
(p
)D
(q)D
(q
)e
(q
q)y
e
i
(p
+p)x
A
(p)B
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
i
(p
+p)x
L.3. SECOND-ORDER INTERACTION IN QED 745
A
(p)B
(p
)D
(q
)F
(q)e
(q
+q)y
e
i
(p
+p)x
+ A
(p)B
(p
)F
(q
)F
(q)e
i
(q
q)y
e
i
(p
+p)x
A
(q)A
(p
)A
(q
)B
(p)e
(q
q)y
e
(p
+p)x
+ A
(q)A
(p
)B
(q
)B
(p)e
i
(q
+q)y
e
(p
+p)x
+ A
(p
)A
(q
)B
(p)B
(q)e
(q
+q)y
e
(p
+p)x
+ A
(p
)B
(q
)B
(p)B
(q)e
i
(q
q)y
e
(p
+p)x
A
(p
)B
(p)D
(q)D
(q
)e
(q
q)y
e
(p
+p)x
A
(p
)B
(p)D
(q)F
(q
)e
+
i
(q
+q)y
e
(p
+p)x
A
(p
)B
(p)D
(q
)F
(q)e
(q
+q)y
e
(p
+p)x
+ A
(p
)B
(p)F
(q
)F
(q)e
i
(q
q)y
e
(p
+p)x
A
(q)A
(q
)B
(p
)B
(p)e
(q
q)y
e
i
(p
p)x
+ A
(q)B
(p
)B
(q
)B
(p)e
i
(q
+q)y
e
i
(p
p)x
A
(q
)B
(p
)B
(p)B
(q)e
(q
+q)y
e
i
(p
p)x
B
(p
)B
(q
)B
(p)B
(q)e
i
(q
q)y
e
i
(p
p)x
+ B
(p
)B
(p)D
(q)D
(q
)e
(q
q)y
e
i
(p
p)x
+ B
(p
)B
(p)D
(q)F
(q
)e
+
i
(q
+q)y
e
i
(p
p)x
+ B
(p
)B
(p)D
(q
)F
(q)e
(q
+q)y
e
i
(p
p)x
B
(p
)B
(p)F
(q
(q)e
i
(q
q)y
e
i
(p
p)x
A
(q)A
(q
)D
(p)D
(p
)e
(q
q)y
e
(p
p)x
A
(q)B
(q
)D
(p)D
(p
)e
i
(q
+q)y
e
(p
p)x
A
(q
)B
(q)D
(p)D
(p
)e
(q
+q)y
e
(p
p)x
+ B
(q
)B
(q)D
(p)D
(p
)e
i
(q
q)y
e
(p
p)x
D
(q)D
(p)D
(q
)D
(p
)e
(q
q)y
e
(p
p)x
D
(p)D
(q)D
(p
)F
(q
)e
+
i
(q
+q)y
e
(p
p)x
+ D
(p)D
(p
)D
(q
)F
(q)e
(q
+q)y
e
(p
p)x
D
(p)D
(p
)F
(q
)F
(q)e
i
(q
q)y
e
(p
p)x
746APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
A
(q)A
(q
)D
(p)F
(p
)e
(q
q)y
e
+
i
(p
+p)x
A
(q)B
(q
)D
(p)F
(p
)e
i
(q
+q)y
e
+
i
(p
+p)x
A
(q
)B
(q)D
(p)F
(p
)e
(q
+q)y
e
+
i
(p
+p)x
+ B
(q
)B
(q)D
(p)F
(p
)e
i
(q
q)y
e
+
i
(p
+p)x
+ D
(p)D
(q)D
(q
)F
(p
)e
(q
q)y
e
+
i
(p
+p)x
+ D
(p)D
(q)F
(p
)F
(q
)e
+
i
(q
+q)y
e
+
i
(p
+p)x
+ D
(p)D
(q
)F
(p
)F
(q)e
(q
+q)y
e
+
i
(p
+p)x
D
(p)F
(p
)F
(q
)F
(q)e
i
(q
q)y
e
+
i
(p
+p)x
A
(q)A
(q
)D
(p
)F
(p)e
(q
q)y
e
(p
+p)x
A
(q)B
(q
)D
(p
)F
(p)e
i
(q
+q)y
e
(p
+p)x
A
(q
)B
(q)D
(p
)F
(p)e
(q
+q)y
e
(p
+p)x
+ B
(q
)B
(q)D
(p
)F
(p)e
i
(q
q)y
e
(p
+p)x
D
(q)D
(p
)D
(q
)F
(p)e
(q
q)y
e
(p
+p)x
+ D
(q)D
(p
)F
(q
)F
(p)e
+
i
(q
+q)y
e
(p
+p)x
+ D
(p
)D
(q
)F
(p)F
(q)e
(q
+q)y
e
(p
+p)x
+ D
(p
)F
(q
)F
(p)F
(q)e
i
(q
q)y
e
(p
+p)x
+ A
(q)A
(q
)F
(p
)F
(p)e
(q
q)y
e
i
(p
p)x
+ A
(q)B
(q
)F
(p
)F
(p)e
i
(q
+q)y
e
i
(p
p)x
+ A
(q
)B
(q)F
(p
)F
(p)e
(q
+q)y
e
i
(p
p)x
B
(q
)B
(q)F
(p
)F
(p)e
i
(q
q)y
e
i
(p
p)x
D
(q)D
(q
)F
(p
)F
(p)e
(q
q)y
e
i
(p
p)x
+ D
(q)F
(p
)F
(q
)F
(p)e
+
i
(q
+q)y
e
i
(p
p)x
D
(q
)F
(p
)F
(p)F
(q)e
(q
+q)y
e
i
(p
p)x
F
(p
)F
(q
)F
(p)F
(q)e
i
(q
q)y
e
i
(p
p)x
)
Next we switch summation labels and integration variables x y
and p q to simplify
L.3. SECOND-ORDER INTERACTION IN QED 747
V
2
= e
2
(2)
6
_
dxdy
_
dpdp
dqdq
8[x y[
( A
(p)A
(q)A
(p
)A
(q
)e
(q
q)y
e
(p
p)x
+ 2A
(p)A
(p
)A
(q
)B
(q)e
(q
+q)y
e
(p
p)x
2A
(p)A
(p
)B
(q
)B
(q)e
i
(q
q)y
e
(p
p)x
2A
(p)A
(p
)D
(q)D
(q
)e
(q
q)y
e
(p
p)x
2A
(p)A
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
(p
p)x
2A
(p)A
(p
)D
(q
)F
(q)e
(q
+q)y
e
(p
p)x
+ 2A
(p)A
(p
)F
(q
)F
(q)e
i
(q
q)y
e
(p
p)x
+ 2A
(p)A
(q)A
(q
)B
(p
)e
(q
q)y
e
i
(p
+p)x
+ A
(p)A
(q)B
(p
)B
(q
)e
i
(q
+q)y
e
i
(p
+p)x
+ 2A
(p)A
(q
)B
(p
)B
(q)e
(q
+q)y
e
i
(p
+p)x
2A
(p)B
(p
)B
(q
)B
(q)e
i
(q
q)y
e
i
(p
+p)x
2A
(p)B
(p
)D
(q)D
(q
)e
(q
q)y
e
i
(p
+p)x
2A
(p)B
(p
)D
(q)F
(q
)e
+
i
(q
+q)y
e
i
(p
+p)x
2A
(p)B
(p
)D
(q
)F
(q)e
(q
+q)y
e
i
(p
+p)x
+ 2A
(p)B
(p
)F
(q
)F
(q)e
i
(q
q)y
e
i
(p
+p)x
+ A
(p
)A
(q
)B
(p)B
(q)e
(q
+q)y
e
(p
+p)x
+ 2A
(p
)B
(q
)B
(p)B
(q)e
i
(q
q)y
e
(p
+p)x
2A
(p
)B
(p)D
(q)D
(q
)e
(q
q)y
e
(p
+p)x
2A
(p
)B
(p)D
(q)F
(q
)e
+
i
(q
+q)y
e
(p
+p)x
2A
(p
)B
(p)D
(q
)F
(q)e
(q
+q)y
e
(p
+p)x
+ 2A
(p
)B
(p)F
(q
)F
(q)e
i
(q
q)y
e
(p
+p)x
B
(p
)B
(q
)B
(p)B
(q)e
i
(q
q)y
e
i
(p
p)x
+ 2B
(p
)B
(p)D
(q)D
(q
)e
(q
q)y
e
i
(p
p)x
748APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
+ 2B
(p
)B
(p)D
(q)F
(q
)e
+
i
(q
+q)y
e
i
(p
p)x
+ 2B
(p
)B
(p)D
(q
)F
(q)e
(q
+q)y
e
i
(p
p)x
2B
(p
)B
(p)F
(q
)F
(q)e
i
(q
q)y
e
i
(p
p)x
D
(p)D
(q)D
(p
)D
(q
)e
(q
q)y
e
(p
p)x
2D
(p)D
(q)D
(p
)F
(q
)e
+
i
(q
+q)y
e
(p
p)x
+ 2D
(p)D
(p
)D
(q
)F
(q)e
(q
+q)y
e
(p
p)x
2D
(p)D
(p
)F
(q
)F
(q)e
i
(q
q)y
e
(p
p)x
+ D
(p)D
(q)F
(p
)F
(q
)e
+
i
(q
+q)y
e
+
i
(p
+p)x
+ 2D
(p)D
(q
)F
(p
)F
(q)e
(q
+q)y
e
+
i
(p
+p)x
2D
(p)F
(p
)F
(q
)F
(q)e
i
(q
q)y
e
+
i
(p
+p)x
+ D
(p
)D
(q
)F
(p)F
(q)e
(q
+q)y
e
(p
+p)x
+ 2D
(p
)F
(q
)F
(p)F
(q)e
i
(q
q)y
e
(p
+p)x
F
(p
)F
(q
)F
(p)F
(q)e
i
(q
q)y
e
i
(p
p)x
)
Integrals over x and y can be evaluated by using formula (B.6)
V
2
=
e
2
2
2(2)
3
_
dpdp
dqdq
_
A
(p)A
(q)A
(p
)A
(q
)(q
q +p
p)
1
[q
q[
2
+ 2A
(p)A
(p
)A
(q
)B
(q)(q
+q +p
p)
1
[q
+q[
2
2A
(p)A
(p
)B
(q
)B
(q)(q
q p
+p)
1
[q
q[
2
2A
(p)A
(p
)D
(q)D
(q
)(q
q +p
p)
1
[q
q[
2
2A
(p)A
(p
)D
(q)F
(q
)(q
+q p
+p)
1
[q
+q[
2
2A
(p)A
(p
)D
(q
)F
(q)(q
+q +p
p)
1
[q
+q[
2
L.3. SECOND-ORDER INTERACTION IN QED 749
+ 2A
(p)A
(p
)F
(q
)F
(q)(q
q p
+p)
1
[q
q[
2
+ 2A
(p)A
(q)A
(q
)B
(p
)(q
q p
p)
1
[q
q[
2
+ A
(p)A
(q)B
(p
)B
(q
)(q
+q +p
+p)
1
[q
+q[
2
+ 2A
(p)A
(q
)B
(p
)B
(q)(q
+ q p
p)
1
[q
+q[
2
2A
(p)B
(p
)B
(q
)B
(q)(q
q +p
+p)
1
[q
q[
2
2A
(p)B
(p
)D
(q)D
(q
)(q
q p
p)
1
[q
q[
2
2A
(p)B
(p
)D
(q)F
(q
)(q
+q +p
+p)
1
[q
+q[
2
2A
(p)B
(p
)D
(q
)F
(q)(q
+q p
p)
1
[q
+q[
2
+ 2A
(p)B
(p
)F
(q
)F
(q)(q
q +p
+p)
1
[q
q[
2
+ A
(p
)A
(q
)B
(p)B
(q)(q
+q +p
+p)
1
[q
+q[
2
+ 2A
(p
)B
(q
)B
(p)B
(q)(q
q p
p)
1
[q
q[
2
2A
(p
)B
(p)D
(q)D
(q
)(q
q +p
+p)
1
[q
q[
2
2A
(p
)B
(p)D
(q)F
(q
)(q
+q p
p)
1
[q
+q[
2
2A
(p
)B
(p)D
(q
)F
(q)(q
+q +p
+p)
1
[q
+q[
2
+ 2A
(p
)B
(p)F
(q
)F
(q)(q
q p
p)
1
[q
q[
2
B
(p
)B
(q
)B
(p)B
(q)(q
q +p
p)
1
[q
q[
2
+ 2B
(p
)B
(p)D
(q)D
(q
)(q
q p
+p)
1
[q
q[
2
750APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
+ 2B
(p
)B
(p)D
(q)F
(q
)(q
+q +p
p)
1
[q
+q[
2
+ 2B
(p
)B
(p)D
(q
)F
(q)(q
+q p
+p)
1
[q
+q[
2
2B
(p
)B
(p)F
(q
)F
(q)(q
q +p
p)
1
[q
q[
2
D
(p)D
(q)D
(p
)D
(q
)(q
q +p
p)
1
[q
q[
2
2D
(p)D
(q)D
(p
)F
(q
)(q
+q p
+p)
1
[q
+q[
2
+ 2D
(p)D
(p
)D
(q
)F
(q)(q
+q +p
p)
1
[q
+q[
2
2D
(p)D
(p
)F
(q
)F
(q)(q
q p
+p)
1
[q
q[
2
+ D
(p)D
(q)F
(p
)F
(q
)(q
+q +p
+p)
1
[q
+q[
2
+ 2D
(p)D
(q
)F
(p
)F
(q)(q
+q p
p)
1
[q
+q[
2
2D
(p)F
(p
)F
(q
)F
(q)(q
q +p
+p)
1
[q
q[
2
+ D
(p
)D
(q
)F
(p)F
(q)(q
+q +p
+p)
1
[q
+q[
2
+ 2D
(p
)F
(q
)F
(p)F
(q)(q
q p
p)
1
[q
q[
2
F
(p
)F
(q
)F
(p)F
(q)(q
q +p
p)
1
[q
q[
2
_
Finally, we integrate this expression on q
and divide V
2
into phys and unphys
parts
V
2
= V
phys
2
+ V
unphys
2
V
phys
2
=
e
2
2
2(2)
3
_
dpdp
dq
0
(p)A
(q)A
(p
)A
(q p
+p)
1
[p
p[
2
2A
(p)A
(p
)B
(q +p
p)B
(q)
1
[p
p[
2
2A
(p)A
(p
)D
(q)D
(q p
+p)
1
[p
p[
2
+ 2A
(p)A
(p
)F
(+q +p
p)F
(q)
1
[p
p[
2
+ 2A
(p)A
(q +p
+p)B
(p
)B
(q)
1
[p
+p[
2
2A
(p)B
(p
)D
(q +p
+p)F
(q)
1
[p
+p[
2
2A
(p
)B
(p)D
(q)F
(q +p
+p)
1
[p
+p[
2
2A
(p
)B
(p)D
(q p
p)F
(q)
1
[p
+p[
2
B
(p
)B
(q p
+p)B
(p)B
(q)
1
[p
p[
2
+ 2B
(p
)B
(p)D
(q)D
(q +p
p)
1
[p
p[
2
2B
(p
)B
(p)F
(q p
+p)F
(q)
1
[p
p[
2
D
(p)D
(q)D
(p
)D
(q p
+p)
1
[p
p[
2
2D
(p)D
(p
)F
(q +p
p)F
(q)
1
[p
p[
2
+ 2D
(p)D
(q +p
+p)F
(p
)F
(q)
1
[p
+p[
2
F
(p
)F
(q p
+p)F
(p)F
(q)
1
[p
p[
2
_
(L.11)
V
unphys
2
=
e
2
2
2(2)
3
_
dpdp
dq
0
_
+ 2A
(p)A
(p
)A
(q p
+p)B
(q)
1
[p
p[
2
752APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
2A
(p)A
(p
)D
(q)F
(q +p
p)
1
[p
p[
2
2A
(p)A
(p
)D
(q p
+p)F
(q)
1
[p
p[
2
+ 2A
(p)A
(q)A
(q +p
+p)B
(p
)
1
[p
+p[
2
+ A
(p)A
(q)B
(p
)B
(q p
p)
1
[p
+p[
2
2A
(p)B
(p
)B
(q p
p)B
(q)
1
[p
+p[
2
2A
(p)B
(p
)D
(q)D
(q) +p
+p)
1
[p
+p[
2
2A
(p)B
(p
)D
(q)F
(q p
p)
1
[p
+p[
2
+ 2A
(p)B
(p
)F
(q p
p)F
(q)
1
[p
+p[
2
+ A
(p
)A
(q p
p)B
(p)B
(q)
1
[p
+p[
2
+ 2A
(p
)B
(q +p
+p)B
(p)B
(q)
1
[p
+p[
2
2A
(p
)B
(p)D
(q)D
(q p
p)
1
[p
+p[
2
2A
(p
)B
(p)D
(q p
p)F
(q)
1
[p
+p[
2
+ 2A
(p
)B
(p)F
(q +p
+p)F
(q)
1
[p
+p[
2
+ 2B
(p
)B
(p)D
(q)F
(q p
+ p)
1
[p
p[
2
+ 2B
(p
)B
(p)D
(q +p
p)F
(q)
1
[p
p[
2
2D
(p)D
(q)D
(p
)F
(q +p
p)
1
[p
p[
2
+ 2D
(p)D
(p
)D
(q p
+p)F
(q)
1
[p
p[
2
L.3. SECOND-ORDER INTERACTION IN QED 753
+ D
(p)D
(q)F
(p
)F
(q p
p)
1
[p
+p[
2
2D
(p)F
(p
)F
(q p
p)F
(q)
1
[p
+p[
2
+ D
(p
)D
(q p
p)F
(p)F
(q)
1
[p
+p[
2
+ 2D
(p
)F
(q +p
+p)F
(p)F
(q)
1
[p
+p[
2
_
(L.12)
754APPENDIX L. QED INTERACTION IN TERMS OF PARTICLE OPERATORS
Appendix M
Loop integrals in QED
M.1 4-dimensional delta function
In covariant Feynman-Dyson perturbation theory one often needs 4-dimensional
delta function of 4-momentum (p
0
, p
x
, p
y
, p
z
)
4
( p) (p
0
)(p
x
)(p
y
)(p
z
) = (p
0
)(p) (M.1)
which has the following integral representation
1
(2)
4
_
e
i
( p x)
d
4
x =
4
( p) (M.2)
In our notation
x = (t, x)
p = (p
0
, p)
p x = p
0
t p x
d
4
x dtdx
M.2 Feynmans trick
In QED loop calculations one often meets integrals on the loop 4-momentum
k. The calculations become much simpler if one can replace the integrand
1/(abc . . .) with an expression in which a, b, c, . . . are present in the denomi-
nator in a linear form. This can be achieved using a trick rst introduced by
Feynman [Fey49].
The simplest example of such a trick is given by the integral representation
of the product 1/(ab)
1
_
0
dx
(ax + b(1 x))
2
=
1
(b a)(ax + b(1 x))
1
0
=
1
(b a)a
1
(b a)b
=
1
ab
(M.3)
The denominator on the left hand side is a square of a function linear in a and
b. In spite of adding one more integral (on x), the overall integration task is
greatly simplied, as we will see in many examples in this Appendix. Using
this result, we can convert to the linear form more complex expressions, e.g.,
1
a
2
b
=
d
da
_
1
ab
_
=
d
da
1
_
0
dx
(ax + b(1 x))
2
=
1
_
0
2xdx
(ax + b(1 x))
3
(M.4)
These two results can be used to get an integral representation for 1/(abc)
1
abc
=
_
1
bc
_
1
a
=
_
_
1
_
0
dy
(by + c(1 y))
2
_
_
1
a
=
1
_
0
dy
1
_
0
2xdx
1
[(by + c(1 y))x + a(1 x)]
3
= 2
1
_
0
xdx
1
_
0
dy
[byx + cx(1 y) + a(1 x)]
3
(M.5)
M.3. SOME BASIC 4D INTEGRALS 757
Another useful formula is
1
1
abc
= 2
1
_
0
dx
1
_
0
dy
1
_
0
dz
(x + y + z 1)
[ax + by + cz]
3
= 2
1
_
0
dx
1x
_
0
dy
1
[ax + by + c(1 x y)]
3
(M.6)
Next dierentiate equation (M.4) on a
1
a
3
d
=
1
2
d
da
_
1
a
2
d
_
=
d
da
1
_
0
zdz
[az + d(1 z)]
3
= 3
1
_
0
z
2
dz
[az + d(1 z)]
4
This results in
1
abcd
=
_
_
2
1
_
0
xdx
1
_
0
dy
1
[a(1 x) + bxy + cx(1 y)]
3
_
_
1
d
= 6
1
_
0
xdx
1
_
0
dy
1
_
0
z
2
dz
[az(1 x) + bxyz + cxz(1 y) + d(1 z)]
4
(M.7)
Obviously these calculations can be continued for expressions with larger
numbers of factors in denominators. See, e.g., the last formula on page 520
of [Sch61] and equation (11.A.1) in [Wei95].
M.3 Some basic 4D integrals
In our studies of loop integrals we will follow Feynmans approach [Fey49]
and begin with the following simple integral
1
equation (131.2) in [BLP01]
758 APPENDIX M. LOOP INTEGRALS IN QED
K =
_
d
4
k
(
k
2
L)
3
_
dk
0
dk
(k
2
0
c
2
k
2
L + i)
3
(M.8)
The integral on k
0
has two 3rd order poles at k
0
=
c
2
k
2
+ L. We can
rotate
2
the integration contour on k
0
, so that it goes along the imaginary
axis and then change the integration variables ik
0
= m
4
and ck = m. Then
the integral is
K =
1
c
3
i
_
i
dk
0
_
dm
(k
2
0
m
2
L)
3
=
i
c
3
dm
4
_
dm
(m
2
4
m
2
L)
3
Next we introduce 4-dimensional spherical coordinates [Blu60] with r
2
=
m
2
4
+ m
2
and the area of a unit sphere
3
is
_
d = 2
2
K =
2
2
i
c
3
_
0
r
3
dr
(r
2
+ L)
3
=
2
i
c
3
_
L
(t L)dt
t
3
=
2
2ic
3
L
(M.9)
From symmetry properties we also get
_
d
4
k
k
k
2
L)
3
= 0 (M.10)
Replacing
k
k p in (M.8) and calling L p
2
= we get
2
2i( p
2
+ )c
3
=
_
d
4
k
((
k p)
2
L)
3
=
_
d
4
k
(
k
2
2 p
k + p
2
L)
3
=
_
d
4
k
(
k
2
2 p
k )
3
(M.11)
Making the same substitutions in (M.10) we obtain
2
This step is known as Wick rotation [PS95b].
3
See equation (7.81) in [PS95b].
M.3. SOME BASIC 4D INTEGRALS 759
0 =
_
d
4
k(k
)
((
k p)
2
L)
3
=
_
d
4
k(k
)
(
k
2
2 p
k )
3
Then
_
d
4
kk
k
2
2 p
k )
3
=
_
d
4
kp
k
2
2 p
k )
3
=
2
p
2i( p
2
+ )c
3
(M.12)
Dierentiating both sides of (M.11) either by or by p
we obtain
_
d
4
k
(
k
2
2 p
k )
4
=
2
6i( p
2
+ )
2
c
3
(M.13)
_
d
4
kk
k
2
2 p
k )
4
=
2
p
6i( p
2
+ )
2
c
3
(M.14)
Next dierentiate both sides of (M.12) by p
. If ,= then
_
d
4
kk
k
2
2 p
k )
4
=
2
p
6i( p
2
+ )
2
c
3
(M.15)
If =
_
d
4
kk
k
2
2 p
k )
4
=
2
p
6i( p
2
+ )
2
c
3
2
12i( p
2
+ )c
3
(M.16)
Combining (M.15) and (M.16) yields
_
d
4
kk
k
2
2 p
k )
4
=
2
(p
1
2
( p
2
+ ))
6i( p
2
+ )
2
c
3
Next we use (M.4) and (M.11) to calculate
760 APPENDIX M. LOOP INTEGRALS IN QED
_
d
4
k
(
k
2
2 p
1
k
1
)
2
(
k
2
2 p
2
k
2
)
=
1
_
0
2xdx
_
d
4
k
[(
k
2
2 p
1
k
1
)x + (
k
2
2 p
2
k
2
)(1 x)]
3
=
1
_
0
2xdx
_
d
4
k
[
k
2
x 2 p
1
kx
1
x +
k
2
2 p
2
k
2
k
2
x + 2 p
2
kx +
2
x]
3
=
1
_
0
2xdx
_
d
4
k
[
k
2
2 p
x
k
x
]
3
=
2
ic
3
1
_
0
xdx
p
2
x
+
x
(M.17)
where p
1
, p
2
are two arbitrary 4-vectors,
1
,
2
are numerical constants and
p
x
= x p
1
+ (1 x) p
2
x
= x
1
+ (1 x)
2
Similarly, we use (M.12) to obtain
_
d
4
kk
k
2
2 p
1
k
1
)
2
(
k
2
2 p
2
k
2
)
=
1
_
0
2xdx
_
d
4
kk
k
2
2 p
x
k
x
]
3
=
2
ic
3
1
_
0
p
x
xdx
p
2
x
+
x
(M.18)
Three more integrals are obtained by dierentiating (M.17) with respect to
2
and p
2
and by dierentiating (M.18) with respect to p
2
_
d
4
k
(
k
2
2 p
1
k
1
)
2
(
k
2
2 p
2
k
2
)
2
=
2
ic
3
1
_
0
x(1 x)dx
( p
2
x
+
x
)
2
(M.19)
_
d
4
kk
k
2
2 p
1
k
1
)
2
(
k
2
2 p
2
k
2
)
2
=
2
ic
3
1
_
0
p
x
x(1 x)dx
( p
2
x
+
x
)
2
M.4. ELECTRON SELF-ENERGY INTEGRAL 761
(M.20)
_
d
4
kk
k
2
2 p
1
k
1
)
2
(
k
2
2 p
2
k
2
)
2
=
2
ic
3
1
_
0
(p
x
p
x
1/2
( p
2
x
+
x
))x(1 x)dx
( p
2
x
+
x
)
2
(M.21)
M.4 Electron self-energy integral
The loop integral in square brackets in (10.18) can be represented in the
form
4
J
ad
(,p) =
(,p + mc
2
)I
= (2 ,p + 4mc
2
)I + 2
(M.22)
where
I
_
d
4
k
[( p
k)
2
m
2
c
4
]
k
2
I
_
d
4
kk
[( p
k)
2
m
2
c
4
]
k
2
The factor 1/
k
2
in the integrand is a source of both ultraviolet and infrared
divergences. So, the integrals need to be regularized, as described in subsec-
tion 10.1.1. To do that, we introduce two parameters: the ultraviolet cuto
and the infrared cuto .
5
Then we replace the troublesome factor 1/
k
2
by the integral
1/
k
2
2
c
4
_
2
c
4
dL
(
k
2
L)
2
(M.23)
4
Here we used equations (J.9), (J.10) and (J.11).
5
and have the dimensionality of (mass)
762 APPENDIX M. LOOP INTEGRALS IN QED
In the end of calculations we should take limits and 0. In this
limit the integral reduces to 1/
k
2
, as expected
_
0
dL
(
k
2
L)
2
=
k
2
dx
x
2
=
1
k
2
Then we can use (M.17) and (M.18) with parameters
1
= L; p
1
= 0;
2
= m
2
c
4
p
2
; p
2
= p (M.24)
p
x
= (1 x) p;
x
= xL + (1 x)(m
2
c
4
p
2
) (M.25)
to rewrite our integrals
6
I =
2
c
4
_
2
c
4
dL
_
d
4
k
(
k
2
2 p
k + p
2
m
2
c
4
)(
k
2
L)
2
=
2
ic
3
2
c
4
_
2
c
4
dL
1
_
0
xdx
( p
2
x
+
x
)
=
2
ic
3
2
c
4
_
2
c
4
dL
1
_
0
xdx
(1 x)
2
p
2
+ xL + (1 x)(m
2
c
4
p
2
)
=
2
ic
3
1
_
0
dxln
_
(1 x)
2
p
2
+ xL + (1 x)(m
2
c
4
p
2
)
_
L=
2
c
4
L=
2
c
4
=
2
ic
3
1
_
0
dxln
(1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
)
(1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
)
=
2
ic
3
1
_
0
dxln
x
2
c
4
(1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
)
(M.26)
6
we have assumed that
2
m
2
c
4
M.4. ELECTRON SELF-ENERGY INTEGRAL 763
I
2
c
4
_
2
c
4
dL
_
d
4
k
k
k
2
2 p
k + p
2
m
2
c
4
)(
k
2
L)
2
=
2
ic
3
2
c
4
_
2
c
4
dL
1
_
0
p
x(1 x)dx
(1 x)
2
p
2
+ xL + (1 x)(m
2
c
4
p
2
)
=
2
ic
3
1
_
0
dx(1 x)p
ln
(1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
)
(1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
)
=
2
ic
3
1
_
0
dx(1 x)p
ln
x
2
c
4
(1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
)
(M.27)
Inserting (M.26) and (M.27) in (M.22) we obtain
J
ad
(,p) =
2
ic
3
(2 ,p + 4mc
2
)
1
_
0
dxln
x
2
c
4
(1 x)
2
p
2
+ (1 x)(m
2
c
4
p
2
)
+
2
2
,p
ic
3
1
_
0
dx(1 x) ln
x
2
c
4
(1 x)
2
p
2
+ (1 x)(m
2
c
4
p
2
)
=
2
ic
3
1
_
0
dx(4mc
2
2 ,px) ln
x
2
c
4
(1 x)
2
p
2
+ (1 x)(m
2
c
4
p
2
)
(M.28)
For our discussion in subsections 10.2.1 and 10.2.2 it would be convenient
to represent J
ad
in the form of Taylor expansion around the on-mass-shell
value of 4-momentum ,p = mc
2
J
ad
(,p) = C
0
ad
+ C
1
(,p mc
2
)
ad
+ R(,p) (M.29)
where C
0
is a constant (independent on p
) term, C
1
(,p mc
2
)
ad
is linear
in ,p mc
2
and R(,p) combines all other terms (quadratic, cubic, etc. in
764 APPENDIX M. LOOP INTEGRALS IN QED
,p mc
2
). Coecients of the Taylor expansion, as usual, can be obtained by
dierentiation
7
at ,p = mc
2
C
0
= J
ad
p=mc
2
C
1
=
dJ
ad
d ,p
p=mc
2
To calculate C
0
we simply set p
2
= m
2
c
4
in (M.28)
C
0
=
2
2
mc
2
ic
3
1
_
0
dx(2 x) ln
x
2
c
4
(1 x)
2
m
2
c
4
2
2
mc
2
ic
3
1
_
0
(2 x)dxln
2
m
2
+
2
2
mc
2
ic
3
1
_
0
(2 x)dxln
x
(1 x)
2
=
3
2
mc
2
ic
3
ln
2
m
2
+
3
2
mc
2
2ic
3
=
3
2
mc
2
2ic
3
_
4 ln
m
+ 1
_
(M.30)
For the coecient C
1
we obtain
8
C
1
=
dJ
ad
d ,p
p=mc
2
=
2
2
ic
3
1
_
0
xdxln
x
2
c
4
(1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
)
p=mc
2
2
2
ic
3
mc
2
1
_
0
(2 x)dx
(1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
)
x
2
c
4
x
2
c
4
(2(1 x)
2
,p 2(1 x) ,p)
((1 x)
2
p
2
+ x
2
c
4
+ (1 x)(m
2
c
4
p
2
))
2
p=mc
2
7
Note that p
2
is a function of ,p due to (J.23). When doing calculations with slash
symbols ,p and -matrices it is convenient to use properties (J.3) - (J.13)
8
Here we used integral
1
_
0
dxxln(1/(1 x)
2
) = 5/4.
M.5. INTEGRAL FOR THE VERTEX RENORMALIZATION 765
=
2
2
ic
3
1
_
0
xdxln
x
2
(1 x)
2
m
2
2
2
mc
2
ic
3
1
_
0
dx(2 x)
2(1 x)
2
mc
2
2(1 x)mc
2
(1 x)
2
m
2
c
4
+ x
2
c
4
=
2
2
ic
3
1
_
0
xdxln
x
2
(1 x)
2
m
2
4
2
ic
3
1
_
0
dx
(2 x)(x
2
x)
(1 x)
2
+ x
2
/m
2
=
2
2
ic
3
1
_
0
xdxln
x
(1 x)
2
2
ic
3
ln
2
m
2
2
2
ic
3
_
1 + ln
2
m
2
_
=
2
2
ic
3
_
ln
m
+ 2 ln
m
+
9
4
_
(M.31)
Then the residual term
R(,p) = J
ad
(,p) C
0
ad
C
1
(,p mc
2
)
ad
is ultraviolet-nite, because all -dependent terms there cancel out
2
ic
3
ln
2
1
_
0
dx(4mc
2
2 ,px)
2
2
mc
2
ic
3
ln
2
1
_
0
dx(2 x)
+ (,p mc
2
)
2
2
ic
3
ln
2
1
_
0
xdx = 0
It can be said that J
bc
(,p), as a function of ,p, is innite at the point ,p = mc
2
and has an innite rst derivative at this point. However, the 2nd and higher
derivatives are all nite.
M.5 Integral for the vertex renormalization
Let us calculate the integral in square brackets in equation (10.40)
9
9
We used equation (M.23) and took into account that q and q
)
2
= m
2
c
4
.
766 APPENDIX M. LOOP INTEGRALS IN QED
I
( q, q
) =
_
d
4
h
,h+ ,q + mc
2
(
h q)
2
m
2
c
4
,h+ ,q
+ mc
2
(
h q
)
2
m
2
c
4
h
2
2
c
4
_
2
c
4
dL
_
d
4
h
( ,h+ ,q + mc
2
)
( ,h+ ,q
+ mc
2
)
h
2
2 q
h)(
h
2
2 q
h)(
h
2
L)
2
The numerator can be rewritten as
( ,h+ ,q + mc
2
)
( ,h+ ,q
+ mc
2
)
(,q + mc
2
)
(,q
+ mc
2
)
,h
(,q
+ mc
2
)
(,q + mc
2
)
,h
,h
,h
( q, q
) =
(,q + mc
2
)
(,q
+ mc
2
)
(,q
+ mc
2
)
(,q + mc
2
)
(M.32)
where
10
J =
1
_
0
dy
2
c
4
_
2
c
4
dL
_
d
4
h
[
h
2
2
h q
y
]
2
[
h
2
L]
2
(M.33)
J
=
1
_
0
dy
2
c
4
_
2
c
4
dL
_
d
4
hh
h
2
2
h q
y
]
2
[
h
2
L]
2
(M.34)
J
=
1
_
0
dy
2
c
4
_
2
c
4
dL
_
d
4
hh
h
2
2
h q
y
]
2
[
h
2
L]
2
(M.35)
These are particular cases of integrals (M.19) - (M.21) with parameters p
1
=
q
y
,
1
= 0, p
2
= 0,
2
= L, p
x
= x q
y
,
x
= (1 x)L
10
The denominators were combined using (M.3) and q
y
y q + (1 y) q
.
M.5. INTEGRAL FOR THE VERTEX RENORMALIZATION 767
J =
2
ic
3
1
_
0
dy
2
c
4
_
2
c
4
dL
1
_
0
x(1 x)dx
(x
2
q
2
y
+ (1 x)L)
2
(M.36)
J
2
ic
3
1
_
0
dy
2
c
4
_
2
c
4
dL
1
_
0
q
y
x
2
(1 x)dx
(x
2
q
2
y
+ (1 x)L)
2
(M.37)
J
2
ic
3
1
_
0
dy
2
c
4
_
2
c
4
dL
1
_
0
[x
2
q
y
q
y
1/2
(x
2
q
2
y
+ (1 x)L)]x(1 x)dx
(x
2
q
2
y
+ (1 x)L)
2
(M.38)
In the limit we obtain for (M.36)
J =
2
ic
3
1
_
0
dx
1
_
0
dy
x
x
2
q
2
y
+ (1 x)L
L=
L=
2
c
4
=
2
ic
3
1
_
0
dx
1
_
0
dy
x
x
2
q
2
y
+ (1 x)
2
c
4
2
2ic
3
1
_
0
dy
q
2
y
ln(x
2
q
2
y
(1 x)
2
c
4
)
x=1
x=0
=
2
2ic
3
1
_
0
dy
q
2
y
_
ln( q
2
y
) ln(
2
c
4
)
_
=
2
2ic
3
1
_
0
dy
1
q
2
y
ln
q
2
y
2
c
4
(M.39)
To proceed further with this integral we introduce the 4-vector of trans-
ferred momentum
k q
q (M.40)
then from ( q
)
2
= ( q +
k)
2
and ( q
)
2
= q
2
= m
2
c
4
it follows that
2 q
k =
k
2
q
y
= q + (1 y)
k
q
2
y
= m
2
c
4
(1 y)y
k
2
768 APPENDIX M. LOOP INTEGRALS IN QED
Instead of
k
2
and integration variable y it is convenient to introduce two new
variables and , such that
11
k
2
4m
2
c
4
sin
2
(M.41)
y =
1
2
_
1 +
tan
tan
_
1 y =
1
2
_
1
tan
tan
_
q
2
y
= m
2
c
4
4m
2
c
4
sin
2
1
2
_
1 +
tan
tan
_
1
2
_
1
tan
tan
_
= m
2
c
4
m
2
c
4
cos
2
(tan
2
tan
2
) = m
2
c
4
cos
2
cos
2
dy =
d
2 tan
d
d
_
sin
cos
_
=
d
2 tan cos
2
dy
q
2
y
=
d
2m
2
c
4
cos
2
tan
=
d
m
2
c
4
sin(2)
Integral J is infrared-divergent
12
J =
2
2ic
3
d
m
2
c
4
sin(2)
ln
_
m
2
cos
2
2
cos
2
_
=
2
2
ic
3
m
2
c
4
sin(2)
ln
m
2
2ic
3
m
2
c
4
sin(2)
dln
cos
2
cos
2
=
2
2
ic
3
m
2
c
4
sin(2)
ln
m
2
2
ic
3
m
2
c
4
sin(2)
_
0
d
_
ln(cos ) ln(cos )
_
=
2
2
ic
3
m
2
c
4
sin(2)
ln
m
2
2
ln(cos )
ic
3
m
2
c
4
sin(2)
+
2
2
ic
3
m
2
c
4
sin(2)
_
0
dln(cos )
11
Note that, by denition, 0
k
2
4m
2
c
4
.
12
Here we took the following integral by parts
_
0
dln(cos ) = ln(cos ) +
_
0
tan d
M.5. INTEGRAL FOR THE VERTEX RENORMALIZATION 769
= 2A
_
_
ln
m
_
0
tan d
_
_
(M.42)
where we dened
A
2
ic
3
m
2
c
4
sin(2)
Next we calculate (M.37) using variables and introduced above. Taking
the limits 0, we obtain a nite result (both infrared and
ultraviolet divergences are absent)
J
=
2
ic
3
1
_
0
dx
1
_
0
dy
x
2
q
y
x
2
q
2
y
+ (1 x)L
L=
2
c
4
L=
2
c
4
2
ic
3
1
_
0
dy
q
y
q
2
y
=
2
ic
3
d
m
2
c
4
sin(2)
_
q
+
k
2
_
1
tan
tan
__
=
2
ic
3
2
m
2
c
4
sin(2)
_
q
+
k
2
_
+
2
k
2ic
3
m
2
c
4
sin(2) tan
dtan
= A(q
+ q
) (M.43)
Next we need to calculate (M.38)
13
J
2
ic
3
2
c
4
_
2
c
4
dL
1
_
0
dx
1
_
0
dy
x
3
(1 x)q
y
q
y
(x
2
q
2
y
+ (1 x)L)
2
+
2
2ic
3
2
c
4
_
2
c
4
dL
1
_
0
dx
1
_
0
dy
x(1 x)
x
2
q
2
y
+ (1 x)L
13
Here we assumed that
2
c
4
q
2
y
2
c
4
and used integral
1
_
0
dxxln
_
(1 x)/x
2
_
=
_
(x
2
/2) ln
_
(1 x)/x
2
_
+x
2
/4 x/2 1/2 ln(1 x)
1
0
= 1/4.
770 APPENDIX M. LOOP INTEGRALS IN QED
2
ic
3
1
_
0
dx
1
_
0
dy
_
x
3
q
y
q
y
x
2
q
2
y
+ (1 x)
2
c
4
xq
y
q
y
q
2
y
_
+
2
2ic
3
1
_
0
dx
1
_
0
dy
x[ln((1 x)
2
c
4
) ln(x
2
q
2
y
)]
2
2ic
3
1
_
0
dy
q
y
q
y
q
2
y
+
2
2ic
3
1
_
0
dx
1
_
0
dyxln
(1 x)
2
c
4
x
2
q
2
y
=
2
2ic
3
1
_
0
dy
q
y
q
y
q
2
y
+
2
2ic
3
1
_
0
dxxln
(1 x)
x
2
+
2
4ic
3
1
_
0
dy ln
2
c
4
q
2
y
=
2
2ic
3
1
_
0
dy
q
y
q
y
q
2
y
+
2
4ic
3
1
_
0
dy ln
2
c
4
q
2
y
8ic
3
Integrations on y are performed using variables and
14
1
_
0
dy
q
y
q
y
q
2
y
=
_
q
+
1
2
k
tan
2 tan
__
q
+
1
2
k
tan
2 tan
_
d
m
2
c
4
sin(2)
=
_
q
+
1
2
k
__
q
+
1
2
k
_
d
m
2
c
4
sin(2)
+
tan
2
4 tan
2
d
m
2
c
4
sin(2)
=
2m
2
c
4
sin(2)
(q
+ q
)(q
+ q
) +
k
cos
4m
2
c
4
sin
3
_
0
tan
2
d
=
2m
2
c
4
sin(2)
(q
+ q
)(q
+ q
) +
k
k
2
(1 cot )
1
_
0
dy ln
2
c
4
q
2
y
=
_
0
d
tan cos
2
ln
2
cos
2
m
2
cos
2
14
Here we used integrals
_
tan
2
(x)dx = tan(x) x +C,
_
cos
2
(x)dx = tan(x) +C and
_
ln(cos
2
(x))/ cos
2
(x)dx = 2x + 2 tan(x) + tan(x) ln(cos
2
(x)) +C.
M.5. INTEGRAL FOR THE VERTEX RENORMALIZATION 771
= ln
2
m
2
cos
2
+
1
tan
(2 + ln(cos
2
) tan + 2 tan)
= 2 ln
m
+ 2(1 cot )
Then we see that integral J
is ultraviolet-divergent
J
=
A
4
(q
+ q
)(q
+ q
) + Dk
+ E
(M.44)
D
2
(1 cot )
2ic
3
k
2
E
2
2ic
3
_
ln
m
+
3
4
cot
_
Using results (M.42), (M.43), (M.44), we obtain for (M.32)
I
( q, q
) = J
(,q + mc
2
)
(,q
+ mc
2
)
(,q+ ,q
(,q
+ mc
2
)
(,q + mc
2
)
(,q+ ,q
+
A
4
(,q+ ,q
(,q+ ,q
+ D
,k
,k
+ 4E
= JT
1
AT
2
AT
3
+
A
4
T
4
+ D
,k
,k
+ 4E
(M.45)
Let us now use (J.11) - (J.13) and process these terms one-by-one
T
1
=
(,q + mc
2
)
(,q
+ mc
2
)
,q
,q
+ mc
2
,q
+ mc
2
,q
+ m
2
c
4
= 2 ,q
,q + 2mc
2
,q
+ 2mc
2
,q
+ 2mc
2
,q
+ 2mc
2
,q 2m
2
c
4
= 2(,q+ ,k)
(,q
,k) + 2mc
2
,q
+ 2mc
2
(,q+ ,k)
+ 2mc
2
,q
+2mc
2
(,q
,k) 2m
2
c
4
= 2 ,q
,q
2 ,k
,q
+ 2 ,q
,k + 2 ,k
,k + 2mc
2
,q
+2mc
2
,q
+ 2mc
2
,k
+ 2mc
2
,q
+ 2mc
2
,q
2mc
2
,k 2m
2
c
4
( q, q
2mc
2
,k
+ 2mc
2
,k + 2 ,k
,k + 2m
2
c
4
+2m
2
c
4
+ 2mc
2
,k
+ 2m
2
c
4
+ 2m
2
c
4
2mc
2
,k 2m
2
c
4
= 2 ,k
,k + 4m
2
c
4
,k =
+ 2
,k
2
+ 2 ,kk
k
2
+ 2(,q
,q)k
The last term vanishes when sandwiched between u(q, ) and u(q
). So,
we can set ,k
,k =
k
2
. Then
T
1
= (2
k
2
+ 4m
2
c
4
)
We use the same techniques to obtain the 2nd, 3rd and 4th terms in (M.45)
T
2
=
(,q+ ,q
(,q
+ mc
2
)
,q
,q
,q
,q
+ mc
2
,q
+ mc
2
,q
= 2 ,q
,q 2 ,q
,q
+ 2mc
2
,q
+ 2mc
2
,q + 2mc
2
,q
+ 2mc
2
,q
= 2(,q+ ,k)
(,q
,q
+ 2mc
2
,q
+ 2mc
2
(,q
,k)
+2mc
2
(,q+ ,k)
+ 2mc
2
,q
= 2 ,q
,q
2 ,k
,q
+ 2 ,q
,k + 2 ,k
,k 2 ,q
,q
2 ,k
,q
+ 2mc
2
,q
+ 2mc
2
,q
2mc
2
,k
+2mc
2
,q
+ 2mc
2
,k
+ 2mc
2
,q
= 2m
2
c
4
2mc
2
,k
+ 2mc
2
,k + 2 ,k
,k 2m
2
c
4
2mc
2
,k
+ 2m
2
c
4
+ 2m
2
c
4
2mc
2
,k
+2m
2
c
4
+ 2mc
2
,k
+ 2m
2
c
4
= 2 ,k
,k 2mc
2
,k
+ 4m
2
c
4
k
2
+ 4m
2
c
4
)
2mc
2
,k
T
3
=
(,q + mc
2
)
(,q+ ,q
,q
,q
mc
2
,q
,q
,q
mc
2
,q
= 2 ,q
,q + 2mc
2
,q + 2mc
2
,q
2 ,q
,q + 2mc
2
,q
+ 2mc
2
,q
= 2 ,q
(,q
,k) + 2mc
2
(,q
,k) + 2mc
2
,q
2(,q+ ,k)
(,q
,k)
+2mc
2
,q
+ 2mc
2
(,q+ ,k)
= 2 ,q
,q
+ 2 ,q
,k + 2mc
2
,q
2mc
2
,k + 2mc
2
,q
2 ,q
,q
2 ,k
,q
+ 2 ,q
,k + 2 ,k
,k
+2mc
2
,q
+ 2mc
2
,q
+ 2mc
2
,k
= 2m
2
c
4
+ 2mc
2
,k + 2m
2
c
4
2mc
2
,k + 2m
2
c
4
2m
2
c
4
2mc
2
,k
+ 2mc
2
,k + 2 ,k
,k
+2m
2
c
4
+ 2m
2
c
4
+ 2mc
2
,k
= (4m
2
c
4
2
k
2
)
+ 2mc
2
,k
T
4
=
(,q+ ,q
(,q+ ,q
,q
,q
,q
,q
,q
,q
,q
,q
= 2 ,q
,q 2 ,q
,q
2 ,q
,q 2 ,q
,q
= 2 ,q
(,q
,k) 2 ,q
,q
2(,q+ ,k)
(,q
,q
= 2 ,q
,q
+ 2 ,q
,k 2 ,q
,q
2 ,q
,q
2 ,k
,q
+2 ,q
,k + 2 ,k
,k 2 ,q
,q
2 ,k
,q
= 2m
2
c
4
+ 2mc
2
,k 2m
2
c
4
2m
2
c
4
2mc
2
,k
+2mc
2
,k + 2 ,k
,k 2m
2
c
4
2mc
2
,k
= 8m
2
c
4
+ 4mc
2
,k 4mc
2
,k
k
2
(q, q
) = J(2
k
2
+ 4m
2
c
4
)
A((2
k
2
+ 4m
2
c
4
)
2mc
2
,k
) A((4m
2
c
4
2
k
2
)
+ 2mc
2
,k)
+
A
4
(8m
2
c
4
+ 4mc
2
,k 4mc
2
,k
k
2
) + 2D
k
2
+ 4E
=
_
2J
k
2
+ 2D
k
2
+ 4Jm
2
c
4
+ 4E 10Am
2
c
4
+
7
2
A
k
2
_
15
Here we used (J.85) to write
,k ,k
= 4mc
2
2(q
+q
.
774 APPENDIX M. LOOP INTEGRALS IN QED
Amc
2
(
,k ,k
)
=
_
2J
k
2
+ 2D
k
2
+ 4Jm
2
c
4
+ 4E 14Am
2
c
4
+
7A
k
2
2
_
+ 2Amc
2
(q
+ q
)
The coecient in front of
is
2
2
(2
k
2
+ 4m
2
c
4
)
ic
3
m
2
c
4
sin(2)
_
_
ln
m
_
0
tand
_
_
2
(1 cot )
ic
3
+
2
2
ic
3
_
ln
m
+ (1 cot )
1
4
_
+
14
2
ic
3
sin(2)
7
2
k
2
2ic
3
m
2
c
4
sin(2)
=
2
2
(8m
2
c
4
sin
2
+ 4m
2
c
4
)
ic
3
m
2
c
4
sin(2)
_
_
ln
m
_
0
tan d
_
_
+
2
(1 cot )
ic
3
+
2
2
ic
3
ln
m
2
2ic
3
+
14
2
ic
3
sin(2)
14
2
m
2
c
4
sin
2
ic
3
m
2
c
4
sin(2)
=
8
2
ic
3
tan(2)
_
_
ln
m
_
0
tand
_
_
+
2
(1 cot )
ic
3
+
2
2
ic
3
ln
m
2
2ic
3
+
7
2
cot
ic
3
Therefore, nally
I
( q, q
)
=
2
ic
3
_
8
tan(2)
ln
m
+
8
tan(2)
_
0
tan d +
1
2
+ 6 cot + 2 ln
m
_
2
2
(q + q
imc
5
sin(2)
(M.46)
M.6. INTEGRAL FOR THE LADDER DIAGRAM 775
M.6 Integral for the ladder diagram
For the integral (10.54)
b(p, q, k) =
_
d
4
h
[
h
2
+ 2( q
h)][
h
2
2( p
h)][
h
2
2
c
4
][
h
2
+ 2(
k) +
k
2
2
c
4
]
we follow the calculation technique from [Red53]. First use equation (M.7)
and notation
a =
h
2
+ 2(
h) +
k
2
2
c
4
b =
h
2
+ 2( q
h)
c =
h
2
2( p
h)
d =
h
2
2
c
4
to write
b(p, q, k)
= 6
_
d
4
h
1
_
0
dx
1
_
0
dy
1
_
0
xz
2
dz[(
h
2
+ 2(
h) + k
2
2
c
4
)z(1 x) + (
h
2
+ 2( q
h))xyz
+ (
h
2
2( p
h))xz(1 y) + (
h
2
2
c
4
)(1 z)]
4
= 6
_
d
4
h
1
_
0
dx
1
_
0
dy
1
_
0
xz
2
dz[
h
2
2
h (
h
2
2(
h p
x
)z ]
4
where
2
c
4
(1 zx)
k
2
z(1 x)
p
x
=
k(1 x) + x p
y
p
y
= p(1 y) qy
776 APPENDIX M. LOOP INTEGRALS IN QED
From (M.13) we obtain
b(p, q, k) =
2
ic
3
1
_
0
dx
1
_
0
dy
1
_
0
xz
2
dz
(z
2
p
2
x
+ )
2
We have q
= q
k and p
= p +
k. Taking squares of both sides of these
equations and using q
2
= ( q
)
2
= m
2
c
4
and p
2
= ( p
)
2
= M
2
c
4
we obtain
( q
k) =
k
2
/2 (M.47)
( p
k) =
k
2
/2 (M.48)
(
k p
y
) = (
k p)(1 y) (
k q)y =
k
2
2
(1 y)
k
2
2
y =
k
2
2
p
2
x
= (x p
y
k(1 x))
2
= x
2
p
2
y
+
k
2
(1 x)
2
2x(1 x)( p
y
k)
= x
2
p
2
y
+
k
2
2
k
2
x +
k
2
x
2
+
k
2
x
k
2
x
2
= x
2
p
2
y
+
k
2
(1 x)
b(p, q, k) =
2
ic
3
1
_
0
dx
1
_
0
dy
1
_
0
xz
2
dz
[z
2
(x
2
p
2
y
+
k
2
(1 x)) +
2
c
4
(1 zx)
k
2
z(1 x)]
2
Even though is small, the term
2
c
4
(1 zx) cannot be neglected
16
when
x 0, z 0, when x 1, z 0 and when x 0, z 1. Therefore, we
are going to break the region of integration on x into three parts 0 < x < ,
< x < 1 and 1 < x < 1, where and are small, but large enough,
so that in the interval < x < 1 the term
2
c
4
(1 zx) can be neglected.
Integrations on x in these three regions split our integral into three parts
b(p, q, k) = L
I
+ L
II
+ L
III
In the second region we neglect the -term
L
II
2
ic
3
1
_
dx
1
_
0
dy
1
_
0
xdz
[z(x
2
p
2
y
+
k
2
(1 x))
k
2
(1 x)]
2
16
because other terms in the denominator can be even smaller
M.6. INTEGRAL FOR THE LADDER DIAGRAM 777
use table integrals
_
dz
(az + b)
2
=
1
a(ax + b)
+ const
_
dx
x(1 x)
= ln(x) ln(x 1) + const
and obtain
L
II
=
2
ic
3
1
_
dx
1
_
0
dy
x
[x
2
p
2
y
+
k
2
(1 x)][z(x
2
p
2
y
+
k
2
(1 x))
k
2
(1 x)]
z=1
z=0
=
2
ic
3
1
_
dx
1
_
0
dy
x
x
2
p
2
y
+
k
2
(1 x)
_
1
x
2
p
2
y
+
1
k
2
(1 x)
_
=
2
ic
3
k
2
1
_
dx
1
_
0
dy
1
x p
2
y
(1 x)
=
2
ic
3
k
2
1
_
0
dy
1
p
2
y
(ln(x) ln(x 1))
x=1
x=
2
ic
3
k
2
1
_
0
dy
1
p
2
y
(ln() ln(1) ln() + ln(1))
=
2
ln()
ic
3
k
2
1
_
0
dy
p
2
y
In the third integral we replace x 1 x
L
III
=
2
ic
3
0
_
dx
1
_
0
dy
1
_
0
(1 x)z
2
dz
[z
2
((1 x)
2
p
2
y
+
k
2
x) +
2
c
4
(1 z(1 x))
k
2
zx]
2
2
ic
3
0
_
dx
1
_
0
dy
1
_
0
z
2
dz
[z
2
p
2
y
+ z
2
k
2
x +
2
c
4
(1 z)
k
2
zx]
2
778 APPENDIX M. LOOP INTEGRALS IN QED
=
2
ic
3
k
2
1
_
0
dy
1
_
0
zdz
(z 1)
_
1
z
2
p
2
y
+ z
2
k
2
x +
2
c
4
(1 z)
k
2
zx
_
x=0
x=
=
2
ic
3
k
2
1
_
0
dy
1
_
0
zdz
(z 1)
_
1
z
2
p
2
y
+
2
c
4
(1 z)
1
z
2
p
2
y
+ z
2
k
2
+
2
c
4
(1 z)
k
2
z
_
=
2
ic
3
k
2
1
_
0
dy
1
_
0
zdz
(z 1)
z
2
p
2
y
+ z
2
k
2
+
2
c
4
(1 z)
k
2
z z
2
p
2
y
2
c
4
(1 z)
(z
2
p
2
y
+
2
c
4
(1 z))(z
2
p
2
y
+ z
2
k
2
+
2
c
4
(1 z)
k
2
z)
=
2
ic
3
1
_
0
dy
1
_
0
z
2
dz
[z
2
p
2
y
+
2
c
4
(1 z)][z
2
p
2
y
+ z
2
k
2
+
2
c
4
(1 z)
k
2
z]
We now break the z integration into two regions 0 z < z
c
and z
c
z < 1,
where z
c
is chosen such that
2
c
4
z
2
c
p
2
y
k
2
z
c
. We also use table integrals
_
dz
z(az + b)
=
1
b
[ln(z) ln(az + b)] + const
_
dz
a + bz
a + cz
2
=
b
2c
ln(a + cz
2
) +
_
a
c
tan
1
_
cz
a
_
+ const
b
2c
ln(a + cz
2
) + const (M.49)
_
dz
a
a + cz
=
a
c
ln(a + cz) + const
Then
L
III
= L
IIIa
+ L
IIIb
L
IIIb
=
2
ic
3
1
_
0
dy
1
_
zc
z
2
dz
[z
2
p
2
y
+
2
c
4
(1 z)][z
2
p
2
y
+ z
2
k
2
+
2
c
4
(1 z)
k
2
z]
ic
3
1
_
0
dy
1
_
zc
dz
p
2
y
z[z( p
2
y
+
k
2
)
k
2
]
=
ic
3
1
_
0
dy
p
2
y
1
k
2
_
ln(z) ln(z( p
2
y
+
k
2
)
k
2
)
_
z=1
z=zc
M.6. INTEGRAL FOR THE LADDER DIAGRAM 779
=
ic
3
1
_
0
dy
p
2
y
1
k
2
_
ln( p
2
y
) ln(z
c
) + ln[z
c
( p
2
y
+
k
2
)
k
2
]
_
2
ic
3
k
2
1
_
0
dy
p
2
y
ln
_
k
2
p
2
y
z
c
_
(M.50)
L
IIIa
=
2
ic
3
1
_
0
dy
zc
_
0
z
2
dz
[z
2
p
2
y
+
2
c
4
(1 z)][z
2
p
2
y
+ z
2
k
2
+
2
c
4
(1 z)
k
2
z]
ic
3
1
_
0
dy
zc
_
0
z
2
dz
(z
2
p
2
y
+
2
c
4
)(
2
c
4
k
2
z)
=
ic
3
1
_
0
dy
zc
_
0
dz
1
p
2
y
2
c
4
+
k
4
2
_
2
c
4
+
k
2
z
z
2
p
2
y
+
2
c
4
2
c
4
2
c
4
k
2
z
_
=
ic
3
1
_
0
dy
1
p
2
y
2
c
4
+
k
4
2
_
k
2
2 p
2
y
ln(
2
c
4
+ z
2
p
2
y
) +
2
c
4
k
2
ln(
2
c
4
k
2
z)
_
z=zc
z=0
=
ic
3
1
_
0
dy
1
p
2
y
2
c
4
+
k
4
2
_
k
2
2 p
2
y
ln(
2
c
4
+ z
2
c
p
2
y
) +
2
c
4
k
2
ln(
2
c
4
k
2
z
c
)
k
2
2 p
2
y
ln(
2
c
4
)
2
c
4
k
2
ln(
2
c
4
)
_
ic
3
1
_
0
dy
1
p
2
y
2
c
4
+
k
4
2
_
k
2
2 p
2
y
ln(z
2
c
p
2
y
) +
2
c
4
k
2
ln(
k
2
z
c
)
k
2
2 p
2
y
ln(
2
c
4
)
2
c
4
k
2
ln(
2
c
4
)
_
2
ic
3
2
k
2
1
_
0
dy
1
p
2
y
ln
_
z
2
c
p
2
y
2
c
4
_
(M.51)
Adding together (M.50) and (M.51) we obtain
780 APPENDIX M. LOOP INTEGRALS IN QED
L
III
= L
IIIa
+ L
IIIb
=
2
ic
3
k
2
1
_
0
dy
p
2
y
ln
_
k
2
p
2
y
z
c
_
2
ic
3
2
k
2
1
_
0
dy
1
p
2
y
ln
_
z
2
c
p
2
y
2
c
4
_
2
ic
3
2
k
2
1
_
0
dy
p
2
y
ln
_
k
4
2
p
4
y
z
2
c
_
2
ic
3
2
k
2
1
_
0
dy
1
p
2
y
ln
_
z
2
c
p
2
y
2
c
4
_
=
2
ic
3
2
k
2
1
_
0
dy
p
2
y
ln
_
k
4
2
p
2
y
2
c
4
_
In the integral L
I
we replace z 1 z
L
I
=
2
ic
3
_
0
dx
1
_
0
dy
1
_
0
x(1 z)
2
dz
[(1 z)
2
(x
2
p
2
y
+
k
2
(1 x)) +
2
c
4
(1 (1 z)x)
k
2
(1 z)(1 x)]
2
and break z-integration into two regions 0 z < z
c
and z
c
z 1, where
z
c
is small, but large enough, so that in the second region we can neglect the
-term. Then
L
Ia
2
ic
3
_
0
dx
1
_
0
dy
zc
_
0
xdz
[(1 2z)(x
2
p
2
y
+
k
2
(1 x)) +
2
c
4
k
2
(1 z)(1 x)]
2
=
2
ic
3
_
0
dx
1
_
0
dy
zc
_
0
xdz
[(x
2
p
2
y
+
2
c
4
) (2x
2
p
2
y
+
k
2
(1 x))z]
2
=
2
ic
3
_
0
xdx
1
_
0
dy
1
[2x
2
p
2
y
+
k
2
(1 x)][(x
2
p
2
y
+
2
c
4
) + (2x
2
p
2
y
+
k
2
(1 x))z]
z=zc
z=0
=
2
ic
3
_
0
xdx
1
_
0
dy
1
(2x
2
p
2
y
+
k
2
(1 x))
_
1
(x
2
p
2
y
+
2
c
4
) + (2x
2
p
2
y
+
k
2
(1 x))z
c
M.6. INTEGRAL FOR THE LADDER DIAGRAM 781
+
1
x
2
p
2
y
+
2
c
4
_
2
ic
3
_
0
dx
1
_
0
dy
x
k
2
(1 x)(x
2
p
2
y
+
2
c
4
)
2
ic
3
k
2
_
0
dx
1
_
0
dy
x(1 + x)
x
2
p
2
y
+
2
c
4
2
ic
3
k
2
_
0
dx
1
_
0
dy
_
x
x
2
p
2
y
+
2
c
4
+
1
p
2
y
_
The last term in parentheses can be neglected when integrated on x. Using
integral (M.49) with a =
2
c
4
, b = 1, c = p
2
y
we obtain
L
Ia
2
ic
3
k
2
1
_
0
dy
2 p
2
y
ln(x
2
p
2
y
+
2
c
4
)
x=
x=0
=
2
ic
3
k
2
1
_
0
dy
2 p
2
y
ln
_
2
p
2
y
2
c
4
_
In the second part L
Ib
we neglect the -term
L
Ib
2
ic
3
_
0
dx
1
_
0
dy
1
_
zc
x(1 z)
2
dz
[(1 z)
2
(x
2
p
2
y
+
k
2
(1 x))
k
2
(1 z)(1 x)]
2
2
ic
3
_
0
dx
1
_
0
dy
1
_
zc
xdz
[x
2
p
2
y
+ (x
2
p
2
y
+
k
2
(1 x))z]
2
=
2
ic
3
_
0
xdx
1
_
0
dy
1
[x
2
p
2
y
+
k
2
(1 x)][x
2
p
2
y
+ (x
2
p
2
y
+
k
2
(1 x))z]
z=1
z=zc
=
2
ic
3
_
0
dx
1
_
0
dy
x
x
2
p
2
y
+
k
2
(1 x)
_
1
k
2
(1 x)
1
x
2
p
2
y
+ (x
2
p
2
y
+
k
2
(1 x))z
c
_
2
ic
3
k
2
_
0
dx
1
_
0
dyx
_
1
k
2
k
2
z
c
_
782 APPENDIX M. LOOP INTEGRALS IN QED
0
Collecting all contributions we obtain
L = L
Ia
+ L
II
+ L
III
2
ic
3
k
2
1
_
0
dy
1
2 p
2
y
ln
_
2
p
2
y
2
c
4
_
+
2
ln()
ic
3
k
2
1
_
0
dy
p
2
y
2
ic
3
2
k
2
1
_
0
dy
p
2
y
ln
_
k
4
2
p
2
y
2
c
4
_
=
2
ic
3
k
2
1
_
0
dy
1
2 p
2
y
ln
_
2
p
2
y
2
c
4
k
4
2
p
2
y
2
c
4
_
=
2
ic
3
k
2
ln
_
k
2
2
c
4
_
1
_
0
dy
1
p
2
y
(M.52)
This is equation (A20) in [Red53].
M.7 Integral in (13.114)
Here we will calculate the 3D integral
D(q, q
) =
_
ds
[(q s)
2
+
2
c
2
][s
2
q
2
+ i][(s q
)
2
+
2
c
2
]
in formula (13.114) for the 4th order commutator term in the electron-proton
interaction. We are interested in leading terms surviving in the limits 0,
0. The calculation method was adopted from 121 in [BLP01].
17
First we use (M.6) and the elastic scattering condition (q
)
2
= q
2
to write
D(q, q
)
= 2
1
_
0
dx
1x
_
0
dy
_
ds
[((q s)
2
+
2
c
2
)x + ((s q
)
2
+
2
c
2
)y + (s
2
q
2
i)(1 x y)]
3
= 2
1
_
0
dx
1x
_
0
dy
_
ds
[s
2
2(qs)x 2(q
s)y +
2
c
2
(x + y) + q
2
(2x + 2y 1) i]
3
17
see also [Kac59]
M.7. INTEGRAL IN (??) 783
Next we shift the integration variable s h s xq
) = 2q
2
k
2
, where the vector of transferred momentum
is dened as k = q
q
D(q, q
)
= 2
1
_
0
dx
1x
_
0
dy
_
dh
[h
2
+ q
2
(x
2
y
2
+ 2x + 2y 1) 2(qq
)xy +
2
c
2
(x + y) i]
3
= 2
1
_
0
dx
1x
_
0
dy
_
dh
[h
2
q
2
(x + y 1)
2
+ k
2
xy +
2
c
2
(x + y) i]
3
=
i
2
2
1
_
0
dx
1x
_
0
dy
1
[q
2
(x + y 1)
2
k
2
xy
2
c
2
(x + y) i]
3/2
Change integration variables = x + y, = x y
D(q, q
) =
i
2
2
1
_
0
d
_
0
d
1
(q
2
( 1)
2
k
2
2
/4 + k
2
2
/4
2
c
2
i)
3/2
=
i
2
2
1
_
0
d
(q
2
( 1)
2
k
2
2
/4
2
c
2
2
i)
_
q
2
( 1)
2
2
c
2
i
Let us now introduce parameter , such that 1
2
c
2
/q
2
, and split the
integration range into two parts
D(q, q
) = D
1
(q, q
) + D
2
(q, q
)
D
1
(q, q
) =
i
2
2
1
_
0
. . . d
D
2
(q, q
) =
i
2
2
1
_
1
. . . d
784 APPENDIX M. LOOP INTEGRALS IN QED
In the rst integral we ignore the -term
D
1
(q, q
i
2
2q
3
1
_
0
d
[( 1)
2
k
2
2
/(4q
2
) i]( 1)
=
i
2
2q
3
2q
2
k
2
ln
_
k
2
2
/(4q
2
) +
2
2 + 1
(1 )
2
_
1
0
i
2
qk
2
ln
_
k
2
4q
2
2
_
In the second integral we change the integration variable y = x 1
D
2
(q, q
i
2
2
0
_
dy(y + 1)
(q
2
y
2
k
2
/4)
_
q
2
y
2
2
c
2
2i
2
k
2
_
0
dy
_
q
2
y
2
2
c
2
=
2i
2
qk
2
ln(q
_
q
2
y
2
2
c
2
+ q
2
y)
0
=
2i
2
qk
2
ln
_
q
_
q
2
2
c
2
+ q
2
iqc
_
i
2
qk
2
ln
_
4q
2
2
c
2
_
Putting both parts of the integral together we nally obtain
D(q, q
)
i
2
qk
2
ln
_
k
2
4q
2
2
4q
2
2
c
2
_
=
i
2
qk
2
ln
_
k
2
2
c
2
_
(M.53)
which is equation (121.16) in [BLP01].
Appendix N
Relativistic invariance of RQD
N.1 Relativistic invariance of simple QFT
Here we would like to verify that interacting theory presented in subsection
9.1.1 is, indeed, relativistically invariant [Wei95, Wei64b]. In other words,
we are going to prove the validity of Poincare commutators (6.22) - (6.26)
for the interacting energy and boost operators
V =
_
dxV (x, 0)
Z =
1
c
2
_
dxxV (x, 0) (N.1)
satisfying conditions from subsection 9.1.1.
Equation (6.22) follows from property (9.7) in the case of space transla-
tions and rotations. The potential boost Z in (N.1) is a 3-vector by construc-
tion, so equation (6.24) is valid as well. Let us now prove the commutator
(6.23)
[P
0i
, Z
j
] =
i
c
2
V
ij
Consider the case i = j = z. Then, using equation (9.7) with = 1, we
obtain
785
786 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
[P
0z
, Z
z
] =
i
c
2
lim
a0
d
da
_
dxe
i
P
0z
a
zV (x, 0)e
P
0z
a
=
i
c
2
lim
a0
d
da
_
dxzV (x, y, z + a, 0)
=
i
c
2
lim
a0
d
da
_
dx(z a)V (x, y, z, 0)
=
i
c
2
_
dxV (x, y, z, 0) =
i
c
2
V (N.2)
which is exactly equation (6.23).
The proof of equation (6.26) is more challenging. Let us consider the case
i = z and attempt to prove
1
[K
0z
, V (t)] + [Z
z
(t), H
0
] [V (t), Z
z
(t)] = 0 (N.3)
For the rst term on the left hand side we use (9.7) and
lim
0
d
d
V ( x) = lim
0
d
d
V (x, y, z cosh ct sinh , t cosh
z
c
sinh )
= lim
0
V
z
(z sinh ct cosh ) +
V
t
(t sinh
z
c
cosh )
= ct
V
z
z
c
V
t
where is matrix (I.11). Then
[K
0z
, V (t)] =
i
c
lim
0
d
d
e
ic
K
0z
_
dxV ( x)e
ic
K
0z
=
i
c
lim
0
d
d
_
dxV ( x)
=
i
c
_
dx
_
ct
V (x, t)
z
z
c
V (x, t)
t
_
(N.4)
1
In this calculation it is convenient to write condition (6.26) in a t-dependent form, i.e.,
multiply this equation by exp(
i
H
0
t) from the left and exp(
i
H
0
t) from the right, as in
(7.10). At the end of calculations we can set t = 0.
N.2. RELATIVISTIC INVARIANCE OF QED 787
For the third term we obtain
[Z
z
(t), H
0
] = i
t
Z
z
(t) =
i
c
t
_
dxzV (x, t) (N.5)
The last term in (N.3) vanishes due to (9.8). Now we can set t = 0 and see
that (N.4) and (N.5) cancel each other, which proves (N.3).
Derivation of the last remaining nontrivial commutation relation
[K
0i
, Z
j
] + [Z
i
, K
0j
] + [Z
i
, Z
j
] = 0
is left as an exercise for the reader.
N.2 Relativistic invariance of QED
In this Appendix we are going to prove the relativistic invariance of the eld-
theoretical formulation of QED presented in subsection 9.1.2. In other words,
we are going to prove the validity of Poincare commutators (6.22) - (6.26).
2
The proof presented here is taken from Weinbergs works [Wei95, Wei64b]
and, especially, Appendix B in [Wei65].
The interaction operator V (t) in (9.12) clearly commutes with operators
of the total momentum and total angular momentum, so equation (6.22) is
easily veried. The potential boost Z in (9.17) is a 3-vector by construction,
so equation (6.24) is valid as well. Let us now prove the commutator (6.23)
[P
0i
, Z
j
(t)] =
i
c
2
V (t)
ij
Consider the case i = j = x and denote
V (x, t)
c
j(x, t)A(x, t) +
1
2c
2
_
dyj
0
(x, t)((x y)j
0
(y, t)
((x)
1
4x
2
We write conditions (6.22) - (6.26) in a t-dependent form. See footnote on page 786.
788 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
so that
V (t) =
_
dxV (x, t)
Z(t) =
1
c
2
_
dxxV (x, t) +
c
5/2
_
dxj
0
(x, t)C(x, t) (N.6)
where
C( x)
i
2
c
_
2(2)
3
_
dp
p
3/2
_
e
p x
e(p, )c
p,
e
i
p x
e
(p, )c
p,
_
(N.7)
Then, using equations (L.4) and (8.38) - (8.39) we obtain
[P
0x
, Z
x
(t)]
= i lim
a0
d
da
e
i
P
0x
a
Z
x
(t)e
P
0x
a
=
i
c
2
lim
a0
d
da
_
dxe
i
P
0x
a
_
xV (x, t) +
c
j
0
(x, t)C
x
(x, t)
_
e
P
0x
a
=
i
c
2
lim
a0
d
da
_
dx
_
xV (x + a, y, z, t) +
c
j
0
(x + a, y, z, t)C
x
(x + a, y, z, t)
_
=
i
c
2
lim
a0
d
da
_
dx
_
(x a)V (x, y, z, t) +
c
j
0
(x, y, z, t)C
x
(x, y, z, t)
_
=
i
c
2
_
dxV (x, y, z, t) =
i
c
2
V (t) (N.8)
which is exactly equation (6.23).
The proof of equation (6.26) is more challenging. Let us consider the case
i = z and attempt to prove
[K
0z
, V
1
(t)] + [K
0z
, V
2
(t)] i
d
dt
Z
z
(t) [V (t), Z
z
(t)] = 0 (N.9)
where we took into account that [Z
z
(t), H
0
] = i
d
dt
Z
z
(t). We will calculate
all four terms on the left hand side of (N.9) separately. Consider the rst
term and use equations (L.3), (K.23), (K.26), (I.4)
N.2. RELATIVISTIC INVARIANCE OF QED 789
[K
0z
, V
1
(t)]
=
i
c
lim
0
d
d
e
ic
K
0z
V
1
(t)e
ic
K
0z
=
i
2
c
3/2
lim
0
d
d
e
ic
K
0z
_
dx
j( x)
A( x)e
ic
K
0z
=
i
2
c
3/2
lim
0
d
d
_
dx(
1
j( x)
1
A( x) +
1
j( x) ( x, ))
=
i
2
c
3/2
lim
0
d
d
_
dx(
j( x)
A( x) +
1
j( x) ( x, ))
=
i
2
c
3/2
lim
0
_
dx
_
d
d
j( x)
A( x) +
j( x)
d
d
A( x) +
_
d
d
1
_
j( x) ( x, 1)
+
d
d
j( x) ( x, 1) +
j( x)
d
d
( x, )
_
=
i
2
c
3/2
lim
0
_
dx
_
d
d
j( x)
A( x) +
j( x)
d
d
A( x) +
j( x)
d
d
( x, )
_
(N.10)
where ( x, ) is given by equation (K.24) and is matrix (I.11). Here we
use the following results
lim
0
d
d
j( x) = lim
0
d
d
j
z
(z sinh ct cosh ) +
j
t
(t sinh
z
c
cosh )
= ct
j
z
z
c
j
t
(N.11)
lim
0
d
d
A( x) = ct
A
z
z
c
A
t
(N.12)
Calculation of the d/d term is more involved
lim
0
d
d
( x, )
=
c
(2)
3/2
lim
0
3
=0
d
d
_
dp
2p
1
=1
(
1
p)
[
1
p[
790 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
3
=0
1
0
_
e
1
p x
e
(p, )c
p,
+ e
i
1
p x
e
(p, )c
p,
_
(N.13)
The quantities dependent on are lambda matrices. Therefore, taking the
derivative on the right hand side of equation (N.13) we will obtain four
terms, those containing
d
d
,
d
d
1
0
,
d
d
[
1
p[
1
and
d
d
exp(i
1
p x).
After taking the derivative we must set 0. It follows from equation
(K.25) that the only non-zero term is that containing
lim
0
d
d
1
0
= lim
0
d
d
(cosh , 0, 0, sinh)
= lim
0
(sinh , 0, 0, cosh)
= (0, 0, 0, 1)
Thus
lim
0
d
d
( x, )
=
c
(2)
3/2
_
dpp
2p
3/2
1
=1
_
e
p x
e
z
(p, )c
p,
+ e
i
p x
e
z
(p, )c
p,
_
=
i
2
c
_
2(2)
3
_
dp
p
3/2
1
=1
_
e
p x
e
z
(p, )c
p,
e
i
p x
e
z
(p, )c
p,
_
=
C
z
( x) (N.14)
where
(
1
c
t
,
x
,
y
,
z
). So, using (N.14) and the continuity equation
(L.5), we obtain that the last term on the right hand side of equation (N.10)
is
3
3
due to the property (9.2) all functions f and g of quantum elds vanish at innity,
therefore we can take integrals by parts ( (t, x, y, z))
dx
_
d
dx
f()
_
g() =
dx
d
dx
(f()g())
dxf()
d
dx
g()
N.2. RELATIVISTIC INVARIANCE OF QED 791
i
2
c
3/2
lim
0
_
dx
j( x)
d
d
( x, )
=
i
2
c
3/2
_
dxj
(x, t)g
C
z
(x, t)
=
i
2
c
5/2
_
dxj
0
(x, t)
C
z
(x, t)
t
+
i
2
c
3/2
_
dxj(x, t)
C
z
(x, t)
x
=
i
2
c
5/2
_
dxj
0
(x, t)
C
z
(x, t)
t
i
2
c
3/2
_
dx
j(x, t)
x
C
z
(x, t)
=
i
2
c
5/2
_
dxj
0
(x, t)
C
z
(x, t)
t
+
i
2
c
5/2
_
dx
j
0
(x, t)
t
C
z
(x, t)
=
i
2
c
5/2
t
_
dxj
0
(x, t)C
z
(x, t) (N.15)
Substituting results (N.11), (N.12), (N.14) and (N.15) in equation (N.10) and
setting t = 0 we obtain
[K
0z
, V
1
(t)]
=
i
2
c
3/2
_
dx
_
z
c
j
t
A( x)
j( x)
z
c
A
t
1
c
2
t
(j
0
( x)C
z
( x))
_
=
i
2
c
5/2
t
_
dx
_
z(
j( x)
A( x)) j
0
( x)C
z
( x)
_
(N.16)
For the second term on the left hand side of (N.9) we use equation (L.3)
[K
0z
, V
2
(t)]
=
1
2c
2
_
dxdx
[K
0z
, j
0
( x)]((x x
)j
0
( x
) +
1
2c
2
_
dxdx
j
0
( x)((x x
)[K
0z
, j
0
( x
)]
= f(x = )g(x = ) f(x = )g(x = )
dxf()
d
dx
g()
=
dxf()
d
dx
g()
792 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
=
1
c
2
_
dxdx
[K
0z
, j
0
( x)]((x x
)j
0
( x
)
=
i
c
2
_
dxdx
_
z
c
2
j
0
( x)
t
1
c
j
z
( x)
_
((x x
)j
0
( x
)
=
i
2c
4
_
dxdx
j
0
( x)
t
(z z
)((x x
)j
0
( x
) +
i
2c
4
_
dxdx
j
0
( x)
t
z((x x
)j
0
( x
)
+
i
2c
4
_
dxdx
j
0
( x)
t
z
((x x
)j
0
( x
)
i
c
3
_
dxdx
j
z
( x)((x x
)j
0
( x
)
=
i
2c
3
_
dxdx
j( x)
x
(z z
)((x x
)j
0
( x
) +
i
2c
4
_
dxdx
j
0
( x)
t
z((x x
)j
0
( x
)
+
i
2c
4
_
dxdx
j
0
( x)z((x x
)
j
0
( x
)
t
i
c
3
_
dxdx
j
z
( x)((x x
)j
0
( x
)
=
i
2c
3
_
dxdx
j( x)
((z z
)((x x
))
x
j
0
( x
) +
i
2c
4
t
_
dxdx
j
0
( x)z((x x
)j
0
( x
i
c
3
_
dxdx
j
z
( x)((x x
)j
0
( x
) (N.17)
Using expression (N.6) for Z(t) we obtain for the third term on the left hand
side of equation (N.9)
i
t
Z
z
(t) =
i
2
c
5/2
t
_
dxzj(x, t)A(x, t)
i
2
c
5/2
t
_
dxj
0
(x, t)C
z
(x, t)
i
2c
4
t
_
dxdyj
0
(x, t)z((x y)j
0
(y, t) (N.18)
In order to calculate the last term in (N.9), we notice that the only term in
Z(t) which does not commute with V (t) is that containing C, therefore
[V (t), Z
z
(t)] =
2
c
3
_
dxdx
j( x)j
0
( x
)[A( x), C
z
( x
)] (N.19)
To calculate the commutator, we set t = 0 and use equation (B.12)
[A
i
(x, 0), C
z
(x
, 0)]
= i(2)
3
_
dpdq
2
_
q
3
p
px
+ e
i
(p, )c
p,
e
px
_
,
_
e
z
(q, )c
q,
e
i
qx
e
z
(q, )c
q,
e
qx
__
= i(2)
3
_
dpdq
2
_
q
3
p
e
i
(p, )e
z
(q, )
_
(p q)
,
e
i
px
i
qx
(p q)
,
e
px+
i
qx
_
= i(2)
3
_
dp
2p
2
e
i
(p, )e
z
(p, )
,
_
e
i
p(xx
)
+ e
p(xx
)
_
= i(2)
3
_
dp
2p
2
_
iz
p
i
p
z
p
2
_
_
e
i
p(xx
)
+ e
p(xx
)
_
= i(2)
3
_
dp
p
2
_
iz
p
i
p
z
p
2
_
e
i
p(xx
)
=
i
iz
((x x
) +
i(i)
2
(2)
3
x
i
z
_
dp
p
4
e
i
p(xx
)
=
i
iz
((x x
) + i
3
x
i
z
[x x
[
8
4
=
i
iz
((x x
) +
i
2
x
i
((z z
)((x x
)) (N.20)
Then
[V (t), Z
z
(t)]
=
i
c
3
3
i=1
_
dxdx
j
i
( x)
_
iz
((x x
)j
0
( x
)
1
2
x
i
[(z z
)((x x
)]j
0
( x
)
_
(N.21)
Now we can set t = 0, add four terms (N.16), (N.17), (N.18) and (N.21)
together and see that the rst two terms in (N.18) cancel with the two terms
on the right hand side of (N.16); the third term in (N.18) cancels the sec-
ond term on the right hand side of (N.17); and (N.21) exactly cancels the
remaining rst and third terms on the right hand side of (N.17). This proves
equation (N.9).
The proof of the last remaining commutation relation
[K
0i
, Z
j
] + [Z
i
, K
0j
] + [Z
i
, Z
j
] = 0 (N.22)
is left as an exercise for the reader.
794 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
N.3 Relativistic invariance of classical elec-
trodynamics
In this Appendix we will prove the relativistic invariance of the classical limit
of RQD constructed in subsections 12.1.2 and 14.1.1.
From our derivation in chapter 12 it follows that the Darwin-Breit Hamil-
tonian (12.10) is a part of a relativistically invariant theory in the instant form
of dynamics. This means that there exists an interacting boost operator K,
which satises all commutation relations of the Poincare Lie algebra together
with the Darwin-Breit Hamiltonian H. In principle, it should be possible to
nd the explicit form of the operator K by applying the unitary dressing
transformation
4
to the boost operator (9.16) - (9.17) of QED. However, here
we will choose a dierent route. Together with [CV68, CO70, KF74] we will
simply postulate the form of K and verify that Poincare commutators are,
indeed, satised in the (v/c)
2
approximation.
Let us rst write the non-interacting generators of the Poincare group for
a two-particle system as sums of one-particle generators
5
P
0
= p
1
+p
2
(N.23)
J
0
= [r
1
p
1
] +s
1
+ [r
2
p
2
] +s
2
(N.24)
H
0
= h
1
+ h
2
m
1
c
2
+ m
2
c
2
+
p
2
1
2m
1
+
p
2
2
2m
2
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
(N.25)
K
0
=
h
1
r
1
c
2
[p
1
s
1
]
m
1
c
2
+ h
1
h
2
r
2
c
2
[p
2
s
2
]
m
2
c
2
+ h
2
m
1
r
1
m
2
r
2
p
2
1
r
1
2m
1
c
2
p
2
2
r
2
2m
2
c
2
+
1
2c
2
_
[s
1
p
1
]
m
1
+
[s
2
p
2
]
m
2
_
(N.26)
The full interacting generators are
H = H
0
+ V
K = K
0
+Z (N.27)
4
see subsection 11.2.8
5
see equations (15.17) - (15.20)
N.3. RELATIVISTIC INVARIANCE OF CLASSICAL ELECTRODYNAMICS795
The potential energy V is given by (14.2) and the potential boost is postu-
lated as [CV68, CO70, KF74]
Z
q
1
q
2
(r
1
+r
2
)
8c
2
r
(N.28)
The non-trivial Poisson brackets of the Poincare Lie algebra (3.52) - (3.58)
that need to be veried are those involving interacting generators H and K
[J
0i
, K
j
]
P
=
k=x,y,z
ijk
K
k
(N.29)
[J
0
, H]
P
= [P
0
, H]
P
= 0 (N.30)
[K
i
, K
j
]
P
=
1
c
2
k=x,y,z
ijk
J
0k
(N.31)
[K
i
, P
0j
]
P
=
1
c
2
H
ij
(N.32)
[K, H]
P
= P
0
(N.33)
where i, j, k = (x, y, z).
The proof of (N.29) - (N.30) follows easily from the Poisson brackets of
particle observables (15.12) - (15.16) and formula (6.96) for brackets involving
complex expressions. This proof is left as an exercise for the reader. For the
less trivial brackets (N.31) - (N.33), it will be convenient to write H and K
as series in powers of (v/c)
2
(the superscript in parentheses is the power of
(v/c)
2
)
H H
(1)
+ H
(0)
+ H
(1)
orb
+ H
(1)
spinorb
+ H
(1)
spinspin
K K
(0)
+K
(1)
orb
+K
(1)
spinorb
where
H
(1)
= m
1
c
2
+ m
2
c
2
H
(0)
=
p
2
1
2m
1
+
p
2
2
2m
2
+
q
1
q
2
4r
796 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
H
(1)
orb
=
p
4
1
8m
3
1
c
2
p
4
2
8m
3
2
c
2
q
1
q
2
8m
1
m
2
c
2
r
_
(p
1
p
2
) +
(p
1
r)(p
2
r)
r
2
_
H
(1)
spinorb
=
q
1
q
2
[r p
1
] s
1
8m
2
1
c
2
r
3
+
q
1
q
2
[r p
2
] s
2
8m
2
2
c
2
r
3
+
q
1
q
2
[r p
2
] s
1
4m
1
m
2
c
2
r
3
q
1
q
2
[r p
1
] s
2
4m
1
m
2
c
2
r
3
H
(1)
spinspin
=
(s
1
s
2
)
4m
1
m
2
c
2
r
3
3(s
1
r)(s
2
r)
4m
1
m
2
c
2
r
5
K
(0)
= m
1
r
1
m
2
r
2
K
(1)
orb
=
p
2
1
r
1
2m
1
c
2
p
2
2
r
2
2m
2
c
2
q
1
q
2
(r
1
+r
2
)
8c
2
r
K
(1)
spinorb
=
1
2c
2
_
[s
1
p
1
]
m
1
+
[s
2
p
2
]
m
2
_
Then we nd that the following relationships need to be proven
1
c
2
H
(1)
ij
= [K
(0)
i
, P
0j
]
P
(N.34)
0 = [K
(0)
i
, H
(1)
]
P
(N.35)
P
0i
= [K
(0)
i
, H
(0)
]
P
(N.36)
0 = [K
(0)
i
, K
(0)
j
]
P
(N.37)
1
c
2
H
(0)
ij
= [K
(1)
iorb
, P
0j
]
P
+ [K
(1)
ispinorb
, P
0j
]
P
(N.38)
0 = [K
(1)
iorb
, H
(0)
]
P
+ [K
(1)
ispinorb
, H
(0)
]
P
+ [K
(0)
i
, H
(1)
orb
]
P
+[K
(0)
i
, H
(1)
spinorb
]
P
+ [K
(0)
i
, H
(1)
spinspin
]
P
(N.39)
1
c
2
3
k=1
ijk
J
0k
= [K
(1)
iorb
, K
(0)
j
]
P
+ [K
(1)
ispinorb
, K
(0)
j
]
P
+ [K
(0)
i
, K
(1)
jorb
]
P
+[K
(0)
i
, K
(1)
jspinorb
]
P
(N.40)
Again, we skip the easy-to-prove (N.34), (N.35), (N.36) and (N.37). For
equation (N.38) we obtain
[K
(1)
xorb
+ K
(1)
xspinorb
, P
0x
]
P
N.3. RELATIVISTIC INVARIANCE OF CLASSICAL ELECTRODYNAMICS797
=
_
p
2
1
r
1x
2m
1
c
2
, p
1x
_
P
_
p
2
2
r
2x
2m
2
c
2
, p
2x
_
P
_
q
1
q
2
8c
2
r
1x
+ r
2x
r
, p
1x
_
P
_
q
1
q
2
8c
2
r
1x
+ r
2x
r
, p
2x
_
P
=
p
2
1
2m
1
c
2
p
2
2
2m
2
c
2
q
1
q
2
4c
2
r
=
1
c
2
H
(0)
Individual terms on the right hand side of (N.39) are
[K
(1)
xorb
, H
(0)
]
P
=
p
2
1
4m
2
1
c
2
[r
1x
, p
2
1
]
P
q
1
q
2
r
1x
8m
1
c
2
_
p
2
1
,
1
r
_
P
p
2
2
4m
2
2
c
2
[r
2x
, p
2
2
]
P
q
1
q
2
r
2x
8m
2
c
2
_
p
2
2
,
1
r
_
P
q
1
q
2
16m
1
c
2
r
[r
1x
, p
2
1
]
P
q
1
q
2
r
1x
16m
1
c
2
_
1
r
, p
2
1
_
P
q
1
q
2
r
2x
16m
1
c
2
_
1
r
, p
2
1
_
P
q
1
q
2
r
1x
16m
2
c
2
_
1
r
, p
2
2
_
P
q
1
q
2
16m
2
c
2
r
[r
2x
, p
2
2
]
P
q
1
q
2
r
2x
16m
2
c
2
_
1
r
, p
2
2
_
P
=
p
2
1
p
1x
2m
2
1
c
2
q
1
q
2
(r
1x
r
2x
)
8m
1
c
2
(p
1
r)
r
3
p
2
2
p
2x
2m
2
2
c
2
q
1
q
2
(r
1x
r
2x
)
8m
2
c
2
(p
2
r)
r
3
q
1
q
2
p
1x
8m
1
c
2
r
q
1
q
2
p
2x
8m
2
c
2
r
(N.41)
[K
(1)
xspinorb
, H
(0)
]
P
=
1
2c
2
_
1
m
1
[s
1
p
1
]
x
+
1
m
2
[s
2
p
2
]
x
,
q
1
q
2
4r
_
P
=
q
1
q
2
[s
1
r]
x
8m
1
c
2
r
3
q
1
q
2
[s
2
r]
x
8m
2
c
2
r
3
(N.42)
[K
(0)
x
, H
(1)
orb
]
P
=
1
8m
2
1
c
2
[r
1x
, p
4
1
]
P
+
q
1
q
2
8m
2
c
2
r
_
r
1x
,
_
(p
1
p
2
) +
(p
1
r)(p
2
r)
r
2
__
P
+
1
8m
2
2
c
2
[r
2x
, p
4
2
]
P
+
q
1
q
2
8m
1
c
2
r
_
r
2x
,
_
(p
1
p
2
) +
(p
1
r)(p
2
r)
r
2
__
P
=
p
2
1
p
1x
2m
2
1
c
2
+
q
1
q
2
8m
2
c
2
r
_
p
2x
+
(r
1x
r
2x
)(p
2
r)
r
2
_
+
p
2
2
p
2x
2m
2
2
c
2
+
q
1
q
2
8m
1
c
2
r
_
p
1x
+
(p
1
r)(r
1x
r
2x
)
r
2
_
(N.43)
798 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
[K
(0)
x
, H
(1)
spinorb
]
P
=
_
m
1
r
1x
m
2
r
2x
,
q
1
q
2
[r p
1
] s
1
8m
2
1
c
2
r
3
+
q
1
q
2
[r p
2
] s
2
8m
2
2
c
2
r
3
+
q
1
q
2
[r p
2
] s
1
4m
1
m
2
c
2
r
3
q
1
q
2
[r p
1
] s
2
4m
1
m
2
c
2
r
3
_
P
=
q
1
q
2
[s
2
r]
x
8m
2
c
2
r
3
+
q
1
q
2
[s
1
r]
x
8m
1
c
2
r
3
+
q
1
q
2
[s
2
r]
x
4m
2
c
2
r
3
q
1
q
2
[s
1
r]
x
4m
1
c
2
r
3
=
q
1
q
2
[s
1
r]
x
8m
1
cr
3
+
q
1
q
2
[s
2
r]
x
8m
2
cr
3
(N.44)
[K
(0)
x
, H
(1)
spinspin
]
P
= 0 (N.45)
Summing up the right hand sides of equations (N.41) - (N.45) we see that
equation (N.39) is, indeed, satised. For equation (N.40) we obtain
[K
(1)
xorb
, K
(0)
y
]
P
+ [K
(0)
x
, K
(1)
yorb
]
P
+ [K
(1)
xspinorb
, K
(0)
y
]
P
+ [K
(0)
x
, K
(1)
yspinorb
]
P
=
r
1x
2c
2
[p
2
1
, r
1y
]
P
+
r
2x
2c
2
[p
2
2
, r
2y
]
P
+
r
1y
2c
2
[r
1x
, p
2
1
]
P
+
r
2y
2c
2
[r
2x
, p
2
2
]
P
1
2c
2
_
1
m
1
s
1z
p
1y
1
m
2
s
2z
p
2y
, m
1
r
1y
+ m
2
r
2y
_
P
1
2c
2
_
m
1
r
1x
+ m
2
r
2x
,
1
m
1
s
1z
p
1x
+
1
m
2
s
2z
p
2x
_
P
=
1
c
2
[r
1
p
1
]
z
1
c
2
[r
2
p
2
]
z
1
c
2
(s
1z
+ s
2z
) =
1
c
2
J
0z
(N.46)
N.4 Relativistic invariance of gravity theory
Let us prove that the theory of gravity developed in section 16.1 is relativisti-
cally invariant. To do that, we postulate the following form of the interacting
boost generator
K = k
1
+k
2
+
Gh
1
h
2
(r
1
+r
2
)
2c
6
r
+ . . .
=
h
1
r
1
c
2
h
2
r
2
c
2
+
Gh
1
h
2
(r
1
+r
2
)
2c
6
r
+ . . . (N.47)
Then we would like to demonstrate that Poincare Poisson brackets (N.29)
- (N.33) are satised by operators (16.1), (N.47) and usual non-interacting
N.4. RELATIVISTIC INVARIANCE OF GRAVITY THEORY 799
P
0
, J
0
. The calculations are similar to those performed for classical elec-
trodynamics in Appendix N.3. We will use standard Poisson brackets of
one-particle observables (15.12) - (15.16) and general formula (6.96) for the
Poisson bracket of two complex expressions. The theory developed in chapter
16 is valid only up to the order (v/c)
2
. Therefore, we will omit all higher
orders of (v/c) in our calculations.
6
For the Poisson bracket (N.32) we get
[K
x
, P
0x
]
P
r
1x
h
1
c
2
r
2x
h
2
c
2
+
Gh
1
h
2
(r
1x
+ r
2x
)
2c
6
r
, p
1x
+ p
2x
_
P
=
1
c
2
[p
1x
, r
1x
h
1
]
P
+
1
c
2
[p
2x
, r
2x
h
2
]
P
Gh
1
h
2
2c
6
__
p
1x
,
(r
1x
+ r
2x
)
r
_
P
+
_
p
2x
,
(r
1x
+ r
2x
)
r
_
P
_
=
1
c
2
h
1
1
c
2
h
2
Gh
1
h
2
2c
6
_
2
r
+
(r
1x
+ r
2x
)r
x
r
3
(r
1x
+ r
2x
)r
x
r
3
_
=
1
c
2
_
h
1
+ h
2
Gh
1
h
2
c
4
r
_
The right hand side diers from the desired expression c
2
H only by terms
of the order (v/c)
4
or smaller, which are beyond the accuracy of our approx-
imation.
For the left hand side of (N.33) we obtain
[K
x
, H]
P
_
h
1
r
1x
c
2
h
2
r
2x
c
2
+
Gh
1
h
2
(r
1x
+ r
2x
)
2c
6
r
,
h
1
+ h
2
Gh
1
h
2
c
4
r
Gh
2
p
2
1
h
1
c
2
r
Gh
1
p
2
2
h
2
c
2
r
+
7G(p
1
p
2
)
2c
2
r
+
G(p
1
r)(p
2
r)
2c
2
r
3
+
G
2
m
1
m
2
(m
1
+ m
2
)
2c
2
r
2
_
P
We rst evaluate the following individual contributions
6
To calculate the orders of terms in (16.1) - (N.47) one should take into account that
h (v/c)
2
and p, r, k (v/c)
0
800 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
_
h
1
r
1x
c
2
, h
1
_
P
=
h
1
c
2
p
1x
c
2
h
1
= p
1x
(N.48)
_
h
2
r
2x
c
2
, h
2
_
P
= p
2x
(N.49)
_
h
1
r
1x
c
2
,
Gh
1
h
2
c
4
r
_
P
=
Gh
2
c
6
_
h
1
p
1x
c
2
h
1
r
+
p
1x
c
2
r
1x
h
1
h
1
r
x
r
3
+
p
1y
c
2
r
1x
h
1
h
1
r
y
r
3
+
p
1z
c
2
r
1x
h
1
h
1
r
z
r
3
_
=
Gh
2
c
6
_
p
1x
c
2
r
+
(p
1
r)r
1x
c
2
r
3
_
=
Gh
2
c
4
_
p
1x
r
+
(p
1
r)r
1x
r
3
_
(N.50)
_
h
2
r
2x
c
2
,
Gh
1
h
2
c
4
r
_
P
=
Gh
1
c
4
_
p
2x
r
(p
2
r)r
2x
r
3
_
(N.51)
_
h
1
r
1x
c
2
,
Gh
2
p
2
1
h
1
c
2
r
_
P
=
Gh
2
c
4
_
h
1
2p
1x
h
1
r
h
1
p
2
1
p
1x
c
2
h
3
1
r
+
p
1x
c
2
r
1x
h
1
p
2
1
r
x
h
1
r
3
+
p
1y
c
2
r
1x
h
1
p
2
1
r
y
h
1
r
3
+
p
1z
c
2
r
1x
h
1
p
2
1
r
z
h
1
r
3
_
=
Gh
2
c
4
_
2p
1x
r
p
2
1
p
1x
c
2
h
2
1
r
+
(p
1
r)c
2
r
1x
p
2
1
h
2
1
r
3
_
Gh
2
c
4
2p
1x
r
(N.52)
_
h
2
r
2x
c
2
,
Gh
1
p
2
2
h
2
c
2
r
_
P
=
Gh
1
c
4
_
2p
2x
r
p
2
2
p
2x
c
2
h
2
2
r
(p
2
r)c
2
r
2x
p
2
2
h
2
2
r
3
_
Gh
1
c
4
2p
2x
r
(N.53)
_
h
1
r
1x
c
2
,
Gh
1
p
2
2
h
2
c
2
r
_
P
=
Gp
2
2
h
2
c
4
_
h
1
p
1x
c
2
h
1
r
+
p
1x
c
2
r
1x
h
1
h
1
r
x
r
3
+
p
1y
c
2
r
1x
h
1
h
1
r
y
r
3
+
p
1z
c
2
r
1x
h
1
h
1
r
z
r
3
_
=
Gp
2
2
h
2
c
2
_
p
1x
r
+
(p
1
r)r
1x
r
3
_
0 (N.54)
_
h
2
r
2x
c
2
,
Gh
2
p
2
1
h
1
c
2
r
_
P
0 (N.55)
N.4. RELATIVISTIC INVARIANCE OF GRAVITY THEORY 801
_
h
1
r
1x
c
2
,
7G(p
1
p
2
)
2c
2
r
_
P
=
7G
2c
4
_
h
1
p
2x
r
+
p
1x
c
2
r
1x
h
1
(p
1
p
2
)r
x
r
3
+
p
1y
c
2
r
1x
h
1
(p
1
p
2
)r
y
r
3
+
p
1z
c
2
r
1x
h
1
(p
1
p
2
)r
z
r
3
_
=
7G
2c
4
_
h
1
p
2x
r
+
(p
1
r)(p
1
p
2
)c
2
r
1x
h
1
r
3
_
7G
2c
4
h
1
p
2x
r
(N.56)
_
h
2
r
2x
c
2
,
7G(p
1
p
2
)
2c
2
r
_
P
7G
2c
4
h
2
p
1x
r
(N.57)
_
h
1
r
1x
c
2
,
G(p
1
r)(p
2
r)
2c
2
r
3
_
P
=
G
2c
4
_
h
1
r
x
(p
2
r)
r
3
r
1x
(3p
1
r)c
2
(p
1
r)(p
2
r)
h
1
r
5
+r
1x
(p
1
p
1
)c
2
(p
2
r)
h
1
r
3
+ r
1x
(p
1
p
2
)c
2
(p
1
r)
h
1
r
3
_
G
2c
4
h
1
r
x
(p
2
r)
r
3
(N.58)
_
h
2
r
2x
c
2
,
G(p
1
r)(p
2
r)
2c
2
r
3
_
P
G
2c
4
h
2
r
x
(p
1
r)
r
3
(N.59)
_
h
1
r
1x
c
2
,
G
2
m
1
m
2
(m
1
+ m
2
)
2c
2
r
2
_
P
h
2
r
2x
c
2
,
G
2
m
1
m
2
(m
1
+ m
2
)
2c
2
r
2
_
P
0 (N.60)
_
Gh
1
h
2
(r
1x
+ r
2x
)
2c
6
r
, h
1
_
P
=
Gh
1
h
2
2c
6
_
p
1x
c
2
h
1
r
(r
1x
+ r
2x
)(r p
1
)c
2
h
1
r
3
_
(N.61)
_
Gh
1
h
2
(r
1x
+ r
2x
)
2c
6
r
, h
2
_
P
=
Gh
1
h
2
2c
6
_
p
2x
c
2
h
2
r
+
(r
1x
+ r
2x
)(r p
2
)c
2
h
2
r
3
_
(N.62)
_
Gh
1
h
2
(r
1x
+ r
2x
)
2c
6
r
,
Gh
1
h
2
c
4
r
_
P
=
G
2
2c
10
_
h
1
h
2
p
1x
c
2
h
2
h
1
r
2
h
1
h
2
c
2
(r
1x
+ r
2x
)(r p
1
)h
2
r
4
+
h
1
h
2
p
2x
c
2
h
1
h
2
r
2
+
h
1
h
2
c
2
(r
1x
+ r
2x
)(r p
2
)h
1
r
4
c
2
h
2
(r
1x
+ r
2x
)(r p
1
)h
1
h
2
h
1
r
4
+
c
2
h
1
(r
1x
+ r
2x
)(r p
2
)h
1
h
2
h
2
r
4
_
0 (N.63)
Then the desired Poisson bracket is obtained by summing up all contributions
(N.48) - (N.63)
802 APPENDIX N. RELATIVISTIC INVARIANCE OF RQD
[K
x
, H]
P
p
1x
p
2x
+
Gh
2
p
1x
c
4
r
+
Gh
2
(p
1
r)r
1x
c
4
r
3
+
2Gh
2
p
1x
c
4
r
7Gh
1
p
2x
2c
4
r
Gh
1
(r
1x
r
2x
)(p
2
r)
2c
4
r
3
+
Gh
1
p
2x
c
4
r
Gh
1
(p
2
r)r
2x
c
4
r
3
+
2Gh
1
p
2x
c
4
r
7Gh
2
p
1x
2c
4
r
Gh
2
(r
1x
r
2x
)(p
1
r)
2c
4
r
3
+
Gh
2
p
1x
2c
4
r
Gh
2
(r
1x
+ r
2x
)(p
1
r)
2c
4
r
3
+
Gh
1
p
2x
2c
4
r
+
Gh
1
(r
1x
+ r
2x
)(p
2
r)
2c
4
r
3
= p
1x
p
2x
= P
0x
The proof of the Poisson bracket (N.31) is the same as in (N.46). Verication
of other Poisson brackets of the Poincare Lie algebra is left as an exercise for
the reader.
Appendix O
Dimensionality checks
In our formulas in this book we chose to show explicitly all fundamental con-
stants, like c and , rather than adopt the usual convention = c = 1. This
makes our expressions slightly lengthier, but has the benet of easier control
of dimensions and checking correctness at each calculation step. In this sub-
section we are going to suggest a few rules for such dimension estimates in
formulas involving quantum elds.
From the familiar formula
_
dp(p) = 1
it follows that the dimension of the delta function is
1
(p) =
1
p
3
p,
, a
p
,
= (p p
)
,
[c
p,
, c
] = (p p
)
,
1
Angle brackets A denote the dimension of an observable A, as it has been introduced
in subsection 2.3.1. For example, p = mv = E/v denotes the dimension of
momentum. Note that dimension of the 4D delta function (M.1) is E
1
p
3
.
803
804 APPENDIX O. DIMENSIONALITY CHECKS
suggests that dimensions of these operators are
a
p,
= a
p
,
= c
p,
= c
=
1
p
3/2
(O.1)
In the denition of the Diracs quantum eld (J.26)
(x, t) =
_
dp
(2)
3/2
mc
2
_
e
p x
u(p, )a
p,
+ e
i
p x
v(p, )b
p,
_
4-vectors p and x have dimensions of energy p = E and time x = t,
respectively. The dimension of the Plancks constant is = pr =
Et, which implies that arguments
i
3/2
p
3/2
=
p
3/2
3/2
=
1
r
3/2
r
3/2
p
1/2
=
p
1/2
c
1/2
r
1/2
(O.2)
current density operator (L.1)
j = ec =
ec
r
3
2
see (J.40) - (J.43)
3
In dierent texts one can nd various denitions of quantum elds, which can dier
from denitions adopted here by their numerical factors and dimensions. However, as we
stress in subsection 15.5.2, quantum elds do not correspond to any observable quantities.
They are just formal mathematical objects, whose role is to provide convenient building
blocks for interaction operators (9.13), (9.14) and (9.17). So, there is a signicant freedom
in choosing concrete forms of quantum elds. All these choices should lead to the same
forms of physically meaningful interaction operators V
1
, V
2
and Z.
805
and potential energy (9.13)
4
V
1
=
1
c
r
3
ec
r
3
p
1/2
c
1/2
r
1/2
=
ep
1/2
c
1/2
r
1/2
=
e
1/2
c
1/2
r
=
e
2
r
This is exactly the dimension of energy, as one can expect from the Coulomb
law V = e
2
/(4r). The 2nd order QED potential (9.14) also has the dimen-
sion of energy
V
2
=
1
c
2
r
3
r
3
r
ec
r
3
ec
r
3
=
e
2
r
By following the same rules it is easy to establish that all three terms in the
potential boost (9.17) have the dimension mr, as expected.
Let us illustrate the dimensionality checks on the example of the scatter-
ing amplitude (9.33). The S-operator is a dimensionless quantity and particle
creation-annihilation operators have the dimension p
3/2
. Therefore, the
dimension of the matrix element 0[a
q,
d
p,
S
2
d
[0 is expected to be
p
6
. Turning to the nal result (9.35) we may note that according to (9.29)
4
(p) =
1
Ep
3
Ep
3
E
=
c
3
E
3
p
3
=
1
p
6
d
4
p = Ep
3
4
This expression was simplied by using e
2
= c, which follows from the fact that
e
2
/(4c) 1/137 is the dimensionless ne structure constant.
806 APPENDIX O. DIMENSIONALITY CHECKS
Bibliography
[AB59] Y. Aharonov and D. Bohm. Signicance of electromagnetic
potentials in quantum mechanics. Phys. Rev., 115:485, 1959.
[ACG
+
71] D. S. Ayres, A. M. Cormack, A. J. Greenberg, R. W. Kenney,
D. O. Cladwell, V. B. Elings, W. P. Hesse, and R. J. Morrison.
Measurements of the lifetime of positive and negative pions.
Phys. Rev. D., 3:1051, 1971.
[AD78a] D. Aerts and I. Daubechies. About the structure-preserving
maps of a quantum mechanical propositional system. Helv.
Phys. Acta, 51:637, 1978.
[AD78b] D. Aerts and I. Daubechies. Physical justication for using the
tensor product to describe two quantum systems as one joint
system. Helv. Phys. Acta, 51:661, 1978.
[AFKW64] T. Alv ager, F. J. M. Farley, J. Kjellman, and I. Wallin. Test
of the second postulate of special relativity in the GeV region.
Phys. Lett., 12:260, 1964.
[AHR04] J. M. Aguirregabiria, A. Hernandez, and M. Rivas. Linear mo-
mentum density in quasistatic electromagnetic systems, 2004.
http://arxiv.org/abs/physics/0404139.
[AP65] A. B. Arons and M. B. Peppard. Einsteins proposal of the
photon concept - a translation of the Annalen der Physik paper
of 1905. Am. J. Phys., 33:367, 1965.
[APV88] Y. Aharonov, P. Pearle, and L. Vaidman. Comment on
Proposed Aharonov-Casher eect: Another example of an
807
808 BIBLIOGRAPHY
Aharonov-Bohm eect arising from a classical lag. Phys. Rev.
A, 37:4052, 1988.
[Are72] I. Ya. Arefeva. Renormalized scattering theory for the Lee
model. Theor. Math. Phys., 12:859, 1972.
[AW75] S. M. W. Ahmad and E. P. Wigner. Invariant theoretic deriva-
tion of the connection between momentum and velocity. Nuovo
Cimento A, 28:1, 1975.
[AZ91] G. Alber and P. Zoller. Laser excitation of electronic wave
packets in Rydberg atoms. Physics Reports, 5:231, 1991.
[Bac89] H. Bacry. The notions of localizability and space: from Eugene
Wigner to Alain Connes. Nucl. Phys. Proc. Suppl., 6:222, 1989.
[Bac04] H. Bacry. The foundations of the Poincare group and the va-
lidity of general relativity. Rep. Math. Phys., 53:443, 2004.
[Bak61] B. Bakamjian. Relativistic particle dynamics. Phys. Rev.,
121:1849, 1961.
[Bal98] L. E. Ballentine. Quantum Mechanics: A Modern Development.
World Scientic, Singapore, 1998.
[Bar12] S. J. Barnett. On electromagnetic induction and relative mo-
tion. Phys. Rev., 35:323, 1912.
[BBC
+
77] J. Bailey, K. Borer, F. Combley, H. Drumm, F. Kreinen,
F. Lange, E. Picasso, W. von Ruden, F. J. M. Farley, J. H.
Field, W. Flegel, and P. M. Hattersley. Measurements of rel-
ativistic time dilatation for positive and negative muons in a
circular orbit. Nature, 268:301, 1977.
[BCOR] S. Blanes, F. Casas, J.A. Oteo, and J. Ros. The
Magnus expansion and some of its applications.
http://arxiv.org/abs/0810.5488v1.
[BD64] J. D. Bjorken and S. D. Drell. Relativistic quantum mechanics.
McGraw-Hill, New York, 1964.
BIBLIOGRAPHY 809
[BD65] J. D. Bjorken and S. D. Drell. Relativistic quantum elds.
McGraw-Hill, New York, 1965.
[Ber65] R. A. Berg. Position and intrinsic spin operators in quantum
theory. J. Math. Phys., 6:34, 1965.
[BF62] B. Barsella and E. Fabri. Angular momenta in relativistic
many-body theory. Phys. Rev., 128:451, 1962.
[BL77] L. Briatore and S. Leschiutta. Evidence for the earth gravi-
tational shift by direct atomic-time-scale comparison. Nuovo
Cim. B, 37:219, 1977.
[BLP01] V. B. Berestetski, E. M. Livshitz, and L. P. Pitaevski. Quan-
tum electrodynamics. Fizmatlit, Moscow, 2001. (in Russian).
[Blu60] L. E. Blumenson. A derivation of n-dimensional spherical co-
ordinates. Am. Math. Monthly, 67:63, 1960.
[Boy05] T. H. Boyer. The paradoxical forces for the classical electro-
magnetic lag associated with the Aharonov-Bohm phase shift,
2005. http://arxiv.org/abs/physics/0506180v1.
[Boy06] T. H. Boyer. Darwin-Lagrangian analysis for the interaction
of a point charge and a magnet: Considerations related to
the controversy regarding the Aharonov-Bohm and Aharonov-
Casher phase shifts. J. Phys. A:Math. Gen., 39:3455, 2006.
http://arxiv.org/abs/physics/0506181v1.
[Boy07a] T. H. Boyer. Comment on experiments re-
lated to the Aharonov-Bohm phase shift, 2007.
http://arxiv.org/abs/0708.3194v1.
[Boy07b] T. H. Boyer. Unresolved classical electromagnetic
aspects of the Aharonov-Bohm phase shift, 2007.
http://arxiv.org/abs/0709.0661v1.
[Boy08] T. H. Boyer. Illustrating some implications of the
conservation laws in relativistic mechanics, 2008.
http://arxiv.org/abs/0812.1017v1.
810 BIBLIOGRAPHY
[Bre68] E. Breitenberger. Magnetic interactions between charged par-
ticles. Am. J. Phys., 36:505, 1968.
[Bro05] H. R. Brown. Physical relativity: Space-time structure from a
dynamical perspective. Oxford University Press, Oxford, 2005.
[BT53] B. Bakamjian and L. H. Thomas. Relativistic particle dynam-
ics. II. Phys. Rev., 92:1300, 1953.
[But69] J. W. Butler. A proposed electromagnetic momentum-energy
4-vector for charged bodies. Am. J. Phys., 37:1258, 1969.
[BvN36] G. Birkho and J. von Neumann. The logic of quantum me-
chanics. Ann. Math., 37:823, 1936.
[Can65] D. J. Candlin. Physical operators and the representations of
the inhomogeneous Lorentz group. Nuovo Cim., 37:1396, 1965.
[Car05] R. Carroll. Remarks on photons and the aether, 2005.
http://arxiv.org/abs/physics/0507027v1.
[CBB07] A. Caprez, B. Barwick, and H. Batelaan. A macro-
scopic test of the Aharonov-Bohm eect, 2007.
http://arxiv.org/abs/0708.2428v1.
[CCTS13] G. Cavalleri, E. Cesaroni, E. Tonni, and G. Spavieri. Interpre-
tation of the longitudinal forces detected in a recent experiment
of electrodynamics. Eur. Phys. J. D, 26:221, 20013.
[CdSF
+
12] A. Calcaterra, R. de Sangro, G. Finnochiaro, P. Patteri, M. Pic-
colo, and G. Pizzella. Measuring propagation speed of Coulomb
elds, 2012. http://arxiv.org/abs/1211.2913v1.
[Cha60] R. G. Chambers. Shift of an electron interference pattern by
enclosed magnetic ux. Phys. Rev. Lett., 5:3, 1960.
[Cha64] A. J. Chakrabarti. On the canonical relativistic kinematics of
N-particle systems. J. Math. Phys., 5:922, 1964.
[CJS63] D. G. Currie, T. F. Jordan, and E. C. G. Sudarshan. Relativis-
tic invariance and Hamiltonian theories of interacting particles.
Rev. Mod. Phys., 35:350, 1963.
BIBLIOGRAPHY 811
[CO70] F. E. Close and H. Osborn. Relativistic center-of-mass mo-
tion and the electromagnetic interaction of systems of charged
particles. Phys. Rev. D, 2:2127, 1970.
[Com96] E. Comay. Exposing hidden momentum. Am. J. Phys.,
64:1028, 1996.
[Com97] E. Comay. Decomposition of electromagnetic elds into radia-
tion and bound components. Am. J. Phys., 65:862, 1997.
[Com00] E. Comay. Lorentz transformation of a system carrying hidden
momentum. Am. J. Phys., 68:1007, 2000.
[CP82] F. Coester and W. N. Polyzou. Relativistic quantum mechanics
of particles with direct interactions. Phys. Rev. D, 26:1348,
1982.
[CR] P. Caban and J. Rembielinski. Photon polarization and
Wigners little group. http://arxiv.org/abs/quant-ph/0304120.
[CSR96] A. E. Chubykalo and R. Smirnov-Rueda. Action at a distance
as a full-value solution of Maxwell equations: basis and appli-
cation of separated potentials method. Phys. Rev. E, 53:5373,
1996.
[CSW60] T. E. Cranshaw, J. P. Schier, and A. B. Whitehead. Measure-
ment of the gravitational red shift using the Mossbauer eect
in Fe
57
. Phys. Rev. Lett., 4:163, 1960.
[Cul52] E. G. Cullwick. Electromagnetic momentum and Newtons
third law. Nature, 170:425, 1952.
[CV68] S. Coleman and J. H. Van Vleck. Origin of hidden momentum
forces on magnets. Phys. Rev., 171:1370, 1968.
[CZJW00] J. J. Carey, J. Zawadzka, D. A. Jaroszynski, and K. Wynne.
Noncausal time response in frustrated total internal reection?
Phys. Rev. Lett., 84:1431, 2000.
[CZJW01] J. J. Carey, J. Zawadzka, D. A. Jaroszynski, and K. Wynne.
Reply to Comment on.... Phys. Rev. Lett., 84:119102, 2001.
812 BIBLIOGRAPHY
[DB94] J. Yngvason D. Buchholz. There are no causality problems
with Fermis two atom system. Phys. Rev. Lett., 73:613, 1994.
http://arxiv.org/abs/hep-th/9403027.
[DD85] T. Damour and N. Deruelle. General relativistic celestial me-
chanics of binary systems. I. The post-Newtonian motion. Ann.
Inst. Henri Poincare A, 43:107, 1985.
[DG73] G. Dillon and M. M. Giannini. On the clothing transformation
in the Lee model. Nuovo Cim., 18A:31, 1973.
[DG75] G. Dillon and M. M. Giannini. On the potential description of
the V NNN sector of the Lee model. Nuovo Cim., 27A:106,
1975.
[Dir49] P. A. M. Dirac. Forms of relativistic dynamics. Rev. Mod.
Phys., 21:392, 1949.
[dlTa] A. C. de la Torre. Understanding light quanta: Construction
of the free electromagnetic eld. http://arxiv.org/abs/quant-
ph/0503023v2.
[dlTb] A. C. de la Torre. Understanding light quanta:
First quantization of the free electromagnetic eld.
http://arxiv.org/abs/quant-ph/0410171.
[dlTc] A. C. de la Torre. Understanding light quanta: The photon.
http://arxiv.org/abs/quant-ph/0410179.
[Dol76] J. D. Dollard. Interpretation of Katos invariance principle in
scattering theory. J. Math. Phys., 17:46, 1976.
[DV96] T. Damour and D. Vokrouhlick y. The equivalence principle
and the Moon. Phys. Rev. D, 53:4177, 1996.
[DW65] H. Van Dam and E. P. Wigner. Classical relativistic mechanics
of interacting point particles. Phys.Rev., B138:1576, 1965.
[Dys51] F. J. Dyson. The renormalization method in quantum electro-
dynamics. Proc. Roy. Soc., A207:395, 1951.
BIBLIOGRAPHY 813
[EIH38] A. Einstein, L. Infeld, and B. Homann. The gravitational
equations and the problem of motion. Ann. Math., 39:65, 1938.
[Ein05] A. Einstein. Zur Electrodynamik bewegter K orper. Annalen
der Physik, 17:891, 1905.
[Ein20] A. Einstein. Relativity: The Special and General Theory.
Methuen and Co, 1920.
[Ein49] A. Einstein. in Albert Einstein: Philosopher-Scientist. Open
Court, Peru, 1949.
[EKL76] W. F. Edwards, C. S. Kenyon, and D. K. Lemon. Continu-
ing investigation into possible electric elds arising from steady
conductor currents. Phys. Rev. D, 14:922, 1976.
[Eks60] H. Ekstein. Equivalent Hamiltonians in scattering theory. Phys.
Rev., 117:1590, 1960.
[EKU62] H. Ezawa, K. Kikkawa, and H. Umezawa. Potential represen-
tation in quantum eld theory. Nuovo Cim., 23:751, 1962.
[EL08] A. Einstein and J. Laub. Uber die electromagnetischen Grund-
gleichungen f ur bewegte K orper. Ann. Phys. (Leipzig), 26:532,
1908.
[EN93] A. Enders and G. Nimtz. Evanescent-mode propagation and
quantum tunneling. Phys. Rev. E, 48:632, 1993.
[Eng03] W. Engelhardt. Relativistic Doppler eect and the principle of
relativity. Apeiron, 10:29, 2003.
[Ess95] H. Essen. A study of lattice and magnetic interactions of con-
duction electrons. Phys. Scr., 52:388, 1995.
[Ess96] H. Essen. Darwin magnetic interaction energy and its macro-
scopic consequences. Phys. Rev. E, 53:5228, 1996.
[Ess99] H. Essen. Magnetism of matter and phase space energy of
charged particle systems. J. Phys. A: Math. Gen., 32:2297,
1999.
814 BIBLIOGRAPHY
[Ess07] H. Essen. Circulating electrons, superconductivity, and the
Darwin-Breit interaction, 2007. http://arxiv.org/abs/cond-
mat/0002096.
[Fad63] L. D. Faddeev. On the separation of self-interaction and scat-
tering eects in perturbation theory. Dokl. Akad. Nauk SSSR,
152:573, 1963.
[Far92] F. J. M. Farley. The CERN (g-2) measurements. Z. Phys. C,
56:S88, 1992.
[Fey49] R. P. Feynman. Space-time approach to quantum electrody-
namics. Phys. Rev., 76:769, 1949.
[Fey85] R. P. Feynman. Q.E.D. Princeton University Press, 1985.
[FGR78] L. Fonda, G. C. Ghirardi, and A. Rimini. Decay theory of
unstable quantum systems. Rep. Prog. Phys., 41:587, 1978.
[Fiea] J. H. Field. On the relationship of quantum mechanics to
classical electromagnetism and classical relativistic mechanics.
http://arxiv.org/abs/physics/0403076.
[Fieb] J. H. Field. Space-time transformation properties of
inter-charge forces and dipole radiation: Breakdown of
the classical eld concept in relativistic electrodynamics.
http://arxiv.org/abs/physics/0604089.
[Fie97] J. H. Field. A new kinematical derivation of the Lorentz trans-
formation and the particle description of light. Helv. Phys.
Acta, 70:542, 1997. http://arxiv.org/abs/physics/0410062.
[Fie06] J. H. Field. Classical electromagnetism as a consequence
of Coulombs law, special relativity and Hamiltons principle
and its relationship to quantum electrodynamics. Phys. Scr.,
74:702, 2006. http://arxiv.org/abs/physics/0501130v5.
[Fiv70] D. I. Fivel. Solutions of the Lee model in all sectors by dynam-
ical algebra. J. Math. Phys., 11:699, 1970.
BIBLIOGRAPHY 815
[FK03] E. B. Fomalont and S. M. Kopeikin. The measurement of the
light deection from Jupiter: Experimental results. Astrophys.
J., 598:704, 2003. http://arxiv.org/abs/astro-ph/0302294v2.
[Fla98] T. Van Flandern. The speed of gravity - what the experiments
say. Phys. Lett. A, 250:1, 1998.
[FM96] G. Fiore and G. Modanese. General properties of the decay
amplitudes for massless particles. Nucl. Phys., B477:623, 1996.
[FN94] W. I. Fushchich and A. G. Nikitin. Symmetries of equations of
quantum mechanics. New York, 1994.
[Fol61] L. L. Foldy. Relativistic particle systems with interaction. Phys.
Rev., 122:275, 1961.
[Fou] D. J. Foulis. A half century of quan-
tum logic. What have we learned?
http://www.quantonics.com/Foulis On Quantum Logic.html .
[Fra07] J. Franklin. The nature of electromagnetic energy, 2007.
http://arxiv.org/abs/0707.3421v2.
[Fri94] A. Friedman. Nonstandard extension of quantum logic and
Diracs bra-ket formalism of quantum mechanics. Int. J. Theor.
Phys., 33:307, 1994.
[FRS
+
02] S. Francis, B. Ramsey, S. Stein, J. Leitner, M. Moreau,
R. Burns, R. A. Nelson, T. R. Bartholomew, and A. Giord.
Timekeeping and time dissemination in a distributed space-
based clock ensemble. 2002. in Proceedings of the 34
th
Annual
Precise Time and Time Interval (PTTI) Systems and Applica-
tions Meeting, Reston, Virginia, USA (3 - 5 December 2002)
tycho.usno.navy.mil/ptti/ptti2002/paper20.pdf.
[FS64] R. Fong and J. Sucher. Relativistic particle dynamics and the
S-matrix. J. Math. Phys., 5:456, 1964.
[FS73] V. A. Fateev and A. S. Shvarts. Dressing operators in quantum
eld theory. Dokl. Akad. Nauk SSSR, 209:66, 1973. [English
translation in Sov. Phys. Dokl. 18 (1973), 165.].
816 BIBLIOGRAPHY
[FS88] G. Feinberg and J. Sucher. Two-photon-exchange force between
charged systems: Spinless particles. Phys. Rev. D, 38:3763,
1988.
[Fur69] W. H. Furry. Examples of momentum distributions in the elec-
tromagnetic eld and in matter. Am. J. Phys., 37:621, 1969.
[FV02] T. Van Flandern and J. P. Vigier. Experimental repeal of the
speed limit for gravitational, electromagnetic, and quantum
eld interactions. Found. Phys., 32:1031, 2002.
[Gal01] Galileo Galilei. Dialogues Concerning the Two Chief World
Systems. Modern Library Science Series, New York, 2001.
[Gal05] E. A. Galapon. Theory of conned quantum time of arrivals,
2005. http://arxiv.org/abs/quant-ph/0504174.
[Gau03] N. Gauthier. What happens to energy and momentum when
two oppositely-moving wave pulses overlap? Am. J. Phys.,
71:787, 2003.
[GI91a] G. C. Giakos and T. K. Ishii. Anomalous microwave propaga-
tion in open space. Microwave and Optical Technology Letters,
4:79, 1991.
[GI91b] G. C. Giakos and T. K. Ishii. Rapid pulsed microwave prop-
agation. IEEE Microwave and Guided Wave Letters, 1:374,
1991.
[GJR01] N. Graneau, T. Phipps Jr., and D. Roscoe. An experimen-
tal conrmation of longitudinal electrodynamic forces. Europ.
Phys. J. D, 15:87, 2001.
[GL] C. Giunti and M. Laveder. Neutrino mixing. in Develop-
ments in Quantum Physics, edited by F. H. Columbus and V.
Krasnoholovets, (Nova Science, New York, 2004) pp. 197-254,
http://arxiv.org/abs/hep-ph/0310238.
[G la] St. D. G lazek. Similarity renormalization group ap-
proach to boost invariant Hamiltonian dynamics.
http://arxiv.org/abs/hep-th/9712188.
BIBLIOGRAPHY 817
[Gle57] A. M. Gleason. Measures on the closed subspaces of a Hilbert
space. J. Math. Mech., 6:885, 1957.
[GR80] S. N. Gupta and S. F. Radford. Quantum eld-theoretical elec-
tromagnetic and gravitational two-particle potentials. Phys.
Rev. D, 21:2213, 1980.
[GR00] I. S. Gradshteyn and I. M. Ryzhik. Tables of Integrals, Series,
and Products. Academic Press, San Diego, 2000.
[GRI89] S. N. Gupta, W. W. Repko, and C. J. Suchyta III. Muonium
and positronium potentials. Phys. Rev. D, 40:4100, 1989.
[GRT96] N. Grot, C. Rovelli, and R. S. Tate. Time-of-arrival in quantum
mechanics, 1996. http://arxiv.org/abs/quant-ph/9603021v1.
[GS58] O. W. Greenberg and S. S. Schweber. Clothed particle oper-
ators in simple models of quantum eld theory. Nuovo Cim.,
8:378, 1958.
[Gup63] A. K. Das Gupta. Unipolar machines, association of the mag-
netic eld with the eld-producing magnet. Am. J. Phys.,
31:428, 1963.
[GW93] St. D. G lazek and K. G. Wilson. Renormalization of hamilto-
nians. Phys. Rev. D, 48:5863, 1993.
[GZL] T. P. Gill, W. W. Zachary, and J. Lindesay. The classical
electron problem. http://arxiv.org/abs/physics/0405131v1.
[HBH
+
01] J. B. Hertzberg, S. R. Bickman, M. T. Hummon, Jr. D. Krause,
S. K. Peck, and L. R. Hunter. Measurement of the relativistic
potential dierence across a rotating magnetic dielectric cylin-
der. Am. J. Phys., 69:648, 2001.
[HC] H. Halvorson and R. Clifton. No place for particles in
relativistic quantum theories? http://arxiv.org/abs/quant-
ph/0103041.
[Heg98] G. C. Hegerfeldt. Instantaneous spreading and Einstein causal-
ity in quantum theory. Ann. Phys. (Leipzig), 7:716, 1998.
http://arxiv.org/abs/quant-ph/9809030.
818 BIBLIOGRAPHY
[Hei58] W. Heisenberg. Physics and Philosophy. Harper and Brothers,
New York, 1958.
[HK72] J. C. Hafele and R. E. Keating. Around-the-world atomic
clocks: Observed relativistic time gains. Science, 177:168,
1972.
[HMS79] D. Hasselkamp, E. Mondry, and A. Scharmann. Direct obser-
vation of the transversal Doppler-shift. Z. Physik A, 289:151,
1979.
[HN] A. Haibel and G. Nimtz. Universal tunneling time in photonic
barriers. http://arxiv.org/abs/physics/0009044.
[HN08] G. C. Hegerfeldt and J. T. Neumann. The Aharonov-Bohm
eect: the role of tunneling and associated forces, 2008.
http://arxiv.org/abs/arXiv:0801.0799v2.
[Hni04] V. Hnizdo. On linear momentum in quasistatic electromagnetic
systems, 2004. http://arxiv.org/abs/physics/0407027v1.
[Hob12] A. Hobson. There are no particles, there are only elds, 2012.
http://arxiv.org/abs/1204.4616.
[Hol04] B. R. Holstein. Eective interactions and the hydrogen atom.
Am. J. Phys., 72:333, 2004.
[Hov55] L. Van Hove. Energy corrections and persistent perturbation
eects in continuous spectra. Physica, 21:901, 1955.
[Hov56] L. Van Hove. Energy corrections and persistent perturbation ef-
fects in continuous spectra. II. The perturbed stationary states.
Physica, 22:343, 1956.
[How44] G. W. O. Howe. A problem of two electrons and Newtons third
law. Wireless Engineer, 21:105, 1944.
[HP02] A. Hache and L. Poirier. Long-range superluminal pulse propa-
gation in a coaxial photonic crystal. Appl. Phys. Lett., 80:518,
2002.
BIBLIOGRAPHY 819
[Hsi00] W. Y. Hsiang. Lectures on Lie Groups. World Scientic, Sin-
gapore, 2000.
[IS38] H. E. Ives and G. R. Stilwell. An experimental study of the
rate of a moving clock. J. Opt. Soc. Am, 28:215, 1938.
[IS41] H. E. Ives and G. R. Stilwell. An experimental study of the
rate of a moving clock. II. J. Opt. Soc. Am, 31:369, 1941.
[Ito65] T. Itoh. Derivation of nonrelativistic Hamiltonian for electrons
from quantum electrodynamics. Rev. Mod. Phys., 37:159, 1965.
[Jac99] J. D. Jackson. Classical electrodynamics. J. Wiley and Sons,
3rd edition, 1999.
[Jac04] J. D. Jackson. Torque or no torque? Simple charged particle
motion observed in dierent inertial frames. Am. J. Phys.,
72:1484, 2004.
[Jau71] J. M. Jauch. Projective representation of the Poncare group
in a quaternionic Hilbert space. in Group theory and its ap-
plications, edited by E.M. Loebl. Academic Press, New York,
1971.
[Jef99a] O. D. Jemenko. A relativistic paradox seemingly violating
conservation of momentum law in electromagnetic systems.
Eur. J. Phys., 20:39, 1999.
[Jef99b] O. D. Jemenko. The Trouton-Noble paradox. J. Phys. A:
Math. Gen., 32:3755, 1999.
[Joh96] L. Johansson. Longitudinal electrodynamic forces -
and their possible technological applications, 1996.
MSc Thesis, Lund Institute of Technology, Sweden,
http://www.df.lth.se/ snorkelf/LongitudinalMSc.pdf.
[Jor77] T. F. Jordan. Identication of the velocity operator for an
irreducible unitary representation of the Poincare group. J.
Math. Phys., 18:608, 1977.
[Jor80] T. F. Jordan. Simple derivation of the Newton-Wigner position
operator. J. Math. Phys., 21:2028, 1980.
820 BIBLIOGRAPHY
[Kac59] C. Kacser. Higher Born approximations in non-relativistic
Coulomb scattering. Nuovo Cim., 13:303, 1959.
[KAC90] T. P. Krisher, J. D. Anderson, and J. K. Campbell. Test of
the gravitational redshift eect at Saturn. Phys. Rev. Lett.,
64:1322, 1990.
[Kaz71] E. Kazes. Analytic theory of relativistic interactions. Phys.
Rev. D, 4:999, 1971.
[Kei] B. D. Keister. Forms of relativistic dynamics: What are the
possibilities? http://arxiv.org/abs/nucl-th/9406032.
[Kel42] J. M. Keller. Newtons third law and electrodynamics. Am. J.
Phys., 10:302, 1942.
[Ken17] E. H. Kennard. On unipolar induction: Another experiment
and its signicance as evidence for the existence of the aether.
Phil. Mag., 33:179, 1917.
[Ker62] E. H. Kerner. Electromagnetism and gravitation: an action-at-
a-distance conuence. Phys. Rev., 125:2184, 1962.
[KF74] R. A. Krajcik and L. L. Foldy. Relativistic center-of-mass
variables for composite systems with arbitrary internal inter-
actions. Phys. Rev. D, 10:1777, 1974.
[Kha97] L. A. Khaln. Quantum theory of unstable particles
and relativity, 1997. Preprint of Steklov Mathemati-
cal Institute, St. Petersburg Department, PDMI-6/1997
http://www.pdmi.ras.ru/preprint/1997/97-06.html.
[Kho] A. L. Kholmetskii. The authors collection of
relativistic paradoxes in classical electrodynam-
ics. http://www.space-lab.ru/les/pages/PIRT VII-
XII/pages/text/PIRT IX/Kholmetskii 1.pdf.
[Kho03] A.L. Kholmetskii. One century later: Remarks on the Barnett
experiment. Am. J. Phys., 71:558, 2003.
BIBLIOGRAPHY 821
[Kho05] A. L. Kholmetskii. On momentum and energy
of a non-radiating electromagnetic eld, 2005.
http://arxiv.org/abs/physics/0501148v2.
[Kho06] A. L. Kholmetskii. Momentum-energy of the non-radiating
electromagnetic eld: open problems? Phys. Scr., 73:620,
2006.
[Kit66] H. Kita. A non-trivial example of a relativistic quantum theory
of particles without divergence diculties. Progr. Theor. Phys.,
35:934, 1966.
[Kit68] H. Kita. Another convergent relativistic model theory of inter-
acting particles. Progr. Theor. Phys., 39:1333, 1968.
[Kit70] H. Kita. Structure of the state space in a convergent relativis-
tic model theory of interacting particles. Progr. Theor. Phys.,
43:1364, 1970.
[Kit72a] H. Kita. A model of relativistic quantum mechanics of inter-
acting particles. Progr. Theor. Phys., 48:2422, 1972.
[Kit72b] H. Kita. Vertex functions in convergent relativistic model the-
ories. Progr. Theor. Phys., 47:2140, 1972.
[Kit73] H. Kita. A realistic model of convergent relativistic quan-
tum mechanics of interacting particles. Progr. Theor. Phys.,
49:1704, 1973.
[KMA93] T. P. Krisher, D. D. Morabito, and J. D. Anderson. The Galileo
solar redshift experiment. Phys. Rev. Lett., 70:2213, 1993.
[KMSR
+
07] A. L. Kholmetskii, O. V. Missevitch, R. Smirnov-Rueda,
R. I. Tzonchev, A. E. Chubykalo, and I. Moreno. Exper-
imental evidence on non-applicability of the standard retar-
dation condition to bound magnetic elds and on new gen-
eralized Biot-Savart law. J. Appl. Phys., 101:023532, 2007.
http://arxiv.org/abs/physics/0601084v1.
[Kop03] S. M. Kopeikin. The measurement of the light de-
ection from Jupiter: Theoretical interpretation, 2003.
http://arxiv.org/abs/astro-ph/0302462v1.
822 BIBLIOGRAPHY
[KP91] B. D. Keister and W. N. Polyzou. Relativistic Hamiltonian
dynamics in nuclear and particle physics. in Advances in Nu-
clear Physics vol. 20, edited by J. W. Negele and E. W. Vogt.
Plenum Press, 1991. http://www.physics.uiowa.edu/ wpoly-
zou/papers/rev.pdf.
[KPR85] M. Kaivola, O. Poulsen, and E. Riis. Measurement of the rel-
ativistic Doppler shift in neon. Phys. Rev. Lett., 54:255, 1985.
[KR81] T. Katila and K. Riski. Measurement of the interaction between
electromagnetic radiation and gravitational eld using Zn-67
Mossbauer spectroscopy. Phys. Lett, 83A:51, 1981.
[KSO97] M. Kobayashi, T. Sato, and H. Ohtsubo. Eective interactions
for mesons and baryons in nuclei. Progr. Theor. Phys., 98:927,
1997.
[KV02] A. Kislev and L. Vaidman. Relativistic causality and conserva-
tion of energy in classical electromagnetic theory. Am. J. Phys.,
70:1216, 2002. http://arxiv.org/abs/physics/0201042v1.
[KvB] J. Koer and
C. Brukner. Classical world because of quantum
physics. http://arxiv.org/abs/quant-ph/0609079.
[KY07a] A. L. Kholmetskii and T. Yarman. Apparent paradoxes in clas-
sical electrodynamics: relativistic transformation of force. Eur.
J. Phys., 28:537, 2007.
[KY07b] A. L. Kholmetskii and T. Yarman. Relativistic transforma-
tion of force: resolution of apparent paradoxes. Eur. J. Phys.,
28:1081, 2007.
[KY08] A. L. Kholmetskii and T. Yarman. Energy ow in a bound
electromagnetic eld: resolution of apparent paradoxes. Eur.
J. Phys., 29:1135, 2008.
[LEK92] D. K. Lemon, W. F. Edwards, and C. S. Kenyon. Phys. Lett.
A, 162:105, 1992.
[Liv47] I. M. Livshitz. JETP, 11:1017, 1947.
BIBLIOGRAPHY 823
[LK75] A. R. Lee and T. M. Kalotas. Lorentz transformations from
the rst postulate. Am. J. Phys., 43:434, 1975.
[LL73] L. Landau and E. Lifshitz. Course of theoretical physics, Vol-
ume 2, Field theory. Moscow, Nauka, 6th edition (in Russian),
1973.
[LL76] J.-M. Levy-Leblond. One more derivation of the Lorentz trans-
formation. Am. J. Phys., 44:271, 1976.
[LL77] L. Landau and E. Lifshitz. Course of theoretical physics, Vol-
ume 3, Quantum mechanics, Non-relativistic theory. Perga-
mon, 1977.
[LLB90] J.-M. Levy-Leblond and F. Balibar. Quantics. North Holland,
1990.
[Mac63] G. W. Mackey. The mathematical foundations of quantum me-
chanics. W. A. Benjamin, New York, 1963. see esp. Section
2-2.
[Mac86] D. W. MacArthur. Special relativity: Understanding experi-
mental tests and formulations. Phys. Rev. A, 33:1, 1986.
[Mac00] G. W. Mackey. The theory of unitary group representations.
University of Chicago Press, Chicago, 2000.
[Mag54] W. Magnus. On the exponential solution of dierential equa-
tions for a linear operator. Commun. Pure Appl. Math., 7:649,
1954.
[Mal96] D. B. Malament. In defence of dogma: Why there cannot be
a relativistic quantum mechanics of (localizable) particles. in
Perspectives on quantum reality, edited by R. Clifton. Kluwer,
1996.
[Mat75] T. Matolcsi. Tensor product of Hilbert lattices and free or-
thodistributive product of orthomodular lattices. Acta Sci.
Math. (Szeged), 37:263, 1975.
824 BIBLIOGRAPHY
[MB01] W. L. Mochan and V. L. Brudny. Comment on Noncausal
time response in frustrated total internal reection?. Phys.
Rev. Lett., 87:119101, 2001.
[McDa] K. T. McDonald. Cullwicks paradox: Charged particle on the
axis of a toroidal magnet. http://puhep1.princeton.edu/ mc-
donald/examples/cullwick.pdf.
[McDb] K. T. McDonald. Limits on the applicability of classical
electromagnetic elds as inferred from the radiation reaction.
http://arxiv.org/abs/physics/0003062.
[McDc] K. T. McDonald. The Wilson-Wilson experiment.
http://128.112.100.2/ mcdonald/examples/wilson.pdf.
[McD06] K. T. McDonald. Onoochins paradox,
2006. http://puhep1.princeton.edu/ mcdon-
ald/examples/onoochin.pdf.
[Mer09] N. D. Mermin. Whats bad about this habit. Physics Today,
May:8, 2009.
[MIB03] G. Matteucci, D. Iencinella, and C. Beeli. The Aharonov-Bohm
phase shift and Boyers critical considerations: New experimen-
tal result but still an open subject? Found. Phys., 33:577, 2003.
[Mit] P. Mittelstaedt. Quantum physics and classical physics -
in the light of quantum logic. http://arxiv.org/abs/quant-
ph/0211021.
[MKSR11] O. V. Missevitch, A. L. Kholmetskii, and R. Smirnov-Rueda.
Anomalously small retardation of bound (force) electromag-
netic elds in antenna near zone. Europhys. Lett., 93:64004,
2011.
[MM97] A. H. Monahan and M. McMillan. Lorentz boost of the Newton-
Wigner-Pryce position operator. Phys. Rev. D, 56:2563, 1997.
[MRR00] D. Mugnai, A. Ranfagni, and R. Ruggeri. Observation of su-
perluminal behaviors in wave propagation. Phys. Rev. Lett.,
21:4830, 2000.
BIBLIOGRAPHY 825
[Mut78] U. Mutze. A no-go theorem concerning the cluster decomposi-
tion property of direct interaction scattering theories. J. Math.
Phys., 19:231, 1978.
[NFRS78] D. Newman, G. W. Ford, A. Rich, and E. Sweetman. Precision
experimental verication of special relativity. Phys. Rev. Lett.,
21:1355, 1978.
[Nik] H. Nikolic. Time in relativistic and nonrelativistic quantum
mechanics. http://arxiv.org/abs/0811.1905v1.
[Nor03] K. Nordtvedt. Lunar laser ranging - a comprehensive probe
of post-Newtonian gravity, 2003. http://arxiv.org/abs/gr-
qc/0301024v1.
[NS07] G. Nimtz and A. A. Stahlhofen. Universal tunneling time for
all elds, 2007. http://arxiv.org/abs/0709.0921.
[NW49] T. D. Newton and E. P. Wigner. Localized states for elementary
systems. Rev. Mod. Phys., 21:400, 1949.
[OMK
+
86] N. Osakabe, T. Matsuda, T. Kawasaki, J. Endo, A. Tono-
mura, S. Yano, and H. Yamada. Experimental conrmation
of Aharonov-Bohm eect using a toroidal magnetic eld con-
ned by a superconductor. Phys. Rev. A: Math. Gen., 34:815,
1986.
[opu59] J. L opusza` nski. The Ruijgrok-van Hove model of eld theory
in terms of dressed operators. Physica, 25:745, 1959.
[ORU98] J. Oppenheim, B. Reznik, and W.G. Unruh. Time-of-arrival
states, 1998. http://arxiv.org/abs/quant-ph/9807043.
[Osb68] H. Osborn. Relativistic center-of-mass variables for two-particle
system with spin. Phys. Rev., 176:1514, 1968.
[OST00] L. B. Okun, K. G. Selivanov, and V. L. Telegdi. On the inter-
pretation of the redshift in a static gravitational eld. Am. J.
Phys., 68:115, 2000. http://arxiv.org/abs/physics/9907017v2.
[Par] S. Parrott. Variant forms of Eliezers theorem.
http://arxiv.org/abs/gr-qc/0505042.
826 BIBLIOGRAPHY
[Par02] S. Parrott. Radiation from a uniformly accelerated charge
and the equivalence principle. Found. Phys., 32:407, 2002.
http://arxiv.org/abs/gr-qc/9303025.
[Pin04] M. J. Pinheiro. Do Maxwells equations need
a revision? - A methodological note, 2004.
http://arxiv.org/abs/physics/0511103v3.
[Pir64] C. Piron. Axiomatique quantique. Helv. Phys. Acta, 37:439,
1964.
[Pir76] C. Piron. Foundations of Quantum Physics. W. A. Benjamin,
Reading, 1976.
[PJ60] R. V. Pound and G. A. Rebka Jr. Apparent weight of photons.
Phys. Rev. Lett., 4:337, 1960.
[PL66] P. Pechukas and J. C. Light. On the exponential form of time-
displacement operators in quantum mechanics. J. Chem. Phys.,
44:3897, 1966.
[PN45] L. Page and N. I. Adams Jr. Action and reaction between
moving charges. Am. J. Phys., 13:141, 1945.
[Pol] R. Polishchuk. Derivation of the Lorentz transformations.
http://arxiv.org/abs/physics/0110076.
[Pol85] W. N. Polyzou. Manifestly covariant, Poincare invariant quan-
tum theories of directly interacting particles. Phys. Rev. D,
32:995, 1985.
[Pol03] W. N. Polyzou. Relativistic quantum mechanics - particle pro-
duction and cluster properties. Phys. Rev. C, 68:015202, 2003.
http://arxiv.org/abs/nucl-th/0302023.
[Pry48] M. H. L. Pryce. The mass-centre in the restricted theory of
relativity and its connexion with the quantum theory of ele-
mentary particles. Proc. Royal Soc. London, Ser. A, 195:62,
1948.
[PS] A. Pineda and J. Soto. Potential NRQED: The positronium
case. http://arxiv.org/abs/hep-ph/9805424.
BIBLIOGRAPHY 827
[PS65] R. V. Pound and J. L. Snider. Eect of gravity on gamma
radiation. Phys. Rev., 140:B788, 1965.
[PS95a] G. N. Pellegrini and A. R. Swift. Maxwells equations in a
rotating medium: Is there a problem? Am. J. Phys., 63:694,
1995.
[PS95b] M. E. Peskin and D. V. Schroeder. An introduction to quantum
eld theory. Westview Press, 1995.
[PS98] A. Pineda and J. Soto. The Lamb shift in dimensional regular-
ization. Phys. Lett. B, 420:391, 1998.
[PSS
+
92] W. Potzel, C. Schafer, M. Steiner, H. Karzel, W. Schiessl,
M. Peter, G. M. Kalvius, T. Katila, E. Ikonen, P. Helist o, J. Hi-
etaniemi, and K. Riski. Gravitational redshift experiments with
the high-resolution Mossbauer resonance in
67
Zn. Hyperne In-
teractions, 72:195, 1992.
[RB99] F. Richman and D. Bridges. A constructive proof of Gleasons
theorem. J. Funct. Anal., 162:287, 1999.
[Red53] M. L. G. Redhead. Radiative corrections to the scattering
of electrons and positrons by electrons. Proc. Roy. Soc. A,
220:219, 1953.
[RF69] R. A. Reck and D. L. Fry. Orbital and spin magnetization in
Fe-Co, Fe-Ni, and Ni-Co. Phys. Rev., 184:492, 1969.
[RGR91] H. Rubio, J. M. Getino, and O. Rojo. The Aharonov-Bohm
eect as a classical electromagnetic eect using electromagnetic
potentials. Nuovo Cim. B, 106:407, 1991.
[RH41] B. Rossi and D. B. Hall. Variation of the rate of decay of
mesotron with momentum. Phys. Rev., 59:223, 1941.
[RI96] T. G. Rolling and T. K. Ishii. Comparative propagation study
of pulse-modulated microwaves. Microwave and Optical Tech-
nology Letters, 13:202, 1996.
828 BIBLIOGRAPHY
[Rit61] V. I. Ritus. Transformations of the inhomogeneous Lorentz
group and the relativistic kinematics of polarized states. Soviet
Physics JETP, 13:240, 1961.
[RM96] A. Ranfagni and D. Mugnai. Anomalous pulse delay in mi-
crowave propagation: A case of superluminal behavior. Phys.
Rev. E, 54:5692, 1996.
[RMM74] T. N. Rescigno, C. W. McCurdy, and V. McKoy. Discrete basis
set approach to nonspherical scattering. Chem. Phys. Lett.,
27:401, 1974.
[RMR
+
80] C. E. Roos, J. Marrano, S. Reucroft, J. Waters, M. S. Web-
ster, E. G. H. Williams, A. Manz, R. Settles, and G. Wolf.