of 689 /689
arXiv:physics/0504062v10 [physics.gen-ph] 5 Nov 2008  R E L A T I    V I    S  T I    C   Q  U A T  U Y A I    C  S  E  u   g  e n  e  V  S   t   e f    a n  o  v i    c h  2   0   0   8  

Relativistic Quantum

Embed Size (px)

Text of Relativistic Quantum

Page 1: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 1/687

arXiv:physics/0504062v10 [physics.gen-ph] 5

RE L

A T I VI S T I C Q UA

NT UM

DYNA MI C S

E u g e n e V. S t

e f an o vi c h

2 0 0 8

Page 2: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 2/687

ii

Page 3: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 3/687

iii

Draft, 2nd edition

RELATIVISTIC QUANTUM DYNAMICS:

A Non-Traditional Perspective on Space, Time,Particles, Fields, and Action-at-a-Distance

Eugene V. Stefanovich 1

Mountain View, California

Copyright c2004 - 2008 Eugene V. Stefanovich

1Present address: 2255 Showers Dr., Apt. 153, Mountain View, CA 94040, USA.e-mail: eugene st[email protected] address: http://www.geocities.com/meopemuk,

http://www.arxiv.org/abs/physics/0504062

Page 4: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 4/687

iv

Page 5: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 5/687

v

To Regina

Page 6: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 6/687

vi

Abstract

This book is an attempt to build a consistent relativistic quantum theoryof interacting particles. In the first part of the book “Quantum electrody-namics” we present traditional views on theoretical foundations of particlephysics. Our discussion proceeds systematically from the principle of rel-ativity and postulates of measurements to the renormalization in quantumelectrodynamics. In the second part of the book “The quantum theory of particles” the traditional approach is reexamined. We find that formulas of special relativity should be modified to take into account interparticle inter-actions. We also suggest to reinterpret quantum field theory in the languageof physical “dressed” particles. In this new formulation the fundamental ob-

jects are particles rather than fields. This approach eliminates the need forrenormalization and opens up a new way for studying dynamical and boundstate properties of quantum interacting systems. The developed theory isapplied to realistic physical objects and processes including the hydrogenatom, the decay law of moving unstable particles, the dynamics of interact-ing charges, and boost transformations of observables. These results forceus to take a fresh look at some core issues of modern particle theories, inparticular, the Minkowski space-time unification, the role of quantum fieldsand renormalization, and the alleged impossibility of action-at-a-distance. Anew perspective on these issues is suggested that can help to solve the biggestproblem of modern theoretical physics – a consistent unification of relativity

and quantum mechanics.

Page 7: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 7/687

Contents

PREFACE xvii

INTRODUCTION xxv

I QUANTUM ELECTRODYNAMICS 1

1 THE POINCARE GROUP 31.1 Inertial observers . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.1.1 The principle of relativity . . . . . . . . . . . . . . . . 31.1.2 Inertial transformations . . . . . . . . . . . . . . . . . 5

1.2 The Galilei group . . . . . . . . . . . . . . . . . . . . . . . . . 71.2.1 The multiplication law of the Galilei group . . . . . . . 7

1.2.2 Lie algebra of the Galilei group . . . . . . . . . . . . . 81.2.3 Transformations of generators under rotations . . . . . 111.2.4 Space inversions . . . . . . . . . . . . . . . . . . . . . . 14

1.3 The Poincare group . . . . . . . . . . . . . . . . . . . . . . . . 151.3.1 Lie algebra of the Poincare group . . . . . . . . . . . . 161.3.2 Transformations of translation generators under boosts 21

2 QUANTUM MECHANICS 252.1 Why do we need quantum mechanics? . . . . . . . . . . . . . 26

2.1.1 The corpuscular theory of light . . . . . . . . . . . . . 26

2.1.2 The wave theory of light . . . . . . . . . . . . . . . . . 292.1.3 Low intensity light and other experiments . . . . . . . 312.2 Physical foundations of quantum mechanics . . . . . . . . . . 33

2.2.1 Ensembles and experiments . . . . . . . . . . . . . . . 332.2.2 Measurements in classical mechanics . . . . . . . . . . 34

vii

Page 8: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 8/687

viii CONTENTS

2.2.3 The quantum case . . . . . . . . . . . . . . . . . . . . 36

2.3 The lattice of propositions . . . . . . . . . . . . . . . . . . . . 372.3.1 Propositions and states . . . . . . . . . . . . . . . . . . 392.3.2 Partial ordering . . . . . . . . . . . . . . . . . . . . . . 422.3.3 Meet . . . . . . . . . . . . . . . . . . . . . . . . . . . . 432.3.4 Join . . . . . . . . . . . . . . . . . . . . . . . . . . . . 442.3.5 Orthocomplement . . . . . . . . . . . . . . . . . . . . . 442.3.6 Atomic propositions . . . . . . . . . . . . . . . . . . . 48

2.4 Classical logic . . . . . . . . . . . . . . . . . . . . . . . . . . . 492.4.1 Truth tables and the distributive law . . . . . . . . . . 492.4.2 Atomic propositions in classical logic . . . . . . . . . . 522.4.3 Atoms and pure classical states . . . . . . . . . . . . . 542.4.4 The classical phase space . . . . . . . . . . . . . . . . . 56

2.5 Quantum logic . . . . . . . . . . . . . . . . . . . . . . . . . . 592.5.1 Compatibility of propositions . . . . . . . . . . . . . . 592.5.2 The logic of quantum mechanics . . . . . . . . . . . . . 62

2.6 Quantum observables and states . . . . . . . . . . . . . . . . . 662.6.1 Observables . . . . . . . . . . . . . . . . . . . . . . . . 662.6.2 States . . . . . . . . . . . . . . . . . . . . . . . . . . . 682.6.3 Commuting and compatible observables . . . . . . . . 702.6.4 Expectation values . . . . . . . . . . . . . . . . . . . . 722.6.5 Basic rules of classical and quantum mechanics . . . . 73

2.7 Is quantum mechanics a complete theory? . . . . . . . . . . . 742.7.1 Quantum unpredictability in microscopic systems . . . 742.7.2 The Copenhagen interpretation . . . . . . . . . . . . . 762.7.3 The realistic interpretation . . . . . . . . . . . . . . . . 772.7.4 The statistical interpretation of quantum mechanics . . 78

3 QUANTUM MECHANICS AND RELATIVITY 813.1 Inertial transformations in quantum mechanics . . . . . . . . . 81

3.1.1 Wigner theorem . . . . . . . . . . . . . . . . . . . . . . 823.1.2 Inertial transformations of states . . . . . . . . . . . . 85

3.1.3 The Heisenberg and Schrodinger pictures . . . . . . . . 873.2 Unitary representations of the Poincare group . . . . . . . . . 883.2.1 Projective representations of groups . . . . . . . . . . . 883.2.2 Elimination of central charges in the Poincare algebra . 893.2.3 Single- and double-valued representations . . . . . . . . 98

Page 9: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 9/687

CONTENTS ix

3.2.4 The fundamental statement of relativistic quantum the-

ory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1014 OPERATORS OF OBSERVABLES 105

4.1 Basic observables . . . . . . . . . . . . . . . . . . . . . . . . . 1064.1.1 Energy, momentum, and angular momentum . . . . . . 1064.1.2 The operator of velocity . . . . . . . . . . . . . . . . . 107

4.2 Casimir operators . . . . . . . . . . . . . . . . . . . . . . . . . 1084.2.1 4-vectors . . . . . . . . . . . . . . . . . . . . . . . . . . 1094.2.2 The operator of mass . . . . . . . . . . . . . . . . . . . 1094.2.3 The Pauli-Lubanski 4-vector . . . . . . . . . . . . . . . 110

4.3 Operators of spin and position . . . . . . . . . . . . . . . . . . 113

4.3.1 Physical requirements . . . . . . . . . . . . . . . . . . 1134.3.2 The spin operator . . . . . . . . . . . . . . . . . . . . . 1154.3.3 The position operator . . . . . . . . . . . . . . . . . . 1174.3.4 An alternative set of basic operators . . . . . . . . . . 1214.3.5 Canonical form and “power” of operators . . . . . . . . 1234.3.6 The uniqueness of the spin operator . . . . . . . . . . . 1264.3.7 The uniqueness of the position operator . . . . . . . . 1284.3.8 Boost transformations of the position operator . . . . . 129

5 SINGLE PARTICLES 1335.1 Massive particles . . . . . . . . . . . . . . . . . . . . . . . . . 135

5.1.1 Irreducible representations of the Poincare group . . . 1355.1.2 Momentum-spin basis . . . . . . . . . . . . . . . . . . 1385.1.3 Action of Poincare transformations . . . . . . . . . . . 141

5.2 Momentum and position representations . . . . . . . . . . . . 1445.2.1 Spectral decomposition of the identity operator . . . . 1455.2.2 Wave function in the momentum representation . . . . 1485.2.3 The position representation . . . . . . . . . . . . . . . 150

5.3 The classical limit of quantum mechanics . . . . . . . . . . . . 1545.3.1 Quasiclassical states . . . . . . . . . . . . . . . . . . . 1555.3.2 The Heisenberg uncertainty relation . . . . . . . . . . . 156

5.3.3 Spreading of quasiclassical wave packets . . . . . . . . 1575.3.4 Classical observables and the Poisson bracket . . . . . 158

5.4 Massless particles . . . . . . . . . . . . . . . . . . . . . . . . . 1635.4.1 Spectra of momentum, energy, and velocity . . . . . . 1635.4.2 The Doppler effect and aberration . . . . . . . . . . . . 164

Page 10: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 10/687

x CONTENTS

5.4.3 Representations of the little group . . . . . . . . . . . . 167

5.4.4 Massless representations of the Poincare group . . . . . 1706 INTERACTION 173

6.1 The Hilbert space of many-particle system . . . . . . . . . . . 1746.1.1 The tensor product theorem . . . . . . . . . . . . . . . 1746.1.2 Particle observables in a multiparticle system . . . . . 1766.1.3 Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . 177

6.2 Relativistic Hamiltonian dynamics . . . . . . . . . . . . . . . . 1796.2.1 Non-interacting representation of the Poincare group . 1806.2.2 Dirac’s forms of dynamics . . . . . . . . . . . . . . . . 1816.2.3 Total observables in a multiparticle system . . . . . . . 183

6.3 The instant form of dynamics . . . . . . . . . . . . . . . . . . 1846.3.1 General instant form interaction . . . . . . . . . . . . . 1846.3.2 The Bakamjian-Thomas construction . . . . . . . . . . 1856.3.3 Bakamjian’s construction of the point form dynamics . 1886.3.4 Non-Bakamjian-Thomas instant forms of dynamics . . 1896.3.5 Cluster separability . . . . . . . . . . . . . . . . . . . . 1926.3.6 Non-separability of the Bakamjian-Thomas dynamics . 1946.3.7 Cluster separable 3-particle interaction . . . . . . . . . 196

6.4 Dynamics and bound states . . . . . . . . . . . . . . . . . . . 2006.4.1 Mass and energy spectra . . . . . . . . . . . . . . . . . 2016.4.2 The Doppler effect revisited . . . . . . . . . . . . . . . 2026.4.3 Time evolution . . . . . . . . . . . . . . . . . . . . . . 206

6.5 Scattering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2076.5.1 The scattering operator . . . . . . . . . . . . . . . . . 2086.5.2 S-operator in perturbation theory . . . . . . . . . . . . 2116.5.3 Scattering equivalence of Hamiltonians . . . . . . . . . 2146.5.4 Scattering equivalence of the point and instant forms . 216

7 THE FOCK SPACE 2217.1 Annihilation and creation operators . . . . . . . . . . . . . . . 222

7.1.1 Sectors with fixed numbers of particles . . . . . . . . . 222

7.1.2 Non-interacting representation of the Poincare group . 2247.1.3 Creation and annihilation operators. Fermions . . . . . 2257.1.4 Anticommutators of particle operators . . . . . . . . . 2277.1.5 Creation and annihilation operators. Photons . . . . . 2297.1.6 Particle number operators . . . . . . . . . . . . . . . . 229

Page 11: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 11/687

CONTENTS xi

7.1.7 Continuous spectrum of momentum . . . . . . . . . . . 230

7.1.8 Generators of the non-interacting representation . . . . 2327.1.9 Poincare transformations of particle operators . . . . . 2357.2 Interaction potentials . . . . . . . . . . . . . . . . . . . . . . . 236

7.2.1 Conservation laws . . . . . . . . . . . . . . . . . . . . . 2377.2.2 Normal ordering . . . . . . . . . . . . . . . . . . . . . 2387.2.3 The general form of the interaction operator . . . . . . 2397.2.4 Five types of regular potentials . . . . . . . . . . . . . 2427.2.5 Products and commutators of potentials . . . . . . . . 2457.2.6 Adiabatic switching and t-integrals . . . . . . . . . . . 2487.2.7 Two-particle potentials . . . . . . . . . . . . . . . . . . 2517.2.8 Cluster separability in the Fock space . . . . . . . . . . 255

7.3 A toy model theory . . . . . . . . . . . . . . . . . . . . . . . . 2587.3.1 Fock space and Hamiltonian . . . . . . . . . . . . . . . 2587.3.2 Drawing a diagram in the toy model . . . . . . . . . . 2607.3.3 Reading a diagram in the toy model . . . . . . . . . . 2647.3.4 Electron-electron scattering . . . . . . . . . . . . . . . 265

7.4 Diagrams in general theory . . . . . . . . . . . . . . . . . . . . 2677.4.1 Properties of products and commutators . . . . . . . . 2677.4.2 Cluster separability of the S -operator . . . . . . . . . . 2717.4.3 Divergence of loop integrals . . . . . . . . . . . . . . . 275

7.5 Particle decays . . . . . . . . . . . . . . . . . . . . . . . . . . 2787.5.1 Quantum mechanics of particle decays . . . . . . . . . 2787.5.2 Non-interacting representation of the Poincare group . 2827.5.3 Normalized eigenvectors of momentum . . . . . . . . . 2837.5.4 Interacting representation of the Poincare group . . . . 2847.5.5 The non-decay law . . . . . . . . . . . . . . . . . . . . 2887.5.6 Solution of the eigenvalue problem . . . . . . . . . . . 2907.5.7 The Breit-Wigner formula . . . . . . . . . . . . . . . . 294

8 QUANTUM ELECTRODYNAMICS 3038.1 Interaction in QED . . . . . . . . . . . . . . . . . . . . . . . . 304

8.1.1 Construction of simple quantum field theories . . . . . 305

8.1.2 Current density . . . . . . . . . . . . . . . . . . . . . . 3078.1.3 Interaction operators in QED . . . . . . . . . . . . . . 3098.1.4 Poincare invariance of QED . . . . . . . . . . . . . . . 311

8.2 S -operator in QED . . . . . . . . . . . . . . . . . . . . . . . . 3168.2.1 S -operator in the second order . . . . . . . . . . . . . 316

Page 12: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 12/687

xii CONTENTS

8.2.2 S -operator in the non-relativistic approximation . . . 322

8.3 Renormalization . . . . . . . . . . . . . . . . . . . . . . . . . . 3258.3.1 Regularization . . . . . . . . . . . . . . . . . . . . . . . 3268.3.2 The mass renormalization condition . . . . . . . . . . . 3268.3.3 Counterterms . . . . . . . . . . . . . . . . . . . . . . . 3298.3.4 Electron-electron scattering . . . . . . . . . . . . . . . 3348.3.5 Charge renormalization . . . . . . . . . . . . . . . . . . 3418.3.6 Renormalization in QED . . . . . . . . . . . . . . . . . 344

II A RELATIVISTIC QUANTUM THEORY OFPARTICLES 347

9 THE DRESSED PARTICLE APPROACH 3519.1 Troubles with renormalized QED . . . . . . . . . . . . . . . . 352

9.1.1 Renormalization in QED revisited . . . . . . . . . . . . 3529.1.2 Time evolution in QED . . . . . . . . . . . . . . . . . 3549.1.3 Unphys and renorm operators in QED . . . . . . . . . 356

9.2 Dressing transformation . . . . . . . . . . . . . . . . . . . . . 3579.2.1 Physical particles . . . . . . . . . . . . . . . . . . . . . 3579.2.2 The main idea of the dressed particle approach . . . . 3609.2.3 The unitary dressing transformation . . . . . . . . . . 3619.2.4 Dressing in the first perturbation order . . . . . . . . . 3649.2.5 Dressing in the second perturbation order . . . . . . . 3659.2.6 Dressing in an arbitrary order . . . . . . . . . . . . . . 3679.2.7 The infinite momentum cutoff limit . . . . . . . . . . . 3689.2.8 Poincare invariance of the dressed particle approach . . 3699.2.9 General properties of physical particle interactions . . . 3709.2.10 Comparison with other dressed particle approaches . . 373

9.3 The Coulomb potential and beyond . . . . . . . . . . . . . . . 3759.3.1 Electron-proton interaction in the 2nd order . . . . . . 3759.3.2 Electron-proton potential in the momentum space . . . 3779.3.3 The Breit potential . . . . . . . . . . . . . . . . . . . . 379

9.3.4 The hydrogen atom . . . . . . . . . . . . . . . . . . . . 382

10 INTERACTIONS AND RELATIVITY 38710.1 Localized events in relativistic quantum theory . . . . . . . . . 388

10.1.1 Localized particles . . . . . . . . . . . . . . . . . . . . 389

Page 13: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 13/687

CONTENTS xiii

10.1.2 Localized states in the moving reference frame . . . . . 390

10.1.3 Spreading of well-localized states . . . . . . . . . . . . 39210.2 Inertial transformations in multiparticle systems . . . . . . . . 39410.2.1 Non-interacting particles . . . . . . . . . . . . . . . . . 39510.2.2 Lorentz transformations for non-interacting particles . 39710.2.3 Interacting particles . . . . . . . . . . . . . . . . . . . 39810.2.4 Time translations in interacting systems . . . . . . . . 39910.2.5 Boost transformations in interacting systems . . . . . . 40110.2.6 Spatial translations and rotations . . . . . . . . . . . . 40310.2.7 Physical inequivalence of forms of dynamics . . . . . . 40510.2.8 The ”no interaction” theorem . . . . . . . . . . . . . . 406

10.3 Comparison with special relativity . . . . . . . . . . . . . . . . 412

10.3.1 On “derivations” of Lorentz transformations . . . . . . 41210.3.2 On experimental tests of special relativity . . . . . . . 41410.3.3 Poincare invariance vs. manifest covariance . . . . . . . 41610.3.4 Is geometry 4-dimensional? . . . . . . . . . . . . . . . 418

10.4 Quantum theory of gravity . . . . . . . . . . . . . . . . . . . . 42110.4.1 The two-body Hamiltonian . . . . . . . . . . . . . . . . 42210.4.2 Relativistic invariance . . . . . . . . . . . . . . . . . . 42410.4.3 Photons . . . . . . . . . . . . . . . . . . . . . . . . . . 42810.4.4 Universality of the free fall . . . . . . . . . . . . . . . . 42910.4.5 Composition invariance of interactions . . . . . . . . . 43110.4.6 n-body gravitational potentials . . . . . . . . . . . . . 43510.4.7 Red shift and time dilation . . . . . . . . . . . . . . . . 43610.4.8 Principle of equivalence . . . . . . . . . . . . . . . . . . 439

10.5 Particle decays and relativity . . . . . . . . . . . . . . . . . . 44010.5.1 General formula for the non-decay law . . . . . . . . . 44110.5.2 Decays of states with definite momentum . . . . . . . . 44310.5.3 Non-decay law in the moving reference frame . . . . . . 44510.5.4 Decays of states with definite velocity . . . . . . . . . . 44610.5.5 Numerical results . . . . . . . . . . . . . . . . . . . . . 44710.5.6 Decays caused by boosts . . . . . . . . . . . . . . . . . 45010.5.7 Particle decays in different forms of dynamics . . . . . 452

11 PARTICLES VS. FIELDS 45511.1 Fields, particles, and action-at-a-distance . . . . . . . . . . . . 457

11.1.1 Maxwell’s theory . . . . . . . . . . . . . . . . . . . . . 45711.1.2 Is Maxwell’s theory exact? . . . . . . . . . . . . . . . . 459

Page 14: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 14/687

xiv CONTENTS

11.1.3 Retarded interactions in Maxwell’s theory . . . . . . . 461

11.1.4 The Kislev-Vaidman “paradox” . . . . . . . . . . . . . 46311.1.5 Interactions of particles in RQD . . . . . . . . . . . . . 466

11.1.6 Interaction between charges and magnetic dipoles . . . 467

11.1.7 The Aharonov-Bohm effect . . . . . . . . . . . . . . . . 472

11.1.8 Does action-at-a-distance violate causality? . . . . . . . 476

11.1.9 Superluminal propagation of evanescent waves . . . . . 478

11.2 Are quantum fields necessary? . . . . . . . . . . . . . . . . . 481

11.2.1 Dressing transformation in a nutshell . . . . . . . . . . 481

11.2.2 What is the reason for having quantum fields? . . . . 484

11.2.3 Haag’s theorem . . . . . . . . . . . . . . . . . . . . . . 487

11.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 489

III MATHEMATICAL APPENDICES 491

A Sets, groups, and vector spaces 493

A.1 Sets and mappings . . . . . . . . . . . . . . . . . . . . . . . . 493

A.2 Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 493

A.3 Vector spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . 495

B The delta function and useful integrals 499

C Some theorems for orthocomplemented lattices. 503

D The rotation group 505

D.1 Basics of the 3D space . . . . . . . . . . . . . . . . . . . . . . 505

D.2 Scalars and vectors . . . . . . . . . . . . . . . . . . . . . . . . 507

D.3 Orthogonal matrices . . . . . . . . . . . . . . . . . . . . . . . 508

D.4 Invariant tensors . . . . . . . . . . . . . . . . . . . . . . . . . 511

D.5 Vector parameterization of rotations . . . . . . . . . . . . . . 513

D.6 Group properties of rotations . . . . . . . . . . . . . . . . . . 516

D.7 Generators of rotations . . . . . . . . . . . . . . . . . . . . . . 518

E Lie groups and Lie algebras 521

E.1 Lie groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 521

E.2 Lie algebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . 525

Page 15: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 15/687

CONTENTS xv

F The Hilbert space 529

F.1 Inner product . . . . . . . . . . . . . . . . . . . . . . . . . . . 529F.2 Orthonormal bases . . . . . . . . . . . . . . . . . . . . . . . . 530F.3 Bra and ket vectors . . . . . . . . . . . . . . . . . . . . . . . . 531F.4 The tensor product of Hilbert spaces . . . . . . . . . . . . . . 533F.5 Linear operators . . . . . . . . . . . . . . . . . . . . . . . . . . 534F.6 Matrices and operators . . . . . . . . . . . . . . . . . . . . . . 535F.7 Functions of operators . . . . . . . . . . . . . . . . . . . . . . 539F.8 Linear operators in different orthonormal bases . . . . . . . . 543F.9 Diagonalization of Hermitian and unitary matrices . . . . . . . 547

G Subspaces and projection operators 551

G.1 Pro jections . . . . . . . . . . . . . . . . . . . . . . . . . . . . 551G.2 Commuting operators . . . . . . . . . . . . . . . . . . . . . . . 553

H Representations of groups 561H.1 Unitary representations of groups . . . . . . . . . . . . . . . . 561H.2 The Heisenberg Lie algebra . . . . . . . . . . . . . . . . . . . 563H.3 Unitary irreducible representations of the rotation group . . . 563H.4 Pauli matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . 564

I Lorentz group and its representations 567I.1 4-vector representation of the Lorentz group . . . . . . . . . . 567I.2 Spinor representation of the Lorentz group . . . . . . . . . . . 573

J Special relativity 577J.1 Lorentz transformations for time and position . . . . . . . . . 577J.2 The ban on superluminal signaling . . . . . . . . . . . . . . . 578J.3 Minkowski space-time and manifest covariance . . . . . . . . . 580J.4 Decay of moving particles in special relativity . . . . . . . . . 582

K Quantum fields for fermions 585K.1 Construction of the fermion field . . . . . . . . . . . . . . . . 585

K.2 Properties of factors u and v . . . . . . . . . . . . . . . . . . . 587K.3 Explicit formulas for u and v . . . . . . . . . . . . . . . . . . . 591K.4 Convenient notation . . . . . . . . . . . . . . . . . . . . . . . 592K.5 Transformation laws . . . . . . . . . . . . . . . . . . . . . . . 594K.6 Anticommutation relations . . . . . . . . . . . . . . . . . . . . 595

Page 16: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 16/687

xvi CONTENTS

K.7 The Dirac equation . . . . . . . . . . . . . . . . . . . . . . . . 598

K.8 Electron propagator . . . . . . . . . . . . . . . . . . . . . . . . 599L Quantum fields for photons 601

L.1 Construction of the photon quantum field . . . . . . . . . . . 601L.2 Photon propagator . . . . . . . . . . . . . . . . . . . . . . . . 602L.3 Equal time commutator of the photon fields . . . . . . . . . . 605L.4 Poincare transformations of the photon field . . . . . . . . . . 606

M QED interaction in terms of particle operators 611M.1 Current density . . . . . . . . . . . . . . . . . . . . . . . . . 611M.2 First-order interaction . . . . . . . . . . . . . . . . . . . . . 613

M.3 Second-order interaction . . . . . . . . . . . . . . . . . . . . 614

Page 17: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 17/687

PREFACE

Looking back at theoretical physics of the 20th century, we see two mon-umental achievements that changed forever the way we understand space,

time, and matter – the special theory of relativity and quantum mechanics.These theories covered two sides of the natural world that are not normallyaccessible to human’s senses and experience. Special relativity was appliedto observers and objects moving with extremely high speeds (and high ener-gies). Quantum mechanics was essential to describe the properties of matterat microscopic scale: nuclei, atoms, molecules, etc. The challenge remainsin the unification of these two theories, i.e., in the theoretical description of high-energy microscopic particles and their interactions.

It is commonly accepted that the most promising candidate for suchan unification is the local quantum field theory (QFT). Indeed, this theoryachieved astonishing accuracy in calculations of certain physical observables,

e.g., scattering cross-sections. In some instances, the discrepancies betweenpredictions of quantum electrodynamics (QED) and experiment are less than0.000000001%. It is difficult to find such an accuracy anywhere else in science!However, in spite of its success, quantum field theory cannot be regarded asthe ultimate unification of relativity and quantum mechanics. Just too manyfundamental questions remain unanswered and too many serious problemsare left unsolved.

It is fair to say that everyone trying to learn QFT is struck by enormouscomplexity of this formalism and its detachment from physically intuitiveideas. A successful physical theory is expected to operate, as much as possi-

ble, with objects having clear interpretation in terms of observed quantities.This is often not the case in QFT, where such physically transparent con-cepts of quantum mechanics as the Hilbert space, wave functions, particleobservables, Hamiltonian, and the time evolution, were substituted (thoughnot completely discarded) by more formal and obscure notions of quantum

xvii

Page 18: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 18/687

xviii PREFACE

fields, ghosts, propagators, and Lagrangians. It was even declared that the

concept of a particle is not fundamental anymore and must be abandoned infavor of the field description of nature:

In its mature form, the idea of quantum field theory is that quan-tum fields are the basic ingredients of the universe, and particles are just bundles of energy and momentum of the fields. S. Wein-berg [1]

The most notorious failure of QFT is the problem of ultraviolet diver-gences: To obtain sensible results from QFT calculations one must drop cer-tain infinite terms. Although rules for doing such tricks are well-established,

they cannot be considered a part of a mathematically sound theory. As Diracremarked

This is just not sensible mathematics. Sensible mathematics in-volves neglecting a quantity when it turns out to be small – not neglecting it because it is infinitely large and you do not want it! P. A. M. Dirac

In modern QFT the problem of ultraviolet infinities is not solved, it is “sweptunder the rug”. The prevailing opinion is that ultraviolet divergences arerelated to our lack of understanding of physics at short distances. It isoften argued that QFT is a low energy approximation to some yet unknowntruly fundamental theory, and that in this final theory the small distanceor high energy (ultraviolet) behavior will be tamed somehow. There arevarious guesses about what this ultimate theory may be. Some think thatfuture theory will reveal a non-trivial, probably discrete, or non-commutativestructure of space at distances comparable to the “Planck scale” of 10−33 cm.Others hope that paradoxes will go away if we substitute point-like particleswith tiny extended objects, like strings or membranes.

Many researchers agree that the most fundamental obstacle on the wayforward is the deep contradiction between quantum theory and Einstein’s

relativity theory (both special and general). In a more general sense, thebasic question is “what is space and time?” The answers given by Einstein’sspecial relativity and by quantum mechanics are quite different. In specialrelativity position and time are treated on an equal footing, both of thembeing coordinates in the 4-dimensional Minkowski space-time. However in

Page 19: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 19/687

xix

quantum mechanics position and time play very different roles. Position is

an observable described (as any other physical observable) by an Hermitianoperator, whereas time is a numerical parameter, which cannot be cast intothe operator form without contradictions.

Traditionally, throughout centuries, theoretical physics was a mixture of deep physical insights, “what if” speculations, and lucky heuristic guesses.For example, Maxwell based his equations of electrodynamics on both carefulfitting of available experiments and on arcane ideas about inner workings of the “aether”. Dirac “derived” his famous equation by an ingenious trick of taking a “square root” of the relativistic energy-momentum-mass relation-ship E 2 = p2c2 + m2c4. In spite of their questionable origins, both Maxwelland Dirac equations lie in the foundation of empirically successful modern

quantum field theories of particles and their interactions. It would be fool-ish to think that further progress in physics can be achieved by waiting forsimilar “revelations”. What we need is a rigorous axiomatic approach thatwould establish a basic framework for description of space, time, matter, andinteractions. For example, it is clear that such branches of physics as Newto-nian mechanics, quantum mechanics, Maxwell’s electrodynamics, quantumfield theory, and Einstein’s theory of gravity use very different basic math-ematical structures. They are, respectively, the phase space, the Hilbertspace, classical fields, quantum fields, and the curved space-time. These dif-ferent theories try to describe basically the same set of physical phenomena– particles, their interactions and dynamics. So, it is reasonable to expecta common theme in their mathematical formulations. We are still far fromputting entire physics in a rigorous axiom/theorem setup, as demanded bythe Hilbert’s 6th problem. However, there are at least two such commonthemes, which are the focus of this book.

The two basic postulates of our approach is the principle of relativity(the equivalence of inertial frames of reference) and the laws of quantum me-chanics. From mathematical perspective, the former postulate is embodiedin the idea of the Poincare group, and the latter postulate leads to the alge-bra of operators in the Hilbert space. When combined, these two postulatesinevitably imply the idea of unitary representations of the Poincare group in

the Hilbert space. This is the central idea of our book. Our goal here is todemonstrate that most of known physics fits nicely into this mathematicalframework.

Although the ideas that will be presented here have rather general na-ture, most calculations will be performed only for systems of charged particles

Page 20: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 20/687

xx PREFACE

and photons and electromagnetic forces acting between them.2 Traditionally,

these systems were described by quantum electrodynamics. However, ourapproach will lead us to a different version of QED called relativistic quan-tum dynamics or RQD. Our theory being a logical continuation of QED,is exactly equivalent to QED as far as properties related to the S -matrix(scattering cross-sections, lifetimes, energies of bound states, etc.) are con-cerned. However, it differs from the traditional approach in two importantaspects: recognition of the dynamical character of boosts and the primary role of particles rather than fields.

The dynamical character of boosts. The second postulate of Ein-stein’s special relativity (the invariance of the speed of light) applies only

to a limited class of phenomena associated with light particles – photons.Therefore special relativistic formulas for Lorentz transformations can beproved only for specific events such as intersections of light beams or tra-

jectories of non-interacting particles. Special relativity tacitly assumes thatthese Lorentz formulas can be extended to general events with interactingparticles. We will show that this assumption is actually wrong. We will de-rive boost transformations of particle observables by using Wigner’s theoryof unitary representations of the Poincare group [2] and Dirac’s approachto relativistic interactions [3]. It will then follow that these transformationsshould be interaction-dependent. Usual Lorentz transformations of specialrelativity are thus approximations that neglect the presence of interactions.

The Einstein-Minkowski 4-dimensional spacetime is an approximate conceptas well.

Particles rather than fields. Presently accepted quantum field the-ories (e.g., renormalized QED) cannot describe the time evolution even of simplest systems, such as vacuum and single-particle states. Direct applica-tion of the QED time evolution operator to these states leads to spontaneouscreation of extra particles, which has not been observed in experiments. Theproblem is that bare particles of QED have rather remote relationship tophysically observed electrons, positrons, etc., and the rules connecting bareand physical particles are not well established. We solve this problem byusing the “dressed particle” approach, which is the cornerstone of RQD. The“dressed” Hamiltonian of RQD is obtained by applying an unitary dress-ing transformation to the QED Hamiltonian. This transformation does not

2In section 10.4 we will discuss the gravitational interaction as well.

Page 21: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 21/687

xxi

change the S -operator, therefore the perfect agreement with experimental

data is preserved. The RQD Hamiltonian describes electromagnetic phe-nomena in systems of interacting physical particles (electrons, photons, etc.)without reference to spurious bare particles. In addition to accurate scat-tering amplitudes, our approach allows us to obtain the time evolution of interacting particles and offers a rigorous way to find both energies and wavefunctions of bound states. All calculations with the RQD Hamiltonian canbe done by using standard recipes of quantum mechanics without encoun-tering embarrassing divergences and without the need for artificial cutoffs,regularization, renormalization, etc. The main ingredients of this formalismare physical particles, while quantum fields play only an auxiliary role.

This book is divided into three parts. Part I: QUANTUM ELEC-TRODYNAMICS consists of eight chapters 1 - 8 where we tried to stickto traditionally accepted views on relativistic quantum theories and, in par-ticular, on QED. In chapter 1 (The Poincare group) we introduce thePoincare group as a set of transformations that relate different (but equiv-alent) inertial reference frames. In chapter 2 (Quantum mechanics) thebasic laws of quantum mechanics are derived from simple axioms of measure-ments (quantum logic). Chapter 3 unifies these two pieces and establishesunitary representations of the Poincare group in the Hilbert space of states asthe most general mathematical description of any isolated physical system.In the next two chapters 4 and 5 we explore some immediate consequencesof this formalism. In chapter 4 (Operators of observables) we find thecorrespondence between known physical observables (such as mass, energy,momentum, spin, position, etc.) and concrete Hermitian operators in theHilbert space. In chapter 5 (Single particles) we show how Wigner’s the-ory of irreducible representations of the Poincare group provides a completedescription of basic properties and dynamics of isolated stable elementaryparticles. The next important step is to consider multi-particle systems. Inchapter 6 (Interaction) we discuss major properties of relativistically invari-ant interactions in such systems. In chapter 7 (The Fock space) we extendthis discussion to the general class of systems in which particles can be cre-

ated and annihilated. Chapter 8 (Quantum electrodynamics) concludesthis first “traditional” part of the book. In that chapter we apply all theabove ideas to the description of systems of charged particles and photonsin the formalism of QED. A particular emphasis is made on the problem of renormalization in QED.

Page 22: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 22/687

xxii PREFACE

Part II of the book A RELATIVISTIC QUANTUM THEORY OF

PARTICLES (chapters 9 - 11) examines the new RQD approach, its connec-tion to the basic QED from part I, and its advantages. This “non-traditional”part begins with chapter 9 (The dressed particle approach) and a deeperanalysis of renormalization and the bare particle picture in quantum fieldtheories. The dressed particle approach is formulated there, and QED istotally rewritten in terms of dressed particles, rather than quantum fields.In chapter 10 (Interactions and relativity) we discuss real and imaginaryparadoxes usually associated with quantum relativistic description in termsof particles. In particular, deviations from predictions of special relativityare discussed and a novel approach to quantum gravity is suggested. Inchapter 11 (Particles vs. fields) we argue that fields (either classical or

quantum) are not suitable for describing electromagnetic systems. We an-alyze several contradictions in Maxwell’s field electrodynamics and presentan alternative formulation of electromagnetic theory in terms of physical (ordressed) particles and instantaneous potentials acting between them. Someuseful mathematical facts and more technical derivations are collected in thePart III: MATHEMATICAL APPENDICES.

Remarkably, the development of RQD did not require introduction of radically new physical ideas. Actually, all key ingredients of this study wereformulated a long time ago, but for some reason they have not attractedthe attention they deserve. For example, the fact that either translations orrotations or boosts must have dynamical dependence on interactions was firstestablished in Dirac’s work [3]. These ideas were further developed in “directinteraction” theories by Bakamjian and Thomas [4], Foldy [5], Sokolov [6, 7],Coester and Polyzou [8], and many others. The primary role of particles informulation of quantum field theories was emphasized in an excellent book byWeinberg [9]. The “dressed particle” approach was advocated by Greenbergand Schweber [10]. First indications that this approach can solve the problemof ultraviolet divergences in QFT are contained in papers by Ruijgrok [11]and Visinesku and Shirokov [12]. The “similarity renormalization” idea of Glazek and Wilson [13, 14] is also an important tool for taming divergences

in QED. The formulation of RQD presented in this book just combined allthese good ideas into one comprehensive approach, which, we believe, is astep toward consistent unification of quantum mechanics and the principleof relativity.

The new material contained in this book was partially covered in seven

Page 23: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 23/687

xxiii

articles [15, 16, 17, 18, 19, 20, 21].

I would like to express my gratitude to Peter Enders, Theo Ruijgrok andBoris Zapol for reading this book and making valuable critical comments.I also would like to thank William Klink, Vladimir Korda, Wayne Polyzou,Alexander Shebeko, and Mikhail Shirokov for enlightening conversations aswell as Bilge, Bernard Chaverondier, Wolfgang Engelhardt, Bill Hobba, IgorKhavkine, Mike Mowbray, Arnold Neumaier, and Dan Solomon for onlinediscussions and fresh ideas that allowed me to improve the quality of thismanuscript over the years. This does not imply any direct or indirect en-dorsements of my work by these distinguished researchers. All errors andmisconceptions contained in this book are mine and only mine.

Page 24: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 24/687

xxiv PREFACE

Page 25: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 25/687

INTRODUCTION

It is wrong to think that the task of physics is to find out how nature is. Physics concerns what we can say about nature...

Niels Bohr

In this Introduction, we will try to specify more exactly what is thepurpose of theoretical physics and what are the fundamental notions, con-cepts, and their relationships studied by this branch of science. Some of thedefinitions and statements made here may look self-evident or even trivial.Nevertheless, it seems important to spell out these definitions explicitly, inorder to avoid misunderstandings in later parts of the book.

We obtain all information about physical world from measurements , andthe most fundamental goal of theoretical physics is to describe and predictthe results of these measurements. Generally speaking, the act of measure-ment requires the presence of at least three objects (see Fig. 1): a preparation device , a physical system , and a measuring apparatus . The preparation de-vice P prepares the physical system S in a certain state . The state of thesystem has some attributes or properties. If an attribute or property can beassigned a numerical value it will be called observable F . The observables aremeasured by bringing the system into contact with the measuring apparatus.The result of the measurement is a numerical value of the observable, whichis a real number f . We assume that every measurement of F gives some value f , so that there is no misfiring of the measuring apparatus.

This was just a brief list of relevant notions. Let us now look at all these

ingredients in more detail.

Physical system. Loosely speaking, physical system is any object thatcan trigger a response (measurement) in the measuring apparatus. As phys-ical system is the most basic concept in physics, it is difficult to give a more

xxv

Page 26: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 26/687

xxvi INTRODUCTION

preparat ion

dev ice

measur ing

appar at usphysical

system

preparation measurement

value of observable F

state

Figure 1: Schematic representation of the preparation/measurement process.

precise definition. An intuitive understanding will be sufficient for our pur-poses. For example, an electron, a hydrogen atom, a book, a planet are allexamples of physical systems.

Physical systems can be either elementary (also called particles ) or com-pound , i.e., consisting of more than one particle.

In this book we will limit our discussion to isolated systems , which do notinteract with the rest of the world or with any external potential. By doingthis we exclude some interesting physical systems and effects, like atomsin external electric and magnetic fields. However, this does not limit thegenerality of our treatment. Indeed, one can always combine the atom andthe field-creating device into one unified system that can be studied withinthe “isolated system” approach.

States. Any physical system may exist in a variety of different states:a book can be on your desk or in the library; it can be open or closed; itcan be at rest or fly with a high speed. The distinction between differentsystems and different states of the same system is sometimes far from ob-vious. For example, a separated pair of particles (electron + proton) does

not look like the hydrogen atom. So, one may conclude that these are twodifferent systems. However, in reality these are two different states of thesame compound system.

Observables. Theoretical physics is inclined to study simplest physical

Page 27: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 27/687

xxvii

systems and their most fundamental observable properties (mass, velocity,

spin, etc.). Clearly the measured values of observables must depend on thekind of the system being measured and on the state of this system. Weemphasize this fact because there are numerical quantities in physics whichare not associated with any physical system, and therefore they are not calledobservables. For example, the number of space dimensions (3) is not anobservable, because we do not regard the space as an example of a physicalsystem. Time is also not an observable. We cannot say that time is aproperty of a physical system. It is rather an attribute of the measuringapparatus, or, more exactly, an attribute of the clock associated with themeasuring apparatus or observer. The clock assigns a certain time label toeach measurement and this label does not depend on the state of the observed

system. One can “measure” time even in the absence of any physical system.We will assume exact measurability of any observable. Of course, this

claim is an idealization. Clearly, there are precision limits for all real mea-suring apparatuses. However, we will suppose that with certain efforts onecan always make more and more precise measurements, so the precision is,in principle, unlimited. As this statement is important for future discussionsin chapter 2, we formulate it as a postulate3

Postulate 0.1 (unlimited precision of measurements) Each observable can be measured with any prescribed precision.

Some observables can take a value anywhere on the real axis R. The Carte-sian components of position Rx, Ry, and Rz are good examples of such ob-servables. However there are also observables for which this is not true andthe allowed values form only a subset of the real axis. Such a subset is calledthe spectrum of the observable. For example, it is known (see Chapter 5)that each component of particle’s velocity cannot exceed the speed of lightc, so the spectrum of the velocity components V x, V y, and V z is [−c, c]. Bothposition and velocity have continuous spectra. However, there are many ob-servables having a discrete spectrum. For example, the number of particlesin the system (which is also a valid observable) can only take integer values 0,

3In this book we will distinguish Postulates , Statements , and Assertions . Postulatesform a foundation of our theory. In most cases they follow undoubtedly from experiments.Statements follow logically from Postulates and we believe them to be true. Assertions arecommonly accepted in the literature, but they do not have a place in the RQD approachdeveloped in this book.

Page 28: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 28/687

xxviii INTRODUCTION

1, 2, ... Later we will also meet observables whose spectrum is a combination

of discrete and continuous parts, e.g., the energy spectrum of the hydrogenatom.

Observers. We will call observer O a collection of measuring apparatuseswhich are designed to measure all possible observables. Laboratory (P, O) isa full experimental setup, i.e., a preparation device P plus observer O withall his measuring devices. Generally, preparation and measuring devices canbe rather sophisticated.4 It would be hopeless to include in our theoreticalframework a detailed description of their design, and how they interact withthe physical system. Instead, we will use an idealized representation of boththe preparation and measurement acts. In particular, we will assume that

the measuring apparatus is a black box whose job is to produce just one realnumber - the value of some observable - upon interaction with the physicalsystem.

The minimal set of measuring devices associated with an observer includea yardstick for measuring distances, a clock for registering time, a fixed pointof origin and three mutually perpendicular axes erected from this point. Inaddition to measuring properties of physical systems, our observers can alsosee their fellow observers. With the measuring kit described above each ob-server O can characterize another observer O′ by ten parameters φ, v, r, t:the time difference t between the clocks of O and O′, the position vector rthat connects the origin of O with the origin of O

′, the rotation angle5 φ that

relates the orientation of axes in O′ to the orientation of axes in O, and thevelocity v of O′ relative to O.

In this book we consider only inertial observers (= inertial frames of ref-erence) or inertial laboratories. These are observers that move uniformlywithout acceleration and rotation, i.e., observers whose velocity and orienta-tion of axes does not change with time. The importance of choosing inertialobservers will become clear in subsection 1.1.1 where we will see that mea-surements performed by these observers must obey the principle of relativity.

It is convenient to introduce the notion of inertial transformations of observers and laboratories. The transformations of this kind include

• rotations,

4e.g., accelerators, bubble chambers, etc.5The vector parameterization of rotations is discussed in Appendix D.5.

Page 29: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 29/687

xxix

• space translations,

• time translations,

• changes of velocity or boosts .

There are three independent rotations (around x, y, and z axes), three inde-pendent translations, and three independent boosts. So, totally, there are 10distinct basic types of inertial transformations. More general inertial trans-formations can be made by performing two (or more) basic transformationsin succession. We will postulate that for any pair of inertial observers Oand O′ one can always find an inertial transformation g, such that O′ = gO.

Conversely, the application of any inertial transformation g to any inertialobserver O leads to a different inertial observer O′ = gO.One of the most important tasks of physics is to establish the relation-

ship between measurements performed by two different observers on the samephysical system. These relationships will be referred to as inertial transfor-mations of observables . In particular, if values of observables measured byO are known and the inertial transformation connecting O with O′ is knownas well, then we should be able to say exactly what are the values of (thesame) observables measured by O′. Probably the most important and chal-lenging task of this kind is the description of dynamics or time evolution . Inthis case, observers O′ and O are connected by a time translation. Another

important task is to compare measurements performed by observers movingwith respect to each other.6

An important comment should be made about the definition of “observer”used in this book. It is common to understand observer as a person (or ameasuring apparatus) that exists and performs measurements for (infinitely)long time. For example, it is not unusual to find discussions of the time evo-lution of a physical system “from the point of view” of this or that observer.However, this colloquial definition does not fit our purposes. The problemis that with this definition the time translations obtain a status that is dif-ferent from other inertial transformations: space translations, rotations, andboosts. The time translations become associated with the observer herself.The central idea of our approach to relativity is the treatment of all ten typesof inertial transformations on equal footing. Therefore, we will use a slightlydifferent definition of observer. In our definition observers are “short-lived”.

6e.g., Lorentz transformations

Page 30: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 30/687

xxx INTRODUCTION

They exist and perform measurements in a short time interval and they can

see only a snapshot of the world around them. Individual observers cannot“see” the time evolution of a physical system. In our approach the timeevolution is described as a succession of measurements performed by a seriesof instantaneous observers related to each other by time translations. Thenthe colloquial “observer” is actually a series of our “short-lived” observersdisplaced in time.

The goals of physics. The discussion above can be summarized byindicating five essential goals of theoretical physics:

• provide a classification of physical systems;

• for each physical system give a list of observables and their spectra;

• for each physical system give a list of possible states.

• for each state of the system describe the results of measurements of relevant observables.

• given a description of the state of the system by one observer find outhow other observers see the same state.

Page 31: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 31/687

Part I

QUANTUMELECTRODYNAMICS

1

Page 32: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 32/687

Page 33: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 33/687

Chapter 1

THE POINCARE GROUP

There are more things in Heaven and on earth, dear Horacio,than are dreamed of in your philosophy.

Hamlet

In this chapter we will analyze group properties of inertial transformationsof observers. In this introductory discussion, nothing will be said aboutphysical systems, observables, and their transformations to different inertialreference frames. A complete relativistic physical theory will be approachedonly in chapter 10.

1.1 Inertial observers

1.1.1 The principle of relativity

In this book we will consider only inertial laboratories. What is so specialabout them? The answer is that one can apply the powerful principle of rel-ativity to such laboratories. The essence of this principle was best explainedby Galileo more than 370 years ago [22]:

Shut yourself up with some friend in the main cabin below

decks on some large ship, and have with you there some flies,butterflies, and other small flying animals. Have a large bowl of water with some fish in it; hang up a bottle that empties dropby drop into a wide vessel beneath it. With the ship standing still, observe carefully how the little animals fly with equal speed

3

Page 34: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 34/687

4 CHAPTER 1. THE POINCAR E GROUP

to all sides of the cabin. The fish swim indifferently in all di-

rections; the drops fall into the vessel beneath; and, in throwing something to your friend, you need to throw it no more strongly in one direction than another, the distances being equal; jumping with your feet together, you pass equal spaces in every direction.When you have observed all of these things carefully (though there is no doubt that when the ship is standing still everything must happen this way), have the ship proceed with any speed you like,so long as the motion is uniform and not fluctuating this way and that. You will discover not the least change in all the effects named, nor could you tell from any of them whether the ship was moving or standing still. In jumping, you will pass on the floor

the same spaces as before, nor will you make larger jumps toward the stern than towards the prow even though the ship is moving quite rapidly, despite the fact that during the time that you are in the air the floor under you will be going in a direction opposite toyour jump. In throwing something to your companion, you will need no more force to get it to him whether he is in the direc-tion of the bow or the stern, with yourself situated opposite. The droplets will fall as before into the vessel beneath without drop-ping towards the stern, although while the drops are in the air the ship runs many spans. The fish in the water will swim towards the front of their bowl with no more effort than toward the back,and will go with equal ease to bait placed anywhere around the edges of the bowl. Finally the butterflies and flies will continue their flights indifferently toward every side, nor will it ever hap-pen that they are concentrated toward the stern, as if tired out

from keeping up with the course of the ship, from which they will have been separated during long intervals by keeping themselves in the air.

These observations can be translated into the statement that all inertial lab-oratories cannot be distinguished from the laboratory at rest by performing

experiments confined to those laboratories. Any experiment performed inone laboratory, will yield exactly the same result as an identical experimentin any other laboratory. The results will be the same independent on howfar apart the laboratories are, and what are their relative orientations andvelocities. Moreover, we may repeat the same experiment at any time, to-

Page 35: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 35/687

1.1. INERTIAL OBSERVERS 5

morrow, or many years later, still results will be the same. This allows us to

formulate one of the most important and deep postulates in physics

Postulate 1.1 (the principle of relativity) In all inertial laboratories,the laws of nature are the same: they do not change with time, they do not depend on the position and orientation of the laboratory in space and on its velocity. The laws of physics are invariant against inertial transformations of laboratories.

1.1.2 Inertial transformations

Our next goal is to study inertial transformations between laboratories or

observers in more detail. To do this we do not need to consider physicalsystems at all. It is sufficient to think about a world inhabited only byobservers. The only thing these observers can do is to measure parameters φ; v; r; t of their fellow observers. It appears that even in this oversimplifiedworld we can learn quite a few useful things about properties of inertialobservers and their relationships to each other.

Let us first introduce a convenient labeling of inertial observers and in-ertial transformations. We choose an arbitrary observer O as our referenceobserver, then other examples of observers are

(i) an observer

0; 0; 0; t1

O displaced in time by the amount t1;

(ii) an observer 0; 0; r1; 0O shifted in space by the vector r1;

(iii) an observer 0; v1; 0; 0O moving with velocity v1;

(iv) an observer φ1; 0; 0; 0O rotated by the vector φ1.

Suppose now that we have three different inertial observers O, O′, and O′′.There is an inertial transformation φ1; v1; r1; t1 which connects O and O′

O′ = φ1; v1; r1; t1O (1.1)

where parameters φ1, v1, r1, and t1 are measured by the ruler and clockbelonging to the reference frame O with respect to its basis vectors. Similarly,there is an inertial transformation that connects O′ and O′′

Page 36: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 36/687

6 CHAPTER 1. THE POINCAR E GROUP

O′′ = φ2; v2; r2; t2O′ (1.2)

where parameters φ2, v2, r2, and t2 are defined with respect to the basisvectors, ruler, and clock of the observer O′. Finally, there is a transformationthat connects O and O′′

O′′ = φ3; v3; r3; t3O (1.3)

with all transformation parameters referring to O. We can represent thetransformation (1.3) as a composition or product of transformations (1.1)

and (1.2)

φ3; v3; r3; t3 = φ2; v2; r2; t2 φ1; v1; r1; t1 (1.4)

Apparently, this product has the property of associativity. There exists a triv-ial (identity ) transformation 0; 0; 0; 0 that leaves all observers unchanged,

and for each inertial transformation φ; v; r; t there is the inverse transfor-

mation φ; v; r; t−1 such that their product is the identity transformation

φ; v; r; t φ; v; r; t−1

= φ; v; r; t−1

φ; v; r; t = 0; 0; 0; 0 (1.5)

In other words, the set of inertial transformations forms a group (see Ap-pendix A.2). Moreover, since these transformations smoothly depend ontheir parameters, this is a Lie group (see Appendix E). The Lie group of inertial transformations of observers plays a central role in physics, as wewill see time and again throughout this book. The main goal of the presentchapter is to study the properties of this group in some detail. In particular,we will need expressions for the multiplication (1.4) and inversion (1.5) laws.

First we notice that a general inertial transformation φ; v; r; t can berepresented as a product of basic transformations (i) - (iv). As these basic

transformations generally do not commute, we need to agree on a canonicalorder in this product. For our purposes the following choice is convenient

φ; v; r; tO = φ; 0; 0; 0 0; v; 0; 0 0; 0; r; 00; 0; 0; tO (1.6)

Page 37: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 37/687

1.2. THE GALILEI GROUP 7

This means that in order to obtain observer O′ = φ; v; r; tO we first shift

observer O in time by the amount t, then shift the time-translated observer bythe vector r, then give it velocity v, and finally rotate the obtained observerby the angle φ.

1.2 The Galilei group

In this section we begin our study of the group of inertial transformationsby considering a non-relativistic world in which observers move with lowspeeds. This is a relatively easy task, because in these derivations we canuse our everyday experience and “common sense”. The relativistic group of

transformations will be approached in section 1.3 as a formal generalizationof the Galilei group derived here.

1.2.1 The multiplication law of the Galilei group

Let us first consider four examples of eq. (1.4) in which φ1; v1; r1; t1 is a

general inertial transformation and φ2; v2; r2; t2 is one of the basic trans-formations from the list (i) - (iv). Applying a time translation to a general

reference frame φ1; v1; r1; t1O will change its time label and change itsposition in space according to equation

0; 0; 0; t2 φ1; v1; r1; t1O = φ1; v1; r1 + v1t2; t1 + t2O (1.7)

Space translations affect the position

0; 0; r2; 0 φ1; v1; r1; t1O = φ1; v1; r1 + r2; t1O (1.8)

Boosts change the velocity

0; v2; 0; 0 φ1; v1; r1; t1O = φ1; v1 + v2; r1; t1O (1.9)

Rotations affect all vector parameters1

1For definition of rotation matrices R φ and function Φ see Appendix D.5.

Page 38: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 38/687

8 CHAPTER 1. THE POINCAR E GROUP

φ2; 0; 0; 0 φ1; v1; r1; t1O = Φ(R φ2R φ1); R φ2v1; R φ2r1; t1O (1.10)

Now we can calculate the product of two general inertial transformations in(1.4) by using (1.6) - (1.10)

φ2; v2; r2; t2 φ1; v1; r1; t1= φ2; 0; 0; 0 0; v2; 0; 0 0; 0; r2; 0 0; 0; 0; t2 φ1; v1; r1; t1= φ2; 0; 0; 0 0; v2; 0; 0 0; 0; r2; 0 φ1; v1; r1 + v1t2; t1 + t2=

φ2; 0; 0; 0

0; v2; 0; 0

φ1; v1; r1 + v1t2 + r2; t1 + t2

= φ2; 0; 0; 0 φ1; v1 + v2; r1 + v1t2 + r2; t1 + t2= Φ(R φ2

R φ1); R φ2

(v1 + v2); R φ2(r1 + v1t2 + r2); t1 + t2 (1.11)

By direct substitution to equation (1.5) it is easy to check that the inverse

of a general inertial transformation φ; v; r; t is

φ; v; r; t−1 = − φ; −v; −r + vt; −t (1.12)

Eqs. (1.11) and (1.12) are multiplication and inversion laws which fullydetermine the structure of the Lie group of inertial transformations in non-

relativistic physics. This group is called the Galilei group.

1.2.2 Lie algebra of the Galilei group

According to our discussion in Appendix E, we can obtain the basis (H, P , K, J )in the Lie algebra of generators of the Galilei group by taking derivatives withrespect to parameters of one-parameter subgroups. For example, the gener-ator of time translations is

H= lim

t→0

d

dt 0; 0; 0; t

For generators of space translations and boosts along the x-axis we obtain2

2Note that vector parameters are denoted either by vector symbol (e.g. 0 or 0) or bytheir components (e.g., (x,0,0)).

Page 39: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 39/687

1.2. THE GALILEI GROUP 9

P x = limx→0

ddx

0; x, 0, 0; 0; 0

Kx = limv→0

d

dv 0; 0; v, 0, 0; 0

The generator of rotations around the x-axis is

J x = limφ→0

d

dφφ, 0, 0; 0; 0; 0

Similar formulas are valid for y- and z -components. According to (E.1) we

can also express finite transformations as exponents of generators

0; 0; 0; t = eHt ≈ 1 + Ht (1.13)

0; 0; r; 0 = e P r ≈ 1 + P r (1.14)

0; v; 0; t = e Kv ≈ 1 + Kv (1.15)

φ; 0; 0; 0 = e J φ ≈ 1 + J φ

Then each group element can be represented in its canonical form (1.6) asthe following function of parameters

φ; v; r; t ≡ φ; 0; 0; 0 0; v; 0; 0 0; 0; r; 00; 0; 0; t= e

J φe Kve

P reHt (1.16)

Let us now find the commutation relations between generators, i.e., thestructure constants of the Galilei Lie algebra . Consider, for example, trans-lations in time and space. From eq. (1.11) we have

0; 0; 0; t 0; 0; x, 0, 0; 0 = 0; 0; x, 0, 0; 0 0; 0; 0; t

This implies

eHteP xx = eP xxeHt

1 = eP xxeHte−P xxe−Ht

Page 40: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 40/687

10 CHAPTER 1. THE POINCAR E GROUP

Using eqs. (1.13) and (1.14) for the exponents we can write to the first order

in x and to the first order in t

1 ≈ (1 + P xx)(1 + Ht)(1 − P xx)(1 − Ht)

≈ 1 + P xHxt − P xHxt − HP xxt + P xHxt

= 1 − HP xxt + P xHxt

hence

[P x, H] ≡ P xH−HP x = 0

So, generators of space and time translations commute. Similarly we obtaincommutators

[H, P i] = [P i, P j] = [Ki, K j] = 0

for any i, j = x,y,z (or i, j = 1, 2, 3). The composition of a time translationand a boost is more interesting since they do not commute. We calculatefrom eq. (1.11)

eKxveHte−Kxv = 0; v, 0, 0; 0, 0 0; 0; 0; t 0; −v, 0, 0; 0; 0= 0; v, 0, 0; 0; 0 0;−v, 0, 0; −vt, 0, 0; t= 0; 0, 0, 0; −vt, 0, 0; t= eHte−P xvt

Therefore, using eqs (1.13), (1.15), and (E.13) we obtain

Ht + [

Kx,

H]vt =

Ht

− P xvt

and

[Kx, H] = −P x

Page 41: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 41/687

1.2. THE GALILEI GROUP 11

Proceeding in a similar fashion for other pairs of transformations we obtain

the full set of commutation relations for the Lie algebra of the Galilei group.

[J i, P j] =3

k=1

ǫijkP k (1.17)

[J i, J j] =3

k=1

ǫijkJ k (1.18)

[J i, K j] =3

k=1

ǫijkKk (1.19)

[J i, H] = 0 (1.20)[P i, P j] = [P i, H] = 0 (1.21)

[Ki, K j] = 0 (1.22)

[Ki, P j] = 0 (1.23)

[Ki, H] = −P i (1.24)

From these commutators one can identify several important sub-algebras of the Galilei Lie algebra and, therefore, subgroups of the Galilei group. Inparticular, there is an Abelian subgroup of space and time translations (with

generators P and H, respectively), a subgroup of rotations (with generators

J ), and an Abelian subgroup of boosts (with generators K).

1.2.3 Transformations of generators under rotations

Consider two reference frames O and O′ connected to each other by the groupelement g:

O′ = gO

Suppose that observer O performs an inertial transformation with the groupelement h (e.g., h is a translation along the x-axis). We want to find atransformation h′ which is related to the observer O′ in the same way as his related to O (i.e., h′ is the translation along the x′-axis belonging to theobserver O′). As seen from the example in Fig. 1.1,3 the transformation

3in which g = exp J zφ is a rotation around the z-axis that is perpendicular to the page

Page 42: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 42/687

12 CHAPTER 1. THE POINCAR E GROUP

xx

yy

x’

y’

gg

gggg

hh

h’

AA

OO O’

11

Figure 1.1: Connection between similar transformations in different referenceframes.

h′ of the object A can be obtained by first going from O′ to O, performingtranslation h there, and then returning back to the reference frame O′

h′ = ghg−1

Similarly, if A is a generator of an inertial transformation in the referenceframe O, then

A′ = gAg−1 (1.25)

is “the same” generator in the reference frame O′ = gO.Let us consider the effect of rotation around the z -axis on generators of

the Galilei group. We can write

A′x ≡ Ax(φ) = eJ zφAxe−J zφ

A′y ≡ Ay(φ) = eJ zφ

Aye−J zφ

A′z ≡ Az(φ) = eJ zφAze−J zφ

where A is any of the generators P , J or K. From commutators (1.17) -(1.19) we obtain

Page 43: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 43/687

1.2. THE GALILEI GROUP 13

∂ ∂φ

Ax(φ) = eJ zφ(J zAx − AxJ z)e−J zφ = eJ zφAye−J zφ = Ay(φ) (1.26)

∂φAy(φ) = eJ zφ(J zAy − AyJ z)e−J zφ = −eJ zφAxe−J zφ = −Ax(φ)

∂φAz(φ) = eJ zφ(J zAz − AzJ z)e−J zφ = 0 (1.27)

Taking a derivative of eq. (1.26) by φ we obtain a second order differentialequation

∂ 2

∂ 2φAx(φ) = ∂

∂φAy(φ) = −Ax(φ)

with the general solution

Ax(φ) = B cos φ + D sin φ

From the initial conditions we obtain

B = Ax(0) = Ax

D =d

dφAx(φ)|φ=0 = Ay

so that finally

Ax(φ) = Ax cos φ + Ay sin φ (1.28)

Similar calculations show that

Ay(φ) = −Ax sin φ + Ay cos φ (1.29)

Az(φ) = Az (1.30)

Comparing (1.28) - (1.30) with eq. (D.10), we see that

Page 44: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 44/687

14 CHAPTER 1. THE POINCAR E GROUP

00 SS

S’

φφ

φφxx

xx

vv

vv

Figure 1.2: Transformation of generators under space inversion.

A′i = eJ zφAie

−J zφ =3

j=1

(Rz)ijA j (1.31)

where Rz is the rotation matrix. As shown in eq. (D.19), we can find the

result of application of a general rotation

φ; 0; 0; 0

to generators

A A′ = e

J φ Ae− J φ

= A cos φ + φ

φ( A ·

φ

φ)(1 − cos φ) − A ×

φ

φsin φ

= R φ A

This means that P , J , and K are 3-vectors.4 The commutator (1.20) obvi-ously means that H is a 3-scalar.

1.2.4 Space inversionsWe will not consider physical consequences of discrete transformations (in-version and time reversal) in this book. It is physically impossible to prepare

4see Appendix D.2

Page 45: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 45/687

1.3. THE POINCAR E GROUP 15

an exact mirror image or a time-reversed image of a laboratory, so the rel-

ativity postulate has nothing to say about such transformations. Indeed, ithas been proven by experiment that these discrete symmetries are not ex-act. Nevertheless, we will find it useful to know how generators behave withrespect to space inversions. Suppose we have a classical system S and its in-version image S ′ (see Fig. 1.2) with respect to the origin 0. The question is:how the image S ′ will transform if we apply a certain inertial transformationto S ?

Apparently, if we shift S by vector x, then S ′ will be shifted by −x.This can be interpreted as the change of sign of the generator of translation P under inversion. The same with boost: the inverted image S ′ acquiresvelocity

−v if the original was boosted by v. So, inversion changes the sign

of the boost generator as well

K → − K (1.32)

Vectors, such as P and K, changing their sign after inversion are calledtrue vectors . However, the generator of rotation J is not a true vector.Indeed, if we rotate S by angle φ, then the image S ′ is also rotated bythe same angle (see fig. 1.2). So, J does not change the sign after inver-sion. Such vectors are called pseudovectors . Similarly we can introduce thenotions of true scalars/pseudoscalars and true tensors/pseudotensors. It isconventional to define their properties in a way opposite to those of true vec-tors/pseudovectors. In particular, true scalars and true tensors (of rank 2) donot change their sign after inversion. For example, H is a true scalar. Pseu-doscalars and pseudotensors (of rank 2) do change their sign after inversion.

1.3 The Poincare group

It appears that the Galilei group described above is valid only for observers

moving with low speeds. In the general case a different multiplication lawshould be used and the group of inertial transformations is, in fact, thePoincare group (also known as the inhomogeneous Lorentz group). This is themost important lesson of the theory of relativity developed in the beginningof the 20th century by Einstein, Lorentz, and Poincare.

Page 46: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 46/687

16 CHAPTER 1. THE POINCAR E GROUP

Derivation of the relativistic group of inertial transformations is a diffi-

cult task, because we lack experience of dealing with fast-moving objects inour everyday life. So, we will use some more formal mathematical argumentsinstead. In this section we will find that there is basically a unique way toobtain the Lie algebra of the Poincare group by generalizing the commuta-tion relations of the Galilei Lie algebra (1.17) - (1.24), so that they remaincompatible with some simple physical requirements.

1.3.1 Lie algebra of the Poincare group

When we derived commutation relations of the Galilei algebra in the preced-ing section, we used some “common sense” ideas about how inertial trans-

formations may affects parameters (positions, velocities, etc.) of observers.We can be confident about the validity of Galilei commutators involving gen-erators of space and time translations and rotations, because properties of these transformations have been verified in everyday life and in physical ex-periments over a wide range of involved parameters (distances, times, andangles). The situation with respect to boosts is quite different. Normally, wedo not experience high speeds in our life, and we lack any physical intuitionthat was so helpful in deriving the Galilei Lie algebra. Therefore the argu-ments that lead us to the commutation relations involving boost generators5

may be not exact, and these formulas may be just approximations that canbe tolerated only for low-speed observers. So, we will base our derivation of

the relativistic group of inertial transformations on the following ideas.

(I) Just as in the non-relativistic world, the set of inertial transformationsshould remain a 10-parameter Lie group. However, commutation rela-tions in the exact (Poincare) Lie algebra are expected to be differentfrom the Galilei commutators (1.17) - (1.24).

(II) The Galilei group does a good job in describing the low-speed trans-formations, and the speed of light c is a natural measure of speed.Therefore we may guess that the correct commutators should include cas a parameter, and they must tend to the Galilei commutators in the

limit c → ∞.65for example, the formula for addition of velocities in eq. (1.9)6Note that here we do not assume that c is a limiting speed or that the speed of light is

invariant. These facts will come out as a result of application of our approach in subsection5.4.1.

Page 47: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 47/687

1.3. THE POINCAR E GROUP 17

(III) We will assume that only commutators involving boosts may be subject

to revision.(IV) We will further assume that relativistic generators of boosts K still form

components of a true vector, so eqs. (1.19) and (1.32) remain valid.

Summarizing requirements (I) - (IV), we can write the following relativisticgeneralizations of commutators (1.22) - (1.24)

[Ki, P j] = U ij (1.33)

[Ki, K j] = T ij[

Ki,

H] =

−P i +

V i (1.34)

where T ij, U ij , and V ij are some yet unknown linear combinations of genera-tors. The coefficients of these linear combinations must be selected in such away that all Lie algebra properties (in particular, the Jacobi identity ( E.10))are preserved. Let us try to satisfy these conditions step by step.

First note that commutator [Ki, P j] is a 3-tensor. Indeed, using eq. (1.31)we obtain the tensor transformation law

e J φ[Ki, P j ]e− J φ = [

3

k=1Rik( φ)Kk,

3

k=1R jl( φ)P l]

=3

kl=1

Rik( φ)R jl( φ)[Kk, P l]

that can be compared with the definition of 3-tensor in (D.13). Since both K and P change the sign upon inversion, this a true tensor. Therefore U ijmust be a true tensor as well. This tensor should be constructed as a linearfunction of generators among which we have a true scalar H, a pseudovector J , and two true vectors P and K. According to our discussion in Appendix

D.4, the only way to make a true tensor is by using formulas in the firstand third rows in table D.1. Therefore, the most general expression for the

commutator (1.33) is

[Ki, P j ] = −β Hδ ij + γ 3

k=1

ǫijkJ k

Page 48: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 48/687

18 CHAPTER 1. THE POINCAR E GROUP

where β and γ are yet unspecified real constants.

Similar arguments suggest that T ij is also a true tensor. Due to therelationship

[Ki, K j] = −[K j , Ki]

this tensor must be antisymmetric with respect to indices i and j. Thisexcludes the term proportional to δ ij , hence

[Ki, K j] = α

3k=1

ǫijkJ k,

where α is, again, an undefined constant.The quantity V i in eq. (1.34) must be a true vector, so, the most general

form of the commutator (1.34) is

[Ki, H] = −(1 + σ)P i + κKi.

So, we have reduced the task of generalization of Galilei commutators tofinding just five real parameters α, β , γ , κ, and σ. To do that, let us first usethe following Jacobi identity

0 = [P x

, [Kx

,H

]] + [Kx

, [H

,P x

]] + [H

, [P x

,Kx

]]

= κ[P x, Kx]

= βκHwhich implies

βκ = 0 (1.35)

Similarly,

0 = [

Kx, [

Ky,

P y]] + [

Ky, [

P y,

Kx]] + [

P y, [

Kx,

Ky]]

= −β [Kx, H] − γ [Ky, J z] + α[P y, J z]= β (1 + σ)P x − βκKx − γ Kx + αP x= (α + β + βσ)P x − (βκ + γ )Kx

= (α + β + βσ)P x − γ Kx

Page 49: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 49/687

1.3. THE POINCAR E GROUP 19

implies

α = −β (1 + σ) (1.36)

γ = 0 (1.37)

The system of equations (1.35) - (1.36) has two possible solutions (in bothcases σ remains undefined)

(i) If β = 0, then α = −β (1 + σ) and κ = 0.

(ii) If β = 0, then α = 0 and κ is arbitrary.

From the condition (II) we know that parameters α,β,σ,κ must depend onc and tend to zero as c → ∞

limc→∞

κ = limc→∞

σ = limc→∞

α = limc→∞

β = 0 (1.38)

Additional insight into the values of these parameters may be obtained byexamining their dimensions. To keep the arguments of exponents in (1.16)dimensionless we must assume the following dimensionalities (denoted byangle brackets) of the generators

< H > = < time >−1

< P > = < distance >−1

< K > = < speed >−1

< J > = < angle >−1= dimensionless

It then follows that

< α > =< K >2

< J >

=< speed >−2

< β > =< K >< P >

< H >=< speed >−2

< κ > = < H >=< time >−1

< σ > = dimensionless

Page 50: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 50/687

20 CHAPTER 1. THE POINCAR E GROUP

and we can satisfy condition (1.38) only by setting κ = σ = 0 (i.e., the

choice (i) above) and assuming β = −α ∝ c−2

. This approach does notspecify the coefficient of proportionality between β (and −α) and c−2. To bein agreement with experimental data we must choose this coefficient equalto 1.

β = −α =1

c2

Then the resulting commutators are

[J i, P j] =

3k=1

ǫijkP k (1.39)

[J i, J j] =3

k=1

ǫijkJ k (1.40)

[J i, K j] =3

k=1

ǫijkKk (1.41)

[J i, H] = 0 (1.42)

[P i, P j] = [P i, H] = 0 (1.43)

[Ki, K j] = − 1c2

3k=1

ǫijkJ k (1.44)

[Ki, P j] = − 1

c2Hδ ij (1.45)

[Ki, H] = −P i (1.46)

This set of commutators is called the Poincare Lie algebra and it differs fromthe Galilei algebra (1.17) - (1.24) only by small terms on the right hand sidesof commutators (1.44) and (1.45). The general element of the correspondingPoincare group has the form7

e J φe

Kc θe P xeHt (1.47)

7Note that here we adhere to the conventional order of basic transformations adoptedin (1.6); from right to left: time translations - space translations - boosts - rotations.

Page 51: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 51/687

1.3. THE POINCAR E GROUP 21

In eq. (1.47) we denoted the parameter of boost by c θ, where θ = | θ| is

a dimensionless quantity called rapidity .

8

Its relationship to the velocity of boost v is

v( θ) = θ

θc tanh θ

cosh θ = (1 − v2/c2)−1/2

In spite of their simplicity, eqs (1.39) - (1.46) are among the most importantequations in physics, and they have such an abundance of experimental con-firmations that one cannot doubt their validity. We therefore accept that thePoincare group is the true mathematical expression of relationships between

different inertial laboratories.

Postulate 1.2 (the Poincare group) Transformations between inertial lab-oratories form the Poincare group.

Even a brief comparison of the Poincare (1.39) - (1.46) and Galilei (1.17)- (1.24) commutators reveals a number of important new features in therelativistic theory. For example, due to commutator (1.44), boosts no longerform a subgroup. However, boosts together with rotations do form a 6-dimensional subgroup of the Poincare group which is called the Lorentz group.

1.3.2 Transformations of translation generators underboosts

Poincare commutators allow us to derive transformation properties of gener-ators P and H with respect to boosts. Using Eq. (1.25) and commutators(1.45) - (1.46) we find that if P x and H are generators in the reference frameat rest O, then their counterparts P x(θ) and H(θ) in the reference frame O′

moving along the x-axis are

H(θ) = eKxcθHe−Kxcθ

P x(θ) = eKxcθP xe−Kxcθ

8The reason for introducing this new quantity is that rapidities of successive boosts inone direction are additive, while velocities are not; see eq. (4.5).

Page 52: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 52/687

22 CHAPTER 1. THE POINCAR E GROUP

Taking derivatives of these equations with respect to the parameter θ

∂θH(θ) = ceKxcθ(KxH−HKx)e−Kxcθ

= −ceKxcθP xe−Kxcθ

= −cP x(θ)

∂θP x(θ) = ceKxcθ(KxP x − P xKx)e−Kxcθ

= −1

ceKxcθHe−Kxcθ

= −1

cH(θ) (1.48)

and taking a derivative of eq. (1.48) again, we obtain a differential equation

∂ 2

∂ 2θP x(θ) = −1

c

∂θH(θ) = P x(θ)

with the general solution

P x(θ) = A cosh θ + B sinh θ

From the initial conditions we obtain

A = P x(0) = P xB =

∂θP x(θ)|θ=0 = −1

cH

and finally

P x(θ) = P x cosh θ − Hc

sinh θ

Similar calculation shows that

H(θ) = H cosh θ − cP x sinh θP y(θ) = P yP z(θ) = P z

Page 53: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 53/687

1.3. THE POINCAR E GROUP 23

Similar to our discussion of rotations in subsection D.5, we can find the

transformation of P and H corresponding to a general boost vector

θ in thecoordinate-independent form. First we decompose P into sum of two vectors

P = P + P ⊥. The vector P = ( P · θθ

) θθ

is parallel to the direction of the

boost and vector P ⊥ = P − P is perpendicular to the direction of the boost.

The perpendicular part P ⊥ remains unchanged under the boost, while P transforms according to exp( Kc θ) P exp(− Kc θ) = P cosh θ − c−1H sinh θ

θθ

.Therefore

P ′ = e

Kc θ

P e− Kc θ =

P +

θ

θ[(

P ·

θ

θ)(cosh θ

−1)

1

cH

sinh θ] (1.49)

H′ = e Kc θHe− Kc θ = H cosh θ − c( P ·

θ

θ)sinh θ (1.50)

It is clear from (1.49) and (1.50) that boosts perform linear transformations

of components c P and H. These transformations can be represented in amatrix form if four generators (H, c P ) are arranged in a column 4-vector9

H′

cP ′xcP ′ycP ′z

= B( θ)

HcP xcP ycP z

.

From eq. (1.49) and (1.50) we can find the matrix B( θ) corresponding to an

arbitrary boost θ

B( θ) =

cosh θ −θxθ sinh θ −θy

θ sinh θ −θzθ sinh θ

−θxθ sinh θ 1 + χθ2x χθxθy χθxθz

−θyθ

sinh θ χθxθy 1 + χθ2y χθyθz

−θzθ

sinh θ χθxθz χθyθz 1 + χθ2z

(1.51)

where we denoted χ = (cosh θ − 1)θ−2. In particular, the boosts along x, y,and z axes are represented by the following 4 × 4 matrices

9see Appendix I.1

Page 54: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 54/687

24 CHAPTER 1. THE POINCAR E GROUP

B(θ, 0, 0) =

cosh θ − sinh θ 0 0

− sinh θ cosh θ 0 00 0 1 00 0 0 1

(1.52)

B(0, θ, 0) =

cosh θ 0 − sinh θ 00 1 0 0

− sinh θ 0 cosh θ 00 0 0 1

(1.53)

B(0, 0, θ) = cosh θ 0 0 − sinh θ

0 1 0 0

0 0 1 0− sinh θ 0 0 cosh θ

(1.54)

Page 55: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 55/687

Chapter 2

QUANTUM MECHANICS

The nature of light is a subject of no material importance to the concerns of life or to the practice of the arts, but it is in many other respects extremely interesting.

Thomas Young.

In the preceding chapter we described a world, which was inhabited byinertial observers, and all these observers could do was to measure parametersof each other. Now we are going to add preparation devices and physicalsystems (see Fig. 1) to our picture of the world. We will ask what kind of information about the physical system can be obtained by the observer, andhow this information depends on the state of the system?

Until the end of the 19th century these questions were answered by clas-sical mechanics which, basically, said that in each state the physical systemhas a number of observables (e.g, position, momentum, mass, etc) whosevalues can be measured simultaneously, accurately, and reproducibly. Thesedeterministic views were fundamental not only for classical mechanics, butthroughout classical physics.

A dissatisfaction with the classical theory started to grow at the end of the 19th century when this theory was found inapplicable to microscopicphenomena, such as the radiation spectrum of heated bodies, the discretespectrum of atoms, and the photo-electric effect. Solutions of these and manyother problems were found in quantum mechanics whose creation involved

25

Page 56: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 56/687

26 CHAPTER 2. QUANTUM MECHANICS

joint efforts and passionate debates of such outstanding scientists as Bohr,

Born, de Broglie, Dirac, Einstein, Fermi, Fock, Heisenberg, Pauli, Planck,Schrodinger, Wigner, and many others. The picture of the physical worldemerged from these efforts was weird, paradoxical, and completely differentfrom the familiar classical picture. However, despite this apparent weirdness,predictions of quantum mechanics are tested countless times everyday inphysical and chemical laboratories around the world, and not a single timewere these “weird” predictions found wrong. This makes quantum mechanicsthe most successful and accurate physical theory of all times.

2.1 Why do we need quantum mechanics?

The inadequacy of classical concepts is best seen by analyzing the debatebetween the corpuscular and wave theories of light. Let us demonstrate theessence of this centuries-old debate on an example of a thought experimentwith pinhole camera.

2.1.1 The corpuscular theory of light

You probably saw or heard about a simple device called camera obscura orpinhole camera. You can easily make this device yourself: Take a light-tightbox, put a photographic plate inside the box, and make a small hole in thecenter of the side opposite to the photographic plate (see Fig. 2.1). The lightpassing through the hole inside the box creates a sharp inverted image on thephotographic plate. You will get even sharper image by decreasing the sizeof the hole, though the brightness of the image will become lower, of course.This behavior of light was well known for centuries (a drawing of the cameraobscura is present in Leonardo da Vinci’s papers). One of the earliest scien-tific explanations of this and other properties of light (reflection, refraction,etc.) was suggested by Newton. In modern language, his corpuscular theory would explain the formation of the image like this:

Corpuscular theory: Light is a flow of tiny particles (photons)propagating along classical trajectories (light rays). Each particlein the ray carries a certain amount of energy which gets releasedupon impact in a very small volume corresponding to one grainof the photographic emulsion and produces a small dot image.

Page 57: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 57/687

2.1. WHY DO WE NEED QUANTUM MECHANICS? 27

aperture

photographic plate

AA

A’

Figure 2.1: The image in the camera obscura with a pinhole aperture iscreated by straight light rays: the image at point A′ on the photographicplate is created only by light rays emitted from point A and passed straightthrough the hole.

When intensity of the source is high, there are so many particles,that we cannot distinguish their individual dots. All these dotsmerge into one continuous image and the intensity of the image isproportional to the number of particles hitting the photographicplate during the exposure time.

Let us continue our experiment with the pinhole camera and decreasethe size of the hole even further. The corpuscular theory would insist thatsmaller size of the hole must result in a sharper image. However this is notwhat experiment shows! For a very small hole the image on the photographicplate will be blurred. If we further decrease the size of the hole, the detailedpicture will completely disappear and the image will look like one diffusespot (see Fig. 2.2), independent on the form and shape of the light sourceoutside the camera. It appears as if light rays scatter in all directions whenthey pass through a small aperture or near a small object. This effect of thelight spreading is called diffraction and was discovered by Grimaldi in the

middle of the 17th century.The diffraction is rather difficult to reconcile with the corpuscular theory.

For example, we can try to save this theory by assuming that light raysdeviate from their straight paths due to some interaction with the materialsurrounding the hole. However this is not a satisfactory explanation. The

Page 58: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 58/687

28 CHAPTER 2. QUANTUM MECHANICS

AA BB

AA BB

(a) (b)

Figure 2.2: (a) Image in the pinhole camera with a very small aperture; (b)the density of the image along the line AB

most striking evidence of the fallacy of the naive corpuscular theory is theeffect of light interference discovered by Young in 1802 [23]. To observe theinterference we can slightly modify our pinhole camera by making two smallholes close to each other, so that their diffraction spots on the photographicplate overlap. We already know that when we leave the left hole open andclose the right hole we get a diffuse spot L (see Fig. 2.3(a)). When we leaveopen the right hole and close the left hole we get another spot R. Let us tryto predict what kind of image we will get if we open both holes.

Following the corpuscular theory and simple logic we might concludethat particles reaching the photographic plate are of two sorts: those passedthrough the left hole and those passed through the right hole. When the twoholes are opened at the same time, the density of the “left hole“ particlesshould add to the density of the “right hole” particles and the resulting imageshould be a superposition L+R of the two images (full line in Fig. 2.3(a)).Right? Wrong! This “reasonable” explanation disagrees with experiment.The actual image has new features (brighter and darker regions) called theinterference picture (full line in Fig. 2.3(b)).

Can the corpuscular theory explain the interference picture? We couldassume, for example, that some kind of interaction between light corpusclesis responsible for the interference. However, this idea must be rejected. Forexample, in an interference experiment conducted by Taylor in 1909 [24], theintensity of light was so low that no more than one photon was present at any

Page 59: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 59/687

2.1. WHY DO WE NEED QUANTUM MECHANICS? 29

(a) (b)

LL RR

L+R

Figure 2.3: (a) The density of the image in a two-hole camera accordingto naive corpuscular theory is a superposition of images created by the left(L) and right (R) holes; (b) Actual interference picture: In some places thedensity of the image is higher than L+R (constructive interference); in otherplaces the density is lower than L+R (destructive interference).

time instant, so that any photon-photon interaction was impossible. Another“explanation” that the photon somehow splits and passes through both holesand then rejoins again at the point of collision with the photographic platedoes not stand criticism as well: One photon never creates two dots on thephotographic plate, so it is unlikely that it can split during propagation. Canwe assume that the particle passing through the right hole somehow “knows”whether the left hole is open or closed and adjusts its trajectory accordingly?Of course, there is some difference of force acting on the photon near the lefthole depending on whether the right hole is open or not. However by allestimates this difference is negligibly small.

2.1.2 The wave theory of light

The inability to explain such basic effects of light propagation as diffraction

and interference was a major embarrassment for the Newtonian corpusculartheory. These effects as well as all other properties of light known beforequantum era (reflection, refraction, polarization, etc.) were brilliantly ex-plained by the wave theory of light advanced by Grimaldi, Huygens, Young,Fresnel, and others. The wave theory gradually substituted the corpuscular

Page 60: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 60/687

30 CHAPTER 2. QUANTUM MECHANICS

theory in the 19th century. It found its strongest support from Maxwell’s

electromagnetic theory which unified optics with electric and magnetic phe-nomena. This theory explained that the light wave is actually an oscillatingfield of electric E(x, t) and magnetic B(x, t) vectors – a sinusoidal wave prop-agating with the speed of light. According to the Maxwell’s theory, the energyof the wave, and consequently the intensity of light I , is proportional to thesquare of the amplitude of the field vector oscillations, e.g., I ∝ E2. Thenthe wave theory would explain the formation of the photographic image asfollows:

Wave theory: Light is a continuous wave or field propagat-

ing in space in an undulatory fashion. When the light wavemeets molecules of the photo-emulsion, the charged parts of themolecules start to vibrate under the influence of the light’s elec-tric and magnetic field vectors. The places with higher ampli-tudes of the fields, and consequently higher vibration amplitudesof molecules, will have the denser image.

The diffraction and interference are quite naturally explained by the wavetheory. Diffraction simply means that the light waves can deviate from the

straight path and go around corners, just like sound waves do.1

To explainthe interference, we just need to note that when two portions of the wavethat passed through different holes meet each other on the photographicplate, their electric vectors add up. However the intensity of the combinedwave is not necessarily equal to the sum of intensities of the waves: I ∝(E1 + E2)

2 = E21 + 2E1 · E2 + E22 = E21 + E22 ∝ I 1 + I 2. It follows fromsimple geometric considerations that in the two-hole configuration there areplaces on the photographic plate where the two waves always come in phase(E1(t) ↑↑ E2(t) and E1 · E2 > 0, which means constructive interference), andthere are other places where the two waves always come in opposite phase(E1(t)

↑↓E2(t) and E1

·E2 < 0, i.e., destructive interference).

1The wave corresponding to the visible light has a very small wavelength between 0.4micron for the violet light and 0.7 micron for the red light, so for large obstacles or holes,the deviations from the straight path are very small, and the corpuscular theory of lightworks fine.

Page 61: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 61/687

2.1. WHY DO WE NEED QUANTUM MECHANICS? 31

2.1.3 Low intensity light and other experiments

In the 19th century physics, the wave-particle debate was decided in favorof the wave theory. However, further experimental evidence showed thatthe victory was declared prematurely. To see what goes wrong with thewave theory, let us continue our thought experiment with the interferencepicture created by two holes, and gradually tune down the intensity of thelight emitted by the source. At first, nothing interesting will happen: wewill see that the density of the image predictably decreases. However, aftersome point we will recognize that the image is not uniform and continuousas before. It consists of small bright dots as if some grains of photo-emulsionwere exposed to light and some not. This observation is very difficult to

reconcile with the wave theory. How a continuous wave can produce thisdotty image? However this is exactly what the corpuscular theory wouldpredict. Apparently the dots are created by particles hitting the photographicplate one at a time.

A number of other effects were discovered at the end of the 19th centuryand in the beginning of the 20th century which could not be explained bythe wave theory of light. One of them was the photo-electric effect: It wasobserved that when the light is shined on a piece of metal, electrons canescape from the metal into the vacuum. This observation was not surprisingby itself. However it was rather puzzling how the number of emitted elec-

trons depended on the frequency and intensity of the incident light. It wasfound that only light waves with the frequency above some threshold ω0 werecapable of knocking out the electrons from the metal. The radiation with thefrequency below ω0 could not produce the electron emission even if a highintensity light was used. According to the wave theory “explanation” above,we may assume that the electrons are knocked out of the metal due to somekind of force exerted on them by the wave. The larger wave amplitude (=larger light intensity) naturally means the larger force and the larger chanceof the electron emission. Then why the low frequency but high intensity lightcould not do the job?

In 1905 Einstein explained the photo-electric effect by bringing back New-tonian corpuscles in the form of “light quanta” later called photons. He de-scribed the light as “consisting of finite number of energy quanta which are localized at points in space, which move without dividing, and which can only be produced and absorbed as complete units” [25]. According to the Einstein’s

Page 62: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 62/687

32 CHAPTER 2. QUANTUM MECHANICS

explanation, each photon carries the energy of ω, where ω is the frequency2

of the light wave and

is the Planck constant. Each photon has a chance tocollide with and pass its energy to just one electron in the metal. Only highenergy photons (those corresponding to high frequency light) are capable of passing enough energy to the electron to overcome certain energy barrier E bbetween the metal and the vacuum. No matter what is the amplitude (= thenumber of photons) in the low-frequency (= low energy) light, the photonsof such light are just too “weak” to kick the electrons with sufficient energy.3

In the Compton’s experiment (1923) the interaction of light with electronscould be studied with much greater detail. And indeed, this interaction moreresembled a collision of two particles rather than shaking of the electron bya periodic electromagnetic wave.

These observations clearly lead to the conclusion that the light is a flowof particles after all. But what about the interference? We already agreedthat the corpuscular theory had no logical explanation of this effect. So,young quantum theory had an almost impossible task to reconcile two ap-parently contradicting classes of experiments with light: Some experiments(diffraction, interference) were easily explained with the wave theory whilethe corpuscular theory had serious difficulties. Other experiments (photo-electric effect, Compton scattering) could not be explained from the waveproperties and clearly showed that light consists of particles. Just adding tothe confusion, de Broglie in 1924 advanced a hypothesis that such particle-wave duality is not specific to photons. He proposed that all particles of matter – like electrons – have wave-like properties. This “crazy” idea wassoon confirmed by Davisson and Germer who observed the diffraction andinterference of electron beams in 1927.

Certainly, in the first quarter of the 20th century, physics faced the great-est challenge in its history. This is how Heisenberg described the situation:

I remember discussions with Bohr which went through many hours till very late at night and ended almost in despair; and when at the end of the discussion I went alone for a walk in the neighbor-ing park I repeated to myself again and again the question: Can

2ω is the so-called circular frequency (measured in radians per second) which is relatedto the usual frequency ν (measured in oscillations per second) by the formula ω = 2πν .

3Actually, the low-frequency light may lead to the electron emission when two or morelow-energy photons collide simultaneously with the same electron, but such events havevery low probability and become observable only at very high light intensities.

Page 63: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 63/687

2.2. PHYSICAL FOUNDATIONS OF QUANTUM MECHANICS 33

nature possibly be as absurd as it seemed to us in those atomic

experiments? W. Heisenberg [26]

2.2 Physical foundations of quantum mechan-

ics

In this section we will formulate two basic postulates which make the foun-dation of quantum mechanics, and explain the main difference between clas-sical and quantum views of the world. To understand quantum mechanics,we must accept that certain concepts which were taken for granted in clas-sical physics can not be applied to micro-objects like photons and electrons.

To see what is different, we must revisit basic ideas about what is physicalsystem, how its states are prepared, and how its observables are measured.

2.2.1 Ensembles and experiments

In our discussion of the experimental setup in Introduction, we considered just one act of preparation and measurement. However, releasing one pho-ton will not make a photograph and firing just one bullet is not the way toperform a shooting exercise. Every experimental study requires repetition of the preparation-measurement cycle. In classical physics, multiple measure-

ments are desirable, e.g., to determine the statistical distribution of resultsand estimate experimental errors. However, as we will see later, quantumphysics is an inherently probabilistic approach, and it requires accumulationof statistics as a necessary part of each experiment. We will call experiment the preparation of an ensemble (= a set of identical copies of the systemprepared in the same conditions) and performing measurements of the sameobservable in each member of the ensemble.4

Suppose we had N inertial laboratories in which we prepared an ensem-ble of N identical copies of the system and measured the same observable N

4It is worth noting here that in this book we are not considering repeated measurements

performed on the same copy of the physical system. We will work under assumption thatafter the measurement has been performed, the used copy of the system is discarded. Eachmeasurement requires a fresh copy of the physical system. This means, in particular, thatwe are not interested in the state to which the system may have “collapsed” after themeasurement. The description of repetitive measurements is an interesting subject, but itis beyond the scope of this book.

Page 64: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 64/687

34 CHAPTER 2. QUANTUM MECHANICS

times. We cannot say a priori that all these measurements will always pro-

duce the same results. However, it seems reasonable to assume the existenceof ensembles in which measurements of one observable can be repeated withthe same result infinite number of times. Indeed, there is no point to talkabout an observable, if there are no ensembles in which it can be preparedand observed with certainty.

Postulate 2.1 (the reproducibility of measurements in an ensemble)For any observable F and any value f from its spectrum, we can always pre-pare an ensemble in such a state that measurements of this observable are reproducible, i.e., always yield the value f .

The postulates 0.1 (unlimited precision of measurements) and 2.1 arevalid in both classical and quantum mechanics. However, we are interestedin finding situations where these two disciplines are different. It appearsthat the fundamental difference is in two additional assertions of classicalmechanics: simultaneous measurability and determinism . These assertionslook innocent for familiar objects in the macroscopic domain, but they areno longer valid in the micro-world.5 They will be discussed in the nextsubsection.

2.2.2 Measurements in classical mechanics

The first additional statement of classical mechanics enhances the postulate0.1 about the unlimited precision of measurements and asserts that we canaccurately measure several observables simultaneously.

Assertion 2.2 (simultaneous measurability) It is possible, in principle,to measure simultaneously any combination of observables with any prescribed precision.

In classical mechanics this assertion is so obvious that it is rarely mentionedat all, though it lies in the basis of the entire mathematical formalism of the

theory. Consider for example a bullet fired from a riffle. If we disregard the5 Actually, it is fair to say that they are equally wrong in the macro-world as well.

Simply the deviations from classical behavior are rather small and not easy to observein macro-systems on the background of relatively large experimental errors, e.g., due tonon-zero temperature.

Page 65: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 65/687

2.2. PHYSICAL FOUNDATIONS OF QUANTUM MECHANICS 35

spinning motion and the orientation of the bullet in space, then there are six

parameters describing the state of the bullet: its position r and momentump. According to Assertion 2.2, these observables can be measured accuratelyand simultaneously. Therefore an instantaneous state of the bullet can bedescribed as a point r, p in a 6-dimensional phase space , and the timedevelopment of this state can be described by a trajectory , r(t), p(t).

Assertion 2.2 refers to an individual preparation-measurement act. Whatcan we say about many measurements in the ensemble? We already men-tioned that often results of measurements performed in the ensemble are notreproducible, even if each individual measurement can be infinitely precise.For example, even if we do our best in taking the aim, still different bulletsfired from the riffle will hit different places on the target with different mo-

menta. Each bullet has its own trajectory in the phase space. Then we willsay that the ensemble of bullets is in a mixed state described by a probability distribution or probability density ρ(r, p) on the phase space.

In classical physics, these probabilities, are not a cause of concern, be-cause we believe that the uncertainties simply result from our insufficientcontrol over the preparation of the states of the bullets. We believe thatwith certain efforts we can always control the alignment of the riffle’s barrel,the amount of powder in the cartridge, wind, etc., so that each shot will hitexactly the same place on the target with the same momentum. This meansthat the ensemble can be prepared in a state (which will be called a pure

classical state ) whose probability density is squeezed into a delta-functionρ0(r, p) = δ (r − r0)δ (p − p0), and all bullets in the ensemble will followthe same trajectory. This belief in orderly definiteness and predictabilityof physical events is captured in the second additional postulate of classicalmechanics, which is a stronger version of the Postulate 2.1

Assertion 2.3 (determinism) We can always prepare an ensemble in a classical pure state in which measurements of all observables are reproducible.

Classical mechanics is a theory based on two Assertions 2.2 and 2.3. Thisis a fully deterministic theory in which one can, in principle, obtain a fullinformation about the system at any given time and knowing the rules of dynamics predict exactly its development in the future. This belief was bestexpressed by Laplace:

Page 66: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 66/687

36 CHAPTER 2. QUANTUM MECHANICS

An intelligence that would know at a certain moment all the forces

existing in nature and the situations of the bodies that compose nature, and if it would be powerful enough to analyze all these data, would be able to grasp in one formula the movements of the biggest bodies of the Universe as well as of the lightest atom. P.-S.Laplace

2.2.3 The quantum case

Quantum mechanics can be defined as a theory which respects two Postu-lates 0.1 (the unlimited precision of individual measurements) and 2.1 (thereproducibility of some measurements in an ensemble), but does not assume

the validity of classical Assertions 2.2 (the simultaneous measurability of ob-servables) and 2.3 (the determinism). The idea that 2.2 may be invalid, andfor some pairs of observables accurate simultaneous measurements could beimpossible was first advanced by Heisenberg. In particular, he discussed theimpossibility of an accurate measurement of both position and momentumof a micro-particle with a powerful microscope. He put forward the followingarguments. In order to determine the position of the particle we must registerat least one photon scattered by it. Then the accuracy of the measurementcannot be better than the wavelength of the used light. The measurements of momentum are not perfect either, because collisions with photons would defi-nitely disturb the particle and alter its momentum (e.g., due to the Comptoneffect). If we want to get more accurate results for the position we must usethe light with a shorter wavelength whose photons are more energetic, andtherefore perturb the particle’s momentum even stronger. So, better accu-racy for the position means worse accuracy for the momentum and vice versa .It is impossible to obtain accurate values of both position and momentum inone measurement. Such pairs of observables are called incompatible .

This discussion suggests that we must reject Assertion 2.2 in our con-struction of quantum theory and that

Statement 2.4 (incompatibility of observables) There is an accuracy

limit with which certain pairs of incompatible observables can be measured simultaneously.

This is the first major point where quantum mechanics deviates from classicalmechanics.

Page 67: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 67/687

2.3. THE LATTICE OF PROPOSITIONS 37

If we reject the simultaneous measurability of all observables, how can we

be sure that the system really has definite values of all these observables?In classical mechanics, we may still rely on Assertion 2.3 which allows us toknow simultaneous values of all observables in each individual system evenif the simultaneous measurements were not performed. To achieve that, wecan prepare an ensemble of identical particles in a pure classical state andfor each copy perform a measurement of any one observable from the set.E.g., one time we measure position and next time we measure momentum,etc. This is a kind of cheating because, in fact, we do not measure differentobservables simultaneously, but since the ensemble is pure we can be sure thatall measurements of any given observable will yield exactly the same answereach time, which in turn strongly suggests that each individual system in the

ensemble has definite values of all observables.A striking fact about nature is that she does not allow such a cheating!

It appears that, strictly speaking, classical pure ensembles do not exist. Inthe experiment described above, at least one observable will not show thesame value in repeated measurements. So, Assertion 2.3 must be rejected.Then the second major statement which distinguishes quantum mechanicsfrom classical mechanics follows:

Statement 2.5 (indeterminism) It is impossible to prepare ensemble in which measurements of all observables are exactly reproducible.

Quantum mechanics says that with certain efforts we can prepare an ensemblein such a state that measurements of the position would yield the same resulteach time, but then results for the momentum would be different all the time.We can also prepare (another) ensemble in a state with certain momentum,then the position will be all over the place. We cannot prepare an ensemblein which the uncertainties of both position ∆r and momentum ∆p are zero.6

2.3 The lattice of propositions

Having described the two fundamental statements of quantum mechanics inthe preceding section, we now need to turn these statements into workingmathematical formalism. This is the goal of the present section and nexttwo sections.

6see discussion of the Heisenberg uncertainty relation in subsection 5.3.2

Page 68: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 68/687

38 CHAPTER 2. QUANTUM MECHANICS

Physical theory is a set of statements about physical systems and their

properties. The language in which these statements are formulated and an-alyzed is provided by logic . The rules of classical logic were formulated longago, and they express relationships characteristic to macroscopic classical ob-

jects. In the preceding section we found that microscopic objects have ratherstrange properties which do not have counterparts in the macroscopic world.Can we still apply classical logical relationships when formulating statementsabout such quantum systems? The answer is ’no’. The lack of simultaneousmeasurability and the indeterminism are such significant deviations from thefamiliar classical behavior that the rules of logics should be modified whendealing with microscopic objects.

The initial idea that the fundamental difference between classical and

quantum mechanics lies in their logical structures belongs to Birkhoff and vonNeumann. They suggested to substitute classical logic of Aristotle and Booleby the quantum logic . The formalism presented in this section summarizestheir seminal work [27] as well as further research most notably by Mackey[28] and Piron [29, 30]. Even for those who do not accept the necessity of such radical change in our views on logic, the study of quantum logic mayprovide a desirable bridge between intuitive concepts of classical mechanicsand abstract formalism of quantum theory.

In introductory quantum physics classes (especially in the United States), students are informed ex cathedra that the state of a

physical system is represented by a complex-valued wavefunction ψ, that observables correspond to self-adjoint operators, that the temporal evolution of the system is governed by a Schr¨ odinger equation, and so on. Students are expected to accept al l this un-critically, as their professors probably did before them. Any ques-tion of why is dismissed with an appeal to authority and an injunc-tion to wait and see how well it all works. Those students whose curiosity precludes blind compliance with the gospel according toDirac and von Neumann are told that they have no feeling for physics and that they would be better off studying mathematics or philosophy. A happy alternative to teaching by dogma is provided by basic quantum logic, which furnishes a sound and intellectually satisfying background for the introduction of the standard notions of elementary quantum mechanics. D. J. Foulis [31]

The main purpose of our sections 2.3 - 2.5 is to demonstrate that pos-

Page 69: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 69/687

2.3. THE LATTICE OF PROPOSITIONS 39

tulates of quantum mechanics are robust and inevitable consequences of the

laws of probability, and simple properties of measurements. Basic axioms of quantum logic which are common for both classical and quantum mechanicsare presented in section 2.3. The close connection between the classical logicand the phase space formalism of classical mechanics is discussed in section2.4. In section 2.5, we will note a remarkable fact that the only difference be-tween classical and quantum logics (and, thus, between classical and quantumphysics in general) is in a single obscure distributivity postulate. This classi-cal postulate must be replaced by the orthomodularity postulate of quantumtheory. We will also demonstrate how postulates of quantum logic lead (viaPiron’s theorem) directly to the standard formalism of quantum mechanicswith Hilbert spaces, Hermitian operators, wave functions, etc.

2.3.1 Propositions and states

The key elements of logic (either classical or quantum) are special observ-ables called propositions . A proposition x is a statement about the physicalsystem which can be either true or false . So, propositions can be also definedas observables7 whose spectrum consists of just two points: 1 (if x is true)and 0 (if x is false). As an example, we can consider propositions x aboutone observable F . These propositions are of the form ”the value of observ-able F is inside the interval X of the real line R.”8 When a measurement of the observable F is performed, the proposition x may be found either true

(if the value of F was found inside the subset X ) or false (otherwise). Wecan imagine the apparatus measuring F as a mechanism (whose exact con-struction is of no relevance to us) connected to an arrow. The measurementresults in the arrow pointing to a certain spot on the dial. According to thePostulate 0.1, a single measurement of one observable F can be done witharbitrary precision, so for any interval X on the dial we can say with absolutecertainty whether the arrow pointed inside this interval (the proposition xwas true) or not (the proposition x was false). Therefore, there is no uncer-tainty associated with a single measurement of any proposition about oneobservable.

As we will see later, there exist sets of observables F 1, F 2, . . . , F n whichare simultaneously measurable with arbitrary precision. For such sets of

7Propositions are sometimes called “yes-no experiments”.8Intervals X are not necessarily limited to contiguous intervals in R. All results remain

valid for more complex subsets of R, such as unions of any number of disjoint intervals.

Page 70: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 70/687

40 CHAPTER 2. QUANTUM MECHANICS

observables one can define propositions corresponding to subsets in the (n-

dimensional) common spectrum of these observables. For example, the propo-sition “particle’s position is inside volume V ”9 can be realized using a Geigercounter occupying the volume V . The counter clicks (the proposition is true)when the particle passes through the counter’s chamber and does not click(the proposition is false) when the particle is outside of V .

The above discussion refers to the single measurement performed on onecopy of the physical system. Let us now prepare multiple copies (an en-semble) of the system and perform the same measurement on all of them.According to Statement 2.5, there is no guarantee that the results of all thesemeasurements will be the same. So, for some members in the ensemble theproposition x will be found ’true’, while for other members it will be ’false’,

even if every effort is made to ensure that the state of the system is the samein all cases. For this reason, it is not useful to introduce the notion of the“state of the system” in quantum mechanics. It makes more sense to talkabout the “state of the ensemble of identically prepared system”. For brevityand keeping with tradition, we will still say “state of the system” when, infact, we will always have in mind the “state of the ensemble”.

In what follows we will denote the set of all propositions about the phys-ical system (also called the propositional system ) by L and the set of allpossible states of the system by Φ. Our goal in this chapter is to study themathematical relationships between elements x ∈ L and φ ∈ Φ in these twosets.

Using results of measurements as described above, we can introduce afunction (φ|x) called the probability measure which assigns to each state φand to each proposition x the probability of x to be true in the state φ. Thevalue of this function (a real number between 0 and 1) is obtained by thefollowing recipe:

(i) prepare a copy of the system in the state φ;

(ii) determine whether x is true or false;

(iii) repeat steps (i) and (ii) N times, then

(φ|x) = limN →∞

M

N 9This is a proposition about simultaneous measurement of three observables - the x, y,

and z components of position.

Page 71: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 71/687

2.3. THE LATTICE OF PROPOSITIONS 41

where M is the number of times when the proposition x was found to be

true.Two states φ and ψ of the same system are said to be equal (φ = ψ) if for any proposition x we have

(φ|x) = (ψ|x)

Indeed, there is no physical difference between these two states as all experi-ments yield the same results (probabilities). For the same reason we will saythat two propositions x and y are identical (x = y) if for all states φ of thesystem

(φ|x) = (φ|y) (2.1)

It follows from the above discussion that the probability measure (φ|x) con-sidered as a function on the set of all states Φ is a unique representative of proposition x (different propositions have different representatives). So, wecan gain some insight into the properties of different propositions by studyingproperties of corresponding probability measures.

There are propositions which are always true independent on the state

of the system. For example, the proposition “the value of the observable F 1is somewhere on the real line” is always true.10 For any other observableF 2, the proposition “the value of the observable F 2 is somewhere on the realline” is also true for all states. Therefore, according to (2.1), we will say thatthese two propositions are identical. So, we can define a unique maximum proposition I ∈ L which is always true. Inversely, the proposition “the valueof observable belongs to the empty set of the real line” is always false and willbe called the minimum proposition. There is just one minimum proposition∅ in the set L, and for each state φ we can write

(φ|I ) = 1 (2.2)(φ|∅) = 0 (2.3)

10Measurements of observables always yield some value, since we agreed in Introductionthat an ideal measuring apparatus never misfires.

Page 72: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 72/687

42 CHAPTER 2. QUANTUM MECHANICS

2.3.2 Partial ordering

Suppose that we found two propositions x and y, such that their measuressatisfy (φ|x) ≤ (φ|y) everywhere on the set of states Φ. Then we will say thatproposition x is less than or equal to proposition y and denote this relationby x ≤ y. If x and y are propositions about the same observable, then x ≤ ywhen the subset X is inside the subset Y , i.e., X ⊆ Y . In this case, therelation x ≤ y is associated with logical implication , i.e., if x is true then yis definitely true as well; x IMPLIES y.

If x ≤ y and x = y we will say that x is less than y and denote it by x < y .Example: x = “This is a butterfly”, y = “This is an insect”. Obviously,x implies y, and the set of objects for which x is true is less than the set of objects for which y is true.

The relation ≤ has three obvious properties.

Lemma 2.6 (reflectivity) x ≤ x.

Proof. Since for any φ it is true that (φ|x) ≤ (φ|x), then each propositionx is less than or equal to itself.

Lemma 2.7 (symmetry) If x ≤ y and y ≤ x then x = y.

Proof. If two propositions x and y are less than or equal to each other, then(φ|x) ≤ (φ|y) and (φ|y) ≤ (φ|x) for each state φ which implies (φ|x) = (φ|y)

and, according to (2.1), x = y.Lemma 2.8 (transitivity) If x ≤ y and y ≤ z , then x ≤ z .

Proof. If x ≤ y and y ≤ z , then (φ|x) ≤ (φ|y) ≤ (φ|z ) for every state φ,and consequently (φ|x) ≤ (φ|z )

Properties 2.6, 2.7, and 2.8 tell us that ≤ is a partial ordering relation.It is ordering because it tells which proposition is “smaller” and which is“larger”. It is partial , because it doesn’t apply to all pairs of propositions.There could be propositions x and y, such that for some states (φ|x) > (φ|y),while for other states (φ

|x) < (φ

|y). Thus, the set

Lof all propositions is a

partially ordered set . From eqs. (2.2) and (2.3) we also conclude that

Lemma 2.9 (definition of I ) x ≤ I for any x ∈ L.

Lemma 2.10 (definition of ∅) ∅ ≤ x for any x ∈ L.

Page 73: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 73/687

2.3. THE LATTICE OF PROPOSITIONS 43

2.3.3 Meet

For two propositions x and y, suppose that we found a third proposition z such that

z ≤ x (2.4)

z ≤ y (2.5)

There could be more than one proposition satisfying these properties. Wewill assume that we can always find one maximum proposition z in this setwhich will be called a meet of x and y and denoted by x

∧y. Example: x

= “This is a butterfly”, y = “This is red”, x ∧ y = “This is a red butterfly”.The existence of a unique meet is obvious in the case when x and y are

propositions about the same observable and correspond to two subsets of thereal line R: X and Y , respectively. Then the meet z = x ∧ y is a propositioncorresponding to the intersection of these two subsets Z = X ∩ Y .11 In thisone-dimensional case the operation meet can be identified with the logicaloperation AND: proposition x ∧ y is true only when both x AND y are true.

The above definition of meet can be formalized as

Postulate 2.11 (definition of ∧) x ∧ y ≤ x and x ∧ y ≤ y.

Postulate 2.12 (definition of ∧) If z ≤ x and z ≤ y then z ≤ x ∧ y.

One can easily show that the order in which meet operations are performedis not relevant

Lemma 2.13 (commutativity of ∧) x ∧ y = y ∧ x.

Lemma 2.14 (associativity of ∧) (x ∧ y) ∧ z = x ∧ (y ∧ z ).

11If X and Y do not intersect, then z = ∅.

Page 74: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 74/687

44 CHAPTER 2. QUANTUM MECHANICS

2.3.4 Join

Quite similarly, we assume that for any two propositions x and y there alwaysexists a unique join x ∨ y, such that both x and y are less or equal thanx∨y, and x∨y is the minimum proposition with such a property. Example:x = “This is an Asian elephant”, y = “This is an African elephant”, x ∨ y =“This is an elephant”.

In the case of propositions about the same observable, the join of x andy is a proposition z = x ∨ y whose subset of the real line is a union of thesubsets corresponding to x and y: Z = X ∪ Y . The proposition z is truewhen either x OR y is true. So, the join can be identified with the logicaloperation OR.12

The formal version of the above definition of join is

Postulate 2.15 (definition of ∨ ) x ≤ x ∨ y and y ≤ x ∨ y.

Postulate 2.16 (definition of ∨ ) If x ≤ z and y ≤ z then x ∨ y ≤ z .

Similar to Lemmas 2.13 and 2.14 we see that the order of join operationsis irrelevant

Lemma 2.17 (commutativity of ∨) x ∨ y = y ∨ x.

Lemma 2.18 (associativity of ∨

) (x∨

y)∨

z = x∨

(y∨

z ).

The properties of propositions listed so far (partial ordering, meet, and join) mean that the set of propositions L is what mathematicians call acomplete lattice .

2.3.5 Orthocomplement

There is one more operation on the set of propositions that we need toconsider. This operation is called orthocomplement . For any propositionx its orthocomplement is denoted by x⊥. In the case of propositions about

one observable, the orthocomplement has the meaning of the logical negation(operation NOT). If proposition x corresponds to the subset X of the real

12It is important to note that in quantum logic from x ∨ y being true it does notnecessarily follow that either x or y are true. The differences between quantum andclassical logics are discussed in subsection 2.5.2.

Page 75: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 75/687

2.3. THE LATTICE OF PROPOSITIONS 45

line, then its orthocomplement x⊥ corresponds to the relative complement of

X with respect toR

(denoted byR

\ X ). When the value of observable F isfound inside X , i.e., the value of x is 1, we immediately know that the valueof x⊥ is zero. Inversely, if x is false then x⊥ is necessarily true. Example:x = “This is big”, x⊥ = “This is small”.

More formally, the orthocomplement x⊥ is defined as a proposition whoseprobability measure for each state φ is

(φ|x⊥) = 1 − (φ|x) (2.6)

Lemma 2.19 (non-contradiction) x ∧ x⊥ = ∅.

Proof. Let us prove this Lemma in the case when x is a proposition aboutone observable F . Suppose that x∧x⊥ = y = ∅, then, according to Postulate2.1, there exists a state φ such that (φ|y) = 1, and, from Postulate 2.11,

y ≤ x

y ≤ x⊥

1 = (φ|y) ≤ (φ|x)

1 = (φ|y) ≤ (φ|x⊥)

It then follows that (φ|x) = 1 and (φ|x⊥) = 1, which means that any mea-surement of the observable F in the state φ will result in a value inside bothX and R \ X simultaneously, which is impossible. This contradiction shouldconvince us that x ∧ x⊥ = ∅.

Lemma 2.20 (double negation) (x⊥)⊥ = x.

Proof. From eq. (2.6) we can write for any state φ

(φ|(x⊥)⊥) = 1 − (φ|x⊥)= 1 − (1 − (φ|x))

= (φ|x).

Page 76: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 76/687

46 CHAPTER 2. QUANTUM MECHANICS

Lemma 2.21 (contraposition) If x ≤ y then y⊥ ≤ x⊥.

Proof. If x ≤ y then (φ|x) ≤ (φ|y) and (1−(φ|x)) ≥ (1−(φ|y)) for all statesφ. But according to our definition (2.6), the two sides of this inequality areprobability measures for propositions x⊥ and y⊥, which proves the Lemma.

Propositions x and y are said to be disjoint if x ≤ y⊥ or, equivalently,y ≤ x⊥. Example: x = “This is a planet” and y = “This is a butterfly” aredisjoint propositions.

When x and y are disjoint propositions about the same observable, theircorresponding subsets do not intersect: X ∩ Y = ∅. For such mutuallyexclusive propositions the probability of either x OR y being true (i.e., the

probability corresponding to the proposition x∨y) is the sum of probabilitiesfor x and y. It seems natural to generalize this property to all pairs of disjointpropositions

Postulate 2.22 (probabilities for mutually exclusive propositions) If x and y are disjoint, then for any state φ

(φ|x ∨ y) = (φ|x) + (φ|y)

The following Lemma establishes that either proposition x or its orthocom-

plement x⊥ is definitely true.

Lemma 2.23 (non-contradiction) x ∨ x⊥ = I .Proof. From Lemmas 2.6 and 2.20 it follows that x ≤ x = (x⊥)⊥ and thatpropositions x and x⊥ are disjoint. Then, by Postulate 2.22, for any state φwe obtain

(φ|x ∨ x⊥) = (φ|x) + (φ|x⊥) = (φ|x) + (1 − (φ|x)) = 1

which proves the Lemma.Adding the orthocomplement to the properties of the propositional sys-tem (complete lattice) L, we obtain that L is an orthocomplemented lattice .Axioms of orthocomplemented lattices are collected in the upper part of Ta-ble 2.1 for easy reference.

Page 77: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 77/687

2.3. THE LATTICE OF PROPOSITIONS 47

Table 2.1: Axioms of quantum logic

Property Postulate/LemmaAxioms of orthocomplemented lattices

Reflectivity 2.6 x ≤ xSymmetry 2.7 x ≤ y & y ≤ x ⇒ x = y

Transitivity 2.8 x ≤ y & y ≤ z ⇒ x ≤ z Definition of I and ∅ 2.9 x ≤ I

2.10 ∅ ≤ xDefinition of ∧ and ∨ 2.11 x ∧ y ≤ x

2.15 x ≤ x ∨ yDefinition of ∧ and ∨ 2.12 z ≤ x & z ≤ y ⇒ z ≤ x ∧ y

2.16 x≤

z & y≤

z ⇒

x∨

y≤

z Commutativity 2.13 x ∨ y = y ∨ x

2.17 x ∧ y = y ∧ xAssociativity 2.14 (x ∨ y) ∨ z = x ∨ (y ∨ z )

2.18 (x ∧ y) ∧ z = x ∧ (y ∧ z )Non-contradiction 2.19 x ∧ x⊥ = ∅

2.23 x ∨ x⊥ = I Double negation 2.20 (x⊥)⊥ = xContraposition 2.21 x ≤ y ⇒ y⊥ ≤ x⊥

Atomicity 2.24 existence of logical atomsAdditional assertions of classical logic

Distributivity 2.28 x ∨ (y ∧ z ) = (x ∨ y) ∧ (x ∨ z )2.29 x ∧ (y ∨ z ) = (x ∧ y) ∨ (x ∧ z )

Additional postulate of quantum logicOrthomodularity 2.39 x ≤ y ⇒ x ↔ y

Page 78: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 78/687

48 CHAPTER 2. QUANTUM MECHANICS

2.3.6 Atomic propositions

One says that proposition y covers proposition x if the following two state-ments are true:

1) x < y

2) If x ≤ z ≤ y, then either z = x or z = y

This means that there are no propositions “intermediate” between x and y.Example: x = “This is 10 dollars”, y = “This is 10 dollars and 1 cent”

If x is a proposition about single observable corresponding to the intervalX

⊆R, then the interval corresponding to the covering proposition y can be

obtained by adding just one extra point to the interval X .A proposition covering ∅ is called an atomic proposition or simply an

atom . So, atoms are smallest non-vanishing propositions. They unambigu-ously specify properties of the system in the most exact way. Example: x= “This is black spider named Albert sitting in the south-west corner of myroom.”

We will say that the atom p is contained in the proposition x if p ≤ xand assume the following

Postulate 2.24 (atomicity) The propositional system L is an atomic lat-

tice. This means that 1. If x = ∅, then there exists at least one atom p contained in x.2. Each proposition x is a join of all atoms contained in it:

x = ∨ p≤x p

3. If p is an atom and p ∧ x = ∅, then p ∨ x covers x.

There are three simple Lemmas that follow directly from this Postulate.

Lemma 2.25 If p is an atom and x is any non-zero proposition then either

p ∧ x = ∅ or p ∧ x = p.

Proof. We know that ∅ ≤ p ∧ x ≤ p and that p covers ∅. Then, accordingto the definition of covering, either p ∧ x = ∅ or p ∧ x = p.

Page 79: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 79/687

2.4. CLASSICAL LOGIC 49

Lemma 2.26 x ≤ y if and only if all atoms contained in x are contained in

y as well.Proof. If x ≤ y then for each atom p contained in x we have p ≤ x ≤ y and

p ≤ y by the transitivity property 2.8. To prove the inverse statement wenotice that if we assume that all atoms in x are also contained in y then byPostulate 2.24(2)

y = ∨ p≤y p= (∨ p≤x p) ∨ (∨ p≤x p)

= x ∨ (∨ p≤x p)

≥ x

Lemma 2.27 The meet x∧y of two propositions x and y is a union of atoms contained in both x and y.

Proof. If p is an atom contained in both x and y ( p ≤ x and p ≤ y), then p ≤ x ∧ y. Conversely, if p ≤ x ∧ y, then p ≤ x and p ≤ y by Lemma C.1

2.4 Classical logic

2.4.1 Truth tables and the distributive law

One may notice that the theory constructed above is similar to classicallogic. Indeed if we make substitutions (see Table 2.2) ’less than or equal to’→ IF...THEN..., join → OR, meet → AND, orthocomplement → NOT, etc.,then properties described in Postulates and Lemmas 2.6 - 2.24 exactly matchaxioms of classical Boolean logic. For example, the transitivity property inLemma 2.8 allows us to make syllogisms, like the one analyzed by Aristotle

If all humans are mortal,and all Greeks are humans,then all Greeks are mortal.

Page 80: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 80/687

50 CHAPTER 2. QUANTUM MECHANICS

Table 2.2: Four operations and two special elements of lattice theory and

logicName Name Meaning Symbol

in lattice theory in logic in classical logicless or equal implication IF x THEN y x ≤ y

meet injunction x AND y x ∧ y join disjunction x OR y x ∨ y

orthocomplement negation NOT x x⊥

maximum element tautology always true I minimum element absurdity always false ∅

Lemma 2.19 tells that a proposition and its negation cannot be true at thesame time. Lemma 2.23 is the famous tertium non datur law of logic: eithera proposition or its negation is true with no third possibility.

Note, however, that properties 2.6 - 2.24 are not sufficient to build acomplete theory of mathematical logic: Boolean logic has two additionalaxioms, which are called distributive laws

Assertion 2.28 x ∨ (y ∧ z ) = (x ∨ y) ∧ (x ∨ z ).

Assertion 2.29 x ∧ (y ∨ z ) = (x ∧ y) ∨ (x ∧ z ).

These laws, unlike axioms of orthocomplemented lattices, cannot be justifiedby using our approach with probability measures. This is the reason whywe call them Assertions. We will not assume their validity in the generalcase. However, these two Assertions can be proven if we use the fundamentalAssertions of classical mechanics: 2.2 and 2.3. In particular, Assertion 2.3says that in classical pure states all measurements yield the same results,i.e., reproducible. Then for a given classical pure state φ each proposition xis either always true or always false, and the probability measure can haveonly two values: (φ|x) = 1 or (φ|x) = 0. Such classical probability measureis called the truth function . In the double-valued (true-false) Boolean logic,

the job of performing logical operations with propositions is greatly simpli-fied by analyzing their truth functions. For example, to show the equality of two propositions it is sufficient to demonstrate that the values of their truthfunctions are the same for all classical pure states. Let us consider an exam-ple. Given two propositions x and y, there are at most four possible values

Page 81: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 81/687

2.4. CLASSICAL LOGIC 51

for the pair of their truth functions (φ|x) and (φ|y): (1,1), (1,0), (0,1), and

(0,0). To analyze these possibilities it is convenient to put the values of thetruth functions in a truth table . Table 2.3 is the truth table for propositionsx, y, x∧y, x∨y, x⊥, and y⊥.13 The first row in table 2.3 refers to all classicalpure states in which both propositions x and y are false. The second rowrefer to states in which x is false and y is true, etc.

Table 2.3: Truth table for basic logical operationsx y x ∧ y x ∨ y x⊥ y⊥

0 0 0 0 1 10 1 0 1 1 01 0 0 1 0 11 1 1 1 0 0

Table 2.4: Demonstration of the distributive law using truth tablex y z y ∧ z x ∨ (y ∧ z ) x ∨ y x ∨ z (x ∨ y) ∧ (x ∨ z )0 0 0 0 0 0 0 00 0 1 0 0 0 1 00 1 0 0 0 1 0 00 1 1 1 1 1 1 1

1 0 0 0 1 1 1 11 0 1 0 1 1 1 11 1 0 0 1 1 1 11 1 1 1 1 1 1 1

Another example is shown in Table 2.4. It demonstrates the validity of the distributive law14 in the classical case. As this law involves three differentpropositions, we need to consider 8 = 23 different cases. In all these cases thevalues of the truth functions in columns 5 and 8 are identical which meansthat

x ∨ (y ∧ z ) = (x ∨ y) ∧ (x ∨ z )

13Here we assume that all these propositions are non-empty.14Assertion 2.28

Page 82: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 82/687

52 CHAPTER 2. QUANTUM MECHANICS

Assertion 2.29 can be derived in a similar way.

Thus we have shown that in the world of classical mechanics the set of propositions L is an orthocomplemented atomic lattice with distributive laws2.28 and 2.29. Such a lattice will be called a classical propositional system or, shortly, classical logic . Study of classical logics and its relationship toclassical mechanics is the topic of the present section.

2.4.2 Atomic propositions in classical logic

Our next step is to demonstrate that classical logic provides the entire math-ematical framework of classical mechanics, i.e., the description of observablesand states in the phase space . First, we prove four Lemmas.

Lemma 2.30 In classical logic, if x < y, then there exists an atom p such that p ∧ x = ∅ and p ≤ y.

Proof. Clearly, y ∧ x⊥ = ∅, because otherwise we would have

y = y ∧ I = y ∧ (x ∨ x⊥)

= (y

∧x)

∨(y

∧x⊥)

= (y ∧ x) ∨ ∅= y ∧ x

≤ x

and, by Lemma 2.7, x = y in contradiction with the condition of the Lemma.Since y ∧ x⊥ is non-zero, then by Postulate 2.24(1) there exists an atom psuch that p ≤ y ∧ x⊥. It then follows that p ≤ x⊥, and by Lemma C.3

p ∧ x ≤ x⊥ ∧ x = ∅.

Lemma 2.31 In classical logic, the orthocomplement x⊥ of a proposition x(where x = I ) is a join of all atoms not contained in x.

Proof. First, it is clear that there should exist al least one atom p thatis not contained in x. If it were not true, then we would have x = I in

Page 83: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 83/687

2.4. CLASSICAL LOGIC 53

contradiction to the condition of the Lemma. Let us now prove that the

atom p is contained in x⊥. Indeed, using the distributive law 2.29 we canwrite

p = p ∧ I = p ∧ (x ∨ x⊥)

= ( p ∧ x) ∨ ( p ∧ x⊥)

According to Lemma 2.25 we now have four possibilities:

1. p ∧ x = ∅ and p ∧ x⊥ = ∅; then p ≤ x ∧ x⊥ = ∅, which is impossible;

2. p ∧ x = p and p ∧ x⊥ = p; then p = p ∧ p = ( p ∧ x) ∧ ( p ∧ x⊥) = p ∧ (x ∧ x⊥) = p ∧ ∅ = ∅, which is impossible;

3. p ∧ x = p and p ∧ x⊥ = ∅; from Postulate 2.11 it follows that p ≤ x,which contradicts our assumption and should be dismissed;

4. p ∧ x = ∅ and p ∧ x⊥ = p; from this we have p ≤ x⊥, i.e., p is containedin x⊥.

This shows that all atoms not contained in x are contained in x⊥. Further,from Lemmas 2.19 and 2.27 it follows that all atoms contained in x⊥ are notcontained in x. The statement of the Lemma then follows from Postulate2.24(2).

Lemma 2.32 In classical logic, two different atoms p and q are always dis- joint: q ≤ p⊥.

Proof. By Lemma 2.31, p⊥ is a join of all atoms different from p, including

q , thus q ≤ p⊥.

Lemma 2.33 In classical logic, the join x ∨ y of two propositions x and yis a join of atoms contained in either x or y.

Page 84: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 84/687

54 CHAPTER 2. QUANTUM MECHANICS

Proof. If p ≤ x or p ≤ y then p ≤ x ∨ y. Conversely, suppose that p ≤ x ∨ y

and p ∧ x = ∅, p ∧ y = ∅, then

p = p ∧ (x ∨ y)

= ( p ∧ x) ∨ ( p ∧ y)

= ∅ ∨ ∅= ∅

which is absurd.

Now we are ready to prove the important fact that in classical mechanics

propositions can be interpreted as subsets of a set S , which is called thephase space .

Theorem 2.34 For any classical logic L, there exists a set S and an iso-morphism f (x) between elements x of L and subsets of the set S such that

x ≤ y ⇔ f (x) ⊆ f (y) (2.7)

f (x ∧ y) = f (x) ∩ f (y) (2.8)

f (x ∨ y) = f (x) ∪ f (y) (2.9)

f (x⊥) = S \ f (x) (2.10)

where ⊆, ∩, ∪, and \ are usual set-theoretical operations of inclusion, inter-section, union and relative complement.

Proof. The statement of the theorem follows immediately if we choose S tobe the set of all atoms. Then property (2.7) follows from Lemma 2.26, eq.(2.8) follows from Lemma 2.27. Lemmas 2.33 and 2.31 imply eqs. (2.9) and(2.10), respectively.

2.4.3 Atoms and pure classical statesLemma 2.35 In classical logic, if p is an atom and φ is a pure state such that (φ| p) = 1,15 then for any other atom q we have (φ|q ) = 0.

15such a state always exists due to Postulate 2.1.

Page 85: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 85/687

2.4. CLASSICAL LOGIC 55

Proof. According to Lemma 2.32, q ≤ p⊥, and due to eq. (2.6)

(φ|q ) ≤ (φ| p⊥) = 1 − (φ| p) = 0

Lemma 2.36 In classical logic, if p is an atom and φ and ψ are two pure states such that (φ| p) = (ψ| p) = 1, then φ = ψ.

Proof. There are propositions of two kinds: those containing the atom p

and those not containing the atom p. For any proposition x containing theatom p we denote by q the atoms contained in x and obtain using Postulate2.24(2), Lemma 2.32, Postulate 2.22, and Lemma 2.35

(φ|x) = (φ| ∨q≤x q )

=q≤x

(φ|q )

= (φ| p)

= 1

The same equation holds for the state ψ. Similarly we can show that for anyproposition y not containing the atom p

(φ|y) = (ψ|y) = 0

Since probability measures of φ and ψ are the same for all propositions, thesetwo states are equal.

Theorem 2.37 In classical logic, there is an isomorphism between atoms pand pure states φ p such that

(φ p| p) = 1 (2.11)

Page 86: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 86/687

56 CHAPTER 2. QUANTUM MECHANICS

Proof. From Postulate 2.1 we know that for each atom p there is a state φ p

in which eq. (2.11) is valid. From Lemma 2.35 this state is unique. To provethe reverse statement we just need to show that for each pure state φ p thereis a unique atom p such that (φ p| p) = 1. Suppose that for each atom p wehave (φ p| p) = 0. Then, taking into account that I is a join of all atoms, thatall atoms are mutually disjoint, and using Postulate 2.22, we obtain

1 = (φ p|I )= (φ p| ∨ p≤I p)

=

p≤I (φ p| p)

= 0

which is absurd. Therefore, for each state φ p one can always find at leastone atom p such that eq. (2.11) is valid. Finally, we need to show that if pand q are two such atoms, then p = q . This follows from the fact that foreach pure classical state the probability measures (or the truth functions)corresponding to propositions p and q are exactly the same. For the state φ pthe truth function is equal to 1, for all other pure states the truth functionis equal to 0.

2.4.4 The classical phase space

Now we are fully equipped to discuss the phase space representation in clas-sical mechanics. Suppose that the physical system under consideration hasobservables A , B , C , . . . with corresponding spectra S A, S B, S C , ... Accordingto Theorem 2.37, for each atom p of the propositional system we can find itscorresponding pure state φ p. Due to Assertion 2.2, all observables A , B , C , . . .can be measured simultaneously in this state. Such measurements will assignto the state φ p a set of real numbers A p, B p, C p, . . . - the values of observ-ables. Let us suppose that the full set of observables A , B , C , . . . containsa minimal subset of observables

X , Y , Z , . . .

whose values

X p, Y p, Z p, . . .

uniquely enumerate all pure states φ p and therefore all atoms p. So, there is aone-to-one correspondence between number sets X p, Y p, Z p, . . . and atoms

p. Then the set of all atoms can be identified with the direct product16 of

16See Appendix A for definition of the direct product.

Page 87: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 87/687

2.4. CLASSICAL LOGIC 57

spectra of the minimal set of observables S = S X ×S Y ×S Z × . . .. This direct

product is called the phase space of the system. The values X p, Y p, Z p, . . .of the independent observables X , Y , Z , . . . in each point s ∈ S provide thephase space with “coordinates”. Other (dependent) observables A , B , C , . . .can be represented as real functions A(s), B(s), C (s), . . . on S or as functionsof independent observables X , Y , Z , . . ..

In this representation, propositions can be viewed as subsets of the phasespace. Another way is to consider propositions as special cases of observables(= real functions on S ): The function corresponding to the proposition aboutthe subset T of the phase space is the characteristic function of this subset

ξ (s) = 1, if s ∈ T 0, if s /∈ T

Atomic propositions correspond to single-point subsets of the phase spaceS .

Let us consider the above statements on a simple example which will bestudied in more detail in section 5.3. We will see there that in the case of asingle massive spinless particle one can choose six independent observables,namely three components of the particle’s position r and three componentsof the momentum p. All other one-particle observables (the energy, angularmomentum, velocity, etc.) can be expressed as real functions of r and p.The spectrum of each component of the position and momentum is the realline R. Thus the one-particle phase space is a direct product of six infiniteintervals (−∞, ∞), i.e., the 6-dimensional space R6. Let us consider twoexamples of propositions in R6. The proposition R = “position of the par-ticle is r0” is represented in the phase space by a 3-dimensional hyperplanewith fixed position r = r0 and arbitrary momentum p. The proposition P = “momentum of the particle is p0” is represented by another 3-dimensionalhyperplane in which the value of momentum is fixed, while position is arbi-trary. The meet of these two propositions is represented by the intersectionof the hyperplanes s = R∩P which is a point (r0, p0) in the phase space and

an atom in the propositional system.Probability measures have a simple interpretation in the classical phase

space. Each state φ (not necessarily a pure state) defines probabilities (φ| p)for all atoms p (= all points s in the phase space). Each proposition x is a

join of disjoint atoms contained in x.

Page 88: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 88/687

58 CHAPTER 2. QUANTUM MECHANICS

x = ∨q≤xq

Then, by Postulates 2.22, 2.24, and Lemma 2.32 the probability of the propo-sition x being true in the state φ is

(φ|x) = (φ| ∨q≤x q ) (2.12)

=q≤x

(φ|q ) (2.13)

So, the value of the probability measure for all propositions x is uniquelydetermined by its values on atoms. In many important cases (e.g., in thecase of a single particle discussed above), the phase space is continuous, andinstead of considering probabilities (φ|q ) at points in the phase space (=atoms) it is convenient to consider the probability density which is a functionΦ(s) on the phase space such that

1) Φ(s) ≥ 0;

2) S

Φ(s)ds = 1.

Then the value of the probability measure (φ|x) is obtained by the integral

(φ|x) =

X

Φ(s)ds

over the subset X corresponding to the proposition x.For a pure classical state φ, the probability density is localized at one point

s0 in the phase space and has the form of the delta function Φ(s) = δ (s−s0).For such states, the probability measure may only have values 0 or 1 for eachproposition x. The value of (φ|x) is 0 if the point s0 does not belong to the

subset X corresponding to the proposition x, and the value is 1 otherwise.

(φ|x) =

X

δ (s − s0)ds =

1, if s0 ∈ X 0, if s0 /∈ X

Page 89: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 89/687

2.5. QUANTUM LOGIC 59

This shows that for pure classical states the probability measure degener-

ates to the two-valued truth function, in agreement with our discussion insubsection 2.4.1.The states whose probability density is nonzero at more than one point

(i.e., is different from the delta function) are called classical mixed states .

2.5 Quantum logic

The above derivations of properties of classical logic and phase spaces reliedheavily on two Assertions of classical mechanics 2.2 and 2.3 and on the valid-ity of distributive laws (Assertions 2.28 and 2.29). In quantum mechanics we

are not allowed to use these Assertions. In this section we will build quantum logic , in which the distributive laws are not necessarily valid. Quantum logicis a foundation of the entire mathematical formalism of quantum theory, aswe will see in the rest of this chapter.

2.5.1 Compatibility of propositions

Propositions x and y are said to be compatible (denoted x ↔ y) if

x = (x ∧ y) ∨ (x ∧ y⊥) (2.14)

y = (x ∧ y) ∨ (x⊥ ∧ y) (2.15)

The notion of compatibility has a great importance for quantum theory. Insubsection 2.6.3 we will see that two propositions can be measured simulta-neously if and only if they are compatible.

Theorem 2.38 In an orthocomplemented lattice all propositions are com-patible if and only if the lattice is distributive.

Proof. If the lattice is distributive then for any two propositions x and y

(x ∧ y) ∨ (x ∧ y⊥) = x ∧ (y ∨ y⊥) = x ∧ I = x

and, changing places of x and y

Page 90: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 90/687

60 CHAPTER 2. QUANTUM MECHANICS

(x ∧ y) ∨ (x⊥ ∧ y) = y

These formulas coincide with our definitions of compatibility (2.14) and(2.15) which proves the direct statement of the theorem.

The proof of the inverse statement (compatibility → distributivity) ismore lengthy. We assume that all propositions in our lattice are compatiblewith each other and choose three arbitrary propositions x, y, and z . Now weare going to prove that the distributive laws17

(x∧

z )∨

(y∧

z ) = (x∨

y)∧

z (2.16)

(x ∨ z ) ∧ (y ∨ z ) = (x ∧ y) ∨ z (2.17)

are valid. First we prove that the following 7 propositions (some of themmay be empty) are mutually disjoint (see Fig. 2.4)

q 1 = x ∧ y ∧ z

q 2 = x⊥ ∧ y ∧ z

q 3 = x ∧ y⊥ ∧ z

q 4 = x∧

y∧

z ⊥

q 5 = x ∧ y⊥ ∧ z ⊥

q 6 = x⊥ ∧ y ∧ z ⊥

q 7 = x⊥ ∧ y⊥ ∧ z

For example, to show that propositions q 3 and q 5 are disjoint we notice thatq 3 ≤ z and q 5 ≤ z ⊥ (by Postulate 2.11). Then by Lemma 2.21 z ≤ q ⊥5 andq 3 ≤ z ≤ q ⊥5 . Therefore by Lemma 2.8 q 3 ≤ q ⊥5 .

Since by our assumption both x ∧ z and x ∧ z ⊥ are compatible with y, weobtain

x ∧ z = (x ∧ z ∧ y) ∨ (x ∧ z ∧ y⊥) = q 1 ∨ q 3

x ∧ z ⊥ = (x ∧ z ⊥ ∧ y) ∨ (x ∧ z ⊥ ∧ y⊥) = q 4 ∨ q 5

17Assertions 2.28 and 2.29

Page 91: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 91/687

2.5. QUANTUM LOGIC 61

xx yy

zz

qq11

qq55

qq44 qq

66

qq33

qq22

qq77

Figure 2.4: To the proof of Theorem 2.38.

so that

x = (x ∧ z ) ∨ (x ∧ z ⊥) = q 1 ∨ q 3 ∨ q 4 ∨ q 5

Similarly we show

y ∧ z = q 1 ∨ q 2

y = q 1∨

q 2∨

q 4∨

q 6

z = q 1 ∨ q 2 ∨ q 3 ∨ q 7

Then denoting Q = q 1 ∨ q 2 ∨ q 3 we obtain

(x ∧ z ) ∨ (y ∧ z ) = (q 1 ∨ q 3) ∨ (q 1 ∨ q 2) = q 1 ∨ q 2 ∨ q 3 = Q (2.18)

From Postulate 2.12 and y ∨ x = Q ∨ q 4 ∨ q 5 ∨ q 6 it follows that

Q ≤ (Q ∨ q 7) ∧ (Q ∨ q 4 ∨ q 5 ∨ q 6) = (x ∨ y) ∧ z (2.19)

On the other hand, from q 4 ∨ q 5 ∨ q 6 ≤ q ⊥7 , Lemma C.3, and the definition of compatibility it follows that

(x ∨ y) ∧ z = (Q ∨ q 4 ∨ q 5 ∨ q 6) ∧ (Q ∨ q 7) ≤ (Q ∨ q ⊥7 ) ∧ (Q ∨ q 7) = Q(2.20)

Page 92: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 92/687

62 CHAPTER 2. QUANTUM MECHANICS

Therefore, applying the symmetry property 2.7 to eq. (2.19) and (2.20), we

obtain

(x ∨ y) ∧ z = Q (2.21)

Comparing eqs. (2.18) and (2.21) we see that the distributive law (2.16) isvalid. The other distributive law (2.17) is obtained from eq. (2.16) by duality(see Appendix C).

2.5.2 The logic of quantum mechanics

In quantum mechanics we are not allowed to use Assertions 2.2 and 2.3and therefore we must abandon the distributive law. However, in order toget a non-trivial theory we need some substitute for this property. Thisadditional postulate should be specific enough to yield sensible physics andgeneral enough to be non-empty and to include the distributive law as aparticular case. The latter requirement is justified by our desire to haveclassical mechanics as a particular case of more general quantum mechanics.

To find such a generalization we will use the following arguments. FromTheorem 2.38 we know that the compatibility of all propositions is a char-acteristic property of classical Boolean lattices. We also mentioned that this

property is equivalent to the simultaneous measurability of propositions. Weknow that in quantum mechanics not all propositions are simultaneouslymeasurable, therefore they cannot be compatible as well. This suggests thatwe may try to find a generalization of classical theory by limiting the set of propositions that are mutually compatible. In quantum mechanics, we willpostulate that two propositions are definitely compatible if one implies theother, and leave it to mathematics to tell us about the compatibility of otherpairs.

Postulate 2.39 (orthomodularity) Propositions about physical systems obey the orthomodular law

a ≤ b ⇒ a ↔ b. (2.22)

Page 93: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 93/687

2.5. QUANTUM LOGIC 63

Orthocomplemented lattices with additional Postulate 2.39 are called ortho-

modular lattices .The center of the lattice is the set of elements compatible with all others.Obviously ∅ and I are in the center. A propositional system in which thereare only two elements in the center (∅ and I ) is called irreducible . Otherwiseit is called reducible . Any Boolean lattice having more than two elements(∅ and I are present in any lattice, of course) is reducible and its centercoincides with the entire lattice. Orthomodular atomic irreducible latticesare called quantum propositional systems or quantum logics . The rank of a propositional system is defined as the maximum number of mutually dis-

joint atoms. For example, the rank of the classical propositional system of one massive spinless particle described in subsection 2.4.4 is the “number of

points in the phase space R6”.The most fundamental conclusion of our study in this section is that

Statement 2.40 (quantum logic) Experimental propositions form a quan-tum propositional system.

In principle, it should be possible to perform all constructions and calcu-lations in quantum theory by using the formalism of orthomodular latticesbased on just described postulates. Such an approach would have certainadvantages because all its components have clear physical meaning: propo-sitions x are realizable in laboratories and probabilities (φ|x) can be directlymeasured in experiments. However, this approach meets tremendous diffi-culties mainly because lattices are rather exotic mathematical objects andwe lack intuition when dealing with lattice operations.

We saw that in classical mechanics the happy alternative to obscure latticetheory is provided by Theorem 2.34 which proves the isomorphism betweenthe language of distributive orthocomplemented atomic lattices and the phys-ically transparent language of phase spaces. Is there a similar equivalencetheorem in the quantum case? To answer this question, we may notice thatthere is a striking similarity between algebras of projections on closed sub-spaces in a complex Hilbert space H (see Appendices F and G) and quantumpropositional systems discussed above. In particular, if operations between

projections (or subspaces) in the Hilbert space are translated to the latticeoperations according to Table 2.5, then all axioms of quantum logic can bedirectly verified. For example, the validity of the Postulate 2.39 follows fromLemmas G.4 and G.5. Atoms can be identified with one-dimensional sub-spaces or rays in H. The irreducibility follows from Lemma G.6.

Page 94: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 94/687

64 CHAPTER 2. QUANTUM MECHANICS

Table 2.5: Translation of terms, symbols, and operations used for subspaces

and projections in the Hilbert space, and propositions in quantum logics.Subspaces Projections Propositions

X ⊆ Y P X P Y = P Y P X = P X x ≤ yX ∩ Y P X ∩Y x ∧ y

Span(X, Y ) P Span(X,Y ) x ∨ yX ′ 1 − P X x⊥

X and Y are compatible [P X , P Y ] = 0 x ↔ yX ⊥ Y P X P Y = P Y P X = 0 x and y are disjoint

0 0 ∅H 1 I

ray x|x

x|

x is an atom

One can also verify directly that distributive laws 2.28 and 2.29 are notvalid for subspaces in the Hilbert space H. To show that, it is sufficient todemonstrate that some consequences of these Assertions are not true. Themost apparent is the violation of Lemma 2.31. This Lemma establishes thateach atom in the lattice belongs either to x or to x⊥ for any x. In the latticeof subspaces in H this is not true18 as there are atoms belonging neither tox nor to x⊥ (see Fig. 2.5). So, the logic represented by subspaces in theHilbert space is different from classical Boolean logic.19

Thus we established that the set of closed subspaces (or projections) inany complex Hilbert space H is a representation of some quantum proposi-tional system. The next question is: can we find a Hilbert space representa-tion for each quantum propositional system? The answer to this question isgiven by the famous Piron theorem [29, 30] which allows us to claim the iso-morphism between the Hilbert space formalism and the logico-probabilisticapproach.20

Theorem 2.41 (Piron) Any irreducible quantum propositional system L18Unless x = ∅ or x = I .19

An interesting question follows from this discussion: “Does this mean that we cannotuse classical logic of Aristotle and Boole in our reasoning?” I think that the answer is“yes”. The Boolean logic seems so natural to us because in our everyday life we dealexclusively with classical objects. This logic, in particular its distributive law, does notapply to reasoning about quantum objects.

20The proof of the Piron theorem is beyond the scope of this book.

Page 95: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 95/687

2.5. QUANTUM LOGIC 65

XX

X’

YYHH

Figure 2.5: Subspaces X and X ′ are orthogonal to each other and theirspan is entire Hilbert space H. One-dimensional subspace (atom) Y belongsneither to X nor to X ′. However Y belongs to Span(X, X ′) = H. Thiscontradicts Lemma 2.31 of classical logic.

of rank 4 or higher 21 is isomorphic to the lattice of closed subspaces in a complex 22 Hilbert space H such that the correspondences shown in Table 2.5 are true.

This theorem forms the foundation of the mathematical formalism of quan-tum physics. In particular, it allows us to express important notions of theobservable and state in the new language of Hilbert spaces. This will be donein the next section. Orthomodular lattices of quantum logic, phases spacesof classical mechanics, and Hilbert spaces of quantum mechanics are justdifferent languages for describing relationships between states, observables,and their measured values. Table 2.6 can be helpful in translation between

21 All propositional systems of interest to physics have infinite (even uncountable) rank,so the condition ’rank ≥ 4’ is not a significant restriction.

22The original Piron’s theorem still leaves the freedom of choosing any division ringwith involutive antiautomorphism as the set of scalars in H. We can greatly reduce thisfreedom if we remember the important role played by real numbers in physics (values of

observables are always inR

). Therefore, it makes physical sense to consider only thoserings which include R as a subring. In 1877 Frobenius proved that there are only threesuch rings. They are real numbers R, complex numbers C, and quaternions H. Althoughthere is vast literature on real and, especially, quaternionic quantum mechanics [32, 33],the relevance of these theories to physics remains uncertain. Therefore, we will stick withcomplex numbers in this book.

Page 96: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 96/687

66 CHAPTER 2. QUANTUM MECHANICS

these languages.

Table 2.6: Glossary of terms used in general quantum logic, in classical phasespace and in the Hilbert space of quantum mechanics.

Nature Quantum logic Phase space Hilbert space

Statement proposition subset closed subspaceUnambiguous Atom Point Ray

statementAND meet intersection intersectionOR join union linear span

NOT orthocomplement relative orthogonal

complement complementIF...THEN implication inclusion inclusionof subsets of subspaces

Observable proposition-valued real function Hermitianmeasure on R operator

jointly compatible all observables commutingmeasurable propositions are compatible operatorsobservables

mutually exclusive disjoint propositions non-intersecting orthogonal subspacestatements subsetsPure state Probability measure delta function Ray

Mixed state Probability measure Probability function Density operator

2.6 Quantum observables and states

2.6.1 Observables

Each observable F naturally defines a mapping (called a proposition-valued measure ) from the set of intervals of the real line R to propositions F E in Lthat can be described in words: ”the value of the observable F is inside theinterval E of the real line R”. We already discussed properties of propositionsabout one observable. They can be summarized as follows:

• The proposition corresponding to the intersection of intervals E 1 and

Page 97: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 97/687

2.6. QUANTUM OBSERVABLES AND STATES 67

E 2 is the meet of propositions corresponding to these intervals

F E 1∩E 2 = F E 1 ∧ F E 2 (2.23)

• The proposition corresponding to the union of intervals E 1 and E 2 isthe join of propositions corresponding to these intervals

F E 1∪E 2 = F E 1 ∨ F E 2 (2.24)

• The proposition corresponding to the complement of interval E is theorthocomplement of the proposition corresponding to E

F R\E = F ⊥E (2.25)

• The minimum proposition corresponds to the empty subset of the realline

F ∅ = ∅ (2.26)

• The maximum proposition corresponds to the real line itself.

F R = I (2.27)

Intervals E of the real line form a Boolean (distributive) lattice withrespect to set theoretical operations ⊆, ∩, ∪, and \. Due to the isomorphism(2.23) - (2.27), the corresponding propositions F E also form a Boolean lattice,which is a sublattice of our full propositional system. Therefore, accordingto Theorem 2.38, all propositions about the same observable are compatible.Due to the isomorphism “propositions”↔“subspaces”, we can use the samenotation F E for subspaces (projections) in H corresponding to intervals E .Then, according to Lemma G.5, all F E commute with each other.

Each point f in the spectrum of observable F is called an eigenvalue of this observable (operator). The subspace F f ⊂ H corresponding to the

eigenvalue f is called eigensubspace and projection P f onto this subspace iscalled a spectral projection . Each vector in the eigensubspace is called eigen-vector or eigenstate . The observable F has definite values (=eigenvalues)in its eigenstates. This means that eigenstates are examples of states whoseexistence was guaranteed by Postulate 2.1.

Page 98: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 98/687

68 CHAPTER 2. QUANTUM MECHANICS

Consider two distinct eigenvalues f and g of observable F . The corre-

sponding intervals (=points) of the real line are disjoint. According to (2.23)- (2.27) propositions F f and F g are disjoint too, and corresponding subspacesare orthogonal. The linear span of subspaces F f , where f runs through entirespectrum of F , is the full Hilbert space H. Therefore, spectral projections of any observable form a decomposition of unity .23 So, according to discussionin Appendix G.2, we can associate an Hermitian operator

F =f

f P f (2.28)

with each observable F . In what follows we will often use terms ’observable’and ’Hermitian operator’ as synonyms.

2.6.2 States

As we discussed in subsection 2.3.1, each state φ of the system defines aprobability measure (φ|x) on propositions in quantum logic L. Accordingto the isomorphism ’propositions ↔ subspaces’, the state φ also defines aprobability measure (φ|X ) on subspaces X in the Hilbert space H. Thisprobability measure is a function from subspaces to the interval [0, 1] ⊆ R

whose properties follow directly from eqs. (2.2), (2.3), and Postulate 2.22

• The probability corresponding to the whole Hilbert space is 1 in allstates

(φ|H) = 1 (2.29)

• The probability corresponding to the empty subspace is 0 in all states

(φ|0) = 0 (2.30)

•The probability corresponding to the linear span of orthogonal sub-spaces is the sum of probabilities for each subspace

(φ|X ⊕ Y ) = (φ|X ) + (φ|Y ), if X ⊥ Y (2.31)

23see Appendix G.1

Page 99: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 99/687

2.6. QUANTUM OBSERVABLES AND STATES 69

The following important theorem provides a classification of all such proba-

bility measures (= all states of the physical system).Theorem 2.42 (Gleason [34]) If (φ|X ) is a probability measure on closed subspaces in the Hilbert space H with properties ( 2.29 ) - ( 2.31), then there exists a non-negative 24 Hermitian operator ρ25 in H such that 26

T r(ρ) = 1 (2.32)

and for any subspace X with projection P X the value of the probability mea-sure is

(φ|X ) = T r(P X ρ) (2.33)

The proof of this theorem is far from trivial and we refer interested readerto original works [34, 35]. Here we will focus on the physical interpretationof this result. First, we may notice that, according to the spectral theoremF.8, the operator ρ can be always written as

ρ = i ρi|eiei| (2.34)

where |ei is an orthonormal basis in H. Then the Gleason theorem meansthat

ρi ≥ 0 (2.35)i

ρi = 1 (2.36)

0 ≤ ρi ≤ 1 (2.37)

Among all states satisfying eq. (2.35) - (2.37) there are simple states for which just one coefficient ρi is non-zero. Then, from (2.36) it follows that ρi = 1,

24which means that all eigenvalues are greater than or equal to zero25which is called the density operator or the density matrix 26T r denotes trace of the matrix of the operator ρ, which is defined in Appendix F.7.

Page 100: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 100/687

70 CHAPTER 2. QUANTUM MECHANICS

ρ j = 0 if j = i, and the density operator degenerates to a projection onto the

one-dimensional subspace |eiei|.27

Such states will be called pure quantum states. It is also common to describe a pure state by a unit vector fromits ray. Any unit vector from this ray represents the same state, i.e., in thevector representation of states there is a freedom of choosing an unimodularphase factor of the state vector. In what follows we will often use the terms’pure quantum state’ and ’state vector’ as synonyms.

One can notice an important difference between classical pure states (fromsection 2.4) and quantum pure states. The probability measure correspond-ing to any classical pure state is given by a truth function which can onlytake values 0 or 1. The value of the probability measure corresponding tothe pure quantum state can be any real number between 0 and 1. Thus, un-

like classical pure states, quantum pure states are characterized by statisticaluncertainties. This conclusion agrees with the fundamental statement 2.5 of quantum mechanics.

Mixed quantum states are expressed as weighed sums of pure stateswhose coefficients ρi in eq. (2.34) reflect the probabilities with which thepure states enter in the mixture. Therefore, in quantum mechanics thereare uncertainties of two types. The first type is the uncertainty presentin mixed states. This uncertainty is already familiar to us from classical(statistical) physics. This uncertainty results from our insufficient controlof preparation conditions (like when a bullet is fired from a shaky riffle).The second uncertainty is present even in pure quantum states and is uniqueto quantum mechanics. It does not have a counterpart in classical physics,and it cannot be avoided by tightening the preparation conditions. Thisuncertainty is a reflection of the mysterious unpredictability of microscopicphenomena.

2.6.3 Commuting and compatible observables

In subsection 2.5.1 we defined the notion of compatible propositions. InAppendix G.2 we showed that the compatibility of propositions is equiva-lent to the commutativity of corresponding projections. The importance of

these definitions for physics comes from the fact that for a pair of compatiblepropositions (=projections=subspaces) there are states in which both thesepropositions are certain, i.e., simultaneously measurable. A similar state-

27One-dimensional subspaces are also called rays .

Page 101: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 101/687

2.6. QUANTUM OBSERVABLES AND STATES 71

ment can be made for two compatible (=commuting) Hermitian operators of

observables. According to Theorem G.9, these two operators have a commonbasis of eigenvectors (=eigenstates). In these eigenstates both observableshave definite (eigen)values.

We will assume that for any physical system there always exists a minimalset of mutually compatible (= commuting) observables F , G , H , . . ..28 Then,according to Appendix G.2 we can build an orthonormal basis of commoneigenvectors |ei such that each basis vector is uniquely labeled by eigenvaluesf i, gi, hi, . . . of operators F , G , H , . . ., i.e., if |ei and |e j are two eigenvectorsthen there is at least one different eigenvalue in the sets f i, gi, hi, . . . andf j , g j, h j, . . ..

Each state vector

can be represented as a linear combination of these

basis vectors

|φ =i

φi|ei (2.38)

where in the bra-ket notation (see Appendix F.3)

φi = ei|φ (2.39)

The set of coefficients φi can be viewed as a function φ(f , g , h , . . .) on the

common spectrum of observables F , G , H , . . .. In this form, the coefficientsφi are referred to as the wave function of the state |φ in the representa-tion defined by observables F , G , H , . . .. When the spectrum of operatorsF , G , H , . . . is continuous, the index i is, actually, a continuous variable. Forexample, wave functions in the momentum representation are denoted byφ( px, py, pz) = φ(p).29

As there are many different complete sets of mutually commuting op-erators, the same state can be represented by different wave functions in

28The set is called minimal if no observable from the set can be expressed as a functionof other observables from the same set. Any function of observables from the minimal

commuting set also commutes with F , G , H , . . . and with any other such function. In fact,there are many minimal sets of mutually commuting observables that do not commutewith each other. We will see in section 5.2 that for one massive spinless particle the threecomponents of position (Rx, Ry, Rz) and the three components of momentum (P x, P y, P z )are two examples of such minimal commuting sets.

29see subsection 5.2.2

Page 102: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 102/687

72 CHAPTER 2. QUANTUM MECHANICS

different representations. In chapter 5 we will construct wave functions of a

single particle in the position and momentum representations and learn howthese wave functions transform from one basis to another.

2.6.4 Expectation values

Eq. (2.28) defines a spectral decomposition for each observable F , whereindex f runs over all distinct eigenvalues of F . Then for each pure state |φwe can find the probability of measuring a value f of the observable F inthis state by using formula30

ρf =

mi=1

|ef i |φ|2 (2.40)

where |ef i are basis vectors in the range of the projection P f and m is thedimension of the corresponding subspace. This formula defines the probabil-ity distribution for values of the observable F in the state |φ. Sometimeswe also need to know the weighed average of values f which is called theexpectation value of the observable F in the state |φ and denoted F

F = f ρf f

Substituting here eq. (2.40) we obtain

F =n

j=1

|e j|φ|2f j

≡n

j=1

|φ j|2f j

where the summation is carried out over the entire basis|e j

of eigenvectors

of the operator F with eigenvalues f j . By using decompositions (2.38) and(2.28) we obtain a more compact formula for the expectation value

30This is simply the value of the probability measure (φ|P f ) (see subsection 2.3.1) cor-responding to the spectral pro jection P f .

Page 103: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 103/687

2.6. QUANTUM OBSERVABLES AND STATES 73

φ|F |φ = (i

φ∗i ei|)(

j

|e jf je j |)(k

φk|ek)

=ijk

φ∗if jφkei|e je j|ek

=ijk

φ∗if jφkδ ijδ jk

= j

|φ j|2f j

= F (2.41)

2.6.5 Basic rules of classical and quantum mechanics

Results obtained in this chapter can be summarized as follows. If we wantto calculate the probability ρ for measuring the value of the observable F inside the interval E of the real axis for a system prepared in a pure state φ,then we need to perform the following steps:

In classical mechanics:

1. Define the phase space S of the physical system;

2. Find a real function f : S → R corresponding to the observable F ;

3. Find the subset U of S corresponding to the subset E of the spectrum of the observable F (U is the set of all points s ∈ S such that f (s) ∈ E .);

4. Find the point sφ ∈ S representing the pure classical state φ;

5. The probability ρ is equal to 1 if sφ ∈ U and ρ = 0 otherwise.

In quantum mechanics:

1. Define the Hilbert space H of the physical system;

2. Find the Hermitian operator F in H corresponding to the observable;

3. Find the eigenvalues and eigenvectors of the operator F ;

Page 104: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 104/687

74 CHAPTER 2. QUANTUM MECHANICS

4. Find a spectral projection P E corresponding to the subset E of the

spectrum of the operator F .5. Find the unit vector |φ (defined up to an arbitrary unimodular factor)

representing the state of the system.

6. Use formula ρ = φ|P E |φAt this point, there seems to be no connection between the classical andquantum rules. However, we will see in subsection 5.3 that in the macroscopicworld with massive objects and poor resolution of instruments, the classicalrules emerge as a decent approximation to the quantum ones.

2.7 Is quantum mechanics a complete the-

ory?

In sections 2.3 - 2.6 of this chapter we focused on the mathematical formalismof quantum mechanics. Now it is time to discuss the physical meaning andinterpretation of these formal rules.

2.7.1 Quantum unpredictability in microscopic systems

Let us first review conclusions made in previous sections. In classical mechan-ics, we have Assertions of simultaneous measurability 2.2 and determinism2.3. These Assertions express the idea that each pure state of the physi-cal system has a set of well-defined properties, attributes, and observablesattached to it. Therefore, in classical mechanics, each pure state can be de-scribed by the full collection of its properties, e.g., as a point in the phasespace. With time, this point moves along a well-defined and predictabletrajectory. This is what can be called a complete description of the system.

It appears that for micro-systems Assertions 2.2 and 2.3 are no longervalid and we cannot guarantee the simultaneous measurability and deter-minism. Therefore we cannot say that the system has all its properties at

a given time. The complete classical description by points and trajectoriesin the phase space should be substituted by certain incomplete probabilisticdescription. This is the essence of quantum mechanics.

Let us illustrate this by two examples. We know from experience thateach photon passing through the hole in the camera obscura will hit the

Page 105: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 105/687

2.7. IS QUANTUM MECHANICS A COMPLETE THEORY? 75

photographic plate at some point on the photographic plate. Quantum me-

chanics allows us to calculate the probability density for these points, butapart from that, the behavior of each individual photon appears to be com-pletely random. Quantum mechanics does not even attempt to predict whereeach individual photon will hit the target.

Another example of such an apparently random behavior is the decay of unstable nuclei. The nucleus of 232T h has the lifetime of 14 billion years.This means that in any sample containing thorium, approximately half of all232T h nuclei will decay after 14 billion years. In principle, quantum mechan-ics can calculate the probability of the nuclear decay as a function of time bysolving the corresponding Schrodinger equation.31 However, quantum me-chanics cannot even approximately guess when any given nucleus will decay.

It could happen today, or it could happen 100 billion years from now.Although, such unpredictability is certainly a hallmark of microscopic

systems it would be wrong to think that it is not affecting our macroscopicworld. It appears that chance plays much bigger role in nature than we arewilling to recognize. Quite often the effect of random microscopic processescan be amplified to produce a sizable equally random macroscopic effect.Thus, looking at marks left by photons on the photographic plate or hearingclicks of a Geiger counter, we are witnessing truly random events whose exactdescription and prediction is beyond capabilities of modern science. Anotherfamous example of the amplification of quantum uncertainties is the thought

experiment with the “Schrodinger cat” [36].We see that quantum mechanics does not describe what actually hap-pens; it describes the full range of possibilities of what might have happenedand the probability of each possible outcome. Each time nature chooses justone possibility from this range, while obeying the probabilities predicted byquantum mechanics. QM cannot say anything about which particular choicewill be made by nature in each particular instance. These choices are com-pletely random and beyond explanation by modern science. This observationis a bit disturbing and embarrassing. Indeed, we have real physically mea-surable effects (the choices made by nature) for which we have no controland no power to predict the outcome. These are facts without an explana-

tion, effects without a cause. It seems that microscopic particles obey somemysterious random force. Then it is appropriate to ask what is the reason

31though our current knowledge of nuclear forces is insufficient to make a reliable cal-culation of that sort for thorium.

Page 106: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 106/687

76 CHAPTER 2. QUANTUM MECHANICS

for such stochastic behavior of micro-systems? Is it truly random or it just

seems to be random? If quantum mechanics cannot explain this randombehavior, maybe there is a deeper theory which can? In other words, isquantum mechanics a complete theory? There are basically two conflictinganswers widely discussed in literature: the “Copenhagen interpretation” andthe “realistic interpretation”.

2.7.2 The Copenhagen interpretation

The Copenhagen interpretation was advanced primarily by Bohr and Heisen-berg. The logic behind their approach is as follows. As we saw in the examplewith the “Heisenberg’s microscope” in subsection 2.2.3, each act of measure-

ment requires an “interaction” between the observed system and the mea-suring apparatus. This interaction necessarily disturbs the system, therefore,we cannot know exactly which properties the system “really” had before themeasurement. We can only know the properties of the system in contactwith the measuring apparatus. The Copenhagen school postulated that thisinteraction is universal, unavoidable, and has a random unpredictable effecton the result of measurement.

As real intrinsic properties of the system always escape observation, theclassical idea that each individual system possesses a certain collection of properties or definite values of observables should not be in the foundationof our theory. According to the Copenhagen interpretation, the correct way isto describe the state of each individual system as a mix or superposition of allpossible properties, each with its own probability. The electron or photon orunstable nucleus can be imagined as a cloud of probability or a wave function evolving in time according to certain rules, e.g., the Schrodinger equation.The act of measurement consists of interaction between the measuring ap-paratus and this probability cloud. Such interaction always selects just onepossibility among multiple choices present in the wave function. The cloud“collapses” and the probability becomes a certainty due to an unpredictableinteraction between the quantum particle and the classical measuring device.So, observables do not have certain values before the measurement. Their

values are created during the measurement process.In a short phrase, the Copenhagen interpretation claims that the proba-

bilistic behavior is an intrinsic characteristic of each individual system. Theprobabilistic behavior of the ensemble of identically prepared systems is just areflection of this uncontrollable probabilistic behavior of individual systems.

Page 107: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 107/687

2.7. IS QUANTUM MECHANICS A COMPLETE THEORY? 77

2.7.3 The realistic interpretation

Einstein was the most outspoken critic of the Copenhagen interpretation anda proponent of an alternative “realistic interpretation”. He wrote:

I think that a particle must have a separate reality independent of the measurements. That is an electron has spin, location and so

forth even when it is not being measured. I like to think that the moon is there even if I am not looking at it. A. Einstein

In the “realistic” approach to quantum mechanics, each system is sup-posed to have definite values of all its observables at each time whether wemeasure them or not, just as in good old classical mechanics. Measure-ments just reveal these values. In the realistic approach, each member of the ensemble moves along some deterministic trajectory in the phase space,and the quantum mechanical probabilistic description applies to the ensemble rather than to individual system. According to these views, it is wrong todescribe each individual system by the wave function. The wave function isan abstract mathematical concept that allows us to predict chances of dif-ferent outcomes of measurements performed in the ensemble. The collapse

of the wave function is not more mysterious than the collapse of a classicalprobability distribution when a bullet hits a target.

Then why ensembles do not behave predictably? Researchers from thisschool attribute the apparently random behavior of microsystems to some yetunknown “hidden” variables, which are currently beyond our observation andcontrol. According to these “realistic” views, put somewhat simplistically,each photon in camera obscura has a guiding mechanism which directs itto a certain predetermined spot on the photographic plate. Each unstablenucleus has some internal “alarm clock” ticking inside. The nucleus decayswhen the alarm goes off. The behavior of quantum systems just appears to

be random to us because so far we don’t have a clue about these “guidingmechanisms” and “alarm clocks”.

According to the “realistic” interpretation, quantum mechanics is not thefinal word, and future theory will be able to fully describe the properties of individual systems and predict events without relying on chance.

Page 108: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 108/687

78 CHAPTER 2. QUANTUM MECHANICS

2.7.4 The statistical interpretation of quantum mechan-

icsBoth Copenhagen and realistic interpretations of quantum mechanics haveserious problems. If we adopt the Copenhagen interpretation and describean individual particle by its wave function, then it is not clear how this wavefunction “knows” when it can undergo a continuous evolution described bythe Schrodinger equation and when it should make the sudden “collapse”?Where is the boundary between the measuring device and the quantum sys-tem? For example, it is customary to say that the photon is the quantum sys-tem and the photographic plate is the measuring device. However, we mightadopt a different approach and include the photographic plate together with

the photon in our quantum system. Then, we could, in principle, describeboth the photon and the photographic plate by a joint wave function. Whendoes this wave function collapse? Where is the measuring apparatus in thiscase? Human’s eye? Does it mean that while we are not looking, the entiresystem (photon + photographic plate) remains in a superposition state? If we stick with the Copenhagen interpretation to the end, we may easily reachan absurd conclusion that the ultimate measuring device is human’s brainand all events remain potentialities until they are registered by mind.

The realistic interpretation has its own problems. So far nobody wasable to build a convincing theory of hidden variables, and there are reasonsto believe that a simple theory of that sort does not exist. One serious

reason to be unsatisfied with the realistic approach is provided by the waywe formulated quantum mechanical Statements 2.4 and 2.5 as generalizationsof classical Assertions 2.2 and 2.3. Quantum mechanics simply abandonsthe unsupported claims of simultaneous measurability and determinism. So,quantum mechanics with its probabilities is a more general mathematicalconstruction, and classical mechanics with its determinism can be representedas a particular case of this general theory. As Mittelstaedt put it [37]

...classical mechanics is loaded with metaphysical hypotheses which clearly exceed our everyday experience. Since quantum mechanics

is based on strongly relaxed hypotheses of this kind, classical me-chanics is less intuitive and less plausible than quantum mechan-ics. Hence classical mechanics, its language and its logic cannot be the basis of an adequate interpretation of quantum mechanics.P. Mittelstaedt

Page 109: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 109/687

2.7. IS QUANTUM MECHANICS A COMPLETE THEORY? 79

It is impossible to deny one simple, and yet mysterious fact. The fact is

that if we prepare N absolutely identical physical systems and measure thesame observable in each of them, we may find N different results. Nobodyknows why it happens, why physical systems have this random unpredictablebehavior. This is just a fact of life. Quantum mechanics simply accepts thisfact and does not attempt to explain it.

This discussion leads to a philosophical viewpoint which is somewhat in-termediate between the realistic and Copenhagen interpretations and whichis called “statistical interpretation” or “ensemble interpretation”[38]. On theone hand, in agreement with Einstein, we can admit that each individual sys-tem has certain values of all observables with or without measurement.32 Onthe other hand, we can agree with Bohr and Heisenberg that full informa-

tion about these values cannot be obtained in experiment, and that realityis often unpredictable. This position was best explained (though not shared)by Einstein himself:

I now imagine a quantum theoretician who may even admit that the quantum-theoretical description refers to ensembles of systems and not to individual systems, but who, nevertheless, clings to the idea that the type of description of the statistical quantum theory will, in its essential features, be retained in the future. He may argue as follows: True, I admit that the quantum-theoretical de-scription is an incomplete description of the individual system. I

even admit that a complete theoretical description is, in principle,thinkable. But I consider it proven that the search for such a com-plete description would be aimless. For the lawfulness of nature is thus constructed that the laws can be completely and suitably

formulated within the framework of our incomplete description.To this I can only reply as follows: Your point of view - taken as theoretical possibility - is incontestable. A. Einstein [39]

In the rest of this book we will try to stay out of this fascinating philosoph-ical debate and follow the fourth approach to quantum mechanics attributedto Feynman: “Shut up and calculate!”

32A better way to think about it is to say that we just do not want to discuss whatproperties the system may or may not have while we are not measuring. As we discussedin the Introduction, physics is a science about results of measurements. So, the questionslike ”does the Moon exist when nobody is looking?” should be answered by philosophers,not by physicists.

Page 110: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 110/687

80 CHAPTER 2. QUANTUM MECHANICS

Page 111: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 111/687

Chapter 3

QUANTUM MECHANICSAND RELATIVITY

There must be no barriers for freedom of inquiry. There is noplace for dogma in science. The scientist is free, and must be

free to ask any question, to doubt any assertion, to seek for any evidence, to correct any errors.

J. Robert Oppenheimer

The main goal of this book is to try and build a consistent relativistic quan-tum theory. Two preceding chapters discussed the ideas of relativity andquantum mechanics separately. Now is the time to unify them in one theory.The major contribution to such an unification was made by Wigner who for-mulated and proved the famous Wigner theorem and developed the theoryof unitary representations of the Poincare group in the Hilbert space.1

3.1 Inertial transformations in quantum me-

chanicsThe relativity Postulate 1.1 tells us that any inertial laboratory L is physi-cally equivalent to any other laboratory L′ = gL obtained from L by applying

1see chapter 5

81

Page 112: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 112/687

82 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

an inertial transformation g.2 This means that for identically arranged ex-

periments in these two laboratories the corresponding probability measures(φ|X ) are the same. As shown in Fig. 1, laboratories are composed of twomajor parts: the preparation device P and the observer O. The inertialtransformation g of the laboratory results in changes of both the preparationdevice and observer. The change of the preparation device can be interpretedas a change of the state of the system. We can formally denote this changeby φ → gφ. The change of the observer (or measuring apparatus) can beviewed as a change of the experimental proposition X → gX . Then, themathematical expression of the relativity principle is that for any g, φ, andX

(gφ|gX ) = (φ|X ) (3.1)

In the rest of this chapter (and in chapters 4 – 6) we will develop a mathe-matical formalism for representing transformations gφ and gX in the Hilbertspace. This is the formalism of unitary representations of the Poincare group,which is a cornerstone of any relativistic approach in quantum physics.

3.1.1 Wigner theorem

Let us first focus on inertial transformations of propositions X → gX .3 Sup-pose that two observers O and O′ are related by an inertial transformation:O′ = gO. The experimental propositions attributed to the observer O forma propositional lattice L(H) which is realized as a set of closed subspaces inthe Hilbert space H. Observer O′ also represents her propositions as sub-spaces in the same Hilbert space H. As these two observers are equivalent,we may expect that their propositional systems have exactly the same math-ematical structures, i.e., they are isomorphic. This means that there existsa one-to-one mapping

2There is a hidden controversy in this statement. Here we assume that parameters φ;v; r; t of the transformation g are precisely specified. In particular, the position r

and the velocity v of the laboratory gL with respect to the laboratory L have definite

values. Strictly speaking, this is not consistent with quantum mechanics. The laboratoriesare, after all, quantum objects, and simultaneous exact determination of their positions,velocities, and orientations is not possible [40]. Frankly, I do not know how to avoid thisinconsistency here. The only justification is that laboratories are massive macroscopicobjects for which quantum uncertainties are ridiculously small (see section 5.3).

3We will turn to transformations of states φ → gφ in the next subsection.

Page 113: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 113/687

3.1. INERTIAL TRANSFORMATIONS IN QUANTUM MECHANICS 83

K g : L(H) → L(H)

that connects propositions of observer O with propositions of observer O′,such that all lattice relations between propositions remain unchanged. Inparticular, we will require that K g transforms atoms to atoms; K g mapsminimum and maximum propositions of O to the minimum and maximumpropositions of O′, respectively

K g( I ) = I (3.2)

K g(∅) = ∅ (3.3)

and for any X, Y ∈ L(H)

K g(X ∨ Y ) = K g(X ) ∨ K g(Y ) (3.4)

K g(X ∧ Y ) = K g(X ) ∧ K g(Y ) (3.5)

K g(X ⊥) = K g(X )⊥ (3.6)

As discussed in subsection 2.5.2, working with propositions is rather in-convenient. It would be better to translate conditions (3.2) - (3.6) into thelanguage of vectors in the Hilbert space. In other words, we would like to finda vector-to-vector transformation kg : H → H which generates the subspace-to-subspace transformation K g. More precisely, we demand that for eachsubspace X , if K g(X ) = Y , then the generator kg maps all vectors in X intovectors in Y , so that Span(kg(x)) = Y , where x runs through all vectors inX .

The problem with finding generators kg is that there are just too manyof them. For example, if a ray p goes to the ray K g( p), then the generator kgmust map each vector |x ∈ p somewhere inside K g( p), but the exact valueof kg|x remains undetermined. Actually, we can multiply each image vector

kg|x by an arbitrary nonzero factor η(|x), and still have a valid generator.Factors η(|x) can be chosen independently for each |x ∈ H. This freedomis very inconvenient from the mathematical point of view.

This problem is solved by the celebrated Wigner theorem [41] which statesthat we can always select factors η(|x) in such a way that the vector-to-

Page 114: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 114/687

84 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

vector mapping η(|x)kg becomes either linear and unitary or antilinear and

antiunitary.

4

Theorem 3.1 (Wigner) For any isomorphic mapping K g of a proposi-tional lattice L(H) onto itself, one can find either unitary or antiunitary transformation kg of vectors in the Hilbert space H, which generates K g. For a given K g only one of these two possibilities can be realized.

In this formulation, the Wigner theorem has been proven in ref. [42] (see also[43]). The significance of this theorem comes from the fact that there is a

powerful mathematical apparatus for working with unitary and antiunitarytransformations, so that their properties (and, thus, properties of subspacetransformations K g) can be studied in great detail.

From our study of inertial transformations in chapter 1, we know thatthere is always a continuous path from the identity transformation e = 0, 0, 0, 0 to any other element g = φ, v, r, t in the Poincare group. Theidentity transformation e is represented in the Hilbert space by the identityoperator (up to an arbitrary unimodular factor) which is, of course, unitary.It seems reasonable to demand that the mappings g → K g and g → kg arecontinuous, so, the representative kg cannot suddenly switch from unitaryto antiunitary along the path connecting e with g. Then we can reject the

antiunitary transformations as representatives of K g.5

Although Wigner’s theorem reduces the freedom of choosing generators,it does not eliminate this freedom completely: Two unitary transformationskg and βkg (where β is any unimodular constant) generate the same subspacemapping. Therefore, for each K g there is a set of generating unitary trans-formations U g differing from each other by a multiplicative constant. Such aset is called a ray of transformations [U g].

Results of this subsection can be summarized as follows: each inertialtransformation g of the observer can be represented by a unitary opera-tor U g in H defined up to an arbitrary unimodular factor: ket vectors are

transformed according to |x → U g|x and bra vectors are transformed as4See Appendix F.7 for definitions of antilinear and antiunitary operators.5 The antiunitary operators may still represent discrete transformations, e.g., time

inversion, but we agreed not to discuss such transformations in this book, because theydo not correspond to exact symmetries.

Page 115: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 115/687

3.1. INERTIAL TRANSFORMATIONS IN QUANTUM MECHANICS 85

x| → x|U −1g . If X =

i |eiei|6 is a projection (proposition) associated

with the observer O, then observer O′ = gO represents the same propositionby the projection

X ′ =i

U g|eiei|U −1g

= U gXU −1g

Similarly, if F =

i f i|eiei| is an operator of observable associated withthe observer O then

F ′ =i

f iU g|eiei|U −1g

= U gF U −1g (3.7)

is operator of the same observable from the point of view of the observerO′ = gO.

3.1.2 Inertial transformations of states

In the preceding subsection we analyzed the effect of an inertial transforma-

tion g on observers, measuring apparatuses, propositions, and observables.Now we are going to examine the effect of g on preparation devices and states.We will try to answer the following question: if |Ψ is a vector describing apure state prepared by the preparation device P , then which state vector |Ψ′describes the state prepared by the transformed preparation device P ′ = gP ?

To find the connection between |Ψ and |Ψ′ we will use the relativityprinciple. According to eq. (3.1), for every observable F , its expectationvalue should not change after inertial transformation of the entire laboratory(= both the preparation device and the observer). Mathematically, thiscondition can be written as

Ψ|F |Ψ = Ψ′|F ′|Ψ′= Ψ′|U gF U −1g |Ψ′ (3.8)

6Here |ei is an orthonormal basis in the subspace X , see Appendix G.

Page 116: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 116/687

86 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

This equation should be valid for any choice of observable F . Let us choose

F = |ΨΨ|, i.e., the projection onto the ray containing vector |Ψ. Then eq.(3.8) takes the form

Ψ|ΨΨ|Ψ = Ψ′|U g|ΨΨ|U −1g |Ψ′= Ψ′|U g|ΨΨ′|U g|Ψ∗

= |Ψ′|U g|Ψ|2

The left hand side of this equation is equal to 1. So, for each |Ψ, thetransformed vector |Ψ′ is such that

|Ψ′|U g|Ψ|2 = 1

Since both U g|Ψ and |Ψ′ are unit vectors, we must have

|Ψ′ = σ(g)U g|Ψ

where σ(g) is an unimodular factor. Operator U g is defined up to a unimod-ular factor,7 therefore, we can absorb the factor σ(g) into the uncertainty of U g and finally write the action of the inertial transformation g on states

|Ψ → |Ψ′ = U g|Ψ (3.9)

Then, taking into account the transformation law for observables (3.7) wecan check that, in agreement with the relativity principle, the expectationvalues remain the same in all laboratories

F ′ = Ψ′|F ′|Ψ′= (Ψ|U −

1

g )(U gF U −1

g )(U g|Ψ)= Ψ|F |Ψ= F (3.10)

7see subsection 3.1.1

Page 117: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 117/687

3.1. INERTIAL TRANSFORMATIONS IN QUANTUM MECHANICS 87

3.1.3 The Heisenberg and Schrodinger pictures

The conservation of expectation values (3.10) is valid only in the case whenthe inertial transformation g is applied to the laboratory as a whole. Whatwill happen if only observer or only preparation device are transformed?

Let us first consider inertial transformations of observers. If we change theobserver without changing the preparation device (=state) then operators of observables change according to (3.7) while the state vector remains the same|Ψ. As expected, this transformation changes the results of experiments. Forexample, the expectation values of observable F are generally different fordifferent observers O and O′ = gO

F ′ = Ψ|(U gF U −1g )|Ψ= Ψ|F |Ψ= F (3.11)

On the other hand, if the inertial transformation is applied to the preparationdevice and the state of the system changes according to eq. (3.9), then theresults of measurements are also affected

F ′′

= (

Ψ

|U −1g )F (U g

)

= Ψ|F |Ψ= F (3.12)

Formulas (3.11) and (3.12) play a prominent role because many problemsin physics can be formulated as questions about descriptions of the samephysical system by different observers. An important example is dynamics ,i.e., the time evolution of the system. In this case one considers time trans-lation elements of the Poincare group g = 0, 0, 0, t. Then eqs. (3.11) and(3.12) provide two equivalent descriptions of dynamics. Eq. (3.11) describesdynamics in the Heisenberg picture . In this picture the state vector of the

system remains fixed while operators of observables change with time. Eq.(3.12) provides an alternative description of dynamics in the Schr¨ odinger picture . In this description, operators of observables are time-independent,while the state vector of the system depends on time. These two pictures areequivalent because according to (3.1) a shift of the observer by g (forward

Page 118: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 118/687

88 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

time translation) is equivalent to the shift of the preparation device by g−1

(backward time translation).One can realize that the notions of Schrodinger and Heisenberg picturescan be applied not only to time translations. They can be generalized toother types of inertial transformations: space translations, rotations, andboosts.

3.2 Unitary representations of the Poincare

group

In the preceding section we discussed the representation of a single inertial

transformation g by an isomorphism K g of the lattice of propositions and bya ray of unitary operators [U g], which act on states and/or observables in theHilbert space. We know from chapter 1 that inertial transformations form thePoincare group. Then subspace mappings K g1, K g2, K g3, . . . corresponding todifferent group elements g1, g2, g3, . . . cannot be arbitrary. They must satisfyconditions

K g2K g1 = K g2g1 (3.13)

K g−1 = K −1g (3.14)

K g3(K g2K g1) = (K g3K g2)K g1 (3.15)

which reflect the group properties of inertial transformations g.8 Our goal inthis section is to find out which conditions are imposed by (3.13) - (3.15) onthe set of unitary representatives U g of the Poincare group.

3.2.1 Projective representations of groups

For each group element g let us choose an arbitrary unitary representativeU g in the ray [U g]. For example, let us choose the representatives (also calledgenerators) U g1

∈[U g1], U g2

∈[U g2], and U g2g1

∈[U g2g1]. The product U g2U g1

should generate the mapping K g2g1, therefore it can differ from our chosenrepresentative U g2g1 by at most a unimodular constant α(g2, g1). So, we canwrite for any two transformations g1 and g2

8see Appendix A.2

Page 119: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 119/687

3.2. UNITARY REPRESENTATIONS OF THE POINCAR E GROUP 89

U g2U g1 = α(g2, g1)U g2g1 (3.16)

The factors α have three properties. First, they are unimodular.

|α(g2, g1)| = 1 (3.17)

Second, from the property (A.2) of the unit element we have for any g

U gU e

= α(g, e)U g

= U g

(3.18)

U eU g = α(e, g)U g = U g (3.19)

which implies

α(g, e) = α(e, g) = 1 (3.20)

Third, the associative law (3.15) implies

U g3(α(g2, g1)U g2g1) = (α(g3, g2)U g3g2)U g1α(g2, g1)α(g3, g2g1)U g3g2g1 = α(g3g2, g1)α(g3, g2)U g3g2g1

α(g2, g1)α(g3, g2g1) = α(g3g2, g1)α(g3, g2) (3.21)

The mapping U g from group elements to unitary operators in H is called aprojective group representation of the group if it satisfies eqs (3.16), (3.17),(3.20), and (3.21).

3.2.2 Elimination of central charges in the Poincarealgebra

In principle, we could keep the arbitrarily chosen unitary representatives of the subspace transformations U g1, U g2, . . ., as discussed above, and work withthus obtained projective representation of the Poincare group, but this wouldresult in a rather complicated mathematical formalism. The theory would

Page 120: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 120/687

90 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

be significantly simpler if we could judiciously choose the representatives9 in

such a way that the factors α(g2, g1) in (3.16) are simplified or eliminatedaltogether. Then we would have a much simpler linear unitary group repre-sentation (see Appendix H) instead of the projective group representation.In this subsection we are going to demonstrate that in any projective repre-sentation of the Poincare group such elimination of factors α(g2, g1) is indeedpossible [44].

The proof of the last statement is significantly simplified if conditions(3.17), (3.20), and (3.21) are expressed in the Lie algebra notation. In the

vicinity of the unit element of the group we can use vectors ζ from thePoincare Lie algebra to identify other group elements (see eq. (E.1)), i.e.

g = e ζ

= exp(10a=1

ζ ata)

where ta is the basis of the Poincare Lie algebra (H, P , K, J ) from subsection1.3.1. Then we can write unitary representatives U g of inertial transforma-tions g in the form

U ζ = exp(−i

10a=1

ζ aF a) (3.22)

where is a real constant which will be left unspecified at this point,10 andF a are ten Hermitian operators in the Hilbert space H called the generators of the unitary projective representation. Then we can write eq. (3.16) in theform

U ζ U ξ = α( ζ, ξ )U ζ ξ (3.23)

Since α is unimodular we can set α( ζ,

ξ ) = exp[iκ(

ζ,

ξ )], where κ(

ζ,

ξ ) isa real function. The conditions (3.20) and (3.21) then can be rewritten in

terms of κ

9i.e., multiply unitary operators U g by unimodular factors U g → β (g)U g10We will identify with the Planck constant in subsection 4.1.1

Page 121: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 121/687

3.2. UNITARY REPRESENTATIONS OF THE POINCAR E GROUP 91

κ( ζ, 0) = κ( 0, ζ ) = 0 (3.24)

κ( ξ, ζ ) + κ( χ, ξ ζ ) = κ( χ ξ, ζ ) + κ( χ, ξ ) (3.25)

Note that we can write the lowest order term in the Taylor series for κ nearthe group identity element in the form (see also eq. (E.4))

κ( ζ, ξ ) =

10ab=1

habζ aξ b (3.26)

The constant term, the terms linear in ζ a and ξ b, as well as the terms propor-

tional to ζ aζ b and ξ aξ b are absent on the right hand of (3.26) as a consequenceof the condition (3.24).

Using the same arguments as during our derivation of eq. (E.6), we can

expand all terms in (3.23) around ζ = ξ = 0

(1 − i

10a=1

ξ aF a − 1

2 2

10bc=1

ξ bξ cF bc + . . .)(1 − i

10a=1

ζ aF a − 1

2 2

10bc=1

ζ bζ cF bc + . . .)

= (1 + i

10

ab=1habζ aξ b + . . .)(1 − i

10

a=1(ζ a + ξ a +

10

bc=1f abcξ bζ c + . . .)F a

− 1

2 2

10ab=1

(ζ a + ξ a + . . .)(ζ b + ξ b + . . .)F ab + . . .)

Equating the coefficients in front of terms with different powers of ξ a and ζ b

on both sides, we obtain commutators of generators F a

F bF c − F cF b = i 10a=1

C abcF a + E bc (3.27)

where C abc

are familiar structure constants of the Poincare Lie algebra (1.39)- (1.46) and E bc = i 2(hcb − hbc) are imaginary constants, which depend onthe choice of representatives U g ∈ [U g].11 These constants are called central

11To be exact, we must write E bc on the right hand side of eq. (3.27) multiplied by theidentity operator I . However, we will omit the symbol I here for brevity.

Page 122: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 122/687

92 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

charges . Our main task in this subsection is to prove that representatives

U g can be chosen in such a way that E bc = 0, i.e., the central charges geteliminated.First we will choose an arbitrary set of representatives U g . In accordance

with our notation in section 1.3 we will use symbols

(H, P, J, K) (3.28)

to denote the generators F a of the projective representation U g. These gener-ators correspond to time translation, space translation, rotations, and boosts,respectively. Then using the structure constants C abc of the Poincare Lie al-

gebra from eqs (1.39) - (1.46) we obtain the full list of commutators (3.27).

[J i, P j] = i 3

k=1

ǫijk P k + E (1)ij (3.29)

[J i, J j] = i 3

k=1

ǫijk(J k + iE (2)k ) (3.30)

[J i, K j] = i 3

k=1ǫijkK k + E

(3)ij (3.31)

[P i, P j] = E (4)ij (3.32)

[J i, H ] = E (5)i (3.33)

[P i, H ] = E (6)i (3.34)

[K i, K j] = −i

c2

3k=1

ǫijk (J k + iE (7)k ) (3.35)

[K i, P j] = −i

c2Hδ ij + E

(8)ij , (3.36)

[K i, H ] = −i P i + E (9)i (3.37)

Here we arranged E bc into nine sets of central charges E (1) . . . E (9). In eq.(3.30) and (3.35) we took into account that their left hand sides are an-tisymmetric tensors. So, the central charges must form an antisymmetrictensor as well, and, according to Table D.1, they can be represented as

Page 123: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 123/687

3.2. UNITARY REPRESENTATIONS OF THE POINCAR E GROUP 93

− 3k=1 ǫijkE

(2)k and c−2

3k=1 ǫijkE

(7)k , respectively, where E

(2)k and E

(7)k

are 3-vectors.Next we will use the requirement that commutators (3.29) - (3.37) mustsatisfy the Jacobi identity.12 This will allow us to make some simplifications.For example, using P 3 = − i

[J 1, P 2] +

i

E (1)12 , and the fact that all constants

E commute with generators of the group, we obtain

[P 3, P 1] = − i

[([J 1, P 2] + E

(1)12 ), P 1]

= − i

[[J 1, P 2], P 1]

= −i

[[P 1, P 2], J 1] −i

[[J 1, P 1], P 2]

= − i

[E (4)12 , J 1] − i

[E (1)11 , P 2]

= 0

so E (4)31 = 0. Similarly, we can show that E

(4)ij = E

(5)i = E

(6)i = 0 for all values

of i, j = 1, 2, 3.Using the Jacobi identity we further obtain

i [J 3, P 3] = [[J 1, J 2], P 3]

= [[P 3, J 2], J 1] + [[J 1, P 3], J 2]

= i [J 1, P 1] + i [J 2, P 2] (3.38)

and, similarly,

i [J 1, P 1] = i [J 2, P 2] + i [J 3, P 3] (3.39)

By adding eqs. (3.38) and (3.39) we see that

[J 2, P 2] = 0 (3.40)

Similarly, we obtain [J 1, P 1] = [J 3, P 3] = 0, which means that

E (1)ii = 0 (3.41)

12eq. (E.10), which is equivalent to the associativity conditions (3.15) and (3.25)

Page 124: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 124/687

94 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

Using the Jacobi identity again, we obtain

i [J 2, P 3] = [[J 3, J 1], P 3]

= [[P 3, J 1], J 3] + [[J 3, P 3], J 1]

= −i [J 3, P 2]

This antisymmetry property is also true in the general case (for any i, j =1, 2, 3; i = j)

[J i, P j] = −[J j , P i] (3.42)

Putting together (3.40) and (3.42) we see that tensor [J i, P j] is antisymmetric.

This implies that we can introduce a vector E (1)k such that

E (1)ij = −

3i=1

ǫijkE (1)k

and

[J i, P j] = i 3

i=1ǫijk (P k + iE

(1)k ) (3.43)

Similarly, we can show that E (3)ii = 0 and

[J i, K j ] = i 3i=1

ǫijk(K k + iE (3)k )

Taking into account the above results, commutation relations (3.29)-(3.37) now take the form

[J i, P j] = i 3

k=1

ǫijk(P k + iE (1)k ) (3.44)

[J i, J j] = i 3

k=1

ǫijk(J k + iE (2)k ) (3.45)

Page 125: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 125/687

3.2. UNITARY REPRESENTATIONS OF THE POINCAR E GROUP 95

[J i, K j ] = i 3

k=1 ǫijk(K k + iE (3)k ) (3.46)

[P i, P j ] = [J i, H ] = [P i, H ] = 0 (3.47)

[K i, K j ] = −i

c2

3k=1

ǫijk(J k + iE (7)k ) (3.48)

[K i, P j ] = −i

c2Hδ ij + E

(8)ij , (3.49)

[K i, H ] = −i P i + E (9)i (3.50)

where E on the right hand sides are certain imaginary constants. The next

step in elimination of the central charges E is to use the freedom of choos-ing unimodular factors β (g) in front of operators of the representation U g:

Two unitary operators U ζ and β ( ζ )U ζ differing by a unimodular factor β ( ζ )generate the same subspace transformation. Correspondingly, the choice of generators F a has some degree of arbitrariness as well. Since β ( ζ ) are uni-modular, we can write

β ( ζ ) = exp(iγ ( ζ )) ≈ 1 + i

10

a=1Raζ a

Therefore, in the first order, the presence of factors β ( ζ ) results in addingsome real constants Ra to generators F a. We would like to show that byadding such constants we can make all central charges equal to zero.

Let us now add constants R to the generators P j , J j, and K j and denotethe redefined generators as

P j = P j + R(1) j

J j = J j + R(2) j

K j = K j + R(3) j

Then commutator (3.45) takes the form

Page 126: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 126/687

96 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

[J i, J j] = [J i + R(2)i , J j + R(2) j ]

= [J i, J j]

= i 3

k=1

ǫijk(J k + iE (2)k )

So, if we choose R(2)k = iE

(2)k , then

[J i, J j ] = i

3

k=1 ǫijkJ k

and central charges are eliminated from this commutator.Similarly, central charges can be eliminated from commutators

[J i, P j] = i 3

k=1

ǫijkP k

and

[J i, K j] = i 3

k=1

ǫijkK k (3.51)

by choosing R(1)k = iE

(1)k and R

(3)k = iE

(3)k . From eq. (3.51) we then obtain

[K 1, K 2] = − i

[[J 2, K 3], K 2]

= −i

[[J 2, K 2], K 3] −i

[[K 2, K 3], J 2]

= − i

[−i

c2(J 1 + iE

(7)1 ), J 2]

= −i

c2J 3

Page 127: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 127/687

3.2. UNITARY REPRESENTATIONS OF THE POINCAR E GROUP 97

so, our choice of the constants R(1)k , R

(2)k , and R

(3)k eliminates the central

charges E

(7)

i .From eq. (3.51) we also obtain

[K 3, H ] = − i

[[J 1, K 2], H ]

= − i

[[H, K 2], J 1] − i

[[J 1, H ], K 2]

= −[J 1, P 2]

= −i P 3

which implies that the central charge E (9) is canceled as well. Finally

[K 1, P 2] = − i

[[J 2, K 3], P 2]

= − i

[[J 2, P 2], K 3] +

i

[[K 3, P 2], J 3]

= 0

and

[K 1, P 1] = −i

[[J 2, K 3], P 1]

= − i

[[J 2, P 1], K 3] +

i

[[K 3, P 1], J 3]

= [K 3, P 3]

This implies that E (8)ij = 0 if i = j, and we can introduce a real scalar E (8)

such that

E (8)11 = E

(8)22 = E

(8)33 ≡ −i

c2E (8)

and

[K i, P i] = −i

c2δ ij(H + E (8))

Page 128: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 128/687

98 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

Finally, by redefining the generator of time translations H = H + E (8) we

eliminate all central charges from the commutation relations of the PoincareLie algebra.

[J i, P j ] = i 3

k=1

ǫijkP k (3.52)

[J i, J j ] = i 3

k=1

ǫijkJ k (3.53)

[J i, K j ] = i 3

k=1 ǫijkK k (3.54)

[P i, P j ] = [J i, H ] = [P i, H ] = 0 (3.55)

[K i, K j ] = −i

c2

3k=1

ǫijkJ k (3.56)

[K i, P j ] = −i

c2Hδ ij (3.57)

[K i, H ] = −i P i (3.58)

Thus Hermitian operators H , P, J, and K provide a representation of thePoincare Lie algebra and the redefined unitary operators β (g)U g form a

unique unitary representation of the Poincare group that corresponds to thegiven projective representation U g in the vicinity of the group identity. Wehave proven that each projective representation is equivalent to a certain uni-tary representation, which are much easier objects for study (see AppendixH).

Commutators (3.52) - (3.58) are probably the most important equationsof relativistic quantum theory. In the rest of this book we will have manyopportunities to appreciate a deep physical content of these formulas.

3.2.3 Single- and double-valued representations

In the preceding subsection we eliminated the phase factors α(g2, g1) fromeq. (3.16) by resorting to Lie algebra arguments. However, these argumentswork only in the vicinity of the group’s unit element. There is a possibilitythat non-trivial phase factors may reappear in the multiplication law ( 3.16)

Page 129: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 129/687

3.2. UNITARY REPRESENTATIONS OF THE POINCAR E GROUP 99

when the group manifold has a non-trivial topology and group elements are

considered which are far from the unit element.The simplest example of a group with such a non-trivial topology is therotation group (see Appendix D). Note that two rotations around the sameaxis by angles φ and φ+2πn (with integer n) are physically indistinguishable.Then the region of independent rotation vectors13 in R

3 can be described asthe interior of the sphere of radius π with opposite points on the surface of the sphere identified. This set of points will be referred to as ball Π (see Fig.3.1). The unit element 0 is in the center of the ball. We will be interestedin one-parameter families of group elements which form continuous curves inthe group manifold Π. Since the opposite points on the surface of the ballare identified in our topology, any continuous path that crosses the surface

must reappear on the opposite side of the sphere (see Fig. 3.1(a)).A topological space is simply connected if every loop can be continuously

deformed to a single point. An example of a simply connected topologicalspace is the surface of a sphere. However, the manifold Π of the rotationparameters is not simply connected. The loop shown in Fig. 3.1(a) crossesthe sphere once and can not be shrunk to a single point. However, the loopshown in Fig. 3.1(b) can be continuously deformed to a point. It appearsthat for any rotation R there are two classes of paths from the group’s unitelement 0 to R. They are also called the homotopy classes . These twoclasses consist of paths that cross the surface of the sphere Π even and odd

number of times, respectively. Two paths from different classes cannot becontinuously deformed to each other.

Just as in the Poincare algebra, the central charges can be eliminatedin the Lie algebra of the rotation group by a proper choice of numericalconstants added to generators. Then a unitary representation of the rotationgroup can be constructed in which the identity rotation is represented bythe identity operator, and by traveling a small loop in the group manifoldfrom the identity element 0 back to 0 we will end up with the identityoperator I again. However, if we travel the long path 0 → A → A′ → 0 in Fig. 3.1(a), there is no guarantee that in the end we will find thesame representative of the identity transformation. We can get some other

equivalent unitary operator from the ray containing I , so the representativeof 0 may acquire a phase factor eiφ after travel along such a loop. On

13Recall from Appendix D.5 that direction of the rotation vector φ coincides with theaxis of rotation, and its length φ is the rotation angle.

Page 130: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 130/687

100 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

00

AABB

B’A’

00

AA

A’

(a) (b)

ΠΠ ΠΠ

Figure 3.1: The space of parameters of the rotation group is not simplyconnected: (a) a loop which starts from the center of the ball 0, reachesthe surface of the sphere Π at point A, and then continues from the oppositepoint A′ back to 0; this loop cannot be continuously collapsed to 0,because it crosses the surface an odd number of times (1); (b) a loop 0 →A → A′ → B → B′ → 0 which crosses the surface of the sphere Π twicecan be deformed to the point 0. This can be achieved by moving the pointsA′ and B (and, correspondingly the points A and B′) close to each other, sothat the segment A′ → B of the path disappears.

Page 131: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 131/687

3.2. UNITARY REPRESENTATIONS OF THE POINCAR E GROUP 101

the other hand, making two passes on the loop 0 → A → A′ → 0 →

A → A′ → 0 we obtain a loop which crosses the surface of the spheretwice, and hence can be deformed to a point. Therefore e2iφ = 1, and eiφ =

±1. This demonstrates that there are two types of unitary representationsof the rotation group: single-valued and double-valued. For single-valued representations, the representative of the identity rotation is always I . Fordouble-valued representations, the identity rotation has two representativesI and −I , and the product of two operators in (3.16) may have a non-trivialsign factor

U g1U g2 = ±U g1g2

Since the rotation group is a subgroup of the Poincare group, the lattergroup also has a non-trivial global topology. Then, one should also takeinto account both single- and double-valued representations of the Poincaregroup.

3.2.4 The fundamental statement of relativistic quan-tum theory

The most important result of this chapter is the connection between relativityand quantum mechanics summarized in the following statement (see, e.g., [9])

Statement 3.2 (Unitary representations of the Poincare group) In a relativistic quantum description of a physical system, inertial transforma-tions are represented by unitary operators which furnish a unitary (single- or double-valued) representation of the Poincare group in the Hilbert space of the system.

It is important to note that this statement is completely general. The Hilbertspace of any isolated physical system (no matter how complex) must carry aunitary representation of the Poincare group. Construction of Hilbert spaces

and Poincare group representations in them is the major part of theoreticaldescription of physical systems. The rest of this book is primarily devotedto performing these difficult tasks.

Basic inertial transformations from the Poincare group are represented in

the Hilbert space by unitary operators: e− iPr for spatial translations, e− i

J φ

Page 132: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 132/687

102 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

for rotations, e− iKc θ for boosts, and e

iHt for time translations,14 A gen-

eral inertial transformation g = φ, v(

θ), r, t is represented by the unitaryoperator15

U g = e− iJ φe− i

Kc θe− i

Pre

iHt (3.59)

Then, in the Schrodinger picture state vectors transform between differentinertial reference frames according to

|Ψ′ = U g|Ψ (3.60)

In the Heisenberg picture inertial transformations of observables have theform

F ′ = U gF U −1g (3.61)

For example, the equation describing the time evolution of the observable F in the Heisenberg picture16

F (t) = eiHtF e− i

Ht (3.62)

= F +i

[H, F ]t −1

2 [H, [H, F ]]t2 + . . . (3.63)

can be written in the differential form

dF (t)

dt=

i

[H, F ]

which is the familiar Heisenberg equation .Note also that analogous “Heisenberg equations” can be written for trans-

formations of observables with respect to space translations, rotations, andboosts

14The exponential form of the unitary group representatives follows from eq. (3.22). Weuse a different sign for the time translations, because, unlike space translations, rotations,and boosts, they are normally considered as active transformations.

15compare with eq. (1.16)16see eq. (E.13)

Page 133: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 133/687

3.2. UNITARY REPRESENTATIONS OF THE POINCAR E GROUP 103

dF (r)dr

= − i

[P, F ]

dF ( φ)

d φ= − i

[J, F ]

dF ( θ)

d θ= −ic

[K, F ]

We already discussed the point17 that transformations of observables withrespect to inertial transformations of observers cover many interesting prob-lems in physics (the time evolution, Lorentz transformations, etc.). From the

above formulas we see that solution of these problems requires the knowl-edge of commutators between observables F and generators of the Poincaregroup representation H, P, J, and K. In the next chapter we will discussdefinitions of various observables, their connections to Poincare generators,and their commutation relations.

17see subsection 3.1.3

Page 134: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 134/687

104 CHAPTER 3. QUANTUM MECHANICS AND RELATIVITY

Page 135: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 135/687

Chapter 4

OPERATORS OFOBSERVABLES

Throwing pebbles into the water, look at the ripples they form on the surface, otherwise, such occupation becomes an idle pastime.

Kozma Prutkov

In chapters 2 and 3 we established that in quantum theory any physical sys-tem is described by a complex Hilbert space H, pure states are representedby rays in H, observables are represented by Hermitian operators in H, andthere is a unitary representation U g of the Poincare group in H which de-termines how state vectors and operators of observables change when thepreparation device or the measuring apparatus undergoes an inertial trans-formation. Our next goal is to clarify the structure of the set of observables.In particular, we wish to find which operators correspond to such familiarobservables as velocity, momentum, energy, mass, position, etc, what aretheir spectra, and what are the relationships between these operators?

In this chapter we will focus on observables whose operators can be ex-pressed as functions of generators (P, J, K, H ) of the Poincare group rep-resentation U g. In chapter 7, we will meet other observables, such as thenumber of particles. They cannot be expressed through ten generators of thePoincare group.

105

Page 136: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 136/687

106 CHAPTER 4. OPERATORS OF OBSERVABLES

4.1 Basic observables

4.1.1 Energy, momentum, and angular momentum

The generators of the Poincare group representation in the Hilbert space of any system are Hermitian operators H , P, J, and K, and we might suspectthat they are related to certain observables pertinent to this system. Whatare these observables? In order to get a hint, let us now postulate that theconstant introduced in subsection 3.2.2 is the Planck constant

= 6.626 · 10−34kg · m2

s(4.1)

whose dimension can be also expressed as < >=< mass >< speed ><distance >. Then the dimensions of generators can be found from the con-dition that the arguments of exponents in (3.59) must be dimensionless

• < H >= <><time>

=< mass >< speed >2;

• < P >= <><distance>

=< mass >< speed >;

• < J >=< >=< mass >< speed >< distance >

• < K >= <><speed> =< mass >< distance >;

Based on these dimensions we can guess that these are observables of energy

(or Hamiltonian ) H , momentum P, and angular momentum J of the sys-tem.1 We will call them basic observables . Operators H , P, and J generatetransformations of the system as a whole, so we will assume that these areobservables for the entire system, i.e., the total energy, the total momentum,and the total angular momentum. Of course, these dimensionality consider-ations are not a proof. The justification of these choices will become moreclear later, when we consider properties of operators and relations betweenthem.

Using this interpretation and commutators in the Poincare Lie algebra(3.52) - (3.58), we immediately obtain commutation relations between oper-

ators of observables. Then we know which pairs of observables can be simul-taneously measured. For example, we see from (3.55) that energy is simul-taneously measurable with the momentum and angular momentum. From

1There is no common observable directly associated with the boost generator K, butwe will see later that K is related to observables of position and spin.

Page 137: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 137/687

4.1. BASIC OBSERVABLES 107

(3.53) it is clear that different components of the angular momentum cannot

be measured simultaneously. These facts are well-known in non-relativisticquantum mechanics. Now we have them as direct consequences of the prin-ciple of relativity.

From commutators (3.52) - (3.58) we can also find formulas for transfor-mations of operators H , P, J, and K from one inertial frame to another. Forexample, each vector observable F = P, J or K transforms under rotationsas2

F( φ) = e− iJ φFe

iJ φ = F cos φ +

φ

φ(F ·

φ

φ)(1 − cos φ) − F ×

φ

φsin φ(4.2)

The boost transformation law for generators of translations is3

P(θ) = e− iKc θPe

iKc θ = P +

θ

θ(P ·

θ

θ)(cosh θ − 1) −

θ

cθH sinh θ(4.3)

H (θ) = e− iKc θHe

iKc θ = H cosh θ − c(P ·

θ

θ) sinh θ (4.4)

It also follows from (3.55) that energy H , momentum P, and angular mo-

mentum J do not depend on time, i.e., they are conserved quantities .

4.1.2 The operator of velocity

The operator of velocity is defined as4 (see, e.g., [45, 46]).

V =Pc2

H

Denoting V(θ) the velocity of the system measured in the frame of referencemoving with the speed v = c tanh θ along the x-axis, we obtain

2see eq. (D.19)3see eqs. (1.49) and (1.50)4The ratio of operators is well-defined here because P and H commute with each other.

Page 138: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 138/687

108 CHAPTER 4. OPERATORS OF OBSERVABLES

V x(θ) = e− iK xcθP xc2

H eiK xcθ

=c2P x cosh θ − cH sinh θ

H cosh θ − cP x sinh θ

=c2P xH −1 − c tanh θ

1 − cP xH −1 tanh θ

=V x − c tanh θ

1 − V xc

tanh θ

=V x − v

1−

V xv

c2

(4.5)

V y(θ) =V y

(1 − V xc

tanh θ) cosh θ=

V y

1 − v2/c2

(1 − V xvc2

), (4.6)

V z(θ) =V z

(1 − V xc tanh θ) cosh θ

=V z

1 − v2/c2

(1 − V xvc2 )

(4.7)

These formulas coincide with the usual relativistic law of addition of veloci-ties . In the limit c → ∞ they reduce to the familiar non-relativistic form

V x(v) = V x−

v

V y(v) = V y

V z(v) = V z

4.2 Casimir operators

Observables H , P, V and J depend on the observer, so they do not representintrinsic fundamental properties of the system. For example, if a system hasmomentum p in one frame of reference, then there are other (moving) framesof reference in which the momentum takes any other value from R

3. The sys-

tem’s momentum depends on both the state of the system and the referenceframe in which the observation is made. Are there observables which reflectsome intrinsic observer-independent properties of the system? If there aresuch observables (they are called Casimir operators ), their operators mustcommute with all generators of the Poincare group. It can be shown that

Page 139: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 139/687

4.2. CASIMIR OPERATORS 109

the Poincare group has only two independent Casimir operators [47]. Any

other Casimir operator of the Poincare group is a function of these two. So,there are two invariant physical properties of any physical system. One suchproperty is mass , which is a measure of the matter content in the system.The corresponding Casimir operator will be considered in subsection 4.2.2.Another invariant property is related to the speed of rotation of the systemaround its own axis or spin . The Casimir operator corresponding to thisinvariant property will be found in subsection 4.2.3.

4.2.1 4-vectors

Before addressing the Casimir operators, let us introduce some useful defi-

nitions. We will call a quadruple of operators (A0, A1, A2, A3) a 4-vector 5

if (A1, A2, A3) is a 3-vector, A0 is a 3-scalar, and their commutators with theboost generators are

c[Ki, A j] = −i A0δ ij (4.8)

c[ K, A0] = −i A (4.9)

Then, it is easy to show that the 4-square A2−A20 of a 4-vector is a 4-scalar ,i.e., it commutes with both rotations and boosts. For example,

[K x, A2 − A20] = [K x, A2x + A2y + A2z − A20]

=i

c(AxA0 + A0Ax − A0Ax − AxA0)

= 0

Therefore, in order to find the Casimir operators of the Poincare group weshould be looking for two functions of the Poincare generators, which are4-vectors and, in addition, commute with H and P. Then 4-squares of these4-vectors are guaranteed to commute with all Poincare generators.

4.2.2 The operator of massIt is easy to see that four operators (H, cP) satisfy all conditions specified insubsection 4.2.1 for 4-vectors. These operators are usually called the energy-

5see also Appendix I.1

Page 140: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 140/687

110 CHAPTER 4. OPERATORS OF OBSERVABLES

momentum 4-vector . Then we can construct the first Casimir invariant called

the mass operator as the 4-square of this 4-vector

M = +1

c2√

H 2 − P2c2 (4.10)

The operator of mass must be Hermitian, therefore we demand that for anyphysical system H 2−P2c2 ≥ 0, i.e., that the spectrum of operator H 2−P2c2

does not contain negative values. Honoring the fact that masses of all knownphysical systems are non-negative we choose the positive value of the squareroot in (4.10). Then the relationship between the energy, momentum, andmass takes the form

H = +√

P2c2 + M 2c4 (4.11)

In the non-relativistic limit (c → ∞) we obtain from eq. (4.11)

H ≈ Mc2 +P2

2M

which is the sum of the famous Einstein’s rest mass energy E = Mc2 andthe usual kinetic energy term P2

2M .

4.2.3 The Pauli-Lubanski 4-vector

The second 4-vector commuting with H and P is the Pauli-Lubanski operator whose components are defined as6

W 0 = (P · J) (4.12)

W =1

cH J − c[P × K] (4.13)

Let us check that all required 4-vector properties are, indeed, satisfied for(W 0, W). We can immediately observe that

6These definitions involve products of Hermitian commuting operators, therefore oper-ators W 0 and W are guaranteed to be Hermitian.

Page 141: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 141/687

4.2. CASIMIR OPERATORS 111

[J, W 0] = 0

so W 0 is a scalar. Moreover, W 0 changes its sign after changing the sign of P so it is a pseudoscalar.7 W is a pseudovector, because it does not changeits sign after changing the signs of K and P and

[J i, W j] = i 3

k=1

ǫijkW k

Let us now check the commutators with boost generators

c[K x, W 0] = c[K x, P xJ x + P yJ y + P zJ z]

= −i (HJ x

c− cP yK z + cP zK y)

= −i W x (4.14)

c[K x, W x] = [K x, HJ x − c2P yK z + c2P zK y]

= i (−P xJ x − P yJ y − P zJ z)

= −i W 0 (4.15)

c[K x, W y] = [K x, HJ y − c

2

P zK x + c

2

P xK z]= i (HK z − P xJ y − HK z + P xJ y)

= 0 (4.16)

[K x, W z] = 0 (4.17)

Putting equations (4.14) - (4.17) together we obtain the characteristic 4-vector relations (4.8) - (4.9)

c[K, W 0] = −i W (4.18)

c[K i, W

j] =

−i δ

ijW 0 (4.19)

Next we need to verify that commutators with generators of translationsare all zero. First, for W 0 we obtain

7For definitions of pseudoscalars, pseudovectors, etc. see subsection 1.2.4.

Page 142: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 142/687

112 CHAPTER 4. OPERATORS OF OBSERVABLES

[W 0, H ] = [P · J, H ] = 0

[W 0, P x] = [J xP x + J yP y + J zP z, P x]

= P y[J y, P x] + P z[J z, P x]

= −i P yP z + i P zP y

= 0

For the vector part W we obtain

[W, H ] = −c[[P × K], H ]= −c[P, H ] × K − cP × [K, H ]

= 0

[W x, P x] =1

c[HJ x, P x] − c[[P × K]x, P x]

= −c[P yK z − P zK y, P x]

= 0

[W x, P y] =1

c[HJ x, P y] − c[[P × K]x, P y]

=i

c

HP z

−c[P yK z

−P zK y, P y]

=i

cHP z − i

cHP z

= 0

So, all components of the 4-vector (W 0, W) commute with all components of (H, P). This proves that the 4-square of the Pauli-Lubanski 4-vector

Σ2 = W2 − W 20

is a Casimir operator. Although operators (W 0

, W) do not have direct phys-ical interpretation, we will find them very useful in the next section for de-riving the operators of position R and spin S. For these calculations we willneed the commutators between components of the Pauli-Lubanski 4-vector.For example,

Page 143: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 143/687

4.3. OPERATORS OF SPIN AND POSITION 113

[W x, W y] = [W x, 1c

HJ y + cP xK z − cP zK x]

= i (1

cHW z − W 0P z)

[W 0, W x] = [W 0,1

cHJ x − cP yK z + cP zK y]

= −i P yW z + i P zW y

= −i [P × W]x

The above equations are easily generalized for all components

[W i, W j] =i

c

3k=1

ǫijk (HW k − cW 0P k) (4.20)

[W 0, W j] = −i [P × W] j (4.21)

4.3 Operators of spin and position

Now we are ready to tackle the problem of finding expressions for spin andposition as functions of generators of the Poincare group [48, 49, 50, 51].

4.3.1 Physical requirements

We will be looking for the total spin operator S and the center-of-mass posi-tion operator R which have the following natural properties:

(I) Owing to the similarity between spin and angular momentum,8 wedemand that S is a pseudovector (just like J)

8It is often stated that spin is a purely quantum-mechanical observable which doesnot have a classical counterpart. We do not share this point of view in this book. From

classical mechanics we know that the total angular momentum of a body is a sum of twoparts. The first part is the angular momentum resulting from the linear movement of thebody as a whole with respect to the observer. The second part is related to the rotationof the body around its own axis, or spin. The only significant difference between classicaland quantum intrinsic angular momenta (spins) is that the latter has a discrete spectrum,while the former is continuous.

Page 144: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 144/687

114 CHAPTER 4. OPERATORS OF OBSERVABLES

[J j , S i] = i 3

k=1

ǫijkS k

(II) and that components of S satisfy the same commutation relations ascomponents of J (3.53)

[S i, S j] = i 3

k=1

ǫijkS k (4.22)

(III) We also demand that spin can be measured simultaneously with mo-mentum

[P, S] = 0;

(IV) and with position

[R, S] = 0 (4.23)

(V) From the physical meaning of R it follows that space translations of the observer simply shift the values of position.

e− iP xaRxe

iP xa = Rx − a

e− iP xaRye

iP xa = Ry

e− iP xaRze

iP xa = Rz

This implies the following commutation relations

[Ri, P j] = i δ ij (4.24)

Page 145: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 145/687

4.3. OPERATORS OF SPIN AND POSITION 115

(VI) Finally, we will assume that position is a true vector (see subsection

1.2.4)

[J i, R j ] = i 3

k=1

ǫijkRk (4.25)

4.3.2 The spin operator

Now we would like to make the following guess about the form of the spinoperator9

S =W

Mc− W 0P

M (Mc2 + H )(4.26)

=H J

Mc2− P × K

M − P(P · J)

(H + Mc2)M (4.27)

which is a pseudovector commuting with P as required by the above condi-tions (I) and (III). Next we can verify that condition (II) is also valid for thisoperator. To calculate the commutators (4.22) between spin components wedenote

F ≡ −M −1(Mc2 + H )−1 (4.28)

use commutators (4.20) and (4.21), the equality

(P · W) =1

cH (P · J) =

1

cHW 0 (4.29)

and eq. (D.15). Then

[S x, S y] = [F W 0P x + W xMc

, F W 0P y + W yMc

]

9Note that operator S has the mass operator M in the denominator, so expressions(4.26) and (4.27) have mathematical sense only for systems with strictly positive massspectrum.

Page 146: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 146/687

116 CHAPTER 4. OPERATORS OF OBSERVABLES

= i (−F P x(P × W)yMc

+F P y(P × W)x

Mc+

HW z − cW 0P zM 2c3

)

= i (−F [P × [P × W]]zMc

+HW z − cW 0P z

M 2c3)

= i (−F (P z(P · W) − W zP2)

Mc+

HW z − cW 0P zM 2c3

)

= i (−F (P zHW 0c−1 − W zP2)

Mc+

HW z − cW 0P zM 2c3

)

= i W z(P2F

Mc+

H

M 2c3) + i P zW 0(− HF

Mc2− 1

M 2c2)

For the expressions in parentheses we obtain

P2F

Mc+

H

M 2c3= − P 2

M 2c(Mc2 + H )+

H

M 2c3

=H (Mc2 + H ) − P 2c2

M 2c3(Mc2 + H )

=H (Mc2 + H ) − (Mc2 + H )(H − Mc2)

M 2c3(Mc2 + H )

=1

Mc

and

− HF

Mc2− 1

M 2c2=

H

M 2c2(Mc2 + H )− 1

M 2c2

=H − (Mc2 + H )

M 2c2(Mc2 + H )

= − 1

M (Mc2 + H )= F

Thus, property (4.22) follows

[S x, S y] = i (W zMc

+ F W 0P z)

= i S z

Page 147: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 147/687

4.3. OPERATORS OF SPIN AND POSITION 117

Let us now prove that spin squared S2 is a function of M 2 and Σ2, i.e., a

Casimir operator

10

S2 = (W

Mc+ W 0PF )2

=W2

M 2c2+

2W 0F P · W

Mc+ W 20P2F 2

=W2

M 2c2+

2W 0F H P · J

Mc2+ W 20P2F 2

=W2

M 2c2+ W 20F (

2H

Mc2+ P2F )

= W

2

M 2c2 + W 20F 2H (Mc

2

+ H ) − P

2

c

2

Mc2(Mc2 + H )

=W2

M 2c2− W 20

H 2 + 2HMc2 + M 2c4

M 2c2(Mc2 + H )2

=W2 − W 20

M 2c2

=Σ2

M 2c2

So far we guessed the form of the spin operator and verified that the requiredproperties are satisfied.11 In subsection 4.3.6 we will demonstrate that S is

the unique operator satisfying all conditions from subsection 4.3.1.

4.3.3 The position operator

Now we are going to switch to the derivation of the position operator. Herewe will follow a similar route: we will first guess the form of the operator Rand then in subsection 4.3.7 we will prove that this is the unique expressionsatisfying all requirements from subsection 4.3.1. Our guess for R is theNewton-Wigner position operator 12 [48, 49, 50, 51, 52]

10The invariance of the absolute value of spin is evident for macroscopic freely moving

objects. Indeed, no matter how we translate, rotate or boost the frame of reference wecannot stop the spinning motion of the system or force it to spin in the opposite direction.

11Property (IV) will be examined in subsection 4.3.3.12Similarly to the operator of spin (see footnote on page 115), the Newton-Wigner

position operator is defined only for systems whose mass spectrum is strictly positive.

Page 148: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 148/687

118 CHAPTER 4. OPERATORS OF OBSERVABLES

R = −c2

2(H −1K + KH −1) − c

2

P × SH (Mc2 + H )

(4.30)

= −c2H −1K − i c2

2

P

H 2− cP × W

MH (Mc2 + H )(4.31)

which is a true vector having the property (V), e.g.,

[Rx, P x] = −c2

2[(H −1K x + K xH −1), P x]

=i

2

(H −1H + HH −1)

= i

[Rx, P y] = −c2

2[(H −1K x + K xH −1), P y]

= 0

Let us now calculate13

J − R × P

= J + c2H −1K × P +c2[P × S] × P

H (Mc2 + H )

= J + c2H −1K × P − c2

H (Mc2 + H )(P(P · S) − SP2)

= J + c2H −1K × P − c2

H (Mc2 + H )(P(P · S) − S

c2(H − Mc2)(H + Mc2))

= J + c2H −1K × P + S − c2P(P · S)

H (Mc2 + H )− Mc2

H S

= J + c2H −1K × P + S − c2P(P · S)

H (Mc2 + H )

13Note that [K xP y

−K yP x, H ] =

−i (P xP y

−P yP x) = 0, therefore K

×P commutes

with H and operator H −1K×P is Hermitian. In this derivation we also use

P · S =P · JH

M c2− P2(P · J)

P 2(H − M c2)

Mc2= P · J

Page 149: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 149/687

4.3. OPERATORS OF SPIN AND POSITION 119

−J +c2P(P · J)

H (Mc2 + H )+

c2

H P × K

= S

Therefore, just as in classical physics, the total angular momentum is a sumof two parts: the orbital angular momentum R×P and the intrinsic angular momentum or spin S

J = R × P + S

Finally, we can check that condition (IV) is satisfied, e.g.,

[S x, Ry] = [J x − [R × P]x, Ry]

= i Rz − [P yRz − P zRy, Ry]

= i Rz − i Rz

= 0

Theorem 4.1 All components of the position operator commute with each other: [Ri, R j ] = 0.

Proof. First, we calculate the commutator [HRx, HRy] which is related to[Rx, Ry] via formula

[HRx, HRy] = [HRx, H ]Ry + H [HRx, Ry]

= H [Rx, H ]Ry + H [H, Ry]Rx + H 2[Rx, Ry]

= i c2(P xRy − RyP x) + H 2[Rx, Ry]

= i c2[P × R]z + H 2[Rx, Ry]

= −i c2J z + i c2S z + H 2[Rx, Ry] (4.32)

Using formula (4.31) for the position operator, we obtain

[HRx, HRy] = [−c2K x − i c2

2

P xH

+ cF [P × W]x, −c2K y − i c2

2

P yH

+ cF [P × W]y]

Page 150: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 150/687

120 CHAPTER 4. OPERATORS OF OBSERVABLES

Non-zero contributions to this commutator are

[−c2K x, −c2K y] = c4[K x, K y]

= −i c2J z

[−i c2

2

P xH

, −c2K y] = −i c2

2[K y,

P xH

]

= −i c4

2i

P yP xH 2

[−c2K x, −i c2

2

P yH

] =i c4

2i

P yP xH 2

[−c2K x, +cF [P × W]y]

=c3

M [K x,

P zW x − P xW zH + Mc2

]

=c3

M (−P zW x − P xW z

(H + Mc2)2[K x, H ] +

P z[K x, W x]

H + Mc2− [K x, P x]W z

H + Mc2)

= i c3(MF 2(P zW x − P xW z)P x + F P zW 0c−1 − F HW zc−2) (4.33)

[cF [P × W]x, −c2K y]

= −c3(−MF 2(P yW z − P zW y)[K y, H ] + F P z[K y, W y] − F [K y, P y]W z)

= i c3(MF 2(P yW z − P zW y)P y + F P zW 0c−1 − F HW zc−2) (4.34)

and

[cF [P × W]x, cF [P × W]y]

= c2F 2[P yW z − P zW y, P zW x − P xW z]

= c2F 2(P zP y[W z, W x] − P 2z [W y, W x] + P xP z[W y, W z])

= i cF 2(P zP y(HW y − cW 0P y) + P 2z (HW z − cW 0P z) + P xP z(HW x − cW 0P x))

= i cF 2(−W 0cP z(P 2x + P 2y + P 2z ) + HP z(P xW x + P yW y + P zW z))

= i c2F 2(−

W 0P zP2 + HP z(P·

W)/c)

= i c2F 2(−W 0P zP2 + H 2P zW 0/c2)

= i F 2W 0(−P z(H 2 − M 2c4) + H 2P z)

=i c4W 0P z

(H + Mc2)2(4.35)

Page 151: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 151/687

4.3. OPERATORS OF SPIN AND POSITION 121

Adding together eqs (4.33) and (4.34) and using (4.29) we obtain

[−c2K x, cF [P × W]y] + [cF [P × W]x, −c2K y]

= i c3(MF 2[P × [P × W]]z + 2F P zW 0c−1 − 2F HW zc−2)

= i c3(MF 2(P z(P · W) − W zP2) + 2F P zW 0c−1 − 2F HW zc−2)

= i c3(MF 2(P zHW 0/c − W zP2) + 2F P zW 0c−1 − 2F HW zc−2)

= i c2MF 2P zW 0(H − 2(H + Mc2)) + i cMF 2W z(−(H − Mc2)(H + Mc2)

+2H (H + Mc2))

= i c3P zW 0(F c−1 − M 2F 2c) +i cW z

M (4.36)

Adding together eqs (4.35) and (4.36) we finally calculate

[HRx, HRy] = −i c2J z + i c3P zW 0(F − M 2F 2c) +i cW z

M + i c4M 2F 2W 0P z

= −i c2J z + i c2(− P zW 0M (H + Mc2)

+W zMc

)

= −i c2J z + i c2S z

Comparing this with eq. (4.32) we obtain

H 2[Rx, Ry] = 0

and

[Rx, Ry] = 0

4.3.4 An alternative set of basic operators

So far, our plan was to construct operators of observables from 10 basicgenerators P, J, K, H . However, this set of operators is often difficultto use in calculations due to rather complicated commutation relations inthe Poincare Lie algebra (3.52) - (3.58). For systems with a strictly positive

Page 152: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 152/687

122 CHAPTER 4. OPERATORS OF OBSERVABLES

spectrum of the mass operator, it is sometimes more convenient to use an

alternative set of basic operators P, R, S, M whose commutation relationsare much simpler

[P, M ] = [R, M ] = [S, M ] = [Ri, R j] = [P i, P j] = 0 (4.37)

[Ri, P j ] = i δ ij

[P, S] = [R, S] = 0

[S i, S j ] = i 3

k=1

ǫijkS k (4.38)

Summarizing our previous discussion, we can express operators in this setthrough generators of the Poincare group

P = P (4.39)

R = −c2

2(H −1K + KH −1) − cP × W

MH (Mc2 + H )(4.40)

S = J − R × P (4.41)

M = +1

c2√

H 2 − P2c2 (4.42)

Conversely, we can express generators of the Poincare group through opera-tors P, R, S, M . The momentum operator P is the same in both sets. Forthe energy and angular momentum we obtain

H = +√

M 2c4 + P2c2 (4.43)

J = R × P + S (4.44)

and the expression for the boost operator is

−1

2c2(RH + H R)

−P × S

Mc2 + H (4.45)

= − 1

2c2(−c2

2(H −1KH + K) − c2P × S

Mc2 + H )

− 1

2c2(−c2

2(K + H KH −1) − c2P × S

Mc2 + H ) − P × S

Mc2 + H

Page 153: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 153/687

4.3. OPERATORS OF SPIN AND POSITION 123

=1

4(H −1KH + K + K + H KH −1)

= K − i 4

(H −1P − PH −1)

= K

These two sets provide equivalent descriptions of Poincare invariant theories.Any function of operators from the set P, J, K, H can be expressed as afunction of operators from the set P, R, S, M and vice versa . We will usethis property in subsections 4.3.5, 6.3.2 and 6.3.3.

4.3.5 Canonical form and “power” of operators

In this subsection, we would like to mention some mathematical facts whichwill be helpful in further calculations. When performing calculations withfunctions of Poincare generators, we meet a problem that the same operatorcan be expressed in many equivalent functional forms. For example, accord-ing to (3.58) K xH and HK x − i P x are two forms of the same operator. Tosolve this non-uniqueness problem, we will agree to write operator factorsalways in the canonical form , i.e., from left to right in the following order14:

C (P x, P y, P z, H ), J x, J y, J z, K x, K y, K z (4.46)

Consider, for example, the product K yP yJ x. This operator is not in thecanonical form. To bring it to the canonical form, first we need to move themomentum factor to the left. Using (3.57) we obtain

K yP yJ x = P yK yJ x + [K yP y]J x

= P yK yJ x − i

c2HJ x (4.47)

The second term in eq. (4.47) is already in the canonical form, but the firstterm is not. We need to switch factors J x and K y there. Then

14Since H, P x, P y, and P z commute with each other, the part of the operator depend-ing on these factors can be written as an ordinary function of commuting argumentsC (P x, P y, P z, H ), whose order is irrelevant.

Page 154: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 154/687

124 CHAPTER 4. OPERATORS OF OBSERVABLES

K yP yJ x = P yJ xK y + P y[K y, J x] − i c2

HJ x

= P yJ xK y − i P yK z − i

c2HJ x (4.48)

Now all terms in (4.48) are in the canonical form.The procedure for bringing a general operator to the canonical form is

not more difficult than in the above example. If we call the original operatorthe primary term , then this procedure can be formalized as the followingsequence of steps: First we transform the primary term to the canonicalform. We do that by switching the order of pairs of neighboring factors if

they occur in the “wrong” order. Let us call them the “left factor” L and the“right factor” R. If R happens to commute with L, then such a change hasno other effect. If R does not commute with L, then the result of the switchis LR → RL + [L, R]. This means that apart from switching we must addanother secondary term to the original expression which is obtained from theprimary term by replacing the product LR with the commutator [L, R].15

At the end of the first step we have all factors in the primary term in thecanonical order. If during this process all commutators [L, R] were zero, thenwe are done. If there were nonzero commutators, then we have a number of additional secondary terms. In the general case, these terms are not yet in

the canonical form, and the above procedure should be repeated for themresulting in tertiary , etc. terms until all terms are in the canonical order.Then, for each operator there is a unique representation as a sum of terms

in the canonical form

F = C 00 +3i=1

C 10i J i +3i=1

C 01i K i +3

ij=1

C 11ij J iK j +3

ij=1

C 02ij K iK j + . . .(4.49)

where C αβ = C αβ (P x, P y, P z, H ) are functions of translation generators.We will also find useful the notion of a power of terms A in (4.49), which

will be written as pow(A)) and defined as the number of factors J and K in the term. For example, the first term on the right hand side of (4.49)has power 0. The second and third terms have power 1, etc. The power of a

15The second and third terms on the right hand side of (4.48) are secondary.

Page 155: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 155/687

4.3. OPERATORS OF SPIN AND POSITION 125

general operator (which is a sum of several terms) is defined as the maximum

power among terms in A. For operators considered earlier in this chapter,we have

pow[H ] = pow[P] = pow[V] = 0

pow[W 0] = pow[W] = pow[S] = pow[R] = 1

Lemma 4.2 If L and R are operators from the list ( 4.46 ) and [L, R] = 0,then

pow([L, R]) = pow(L) + pow(R) − 1

Proof. The commutator [L, R] is non-zero in two cases.1. pow(L) = 1 and pow(R) = 0 (or, equivalently, pow(L) = 0 and

pow(R) = 1). From commutation relations (3.52), (3.55), (3.57), and (3.58),it follows that non-vanishing commutators between Lorentz generators andtranslation generators are functions of translation generators, i.e., have zeropower. The same is true for commutators between Lorentz generators andarbitrary functions of translation generators C (P x, P y, P z, H ).

2. If pow(L) = 1 and pow(R) = 1, then pow([L, R]) = 1 follows directlyfrom commutators (3.53), (3.54), and (3.56).

The primary term for the product of two terms AB has exactly the samenumber of Lorentz generators as the original operator, i.e., pow(A)+ pow(B).

Lemma 4.3 For two terms A and B, either secondary term in the product AB is zero or its power is equal to pow(A) + pow(B) -1.

Proof. Each secondary term results from replacing a product of two gener-ators LR in the primary term with their commutator [L, R]. According to

Lemma 4.2, if [L, R] = 0 such a replacement decreases the power of the termby 1.

The powers of the tertiary and higher order terms are less than the power of the secondary terms. Therefore, for any product AB

Page 156: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 156/687

126 CHAPTER 4. OPERATORS OF OBSERVABLES

pow(AB) = pow(BA) = pow(A) + pow(B)

This implies

Theorem 4.4 16 For two non-commuting terms A and B

pow([A, B]) = pow(A) + pow(B) − 1

Proof. In the commutator AB − BA, the primary term of AB cancelsout the primary term of BA. If [A, B] = 0, then the secondary terms do notcancel. Therefore, there is at least one non-zero secondary term whose poweris pow(A) + pow(B) − 1 according to Lemma 4.3.

Having at our disposal basic operators P, R, S, and M we can forma number of Hermitian scalars, vectors, and tensors which are classified intable 4.1 according to their true/pseudo character and power:

Table 4.1: Scalar, vector, and tensor functions of basic operatorspower 0 power 1 power 2

True scalar P 2; M P · R + R · P R2; S 2

Pseudoscalar P · S R · SPseudovector S; P × RTrue vector P R; P × S R × S

True tensor P iP j3

k=1 ǫijkS k; P iR j + R jP i S iS j + S jS i; RiR jPseudotensor

3k=1 ǫijkP k

3k=1 ǫijkRk; P iS j RiS j

4.3.6 The uniqueness of the spin operator

Let us now prove that (4.26) is the unique spin operator satisfying conditions(I) - (IV) from subsection 4.3.1. Suppose that there is another spin operatorS′ satisfying these conditions. Denoting the power of the spin components

16This theorem was used by Berg in ref. [50].

Page 157: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 157/687

4.3. OPERATORS OF SPIN AND POSITION 127

by p = pow(S ′x) = pow(S ′y) = pow(S ′z) we obtain from (4.22) and Theorem

4.4

pow([S ′x, S ′y]) = pow(S ′z)

2 p − 1 = p

Therefore, the components of S′ must have power 1. The most general formof a pseudovector operator having power 1 can be deduced from Table 4.1

S′ = b(M, P 2)S + f (M, P 2)P

×R + e(M, P 2)(S

·P)P

where b, f , and e are arbitrary real functions. From the condition [S′, P] = 0we obtain f (M, P 2) = 0. Comparing the commutator17

[S ′x, S ′y] = [bS x + e(S · P)P x, bS y + e(S · P)P y]

= b2[S x, S y] − i ebP x[S × P]y + i ebP y[S × P]x

= i b2S z − i eb(P × [S × P])z

= i (b2S z − ebP2S z + eb(S · P)P z)

with the requirement (4.22)

[S ′x, S ′y] = i S ′z= i (bS z + e(S · P)P z)

we obtain the system of equations

b2 − ebP2 = b

eb = e

whose solution is b = 1 and e = 0. Therefore, the spin operator is uniqueS′ = S.

17Here we used equation [S, (S · P)] = i S×P.

Page 158: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 158/687

128 CHAPTER 4. OPERATORS OF OBSERVABLES

4.3.7 The uniqueness of the position operator

Assume that in addition to the Newton-Wigner position operator R thereis another position operator R′ satisfying all properties (IV) - (VI). Thenit follows from condition (V) that R′ has power 1. The most general truevector with this property is

R′ = a(P 2, M )R + d(P 2, M )S × P + g(P 2, M )P

where a, d, and g are arbitrary real functions. From the condition (4.23) itfollows, for example, that

0 = [R′x, S y]

= d(P 2, M )[S yP z − S zP y, S y]

= i d(P 2, M )P yS x

which implies that d(P 2, M ) = 0. From [R′i, P j] = i δ ij it follows that

i = [R′x, P x]

= a(P 2, M )[Rx, P x]

= i a(P 2

, M )

This implies that a(P 2, M ) = 1. Therefore the most general form of theposition operator is

R′ = R + g(P 2, M )P (4.50)

In Theorem 10.1 we will consider boost transformations for times and posi-tions of events in non-interacting systems of particles. If the term g(P2, M )Pin (4.50) were non-zero, we would not get an agreement with Lorentz trans-

formations (J.2) - (J.5) known from Einstein’s special relativity. Therefore,we will assume that the factor g(P2, M ) is zero. So, from now on, we willuse the Newton-Wigner operator R as the representative of the position ob-servable.

It follows from the commutator (4.24) that

Page 159: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 159/687

4.3. OPERATORS OF SPIN AND POSITION 129

[Rx, P nx ] = i nP n−1x (4.51)

so for any function f (P x)

[Rx, f (P x)] = i ∂f (P x)

∂P x

For example,

[R, H ] = [R,√

P 2c2 + M 2c4]

= i ∂ √ P 2c2 + M 2c4

∂ P

=i Pc2√

P 2c2 + M 2c4

= i Pc2

H = i V

Therefore, as expected, for an observer shifted in time by the amount t, the

position of the physical system appears shifted by Vt:

R(t) = exp(i

Ht)R exp(− i

Ht) (4.52)

= R +i

[H, R]t

= R + Vt (4.53)

4.3.8 Boost transformations of the position operator

Let us now find how the vector of position (4.30) transforms with respect toboosts, i.e., we are looking for the connection between position observablesin two inertial reference frame moving with respect to each other. For sim-plicity, we consider a massive system without spin, so that the center-of-massposition in the reference frame at rest O can be written as

Page 160: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 160/687

130 CHAPTER 4. OPERATORS OF OBSERVABLES

R = −c2

2 (KH −1 + H −1K)

First, we need to determine the boost transformation for the operator of boost. For example, the transformation of the component K y with respectto the boost along the x-axis is obtained by using eqs. (E.13), (3.54) and(3.56)

K y(θ)

= e− iK xcθK ye

iK xcθ

= K y −icθ

[K x, K y] −c2θ2

2! 2 [K x, [K x, K y]] +

ic3θ3

3! 3 [K x, [K x, [K x, K y]]] + . . .

= K y − θ

cJ z +

θ2

2!K y − θ3

3!cJ z . . .

= K y cosh θ − 1

cJ z sinh θ

Then the y-component of position in the reference frame O′ moving alongthe x-axis is

Ry(θ) = e− iK xcθRye

iK xcθ

= −c2

2 e−iK xcθ

(K yH −1

+ H −1

K y)e

iK xcθ

= −c2

2(K y cosh θ − J z

csinh θ)(H cosh θ − cP x sinh θ)−1

−c2

2(H cosh θ − cP x sinh θ)−1(K y cosh θ − J z

csinh θ) (4.54)

Similarly, for the x- and z -components

Rx(θ) = −c2

2K x(H cosh θ − cP x sinh θ)−1

−c2

2 (H cosh θ − cP x sinh θ)

−1K x (4.55)

Rz(θ) = −c2

2(K z cosh θ +

J yc

sinh θ)(H cosh θ − cP x sinh θ)−1

−c2

2(H cosh θ − cP x sinh θ)−1(K z cosh θ +

J yc

sinh θ) (4.56)

Page 161: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 161/687

4.3. OPERATORS OF SPIN AND POSITION 131

Furthermore, we can find the time dependence of the position operator in

the moving reference frame O′. We use label t′ to indicate the time measuredin the reference frame O′ by its own clock and notice that the time translationgenerator H ′ in O′ is different from that in O

H ′ = e− iK xcθHe

iK xcθ (4.57)

Then we obtain

R(θ, t′) = eiH ′t′R(θ)e− i

H ′t′

= e

iH ′t′

e−iK xcθ

Re

iK xcθ

e−iH ′t′

= (e− iK xcθe

iHt′e

iK xcθ)e− i

K xcθRe

iK xcθ(e− i

K xcθe− i

Ht′e

iK xcθ)

= e− iK xcθe

iHt′Re− i

Ht′e

iK xcθ

= e− iK xcθ(R + Vt′)e

iK xcθ

= R(θ) + V(θ)t′ (4.58)

where position R(θ) and velocity V(θ) in the reference frame O′ are givenby eqs. (4.54) - (4.56) and (4.5) - (4.7), respectively.

Page 162: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 162/687

132 CHAPTER 4. OPERATORS OF OBSERVABLES

Page 163: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 163/687

Chapter 5

SINGLE PARTICLES

Physics is becoming so unbelievably complex that it is taking longer and longer to train a physicist. It is taking so long, in fact, totrain a physicist to the place where he understands the nature of physical problems that he is already too old to solve them.

Eugene P. Wigner

Our discussion in the preceding chapter could be universally applied to anarbitrary isolated physical system, be it an electron or the Solar System. Wehave not specified how the system was put together and we considered onlytotal observables pertinent to the system as a whole. The results we obtainedare not surprising: the total energy, momentum and angular momentum of any isolated system are conserved, and the center of mass is moving witha constant speed along straight line (4.53). Although the time evolution of these total observables is rather uneventful, the internal structure of com-plex (compound) physical systems may undergo dramatic changes due tocollisions, reactions, decays, etc. The description of such transformations isthe most interesting and challenging part of physics. To address such prob-lems, we need to define how complex physical systems are put together. The

central idea of this book is that all material objects are composed of elemen-tary particles i.e., systems without components and internal structure. Inthis chapter we will study these most fundamental ingredients of nature.

We established in subsection 3.2.4 that the Hilbert space of any physicalsystem carries a unitary representation of the Poincare group. Any unitary

133

Page 164: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 164/687

134 CHAPTER 5. SINGLE PARTICLES

representation of the Poincare group can be decomposed into a direct sum of

irreducible representations. Elementary particles are defined as physical sys-tems for which this sum has only one summand. Therefore, the Hilbert spaceHa of a stable elementary particle a carries an irreducible unitary represen-tation of the Poincare group. The classification of irreducible representationsof the Poincare group and their Hilbert spaces was given by Wigner [2]. FromSchur’s first Lemma H.1 we know that in any irreducible unitary represen-tation of the Poincare group, the two Casimir operators M and S2 act asmultiplication by a constant. So, all different irreducible representations,and, therefore, all elementary particles, can be classified according to thevalues of these two constants - the mass and the spin squared. Of course,there are many other parameters describing elementary particles, such as

charge, magnetic moment, strangeness, etc. But all of them are related tothe manner in which particles participate in interactions. In the world whereall interactions are “turned off”, particles have just two intrinsic properties:mass and spin.

There are only six known stable elementary particles for which the clas-sification by mass and spin applies (see Table 5.1). Some reservations shouldbe made about this statement. First, for each particle in the table (exceptphotons) there is a corresponding antiparticle having the same mass and spinbut opposite values of the electric, baryon, and lepton charges.1 So, if we alsocount antiparticles, there are eleven different stable particle species. Second,there are many more particles, like muons , pions , neutrons , etc., which areusually called elementary but all of them are unstable and eventually decayinto particles shown in Table 5.1. This does not mean that unstable particlesare “made of” stable particles or that they are less elementary. Simply, stableparticles from Table 5.1 have the lowest mass, and there are no lighter speciesto which they could decay without violating conservation laws. Third, wedo not list in Table 5.1 quarks, gluons, Higgs scalars, and other particlespredicted theoretically, but never directly observed in experiment. Fourth,strictly speaking, the photon is not a true elementary particle as it is notdescribed by an irreducible representation of the Poincare group. We will seein subsection 5.4.4 that a photon is described by a reducible representation of

the Poincare group which is a direct sum of two irreducible representationswith helicities +1 and -1. Fifth, neutrinos are not truly stable elementaryparticles. According to recent experiments, three flavors of neutrinos are os-

1see subsection 7.2.1

Page 165: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 165/687

5.1. MASSIVE PARTICLES 135

cillating between each other over time. Finally, it may be true that protons

are not elementary particles as well. They are usually regarded as composedof quarks. This leaves us with just one truly stable and elementary particle,which is the electron.

Table 5.1: Properties of stable elementary particlesParticle Mass Spin/helicity

Electron 0.511 MeV/c2 1/2Proton 938.3 MeV/c2 1/2

Electron neutrino < 1 eV/c2 1/2Muon neutrino < 1 eV/c2 1/2

Tau neutrino < 1 eV/c2 1/2

Photon 0 ±1

In the following we will denote m the value of the mass operator in theHilbert space of elementary particle and consider separately two cases: mas-sive particles (m > 0) and massless particles (m = 0).

5.1 Massive particles

5.1.1 Irreducible representations of the Poincare group

The Hilbert space H of a massive elementary particle carries an unitaryirreducible representation U g of the Poincare group characterized by a singlepositive eigenvalue m of the mass operator M . As discussed in subsection4.3.3, the position operator R is well-defined in this case. The componentsof the position and momentum operators satisfy the commutation relationsof the 6-dimensional Heisenberg Lie algebra .2

[P i, P j ] = [Ri, R j ] = 0

[Ri, P j ] = i δ ij

Then, according to the Stone-von Neumann theorem H.2, operators P x, P y, P zand Rx, Ry, Rz have continuous spectra occupying entire real axis (−∞, ∞).

2see eqs. (3.55), (4.24), and Theorem 4.1

Page 166: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 166/687

136 CHAPTER 5. SINGLE PARTICLES

There exists a decomposition of unity associated with three mutually com-

muting operators P x, P y, P z, and the Hilbert space H can be represented asa direct sum of corresponding eigensubspaces Hp of the momentum operatorP

H = ⊕p∈R3Hp

Let us first focus on the subspace H0 with zero momentum. This subspaceis invariant with respect to rotations, because for any vector |0 from this

subspace the result of rotation e− iJ φ|0 belongs to H0

Pe− iJ φ|0 = e− i

J φe i

J φPe− i

J φ|0

= e− iJ φ((P ·

φ

φ)

φ

φ(1 − cos φ) + Pcosφ − P ×

φ

φsin φ)|0

= 0

This means that representation U g induces a unitary representation V g of therotation group in H0.

The generators of rotations in H are, of course, represented by the angularmomentum vector J. However, in the subspace H0, they can be equivalentlyrepresented by the vector of spin S, because

S z|0 = J z|0 − [R × P]z|0= J z|0 − (RxP y − RyP x)|0= J z|0

We will show later that the representation of the Poincare group is irre-ducible if and only if the representation V g of the rotation group in H0 is irre-ducible. So, we will be interested only in such irreducible representations V g.The classification of unitary irreducible representations of the rotation group(single- and double-valued) depends on one integer or half-integer parameters,3 which we will call the spin of the particle. The trivial one-dimensional rep-resentation is characterized by spin zero (s = 0) and corresponds to a spinless particle. The two-dimensional representation corresponds to particles with

3see Appendix H.3

Page 167: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 167/687

5.1. MASSIVE PARTICLES 137

spin one-half (s = 1/2). The 3-dimensional representation corresponds to

particles with spin one (s = 1), etc.It is customary to choose a basis of eigenvectors of S z in H0 and denotethese vectors by |0, σ, i.e.,

P|0, σ = 0

H |0, σ = mc2|0, σM |0, σ = m|0, σS2|0, σ =

2s(s + 1)|0, σS z|0, σ = σ|0, σ

where σ = −s, −s + 1, . . . , s − 1, s. The action of a rotation on these basisvectors is

e− iJ φ|0, σ = e− i

S φ|0, σ =

sσ′=−s

Dsσ′σ( φ)|0, σ′ (5.1)

where Ds are (2s + 1) × (2s + 1) matrices of the representation V g. Thisdefinition implies that4

sσ′=−s

Dsσ′σ( φ1 φ2)|0, σ′

= e− iJ φ1e− i

J φ2|0, σ

= e− iJ φ1

sσ′′=−s

Dsσ′′σ( φ2)|0, σ′′

=

sσ′′=−s

sσ′=−s

Dsσ′′σ( φ2)Ds

σ′σ′′( φ1)|0, σ′

=

sσ′=−s

sσ′′=−s D

sσ′σ′′( φ1)D

sσ′′σ( φ2)|0, σ′

and

4Here φ1 φ2 denotes the composition of two rotations by vectors φ1 and φ2.

Page 168: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 168/687

138 CHAPTER 5. SINGLE PARTICLES

ΛΛ

pp

p’

00

λλ p ’

λλ p p

σ= σ=

σ= σ=

σ= σ=

1 / 2 1 / 2

1 / 21 / 2

1 / 21 / 2

Figure 5.1: Construction of the momentum-spin basis for a spin one-half par-ticle. Spin eigenvectors (with eigenvalues σ = −1/2, 1/2) at zero momentumare propagated to non-zero momentum p and p′ by using pure boosts λp andλp′. There is a unique pure boost Λ which connects momenta p and p′.

Dsσ′σ( φ1 φ2) =

sσ′′=−s

Dsσ′σ′′( φ1)Ds

σ′′σ( φ2)

which means that matrices Ds

furnish a representation of the rotation group.

5.1.2 Momentum-spin basis

In the preceding subsection we constructed basis vectors |0, σ in the subspaceH0. We also need basis vectors |p, σ in other subspaces Hp with p = 0. Wewill build basis |p, σ by propagating the basis |0, σ to other points in the3D momentum space using pure boost transformations.5 The unique pureboost which transforms momentum 0 to p is (see fig. 5.1)

λp ≡ e−iKc θp

(5.2)5Of course, this choice is rather arbitrary. A different choice of transformations con-

necting momenta 0 and p (e.g., boost coupled with rotation) would result in a differentbut equivalent basis set. However, once the basis set has been fixed, all formulas shouldbe written with respect to it.

Page 169: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 169/687

5.1. MASSIVE PARTICLES 139

where

θp =p

psinh−1 p

mc(5.3)

Therefore we can write

|p, σ = N (p)λp|0, σ= N (p)e− i

Kc θp|0, σ (5.4)

where N (p) is a normalization factor. Here we will just assume that this

factor does not depend on the direction of p. The explicit expression forN (p) will be given in eq. (5.19).

To verify that vector (5.4) is indeed an eigenvector of the momentumoperator with eigenvalue p we use eq. (4.3)

P|p, σ = N (p)Pe− iKc θp|0, σ

= N (p)e− iKc θpe

iKc θpPe− i

Kc θp|0, σ

= N (p)e− iKc θp

θpcθp

H sinh θp|0, σ

= N (p)e− iKc θp

θpθp

mc sinh θp|0, σ

= N (p)pe− iKc θp|0, σ

= p|p, σ

Let us now find the action of the spin component S z on the basis vectors|p, σ6

S z|p, σ = N (p)S ze−iKc θp

|0, σ6Here we take into account that W 0|0, σ = P|0, σ = 0. We also use boost transfor-

mations (4.3) and (4.4) of the energy-momentum 4-vector (H, cP) and similar formulasfor the Pauli-Lubanski 4-vector (W 0,W). For brevity, we denote θz the z-component of

the vector θp and θ its absolute value.

Page 170: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 170/687

140 CHAPTER 5. SINGLE PARTICLES

= N (p)e− iKc θpe

iKc θp(

W zMc

− W 0P zM (Mc2 + H )

)e− iKc θp|0, σ

= N (p)e− iKc θp(

W z + θzθ [(W · θ

θ )(cosh θ − 1) − W 0 sinh θ]

Mc

− (W 0 cosh θ − (W · θθ

)sinh θ)(P z + θzθ

[(P · θθ

)(cosh θ − 1) − 1cH sinh θ])

M (Mc2 + H cosh θ − c(P · θθ ) sinh θ)

)|0,

= N (p)e− iKc θp(

W z + θzθ [(W · θ

θ )(cosh θ − 1)]

Mc

+(W · θ

θ) sinh θ( θzθ [−1cMc2 sinh θ])

M (Mc2 + Mc2 cosh θ)))|0, σ

= N (p)e− iKc θp( W z

Mc+ θz

θ(W · θ

θ)(cosh θ − 1

Mc− sinh2 θ

Mc(1 + cosh θ))|0, σ

= N (p)e− iKc θpS z|0, σ

= N (p)e− iKc θp σ|0, σ

= σ|p, σ

So, |p, σ are eigenvectors of the momentum, energy, and z -component of spin7

P|p, σ = p|p, σH |p, σ = ωp|p, σM |p, σ = m|p, σS2|p, σ =

2s(s + 1)|p, σS z|p, σ = σ|p, σ

where we denoted7Note that eigenvectors of the spin operator S (4.26) are obtained here by applying

pure boosts to vectors at p = 0. Different transformations (involving rotations) connecting

bases in points 0 and p (see footnote on page 138) would result in different momentum-spin basis and in different spin operator S′ (see [53]). Does this contradict our statementabout the uniqueness of the spin operator in subsection 4.3.6? Not really. The point isthat the alternative spin operator S′ (and the corresponding alternative position operatorR′) will not be expressed as a function of basic generators of the Poincare group. Thiscondition was important for our proof of the uniqueness of S (and R) in section 4.3.

Page 171: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 171/687

5.1. MASSIVE PARTICLES 141

ωωp’

ωωpp

pp p’

LLθθ PP

HH

00

m > 0

m = 0

Figure 5.2: Mass hyperboloid in the energy-momentum space for massiveparticles, and the zero-mass cone for m = 0.

ωp ≡

m2c4 + p2c2 (5.5)

the one-particle energyThe common spectrum of the energy-momentum eigenvalues (ωp, p) can

be conveniently represented as points on the mass hyperboloid in the 4-dimensional energy-momentum space (see Fig. 5.2). For massive parti-cles, the spectrum of the velocity operator V = Pc2

H is the interior of a

3-dimensional sphere |v| < c. This spectrum does not include the surface of the sphere therefore massive particles cannot reach the speed of light.8

5.1.3 Action of Poincare transformations

We can now define the action of transformations from the Poincare groupon the basis vectors |p, σ constructed above.9 Translations act by simplemultiplication

8In quantum mechanics, the speed of propagation of particles is not a well-definedconcept. The value of particle’s speed is definite in states having certain momentum.

However, such states are described by infinitely extended plane waves (5.31), and onecannot speak about particle propagation in such states. So, strictly speaking, the speed of the particle cannot be obtained by measuring its positions at two different time instantsand dividing the traveled distance by the time interval. This is a consequence of thenon-commutativity of the operators of position and velocity.

9 We are working in the Schrodinger picture here.

Page 172: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 172/687

142 CHAPTER 5. SINGLE PARTICLES

e− iPa|p, σ = e− ipa|p, σ (5.6)

eiHt |p, σ = e

iωpt|p, σ (5.7)

Let us now apply rotation e− iJ φ to the vector |p, σ and use eq. (5.1)

e− iJ φ|p, σ = N (p)e− i

J φe− i

Kc θp|0, σ

= N (p)e− iJ φe− i

Kc θpe

iJ φe− i

J φ|0, σ

= N (p)e− i(R−1

φK)c θp

s

σ′=−s

Dσ′σ( φ)|0, σ′

= N (p)e− iKcR φ

θp

sσ′=−s

Dσ′σ( φ)|0, σ′

=s

σ′=−sDσ′σ( φ)|R φp, σ′ (5.8)

This means that both momentum and spin of the particle are rotated by theangle φ, as expected.

Applying a boost Λ ≡ e− iKc θ to the vector |p, σ and using (5.4) we

obtain

Λ|p, σ = ΛN (p)λp|0, σ (5.9)

The product of boosts on the right hand side of eq. ( 5.9) is a transformationfrom the Lorentz group, so it can be represented in the form (boost)×(rotation)

Λλp = λp′R φW (p,Λ)(5.10)

Multiplying both sides of eq. (5.10) by λ−1p′ , we obtain

λ−1p′ Λλp = R φW (p,Λ)

(5.11)

Since R φW is a rotation, it keeps invariant the subspace with zero momentum

H0. Therefore, the sequence of boosts on the left hand side of equation (5.11)

Page 173: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 173/687

5.1. MASSIVE PARTICLES 143

returns each vector with zero momentum |0, σ back to the zero momentum

subspace (see Fig. 5.1). The zero momentum vector is mapped to a vectorwith momentum p by the boost λp. Subsequent application of Λ transformsthis vector to

Λp = p + θ

θ[(p ·

θ

θ)(cosh θ − 1) +

ωp

csinh θ]

Then λp′ is the boost connecting zero momentum with momentum Λp,and we may write

R φW (p,Λ)= λ−1

ΛpΛλp (5.12)

The rotation angle on the left hand side of eq. (5.12) is called the Wigner

angle . Explicit formulas for φW (p, Λ) can be found, e.g., in ref. [54]. Then,substituting (5.12) in (5.10), we obtain

e− iKc θ|p, σ = ΛN (p)λp|0, σ

= N (p)λΛpR φW (p,Λ)|0, σ

= N (p)λΛp

sσ′=−s

Dsσ′σ( φW (p, Λ))|0, σ′

=N (p)

N (Λp)

sσ′=−s

Dsσ′σ( φW (p, Λ))|Λp, σ′ (5.13)

Eqs. (5.8) and (5.13) show that rotations and boosts are accompaniedwith turning the spin vector in each subspace Hp by rotation matrices Ds.If the representation of the rotation group Ds were reducible, then eachsubspace Hp would be represented as a direct sum of irreducible components

Hkp

Hp = ⊕kHkp

and each subspace

Page 174: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 174/687

144 CHAPTER 5. SINGLE PARTICLES

Hk = ⊕p∈R3Hkp

would be irreducible with respect to the entire Poincare group. Therefore, inorder to construct an irreducible representation of the Poincare group in H,the representation Ds must be an irreducible unitary representation of therotation group, as was mentioned already in subsection 5.1.1. In this bookwe will be interested in describing interactions between electrons and protonswhich are massive particles with spin 1/2. Then the relevant representationDs of the rotation group is the 2-dimensional representation described insubsection H.3.

Let us now review the above construction of unitary irreducible repre-sentations of the Poincare group for massive particles.10 First we chose astandard momentum vector p = 0 and found a little group, which was asubgroup of the Lorentz group leaving this vector invariant. The little groupturned out to be the rotation group in our case. Then we found that if thesubspace H0 corresponding to the standard vector carries an irreducible rep-resentation of the little group, then the entire Hilbert space is guaranteed tocarry an irreducible representation of the Poincare group. In this represen-tation, translations are represented by multiplication (5.6) - (5.7), rotationsand boosts are represented by formulas (5.8) and (5.13), respectively. It canbe shown that a different choice of the standard vector in the spectrum of

momentum would result in a representation of the Poincare group isomorphicto the one found above.

5.2 Momentum and position representations

So far we discussed the action of inertial transformations on common eigen-vectors |p, σ of the operators P and S z. All other vectors in the Hilbertspace H can be represented as linear combinations of these basis vectors,i.e., they can be represented as wave functions ψ(p, σ) in the momentum-spin representation. Similarly one can construct the position space basis from

common eigenvectors of the (commuting) Newton-Wigner position operatorand operator S z. Then arbitrary states in H can be represented in this basisby their position-spin wave functions ψ(r, σ). In this section we will consider

10This construction is known as the induced representation method [55].

Page 175: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 175/687

5.2. MOMENTUM AND POSITION REPRESENTATIONS 145

the wave function representations of states in greater detail. For simplic-

ity, we will omit the spin label and consider only spinless particles. It isremarkable that formulas for the momentum-space and position-space wavefunctions appear very similar to those in non-relativistic quantum mechanics.

5.2.1 Spectral decomposition of the identity operator

Two basis vectors with different momenta |p and |p′ are eigenvectors of theHermitian operator P with different eigenvalues, so they must be orthogonal

p|p′ = 0 if p = p′

If the spectrum of momentum values p were discrete we could simply nor-malize the basis vectors to unity p|p = 1. However, this normalizationbecomes problematic in the continuous momentum space. We will call eigen-vectors |p improper states and use them to conveniently write arbitrary“proper” normalizable state vectors |Ψ as integrals

|Ψ =

dpψ(p)|p (5.14)

where ψ(q) is called the wave function in the momentum representation . Itis convenient to demand, in analogy with (2.39), that normalizable wave

functions ψ(q) are given by the inner product

ψ(q) = q|Ψ =

dpψ(p)q|p

This implies that the inner product of two basis vectors is given by the Dirac’sdelta function (see Appendix B)

q|p = δ (q − p) (5.15)

Then in analogy with eq. (F.18) we can define the decomposition of theidentity operator

I =

dp|pp| (5.16)

Page 176: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 176/687

146 CHAPTER 5. SINGLE PARTICLES

so that for any normalized state vector |Ψ one can verify that

|Ψ = I |Ψ=

dp|pp|Ψ

=

dp|pψ(p)

= |ΨThe identity operator, of course, must be invariant with respect to Poincare

transformations, i.e.,

I = U (Λ, r, t)IU −1(Λ, r, t)

The invariance of I with respect to translations follows directly from eqs (5.6)and (5.7). The invariance with respect to rotations can be proven as follows

I ′ = e− iJ φIe

iJ φ

= e− iJ φ(

dp|pp|)e

iJ φ

= dp|R φpR φp|=

dq det |dp

dq||qq|

=

dq|qq|

= I

where we used the fact that det |dpdq

| is the Jacobian of transformation fromvariables p to q = R φp, which is equal to the determinant of the rotationmatrix det(R

− φ) = 1.

Let us consider more closely the invariance of I with respect to boosts.Using eq. (5.13) we obtain

I ′ = e− iKc θIe

iKc θ

Page 177: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 177/687

5.2. MOMENTUM AND POSITION REPRESENTATIONS 147

= e− iKc θ( dp|pp|)e

iKc θ

=

dp|ΛpΛp|| N (p)

N (Λp)|2

=

dq det |dΛ−1q

dq||qq||N (Λ−1q)

N (q)|2 (5.17)

where N (q) is the normalization factor introduced in (5.4) and det |dΛ−1q/dq|is the Jacobian of transformation from variables p to q = Λp. This Jacobianshould not depend on the direction of the boost θ, so we can choose thisdirection along the z -axis to simplify calculations. Then

Λ−1q x = q x

Λ−1q y = q y

Λ−1q z = q z cosh θ − 1

c

m2c4 + p2c2 sinh θ

ωΛ−1q =

m2c4 + c2q 2x + c2q 2y + c2(q z cosh θ − 1

c

m2c4 + q 2c2 sinh θ)2

= ωq cosh θ − cq z sinh θ

and

det |dΛ−1qdq

| = det

1 0 0

0 1 0cqx sinh θ√ m2c4+q2c2

cqy sinh θ√ m2c4+q2c2

cosh θ − cqz sinh θ√ m2c4+q2c2

= cosh θ − cq z sinh θ m2c4 + q 2c2

=ωΛ−1q

ωq

(5.18)

Inserting this result in eq. (5.17) we obtain

I ′ =

dp

ωΛ−1pωp

|pp||N (Λ−1p)

N (p)|2

Page 178: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 178/687

148 CHAPTER 5. SINGLE PARTICLES

Thus, to ensure the invariance of I , we should set11

N (p) =1√ ωp

(5.19)

Putting together our results from eqs (5.6) - (5.8), (5.13), and (5.19), wecan define the action of an arbitrary Poincare group element on basis vectors|p, σ. Bearing in mind that in a general Poincare transformation (Λ, r, t) weagreed12 perform first to perform translations (r, t) and then boosts/rotationsΛ, we obtain for the general case of a particle with spin

U (Λ, r, t)|p, σ= U (Λ; 0, 0)e− i

Pre

iHt |p, σ

=

ωΛpωp

e− ip·r+ i

ωpt

sσ′=−s

Dσ′σ( φW (p, Λ))|Λp, σ′ (5.20)

5.2.2 Wave function in the momentum representation

The inner product13 of two normalized vectors |Ψ =

dpψ(p)|p and |Φ =

dpφ(p)|p can we written in terms of their wave functions

Ψ|Φ =

dpdp′ψ∗(p)φ(p′)p|p′

=

dpdp′ψ∗(p)φ(p′)δ (p − p′)

=

dpψ∗(p)φ(p) (5.21)

So, for a state vector |Ψ with unit normalization, the wave function ψ(p)must satisfy the condition

11We could also multiply this expression for N (p) by an arbitrary unimodular factor,but this would not have any effect, because state vectors and their wave functions aredefined up to an unimodular factor anyway.

12 see eq. (1.6)13see Appendix F.1

Page 179: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 179/687

5.2. MOMENTUM AND POSITION REPRESENTATIONS 149

1 = Ψ|Ψ =

dp|ψ(p)|2

This wave function has a direct probabilistic interpretation, e.g., if Ω is aregion in the momentum space, then the integral

Ω

dp|ψ(p)|2 gives the prob-

ability of finding particle’s momentum inside this region.Poincare transformations of the state vector |Ψ can be viewed as trans-

formations of the corresponding momentum-space wave function, e.g., usingeq. (5.20)

e− iKc θψ(p) ≡ p|e− i

Kc θ|Ψ

=

ωΛ−1p

ωp

Λ−1p|Ψ

=

ωΛ−1p

ωp

ψ(Λ−1p) (5.22)

Then the invariance of the inner product (5.21) can be easily proven

Φ′|Ψ′ = dpφ∗(Λ−1

p)ψ(Λ−1

p)

ωΛ−1pωp

=

dΛ−1pωΛ−1p

φ∗(Λ−1p)ψ(Λ−1p)ωΛ−1p

=

dpφ∗(p)ψ(p)

= Φ|Ψ.

where we used property

dpωp

= d(Λp)ωΛp

which is valid for any transformation Λ from the Lorentz group (e.g., boostsand rotations).

Page 180: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 180/687

150 CHAPTER 5. SINGLE PARTICLES

The action of Poincare generators and the Newton-Wigner position op-

erator on the momentum-space wave functions of a massive spinless particlecan be derived from formula (5.20)

P xψ(p) = pxψ(p) (5.23)

Hψ(p) = ωpψ(p) (5.24)

K xψ(p) =i

climθ→0

d

dθe− i

K xcθψ(p)

=i

climθ→0

d

m2c4 + p2c2 cosh θ − cpx sinh θ

m2c4 + p2c2

ψ( px cosh θ − 1c m2c4 + p2c2 sinh θ, py, pz)

= i (−ωp

c2d

dpx− px

2ωp

ψ(p)) (5.25)

Rxψ(p) = −c2

2(H −1K x + K xH −1)ψ(p)

= −i

2(−ω−1

p ωp

d

dpx− ωp

d

dpxω−1p − pxc2

ω2p)ψ(p)

= i d

dpxψ(p) (5.26)

J xψ(p) = (RyP z − RzP y)ψ(p)

= i ( pzd

dpy− py

d

dpz)ψ(p) (5.27)

5.2.3 The position representation

In the preceding section we considered particle’s wave function in the mo-mentum representation, i.e., with respect to common eigenvectors of threecommuting components of momentum P x, P y, and P z. Three components of the position operator Rx, Ry, and Rz also commute with each other,14 andtheir common eigenvectors

|r

also form a basis in the Hilbert space

Hof

one massive spinless particle. In this section we will describe particle’s wavefunction with respect to this basis set, i.e., in the position representation .

First we can expand eigenvectors |r in the momentum basis

14see Theorem 4.1

Page 181: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 181/687

5.2. MOMENTUM AND POSITION REPRESENTATIONS 151

|r = (2π )−3/2 dpψr(p)|p (5.28)

The momentum-space eigenfunctions are

ψr(p) = p|r= (2π )−3/2e− i

pr (5.29)

as can be verified by substitution of (5.26) and (5.29) to the eigenvalue equa-tion

Rψr(p) = (2π )−3/2R e− iqr

= i (2π )−3/2 d

dpe− i

qr

= r(2π )−3/2e− iqr

= rψr(p)

As operator R is Hermitian, its eigenvectors with different eigenvalues rand r′ must be orthogonal. Indeed, using eq. (B.1) we find that the factor(2π )−3/2 in (5.29) ensures the delta-function inner product

r′|r = (2π )−3

dpdp′e− ipr+ i

p′r′p′|p

= (2π )−3

dpdp′e− ipr+ i

p′r′δ (p − p′)

= (2π )−3

dpe− ip(r−r′)

= δ (r − r′) (5.30)

which means that |r are improper states just as |p are. Similarly to (5.14),

a normalized state vector |Ψ can be represented as an integral over theposition space

|Ψ =

drψ(r)|r

Page 182: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 182/687

152 CHAPTER 5. SINGLE PARTICLES

where ψ(r) = r|Ψ is the wave function of the state |Ψ in the position

representation. The absolute square |ψ(r)|2

of the wave function is the prob-ability density for particle’s position. The inner product of two vectors |Ψand |Φ can be expressed through their position-space wave functions

Φ|Ψ =

drdr′φ∗(r)ψ(r′)r|r′

=

drdr′φ∗(r)ψ(r′)δ (r − r′)

=

drφ∗(r)ψ(r)

Using Eqs. (5.28) and (5.29) we find that the position space wave functionof the momentum eigenvector is the usual plane wave

ψp(r) = r|p= (2π )−3/2e

ipr (5.31)

As expected, eigenvectors of position are given by delta-functions in its ownrepresentation (5.30).15

From (5.16) we can also obtain a position-space representation of the

identity operator

dr|rr| = (2π )−3

drdpdp′e− i

p′r|pp′|e i

pr

=

dpdp′|pp′|δ (p − p′)

=

dp|pp|

= I

Similar to momentum-space formulas (5.23) - (5.27) we can representgenerators of the Poincare group by their action on the position-space wavefunctions. For example, it follows from (5.6), (5.28) and (5.29) that

15Note that position eigenfunctions derived in ref. [49] do not satisfy this requirement.

Page 183: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 183/687

5.2. MOMENTUM AND POSITION REPRESENTATIONS 153

e− iPa drψ(r)|r =

drψ(r)|r + a

=

drψ(r − a)|r

Therefore we can write

e− iPaψ(r) = ψ(r − a)

and

Pψ(r) = i lima→0

d

dae− i

Paψ(r)

= i lima→0

d

daψ(r − a)

= −i d

drψ(r) (5.32)

Other operators in the position representation have the following forms16

Hψ(r) =

m2c4 − 2c2

d2

dr2ψ(r)

J xψ(r) = −i

y

d

dz − z

d

dy

ψ(r)

K xψ(r) =1

2

m2c4 − 2c2

d2

dr2x + x

m2c4 − 2c2

d2

dr2

ψ(r)

Rψ(r) = rψ(r)

16

Here we used a formal notation for the Laplacian operator

d2

dr2≡ ∂ 2

∂x2+

∂ 2

∂y2+

∂ 2

∂z2

Page 184: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 184/687

154 CHAPTER 5. SINGLE PARTICLES

The switching between the position-space and momentum-space wave

functions of the same state are achieved by Fourier transformation formulas.To derive them, assume that the state |Ψ has a position-space wave functionψ(r). Then using (5.28) and (5.29) we obtain

|Ψ =

drψ(r)|r

= (2π )−3/2

drψ(r)

dpe− i

pr|p

= (2π )−3/2

dp(

drψ(r)e− i

pr)|p

and the corresponding momentum-space wave function is

ψ(p) = (2π )−3/2

drψ(r)e− ipr (5.33)

Inversely, if the momentum-space wave function is ψ(p), then the position-space wave function is

ψ(r) = (2π )−3/2

dpeiprψ(p) (5.34)

5.3 The classical limit of quantum mechanics

In section 2.5.2 we indicated that distributive (classical) propositional sys-tems are particular cases of orthomodular (quantum) propositional systems.Therefore, we may expect that quantum mechanics includes classical me-chanics as a particular case. However, this is not obvious how exactly the

phase space of classical mechanics is “embedded” in the Hilbert space of quantum mechanics. We expect that there must exist a close link betweenthese two approaches, and we would like to analyze this link in the presentsection. We will use as an example a single spinless particle with non-zeromass m > 0.

Page 185: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 185/687

5.3. THE CLASSICAL LIMIT OF QUANTUM MECHANICS 155

5.3.1 Quasiclassical states

In the macroscopic world we do not meet localized eigenvectors of the positionoperator |r. According to eq. (5.29), such states have infinite uncertaintyof momentum which is rather unusual. Similarly, we do not meet states withsharply defined momentum. Such states are delocalized over entire universe(5.31). The reason why we do not see such states is not well understoodyet. The most plausible hypothesis is that eigenstates of the position oreigenstates of the momentum are susceptible to small perturbations (e.g., dueto temperature or external radiation) and rapidly transform to more robustwave packets or quasiclassical states in which both position and momentumhave good, but not perfect localization.

So, when discussing the classical limit of quantum mechanics, we will notconsider general states allowed by quantum mechanics. We will limit ourattention only to the class of states |Ψr0,p0 that we will call quasiclassical.Wave functions of these states are assumed to be well-localized around apoint r0 in the position space and also well-localized around a point p0 inthe momentum space. Without loss of generality such wave functions can berepresented as

ψr0,p0(r) = r|Ψr0,p0 = ηr0(r)eip0r (5.35)

where ηr0(r) is a smooth (non-oscillating) function with a maximum near thevalue of position r0. As we will see later, in order to discuss the classical limitof quantum mechanics the exact choice of the function ηr0(r) is not important.For example, it is convenient to choose it in the form of a Gaussian

ψr0,p0(r) = Ne−(r−r0)2/d2eip0r (5.36)

where parameter d controls the degree of localization and N is a coefficientrequired for the proper normalization of the wave function

dr|ψr0,p0(r)|2 = 1

The exact magnitude of this coefficient is not important for our discussion,so we will not calculate it here.

Page 186: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 186/687

156 CHAPTER 5. SINGLE PARTICLES

5.3.2 The Heisenberg uncertainty relation

Wave functions like (5.36) cannot possess both sharp position and sharpmomentum at the same time. They are always characterized by non-zerouncertainty of position ∆r and non-zero uncertainty of momentum ∆ p. Theseuncertainties are roughly inversely proportional to each other. To see thenature of this inverse proportionality, we assume, for simplicity, that theparticle is at rest in the origin, i.e., r0 = p0 = 0. Then the position wavefunction is

ψ0,0(r) = Ne−r2/d2 (5.37)

and its counterpart in the momentum space is (see eqs. (5.33) and (B.13))

ψ0,0(p) = (2π )−3/2N

dre−r2/d2e− i

pr

= (2 )−3/2Nd3e−p2d2/(42) (5.38)

Then the product of the uncertainties of the momentum-space (∆ p ≈ 2d )

and position-space (∆r ≈ d) wave functions is

∆r∆ p ≈ 2

This is an example of the famous Heisenberg uncertainty relation which tellsthat the above uncertainties must satisfy the inequality

∆r∆ p ≥ /2 (5.39)

In the language of propositional lattices from section 2.3 this relationshipmeans the following: Let x be a proposition corresponding to the interval ∆rof the position values and y be a proposition corresponding to the interval∆ p of the momentum values. Then their meet is not empty x ∧ y > ∅ onlyif the sizes of the two intervals satisfy the inequality (5.39).

Page 187: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 187/687

5.3. THE CLASSICAL LIMIT OF QUANTUM MECHANICS 157

5.3.3 Spreading of quasiclassical wave packets

Suppose that at time t = 0 the particle was prepared in the state with well-localized wave function (5.37), i.e., the uncertainty of position ∆r ≈ d issmall. The corresponding time-dependent wave function in the momentumrepresentation is

ψ(p, t) = eiHtψ0,0(p, 0)

= (2 )−3/2Nd3e−p2d2/(42)eit

√ m2c4+ p2c2

Returning back to the position representation we obtain the wave function17

ψ(r, t) = (2 )−3π−3/2Nd3

dpe−p2d2/(42)eipre

it

√ m2c4+q2c2

≈ (2 )−3π−3/2Nd3eimc2t

dp exp(− p2(

d2

4 2− it

2 m) +

i

pr)

= (2 )−3π−3/2Nd3(π

d2

42− it2m

)3/2eimc2t exp(− r2

( d2

42− it2m)4 2

)

= (2 )−3Nd3(4 2m

d2m − 2i t)3/2e

imc2t exp(− mr2

d2m − 2i t)

= N ( d2

md2m − 2i t

)3/2e imc2t exp(− mr2

d2m − 2i t)

and the probability density

ρ(r, t) = |ψ(r, t)|2

= |N |2( d4m2

d4m2 + 4 2t2)3/2 exp(− 2r2d2m2

d4m2 + 4 2t2)

Then the size of the wave packet at time t is easily found as

17Due to the factor e−p2d2/(42), only small values of momentum contribute to the

integral, and we can use the non-relativistic approximation

m2c4 + p2c2 ≈ mc2 + p2

2mand eq. (B.13).

Page 188: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 188/687

158 CHAPTER 5. SINGLE PARTICLES

∆r(t) ≈ d4m2 + 4 2t2

2d2m2

At large times t → ∞ this formula simplifies

∆r(t) ≈√

2 t

dm

So, the position-space wave packet is spreading out, and the speed of spread-ing vs is inversely proportional to the uncertainty of position d ≈ ∆r in the

initially prepared state

vs ≈√

2

dm≈ ∆ p

m(5.40)

One can verify that at large times this speed does not depend on the shapeof the initial wave packets. The important parameters are the size of thewave packet d and the particle’s mass m.

A simple estimate demonstrates that for macroscopic objects this spread-ing of the wave packets can be safely neglected. For example, for a particleof mass m = 1 mg and the initial position uncertainty of d = 1 micron, thetime needed for the wave function to spread to 1 cm is more than 1011 years.Therefore, for quasiclassical states of macroscopic particles, their positionsand momenta are well defined at all times, and particles have well-definedtrajectories .

5.3.4 Classical observables and the Poisson bracket

Besides large masses of involved objects and their good localization in bothposition and momentum spaces, there is another feature distinguishing clas-sical world from the quantum one. This is very small value of the Planck

constant (4.1). In most circumstances, the resolution of classical measuringdevices is much poorer than the quantum of action [56]. Thus, from thepoint of view of classical mechanics, all quasiclassical states (5.35), indepen-dent on shapes of their functions ηr0(r), look as approximate eigenstates of both position and momentum operators simultaneously:

Page 189: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 189/687

5.3. THE CLASSICAL LIMIT OF QUANTUM MECHANICS 159

R|Ψr0,p0 ≈ r0|Ψr0,p0 (5.41)

P|Ψr0,p0 ≈ p0|Ψr0,p0 (5.42)

so all of these states can be represented as one point (r0, p0) in the phasespace. All observables for a massive spinless particle can be expressed asfunctions f (P, R) of momentum and position operators.18 Then it followsfrom (5.41) and (5.42) that quasiclassical states are also eigenstates of anysuch observable

f (R, P)|Ψr

0,p0 ≈

f (r0, p0)|Ψr

0,p0

The expectation value of observable f (R, P) in the quasiclassical state |Ψr0,p0is just the value of the corresponding function f (r0, p0)

f (R, P) = f (r0, p0)

and the expectation value of a product of two such observables is equal tothe product of expectation values

f (R, P)g(R, P) = f (r0, p0)g(r0, p0)= f (R, P)g(R, P)

This agrees with our phase space description derived from axioms of theclassical propositional system in subsection 2.4.4.

The above discussion indicates that in the classical limit the Plank con-stant should be set to zero. According to (3.52) - (3.58), all commutatorsare proportional to , so in this limit all operators of observables commutewith each other. There are two important roles played by commutators of observables in quantum mechanics. First, the commutator of two observablesdetermines whether these observables can be measured simultaneously. Van-ishing commutators of classical observables implies that all of them can bemeasured simultaneously in agreement with classical Assertion 2.2. Second,commutators of observables with generators of the Poincare group allow us to

18see subsection 4.3.4

Page 190: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 190/687

160 CHAPTER 5. SINGLE PARTICLES

perform transformations of observables from one reference frame to another,

as in the case of time translations (3.63). However, the zero classical limitof these commutators as → 0 does not mean that the right hand side of eq. (3.63) becomes zero and that the time evolution stops in this limit. Theright hand side of (3.63) does not vanish even in the classical limit, becausethe commutators in this expression are multiplied by factors − i

. In the limit

→ 0 we obtain

F (t) = F − [H, F ]P t +1

2[H, [H, F ]P ]P t

2 + . . . (5.43)

where

[f, g]P ≡ lim→0

−i

[f (R, P), g(R, P)] (5.44)

is called the Poisson bracket . So, even though commutators of observablesare effectively zero in classical mechanics, we can still use the non-vanishingPoisson brackets in calculations of the action of inertial transformations onobservables.

The exact commutator of quantum mechanical operators f (R, P) andg(R, P) can be generally written as a series in powers of

[f, g] = i k1 + i 2k2 + i 3k3 . . .

where ki are Hermitian operators. From eq. (5.44) it is clear that the Poissonbracket is equal to the coefficient of the dominant term of the first order in 19

[f, g]P = k1

As a consequence, the classical Poisson bracket [f, g]P is much easier to calcu-late than the full quantum commutator [f, g]. The following theorem demon-strates that calculation of the Poisson bracket can be reduced to simple dif-

ferentiation.

19This observation suggests, in particular, that there is no unique way to perform quan-

tization , i.e., a construction of a quantum counterpart of a classical theory by switchingfrom classical Poisson brackets to quantum commutators.

Page 191: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 191/687

5.3. THE CLASSICAL LIMIT OF QUANTUM MECHANICS 161

Theorem 5.1 If f (R, P) and g(R, P) are two observables of a massive spin-

less particle, then

[f (R, P), g(R, P)]P =∂f

∂ R· ∂g

∂ P− ∂f

∂ P· ∂g

∂ R(5.45)

Proof. Consider for simplicity the one-dimensional case (the 3D proof issimilar) in which eq. (5.45) becomes

lim→0

−i

[f (R, P ), g(R, P )] =

∂f

∂R· ∂g

∂P − ∂f

∂P · ∂g

∂R(5.46)

First, functions f (R, P ) and g(R, P ) can be represented by their Taylor ex-pansions around the origin (r = 0, p = 0) in the phase space, e.g.,

f (R, P ) = C 00 + C 10R + C 01P + C 11RP + C 20R2 + C 02P 2 + C 21R

2P + . . .

g(R, P ) = D00 + D10R + D01P + D11RP + D20R2 + D02P 2 + D21R

2P + . . .

where C and D are numerical coefficients, and we agreed to write factors Rto the left from factors P . Then it is sufficient to prove formula (5.46) forf and g being monoms of the form RnP m. In particular, we would like to

prove that

[RnP m, RqP s]P =∂ (RnP m)

∂R

∂ (RqP s)

∂P − ∂ (RnP m)

∂P

∂ (RqP s)

∂R= nsRn+q−1P m+s−1 − mqRn+q−1P m+s−1

= (ns − mq )Rn+q−1P m+s−1 (5.47)

We will prove this formula by induction. The result (5.47) holds if f and gare linear in R and P , i.e., when n,m,q,s are either 0 or 1. For example, inthe case n = 1, m = 0, q = 0, s = 1 formula (5.47) yields

[R, P ]P = 1

which agrees with definition (5.44) and quantum result (4.24).

Page 192: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 192/687

162 CHAPTER 5. SINGLE PARTICLES

Suppose that we established the validity of (5.47) for the set of higher

powers n,m,q,s as well as for any set of lower powers n′, m′, q ′, s′, wheren′ ≤ n, m′ ≤ m, q ′ ≤ q , s′ ≤ s. The proof by induction now requires us toestablish the validity of the following equations

[RnP m, Rq+1P s]P = (ns − mq − m)Rn+qP m+s−1

[RnP m, RqP s+1]P = (ns − mq + n)Rn+q−1P m+s

[Rn+1P m, RqP s]P = (ns − mq + s)Rn+qP m+s−1

[RnP m+1, RqP s]P = (ns − mq − q )Rn+q−1P m+s

Let us prove only the first equation. Three others are proved similarly. Usingeqs. (4.51) and (5.47) we, indeed, obtain

[RnP m, Rq+1P s]P = − lim→0

i

[RnP m, Rq+1P s]

= − lim→0

i

[RnP m, R]RqP s − lim

→0i

R[RnP m, RqP s]

= [RnP m, R]P RqP s + R[RnP m, RqP s]P

=

−mRn+qP m+s−1 + (ns

−mq )Rn+qP m+s−1

= (ns − mq − m)Rn+qP m+s−1

Therefore, by induction, eq. (5.46) holds for all values of n,m,q, and s.

Let us apply the above formalism of Poisson brackets to the time evo-lution. We can use formulas (5.43) and (5.45) in the case when F is eitherposition or momentum and obtain

dP(t)

dt=

−[H (R, P), P]

P =

−∂H (R, P)

∂ R(5.48)

dR(t)

dt= −[H (R, P), R]P =

∂H (R, P)

∂ P(5.49)

where one recognizes the classical Hamilton equations of motion .

Page 193: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 193/687

5.4. MASSLESS PARTICLES 163

5.4 Massless particles

5.4.1 Spectra of momentum, energy, and velocity

In the case of massless particles (m = 0), such as photons, the methodused in section 5.1 to construct irreducible unitary representations of thePoincare group does not work. For massless particles the position operator(4.31) cannot be defined. Therefore we cannot apply the Stone-von Neumanntheorem H.2 to figure out the spectrum of the operator P. To find thespectrum of P we will use another argument.

Let us choose a state of the massless particle with arbitrary nonzeromomentum p (we assume that such a state exists in the spectrum of the mo-

mentum operator P). There are two kinds of inertial transformations thatcan affect the momentum: rotations and boosts. Any vector p′ obtainedfrom p by rotations and boosts is also in the spectrum of P. So, we can usethese transformations to explore the spectrum of the momentum operator.Rotations generally change the direction of the momentum vector, but pre-serve its length |p|, so all images of p generated by rotations form a surfaceof a sphere with radius |p|. Boosts along the direction p do not change thedirection of p, but they change its length. To decrease the length of themomentum vector we can use a boost vector θ which points in the direction

that is opposite to the direction p, i.e., θθ

= − p

|p| . Then, using formula (4.3)

and equality20

ωp = cp (5.50)

we can write

p′ = Λ−1p

= p +p

p[ p(cosh θ − 1) − p sinh θ]

= p[cosh θ − sinh θ]

= pe−θ

so the transformed momentum reaches zero only in the limit θ → ∞. Thismeans that the point p = 0 does not belong to the spectrum of the momen-

20This equality follows from eq. (5.5) if m = 0.

Page 194: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 194/687

164 CHAPTER 5. SINGLE PARTICLES

tum of any massless particle21. Then, from the energy-momentum relation-

ship (4.11) we see that for massless particles the mass hyperboloid degener-ates to a cone (5.50) with the point p = 0 deleted (see Fig. 5.2). Therefore,the spectrum of velocity V = Pc2

H is the surface of a sphere |v| = c. This

means that massless particles can move only with the speed of light in anyreference frame.

Statement 5.2 (invariance of the speed of light) The speed of massless particles (e.g., photons) is equal to c independent on the velocity of the source and/or observer.

5.4.2 The Doppler effect and aberrationLet us find the relationship between photon energies in the reference frame

O and in the frame O′ moving with velocity v = c θθ tanh θ relative to O. We

denote H (0) the photon’s energy and P(0) its momentum in the referenceframe O. We also denote H (θ) and P(θ) the photon’s energy and momentumin the reference frame O′. Then using (4.4) and (5.50) we obtain the usualformula for the Doppler effect

H (θ) = H (0) cosh θ−

cP(0)·

θ

θsinh θ

= H (0) cosh θ(1 − 1

c

c2|P(0)|H (0)

P(0)

|P(0)| · θ

θtanh θ)

= H (0) cosh θ(1 − v

ccos φ0) (5.51)

where we denoted φ the angle between the direction of photon’s propagation(seen in the reference frame O) and the direction of movement of the referenceframe O′ with respect to O

cos φ0 ≡ P(0)|P(0)| · θ

θ(5.52)

21 The physical meaning of this result is clear because there are no photons with zeromomentum and energy.

Page 195: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 195/687

5.4. MASSLESS PARTICLES 165

Sometimes the Doppler effect formula is written in another form where

the angle φ between the photon momentum and the reference frame velocityis measured from the point of view of O′

cos φ ≡ P(θ)

|P(θ)| · θ

θ(5.53)

From (4.4) we can write

H (0) = H (θ) cosh θ + cP(θ) · θ

θsinh θ

= H (θ) cosh θ(1 + 1c

c2P (θ)H (θ)

P(θ)P (θ)

· θθ

tanh θ)

= H (θ) cosh θ(1 +v

ccos φ)

Therefore

H (θ) =H (0)

cosh θ(1 + vc cos φ)

(5.54)

The difference between angles φ0 and φ, i.e., the dependence of the direc-

tion of light propagation on the observer is known as the aberration effect.In order to see the same star in the sky observers O and O′ must point theirtelescopes at different directions. These directions make angles φ0 and φ,

respectively, with the direction θθ

of the relative velocity of O and O′. Theconnection between these angles can be found by taking the scalar product of

both sides of (4.3) with θθ

and taking into account eqs. (5.51), (5.52), (5.53),and c|P(θ)| = H (θ)

cos φ =|P(0)|

|P(θ)

|(cosh θ cos φ0 − sinh θ)

=cosh θ cos φ0 − sinh θ

cosh θ(1 − vc cos φ0)

=cos φ0 − v/c

1 − vc

cos φ0

Page 196: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 196/687

166 CHAPTER 5. SINGLE PARTICLES

OOO’

vv

SS

OOO’

vv

SS S’

vv(a) (b)

Figure 5.3: To the discussion of the Doppler effect: (a) observer at rest Oand moving observer O′ measure light from the same source (e.g., star) S ;(b) one observer O measures light from two sources S and S ′ that move withrespect to each other.

Our derivations above referred to the case when there was one source of light and two observers moving with respect to each other (see fig. 5.4(a)).However, this setup is not characteristic for most astronomical observationsof the Doppler effect. In these observations one typically has one observerand two sources of light (stars) that move with respect to each other withvelocity v (see fig. 5.4(b)). Let us assume that the distance between thesestars is much smaller than the distance from the stars to the Earth. Photonsemitted simultaneously by S and S ′ move with the same speed c and arriveto Earth at the same time. Two stars are seen by O in the same region of the sky independent on the velocity v. We also assume that sources S andS ′ are identical, i.e., they emit photons of the same energy in their respectivereference frames. Further, we assume that the energy h(0) of photons arrivingfrom the source S to the observer O is known. Our goal is to find the energy(denoted by h(θ)) of photons emitted by S ′ from the point of view of O.In order to do that, we introduce an imaginary observer O′ whose velocity

v with respect to O is the same as velocity of S ′ with respect to S andapply the principle of relativity. According to this principle, the energy of photons from S ′ registered by O′ is the same as the energy of photons from S registered by O, i.e., h(0). Now, in order to find the energy of photons fromS ′ seen by O we can apply formula (5.54) with the opposite sign of velocity

Page 197: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 197/687

5.4. MASSLESS PARTICLES 167

h(θ) = h(0)cosh θ(1 − v

c cos φ)(5.55)

where φ is the angle between velocity v of the star S ′ and the direction of light arriving from stars S and S ′ from the point of view of O.

The Doppler effect and light aberration discussed in this subsection weremeasured in experiments with high precision. More discussion of experimen-tal verifications of relativistic effects is in subsection 10.3.2.

5.4.3 Representations of the little group

Next we need to construct unitary irreducible representations of the Poincaregroup for massless particles. To do that we can slightly modify the methodof induced representations used for massive particles in section 5.1.

We already discussed in subsection 5.4.1 that vector p = 0 (which waschosen as the standard vector for the construction of induced representa-tions for massive particles in section 5.1) does not belong to the momentumspectrum of a massless particle. We also mentioned that the choice of thestandard vector is arbitrary and representations built on different standardvectors are unitarily equivalent. Therefore, we can choose a different stan-dard momentum vector

k = (0, 0, 1) (5.56)

in the massless case.The next step is to find transformations of the little group which leave

this vector invariant. The energy-momentum 4-vector corresponding to thestandard vector (5.56) is (c|k|, ck) = (c, 0, 0, c). Therefore, in the 4D notationfrom Appendix I.1, the matrices of little group elements must satisfy equation

S c0

0c

= c0

0c

Since the little group is a subgroup of the Lorentz group, we also have (seeeq. (I.5))

Page 198: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 198/687

168 CHAPTER 5. SINGLE PARTICLES

S T gS = g

One can verify that the most general matrix S with these properties has theform [57]

S (X 1, X 2, θ) =

1 + 12

(X 21 + X 22 ) X 1 X 2 −12

(X 21 + X 22 )X 1 cos θ + X 2 sin θ cos θ sin θ −X 1 cos θ − X 2 sin θ

−X 1 sin θ + X 2 cos θ − sin θ cos θ X 1 sin θ − X 2 cos θ12(X 21 + X 22 ) X 1 X 2 1 − 1

2(X 21 + X 22 )

(5.57)

where X 1, X 2, and θ are independent real parameters. The three generatorsof these transformations are obtained by differentiation

M 1 = limX 1,X 2,θ→0

∂X 1S (X 1, X 2, θ) =

0 1 0 01 0 0 −10 0 0 00 1 0 0

= J x − cKy

M 2 = limX 1,X 2,θ→0

∂X 2S (X 1, X 2, θ) =

0 0 1 00 0 0 0

1 0 0 −10 0 1 0

=

J y + c

Kx

R = limX 1,X 2,θ→0

∂θS (X 1, X 2, θ) =

0 0 0 00 0 1 00 −1 0 00 0 0 0

= J z

where J and K are Lorentz group generators (I.8) and (I.9). The commuta-tors are easily calculated

[M x, M y] = 0

[R, M y] = −M x

Page 199: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 199/687

5.4. MASSLESS PARTICLES 169

[R, M x] = M y

These are commutation relations of the Lie algebra of the group of “transla-tions” (M x and M y) and rotations (R) in a 2D plane.

The next step is to find full set of unitary irreducible representations of thelittle group. To achieve that we introduce Hermitian operators Π = (Π1, Π2)and Θ ≡ J z, which correspond to the Lie algebra generators M and R,respectively. So, the little group “translations” and rotations are representedin the Hilbert space Hk by unitary operators e− i

Π1x, e− i

Π2y, and e− i

Θφ.

Suppose that the Hilbert space Hk contains a state vector |π which is

an eigenvector of Π with nonzero “momentum” π = 0

Π|π = (π1, π2)|πthen vector

e− iΘφ|π1, π2 = |π1 cos φ + π2 sin φ, π1 sin φ − π2 cos φ (5.58)

also belongs to the subspace Hk. The vectors (5.58) form a circle π21 +π22 = const in the 2D “momentum” plane. Therefore, the subspace Hk isinfinite-dimensional. If we used such a representation of the little group tobuild the unitary irreducible representation of the Poincare group, we wouldobtain massless particles having an infinite-dimensional subspace of internal(spin) degrees of freedom, or “continuous spin”. Such particles have not beenobserved in nature, so we will not discuss such possibility further. The onlycase having relevance to physics is the “zero-radius” circle π = 0. Then thesubspace Hk is one-dimensional, “translations” are represented trivially

e− i

Πr|π = 0 = |π = 0 (5.59)

and rotations around the z -axis are represented by unimodular factors.

e−iΘφ

|π = 0 ≡ e−iJ zφ

|π = 0 = eiτφ

|π = 0 (5.60)The allowed values of the parameter τ can be obtained from the fact thatthe representation must be single- or double-valued.22 Therefore, the rotation

22see Statement 3.2

Page 200: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 200/687

170 CHAPTER 5. SINGLE PARTICLES

by the angle φ = 2π can be represented by either 1 or -1, and τ must be

either integer or half-integer number: τ = . . . , −1, −1/2, 0, 1/2, 1, . . .. Wewill refer to the parameter τ as to helicity . This parameter distinguishesdifferent massless unitary irreducible representations of the Poincare group,i.e., different types of elementary massless particles.

5.4.4 Massless representations of the Poincare group

Now we want to build a basis in the Hilbert space of a massless particlewith helicity τ . First we choose an arbitrary basis vector |k in the (one-dimensional) subspace Hk with standard momentum. Similarly to what wedid in the massive case, we are going to propagate this vector to all values of

momentum using transformations from the Lorentz group and thus build abasis in the full Hilbert space H. So, we need to define a boost transformationλp which connects the standard momentum k with arbitrary momentum p.We will choose λp to be a Lorentz boost along the z-axis

Bp =

cosh θ 0 0 sinh θ0 1 0 00 0 1 0

sinh θ 0 0 cosh θ

(5.61)

followed by a rotation Rp which brings direction k = (0, 0, 1) to p

p

λp = RpBp (5.62)

(see Fig. 5.4). The rapidity θ = log(|p|) is such that the length of Bpk isequal to |p|. The absolute value of the rotation angle is

cos φ = (k · p

p) = pz/p (5.63)

and the direction of φ is

φ

φ=

[k × p

p]

sin φ

= ( p2x + p2y)−1/2(− py, px, 0)

Page 201: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 201/687

5.4. MASSLESS PARTICLES 171

ppxx

ppzz

00

pp

ΛΛpp

BBΛΛpp

BBpp

RRpp

RRΛΛpp

ΛΛ k =(0,0,1)

Figure 5.4: Each point p (except p = 0) in the momentum space of a masslessparticle can be reached from the standard vector k = (0, 0, 1) by applying aboost Bp along the z -axis followed by a rotation Rp.

The full basis |p in H is obtained by propagating the basis vector |k toother fixed momentum subspaces Hp (compare with eq. (5.4))

|p =1

|p|U (λp, 0, 0)|k (5.64)

Here we used notation from subsection 5.2.1 in which U (λp, 0, 0) denotesthe unitary representative of the Poincare transformation consisting of theLorentz transformation λp and zero translations in space and time. Now wecan consider how general elements of the Poincare group act on these basisvectors. Let us apply a general transformation from the Lorentz subgroupU (Λ, 0, 0) to an arbitrary basis vector |p

U (Λ, 0, 0)|p = U (Λ, 0, 0)U (λp, 0, 0)|k= U (λΛp, 0, 0)U (λ−1

Λp, 0, 0)U (Λ, 0, 0)U (λp, 0, 0)|k

= U (λΛp, 0, 0)U (λ−1

ΛpΛλp, 0, 0)|k= U (λΛp, 0, 0)U (B−1

ΛpR−1ΛpΛRpBp, 0, 0)|k

The product of boost transformations λ−1ΛpΛλp = B−1

ΛpR−1ΛpΛRpBp on the right

hand side brings vector k back to k (see Fig. 5.4), therefore this product

Page 202: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 202/687

172 CHAPTER 5. SINGLE PARTICLES

is an element of the little group. The “translation” part of this element is

irrelevant for us due to eq. (5.59). The relevant angle of rotation around thez -axis is called the Wigner angle φW (p, Λ).23 According to eq. (5.60), thisrotation acts as multiplication by a unimodular factor

U (λ−1ΛpΛλp, 0, 0)|k = eiτφW (p,Λ)|k

Thus, taking into account (5.64) we can write for arbitrary Lorentz transfor-mation Λ

U (Λ, 0, 0)|p= U (λΛp, 0, 0)eiτφW (p,Λ)|k

=

|Λp| |p| eiτφW (p,Λ)|Λp

For a general Poincare group transformation we obtain, similarly to eq.(5.20),

U (Λ, r, t)|p = |Λp|

|p|

e− ip·r+ ic

|p|teiτφW (p,Λ)|Λp (5.65)

As was mentioned in the beginning of this chapter, photons are describedby a reducible representation of the Poincare group which is a direct sumof two irreducible representations with helicities τ = 1 and τ = −1. In theclassical language these two irreducible components correspond to the leftand right circularly polarized light.

23Explicit expressions for the Wigner rotation angle can be found in [ 54, 58].

Page 203: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 203/687

Chapter 6

INTERACTION

I myself, a professional mathematician, on re-reading my own work find it strains my mental powers to recall to mind from the

figures the meanings of the demonstrations, meanings which I my-self originally put into the figures and the text from my mind. But when I attempt to remedy the obscurity of the material by putting in extra words, I see myself falling into the opposite fault of be-coming chatty in something mathematical.

Johannes Kepler

In the preceding chapter we discussed isolated elementary particles movingfreely in space. Starting from this chapter we will focus on compound systemsconsisting of two or more particles. In addition we will allow a redistribution

of energy and momentum between different parts of the system, in otherwords we will allow interactions . In this chapter, we will limit our analysis tothe cases in which creation and/or destruction of particles is not allowed, andwe will consider only few massive spinless particles. Starting from chapter 7we will lift these limitations.

173

Page 204: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 204/687

174 CHAPTER 6. INTERACTION

6.1 The Hilbert space of many-particle sys-

temIn this section we will construct the Hilbert space of a compound system. Inquantum mechanics textbooks it is usually tacitly assumed that this spaceshould be built as a tensor product of Hilbert spaces of the components. Herewe will show how this statement can be proven from postulates of quantumlogic.1 For simplicity, in this section we will work out the simplest caseof a two-particle system. These results will be generalized to the generaln-particle case at the end of this section.

6.1.1 The tensor product theoremLet L1, L2, and L be quantum propositional systems of particles 1, 2, and thecompound system 1+2, respectively. It seems reasonable to assume that eachproposition about subsystem 1 (or 2) is still valid in the combined system,so, it should be represented also as a proposition in L. Let us formulate thisidea as a new postulate

Postulate 6.1 (properties of compound systems) If L1 and L2 are quan-tum propositional systems describing two physical systems 1 and 2, and L is the quantum propositional system describing the compound system 1+2, then there exist two mappings

f 1 : L1 → Lf 2 : L2 → L

which satisfy the following conditions:(I) They preserve all logical relationships between propositions. So that

f 1(∅L1) = ∅Lf 1(

I L1

) =I L

and for any propositions x, y ∈ L11see sections 2.3 - 2.5

Page 205: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 205/687

6.1. THE HILBERT SPACE OF MANY-PARTICLE SYSTEM 175

x ≤ y ⇔ f 1(x) ≤ f 1(x)

f 1(x ∧ y) = f 1(x) ∧ f 1(y)

f 1(x ∨ y) = f 1(x) ∨ f 1(y)

f 1(x⊥) = (f 1(x))⊥

The same properties are valid for the mapping f 2 : L2 → L.(II) The results of measurements on two subsystems are independent.

This means that in the compound system all propositions about subsystem 1 are compatible with all propositions about subsystem 2:

f 1(x1) ↔ f 2(x2)

where x1 ∈ L1, x2 ∈ L1(III) If we have full information about subsystems 1 and 2, then we have

full information about the combined system. Therefore, if x1 ∈ L1 and x2 ∈L2 are atoms then the meet of their images f 1(x1) ∧ f 2(x2) is also an atomic proposition in L.

The following theorem [59, 60] allows us to translate the above propertiesof the compound system from the language of quantum logic to the more

convenient language of Hilbert spaces.

Theorem 6.2 (Matolcsi) Suppose that H1, H2, and H are three complex Hilbert spaces corresponding to the propositional lattices L1, L2, and L, re-spectively. Suppose also that f 1 and f 2 are two mappings satisfying all condi-tions from Postulate 6.1. Then the Hilbert space H of the compound system is either one of the four tensor products 2 H = H1 ⊗ H2, or H = H∗

1 ⊗ H2,or H = H1 ⊗ H∗

2, or H = H∗1 ⊗ H∗

2.

The proof of this theorem is beyond the scope of this book.So we have four ways to couple two one-particle Hilbert spaces into one

two-particle Hilbert space. Quantum mechanics uses only the first possibility

2For definition of the tensor product of two Hilbert spaces see Appendix F.4. The stardenotes a dual Hilbert space as described in Appendix F.3.

Page 206: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 206/687

176 CHAPTER 6. INTERACTION

H = H1⊗H2.3 This means that if particle 1 is in a state |1 ∈ H1 and particle

2 is in a state |2 ∈ H2, then the state of the compound system is describedby the vector |1 ⊗ |2 ∈ H1 ⊗ H2.

6.1.2 Particle observables in a multiparticle system

The mappings f 1 and f 2 from Postulate 6.1 map propositions (projections)from Hilbert spaces of individual particles H1 and H2 into the Hilbert spaceH = H1 ⊗ H2 of the compound system. Therefore, they also map particleobservables from H1 and H2 to H. For example, consider an 1-particle ob-servable G(1) that is represented in the Hilbert space H1 by the Hermitianoperator with a spectral decomposition (2.28)

G(1) =g

gP (1)g

Then the mapping f 1 transforms G(1) into a Hermitian operator f 1(G(1)) inthe Hilbert space H of the compound system

f 1(G(1)) =

ggf 1(P (1)g )

which has the same spectrum g as G(1). Then observables of individualparticles, e.g., P1, R1 in H1 and P2, R2 in H2 have well-defined meaning inthe Hilbert space H of the combined system.

In what follows we will use small letters to denote observables of individualparticles in H.4 For example, the position and momentum of the particle1 in the two-particle system will be denoted as p1 and r1. The operator of energy of the particle 1 will be written as h1 =

m21c4 + p21c

2, etc. Similarly,observables of the particle 2 in H will be denoted as p2, r2, and h2. Accordingto Postulate 6.1(II), spectral projections of observables of different particles

commute with each other in H. Therefore, observables of different particlescommute with each other as well.

3It is not yet clear what is the physical meaning of the other three possibilities.4We will keep using capital letters for the total observables (H , P, J, and K) of the

compound system.

Page 207: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 207/687

6.1. THE HILBERT SPACE OF MANY-PARTICLE SYSTEM 177

Just as in the single-particle case, two-particle states can be also described

by wave functions. From the properties of the tensor product of Hilbertspaces it can be derived that if ψ1(r1) is the wave function of particle 1and ψ2(r2) is the wave function of particle 2, then the wave function of thecompound system is simply a product

ψ(r1, r2) = ψ1(r1)ψ2(r2) (6.1)

In this case, both particles 1 and 2, and the compound system are in purequantum states. However, the most general pure 2-particle state in H1 ⊗H2 is described by a general (normalizable) function of two vector variablesψ(r

1, r2) which is not necessarily expressed in the product form (6.1). In this

case, individual subsystems are in mixed states: the results of measurementsperformed on the particle 1 are correlated with the results of measurementsperformed on the particle 2, even though the particles do not interact witheach other. The existence of such entangled states is a distinctive feature of quantum mechanics which is not present in the classical world.

6.1.3 Statistics

The above construction of the two-particle Hilbert space H = H1 ⊗ H2 isvalid when particles 1 and 2 belong to different species. If particles 1 and 2

are identical, then there are vectors in H1 ⊗ H2 that do not correspond tophysically realizable states, and the Hilbert space of states is “less” than H1⊗H2. Indeed, if two particles 1 and 2 are identical, then no measurable quantitywill change if these particles change places. Therefore, after permutationof two particles, the wave function may at most acquire an insignificantunimodular phase factor β

ψ(r2, r1) = βψ(r1, r2) (6.2)

If we swap the particles again then the original wave function must be re-

stored

ψ(r1, r2) = βψ(r2, r1)

= β 2ψ(r1, r2)

Page 208: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 208/687

178 CHAPTER 6. INTERACTION

Therefore β 2 = 1 which implies

β = ±1

We have shown that the factor β for any state ψ(r1, r2) in H can be either1 or -1. Is it possible that in a system of two identical particles one stateφ1(r1, r2) has factor β equal to 1

φ1(r1, r2) = φ1(r2, r1) (6.3)

and another state φ2(r1, r2) has factor β equal to -1?

φ2(r1, r2) = −φ2(r2, r1) (6.4)

If eqs (6.3) and (6.4) were true, then the linear combination of the states φ1and φ2

ψ(r1, r2) = aφ1(r1, r2) + bφ2(r1, r2)

would not transform like (6.2) after permutation

ψ(r2, r1) = aφ1(r2, r1) + bφ2(r2, r1)

= aφ1(r1, r2) − bφ2(r1, r2)

= ±ψ(r1, r2)

It then follows that the factor β must be the same for all states in the Hilbertspace H of the system of two identical particles. This result implies that allparticles in nature are divided in two categories: bosons and fermions .

For bosons β = 1 and two-particle wave functions are symmetric with re-spect to permutations. Wave functions of two bosons form a linear subspace

H1⊗sym H2 ⊂ H1⊗H2. This means, in particular, that two identical bosonsmay occupy the same quantum state, i.e., the wavefunction ψ(r1)ψ(r2) be-longs to the bosonic subspace H1 ⊗sym H2 ⊆ H1 ⊗ H2.

For fermions, β = −1 and two-particle wave functions are antisymmetric with respect to permutations. The Hilbert space of two identical fermions is

Page 209: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 209/687

6.2. RELATIVISTIC HAMILTONIAN DYNAMICS 179

the subspace of antisymmetric functions H1⊗asymH2 ⊂ H1⊗H2. This means,

in particular, that two identical fermions may not occupy the same quantumstate (this is called the Pauli exclusion principle ), i.e., the wavefunctionψ(r1)ψ(r2) do not belong to the fermionic subspace H1⊗asym H2 ⊆ H1⊗H2.

A remarkable spin-statistics theorem has been proven in the frameworkof quantum field theory. This theorem establishes (in full agreement withexperiment) that the symmetry of two-particle wave functions is related totheir spin: all particles with integer spin (e.g., photons) are bosons andall particles with half-integer spin (e.g., neutrinos, electrons, protons) arefermions.

All results of this section can be immediately generalized to the case of n-particle system, where n > 2. For example, the Hilbert space of n identical

bosons is the symmetrized tensor product H = H1⊗symH2⊗sym . . .⊗symHn,and the Hilbert space of n identical fermions is the antisymmetrized tensorproduct H = H1 ⊗asym H2 ⊗asym . . . ⊗asym Hn.

6.2 Relativistic Hamiltonian dynamics

To complete our description of the 2-particle system initiated in the precedingsection we need to specify an unitary representation U g of the Poincare groupin the Hilbert space H = H1 ⊗ H2.

5 We already know from chapter 4 thatgenerators of this representation (and functions of generators) define total

observables of the compound system. From subsection 6.1.2 we also knowhow to define observables of individual particles in H. If we assume thattotal observables in H may be expressed as functions of particle observablesp1, r1, p2, and r2, then the construction of U g is equivalent to finding 10Hermitian operator functions6

H (p1, r1, p2, r2) (6.5)

P(p1, r1, p2, r2) (6.6)

J(p1, r1, p2, r2) (6.7)

5For simplicity we will assume that particles 1 and 2 are massive, spinless, and notidentical.

6These formulas work fine in systems with fixed number of particles. We will see inchapter 7 that in the case of variable number of particles it will be more convenient toexpress total observables in terms of particle creation and annihilation operators.

Page 210: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 210/687

180 CHAPTER 6. INTERACTION

K(p1, r1, p2, r2) (6.8)

which satisfy commutation relations of the Poincare Lie algebra (3.52) -(3.58). Even in the two-particle case this problem does not have a uniquesolution, and additional physical principles should be applied to make surethat operators (6.5) - (6.8) are selected in agreement with observations. For ageneral multiparticle system, the construction of the representation U g is themost difficult and the most important part of relativistic quantum theories.A large portion of the rest of this book is devoted to the analysis of differentways to construct representation U g. It is important to understand that oncethis step is completed, we get everything we need for a full description of thephysical system and for comparison with experimental data.

6.2.1 Non-interacting representation of the Poincaregroup

There are infinitely many ways to define the representation U g of the Poincaregroup, in the Hilbert space H = H1 ⊗ H2. Let us start our analysis fromone legitimate choice which has a transparent physical meaning. We knowfrom chapter 5 that Hilbert spaces H1 and H2 carry irreducible unitary rep-resentations U 1g and U 2g of the Poincare group. Functions f 1 and f 2 defined insubsection 6.1.1 allow us to map these representations to the Hilbert space H

of the compound system, i.e., to have representations of the Poincare groupf 1(U 1g ) and f 2(U 2g ) in H. We can then define a new representation U 0g of thePoincare group in H by making a (tensor product) composition of f 1(U 1g )and f 2(U 2g ). More specifically, for any vector |1 ⊗ |2 ∈ H we define

U 0g (|1 ⊗ |2) = f 1(U 1g )|1 ⊗ f 2(U 2g )|2 (6.9)

and the action of U 0g on other vectors in H follows by linearity. Represen-tation (6.9) is called the tensor product of unitary representations U 1g andU 2g , and is denoted by U 0g = U 1g ⊗ U 2g . Generators of this representation are

expressed as sums of one-particle generators

H 0 = h1 + h2 (6.10)

P0 = p1 + p2 (6.11)

Page 211: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 211/687

6.2. RELATIVISTIC HAMILTONIAN DYNAMICS 181

J0 = j1 + j2 (6.12)

K0 = k1 + k2 (6.13)The Poincare commutation relations for generators (6.10) - (6.13) follow im-mediately from the fact that one-particle generators corresponding to parti-cles 1 and 2 satisfy Poincare commutation relation separately and commutewith each other, according to Postulate 6.1(II).

With definitions (6.10) - (6.13), the inertial transformations of particleobservables with respect to the representation U 0g are easy to find. For ex-ample, the positions of particles 1 and 2 change with time as

r1(t) = eiH 0tr1e

− iH 0t

= e ih1tr1e

− ih1t

= r1 + v1t

r2(t) = r2 + v2t

Comparing this with eq. (4.53) we conclude that all observables of particles1 and 2 transform independently from each other as if these particles werealone. So, the representation (6.10) - (6.13) corresponds to the absence of interaction and is called the non-interacting representation of the Poincaregroup.

6.2.2 Dirac’s forms of dynamics

Obviously, the simple choice of generators (6.10) - (6.13) is not realistic, be-cause particles in nature do interact with each other. Therefore, to describeinteractions in multi-particle systems one should choose an interacting repre-sentation U g of the Poincare group in H which is different from U 0g . First wewrite the generators (H, P, J, K) of the desired representation U g in the mostgeneral form where all generators are different from their non-interactingcounterparts by the presence of interaction terms denoted by deltas7

7

Our approach to the description of interactions based on eqs (6.14) - (6.17) and theirgeneralizations for multiparticle systems is called the relativistic Hamiltonian dynamics

[53]. For completeness, we should mention that there is a number of other methods fordescribing interactions which can be called non-Hamiltonian. Overviews of these methodsand further references can be found in [61, 62, 63]. We will not discuss the non-Hamiltonianapproaches in this book.

Page 212: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 212/687

182 CHAPTER 6. INTERACTION

H = H 0 + ∆H (r1, p1, r2, p2) (6.14)

P = P0 + ∆P(r1, p1, r2, p2) (6.15)

J = J0 + ∆J(r1, p1, r2, p2) (6.16)

K = K0 + ∆K(r1, p1, r2, p2) (6.17)

It may happen that some of the deltas on the right hand sides of eqs. (6.14)- (6.17) are zero. Then these generators and transformations coincide withthe generators and transformations of the non-interacting representation U 0g .Such generators and transformations are called kinematical . Generators

which contain interaction terms are called dynamical .

Table 6.1: Comparison of three relativistic forms of dynamicsInstant form Point form Front form

Kinematical generators(P x)0 (K x)0 (P x)0(P y)0 (K y)0 (P y)0(P z)0 (K z)0

1√ 2

(H 0 + (P z)0)

(J x)0 (J x)01√ 2

((K x)0 + (J y)0)

(J y)0 (J y)01√ 2

((K y)0−

(J x)0)

(J z)0 (J z)0 (J z)0(K z)0

Dynamical generatorsH H 1√

2(H − P z)

K x P x1√ 2

(K x − J y)

K y P y1√ 2

(K y + J x)

K z P z

The description of interaction by eqs (6.14) - (6.17) generalizes traditional

classical non-relativistic Hamiltonian dynamics in which the only dynamicalgenerator is H . To make sure that our theory reduces to the familiar non-relativistic approach in the limit c → ∞, we will also assume that ∆H = 0.The choice of other generators is restricted by the observation that kinemat-ical transformations should form a subgroup of the Poincare group, so that

Page 213: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 213/687

6.2. RELATIVISTIC HAMILTONIAN DYNAMICS 183

kinematical generators should form a subalgebra of the Poincare Lie algebra.8

The set (P, J, K) does not form a subalgebra. This explains why in the rel-ativistic case we cannot introduce interaction in the Hamiltonian alone. Wemust add interaction terms to some of the other generators P, J, or K inorder to be consistent with relativity. We will say that interacting represen-tations having different kinematical subgroups belong to different forms of dynamics . In his famous paper [3], Dirac provided a classification of forms of dynamics based on this principle. There are three Dirac’s forms of dynamicsmost frequently discussed in the literature. They are shown in Table 6.1.In the case of the instant form dynamics the kinematical subgroup is thesubgroup of spatial translations and rotations. In the case of the point form dynamics the kinematical subgroup is the Lorentz subgroup [64]. In both

mentioned cases the kinematical subgroup has dimension 6. The front form dynamics has the largest number (7) of kinematical generators.

6.2.3 Total observables in a multiparticle system

Once the interacting representation of the Poincare group and its generators(H, P, J, K) are defined, we immediately have expressions for total observ-ables of the physical system considered as a whole. These are the total energy H , the total momentum P, and the total angular momentum J. Other totalobservables of the system (the total mass M , the total spin S, the center-of-

mass position R, etc.) can be obtained as functions of these generators byformulas derived in chapter 4.

Note also that inertial transformations of the total observables (H, P, J, K)coincide with those presented in chapter 4. For example, the total energy H and the total momentum P form a 4-vector whose boost transformations aregiven by eq. (4.3) - (4.4). Boost transformations of the center-of-mass posi-tion R are derived in subsection 4.3.8. Time translations result in a uniformmovement of the center-of-mass with constant velocity along a straight line(4.53). Thus we conclude that inertial transformations of total observablesare completely independent on the form of dynamics and on the details of interactions acting within the multiparticle system.

There are, however, properties that are dependent on the interaction.One such property is the relationships between total and one-particle observ-

8Indeed, if two generators A and B do not contain interaction terms, then their com-mutator [A, B] should be interaction-free as well.

Page 214: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 214/687

184 CHAPTER 6. INTERACTION

ables. For example, in the instant form of dynamics the total momentum

operator is just the sum of momenta of constituent particles (6.11). In thepoint form of dynamics, there is an additional interaction term that dependson one-particle observables

P = p1 + p2 + ∆P(r1, p1, r2, p2)

Inertial transformations of observables of individual particles (r1, p1, r2, p2, . . .)also depend on the form of dynamics. They will be discussed in section 10.2.

6.3 The instant form of dynamics

6.3.1 General instant form interaction

In this book we will work exclusively in the Dirac’s instant form of dynamics.9

Then we can rewrite eqs (6.14) - (6.17) as

H = H 0 + V (6.18)

P = P0 (6.19)

J = J0 (6.20)

K = K0 + Z (6.21)

where we denoted V ≡ ∆H and Z ≡ ∆K. As we discussed in subsections4.1.1 and 6.2.3, the observables H, P, J, and K are total observables thatcorrespond to the compound system as a whole. The total momentum P, andthe total angular momentum J are simply vector sums of the correspondingoperators for individual particles. The total energy H and the boost operatorK are equal to the sum of one-particle operators plus interaction terms. Theinteraction term V in the energy operator is usually called the potential energy operator. Similarly, we will call Z the potential boost . Other totalobservables (e.g., the total mass M , the total spin S, the center-of-mass

position R, and its velocity V, etc.) are defined as functions of generators(6.18) - (6.21) by formulas from chapter 4. For interacting systems, theseobservables may become interaction-dependent as well.

9In section 10.2 and in subsection 10.5.7 we will discuss the reasons why this is a goodchoice.

Page 215: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 215/687

6.3. THE INSTANT FORM OF DYNAMICS 185

Commutation relations (3.52) - (3.58) imply

[J0, V ] = [P0, V ] = 0 (6.22)

[Z i, (P 0) j ] = −i δ ijc2

V (6.23)

[(J 0)i, Z j] = i ǫijkZ k (6.24)

[(K 0)i, Z j] + [Z i, (K 0) j ] + [Z i, Z j] = 0 (6.25)

[Z, H 0] + [K0, V ] + [Z, V ] = 0 (6.26)

So, the task of constructing a Poincare invariant theory of interacting parti-cles has been reduced to finding a non-trivial solution for the set of equations

(6.22) - (6.26) with respect to V and Z. These equations are necessary andsufficient conditions for the Poincare invariance of our theory.

6.3.2 The Bakamjian-Thomas construction

The set of equations (6.22) - (6.26) is rather complicated. The first non-trivialsolution of these equations for multiparticle systems was found by Bakamjianand Thomas [4]. The idea of their approach is as follows. Instead of work-ing with 10 generators (P, J, K, H ), it is convenient to use an alternativeset of operators P, R, S, M .10 Denote P0, R0, S0, M 0 and P0, R, S, M the sets of operators obtained by using formulas (4.39) - (4.42) from the non-interacting (P0, J0, K0, H 0) and interacting (P0, J0, K, H ) generators, respec-tively. In a general instant form dynamics all three operators R, S, and M may contain interaction terms. However, Bakamjian and Thomas decided tolook for a simpler solution in which the position operator remains kinematicalR = R0. It then immediately follows that the spin operator is kinematicalas well

S = J − R × P

= J0

−R0

×P0

= S0

so the interaction U is present in the mass operator only.

10see subsection 4.3.4

Page 216: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 216/687

186 CHAPTER 6. INTERACTION

M = M 0 + U

From commutators (4.37), the interaction term U must satisfy

[P0, U ] = [R0, U ] = [J0, U ] = 0 (6.27)

So, we have reduced our task of finding 4 interaction operators V and Zsatisfying complex equations (6.22) - (6.26) to a simpler problem of find-ing one operator U satisfying conditions (6.27). Indeed, by knowing U andnon-interacting operators M 0, P0, R0, S0, we can restore the interacting

generators using formulas (4.43) - (4.45)

P = P0 (6.28)

H = +

M 2c4 + P20c2 (6.29)

K = − 1

2c2(R0H + H R0) − P0 × S0

Mc2 + H (6.30)

J = J0 = R0 × P0 + S0 (6.31)

Now let us turn to the construction of U in the case of two massive spinless

particles. Suppose that we found two vector operators π and ρ (they will becalled relative momentum and relative position , respectively) such that theyform a 6-dimensional Heisenberg Lie algebra11

[πi, ρ j] = i δ ij (6.32)

[πi, π j] = [ρi, ρ j ] = 0 (6.33)

commuting with the center-of-mass position R0 and the total momentum P0.

[π, P0] = [π, R0] = [ ρ, P0] = [ ρ, R0] = 0 (6.34)Suppose also that these relative operators have the following non-relativistic(c → ∞) limits

11see Appendix H.2

Page 217: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 217/687

6.3. THE INSTANT FORM OF DYNAMICS 187

π → p1 − p2

ρ → r1 − r2

Then any operator in the Hilbert space H can be expressed either as a func-tion of (p1, r1, p2, r2) or as a function of (P0, R0, π, ρ). Moreover, the interac-tion operator U satisfying conditions [U, P0] = [U, R0] = 0 can be expressedas a function of π and ρ only. To satisfy the last condition [U, J0] = 0 wewill simply require U to be a function of rotationally invariant combinationsof the relative position and momentum

U = U (π2, ρ2, π · ρ) (6.35)

In this ansatz , the problem of building a relativistically invariant interactionhas reduced to finding operators of relative positions ρ and momenta π satis-fying eqs (6.32) - (6.34). This problem has been solved in a number of works[4, 65, 66, 67]. We will not need explicit formulas for the operators of relativeobservables, so we will not reproduce them here.

For systems of n massive spinless particles (n > 2) similar argumentsapply, but instead of one pair of relative operators π and ρ we will have n−1pairs,

πr, ρr, r = 1, 2, . . . , n − 1 (6.36)

These operators should form a 6(n−1)-dimensional Heisenberg algebra com-muting with P0 and R0. Explicit expressions for πr and ρr were constructed,e.g., in ref. [68]. As soon as these expressions are found, we can build aBakamjian-Thomas interaction in an n-particle system by defining the inter-action U in the mass operator as a function of rotationally invariant combi-nations of relative operators (6.36)

U = U (π21 , ρ21, π1·

ρ1, π22, ρ22, π2·

ρ2, π1·

ρ2, π2·

ρ1, . . .) (6.37)

This form agrees with the relativistic idea that there are no preferred posi-tions and velocities and that interactions between particles must depend ontheir relative observables.

Page 218: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 218/687

188 CHAPTER 6. INTERACTION

6.3.3 Bakamjian’s construction of the point form dy-

namicsThe Bakamjian-Thomas method of modifying the mass operator can be usedto construct non-trivial Poincare-invariant interactions in non-instant formsof dynamics as well. In this subsection we would like to construct a particularversion of the point form dynamics using the Bakamjian’s prescription [69].We start from non-interacting operators of mass M 0, linear momentum P0,angular momentum J0, position R0, and spin S0 = J0 − [R0 × P0], andintroduce two new operators

Q0 =

P0M 0c2

X0 = M 0c2R0

which satisfy canonical commutation relations

[(X 0)i, (Q0) j] = i δ ij .

Then we can express generators of the non-interacting representation of the Poincare group in the following form (compare with (6.28) - (6.31))

P0 = M 0c2Q0

J0 = [X0 × Q0] + S0

K0 = −1

2(

1 + Q20X0 + X0

1 + Q2

0) − [Q0 × S0]

1 +

1 + Q20

H 0 = M 0c2

1 + Q20.

A point form interaction can be now introduced by modifying the mass op-erator M 0 → M provided that the following conditions are satisfied12

[M, Q0] = [M, X0] = [M, S0] = 0

12Just as in subsection 6.3.2, these conditions can be satisfied by defining M as a functionof relative position and momentum operators commuting with Q0 and X0.

Page 219: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 219/687

6.3. THE INSTANT FORM OF DYNAMICS 189

These conditions, in particular, guarantee that M is Lorentz invariant

[M, K0] = [M, J0] = 0.

This modification of the mass operator introduces interaction in generatorsof the translation subgroup

P = Mc2Q0

H = Mc2

1 + (Q0)2 (6.38)

while Lorentz subgroup generators K0 and J0 remain interaction-free.

6.3.4 Non-Bakamjian-Thomas instant forms of dynam-ics

In the Bakamjian-Thomas construction, it was assumed that R = R0, butthis limitation is rather artificial, and we will see later that realistic particleinteractions do not satisfy this condition. Any other variant of the instantform dynamics has position operator R different from the non-interacting

Newton-Wigner position R0. Let us now establish a connection betweensuch a general instant form interaction and the Bakamjian-Thomas interac-tion. We are going to demonstrate that corresponding representations of thePoincare group are related by a unitary transformation.

Suppose that operators

(P0, J0, K, H ) (6.39)

define a Bakamjian-Thomas dynamics, and W is a unitary operator com-muting with P0 and J0.13 Then a non-Bakamjian-Thomas instant form of

dynamics (P0, J0, K′, H ′) can be build by applying the transformation W tooperators (6.39).

13In the case of two massive spinless particles such an operator must be a function of rotationally invariant combinations of vectors P0, π, and ρ.

Page 220: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 220/687

190 CHAPTER 6. INTERACTION

J0 = W J0W −1 (6.40)

P0 = W P0W −1 (6.41)

K′ = W KW −1 (6.42)

H ′ = W HW −1 (6.43)

Since unitary transformations preserve commutators

W [A, B]W −1 = [W AW −1, W B W −1]

the transformed generators (6.40) - (6.43) satisfy commutation relations of the Poincare Lie algebra in the instant form. However, generally, the newmass operator M ′ = c−2

(H ′)2 − P20c2 does not commute with R0, so (6.40)

- (6.43) are not in the Bakamjian-Thomas form.The above construction provides a way to build a non-Bakamjian-Thomas

instant form representation (P0, J0, K′, H ′) if a Bakamjian-Thomas represen-tation (P0, J0, K, H ) is given. However, this construction does not answer thequestion if all instant form interactions can be connected to the Bakamjian-Thomas dynamics by a unitary transformation? The answer to this questionis “yes”: For any instant form interaction P0, R′, S′, M ′ one can find a

unitary operator W which transforms it to the Bakamjian-Thomas form [8]

W −1P0, R′, S′, M ′W = P0, R0, S0, M (6.44)

To see that, let us consider the simplest two-particle case. Operator

T ≡ R′ − R0

commutes with P0. Therefore, it can be written as a function of P0 and

relative operators π and ρ: T(P0, π, ρ). Then one can show that unitaryoperator14

14The integral in (6.45) can be treated as integral of ordinary function (rather thanoperator) along the segment [0,P0] in the 3D space of variable P0 with arguments π, and ρ being fixed.

Page 221: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 221/687

6.3. THE INSTANT FORM OF DYNAMICS 191

W = eiW

W =

P0

0

T(P0, π, ρ)dP0 (6.45)

performs the desired transformation (6.44). Indeed

W −1P0W = P0

W −1J0W = J0

because W is a scalar, which explicitly commutes with P0. Operator W hasthe following commutators with the center-of-mass position

[W , R0] = −[R0,

P0

0

T(P0, π, ρ)dP0]

= −i ∂

∂ P0(

P0

0

T(P0, π, ρ)dP0)

= −i T(P0, π, ρ)

= −i (R′ − R0)

[W , [W , R0]] = 0

Therefore

W R0W −1 = eiW R0e

− iW

= R0 +i

[W , R0] − 1

2! 2[W , [W , R0]] + . . .

= R0 + (R′ − R0)

= R′

W −1R′W = R0

and

W −1S′W = W −1(J0 − R′ × P0)W

= J0 − R0 × P0

= S0

Page 222: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 222/687

192 CHAPTER 6. INTERACTION

Finally we can apply transformation W to the mass operator M ′ and obtain

operator

M = W −1M ′W

which commutes with R0 and P0. This demonstrates that operators onthe right hand side of (6.44) describe a Bakamjian-Thomas instant form of dynamics.

6.3.5 Cluster separability

As we saw above, the requirement of Poincare invariance imposes rather loose

conditions on interaction (e.g., the interaction term U in the mass operatorM = M 0 + U must be a rotationally invariant function of relative variables).However, there is another physical requirement which limits the admissibleform of interaction. We know from experiment that all interactions betweenparticles vanish when particles are separated by large distances.15 So, if in a 2-particle system we remove particle 2 to infinity by using the spacetranslation operator e

ip2a the interaction (6.35) must tend to zero

lima→∞

e− ip2aU (π2, ρ2, π · ρ)e

ip2a = 0 (6.46)

This condition is not difficult to satisfy in the two-particle case. However, in

the relativistic multi-particle case the mathematical form of this conditionbecomes rather complicated. This is because now there is more than oneway to separate particles in mutually non-interacting groups. The form of the n-particle interaction (6.37) must ensure that each spatially separatedm-particle group (m < n) behaves as if it were alone. This, in particular,implies that we cannot independently choose interactions in systems withdifferent number of particles. The interaction in the n-particle sector of thetheory must be consistent with interactions in all m-particle sectors, wherem < n.

We will say that a multi-particle interaction is cluster separable if for any

division of the n-particle system (n ≥ 2) into two spatially separated groups(or clusters ) of l and m particles (l + m = n)

15We are not considering here a hypothetical potential between quarks which supposedlygrows as a linear function of the distance and results in the confinement of quarks insidehadrons.

Page 223: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 223/687

6.3. THE INSTANT FORM OF DYNAMICS 193

• (1) the interaction separates too, i.e., the clusters move independently

from each other;• (2) the interaction in each cluster is the same as in separate l-particle

and m-particle systems, respectively.

We will postulate that all interaction in nature have these properties

Postulate 6.3 (cluster separability of interactions) : All interactions are cluster separable.

A counterexample of a non-separable interaction can be built in the 4-particle case. The interaction Hamiltonian

V = |r1 − r2|−1|r3 − r4|−1 (6.47)

has the property that no matter how far two pairs of particles (1+2 and 3+4)are from each other, the relative distance between 3 and 4 affects the forceacting between particles 1 and 2. Such infinite-range interactions are notpresent in nature.

In the non-relativistic case the cluster separability is achieved withoutmuch effort. For example, the non-relativistic Coulomb potential energy inthe system of two charged particles is16

V 12 =1| ρ|

=1

|r1 − r2| (6.48)

which clearly satisfies condition (6.46). In the system of three charged par-ticles 1, 2, and 3, the potential energy can be written as a simple sum of two-particle terms

V = V 12 + V 13 + V 23

=1

|r1 − r2| +1

|r2 − r3| +1

|r1 − r3| (6.49)

16Here we are interested just in the general functional form of interaction, so we are notconcerned with putting correct factors in front of the potentials.

Page 224: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 224/687

194 CHAPTER 6. INTERACTION

The spatial separation between particle 3 and the cluster of particles 1+2 can

be increased by applying a large space translation to the particle 3. In agree-ment with Postulate 6.3, such a translation effectively cancels interactionbetween particles in clusters 3 and 1+2, i.e.

lima→∞

eip3a(V 12 + V 13 + V 23)e− i

p3a

= lima→∞

1

|r1 − r2| +1

|r2 − r3 + a| +1

|r1 − r3 + a|=

1

|r1 − r2|

Therefore, interaction (6.49) is cluster separable. As we will see below, inthe relativistic case construction of a general cluster-separable multi-particleinteraction is more difficult.

Let us now make some definitions which will be useful in discussionsof cluster separability. A smooth m-particle potential V (m) is defined asoperator that depends on variables of m particles and tends to zero if anyparticle or a group of particles is removed to infinity.17 For example, thepotential (6.48) is smooth while (6.47) is not. Generally, a cluster separableinteraction in a n-particle system can be written as a sum

V =2

V (2) +3

V (3) + . . . + V (n) (6.50)

where

2 V (2) is a sum of smooth 2-particle potentials over all pairs of

particles;

3 V (3) is a sum of smooth 3-particle potentials over all triples

of particles, etc. An example is given by eq. (6.49).

6.3.6 Non-separability of the Bakamjian-Thomas dy-namics

We expect that Postulate 6.3 applies to both potential energy and potentialboosts. For example, in the relativistic case of 3 massive spinless particleswith interacting generators

17In section 7.4 we will explain why we call such potentials smooth.

Page 225: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 225/687

6.3. THE INSTANT FORM OF DYNAMICS 195

H = H 0 + V (p1, r1, p2, r2, p3, r3)

K = K0 + Z(p1, r1, p2, r2, p3, r3)

the cluster separability requires, in particular, that

lima→∞

eip3aV (p1, r1, p2, r2, p3, r3)e−i i

p3a = V 12(p1, r1, p2, r2) (6.51)

lima→∞

eip3aZ(p1, r1, p2, r2, p3, r3)e− i

p3a = Z12(p1, r1, p2, r2) (6.52)

where V 12 and Z12 are interaction operators for the 2-particle system.

Let us see if these principles are satisfied for interactions built by theBakamjian-Thomas prescription. In this case the potential energy is

V = H − H 0

=

(p1 + p2 + p3)2c2 + (M 0 + U (p1, r1, p2, r2, p3, r3))2c4

(p1 + p2 + p3)2c2 + M 20 c4

By removing particle 3 to infinity we obtain

lima→∞ eip3aV (p1, r1, p2, r2, p3, r3)e−

ip3a

=

(p1 + p2 + p3)2c2 + (M 0 + U (p1, r1, p2, r2, p3, ∞))2c4

(p1 + p2 + p3)2c2 + M 20 c4 (6.53)

According to (6.51) we should require that the right hand side of eq. (6.53)depends only on the variables of particles 1 and 2. Then we must set

U (p1, r1, p2, r2, p3, ∞) = 0

which also means that

V (p1, r1, p2, r2, p3, ∞) = V 12(p1, r1, p2, r2)

= 0

Page 226: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 226/687

196 CHAPTER 6. INTERACTION

Similarly, we can show that interaction V tends to zero when either particle

1 or particle 2 is removed to infinity. Therefore, interaction V is a smooth 3-particle potential. According to the Postulate 6.3, this means that there is nointeraction in any 2-particle subsystem: the interaction turns on only if thereare three or more particles close to each other, which is clearly unphysical.So, we conclude that the Bakamjian-Thomas construction cannot describea non-trivial cluster-separable interaction in many-particle systems (see also[70]).

6.3.7 Cluster separable 3-particle interaction

The problem of constructing relativistic cluster separable many-particle in-

teractions can be solved by allowing non-Bakamjian-Thomas instant form in-teractions. Our goal here is to construct the interacting Hamiltonian H andboost K operators in the Hilbert space H = H1⊗H2⊗H3 of a 3-particle sys-tem such that it satisfies Postulate 6.3, i.e., reduces to a non-trivial 2-particleinteraction when one of the particles is removed to infinity. The same prob-lem can be alternatively formulated as the problem of relativistic addition of interactions. We saw in the example (6.49) that in non-relativistic physics acluster separable 3-particle interaction can be simply constructed by additionof 2-particle terms. The construction below is a generalization of this resultto the relativistic case. In this construction we follow ref. [8].

Let us assume that 2-particle potentials V ij and Zij , i, j = 1, 2, 3 result-

ing from removing particle k = i, j to infinity are known (they depend onvariables of the i-th and j-th particles only). For example, when particle 3is removed to infinity, the interacting operators take the form

lima→∞

eip3aHe− i

p3a = H 0 + V 12 ≡ H 12 (6.54)

lima→∞

eip3aKe− i

p3a = K0 + Z12 ≡ K12 (6.55)

lima→∞

eip3aMe− i

p3a =

1

c2

H 212 − P20c

2 ≡ M 12 (6.56)

lima→∞ e

ip3a

Re−ip3a

= −c2

2 (K12H 12 + H 12K12) −cP0

×W12

M 12H 12(M 12c2 + H 12)

≡ R12 (6.57)

where operators H 12, K12, M 12, and R12 (energy, boost, mass, and center-of-mass position, respectively) will be considered as given. Similar equations

Page 227: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 227/687

6.3. THE INSTANT FORM OF DYNAMICS 197

result from the removal of particles 2 or 3 to infinity. They are obtained

from (6.54) - (6.57) by permutation of indices (1,2,3). Now we want tocombine the two-particle potentials V ij and Zij together in a cluster-separable3-particle interaction in analogy with (6.49). It appears that we cannotform the interactions V and Z in the 3-particle system simply as a sum of 2-particle potentials. One can verify that such a definition would violatePoincare commutators. Therefore

V = V 12 + V 23 + V 13

Z = Z12 + Z23 + Z13

and the relativistic “addition of interactions” should be more complicated.When particles 1 and 2 are split apart, operators V 12 and Z12 must tend

to zero, therefore

lima→∞

eip1aM 12e

− ip1a = M 0

lima→∞

eip2aM 12e

− ip2a = M 0

lima→∞

eip3aM 12e

− ip3a = M 12

The Hamiltonian H 12 and boost K12 define an instant form representationU 12 of the Poincare group in the Hilbert space H. The corresponding positionoperator (6.57) is generally different from the non-interacting Newton-Wignerposition operator

R0 = −c2

2(K0H 0 + H 0K0) − cP0 × W0

M 0H 0(M 0 + H 0)(6.58)

which is characteristic for the Bakamjian-Thomas form of dynamics. How-ever, we can unitarily transform the representation U 12, so that it acquires aBakamjian-Thomas form with operators R0, H 12, K12, M 12.

18 Let us denotesuch an unitary transformation operator by B12. We can repeat the samesteps for two other pairs of particles 1+3 and 2+3, and write in the generalcase i, j = 1, 2, 3; i = j

18see subsection 6.3.4

Page 228: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 228/687

198 CHAPTER 6. INTERACTION

BijRijB−1ij = R0

BijH ijB−1ij = H ij

BijKijB−1ij = Kij

BijM ijB−1ij = M ij

Operators Bij = B12, B13, B23 commute with P0 and J0. Since representa-tion U ij becomes non-interacting when the distance between particles i and

j tends to infinity, we can write

lima→∞

e ip3aB13e− i

p3a = 1 (6.59)

lima→∞

eip3aB23e

− ip3a = 1 (6.60)

lima→∞

eip3aB12e

− ip3a = B12 (6.61)

The transformed Hamiltonians H ij and boosts Kij define Bakamjian-Thomasrepresentations, and their mass operators M ij now commute with R0. So,we can add M ij together to build a new mass operator

M = M 12 + M 13 + M 23 − 2M 0

= B12M 12B−112 + B13M 13B

−113 + B23M 23B−1

23 − 2M 0

which also commutes with R0. Using this mass operator, we can build aBakamjian-Thomas representation with generators.

H =

P20 + M

2(6.62)

K = − 1

2c2(R0H + H R0) − cP0 × W0

MH (Mc2 + H )(6.63)

This representation has interactions between all particles, however, it doesnot satisfy the cluster property yet. For example, by removing particle 3 toinfinity we do not obtain the interaction M 12 characteristic for the subsystemof two particles 1 and 2. Instead, we obtain a unitary transform of M 12

Page 229: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 229/687

6.3. THE INSTANT FORM OF DYNAMICS 199

lima→∞ e ip3aMe− ip3a

= lima→∞

eip3a(B12M 12B

−112 + B13M 13B−1

13 + B23M 23B−123 − 2M 0)e− i

p3a

= B12M 12B−112 − 2M 0 + lim

a→∞(e

ip3aM 13e

− ip3a + e

ip3aM 23e

− ip3a)

= B12M 12B−112 − 2M 0 + 2M 0

= B12M 12B−112 (6.64)

To fix this deficiency, let us perform a unitary transformation of the repre-sentation (6.62) - (6.63) with operator B (which must commute with P0 and

J0, of course, to preserve the instant form of the interaction)

H = B−1HB (6.65)

K = B−1KB (6.66)

M = B−1MB (6.67)

We choose the transformation B from the requirement that it must cancelfactors Bij and B−1

ij in eq. (6.64) as particle k moves to infinity. In otherwords, this operator must be unitary and have the following limits

lima→∞

eip3aBe− i

p3a = B12 (6.68)

lima→∞

eip2aBe− i

p2a = B13 (6.69)

lima→∞

eip1aBe− i

p1a = B23 (6.70)

Otherwise the operator is arbitrary. One can check that one possible choiceis

B = exp(log B12 + log B13 + log B23)

Indeed, using eqs. (6.59) - (6.61) we obtain

lima→∞

eip3aBe− i

p3a

Page 230: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 230/687

200 CHAPTER 6. INTERACTION

= lima→∞

eip3a exp(log B12 + log B13 + log B23)e− i

p3a

= exp(log B12)= B12

Then, it is easy to show that the interacting representation of the Poincaregroup generated by operators (6.65) and (6.66) satisfies cluster separabilityproperties (6.54) - (6.57). For example,

lima→∞

eip3aHe− i

p3a = lim

a→∞eip3aB−1HBe− i

p3a

= lima→∞

B−112 e

ip3a

P20c2 + M

2c4e− i

p3aB12

= B−112

P20c

2 + (B12M 12B−112 )2c4B12

=

P20c2 + M 212c

4

= H 12

Generally, operator B does not commute with the Newton-Wigner positionoperator (6.58). Therefore, the mass operator (6.67) also does not commutewith R0, and the representation generated by operators (P0, J0, K, H ) doesnot belong to the Bakamjian-Thomas form.

6.4 Dynamics and bound states

Suppose we constructed an interacting representation of the Poincare groupU g in the Hilbert space H of multiparticle system either by following theabove Bakamjian-Thomas prescription or by other means.19 What can welearn from this construction about the physical system? We already men-tioned that the knowledge of U g is sufficient for getting any desired phys-ical information about the system. In this section, we would like to makethis statement more concrete by examining two types of information whichcan be compared with experiment: the mass and energy spectra, and the

time evolution of observables. In the next section we will discuss descriptionof scattering experiments, which are currently the most informative way of studying microscopic systems.

19See chapters 8 and 9 where we discuss a field-theoretic construction of relativisticparticle interactions.

Page 231: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 231/687

6.4. DYNAMICS AND BOUND STATES 201

6.4.1 Mass and energy spectra

The mass operator of a non-interacting 2-particle system is

M 0 = +1

c2

H 20 − P20c

2

= +1

c2

(h1 + h2)2 − (p1 + p2)2c2

= +1

c2

(

m21c4 + p21c2 +

m22c4 + p22c

2)2 − (p1 + p2)2c2(6.71)

As particle’s momenta can have any value in the 3D momentum space, the

eigenvalues m of the mass operator have continuous spectrum in the range

m1 + m2 ≤ m < ∞ (6.72)

where the minimum value of mass m1 + m2 is obtained from (6.71) whenboth particles are at rest p1 = p2 = 0. It then follows that the commonspectrum of mutually commuting operators P0 and

H 0 = +

M 20 c4 + P20c

2

is the union of mass hyperboloids20 in the 4-dimensional momentum-energyspace. This spectrum is shown by the hatched region in Fig. 6.1(a).

In the presence of interaction, the eigenvalues µn of the mass operatorM = M 0 + U can be found by solving the stationary Schrodinger equation

M |Ψn = µn|Ψn (6.73)

It is well-known that in the presence of attractive interaction U , new discreteeigenvalues in the mass spectrum may appear below the threshold m1 + m2.The eigenvectors of the interacting mass operator with eigenvalues µn < m1+

m2 are called bound states . The mass eigenvalues µn are highly degenerated.For example, if |Ψn is an eigenvector corresponding to µn, then for anyPoincare group element g the vector U g|Ψn is also an eigenvector with the

20with masses in the interval (6.72)

Page 232: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 232/687

202 CHAPTER 6. INTERACTION

(m11+m

22)c22

PPxxcc

HH

00

(m11+m

22)c22

PPxxcc

HH

00

(a) (b)

Figure 6.1: Typical momentum-energy spectrum of (a) non-interacting and(b) interacting two-particle system.

same mass eigenvalue. To remove this degeneracy (at least partially) one canconsider operators P0 and H , which commute with M and among themselves,so that they define a basis of common eigenvectors

M |Ψ(p)n = µn|Ψ(p)nP0|Ψ(p)n = p|Ψ(p)nH

|Ψ(p)

n =

√ M 2c4 + P2c2

|Ψ(p)

n

= µ2nc4 + p2c2|Ψ(p)nThen sets of common eigenvalues of P0 and H with fixed µn form hyper-boloids

hn =

µ2nc4 + p2c2

which are shown in Fig. 6.1(b) below the continuous part of the commonspectrum of P0 and H . An example of a bound system whose mass spectrumhas both continuous and discrete parts - the hydrogen atom - is consideredin greater detail in subsection 9.3.4.

6.4.2 The Doppler effect revisited

In our discussion of the Doppler effect in subsection 5.4.2 we were interestedin the energy of free photons measured by moving observers or emitted by

Page 233: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 233/687

6.4. DYNAMICS AND BOUND STATES 203

moving sources. There we applied boost transformations to the energy E of

a free massless photon. It is instructive to look at this problem from anotherpoint of view. Photons are usually emitted by compound massive physicalsystems (atoms, ions, nuclei, etc.) in a transition between two discrete energylevels E 2 and E 1, so that the photon’s energy is21

E = E 2 − E 1

When the source is moving with respect to the observer (or observer ismoving with respect to the source), the energies of levels 1 and 2 experienceinertial transformations given by formula (4.4). Therefore, for consistency,we need to prove that the Doppler shift calculated with this formula is exactly

the same as that obtained in subsection 5.4.2.Suppose that the compound system has two bound states characterized

by mass eigenvalues m1 and m2 > m1 (see Fig. 6.2). Suppose also thatinitially the system is in the state with mass m2, total momentum p2, andenergy E 2 =

m22c4 + p22c

2. In the final state we have the same systemwith the lower mass m1, different total momentum p1 and energy E 1 =

m21c4 + p21c2 and a photon with momentum k. From the momentum and

energy conservation laws we can write

p2 = p1 + k (6.74)

E 2 = E 1 + c|k| (6.75) m22c4 + p22c

2 =

m21c4 + p21c

2 + c|k| (6.76)

=

m21c4 + (p2 − k)2c2 + c|k| (6.77)

Taking square of both sides of the equality (6.77) we obtain

m22c4 + p22c2 = m2

1c4 + (p2 − k)2c2 + 2c|k|

m21c4 + (p2 − k)2c2 + c2k2

k m2

1c2 + (p

2 −k)2 =

1

2µ2c2 + p

2k cos φ

−k2

21The transition energy E is actually not well-defined, because the excited state 2 isnot a stationary state. Therefore our discussion in this subsection is approximately validonly for long-living states 2, for which the uncertainty of energy can be neglected. Seealso section 7.5 .

Page 234: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 234/687

204 CHAPTER 6. INTERACTION

PPxxcc

EE

00

mm22

mm11(p

22,E

22))

(p11,E

11))

BB

AAk k

Figure 6.2: Energy level diagram for a bound system with the ground stateof mass m1 and the excited state of mass m2. If the system is at rest, itsexcited state is represented by point A. Note that the energy of emittedphotons (arrows) is less than (m2 − m1)c2. A moving excited state withmomentum p2 is represented by point B. The energies and momenta k of emitted photons depend on the angle between k and p2.

Page 235: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 235/687

6.4. DYNAMICS AND BOUND STATES 205

where µ2 ≡ m22 − m2

1 and φ is the angle between vectors p2 and k.22 From

this we obtain

m21c2k2 + p22k

2 − 2 p2k3 cos φ + k4

=1

4µ4c4 + p22k

2 cos2 φ + k4 + µ2c2 p2k cos φ − µ2c2k2 − 2 p2k3 cos φ

and a quadratic equation

k2(m22c2 + p22 − p22 cos2 φ) − kµ2c2 p2 cos φ − 1

4µ4c4 = 0

with the solution23

k =1

2m22c2 + 2 p22 sin2 φ

µ2c2 p2 cos φ +

µ4c4(m2

2c2 + p22)

Introducing the rapidity θ of the initial state, we obtain p2 = m2c sinh θ and

k =µ2m2c3 sinh θ cos φ + µ2m2c

3 cosh θ

2m22c2 + 2m2

2c2 sinh2 θ sin2 φ

=µ2c

2m2

sinh θ cos φ + cosh θ

cosh2 θ − sinh2 θ cos2 φ

= µ2c2m2

1cosh θ(1 − v

c cos φ)

This formula gives the energy of the photon emitted by a system movingwith the speed v = c tanh θ

E (θ, φ) = ck

=µ2c2

2m2 cosh θ(1 − vc cos φ)

=E (0)

cosh θ(1 −vc cos φ)

22 Note also that the vector k is directed from the light emitting system to the observer,so the angle φ can be interpreted as the angle between the velocity of the source and theline of sight, which is equivalent to the definition of φ in subsection 5.4.2.

23Only positive sign of the square root leads to the physical solution with positive k

Page 236: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 236/687

206 CHAPTER 6. INTERACTION

where

E (0) =µ2c2

2m2

is the energy of the photon emitted by a source at rest. This is in agreementwith our earlier result (5.55).

6.4.3 Time evolution

In addition to the stationary energy spectra discussed above, we are ofteninterested in the time evolution of a compound system. This includes reac-tions, scattering, decays, etc. In quantum theory, the time evolution fromtime t′ to t is described by the time evolution operator

U (t ← t′) = eiH (t−t′) (6.78)

This operator has the following useful properties

U (t ← t′) = exp(i

H (t − t1)) exp(

i

H (t1 − t′))

= U (t ← t1)U (t1 ← t′) (6.79)

for any t1, and

U (t ← t′) = U −1(t′ ← t) (6.80)

In the Schrodinger picture, the time evolution of a state vector is givenby (3.60)

|Ψ(t) = U (t ← t′)|Ψ(t′) (6.81)

|Ψ(t) is also a solution of the non-stationary Schrodinger equation

−i d

dt|Ψ(t) = −i

d

dteiH (t−t′)|Ψ(t)

= HeiH (t−t′)|Ψ(t)

= H |Ψ(t)

Page 237: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 237/687

6.5. SCATTERING 207

In the Heisenberg picture, the time dependence of operators is (see eq. (3.62))

F (t) = U (t ← t′)F (t′)U (t′ ← t) (6.82)

In spite of simple appearance of formulas (6.81) and (6.82), the evaluationof the exponents of the Hamilton operator is an extremely difficult task. Inrare cases when the eigenvalues E n and eigenvectors |Ψn of the Hamiltonianare known

H |Ψn = E n|Ψn

the initial state can be represented as a sum (and/or integral) of basis eigen-vectors

|Ψ(0) =n

C n|Ψn

and the time evolution can be calculated as

|Ψ(t) = eiHt |Ψ(0)

= eiHtn C n

n

=n

C neiE nt|Ψn (6.83)

This approach to the time evolution will be used in our analysis of particledecays in section 7.5.

6.5 Scattering

Unfortunately, formula (6.83) has very limited value. First, the time evolu-

tion of microsystems is very difficult to study experimentally. Second, thefull spectrum of eigenvalues and eigenvectors of the interacting HamiltonianH can be found only for very simple models. However, nature gives us alucky break: there is a very important class of experiments for which the de-scription of dynamics by eq. (6.81) or (6.82) is not needed: this description

Page 238: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 238/687

208 CHAPTER 6. INTERACTION

is just too detailed. These are scattering experiments. They are performed

by preparing free particles (or their bound states, like hydrogen atoms ordeuterons), bringing them into collision and studying the properties of freeparticles or bound states leaving the region of collision. In these experiments,often it is not possible to observe the time evolution during interaction: par-ticle reactions occur almost instantaneously and we can only register thereactants and products which move freely before and after the collision. Insuch situations the theory is not required to describe the actual evolutionof particles during the short interval of collision. It is sufficient to providea mapping of free states before the interaction onto the free states after theinteraction. This mapping is given by the S -operator which we are going todiscuss in this section.

6.5.1 The scattering operator

Let us consider a scattering experiment in which free states of reactants areprepared at time t = −∞.24 The collision occurs during a short time interval[η′, η] around time zero (η′ < 0 < η). The free states of the products areregistered at time t = ∞, so that inequalities −∞ ≪ η′ < 0 < η ≪ ∞ hold.Here we assume that particles do not form bound states neither before norafter the collision. Therefore, at asymptotic times the exact evolution is wellapproximated by non-interacting operators U 0(η′ ← −∞) and U 0(∞ ← η),respectively, where U 0(t

←t′) = exp(iH 0(t

−t′)). Then we can write for the

time evolution from the infinite past to the infinite future25

U (∞ ← − ∞)

≈ U 0(∞ ← η)U (η ← η′)U 0(η′ ← −∞)

= U 0(∞ ← η)U 0(η ← 0)[U 0(0 ← η)U (η ← η′)U 0(η′ ← 0)]U 0(0 ← η′)U 0(η′ ← −∞)

= U 0(∞ ← η)U 0(η ← 0)S η,η′U 0(0 ← η′)U 0(η′ ← −∞)

= U 0(∞ ← 0)S η,η′U 0(0 ← −∞) (6.84)

where

S η,η′ ≡ U 0(0 ← η)U (η ← η′)U 0(η′ ← 0) (6.85)

24Infinite time here means time much greater than the time of collision.25Here we use properties (6.79) and (6.80).

Page 239: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 239/687

6.5. SCATTERING 209

Eq. (6.84) means that a simplified description of the time evolution in scat-

tering events is possible in which the evolution is free at all times except sud-den change at t = 0 described by the unitary operator S η,η′ : Approximation(6.84) becomes more accurate if we increase the time interval [η′, η] duringwhich the exact time evolution is taken into account, i.e., η′ → −∞, η → ∞.Therefore, the exact formula for the time evolution from −∞ to ∞ can bewritten as

U (∞ ← − ∞) = U 0(∞ ← 0)SU 0(0 ← −∞) (6.86)

where the S -operator (or scattering operator ) is defined by formula26

S = limη′→−∞,η→∞

S η,η′

= limη′→−∞,η→∞

U 0(0 ← η)U (η ← η′)U 0(η′ ← 0)

= limη′→−∞,η→∞

e− iH 0ηe

iH (η−η′)e

iH 0η

(6.87)

= limη→∞

S (η)

where

S (η) = limη′→−∞

e− iH 0ηe

iH (η−η′)e

iH 0η

(6.88)

An important property of the S -operator is its “Poincare invariance”, i.e.,zero commutators with generators of the non-interacting representation of the Poincare group [9, 71]

[S, H 0] = [S, P0] = [S, J0] = [S, K0] (6.89)

This, in particular means that in (6.86) one can change places of U 0 and S ,

so that the interacting time evolution operator can be written as the freetime evolution operator times the S -operator

26It can be shown that for properly prepared initial states and cluster separable inter-actions these limits exist. See also subsection 7.2.6 for discussions of the validity domainof the S -operator.

Page 240: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 240/687

210 CHAPTER 6. INTERACTION

U U 0 0

U U

U U

U U 0 0

00SS

time, t

State

AA

BB

CC DD

Figure 6.3: A schematic representation of the scattering process.

U (∞ ← − ∞) = SU 0(∞ ← −∞) (6.90)

= U 0(∞ ← −∞)S (6.91)

A better understanding of how scattering theory describes time evolutioncan be obtained from fig. 6.3. In this figure we plot the state of the scatteringsystem (represented abstractly as a point on the vertical axis) as a functionof time (the horizontal axis). The exact evolution of the state is governedby the full time evolution operator U and is shown by the thick line A →D. In asymptotic regions (when t is large negative or large positive) theinteraction between parts of the scattering system is weak, and the exacttime evolution can be well approximated by the free time evolution (describedby the operator U 0 and shown in the figure by two thin lines with arrows,one for large positive times CD and one for large negative times AB). Thisapproximation is better for time points far from the interaction region t =

0: The thick line asymptotically approaches thin lines in the remote past(around A) and in the remote future (around D). The past and future freeevolutions can be extrapolated to time t = 0, and there is a gap B−C betweenthese extrapolated states. The S -operator (shown by the dashed line) isdesigned to bridge this gap. It provides a mapping between extrapolated

Page 241: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 241/687

6.5. SCATTERING 211

free states. Thus, the exact time evolution A → D can be approximated by

three steps. In this approximation, the system first evolves freely until timet = 0, i.e., from A to B. Then there is a sudden jump B → C representedby the S -operator. Finally, the time evolution is free again C → D. As seenfrom the figure, this description of the scattering process is perfectly exact,as long as we are interested only in the mapping from asymptotic states inthe remote past A to asymptotic states in the remote future D. However,it is also clear that scattering theory does not provide a good descriptionof the time evolution in the interacting region around t = 0, because inthe scattering operator S the information about particle interactions entersintegrated over the infinite time interval t ∈ [−∞, ∞]. In order to describethe time evolution in the interaction region (t

≈0) the S -matrix approach

is not suitable. The full interacting time evolution operator U is needed forthis purpose.

In applications we are mostly interested in matrix elements of the S -operator,

S i→f = f |S |i (6.92)

where |i is a state of non-interacting initial particles and |f is a state of non-interacting final particles. Such matrix elements are called the S -matrix . Theformulas relating the S -matrix to observable quantities, such as scattering

cross-sections, can be found in any textbook on scattering theory.

6.5.2 S-operator in perturbation theory

There are various techniques available for calculations of the S -operator.Currently, the perturbation theory is the most powerful and effective one.To derive the perturbation expansion for the S -operator, first note that theoperator S (t) in (6.88) satisfies equation

d

dtS (t) =

d

dtlim

t′→−∞e− i

H 0te

iH (t−t′)e

iH 0t′

= limt′→−∞

(e− iH 0t(− i

H 0)e

iH (t−t′)e

iH 0t′ + e− i

H 0t(

i

H )e

iH (t−t′)e

iH 0t′)

= limt′→−∞

i

e− i

H 0t(H − H 0)e

iH (t−t′)e

iH 0t′

Page 242: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 242/687

212 CHAPTER 6. INTERACTION

= limt′

→−∞

i

e− i

H 0tV e

iH (t−t′)e

iH 0t′

= limt′→−∞

i

e− i

H 0tV e

iH 0te− i

H 0te

iH (t−t′)e

iH 0t′

= limt′→−∞

i

V (t)e− i

H 0te

iH (t−t′)e

iH 0t′

=i

V (t)S (t) (6.93)

where we denoted27

V (t) = e−iH 0t

V e

iH 0t

(6.94)

One can directly check that a solution of eq. (6.93) with the naturalinitial condition S (−∞) = 1 is given by the “old-fashioned” perturbationexpansion

S (t) = 1 +i

t−∞

V (t′) dt′ − 1

2

t−∞

V (t′) dt′ t′

−∞V (t′′) dt′′ + . . . ,

Therefore

S = 1 +i

+∞

−∞V (t) dt − 1

2

+∞

−∞V (t) dt

t−∞

V (t′) dt′ + . . . (6.95)

We will avoid discussion of the non-trivial convergence properties of theseries on the right hand side of eq. (6.95). Throughout this book we willtacitly assume that all infinite perturbation series do converge.

In this book we will often use the following convenient symbols for t-integrals

27Note that the t-dependence of V (t) does not mean that we are considering time-

dependent interactions. The argument t has very little to do with actual time dependence ,because real time dependence must be generated by the full interacting Hamiltonian H and not by the free Hamiltonian H 0 as in eq. (6.94). Moreover, the t-dependence in(6.94) is reverse with respect to the normal time dependence of operators in eq. (3.62).The t-dependence in eq. (6.94) is just added for the convenience of calculation of theS -operator.

Page 243: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 243/687

6.5. SCATTERING 213

Y (t) ≡ i

t−∞

Y (t′)dt′ (6.96)

Y (t) ≡ i

+∞

−∞Y (t′)dt′ (6.97)

In this notation the perturbation expansion of the S -operator (6.95) can bewritten compactly as

S = 1 + Σ(t)

(6.98)

where

Σ(t)

= V (t) + V (t)V (t) + V (t)V (t)V (t) + V (t)V (t)V (t)V (t) + . . .(6.99)

Formula (6.95) is not the only way to write the perturbation expansion forthe S -operator, and, perhaps, not the most convenient one. In most books onquantum field theory the covariant Feynman–Dyson perturbation expansion[9] is used which involves a time ordering of operators in the integrands.However, for our purposes (especially in chapter 9) we found more useful yetanother equivalent perturbative expression suggested by Magnus [72, 73, 74]

S = exp(F (t) ) (6.100)

where Hermitian operator F (t) will be referred to as the scattering phase operator. It is represented as a series of multiple commutators with timeintegrals

F (t) = V (t) − 1

2[V (t), V (t)] +

1

6[V (t), [V (t), V (t)]]

+1

6[[V (t), V (t)], V (t)]

−1

12[V (t), [[V (t), V (t)], V (t)]]

− 1

12[[V (t), [V (t), V (t)]], V (t)]

− 1

12[[V (t), V (t)], [V (t), V (t)]] + . . . (6.101)

Page 244: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 244/687

214 CHAPTER 6. INTERACTION

One important advantage of this representation is that expression (6.100) for

the S -operator is manifestly unitary at each perturbation order.It follows from equation

S (t) = 1 + Σ(t) = exp(F (t))

that operators Σ and F are related to each other

Σ(t) =d

dtexp(F (t)) (6.102)

F (t) =d

dt

log(1 + Σ) (6.103)

so finding F or Σ are equivalent tasks.

6.5.3 Scattering equivalence of Hamiltonians

The S -operator and the Hamiltonian provide two different ways to describedynamics. The Hamiltonian completely describes the time evolution for alltime intervals, large or small. On the other hand, the S -operator representstime evolution in the “integral” form, i.e., knowing the state of the systemin the infinite past |Ψ(−∞), the free Hamiltonian H 0, and the scatteringoperator S , we can find the state in the infinite future.28

|Ψ(∞) = U (∞ ← − ∞)|Ψ(−∞)= U 0(∞ ← − ∞)S |Ψ(−∞)

Calculations of the S -operator are much easier than those of the full timeevolution, and yet they fully satisfy the needs of current experiments in highenergy physics. This situation created an impression that a comprehensivetheory can be constructed which uses the S -operator as the fundamentalquantity rather than the Hamiltonian and wave functions. However, the S -operator description is not complete, and such a theory would be applicableonly to a limited class of experiments.

In particular, the knowledge of the S -operator is sufficient to calcu-late scattering cross-sections as well as energies and lifetimes of stable and

28see eq. (6.91)

Page 245: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 245/687

6.5. SCATTERING 215

metastable bound states.29 However, in order to describe the time evolution

and wavefunctions of bound states, the knowledge of the S -operator is notenough: the full interacting Hamiltonian H is needed.Knowing the full interacting Hamiltonian H , we can calculate the S -

operator by formula (6.95). However, the inverse is not true: the same S -operator can be obtained from many different Hamiltonians. Suppose thattwo Hamiltonians H and H ′ are related to each other by a unitary transfor-mation eiΦ

H ′ = eiΦHe−iΦ

Then they yield the same scattering (and Hamiltonians H and H ′ arecalled scattering equivalent ) as long as condition

limt→±∞

e− iH 0tΦe

iH 0t = 0 (6.104)

is satisfied.30 Indeed, in the limit t → +∞, t′ → −∞ we obtain [75]

S ′ = limη′→−∞,η→∞

e− iH 0ηe

iH ′(η−η′)e

iH 0η′

= limη′→−∞,η→∞

e− iH 0η(eiΦe

iH (η−η′)e−iΦ)e

iH 0η

= limη′→−∞,η→∞

(e− iH 0ηeiΦe

iH 0η)e− i

H 0ηe

iH (η−η′)e

iH 0η′(e− i

H 0η′e−iΦe

iH 0η′)

= limη′→−∞,η→∞

e− iH 0ηe

iH (η−η′)e

iH 0η′

= S (6.105)

Note that due to Lemma F.11, the energy spectra of two scattering equiv-alent Hamiltonians H and H ′ are identical. However, their eigenvectors aredifferent and corresponding descriptions of dynamics (e.g., via eq. (6.83)) aredifferent too. Therefore scattering-equivalent theories may be not physically

equivalent.29The two latter quantities are represented by positions of poles of the S -operator on

the complex energy plane.30A rather general class of operators Φ that satisfy this condition will be found in

Theorem 9.2 from subsection 9.2.3.

Page 246: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 246/687

216 CHAPTER 6. INTERACTION

6.5.4 Scattering equivalence of the point and instant

formsThe S -matrix equivalence of Hamiltonians established in the preceding sub-section remains valid even if the transformation eiΦ changes the relativisticform of dynamics [6, 7]. Here we would like to demonstrate this equivalenceon an example of Dirac’s point and instant forms of dynamics [6]. We willuse definitions and notation from subsections 6.3.2 and 6.3.3.

Here we will prove that the unitary operator

Θ = ζ 0ζ −1

where

ζ 0 = exp(−i log(M 0c2)Z 0)

ζ = exp(−i log(Mc2)Z 0)

Z 0 =1

2 (Q0 · X0 + X0 · Q0)

transforms from a point form dynamics31 to the Bakamjian-Thomas instantform dynamics,32 without altering the S -operator. In other words, we willassume that a point form representation of the Poincare group is given, whichis built on operators33

M = M 0

P = Q0Mc2

J = J0

R =X0

Mc2

and we will demonstrate that the set of operators ΘM Θ−1, ΘPΘ−1, ΘJΘ−1,

and ΘRΘ−1

generates a representation of the Poincare group in the instantform. Moreover, the S -operators computed with the point-form Hamiltonian

31from subsection 6.3.332 from subsection 6.3.233in the sense described in subsection 4.3.4

Page 247: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 247/687

6.5. SCATTERING 217

H =√

M 2c4 + P2c2 and the instant form Hamiltonian H ′ = ΘH Θ−1 are the

same.Let us denote

Q0(b) = eibZ 0Q0e−ibZ 0 , b ∈ R

From the commutator

[Z 0, Q0] = iQ0

it follows that

d

dbQ0(b) = i[Z 0, Q0]

= −Q0

and

Q0(b) = e−bQ0

This formula remains valid even if b is a Hermitian operator commuting withboth Q0 and X0. For example, if b = log(M 0c

2), then

ei log(M 0c2)Z 0Q0e−i log(M 0c2)Z 0 = e− log(M 0c2)Q0

= M −10 c−2Q0

Similarly, one can prove

ei log(Mc2)Z 0Q0

e−i log(Mc2)Z 0 = M −1c−2Q0

ei log(M 0c2)Z 0X0e−i log(M 0c2)Z 0 = M 0c2X0

ei log(Mc2)Z 0X0e−i log(Mc2)Z 0 = Mc2X0

which imply

Page 248: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 248/687

218 CHAPTER 6. INTERACTION

ΘPΘ−1 = e−i log(M 0c2)Z 0ei log(Mc2)Z 0Q0Mc2e−i log(Mc2)Z 0ei log(M 0c2)Z 0

= e−i log(M 0c2)Z 0Q0ei log(M 0c2)Z 0

= Q0M 0c2

= P0

ΘJ0Θ−1 = J0

ΘRΘ−1 = e−i log(M 0c2)Z 0ei log(Mc2)Z 0X0M −1c−2e−i log(Mc2)Z 0ei log(M 0c

2)Z 0

= e−i log(M 0c2)Z 0X0ei log(M 0c2)Z 0

= X0M −10 c−2 = R0

From these formulas it is clear that the transformed dynamics correspondsto the Bakamjian-Thomas instant form.Let us now demonstrate that the operator S computed with the point

form Hamiltonian H (6.38) is the same as the operator S ′ computed withthe instant form Hamiltonian H ′ = ΘH Θ−1. Note that we can write eq.(6.87) as

S = Ω+(H, H 0)Ω−(H, H 0)

where operators

Ω±(H, H 0) = limη→±∞

e− iH 0ηe

iHη

are called Møller wave operators . Now we can use the Birman-Kato invarianceprinciple [76] which states that Ω±(H, H 0) = Ω±(f (H ), f (H 0)) where f isany smooth function with positive derivative. Using the following connectionbetween the point form mass operator M and the instant form mass operatorM ′

M = ζ −1Mζ

= ζ −1Θ−1M ′Θζ = ζ −1ζζ −10 M ′ζ 0ζ −1ζ

= ζ −10 M ′ζ 0

Page 249: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 249/687

6.5. SCATTERING 219

we obtain

Ω±(H, H 0) ≡ Ω±(M

1 + Q20, M 0

1 + Q2

0)

= Ω±(M, M 0)

= Ω±(ζ −10 M ′ζ 0, M 0)

= ζ −10 Ω±(M ′, M 0)ζ 0

= ζ −10 Ω±(

(M ′)2c4 + P0c2,

M 20 c4 + P0c2)ζ 0

= ζ −10 Ω±(H ′, H 0)ζ 0

Therefore

S ′ = Ω+(H ′, H 0)Ω−(H ′, H 0)

= ζ 0Ω+(H, H 0)ζ −10 ζ 0Ω

−(H, H 0)ζ −10= ζ 0Ω

+(H, H 0)Ω−(H, H 0)ζ −10= ζ 0Sζ −10

but S commutes with free generators (6.89) and thus commutes with ζ 0,which implies that S ′ = S , and transformation Θ conserves the S -matrix.

In addition to the scattering equivalence of the instant and point formsproved above, Sokolov and Shatnii [6, 7] established the equivalence of threemajor forms of dynamics - the instant, point, and front forms. Then, it seemsreasonable to assume that the same S -operator can be obtained in any formof dynamics.

The scattering equivalence of the S -operator is of great help in practicalcalculations. If we are interested only in scattering properties and in ener-gies of bound states,34 then we can choose the most convenient scattering-equivalent Hamiltonian and the most convenient form of dynamics. However,as we mentioned already, the scattering equivalence of Hamiltonians andforms of dynamics does not mean their complete physical equivalence. Wewill see in subsection 10.2.6 that the instant form of dynamics should be pre-ferred in those cases when desired physical properties cannot be described

34In high energy physics virtually all experimental information about interactions of fundamental particles comes from these data.

Page 250: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 250/687

220 CHAPTER 6. INTERACTION

by the S -operator. The remaining freedom to choose among instant-form

scattering-equivalent Hamiltonians will be used in chapter 9 to solve theproblem of ultraviolet infinities in QED.

Page 251: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 251/687

Chapter 7

THE FOCK SPACE

This subject has been thoroughly worked out and is now under-stood. A thesis on this topic, even a correct one, will not get you a job.

R.F. Streater

The theory of interacting particles was formulated in chapter 6 in the Hilbertspace with a fixed particle content. This theory was incomplete, because itcould not describe many physical processes which can change particle typesand numbers. Familiar examples of such processes include the emission andabsorption of light (photons) in electrodynamics, decays, neutrino oscilla-tions, etc. The persistence of particle creation and destruction processes athigh energies follows from the famous Einstein’s formula E = mc2. This for-mula, in particular, implies that if a system of particles has sufficient energyE of their relative motion, then this energy can be converted to the mass mof newly created particles. Inversely, the mass can be converted to the kinetic

energy, e.g., in decays. Generally, there is no limit on how many particlescan be created in collisions, so the Hilbert space of any realistic quantummechanical system should include states with arbitrary number of particles(from zero to infinity) of all types. Such a Hilbert space is called the Fock space .

221

Page 252: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 252/687

222 CHAPTER 7. THE FOCK SPACE

For simplicity, in this chapter, and in the rest of this book we will consider

a world in which there are only five particle species: electrons e−, positronse+, protons p+, antiprotons p−, and photons γ .1 We will limit our discussionto electromagnetic interactions between these particles, so we are going toneglect the internal structure of the (anti)proton and completely disregardstrong, weak and gravitational interactions. In other words, we will builda description of these particles and their interactions in terms of quantum electrodynamics (QED).

7.1 Annihilation and creation operators

In this section we are going to build the Fock space H of QED and introducecreation and annihilation operators which provide a very convenient notationfor working with arbitrary operators in H.

7.1.1 Sectors with fixed numbers of particles

The number of particles of a given type is readily measured in experiment, sowe can introduce 5 new observables in our theory: the numbers of electrons(N el), positrons (N po), protons (N pr), antiprotons (N an), and photons (N ph).According to general rules of quantum mechanics, these observables must

be represented by five Hermitian operators in the Hilbert (Fock) space H.Apparently, the allowed values (the spectrum) for the number of particles of each type are non-negative integers (0,1,2,...). We assume that these observ-ables can be measured simultaneously, therefore the corresponding operatorscommute with each other, and have common spectrum. So, the Fock spaceH separates into a direct sum of corresponding orthogonal eigensubspaces orsectors H(i ,j,k,l ,m) with i electrons, j positrons, k protons, l antiprotons,and m photons

H=

⊕∞ijklm=0

H(i ,j,k,l ,m) (7.1)

where

1We will consider other types of particles in our discussions of decays in sections 7.5and 10.5.

Page 253: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 253/687

7.1. ANNIHILATION AND CREATION OPERATORS 223

N elH(i ,j,k,l ,m) = iH(i ,j,k,l ,m)

N poH(i ,j,k,l ,m) = jH(i ,j,k,l ,m)

N prH(i ,j,k,l ,m) = kH(i ,j,k,l ,m)

N anH(i ,j,k,l ,m) = lH(i ,j,k,l ,m)

N phH(i ,j,k,l ,m) = mH(i ,j,k,l ,m)

The one-dimensional subspace with no particles H(0, 0, 0, 0, 0) is calledthe vacuum subspace . The vacuum vector |0 is then defined as a vector inthis subspace, up to an insignificant phase factor. The one-particle sectorsare built using prescriptions from chapter 5. The subspaces

H(1, 0, 0, 0, 0)

and H(0, 1, 0, 0, 0) correspond to one electron and one positron, respectively.They are subspaces of unitary irreducible representations of the Poincaregroup characterized by the mass m = 0.511 MeV/c2 and spin 1/2 (see Table5.1). The subspaces H(0, 0, 1, 0, 0) and H(0, 0, 0, 1, 0) correspond to one pro-ton and one antiproton, respectively. They have mass M = 938.3 MeV/c2

and spin 1/2. The subspace H(0, 0, 0, 0, 1) correspond to one photon. It ischaracterized by zero mass, and it is a direct sum of two irreducible subspaceswith helicities 1 and -1.

Sectors with two or more particles are constructed as (anti)symmetrizedtensor products of one-particle sectors.2 For example, if we denote Hel the

one-electron Hilbert space and H ph the one-photon Hilbert space, then sectorshaving only electrons and photons can be written as

H(0, 0, 0, 0, 0) = |0 (7.2)

H(1, 0, 0, 0, 0) = Hel (7.3)

H(0, 0, 0, 0, 1) = H ph (7.4)

H(1, 0, 0, 0, 1) = Hel ⊗ H ph (7.5)

H(2, 0, 0, 0, 0) = Hel ⊗asym Hel (7.6)

H(0, 0, 0, 0, 2) = H ph ⊗sym H ph (7.7)

H(1, 0, 0, 0, 2) = Hel ⊗ (H ph ⊗sym H ph) (7.8)H(2, 0, 0, 0, 1) = H ph ⊗ (Hel ⊗asym Hel) (7.9)

H(2, 0, 0, 0, 2) = (H ph ⊗sym H ph) ⊗ (Hel ⊗asym Hel) (7.10)

2see discussion in section 6.1

Page 254: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 254/687

224 CHAPTER 7. THE FOCK SPACE

. . .

In each sector of the Fock space we can define observables of individualparticles, e.g., position momentum, spin, etc., as described in subsection6.1.2. Then, in each sector we can select a basis of common eigenvectorsof a full set of mutually commuting one-particle observables. For futurediscussions it will be convenient to use the basis in which momenta andz -components of the spin σ of massive particles (or helicity τ of masslessparticles) are diagonal. For example, the basis vectors in the two-electronsector Hel ⊗asym Hel are denoted by

|p1σ1; p2σ2

7.1.2 Non-interacting representation of the Poincare

group

The above construction provides us with the Hilbert (Fock) space H wheremultiparticle states and observables of our theory reside and where a conve-nient orthonormal basis set is defined. To complete the formalism we need tobuild a realistic interacting representation of the Poincare group in H. Letus first fulfill an easier task and construct the non-interacting representationU 0g of the Poincare group in the Fock space

H.

From subsection 6.2.1, we already know how to build the non-interactingrepresentation of the Poincare group in each individual sector of H. Thiscan be done by making tensor products (with proper (anti)symmetrization)of single particle irreducible representations U elg , U phg , etc. Then the non-interacting representation of the Poincare group in the entire Fock space isconstructed as a direct sum of sector representations. In agreement with thesector decomposition of the Fock space (7.2) - (7.10) we can write

U 0g = 1 ⊕ U elg ⊕ U phg ⊕ (U elg ⊗ U phg ) ⊕ (U elg ⊗asym U elg ) . . . (7.11)

Generators of this representation will be denoted as (H 0, P0, J0, K0). Ineach sector these generators are simply sums of one-particle generators (seeeqs. (6.10) - (6.13)). As usual, operators H 0, P0, and J0 describe the total energy, linear momentum, and angular momentum of the non-interactingsystem, respectively.

Page 255: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 255/687

7.1. ANNIHILATION AND CREATION OPERATORS 225

Here we immediately face a serious problem. Consider, for example, the

free Hamiltonian represented as a direct sum of components in each sector

H 0 = H 0(0, 0, 0, 0, 0) ⊕ H 0(1, 0, 0, 0, 0) ⊕ H 0(0, 0, 0, 0, 1) ⊕ H 0(1, 0, 0, 0, 1) ⊕ . . .

It is tempting to use notation from section 6.2 and express the Hamiltonianin each sector using corresponding observables of individual particles p1, p2,etc. For example, in the sector H(1, 0, 0, 0, 0), the free Hamiltonian is

H 0(1, 0, 0, 0, 0) =

m2c4 + p2c2 (7.12)

while in the sector H(2, 0, 0, 0, 2) the Hamiltonian is3

H 0(2, 0, 0, 0, 2) = |p1|c + |p2|c +

m2c4 + p23c2 +

m2c4 + p24c

2 (7.13)

Clearly, this notation is very cumbersome because it does not provide aunique expression for the operator H 0 in the entire Fock space. Moreover,it is not clear at all how one can use this notation to express operatorschanging the number of particles, i.e., moving state vectors across sectorboundaries. We need to find a better and simpler way to write operators inthe Fock space. This task is accomplished by introduction of annihilationand creation operators in the rest of this section.

7.1.3 Creation and annihilation operators. Fermions

First, it is instructive to consider the case of the discrete spectrum of momen-tum. This can be achieved by using the standard trick of putting the systemin a box or applying periodic boundary conditions. Then eigenvalues of themomentum operator form a discrete 3D lattice pi, and the usual continuousmomentum spectrum can be obtained as a limit when the size of the boxtends to infinity.

Let us examine the case of electrons. We define the (linear) creation operator a†p,σ for the electron with momentum p and spin projection σ by its

action on basis vectors with n electrons

3Two photons are denoted by indices 1 and 2, and two electrons are denoted by indices3 and 4

Page 256: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 256/687

226 CHAPTER 7. THE FOCK SPACE

|p1, σ1; p2, σ2; . . . ; pn, σn (7.14)

We need to distinguish two cases. The first case is when the one-particlestate (p, σ) is among the states listed in (7.14), for example (p, σ) = (pi, σi).Since electrons are fermions, and two fermions cannot occupy the same statedue to the Pauli exclusion principle, this action leads to a zero result, i.e.

a†pi,σi

|p1, σ1; p2, σ2; . . . ; pn, σn = 0 (7.15)

The second case is when the created one-particle state (p, σ) is not among

the states listed in (7.14). Then the creation operator a†p,σ just adds oneelectron in the state (p, σ) to the beginning of the list of particles

a†p,σ|p1, σ1; p2, σ2; . . . ; pn, σn = |p, σ; p1, σ1; p2, σ2; . . . ; pn, σn (7.16)

Operator a†p,σ has transformed the state with n electrons to the state with

n + 1 electrons. Applying multiple creation operators to the vacuum state|0 we can construct all basis vectors in the Fock space. For example,

a†p1,σ1a†p2,σ2 |0 = |p1, σ1; p2, σ2is a basis vector in the 2-electron sector.

We define the electron annihilation operator ap,σ as operator adjoint tothe creation operator a†

p,σ. It can be proven [9] that the action of ap,σ onthe n-electron state (7.14) is the following. If the electron state with pa-rameters (p, σ) was already occupied, e.g. (p, σ) = (pi, σi) then this state is“annihilated” and the number of particles in the system is reduced by one

ap,σ|p1, σ1; . . . ; pi−1, σi−1; pi, σi; pi+1, σi+1; . . . ; pn, σn= (−1)P |p1, σ1; . . . ; pi−1, σi−1; pi+1, σi+1; . . . ; pn, σn (7.17)

where P is the number of permutations of particles required to bring theone-particle i to the first place in the list. If the state (p, σ) is not presentin the list, i.e., (p, σ) = (pi, σi) for each i, then

Page 257: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 257/687

7.1. ANNIHILATION AND CREATION OPERATORS 227

ap,σ|p1, σ1; p2, σ2; . . . ; pn, σn = 0 (7.18)

Annihilation operators always yield zero when acting on the vacuum state

ap,σ|0 = 0

The above formulas fully define the action of creation and annihilationoperators on basis vectors in purely electronic sectors. These rules are easilygeneralized to all states: they do not change if other particles are present, and

they can be extended to linear combinations of the basis vectors by linearity.Creation and annihilation operators for other fermions - positrons, protons,and antiprotons - are constructed similarly.

For brevity we will refer to creation and annihilation operators collec-tively as to particle operators . This will distinguish them from operatorsof momentum, position, energy, etc. of individual particles which will becalled particle observables . Let us emphasize that creation and annihilationoperators are not intended to directly describe any real physical process of changing the number of particles in the system, and they do not correspondto physical observables. They are just formal mathematical objects that sim-plify the notation for other operators having more direct physical meaning.

We will see how operators of observables are built from particle operatorslater in this book, e.g., in subsections 7.1.8 and 9.3.1.

7.1.4 Anticommutators of particle operators

Although definitions (7.15) - (7.18) clearly express the meaning of particleoperators, they are not very convenient in practical calculations. The anti-commutation relations between particle operators are much more useful. Letus consider the following anticommutator of fermion operators

ap′,σ′, a†p,σ ≡ a†

p,σap′,σ′ + ap′,σ′a†p,σ

in which (p, σ) = (p′, σ′). Acting on a state |p′′, σ′′ which is different fromboth |p, σ and |p′, σ′, we obtain

Page 258: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 258/687

228 CHAPTER 7. THE FOCK SPACE

(a†p,σap′,σ′ + ap′,σ′a†

p,σ)|p′′, σ′′ = ap′,σ′ |p, σ; p′′, σ′′= 0

Similarly, we obtain

(a†p,σap′,σ′ + ap′,σ′a

†p,σ)|p, σ = 0

and

(a†p,σap′,σ′ + ap′,σ′a†

p,σ)|p′, σ′ = a†p,σ|0 + ap′,σ′ |p, σ; p′, σ′

= |p, σ − |p, σ= 0

One can easily demonstrate that the result is still zero when acting on zero-,two-, three-, etc. particle states as well as on their linear combinations. So,we conclude that

ap′,σ′ , a†p,σ = 0, if (p, σ) = (p′, σ′)

Similarly, in the case (p, σ) = (p′, σ′) we obtain

a†p,σ, ap,σ = 1

Therefore for all values of p, p′, σ, and σ′ we have

a†p,σ, ap′,σ′ = δ p,p′δ σ,σ′ (7.19)

Using similar arguments one can show that

a†p,σ, a†

p′,σ′ = 0 (7.20)

ap,σ, ap′,σ′ = 0 (7.21)

Page 259: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 259/687

7.1. ANNIHILATION AND CREATION OPERATORS 229

7.1.5 Creation and annihilation operators. Photons

For photons, which are bosons, the properties of creation and annihilationoperators are slightly different from those described above. Two or morebosons may coexist in the same state. Therefore, we define the action of thephoton creation operator c†

p,τ on a many-photon state as

c†p,τ |p1, τ 1; p2, τ 2; . . . ; pn, τ n = |p, τ ; p1, τ 1; p2, τ 2; . . . ; pn, τ n (7.22)

independent of whether or not the state (p, τ ) already existed. The actionof the adjoint photon annihilation operator cp,τ is

cp,τ |p1, τ 1; p2, τ 2; . . . ; pn, τ n = 0 (7.23)

if the annihilated state (p, τ ) was not present, and

cpi,τ i|p1, τ 1; . . . ; pi−1, τ i−1; pi, τ i; pi+1, τ i+1; . . . ; pn, τ n= |p1, τ 1; . . . ; pi−1, τ i−1; pi+1, τ i+1; . . . ; pn, τ n (7.24)

otherwise.The above formulas are easily extended to states where, in addition to

photons, other particles are also present. The action of creation and an-

nihilation operators on linear combinations of basis vectors is obtained bylinearity.

Similar to subsection 7.1.4, we obtain the following commutation relationsfor photon creation and annihilation operators

[cp,τ , c†p′,τ ′] = δ p,p′δ τ,τ ′ (7.25)

[cp,τ , cp′,τ ′] = 0 (7.26)

[c†p,τ , c†

p′,τ ′] = 0 (7.27)

7.1.6 Particle number operators

With the help of particle creation and annihilation operators we can nowbuild explicit expressions for various operators in the Fock space. Consider,for example, the product of two photon operators

Page 260: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 260/687

230 CHAPTER 7. THE FOCK SPACE

N p,τ = c†p,τ cp,τ (7.28)

Acting on a state with two photons with quantum numbers (p, τ ) this oper-ator yields

N p,τ |p, τ ; p, τ = N p,τ c†p,τ c

†p,τ |0

= c†p,τ cp,τ c

†p,τ c

†p,τ |0

= c†p,τ c

†p,τ cp,τ c

†p,τ |0 + c†

p,τ c†p,τ |0

= c†p,τ c†p,τ c†p,τ cp,τ |0 + 2c†p,τ c†p,τ |0= 2|p, τ ; p, τ

while acting on the state |p, τ ; p′, τ ′ we obtain

N p,τ |p, τ ; p′, τ ′ = N p,τ c†p,τ c

†p′,τ ′|0

= c†p,τ cp,τ c

†p,τ c

†p′,τ ′|0

= c†p,τ c

†p,τ cp,τ c

†p′,τ ′|0 + c†

p,τ c†p′,τ ′|0

= c†p,τ c

†p,τ c

†p′τ ′cp,τ

|0

+ c†

p,τ c†p′,τ ′

|0

= |p, τ ; p′, τ ′

These examples should convince us that operator N p,τ works as a counter of the number of photons with quantum numbers (p, τ ).

7.1.7 Continuous spectrum of momentum

Properties of creation and annihilation operators presented in precedingsubsections were derived for the case of discrete spectrum of momentum.In reality the spectrum of momentum is continuous, and the above results

should be modified by taking the “large box” limit. We can guess that inthis limit eq. (7.19) transforms to

ap′,σ′ , a†p,σ = δ σ,σ′δ (p − p′) (7.29)

Page 261: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 261/687

7.1. ANNIHILATION AND CREATION OPERATORS 231

The following chain of formulas4

δ σ,σ′δ (p − p′) = p, σ|p′, σ′= 0|ap,σa†

p′,σ′|0= −0|a†

p′,σ′ap,σ|0 + δ σ,σ′δ (p − p′)

= δ σ,σ′δ (p − p′)

confirms that our choice (7.29) is consistent with normalization of momentumeigenvectors (5.15).

The same arguments now can be applied to positrons (operators bp,σ

and b†p,σ), protons (dp,σ and d†p,σ), antiprotons (f p,σ and f †p,σ), and photons(cp,τ and c†p,τ ). So, finally, we obtain the full set of anticommutation and

commutation relations pertinent to QED

ap,σ, a†p′,σ′ = bp,σ, b†

p′,σ′= dp,σ, d†

p′,σ′ = f p,σ, f †p′,σ′ = δ (p − p′)δ σσ′ (7.30)

ap,σ, ap′,σ′ = bp,σ, bp′,σ′ = dp,σ, dp′,σ′ = f p,σ, f p′,σ′= a†

p,σ, a†p′,σ′ = b†

p,σ, b†p′,σ′

=

d†p,σ, d†

p′,σ′

=

f †p,σ, f †p′,σ′

= 0 (7.31)

[cp,τ , c†p′,τ ′] = δ (p − p′)δ ττ ′ (7.32)

[c†p,τ , c†

p′,τ ′] = [cp,τ , cp′,τ ′] = 0 (7.33)

Commutators of operators related to different particles are always zero.In the continuous momentum limit, the analog of the particle counter

operator (7.28)

ρp,τ = c†p,τ cp,τ (7.34)

can be interpreted as the density of photons with helicity τ at momentump. By summing over photon polarizations and integrating density (7.34) wecan define an operator for the total number of photons in the system

4see eq. (5.15)

Page 262: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 262/687

232 CHAPTER 7. THE FOCK SPACE

N ph =τ

dpc†

p,τ cp,τ

We can also write down similar operator expressions for the numbers of otherparticles. For example

N el =σ

dpa†

p,σap,σ

is the electron number operator. Then operator

N = N el + N po + N pr + N an + N ph (7.35)

corresponds to the total number of all particles in the system.

7.1.8 Generators of the non-interacting representation

Now we can fully appreciate the benefit of introducing annihilation and cre-ation operators. The expression for the non-interacting Hamiltonian H 0 canbe simply obtained from the particle number operator (7.35) by multiplyingthe integrands (particle densities in the momentum space) by energies of freeparticles.

H 0 =

dpωp

σ=±1/2

[a†p,σap,σ + b†

p,σbp,σ ]

+

dpΩp

σ=±1/2

[d†p,σdp,σ + f †p,σf p,σ]

+ c

dp|p|

τ =±1

c†p,τ cp,τ (7.36)

where we denoted ωp =

m2c4 + p2c2 the energy of free electrons and

positrons and Ωp =

M 2c4 + p2c2 the energy of free protons and antipro-tons, and c|p| is the energy of free photons. So, this formula is no more com-plicated than just summing up free energies of all particles in the system. One

Page 263: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 263/687

7.1. ANNIHILATION AND CREATION OPERATORS 233

can easily verify that H 0 in (7.36) acts on states in the sector H(1, 0, 0, 0, 0)

just as eq. (7.12) and it acts on states in the sector H(2, 0, 0, 0, 2) just as eq.(7.13). So, we have obtained a single expression which works equally well inall sectors of the Fock space. Similar arguments demonstrate that operator

P0 =

dpp

σ=±1/2

[a†p,σap,σ + b†

p,σbp,σ]

+

dpp

σ=±1/2

[d†p,σdp,σ + f †p,σf p,σ]

+ dpp τ =±1c†p,τ cp,τ (7.37)

can be regarded as the total momentum operator in QED.Expressions for the generators J0 and K0 are more complicated as they

involve derivatives of particle operators. Let us illustrate their derivation onan example of a massive spinless particle. Consider the action of a spacerotation e− i

(J 0)zφ on the one-particle state |p5

e− i(J 0)zφ|p = | px cos φ + py sin φ, py cos φ − px sin φ, pz

This action can be represented as annihilation of the state |p = | px, py, pzfollowed by creation of the state | px cos φ + py sin φ, py cos φ − px sin φ, pz,i.e., if α†

p and αp are, respectively, creation and annihilation operators forthe particle, then

e− i(J 0)zφ| px, py, pz = α†

px cosφ+ py sin φ,py cosφ− px sinφ,pzα px,py,pz | px, py, pz= α†

Rz(φ)pαp| px, py, pz

Therefore, for arbitrary 1-particle state, the operator of finite rotation takesthe form

e− i(J 0)zφ =

dpα†

Rz(φ)pαp (7.38)

5 For simplicity, here we omit the spin index and its transformations.

Page 264: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 264/687

234 CHAPTER 7. THE FOCK SPACE

It is easy to show that the same form is valid everywhere on the Fock space.

An explicit expression for the operator (J 0)z can be obtained now by takinga derivative of (7.38) with respect to φ

(J 0)z = i limφ→0

d

dφe− i

(J 0)zφ

= i limφ→0

d

dpα†

Rz(φ)pαp

= i

dp( py

∂α†p

∂px− px

∂α†p

∂py)αp (7.39)

The action of a boost along the z -axis

e− i(K 0)zcθ|p =

ωp cosh θ + cpz sinh θ

ωp

| px, py, pz cosh θ + ωp cosh θ

can be represented as annihilation of the state |p = | px, py, pz followed by

creation of the state (ωp cosh θ+cpz sinh θ)1/2ω−1/2p | px, py, pz cosh θ+ωp cosh θ6

e−i(K 0)zcθ

|p = ωp cosh θ + cpz sinh θ

ωp α

† px,py,pz cosh θ+ωp cosh θα px,py,pz | px, py, pz

Therefore, for arbitrary state in the Fock space, the operator of a finite boosttakes the form

e− i(K 0)zcθ =

dp

ωΛpωp

α†Λpαp (7.40)

An explicit expression for the operator (K 0)z can be now obtained by takinga derivative of (7.40) with respect to θ

(K 0)z =i

climθ→0

d

dθe− i

(K 0)zcθ

6As seen from eq. (5.20), this state has proper normalization.

Page 265: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 265/687

7.1. ANNIHILATION AND CREATION OPERATORS 235

=i

c

limθ→0

d

dθ dp ωp cosh θ + cpz sinh θ

ωp

α† px,py,pz cosh θ+c−1ωp sinh θ

αp

= i

dp(

pz2ωp

α†pαp +

ωp

c2∂α†

p

∂pzαp) (7.41)

Similar derivations can be done for other components of J0 and K0.

7.1.9 Poincare transformations of particle operators

In chapter 5 we established that one-electron states transform as (5.20)with respect to Poincare transformations. Since, by construction, the non-interacting representation U 0

g

in the Fock space (generated by operators H 0,P0, J0, and K0 described above) coincides with these formulas in one-particlesectors, we can find transformations of creation-annihilation operators withrespect to the non-interacting representation U 0g . For electron operators weobtain

U 0(Λ; r, t)a†p,σU −10 (Λ; r, t)|0 = U 0(Λ; r, t)a†

p,σ|0= U 0(Λ; r, t)|p, σ=

ωΛpωp

e− ip·r+ i

ωpt

σ′D1/2σ′σ( φW (p, Λ))|Λp, σ′

Therefore7

U 0(Λ; r, t)a†p,σU −10 (Λ; r, t)

=

ωΛpωp

e− ip·r+ i

ωpt

σ′

D1/2σ′σ( φW (p, Λ))a†

Λp,σ′

=

ωΛpωp

e− ip·r+ i

ωpt

σ′(D1/2)∗

σσ′(− φW (p, Λ))a†Λp,σ′ (7.42)

Similarly, we obtain the transformation law for annihilation operators

7Here ∗ and † denote complex conjugation and Hermitian conjugation, respectively.We also use the property DT ( φ) = (D†( φ))∗ = (D−1( φ))∗ = D∗(− φ) which is valid forany unitary D.

Page 266: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 266/687

236 CHAPTER 7. THE FOCK SPACE

U 0(Λ; r, t)ap,σU −10 (Λ; r, t)

=

ωΛpωp

eip·r− i

ωpt

σ′

D1/2σσ′(

φW (p, Λ))aΛp,σ′ (7.43)

Transformation laws for photon operators are obtained from eq. (5.65)

U 0(Λ; r, t)c†p,τ U −10 (Λ; r, t) =

|Λp||p| e− i

p·r+ ic

|p|teiτφW (p,Λ)c†

Λp,τ (7.44)

U 0(Λ; r, t)cp,τ U −10 (Λ; r, t) = |Λp|

|p| eip·r− ic

|p|te−iτφW (p,Λ)cΛp,τ (7.45)

7.2 Interaction potentials

Our primary goal in the rest of this chapter and in the next chapter is tolearn how to calculate the S -operator in QED, which is the quantity mostreadily comparable with experiment.8 Equations (6.94) and (6.95) tell usthat in order to do that we need to know the non-interacting part H 0 and

the interacting part V of the full Hamiltonian

H = H 0 + V

The non-interacting Hamiltonian H 0 has been constructed in eq. (7.36). Theinteraction energy V (and the corresponding interaction boost Z) in QEDwill be explicitly written only in section 8.1. Until then we are going to studyrather general properties of interactions and S -operators in the Fock space.We will try to use some physical principles to narrow down the allowed form

of the operator V .8Although in section 6.5 we discussed the S -operator for systems with a fixed number

of particles, we never used the assumption that the particle content is the same before andafter the collision. So, our definition of the S -operator works equally well in systems withvariable number of particles considered here.

Page 267: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 267/687

7.2. INTERACTION POTENTIALS 237

7.2.1 Conservation laws

From experiment we know that interaction V between charged particles hasseveral important properties called conservation laws . An observable F iscalled conserved if it remains unchanged in the course of time evolution

F (t) ≡ eiHtF (0)e− i

Ht

= F (0)

It then follows that conserved observables commute with the Hamiltonian[F, H ] = [F, H 0 + V ] = 0, which imposes some restrictions on the interactionoperator V . For example, the conservation of the total momentum and thetotal angular momentum implies that

[V, P0] = 0 (7.46)

[V, J0] = 0 (7.47)

These commutators are automatically satisfied in the instant form of dy-namics (6.22). Our construction will be based entirely on this form. Wealso know that all interactions conserve the lepton number (the number of electrons minus the number of positrons, in our case). Therefore, H mustcommute with the lepton number operator

L = N el − N po =σ

dp(a†

p,σap,σ − b†p,σbp,σ) (7.48)

Since H 0 already commutes with L, we obtain

[V, L] = 0 (7.49)

Moreover, the interaction conserves the baryon number (=the number of protons minus the number of antiprotons in our case). So, V must alsocommute with the baryon number operator

B = N pr − N an

dp(d†

p,σdp,σ − f †p,σf p,σ) (7.50)

[V, B] = 0 (7.51)

Page 268: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 268/687

238 CHAPTER 7. THE FOCK SPACE

Taking into account that electrons have charge −e, protons have charge e,

and antiparticles have charges opposite to those of particles, we can introducethe operator of the electric charge

Q = e(B − L) (7.52)

= eσ

dp(b†

p,σbp,σ − a†p,σap,σ)

+eσ

dp(d†

p,σdp,σ − f †p,σf p,σ) (7.53)

and obtain the charge conservation law

[H, Q] = [V, Q]

= e[V, B − L]

= 0 (7.54)

from eqs. (7.49) and (7.51).It follows from formulas (6.99) and (6.101) that if the interaction operator

V satisfies conditions (7.46), (7.47), (7.49), (7.51), and (7.54) then scatteringoperators F , Σ, and S also commute with P0, J0, L, B, and Q, which meansthat corresponding observables are conserved in scattering events. Although,separate numbers of particles of individual species, i.e., electrons, or protonsmay not be conserved, the above conservation laws require that chargedparticles may be created or annihilated only together with their antiparticles,i.e., in pairs. Creation of pairs is suppressed in low energy reactions as suchprocesses require additional energy of 2melc

2 = 2×0.51MeV = 1.02MeV foran electron-positron pair and 2m prc2 = 1876.6MeV for an proton-antiprotonpair. Therefore such processes can be safely neglected in most applicationsusually considered in classical electrodynamics. However, since photons havezero mass, the energetic threshold for the photon emission is zero, and thereare no restrictions on creation and annihilation of photons. Photons are theirown antiparticles, so they can be created and destroyed in any quantities.

7.2.2 Normal ordering

In the next subsection we are going to express operators in the Fock spaceas polynomials in particle creation and annihilation operators. But first we

Page 269: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 269/687

7.2. INTERACTION POTENTIALS 239

need to overcome one notational problem related to the non-commutativity

of particle operators: two different polynomials may, actually, represent thesame operator. To have a unique polynomial representative for each operator,we will agree always to write products of operators in the normal order , i.e.,creation operators to the left from annihilation operators. Among creation(annihilation) operators we will enforce a certain order based on particlespecies: We will write particle operators in the order proton - antiproton -electron - positron - photon from left to right. With these rules and with(anti)commutation relations (7.30) - (7.33) we can always convert a productof particle operators to the normally ordered form. This is illustrated by thefollowing example

ap′,σ′cq′,τ ′a†p,σc†

q,τ = ap′,σ′a†p,σcq′,τ ′c

†q,τ

= (a†p,σap′,σ′ + δ (p − p′)δ σ,σ′)(−c†

q,τ cq′,τ ′ + δ (q − q′)δ τ,τ ′))

= −a†p,σc†

q,τ ap′,σ′cq′,τ ′ + a†p,σap′,σ′δ (q − q′)δ τ,τ ′

− c†q,τ cq′,τ ′δ (p − p′)δ σ,σ′ + δ (p − p′)δ σ,σ′δ (q − q′)δ τ,τ ′

where the right hand side is in the normal order.

7.2.3 The general form of the interaction operator

A well-known theorem (see [9] p. 175) states that in the Fock space anyoperator V satisfying conservation laws (7.46), (7.47), (7.49), (7.51), and(7.54)

[V, P0] = [V, J0] = [V, L] = [V, B] = [V, Q] = 0 (7.55)

can be written as a polynomial in particle creation and annihilation opera-tors9

V = ∞N =0

∞M =0

V NM (7.56)

9Although this form does not involve derivatives of particle operators, it still can be usedto represent operators like eq. (7.39) if derivatives are approximated by finite differences.

Page 270: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 270/687

240 CHAPTER 7. THE FOCK SPACE

V NM = η,η′ dq′1 . . . dq′

N dq1 . . . dqM DNM (q′1η

′1; . . . ; q′

N η′N ; q1η1; . . . ; qM ηM ) ×

δ (N i=1

q′i −

M j=1

q j)α†q′1,η

′1

. . . α†q′N ,η

′N

αq1,η1 . . . αqM ,ηM (7.57)

where the summation is carried over all spin/helicity indices η, η′ of cre-ation and annihilation operators, and integration is carried over all particlemomenta. Individual terms V NM in the expansion (7.56) of the interactionHamiltonian will be called potentials . Each potential is a normally orderedproduct of N creation operators α† and M annihilation operators α.10 Thepair of integers (N, M ) will be referred to as the index of the potential V NM .

A potential is called bosonic if it has an even number of fermion particleoperators N f + M f . Conservation laws (7.55) imply that all potentials inQED must be bosonic.

DNM is a numerical coefficient function which depends on momenta andspin projections (or helicities) of all created and annihilated particles. Inorder to satisfy [V, J0] = 0, the function DNM must be rotationally invariant.The translational invariance of the expression (7.57) is guaranteed by themomentum delta function

δ (

N i=1

q′i −M j=1

q j)

which expresses the conservation of momentum: the sum of momenta of annihilated particles is equal to the sum of momenta of created particles.11

The interaction Hamiltonian enters in the formulas (6.99) and (6.101) forthe S -operator in the t-dependent form

V (t) = e− iH 0tV e

iH 0t (7.58)

10

Here symbols α†

and α refer to generic creation and annihilation operators withoutspecifying the type of the particle.11Note that this property does not apply to the energy: the sum of free energies of an-

nihilated particles is not necessarily equal to the sum of free energies of created particles.This is because such sums of free energies are eigenvalues of the non-interacting Hamilto-nian H 0, which is not a conserved quantity in the presence of interaction [H 0, H ] = 0.

Page 271: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 271/687

7.2. INTERACTION POTENTIALS 241

Operators with t-dependence determined by the free Hamiltonian H 0 as in

eq. (7.58) and satisfying conservation laws (7.55) will be called regular . Suchoperators will play an important role in our calculations of the S -operatorbelow. In what follows, when we write a regular operator V without itst-argument, this means that either this operator is t-independent, i.e., itcommutes with H 0, or that we take its value at t = 0.

One final notational remark. If potential V NM has coefficient functionDNM , we introduce notation V NM ζ for the operator whose coefficient func-tion D′

NM is a product of DNM and a function ζ of the same arguments

D′NM (q′

1η′1; . . . ; q′

N η′N ; q1η1; . . . ; qM ηM )

= DNM (q′1η′1; . . . ; q′N η′N ; q1η1; . . . ; qM ηM )ζ (q′1η′1; . . . ; q′N η′N ; q1η1; . . . ; qM ηM )

Then, substituting (7.57) in (7.58) and using (7.42) - (7.45), the t-dependentform of any regular potential V NM (t) can be written as

V NM (t) = e− iH 0tV NM e

iH 0t

= V NM e− iE NM t

where

E NM (q′1, . . . , q′

N , q1, . . . , qM ) ≡N i=1

ωq′i−

M j=1

ωqj (7.59)

is the difference of energies of particles created and destroyed by V NM , whichis called the energy function of the term V NM . We can also generalize thisnotation for a general sum of potentials V NM

V (t) = e− iH 0tV e

iH 0t

= V e−i

E V t

which means that for each potential V NM (t) entering the sum V (t), the ar-gument of the t-exponent contains the corresponding energy function E NM .In this notation we can conveniently write, for example,

Page 272: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 272/687

242 CHAPTER 7. THE FOCK SPACE

ddt

V (t) = V (t) (− i

E V ),

and

V (t) ≡ i

∞−∞

V (t)dt

= 2πiV δ (E V ) (7.60)

Eq. (7.60) means that each term in V (t)

is non-zero only on the hypersurface

of solutions of the equation

E NM (q′1, . . . , q′

N , q1, . . . , qM ) = 0 (7.61)

(if such solutions exist). This hypersurface in the momentum space is calledthe energy shell of the potential V NM . We will also say that V (t) in eq. (7.60)

is zero outside the energy shell of V . Note that the scattering operator (6.98)S = 1+Σ(t) is different from 1 only on the energy shell, i.e., where the energy

conservation condition (7.61) is satisfied.

7.2.4 Five types of regular potentials

Here we would like to introduce a classification of regular potentials (7.57) bydividing them into five groups depending on their index (N, M ). We will callthese types of operators renorm , oscillation , decay , phys , and unphys . Therationale for introducing this classification and nomenclature will becomeclear in chapters 8 and 9 where we will examine renormalization and the“dressed particle” approach in quantum field theory.

Renorm potentials have either index (0,0) (such operator is simply anumerical constant) or index (1,1) in which case both created and annihi-lated particles are required to have the same mass. The most general form

of a renorm potential obeying conservation laws is the sum of a numericalconstant C and (1,1) terms corresponding to each particle type.12

12Here we write just the operator structure of R omitting all numerical factors, indices,integration and summation signs. Note also that terms like a†b or d†f are forbidden bythe charge conservation law (7.54).

Page 273: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 273/687

7.2. INTERACTION POTENTIALS 243

R = a†a + b†b + d†d + f †f + c†c + C (7.62)

Note that the free Hamiltonian (7.36) and the total momentum (7.37) areexamples of renorm operators, i.e., sums of renorm potentials. The class of renorm potentials is characterized by the property that the energy function(7.59) is identically zero. So, renorm potentials always have energy shellwhere they do not vanish. Renorm potentials commute with H 0, thereforeregular renorm operators do not depend on t.

Oscillation potentials have index (1, 1). In contrast to renorm po-tentials with index (1,1), oscillation potentials destroy and create different

particle species having different masses. For this reason, the energy function(7.59) of an oscillation potential never turns to zero, so there is no energyshell. In QED there can be no oscillation potentials, because they wouldviolate either lepton number or baryon number conservation law. However,there are particles in nature, such as kaons and neutrinos, for which oscil-lation interactions play a significant role. These interactions are responsiblefor mixing and time-dependent oscillations between different particle species[77].

Decay potentials satisfy two conditions:

1. they must have indices (1, N ) or (N, 1) with N ≥ 2;

2. they must have a non-empty energy shell;

There are no decay terms in the QED Hamiltonian and in the correspondingS -matrix: decays of electrons, protons, or photons would violate conservationlaws.13 Nevertheless, particle decays play an important role in other areas of high energy physics, and they will be considered in sections 7.5 and 10.5.

13Exceptions to this rule are given by operators describing the decay of a photon intoodd number of photons, e.g.,

c†k1,τ 1c†k2,τ 2

c†k3,τ 3ck1+k2+k3,τ 4

This potential obeys all conservation laws if momenta of involved photons are collinear and|k1|+ |k2|+ |k3|−|k1+k2 +k3| = 0. However, it was shown in [78] that such contributionsto the S -operator are zero on the energy shell, so photon decays are forbidden in QED.

Page 274: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 274/687

244 CHAPTER 7. THE FOCK SPACE

Phys potentials have at least two creation operators and at least two

destruction operators (index (N, M ) with N ≥ 2 and M ≥ 2). For physpotentials the energy shell always exists. For example, in the case of a physpotential d†

p+k,ρf †q−k,σap,τ bq,η the energy shell is determined by the solutionof equation Ωp+k + Ωq−k = ωp + ωq which is not empty.

Table 7.1: Types of potentials in the Fock space.Potential Index of potential Energy shell Examples

(N, M )Renorm (0, 0),(1, 1) yes a†

papOscillator (1, 1) no forbidden in QED

Unphys (0, N ≥ 1),(N ≥ 1, 0) no a†pb

†−p−kc

†k

Unphys (1, N ≥ 2),(N ≥ 2, 1) no a†pap−kck

Decay (1, N ≥ 2),(N ≥ 2, 1) yes forbidden in QED

Phys (N ≥ 2, M ≥ 2) yes a†p−kd†

q+kapdq

All regular operators not mentioned above belong to the class of Unphys potentials. They come in two subclasses. They can either

1. have index (0, N ), or (N, 0), where N ≥ 1. Obviously, there is noenergy shell in this case.

2. or have index (1, M ) or (M, 1), where M ≥ 2. This is the same condi-tion as for decay potentials, however, in contrast to decay potentials,for unphys potentials it is required that the energy shell does not exist.

An example of an unphys potential satisfying condition 2. is

a†p,ρap−k,σck,τ (7.63)

The energy shell equation is

ωp−k + c|k| = ωp

with only solution k = 0. However zero vector is excluded from the pho-ton momentum spectrum (see subsection 5.4.1), so the energy shell of thepotential (7.63) is empty.

Page 275: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 275/687

7.2. INTERACTION POTENTIALS 245

Properties of potentials discussed above are summarized in Table 7.1.

These five types of potentials exhaust all possibilities, therefore any regularoperator V must have a unique decomposition

V (t) = V ren + V unp(t) + V dec(t) + V ph(t) + V osc(t).

As mentioned above, in QED interaction, oscillation and decay contributionsare absent. So, everywhere in this book14 we will assume that all operatorsare sums of renorm, unphys, and phys potentials

V (t) = V ren + V unp(t) + V ph(t)

Now we need to learn how to perform various operations with these fiveclasses of potentials, i.e., the product, commutator, and t-integral, requiredfor calculations of scattering operators in (6.99) or (6.101).

7.2.5 Products and commutators of potentials

Lemma 7.1 The product of two (or any number of) regular operators is regular.

Proof. If operators A(t) and B(t) are regular, then

A(t) = e− iH 0tAe

iH 0t

B(t) = e− iH 0tBe

iH 0t

and their product C (t) = A(t)B(t) has t-dependence

C (t) = e− iH 0tAe

iH 0te− i

H 0tBe

iH 0t

= e− iH 0tABe

iH 0t

characteristic for regular operators. The conservation laws (7.55) are validfor the product AB if they are valid for A and B separately.

14except sections 7.5 and 10.5 where we will discuss decays

Page 276: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 276/687

246 CHAPTER 7. THE FOCK SPACE

Lemma 7.2 A Hermitian operator A is phys if and only if it yields zero

when acting on the vacuum |0 and one-particle states |1.

A|0 = 0 (7.64)

A|1 = 0 (7.65)

Proof. Phys operators have two annihilation operators on the right, soeqs. (7.64) and (7.65) are satisfied. Let us now prove the inverse statement.Renorm operators cannot satisfy (7.64) and (7.65) because they conserve thenumber of particles. Unphys operators (1, N ) can satisfy eqs. (7.64) and(7.65), e.g.,

α†1α2α3|0 = 0

α†1α2α3|1 = 0.

However, for Hermiticity, such operators should be always present in pairswith (N, 1) operators α†

2α†3α1. Then, there exists at least one one-particle

state |1 for which eq. (7.65) is not valid, e.g.,

α†3α

†2α1|1 = α†

3α†2|0

= 0

The same argument is valid for unphys operators having index (0 , N ). There-fore, the only remaining possibility for A is to be phys.

Lemma 7.3 Product and commutator of any two phys operators A and Bis phys.

Proof. By Lemma 7.2 if A and B are phys, then

A|0 = B|0 = A|1 = B|1 = 0.

Then the same conditions are true for the Hermitian combinations i(AB −BA) and AB + BA. Therefore, the commutator [A, B] and anticommutatorAB + BA are phys, and

Page 277: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 277/687

7.2. INTERACTION POTENTIALS 247

AB = 12

(AB + BA) + 12

[A, B]

is phys as well.

Lemma 7.4 If R is a renorm operator and [A, R] = 0, then operator [A, R]has the same type (i.e., renorm, phys, or unphys) as A.

Proof. The general form of a renorm operator is given in eq. (7.62). Let usconsider just one term in the sum over particle types15

R =

dpG(p)α†

pαp

Let us calculate the commutator [A, R] = AR − RA by moving the factorR in the term AR to the leftmost position. If the product α†

pαp (from R)changes places with a particle operator (from A) different from α† or α thennothing happens. If the product α†α changes places with a creation operatorα†q (from A) then, as discussed in subsection 7.2.2, a secondary term should

be added which, instead of α†q contains the commutator16

α†q(

dpf (p)α†

pαp) − (

dpf (p)α†

pαp)α†q

= ±

dpf (p)α†pα†

qαp −

dpf (p)α†pαpα†

q

=

dpf (p)α†

pαpα†q ±

dpf (p)α†

pωpδ (p − q) −

dpf (p)α†pαpα†

q

= ±f (q)ωqα†q

This commutator is proportional to α†q, so the secondary term has the same

operator structure as the primary term, and it is already in the normalorder, so no tertiary terms need to be created. If the product α†α changes

15Here α denotes any one of the five particle operators (a,b,d,f,c) present in (7.62).Again, the spin indices are omitted.

16The upper sign is for bosons, and the lower sign is for fermions.

Page 278: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 278/687

248 CHAPTER 7. THE FOCK SPACE

places with an annihilation operator αq then the commutator ∓f (q)ωqαq

is proportional to the annihilation operator. If there are many α† and αoperators in A having non-vanishing commutators with R, then each one of them results in one additional term whose type remains the same as in theoriginal operator A.

Lemma 7.5 A commutator [P, U ] of a Hermitian phys operator P and a Hermitian unphys operator U can be either phys or unphys, but not renorm.

Proof. Acting by [P, U ] on a one-particle state |1, we obtain

[P, U ]|1

= (P U −

UP )|1= P U |1

If U is Hermitian then the state U |1 has at least two particles (see proof of Lemma 7.2), and the same is true for the state P U |1. Therefore, [P, U ] cre-ates several particles when acting on a one-particle state, which is impossibleif [P, U ] were renorm.

Finally, there are no limitations on the type of commutator of two unphysoperators [U, U ′]. It can be a superposition of phys, unphys, and renormterms.

7.2.6 Adiabatic switching and t-integrals

Lemma 7.6 A t-derivative of a regular operator A(t) is regular and has zerorenorm part.

Proof. The derivative of a regular operator has t-dependence that is char-acteristic for regular operators:

d

dtA(t) =

d

dte− i

H 0tAe

iH 0t

= −i

e−iH 0t

[H 0, A]e

iH 0t

= − i

[H 0, A(t)] (7.66)

= − i

e− i

H 0t[H 0, A]e− i

H 0t (7.67)

Page 279: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 279/687

7.2. INTERACTION POTENTIALS 249

In addition, it is easy to check that the derivative obeys all conservation laws

(7.55). Therefore, it is regular.Suppose that ddtA(t) has a non-zero renorm part R. Then R is t-independent

and originates from a derivative of the term Rt + S in A(t), where S is t-independent. Since A(t) is regular, its renorm part must be t-independent,therefore R = 0.

In formulas for scattering operators (6.99) and (6.101) we meet t-integralsV (t) of regular operators V (t). A straightforward calculation of such integralsgives rather discouraging result

V (t) ≡i

V

t

−∞

e−iE V t

dt′ (7.68)

=i

V (− e− i

E V t

iE V + e− i

E V (−∞)

iE V ) (7.69)

What shall we do with the term on the right hand side containing −∞?Recall that the purpose of using the t-integral is in calculation of the

S -operator by perturbation theory described in subsection 6.5.1. The mean-ingless term on the right hand side of (7.68) can be made harmless if wetake into account an important fact that the S -operator does not act on allstates in the Hilbert space. It only acts on scattering states in which free

particles are far from each other in asymptotic limits t → ±∞. The timeevolution of these states coincides with the free evolution in the distant pastand distant future. Certainly, these assumptions cannot be applied to allstates in the Hilbert space. For example, the time evolution of bound statesof the interacting Hamiltonian H , does not resemble the free evolution atany time. It appears that if we limit the application of the S -operator andt-integrals (7.68) only to scattering states |Ψ(−∞) consisting of one-particlewave packets with good localization in both position and momentum spaces,then no ambiguity arises. In the distant past and in the distant future theparticles are far away from each other and the interaction operator is effec-tively zero,17 so, for any term in the scattering operator we can write

limt→±∞

e− iHtV e

iHt |Ψ = lim

t→±∞e− i

H 0tV e

iH 0t|Ψ′

17Of course, the interaction V must be cluster separable to ensure that.

Page 280: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 280/687

250 CHAPTER 7. THE FOCK SPACE

= limt→±∞

V (t)|Ψ′

= 0 (7.70)

(where |Ψ′ is generally different from |Ψ).One approach to the exact treatment of scattering is to explicitly consider

only wave packets described above. Then the cluster separability of V willensure the correct behavior of the wave packets, and the validity of eq. (7.70).However, such an approach is rather complicated, and we would like to stayaway from working with wave packets.

There is another way to achieve the same goal by using a trick called theadiabatic switching of interaction. The trick is to add the property (7.70)to the interaction operator “by hand”. To do that we multiply V (t) by a

numerical function of t which slowly grows from the value of zero at t = −∞to the value of one at t ≈ 0 (turning the interaction “on”) and then slowlydecreases back to zero at t = ∞ (turning the interaction “off”). For example,it is convenient to choose

V (t) = e− iH 0tV e

iH 0te−ǫ|t|,

If the parameter ǫ is small and positive, such a modification would not affectthe movement of quasiclassical wave packets and the S -matrix. At the endof calculations we will take the limit ǫ

→+0. Then, if V (t) is either phys or

unphys, the t-integral (7.68) takes the form

V (t) =i

limǫ→+0

V t

−∞e− i

E V t

e−ǫ|t′|dt′

=i

limǫ→+0

V 0

−∞

e− iE V t

eǫt′

dt′ +i

limǫ→+0

V t 0

e− iE V t

e−ǫt′dt′

=i

limǫ→+0

V

i

E V + i

ǫ

e− iE V t+ǫt

|0

−∞+

i

limǫ→+0

V

i

E V − i

ǫ

e− iE V te−ǫt

|t0

= V −1

E V + lim

ǫ→+0V −1

E V (e− i

E V te−ǫt − 1)

= V −e− iE V t

E V

Page 281: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 281/687

7.2. INTERACTION POTENTIALS 251

so the embarrassing expression ei∞ does not appear.

In fact, we are not going to use the parameter ǫ explicitly in calculations.Instead of (7.69) we will simply use the equivalent rule for calculating t-integrals

V (t) = V (t) −1

E V (7.71)

where the “ei∞” term is omitted. This formula explicitly demonstrates thatt-integrals of phys and unphys operators are regular. However, this does notapply to renorm operators. They are t-independent, therefore

V ren = limǫ→+0V ren ie−ǫt

ǫ

= limǫ→+0

V ren i

ǫ − V ren it

+ . . . (7.72)

V ren = ∞ (7.73)

Thus, renorm operators are different from others in the sense that their t-integrals (7.72) are infinite and non-regular, and their t-integrals (7.73) areinfinite. This fact does not limit the applicability of our theory, because,as we will see in subsection 8.3.2, a properly renormalized expression foroperators F in (6.101) and Σ in (6.99) should not contain renorm terms.

Since for any unphys operator V unp either the energy shell does not existor the coefficient function is zero on the energy shell, we conclude from eq.(7.60) that

V unp = 0 (7.74)

From eqs. (6.99) and (6.101) it is then clear that unphys terms in F and Σdo not make contributions to the S -operator.

The results obtained in this subsection and in the preceding subsectionare summarized in Table 7.2.

7.2.7 Two-particle potentials

Our next goal is to express n-particle potentials (n ≥ 2) studied in subsection6.3.5 using the formalism of annihilation and creation operators. These po-tentials conserve the number and types of particles, so they must have equal

Page 282: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 282/687

252 CHAPTER 7. THE FOCK SPACE

Table 7.2: Operations with regular operators in the Fock space. (Notation:

P=phys, U=unphys, R=renorm, NR=non-regular.)Type of operator

A [A, P ] [A, U ] [A, R] dAdt A A

P P P+U P P P PU P+U P+U+R U U U 0R P U R 0 NR ∞

numbers of creation and annihilation operators (N ≥ 2, M ≥ 2). Therefore,their type must be phys.

Consider now a two-particle subspace H(1, 0, 1, 0, 0) of the Fock space.This subspace describes states of the system consisting of one electron andone proton. A general phys operator leaving this subspace invariant musthave N = 2, M = 2 and, according to eq. (7.57), it can be written as18

V =

dpdedp′de′D22(p, e, p′, e′)δ (p + e − p′ − e′)d†

pa†edp′ae′

=

dpdedp′D22(p, e, p′, p + e − p′)d†

pa†edp′dp+e−p′

= dpdedkV (p, e, k)d†pa†edp−kae+k (7.75)

where we denoted k = p − p′ the “transferred momentum” and

V (p, e, k) = D22(p, e, p − k, e + k)

Acting by this operator on an arbitrary state |Ψ of the two-particle system

= dp′′de′′Ψ(p′′, e′′)d†

p′′a

†e′′

|0

(7.76)

we obtain

18In this subsection we use variables p and e to denote momenta of the proton andelectron, respectively, and omit spin indices for brevity.

Page 283: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 283/687

7.2. INTERACTION POTENTIALS 253

V |Ψ=

dpdedkV (p, e, k)d†

pa†edp−kae+k

dp′′de′′Ψ(p′′, e′′)d†

p′′a†e′′|0

=

dpdedkV (p, e, k)

dp′′de′′Ψ(p′′, e′′)δ (p − k − p′′)δ (e + k − e′′)d†

pa†e|0

=

dpde

dkV (p, e, k)Ψ(p − k, e + k)

d†pa†

e|0 (7.77)

Comparing this with (7.76) we see that the momentum-space wave functionΨ(p, e) has been transformed by the action of V to the new wave function

Ψ′(p, e) = V Ψ(p, e)

=

dkV (p, e, k)Ψ(p − k, e + k)

This is the most general linear transformation of a two-particle wavefunctionwhich conserves the total momentum.

For comparison with traditionally used inter-particle potentials, it is moreconvenient to have expression for V in the position space. We can write19

Ψ′(x, y)

= V Ψ(x, y)

=1

(2π )3

eipx+ i

eydpdeΨ′(p, e)

=1

(2π )3

eipx+ i

eydpde(

dkV (p, e, k)Ψ(p − k, e + k))

=1

(2π )3

ei(p+k)x+ i

(e−k)ydpde

dkV (p + k, e − k, k)Ψ(p, e)

= dke

ik(y

−x)

V (p + k, e − k, k)[(

1

(2π )3 dpdee

ipx+ i

ey

Ψ(p, e))]

(7.78)

19Here x and y are positions of the proton and the electron, respectively; and we use(5.34) to transform between momentum and position representations.

Page 284: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 284/687

254 CHAPTER 7. THE FOCK SPACE

where expression in square brackets is recognized as the original position-

space wave function Ψ(x, y) and the rest as an operator acting on this wavefunction. This operator acquires especially simple form if we assume thatV (p, e, k) does not depend on p and e.

V (p, e, k) = v(k)

then

V Ψ(x, y)

= dkeik(y−x)v(k)[( 1(2π )3

dpdeeipx+

i eyΨ(p, e))]

= v(x − y)1

(2π )3

dpdee

ipx+ i

eyΨ(p, e)

= v(x − y)Ψ(x, y) (7.79)

where

v(r) =

dke

ikrv(k)

is the Fourier transform of v(k). We see that interaction (7.75) acts asmultiplication by the function v(x) in the position space. So, it is a usualposition-dependent potential. Note that the requirement of the total mo-mentum conservation implies automatically that the potential depends onthe relative position (x − y).

Let us now consider the general case (7.78). Without loss of generalitywe can represent function V (p + k, e − k, k) as a series20

V (p + k, e

−k, k) = j χ j(p, e)v j(k)

Then we obtain

20For example, a series of this form can be obtained by writing a Taylor expansion withrespect to the variable k with χj being the coefficients depending on p and e.

Page 285: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 285/687

7.2. INTERACTION POTENTIALS 255

V Ψ(x, y)

= j

dke

ik(y−x)v j(k)[(

1

(2π )3

dpdeχ j(p, e)e

ipx+ i

eyΨ(p, e))]

= j

v j(y − x)χ j(p, e)[(1

(2π )3

dpdee

ipx+ i

eyΨ(p, e))]

= j

v j(y − x)χ j(p, e)Ψ(x, y) (7.80)

where p = −i d/dx and e = −i d/dy are differential operators, i.e., position-

space representations (5.32) of the momentum operators of the two particles.Expression (7.80) then demonstrates that interaction d†a†da can be alwaysrepresented as a general 2-particle potential depending on the distance be-tween particles and their momenta. We will use eq. (7.80) in our derivationof 2-particle RQD potentials in subsection 9.3.3.

7.2.8 Cluster separability in the Fock space

We know from subsection 6.3.5 that a cluster separable interaction potentialcan be constructed as a sum of smooth potentials (6.50). However, thisnotation is very inconvenient to use in the Fock space. Such sums have

rather different forms in different Fock sectors. For example, the Coulombinteraction has the form (6.48) in the 2-particle sector and the form (6.49)in the 3-particle sector. Fortunately, cluster separable interactions in theFock space can be written in the general form (7.56) - (7.57). The clusterseparability is ensured if coefficient functions DNM are smooth functions of particle momenta [9].

Let us verify this statement on a simple example. We are going to findout how the 2-particle potential (7.75) acts in the 3-particle (one proton andtwo electrons) sector of the Fock space H(2, 0, 1, 0, 0) where state vectorshave the form

|Ψ =

dpde1de2ψ(p, e1, e2)d†

pa†e1

a†e2

|0 (7.81)

Applying operator (7.75) to this state vector we obtain

Page 286: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 286/687

256 CHAPTER 7. THE FOCK SPACE

V |Ψ=

dp′de′dk

dpde1de2V (p′, e′, k)ψ(p, e1, e2)d†

p′a†e′dp′−kae′+kd†

pa†e1

a†e2

|0(7.82)

The product of particle operators acting on the vacuum state can be rewrittenas

d†p′a

†e′dp′−kae′+kd†

pa†e1

a†e2

|0= −d†

p′a†e′d†

pa†e1a†

e2dp′−kae′+k|0 + d†p′a†

e′ae′+ka†e1a†

e2δ (p′ − k − p)|0− d†

p′a†e′d

†pa†

e2dp′−kδ (e1 − e′ − k)|0 + d†

p′a†e′d

†pa†

e1dp′−kδ (e2 − e′ − k)|0

= d†p′a

†e′a

†e2

δ (p′ − k − p)δ (e1 − e′ − k)|0 − d†p′a

†e′a

†e1

δ (e2 − e′ − k)δ (p′ − k − p)|0

Inserting this result in (7.82) we obtain

V |Ψ= dkdpde1de2V (k + p, e1 − k, k)ψ(p, e1, e2)d†

k+pa†e1−ka†

e2|0

dkdpde1de2V (k + p, e2 − k, k)ψ(p, e1, e2)d†k+pa†

e2−ka†e1

|0

=

dkdpde1de2V (p, e1, k)ψ(p − k, e1 + k, e2)d†

pa†e1

a†e2

|0

+

dkdpde1de2V (p, e2, k)ψ(p − k, e1, e2 + k)d†

pa†e1

a†e2

|0

=

dpde1de2

dk[V (p, e1, k)ψ(p − k, e1 + k, e2)

+ V (p, e2, k)ψ(p − k, e1, e2 + k)]d†pa†

e1a†e2

|0 (7.83)

Comparing this with eq. (7.77) we see that, as expected from the conditionof cluster separability, the two-particle interaction in the three-particle sectorseparates in two terms. One term acts on the pair of variables (p, e1). Theother term acts on variables (p, e2).

Page 287: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 287/687

7.2. INTERACTION POTENTIALS 257

Removing the electron 2 to infinity is equivalent to multiplying the wave

function ψ(p, e1, e2) by exp(

i

e2a) where a → ∞. The action of V on such awave function (i.e., the term in parentheses in (7.83)) is

lima→∞

dkV (p, e1, k)ψ(p − k, e1 + k, e2)e

ie2a

+

dkV (p, e2, k)ψ(p − k, e1, e2 + k)e

i(e2+k)a

In the limit a → ∞ the exponent in the integrand of the second term is arapidly oscillating function of k. If the coefficient function V (p, e, k) is a

smooth function of k then the integral on k is zero due to the Riemann-Lebesgue lemma B.1. Therefore, only the interaction proton - electron(1)does not vanish

lima→∞

V eie2a|Ψ

= lima→∞

dkdpde1de2V (p, e1, k)ψ(p − k, e1 + k, e2)e

ie2ad†

pa†e1

a†e2

|0

which demonstrates that V is a cluster separable potential.For more complex potentials (including those which change the number

of particles) with smooth coefficient functions, the above arguments can berepeated. Then one can see that when some particles are removed to infinitysuch potentials automatically separate into sums of smooth terms, as requiredby cluster separability. Therefore,

Statement 7.7 (cluster separability) The cluster separability of the in-teraction ( 7.56 ) is guaranteed if the coefficient functions DNM of all potentials V NM are smooth functions of momenta.

The power of this statement is that when expressing interacting potentials

through particle operators in the momentum representation we have a verysimple criterion of cluster separability: the coefficient functions must besmooth, i.e., they should not contain singular factors, like delta functions.21

21This is the reason why such potentials were called smooth in subsection 6.3.5.

Page 288: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 288/687

258 CHAPTER 7. THE FOCK SPACE

This is the great advantage of writing interactions in terms of particle (cre-

ation and annihilation) operators instead of particle (position and momen-tum) observables as in section 6.3.22 Recall that in subsection 6.3.7 it was avery non-trivial matter to ensure the cluster separability for interaction po-tentials written in terms of particle observables even in a simplest 3-particlesystem.

7.3 A toy model theory

Before considering real QED interactions between charged particles and pho-tons in the next chapter, in this section we are going to perform a warm-up

exercise. We will introduce a simple yet quite realistic model theory withvariable number of particles in the Fock space. In this theory, the perturba-tion expansion of the S -operator can be evaluated with minimal efforts, inparticular, with the help of a convenient diagram technique . In section 8.3we will use this simplified model again in order to illustrate major ideas of the renormalization program.

7.3.1 Fock space and Hamiltonian

The toy model introduced here is a rough approximation to QED. This ap-proximation describes only electrons and photons and their interactions. (No

particle-antiparticle pair creation is allowed.) So, we will work in the part of the Fock space with no other particles. This part is a direct sum of electron-photon sectors like those described in formulas (7.2) - (7.10). We will alsoassume that interaction does not affect the electron spin and photon polar-ization degrees of freedom, so the corresponding labels will be omitted. Thenrelevant (anti)commutation relations of particle operators can be obtainedfrom (7.30) - (7.33)

ap, a†p′ = δ (p − p′) (7.84)

[cp, c†p

′ ] = δ (p−

p′) (7.85)

ap, ap′ = a†p, a†

p′ = 0 (7.86)

[cp, cp′ ] = [c†p, c†

p′] = 0 (7.87)

22see section 4 of ref. [9].

Page 289: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 289/687

7.3. A TOY MODEL THEORY 259

[a†p, c†

p′] = [a†p, cp′] = [ap, c†

p′] = [ap, cp′] = 0 (7.88)

The interacting Hamiltonian H = H 0 + V 1, as usual, is the sum of the freeHamiltonian

H 0 =

dpωpa†

pap + c

dk|k|c†

kck,

and interaction, which we choose in the following unphys form

V 1 =e

√ c

(2π )3/2

dpdk

|k| a†

pc†kap+k +

e √

c

(2π )3/2

dpdk

|k| a†

pap−kck (7.89)

The coupling constant e is assumed to be proportional to the absolutevalue of the electron charge. Here and in what follows the perturbation order of an operator (= the power of the coupling constant e) is shown by thesubscript. For example, the free Hamiltonian H 0 does not depend on e, so itis of zero perturbation order; V 1 is of the first perturbation order, etc.

The above theory satisfies conservation laws

[H, Q] = [H, P0] = [H, J0] = 0,

where P0 is the total momentum operator (7.37), J0 is the total angularmomentum operator (7.39), and Q is the total charge (7.53). The number of

electrons is conserved, but the number of photons can be changed. So, thistheory is capable of describing important processes of the photon emissionand absorption. However, it has two major drawbacks.

First, it is not Poincare invariant. This means that it is not possible toconstruct an interacting boost operator K such that the Poincare commuta-tion relations with H , P0, and J0 are satisfied. In this section we will toleratethis deficiency, but in chapter 8 we will show how the Poincare invariancecan be satisfied in a more comprehensive theory (QED) which includes bothparticles and antiparticles.

The second drawback is that due to the presence of singularities k−1/2

in the coefficient functions of (7.89), the interaction V 1 formally does notsatisfy our criterion of cluster separability in Statement 7.7.23 The singular-ities at k = 0 are related to so-called infrared divergences . Physically, these

23The same problem is also characteristic for interaction potentials in QED (M.5) and(M.8)

Page 290: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 290/687

260 CHAPTER 7. THE FOCK SPACE

divergences reflect the fact that the photon has zero mass, and an unlimited

number of soft photons can be created in scattering processes. The tech-niques for dealing with infrared divergences are well established (see, e.g.,[9, 79]) and they will not be discussed in this book. The easiest way toavoid such divergences in our calculations is to assign a fictitious small massto photons. Although, we are not going to indicate this explicitly, we willpretend that such a modification is done, so that the k = 0 singularities areremoved from the coefficient functions of interaction potentials, and thesefunctions become smooth in agreement with the Statement 7.7.24

Another important problem with interaction (7.89) is its unphys type.25

In section 8.3 we will demonstrate that this fact is responsible for the ne-cessity to introduce renormalization, which is equivalent to adding (infinite)

counterterms to the interaction (7.89). In this section we will discuss onlythe 2nd perturbation order for scattering operators, so the renormalizationwill not concern us here.

7.3.2 Drawing a diagram in the toy model

Our primary goal in this subsection is to introduce the diagram techniquewhich greatly facilitates perturbative calculations of scattering operators(6.99) and (6.101). Let us graphically represent each term in the interac-tion potential (7.89) as a vertex (see Fig. 7.1). Each particle operator in(7.89) is represented as an oriented line or arrow. The line corresponding toan annihilation operator enters the vertex, and the line corresponding to acreation operator leaves the vertex. Electron lines are shown by full arcs andphoton lines are shown by dashed arrows on the diagram. Each line is markedwith the momentum label of the corresponding particle operator. Free endsof the electron lines are attached to the vertical electron “order bar” on theleft hand side of the diagram. The order of these external lines (from bottomto top of the order bar) corresponds to the order of electron particle opera-tors in the potential (from right to left). An additional numerical factor isindicated in the upper left corner of the diagram.

The t-integral V 1(t) differs from V 1(t) only by the factor −E −1V 1 (see eq.

24 Even in the limit of zero photon mass the singularity at k = 0 is not as bad as itlooks. In the dressed particle representation of QED it becomes obvious that interactionpotentials between charges (e.g., the Coulomb potential in the 2nd perturbation order)are still separable in spite of having a long range (see subsection 9.3.3).

25Two potentials in V 1 satisfy condition 2. for unphys operators in subsection 7.2.4.

Page 291: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 291/687

7.3. A TOY MODEL THEORY 261

+1 +1

p+k

pp

kk kk

p−k

pp

(a) ((bb))

Figure 7.1: Diagram representation of the interaction operator V 1.

(7.71)), which is represented in the diagram by drawing a box that crossesall external lines. A line entering (leaving) the box contributes its energywith the positive (negative) sign to the energy function E V 1. The diagramrepresentation of the integral

V 1(t) =e

√ c

(2π )3/2

dpdk

|k|e− it

(−ωp−c|k|+ωp+k)

ωp + c|k| − ωp+ka†pc†

kap+k

+e √ c

(2π )3/2

dpdk |k|e− it (−ωp+c|k|+ωp−k)

ωp − c|k| − ωp−ka†pap−kck (7.90)

is shown in Fig. 7.2.The product of two potentials AB is represented by simply placing the

diagram B below the diagram A and attaching external electron lines of bothdiagrams to the same order bar. For example, the diagram for the productof the second term in (7.89) (Fig. 7.1(b)) and the first term in (7.90) (Fig.7.2(a))

V 1V 1 ∝ (a†pap−kck)(a†qc†k′aq+k′) + . . . (7.91)

is shown in Fig. 7.3(a).26 This product should be further converted to thenormal form, i.e., all incoming lines should be positioned below outgoing

26By convention, we will place free ends of photon external lines on the right hand side

Page 292: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 292/687

262 CHAPTER 7. THE FOCK SPACE

+1 +1

p+k

pp

kk kk

p−k

pp

(a) ((bb))

Figure 7.2: t-integral V 1(t).

lines. Due to anticommutation relations (7.84) and (7.86), each exchange of positions of electron particle operators (full lines on the diagram) changesthe total sign of the expression. On the diagram, the movement of particleoperators from right to left is represented by the movement of free ends of particle lines upward. Each permutation of annihilation and creation oper-ators (incoming and outgoing lines, respectively) of similar particles createsan additional expression and a new diagram in which the swapped lines are

joined together by an internal line . Using these rules we first move thephoton operators in (7.91) to the rightmost positions, move the operator a†q

to the leftmost position, and add another term due to the anticommutatorap−k, a†

q = δ (q − p + k).

V 1V 1 ∝ a†qa†

pap−kaq+k′ckc†k′ + δ (q − p + k)a†

paq+k′ckc†k′

= a†qa†

pap−kaq+k′ckc†k′ + a†

pap−k+k′ckc†k′ + . . . (7.92)

Expression (7.92) is represented by two diagrams 9.3(b) and 9.3(c). Inthe diagram 9.3(b) the electron line marked q has been moved to the top of

the electron order bar. In the diagram 9.3(c) the product δ (q − p + k) andof the diagram. The order of these free ends (from top to bottom of the diagram) willcorrespond to the order of photon particle operators in the expression (from left to right).For example, in Fig. 7.3(a) the incoming photon line is above the outgoing photon line,which corresponds to the order cc† of photon operators in (7.91).

Page 293: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 293/687

7.3. A TOY MODEL THEORY 263

+1

q+k’

kk

(a)pp

p−k

qqk’

+1

kk

p−k

qq

((bb))

k’

q+k’

pp

+1

kk

p−k+k’

((cc))

k’

pp

==

+1

kk

p−k

qq

((dd))

k’

q+k’

pp

+1

kk

qq

p−k

((ee))

pp

q+k

+1 ((ff)) +1

kk

pp

((gg))

p−k

pp

==

Figure 7.3: The normal product of operators in Fig. 7.1(b) and 7.2(a).

the integration by q are represented by joining or pairing the incoming elec-

tron line carrying momentum p − k with the outgoing electron line carryingmomentum q. This produces an internal electron line carrying momentump − k between two vertices.

In the expression (7.92), the electron operators are in the normal order,however, the photon operators are not. The next step is to change the orderof photon operators

V 1V 1 ∝ a†qa†

pap−kaq+k′c†k′ck + a†

qa†pap−kaq+k′δ (k′ − k)

+ a†pap−k+k′c

†k′ck + a†

pap−k+k′δ (k′ − k) + . . .

= a†qa†pap−kaq+k′c†k′ck + a†qa†pap−kaq+k+ a†

pap−k+k′c†k′ck + a†

pap + . . . (7.93)

The normal ordering of photon operators in 9.3(b) yields diagrams 9.3(d) and9.3(e) according to equation (7.85). Diagrams 9.3(f) and 9.3(g) are obtained

Page 294: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 294/687

264 CHAPTER 7. THE FOCK SPACE

from 9.3(c) in a similar way.

7.3.3 Reading a diagram in the toy model

Using diagrams, with some practice, one can perform calculations of scatter-ing operators (6.99) and (6.101) much easier than in the usual algebraic way.During these diagram manipulations we, actually, do not need to keep mo-mentum labels of lines. The algebraic expression of the result can be easilyrestored from an unlabeled diagram by following these steps:

(I) Assign a distinct momentum label to each external line, except one,whose momentum is obtained from the (momentum conservation) con-

dition that the sum of all incoming external momenta minus the sumof all outgoing external momenta is zero.

(II) Assign momentum labels to internal lines so that the momentum con-servation law is satisfied at each vertex: The sum of momenta of linesentering the vertex is equal to the sum of momenta of outgoing lines.If there are loops , one needs to introduce new independent loop mo-menta 27

(III) Read external lines from top to bottom of the diagram and write cor-responding particle operators from left to right. Do it first for electron

lines and then for photon lines.

(IV) For each box, write a factor (E i − E f )−1, where E f is the sum of

energies of particles going out of the box and E i is the sum of energiesof particles coming into the box.

(V) Write a factor e− iE Y t, where E Y is the energy function of the diagram

which is the sum of energies of all outgoing external lines minus thesum of energies of all incoming external lines.

(VI) For each vertex introduce a factor e√ c√

(2π)3|k| , where k is the momentum

of the photon line attached to the vertex.(VII) Integrate the obtained expression by all independent external momenta

and loop momenta.

27see diagram 7.3(g) in which k is the loop momentum.

Page 295: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 295/687

7.3. A TOY MODEL THEORY 265

7.3.4 Electron-electron scattering

Let us now try to extract some physical information from the above theory.We will calculate low order terms in the perturbation expansion (6.99) forthe Σ-operator

Σ1(t) = V 1(t) (7.94)

Σ2(t) = (V 1(t)V 1(t))unp + (V 1(t)V 1(t)) ph + (V 1(t)V 1(t))ren (7.95)

To obtain the corresponding contributions to the S -operator we need to taket-integrals

S = 1 + Σ1(t) + Σ2(t) + . . .

Note that the right hand side of (7.94) and the first term on the right handside of (7.95) are unphys, so, due to eq. (7.74), they do not contribute to theS -operator. For now, we also ignore the contribution of the renorm term in(7.95).28 Then we obtain in the 2nd perturbation order

S 2 = (V 1(t)V 1(t)) ph

+ . . . (7.96)

Operator V 1V 1 has several terms corresponding to different scattering pro-cesses. Some of them were calculated in subsection 7.3.2. For example, theterm of the type a†c†ac (see fig. 7.3(f)) annihilates an electron and a photonin the initial state and recreates them (with different momenta) in the finalstate. So, this term describes the electron-photon (Compton) scattering. Letus consider in more detail the electron-electron scattering term a†a†aa. Thecorresponding term in the integrand of (7.96) is described by the diagramin Fig. 7.3(e) and, according to the rules (I) - (VII) this diagram translatesinto expression

Fig.7.3(e) = 2e2c

(2π )3

dpdqdk

e− it (ωp−k+ωq+k−ωq−ωp)

|k|(c|k| + ωp−k − ωp)a†p−ka†

q+kapaq

28We will devote a full section 8.3 to the discussion of renorm terms in the Hamiltonianand scattering operators.

Page 296: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 296/687

266 CHAPTER 7. THE FOCK SPACE

The t-integral of this expression is

Fig.7.3(e) =

2πe2 3c

(2π )3

dpdqdk

δ (ωp−k + ωq+k − ωq − ωp)

|k|(c|k| + ωp−k − ωp)a†p−ka†

q+kapaq(7.97)

The delta function in (7.97) expresses the conservation of energy in the scat-tering process. We will also say that expression (7.97) is non-zero only onthe energy shell which is a solution of the equation

ωp−k + ωq+k = ωq + ωp

In the non-relativistic approximation ( p, q ≪ mc)

ωp =

p2c2 + m2c4

= mc2 +p2

2m+

p4

8m3c2+ . . . (7.98)

Then in the limit of small momentum transfer (k ≪ mc) the denominatorcan be approximated as

|k|(c|k| + ωp−k − ωp) ≈ |k|(c|k| + mc2 +(p − k)2

2m− mc2 − (p)2

2m)

= |k|(c|k| +−2p · k + k2

2m)

≈ c|k|2 (7.99)

Substituting this result in (7.96), we obtain the second order contribution tothe S -operator

S 2[a†a†aa] ≈ e2i

4π2

dpdqdk

δ (ωp−k + ωq+k − ωq − ωp)

|k|2 a†p−ka†

q+kaqap(7.100)

with the coefficient function

Page 297: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 297/687

7.4. DIAGRAMS IN GENERAL THEORY 267

D2(p, q, k) = e2i4π2 |k|2 (7.101)

In subsection 9.3.3 we will see that this coefficient function is characteristicfor scattering of two electrons interacting via usual Coulomb potential

e2

4π|r1 − r2| (7.102)

so, our toy model is quite realistic.

7.4 Diagrams in general theory

7.4.1 Properties of products and commutators

The diagrammatic approach developed for the toy model above can be easilyextended to interactions in the general form (7.56): Each potential V NM withN creation operators and M annihilation operators can be represented by avertex with N outgoing and M incoming lines. In calculations of scatteringoperators (6.99) and (6.101) we meet products of such potentials.29

Y = V (1)V (2) . . . V (V ) (7.103)

As discussed in subsection 7.2.2, we should bring this product to the normalorder. The normal ordering transforms (7.103) into a sum of terms y( j)

Y = j

y( j) (7.104)

each of which can be described by a diagram with

V vertices.

Each potential V (i) in the product (7.103) has N (i) creation operators,M (i) annihilation operators, and N (i) + M (i) integrals. Then each term y( j)

in the expansion (7.104) has

29V is the number of potentials in the product.

Page 298: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 298/687

268 CHAPTER 7. THE FOCK SPACE

N =

V i=1

(N (i) + M (i)) (7.105)

integrals and independent integration variables. This term also has V deltafunctions which express the conservation of the total momentum in eachof the factors V (i). Moreover, in the process of normal ordering, I pairs of external lines in the factors V (i) have to be joined together to make I internallines and to introduce new I delta-functions. So, the total number of deltafunctions in y( j) is

N δ = V + I (7.106)

and the number of external lines is

E = N − 2 I (7.107)

The terms y( j) in the normally ordered product (7.104) can be either discon-nected (see, for example, Fig. 7.3(d)) or connected . (in which case there is acontinuous sequence of internal lines connecting any two vertices).

Consider a product of two connected30 potentials V (1) and V (2)

V (1)V (2) =

jy( j) (7.108)

where the right hand side is written in the normally ordered form. Fromexample (7.91) it should be clear that in a general product (7.108) there isonly one disconnected term in the sum on the right hand side. Let us denotethis term y(0) ≡ (V (1)V (2))disc. This is the term in which the factors fromthe original product are simply rearranged and no pairings are introduced.31

All other terms y(1), y(2), . . . on the right hand side of (7.108) are connected,because they have at least one pairing which is represented on the diagramby an internal line connecting vertices V (1) and V (2).

Lemma 7.8 The disconnected part of a product of two connected bosonic

operators 32

does not depend on the order of the product 30As we will see below, original potentials V (i) are always connected in cluster-separable

interactions.31see, for example, the first terms on the right hand side of (7.93)32As discussed in subsection 7.2.3, all potentials considered in this book are bosonic.

Page 299: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 299/687

7.4. DIAGRAMS IN GENERAL THEORY 269

(V (1)V (2))disc = (V (2)V (1))disc (7.109)

Proof. Operators V (1)V (2) and V (2)V (1) differ only by the order of particleoperators. So, after all particle operators are brought to the normal orderin (V (1)V (2))disc and (V (2)V (1))disc, they may differ, at most, by a sign. Theorder of boson particle operators does not affect the sign, so it can be ig-nored. Let us now focus only on fermion particle operators in V (1) and V (2).For simplicity, we will assume that only electron and/or positron particleoperators are present in V (1) and V (2). The inclusion of the proton and an-

tiproton operators will not change anything in this proof, except its length.For the two factors V (i) (where i = 1, 2) let us denote N

(i)e the numbers of

electron creation operators, N (i) p the numbers of positron creation operators,

M (i)e the numbers of electron annihilation operators, and M

(i) p the numbers of

positron annihilation operators. Taking into account that V (i) are assumedto be normally ordered, we may formally write

V (1) ∝ [N (1)e ][N (1) p ][M (1)e ][M (1) p ]

V (2) ∝ [N (2)e ][N (2) p ][M (2)e ][M (2) p ]

where the bracket [N (1)e ] denotes the product of N (1)e electron creation op-

erators from the term V (1), the bracket [N (1) p ] denotes the product of N

(1) p

positron creation operators from the term V (1), etc. Then

V (1)V (2) ∝ [N (1)e ][N (1) p ][M (1)e ][M (1) p ][N (2)e ][N (2) p ][M (2)e ][M (2) p ] (7.110)

V (2)V (1) ∝ [N (2)e ][N (2) p ][M (2)e ][M (2) p ][N (1)e ][N (1) p ][M (1)e ][M (1) p ] (7.111)

Let us now bring the particle operators on the right hand side of (7.111) to the

same order as on the right hand side of (7.110). First we move N (1)e electron

creation operators to the leftmost position in the product. This involvesN (1)e M

(2)e permutations with electron annihilation operators from the factor

V (2) and N (1)e N

(2)e permutations with electron creation operators from the

factor V (2). Each of these permutations changes the sign of the disconnected

term, so the acquired factor is (−1)N (1)e (N

(2)e +M

(2)e ).

Page 300: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 300/687

270 CHAPTER 7. THE FOCK SPACE

Next we need to move the [N (1) p ] factor to the second position from the

left. The factor acquired after this move is (−1)N (1)p (N

(2)p +M

(2)p )

. Then we movethe factors [M

(1)e ] and [M

(1) p ] to the third and fourth places in the product,

respectively. Finally, the total factor acquired by the expression (V (2)V (1))discafter all its terms are rearranged in the same order as in ( V (1)V (2))disc is

f = (−1)K (1)e K

(2)e +K

(1)p K

(2)p (7.112)

where we denoted

K (i)e ≡

N (i)e

+ M (i)e

K (i) p ≡ N (i) p + M (i) p

the total (= creation + annihilation) numbers of electron and positron oper-ators, respectively, in the factor V (i). Then it is easy to prove that the powerof (-1) in (7.112) is even, so that f = 1. Indeed, consider the case when K

(1)e

is even and K (2)e is odd. Then the product K

(1)e K

(2)e is odd. From the bosonic

character of V (1) and V (2) it follows that K (1)e + K

(1) p and K

(2)e + K

(2) p are

even numbers. Therefore K (1) p is odd and K

(2) p is even, so that the product

K (1) p K

(2) p is odd and the total power of (-1) in (7.112) is even.

The same result is obtained for any other assumption about the even/odd

character of K (1)e and K

(2)e . This proves (7.109).

Theorem 7.9 A multiple commutator of connected bosonic potentials is con-nected.

Proof. Let us first consider a single commutator of connected potentialsV (1) and V (2).

V (1)V (2) − V (2)V (1) (7.113)

According to Lemma 7.8, the disconnected terms (V (1)V (2))disc and (V (2)V (1))discin the commutator (7.113) are canceled and all remaining terms are con-nected. This proves the theorem for a single commutator (7.113). Since this

Page 301: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 301/687

7.4. DIAGRAMS IN GENERAL THEORY 271

commutator is also bosonic, by repeating the above arguments, we conclude

that all multiple commutators of bosonic operators are connected.In a connected diagram the number of independent loops is

L = I − V + 1 (7.114)

This formula can be proven by the following arguments: If there are V ver-tices, they can be connected together without making loops by V −1 internallines. Each additional internal line will make one independent loop. There-fore, the total number of independent loops is I − (V − 1).

An example of a connected diagram is shown in Fig. 7.4. This diagramhas

V = 4 vertices,

E = 5 external lines,

I = 7 internal lines, and

L= 4

independent loops. There are nine integrals and nine independent integra-tion variables. Five momentum integrals correspond to the external lines(incoming p1, p2 and outgoing p3, p4, and p5) and are a part of the generalexpression for the potential (7.57). Four integrals are performed by loop mo-menta p6, p7, p8, and p9. These integrals can be absorbed in the definitionof the coefficient function

D3,2(p′3, p′

4, p′5; p1, p2) =

dp6dp7dp8dp9DA(p6, p7, p8, p1 + p2 − p6 − p7 − p8; p1, p2) ×DB(p9, p1 + p2 − p7 − p8 − p9; p6, p1 + p2 − p6 − p7 − p8) ×DC (p′

5, p1 + p2 − p5 − p8 − p9; p7, p1 + p2 − p7 − p8 − p9) ×DD(p′

3, p′4; p8, p9, p1 + p2 − p5 − p8 − p9)

1

E A(E C + E D)(7.115)

where E A, E C , and E D are energy functions of the corresponding vertices(“energy function”= the sum of energies of particles leaving the vertex minusthe sum of energies of particles entering the vertex).

7.4.2 Cluster separability of the S -operator

In agreement with Postulate 6.3 (cluster separability) and Statement 7.7(cluster separability of smooth potentials), we assume that coefficient func-tions of interaction potentials V (i) in the product (7.103) are smooth. Whatabout the product (7.103) itself? What are the conditions under which such

Page 302: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 302/687

272 CHAPTER 7. THE FOCK SPACE

pp11+p

22−p

55−p

88−p

99

pp11+p

22−p

77−p

88−p

99

pp11+p

22−p

66−p

77−p

88

pp22

pp11

pp66

pp99pp77

pp88

pp33

pp44

pp55

DD

CC

BB

A A

Figure 7.4: A diagram representing one term in the 4-order product of ahypothetical theory. Here we do not draw the order bars as we are notinterested in the relative order of creation and annihilation operators whichaffects only the total sign. However, we draw all outgoing lines on the topof the diagram, and all incoming lines at the bottom to indicate that thediagram is normally ordered. Note that all internal lines are oriented upwardsbecause all paired operators (i.e., those operators whose order should bechanged by the normal ordering procedure) in the product (7.103) always

occur in the order αα†.

Page 303: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 303/687

7.4. DIAGRAMS IN GENERAL THEORY 273

a product is a sum of smooth operators? This question is important, because

from physical considerations we want the S -operator to have the property of cluster separability, i.e., the scattering of two spatially separated groups of particles should proceed as if these groups were alone.

However, the question about the smoothness of a product of smooth po-tentials is not trivial because bringing the product (7.103) to the normalorder involves pairings, and pairings produce singular delta functions. Forthe term y( j) to be of the smooth form (7.57), we need to make sure thatthere are enough integrals to integrate out all these delta functions exceptone delta function required by the conservation of momentum, as in (7.57),and that the number of remaining integrals is equal to the number of externallines (7.107) plus the number of independent loops

L.

The following theorem establishes an important connection between thesmoothness of terms on the right hand side of (7.104) and the connectivityof diagrams on the right hand side of this expression.

Theorem 7.10 Each term y( j) in the expansion ( 7.104) of the product of smooth potentials is smooth if and only if it is represented by a connected diagram.

Proof. We will establish the smoothness of the term y( j) by proving that itcan be represented in the general form (7.57) in which the integrand contains

only one delta function required by the momentum conservation condition,and the coefficient function DNM is smooth. From eq. (7.105), the originalnumber of integrals in y( j) is N . Integrals corresponding to E external linesare parts of the general form (7.57), and integrals corresponding to L loopsare absorbed into the definition of the coefficient function of y( j). The numberof remaining integrals is then

N ′ = N − E − L= I + V − 1 (7.116)

Then we have just enough integrals to cancel all momentum delta functions(7.106) except one, which proves that the term y(i) is smooth.

Inversely, suppose that the term y( j) is represented by a disconnecteddiagram with V vertices and I internal lines. Then the number of indepen-dent loops L is greater than the value I −V + 1 characteristic for connected

Page 304: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 304/687

274 CHAPTER 7. THE FOCK SPACE

diagrams. Then the number of integrations N ′ in eq. (7.116) is less than

I + V − 1, and the number of delta functions remaining in the integrand N ′ − N δ is greater than 1. This means that the term y( j) is represented byexpression (7.57) whose coefficient function is singular, therefore the corre-sponding operator is not smooth. This contradicts our original assumptionand proves that the diagram y( j) is connected.

Theorem 7.10 establishes that smooth operators are represented by con-nected diagrams, and vice versa . In what follows, we will use the termssmooth and connected as synonyms, when applied to operators.

Putting together Theorems 7.9 and 7.10 we immediately obtain the fol-lowing important

Theorem 7.11 All terms in a normally ordered multiple commutator of smooth bosonic potentials are smooth.

This theorem allow us to apply the property of cluster separability to theS -operator. Let us write the S -operator in the form

S = eF (7.117)

where F is a series of multiple commutators (6.101) of smooth potentials in V .

According to Theorem 7.11, operators F and F are also smooth.33 Then,F is cluster separable and if all particles are divided into two spatially

separated groups 1 and 2, the argument of the exponent in (7.117) takes theform of a sum

F → F (1) + F (2) where F (1)

acts only on variables in the group 1 and F (2)

acts only on vari-

ables in the group 2. So, these two operators commute with each other.

Then, the S -operator separates into the product of two independent factors33The singularities present due to energy denominators resulting from t-integrals in

(6.101) are made harmless by employing the “adiabatic switching” trick from subsection7.2.6. This trick essentially results in adding small imaginary contributions to each de-nominator, which removes the singularity.

Page 305: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 305/687

7.4. DIAGRAMS IN GENERAL THEORY 275

S → e(F (1) +F (2) )

= eF (1) eF (2)

This relationship expresses the cluster separability of the S -operator and theS -matrix: The total scattering amplitude for spatially separated events isgiven by the product of individual amplitudes.

7.4.3 Divergence of loop integrals

In the preceding subsection we showed that terms described by connected di-agrams are smooth. However, such terms involve loop integrals, and generallythere is no guarantee that these integrals converge. This problem is evidentin our toy model: the loop integral by k in diagram 7.3(g) is divergent.34

(V 1(t)V 1(t))ren = − e2 2c

(2π )3

dpdk

a†pap

(ωp−k − ωp + ck)k(7.118)

Substituting this result to the right hand side of (7.95) we see that the S -operator in the second order Σ2(t)

is infinite, which makes it meaningless

and unacceptable.Actually, the appearance of infinities in products and commutators of po-tentials in formulas for the S -operator (6.99) and (6.101) is a commonplace inquantum field theories. So, we need to better understand this phenomenon.In this subsection, we will discuss the convergence of general loop integralsand we will formulate a sufficient condition under which loop integrals areconvergent. We will find these results useful in our discussion of the renor-malization of QED in section 8.3 and in the construction of a divergence-freetheory in section 9.2.

34 For future reference we may note that 3D integral of a function f (k)

dkf (k)

diverges at large k if the integrand f (k) tends to zero as k−3 or slower. The integrand in(7.118) satisfies this criterion: its asymptotic behavior is ∝ k−2.

Page 306: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 306/687

276 CHAPTER 7. THE FOCK SPACE

Let us consider, for example, the diagram in Fig. 7.4. There are three

different reasons why loop integrals may diverge there.

(I) The coefficient functions DA, DB, . . . in (7.115) may contain singular-ities.

(II) There can be also singularities due to zeroes in energy denominatorsE A and E C + E D.

(III) The coefficient functions DA, DB, . . . may not decay fast enough forlarge values of loop momenta, so that the integrals may be divergentdue to the infinite integration range.

As discussed in subsection 7.3.1, the singularities in the coefficient functionsare avoided if we introduce a fictitious small photon mass. The energy de-nominators also may be assumed finite if we use the adiabatic switchingprescription from subsection 7.2.6. The divergence of loop integrals at largeintegration momentum or ultraviolet divergence (point (III) above) is a moreserious problem, which we are going to discuss here. We would like to showthat the divergence of loop integrals is closely related to the behavior of co-efficient functions of potentials far from the energy shell. In particular, wewould like to prove the following

Theorem 7.12 If coefficient functions of potentials decay sufficiently rapidly (e.g., exponentially) when arguments move away from the energy shell, then all loop integrals converge.

Idea of the proof. Eq. (7.115) is an integral in a 12-dimensional space of 4 loop momenta p6, p7, p8, and p9. Let us denote this space Σ. Consider forexample the dependence of the integrand in (7.115) on the loop momentump9 as p9 → ∞ and all other momenta fixed. Note that we have chosen theintegration variables in Fig. 7.4 in such a way that each loop momentum

is present only in the internal lines forming the corresponding loop, e.g.,momentum p9 is confined to the loop BDCB, and the energy function E A of the vertex A does not depend on p9. Such a selection of integration variablescan be done for any arbitrary diagram. Taking into account that at largevalues of momentum ωp ≈ c|p|, we obtain in the limit p9 → ∞

Page 307: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 307/687

7.4. DIAGRAMS IN GENERAL THEORY 277

E A → const,

E B = ωp1+p2−p7−p8−p9 + ωp9 − ωp6 − ωp1+p2−p6−p7−p8≈ 2c|p9| → ∞,

E C = ωp1+p2−p5−p8−p9 + ωp5 − ωp7 − ωp1+p2−p7−p8−p9→ const,

E D = ωp3 + ωp4 − ωp8 − ωp9 − ωp1+p2−p5−p8−p9≈ −2c|p9| → ∞.

So, in this limit, according to the condition of the theorem, the coefficientfunctions at vertices B and D tend to zero rapidly, e.g., exponentially. Inorder to prove the convergence of the loop integrals, we need to make surethat the same rapid decay is characteristic for all directions in the space Σ.

The above analysis is applicable to all loop variables: Any loop has abottom vertex (vertex B in our example), a top vertex (vertex D in ourexample), and possibly a number of intermediate vertices (vertex C in ourexample). As the loop momentum goes to infinity, the energy functions of the top and bottom vertices tend to infinity, i.e., move away from the energyshell, which ensures a fast decay of the corresponding coefficient function.

Now we can take an arbitrary direction to infinity in the space Σ. Alongthis direction, there is at least one loop momentum which goes to infinity.Then there is at least one energy function (E A, E B, E C , or E D) which growslinearly, while others stay constant (in the worst case). Therefore, accordingto the condition of the theorem, the integrand decreases rapidly (e.g., expo-nentially) along this direction. These arguments are valid for all directions inthe space Σ. Therefore the integrand rapidly tends to zero in all directions,and integral (7.115) converges.

In chapter 8 we will see that in realistic theories, like QED, the asymp-totic decay of the coefficient functions of potentials at large momenta is notfast enough, so Theorem 7.12 is not applicable, and loop integrals usuallydiverge. A detailed discussion of divergences in quantum field theory andtheir elimination will be presented in sections 8.3 and 9.2.

Page 308: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 308/687

278 CHAPTER 7. THE FOCK SPACE

7.5 Particle decays

The Fock space formulation of quantum theory presented in section 7.2 hasone important advantage in comparison with the simple few-particle ap-proach from chapter 6. The Fock space gives us an opportunity to describenot just interparticle interactions, but also the processes of creation and ab-sorption of particles. The simplest example of such processes is the decay of an unstable particle considered in this section. Here we will pursue two goals.The first goal is to present a preliminary material for our discussion of decaysof moving particles in section 10.5. The second goal is to derive a beautifulresult, due to Breit and Wigner, which tells that the time dependence of particle decays is (almost) always exponential.

7.5.1 Quantum mechanics of particle decays

The decay of unstable particles is described mathematically by the non-decay probability which has the following definition. Suppose that we have a pieceof radioactive material with N unstable nuclei prepared simultaneously attime t = 0 and denote N u(t) the number of nuclei that remain undecayed attime t > 0. So, at each time point the piece of radioactive material can becharacterized by the ratio N u(t)/N .

In this paper, in the spirit of quantum mechanics, we will treat N unstableparticles as an ensemble of identically prepared systems and consider the

ratio N u(t)/N as a property of a single particle (nucleus) – the probability of finding this particle in the undecayed state. Then the non-decay probabilityω(t) (also called the non-decay law in this book) is defined as a large N limit

ω(t) = limN →∞

N u(t)/N (7.119)

Let us now turn to the description of an isolated unstable system fromthe point of view of quantum theory. We will consider a model theory withparticles a, b, and c, so that particle a is massive and unstable, while itsdecay products b and c are stable and their masses satisfy the inequality

ma > mb + mc (7.120)

which makes the decay a → b + c energetically possible. In order to simplifycalculations and avoid being distracted by issues that are not relevant to the

Page 309: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 309/687

7.5. PARTICLE DECAYS 279

problem at hand we assume that particle a is spinless and has only one decay

mode. For our discussion, the nature of this particle is not important. Forexample, this could be a muon or a radioactive nucleus or an atom in anexcited state.

Observations performed on the unstable system may result in only twooutcomes. One can find either a non-decayed particle a or its decay productsb + c. Thus it is appropriate to describe the states of the unstable particle in

just two sectors of the Fock space

H = Ha ⊕ Hbc (7.121)

where Ha is the subspace of states of the unstable particle a, and Hbc ≡Hb ⊗ Hc is the orthogonal subspace of the decay products. In principle, afull description of systems involving these three types of particles must beformulated in the Fock space where particle numbers N a, N b, and N c areallowed to take any values from zero to infinity. However, for most unstableparticles the interaction between the decay products b and c in the final statecan be ignored, and considering the subspace (7.121) of the full Fock spaceis a reasonable approximation.

The simplest interaction responsible for the decay can be written as35

V = dpdr(g(p, r)a†p+rbpcr + g∗(p, r)b†

pc†rap+r) (7.122)

As expected, the interaction operator (7.122) leaves invariant the sector H =Ha ⊕ Hbc of the total Fock space.

We can now introduce a Hermitian operator T that corresponds to theobservable “particle a exists”. The operator T can be fully defined by itseigensubspaces and eigenvalues. When a measurement performed on theunstable system finds it in a state corresponding to the particle a, then thevalue of T is 1. When the decay products b + c are observed, the value of T is 0. Apparently, T is the projection operator on the subspace

Ha. For

35Note that in order to have a Hermitian Hamiltonian we need to include in the inter-action both the term b†c†a responsible for the decay and the term a†bc responsible for theinverse process b + c → a. Due to the relation (7.120), these two terms have non-emptyenergy shells, so, according to our classification in subsection 7.2.4, they belong to the“decay” type.

Page 310: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 310/687

280 CHAPTER 7. THE FOCK SPACE

each normalized state vector |Ψ ∈ H the probability of finding the unstable

particle a is given by the expectation value of this projection

ω = Ψ|T |Ψ (7.123)

Alternatively, one can say that ω is a square of the norm of the projectionT |Ψ

ω = Ψ|T T |Ψ= T |Ψ2 (7.124)

where we used property T 2 = T from Theorem G.1. Any vector

|Ψ ∈ Ha

describes a state in which the unstable particle a is found with 100% certainty.

ω(0) = Ψ|T |Ψ= 1 (7.125)

We will assume that the unstable system was prepared in the state |Ψ attime t = 0. Then the time evolution of this state is given by

|Ψ(t) = eiHt |Ψ (7.126)

and the non-decay law is

ω(t) = Ψ|e− iHtT e

iHt |Ψ (7.127)

From this equation it is clear that the Hamiltonian H describing the unstablesystem should not commute with the projection operator T 36

[H, T ] = 0 (7.128)

Page 311: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 311/687

7.5. PARTICLE DECAYS 281

HHaa

HHbc

t=0

t>0ω(tt))

11

Figure 7.5: Time evolution of the state vector of an unstable system.

Now we can construct a schematic visual representation of the decayprocess in the Hilbert space. In fig. 7.5 we showed the full Hilbert spaceHa ⊕ Hbc as a sum of two orthogonal subspaces Ha and Hbc. We assumed

that the initial normalized state vector |Ψ at time t = 0 lied entirely inthe subspace Ha. So that the non-decay probability ω(0) is equal to oneas in eq. (7.125). From eq. (7.128) we know that the subspace Ha is notinvariant with respect to time translations. Therefore, at time t > 0 thevector |Ψ(t) ≡ e

iHt |Ψ develops a component lying in the subspace of decay

products Hbc. The decay is described as a gradual time evolution of the statevector from the subspace of unstable particle Ha to the subspace of decayproducts Hbc. Then the non-decay probability ω(t) decreases with time.

Before calculating the non-decay law (7.127) we will need to do somepreparatory work first. In subsections 7.5.2 and 7.5.4 we are going to con-struct two useful bases. One is the basis

|p

of eigenvectors of the momentum

operator P0 in Ha. Another is the basis |p, m of common eigenvectors of P0 and the interacting mass M in H.

36Otherwise, the subspace Ha of states of the particle a would be invariant with respectto time translations and the particle a would be stable.

Page 312: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 312/687

282 CHAPTER 7. THE FOCK SPACE

7.5.2 Non-interacting representation of the Poincare

groupLet us first consider a simple case when the interaction responsible for thedecay is “turned off”. This means that dynamics of the system is determinedby the non-interacting representation of the Poincare group U 0g in H. Thisrepresentation is constructed in accordance with the structure of the Hilbertspace (7.121) as37

U 0g ≡ U ag ⊕ U bg ⊗ U cg (7.129)

where U ag , U bg , and U cg are unitary irreducible representations of the Poincare

group corresponding to particles a, b, and c, respectively. The generators of the representation (7.129) are denoted by P0, J0, H 0, and K0. According to(7.120), the operator of non-interacting mass

M 0 = +1

c2

H 20 − P 20 c2

has a continuous spectrum in the interval [mb + mc, ∞) and a discrete pointma embedded in this interval.

From definition (7.129) it is clear that the subspaces Ha and Hbc areinvariant with respect to U 0g . Moreover, the projection operator T commutes

with non-interacting generators

[T, P0] = [T, J0] = [T, K0] = [T, H 0] = 0 (7.130)

This implies that particle a is stable with respect to all inertial transforma-tions, as expected.

Exactly as we did in subsection 5.1.2, we can use the non-interacting rep-resentation U 0g to build a basis |p of eigenvectors of the momentum operatorP0 in the subspace Ha. Then any state |Ψ ∈ Ha can be represented by alinear combination of these basis vectors

|Ψ =

dpψ(p)|p (7.131)

37compare with eq. (7.11)

Page 313: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 313/687

7.5. PARTICLE DECAYS 283

and the projection operator T can be written as

T =

dp|pp| (7.132)

7.5.3 Normalized eigenvectors of momentum

Basis vectors |p are convenient for writing arbitrary states |Ψ ∈ Ha aslinear combinations (7.131). However vectors |p themselves are not goodrepresentatives of quantum states, because they are not normalized. Forexample, the momentum space “wave function” of the basis vector |q is adelta function

ψ(p) = p|q= δ (p − q) (7.133)

and the corresponding “probability” of finding the particle is infinite

dp|ψ(p)|2 =

dp|δ (p − q)|2

= ∞Therefore, states

|q

cannot be used in formula (7.123) to calculate the non-

decay probability. However, we would like to have a method to calculatethe non-decay law for states with definite (or almost definite) momentump0. In order to do that we should use state vectors38 that have normalizedmomentum-space wave functions sharply localized near p0. In order to satisfythe normalization condition

dp|ψ(p)|2 = 1

wave functions of |p0) may be formally represented as a square root of theDirac’s delta function39

38which we denote by |p0) to distinguish them from |p039Another way to achieve the same goal would be to keep the delta-function represen-

tation (7.133) of definite-momentum states, but use (formally vanishing) normalizationfactors, like N = (

dp|ψ(p)|2)−1/2. Perhaps such manipulations with infinitely large and

infinitely small numbers can be justified within non-standard analysis [80].

Page 314: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 314/687

284 CHAPTER 7. THE FOCK SPACE

ψ(p) = δ (p − p0) (7.134)

According to eq. (5.26), the exponent of the Newton-Wigner position

operator ei(R0)zb acts as a translation operator in the momentum space. In

particular, we can apply this operator to the wave function (7.134) and obtain

ei(R0)zmac sinh θ

δ (p) =

δ (p − p0)

where p0 = (0, 0, mac sinh θ). On the other hand, applying the boost trans-formation (5.22) to (7.134) we obtain

e− ic(K 0)zθ

δ (p) =

ΩL−1p

Ωp

δ (L−1p)

=

ΩL−1p

Ωp

1

|J |δ (p)

=

δ (p − p0)

where

Ωp =

m2ac4 + p2c2 (7.135)

and |J | =ΩL−1pΩp

is the Jacobian of transformation p → L−1p. This suggests

that momentum eigenvectors |p have a useful representation

|p = eiR0·p|0 (7.136)

7.5.4 Interacting representation of the Poincare group

In order to study dynamics of the unstable system we need to define aninteracting unitary representation of the Poincare group in the Hilbert spaceH. This representation will allow us to relate results of measurements indifferent reference frames. In this section we will take the point of view of

Page 315: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 315/687

7.5. PARTICLE DECAYS 285

the observer at rest. We will discuss particle decay from the point of view of

a moving observer in section 10.5.Let us now “turn on” the interaction responsible for the decay and discussthe interacting representation U g of the Poincare group in H with generatorsP, J, K, and H . As mentioned before, we prefer to work in the Dirac’sinstant form of dynamics. Then the generators of space translations androtations are interaction-free,

P = P0

J = J0

while generators of time translations (the Hamiltonian H ) and boosts containinteraction-dependent terms40

H = H 0 + V

K = K0 + Z

We will further assume that the interacting representation U g belongs to theBakamjian-Thomas form of dynamics in which the interacting operator of mass M commutes with the Newton-Wigner position operator (6.58)41

M ≡ c−2

H 2 − P20c2 (7.137)

[R0, M ] = 0 (7.138)

Our next goal is to define the basis of common eigenvectors of commutingoperators P0 and M in H.42 These eigenvectors must satisfy conditions

40see eq. (7.122)41The possibilities for the interaction to be not in the Bakamjian-Thomas instant form

are discussed in subsection 10.5.7.42In addition to these two operators, whose eigenvalues are used for labeling eigenvectors

|p, m, there are other independent operators in the mutually commuting set containing P0and M . These are, for example, the operators of the square of the total angular momentumJ20 and the projection of the total angular momentum on the z-axis (J 0)z. Thereforea unique characterization of any basis vector requires specification of all correspondingquantum numbers as |p, m , j2, jz, . . .. However these quantum numbers are not relevantfor our discussion and we omit them.

Page 316: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 316/687

286 CHAPTER 7. THE FOCK SPACE

P0|p, m = p|p, m (7.139)

M |p, m = m|p, m (7.140)

They are also eigenvectors of the interacting Hamiltonian H =

M 2c4 + P20c2

H |p, m = ωp|p, m

where ωp ≡

m2c4 + c2p2.43 In the zero-momentum eigensubspace of the

momentum operator P0 we can introduce a basis

|0, m

of eigenvectors of

the interacting mass M

P0|0, m = 0

M |0, m = m|0, m

Then the basis |p, m in the entire Hilbert space H can be built by formula44

|p, m =1

√ ωp

e− icK θ|0, m

where p = mc θθ−1 sinh θ. These eigenvectors are normalized to delta func-tions

q, m|p, m′ = δ (q − p)δ (m − m′) (7.141)

The actions of inertial transformations on these states are found by the samemethod as in section 5.1. In particular, for boosts along the x-axis and timetranslations we obtain45

43Note the difference between ωp that depends on the eigenvalue m of the interactingmass operator and Ωp in eq. (7.135) that depends on the fixed value of mass ma of theparticle a.

44compare with (5.4) and (5.19)45compare with eq. (5.20)

Page 317: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 317/687

7.5. PARTICLE DECAYS 287

e− icK xθ|p, m =

ωΛpωp

|Λp, m (7.142)

eiHt |p, m = e

iωpt|p, m (7.143)

where

Λp = ( px cosh θ +ωp

csinh θ, py, pz) (7.144)

Next we notice that due to eqs. (4.24) and (7.138) vectors eiR0·p|0, m also

satisfy eigenvector equations (7.139) - (7.140), so they must be proportionalto the basis vectors |p, m

|p, m = γ (p, m)eiR0·p|0, m (7.145)

where γ (p, m) is an unimodular factor. Unlike in (7.136), we cannot concludethat γ (p, m) = 1. However, if the interaction is not pathological we canassume that the factor γ (p, m) is smooth, i.e., without rapid oscillations.46

Obviously, vector |0 from the basis (7.136) can be expressed as a linearcombination of zero-momentum basis vectors |0, m, so we can write47

|0 =

∞ mb+mc

dmµ(m)|0, m (7.146)

where

µ(m) = 0, m|0 (7.147)

is yet unknown function which depends on the choice of the interactionHamiltonian V . The physical meaning of µ(m) is the amplitude of finding

46

This property will be used in derivation of eq. (10.112).47We will assume that interaction responsible for the decay does not change the spectrumof mass. In particular, we will neglect the possibility of existence of bound states of particles b and c, i.e., discrete eigenvalues of M below mb + mc. Then the spectrum of M (similar to the spectrum of M 0) is continuous in the interval [mb + mc, ∞), and integrationin (7.146) should be performed from mb + mc to infinity.

Page 318: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 318/687

288 CHAPTER 7. THE FOCK SPACE

the interacting mass m in the unstable state |0. We now use eqs. (7.136)

and (7.146) to expand vectors |p in the basis |p, m

|p = eiR0p|0 (7.148)

= eiR0p

∞ mb+mc

dmµ(m)|0, m

=

∞ mb+mc

dmµ(m)γ (p, m)|p, m (7.149)

Then any state vector from the subspace Ha can be written as

|Ψ =

dpψ(p)|p (7.150)

=

dp

∞ mb+mc

dmµ(m)γ (p, m)ψ(p)|p, m (7.151)

From (7.141) we also obtain a useful formula

q|p, m =

∞ mb+mc

dm′µ∗(m′)γ ∗(q, m′)q, m′|p, m

= γ ∗(p, m)µ∗(m)δ (q − p) (7.152)

7.5.5 The non-decay law

Let us find the time evolution of the state vector (7.150) prepared withinsubspace

Ha at time t = 0. The time dependence is obtained by applying

eqs (7.126), (7.143), and (7.149)

|Ψ(t) =

dpψ(p)e

iHt |p

Page 319: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 319/687

7.5. PARTICLE DECAYS 289

= dpψ(p)

mb+mc

dmµ(m)γ (p, m)eiHt

|p, m

=

dpψ(p)

∞ mb+mc

dmµ(m)γ (p, m)eiωpt|p, m

The inner product of this vector with |q is found by using (7.152)

q|Ψ(t)

= dpψ(p)

mb+mc

dmµ(m)γ (p, m)eiωpt

q

|p, m

=

dpψ(p)

∞ mb+mc

dm|µ(m)|2γ (p, m)γ ∗(p, m)eiωptδ (q − p)

= ψ(q)

∞ mb+mc

dm|µ(m)|2e iωqt

The non-decay law is then found by substituting (7.132) in eq. (7.127) andusing (7.153)

ω(t) =

dqΨ(t)|qq|Ψ(t)

=

dq|q|Ψ(t) |2

=

dq|ψ(q)|2

mb+mc

dm|µ(m)|2e iωqt

2

(7.153)

This formula is valid for the decay law of any state |Ψ ∈ Ha. In the particularcase of the normalized state

|0) whose wave function ψ(q) is well-localized

in the momentum space near zero momentum, we can set approximately

ψ(q) ≈

δ (q)

|ψ(q)|2 ≈ δ (q)

Page 320: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 320/687

290 CHAPTER 7. THE FOCK SPACE

and48

ω|0)(t) ≈

∞ mb+mc

dm|µ(m)|2e imc2t

2

(7.154)

This result demonstrates that the decay law is fully determined by the func-tion |µ(m)|2 which is referred to as the mass distribution of the unstableparticle. In the next two subsections we will consider an exactly solvabledecay model for which the mass distribution and the non-decay law can beexplicitly calculated.

7.5.6 Solution of the eigenvalue problem

In this section we are discussing decay of a particle at rest. Therefore, itis sufficient to consider the subspace H0 ⊆ H of states having zero totalmomentum. The subspace H0 can be further decomposed into the directsum

H0 = Ha0 ⊕ H(bc)0

where

Ha0 = H0 ∩ Ha

H(bc)0 = H0 ∩ (Hb ⊗ Hc)

Ha0 is, of course, the one-dimensional subspace spanning the vector |0 of the particle a, and the subspace H(bc)0 of decay products can be furnishedwith the basis set |π of eigenvectors of the relative momentum of particlesb and c.49 Each state in H0 can be written as a decomposition in the abovebasis

|Ψ = µ|0 +

dπζ (π)|π48compare, for example, with eq. (3.8) in [81]49see subsection 6.3.2

Page 321: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 321/687

7.5. PARTICLE DECAYS 291

The coefficients of this decomposition can be represented as a column vector

|Ψ =

µ

ζ (π)

(7.155)

that is composed of a complex number µ and a complex function ζ (π). Thevector |Ψ should be normalized, hence its wave function should satisfy thenormalization condition

|µ|2 +

dπ|ζ (π)|2 = 1 (7.156)

The probability of finding the unstable particle in the state |Ψ is

ω = |µ|2

and in the initial state

|0 =

10

the unstable particle is found with 100% probability.We can now find representations of various operators in the basis |0, |π.

The non-interacting mass is diagonal

M 0 =

ma 00 ηπ

where

ηπ =

1

c2 ( m2

bc4

+ c2

π2

+ m2cc4

+ c2

π2

) (7.157)

is the mass of the two-particle (b + c) system expressed as a function of therelative momentum (see, for example, eq. (3.3) in [66]). The matrix form of the interaction operator (7.122) is

Page 322: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 322/687

292 CHAPTER 7. THE FOCK SPACE

V = 0 dqg(q) . . .

g∗(π) 0

Then the action of the full Hamiltonian H = H 0 + V on vectors (7.155) is

H

µ

ζ (π)

=

mac2µ +

dqg(q)ζ (q)

g∗(π)µ + ηπc2ζ (π)

The next step is to find eigenvalues (mc2) and eigenvectors

|0, m ≡ µ(m)ζ m(π) (7.158)

of the Hamiltonian H .50 This task is equivalent to the solution of the follow-ing system of linear equations:

mac2µ(m) +

dqg(q)ζ m(q) = mc2µ(m) (7.159)

g∗(π)µ(m) + η πc2ζ m(π) = mc2ζ m(π) (7.160)

From Eq. (7.160) we obtain

ζ m(π) =g∗(π)µ(m)

mc2 − ηπc2(7.161)

Substituting this result to eq. (7.159) we obtain equation which determinesthe spectrum of eigenvalues m

m − ma =1

c2

dq

|g(q)|2m − ηq

(7.162)

To comply with the law of conservation of the angular momentum, the func-tion g(q) should depend only on the absolute value q ≡ |q| of its argument.Therefore, we can rewrite eq. (7.162) in the form

50Note that the function µ(m) in (7.158) is the same as in (7.147). So, in order tocalculate the non-decay law (7.154), all we need to know is |µ(m)|2.

Page 323: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 323/687

7.5. PARTICLE DECAYS 293

m − ma = F (m) (7.163)

where

F (m) =

∞ 0

dq G(q )

m − ηq(7.164)

and

G(q ) =4πq 2

c2|g(q )|2

From the normalization condition (7.156)

|µ(m)|2 +

dq|ζ m(q)|2 = 1

and eq. (7.161) we obtain

|µ(m)|2

1 +

dq

G(q )

(m − ηq)2

= 1

and

|µ(m)|2 =1

1 − F ′(m)(7.165)

where F ′(m) is the derivative of F (m). So, in order to calculate the decaylaw (7.153) we just need to know the derivative of F (m) at points m of thespectrum of the interacting mass operator. The following subsection willdetail such a calculation.

Page 324: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 324/687

294 CHAPTER 7. THE FOCK SPACE

mmaa

mm

F(mm))

mmbb+m

cc

MM00

Figure 7.6: The graphical solution of eq. (7.163). The dashed line indicatesthe continuous part of the spectrum of the non-interacting mass operatorM 0. The thick full line shows function F (m) at m < mb + mc.

7.5.7 The Breit-Wigner formula

The function ηq in (7.157) has minimum value η0 = mb + mc at q = 0 andgrows to infinity with increasing q . Then the solution of eq. (7.163) for thevalues of m ∈ [−∞, mb + mc] is rather straightforward (see Fig. 7.6). In thisregion the denominator in the integrand of (7.164) does not vanish, and F (m)is a well-defined continuous function which tends to zero at m = −∞ anddeclines monotonically as m grows. A graphical solution of eq. (7.163) can beobtained in the interval [−∞, mb + mc] as an intersection of the line m − ma

and the function F (m) (point M 0 in Fig. 7.6). The corresponding value of m is an eigenvalue of the interacting mass operator, and the correspondingeigenstate is a superposition of the unstable particle a and its decay productsb + c.

Finding the spectrum of the interacting mass in the region [mb + mc, ∞]is more tricky due to a singularity in the integrand of (7.164). Let us firstdiscuss our approach qualitatively, using graphical representation in Fig. 7.7.We will do this by first assuming that the momentum spectrum of the prob-

Page 325: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 325/687

7.5. PARTICLE DECAYS 295

mm11

mm22 mm

33

mm44

mmaa

mm55

mm66

MM00

MM11

MM22

MM33 MM

44 MM55

mm

F(mm))

Figure 7.7: Spectra of the free (opened circles) and interacting (full circles)Hamiltonians.

lem is discrete51 and then making a gradual transition to the continuousspectrum (e.g., increasing the size of the box to infinity). In the discreteapproximation, eq. (7.163) takes the form

m − ma =1

c2

i

|g(qi)|2m − mi

= F (m) (7.166)

where mi = ηqi are eigenvalues of the non-interacting mass of the 2-particlesystem b + c, and m1 = mb + mc. The function on the right hand side of eq. (7.166) is a superposition of functions |g(qi)|2(m − mi)

−1 for all valuesof i = 1, 2, 3, . . .. These functions have singularities at points mi, which areeigenvalues of the non-interacting Hamiltonian. Positions of these singular-ities are shown as open circles and dashed vertical lines in Fig. 7.7. Theoverall shape of the function F (m) in this approximation is shown by thethick full line in Fig. 7.7. According to eq. (7.166), the spectrum of the

51This can be achieved, e.g., by placing the system in a box or applying periodic bound-ary conditions.

Page 326: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 326/687

296 CHAPTER 7. THE FOCK SPACE

interacting mass operator can be found at points where the line m − ma

intersects with the function F (m). These points M i are shown by full cir-cles in Fig. 7.7. So, the derivatives required in eq. (7.165) are graphicallyrepresented as slopes of the function F (m) at points M 1, M 2, M 3, . . .. Thedifficulty is that in the limit of continuous spectrum the distances betweenpoints mi tend to zero, function F (m) wildly oscillates, and its derivativetends to infinity everywhere.

To overcome this difficulty we will use a clever idea first suggested in ref.[82]. Let us change the integration variable

z = ηq

q = η−1(z )

in the integral (7.164). Then denoting

K (z ) =dη−1(z )

dz G(η−1(z ))

we obtain

F (m) =

∞ mb+mc

dz K (z )m − z

=

m−∆ mb+mc

dz K (z )

m − z +

m+∆ m−∆

dz K (z )

m − z +

∞ m+∆

dz K (z )

m − z (7.167)

where ∆ is a small number such that function K (z ) may be considered con-stant (K (z ) = K (m)) in the interval [m− ∆, m +∆]. When ∆ → 0, the firstand third terms on the right hand side of (7.167) give the principal valueintegral (denoted by P )

m−∆ mb+mc

dz K (z )

m − z +

∞ m+∆

dz K (z )

m − z −→ P

∞ mb+mc

dz K (z )

m − z (7.168)

Page 327: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 327/687

7.5. PARTICLE DECAYS 297

Let us now look more closely at the second integral on the right hand side

of (7.167). The interval [m − ∆, m + ∆] can be divided into 2N small equalsegments

mi = m0 + i∆

N

where m0 = m, integer i runs from −N to N , and the integral can beapproximated as a partial sum

m+∆ m−∆

dz K (z )m − z

≈ K (m0)

m+∆ m−∆

dz 1m − z

≈ K (m0)

N i=−N

∆/N

m − m0 − i∆N

(7.169)

Next we assume that N → ∞ and index i runs from −∞ to ∞. Then theright hand side of eq. (7.169) defines an analytical function with poles atpoints

mi = m0 + i∆

N (7.170)

and with residues K (m0)∆/N . As any analytical function is uniquely deter-mined by the positions of its poles and the values of its residues, we concludethat integral (7.169) has the following representation

m+∆

m−∆

dz K (z )

m

−z

= K (m0)π cot(πN

∆(m − m0)) (7.171)

Indeed, the cot function on the right hand side of eq. (7.171) also has polesat points (7.170). Near these points the right hand side of (7.171) can beapproximated as

Page 328: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 328/687

298 CHAPTER 7. THE FOCK SPACE

K (m0)π cot(πN ∆

(m − m0)) ≈ πK (m0)πN ∆ (m − m0)

=K (m0)∆/N

(m − m0)

so it has exactly the required residues. Now we can put eqs. (7.168) and(7.171) together and write

F (m) = P (m) + πK (m)cot(πN m

∆)

where we denoted for brevity

P (m) ≡ P

∞ 0

dq G(q )

m − ηq

Then, using

cot(ax)′ = −a(1 + cot2(ax))

and neglecting derivatives of smooth functions (the principal value integraland K (m)), we obtain

F ′(m) = −π2K (m)N

∆(1 + cot2(πN ∆−1m)) (7.172)

We need values of F ′(m) at the discrete set of solutions of the equation

F (m) = m − ma

At these points we can write

m−

ma

=P

(m)−

πK (m)cot(πN ∆−1m)

cot(πN ∆−1m) = −m − ma − P (m)

πK (m)

cot2(πN ∆−1m) =(m − ma − P (m))2

π2K 2(m)

Page 329: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 329/687

7.5. PARTICLE DECAYS 299

Substituting this to (7.172) and (7.165) we obtain the desired result

F ′(m) = −π2K (m)N

∆(1 +

(m − ma − P (m))2

π2K 2(m))

|µ(m)|2 =1

1 + π2K (m)N ∆−1(1 + (ma+P (m)−m)2)π2K 2(m) )

(7.173)

=1

π2K (m)N ∆−1(1 + (ma+P (m)−m)2)π2K 2(m) )

=K (m)∆/N

π2K 2(m) + (ma +

P (m)

−m)2

(7.174)

where we neglected the unity in the denominator of (7.173) as comparedto the large factor N ∆−1. Formula (7.174) gives the probability for find-ing particle a at each point of the discrete spectrum M 1, M 2, M 3, . . .. Thisprobability tends to zero as the density of points N ∆−1 tends to infinity.However, when approaching the continuous spectrum in the limit N → ∞we do not need the probability for each spectrum point. We, actually, needthe probability density which can be obtained by multiplying the right handside of eq. (7.174) by the number of points per unit interval N ∆−1. Finally,the mass distribution for the unstable particle takes the famous Breit-Wigner form

|µ(m)|2 =Γ(m)/(2π)

Γ(m)2/4 + (ma + P (m) − m)2(7.175)

where we denoted Γ(m) ≡ 2πK 2(m). This resonance mass distributiondescribes an unstable particle with the expectation value of mass52 mA =ma + P (mA) and the width of ∆m ≈ Γ(mA) (see Fig. 7.8).

For unstable systems whose decays are slow enough to be observed inexperiment, the resonance shown in Fig. 7.8 is very narrow, so that insteadof functions Γ(m) and

P (m) we can use their values (constants) at m = mA:

Γ ≡ Γ(mA) and P ≡ P (mA). Moreover, we will assume that P ≪ ma, sothat the instability of the particle does not have a large effect on its mass. Wealso neglect the contribution from the isolated point M 0 of the mass spectrum

52the center of the resonance

Page 330: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 330/687

300 CHAPTER 7. THE FOCK SPACE

mmbb+m

ccmm

aa

mm

||µµ(m)|22

Γ Γ

Figure 7.8: Mass distribution of a typical unstable particle.

discussed in the beginning of this subsection. In most cases this contributionis very small. Then, substituting result (7.175) in equation (7.154), we obtainthe non-decay law for a particle at rest

ω(t) ≈ 1

4π2

∞mb+mc

dmΓe

imc2t

Γ2/4 + (ma − m)2

2

(7.176)

For most unstable systems

Γ ≪ ma − (mb + mc) (7.177)

so we can introduce further approximation by setting the lower integrationlimit in (7.176) to −∞. Then the non-decay law obtains the familiar expo-nential form

ω(t) ≈ e−Γc2t

= exp(− t

τ 0) (7.178)

where

τ 0 =

Γc2(7.179)

Page 331: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 331/687

7.5. PARTICLE DECAYS 301

is the lifetime of the unstable particle. For a particle prepared initially in

the undecayed state ω(0) = 1, the nondecay probability decreases from 1 toe−1 during its lifetime.The importance of formulas (7.175) and (7.178) is that they were derived

from very general assumptions. Actually, the only significant approximationis the weakness of the interaction responsible for the decay, i.e., the narrowwidth Γ of the resonance (7.177). This condition is satisfied for all knowndecays.53 Therefore, the exponential decay law is expected to be universallyvalid. This prediction is confirmed by experiment: so far no deviations fromthe exponential decay law (7.178) were observed.

53Approximation (7.177) may be not accurate for particles (or resonances ) decaying dueto strong nuclear forces. However, their lifetime is very short τ 0 ≈ 10−23s, so the timedependence of their decays cannot be observed experimentally.

Page 332: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 332/687

302 CHAPTER 7. THE FOCK SPACE

Page 333: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 333/687

Chapter 8

QUANTUMELECTRODYNAMICS

If it turned out that some physical system could not be described by a quantum field theory, it would be a sensation; if it turned out that the system did not obey the rules of quantum mechanics and relativity, it would be a cataclysm.

Steven Weinberg

From our previous discussion we know that in order to describe the dy-namics of an arbitrary system with a varying number of particles one has tobuild a relativistic cluster separable interacting Hamiltonian H = H 0 + V and boost operators K = K0 + Z in the Fock space. Statement 7.7 (clusterseparability of smooth interactions) tells us that the cluster separability isachieved simply by having smooth coefficients in the expansions of V andZ in terms of normal products of particle creation and annihilation opera-tors. Ensuring the Poincare invariance is a more difficult task. The Poincareinvariance means that interaction terms V and Z are consistent with each

other, so that commutation relations (6.22) - (6.26) are satisfied. The stan-dard way for satisfying these conditions is given by the quantum field the-ory (QFT). A particular version of QFT for describing interactions betweenelectrically charged particles and photons is called quantum electrodynamics (QED). This theory is the topic of our discussion in this chapter.

303

Page 334: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 334/687

304 CHAPTER 8. QUANTUM ELECTRODYNAMICS

For the time being, we will try to avoid any discussion of the physical

interpretation of classical and quantum fields. These issues will be consid-ered in chapter 11. In this chapter and in the next chapter we will adopta “minimalistic” approach in which quantum fields appear just as formallinear combinations of particle creation and annihilation operators. This ap-proach, in which the only role of quantum fields is to simplify the constructionof Poincare invariant interactions, was inspired by a non-traditional way of looking at quantum fields presented in Weinberg’s book [9].

The auxiliary role of the field-based formalism in our approach is un-derscored by the fact that constructions of quantum fields for fermions (elec-trons, protons, and their antiparticles) and photons are moved to AppendicesK and L, respectively. In section 8.1 we will write down interaction terms V

and Z and prove that the resulting theory is Poincare invariant. In section 8.2we will discuss how the S -matrix elements can be calculated in lowest ordersof the perturbation theory. In section 8.3, we will find out that the Hamilto-nian of QED actually leads to infinite results in higher order contributions tothe S -matrix. We will describe an important procedure of renormalizationwhich allows one to obtain physically sensible results for the S -matrix in allorders.

8.1 Interaction in QEDOur goal in this section is to build the interacting representation U (Λ, a) of the Poincare group in the Fock space (7.1). In this book we do not pretendto derive QED interactions from first principles. We simply borrow from thetraditional approach the form of four interacting generators of the Poincaregroup H and K as functions of quantum fields for electrons/positrons ψα(x),protons/antiprotons Ψα(x), and photons aµ(x) (see Appendices K and L).1

1At this point we do not offer any physical interpretation to quantum fields. For us

they are just abstract multicomponent functions from the Minkowski space-time M tooperators in the Fock space. Also, we are not identifying coordinates x and t in M withpositions and times of events measured in real life. The space-time M will be understoodas an abstract 4-dimensional space with pseudo-Euclidean metric. In chapter 11 we willdiscuss in more detail the physical meaning of quantum fields and the relationship betweentheir arguments and observable positions and times.

Page 335: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 335/687

8.1. INTERACTION IN QED 305

8.1.1 Construction of simple quantum field theories

In simple QFT theories the construction of relativistic interaction proceedsin three steps [9, 83]Step 1. For each particle type2 participating in the theory we construct aquantum field which is a multicomponent operator-valued function φi(x, t)defined on the Minkowski space-time M3 and satisfying the following condi-tions.

(I) Operator φi(x, t) contains only terms linear in creation or annihilationoperators of the particle and its antiparticle.

(II) Quantum fields are supposed to have simple transformation laws

U −10 (Λ, a)φi(x)U 0(Λ, a) = j

Dij(Λ−1)φ j(Λ(x + a)) (8.1)

with respect to the non-interacting representation of the Poincare groupin the Fock space,4 where Λ is a boost/rotation, a is a space-timetranslation, and Dij is a finite-dimensional (non-unitary) representationof the Lorentz group.

(III) Quantum fields turn to zero at x-infinity, i.e.

lim|x|→∞

φi(x, t) = 0 (8.2)

(IV) Fermion fields (i.e., fields for particles with half-integer spin) φi(x) andφ j(x′) are required to anticommute if (x − x′) is a space-like 4-vector,5

or equivalently

φi(x, t), φ j(y, t) = 0 if x = y (8.3)

Quantum fields for electrons-positrons ψα(x) and protons-antiprotons

Ψα(x) are constructed and analyzed in Appendix K.2Particle and its antiparticle are assumed to belong to the same particle type.3see Appendix I.14see subsection 7.1.85A 4-vector (x, t) is space-like if x2 > c2t2.

Page 336: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 336/687

306 CHAPTER 8. QUANTUM ELECTRODYNAMICS

(V) Boson fields (i.e., fields for particles with integer spin or helicity) at

points x and x′ are required to commute if (x − x′) is a space-like4-vector, or equivalently

[φi(x, t), φ j(y, t)] = 0 if x = y (8.4)

Quantum field for photons aµ(x) is constructed and discussed in Ap-pendix L.

Step 2. Having at our disposal quantum fields φi(x), ψ j(x), χk(x), . . . for allparticles we can build the potential energy density

V (x, t) =n

V n(x, t)

where each term has the form of a product of fields at the same (x, t) point

V n(x, t) =i,j,k,...

Gnijk...φi(x, t)ψ j(x, t)χk(x, t) . . . (8.5)

and coefficients Gnijk... are such that V (x)

(I) is a bosonic Hermitian operator function on the space-time M;

(II) transforms as a scalar with respect to the non-interacting representa-tion of the Poincare group:

U −10 (Λ, a)V (x)U 0(Λ, a) = V (Λ(x + a))

From properties (8.3) - (8.4) and the bosonic character of V (x) it is easyto prove that V (x) commutes with itself at space-like separations, e.g.,

[V (x, t), V (y, t)] = 0 if x = y (8.6)

Step 3. The interaction terms in the Hamiltonian and boost operator areobtained by integrating the potential energy density on x and setting t = 0

Page 337: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 337/687

8.1. INTERACTION IN QED 307

H = H 0 + V = H 0 +

dxV (x, 0) (8.7)

K = K0 + Z = K0 +1

c2

dxxV (x, 0) (8.8)

With these definitions, the commutation relations of the Poincare Liealgebra in the instant form of interacting dynamics (6.22) - (6.26) are notdifficult to prove (see [9, 83, 84] and subsection 8.1.4).

One can appreciate the simplicity and power of the quantum field formal-ism by comparing it with the Bakamjian-Thomas “direct interaction” theorydiscussed in section 6.3. There we found it rather difficult to make interac-

tions cluster separable, and to provide a unified treatment of systems withdifferent numbers of particles. In QFT we immediately obtain interactionoperators in the entire Fock space, and the cluster separability is satisfiedalmost automatically.6

Unfortunately, formulas (8.7) and (8.8) work only for simplest QFT mod-els. More interesting cases, such as QED, require some modifications in thisscheme. In particular, the presence of the additional term Ωµ(x, Λ) in thetransformation law of the photon field (L.17) does not allow us to define theboost interaction in QFT by simple formula (8.8). The construction of QEDinteractions and the full proof of their Poincare invariance will be discussedin the rest of this section.

8.1.2 Current density

In QED an important role is played by the operator of current density whichis defined as a sum of the electron/positron jµep(x) and proton/antiproton

jµ pa(x) current densities

jµ(x) = jµep(x) + jµ pa(x)

≡ −eψ(x)γ µψ(x) + eΨ(x)γ µΨ(x) (8.9)

where e is absolute value of the electron charge, gamma matrices γ µ aredefined in eqs (I.12) - (I.13), and quantum fields ψ(x), ψ(x), Ψ(x), Ψ(x) are

6coefficient functions of potentials in (8.7) are usually smooth in the momentum rep-resentation

Page 338: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 338/687

308 CHAPTER 8. QUANTUM ELECTRODYNAMICS

defined in Appendix K.1. Let us consider the electron/positron part jµep(x) of

the current density and derive three important properties of this operator.

7

First, with the help of (I.24), (I.26), and (K.25) we can find that the currentoperator (8.9) transforms as a 4-vector function on the Minkowski space-time.

U −10 (Λ, 0) jµep(x)U 0(Λ, 0) = −eU −10 (Λ, 0)ψ†(x)γ 0γ µψ(x)U 0(Λ, 0)

= −eU −10 (Λ, 0)ψ†(x)U 0(Λ, 0)γ 0γ µU −10 (Λ, 0)ψ(x)U 0(Λ, 0)

= −eψ†(Λx)D†(Λ−1)γ 0γ µD(Λ−1)ψ(Λx)

= −eψ†(Λx)D(Λ−1)γ 0D(Λ−1)D(Λ)γ µD(Λ−1)ψ(Λx)

= −eψ†(Λx)γ 0D(Λ)γ µD(Λ−1)ψ(Λx)

= −e3

ν =0

ψ†(Λx)γ 0Λ−1µν γ ν ψ(Λx)

=3

ν =0

Λ−1µν j

ν ep(Λx) (8.10)

From this we obtain a useful commutator

[(K 0)z, j0(x)]

= −i

c limθ→0d

dθ e

i(K 0)zcθ

j0(x)e−i(K 0)zcθ

= −i

climθ→0

d

dθ[ j0(x, y, z cosh θ − ct sinh θ, t cosh θ − z

csinh θ) cosh θ

+ jx(x, y, z cosh θ − ct sinh θ, t cosh θ − z

csinh θ) sinh θ]

= i (z

c2d

dt+ t

d

dz ) j0(x) − i

cjz(x) (8.11)

Translations act by shifting the argument of the current

U

−10 (0, a) j

µ

ep(x)U 0(0, a) = j

µ

ep(x + a) (8.12)Second, the current density satisfies the continuity equation which can beproven by using Dirac equations (K.30), (K.31) and property (I.14)

7Properties of the proton/antiproton part are similar.

Page 339: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 339/687

8.1. INTERACTION IN QED 309

∂ ∂t

j0(x) = −e ∂ ∂t

(ψ(x)γ 0ψ(x))

= −e(∂

∂tψ(x))γ 0ψ(x) + ψ(x)(γ 0

∂tψ(x))

= e(c∂

∂ xψ†(x)γ † +

i

mc2ψ†(x))γ 0ψ(x) + ψ(x)(cγ

∂ xψ(x) − i

mc2ψ(x))

= ec∂

∂ xψ(x)γψ(x) + ecψ(x)γ

∂ xψ(x)

= ec∂

∂ x(ψ(x)γψ(x))

= −c

∂ x j(x) (8.13)

Third, from eq. (K.29) it follows that current components commute at space-like separations

[ jµ(x, t), jν (y, t)] = 0, if x = y

8.1.3 Interaction operators in QED

The total Hamiltonian of QED is

H = H 0 + V (8.14)

where the non-interacting Hamiltonian H 0 is that from eq. (7.36), and in-teraction is composed of two terms8

V (t) = V 1(t) + V 2(t) (8.15)

The first order interaction is a pseudoscalar product of the fermion currentoperator and the photon quantum field9

8Here and in what follows we denote the power of the coupling constant e (the pertur-bation order of an operator) by a subscript, i.e., H 0 is zero order, V 1 is first order, V 2 issecond order, etc.

9 The last equality follows from definition of the pseudoscalar product (I.2) and eq.(L.6).

Page 340: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 340/687

310 CHAPTER 8. QUANTUM ELECTRODYNAMICS

V 1(t) = dx j(x, t) · a(x, t)

=

dxj(x, t)a(x, t) (8.16)

The second order interaction is

V 2(t) =1

2

dxdy j0(x, t)G(x − y) j0(y, t) (8.17)

where we denoted

G(x − y) =1

4π|x − y|The interaction in the boost operator

K(t) = K0(t) + Z(t)

is defined as

Z(t) = 1c2 dxxj(x, t)a(x, t) + 12c2

dxdy j0(x, t)xG(x − y) j0(y, t)

+1

c

dx j0(x, t)C(x, t) (8.18)

where components of the operator function C(x, t) are given by eq. (L.22).The Hamiltonian H and the boost operator K here are those usually

written in the Coulomb gauge version of QED [9, 83]. In this chapter we willfind that after renormalization this interaction can describe the S -matrix inexcellent agreement with experiment.10 However, we will see in subsection9.1.2 that this interaction cannot be used to describe the time evolution of

wave functions and observables. In chapter 9 we will find different expres-sions for H (and K) which, in addition to scattering, can describe the timeevolution as well.

10This Hamiltonian does not take into account the proton’s internal structure, thus itmakes an inaccurate prediction of the proton’s magnetic moment and related interactions.

Page 341: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 341/687

8.1. INTERACTION IN QED 311

8.1.4 Poincare invariance of QED

In this section we are going to prove the Poincare invariance of the theoryconstructed above, i.e., the validity of commutators (6.22) - (6.26). Theproof presented here is taken from Weinberg’s works [9, 83] and, especially,Appendix B in [84].

Interaction operator V (t) clearly commutes with operators of the totalmomentum and total angular momentum, so eq. (6.22) is easily verified. Zis a 3-vector by construction, so eq. (6.24) is valid as well. Let us now provethe commutator (6.23)11

[(P 0)i, Z

j(t)] =

i

c2V (t)δ

ij

Consider the case i = j = x and denote

V (x, t) ≡ j(x, t)a(x, t) +1

2

dy j0(x, t)G(x − y) j0(y, t)

so that

V (t) = dxV (x, t)

and

Z(t) =1

c2

dxxV (x, t) +

1

c

dx j0(x, t)C(x, t) (8.19)

Then, using eqs. (8.12) and (7.44) - (7.45) we obtain

[(P 0)x, Z x(t)]

= −i lima→0 d

dae i (P 0)xaZ x(t)e− i (P 0)xa

11In calculations it is convenient to write conditions (6.22) - (6.26) in a t-dependentform, i.e., to multiply these equations by exp(− i

H 0t) from the left and exp( i

H 0t) from

the right, as in (6.94). At the end of calculations we can set t = 0.

Page 342: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 342/687

312 CHAPTER 8. QUANTUM ELECTRODYNAMICS

= −i

c2lima

→0

d

da dxei(P 0)xa(xV (x, t) + cj0(x, t)C x(x, t))e− i

(P 0)xa

= −i

c2lima→0

d

da

dx(xV (x + a , y, z, t) + cj0(x + a , y, z, t)C x(x + a , y, z, t))

= −i

c2lima→0

d

da

dx((x − a)V (x, y, z, t) + cj0(x, y, z, t)C x(x, y, z, t))

= i 1

c2

dxV (x, y, z, t)

= i 1

c2V (t) (8.20)

which is exactly eq. (6.23).

The proof of eq. (6.26) is more challenging. Let us consider the case i = z and attempt to prove

[(K 0)z, V 1(t)] + [(K 0)z, V 2(t)] − i d

dtZ z(t) − [V (t), Z z(t)] = 0 (8.21)

where we took into account that [Z z(t), H 0] = −i ddt

Z z(t). We will calculateall four terms on the left hand side of (8.21) separately. Consider the firstterm and use eqs (8.11), (L.17), and (I.4)

[(K 0)z, V 1(t)]

= −i

climθ→0

d

dθei(K 0)zcθV 1(t)e− i

(K 0)zcθ

= −i

climθ→0

d

dθei(K 0)zcθ

dx j(x) · a(x)e− i

(K 0)zcθ

= −i

climθ→0

d

dx(Λ−1 j(Λx) · Λ−1a(Λx) + Λ−1 j(Λx) · Ω(x, Λ))

= −i

climθ→0

d

dx( j(Λx) · a(Λx) + Λ−1 j(Λx) · Ω(x, Λ))

= −i

climθ

→0 dx

d

dθ j(Λx) · a(x) + j(x) · d

dθa(Λx) + (

d

dθΛ−1) j(x) · Ω(x, 1)

+d

dθ j(Λx) · Ω(x, 1) + j(x) · d

dθΩ(x, Λ)

(8.22)

where Ω(x, Λ) is given by eq. (L.18) and Λ is matrix (1.54). Here we can usethe following results

Page 343: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 343/687

8.1. INTERACTION IN QED 313

limθ→0 d

dθ j(Λx) = lim

θ→0 ddθ

j(x,y,z cosh θ − ct sinh θ, t cosh θ − z c

sinh θ)

= limθ→0

∂j

∂z (z sinh θ − ct cosh θ) +

∂j

∂t(t sinh θ − z

ccosh θ)

= −ct∂j

∂z − z

c

∂j

∂t(8.23)

limθ→0

d

dθa(Λx) = −ct

∂a

∂z − z

c

∂a

∂t(8.24)

Further, using (L.21) and the continuity equation (8.13), we obtain that thelast term on the right hand side of eq. (8.22) is12

− i

climθ→0

dx j(x) · d

dθΩ(x, Λ)

=i

c

µν

dx jµ(x, t)gµν ∂ ν C z(x, t)

=i

c

dx j0(x, t)

∂tC z(x, t) + i

dxj(x, t)

∂ xC z(x, t)

=i

c

dx j0(x, t)

∂tC z(x, t) − i

dx

∂ x j(x, t)C z(x, t)

=

i

c dx j0(x, t)

∂t C z(x, t) +

i

c dx

∂t j0(x, t)C z(x, t)

=i

c

∂t

dx j0(x, t)C z(x, t) (8.25)

12 due to the property (8.2) all functions f and g of quantum fields vanish at infinity,therefore we can take integrals by parts (ξ ≡ (t ,x,y,z))

∞ −∞

dx(d

dxf (ξ ))g(ξ ) =

∞ −∞

dxd

dx(f (ξ )g(ξ )) −

∞ −∞

dxf (ξ )d

dxg(ξ )

= f (x =

∞)g(x =

∞)

−f (x =

−∞)g(x =

−∞)

−∞

dxf (ξ )d

dx

g(ξ )

= −∞

−∞

dxf (ξ )d

dxg(ξ )

Page 344: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 344/687

314 CHAPTER 8. QUANTUM ELECTRODYNAMICS

Substituting results (8.23), (8.24), (L.21), and (8.25) to eq. (8.22), we obtain

[(K 0)z, V 1(t)]

= −i

c

dx(−z

c

∂j

∂t· a(x) − j(x) · z

c

∂a

∂t− ∂

∂t( j0(x)C z(x)))

= −i ∂

∂t(

dx(− z

c2( j(x) · a(x)) − 1

c j0(x)C z(x)) (8.26)

For the second term on the left hand side of (8.21) we denote x = (t, x) andx′ = (t, x′) and use eq. (8.11)

[(K 0)z, V 2(t)]

=1

2

dxdx′[(K 0)z, j0(x)]G(x − x′) j0(x′) +

1

2

dxdx′ j0(x)G(x − x′)[(K 0)z, j0(x′)]

=

dxdx′[(K 0)z, j0(x)]G(x − x′) j0(x′)

= i

dxdx′(

z

c2∂j0(x)

∂t− 1

c jz(x))G(x − x′) j0(x′)

=i

2c2

dxdx′ ∂j0(x)

∂t(z − z ′)G(x − x′) j0(x′) +

i

2c2

dxdx′ ∂j0(x)

∂tz G(x − x′) j0(x′)

+

i

2c2 dxdx′∂j0(x)

∂t z ′G(x − x′) j0(x′) −i

c dxdx′ jz(x)G(x − x′) j0(x′)

= − i

2c

dxdx′ ∂ j(x)

∂ x(z − z ′)G(x − x′) j0(x′) +

i

2c2

dxdx′ ∂j0(x)

∂tz G(x − x′) j0(x′)

+i

2

dxdx′ j0(x)z G(x − x′)

∂j0(x′)c2∂t

− i

c

dxdx′ jz(x)G(x − x′) j0(x′)

=i

2c

dxdx′ j(x)

∂ ((z − z ′)G(x − x′))

∂ xj0(x′) +

i

2c2∂

∂t

dxdx′ j0(x)z G(x − x′) j0(x′)

− i

c

dxdx′ jz(x)G(x − x′) j0(x′) (8.27)

Using expression (8.18) for Z(t) we obtain for the third term on the left handside of eq. (8.21)

− i ∂

∂tZ z(t) = −i

c2∂

∂t

dxz j(x, t)a(x, t) +

i

c

∂t

dx j0(x, t)C z(x, t)

Page 345: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 345/687

8.1. INTERACTION IN QED 315

+i

2c2∂

∂t dxdy j0(x, t)z G(x − y) j0(y, t) (8.28)

In order to calculate the last term in (8.21), we notice that the only term inZ(t) which does not commute with V (t) is that containing C, therefore

− [V (t), Z z(t)] = −1

c

dxdx′ j(x) j0(x′)[a(x), C z(x′)] (8.29)

To calculate the commutator, we set t = 0 and use eq. (B.12)

[ai(x, 0), C

z(x′, 0)]

= i 3(2π )−3

dpdq

2

q 3 p

στ

[

ei(p, σ)cp,σeipx + e∗

i (p, σ)c†p,σe− i

px

,

ez(q, τ )cq,τ eiqx′ − ez(q, τ )c†

q,τ e− iqx′

]

= i 3(2π )−3

dpdq

2

q 3 p

στ

ei(p, σ)e∗z(q, τ )

[−δ (p − q)δ σ,τ eipx− i

qx′ − δ (p − q)δ σ,τ e

− ipx+ i

qx′ ]

= −i 3(2π )−3

dp

2 p2στ ei(p, σ)e∗

z(p, τ )δ σ,τ [eip(x−x′) + e− i

p(x−x′)]

= −i 3(2π )−3

dp

2 p2(δ iz − pi pz

p2)[e

ip(x−x′) + e− i

p(x−x′)]

= −i 3(2π )−3

dp

p2(δ iz − pi pz

p2)e

ip(x−x′)

= −i δ izG(x − x′) +i 3(−i )2

(2π )3∂ xi∂ z

dp

|p|4eip(x−x′)

= −i δ izG(x − x′) + i 5∂ xi∂ z|x − x′|

8π 4

=

−i δ iz

G(x

−x′) +

i

2

∂ xi((z

−z ′)

G(x

−x′)) (8.30)

Then

−[V (t), Z z(t)]

Page 346: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 346/687

316 CHAPTER 8. QUANTUM ELECTRODYNAMICS

=i

c i dxdx′ ji(x)δ izG(x − x′) j0(x′) − 1

2

∂xi[(z − z ′)G(x − x′)] j0(x′)

(8.31)

Now we can set t = 0, add four terms (8.26), (8.27), (8.28), and (8.31)together, and see that the first two terms in (8.28) cancel with the two termson the right hand side of (8.26); the third term in (8.28) cancels the secondterm on the right hand side of (8.27); and (8.31) exactly cancels the remainingfirst and third terms on the right hand side of (8.27). This proves eq. (8.21).

The proof of the last remaining commutation relation

[(K 0)i, Z j] + [Z i, (K 0) j] + [Z i, Z j] = 0 (8.32)

is left as an exercise for the reader. We now see that all commutators (6.22)- (6.26) are satisfied, so QED with potential energy (8.16) - (8.17)

V (t) =

dxj(x, t)a(x, t) +

dxdx′ j0(x, t)

1

8π|x − x′| j0(x′, t) (8.33)

and potential boost (8.18) is a Poincare invariant theory.Instead of writing interaction (8.33) in terms of quantum fields, for our

purposes it will be more convenient to have its expression through particle

creation and annihilation operators, as in chapter 7. For that, we just needto insert expansions (K.1) and (L.1) into equation (8.33). The resultingexpressions are rather long and cumbersome, so this derivation is done inAppendix M.

8.2 S -operator in QED

8.2.1 S -operator in the second order

In order to explore physical consequences of the interaction (8.33), let us

calculate the S -operator (6.100). The phase operator F in (6.101) can beexpanded in powers of the coupling constant

F (t) = F 1(t) + F 2(t) + . . .

Page 347: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 347/687

8.2. S -OPERATOR IN QED 317

where

F 1(t) = V 1(t)

F 2(t) = V 2(t) − 1

2[V 1(t), V 1(t)] (8.34)

. . .

Taking into account that operator V 1(t) is unphys, so that F 1(t) = V 1(t) = 0,

we can obtain the following perturbation expansion

S = e

F (t) = 1 + F (t) + 1

2! F (t) F (t) + . . .

= 1 + F 1(t) + F 2(t) + 1

2!F 1(t) F 1(t) + . . . (8.35)

= 1 + F 2(t) + . . . (8.36)

In this section we will calculate only 2nd order terms in S that are respon-sible for the proton-electron scattering. These are terms of the type d†a†da.Let us first evaluate expression −1

2[V 1, V 1]. The relevant terms in V 1(t) are13

V 1(t)

= − e

(2π )3/2

dkdpA

†α(p + k)Aβ (p)C αβ (k)e− i

(ωp+k−ωp−c|k|)t

− e

(2π )3/2

dkdpA

†α(p − k)Aβ (p)C †αβ (k)e− i

(ωp−k−ωp+c|k|)t

+e

(2π )3/2

dkdpD

†α(p + k)Dβ (p)C αβ (k)e− i

(Ωp+k−Ωp−c|k|)t

+e

(2π )3/2

dkdpD

†α(p − k)Dβ (p)C †αβ (k)e− i

(Ωp−k−Ωp+c|k|)t + . . .

According to (7.71), the corresponding terms in V 1(t) are

V 1(t)

13see eq. (M.5); operators A, C , D are defined in (K.17) - (K.24) and (L.7)

Page 348: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 348/687

318 CHAPTER 8. QUANTUM ELECTRODYNAMICS

=e

(2π

)3/2 dkdpA

†α(p + k)Aβ (p)C αβ (k)

e− i(ωp+k−ωp−c|k|)t

ωp+k − ωp − c|k|+

e

(2π )3/2

dkdpA

†α(p − k)Aβ (p)C †αβ (k)

e− i(ωp−k−ωp+c|k|)t

ωp−k − ωp + c|k|

− e

(2π )3/2

dkdpD

†α(p + k)Dβ (p)C αβ (k)

e− i(Ωp+k−Ωp−c|k|)t

Ωp+k − Ωp − c|k|

− e

(2π )3/2

dkdpD

†α(p − k)Dβ (p)C †αβ (k)

e− i(Ωp−k−Ωp+c|k|)t

Ωp−k − Ωp + c|k|+ . . . (8.37)

In order to obtain terms of the type D†A†DA in the expression [V 1, V 1],

we need to consider four commutators: the 1st term in V 1 commuting withthe 4th term in V 1, the 2nd term in V 1 commuting with the 3rd term in V 1,the 3rd term in V 1 commuting with the 2nd term in V 1, and the 4th term inV 1 commuting with the 1st term in V 1. Using commutator (L.8) we obtain

−1

2[V 1(t), V 1(t)]

= − e2

2(2π )3

dkdpdk′dp′A

†α(p + k)Aβ (p)D

†γ (p′ − k

′)Dδ(p′)

[C αβ (k), C †γδ(k′)]

e− i(ωp+k−ωp−c|k|)te− i

p′−k′−Ωp′+c|k′|)t

ωp+k − ωp − c|k|− e2

2(2π )3

dkdpdk′dp′A

†α(p − k)Aβ (p)D

†γ (p′ + k

′)Dδ(p′)

[C †αβ (k), C γδ(k′)]e− i

(ωp−k−ωp+c|k|)te− i

p′+k′−Ωp′−c|k′|)t

ωp−k − ωp + c|k|− e2

2(2π )3

dkdpdk′dp′D

†α(p + k)Dβ (p)A

†γ (p′ − k

′)Aδ(p′)

[C αβ (k), C †γδ(k′)]e− i

p′−k′−ωp′+c|k′|)te− i

(Ωp+k−Ωp−c|k|)t

Ωp+k

−Ωp

−c

|k

|− e2

2(2π )3

dkdpdk′dp′D

†α(p − k)Dβ (p)A

†γ (p′ + k

′)Aδ(p′)

[C †αβ (k), C γδ(k′)]e− i

p′+k′−ωp′−c|k′|)te− i

(Ωp−k−Ωp+c|k|)t

Ωp−k − Ωp + c|k| + . . .

Page 349: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 349/687

8.2. S -OPERATOR IN QED 319

=e2 2c

4(2π )3 dkdpdq

|k|

γ µαβ γ ν γδhµν (k)

( − D†γ (p − k)Dδ(p)A

†α(q + k)Aβ (q)

e− it(ωq+k−ωq+Ωp−k−Ωp)

ωq+k − ωq − c|k|

+ D†γ (p + k)Dδ(p)A

†α(q − k)Aβ (q)

e− it(ωq−k−ωq+Ωp+k−Ωp)

ωq−k − ωq + c|k|

− D†α(p + k)Dβ (p)A

†γ (q − k)Aδ(q)

e− it(ωq−k−ωq+Ωp+k−Ωp)

Ωp+k − Ωp − c|k|

+ D†α(p − k)Dβ (p)A

†γ (q + k)Aδ(q)

e− it(ωq+k−ωq+Ωp−k−Ωp)

Ωp−k − Ωp + c|k| + . . .)

= e2

2

c4(2π )3

dkdpdq|k| γ µαβ γ ν γδhµν (k)

( − D†α(p − k)Dβ (p)A

†γ (q + k)Aδ(q)

e− it(ωq+k−ωq+Ωp−k−Ωp)

ωq+k − ωq − c|k|

+ D†α(p − k)Dβ (p)A

†γ (q + k)Aδ(q)

e− it(ωq+k−ωq+Ωp−k−Ωp)

ωq+k − ωq + c|k|

− D†α(p − k)Dβ (p)A

†γ (q + k)Aδ(q)

e− it(ωq+k−ωq+Ωp−k−Ωp)

Ωp−k − Ωp − c|k|

+ D

†α(p − k)Dβ (p)A

†γ (q + k)Aδ(q)

e− it(ωq+k−ωq+Ωp−k−Ωp)

Ωp−k − Ωp + c|k| + . . .)

= − e2 2c2

2(2π )3

dkdpdqγ µαβ γ ν γδ

hµν (k)

(q + k ÷ q )2

e− iE (p,q,k)tD

†α(p − k)A

†γ (q + k)Dβ (p)Aδ(q)

− e2 2c2

2(2π )3

dkdpdqγ µαβ γ ν γδ

hµν (k)

(P − K ÷ P )2

e− iE (p,q,k)tD

†α(p − k)A

†γ (q + k)Dβ (p)Aδ(q) (8.38)

where we denoted

( p ÷ q )2 ≡ (ωp − ωq)2 − c2|p − q|2(P ÷ Q)2 ≡ (Ωp − Ωq)2 − c2|p − q|2

and

Page 350: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 350/687

320 CHAPTER 8. QUANTUM ELECTRODYNAMICS

E (p, q, k) = ωq+k − ωq + Ωp−k − Ωp

is the energy function.Now consider the first term in (8.38), and use the form (L.11) of the

matrix hµν (k) with k0 defined as k0 = ωq+k − ωq. Using properties (M.3) -(M.4) we obtain

− e2 2c2

2(2π )3

dkdpdq

hµν (k)

(q + k ÷ q )2

e− iE (p,q,k)tD

†(p − k)γ µDβ (p)A

†(q + k)γ ν A(q)

= − e2 22(2π )3

dkdpdq e−

i

E (p,q,k)t

(q + k ÷ q )2

( D†(p − k)γ µD(p)c2gµν A

†(q + k)γ ν A(q)

+k0

k2D

†(p − k)γ µkµD(p)A

†(q + k)γ ν nν A(q)

+k0

k2D

†(p − k)γ µnµD(p)A

†(q + k)γ ν kν A(q)

− 1

k2D

†(p − k)γ µkµD(p)A

†(q + k)γ ν kν A(q)

+

c2k2

−(k0)2

k2 D

(p − k)γ µ

nµD(p)A

(q + k)γ ν

nν A(q))

= − e2 2

2(2π )3

dkdpdq

e− iE (p,q,k)t

(q + k ÷ q )2

( D†(p − k)γ µD(p)c2gµν A

†(q + k)γ ν A(q)

+k0

k2D

†(p − k)γ µkµD(p)A

†(q + k)γ ν nν A(q)

+c2k2 − (k0)2

k2D

†(p − k)γ µnµD(p)A

†(q + k)γ ν nν A(q))

=

e2 2

2(2π

)3 dkdpdqe− i

E (p,q,k)t

( D†(p − k)γ µD(p)

c2gµν (q + k ÷ q )2

A†(q + k)γ ν A(q)

+ωq+k − ωq

k2(q + k ÷ q )2D

†(p − k)γ µkµD(p)A

†(q + k)γ 0A(q)

Page 351: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 351/687

8.2. S -OPERATOR IN QED 321

− 1

k2D

†(p − k)γ 0D(p)A

†(q + k)γ 0A(q))

A similar calculation can be done for the second term in (8.38). Combiningthese two expressions with the term D†A†DA in V 2(t),14 we see that operatorF 2(t) in (8.34) can be written in the form

F 2(t)

= − 2e2

2(2π )3

dpdqdke− it

E (p,q,k)

( D†(p

−k)γ 0D(p)

2

|k|2

A†(q + k)γ 0A(q)

+ D†(p − k)γ µD(p)

c2gµν (q + k ÷ q )2

A†(q + k)γ ν A(q)

+ D†(p − k)γ µkµD(p)

ωq+k − ωq

k2(q + k ÷ q )2A

†(q + k)γ 0A(q)

− D†(p − k)γ 0D(p)

1

k2A

†(q + k)γ 0A(q)

+ D†(p − k)γ µD(p)

c2gµν (P − K ÷ P )2

A†(q + k)γ ν A(q)

+ D†(p

−k)γ 0D(p)

Ωp−k − Ωp

k2

(P − K ÷ P )2

A†(q + k)γ µkµA(q)

− D†(p − k)γ 0D(p)

1

k2A

†(q + k)γ 0A(q))

= − 2e2

2(2π )3

dpdqdke− it

E (p,q,k)

( D†(p − k)γ µD(p)

c2gµν (q + k ÷ q )2

A†(q + k)γ ν A(q)

+ D†(p − k)γ µkµD(p)

ωq+k − ωq

k2(q + k ÷ q )2A

†(q + k)γ 0A(q)

+ D†(p

−k)γ µD(p)

c2gµν

(P − K ÷ P )2A

†(q + k)γ ν A(q)

+ D†(p − k)γ 0D(p)

Ωp−k − Ωp

k2(P − K ÷ P )2A

†(q + k)γ µkµA(q)) (8.39)

14 the fourth term in eq. (M.8)

Page 352: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 352/687

322 CHAPTER 8. QUANTUM ELECTRODYNAMICS

Now we can insert eq. (8.39) in the formula for the S -operator (8.36). To

perform the integration by t from −∞ to ∞ we just need to substitute thet-exponent e− iE (p,q,k)t with the delta function 2πiδ (E (p, q, k)). This makes

the S -operator vanishing everywhere outside the energy shell E (p, q, k) = 0.So, we can use formulas

Ωp−k − Ωp = ωq − ωq+k

(P − K ÷ P )2 = (q + k ÷ q )2 (8.40)

Finally, we use properties (M.3) and (M.4) to obtain

S 2[d†a†da] = F 2(t)

= − e2i

(2π)2

dpdqdkδ (E (p, q, k))

D†(p − k)γ µD(p)

c2gµν (q + k ÷ q )2

A†(q + k)γ ν A(q) (8.41)

This formula is in a good agreement with experimental data and it exactlycoincides with the usual textbook result obtained in Feynman-Dyson pertur-bation theory. Our path to this formula used Magnus perturbation theory

[72, 73] and was much longer than the path usually presented in textbooks.Here we used representation (L.11) of the matrix hµν (k) to demonstrate thecancelation of terms proportional to |k|−2 coming from the 2nd order inter-action V 2(t) and from the commutator −1

2 [V 1(t), V 1(t)] in (8.34) [9]. Thiscancelation finally led us to the manifestly covariant result (8.41), althoughwe did not maintain the manifest covariance at each step, as is normally donein the Feynman-Dyson perturbation theory. Why did we choose this longpath? The answer is that Magnus perturbation theory will be found moreconvenient for derivation of the “dressed particle” Hamiltonian in chapter 9.

8.2.2 S -operator in the non-relativistic approximation

In the non-relativistic case, the momenta of electrons are much less thanmc, and momenta of protons are much less than Mc. Therefore, with goodaccuracy we can represent the S -operator (8.41) as a series in powers of 1/c

Page 353: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 353/687

8.2. S -OPERATOR IN QED 323

and leave only terms of order not higher than c−2. First, we can use (7.98)

to write

ωp + mc2 ≈

mc2 +

p2

2m+ mc2 =

2mc2 +

p2

2m

=√

2mc2

1 +

p2

4m2c2≈

√ 2mc2(1 +

p2

8m2c2)

ωp − mc2 ≈

mc2 +

p2

2m− mc2 =

p√ 2m

Near the energy shell we also obtain

1

(q + k ÷ q )2=

1

(ωq+k − ωq)2 − c2|k|2

≈ − 1

c2k2(8.42)

It is convenient to introduce notation

U µ(p, σ; p′, σ′) ≡ u(p, σ)γ µu(p′, σ′) (8.43)

W µ(p, σ; p′, σ′)≡

w(p, σ)γ µw(p′, σ′) (8.44)

To obtain the c−2 approximation of these expressions we use eqs. (K.13) -(K.16). To derive the component U 0 we use eq. (H.8)

U 0(p, σ; p′, σ′) (8.45)

= u(p, σ)γ 0u(p′, σ′)

= u†(p, σ)u(p′, σ′) (8.46)

= χ†σ( ωp + mc2, ωp − mc2(

p

p· σ))

ωp′ + mc2

ωp′

−mc2(p

p′

·σ) χσ′

1

2mc2

= χ†σ(

ωp + mc2

ωp′ + mc2 +

ωp − mc2

ωp′ − mc2(p · σ)(p′ · σ)

pp′ )χσ′1

2mc2

≈ χ†σ((1 +

p2

8m2c2)(1 +

( p′)2

8m2c2) +

pp′

4m2c2(p · σ)(p′ · σ)

pp′ )χσ′

Page 354: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 354/687

324 CHAPTER 8. QUANTUM ELECTRODYNAMICS

= χ†σ(1 +

p2 + ( p′)2 + 2p · p′ + 2iσ · [p × p′]8m2c2

)χσ′

= χ†σ(1 +

(p + p′)2 + 2iσ · [p × p′]8m2c2

)χσ′ (8.47)

Analogously, in the c−2 approximation

W 0(p, σ; p′, σ′) = w(p, σ)γ 0w(p′, σ′)

≈ χ†σ(1 +

(p + p′)2 + 2iσ · [p × p′]8M 2c2

)χσ′ (8.48)

Further, we calculate (using eqs. (H.6) and (H.7))

U(p, σ; p′, σ′)

= u(p, σ)γu(p′, σ′)

= χ†σ[

ωp + mc2,

ωp − mc2p · σ

p]γ 0

0 σ

−σ 0

ωp′ + mc2

ωp′ − mc2 p′·σ p′

χσ′

1

2mc2

= χ†σ[

ωp + mc2, −

ωp − mc2p · σ

p]

ωp′ − mc2 σ(p

′·σ) p′

−σ

ωp′ + mc2

χσ′

1

2mc2

= χ†σ( ωp + mc2 ωp′

−mc2

σ(p′ · σ)

p′(8.49)

+

ωp − mc2

ωp′ + mc2(p · σ)σ

p)χσ′

1

2mc2

≈ χ†σ(

√ 2mc2

p′√

2m

σ(p′ · σ)

p′ +√

2mc2p√ 2m

(p · σ)σ

p)χσ′

1

2mc2

= χ†σ((σ · p)σ + σ(σ · p′))χσ′

1

2mc

= χ†σ[p + i[σ × p] + p′ − i[σ × p′]]χσ′

1

2mc(8.50)

= χ†σ[p + p′ + i[σ

×(p

−p′)]]χσ′

1

2mc(8.51)

Analogously

W(p, σ; p′, σ′) ≈ χ†σ[p + p′ + i[σ × (p − p′)]]χσ′

1

2Mc(8.52)

Page 355: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 355/687

8.3. RENORMALIZATION 325

In the non-relativistic limit c → ∞, all formulas are further simplified

limc→∞

ωp = mc2

limc→∞

Ωp = Mc2

limc→∞

U 0(p, σ; p′, σ′) = χ†σχσ′ = δ σ,σ′

limc→∞

V 0(p, σ; p′, σ′) = δ σ,σ′

limc→∞

U(p, σ; p′, σ′) = 0

limc→∞

V(p, σ; p′, σ′) = 0

Using above results we can rewrite (8.41) in the non-relativistic approxima-tion

S 2[d†a†da]

=e2c2

(2π)2

σσ′ττ ′

dpdqdk

Mmc4δ (E (p, q, k)) Ωp−kΩpωq+kωq

W µ(p − k, σ; p, σ′)gµν

(q + k ÷ q )2U ν (q + k, τ ; q, τ ′)

d†p

−k,σdp,σ′a

†q+k,τ aq,τ ′

≈ − e2

(2π)2

σσ′ττ ′

dpdqdkδ (E (p, q, k))

δ σ,σ′δ τ,τ ′

k2d†p−k,σdp,σ′a

†q+k,τ aq,τ ′

= − e2i

(2π)2

στ

dpdqdk

δ (E (p, q, k))

k2d†p−k,σdp,σa†

q+k,τ aq,τ (8.53)

This result is consistent with our toy model (7.100). The difference in signis related to the fact that eq. (7.100) describes scattering of two electronshaving the same charge and, therefore, repelling each other, while eq. (8.53)refers to the attractive electron-proton interaction.

8.3 Renormalization

In spite of successful prediction of scattering amplitudes in the 2nd pertur-bation order, there is a very serious problem with the formalism of QED

Page 356: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 356/687

326 CHAPTER 8. QUANTUM ELECTRODYNAMICS

presented above. This is the problem of ultraviolet divergences. It appears

that higher order contributions to the S -operator are infinite. Proper discus-sion of these infinities requires rather elaborate calculations. To make ourpresentation in this section relatively simple, and, at the same time, retainingimportant features pertinent to QED, we will switch to discussion of our toymodel from section 7.3. This is justified, because, as we saw in subsection8.2.2, there are quite important similarities between two theories. We willreturn to general discussion of QED divergences in subsection 8.3.6.

8.3.1 Regularization

First, we need to deal with one rather technical problem. Diagrams in high

perturbation orders inevitably contain loops.15

If the coefficient functionsof interaction potentials do not decay sufficiently rapidly at large values of arguments (loop momenta) there is a danger that loop integrals may be di-vergent.16 We will see, that loop integrals do diverge in both QED and in ourmodel theory. One way to overcome this difficulty is by using regularization .The idea of regularization is to modify the theory in such a way that all loopintegrals are made finite. There are many different ways to achieve this goal.For example, we can simply introduce a momentum cut-off in all momen-tum integrals. Of course, the theory with such truncated integrals cannot beexact. Therefore, in a rigorous theory at the end of calculations the cutoff momentum should be set to infinity. This will make the integrals divergent

again, so nothing would be gained. The only reason to do regularization isto ensure that intermediate manipulations with truncated integrals do notinvolve infinities and, thus, are mathematically consistent.

Although the regularization is required for mathematical rigor, it sub-stantially lengthens calculations. In our studies we will not perform theregularization explicitly. Instead, we will formally treat all loop integrals “asif” they were finite. If a proper care is taken, this approach leads to the samefinal result as any regularized theory in the limit of infinite cutoff.

8.3.2 The mass renormalization condition

Even if all loop integrals were convergent this would not solve the problem of infinities in QFT: the S -operator might still be infinite. Let us now consider

15see subsection 7.4.116see subsection 7.4.3

Page 357: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 357/687

8.3. RENORMALIZATION 327

in more detail how these infinities appear and what we can do about them.

Generally, the scattering phase operator F in (6.100) may have unphys,phys, and renorm terms, so we can write the S -operator in a general form

S = eF

= exp(F unp + F ph + F ren )= exp( F ph + F ren ) (8.54)

where we noticed that unphys terms in F do not contribute to the S -operatordue to eq. (7.74). Let us now apply scattering operator (8.54) to an one-electron state

|p

= a†

p

|0

. It follows from Lemma 7.2 that phys operators

yield zero when acting on one-particle states. Renorm operators transformone-particle states to one-particle states. Therefore, we can write

S |p = exp( F ph + F ren )|p

= (1 + F ph + F ren + 1

2!( F ph + F ren )2 + . . .)|p

= (1 + F ph

+ F ren

+

1

2!( F ph

)2 +

1

2!F ph

F ren

+

1

2!F ren

F ph

+

1

2!(F ren

)2 + . . .)|p

= (1 + F ren +1

2!

(F ren )2 + . . .)

|p

= exp(F ren )|pA similar derivation can be performed for the vacuum vector

S |0 = exp(F ren )|0So, the scattering in these states is determined by the renorm part of F . Weknow from (7.73) that the t-integral F ren

is infinite, even if all terms in F ren

are finite. Therefore, if F ren = 0 then the action of the S -operator on 0- and1-particle states leads to infinite results. The S -operator relates to directly

measurable properties, so its divergence is rather disturbing. Thus in a finitetheory we must require that

F ren = 0 (8.55)

Page 358: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 358/687

328 CHAPTER 8. QUANTUM ELECTRODYNAMICS

which implies that operator F

must be purely phys.

F = F ph (8.56)

Equations (8.55) and (8.56) can be also understood from the physical mean-ing of the S -operator. The interacting time evolution from the distant pastto the distant future can be written as17

U (∞ ← −∞) = SU 0(∞ ← − ∞).

If this operator acts on the initial state without particles |0 in the past, wemay expect that in the future this state will transform again into the vacuumstate (particles cannot be permanently created out of nothing)

U (∞ ← − ∞)|0 = |0

Since the free time evolution operator U 0 leave the vacuum state invariant,the same must be true for the S -operator

S |0 = |0 (8.57)

Similarly, we can conclude that one-particle states also evolve from the dis-tant past to the distant future without change

Sa†p|0 = a†

p|0 (8.58)

Sc†k|0 = c†

k|0 (8.59)

Due to Lemma 7.2, equations (8.57) - (8.59) are equivalent to (8.55) - (8.56).We formulate these results as the following

Statement 8.1 (mass renormalization condition) There should be noscattering in the vacuum and one-particle states.

17see eq. (6.90)

Page 359: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 359/687

8.3. RENORMALIZATION 329

In other words, scattering is expected to occur only when there are at least

two particles which interact with each other. One particle has nothing tointeract with, and nothing can happen in the no-particle vacuum state. Letus see if this property is satisfied in our model theory from section 7.3. In thistheory, the 2nd order term Σ2(t) in eq. (7.95) has a renorm contribution18

Σren2 = (V 1(t)V 1(t))ren

This term is equal to the one shown in the diagram 7.3(g). As we saw in eq.(7.118) the loop integral is divergent there. Of course, as discussed earlier,this divergence can be fixed by regularization, but even finite regularized

diagrams of the type 7.3(g) are not acceptable - they violate Statement 8.1.Similar renorm terms appear also in higher perturbation orders. Therefore,we must conclude that this theory is not acceptable. It should be changedor renormalized to comply with Statement 8.1.

8.3.3 Counterterms

One possible way to eliminate renorm terms from Σ could be to formulatecertain subtraction rules for calculating scattering amplitudes (see, e.g., [85]),so that unwanted contributions to Σ are simply deleted from formula (6.99)

in each order of perturbation theory. This would be okay if our only goalwere to obtain matrix elements of the S -matrix and scattering cross sections.However, in the final theory we would like to keep the standard quantum-mechanical connection between the Hamiltonian and the S -operator given byeqs (6.98) and (6.99). Therefore, we will prefer another (equivalent) way todo the renormalization: We will modify the Hamiltonian by adding certainunphys U and renorm R counterterms to the interaction operator V .19 Inother words, we are saying that the original Hamiltonian with interaction(7.89)

18In the rest of this section, it will be convenient (the formulas are more compact) to

work with operator Σ(t) from eq. (6.99) rather than with F (t). It is not difficult to show,e.g., by using eqs. (6.102) and (6.103), that renormalization (the removal of renorm terms)of Σ(t) is equivalent to the removal of renorm terms from the operator F (t).

19It appears that in our model theory the renormalization is achieved by unphys andrenorm counterterms only. In more general cases, such as QED, phys counterterms shouldbe added as well.

Page 360: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 360/687

330 CHAPTER 8. QUANTUM ELECTRODYNAMICS

H = H 0 + V 1

is not good, and a modified Hamiltonian with renormalization counterterms20

H c = H 0 + V c

= H 0 + V 1 + U + R

better describes interactions between particles. To comply with Statement8.1 and eq. (8.55), we must choose the counterterms in such a way that the

scattering operator Σc

computed with the interaction V c

= V 1 + U + R

Σc(t) = V c(t) + V c(t)V c(t)V c(t) + V c(t)V c(t)V c(t)V c(t) + . . .(8.60)

does not contain renorm terms. From Table 7.2, it is clear that renorm termsin Σc may appear due to the presence of renorm and unphys terms in V c andin their products. So, in order to satisfy eq. (8.55), the counterterms U andR must maintain a certain balance, which would guarantee that all renormterms in Σc cancel out in all orders of perturbation theory. The procedureof finding such U and R is called mass renormalization . The reason for this

terminology is that the correct choice of U and R ensures the validity of eqs.(8.58) - (8.59), i.e., the absence of scattering in 1-particle states. We willsee that this is equivalent to saying that interaction V c(t) does not modifyparticle masses: the masses of particles are the same in both interacting andnon-interacting theories.

At this point the idea of fixing problems in the interaction V 1 by addingrenormalization counterterms looks rather ad hoc . How can we be certainthat interaction operator V c describes nature better that V 1? We will havemore discussions of the renormalization program and its physical meaning insubsection 8.3.6 and in chapter 9.

Let us now try to modify the Hamiltonian of our model theory fromsection 7.3 by adding Hermitian counterterms21 R2, U 3(t), R4, . . .

20The superscript ’c’ will denote operators that appear in a theory with renormalizationcounterterms added to the interaction.

21As usual, the subscript denotes the perturbation order, i.e. the power of the coupling

Page 361: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 361/687

8.3. RENORMALIZATION 331

H (t) → H c(t) = H 0 + V c(t)

V c(t) = H 0 + V 1(t) + R2 + U 3(t) + R4 + . . . , (8.61)

so that renorm terms are eliminated from the scattering operator Σc(t)

(Σc)ren = 0.

We obtain formulas for Σci(t) (i = 1, 2, 3, . . .) by inserting interaction (8.61)

in (8.60) and collecting terms of equal perturbation order

Σc1(t) = V 1(t) (8.62)

Σc2(t) = V 1(t)V 1(t) + R2 (8.63)

Σc3(t) = V 1(t)V 1(t)V 1(t) + R2V 1(t) + V 1(t)R2 + U 3(t) (8.64)

Σc4(t) = σ4(t) + U 3(t)V 1(t) + V 1(t)U 3(t) + R4 (8.65)

where we denoted

σ4(t) = V 1(t)V 1(t)V 1(t)V 1(t) + V 1(t)V 1(t)R2

+ V 1(t)R2V 1(t) + R2V 1(t)V 1(t),

Now we go order-by-order and choose counterterms R2, U 3(t), R4, . . . so thatall renorm terms on the right hand sides of eqs. (8.62) - (8.65) cancel out.The first-order term (8.62) is unphys (see eq. (7.89)), so there is no need forfirst order renormalization counterterms. To ensure that Σc

2(t) does not havea renorm part we choose a second-order counterterm22

constant e. We will see later that in our model theory unphys counterterms appear onlyin odd orders and renorm counterterms appear only in even orders. This is reflected in

the notation used for counterterms. We also take into account that renorm operators aret-independent.22See diagram 7.3(g). According to (7.118), this counterterm is divergent. As mentioned

in subsection 8.3.1, we could introduce a momentum cutoff to make this counterterm finite.But we will prefer to keep the formal expression (8.66) for R2 and other countertermsavoiding their numerical evaluation.

Page 362: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 362/687

Page 363: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 363/687

8.3. RENORMALIZATION 333

terms on the right hand side of (8.69). Formulas for these terms are rather

complicated, and we are not going to present them here. With the abovechoices, the operator Σc(t) does not contain renorm terms up to the 4thorder, as required by the mass renormalization condition (Statement 8.1).

In order to proceed with calculations of (8.70) we will need the following

Lemma 8.2 (“integration by parts”) For two regular non-renorm oper-ators A and B

AB

= − AB

Proof. We denote E (A) and E (B) energy functions of operators A and Band use eqs. (7.71) and (7.60)

AB = −2π AB δ (E (A) + E (B))E (B)−1

= 2π AB δ (E (A) + E (B))E (A)−1

= − AB

This result has the following implication for the diagram technique. If

we are interested in the behavior of a diagram on the energy shell, and thediagram has a box surrounding a subset of vertices, then we can obtain anequivalent diagram by (i) erasing this box, (ii) drawing another box sur-rounding the complement of the subset of vertices, and (iii) changing thetotal sign of the diagram.

Using Lemma 8.2 we can simplify expression (8.70) for σ4(t) σ4(t)

= V 1(t)V 1(t)(V 1(t)V 1(t)) p+u + V 1(t)R2V 1(t) + R2V 1(t)V 1(t) = (V 1(t)V 1(t)) p+u(V 1(t)V 1(t)) p+u + (V 1(t)V 1(t))ren(V 1(t)V 1(t)) p+u

Page 364: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 364/687

334 CHAPTER 8. QUANTUM ELECTRODYNAMICS

+ V 1(t)R2V 1(t)

+ R2(V 1(t)V 1(t)) p+u

+ R2(V 1(t)V 1(t))ren

= ((V 1(t)V 1(t)) p+u(V 1(t)V 1(t)) p+u) ph + (V 1(t)R2V 1(t)) ph = ((V 1(t)V 1(t)) p+u(V 1(t)V 1(t)) p+u) ph − (V 1(t)(V 1(t)V 1(t))renV 1(t)) ph (8.72)

8.3.4 Electron-electron scattering

The second order contribution to the electron-electron scattering in the renor-

malized theory (8.67) coincides with that calculated in subsection 7.3.4. So,renormalization does not affect physical results in the 2nd perturbation order.As we mentioned above, the first term on the right hand side of ( 8.68) hasodd number of particle operators, so it does not contribute to the electron-electron scattering. We will see shortly that U 3(t) does not contribute as

well. Let us now consider the 4th order term (8.69). First we use the dia-gram technique and calculate operator σ4(t) in (8.72). The product V 1(t)V 1(t)

which appears in the first term is shown in Fig. 8.1(a) - (d) (without normalordering). The normally ordered expression

(V 1(t)V 1(t)) p+u (8.73)

is shown in fig. 8.1(e) - (m). Diagrams representing

(V 1(t)V 1(t)) p+u (8.74)

differ from (V 1(t)V 1(t)) p+u by the absence of the big box around all vertices

and by moving the small box to another vertex.Here we are interested only in the electron-electron scattering described

by terms a†a†aa in the S -operator. Therefore, when calculating the product

(V 1(t)V 1(t)) p+u(V 1(t)V 1(t)) p+u

Page 365: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 365/687

8.3. RENORMALIZATION 335

VV11VV

11==

((dd))

(V11VV

11))p+u==

((ee))

((ii))

((ff))

((jj))

((mm))

((gg))

((kk))

((hh))

((ll))

((aa)) ((bb)) ((cc))

Figure 8.1: Calculation of (V 1(t)V 1(t)) p+u.

Page 366: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 366/687

336 CHAPTER 8. QUANTUM ELECTRODYNAMICS

we need to consider only three terms from (8.74) and three terms from (8.73)

as shown in Fig. 8.2.

23

The result of this calculation is shown in Fig. 8.3.Calculation of the second term in σ4(t) − (V 1(t)(V 1(t)V 1(t))renV 1(t)) ph

is shown in Fig. 8.4.By adding Fig. 8.3 and Fig. 8.4 together we can make some simplifica-

tions. First, consider terms (a) and (b) in Fig. 8.3 and the term (e) in Fig.8.4. These three diagrams have the same topology. The only difference is inenergy denominators. Let us denote the energy functions of the four vertices

in the diagram 8.4(e) by A, −A, C , and −C . Then the sum of the threediagrams is proportional to

− 1

AC (A + C )− 1

A2(A + C )+

1

A2C

=−A − C + A + C

A2C (A + C )= 0

Similarly, we can show that diagrams (c) and (d) in Fig. 8.3 and the diagram

(d) in Fig. 8.4 cancel each other. Next consider two diagrams (e) and (f) inFig. 8.3 which also have the same topology. The sum of these diagrams isproportional to

1

A(A + B)D+

1

B(A + B)D=

1

ABD

i.e., is equal to the diagram 8.5(b). Similarly, the sum of diagrams 8.3(g)and 8.3(h) is equal to the diagram 8.5(a). We can also simplify the sum of diagrams 8.3(m) and 8.3(n). Using notation for energy functions shown inFig. 8.3(m) we obtain that this sum is proportional to

1

A(A + B)D+

1

B(A + B)D=

1

ABD

Page 367: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 367/687

8.3. RENORMALIZATION 337

((aa))((bb))

((cc))

((dd))((ee))

((ff))

(V11VV

11))p+u(V

11VV

11))p+u==

++++ +...

++ ++ +...

Figure 8.2: Calculation of (V 1(t)V 1(t)) p+u(V 1(t)V 1(t)) p+u.

Page 368: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 368/687

338 CHAPTER 8. QUANTUM ELECTRODYNAMICS

((aa)) ((bb))

((gg))((ee))

((kk))

((ll))

((ii))((jj))

((mm))

((cc)) ((dd))

((ff)) ((hh))

((nn))

AA

BB

CC

DD

AA BB

CC DD

Figure 8.3: Terms in (V 1(t)V 1(t)) p+u(V 1(t)V 1(t)) p+u having structure a†a†aa.

Page 369: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 369/687

8.3. RENORMALIZATION 339

((dd))

+... == +... ==

((ee))

++

((aa)) ((bb)) ((cc))

AA

−A CC

−C

−− −− −−

−− −−

Figure 8.4: Calculation of −V 1(t)(V 1(t)V 1(t))renV 1(t).

Page 370: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 370/687

340 CHAPTER 8. QUANTUM ELECTRODYNAMICS

((aa)) ((bb))

((cc)) ((dd)) ((ee))

((ff)) ((gg))

pp

qq

q+k

p−k

kkhh hh

kk

pp

qq

q+k

p−k

BB

FFCC

EE

Figure 8.5: The part of the operator σc4(t) having operator structure a†a†aaon the energy shell.

Page 371: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 371/687

8.3. RENORMALIZATION 341

i.e., is equal to the diagram 8.5(g).

Finally, σ4(t) is represented by seven diagrams shown in Fig. 8.5. Letus first discuss the contribution of diagrams 8.5(a) - 8.5(b) to the coefficientfunction D for the electron-electron scattering on the energy shell. Usingdiagram rules from subsection 7.3.3, we obtain

D(a+b)4 (p, q, k) = − ie4c2

(2π)5 2(ωp−k − ωp + ck)k

dh

h(

1

BC +

1

EF ) (8.75)

where B = ωp−h − ωp + ch, C = ωp−k − ωp−h−k − ch, E = ωq−h − ωq + ch,and F = ωq+k − ωq+k−h − ch. This contribution is infinite, because at

large values of h the integrand behaves as h−3

.

24

Thus we conclude that themass renormalization procedure described in section 8.3.2 has not removedall divergences. To solve this problem we need to perform a second renor-malization step known as the charge renormalization procedure. This stepis explained in the next subsection.

8.3.5 Charge renormalization

First note that the divergence of terms D(a+b)4 is not the only problem. As dis-

cussed above, we could make D(a+b)4 finite by introducing regularization (e.g.,

momentum cutoff in the integral on h). However, even after regularizationthis term would have a singularity k−2 at k → 0. Such a singularity is respon-sible for low-energy electron-electron scattering controlled at large distancesby a long-range Coulomb potential.25 From classical physics and experimentwe also know that electron-electron interactions at large distances and lowenergies depend on e2 (in our language they are of the second perturbationorder) and they are accurately described by the 2nd order term (7.100). Weshould require the low energy limit of our theory to be consistent with thisbehavior. The 4th order term (8.75) adds undesired long-distance low-energyscattering of charged particles that depends on e4. The presence of this termsmeans that the electron charge in the interacting theory is different from the

charge e that was assumed before the introduction of interaction. To make23The terms (d), (e), and (f) on the diagram 8.2 are the same as terms (h), (l), and (m),

respectively, on the diagram 8.1.24see footnote on page 27525see subsection 7.3.4

Page 372: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 372/687

342 CHAPTER 8. QUANTUM ELECTRODYNAMICS

the situation even worse, this “charge correction” is infinite. So, we will

postulate that in orders higher that 2nd, coefficient functions with k = 0singularities like (8.75) should not be present at all, whether they are infiniteor finite.

Postulate 8.3 (charge renormalization condition) : Scattering of charged particles at large distances and low energies is described exactly by the 2nd order term S 2 in the S -operator. All higher order contributions to the low-energy scattering should vanish.

According to this Postulate, the 4th order contribution to scattering shouldbe non-singular at k

→0. Therefore, contributions like (8.75) should be elim-

inated. This elimination can be achieved by proper selection of countertermsin the Hamiltonian. The mass renormalization condition (Statement 8.1) didnot define all counterterms in an unambiguous way. So far we have deter-mined only the renorm part of U 3 in eq. (8.71). We still have a freedom of choosing the unphys part of U 3. If we choose an (infinite) unphys countertermU 3 as shown in Fig. 8.6(a) and 8.6(b), then expression

U 3(t)V 1(t) + V 1(t)U 3(t) (8.76)

in (8.69) (see Fig. 8.6(e) and 8.6(f)) exactly cancels unwanted infinite dia-grams 8.5(a) and 8.5(b) on the energy shell.

So, in our model theory the mass renormalization condition (Statement8.1) unambiguously determines the form of (infinite) renorm countertermsR2, R4,... On the other hand, the charge renormalization condition (Postulate8.3) is responsible for unphys counterterms U 3, U 5,...

Surviving diagrams 8.5(c) - 8.5(g) are the fourth-order radiative correc-tions to the electron-electron scattering. On the energy shell these correctionsare

D(c

−g)

4 (p, q, k)

= − ie4c2

(2π)5 2

dh

h|h + k|1

A(

1

BC +

1

DC +

1

EF +

1

DG+

1

EG) (8.77)

where relevant energy functions are denoted by

Page 373: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 373/687

8.3. RENORMALIZATION 343

((cc)) ((dd))

UU33VV11 +...

==

((ee)) ((ff))

+...

((aa)) ((bb))

UU33==

VV11UU33==++

Figure 8.6: (a) - (b): charge renormalization counterterms U 3(t); (c) - (f):calculation of a†a†aa terms in U 3(t)V 1(t) + V 1(t)U 3(t) on the energy shell.

Page 374: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 374/687

344 CHAPTER 8. QUANTUM ELECTRODYNAMICS

A = ωq−h − ωq + ch

B = ωp−k−h − ωp + c|h + k|C = ωq−h + ωp−k + c|h + k| − ωq − ωp

D = ωp+h + ωq−k − ωq − ωp

E = ωq+k − ωq + |h + k| + ch

F = ωq+k + ωp−h−k + ch − ωq − ωp

G = ωq+k + ωp+h + c|h + k| − ωq − ωp

The integrand in (8.77) is proportional to h−5 and the integral is convergent

at large values of the loop momentum h. Therefore the 4th order radiativecorrection to the scattering operator Σc

4(a†a†aa)(t) is finite on the energyshell. So, in the renormalized theory the electron-electron scattering is rep-resented by a finite S -operator which up to the 4th order is described by thecoefficient function

D = D2 + D4 + . . . (8.78)

where D2 is given by eq. (7.101) and D4 is given by eq. (8.77).

8.3.6 Renormalization in QED

The renormalization steps outlined in subsections 8.3.2 - 8.3.5 for the toymodel can be repeated, though with greater computational efforts, for variousscattering processes in QED. These steps are sufficient for elimination of renorm terms and cancelation of ultraviolet divergences in the S -operator inall perturbation orders. The renormalization theory in QED was developedby Tomonaga, Schwinger, and Feynman in the late 1940’s and had enormoussuccess. Just few low orders in the perturbation expansion of the S -operatorwith the Hamiltonian H c are sufficient to obtain scattering cross sections,

the electron’s magnetic moment, and energies of bound states (e.g., Lambshifts in atoms) in remarkable agreement with experiment.

Similar renormalization steps should be also repeated for interacting boostoperators. However, here we have a problem. How we can be sure that ad-dition of renormalization counterterms to H and K does not destroy the

Page 375: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 375/687

8.3. RENORMALIZATION 345

important property of Poincare invariance? Although, we do not have a pre-

scription for renormalization in boosts, there is an argument that stronglysuggests that what we have done is, in fact, Poincare invariant.26 To explainthis argument, let us look at the renormalization program from a different(more traditional) perspective.

The QED Hamiltonian given by eqs. (7.36) and (8.14) - (8.17) depends onthree parameters: the electron mass m, the proton mass M , and the electroncharge e:

H (m,M,e) = H 0(m, M ) + V 1(m,M,e) + V 2(m,M,e) (8.79)

A remarkable property of QED is that addition of renormalization countert-erms is equivalent to simply modifying the values of parameters m, M , ande.27 In other words, after adding unphys U , renorm R, and phys P coun-terterms in all orders of the perturbation theory, the QED Hamiltonian H c

acquires the same functional form as the original Hamiltonian H in (8.79)but with renormalized (infinite) masses m, M and renormalized (infinite)charge e28

H c = H (m,M,e) + U + R + P (8.80)

= H (m, M, e)

Then the Poincare invariance of the renormalized QED becomes obvious:Since renormalization amounts simply to modifying parameters in the Hamil-tonian (and corresponding parameters in boost operators), it cannot changePoincare algebra commutators in any way.

The physical interpretation of renormalization will be discussed in moredetail in section 9.1.

26Another way of proving the Poincare invariance of the renormalized theory is to per-form regularization by introducing Lorentz-invariant cutoff factors in coefficient functionsof interactions [86]. Then, the regularized theory becomes non-local, but still satisfies thePoincare commutation relations. In this case, introduction of counterterms preserves the

Poincare invariance, even with a finite cutoff. The renormalized theory in the limit of infinite cutoff remains Poincare invariant as well.27This equivalence within a non-covariant approach similar to ours was discussed in refs.

[87, 88, 89].28This also means that renormalization counterterms U , R, and P have the same oper-

ator structure as terms H 0, V 1, and V 2 in the original Hamiltonian (8.79).

Page 376: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 376/687

346 CHAPTER 8. QUANTUM ELECTRODYNAMICS

Page 377: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 377/687

Part II

A RELATIVISTICQUANTUM THEORY OF

PARTICLES

347

Page 378: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 378/687

Page 379: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 379/687

349

In the first part of this book we presented a fairly traditional view on

relativistic quantum field theory. This well-established approach had greatsuccesses in many important areas of high energy physics, in particular, in thedescription of scattering events. However, it also has a few troubling spots.First is the problem of ultraviolet divergences. The idea of self-interactingbare particles with infinite masses and charges seems completely unphysical.Moreover, QFT is not suitable for the description of time evolution of particleobservables and their wave functions. In this second part of the book, wesuggest that these problems can be solved by abandoning the idea of quantumfields as basic ingredients of nature and returning to the old (going back toNewton) concept of particles interacting via direct forces. This reformulationof QFT is achieved by applying the “dressed particle” approach first proposed

by Greenberg and Schweber [10].The existence of instantaneous action-at-a-distance forces implies the real

possibility of sending superluminal signals. Then we find ourselves in con-tradiction with special relativity, where faster-than-light signaling is strictlyforbidden (see Appendix J.2). This paradox forces us to take a second lookon the derivation of basic results of special relativity, such as Lorentz trans-formations for space and time coordinates of events. We find that previ-ous approaches missed one important ingredient. Specifically, they ignoredthe fact that in interacting systems generators of boost transformations areinteraction-dependent. The full recognition of this fact allows us to reconcilefaster-than-light interactions with the principle of causality in all referenceframes and to build a consistent relativistic theory of interacting quantumparticles in this second part of the book.

Page 380: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 380/687

350

Page 381: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 381/687

Chapter 9

THE DRESSED PARTICLEAPPROACH

The first principle is that you must not fool yourself - and you are the easiest person to fool.

Richard Feynman

In this chapter we will continue our discussion of quantum electrodynam-

ics - the theory of interacting charged particles (electrons, protons, etc.) andphotons. Great successes of this theory are well known. Here we are goingto focus on its less-known weak points. The most obvious problem of QEDis related to extremely weird properties of its fundamental ingredients - bareparticles. The masses and charges of bare electrons and protons are infiniteand the Hamiltonian H c of QED is formally infinite1 as well.2

We are going to demonstrate that the formalism of QED can be signifi-cantly improved by removing ultraviolet-divergent terms from the Hamilto-nian and abandoning the idea of non-observable virtual and bare particles. Inparticular, we will find a finite “dressed” or “physical” particle HamiltonianH d which, in addition to accurate scattering operators (as in the traditionalQED), also provides a good description of the time evolution and boundstates. We will call this approach the relativistic quantum dynamics (RQD)

1when the cutoff momentum is send to infinity, as it should2see subsection 8.3.6

351

Page 382: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 382/687

352 CHAPTER 9. THE DRESSED PARTICLE APPROACH

because, unlike traditional quantum field theory concerned with calculations

of time-independent S -matrix, RQD emphasizes the dynamical, i.e., time-dependent nature of interacting processes.In section 9.3 we will briefly discuss applications of RQD to the simplest

quantum mechanical problem - the structure of the hydrogen atom. We willderive the interaction between dressed electrons and protons in the lowest2nd perturbation order and demonstrate how well-known fine and hyperfineatomic level structures can be calculated. We will also discuss an extensionof this approach to higher perturbation orders.

9.1 Troubles with renormalized QED

9.1.1 Renormalization in QED revisited

Let us now take a closer look at the renormalized QED and recall the logicwhich led us from the original interaction Hamiltonian V in (M.5) and (M.8)to the interaction with counterterms V c (see eq. (8.80)).

One distinctive feature of the interaction V is that it contains unphysand renorm terms. In order to obtain the scattering operator F one needsto calculate multiple commutators of V (6.101). It is clear from Table 7.2that these commutators will give rise to renorm terms3 in each perturbationorder of F . However, according to eq. (8.55) and Statement 8.1 (the mass

renormalization condition), there should be no renorm terms in the operatorF of any sensible theory. This requirement can be satisfied only if there is acertain balance of unphys and renorm terms in V , so that all renorm termsin F cancel out. However, for most QFT interactions V there is no suchbalance.4 So, we have a contradiction.

The traditional renormalization approach5 suggests the following resolu-tion of this paradox: change the interaction operator from V to V c by adding(infinite) counterterms. If these counterterms are properly chosen, then onecan achieve cancelation of all renorm terms in F c (Statement 8.1 - no scat-tering in the vacuum and 1-particle states) and ensure accurate scattering

properties of charged particles in the low energy regime (Postulate 8.3). In-troduction of these renormalization counterterms is effectively equivalent to

3in commutators [U, U ] and [R, R]4This includes interactions in our toy model (7.89) and in QED (M.5) and (M.8).5see section 8.3

Page 383: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 383/687

9.1. TROUBLES WITH RENORMALIZED QED 353

making the masses and charges of free particles infinite.6 Somewhat mirac-

ulously, the S-matrix with thus modified Hamiltonian H

c

= H 0 + V

c

agreeswith experiment at all energies and for all scattering processes.The traditional interpretation of the renormalization approach is that

infinities in the Hamiltonian H c (8.80) have a real physical meaning. Thecommon view is that bare electrons and protons really have infinite massesm and M , and infinite charges ±e.7 The fact that such particles were neverobserved in nature is usually explained as follows: Bare particles are noteigenstates of the total Hamiltonian H c. The electrons and protons observedin experiments are complex linear combinations of multiparticle bare states.These linear combinations do have correct (finite) measurable masses m andM and charges

±e. This situation is often described as bare particles being

surrounded by “clouds” of virtual particles , thus forming physical or dressed particles . The virtual cloud modifies the mass of the bare particle byan infinite amount, so that the resulting mass is exactly the one measuredin experiments. The cloud also “shields” the (infinite) charge of the bareparticle, so that the effective charge becomes ±e.

Even if we accept this weird description of physical reality, it is clearthat the renormalization program did not solve the problem of ultravioletdivergences in quantum field theory. The divergences were removed from theS -operator, but they reappeared in the Hamiltonian H c in the form of infinitecounterterms, and this approach just shifted the problem of infinities fromone place to another. Inconsistencies of the renormalization approach con-cerned many prominent scientists, such as Dirac and Landau. For example,Rohrlich wrote

Thus, present quantum electrodynamics is one of the strangest achievements of the human mind. No theory has been confirmed by experiment to higher precision; and no theory has been plagued by greater mathematical difficulties which have withstood repeated attempts at their elimination. There can be no doubt that the

6see subsection 8.3.67It is also common to hypothesize that these bare parameters may be actually very large

rather than infinite. The idea is that “granularity” of space-time or other yet unknownPlanck-scale effect sets a natural momentum limit, which does not allow to send themomentum cutoff to infinity. This “effective field theory” approach assumes that QEDis just a low energy approximation to some unknown divergence-free truly fundamentaltheory operating at the Planck scale. Speculations of this kind are not needed for thedressed particle approach developed in the next section.

Page 384: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 384/687

354 CHAPTER 9. THE DRESSED PARTICLE APPROACH

present agreement with experiments is not fortuitous. Neverthe-

less, the renormalization procedure can only be regarded as a tem-porary crutch which holds up the present framework. It should be noted that, even if the renormalization constants were not infinite,the theory would still be unsatisfactory, as long as the unphysical concept of “bare particle” plays a dominant role. F. Rohrlich[90]

9.1.2 Time evolution in QED

In addition to peculiar infinite masses and charges of bare particles, tradi-tional QED predicts rather complex dynamics of the vacuum and one-particle

states. Let us forget for a moment that interaction terms in H c

are infinite,and apply the time evolution operator exp( i

H ct) to the vacuum (no-particle)

state. Expressing V c in terms of creation and annihilation operators of par-ticles (M.5) and (M.8) and omitting numerical factors we obtain8

|0(t) = eiH ct|0

= (1 +it

(H 0 + V c) + . . .)|0

= |0 + ta†b†c†|0 + td†f †c†|0 + . . .

=

|0

+ t

|abc

+ t

|df c

+ . . . (9.1)

We see that various multiparticle states (|abc, |df c, etc.) are created fromthe vacuum during time evolution. So, the vacuum is full of appearing anddisappearing virtual particles. The physical vacuum in QED is not just anempty state without particles. It is more like a boiling “soup” of particles,antiparticles, and photons.

Similar disturbing problems become evident if we consider the time evo-lution of one-electron states. One bare electron a†|0 is not an eigenstate of the full Hamiltonian H c. Therefore, the initial state of one electron quicklytransforms into a multi-particle state: bare electron + photons + electron-positron pairs + proton-antiproton pairs + ...

|a(t) = eiH ct|a

8Here we are concerned only with the presence of a†b†c† and d†f †c† interaction termsin (M.5). All other terms are omitted.

Page 385: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 385/687

9.1. TROUBLES WITH RENORMALIZED QED 355

= (1 +it

(H 0 + V c) + . . .)|a

= |a + ta†b†c†a†|0 + td†f †c†a†|0 + . . .

= |a + t|aabc + t|adfc + . . . (9.2)

Such a behavior of zero- and one-particle states has not been seen in experi-ments. Obviously, if a theory cannot get right the time evolutions of simplestzero-particle and one-particle states, there is no hope of predicting the timeevolution in more complex multiparticle states.

The reason for these unphysical time evolutions is the presence of unphys(e.g., a†b†c† + d†f †c†) and renorm (a†a + b†b + d†d + f †f ) terms in the inter-action V c of the renormalized QED. This problem is separate from the factthat coefficient functions of these terms are infinite. How is it possible thatsuch an unrealistic Hamiltonian leads to exceptionally accurate experimentalpredictions?

The important point is that unphys and renorm interaction terms in H c

are absolutely harmless when the time evolution in the infinite time range(from −∞ to ∞) is considered. As we saw in eq. (6.91), such time evolutionis represented exactly by the product of the non-interacting time evolutionoperator and the S -operator

U (∞ ← − ∞) = U 0(∞ ← −∞)S c

The factor U 0 in this product leaves invariant no-particle and one-particlestates. The factor S c has the same properties due to the cancelation of unphys and renorm terms in F c as discussed in the preceding subsection. So,in spite of ill-defined operators H c and exp( i

H ct), QED is perfectly capable

of describing scattering.

Luckily for QED, current experiments with elementary particles are notdesigned to measure time-dependent dynamics in the interaction region. In-teraction processes occur almost instantaneously in particle collisions. High

energy particle physics experiments are, basically, limited to measurements of scattering cross-sections and energies of bound states, i.e., properties encodedin the S -matrix. In this situation, the inability of the theory to describe timeevolution can be tolerated. But there is no doubt that time-dependent pro-cesses in high energy physics will be eventually accessible to more advanced

Page 386: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 386/687

356 CHAPTER 9. THE DRESSED PARTICLE APPROACH

experimental techniques.9 Clearly, without accurate description of time evo-

lution, we cannot claim a success in developing a comprehensive theory of subatomic phenomena.

9.1.3 Unphys and renorm operators in QED

In the preceding subsection we saw that the presence of unphys and renorminteraction operators in V c is responsible for unphysical time evolution. Itis not difficult to see that the presence of such terms is inevitable in anylocal quantum field theory where interaction Hamiltonian is constructed as apolynomial of quantum fields [92]. We saw in (K.1) and (L.1) that quantumfields of both massive and massless particles always have the form of a sum

(creation operator + annihilation operator)

ψ ∝ α† + α

Therefore, if we construct interaction as a product (or polynomial) of fields,we would necessarily have unphys and renorm terms there. For example,converting to the normal order a product of four fields

V ∝ ψ4

= (α† + α)(α† + α)(α† + α)(α† + α)

= α†α†α†α† + α†α†α†α + α†ααα + αααα + α†α† + αα (9.3)

+ α†α + C (9.4)

+ α†α†αα (9.5)

we obtain unphys terms (9.3) together with renorm terms (9.4) and onephys term (9.5). So, this operator cannot properly describe interacting timeevolution.

This analysis suggests that any quantum field theory is destined to sufferfrom renormalization difficulties. Do we have any alternative? The prevalentopinion is that “no, there is no alternative to quantum field theory”

9 Time dynamics of wave functions can be resolved in some experiments in atomicphysics, e.g., with Rydberg states of atoms [91]. Moreover, time evolution is clearlyobservable in everyday life. So, a consistent theory of subatomic phenomena should beable to describe such phenomena in its low-energy limit. Strictly speaking, the Hamiltonianof renormalized QED does not have a finite low-energy limit.

Page 387: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 387/687

9.2. DRESSING TRANSFORMATION 357

The bottomline is that quantum mechanics plus Lorentz invari-

ance plus cluster decomposition implies quantum field theory. S.Weinberg [1]

If this statement was true, then we would find ourselves in a very troublingsituation. Luckily, this statement is not true. It is possible to construct asatisfactory relativistic quantum theory where renormalization problems areabsent. This possibility is discussed in the next section.

9.2 Dressing transformation

The position taken in this book is that the presence of infinite countertermsin the Hamiltonian of QED H c is not acceptable and that the Tomonaga-Schwinger-Feynman renormalization program was just a first step in the pro-cess of elimination of infinities from quantum field theory. In this section weare going to propose how to make a second step in this direction: removeinfinite contributions from the Hamiltonian H c and solve the paradox of ultraviolet divergences in QED.

Our solution is based on the dressed particle approach which has a longhistory. Initial ideas about “persistent interactions” in QFT were expressedby van Hove [93, 94]. First clear formulation of the dressed particle concept

and its application to model quantum field theories are contained in a brilliantpaper by Greenberg and Schweber [10]. This formalism was further appliedto various quantum field models including the scalar-field model [95], the Leemodel [96, 97, 98, 99], and the Ruijgrok-Van Hove model [11, 100]. The wayto construct the dressed particle Hamiltonian as a perturbation series in ageneral QFT theory was suggested by Faddeev [101] (see also [102, 103, 104]).Shirokov with coworkers [12, 105, 106, 107] further developed these ideas and,in particular, demonstrated how the ultraviolet divergences can be removedfrom the Hamiltonian of QFT up to the 4th perturbation order (see also[108]).

9.2.1 Physical particles

The simplest way to avoid renormalization problems is to demand that in-teraction Hamiltonian V does not contain unphys and renorm terms. In this

Page 388: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 388/687

358 CHAPTER 9. THE DRESSED PARTICLE APPROACH

case, there are only phys terms in V ,10 so that each term has at least two

creation and at least two annihilation operators

V = α†α†αα + α†α†ααα + α†α†α†αα + . . . (9.6)

According to Table 7.2 in subsection 7.2.5, commutators of phys terms canbe only phys. Therefore, when the F -operator is calculated (6.101) only physterms can appear there. So, F has a form similar to (9.6), and both V andF yield zero when acting on zero-particle and one-particle states

V

|0

= V α†

|0

= 0

F |0 = F α†|0 = 0

Therefore, the mass renormalization condition11 is automatically satisfied,i.e., the S -operator is equal to unity in the zero-particle and one-particlesectors of the Fock space.

S |0 = exp(F (t) )|0= (1 + F ph(t)

+

1

2!F ph(t)

F ph(t)

+ . . .)|0

= |0Sα†|0 = (1 + F ph(t) + 1

2!F ph(t) F ph(t) + . . .)α†|0

= α†|0Moreover, the time evolution of the vacuum and one-particle states is notdifferent from the free time evolution

|0(t) = eiHt |0

= (1 +it

(H 0 + V ph) + . . .)

|0

= (1 +

it

H 0 + . . .)|0

10 Recall that decay and oscillation operators are not present in QED.11eq. (8.55) and Statement 8.1

Page 389: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 389/687

9.2. DRESSING TRANSFORMATION 359

= eiH 0t|0

|α(t) = ei

Ht

α†|0= (1 +

it

(H 0 + V ph) + . . .)α†|0

= (1 +it

H 0 + . . .)α†|0

= eiH 0tα†|0

as it should be. Physically this means that in addition to forbidding scat-tering in zero-particle and one-particle states (Statement 8.1) we forbid any(self-)interaction in these states.

Postulate 9.1 (stability of vacuum and one-particle states) There is no (self-)interaction in the vacuum and one-particle states, i.e., the time evo-lution of these states is not affected by interaction and is governed by the non-interacting Hamiltonian H 0. Mathematically, this means that the interaction Hamiltonian V is phys.

Summarizing discussions from various parts of this book, we can put togethera list of restrictions that should be satisfied by any realistic interaction

(A) Poincare invariance (Statement 3.2);

(B) instant form of dynamics (Postulate 10.2);

(C) cluster separability (Postulate 6.3);

(D) no self-interactions (Postulate 9.1);

(E) finiteness of coefficient functions of interaction potentials;

(F) coefficient functions should rapidly tend to zero at large values of mo-menta 12

As we saw in subsection 9.1.3, requirement (D) practically excludes allfield-theoretical Hamiltonians. The question is whether there are non-trivialinteractions that have all the properties (A) - (F)? And the answer is “yes”.

12According to Theorem 7.12, this guarantees convergence of all loop integrals involvingvertices V , and, therefore, the finiteness of the S -operator calculated with interaction V .

Page 390: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 390/687

360 CHAPTER 9. THE DRESSED PARTICLE APPROACH

One set of examples of allowed interacting theories is provided by “direct

interaction” models.

13

Two-particle models of this kind were first constructedby Bakamjian and Thomas [4]. Sokolov [6, 7], Coester and Polyzou [8] showedhow this approach can be extended to cover multi-particle systems. Thereare recent attempts [109] to extend this formalism to include description of systems with variable number of particles. In spite of these achievements, the“direct interaction” approach is currently applicable only to model systems.One of the reasons is that conditions for satisfying the cluster separabilityare very cumbersome. This mathematical complexity is evident even in thesimplest 3-particle case discussed in subsection 6.3.7.

In the “direct interaction” approach, interactions are expressed as func-tions of (relative) particle observables, e.g., relative distances and momenta.

However, it appears more convenient to write interactions as polynomials inparticle creation and annihilation operators (7.57). We saw in Statement 7.7that in this case the cluster separability condition (C) is trivially satisfiedif the coefficient functions have smooth dependence on particle momenta.The no-self-interaction condition (D) simply means that interaction is phys.The instant form condition (B) means that generators of space translationsP = P0 and rotations J = J0 are interaction-free, and that interaction V commutes with P0 and J0. The most difficult part is to ensure the relativisticinvariance (condition (A)), i.e., commutation relations of the Poincare group.One way to solve this problem is to fix the operator structure of interactionterms and then find the momentum dependence of coefficient functions bysolving a set of differential equations resulting from Poincare commutators(6.26) [71, 110, 111, 112, 113, 114, 115]. Kita demonstrated that there isan infinite number of solutions for these equations, i.e., an infinite numberof interacting models which satisfy conditions (A) - (F). Apparently, thereshould be additional physical principles that would single out a unique theoryof interacting particles that agrees with experimental observations. Unfortu-nately, these additional principles are not known at this moment.

9.2.2 The main idea of the dressed particle approach

Kita-Kazes approach [71, 110, 111, 112, 113, 114, 115] is difficult to apply torealistic particle interactions, so, currently, it cannot compete with QFT. Itmight be more promising to abandon the idea to build relativistic interactions

13Some of them were discussed in section 6.3.

Page 391: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 391/687

9.2. DRESSING TRANSFORMATION 361

from scratch and, instead, try to modify QFT to make it consistent with the

requirement (D). One argument in favor of such an approach is that the S -matrix of the usual renormalized QED agrees with experiment very well. So,we may add the following requirement to the above list:

(G) the S -operator of our theory is exactly the same as the S -operator of the renormalized QED.

Let us denote the desired “phys” interaction operator by V d. Then, con-dition (G) means that the Hamiltonian H d = H 0+V d is scattering equivalentto the QED Hamiltonian H c = H 0 + V c. Recalling our discussion in section6.5.3 we see that this can be achieved if H d and H c are related by a unitary

transformation

H d = H 0 + V d

= eiΦH ce−iΦ (9.7)

= eiΦ(H 0 + V c)e−iΦ

= (H 0 + V c) + i[Φ, (H 0 + V c)] − 1

2![Φ, [Φ, (H 0 + V c)]] + . . . (9.8)

where Hermitian Φ satisfies condition (6.104). The new Hamiltonian H d willbe called the dressed particle Hamiltonian, and transformation eiΦ will be

called the dressing transformation .

9.2.3 The unitary dressing transformation

Now our goal is to find a unitary transformation eiΦ which ensures that thedressed particle Hamiltonian H d satisfies all properties (A) - (G). In thisstudy we will need the following useful results

Theorem 9.2 (transformations preserving the S -operator) A unitary transformation of the Hamiltonian

H ′ = eiΦHe−iΦ

preserves the S -operator if the Hermitian operator Φ has the form ( 7.56 ) -( 7.57 ) where all terms ΦNM have smooth coefficient functions.

Page 392: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 392/687

362 CHAPTER 9. THE DRESSED PARTICLE APPROACH

Proof. Let us write the left hand side of the scattering-equivalence condition

(6.104) for each term ΦNM in Φ

limt→±∞

e− iH 0tΦNM e

iH 0t

= limt→±∞

η,η′

dq′1 . . . dq′

N dq1 . . . dqM DNM (q′1η

′1; . . . ; q′

N η′N ; q1η1; . . . ; qM ηM ) ×

δ (

N i=1

q′i −

M j=1

q j)e− iEtα†

q′1,η′1

. . . α†q′N ,η

′N

αq1,η1 . . . αqM ,ηM

where E is the energy function of this term. The limits t→ ±∞

are zeroby Riemann-Lebesgue lemma B.1, because the coefficient function DNM issmooth, while factor e− i

Et oscillates rapidly in the momentum space. There-

fore, according to (6.105), Hamiltonians H and H ′ are scattering-equivalent.

Lemma 9.3 Potential B14 is smooth if B is either unphys with arbitrary smooth coefficient function or phys with a smooth coefficient function which is identically zero on the energy shell.

Proof. The only possible source of singularity in B is the energy denominatorE −1B (see eq. (7.71)) which is singular on the energy shell. However, foroperators satisfying conditions of this Lemma, either the energy shell doesnot exist, or the coefficient function vanishes there. So, the product of thecoefficient function with E −1B is regular on the energy shell.

Next we introduce regularization, e.g., a cutoff at large integration momentaΛ15 which ensures that all interactions V ci in the Hamiltonian of QED arefinite,16 and that all loop integrals involved in calculations of products and

14For definition of the underlined symbol see (6.96).15see subsection 8.3.116

We will assume that final results do not depend on how exactly the cutoffs are intro-duced and lifted, so we will not perform the regularization explicitly. It will be sufficientto simply assume that all loop integrals and counterterms are made finite. We can do that,because we are not going to evaluate these integrals explicitly. As discussed in subsections7.3.1 and 7.4.3, we will also avoid singularities at |k| = 0 and related infrared divergencesby assigning a small mass to photons.

Page 393: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 393/687

9.2. DRESSING TRANSFORMATION 363

commutators of interactions V ci are convergent. In this section we are going

to prove that in this regulated theory the operator Φ can be chosen so thatconditions (A) - (G) are satisfied in all orders. Of course, in order to getaccurate results, in the end of calculations the momentum cutoff should besend to infinity. Only those quantities may have physical meaning, whichremain finite in this limit. The limit of removed regularization cutoff Λ → ∞will be considered in subsection 9.2.7.

Multiplying (9.8) by e− iH 0t from the left and e

iH 0t from the right we

enforce the t-dependence (7.58) characteristic for regular operators.17 Wewill assume that all relevant operators can be written as expansions in powersof the coupling constant, and all series converge

H c = H 0 + V c1 + V c2 + . . . (9.9)

H d = H 0 + V d1 + V d2 + . . . (9.10)

Φ = Φ1 + Φ2 + . . . (9.11)

Using these expansions and collecting together terms of equal order weobtain an infinite set of equations.

V d1 (t) = V c1 (t) + i[Φ1(t), H 0]

V d2 (t) = V c2 (t) + i[Φ2(t), H 0] + i[Φ1(t), V c1 (t)] − 12!

[Φ1(t), [Φ1(t), H 0]]

V d3 (t) = V c3 (t) + i[Φ3(t), H 0] + i[Φ2(t), V c1 (t)] + i[Φ1(t), V c2 (t)]

− 1

2![Φ2(t), [Φ1(t), H 0]] − 1

2![Φ1(t), [Φ2(t), H 0]] − 1

2![Φ1(t), [Φ1(t), V c1 (t)]]

− i

3![Φ1(t), [Φ1(t), [Φ1(t), H 0]]]

. . .

Using property (7.67) these equations can be transformed to

17 The t-dependence is introduced here merely to assist in calculations. At the end of calculations we should set t = 0 to obtain the Hamiltonian (9.10) in the final form which,of course, does not depend on time. So, parameter t has no relationship to the physicaltime . It would be less confusing if we chose another letter to denote this parameter. Butwe simply follow usual conventions here.

Page 394: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 394/687

364 CHAPTER 9. THE DRESSED PARTICLE APPROACH

V d1 (t) = V c1 (t) + dΦ1(t)

dt(9.12)

V d2 (t) = V c2 (t) + dΦ2(t)

dt+ i[Φ1(t), V c1 (t)] +

i

2![Φ1(t),

dΦ1(t)

dt] (9.13)

V d3 (t) = V c3 (t) + dΦ3(t)

dt+ i[Φ2(t), V c1 (t)] + i[Φ1(t), V c2 (t)]

+i

2![Φ2(t),

dΦ1(t)

dt] +

i

2![Φ1(t),

dΦ2(t)

dt] − 1

2![Φ1(t), [Φ1(t), V c1 (t)]]

3![Φ1(t), [Φ1(t),

dΦ1(t)

dt]] (9.14)

. . .

Now we need to solve these equations order-by-order. This means that weneed to choose appropriate operators Φi(t), so that interaction terms V di onleft hand sides satisfy above conditions (B) - (G).18 Let us start with the firstperturbation order.

9.2.4 Dressing in the first perturbation order

In QED, V c1 (t) = V 1(t) is unphys,19 therefore in eq. (9.12) we can choose

Φ ph1 (t) = Φren1 = 0 and use eqs. (6.96) and (7.71) to write

Φunp1 (t) = −1

t−∞

V 1(t′)dt′

≡ iV 1(t) (9.15)

Then V d1 (t) = 0, so that conditions (B) - (F) are trivially satisfied in thisorder. The coefficient function of V 1(t) is non-singular. By Lemma 9.3 thisimplies that Φunp

1 (t) in eq. (9.15) is smooth. By Theorem 9.2, the presence

of this term in the dressing transformation eiΦ does not affect the S -operatorin agreement with our condition (G).

18We will discuss condition (A) separately in subsection 9.2.8.19see eq. (M.5)

Page 395: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 395/687

9.2. DRESSING TRANSFORMATION 365

9.2.5 Dressing in the second perturbation order

Now we can substitute the operator Φ1(t) found above into eq. (9.13) andobtain

V d2 (t) = V c2 (t) + dΦ2(t)

dt− [V c1 (t), V c1 (t)] +

1

2![V c1 (t), V c1 (t)]

= V c2 (t) + dΦ2(t)

dt− 1

2[V c1 (t), V c1 (t)] (9.16)

It is convenient to write separately unphys, phys, and renorm parts of thisequation and take into account that dΦren

2 /dt = 0

(V d2 )unp(t) = (V c2 )unp(t) + dΦunp

2 (t)

dt− 1

2[V c1 (t), V c1 (t)]unp (9.17)

(V d2 ) ph(t) = (V c2 ) ph(t) + dΦ ph2 (t)

dt− 1

2[V c1 (t), V c1 (t)] ph (9.18)

(V d2 )ren = (V c2 )ren − 1

2[V c1 (t), V c1 (t)]ren (9.19)

From the condition (D) it follows that (V d2 )unp(t) must vanish, therefore weshould choose in (9.17)

Φunp2 (t) = i(V c2 )unp(t) − i

2[V 1(t), V 1(t)]unp (9.20)

Operators V 1(t) and (V c2 )unp are smooth. Therefore, by Lemma 9.3, the oper-ator V 1(t) is also smooth, and by Lemma 7.11 the commutator [V 1(t), V 1(t)]unp

is smooth as well. Using Lemma 9.3 again, we see that operator Φunp2 (t) is

smooth, and by Theorem 9.2 its presence in the transformation eiΦ does notaffect the S -operator.

Let us now turn to eq. (9.18) for the phys part of the dressed particleinteraction V d2 . First, note that unlike Φunp

2 (t), we cannot choose Φ ph2 (t) so as

to cancel completely the phys part of interaction. Such a cancelation wouldrequire us to define

Φ ph2 (t) = i(V c2 ) ph(t) − i

2[V c1 (t), V c1 (t)] ph

Page 396: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 396/687

366 CHAPTER 9. THE DRESSED PARTICLE APPROACH

However, the right hand side has t-integrals of phys operators which have

non-empty energy shells. Therefore, these integrals have singularities on theenergy shell (see eq. (7.71)), but we are not allowed to use a transformationwith singular operator Φ, because this would change the S -operator andviolate condition (G).

Note also that we cannot simply choose Φ ph2 (t) = 0, because in this case,

the dressed particle interaction would acquire the form

(V d2 ) ph(t) = (V c2 ) ph(t) − 1

2[V c1 (t), V c1 (t)] ph

and there is absolutely no guarantee that the coefficient function of (V d2 ) ph(t)

rapidly tends to zero at large values of particle momenta (condition (F)).20In order to have this guarantee, we are going to choose Φ ph2 (t) such that

dΦph2 (t)

dt cancels out two other terms on the right hand side of (9.18) whenmomenta are far from the energy shell. In addition, we will require thatΦ ph2 (t) is non-singular. Both conditions can be satisfied by choosing21

Φ ph2 (t) = i(V c2 ) ph(t) − i

2[V c1 (t), V c1 (t)] ph (1 − ζ 2) (9.21)

where ζ 2 is a real function of particle momenta, such that

(I) ζ 2 is equal to 1 on the energy shell;

(II) ζ 2 depends on rotationally invariant combinations of momenta (to makesure that V d2 commutes with P0 and J0);

(III) ζ 2 is smooth;

(IV) ζ 2 rapidly tends to zero when the arguments move away from the energyshell.22

20This has the danger that loop integrals involving (V d2 ) ph(t) would not converge thus

implying a divergent S -operator.21Notice that this part of our dressing transformation closely resembles the “similarityrenormalization” procedure suggested by Glazek and Wilson [13, 14]. Recall also thedefinition of the symbol from subsection 7.2.3

22For example, we can choose ζ 2 = e−αE 2 where α is a positive constant and E is theenergy function of the operator in curly brackets in (9.21).

Page 397: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 397/687

9.2. DRESSING TRANSFORMATION 367

With the choice (9.21) we obtain

(V d2 ) ph(t) = (V c2 ) ph(t) − 1

2[V c1 (t), V c1 (t)] ph ζ 2 (9.22)

so that (V d2 ) ph(t) rapidly tends to zero when momenta of particles move awayfrom the energy shell in agreement with condition (F).

Now we can set

Φren2 = 0 (9.23)

and prove that condition (D) is satisfied automatically. We already proved(V d2 )unp(t) = 0, so we are left to demonstrate (V d2 )ren = 0 First note that withthe above definitions (9.15), (9.20), (9.21), (9.23) the operator Φ1(t) + Φ2(t)is smooth. Therefore, according to Theorem 9.2, the S -operator obtainedwith the transformed interaction V d2 (t) agrees with the S -operator S c up tothe second perturbation order (condition (G)). This would be impossible if V d2 (t) contained a non-zero renorm term.23 Therefore we must conclude that(V d2 )ren = 0 and that two terms on the right hand side of (9.19) cancel eachother.24

9.2.6 Dressing in an arbitrary order

For any perturbation order i, the selection of Φi(t) and the proofs of (B)- (G) are similar to those described above for the 2nd order. The definingequation for V di (t) can be written in a general form25

V di (t) = V ci (t) + dΦi(t)

dt+ Ξi(t) (9.24)

where Ξi(t) is a sum of multiple commutators involving V c j (t) from lowerorders (1 ≤ j < i) and their t-integrals. This equation is solved by

23

this would imply that operators F

d

2 (t) and F

c

2 (t) also have non-zero renorm terms indisagreement with eq. (8.55).24The cancelation of two terms on the right hand side of (9.19) can be seen explicitly in

our model theory from section 7.3. In that case (V c2 )ren is the renormalization counterterm

(8.66) R2 = (V 1(t)V 1(t))ren = 12 [V 1(t), V 1(t)]ren.

25compare with (9.16)

Page 398: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 398/687

368 CHAPTER 9. THE DRESSED PARTICLE APPROACH

Φreni = 0 (9.25)

Φunpi (t) = iΞunp

i (t) + iV unpi (t), (9.26)

Φ phi (t) = i(Ξ phi (t) + V phi (t)) (1 − ζ i) (9.27)

where function ζ i has properties (I) - (IV). Similar to the 2nd order discussedabove, one then proves that Φi(t) is smooth, so that condition (G) is satisfiedin the i-th order.

Solving equations (9.24) order-by-order we obtain the dressed particleHamiltonian

H d

(t) = eiΦ(t)

H c

(t)e−iΦ(t)

(9.28)= H 0 + V d2 (t) + V d3 (t) + V d4 (t) + . . . (9.29)

which satisfies properties (B) - (G) as promised.

9.2.7 The infinite momentum cutoff limit

It seems rather obvious that conditions (B) - (D) and (F) remain valid in-dependent on the momentum cutoff Λ. Therefore, they also remain valid inthe limit of infinite cutoff. Let us now demonstrate that condition (E) alsoremains valid in this limit. To do that, we use (6.100) and (6.101), and write

log S c = F c2 (t) + F c3 (t) + F c4 (t) + . . .

log S d = V d2 (t) + V d3 (t) + V d4 (t) −1

2[V d2 (t), V d2 (t)] . . .

Since S c = S d by the condition (G), we obtain the following set of relationsbetween V di (t) and F ci (t) on the energy shell

V d2 (t)

= F c2 (t)

(9.30)

V

d

3 (t) = F

c

3 (t) (9.31)

V d4 (t) = F c4 (t) +1

2[V d2 (t), V d2 (t)] (9.32)

V di (t) = F ci (t) + Qi(t) , i > 4 (9.33)

Page 399: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 399/687

9.2. DRESSING TRANSFORMATION 369

where Qi(t) denotes a sum of multiple commutators of V d j (t) from lower

orders (2 ≤ j ≤ i−2) with t-integrations. These relations are independent onthe cutoff Λ, so they remain valid when Λ → ∞. In this limit operators F ci (t)are finite and assumed to be known on the energy shell from the standardrenormalized QED theory. This immediately implies that V d2 (t) and V d3 (t)are finite on the energy shell, and, due to the condition (F), they are finitefor all momenta.

The equation for the 4th order potential (9.32) is different from (9.30)and (9.31) by the presence of an additional term

1

2

[V d2 (t), V d2 (t)] (9.34)

on its right hand side. How can we be sure that this expression is finite onthe energy shell? According to Theorem 7.12, the fast decay of the potentialV d2 off the energy shell (condition (F)) ensures the convergence of all loopintegrals which can appear in calculations of this commutator. Therefore,commutator (9.34) is finite on the energy shell, and V d4 in (9.32) is finite onthe energy shell and everywhere else. These arguments can be repeated inall higher orders, which proves that the dressed particle Hamiltonian H d isfree of ultraviolet divergences.

9.2.8 Poincare invariance of the dressed particle ap-proach

The next question is whether the theory with the transformed HamiltonianH d is Poincare invariant (condition (A))? In other words, whether thereexists such a boost operator Kd that the set of generators P0, J0, Kd, H dsatisfies Poincare commutators? With the unitary dressing operator exp(iΦ)constructed above this problem has a simple solution. If we define Kd =eiΦKce−iΦ, then we can obtain a full set of dressed generators via unitarytransformation of the old generators26

P0, J0, Kd, H d = eiΦP0, J0, Kc, H ce−iΦ

26Note that operator exp(iΦ) commutes with P0 and J0 by construction.

Page 400: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 400/687

370 CHAPTER 9. THE DRESSED PARTICLE APPROACH

The dressing transformation eiΦ is unitary and, therefore, preserves commu-

tators. Since old operators obey the Poincare commutators, the same is truefor the new generators. This proves that the transformed theory is Poincareinvariant and belongs to the instant form of dynamics [116].

9.2.9 General properties of physical particle interac-tions

One may notice that even after conditions (I) - (IV) are satisfied for functionsζ i, there is a great deal of ambiguity in choosing their behavior outside theenergy shell. Therefore, the dressing transformation eiΦ is not unique, andthere is an infinite set of dressed particle Hamiltonians that satisfy our re-quirements (A) - (G). Which dressed Hamiltonian should we choose? Beforetrying to answer this question, we can notice that all of Hamiltonians satis-fying conditions (A) - (G) have some important common properties, whichwill be described in this subsection.

Note that electromagnetic interactions are rather weak. In most situationthe (expectation value) of the interaction potential energy is much less thanthe sum of particle energies (mc2). To describe such situations it is sufficientto know the coefficient functions of the interaction only near the energy shellwhere we can approximately set ζ i ≈ 1 for each perturbation order i.27 Thisobservation immediately allows us to obtain a good approximation for the

second-order interaction from eq. (9.22) by setting ζ 2 ≈ 1

V d2 (t) ≈ (V c2 ) ph(t) − 1

2[V c1 (t), V c1 (t)] ph (9.35)

Some examples of potentials present in V d2 are shown in Table 9.1. We canclassify them into two groups: elastic potentials and inelastic potentials.Elastic potentials do not change the particle content of the system. Theyhave equal number of annihilation and creation operators, as if they createand then annihilate the same numbers and types of particles. As shown

in subsection 7.2.7, elastic potentials correspond to particle interactions fa-miliar from ordinary quantum mechanics and classical physics.28 Inelastic

27This follows from condition (I) on the functions ζ i (see page 366).28In section 9.3, we will discuss in detail potentials with the operator structure d†a†da.

They describe pairwise interactions between electrons and protons.

Page 401: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 401/687

9.2. DRESSING TRANSFORMATION 371

potentials change the number and/or types of particles. Among 2nd order

inelastic potentials in RQD there are potentials for pair creation , annihila-tion , and pair conversion .Similarly to the 2nd order discussed above, the third-order interaction V d3

can be explicitly obtained near the energy shell by setting ζ 3 ≈ 1. All 3rdorder potentials are inelastic. Two of them are shown in Table 9.1: The termd†a†c†da (bremsstrahlung ) describes creation of a photon in a proton-electroncollision. In the language of classical electrodynamics, this can be interpretedas radiation due to acceleration of charged particles and is referred to as theradiation reaction force. The Hermitian-conjugated term d†a†dac describesabsorption of a photon by a colliding pair of charged particles.

Table 9.1: Examples of interaction potentials in RQD. Bold numbers in thethird column indicate perturbation orders in which explicit interaction oper-ators can be unambiguously obtained near the energy shell as discussed insubsection 9.2.9.Operator Physical meaning Perturbation

OrdersElastic potentials

a†a†aa e− − e− potential 2, 4, 6, . . .d†a†da e− − p+ potential 2, 4, 6, . . .a†c†ac e− − γ potential 2, 4, 6, . . .

a†a†a†aaa e− − e− − e− potential 4, 6, . . .Inelastic potentialsa†b†cc e− − e+ pair creation 2, 4, 6, . . .c†c†ab e− − e+ annihilation 2, 4, 6, . . .d†f †ab conversion of e− − e+ pair to p− − p+ pair 2, 4, 6, . . .d†a†c†da e− − p+ bremsstrahlung 3, 5, . . .d†a†dac photon absorption in e− − p+ collision 3, 5, . . .a†a†a†b†aa pair creation in e− − e− collision 4, 6, . . .

The situation is less certain for the 4th and higher order dressed particle

interactions. Near the energy shell we can again set ζ 4 ≈ 1 and write (fromeq. (9.32))

V d4 (t) ≈ (F c4 ) ph(t) +1

2[V d2 (t), V d2 (t)] (9.36)

Page 402: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 402/687

372 CHAPTER 9. THE DRESSED PARTICLE APPROACH

The operator V d4 obtained by this formula is a sum of various interaction

potentials (some of them are shown in Table 9.1)

V d4 = d†a†da + a†a†a†b†aa + . . . (9.37)

The contribution (F c4 ) ph in eq. (9.36) is well-defined near the energy shell,because we assume exact knowledge of the S -operator of renormalized QEDin all perturbation orders. However, this is not true for the contribution12 [V d2 , V d2 ]. This commutator depends on the behavior of V d2 everywhere inthe momentum space. So, it depends on our choice of ζ 2 outside the energyshell. The function ζ 2 satisfies conditions (I) - (IV), but still there is a great

freedom which is reflected in the uncertainty of V d4 even on the energy shell.Therefore, we have two possibilities depending on the operator structure of the 4th order potential we are interested in.

First, there are potentials contained only in the term (F c4 ) ph and notpresent in the commutator 1

2[V d2 , V d2 ]. For example, this commutator does

not contain the contribution a†a†a†b†aa responsible for the creation of anelectron-positron pair in two-electron collisions. For such potentials, their4th order expression near the energy shell can be explicitly obtained fromformula (9.36).

Second, there are potentials V d4 whose contributions come from both two

terms on the right hand side of eq. (9.36). For such potentials, the secondterm on the right hand side of (9.36) remains uncertain. One example is theelectron-proton interaction d†a†da.

To summarize, we see that in interaction operators of higher perturba-tion orders there are more and more terms with increasing complexity. Incontrast to the QED Hamiltonian H c, there seems to be no way to write H d

in a closed form. However, to the credit of RQD, all these high order termsdirectly reflect real interactions and processes observable in nature. Unfortu-nately, the above construction of the dressed particle Hamiltonian does notallow us to obtain full information about V d: The off-shell behavior of poten-tials is fairly arbitrary, and the on-shell behavior29 can be determined only for

lowest order terms. However, this uncertainty is perfectly understandable:It simply reflects the one-to-many correspondence between the S -operatorand Hamiltonians. It means that there is a class of finite phys interactions

29which is the most relevant for the comparison with experiments

Page 403: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 403/687

9.2. DRESSING TRANSFORMATION 373

V d all of which can be used for S -matrix calculations without encounter-

ing divergent integrals. Then which member of the class V

d

is the uniquecorrect interaction Hamiltonian V d? As we are not aware of any theoreticalcondition allowing to determine the off-energy-shell behavior of ζ i, this ques-tion should be deferred to the experiment. There seems to be no other waybut to fit functions ζ i to experimental measurements. Such experiments arebound to be rather challenging because they should go beyond usual infor-mation contained in the S -operator (scattering cross-sections, energies andlifetimes of bound states, etc.) and should be capable of measuring radiativecorrections to the wave functions and time evolution of observables in theregion of interaction. Current experiments do not have sufficient resolutionto meet this challenge.

9.2.10 Comparison with other dressed particle approaches

In this subsection, we would like to discuss another point of view on thedressing transformation. This point of view is philosophically different butmathematically equivalent to ours. It is exemplified by the works of Shirokovand coauthors [105, 106, 107]. In contrast to our approach in which thedressing transformation eiΦ was applied to the field-theoretical HamiltonianH c of QED while (bare) particle creation and annihilation operators werenot affected, they kept the H c intact, but applied the (inverse) dressingtransformation e

−iΦ to creation and annihilation operators of particles

α†d = e−iΦα†eiΦ

αd = e−iΦαeiΦ

and to the vacuum state

|0d = e−iΦ|0

Physically, this means that instead of bare particles (created and annihi-lated by α† and α, respectively) the theory is formulated in terms of fullydressed particles, i.e., particles together with their virtual clouds. Withinthis approach the Hamiltonian H c must be expressed as a function of thenew particle operators H c = F (α†

d, αd). Apparently, the same function F

Page 404: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 404/687

374 CHAPTER 9. THE DRESSED PARTICLE APPROACH

expresses the Hamiltonian H d of our approach through original (bare) par-

ticle operators: H

d

= F (α†, α). Indeed, from eq. (9.7) we can write

H c = e−iΦH deiΦ

= e−iΦF (α†, α)eiΦ

= F (e−iΦα†e−iΦ, e−iΦαeiΦ)

= F (α†d, αd)

So, mathematically, these two approaches are equivalent. Let us demonstratethis equivalence on a simple example. Suppose we want to calculate a tra-

jectory30 of the electron in a 2-particle system (electron + proton). In ourapproach the initial state of the system has two particles

|Ψ = a†d†|0and the expectation value of the electron’s position is given by formula

r(t) = Ψ|R(t)|Ψ=

0

|dae

iH dtRe− i

H dta†d†

|0

= 0|da(eiΦe iH cte−iΦ)R(eiΦe− iH cte−iΦ)a†d†|0 (9.38)

where R is the position operator for the electron. However, we can rewritethis expression in the following form characteristic for the Shirokov’s ap-proach

r(t) = 0|eiΦ(e−iΦdaeiΦ)eiH ct(e−iΦReiΦ)e− i

H ct(e−iΦa†d†eiΦ)e−iΦ|0

= d0|ddadeiH ctRde− i

H cta†

dd†d|0d

= d0|ddadRd(t)a†dd†

d|0d (9.39)

where the time evolution is generated by the original Hamiltonian H c, while“dressed” definitions are used for the vacuum state, particle operators, andthe position observable

30the time dependence of the expectation value of the position operator

Page 405: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 405/687

9.3. THE COULOMB POTENTIAL AND BEYOND 375

|0d = e−iΦ|0ad, dd, a†

d, d†d = e−iΦa,d,a†, d†eiΦ

Rd = e−iΦReiΦ

In spite of different formalisms, physical predictions of both theories, e.g.,expectation values of observables (9.38) and (9.39), are exactly the same.

An interesting and somewhat related approach to particle interactions inQFT was recently developed by Weber and co-authors [117, 118, 119, 120].

9.3 The Coulomb potential and beyondIn the preceding section we obtained formulas for the dressed particle Hamil-tonian H d in a rather abstract form.31 In this section we would like todemonstrate how this Hamiltonian can be cast into the form suitable forcalculations, i.e., expressed through creation and annihilation operators of electrons, protons, photons, etc. It would be very difficult to explore allkinds of RQD interactions (e.g., those in Table 9.1) in one book. So, here wewill focus on pair interactions between electrons and protons in the lowest(second) order of the perturbation theory. In the c−2 approximation we willobtain what is commonly known as the Breit potential . The major part of this

interaction is the usual Coulomb potential. In addition, there are relativisticcorrections responsible for magnetic, contact, spin-orbit, spin-spin and otherinteractions which are routinely used in relativistic calculations of atomic andmolecular systems. This derivation demonstrates how formulas familiar fromordinary quantum mechanics and classical electrodynamics follow naturallyfrom RQD. Applications of the Hamiltonian H d to the hydrogen atom arebriefly discussed in subsection 9.3.4.

9.3.1 Electron-proton interaction in the 2nd order

The second-order dressed interaction operator near the energy shell is (9.35)

V d2 (t) ≈ (V c2 ) ph(t) − 1

2[V c1 (t), V c1 (t)] ph

31see, e.g., eqs. (9.35) and (9.36).

Page 406: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 406/687

376 CHAPTER 9. THE DRESSED PARTICLE APPROACH

We may notice that the term V c1 (t) is the same as in the original (non-

renormalized) QED Hamiltonian (8.16)

V c1 (t) = V 1(t)

because renormalization counterterms appear only in the second and higherorders. Phys counterterms appear only in the 4th and higher orders, so(V c2 ) ph(t) = V ph2 (t). Thus, in our calculations we can use original expressionsfor interactions (M.5) and (M.8) and write

V d2 (t) ≈ V ph2 (t) − 1

2[V 1(t), V 1(t)] ph (9.40)

We can set t = 0, use notation and results (8.38), (8.43) - (8.44), (L.9) (M.8)and obtain the following expression for the electron-proton interaction of thetype d†a†da

V d2 [d†a†da] = (V d2 )A + (V d2 )B + (V d2 )C (9.41)

where

(V d2 )A =e2 2c2

(2π )3

dkdedpMmc4

ωeωe+kΩpΩp−k

1

k2ǫǫ′ππ′

W 0(p − k, π; p, π′)U 0(e + k, ǫ; e, ǫ′) ×

d†p−k,πa†

e+k,ǫdp,π′ae,ǫ′ (9.42)

(V d2 )B = − e2 2c2

(2π )3

dkdedpMmc4 ωeωe+kΩpΩp−k

1

(e + k ÷ e)2ǫǫ′ππ′

W(p − k, π; p, π′) · U(e + k, ǫ; e, ǫ′) ×

d†p−k,πa†

e+k,ǫdp,π′ae,ǫ′ (9.43)

(V

d

2 )C =

e2 2c2

(2π )3 dkdedpMmc4 ωeωe+kΩpΩp−k

1

(e + k ÷ e)2|k|2ǫǫ′ππ′

(k · W(p − k, π; p, π′)(k · U(e + k, ǫ; e, ǫ′)) ×

d†p−k,πa†

e+k,ǫdp,π′ae,ǫ′ (9.44)

Page 407: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 407/687

9.3. THE COULOMB POTENTIAL AND BEYOND 377

In these formulas we integrate over the electron (e), proton (p), and trans-

ferred (k) momenta, and sum over spin projections of the two particles ǫ, ǫ′and π, π′.

9.3.2 Electron-proton potential in the momentum space

Operator (9.41) has non-trivial action in all sectors of the Fock space whichcontain at least one electron and one proton. However, for simplicity, we willlimit our attention to the “1 proton + 1 electron” subspace H pe in the Fockspace. If Ψ(p, π; e, ǫ) is the wave function of a two-particle state, then theinteraction Hamiltonian (9.41) will transform it to32

Ψ′(p, π, e, ǫ) = V d2 [d†a†da]Ψ(p, π; e, ǫ)

=π′ǫ′

dkv2(p, e, k; π,ǫ,π′, ǫ′)Ψ(p − k, π′; e + k, ǫ′)(9.45)

where v2 is the coefficient function of the interaction operator V d2 [d†a†da].We are going to write our formulas with the accuracy of c−2. So, we can use(8.42), (8.47) - (8.52), expand the denominators in (9.42) - (9.44)

Mmc4 Ωp−kωe+kΩpωe

≈ 1 1 + (p−k)2

2M 2c2

1 1 + p2

2M 2c2

1 1 + (e+k)2

2m2c2

1 1 + e2

2m2c2

≈ 1 − (p − k)2

4M 2c2− p2

4M 2c2− (e + k)2

4m2c2− e2

4m2c2

= 1 − p2

2M 2c2+

pk

2M 2c2− k2

4M 2c2− e2

2m2c2− ek

2m2c2− k2

4m2c2

and obtain

v2 = (v2)A + (v2)B + (v2)C (9.46)

where32See discussion in subsection 7.2.7.

Page 408: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 408/687

378 CHAPTER 9. THE DRESSED PARTICLE APPROACH

(v2)A(p, e, k; π,ǫ,π′, ǫ′)

= − e2 2

(2π )3|k|2χ(el)∗ǫ χ( pr)∗π χ(el)ǫ′ χ

( pr)π′

(1 − p2

2M 2c2+

pk

2M 2c2− k2

4M 2c2− e2

2m2c2− ek

2m2c2− k2

4m2c2)

(1 +(2p − k)2 − 2iσ pr · [k × p]

8M 2c2)(1 +

(2e + k)2 + 2iσel · [k × e]

8m2c2)

= − e2 2

(2π )3|k|2χ(el)∗ǫ χ( pr)∗π χ(el)ǫ′ χ

( pr)π′

(1 −p2

2M 2c2 +pk

2M 2c2 −k2

4M 2c2 −e2

2m2c2 −ek

2m2c2 −k2

4m2c2

+p2

2M 2c2− pk

2M 2c2+

k2

8M 2c2+

e2

2m2c2+

ek

2m2c2+

k2

8m2c2− i

σ pr[k × p]

4M 2c2+ i

σel[k × e

4m2c2

= − e2 2

(2π )3|k|2χ(el)∗ǫ χ( pr)∗π χ(el)ǫ′ χ

( pr)π′ (1 − k2

8M 2c2− k2

8m2c2− i

σ pr[k × p]

4M 2c2+ i

σel[k × e]

4m2c2)

= − e2 2

(2π )3χ(el)∗ǫ χ( pr)∗π χ

(el)ǫ′ χ

( pr)π′ (

1

|k|2 − 1

8M 2c2− 1

8m2c2− i

σ pr[k × p]

4M 2c2|k|2 + iσel[k × e]

4m2c2|k|2 )

(v2)B(p, e, k; π,ǫ,π′, ǫ′)

=e2 2

(2π )3|k|2χ(el)∗ǫ χ( pr)∗π χ(el)ǫ′ χ

( pr)π′

1

4Mmc2(2p − k − i[σ pr × k])(2e + k + i[σel × k])

=e2 2

(2π )3|k|2χ(el)∗ǫ χ( pr)∗π χ(el)ǫ′ χ

( pr)π′

1

4Mmc2(4pe − 2ke + 2pk − k2

− 2i[σ pr × k] · e + 2ip · [σel × k] + [σ pr × k] · [σel × k])

=e2 2

(2π )3χ(el)∗ǫ χ( pr)∗π χ

(el)ǫ′ χ

( pr)π′ (

pe

Mmc2|k|2 − ke

2Mmc2|k|2 +pk

2Mmc2|k|2

− 1

4Mmc2− i[σ pr × k] · e

2Mmc2|k|2 +ip · [σel × k]

2Mmc2|k|2 +[σ pr × k] · [σel × k])

4Mmc2|k|2 )

(v2)C (p, e, k; π,ǫ,π′, ǫ′)

= − e2 2

(2π )3|k|4χ(el)∗ǫ χ( pr)∗π χ(el)ǫ′ χ

( pr)π′

1

4Mmc2(2pk − k2)(2ek + k2)

Page 409: Relativistic Quantum

8/22/2019 Relativistic Quantum

http://slidepdf.com/reader/full/relativistic-quantum 409/687

9.3. THE COULOMB POTENTIAL AND BEYOND 379

= − e2 2

(2π )3χ(el)∗ǫ χ( pr)∗π χ

(el)ǫ′ χ

( pr)π′ (

(pk)(ek)

Mmc2

|k|4

− ek

2Mmc2

|k|2

+pk

2Mmc2

|k|2

− 1

4Mmc2)

Further, we use33

[σ pr × k] · [σel × k] = [[σel × k] × σ pr] · k

= (k(σel · σ pr) − σ pr(σel · k)) · k

= |k|2(σel · σ pr) − (σ pr · k)(σel · k)

to write (9.46) in the form (compare with §83 in ref. [85])

v2(p, e, k)

=e2 2

(2π )3

− 1

|k|2 +1

8M 2c2+

1

8m2c2+

pe

Mmc2|k|2 − (pk)(ek)

Mmc2|k|4

+ iσ pr[k × p]

4M 2c2|k|2 − iσel[k × e]

4m2c2|k|2 − iσ pr[k × e]

2Mmc2|k|2