Ch1 Lorentz Group & Lorentz Invariant

Embed Size (px)

Citation preview

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    1/32

    Chapter 1

    Lorentz Group and LorentzInvariance

    In studying Lorentz-invariant wave equations, it is essential that we put our under-standing of the Lorentz group on firm ground. We first define the Lorentz transfor-mation as any transformation that keeps the 4-vector inner product invariant, andproceed to classify such transformations according to the determinant of the transfor-mation matrix and the sign of the time component. We then introduce the generatorsof the Lorentz group by which any Lorentz transformation continuously connected tothe identity can be written in an exponential form. The generators of the Lorentzgroup will later play a critical role in finding the transformation property of the Diracspinors.

    1.1 Lorentz Boost

    Throughout this book, we will use a unit system in which the speed of light c is unity.This may be accomplished for example by taking the unit of time to be one secondand that of length to be 2.99792458 1010 cm (this number is exact1), or taking theunit of length to be 1 cm and that of time to be (2 .99792458 1010)1 second. Howit is accomplished is irrelevant at this point.

    Suppose an inertial frame K (space-time coordinates labeled by t,x,y,z) is mov-ing with velocity in another inertial frame K (space-time coordinates labeled byt, x, y, z) as shown in Figure 1.1. The 3-component velocity of the origin of K

    1One cm is defined (1983) such that the speed of light in vacuum is 2.99792458 1010 cm persecond, where one second is defined (1967) to be 9192631770 times the frequency of the hyper-finesplitting of the Cs133 ground state.

    1

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    2/32

    2 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    K' K

    y'

    x'

    y

    x

    K'K

    (E',P') (E,P)

    K' Kframe frame

    Figure 1.1: The origin of frame K is moving with velocity = (, 0, 0) in frame K,

    and the origin of frame K is moving with velocity in frame K. The axes x andx are parallel in both frames, and similarly for y and z axes. A particle has energymomentum (E, P) in frame K and (E, P) in frame K.

    measured in the frame K, K, is taken to be in the +x direction; namely,

    K (velocity of K in K) = (, 0, 0)

    def . (1.1)Assume that, in the frame K, the axes x,y,z are parallel to the axes x, y, z. Then,the velocity of the origin of K in K, K , is

    K = K = (, 0, 0) (velocity ofK in K). (1.2)Note that K (K) is measured with respect to the axes of K

    (K).

    If a particle (or any system) has energy and momentum (E,P) in the frame K,

    then the energy and momentum (E, P) of the same particle viewed in the frame K

    are given by

    E =E+ Px

    1 2 , P

    y = Py ,

    Px =E + Px

    1 2 , P

    z = Pz, .

    (1.3)

    This can be written in a matrix form asE

    Px

    =

    E

    Px

    ,

    PyPz

    =

    PyPz

    (1.4)

    with

    11 2 , =

    1 2 . (1.5)

    Note that and are related by

    2 2 = 1 . (1.6)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    3/32

    1.1. LORENTZ BOOST 3

    K' K

    y'

    x'

    y

    x

    K'K

    (E',P') (E,P)

    K'frame Kframe

    Figure 1.2: Starting from the configuration of Figure 1.1, the same rotation is appliedto the axes in each frame. The resulting transformation represents a general Lorentzboost.

    Now start from Figure 1.1 and apply the same rotation to the axes of K and Kwithin each frame (Figure 1.2). Suppose the rotation is represented by a 3 3 matrixR. Then, the velocity of K in K, K, and and the velocity of K in K

    , K, arerotated by the same matrix R,

    K RK , K RK , (1.7)and thus we still have

    K = Kdef , (1.8)

    where we have also redefined the vector which is well-defined in both K and K

    frames in terms of K

    and K

    , respectively. The transformation in this case can

    be obtained by noting that, in (1.4), the component of momentum transverse to

    does not change and that Px, P

    x are the components of P, P along in each frame.

    Namely, the tranformation can be written as

    E

    P

    =

    E

    P

    , P = P , (1.9)

    where and denote components parallel and perpecdicular to , repectively. Note

    that P and P are 3-component quantities and the relation P = P holds compo-

    nent by component because we have applied the same roation R in each frame.The axes of K viewed in the frame K are no longer perpendicular to each other

    since they are contracted in the direction of K. Thus, the axes of K in generalare not parallel to the corresponding axes of K at any time. However, since thesame rotation is applied in each frame, and since components transverse to are thesame in both frames, the corresponding axes of K and K are exactly parallel whenprojected onto a plane perpendicular to in either frames. The transformation (1.9)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    4/32

    4 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    is thus correct for the specific relative oprientation of two frames as defined here,and such transformation is called a Lorentz boost, which is a special case of Lorentztransformation defined later in this chapter for which the relative orientation of the

    two frames is arbitrary.

    1.2 4-vectors and the metric tensor g

    The quantity E2 P2 is invariant under the Lorentz boost (1.9); namely, it has thesame numerical value in K and K:

    E2 P2 = E2 (P2 + P2 )= (E + P)

    2 (E + P)

    2 + P2

    = (2

    2)

    1

    E2 + (2

    2) 1

    P2

    P2

    = E2 P2 ,

    (1.10)

    which is the invariant mass squared m2 of the system. This invariance applies to anynumber of particles or any object as long as E and P refer to the same object.

    The relative minus sign between E2 and P2 above can be treated elegantly asfollows. Define a 4-vector P ( = 0, 1, 2, 3) by

    P = (P0, P1, P3, P4)def (E, Px, Py, Pz) = (E, P) (1.11)

    called an energy-momentum 4-vector where the index is called the Lorentz index(or the space-time index). The = 0 component of a 4-vector is often called timecomponent, and the = 1, 2, 3 components space components.

    Define the inner product (or dot product) A B of two 4-vectors A = (A0, A)and B = (B0, B) by

    A B def A0B0 A B = A0B0 A1B1 A2B2 A3B3 . (1.12)Then, P2 P P is nothing but m2:

    P2 = P02 P2 = E2 P2 = m2 (1.13)

    which is invariant under Lorentz boost. This inner product P P is similar to thenorm x2 of an ordinary 3-dimensional vector x, which is invariant under rotation,except for the minus signs for the space components in the definition of the innerproduct. In order to handle these minus signs conveniently, we define subscriptedcomponents of a 4-vector by

    A0 = A0, Ai = Ai (i = 1, 2, 3) . (1.14)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    5/32

    1.2. 4-VECTORS AND THE METRIC TENSOR G 5

    Then the inner product (1.12) can be written as

    A

    B = A0B

    0 + A1B1 + A2B

    2 + A3B3 def

    AB

    = AB , (1.15)

    where we have used the convention that when a pair of the same index appears in thesame term, then summation over all possible values of the index ( = 0, 1, 2, 3 in thiscase) is implied. In general, we will use Roman letters for space indices (take values1,2,3) and greek letters for space-time (Lorentz) indices (take values 0,1,2,3). Thus,

    xiyi =3

    i=1

    xiyi (= x y), (A + B)C =3

    =0

    (A + B)C , (1.16)

    but no sum over or in

    AB

    + CD

    (, not in the same term). (1.17)

    When a pair of Lorentz indices is summed over, usually one index is subscript andthe other is superscript. Such indices are said to be contracted. Also, it is importantthat there is only one pair of a given index per term. We do not consider impliedsummations such as ABC to be well-defined. [(A

    + B)C is well-defined sinceit is equal to AC + B

    C.]Now, define the metric tensor g by

    g00 = g11 = g22 = g33 = 1, g = 0 ( = ) (1.18)

    which is symmetric: g = g . (1.19)

    The corresponding matrix G is defined as

    {g} def

    0 1 2 3 =

    0

    1

    2

    3

    1 0 0 0

    0 1 0 00 0 1 00 0 0 1

    ||

    def G (1.20)

    When we form a matrix out of a quantity with two indices, by definition we take thefirst index to increase downward, and the second to increase to the right.

    As defined in (1.14) for a 4-vector, switching an index between superscript andsubscript results in a sign change when the index is 1,2, or 3, while the sign isunchanged when the index is zero. We adopt the same rule for the indices of g. In

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    6/32

    6 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    fact, from now on, we enforce the same rule for all space-time indices (unless otherwisestated, such as for the Kronecker delta below). Then we have

    g = g, g = g = (1.21)

    where is the Kroneckers delta ( = 1 if = , 0 otherwise) which we define tohave only subscripts. Then, g can be used together with contraction to lower orraise indices:

    A = gA, A = gA (1.22)

    which are equivalent to the rule (1.14).The inner product of 4-vectors A and B (1.12) can also be written in matrix form

    as

    A B = AgB =(A

    0

    A1

    A2

    A3

    )1 0 0 00 1 0 00 0 1 00 0 0 1

    B

    0

    B1

    B2

    B3

    = ATGB . (1.23)

    When we use 4-vectors in matrix form, they are understood to be column vectorswith superscripts, while their transpose are row vectors:

    Adef

    A0

    A1

    A2

    A3

    =

    A0

    AxAyAz

    , AT = (A0, A1, A2, A3) (in matrix form). (1.24)

    1.3 Lorentz group

    The Lorentz boost (1.4) can be written in matrix form as

    P = P (1.25)

    with

    P =

    E

    P

    x

    PyPz

    , = 0 0

    0 00 0 1 0

    0 0 0 1

    , P = E

    PxPyPz

    . (1.26)

    In terms of components, this can be written as

    P = P , (1.27)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    7/32

    1.3. LORENTZ GROUP 7

    where we have defined the components of the matrix by taking the first index to besuperscript and the second to be subscript (still the first index increases downwardand the second index increases to the right):

    def {} =

    0 1 2 3 =

    0

    1

    2

    3

    0 0

    0 0

    0 0 1 0

    0 0 0 1

    ||

    (1.28)

    For example, 01 = and thus 01 =

    , etc. The superscript and subscript in

    (1.27) were chosen such that the index is contracted and that the index on bothsides of the equality has consistent position, namely, both are superscript.

    We have seen that P2 = E2 P2 is invariant under the Lorentz boost given by(1.4) or (1.9). We will now find the necessary and sufficient condition for a 4 4matrix to leave the inner product of two 4-vectors invariant. Suppose A and B

    transform by the same matrix :

    A = A, B = B

    . (1.29)

    Then the inner products A B and A B can be written using (1.22) as

    A B = Ag A

    A

    BB

    = (g

    )AB

    A B = AgA

    B = gAB .

    (1.30)

    In order for A B = A B to hold for any A and B, the coefficients of AB shouldbe the same term by term (To see this, set A = 1 for = and 0 for all else, andB = 1 for = and 0 for all else.):

    g

    = g . (1.31)

    On the other hand, if satisfies this condition, the same derivation above can betraced backward to show that the inner product A B defined by (1.12) is invariant.Thus, (1.31) is the necessary and sufficient condition.

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    8/32

    8 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    What does the condition (1.31) tell us about the nature of the matrix ? Using(1.22), we have g

    = , then the condition becomes

    = graise on both sides = g (= ) . (1.32)

    Comparing this with the definition of the inverse transformation 1:

    (1)

    = , (1.33)

    we see that the inverse matrix of is obtained by

    (1) = , (1.34)

    which means that one simply has to change the sign of the components for whichonly one of the indices is zero (namely, 0i and i0) and then transpose it:

    =

    00 01

    02

    03

    10 11

    12

    13

    20 21

    22

    23

    30 31

    32

    33

    , 1 =

    00 10 20 3001 11 21 3102 12 22 3203 13 23 33

    . (1.35)

    Thus, the set of matrices that keep the inner product of 4-vectors invariant is made ofmatrices that become their own inverse when the signs of components with one timeindex are flipped and then transposed. As we will see below, such set of matrices

    forms a group, called the Lorentz group, and any such transformation [namely, onethat keeps the 4-vector inner product invariant, or equivalently that satisfies thecondition (1.31)] is defined as a Lorentz transformation.

    To show that such set of matrices forms a group, it is convenient to write thecondition (1.31) in matrix form. Noting that when written in terms of components,we can change the ordering in any way we want, the condition can be written as

    g

    = g, or TG = G . (1.36)

    A set forms a group when for any two elements of the set x1 and x2, a productx1x2 can be defined such that

    1. (Closure) The product x1x2 also belongs to the set.

    2. (Associativity) For any elements x1, x2 and x3, (x1x2)x3 = x1(x2x3).

    3. (Identity) There exists an element I in the set that satisfies Ix = xI = x forany element x.

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    9/32

    1.3. LORENTZ GROUP 9

    4. (Inverse) For any element x, there exists an element x1 in the set that satisfiesx1x = xx1 = I.

    In our case at hand, the set is all 4 4 matrices that satisfy T

    G = G, andwe take the ordinary matrix multiplication as the product which defines the group.The proof is straightforward:

    1. Suppose 1 and 2 belong to the set (i.e. T1 G1 = G and

    T2 G2 = G). Then,

    (12)TG(12) =

    T2

    T1 G1

    G

    2 = T2 G2 = G. (1.37)

    Thus, the product 12 also belongs to the set.

    2. The matrix multiplication is of course associative: (12)3 = 1(23).

    3. The identity matrix I (I = ) belongs to the set (ITGI = G), and satisfies

    I = I = for any element.

    4. We have already seen that if a 4 4 matrix satisfies TG = G, then itsinverse exists as given by (1.34). It is instructive, however, to prove it moreformally. Taking the determinant of TG = G,

    detT det

    det G 1

    det = det G 1

    (det )2 = 1 , (1.38)

    where we have used the property of determinant

    det(MN) = det Mdet N (1.39)

    with M and N being square matrices of same rank. Thus, det = 0 andtherefore its inverse 1 exists. Also, it belongs to the set: multiplying TG =G by (1)T from the left and by 1 from the right,

    (1)TT (1

    I)T

    G 1 I

    = (1)TG1 (1)TG1 = G . (1.40)

    This completes the proof that s that satisfy (1.36) form a group.

    Since the inverse of a Lorentz transformation is also a Lorentz transformation, itshould satisfy the condition (1.31)

    g = g(1)

    (1)

    = g

    g = g , (1.41)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    10/32

    10 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    where we have used the inversion rule (1.34). The formulas (1.31), (1.41), and theirvariations are then summarized as follows: on the left hand side of the form g = g,an index of g (call it ) is contracted with an index of a and the other index of

    g (call it ) with an index of the other . As long as and are both first orboth second indices on the s, and as long as the rest of the indices are the same(including superscript/subscript) on both sides of the equality, any possible way ofindexing gives a correct formula. Similarly, on the left hand side of the form = g,an index of a and an index of the other are contracted. As long as the contractedindices are both first or both second indices on s, and as long as the rest of theindices are the same on both sides of the equality, any possible way of indexing givesa correct formula.

    A natural question at this point is whether the Lorentz group defined in thisway is any larger than the set of Lorentz boosts defined by (1.9). The answer is

    yes. Clearly, any rotation in the 3-dimensional space keepsA

    B invariant while itdoes not change the time components A0 and B0. Thus, it keeps the 4-vector inner

    product A B = A0B0 A B invariant, and as a result it belongs to the Lorentzgroup by definition. On the other hand, the only way the boost (1.9) does not changethe time component is to set = 0 in which case the transformation is the identitytransformation. Thus, any finite rotation in the 3-dimensional space is not a boost.

    Furthermore, the time reversal T and the space inversion P defined by

    Tdef {T} def

    11

    1

    1

    , Pdef {P} def

    1

    1

    1

    1

    (1.42)

    satisfyTTGT = G, PTGP = G , (1.43)

    and thus belong to the Lorentz group. Even though the matrix P has the samenumerical form as G, it should be noted that P is a Lorentz transformation but Gis not (it is a metric). The difference is also reflected in the fact that the matrixP is defined by the first index being superscript and the second subscript (becauseit is a Lorentz transformation), while the matrix G is defined by both indices beingsubscript (or both superscript).

    As we will see later, boosts and rotations can be formed by consecutive infinitesi-mal transformations starting from identity I (they are continuously connected to I),while T and P cannot (they are disconnected from I, or said to be discrete trans-formations). Any product of boosts, rotation, T, and P belongs to the Lorentz group,and it turns out that they saturate the Lorentz group. Thus, we write symbolically

    Lorentz group = boost + rotation + T + P . (1.44)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    11/32

    1.4. CLASSIFICATION OF LORENTZ TRANSFORMATIONS 11

    Later, we will see that any Lorentz transformation continuously connected to I is aboost, a rotation, or a combination thereof.

    If the origins of the inertial frames K and K touch at t = t = 0 and x = x = 0,

    the coordinate x = (t, x) of any event transforms in the same way as P:

    x = x . (1.45)

    This can be extended to include space-time translation between the two frames:

    x = x + a , (1.46)

    where a is a constant 4-vector. The transformation of energy-momentum is notaffected by the space-time translation, and is still given by P = P. Such transfor-mations that include space-time translation also form a group and called the inho-mogeneous Lorentz group or the Poincare group. The group formed by the trans-formations with a = 0 is sometimes called the homogeneous Lorentz group. Unlessotherwise stated, we will deal with the homogeneous Lorentz group; namely withoutspace-time translation.

    1.4 Classification of Lorentz transformations

    Up to this point, we have not specified that Lorentz transformations are real (namely,all the elements are real). In fact, Lorentz transformations as defined by (1.31) ingeneral can be complex and the complex Lorentz transformations plays an impor-tant role in a formal proof of an important symmetry theorem called CP T theoremwhich states that the laws of physics are invariant under the combination of particle-antiparticle exchange (C), mirror inversion (P), and time reversal (T) under certainnatural assumptions. In this book, however, we will assume that Lorentz transfor-mations are real.

    As seen in (1.38), all Lorentz transformation satisfy (det )2 = 1, or equivalently,det = +1 or 1. We define proper and improper Lorentz transformations as

    det = +1 : proper

    det = 1 : improper . (1.47)

    Since det(12) = det1 det2, the product of two proper transformations or twoimproper transformations is proper, while the product of a proper transformation anda improper transformation is improper.

    Next, look at the (, ) = (0, 0) component of the defining condition g

    =g:

    g0

    0 = g00 = 1 , (00)2

    3i=1

    (i0)2 = 1 (1.48)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    12/32

    12 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    001

    orthochronous001

    non-orthochronous

    det=+1proper (po) T P(po)

    det=1

    improper P(po) T(po)

    Table 1.1: Classification of the Lorentz group. (po) is any proper and orthochronousLorentz transformation.

    or

    (

    0

    0)

    2

    = 1 +

    3

    i=1(

    i

    0)

    2

    1 , (1.49)which means 00 1 or 00 1, and this defines the orthochronous and non-orthochronous Lorentz transformations:

    00 1 : orthochronous00 1 : non-orthochronous . (1.50)

    It is easy to show that the product of two orthochronous transformations or two non-orthochronous transformations is orthochronous, and the product of an orthochronoustransformation and a non-orthochronous transformation is non-orthochronous.

    From the definitions (1.42) and I = , we have

    det I = det(T P) = +1, det T = det P = 1,I00 = P

    00 = +1, T

    00 = (T P)

    00 = 1

    (1.51)

    Thus, the identity I is proper and orthochronous, P is improper and orthochronous,T is improper and non-orthochronous, and T P is proper and non-orthochronous. Ac-cordingly, we can multiply any proper and orthochronous transformations by each ofthese to form four sets of transformations of given properness and orthochronousnessas shown in Table 1.1. Any Lorentz transformation is proper or improper (i.e. det =1) and orthochronous or non-orthochronous (i.e. |00|2 1). Since any transforma-tion that is not proper and orthochronous can be made proper and orthochronous by

    multiplying T, P or T P, the four forms of transformations in Table 1.1 saturate the

    Lorentz group. For example, if is improper and orthochronous, then Pdef (po)

    is proper and orthochronous, and can be written as = P P = P(po).It is straightforward to show that the set of proper transformations and the set

    of orthochronous transformations separately form a group, and that proper and or-thochronous transformations by themselves form a group. Also, the set of proper

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    13/32

    1.4. CLASSIFICATION OF LORENTZ TRANSFORMATIONS 13

    and orthochronous transformations and the set of improper and non-orthochronoustransformations together form a group.

    Exercise 1.1 Classification of Lorentz transformations.(a) Suppose = AB where , A, and B are Lorentz transformations. Prove that is orthochronous if A and B are both orthochronous or both non-orthochronous,and that is non-orthochronous if one of A and B is orthochronous and the other isnon-orthochronous.[hint: Note that we can write 00 = A

    00B

    00 + a b with a (A01, A02, A03) and

    b (B10, B20, B30). Then use |a b| |a||b|. Also, one can derive a2 = A002 1 andb2 = B00

    2 1. ](b) Show that the following sets of Lorentz transformations each form a group:

    1. proper transformations

    2. orthochronous transformations

    3. proper and orthochronous transformations

    4. proper and orthochronous transformations plus improper and non-orthochronoustransformations

    As mentioned earlier (and as will be shown later) boosts and rotations are con-tinuously connected to the identity. Are they then proper and orthochronous? Toshow that this is the case, it suffices to prove that an infinitesimal transformation

    can change det and 00 only infinitesimally, since then multiplying an infinitesimaltransformation cannot jump across the gap between det = +1 and det = 1 or

    the gap between 00 1 and 00 1.An infinitesimal transformation is a transformation that is very close to the iden-

    tity I and any such transformation can be written as

    = I + dH (1.52)

    where d is a small number and H is a 44 matrix of order unity meaning the maximumof the absolute values of its elements is about 1. To be specific, we could define it suchthat max, |H| = 1 and d 0, which uniquely defines the decomposition above.We want to show that for any Lorentz transformation , multiplying I+ dH changesthe determinant or the (0, 0) component only infinitesimally; namely, the differencesvanish as we take d to zero.

    The determinant of a n n matrix A is defined by

    det Adef

    permutations

    si1,i2,...inAi11Ai22 . . . Ainn (1.53)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    14/32

    14 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    where the sum is taken over (i1, i2, . . . in) which is any permutation of (1, 2, . . . n),and si1,i2,...in is 1(1) if (i1, i2, . . . in) is an even(odd) permutation. When applied to4

    4 Lorentz transformations, this can be written as

    det Adef A0A1A2A3 , (1.54)

    where the implicit sum is over , , , = 0, 1, 2, 3 and is the totally anti-symmetric 4-th rank tensor defined by

    def

    =

    +1

    1

    if () is an

    even

    odd

    permutation of (0, 1, 2, 3)

    = 0 if any of are equal

    (1.55)

    The standard superscript/subscript rule applies to the indices of; namely, 0123 =

    0123 = 1, etc. Then, it is easy to show thatdet(I + dH) = 1 + d TrH+ (higher orders in d) , (1.56)

    where the trace of a matrix A is defined as the sum of the diagonal elements:

    TrAdef

    3=0

    A . (1.57)

    Exercise 1.2 Determinant and trace.Determinant of a n n matrix is defined by

    det Adef si1i2,...inAi11Ai22 . . . Ainn

    where sum overi1, i2 . . . in is implied(each taking values 1 throughn ) ands(i1, i2 . . . in)is the totally asymmetric n-th rank tensor:

    si1i2...in

    +1(1) if (i1, i2 . . . in) is an even (odd) permutation of (1, 2, . . . n).0 if any of i1, i2 . . . in are equal.

    Show that to first order of a small number d, the determinant of a matrix that isinfinitesimally close to the identity matrix I is given by

    det(I+ dH) = 1 + dTrH+ (higher orders in d) ,

    where H is a certain matrix whose size is of order 1, and the trace (Tr) of a matrixis defined by

    TrH n

    i=1

    Hii .

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    15/32

    1.5. TENSORS 15

    Since all diagonal elements ofH are of order unity or smaller, (1.56) tells us thatdet 1 as we take d 0. In fact, the infinitesimal transformation is a Lorentztransformation, so we know that det =

    1. Thus, we see that the determinant of an

    infinitesimal transformation is strictly +1. It then follows from det() = det detthat multiplying an infinitesimal transformation to any transformation does notchange the determinant of the transformation.

    The (0, 0) component of is

    ()00 = [(I + dH)]00 = [ + dH]

    00 =

    00 + d (H)

    00 . (1.58)

    Since (H)00 is a finite number for a finite , the change in the (0, 0) component tendsto zero as we take d 0. Thus, no matter how many infinitesimal transformationsare multiplied to , the (0, 0) component cannot jump across the gap between +1and 1.

    Thus, continuously connected Lorentz transformations have the same propernessand orthochronousness. Therefore, boosts and rotations, which are continuously con-nected to the identity, are proper and orthochronous.

    Do Lorentz boosts form a group?A natural question is whether Lorentz boosts form a group by themselves. The answeris no, and this is because two consecutive boosts in different directions turn out tobe a boost plus a rotation as we will see when we study the generators of the Lorentzgroup. Thus, boosts and rotations have to be combined to form a group. On theother hand, rotations form a group by themselves.

    1.5 Tensors

    Suppose A and B are 4-vectors. Each is a set of 4 numbers that transform undera Lorentz transformation as

    A = A, B = B

    . (1.59)

    Then, the set of 16 numbers AB (, = 0, 1, 2, 3) transforms as

    AB =

    AB . (1.60)

    Anything that has 2 Lorentz indices, which is a set of 16 numbers, and transforms as( ) =

    ( )

    (1.61)

    is called a second rank tensor (or simply a tensor). It may be real, complex, or evena set of operators. Similarly, a quantity that has 3 indices and transforms as

    ( ) =

    ( ) (1.62)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    16/32

    16 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    is called a third-rank tensor, and so on. A 4-vector (or simply a vector) is a first-rank tensor. A Lorentz-invariant quantity, sometimes called a scalar, has no Lorentzindex, and thus it is a zero-th rank tensor:

    ( ) = ( ) (scalar) . (1.63)

    Contracted indices do not count in deciding the rank of a tensor. For example,

    AB : (scalar), AT : (vector), FG : (tensor), etc. (1.64)

    The metrix g has two Lorentz indices and thus can be considered a second-ranktensor (thus, the metric tensor), then it should transform as

    g =

    g = g (1.65)

    where the second equality is due to (1.31). Namely, the metric tensor is invariantunder Lorentz transformations.

    In order for some equation to be Lorentz-invariant, the Lorentz indices have to bethe same on both sides of the equality, including the superscript/subscript distinction.By Lorentz-invariant, we mean that if an equation holds in one frame, then it holdsin any other frame after all the quantities that appear in the equation are evaluatedin the new frame. In the literature, such equations are sometimes called Lorentzcovariant: both sides of the equality change values but the form stays the same. Forexample, if an equation A = B (which is actually a set of 16 equations) holds ina frame, then it also holds in any other frame:

    A =

    A =

    B

    = B . (1.66)

    Thus, equations such asm2 = PP ,

    P = A + B ,

    F = AB(1.67)

    are all Lorentz-invariant, assuming of course that the quantities transform in thewell-defined ways as described above.

    1.6 Fields (classical)

    A field is a quantity that is a function of space-time point x = (t, x) (or event).A scalar quantity that is a function of space time is called a scalar field, a vectorquantity that is a function of space time is called a vector field, etc. The rank ofa field and the Lorentz transformation properties (scalar, vector, tensor, etc.) are

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    17/32

    1.6. FIELDS (CLASSICAL) 17

    defined in the same way as before, provided that the quantities are evaluated at thesame event point before and after a Lorentz transformation; namely,

    Scalar field : (x) = (x)Vector field : A(x) = A

    Tensor field : T(x) =

    T(x)

    (1.68)

    where x and x are related byx = x

    . (1.69)

    For example, a vector field associates a set of 4 numbers A(x) to an event pointx, say when an ant sneezes. In another frame, there are a set of 4 numbers A(x)associated with the same event x, namely, when the ant sneezes in that frame, andthey are related to the 4 numbers A(x) in the original frame by the matrix . The

    functional shape of a primed field is in general different from that of the correspondingunprimed field. Namely, if one plots (x) as a function of x and (x) as a functionof x, they will look different.

    When a quantity is a function ofx, we naturally encounter space-time derivativesof such quantity. Then a question arises as to how they transform under a Lorentztransformation. Take a scalar field f(x), and form a set of 4 numbers (fields) bytaking space-time derivatives:

    f

    x(x) =

    f

    x0(x),

    f

    x1(x),

    f

    x2(x),

    f

    x3(x)

    = f

    t (x),

    f

    x (x),

    f

    y (x),

    f

    z (x) .(1.70)

    Then pick two space-time points x1 and x2 which are close in space and in time. Theargument below is based on the observation that the difference between the valuesof the scalar field at the two event points is the same in any frame. Since f(x) isa scalar field, the values at a given event is the same before and after a Lorentztransformation:

    f(x1) = f(x1), f(x2) = f(x2) , (1.71)

    orf(x1) f(x2) = f(x1) f(x2) . (1.72)

    Since x1 and x2 are close, this can be written as

    dxf

    x(x1) = dx

    f

    x(x1) , (1.73)

    where summation over is implied, and

    dxdef x1 x2 , dx def x1 x2 . (1.74)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    18/32

    18 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    which tells us that the quantity dx(f/x) is Lorentz-invariant. Since dx = x1x2is a superscripted 4-vector, it follows that f/x should transform as a subscripted4-vector (which transforms as A =

    A):

    f

    x(x) =

    f

    x(x) . (1.75)

    In fact, together with dx = dx, we have

    dxf

    x(x) =

    dx

    f

    x(x)

    =

    g

    by (1.32)

    dxf

    x(x)

    = dxf

    x(x) ,

    (1.76)

    showing that it is indeed Lorentz-invariant.Thus, the index in the differential operator /x acts as a subscript even

    though it is a superscript on x. To make this point clear, /x is often written usinga subscript as

    def

    x=

    t,

    x,

    y,

    z

    =

    t,

    . (1.77)

    Once is defined, the standard subscript/superscript rule applies; namely, =

    /x, etc. Symbolically, the operator

    then transforms as a superscripted 4-vector:

    = , (1.78)

    with /x.

    Example: Consider the 4-component charge current density j(x) = ((x),j(x)).We can see that this is indeed a Lorentz 4-vector as follows: Suppose the charge iscarried by some medium, such as gas of ions, then pick a space-time point x and let0 be the charge density in the rest frame of the medium and be the velocity of themedium at that point. Then the charge density in the frame in question is larger

    than 0 by the factor = 1/1 2 due to Lorentz contraction = 0 . (1.79)

    Since j = , j can be written as

    j = (,j) = (0, 0) = 0(, ) = 0 , (1.80)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    19/32

    1.7. GENERATORS OF THE LORENTZ GROUP 19

    where we have defined the 4-velocity by

    def

    (, ) . (1.81)

    On the other hand, the 4-momentum of a particle with mass m can be written as

    P = (m,m) = m (1.82)

    which means that the 4-velocity is a Lorentz 4-vector, and therefore so is j. Whenthe charge is carried by more than one different media, unique rest frame of the mediawhere 0 is defined does not exist. The total j, however, is the sum of j for eachmedium. Since j for each medium is a 4-vector, the sum is also a 4-vector.

    Then j = 0 is a Lorentz-invariant equation; namely, if it is true in one frame,

    then it is true in any frame. Using (1.77), we can write j = 0 as

    j = 0j

    0 + 1j1 + 2j

    2 + 3j3

    =

    t +

    xjx +

    yjy +

    zjz

    =

    t +

    note the sign!

    j = 0 ,(1.83)

    which is nothing but the charge conservation equation. Thus, we see that if charge isconserved in one frame it is conserved in any frame.

    1.7 Generators of the Lorentz group

    In this section, we will focus on the proper and orthochronous Lorentz group. Otherelements of the Lorentz group can be obtained by multiplying T, P, and T P to theelements of this group. The goal is to show that any element that is continuouslyconnected to the identity can be written as2

    = eiKi+iLi , (i = 1, 2, 3) (1.84)

    where i and i are real numbers and Ki and Li are 44 matrices. Such group whoseelements can be parametrized by a set of continuous real numbers (in our case they

    2In the literature, it is often defined as exp i(iKi + iLi), which would make the operatorshermitian if the transformation were unitary (e.g. representations of the Lorentz group in theHilbert space). The Lorentz transformation matrices in space-time are in general not unitary, andfor now, we will define without the i so that the expressions become simpler.

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    20/32

    20 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    are i and i) is called a Lie group. The operators Ki and Li are called the generatorsof the Lie group.

    Any element of the proper and orthochronous Lorentz group is continuously con-

    nected to the identity. Actually we have not proven this, but we will at least showthat all boosts, rotations and combinations thereof are continuously connected to theidentity (and vice versa).

    1.7.1 Infinitesimal transformations

    Lets start by looking at a Lorentz transformation which is infinitesimally close to theidentity:

    = g

    +

    (1.85)

    where is a set of small (real) numbers. Inserting this to the defining condition(1.31) or equivalently = g (1.32), we get

    g =

    = (g + )(g

    +

    )

    = gg

    + g

    + g

    +

    = g + + +

    .

    (1.86)

    Keeping terms to the first order in , we then obtain

    = . (1.87)

    Namely, is anti-symmetric (which is true when the indices are both subscript orboth superscript; in fact, is not anti-symmetric under ), and thus it has 6independent parameters:

    {} =

    0 01 02 0301 0 12 1302 12 0 2303 13 23 0

    (1.88)

    This can be conveniently parametrized using 6 anti-symmetric matrices as

    {} = 01{(M01)} + 02{(M02)} + 03{(M03)}+ 23{(M23)} + 13{(M13)} + 12{(M12)} (1.89)=

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    21/32

    1.7. GENERATORS OF THE LORENTZ GROUP 21

    with

    {(M01)} = 0 1 0 0

    1 0 0 00 0 0 0

    0 0 0 0

    , {(M23)} = 0 0 0 0

    0 0 0 00 0 0 1

    0 0 1 0

    ,

    {(M02)} =

    0 0 1 0

    0 0 0 0

    1 0 0 00 0 0 0

    , {(M13)} =

    0 0 0 0

    0 0 0 1

    0 0 0 0

    0 1 0 0

    ,

    {(M03)

    }=

    0 0 0 1

    0 0 0 0

    0 0 0 01 0 0 0

    , {(M

    12)}

    =

    0 0 0 0

    0 0 1 0

    0 1 0 00 0 0 0

    (1.90)

    Note that for a given pair of and , {(M)} is a 4 4 matrix, while is a realnumber. The elements (M) can be written in a concise form as follows: first, wenote that in the upper right half of each matrix (i.e. for < ), the element with(, ) = (, ) is 1 and all else are zero, which can be written as g g

    . For the

    lower half, all we have to do is to flip and and add a minus sign. Combining thetwo halves, we get

    (M) = g

    g

    g g . (1.91)This is defined only for < so far. For > , we will use this same expression asthe definition; then, (M) is anti-symmetric with respect to ( ):

    (M) = (M) , (1.92)

    which also means (M) = 0 if = . Together with = , (1.89) becomes

    =

    (M) =

    1

    2(M

    ) , (1.93)

    where in the last expression, sum over all values of and is implied. The infinites-

    imal transformation (1.85) can then be written as

    = g

    +1

    2(M

    ) , (1.94)

    or in matrix form,

    = I +1

    2M

    . (1.95)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    22/32

    22 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    where the first indices of M, which is a 4 4 matrix for given and , is takento be superscript and the second subscript; namely, in the same way as Lorentztransformation. Namely, when no explicit indexes for elements are given, the 4

    4

    matrix M is defined asM

    def {(M)} . (1.96)It is convenient to divide the six matrices to two groups as

    Kidef M0i, Li def Mjk (i,j,k : cyclic) . (1.97)

    We always use subscripts for Ki and Li since only possible values are i = 1, 2, 3, andsimilarly to M, elements of the matrices Kis and Lis are defined by taking thefirst Lorentz index to be superscript and the second subscript:

    Kidef {(Ki)} , Li

    def {(Li)} . (1.98)

    Later, we will see that Ks generate boosts and Ls generate rotations. Explicitly,they can be obtained by raising the index in (1.90) (note also the the minus signin L2 = M13):

    K1 =

    0 1 0 0

    1 0 0 0

    0 0 0 0

    0 0 0 0

    , K2 =

    0 0 1 0

    0 0 0 0

    1 0 0 0

    0 0 0 0

    , K3 =

    0 0 0 1

    0 0 0 0

    0 0 0 0

    1 0 0 0

    (1.99)

    L1 =

    0 0 0 0

    0 0 0 0

    0 0 0 10 0 1 0

    , L2 =

    0 0 0 0

    0 0 0 1

    0 0 0 0

    0 1 0 0

    , L3 =

    0 0 0 0

    0 0 1 00 1 0 0

    0 0 0 0

    (1.100)

    By inspection, we see that the elements of Ks and Ls can be written as

    (Ki)j

    k = 0 , (Ki)0

    = (Ki)0 = g

    i ,

    (Li)j

    k =

    ijk , (Li)0

    = (Li)0 = 0 ,

    (i,j,k = 1, 2, 3; = 0, 1, 2, 3) (1.101)

    where ijk is a totally anti-symmetric quantity defined for i,j,k = 1, 2, 3:

    ijkdef

    =

    +1

    1

    if (i,j,k) is an

    even

    odd

    permutation of (1, 2, 3),

    = 0 if any of i,j,k are equal.

    (1.102)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    23/32

    1.7. GENERATORS OF THE LORENTZ GROUP 23

    An explicit calculation shows that Ks and Ls satisfy the following commutationrelations:

    [Ki, Kj ] = ijk Lk[Li , Lj ] = ijk Lk (1.103)[Li , Kj ] = ijk Kk ,

    where sum over k = 1, 2, 3 is implied, and the commutator of two operators A, B isdefined as

    [A, B]def AB BA . (1.104)

    Note that the relation [Ki, Kj ] = ijk Lk can also be written as [Ki, Kj] = Lk(i,j,k: cyclic), etc.

    Exercise 1.3 Verify the commutation relations (1.103). You may numerically verifythem, or you may try proving generally by using the general formula for the elementsof the matrixes.

    Exercise 1.4 Boost in a general direction.Start from the formula for boost (1.9) where P is the component of P parallel to ,

    and P is the component perpendicular to ; namely,

    P = P n, and P = P Pnwith n = / (and similarly for P ). Note that is well-defined in the primed framealso by the particular relative orientation of the two frames chosen.(a) Show that the corresponding Lorentz transformation matrix is given by

    =

    x y zx 1 +

    2x xy xz

    y xy 1 + 2y yz

    z xz yz 1 + 2z

    , with 12 .(b) Show that when is small, the Lorentz transformation matrix for a boost is givento the first order in by

    = 1 + iKi . (summed over i = 1, 2, 3)

    (c) In the explicit expression of given above, one notes that the top row [0 ( =0, 1, 2, 3)] and the left-most column [0 ( = 0, 1, 2, 3)] are nothing but the velocity

    4-vector = (, ). Lets see how it works for general Lorentz transformations

    (proper and orthochronous). Suppose the relative orientation of the two frames Kand K is not given by K = K, where K is the velocity of the origin of Kmeasured in K, and K is the velocity of the origin of K

    measured in K. Let bethe corresponding Lorentz transformation. Express 0 and

    0 in terms of

    K and

    K. (hint: Place a mass m at the origin ofK and view it from K, and place a mass

    at the origin of K and view it from K.)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    24/32

    24 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    1.7.2 Finite transformations

    Now we will show that any finite (namely, not infinitesimal) rotation can be written as

    eiLi

    , and any finite boost can be written as eiKi

    , where i and i (i = 1, 2, 3) are somefinite real numbers. First, however, let us review some mathematical background:

    Matrix exponentiationsThe exponential of a m m matrix A is also a m m matrix defined by

    eAdef lim

    n

    I +

    A

    n

    n, (1.105)

    which can be expanded on the right hand side as

    eA = limn

    nk=0

    n(n 1) . . . (n k + 1)k !

    Ak

    nk. (1.106)

    Since the sum is a rapidly converging series, one can sum only the terms with k nfor which n(n 1) . . . (n k + 1) nk. It then leads to

    eA =

    k=0

    Ak

    k !, (1.107)

    which can also be regarded as a definition of eA

    .Using the definition (1.105) or (1.107), we see that

    eA

    = eA

    , (1.108)

    where the hermitian conjugate of a matrix A is defined by (A)ij Aji . The deter-minant of eA can be written using (1.105) as

    det eA = limn

    det

    I+

    A

    n

    n

    = limn

    1 + TrAn

    + . . . + cknk

    + . . .n ,(1.109)

    where we have used (1.56). This does not depend on ck (k > 1) since the derivativewith respect to ck vanishes as can be readily verified. Thus, ck (k > 1) can be set tozero and we have

    det eA = eTrA . (1.110)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    25/32

    1.7. GENERATORS OF THE LORENTZ GROUP 25

    The derivative ofexA (x is a number, while A is a constant matrix) with respect to xcan be obtained using (1.107),

    ddxexA =

    k=1

    (k xk1

    )Ak

    k ! = A

    k=1

    xk1

    Ak1

    (k 1) ! ; (1.111)

    thus,d

    dxexA = AexA . (1.112)

    There is an important theorem that expresses a product of two exponentials interms of single exponential, called the Campbell-Baker-Hausdorff (CBH) theorem(presented here without proof):

    eAeB = eA+B+1

    2[A,B]+ , (1.113)

    where denotes the higher-order commutators of A and B such as [A, [A, B]],[A, [[A, B], B]] etc. with known coefficients. Note that the innermost commutator isalways [A, B] since otherwise it is zero ([A, A] = [B, B] = 0), and thus if [A, B] is acommuting quantity (a c-number), then is zero. Applying (1.113) to B = A,we get

    eAeA = eAA = I , (1.114)

    or(eA)1 = eA . (1.115)

    RotationAn infinitesimal rotation around the z-axis by [Figure 1.3(a)] can be written as

    x

    y

    =

    x yy + x

    =

    1 1

    xy

    = (I + Lz)

    xy

    , (1.116)

    with

    Lz =

    x yx 0 1y 1 0

    . (1.117)

    Then, a rotation by a finite angle is constructed as n consecutive rotations by /neach and taking the limit n . Using (1.116), it can be written as

    x

    y

    = lim

    n

    I+

    nLz

    n xy

    = eLz

    xy

    , (1.118)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    26/32

    26 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    where we have used the definition (1.105).From the explicit expression of Lz (1.117), we have L

    2z = I, L3z = Lz, L4z = I,

    etc. In general,

    L4nz = I, L4n+1z = Lz, L

    4n+2z = I, L4n+3z = Lz, (1.119)

    where n is an integer. Using the second definition of eA (1.107), the rotation matrixeLz can then be written in terms of the trigonometirc functions as

    eLz = I + Lz +2

    2 !L2zI

    +3

    3 !L3z

    Lz+ . . . (1.120)

    =

    1

    2

    2 !+ . . .

    cos

    I +

    3

    3 !+ . . .

    sin

    Lz (1.121)

    =

    cos sin sin cos

    , (1.122)

    which is probably a more familiar form of a rotation around the z-axis by an angle .Similarly, rotations around x and y axes are generated by Lx and Ly as obtained

    by cyclic permutations of (x,y,z) in the derivation above. Switching to numericalindices [(Lx, Ly, Lz) (L1, L2, L3)],

    L1 = 2 3

    2 0 13 1 0 , L2 = 3 1

    3 0 11 1 0 , L3 = 1 2

    1 0 12 1 0 . (1.123)Are these identical to the definition (1.100) which was given in 4 4 matrix form,or equivalently (1.101)? Since Li is the change of coordinates by the rotation, the

    x

    n

    n

    xx

    (x,y)

    (x',y')

    x

    y

    x

    y

    (a) (b)

    Figure 1.3: Infinitesimal rotation around the z-axis by an angle (a), and around a

    general direction by an angle /n (b).

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    27/32

    1.7. GENERATORS OF THE LORENTZ GROUP 27

    elements of a 4 4 matrix corresponding to unchanged coordinates should be zero.We then see that the Ls given above are indeed identical to (1.100).

    A general rotation is then given by

    eiLi = eL , (1.124)

    where

    def (1, 2, 3), L def (L1, L2, L3) . (1.125)As we will see below, this is a rotation around the direction by an angle ||. Tosee this, first we write eiLi using the definition (1.105):

    eiLi = limn

    I+

    iLin

    n, (1.126)

    which shows that it is a series of small rotations each given by I + iLi/n. Theaction of such an infinitesimal transformation [Figure 1.3(b)] on x is (writing thespace components only)

    xj =

    I +

    iLin

    j

    kxk

    = gj kxk +

    1

    ni (Li)

    jk

    ijk by (1.101)xk

    = xj

    1

    nijk ixk

    = xj +1

    n( x)j (1.127)

    where we have used the definition of the three-dimensional cross product

    (a b)i = ijk ajbk . (1.128)

    Thus, I+iLi/n is nothing but a small rotation around by an angle /n (Figure 1.3).Then n such rotations applied successively will result in a rotation by an angle around the same axis .

    BoostsA boost in x direction by a velocity is given by (1.26):

    =

    t xt x

    ,

    =

    11 2 , =

    . (1.129)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    28/32

    28 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    When is small (= ), 1 and to the first order in ; then, the infinitesimalboost can be written as

    = 1 1 = I+ Kx , (1.130)with

    Kx =

    t xt 0 1x 1 0

    . (1.131)

    Suppose we apply n such boosts consecutively, where we take n to infinity while nis fixed to a certain value :

    n = . (1.132)

    Then the resulting transformation is

    = limn

    I+

    nKx

    n= eKx , (1.133)

    where we have used the definition (1.105). Is the velocity of this boost? The answeris no, even though it is a function of the velocity. Lets expand the exponential aboveby the second definition (1.105) and use K2x = I:

    = eKx (1.134)

    = I+ Kx +2

    2 !K2x

    I+

    3

    3 !K3x

    Kx+ . . .

    =

    1 +2

    2 !+ . . .

    cosh

    I+

    +2

    2 !+ . . .

    sinh

    Kx

    =

    cosh sinh sinh cosh

    . (1.135)

    Comparing with (1.130), we see that this is a boost of a velocity given by

    = cosh , = sinh (1.136)

    or =

    = tanh . (1.137)

    Note that the relation 2 2 = 1 (1.6) is automatically satisfied since cosh 2 sinh 2 = 1.

    Thus, n consecutive boosts by a velocity /n each did not result in a boost of avelocity ; rather, it was a boost of a velocity = tanh . This breakdown of the

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    29/32

    1.7. GENERATORS OF THE LORENTZ GROUP 29

    simple addition rule of velocity is well known: the relativistic rule of velocity additionstates that two consecutive boosts, by 1 and by 2, do not result in a boost of1+2,but in a boost of a velocity 0 given by

    0 =1 + 2

    1 + 12. (1.138)

    Due to the identity tanh(1 + 2) = (tanh 1 +tanh 1)/(1+tanh 1 tanh 2), however,it becomes additive when velocities are transformed by i = tanh i (i = 0, 1, 2);namely, 0 = 1 + 2 holds.

    The matrix Kx( K1) given in (1.131) is identical to the 44 form given in (1.99)when all other elements that correspond to unchanged coordinates are set to zero.The boosts along y and z directions are obtained by simply replacing x with y or zin the derivation above. Thus, we see that K2 and K3 given in (1.99) indeed generate

    boosts in y and z directions, respectively.A boost in a general direction would then be given by

    = eiKi , (1.139)

    where (1, 2, 3) are the parameters of the boost. In order to see what kind oftransformation this represents, lets write it as a series of infinitesimal transformationsusing (1.105):

    eiKi = limn

    I +

    in

    Ki

    n. (1.140)

    From the explicit forms ofKi (1.99), we can write the infinitesimal transformation as

    I +in

    Ki = I +1

    n

    0 1 2 3123

    0

    . (1.141)

    On the other hand, a boost in a general direction by a small velocity is given by(1.9) with 1, and ||:

    EP = 1 1 EP , P = P , (1.142)or

    E = E+ P E = E+ PP = P + EP = P

    P = P + E (1.143)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    30/32

    30 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    where we have used P() = P() +

    P() ( /). This can be written in 4 4 matrix

    form as

    E

    PxPyPz

    = I+ 0 1 2 3

    123

    0

    E

    PxPyPz

    . (1.144)

    Comparing this with (1.141), we can identify that I+ iKi/n as a boost in direction

    by a velocity /n ( ||). Then n consecutive such boosts will result in a boostin the same direction. Since the rule of addition of velocity (1.138) is valid in anydirection as long as the boosts are in the same direction, the n boosts by velocity/n each will result in a single boost of velocity = tanh as before. Thus, eiKi

    represents a boost in direction by a velocity = tanh .

    Boost + rotationFirst, we show that a rotation followed by a rotation is a rotation, but a boost followedby a boost is not in general a boost. Consider a rotation eiLi followed by anotherrotation eiLi where and are arbitrary vectors. Using the CBH theorem (1.113),we can write the product of the two transformations as

    eiLiejLj = eiLi+jLj+1

    2[iLi,jLj ] + ..., (1.145)

    where . . . represents terms with higher-order commutators such as [iLi, [jLj, kLk]]etc. Now we can use the commutation relations (1.103) to remove all commutatorsin the exponent on the right hand side. The result will be a linear combination ofLs

    with well-defined coefficients (call them i) since the coefficients in . . . in the CBHtheorem are known. Here, there will be no Ks appearing in the linear combinationbecause of the commutation relation [Li, Lj] = ijk Lk. Thus, the product is writtenas

    eiLiejLj = eiLi ( : a function of , ), (1.146)

    which is just another rotation.Next, consider a boost eiKi followed by another boost e

    i

    Ki :

    ei

    KiejKj = ei

    Ki+jKj+1

    2[i

    Ki,jKj ] + ... . (1.147)

    Again, the brackets can be removed by the commutation relations (1.103) reducingthe exponent to a linear combination of Ks and Ls. This time, there will be Lsappearing through the relation [Ki, Kj] = ijk Lk which are in general not cancelledamong different terms. Thus, a boost followed by a boost is not in general anotherboost; rather, it is a combination of boost and rotation:

    ei

    KiejKj = eiKi+iLi (, : functions of , ). (1.148)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    31/32

    1.7. GENERATORS OF THE LORENTZ GROUP 31

    It is easy to show, however, that if two boosts are in the same direction, then theproduct is also a boost.

    Any combinations of boosts and rotations can then be written as

    = eiKi+iLi = e1

    2aM

    , (1.149)

    where we have defined the anti-symmetric tensor a by

    a0idef i, aij def k (i,j,k : cyclic), a = a , (1.150)

    and the factor 1/2 arises since terms with > as well as < are included inthe sum. The expression of an infinitesimal transformation (1.95) is nothing but thisexpression in the limit of small a. Since we now know that any product of suchtransformations can also be written as (1.149) by the CBH theorem, we see that the

    set of Lorentz transformations connected to the identity is saturated by boosts androtations.

    We have seen that the generators Ks and Ls and their commutation relations(called the Lie algebra) play critical roles in understanding the Lorentz group. Infact, generators and their commutation relations completely determine the structureof the Lie group, as described briefly below.

    Structure constantsWhen the commutators of generators of a Lie group are expressed as linear com-binations of the generators themselves, the coefficients of the linear expressions arecalled the structure constants of the Lie group. For example, the coeffients ijk in(1.103) are the structure constants of the Lorentz group. We will now show that thestructure constants completely define the structure of a Lie group. To see this, wehave to define what we mean by same structure. Two sets F( f) and G( g) aresaid to have the same structure if there is a mapping between Fand G such that iff1, f2 Fand g1, g2 G are mapped to each other:

    f1 g1, f2 g2 (1.151)then, the products f1f2 and g1g1 are also mapped to each other by the same mapping:

    f1f2 g1g2 ; (1.152)namely, the mapping preserves the product rule.

    Suppose the sets Fand G are Lie groups with the same number of generators Fiand Gi and that they have the same set of structure constants cijk

    [Fi, Fj] = cijk Fk, [Gi, Gj] = cijk Gk . (1.153)

  • 8/3/2019 Ch1 Lorentz Group & Lorentz Invariant

    32/32

    32 CHAPTER 1. LORENTZ GROUP AND LORENTZ INVARIANCE

    Any element ofFand G can be expressed in exponential form using the correspondinggenerators and a set of real parameters. We define the mapping between Fand G bythe same set of the real parameters:

    f = eiFi g = eiGi . (1.154)

    If f1 g1 and f2 g2, then they can be written as

    f1 = eiFi g1 = eiGi (1.155)

    f2 = eiFi g2 = eiGi , (1.156)

    where i and i are certain sets of real parameters. Then the question is whetherthe products f1f2 and g1g2 are mapped to each other by the same mapping. Theproducts f1f2 and g1g2 can be written using the CBH theorem as

    f1f2 = eiFiejFj = eiFi+jFj+

    1

    2[iFi,jFj ] + ... = eiFi , (1.157)

    g1g2 = eiGiejGj = eiGi+jGj+

    1

    2[iGi,jGj ] + ... = eiGi, (1.158)

    The numbers i and i are obtained by removing the commutators using the commu-tation relations (1.153), and thus completely determined by i, i and cijk ; namely,i = i, and thus f1f2 and g1g2 are mapped to each other by (1.154). Thus, if two Liegroups have the same set of structure constants, then they have the same structure.