Tensor Theory - University of St Andrewsstar-hz4/gr/Tensor.doc · Web viewGiven a tensor , we may form another tensor defined by (34) Note that . (35) The tensor may therefore be

Tensor Theory

Introduction and definitions

In n-dimensional space Vn (called a "manifold" in mathematics), points are specified by assigning values to a set of n continuous real variables x

1,x2.....xn called the coordinates.

In many cases these will run from -∞ to +∞, but the range of some or all of these can be finite.

Examples: In Euclidean space in three dimensions, we can use cartesian coordinates x, y and z, each of which runs from -∞ to +∞. For a two dimensional Euclidean plane, Cartesians may again be employed, or we can use plane polar coordinates r, whose ranges are 0 to ∞ and 0 to 2 respectively.

Coordinate transformations. The coordinates of points in the manifold may be assigned in a number of different ways. If we select two different sets of coordinates, x

1,x2.....xn and

x 1, x 2, ..... x n, there will obviously be a connection between them of the form

x r fr(x1,x2....xn) r = 1, 2........n. (1)

where the f's are assumed here to be well behaved functions. Another way of expressing the same relationship is

x r x r(x1,x2....xn) r = 1, 2........n. (2)

where x r(x1,x2....xn) denotes the n functions fr(x1,x2....xn), r = 1, 2......n.

Recall that if a variable z is a function of two variables x and y, i.e. z = f (x, y), then the connection between the differentials dx, dy and dz is

dzf

xdx f

ydy. (3)

Extending this to several variables therefore, for each one of the new coordinates we have

1

2

d xr

∂ xr

∂xss1

n∑ dxs

3

. r=1, 2........n. (4)

The transformation of the differentials of the coordinates is therefore linear and homogeneous, which is not necessarily the case for the transformation of the coordinates themselves.

Range and Summation Conventions. Equations such as (4) may be simplified by the use of two conventions:

Range Convention: When a suffix is unrepeated in a term, it is understood to take all values in the range 1, 2, 3.....n.

Summation Convention: When a suffix is repeated in a term, summation with respect to that suffix is understood, the range of summation being 1, 2, 3.....n.

With these two conventions applying, equation (4) may be written as

4

5

d xr

∂ xr

∂xsdxs

6

. (5)Note that a repeated suffix is a "dummy" suffix, and can be replaced by any convenient alternative. For example, equation (5) could have been written as

7

8

d xr

∂ xr

∂xmdxm

9

. (6)where the summation with respect to s has been replaced by the summation with respect to m.

10

Contravariant vectors and tensors. Consider two neighbouring points P and Q in the manifold whose coordinates are xr and xr + dxr respectively. The vector

11

12

is then described by the quantities dxr which are the components of the vector in this coordinate system. In the dashed coordinates, the vector

13

r P Q

14

is described by the components

15

d xr16

which are related to dxr by equation (5), the differential coefficients being evaluated at P.

17

The infinitesimal displacement represented by dxr or

18

d xr19

is an example of a contravariant vector.

Defn. A set of n quantities T r associated with a point P are said to be the components of a contravariant vector if they transform, on change of coordinates, according to the equation

20

21

Tr

∂ xr

∂xsTs

22

. (7)

where the partial derivatives are evaluated at the point P. (Note that there is no requirement that the components of a contravariant tensor should be infinitesimal.)

Defn. A set of n 2 quantities T rs associated with a point P are said to be the components of a contravariant tensor of the second order if they transform, on change of coordinates, according to the equation

23

24

Trs

∂ xr

∂xm∂ xs

∂xnTmn

25

. (8)

Obviously the definition can be extended to tensors of higher order. A contravariant vector is the same as a contravariant tensor of first order.

Defn. A contravariant tensor of zero order transforms, on change of coordinates, according to the equation

26

27

TT28

, (9)

i.e. it is an invariant whose value is independent of the coordinate system used.

Covariant vectors and tensors. Let be an invariant function of the coordinates, i.e. its value may depend on position P in the manifold but is independent of the coordinate system used. Then the partial derivatives of transform according to

29

30

∂φ

∂ xr∂φ

∂xs∂xs

∂ xr31

(10)

Here the transformation is similar to equation (7) except that the partial derivative involving the two sets of coordinates is the other way up. The partial derivatives of an invariant function provide an example of the components of a covariant vector.

32

Defn. A set of n quantities

33

associated with a point P are said to be the components of a covariant vector if they transform, on change of coordinates, according to the equation

34

35

Tr

∂xs

∂ xrTs

36

. (11)

By convention, suffices indicating contravariant character are placed as superscripts, and those indicating covariant character as subscripts. Hence the reason for writing the coordinates as xr. (Note however that it is only the differentials of the coordinates, not the

coordinates themselves, that always have tensor character. The latter may be tensors, but this is not always the case.)

Extending the definition as before, a covariant tensor of the second order is defined by the transformation

37

38

Trs

∂xm

∂ xr∂xn

∂ xsTmn

39

(12)

and similarly for higher orders.

40

Mixed tensors. These are tensors with at least one covariant suffix and one contravariant suffix. An example is the third order tensor

41

Tstr

42

which transforms according to

43

44

Tstr

∂ xr

∂xm∂xn

∂ xs∂x

∂ xtTnm

45

(13)

Another example is the Kronecker delta defined by

46

47

dsr 1, rs48

49

0, r≠s50

(14)

51

It is a tensor of the type indicated because (a) in an expression such as

52

Bpq..mn..dm

t53

, which involves summation with respect to m, there is only one non-zero contribution from

54

the Kronecker delta, that for which m = t, and so

55

Bpq..mn..dm

t B..tn..

56

; (b) the coordinates in any coordinate system are necessarily independent of each other, so

57

that

58

∂xr

∂xsds

r

59

and

60

∂ xr

∂ xs ds

r61

; so these two properties taken together imply that

62

63

dsr

∂ xr

∂xm∂xn

∂ xsdnm64

. (15)

65

Notes. 1. The importance of tensors is that if a tensor equation is true in one set of coordinates it is also true in any other coordinates. e.g. if

66

Tmn0

67

(which, since m and n are unrepeated, implies that the equation is true for all m and

68

n, not just for some particular choice of these suffices), then

69

Trs0

70

also, from the transformation law. This illustrates the fact that any tensor equation is covariant, which means that it has the same form in all coordinate systems.

2. A tensor may be defined at a single point P within the manifold, or along a curve, or throughout a subspace, or throughout the manifold itself. In the latter cases we speak of a tensor field.

Tensor algebra

71

Addition of tensors. Two tensors of the same type may be added together to give another tensor of the same type, e.g. if

72

Astr

73

and

74

Bstr

75

are tensors of the type indicated, then we can define

76

77

Cstr Ast

r Bstr

78

. (16)

79

It is easy to show that the quantities

80

Cstr

81

form the components of a tensor.

82

Symmetric and antisymmetric tensors.

83

Ars

84

is a symmetric contravariant tensor if

85

ArsAsr

86

and antisymmetric if

87

Ars−Asr88

. Similarly for covariant tensors. Symmetry properties are conserved under transformation of

89

coordinates, e.g. if

90

ArsAsr91

, then

92

93

Amn

∂ xm

∂xr∂ xn

∂xsArs

∂ xm

∂xr∂ xn

∂xsAsr Anm

94

. (17)

95

Note however that for a mixed tensor, a relation such as

96

Ars As

r 97

does not transform to give the equivalent relation in the dashed coordinates. The concept of symmetry (with respect to a pair of suffices which are eithe

Any covariant or contravariant tensor of second order may be expressed as the sum of a symmetric tensor and an antisymmetric tensor, e.g.

98

99

Ars

12(ArsAsr)

12(Ars−Asr)

100

. (18)

Multiplication of tensors. In the addition of tensors we are restricted to tensors of a single type, with the same suffices (though they need not occur in the same order). In the multiplication of tensors there is no such restriction. The only condition is that we never multiply two components with the same suffix at the same level in each. (This would imply summation with respect to the repeated suffix, but the resulting object would not have tensor character - see later.)

101

To multiply two tensors e.g.

102

Ars

103

and

104

Bnm

105

we simply write

106

107

Crsnm ArsBn

m108

. (19)

109

It follows immediately from their transformation properties that the quantities

110

Crsnm

111

form a tensor of the type indicated. This tensor, in which the symbols for the suffices are all

112

different, is called the outer product of

113

Ars

114

and

115

Bnm

116

.

117

Contraction of tensors. Given a tensor

118

Tnpm

119

, then

120

121

Tnm

∂ xm

∂xr∂xs

∂ xn∂xt

∂ xTstr122

. (20)

Hence replacing n by m (and therefore implying summation with respect to m)

123

124

Tmm

∂ xm

∂xr∂xs

∂ xm∂xt

∂ xTstr125

126

∂xs

∂xr∂xt

∂ xTstr127

128

dr

s ∂xt

∂ xTstr129

130

∂xt

∂ xTsts131

(21)

132

so we see that

133

Tmpm

134

behaves like a tensor

135

Ap136

. The upshot is that contraction of a tensor (i.e. writing the same letter as a subscript and a superscript) reduces the

137

Note that contraction can only be applied successfully to suffices at different levels. We may of course construct, starting with a tensor

138

Aqrs

p

139

say, a new set of quantities

140

Aqrr

p141

; but these do not have tensor character (as one can easily check) so are of little interest.

142

Having constructed the outer product

143

Crsnm ArsBn

m 144

in the example above, we can form the corresponding inner products

145

Cmsnm AmsBn

m 146

and

147

Crmnm ArmBn

m148

. Each of these forms a covariant tensor of second order.

Tests for tensor character. The direct way of testing whether a set of quantities form the components of a tensor is to see whether they obey the appropriate tensor transformation law when the coordinates are changed. There is also an indirect method however, two examples of which will now be given:

149

Theorem 1. Let

150

X r

151

be the components of an arbitrary contravariant vector. Let

152

Ar153

be another set of quantities. If

154

ArXr

155

is an invariant, then

156

Ar157

form the components of a covariant vector.

158

Proof: Since

159

X r

160

is a tensor, it obeys the tensor transformation law. Invariance of

161

ArXr

162

means that

163

164

ArX

r As Xs As∂ xs

∂xrX r165

(22)

166

and so

167

(Ar− As

∂ xs

∂xr)X r0

168

. (23)

169

Hence, since

170

X r

171

is an arbitrary tensor,

172

173

Ar

∂ xs

∂xrAs

174

. QED (24)

As an extension of this theorem, it is easy to show that any set of functions of the coordinates, whose inner product with an arbitrary covariant or contravariant vector is a

175

tensor, are themselves the components of a tensor. For example, if

176

ArsXs

177

is a tensor

178

Br179

, then

180

Ars

181

is a second order contravariant tensor.

182

Theorem 2. If

183

arsXrXs

184

is invariant,

185

X r186

being an arbitrary contravariant vector and

187

ars

188

being symmetric in all coordinate systems, then

189

ars

190

are the components of a covariant tensor of second order.

191

Proof: From our assumption about the invariance of

192

arsXrXs

193

,

194

195

amnXmXn ars Xr Xs196

197

198

ars

∂ xr

∂xm∂ xs

∂xnX mX n199

(25)

200

Hence

201

bmnX

mXn ≡(amn − ars∂ xr

∂xm∂ xs

∂xn)XmX n 0

202

. (26)

203

Since

204

Xm

205

is arbitrary and the total coefficient of

206

XmXn

207

is

208

bmnbnm209

, we deduce that

210

bmnbnm 0211

, i.e.

212

213

amn anm ars

∂ xr

∂xm∂ xs

∂xn ars

∂ xr

∂xn∂ xs

∂xm214

215

( ars asr)

∂ xr

∂xm∂ xs

∂xn216

(27)

217

on interchanging the summation variables r and s in the second term. But

218

amnanm

219

in all coordinate systems, hence

220

221

amn ars

∂ xr

∂xm∂ xs

∂xn222

. QED (28)

The metric tensor

223

The Euclidean space. Consider first the familiar Euclidean space in three dimensions, i.e. a space in which one can define Cartesian coordinates x, y and z so that the distance

224

dl

225

between two neighbouring points

226

x,y,z

227

and

228

x dx, y dy, z dz

229

is given by

230

231

dl 2(dx)2 (dy)2 (dz)2232

. (29)

233

If we choose any other coordinates

234

x1,x2,x3

235

to identify points in this space, the original coordinates will be functions of these new coordinates, and their differentials will be linear combinati

236

237

dl 2amn dxm dxn238

(30)

239

where the

240

amn

241

will be functions of

242

xm243

. (For example in spherical polar coordinates

244

x1r, x2 , x3 φ 245

we have

246

a111, a22 r2 , a33 r

2sin2 247

and all other a's are zero.)

248

We now show that

249

amn

250

is a covariant tensor of second order. The proof goes as follows:

251

(a)

252

amn

253

may be taken to be symmetric since each

254

apq

255

occurs only in the combination

256

apqa

257

on the RHS of (30).

258

(b)

259

dl 2amn dxm dxn 260

is invariant, since the distance between two points does not depend on the coordinates used to evaluate it.

261

(c) By keeping one point fixed and letting the second point vary in the neighbourhood of the first,

262

dxr

263

may be considered an arbitrary contravariant tensor.

264

Hence, using the theorem above,

265

amn

266

is a covariant tensor of second order. It is called the metric tensor for the Euclidean 3-space. A similar tensor obviously exists in the case of a two dimensional Euclidean space.

Riemannian space. A manifold is said to be Riemannian if there exists within it a covariant tensor of the second order which is symmetric. This tensor is called the metric tensor and

267

normally denoted by

268

gmn269

. Its significance is that it can be used to define the analogue of "distance" between points, and the lengths of vecto

270

Defn. The interval ds between the neighbouring points

271

xr

272

and

273

xr dxr274

is given by

275

276

ds2gmn dxm dxn277

. (31)

278

This is of course invariant. In the familiar Euclidean space where

279

gmn

280

is just the

281

amn

282

above,

283

ds2dl2 ≥0284

, being zero only when the two points coincide. In other cases however, e.g. in spacetime in

285

relativity theory,

286

ds2

287

may take on negative values, so that

288

ds

289

itself is not necessarily real. If ds = 0 for

290

dxr

291

not all zero, the displacement

292

dxr

293

is called a null displacement. Note that there is no requirement that ds should necessarily have the physical dimensions of length.

294

The conjugate metric tensor. From the covariant metric tensor

295

gmn

296

we can construct a contravariant tensor

297

gmn

298

defined by

299

300

gmngnpd

m301

. (32)

302

To show that

303

gmn

304

is a tensor, we note that, for any contravariant vector

305

Vp306

,

307

gmngnpV

pdmV V m

308

. This means that the inner product of

309

gmn

310

with the arbitrary covariant vector

311

gnpVp

312

is a tensor,

313

Vm314

, and so we deduce that

315

gmn

316

is indeed a tensor of the type indicated. It is said to be conjugate to

317

gmn318

. It is easily shown that when the metric tensor is diagonal, i.e. when

319

gmn0, m ≠n320

, the conjugate tensor is also diagonal, with each diagonal element satisfying

321

gnn1/ gnn322

.

323

The following theorem can be proved, but will just be quoted here: if g is the determinant of the matrix

324

gmn

325

(i.e. choosing to write the components of the tensor

326

gmn

327

in the form of a matrix array), then

328

329

gmn ∂

∂xrgmn

∂∂xr

lng330

. (33)

331

Raising and lowering suffices. Given a tensor

332

T rsm

333

, we may form another tensor

334

Tmrs

335

defined by

336

337

TnrsgnmT rsm

338

(34)

339

Note that

340

gmnTnrsg

mn gntT rst dt

m T rst T rs

m341

. (35)

342

The tensor

343

Tnrs

344

may therefore be regarded as possessing a special relationship with the original tensor

345

T rsm

346

in that either of them may be found from the other by the operation of forming the inner product of the f irst with the metric tensor or its conjugate. For this reason, the same symbol is used (T in this instance), and we describe the above processes by saying that in (34) we have "lowered the suffix m", and that in (35) we have "raised the suffix n". The process of raising or lowering suffices can be extended to cover all the indices of a tensor. For example we

347

can raise one or both of the suffices in the tensor

348

Tmn

349

, generating the corresponding tensors

350

T nm

351

,

352

Tnm

353

and

354

Tmn

355

. Notice the distinction between the two forms of the mixed tensor, effected by leaving appropriate gaps in the set of indices. When the tensor is symmetric however this distinction

356

disappears and we simply write either of these as

357

Tnm

358

.

Cartesian tensors

359

Flat space. A space or manifold is said to be flat if it is possible to find a coordinate system for which the metric tensor

360

gmn

361

is diagonal, with all diagonal elements equal to ± 1, otherwise the space is said to be curved.

The familiar Euclidean space in two or three dimensions is obviously flat, the diagonal elements then being all equal to + 1. We normally assume that the ordinary three dimensional space which we inhabit is flat, likewise in the special theory of relativity that the 4-dimensional "spacetime" is flat. In the general theory of relativity however this assumption must be abandoned, and we have to deal with the consequences of spacetime being curved.

It should not be assumed however that curved spaces never arise in elementary physics or mathematics. Take for instance the surface of a sphere, where it is natural to identify position

362

on the surface by spatial coordinates

363

( , φ)

364

; these are the second and third members of the set of three spherical polar coordinates

365

(r, , φ)366

, the first one having been set equal to a constant, viz. the radius of the sphere. The expression for the line element on the surface of a sphere is

367

368

dl 2a2(d 2 sin2 dφ2)369

(36)

370

where a is the radius of the sphere. No coordinate transformation can be found from

371

( , φ)

372

to new coordinates

373

(x1, x2) 374

such that the line element can be re-expressed in the form

375

376

dl 2(dx1)2 ±(dx2)2377

(37)

and so the space is by definition curved. Of course in this case the result is in accordance with our everyday notions regarding curvature. Geometry in a curved space is intrinsically different from that for flat spaces, e.g. parallel lines do eventually meet, and the sum of the angles in a triangle is not 180o.

Homogeneous coordinates. These are coordinates for which the metric tensor is diagonal with all diagonal elements taking the values +1. The metric expression is then

378

379

ds2(dx1)2 (dx2)2 (dx3)2 .... ..380

(38)

Clearly such coordinates can exist only if the space in question is flat. If this condition is satisfied, it must always be possible to find a set of homogeneous coordinates, since any minus signs in an expression for the metric can be transformed away by re-defining coordinates (albeit with imaginary values) with appropriate factors of i inserted.

Cartesian coordinates in the Euclidean plane or the Euclidean 3- space are obviously homogeneous.

381

Orthogonal transformations. These are linear transformations between two sets of homogeneous coordinates,

382

xm

383

and

384

xm

385

of the form

386

387

xm An

m xn Am388

(39)

389

where the coefficients

390

Anm

391

and

392

Am

393

are constants. Since the set

394

xm

395

are homogeneous,

396

397

ds2d xm d xm398

. (40)

399

But, from (39),

400

d xm Anm dxn401

(41)

402

and so

403

ds2Anm dxnA

m dx404

. (42)

405

But the coordinates

406

xm

407

are also homogeneous, and so the RHS of (42) is required to be equal to

408

dxpdxp409

. Hence

410

411

AnmAp

mdxndx412

(43)which requires

413

414

Anm Ap

m 1415

, n = p = 0, otherwise (44)

Cartesian tensors. If we are dealing with a flat space, homogeneous coordinates are an obvious preferred choice since they facilitate geometrical calculations. Any change of coordinates will normally involve orthogonal transformation equations satisfying equation (39). It is convenient therefore to define Cartesian tensors as quantities which transform according to the usual tensor transformation equations when the coordinates undergo an orthogonal transformation, i.e. as we pass from one set of homogeneous coordinates to another.

Note carefully that orthogonal transformation equations are a subset of all possible transformation equations. Therefore "Cartesian tensors" will not in general obey the tensor laws when subjected to an arbitrary coordinate transformation. On the other hand any (unrestricted) tensor automatically satisfies the definition of being a Cartesian tensor, since the conditions for the latter are a subset of the conditions for the former. We therefore have the seemingly paradoxical statement that "all tensors are Cartesian tensors, but not all Cartesian tensors are tensors".

Consider now the inverse transformation equations for an orthogonal transformation. Starting from (39) in the slightly modified form

416

417

xm A

m x Am418

, (45)

419

we have

420

Anm xm An

mAm x An

mAm421

(46)

422

423

xn An

m Am424

(47)using (44). So the inverse equations are

425

426

xnAn

m xm An427

(48)

428

where

429

An −AnmAm430

. (49)

The whole point of this analysis is now revealed: from equations (39) and (48) we see that

431

432

∂ xm

∂xnAn

m433

,

434

∂xn

∂ xmAn

m435

. (50)

The two differential coefficients involved in these equations are therefore equal; but we see, looking back at equations (7) and (11), that it was the presumed difference between them which was the whole basis of the distinction between covariant and contravariant tensors. Therefore if we restrict ourselves to Cartesian tensors, the distinction between covariant and contravariant tensors disappears, and there is no reason to continue to differentiate between indices used as superscripts and those used as subscripts. For convenience, subscripts are almost invariably the preferred choice in practice.

For example, in solid state physics we may require to calculate the electrical conductivity of a metallic crystal. In an isotropic medium such as a polycrystalline material the conductivity

436

equation

437

ji sEi

438

relates the components of the current density j to the components of the electric field E, with

439

the conductivity

440

s

441

taken to be constant. But in a single crystal the general relationship would be expressed as

442

ji siφEφ

443

where

444

siφ

445

is the conductivity tensor and the usual summation convention applies. In most textbooks on such topics the underlying assumption that the crystal or other system under consideration is e bedded in a flat space is taken for granted, an

N C McGill

446

Documents

Tensor Theory - University of St Andrewsstar-hz4/gr/Tensor.doc · Web viewGiven a tensor , we may form another tensor defined by (34) Note that . (35) The tensor may therefore be