ME 185 P. M. NAGHDI’S NOTES ON CONTINUUM MECHANICS · university of california at berkeley department of mechanical engineering me 185 p. m. naghdi’s notes on continuum mechanics

UNIVERSITY OF CALIFORNIA AT BERKELEY

DEPARTMENT OF MECHANICAL ENGINEERING

ME 185

P. M. NAGHDI’S NOTES ON CONTINUUM MECHANICS

1994

UNIVERSITY OF CALIFORNIADepartment of Mechanical Engineering

ME185

A List of General Referencesiiiiiiiiiiiiiiiiiiiiiiii

1. Cartesian Tensorsiiiiiiiiiiiiiii, by Sir Harold Jeffreys, Cambridge UniversityPress, 1952 [lst ed., 1931, 7th impression 1969].

2. Schaum’s Outline Series - Theory and Problems of ContinuumMechanics, by G. E. Mase, McGraw-Hill Book Co., 1970.

3. Chapters 1-7 of Mechanicsiiiiiiiii, by E. A. Fox, Harper & Row, 1967.

4. Continuum Mechanicsiiiiiiiiiiiiiiiiiii, by Y. C. Fung, Prentice-Hall Inc., 1969.

5. Continuum Mechanicsiiiiiiiiiiiiiiiiiii, by P. G. Hodge, Jr., McGraw-Hill, 1970.

6. Mechanics of Continuaiiiiiiiiiiiiiiiiiii, by A. C. Eringen, John Wiley & Sons,Inc., 1967.

7. Continuum Mechanicsiiiiiiiiiiiiiiiiiii, by P. Chadwick, G. Allen & UnwinLtd., London, 1976.

8. Continuum Mechanicsiiiiiiiiiiiiiiiiiii, by A.J.M. Spencer, Longman GroupLtd., London, 1980.

9. An Introduction to the Theory of Elasticityiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii, by R. J. Atkin andN. Fox, Longman Inc., New York, 1980.

Table of Contents

I. Kinematics .......................................................................................................................... 11. Bodies, configurations, motions, mass, and mass density. ..................................... 12. Deformation gradient. Measure of strain. Rotation and stretch. ............................ 133. Further developments of kinematical results. ......................................................... 204. Further developments of kinematical results. ......................................................... 205. Velocity gradient. Rate of deformation. Vorticity. ................................................. 216. Superposed rigid body motions. ............................................................................. 257. Infinitesimal deformation and infinitesimal strain measures. ................................. 358. The transport theorem. ............................................................................................ 41

II. Conservation Laws and Some Related Results ................................................................. 441. Conservation of mass. .............................................................................................. 45

Appendix to Sec. 1 of Part II. .................................................................................. 472. Forces and couples. Euler’s laws. .......................................................................... 493. Further consideration of the stress vector. Existence of the stress tensor andits

relationship to the stress vector. ............................................................................. 514. Derivation of spatial (or Eulerian) form of the equations of motion. ..................... 585. Derivation of the equations of motion in referential (or Lagrangian form). .......... 616. Invariance under superposed rigid body motions. .................................................. 647. The principle of balance of energy. ........................................................................ 71

III. Examples of Constitutive Equations and Applications. Inviscid and Viscous Fluidand Nonlinear and Linear Elasticity. .......................................................................... 741. Inviscid fluid. .......................................................................................................... 752. Viscous fluid. .......................................................................................................... 793. Elastic solids: Nonlinear constitutive equations. .................................................... 834. Elastic solids: Linear constitutive equations. ......................................................... 865. Some steady flows of viscous fluids. ...................................................................... 916. Some unsteady flow problems. ............................................................................... 967. Further constitutive results in linear elasticity and illustrative examples. .............. 988. Saint-Venant torsion of a circular cylinder. ............................................................ 102Appendix to Part III (Secs. 5-8). .................................................................................. 104

Supplements to the Main Text ................................................................................................ 110Appendix L: Some results from linear algebra ...................................................................... 150

ME185

Part I: Kinematics

1. Bodies, configurations, motions, mass, and mass density.

A body is a set of particles. Sometimes in the literature a body B is referred to as a mani-

fold of particles. Particles of a body B will be designated as X (see Fig. 1.1) and may be

identified by any convenient system of labels, such as for example a set of colors. In mechanics

the body is assumed to be smooth and, by assumption, can be put into correspondence with a

domain of Euclidean space. Thus, by assumption, a particle X can be put into a one-to-one

correspondence with the triples of real numbers X1,X2,X3 in a region of Euclidean 3-space E3.

The mapping from the body manifold onto the domain of E3 is assumed to be one-to-one, inver-

tible, and differentiable as many times as desired; and for most purposes two or three times

suffice.

Each part (or subset) S of the body B at each instant of time is assumed to be endowed with

a positive measure M(S), i.e., a real number > 0, called the mass of the part S and the whole body

is assumed to be endowed with a nonnegative measure M(B) called the mass of the body. We

return to a further consideration of mass later in this section.

Bodies are seen only in their configurations, i.e., the regions of E3 they occupy at each

instant of time t (−∞ < t < +∞). These configurations should not be confused with the bodies

themselves.

Consider a configuration of the body B at time t in which configuration B occupies a region

R in E3 bounded by a closed surface ∂R (Fig. 1.2). Let x be the position vector of the place

occupied by a typical particle X at time t. Since the body can be mapped smoothly onto a

domain of E3, we write

x = χh

(X,t) . (1.1)

In (1.1), X refers to the particle, t is the time, x [the value of the function χh

] is the place occupied

by the particle X at time t and the mapping function χh

is assumed to be differentiable as many

times as desired both with respect to X and t. Also, for each t, (1.1) is assumed to be invertible

so that

- 2 -

X = χh

−1 (x,t) , (1.2)

where the symbol χ−1 designates the inverse mapping. The mapping (1.1) is called a motion of

the body B. The description of motion in (1.1) is similar to that in particle mechanics and is

traditionally referred to as the material description.

During a motion of the body B, a particle X (by occupying successive points in E3) moves

through space and describes a path C ; the equation of this path is parametrically represented by

(1.1). The rate of change of place with time as X traverses C is called the velocity and is tangent

to the curve C. Similarly, the second rate of change of place with time as X traverses C is the

acceleration. Thus, in the material description of the motion, the velocity v and acceleration a

are defined by

v = x.

=∂t

∂χh

(X,t)hhhhhhh , a = x..

=∂t2

∂2χh

(X,t)hhhhhhhh . (1.3)

In the above formulae, the particle velocity v is the partial derivative of the function χh

with

respect to t holding X fixed; and, similarly, the particle acceleration a is the second partial

derivative of χh

with respect to t holding X fixed. Also, the superposed dot, which designates the

material time differentiation, can be utilized in conjunction with any function f and signifies dif-

ferentiation with respect to t holding X fixed.

Reference configuration. Often it is convenient to select one particular configuration and

refer everything concerning the body and its motion to this configuration. The reference

configuration need not necessarily be a configuration that is actually occupied by the body in any

of its motions. In particular, the reference configuration need not be the initial configuration of

the body.

Let κR be a reference configuration of B in which X is the position vector of the place occu-

pied by the particle X (Fig. 1.3). Then, the mapping from X to the place X in the configuration

κR may be written as

- 3 -

X = κR (X) (1.4)

which specifies the position vector X occupied by the particle X in the reference configuration

κR. The mapping (1.4) is assumed to be differentiable as many times as desired and invertible.

The inverse mapping is

X = κR

−1(X) . (1.5)

Then, the mapping (1.1) at time t from the place x to the particle X may be expressed in terms of

(1.5), i.e.,

x = χh

[κR

−1(X),t] = χ

hκR

(X,t) . (1.6)

The second of (1.6), which involves the function χh

κR(with a subscript κR, emphasizes that the

motion described in this manner represents a sequence of mappings of the reference

configuration κR. In the future, however, for simplicity’s sake we omit the subscript κR and

write (1.6)2 and its inverse as

x = χ(X,t) , X = χ−1(x,t) (1.7)

with the understanding that X in the argument (1.6) is the position vector of X in the reference

configuration κR. Returning to (1.6)2, we observe that for each κR a different function χh

κR

results; and the choice of reference configuration, similarly to the choice of coordinates, is arbi-

trary and is introduced for convenience.

A necessary and sufficient condition for the invertibility of the mappings of (1.7)1,2 is that

the determinant J of the transformation from X to x be nonzero, i.e.,

J = det(∂X

∂χhhhh) = det(∂XA

∂χihhhhh) ≠ 0 . (1.8)

- 4 -

Description of the motion. There are several methods of describing the motion of a body.

Here, we describe three but note that, because of our smoothness assumptions, they are all

equivalent.

It was noted earlier that the description utilized in (1.1) is the material description. In such

a description one deals with abstract particles X which together with time t are the independent

variables.

A description such as (1.7) in which the position X of X at some time, e.g., t = 0, is used as

a label for the particle X is called the referential description. In the referential description,

which is also known as Lagrangian, the independent variables are X and t. For some purposes,

it is convenient to display and to utilize (1.7) in terms of its rectangular Cartesian coordinates.

Thus, let EK be constant orthonormal basis vectors associated with the reference configuration

and similarly denote by ek constant orthonormal basis vectors associated with the present

configuration at time t. Then, the positions X and x referred to EK and ek, respectively, are

X = XK EK , x = xk ek (1.9)

where XK are the rectangular Cartesian coordinates of the position X and similarly xk are the

rectangular Cartesian coordinates of the position x. Referred to the basis ek, the component

form of (1.7)1 can be displayed as

xk = χk(XK,t) , (1.10)

where without loss in generality (since EK are constant orthonormal basis) we have also replaced

the argument X by its components XK. In the future we often display the various formulae both

in their "coordinate-free" forms, as well as their component forms.

The particle velocity and acceleration in a material description of motion are given, respec-

tively, by (1.3)1,2. The corresponding expressions for velocity and acceleration in the referential

description (1.7) will have the same forms as (1.3)1,2 but with the argument X replaced by X:

- 5 -

v = x.

=∂t

∂χhhh(X,t) , a = x..

=∂t2

∂2 χhhhh(X,t) . (1.11)

So far we have introduced two descriptions of motion. The third is called the spatial

description in which attention is centered in the present configuration, i.e., the region of space

occupied by the body at the present time t. In the spatial description, which is also known as

Eulerian, the independent variables are the place x and the time t. To elaborate further, consider

an entity f defined by

f = f(X,t) . (1.12)

Since (1.1) is invertible in the form (1.2), the function f(X,t) can be expressed as a different

function of x,t, i.e.,

f(X,t) = f(χh

−1(x,t),t) = f(x,t) . (1.13)

Moreover, the function f is unique. It is clear that with the use of the transformation of the form

(1.13), the velocity and acceleration in (1.3)1,2 can be expressed in terms of different functions of

x,t:

v = v(X,t) = v(x,t) = vk ek , a = a(X,t) = a(x,t) = ak ek . (1.14)

In the above spatial description, the spatial form f(x,t) on the right-hand side of (1.13) was

obtained from a representation (1.12) in which f is a function of particle X and t. Analogous

results hold if we start with f = f(X,t) and then use (1.7)2 to obtain

f = f(X,t) = f(x,t) . (1.15)

- 6 -

Material derivative. Consider again a function such as f(X,t) in (1.12). The partial

derivative of this function with respect to t holding the variable X fixed, denoted by f., is called

the material derivative of f. But f, with the use of (1.13)2, can also be expressed as a different

function of x,t. Thus, using the chain rule for differentiation, we have

f.

=∂t∂fhhh c

c X

=∂x∂fhhh .

∂t

∂χh

hhh +∂t∂fhhh , (1.16)

where on the left-hand side of the above we have temporarily emphasized that the variable X is

held fixed when calculating the partial derivative of f with respect to t. With the use of the first

of (1.3)1, we may rewrite (1.16)2 as

f.

=∂t

∂f(X,t)hhhhhhh cc X

=∂t∂fhhh +

∂x∂fhhh . v =

∂t∂fhhh + vk ∂xk

∂fhhhh (1.17)

which is the material derivative of f(x,t). The results of the type (1.15) and (1.17) are applicable

to any scalar-valued, vector-valued or tensor-valued field. In particular, consider the referential

form of the velocity vector v which by (1.11)1 and transformation of the type (1.15) is

v = v(X,t) = v(x,t) . (1.18)

Then, the acceleration or the material derivative of v where these are regarded as functions of x,t

are

a = v.

=∂t∂vhhh +

∂x∂vhhh v = a(x,t) . (1.19)

Material curve, material surface and material volume. Let a body B with material

points (or particles) X in a fixed reference configuration κR occupy a region R R in E3 bounded

by a closed surface ∂R R. Any subset SR of B in E3 will be designated by PR (⊆ R R) bounded

by a closed boundary surface ∂PR. Because of the nature of the smoothness assumption

- 7 -

imposed on the mapping (1.7)1 from the reference configuration κR to the current configuration κ

of B at time t, all curves, surfaces and volumes in κR are carried by the motion χ into curves, sur-

faces and volumes in the configuration κ. Thus, the region R R with the closed boundary surface

∂R R is mapped into a region R with corresponding closed boundary surface ∂R in κ. More-

over, the parts PR (⊆ R R) bounded by ∂PR is mapped into a part P (⊆ R) bounded by a closed

surface ∂P. The part P is occupied by the same material points (or particles) as those which

occupied PR and such a region or volume is called a material volume.

A surface in the reference configuration κR is defined by equations of the form

F(X) = 0 or X = X(U1,U2) , (1.20)

where U1,U2 are parametric variables on the surface. With the use of (1.7)2, the surface (1.20)

can be mapped into a corresponding surface

f(x,t) = F(χ−1(x,t)) = 0 or x(U1,U2,t) = χ[X(U1,U2),t] (1.21)

in the configuration κ at time t. The surface (1.21) is called a material surface, since its material

points (or particles) are the same as those of the surface (1.20).

A curve in the reference configuration may be regarded as the intersection of two surfaces

of the form

F(X) = 0 , G(X) = 0 , (1.22)

and is defined by

X = X(U) , (1.23)

where U is a parameter and can be identified with the arc length. The intersection of the surfaces

(1.22) can be mapped into a curve in the configuration κ which is the intersection of the surfaces

f(x,t) = F(χ−1(x,t)) = 0 , g(x,t) = G(χ−1(x,t)) = 0 (1.24)

- 8 -

or

x(U,t) = χ[X(U),t] . (1.25)

The curve (1.25) is called a material curve, since its material points (particles) are the same as

those of the curve (1.23).

Lagrange’s criterion for a material surface. A surface f(x,t) = 0 is said to be material if

and only if the material derivative of f vanishes, i.e.,

f.

=∂t∂fhhh +

∂x∂fhhh . v = 0 . (1.26)

First we write f(x,t) = F(X,t) with the use of (1.7)1 and suppose that f(x,t) is material.

Then, F(X,t) must be independent of t and hence F.

= 0 and this implies that f.

= 0.

Next, suppose that f.

= 0. Then, it follows that F.

= 0 and this implies that F must be

independent of t and a function of X only so that F(X) = 0 and after using (1.7)2 we have the

result that F(χ−1(x,t)) = f(x,t) = 0 is material.

In a similar manner, we may establish that the intersection of surfaces f(x,t) = 0 and

g(x,t) = 0 is a material curve if and only if

f.

=∂t∂fhhh +

∂x∂fhhh . v = 0 , g

.=

∂t∂ghhh +

∂x∂ghhh . v = 0 . (1.27)

Mass density. Before closing this section, consider again a configuration κ of the body B

at time t in which B occupies a region of E3 R bounded by a closed surface ∂R and recall that a

part (or a subset) of B will be denoted by S. The part S in a neighborhood of the place x in κ

occupies a region of space P (⊆ R) bounded by a closed surface ∂P. We introduce explicitly the

mass M(S) of the part and the mass M(B) of the whole body at time t. These measures are non-

negative and may be functions of t. Assuming that the measure M(S) is absolutely continuous,

- 9 -

then the limit

ρ =v→0lim

v

M(St)hhhhh (1.28)

exists, where v = v(P) is the volume of the region of space P. The scalar field ρ = ρ(x,t) is

called the mass density of the body in the configuration at time t. The mass density is nonnega-

tive and is a function of both x and t. The mass of the part S of the body and the mass of the

whole body, both at time t, can be expressed in terms of the mass density ρ:

M(St) = ∫P

ρ dv , M(Bt) = ∫R

ρ dv , (1.29)

where the subscript t attached to S and B emphasizes that the left-hand sides of (1.29)1,2 refer to

the masses of S and B in the configuration κ at time t and dv is an element of volume in this

configuration, the regions of integration being over P and R in E3. Similarly, we define the mass

density ρR

in a reference configuration and write

M(SR) = ∫P

R

ρR dVR , M(BR) = ∫R

R

ρR dVR , (1.30)

where dVR denotes an element of volume in the reference configuration and PR,R R (PR ⊆ R R)

are the regions of space occupied by the part and the whole body in the reference configuration.

If the reference configuration is identified with the initial configuration at time to, then instead of

(1.30) we write

M(So) = ∫Po

ρo dV , M(Bo) = ∫R o

ρo dV , (1.31)

where ρo and dV are, respectively, the mass density and the volume element in the initial

configuration while Po and R o (Po ⊆ R o) are the regions of space occupied, respectively, by the

part and the whole body in the initial configuation.

It should be emphasized that, at this stage in our development, the mass of a part of the

body and that of the whole body depend on the particular configuration occupied by the body.

- 10 -

2. Deformation gradient. Measures of strain. Rotation and stretch.

Henceforth we identify the particle (material point) X by its position vector X in a fixed

reference configuration, which (for convenience) we take to be coincident with the initial

configuration κo. The deformation gradient (also called displacement gradient) F relative to the

reference position is defined by

F = F(X,t) =∂X

∂χ(X,t)hhhhhhh , J = det F ≠ 0 . (2.1)

The components of F are designated by FiA or sometimes written as xi,A with a comma designat-

ing partial differentiation. They are given by

FiA =∂XA

∂χi(XB,t)hhhhhhhh (2.2)

and may be regarded as a 3×3 square matrix.

At a fixed time t, line elements dx are related to line elements dX in κo by

dx = F dX or dxi = xi,A dXA . (2.3)

The deformation gradient F in (2.3) describes the local deformation of a particle (material point)

whose position vector is X in κo. In other words, (2.3) is a linear transformation of a small

neighborhood of X from the reference configuration κo into the current configuration κ at time t.

Let the magnitudes of dX and dx be denoted by dS and ds, respectively; and let M and m

be the unit vectors in the directions of dX and dx, respectively. Then,

dX = MdS , M . M = 1 ,

(2.4)

dx = m ds , m . m = 1 ,

or in component forms

- 11 -

dXA = MA dS , MA MA = 1 ,

(2.5)

dxi = mi ds , mi mi = 1 .

Measures of strain.

In general, the material line element dX undergoes both stretch and rotation. The ratio ds/dS ,

denoted by λ, is called the stretch of the line element, i.e.,

λ =dSdshhh . (2.6)

It then follows from (2.3)-(2.5) that

m ds = dx = F dX = F M dS ,

(2.7)

or mi ds = dxi = xi,A dXA = xi,A MA dS .

Next, using (2.6), from (2.7) we have

λm = F M or λmi = xi,A MA . (2.8)

By taking the scalar product of each side of (2.8) with itself we obtain

λ2 = F M . F M = M . C M , (2.9)

where in writing (2.9)3 we have also defined the tensor C by

C = FT F . (2.10)

The component forms of the equations (2.9)-(2.10) are:

λ2 = CAB MA MB , (2.9a)

and

- 12 -

CAB = xi,A xi,B . (2.10a)

The symmetric tensor C (with components CAB) is called the right Cauchy-Green tensor.

Clearly, for a material line element which coincides with dX in the reference configuration

κo, the value of the stretch λ in the direction M can be calculated from (2.9) once C is known.

Alternatively, λ can be calculated in terms of the deformed line element dx in the current

configuration κ. To see this, since F is invertible, we rewrite (2.3) as

F−1 dx = dX or XA,i dxi = dXA , (2.11)

where the inverse function F−1 using (1.7)2 is defined by

F−1 =∂x

∂χ−1hhhhh or XA,i =

∂xi

∂χA

−1

hhhhh . (2.12)

Then, from (2.4), (2.5), (2.11) and (2.12) we have

M dS = dX = F−1 dx = F−1 m ds ,

(2.13)

or MA dS = dXA = XA,i dxi = XA,i mi ds ,

which with the use of (2.6) can be written in the form

λ−1 M = F−1 m or λ−1 MA = XA,i mi . (2.14)

The result (2.14) can also be obtained immediately by inverting (2.8). By taking the scalar pro-

duct of each side of (2.14) with itself we obtain

λ−2 = F−1 m . F−1 m = m . (F−1)T F−1 m = m . B−1 m , (2.15)

where in writing (2.15)3 we have also defined the tensor B by

- 13 -

B = F FT . (2.16)

The component forms of equations (2.15)-(2.16) are:

λ−2 = cij mi mi , (2.15a)

cij = bij

−1= XA,i XA,j , bij = xi,A xj,A , (2.16a)

where the notation cij designates the components of the inverse of the tensor B. The symmetric

tensor B whose components are given in (2.16a) is called the left Cauchy-Green tensor. The ten-

sors C and B represent, respectively, the referential (or Lagrangian) and the spatial (or Eulerian)

descriptions of strain. When the motion is rigid, both tensors become identity tensors, i.e.,

C = B = I, where I is the unit tensor.

For some purposes, it is convenient to employ relative measures of strain such that these

measures vanish when the motion is rigid. Recalling (2.9) and (2.15) we may write

λ2 − 1 = M . (C − I)M = (CAB − δAB)MA MB

(2.17)

1 − λ−2 = m . (I − B−1) m = (δij − bij

−1) mi mj

or equivalently,

λ2 − 1 = 2M . E M = 2EAB MA MB ,

(2.18)

1 − λ−2 = 2m . Ehh

m = 2eij mi mj ,

where

E =21hh(C − I) or EAB =

21hh(CAB − δAB) , (2.19)

Ehh

=21hh( I − B−1) or eij =

21hh(δij − cij) . (2.20)

- 14 -

The relative measure E defined by (2.19) is known as the relative Lagrangian strain, and the

corresponding relative spatial measure of strain Ehh

is defined by (2.20).

Rotation and stretch.

The deformation gradient F in (2.3) describes the local deformation of a material line element at

X from the reference configuration κo to the current configuration κ; it involves, in general, both

rotation and stretch. Now, since by (1.8) the deformation gradient F is nonsingular, using the

polar decomposition theorem it may be expressed in the polar forms

F = R U = V R or xi,A = RiB UBA = Vij RjA , (2.21)

where U and V are positive definite symmetric tensors called, respectively, the right and left

stretch tensors and R is the proper orthogonal tensor satisfying

R RT = RT R = I det R = 1 , (2.22)

or in component form

RiA RiB = δAB or RiA RjA = δij . (2.22a)

The effect of (2.21) is to replace the linear transformation (2.3) by two linear transformations:

Either by

dX′ = U dX and dx = R dX′ or dXB′ = UBA dXA and dxi = RiB dXB

′ , (2.23)

or by

dx′ = R dX and dx = V dx′ or dxj′ = RjA dXA and dxi = Vij dxj

′ . (2.24)

Associated with each of the stretch tensors U, V are three positive principal values (eigen-

values) and three orthogonal principal directions (eigenvectors). These will be discussed later.

- 15 -

Here, it should be emphasized that, in general, F, R, U, V are all functions of X and t and they

can vary from one material point to another during motion. In the remainder of this section, we

provide physical interpretations for the decompositions (2.23) and (2.24).

(i) Stretch followed by pure rotation. In this case, let the deformation of dX into dX′

involve only pure stretch and the deformation from dX′ into dx be one of pure rotation. By

(2.21)1 and (2.23), as well as (2.9)-(2.10), we have

dx . dx = R dX′ . R dX′ = dX′ . dX′ , (2.25a)

or

dxi dxi = RiA RiB dXA′ dXB

′ = δAB dXA′ dXB

′ = dXA′ dXA

′ , (2.25b)

and

C = FT F = (R U)T (R U) = U2 , (2.26)

or

CAB = xi,A xi,B = RiK UKA RiL ULB = δKL UKA ULB = UKA UKB , (2.27a)

λ2 = UKA UKB MA MB = CAB MA MB . (2.27b)

According to (2.25), the length of the line element dX′ is the same as the length of dx so that all

the stretching is represented by U in (2.26). However, line elements are generally also rotated

by the action of U. The part of the deformation described by R is a pure rotation.

(ii) Pure rotation followed by stretch. In this case, let the deformation of dX into dx′ be

one of pure rotation and the deformation from dx′ to dx involve only stretch and rotation. By

(2.21) and (2.24), as well as (2.15)-(2.16), we have

dx′ . dx′ = R dX . R dX = RT R dX . dX = dX . dX , (2.28)

- 16 -

or

dxi′ dxi

′ = RiA RiB dXA dXB = δAB dXA dXB = dXA dXA (2.29)

and

B = F FT = (V R) (V R)T = V2 , (2.30a)

λ2

1hhh = m . B−1 m , (2.30b)

or

bij = xi,A xj,A = Vir RrA Vjs RsA = Vir Vjr (2.31a)

λ−2 = (B−1)ij mi mj . (2.31b)

According to (2.28), the length of the line element dx′ is the same as the length of dX so that the

deformation described by R is one of pure rotation.

Keeping in mind the decomposition (2.21), it is evident from (2.26) and (2.30a) that

U = (FT F)1⁄2 and V = (F FT)

1⁄2 involve the square root operation which often is difficult to exe-

cute and this is the main reason for the choice of U2 = C and V2 = B as the strain tensors.

Moreover, since U2 is symmetric positive definite, it possesses a unique symmetric positive

definite square root; a parallel remark applies to the square root of V2. In this regard the expres-

sions for the square root of the stretch λ and its inverse, in the forms (2.9) and (2.15), are espe-

cially noteworthy.

- 17 -

Secs. 3 & 4. Further developments of kinematical results.

- 18 -

5. Velocity gradient. Rate of deformation. Vorticity.

Consider the spatial (Eulerian) form of the particle velocity in the form given by (1.14),

v = v(x,t) and define the spatial velocity gradient tensor by

L = grad v(x,t) (5.1)

or

∂x∂vhhh = vi,j ei ×b ej , vi,j =

∂xj

∂vihhhh , (5.2)

where vi,j are the components of L referred to the orthonormal basis ei and in recording (5.2) we

have used the same symbol for the function v and its value. The tensor L can be uniquely

decomposed into its symmetric part D and its skew-symmetric part W (see Appendix L, Sect. 8)

as follows:

L = D + W , (5.3)

where

D =21hh(L + LT) = dij ei ×b ej , W =

21hh(L − LT) = wij ei ×b ej . (5.4)

The corresponding component forms of (5.3) and (5.4) are:

vi,j = dij + wij , (5.3a)

where

dij =21hh(vi,j + vj,i) = dji , wij =

21hh(vi,j − vj,i) = − wji . (5.4a)

The tensor D with components dij is called the rate of deformation tensor and W with com-

ponents wij is called the vorticity or the spin tensor.

There is a one-to-one correspondence between a skew tensor and an axial vector (see

Appendix L, Sect. 12). The axial vector w which corresponds to the skew vorticity tensor W in

(5.3) or (5.3a) is defined by

W z = w × z or w =21hh curl v (5.5)

- 19 -

for every vector z. Referred to the basis ei, w reads as

w = wi ei (5.6)

and the corresponding component forms of (5.5)1,2 are:

wij = − εijk wk = εjik wk , wi =21hh εijk wkj =

21hh εijk vk,j , (5.5a)

where εijk stand for the components of the permutation symbol. Motion in which w = 0 is

called irrotational motion. It is usual in some books to identify curl v (instead of21hh curl v in

(5.5)2) as vorticity and denote it by w = curl v.

The foregoing results have been obtained starting from (5.1), where v = v(x,t). Alterna-

tively, we may begin by recalling the referential (Lagrangian) form of the particle velocity in the

form given by (1.14)1, where v = v(X,t). Then, the material derivative of the deformation gra-

dient (2.1) is

F.

= L F , (5.7)


xi,A

hhh.= x

.i,A = vi,A = vi,k xk,A . (5.7a)

Additional useful results may be obtained from (5.7) or (5.7a). Thus, the material derivative of

the Jacobian of transformation (1.8) is

J.

= det Fhhhh.

= J tr (L F F−1) = J tr L

= J tr D = J vk,k = J dkk (5.8)

or

J.

= J XA,i vi,k xk,A = J δik vi,k = J vk,k . (5.8a)

A motion which is volume-preserving, i.e., a motion corresponding to which the volume occu-

pied by any material region remains unchanged, is called isochoric. For an isochoric motion,

- 20 -

J = 1 , J.

= 0 => tr D = dkk = div v = vk,k = 0 . (5.9)

Next, consider the material derivative of the tensor C defined by (2.10):

C.

= FTFhhhh.

= F.

T F + FT F.

= FT( LT + L)F = 2FT D F , (5.10)


C.

AB = xi,A xi,B

hhhhhh.= xi,A

hhh.xi,B + xi,A xi,B

hhh.

= vi,k xk,A xi,B + vi,k xk,B xi,A

= (vi,k + vk,i)xk,A xi,B = 2dik xk,A xi,B (5.10a)

In order to discuss a physical interpretation of the tensors D and W (or equivalently the

axial vector w), we take the material derivative of (2.8) with respect to t and make use of (5.7) to

obtain

λ mhhhh.

= λ.

m + λ m.

= F Mhhhh.

= L F M = λ L m , (5.11)


λ mi

hhhh.= λ

.mi + λ m

.i = xi,A MA

hhhhhhh.= vi,k xk,A MA = λ vi,k mk . (5.11a)

The scalar product of (5.11) with the unit vector m, after also using the results m . m = 1,

m . m.

= 0, yields

λλ.hh = l n λ

hhhh.= L m . m = (D + W) . m ×b m = D . m ×b m , (5.12)

or

λλ.hh = l n λ

hhhh.= vi,k mi mk = (dik + wik)mi mk = dik mi mk . (5.12a)

- 21 -

It follows from (5.12) that the rate of deformation D determines the material time derivative of

the logarithmic stretch for a material line element having the direction m in the current

cofiguration κ. Further, using (5.12), the result (5.11)2 may be rewritten as

m.

= L m −λλ.hh m = W m + [D − (

λλ.hh) I]m , (5.13)

or from (5.11a)

m.

i = vi,j mj − (λλ.hh)mi = wij mj + [dij − (

λλ.hh)δij ] mj . (5.13a)

Now, let m be a principal direction of the eigenvectors associated with the rate of deformation

tensor D. Then,

D m = γ m or dij mj = γ mi (5.14)

where the associated scalar eigenvalue γ is

γ = D m . m = D . m ×b m = dij mi mj =λλ.hh (by (5.12a)) (5.15)

where we have also used (5.12). It follows from (5.14) and (5.15) that

D m =λλ.hh m or dij mj =

λλ.hh mi . (5.16)

Moreover, from combination of (5.13) and (5.16) we deduce that

m.

= W m = w × m or m.

i = wij mj = εikj wk mj . (5.17)

Thus, the material time derivative of a unit vector m (which represents the direction of a

material line in κ) in a principal direction of D is determined by (5.17). In other words, the axial

vector w (or the vorticity) is the angular velocity of the line element which is parallel to a princi-

pal direction.

(For further results, see the Supplement to Part I, Section 5.)

- 22 -

6. Superposed rigid body motions.

A rigid motion is one that preserves relative distance. In this section, we examine the effect

of such a motion on the various kinematic measures introduced in preceding sections. More

specifically, we consider here motions which differ from a prescribed motion, such as (1.7)1,

only by superposed rigid body motions of the whole body, i.e., motions which in addition to a

prescribed motion involve purely rigid motions of the body. For later reference, we recall here

the motion defined in Eq. (1.7)1, namely

x = χ (X, t) .

Consider a material point (or particle) of the body, which in the present configuration at

time t occupies the place x as specified by (1.7)1. Suppose that under a superposed rigid body

motion, the particle which is at x at time t moves to a place x+ at time t+ = t + a, a being a con-

stant. In what follows for all quantities associated with the superposed motion we use the same

symbols as those associated with the motion (1.7)1 to which we also attach a plus "+" sign.

Thus, we introduce a vector function χ+ and write

x+ = χ+ (X, t+) = χ+ (X, t) . (6.1)

It is clear that the difference between the two functions χ+ and χ+ is due to the presence of the

constant a in the argument of the former.

Similarly, consider another material point of the body which in the present configuration at

time t occupies the place y specified by

y = χ (Y, t) . (6.2)

Suppose that under the same superposed rigid body motion, the particle which is at y at time t

moves to a place y+ at time t+. Then, corresponding to (6.1), we write

y+ = χ+ (Y, t+) = χ+ (Y, t) (6.3)

Recalling the inverse relationship X = χ−1(x, t) and the analogous result for Y = χ−1(y, t), the

function χ+ on the right-hand sides (6.1) and (6.3) may be expressed as different functions of x, t

- 23 -

and y, t, respectively, i.e.,

x+ = χ+(X, t) = χh

+(x, t) , y+ = χ+(Y, t) = χh

+(y, t) . (6.4)

During the superposed rigid body motions of the whole body, the magnitude of the relative dis-

placement χh

+(x, t) - χh

+(y, t) must remain unaltered for all pairs of material points X, Y in the

body and for all t in some finite time interval [t1, t2]. Hence,

[χh

+(x, t) − χh

+(y, t)] . [χh

+(x, t) − χh

+(y, t)] = (x − y) . (x − y) (6.5)

for all x, y in the region R occupied by the body at time t.

Since x, y are independent, we may differentiate (6.5) successively with respect to x and y

to obtain the following differential equation for χh

+:

RJQ ∂x

∂χh

+(x, t)hhhhhhhh

HJP

TRJQ ∂y

∂χh

+(y, t)hhhhhhhh

HJP

= I , (6.6)

where x, y are any points in R and t is any time in the interval [t1, t2]. Since the tensor ∂χh

+(y,

t)/∂y is invertible, (6.6) can be written in the alternative form

RJQ ∂x

∂χh

+(x, t)hhhhhhhhHJP

T

=RJQ ∂y

∂χh

+(y, t)hhhhhhhh

HJP

−1

= QT(t) (6.7)

for all x, y ⊆ R and t in [t1, t2]. Thus, each side of the equation (6.7) is a tensor function of t, say

QT(t), and we may set

∂x

∂χh

+(x, t)hhhhhhhh = Q(t) for x ∈ R . (6.8)

Therefore we also have∂y

∂χh

+(y, t)hhhhhhhh = Q(t). >From (6.6) or (6.7), we conclude that

QTQ = I , det Q = ± 1 , (6.9)

and hence Q is an orthogonal tensor. The motion under consideration must include the particu-

lar case χh

+(x, t) = x and for this particular case Q = I and det Q = +1. Since the motions are con-

tinuous, we must alwaysiiiiii have

- 24 -

det Q = 1 (6.10)

and therefore Q(t) is a proper orthogonal tensor.

The differential equation (6.8) may be integrated to yield

χh

+(x, t) = a(t) + Q(t)x , (6.11)

where a(t) is a vector function of time. The last result is a general solution of (6.5) for χh

+(x, t).

For later convenience, we put

a(t) = c+(t+) − Q(t)c(t) , (6.12)

where c+, c are vector functions of t+ and t, respectively. With the use of (6.12), the (6.11) can

be expressed in the form

x+ = c+ + Q(x − c) , (6.13)

where

QT Q = Q QT = I , det Q = 1 . (6.14)

The transformation (6.13) is a rigid transformation since it is distance preserving, i.e.,

| x+ − y+ | 2 = (x+ − y+) . (x+ − y+) = Q(x − y) . Q(x − y)

= (x − y) . [QTQ(x − y)]

= (x − y) . I(x − y)

= (x − y) . (x − y) = | x − y | 2 . (6.15)

The transformation (6.13) also preserves the angle between any two nonzero vectors x − y and

x − z since

cosθ+ =| x+z+ |

x+ − y+hhhhhhh

=| x − y |

Q(x − y)hhhhhhhh . Q(x

=| x − y | | x − z |

(x − y) . [QTQ(x − z)]hhhhhhhhhhhhhhhhhhh

=| x − y |(x − y)hhhhhhh .

| x − z |(x − z)hhhhhhh = cosθ , (6.16)

- 25 -

where z is any place in the configuration occupied by the body at time t which moves to z+ at

time t+ as a consequence of the rigid body motion. Since (6.13) is both length and angle preserv-

ing, it follows that the element of length, element of area and element of volume all remain unal-

tered under superposed rigid body motions.

The component forms of the various results between (6.5) and (6.10) proceed as follows.

First, the component form of (6.5) is

[χh

i+(xj, t) − χ

hi+(yj, t)][χ

hi+(xj, t) − χ

hi+(yj, t)] (6.5a)

To record the results (6.6) in component form, we differentiate (6.5a) successively with respect

to xm and yn and obtain

∂xmj

∂χh

i+(xj t)hhhhhhhh , t)] = δim(xi − yi)

and

∂xm

∂χh

i+(xj, t)hhhhhhhhh

∂yn

∂χh

i(yj, t)hhhhhhhh (6.6a)

Next, multiply (6.6a) by∂χh

k+(yj, t)

∂ynhhhhhhhhh and use the chain rule for differentiation, i.e.,

∂yn

∂χh

i+(yj, t)hhhhhhhhh

∂χh

k+(yj, t)

∂ynhhhhhhhhh

to deduce

∂xm

∂χh

k+(xj, t)hhhhhhhhh =

∂χh

k+(yj, t)

∂ymhhhhhhhhh . (6.7a)

Since the left-hand side of (6.7a) is independent of yj and the right-hand side is independent of

xj, each side must be a function of time only and we have

∂xm

∂χh

k+(xj, t)hhhhhhhhh = Qkm(t) . (6.8a)

- 26 -

Integration of the differential equation (6.8a) yields

xk+ = χ

hk+(xj, t) = ak(t) (6.11a)

Since xj is any point, we must also conclude from (6.11a) that

yk+ = χ

hk(yj, t) = ak(t) + Qkm(t)ym

and hence

∂yn

∂χh

k+(yj, t)hhhhhhhhh = Qkn(t) .

Substitution of the last result and (6.8a) into (6.6a) results in the component for of

Qkm(t)Qkn(t) = δmn . (6.9a)

Also, for the convenience of the reader, we also record here the component forms of most of the

equations given above. Thus, for example, the component form of (6.1) is

xi+ = χi

+(XA, t+) = χi+(XA,t) . (6.1a)

Similarly, we identify the component form of (6.2), (6.3), etc., as

yi = χi(YA, t) , (6.2a)

yi+ = χi

+(YA, t+) = χi+(YA, t) , (6.3a)

xi+ = χi

+(XA, t) = χh

i+(xj, t) , yi

+ = χi+(YA, t) = χ

hi+(yj, t) , (6.4a)

[χh

i+(xj, t) − χ

hi+(yj, t)][χ

hi+(xj, t) − χ

hi+(yj, t)] = (xi − yi)(xi − yi) , (6.5a)

∂xm

∂χh

i+(xj, t)hhhhhhhhh

∂yn

∂χh

i+(yj, t)hhhhhhhhh = δmn , (6.6a)

∂xk

∂χh

i+(xj, t)hhhhhhhhh = Qik(t) , (6.8a)

Qim(t)Qin(t) = δmn , (6.9a)

- 27 -

χh

i+(xj, t) = ai(t) + Qik(t)xk , (6.11a)

ai(t) = ci+(t+) − Qik(t)ck(t) , (6.12a)

xi+ = ci

+ + Qij(xj − cj) , (6.13a)

Qim Qin = δmn = Qmi Qni , (6.14a)

(xi+ − yi

+)(xi+ − yi

+)k)

= δjk(xj − yj)(xk − yk) = (xj − yj)(xj − yj) , (6.15a)

| x+ − y+ | | x+ − z+ | cosθ+i+)

= Qim(xm − ym)Qin(xn − zn)

= δmn(xm − ym)(xn − zn)

= (xm − ym)(xm − zm)

= | x − y | | x − z | cosθ . (6.17a)

Recall

x+ = χ+(X, t+) = χ+(X, t) = χh

+(x, t) ,

or

x+ = χh

+(x, t+) = a(t) + Q(t)x . (6.11)

Then,

v+ = x. + =

dt+dx+hhhh =

∂t+∂χ+(X, t+)hhhhhhhhh

=∂t

∂χ+(X, t)hhhhhhhh

dt+dthhh

=∂t

∂χ+(X, t)hhhhhhhh (6.18)

Therefore

v+ = x. + =

dt+dhhh(a(t) + Q(t)x)

- 28 -

=dtdhhh(a(t) + Q(t)x)

dt+dthhh

= a.(t) + Q

.(t)x + Q(t)x

.

= a.(t) + Q

.(t)x + Q(t)v . (6.19)

Let

Ω (t) = Q.

(t)QT(t) . (6.20)

Then

Ω Q = Q.

QT Q = Q.

. (6.21)

But, from (6.9) QT Q = I, so that

Q.

T(t)Q(t) + QT(t)Q.

(t) = O ,

[Ω (t)Q(t)]TQ(t) + QT(t)Ω(t)Q(t) = O ,

QT(t)ΩT(t)Q(t) + QT(t)Ω(t)Q(t) = O ,

ΩT(t) + Ω(t) = O

or

ΩT(t) = −Ω(t) . (6.22)

Since Ω (t) is skew-symmetric, it posseses an axial vector

ωi = −21hh εijk Ωjk , (6.23)

or

Ωjk = −εjki ωi .

Alternatively, since Ω is skew-symmetric, there exists a vector-valued function ω such that for

any vector V,

- 29 -

Ω V = ω × V . (6.24)

Returing to (6.19), we may write

v+ = a.

+ Ω Q x + Q v . (6.25)

- 30 -

So far we have been discussing superposediiiiiiiii rigid body motions. In order to relate our results to

the more familiar equations of rigid body dynamics, let us now consider a rigid motion defined

by

x = X , (6.26)

x+ = a(t) + Q(t)x= a(t) + Q(t)X . (6.27)

The equation (6.27)1 may be solved for x in the form

x = QT(t)(x+ − a(t)) . (6.28)

Then

v+ = a.(t) + Q

.(t)x

= a.(t) + Ω Q(t)[QT(t)(x+

= a.(t) + Ω (t)[x+ − a(t)]

= a.(t) + ωωωω(t) × [x+ − a(t)] . (6.29)

Thus ωωωω(t) is recognized to be the angular velocity of the body.

The component form of the foregoing equations are listed below:

xi+ = ai(t) + Qij(t)xj , (6.11a)

vi+ = x

.i+ = a

.i(t) + Q

.ij(t)xj + Qij(t)x

.j , (6.19a)

Ωik(t) = Q.

im(t)Qkm(t) , (6.20a)

ΩikQkl = Q.

imQkmQkl = Q.

il , (6.21a)

Recalling

Q.

il Qjl + Qil Q.

jl = 0 ,

- 31 -

we obtain

Ωij(t) = −Ωji(t) , (6.22a)

ΩimVm = −εimp ωp Vm = εipm ωp Vm , (6.24a)

vi+ = a

.i + Ωim Qmn xn + Qim vm , (6.25a)

xi+ = ai(t) + Qim xm , (6.27a)

Qij xi+ = Qij ai + Qij Qim xm

Qij(xi+ − ai) = δjm xm = xj , (6.28a)

vi+ = a

.i + Ωij(xj

+ − aj)

= a.i + εijk ωj(xk

+ − ak) . (6.29a)

We return now to the more general considerations of superposed rigid body motions.

>From (6.25) and (6.11), it follows that

v+ = a.

+ Ω Q x + Q v

= a.

+ Ω Q[QT(x+ − a)] + Q v

= a.

+ Ω (x+ − a) + Q v , (6.30)


vi+ = a

.i + Ωij[xj

+ − aj] + Qijvj . (6.30a)

Then,

- 32 -

∂xm+

∂vi+

hhhh = Ωij ∂xm+

∂xj+

hhhh + Qij ∂xn

∂vjhhhh∂xm

+

∂xnhhhh

= Ωij δjm + Qij ∂xn

∂vjhhhh∂xm

+∂hhhh[Qln(xl

+ − al(t))]

= Ωim + Qij ∂xn

∂vjhhhhQln ∂xm+

∂xl+

hhhh

= Ωim + Qij ∂xn

∂vjhhhhQln δlm

= Ωim + Qij ∂xn

∂vjhhhhQmn

= Ωim + Qij Qmn ∂xn

∂vjhhhh , (6.31a)

or in direct notation

∂x+∂v+hhhh = Ω + Q

∂x∂vhhhQT . (6.31)

It follows from (6.31) and the results of section 5 that

D+ = Q D QT ,

W+ = Q W QT + Ω , (6.32)

or, in indicial notation,

dij+ = Qim Qjn dmn ,

wij+ = Qim Qjn wmn + Ωij . (6.32a)

In particular, suppose that the body in its superposed motion is at time t passing through the

configuration κ with angular velocity Ω, so that

Q = I , Q.

= Ω Q = Ω . (6.33)

- 33 -

Then (6.32)

D+ = D , W+ = W + Ω . (6.34)

- 34 -

7. Infinitesimal deformation and infinitesimal strain measures.

We recall from section 1 that a motion of the body is defined by

x = χ(X, t) or xi = χi(XA, t) . (7.1)

The deformation function χ in (7.1) may be written in terms of the relative displacement u (see

section 4), i.e.,

χ = X + u(X, t) or χi = δiA XA + ui(XA, t) . (7.2)

We further recall the relative strain measures

E =21hh(C − I) or EAB =

21hh(CAB − δAB) , (7.3)

and

e =21hh(I − c) or eij =

21hh(δij − cij) , (7.4)

where

C = FT F or CAB = FiA FiB (7.5)

and

c = (F−1)TF−1 = (F FT)−1 = B−1 or cij = FAi−1

FAj−1

. (7.6)

Also, the relative displacement gradient is defined by

H = F − I or ui,A = FiA − δiA . (7.7)

In order to obtain infinitesimal kinematical results from those of the finite theory discussed

in sections 1-5, we introduce a measure of smallnessiiiiiiii by a nonnegative function

ε = ε(t) =K

maxX ∈ R o

sup || u,K || (K = 1, 2, 3) , (7.8)

where sup stands for the supremum (or the least upper bound) of a nonempty bounded set of real

numbers. If f(u,K) is any scalar-, vector-, or tensor-valued function of u,K = u,1, u,2, u,3

defined in a neighborhood of u,K = 0 (K = 1, 2, 3) and satisfying the condition that there exists a

- 35 -

nonnegative real constant C such that || f(u,K) || < Cεn as ε → 0, then we write f = 0(εn) as ε → 0.

Thus, in particular, the components of u,K referred to either ei or EA are of 0(ε) as ε → 0, i.e.,

ui,A = 0(ε) and uB,A = 0(ε) as ε → 0 . (7.9)

The components EAB of the relative strain measure E in (7.3) can also be expressed in

terms of relative displacement gradients (7.9)2 in the form

EAB =21hh(uA,B + uB,A + uM,AuM,B) . (7.10)

Clearly, if terms of 0(ε2) as ε → 0 can be neglected, we can write (7.10) approximately as

EAB =21hh(uA,B + uB,A) = 0(ε) as ε → 0 . (7.11)

Next, consider the expression

FiA(δAj −∂XB

∂uAhhhhhδBj) (7.12)

and substitute for FiA = δiA +∂XA

∂uihhhhh from (7.7)2. Thus,

Expression (7.12) = (δiA +∂XA

∂uihhhhh)(δAj −∂XB

∂uAhhhhhδBj)

= δij +∂XA

∂uihhhhhδAj −∂XB

∂uihhhhhδBj + 0(ε2)

= δij + 0(ε2)

= δij , (7.13)

where in writing (7.13)4, terms of 0(ε2) as ε → 0 have been neglected. Hence, to the order ε2 as

ε → 0, we can identify the coefficient of FiA in (7.12) as the inverse of FiA, i.e.,

FAi−1 =

∂xiBi

∂XAhhhhh . (7.14)

- 36 -

With the use of (7.14) and the chain rule of differentiation, it can be readily verified that

∂xi

∂uAhhhh =∂XB

∂uAhhhhh∂xi

∂XBhhhhh∂XC

∂uBhhhhhδCi)

=∂XB

∂uAhhhhhδBi + 0(ε2) as ε → 0 (7.15)

and similarly

∂xj

∂uihhhh =∂XA

∂uihhhhh∂xj

∂XAhhhhh∂XB

∂uAhhhhhδBj)

=∂XA

∂uihhhhhδAj + 0(ε2) as ε → 0 . (7.16)

It follows from (7.15) and (7.16) that to 0(ε2), it is immaterial whether the partial derivatives of

the displacement field u is taken with respect to xi or XA so that∂x1

∂u1hhhh =∂X1

∂u1hhhh to 0(ε2) as ε → 0.

Hence in dealing with infinitesimal kinematics, it is not necessary to distinguish between

Eulerian and Lagrangian form of kinematical quantities.

>From (7.6)2 and (7.14), we have

cij = FAi−1

FAj−1 = (δAi −

∂XB

∂uAhhhhhδBi)(δAj −∂XC

∂uAhhhhhδCj)

= δij − (∂XB

∂uAhhhhhδBiδAj +∂XC

∂uAhhhhhδCj δAi) + 0(ε2) as ε → 0 . (7.17)

Hence, if terms of 0(ε2) as ε → 0 can be neglected, then

eij =21hh(δij − cij)

=21hh(

∂XB

∂uAhhhhh +∂XA

∂uBhhhhh)δiAδjBε → 0 , (7.18)

so that any differences between the two strain measures disappear upon linearization.

- 37 -

In view of the remark made following (7.15), to the order of ε2 we may display the com-

ponents of the relative displacement gradients either as uA,B or ui,j. For example, if we make the

latter choice, then we may write

ui,j = eij + ωij = 0(ε) ,

eij = 0(ε) , ωij = 0(ε) as ε → 0 , (7.19)

where eij and ωij are defined by

eij =21hh(ui,j + uj,i) , ωij =

21hh(ui,j − uj,i) . (7.20)

We observe here that ui,i = eii = 0(ε) is called the cubical dilatation and that ωii = 0.

We consider now the linearized version of the Cauchy-Green measure CAB, the stretch UAB

and the rotation RiA. The Cauchy-Green measure CAB defined by (7.5)2 is related to EAB by

CAB = δAB + 2EAB. But, when the approximation (7.11) is adopted, then CAB can be written as

CAB = UAC UCB = δAB + 2EAB

= δAB + 0(ε) as ε → 0 (7.21)

and from an examination of

(δAC + EAC)(δCB + ECB) = δAB + 2EAB + 0(ε2) as ε → 0 (7.22)

to the order ε2 we can identify UAB as

UAB = δAB + EAB = δAB + 0(ε) as ε → 0 . (7.23)

Also, by considering an expression of the form

UAB(δBCneglected,then.EQI(7.24)UAB−1 = δAB − EAB = δAB + 0(ε) as ε → 0 .

We now turn to the rotation tensor RiA. Recalling the polar decomposition theorem, the

components of the rotation tensor R can be written as

RiA = FiB UBA−1

. (7.25)

- 38 -

It then follows from (7.7), (7.24) and (7.11) that

RiA = (δiB + ui,B)[δBA −21hh(uA,B + uB,A)]

= δiA +21hh[ui,A − uA,BδiB]

= δiA +21hh(

∂XA

∂uihhhhh −∂xi

∂uAhhhh)

= δiA +21hh(

∂xA

∂uihhhh −∂xi

∂uAhhhh) , (7.26)

where terms of 0(ε2) as ε → 0 have been neglected and where use has been made of the result

(7.15). To the order of approximation considered, we may also write (7.26) as

R = I + Ω or Rij = δij + ωij , (7.27)

where

Ω = −ΩT =21hh(H − HT)j,i) , (7.28)

which is consistent with (7.20)2.

In the remainder of this section, we employ the linearization procedure discussed above and

obtain the linearized version of such expressions as

J = det F = det(xi,A) (7.29)

and its inverse. For this purpose, we first recall that J can be represented as

J =61hh εijk εLMN FiL FjM FkN . (7.30)

Substituting the components of the displacement gradient in terms of ui,A given by (7.7)2 in

(7.30) we get

- 39 -

J =61hhεijk εLMN(δiL + ui,L)(δjM + uj,M)(δk,N + ukN)

=61hhεijk εLMN[δiL δjM δkN + 3δiL δjM uk,N + 0(ε2)]

= 1 + δkN uk,N = 1 + uk,k = 1 + ekk as ε → 0 , (7.31)

where in obtaining (7.31)3 use has been made of the identities61hh = εijk εijk and εijk εijm = 2δkm

and where terms of 0(ε2) as ε → 0 have been neglected. The inverse of (7.31) is given by

J−1 = [1 + eii + . . . ]−1

= 1 − eii + (eii)2 + . . .

= 1 − eii as ε → 0 (7.32)

and again in writing (7.32) terms of 0(ε2) as ε → 0 have been neglected.

(For a discussion of the interpretation of the infinitesimal strain measures, see Supplement

to Part I, Section 7.)

- 40 -

8. The transport theorem.

Let S be an arbitrary part (or subset) of the body B and suppose that S occupies a region Po,

with a closed boundary surface ∂Po, in a fixed reference configuration. Similarly let P, with

closed boundary surface ∂P, be the region occupied by S in the configuration at time t.

Let φ be any scalar-valued or tensor-valued field with the following representations:

φ = φ(x, t) = φ(χ(X, t), t) (8.1)

and consider the volume integral

I =P∫ φ(x, t)dv =

Po

∫ φ(X, t)J dV , (8.2)

where we have used dv = J dV, J = det F > 0. Often we shall encounter an expression of the type

(8.2) and we need to calculate its time derivative dI/dt. Thus, we write

dtdIhhh =

dtdhhh

Po

∫ φ(X, t)J dV

=Po

∫ dtdhhh(φ(X, t)J)dV

=Po

∫ [∂t∂φhhh(X, t)J + φ(X, t)J

.]dV

=Po

∫ [∂t∂φhhh(X, t)J + Jvi,iφ(X, t)]dV

=Po

∫ [∂t

∂φ(X, t)hhhhhhh + φ(X, t)vi,i]JdV

=Po

∫ [φ.

+ φ(X, t)div v]JdV

=P∫[φ

.+ φ(x, t)div (8.3)

- 41 -

where in the fourth of (8.3) we have also used J.

= J vi,i and where

φ.

=∂t

∂φ(X, t)hhhhhhh =∂t

∂φ(x, t)hhhhhhh +∂xi

∂φ(x, t)hhhhhhhvi . (8.4)

It then follows that the time derivative of (8.2)1 is given by

dtdhhh

P∫ φ(x, t)dv =

P∫ [φ

.+ φ(x, t)div v]dv . (8.5)

>From the result (8.5) follows the various expressions given below:

dtdhhh

P∫ φ(x, t)dv =

P∫ [

∂t∂φ(x, t)hhhhhhh +

∂xi

∂φ(x, t)hhhhhhhvi

=P∫ [

∂t∂φ(x, t)hhhhhhh +

∂xi

∂(φ(x, t)vi)hhhhhhhhhh]dv

=P∫ ∂t

∂φ(x, t)hhhhhhhdv +∂P∫ φ(x, t)vinida

=P∫ ∂t

∂φ(x, t)hhhhhhhdv +∂P∫ φ(x, t)v . n da , (8.6)

where the divergence theorem has been used. Next, apply (8.6)4 to a fixediiii spatial region Ph

in the

configuration at time t and write

dtdhhh

Ph∫ φ(x, t)dv =

Ph∫ ∂t

∂φ(x, t)hhhhhhhdv +∂Ph∫ φ(x, t)v . n da

=∂t∂hhh

Ph∫ φ(x, t)dv +

∂Ph∫ φ(x,t)v . n da . (8.7)

The form (8.7)2 is another statement of the transport theorem. It is used when it is convenient to

focus attention on a fixed region of space Ph

at time t and consider the motion of the body over

this region. This form (8.7)2 can also be deduced directly from (8.6)4 without the introduction of

the fixed spatial region Ph

by the following line of argument: Since in the calculation of the first

- 42 -

integral in (8.6)4 the variable x (and hence the spatial region P) is fixed, the operator ∂/∂t com-

mutes with the volume integral in (8.6)4 over a region P and the form (8.7)2 with Ph

replaced by P

follows from (8.6)4. The commuting of ∂/∂t with the volume integral in (8.6)4 may be contrasted

with a similar calculation involving the volume integral in (8.3)2. In the latter integral, since X

(and hence the material region Po, is fixed, the operator d/dt commutes with the volume integral

in (8.3)2 over the region Po.

- 41 -

Part II: Conservation Laws and Some Related Results

- 42 -

1. Conservation of mass.

We recall from section 1 of Part I that the mass of any part P of the body, i.e., M is a con-

tinuous function of its volume and that there exists a scalar mass density ρ such that

M(St) =P∫ ρ dv , P ⊆ R , (1)

where ρ = ρ(x, t) depends on the particular configuration occupying the region of space R and dv

is the element of volume in the present configuration. A statement of the conservation of mass

for any material part of the body in the present configuration is as follows:

dtdhhh

P∫ ρ dv = 0 . (2)

By the transport theorem, (2) can be written asP∫ [ρ

.+ ρvk,k]dv = 0. The last result must hold for

alliii arbitrary parts P; hence, by the argument outlined previously, assuming that ρ is continuously

differentiable we have

ρ.

+ ρvk,k = 0 , (3)

where a superposed dot denotes material derivative, i.e., ρ.

=∂t∂ρhhh + vk ∂xk

∂ρhhhh . The result (3) can

also be expressed as

∂t∂ρhhh + (ρvk),k = 0 or

∂t∂ρhhh + div(ρv) = 0 . (4)

Equation (3) or (4) represents the local equation for conservation of mass. It is also referred to

as spatialiiiiii form of the "continuity equation."

Another form of the principle of conservation of mass may be stated as:

P∫ ρ dv =

Po

∫ ρo dV , (5)

where Po is the material part in the initial reference configuration, dV is the element of volume in

the reference configuration and ρo denotes the mass density in the reference configuration.

Using dv = J dV, J = det F = det xi,A, from (5) we get

- 43 -

ρ = J−1 ρo or ρ = (det F)−1 ρo , ρo = Jρ , (6)

which is the materialiiiiiii form of continuity equation.

- 44 -

Appendix to Sec. 1 of Part II

Theorem: If φ(x, t) is continuous in R and

P∫ φ dv = 0 (1)

for every part P ⊆ R, then the necessary and sufficient condition for the validity of (1) is that

φ = 0 in R . (2)

Proof: Let us first recall the definition of continuity.

Definition: A function φ(x, t) is said to be continuous in a region R if for every x ∈ R and every

ε > 0 there exists a δ > 0 such that | φ(x, t) − φ(ox, t) | < ε whenever | x − ox | < δ.

Sufficiency: If φ = 0 in R, then (1) is trivially satisfied.

Necessity: (Proof by contradiction). Suppose that there is a point ox ∈ P (⊆ R) at which

φo = φ(ox, t)>0. Then, by the continuity of φ, there exists a δ > 0 such that

| φ(x, t) − φo | <2

φohhh for all | x − ox | < δ . (3)

Now let Pδ be the region | x − ox | < δ and Vδ be the volume of this region so that

Vδ =Pδ

∫ dv > 0 . (4)

>From (3) and the fact that φo > 0 we have

φ >2

φohhh in Pδ . (5)

Now let P be a part which contains Pδ, then

Pδ

∫ φ dv >Pδ

∫ 1⁄2 φo dv = 1⁄2 φo Vδ > 0 . (6)

Since the result (6) contradicts (1), it follows that there can exist no point ox such that φ(ox, t) >

- 45 -

0. Similarly, we realize that if φo = φ(ox, t) < 0, then by the continuity of φ there exists a δ > 0

such that

| φ(x, t) − φo | < −2

φohhh for all | x − ox | < δ . (7)

Thus we can conclude that

φ(x, t) <2

φohhh for all x ∈ Pδ . (8)

Hence

Pδ

∫ φ dv <Pδ

∫ 1⁄2 φo dv = 1⁄2 φo Vδ < 0 . (9)

Since (9) contradicts (1), again it follows that no point ox exists for which φ(ox, t) < 0. Combin-

ing this result with that above we conclude that (2) is also a necessary condition for the validity

of (1).

- 46 -

2. Forces and couples. Euler’s laws.

We admit two types of forces, namely the body force per unit mass b and the surface force

(or contact force) per unit area) as follows:

b = b(X,t) = body force per unit mass,

(1)

t = t(X,t ; n) = contact force per unit area.

It is important to note that the surface force t depends on the orientation of the surface area (with

outward unit normal n) upon which it acts. Similar to the definitions (1) above, we could also

admit body couple per unit mass c and surface couple per unit area m, but these are ruled out in

the construction of classical continuum mechanics.

We also note here the following:

momentum per unit mass = v = x.

(2)

moment of momentum per unit mass = x × v = x × x.

We consider now the basic balance laws (also called conservation laws) for momentum and

moment of momentum (also called angular momentum) in the context of purely mechanical

theory. These balance laws, which are known as Euler’s laws, may be stated (in words) as fol-

lows:

IKLfor any part of the body

Rate of change of momentum MNO

IKLon the part

All external forces acting MNO

,

(3)

IKLfor any part of the body

Rate of change of moment of momentum MNO

IKLacting on the part

Moment of all external forces MNO

.

The above laws can be translated into mathematical forms both with respect to the current

- 47 -

configuration (Eulerian form) or with respect to the reference configuration (Lagrangian form).

In this section we limit the discussion to the Eulerian (or spatial) form of the balance laws (3).

Thus, remembering the definitions (2)1,2, the momentum for any part P of the body occupying a

region P with boundary surface ∂P and the moment of momentum for any part P are:

P

∫vdm ,

P

∫x × vdm , (4)

where dm = ρdv is the element of mass. Also, the total force acting on P and the moment of total

force acting on P are:

P

∫bdm +

∂P

∫ tda ,

P

∫(x × b)dm +

∂P

∫ (x × t)da . (5)

Keeping the expressions (4) and (5) in mind, the spatial (or Eulerian) form of the balance

laws (3) are:

dtdhhh

P

∫vdm =

P

∫bdm +

∂P

∫ tda , (6)

dtdhhh

P

∫x × vdm =

P

∫x × bdm +

∂P

∫ x × tda . (7)

In the next few sections of Part II we exploit the implications of (6) - (7) and also derive their

local forms.

- 48 -

3. Further consideration of the stress vector. Existence of the stress tensor and its rela-tionship to the stress vector.

Consider an arbitrary part of the material region of the body B which occupies a part P in

the present configuration at time t. Let P be divided into two regions P1, P2 separated by a sur-

face σ (see Fig. 3.1). Further, let ∂P1, ∂P2 refer to the boundaries of P1, P2, respectively; and let

∂P′, ∂P′′ be the portions of the boundaries of P1, P2 such that

∂P′ = ∂P1 ∩ ∂P ,

∂P′′ = ∂P2 ∩ ∂P . (1)

Thus, a summary of the above description is as follows:

P = P1 ∪ P2 , ∂P = ∂P′ ∪ ∂P′′ ,

∂P1 = ∂P′ ∪ σ , ∂P2 = ∂P′′ ∪ σ . (2)

Now recall the linear momentum principle, i.e.,

dtdhhh

P

∫ ρ x.

dv =P

∫ ρ b dv +∂P

∫ t(n)da (3)

or with the use of dm = ρ dv in the form:

dtdhhh

P

∫ x.

dm =P

∫ b dm +∂P

∫ t(n) da (4)

which holds for an arbitrary material region P ⊆ R. Application of (4) separately to the parts P1,

P2 and again to P1 ∪ P2 = P yields

dtdhhh

P1

∫ x.

dm −P1

∫ b dm −∂P1

∫ t(n)da = 0 , (5)

dtdhhh

P2

∫ x.

dm −P2

∫ b dm −∂P2

∫ t(n)da = 0 (6)

and

dtdhhh

P1∪P2

∫ x.

dm −P1∪P2

∫ b dm −∂P′∪∂P′′

∫ t(n)da = 0 . (7)

- 49 -

The stress vector t(n) in (5) acting over the boundary ∂P1 results from contact forces exerted

byii the material on one side of the boundary (exterior to P1) onii the material of the other side.

Similar remarks hold for the stress vector in (6) and in (7). We emphasize that the stress vector

in (5) over ∂P′ ∪ σ represents the contact force exerted on P1 across the surface, etc. The

appropriate normals associated with t(n) over the surface σ are equal and opposite in sign. To

elaborate, let n be the outward unit normal at a point on σ when σ is a portion of ∂P1. Then, the

outward unit normal at the same point on σ when σ is a portion of ∂P2 is −n.

>From combination of (5) and (6) and after subtraction from (7), we obtain the following

equation:

σ∫ [t(n) + t(−n)]da = 0 (8)

over the arbitrary surface σ. Assuming that the stress vector is a continuous function of position

and n, it follows that

t(n) = −t(−n) or t(x , t ; n) = − t(x , t ; −n) . (9)

According to the result (9), the stress vectors acting on opposite sides of the same surface at

a given point are equal in magnitude and opposite in direction. The result (9) is known as

Cauchy’s lemma.

Again we suppose that a body B is mapped into the present configuration κ, at time t, which

occupies the region R. Consider some interior particle Xo of B having position vector ox in R.

Construct at ox a tetrahedron T, lying entirely within R, and in such a way that the side i is per-

pendicular to the ei-direction (see Fig. 3.2) and inclined plane - with outward unit normal on -

falls in the octant where x1 , x2, x3 are all positive. We refer to the side i of T by Si and to the

inclined plane by S, respectively. Let h denote the height of the tetrahedron, i.e., the perpendicu-

lar distance from ox to the inclined face S, and let S denote the area of this face. Then, the areas

of the three orthogonal faces Si are

Si = S on . ei = S oni (10)

- 50 -

and the volume of the tetrahedron is

Vtet =31hh h S . (11)

Recalling that dm = ρdv, by virtue of the transport theorem and the conservation of mass

(see Sec. 1 of part II), the principle of linear momentum (4) reduces to

P

∫ x..

dm =P

∫ b dm +∂P

∫ t(n)da . (12)

Next, apply (12) to the material region T and obtain

T

∫ x..

dm =T

∫ b dm +∂T

∫ t(n)da . (13)

Observing that the surface integral in (13) represents contributions from all four boundary planes

of T, we have

∂T

∫ t(n)da =i=1Σ3

Si

∫ t(−ei)da +S

∫ t(on)da . (14)

Now according to Cauchy’s lemma in (9)

t(−ei) = − t(ei) (15)

and then with the use of (14) and (15), the linear momentum equation (13) can be rewritten in

the form

T

∫ (x..

− b)dm =S

∫ t(on)da −

i=1Σ3

Si

∫ t(ei)da . (16)

Now recall that dm = ρdv and that ρ is already assumed to be bounded (see Sec. 1 of Part I).

Further, we assume that the fields x..

and b are bounded. Then, since (by a theorem of analysis)

| ∫ f dv | ≤ ∫ | f | dv

where | f | denotes the absolute value of f, we obtain the following estimate for the integral on

the left-hand side of (16):

- 51 -

|

T

∫ ρ(x..

− b)dv | ≤T

∫ | ρ(x..

− b) | dv

=T

∫ K dv

= K*

T

∫ dv = K*

3Shhhh

where use has been made of the mean value theorem for integrals, we have set

K(x ,t) = | ρ(x..

− b) | and K* stands for some specific interior value of K in T. Hence, we may

conclude that there exists a fixed, positive real number K* such that

cT

∫ ρ(x..

− b)dv c ≤ K*

3Shhhh . (17)

Next, assume that the stress vector field is continuous in both x and n. Then, by the mean value

theorem for integrals, the two surface integrals in (16) yield:

i=1Σ3

∫Si

t(ei)da = ti*Si = ti

* S oni (18)

and

S

∫ t(on)da = t(on)* S , (19)

where in writing (18)2 use has been made of (10) and where ti* and t* stand for some specific

interior values (at the point ox) of the stress vectors on the respective faces Si and on the plane S

of T. Each stress vector ti* acts on that side of coordinate plane i which is associated with the

outward unit normal ei. It follows from (16), (17), (18) and (19) that (since K* is fixed)

31hh K*Sh ≥ |

T

∫ ρ(x..

(on)da −

i=1Σ3

∫Si

t(ei)da |

= | t(on)* S − ti

* S oni |

= S | t(on)* − ti

*oni | .

- 52 -

Therefore,

| t(on)* − ti

*oni | ≤

31hh K*h . (20)

Consider now a sequence of tetrahedra T1, T2, ..., of diminishing heights h1 > h2 > ..., each

member of which is similar to T with three mutually orthogonal faces having outward unit nor-

mals −ei and an inclined plane with outward unit normal on. Next, apply (20) to each member of

the sequence and, in the limit as h→ 0, obtain

| t(on)* − ti

*oni | ≤ 0 , (21)

where the stress vectors in (21) are evaluated at the point ox which is the common vertex of the

family of tetrahedra. It follows from (21) that

t(on)* = ti

*oni . (22)

Since (22) must hold at any point ox and corresponding to any direction on, without ambiguity,

we suppress henceforth the designation star, replace ox by x, on by n and write

t(n) = tini . (23)

We now define tki by

tki = ti. ek or ti = tki ek . (24)

Let tk denote the components of the stress vector t(n), i.e., tk = t(n). ek. Then, from (23) and (24)

we have

tk = t . ek = tini. ek = tkini , (25)

where in (25)1 we have written for simplicity t in place of t(n). Thus, if t = t(x , t ; n) is the stress

vector at the point x acting on a surface whose outward unit normal is n, then it is clear that

tki = tki(x , t) are defined at x and are independent of n. The relation (25) establishes the

existenceiiiiiiii of the set of quantities tki. It remains to show that tki are Cartesian components of a

second order tensor. To this end, consider the transformation of two sets of Cartesian coordi-

- 53 -

nates and recall

xk′ = ajkxj , ek′ = ajkej , nk′ = ajknj , etc. , (26)

where

aik = ei. ek′ . (27)

From

t = tkek = tk′ek′ = tkiniek = tki′ni′ek′ (28)

and with the use of ni′ = arinr , ek′ = ask es where ari are components of an orthogonal tensor, we

obtain (tsr − ariasktki′)nres . But es are linearly independent. Therefore,

(tsr − ariasktki′)nr = 0 , (29)

which holds for all n and the quantity in parentheses is independent of n. Hence,

tsr = askaritki′ , (30)

which establishes the tensor character of tki. The second order tensor whose Cartesian com-

ponents are defined by (24) is called the Cauchy stress tensor.

With reference to a rectangular Cartesian system of spatial coordinates, consider a paral-

lelepiped shown in Fig. 3.3 and recall the formula

ti = tkiek (31)

In (31), ti is the stress vector acting on the face whose outward unit normal is ei. For example,

on the front face of the parallelepiped in Fig. 3.3, the outward unit normal coincides with e1; and

thus, from (31) with i = 1, the stress vector on this face is given by

t1 = tk1ek = t11e1 + t21e2 + t31e3 .

Keeping the above in mind, an examination of the subscripts of the stress tensor tki in (31) easily

reveals that (i) the second index "i" refers to the stress vector ti on the face whose outward unit

normal is ei, while (ii) the first index "k" refers to the component of ti in the coordinate

- 54 -

directions. Utilizing this convention, which stems from the definition of (24), the tractions (i.e.,

the forces per unit area) are sketched in Fig. 3.3.

- 55 -

4. Derivation of spatial (or Eulerian) form of the equations of motion.

We derive in this section the spatial (or Eulerian) form of the equations of motion from the

integral balance laws (6) and (7) of section 2. We begin with the balance of linear momentum

(6) in section 2 of Part II, which for the present purpose can be written as (recall that dm = ρdv):

dtdhhh

P

∫ρvdv =

P

∫ρbdv +

∂P

∫ t(n)da . (1)

By the transport theorem discussed in Part I, the left-hand side of (1) is given by

LHS of (1) =

P

∫[(ρv)hhhh.

+ ρvdivv] dv

=

P

∫[ρ.v + ρv

.+ ρvdivv] dv

=

P

∫ρv.

+ v[ρ.

+ ρdivv] dv (2)

=

P

∫ρadv ,

where in arriving at the results, (2)4, we have also used the local conservation of mass given by

(3) of section 1. Next, we consider the surface integral in (1), substitute the relation (23) of sec-

tion 3, namely t = tini, and use the divergence theorem obtain

∂P

∫ tda =

∂P

∫ tinida =

P

∫ti,i dv , (3)

where a comma following a subscript indicates partial differentiation.

After substitution of the results (2) and (3) into (1), the resulting equation can be put in the

form

P

∫[ti,i + ρb − ρa] dv = 0 , (4)

- 56 -

which must hold for all arbitrary material volumes. Then, since the integrand is a continuous

function, by the usual procedure we arrive at the local form

ti,i + ρb = ρa . (5)

Next, we turn to the balance of moment of momentum (7) in section 2 of Part II, which can

be rewritten as

dtdhhh

P

∫(x × ρv) dv =

P

∫(x × ρb) dv +

∂P

∫ x × t(n) da (6)

Again by the transport theorem, the left-hand side of (6) can be shown to yield

LHS of (6) =

P

∫x × ρa dv . (7)

Similarly, the surface integral in (6) after substitution from (23) of section 3 and the use of the

divergence theorem gives

∂P

∫ x × t(n) da =

∂P

∫ (x × tini) da =

P

∫(x × ti),i dv

(8)

=

P

∫[(x,i × ti) + x × ti,i] dv

Introduction of (7) and (8) into (6) results in

P

∫[(x,i × ti) + x × (ti,i + ρb − ρa)] dv = 0 ,

or

P

∫x,i × ti dv = 0 , (9)

- 57 -

where in obtaining the last result we have also used (5). Since (9) must hold for all arbitrary

material volumes P and since the integrand is continuous, by the usual argument we conclude

that

x,i × ti = 0 ,

or equivalently

ei × ti = 0 . (10)

The two results (5) and (10) represent the consequences of the balance of linear momentum

and moment of momentum. Although the foregoing derivation in vectorial form leading to

equations of motion (5) and the restriction (10) is simple and attractive, it conceals some of the

features of the stress tensor introduced earlier in section 3. Thus, we now recall the relation (24)

for ti = tkiek and substitute this into (5) and (10) to obtain the alternative forms of the equations

of motion and the restriction on ti. With the use of the identities

ti,i = (tkiek),i = tki,iek ,

(11)

ei × ti = ei × tkiek = εikjtkiej ,

the component forms of (6) and (10) result in

tki,i + ρbk = ρak , (12)

tki = tik . (13)

Thus, it is easily seen that the vector equation (5) is equivalent to the three scalar equations of

motion (12). Similarly, the restriction (10) on the three vectors ti implies the symmetry of the

stress tensor tki. This last observation reveals the fact that the three scalar equations of motion

involve only six components of the stress instead of nine.

- 58 -

5. Derivation of the equations of motion in referential (or Lagrangian) form.

In previous developments, the stress vector t (and therefore the stress tensor tij) acting on P

are measured per unit area of surfaces in the present configuration at time t. In view of the

transformation x = χ (X , t) all surfaces in R can be mapped into corresponding surfaces in Ro

occupied by the body in the reference configuration. For some purposes, it is more convenient to

measure the stress vector and the stress tensor acting on P per unit area of surfaces in Po.

Corresponding to an arbitrary part P with boundary ∂P in the present configuration, we have a

part Po with boundary ∂Po in the reference configuration (see Fig. 1.1). We denote the outward

unit normal to ∂Po by N, where

N = NAeA . (1)

We denote the stress vector acting on ∂P, but measured per unit area of the surface ∂Po in the

reference configuration by p.

In terms of quantities measured in the reference configuration, the momentum principles

(i.e., linear and moment of momentum) are:

Po

∫ ρo a dV =Po

∫ ρo b dV +∂Po

∫ p dA , (2)

and

Po

∫ (x × ρo a)dV =Po

∫ (x × ρo b)dV +∂Po

∫ x × p dA (3)

where p = p(X , t ; N) depends on position, time and the unit normal N to ∂Po. By a procedure

similar to that used previously (Part II: Sec. 3), we can prove that

p = NApA , pA = piAei , (4)

p = piA NA ei , (5)

p = P N . (5a)

The vectors pA (A = 1,2,3) represent stress vectors acting across surfaces at a point P in the

- 59 -

present configuration κ which were originally coordinate planes XA = const. through the

corresponding point Po in the reference configuration and measured per unit area of these planes

(Fig. 5.1). Also, the components piA (i.e., of the stress tensor P) represent surface forces acting

in the present configuration, but measured per unit area of an XA-plane in the reference

configuration and resolved parallel to the ei-directions. Introducing (5) into (2), by a procedure

similar to that used previously (Part II, Sec. 4), we obtain

piA,A + ρobi = ρoai or pA,A + ρob = ρoa , (6)

Then, from (3), (5), and after using (6), we also obtain

piAxj,A = pjAxi,A . (7)

Introducing the notation sAB through

piB = xi,AsAB , (8)

it follows from (7) that

sAB = sBA . (9)

The last two are called, respectively, the symmetric and the nonsymmetric Piola-Kirchhoff

stress.

Also, it can be verified that

JT = P FT = F S FT or Jtik = xi,ApkA = xk,ApiA = xi,Axk,BsAB , (10)

which relates the Cauchy stress T to the stress tensors P and S. To verify the truth of (10), we

may start with

df = t da = p dA (11)

which represents an element of contact force acting on the current configuration of the body B in

terms of both the spatial and referential description of the stress vector. Using the derived rela-

tion between the area elements da and dA, namely n da = J F−T N dA, we have

- 60 -

df = t da = T n da

(12)

= p dA = P N dA .

Hence (J T F−T − P)N dA = 0 for all N and we may conclude that J T F−T − P = 0 or

J T = P FT . (13)

Recalling (8) or P = F S, we are led to the results (10)1.

The coordinate-free forms of the various results between (6)-(10) may be displayed as fol-

lows:

Div P + ρo b = ρo v.

, (6a)

P FT = F PT , (7a)

P = F S , (8a)

S = ST . (9a)

- 61 -

6. Invariance under superposed rigid body motions.

It was established previously that a particle X of a body, which in the reference

configuration is at X and at time t in κ occupies the place x = χ(X, t), under superposed rigid

motions of the whole body at a different time t+ = t + a occupies in κ+ the place x+ = χh

+(X, t+)

specified by

x+ = a + Q x or xi+ = ai + Qijxj . (1)

In (1), a is a vector-valued function of t, Q is a proper orthogonal tensor-valued function of t and

a is a constant. The vector a can be interpreted as a rigid body translation and Q as a rotation

tensor.

We have already studied how various kinematical quantities transform under superposed

rigid body motions and shall presently extend these considerations to the dynamical quantities

which appear in the equations of motion, namely

tij,j + ρbi = ρv.i , tij = tji . (2)

However, we first need to establish some further kinematical results:

(a) Further kinematical results:

Recall that an element of area dA of a material surface having outward unit normal vector

N in the reference configuration of a body is deformed at time t into the element of area da

whose outward unit normal vector is n. Then,

dak = J∂xk

∂XKhhhhhdAK , (3)

where

dak = da nk , dAK = dA NK . (4)

Under the motion χh

+ the element of area dA is deformed into da+ and N is deformed into n+ so

that

dak+ = J+

∂xk+

∂XKhhhhhdAK , (5)

- 62 -

and

dak+ = da+nk

+ . (6)

It follows from (1)2 that

∂xk

∂xi+

hhhh = Qij ∂xk

∂xjhhhh = Qijδjk = Qik (7)

and consequently

FiA+ =

∂XA

∂xi+

hhhhh j (8)

i.e.,

F+ = Q F . (9)

Therefore,

J+ = det F+ = det(Q F)

= J .

= +1 det F

= (det Q)(det F)

(10)

Also, by the law of conservation of mass,

ρo = ρJ = ρ+J+ , (11)

which together with (10)5 implies

ρ+ = ρ . (12)

Returning again to (1)2, we see that

Qik xi+ = Qik ai + QikQijxj

= Qikai + xk ,

= Qikai + δkjxj

(13)

- 63 -

xk = Qik(xi+ − ai) or x = QT(x+ − a) . (14)

Therefore

∂xi+

∂xkhhhh = Qik , (15)

and consequently

∂xk+

∂XKhhhhh =∂xj

∂XKhhhhh∂xk

+

∂xjhhhh =∂xj

∂XKhhhhhQkj (16)

which, in direct notation, reads

(F+)−1 = F−1QT (17)

and could also have been deduced by taking the inverse of both sides of (9).

Substituting the results (10)5 and (16)2 into (5) yields

dak+ = J

∂xj

∂XKhhhhhQkjdAK

= Qkjdaj , (18)

where (3) has been used in completing the last step. The combination of (18)2 with (4)1 and (6)

gives

da+nk+ = Qkjda nj . (19)

Now square both sides of (19) and remember that n and n+ are unit vectors and Q is a proper

orthogonal tensor so that

(da+)2nk+ nk

+ = (da+)2

= Qkj nj Qkl nl (da)2

= δjl nj nl (da)2

= nj nj (da)2 = (da)2 . (20)

- 64 -

Since area is always a positive number, it follws from (20)5 that

da+ = da (21)

and hence from (19) that

nk+ = Qkj nj or n+ = Q n . (22)

It is worthwhile noticing that we have been able to deduce the behavior of F, F−1, J, ρ, da

and n under superposed rigid body motions without making any additional physical assumptions

in (a).

(b) The stress vector and the stress tensor.

It is clear from the developments of section 2 that not all kinematical quantities transform

according to formulae of the type

u+ = Q(t)u , U+ = Q(t)U QT(t) (23)

under superposed rigid body motions, where u and U in (23) stand for a vector and a second

order tensor field, respectively. We investigate now the relationships between† t and t+ and

between T and T+. For convenience, we recall the formulae for ti and tij, i.e.,

t = t(X, t; n) , ti = tijnj , (24)

t+ = t+(X, t+; n+) , ti+ (25)

The mechanical fields which enter the linear and moment of momentum principles are the

stress vector and the body force per unit mass. We first consider the former, which in the motion

χ(X, t) is a vector field defined by (24)1. Consider now a second motion which differs from the

first only by superposed rigid body motions specified in the form (1). The second motion

imparts a change in the orientation of the body, so that the outward unit normal vector n to the

same material surface (in the configuration κ) becomes the unit normal vector n+ (in the

configuration κ+). The stress vector in the second motion takes the form indicated by (25)1 andhhhhhhhhhhhhhhh

† Our development follows Naghdi (1972, pp. 484-486). See also Green and Naghdi (1979).

- 65 -

we have already seen that the outward unit normal vector n+ transforms according to (22). The

geometrical properties of the transformation (1) were discussed in section 2. Keeping these in

mind and recalling that t is linear in n, it is reasonable to expect for the stress vector t+(X, t; n+)

(i) to have the same magnitude as t(X, t; n) and (ii) to have the same relative orientation to n+ as

t has relative to n. These remarks lead us to introduce the following assumption:

t+ = Q t , ti+ = Qijtj . (26)

We observe that (26) implies that

| t+ | 2 = t+ . t+ = ti+ ti

+ = Qij Qik tj tk = δjk tj tk = tj tj = | t | 2 (27)

and, recalling (22), we also have

t+ . n+ = ti+ ni

+ = Qij Qik tj nk = δjk tj nk = tj nj = t . n . (28)

Now

t+ . n+ = | t+ | | n+ | cosθ+ = | t+ | cosθ+ (29)

and

t . n = | t | | n | cosθ = | t | cosθ ,

which together with (27)7 and (28)5 lead to

cosθ+ = cosθ . (30)

The results (27) and (30) verify the motivating remarks made prior to (26) to the effect that (i)

the magnitude of t is the same as the magnitude of t+ and (ii) the magnitude of the angle between

t and the outward unit normal n is the same as the magnitude of the angle between t+ and n+.

We now turn attention to the stress tensor. >From the results (24)2, (25)2 and (22) we have

ti+ = tij

+ nj+ = tij

+ Qjk nk . (31)

But, by assumption (26), we also have

ti+ = Qij tj = Qij tjk nk . (32)

- 66 -

Combination of (31) and (32) yields

(tij+ Qjk − Qij tjk)nk = 0 . (33)

The last result must hold for all nk and the coefficient of nk is independent of nk. Therefore

tij+ Qjk − Qij tjk = 0 ,

tij+ Qjk Qmk − Qmk Qij tjk = 0 ,

tim+ = Qij Qmk tjk ,

and hence

tij+ = Qim Qjn tmn (34)

or, in direct notation,

T+ = Q T QT . (35)

A scalar, vector or tensor quantity which under superposed rigid body motions transforms as

(12), (22) or (35), respectively, is called objectiveiiiiiiii. Of course, not all physical quantities are

objective. For example

W+ = Q W QT + Ω

v+ = a.

+ Q.

x + Q x.

,

(36)

are clearly not objective.

(c) Body forces and accelerations.

The equations of motion for the two motions χ and χh+ of the body are, respectively, given

by

∂xj

∂tijhhhh = ρ(v.i − bi) ,

(37)

∂xj+

∂tij+

hhhh = ρ+(v.

i+ − bi

+) .

- 67 -

But

∂xj+

∂tij+

hhhh =∂xk

∂hhhh (Qim Qjn tmn)∂xj

+

∂xkhhhh

= Qim Qjn ∂xk

∂tmnhhhhh Qjk

= Qim δkn ∂xk

∂tmnhhhhh

= Qim ∂xn

∂tmnhhhhh , (38)

where (15) has been employed in (38)2. It follows from (38)4 and (37) that

ρ+(v.

i+ − bi

+) = Qim ρ(v.

m − bm) , (39)

which with the aid of (12) becomes

v.

i+ − bi

+ = Qim (v.

m − bm) (40)

or

v. + − b+ = Q(v

.− b) .

References

1. Naghdi, P.M. 1972 The theory of shells and plates. In S. Fl..ugge’s Handbuch der Physik,Vol. VIa/2 (edited by C. Truesdell), Springer-Verlag, 425-640.

2. Green, A.E. & Naghdi, P.M. 1979 A note on invariance under superposed rigid bodymotions. J. Elasticity 9, 1-8.

- 68 -

7. The principle of balance of energy.

We begin by assuming the existence of a scalar potential function per unit mass ε = ε(x,t),

called the specific internal energy. The internal energy for each part P in the present

configuration is defined by the volume integral

P

∫ ρ ε dv . (1)

We introduce a scalar field r = r(x,t) per unit mass per unit time, called the specific heat supply

(or heat absorption), as well as the heat flux across a surface ∂P (with the outward unit normal n)

by the scalar h = h(x,t;n) per unit area per unit time. The integrals

H =P

∫ ρ r dv −∂P

∫ h da , (2)

where ∂P is the boundary of P, defines the heat per unit time entering the part P in the present

configuration. The first term on the right-hand side of (2) represents the heat transmitted into P

by radiation and the second term the heat entering P by conduction. We also recall that the

kinetic energy for each part P in the present configuration is defined as

P

∫ 21hh ρ v . v dv . (3)

We record now the rate of work by the body and contact forces, i.e., R = Rb + Rc, where

Rb =P

∫ ρ b . v dv , Rc =∂P

∫ t . v da . (4)

In (4), the scalar b . v which occurs in the volume integral represents the rate of work per unit

mass by the body forces. Similarly, t . v is a scalar representing rate of work per unit area by the

contact force t. Each of the integrals in (4) is a rateiiii ofii workiiiii term and thus has the dimension of

"work per unit time".

With the foregoing background, the law of balance of energy may be stated as follows: The

rate of increase of internal energy plus kinetic energy is equal to the rate of work by body force

and contact force plus energies due to heat per unit time entering the body. Thus, we write

- 69 -

dtdhhh

P

∫ ρ[ε +21hh v . v] dv =

P

∫ ρr dv +P

∫ ρb . v dv

+∂P

∫ t . v da −∂P

∫ h da . (5)

We observe that the negative sign in front of the last integral in (5), as well as in (2), is in accord

with the convention that heat h = h(x,t;n) is assumed to flow into the surface in the direction

opposite to that of the outward unit normal to ∂P.

By application of (5) to an elementary tetrahedron, it can be shown that h(n) = −h(−n) or

h(x,t;n) = −h(x,t;−n), together with

h = qini = q . n , q = qiei . (6)

The last surface integral in (5) can then be transformed into a volume integral as follows (using

the divergence theorem):

∂P

∫ h da =∂P

∫ qini da =P

∫ qi,i dv . (7)

Similarly, using t = tkiniek and the divergence theorem

∂P

∫ t . v da =∂P

∫ (ti. v),i dv

=P

∫ [tki,ivk + tkivk,i] dv . (8)

By the transport theorem, the left-hand side of (5) can be expressed

dtdhhh

P

∫ ρ[ε +21hh v . v] dv =

P

∫ ρ(ε.

+ v . v.) + (ε +

21hh v . v)(ρ

.+ ρvm,m) dv

=P

∫ ρ(ε.

+ v . v.) , (9)

where in obtaining the last result conservation of mass has been used.

- 70 -

Now, substitute (7), (8) and (9) into (5) and rearrange the terms to obtain

0 =P

∫ ρ r − qi,i − ρ ε.

+ tkivk,i + vk[tki,i + ρ(bk − v.k)] dv ,

and after using also the equations of motion we get

P

∫ ρ r − qi,i − ρ ε.

+ tkivk,i dv = 0 , (10)

which holds for all arbitrary parts P. Hence, it follows that the local energy equation is

ρ r − qi,i − ρ ε.

+ P = 0 , (11)

where the stress power P is defined by

P = tkivk,i = tkidki , (12)

and where we have made use of the fact that vk,i = dki + wki and tki is symmetric.

- 71 -

Part III: Examples of Constitutive Equations and Applications

Inviscid and viscous fluid and nonlinear and linear elasticity

- 72 -

Examples of Constitutive Equations

Generally materials are classified as solids or fluids. However, we also encounter materials

which do not fall into either of these categories and which possess properties common to both

solids and fluids. A strict definition of these classifications is not attempted here, but we mention

here some of the main characteristic features of solids and contrast these with corresponding

features in the case of fluids. One of the main characteristic features of a solid is that it has a

reference state (or a reference configuration); and, in general, the deformed state (or

configuration) is not too far from the reference state. Moreover, regardless of the amount of

deformation, to some extent a solid remembers its reference state. A fluid, on the other hand,

does not possess a reference state (or configuration) and does not necessarily remain near such a

state in any motion.

1. Inviscid fluid

In hydrodynamics, an inviscid or an ideal fluid (including an ideal gas) is conceived of as a

medium possessing the following properties: (i) it cannot sustain shearing motion and (ii) the

stress vector (or the force intensity) acting on any surface in the fluid is always directed along

the normal to the surface, i.e., t is parallel to n. However, it is desirable to begin from a more

general viewpoint. Thus, we assume that an inviscid fluid is characterized by a constitutive

equation of the form

T = T(ρ) or tij = tij(ρ) . (1.1)

In (1.1), the response function T is a single-valued function of its argument and ρ is the mass

density of the fluid. Morever, it is instructive to consider an even more general assumption and

suppose the constitutive response function to depend also on the velocity vector†v. This is

because at this stage it is not clear why the explicit dependence on v (or even x) should be ruled

hhhhhhhhhhhhhhh† We rule out inclusion of the velocity gradient which gives rise to a shearing motion. The latter is not compati-

ble with the properties of an ideal fluid stipulated in the opening paragraph of section 1.

- 73 -

out. With this background, we begin by examining the more general constitutive assumption:

T = T(ρ, v) or tij = tij(ρ, vk) . (1.2)

It is convenient to recall here (from Parts I & II) that under a superposed rigid body motion

the position x moves to x+ in accordance with

x+(τ) = a(τ) + Q(τ)x , (τ ≤ t)

(1.3)

v+(τ) = a′(τ) + Q(τ)v(τ) + Q′(τ)x ,

where Q′ = Ω Q, Ω is the rigid body angular velocity, the temporary notation Q′(τ) is defined by

Q′(τ) =dτdhhhQ(τ) with τ being real. Consider now a special case of (1.3) for which at time τ = t,

a′(t) = a.(t) is a constant vector and Ω is a constant Ωo, such that at time t

a.(t) = c , Q

.(t) = Ωo Q(t) ,

where c is a constant vector and Ωo is a constant rigid body angular velocity tensor. We note for

later reference that when the above special motion represents a rigid body translational velocity,

we obtain

Q(t) = I , Q.

(t) = 0 ,

(1.4)

v+ = c + v .

Also, when the special motion involves only a constant rigid body angular velocity, we have

c = 0 , Q(t) = I , Q.

(t) = Ωo ,

(1.5)

v+ = v + Ωox , L+ = L + Ωo

We now examine (1.2)1 under a constant rigid body translational velocity of the form (1.4).

Since the constitutive equation (1.2)1 must hold for all motions, including a s.r.b.m., and since

on physical grounds the response function T (but not its arguments) must remain unaltered under

such superposed rigid body motions, we have

- 74 -

T+ = T(ρ+, v+) = T(ρ+, v + c) . (1.6)

Now under the special motion (1.4), the stress tensor transforms as

T+ = T . (1.7)

Substituting (1.2)1 and (1.6)2 with (1.7) we get

T(ρ, v) = T(ρ, v + c) , (1.8)

which must hold for all constant values c. It follows that (1.8) can be satisfied only if the depen-

dence of T on v is suppressed and we are left with the original assumption (1.1). It should be

evident that the same type of conclusion can be reached if we had initially included in (1.2) also

dependence on the current position x.

Once more we examine the constitutive equation (1.1) under a general s.r.b.m., and recall

that under such motions

T+ = Q T QT .

Introducing the assumption (1.1) into the above invariance requirement, we obtain

T(ρ+) = Q T (ρ) QT , (1.9)

which must hold for all proper orthogonal tensors Q. Since the condition (1.9) is also unaltered

if Q is replaced by − Q, it follows that (1.9) must hold for all orthogonal Q and not just the

proper orthogonal ones. Hence, T(ρ) is an isotropic tensor and can only be a scalar multiple of

the identity tensor I (see the Supplement), i.e.,

T = T(ρ) = −p(ρ)I or tij = tij = −p(ρ)δij . (1.10)

In (1.10), p is a scalar function called the pressure and the minus sign is introduced by conven-

tion, in order to conform to the traditional form in which the stress tensor T = −pI for an inviscid

fluid is displayed in the fluid dynamics literature.

Substitution of the result (1.10) into the relation t = T n (see Part II) yields

t = −p(ρ) n or tk = −p(ρ)nk , (1.11)

- 75 -

which shows that the stress vector acting on a surface in an inviscid fluid is always directed

along the normal to the surface. In most books on fluid dynamics, (1.11)1 is taken as a defining

property of an inviscid fluid, as was also indicated in the opending paragraph of this section. If

(1.11) instead of (1.1) is taken as a starting point, then with the use of t = T n we have

T n = −p(ρ)n or (T + pI)n = 0 which must hold for all outward unit normal n, and we may con-

clude that T = −pI in agreement with the earlier result (1.10) for the stress tensor in an inviscid

fluid.

We are now in a position to obtain the differential equation of motion for the determination

of the velocity field v in an inviscid fluid. Thus, after substituting (1.10) into the equations of

motion (see Part II), we arrive at

−grad p + ρb = ρv.

or p,i + ρbi = ρv.i . (1.12)

For a specified body force b, the vector equation (1.12)1 [or (1.12)2], together with the equation

for mass conservation represent four scalar equations for the determination of the four unknown

(p, v). The simple structure of (1.12) at first sight may be somewhat deceptive: these equations

are difficult to solve by analytical procedures (or even numerical methods). This is because of

the nonlinearity due to the convective terms in v..

- 76 -

2. Viscous fluid.

Although our main objective here is the development of linear constitutive equations for a

(Newtonian) viscous fluid, it is enlightening to start with a more general constitutive assumption.

For a viscous fluid we assume that the stress tensor T depends on the present value of the mass

density ρ, the velocity vector v and the velocity gradient tensor L. Thus, we write

T = Thh

(ρ, v, L) or tij = th

ij(ρ, vk, vk,l) , (2.1)

where Thh

is a single-valued function of its arguments and satisfies any other continuity or dif-

ferentiability conditions that may be required in subsequent analysis. Alternatively, for conveni-

ence and without loss in generality, we recall that L = D + W and rewrite (2.1) as

T = T(ρ, v, D, W) or tij = tij(ρ, vk, dkl, wkl) . (2.2)

The constitutive assumption (2.2)1 [or (2.2)2] must hold for all admissible motions, includ-

ing such superposed rigid body motions as those specified by (1.4) and (1.5). First, by a pro-

cedure parallel to that discussed in the previous section (see the development between Eqs.

(1.6)-(1.8) of section 1) we may suppress the dependence of the response function in (2.2)1 on v

and arrive at

T = T(ρ, D, W) or tij = tij(ρ, dkl, wkl) . (2.3)

Next, from the invariance requirement for T, namely T+ = Q T QT, we obtain the restriction

T(ρ+, D+, W+) = Q T(ρ, D, W) QT , (2.4)

which must hold for all proper orthogonal tensors Q. Consider now the special superposed rigid

body motion specified by (1.5). Under this special motion (2.4) becomes

T(ρ, D , W + Ωo) = T(ρ, D, W) , (2.5)

so that the response function T cannot depend on W and the constitutive assumption (2.3) is

reduced to

T = T(ρ, D) or tij = tij(ρ, dkl) , (2.6)

- 77 -

where T in (2.6) is a different function from that in (2.3). By imposing on (2.6) the invariance

requirement we arrive at

T(ρ+, D+) = Q T(ρ, D)QT . (2.7)

The restriction (2.7) must hold for all proper orthogonal tensors Q. Since the condition (2.7) is

also unaltered if Q is replaced by − Q, it follows that (2.7) holds for all orthogonal Q and not

just the proper orthogonal ones.

A viscous fluid characterized by the nonlinear constitutive equation (2.6)1 is known as the

Reiner-Rivlin fluid. We do not pursue further discussions of the general forms (2.6)1, but in the

remainder of this section consider in some detail the (Newtonian) linear viscous fluid as a special

case in which the response function T is a linear function of D with coefficients which depend on

ρ.

In the discussion of the linear viscous fluid, it is convenient, in what follows, to carry out

the details of the development of the constitutive equations in terms of their tensor components.

Thus, for a linear viscous fluid, we specifiy tij on the right-hand side of (2.6)2 to be linear in dkl

in the form

tij(ρ, dkl) = aij + bijkl dkl , (2.8)

the coefficients aij and bijkl are functions of ρ and satisfy the symmetry conditions

aij = aji , bijkl = bjikl , bijkl = bijlk , (2.9)

where the condition (2.9)3 arises from the symmetry of dkl and is assumed without loss in gen-

erality.

The response function (2.8) must satisfy the invariance condition (2.7) which in component

form is

tij(ρ, dkl+) = Qim Qjn tmn (ρ, dkl) , (2.10)

where in recording the above we have also used ρ+ = ρ in the argument of the left-hand side of

(2.10). Introducing (2.8) into (2.10) we obtain the restriction

- 78 -

aij + bijkl Qkp Qlq dpq = Qim Qjn (amn + bmnpq dpq) (2.11)

which holds for all arbitrary dkl. Considering separately the cases when dkl = 0 and dkl ≠ 0, it

follows from (2.11) that

aij = Qim Qjn amn , (2.12)

and

bijkl = Qim Qjn Qkp Qlq bmnpq ,

where in obtaining (2.12)2 we have made use of the orthogonality property of Qij. Now accord-

ing to the theorems on isotropic tensors (see the Supplement) the coefficients aij and bijkl are,

respectively, isotropic tensors of order two and four and hence must have the forms

aij = −pδij (2.13)

bijkl = λδij δkl + µδik δjl + γδil δjk ,

where p, λ, µ, γ are functions of ρ only and the use of −p instead of p in (2.13)1 is for later con-

venience and in order to conform to known expressions in the fluid dynamics literature. If we

appeal to the symmetry condition (2.9)2, then the right-hand side of (2.13)2 must be symmetric

in the pair of indices (ij) and (2.13)2 reduces to

bijkl = λδij δkl + µ(δik δjl + δil δjk) . (2.14)

It should be noted that the expression (2.14) implies a further symmetry restriction so that bijkl

satisfies

bijkl = bklij , (2.15)

in addition to (2.9)2,3. Substitution of (2.13)1 and (2.14) into (2.8) finally yields

tij = −pδij + λδij dkk + 2µdij , (2.16)

as the constitutive equation for a (Newtonian) linear viscous fluid. The scalar p in (2.16) is the

pressure and λ, µ are called the viscosity coefficients.

- 79 -

The differential equations of motion for a (Newtonian) linear viscous fluid in terms of the

velocity field can be obtained by substituting (2.16) into Cauchy’s stress equations of motion and

leads to

−p,i + λdkk,i + 2µdij,j + ρbi = ρv.i .

After recalling the kinematical relations

dkk = vk,k , dij =21hh(vi,j + vj,i)

and making use of vj,ij = vj,ji, the above differential equations of motion become

−p,i + (λ + µ) vk,ki + µvi,kk + ρbi = ρv.i , (2.17a)

or

−∇p + (λ + µ)∇(∇ . v) + µ∇2v + ρb = ρv.

. (2.17b)

The system of differential equations (2.17) and the mass conservation represent four scalar equa-

tions for the determination of the four unknowns, (p, v). It should be noted in the absence of

viscous affects (λ = µ = 0), (2.17) reduces to those for an inviscid fluid (see Eqs. (1.12)).

- 80 -

3. Elastic solids: Nonlinear constitutive equations.

We consider here general constitutive equations for an elastic solid in the context of the

purely mechanical theory.

A material is defined by a constitutive assumption. Moreover, an elastic material in the

purely mechanical theory may be defined in terms of a constitutive equation for the stress. We

return to this later but first we need to establish some preliminaries.

We recall the expression for the rate of work by the body and contact forces as follows (see

Sec. 7 of Part II):

R = Rb + Rc , (3.1)

Rb =P

∫ ρb . v dv , Rc =∂P

∫ t . v da . (3.2)

Also, the local equations of motion (in terms of the symmetric stress tensor of Cauchy) are:

tki,i + ρbk = ρv.k , tki = tik . (3.3)

Consider now the expression for Rc, use the divergence theorem to transform the surface

integral into a volume integral and make use of the equations of motion (3.3)1 to obtain:

Rc =∂P

∫ t . v da =∂P

∫ tkvkda =∂P

∫ tkinivkda =P

∫(tkivk),idv

=P

∫tki,ivkdv +P

∫tkivk,idv

=P

∫ρv.kvkdv −

P

∫ρbkvkdv +P

∫tkivk,idv . (3.4)

Then, after using (3.2)1

Rc =P

∫ρv.kvkdv − Rb +

P

∫tkivk,idv . (3.5)

From this result and (3.1) follows that

R =dtdhhhK(St) +

P

∫tkivk,idv , (3.6)

- 81 -

where K(St) =P

∫ 21hhρv . v dv is the kinetic energy for the material region occupied by P in the

present configuration κ .

We define an elastic body as one whose response depends on deformation gradient or on an

appropriate measure of strain. We assume the existence of a strainiiiii energyiiiiii or a stored energy per

unit mass ψ such that

tkivk,i = ρ ψ.

, (3.7)

and we define the (total) strain energy for each part P in the present configuration by

U(S t) =P

∫ ρ ψ dv . (3.8)

Then, by (3.6), (3.7), (3.8) the conservation of mass and the transport theorem we have

R(St) =dtdhhh[K(St) + U(St)] . (3.9)

We have thus proved the following theoremiiiiiii: The rate of work by contact and body forces is

equal to the sum of the rate of kinetic energy and the rate of the strain energy.

We now return to (3.7) and for an elastic body assume that the strain energy density

ψ = ψ′(F) [or ψ = ψ′(xi,A)].

Alternatively, we may assume that ψ depends on a measure of strain such as E and write

ψ = ψhh

(E) or ψ = ψhh

(EAB) . (3.10)

Now the material derivative of ψ is

ψ.

=∂EAB

∂ψhh

hhhhhE.

ABxi,B , (3.11)

where in obtaining (3.11) we have used (from Part I) the expressions

2EAB = CAB − δAB , CAB = xi,Axi,B ,

E.

AB = dikxk,Axi,B , C.

AB = 2E.

AB .

- 82 -

Introducing (3.11) into (3.7), we obtain

tkidki = ρ∂EAB

∂ψhh

hhhhh

or

(tki − ρ∂EAB

∂ψhh

hhhhhxi,Axk,B)dki = 0 , (3.12)

which must hold for all dki. Hence, we conclude that

tki = ρxi,Axk,B ∂EAB

∂ψhh

hhhhh or T = ρF∂E∂ψhh

hhhFT . (3.13)

If instead of the constitutive assumption (3.10), we begin with the alternative assumption

ψ = ψ(C) or ψ = ψ(CAB) , (3.14)

then in a manner analogous to the development that led to (3.14) we deduce that

tki = 2ρ xi,Axk,B ∂CAB

∂ψhhhhhh or T = 2ρ F∂C∂ψhhhhFT . (3.15)

- 83 -

4. Elastic solids: Linear constitutive equations.

We recall that in the context of infinitesimal kinematics discussed previously (Part I, sec-

tion 7), a function f is said to be of 0(εn) as ε → 0 if there exists a nonnegative real constant C

such that || f || < Cεn as ε → 0. Moreover, for infinitesimal deformation gradients, the distinction

between Lagrangian and Eulerian descriptions disappears; and that, to within 0(ε2), the relative

displacement gradient can be written as

ui,j = eij + ωij , (4.1)

where

eij =21hh (ui,j + uj,i) = 0(ε) , ωij =

21hh (ui,j − uj,i) = 0(ε) ,

as ε → 0 . (4.2)

We further assume that all derivatives (with respect to both coordinates and time) of kinematical

quantities, e.g., ui, eij, ωij, are also of 0(ε) as ε → 0.

We now introduce additional assumptions in order to obtain a completeiiiiiiii linear theory.

Thus, we assume that the fixed reference configuration (which was identified with the initial

configuration) is "stress-free", i.e., in this configuration the stress vector t and the body force b

are zero. We also assume that the body force b (or its components bi) and the stress vector t (or

its components ti) -- when expressed in suitable non-dimensional form -- as well as the com-

ponents of the stress tensor tki, all are of 0(ε) as ε → 0.

Recalling that for infinitesimal motion (see Eq. (7.32) of Part 1, section 7)

J−1 = 1 − eii = 1 − 0(ε) as ε → 0 , (4.3)

from the referential form of the conservation of mass, namely ρJ = ρo, we conclude that

ρ = ρo − 0(ε) as ε → 0 . (4.4)

Keeping in mind that t, b, u..

are all of 0(ε) as ε → 0, with the help of (4.4) and after the neglect

- 84 -

of terms of 0(ε2) as ε → 0, the equations of motion in the linear theory reduce to

tki,i + ρobk = ρov.k , (4.5)

tki = tik , (4.6)

where vi = u.i is the particle velocity, a comma denotes partial differentiation and the v

.i on the

right-hand side of (4.5) stands for∂t

∂vihhhh . It should be observed that each term in (4.5) is of 0(ε) as

ε → 0.

The equations of motion of the linear theory can also be obtained from the results of the

alternative derivation (see Part II, section 5). Thus, in terms of the non-symmetric Piola-

Kirchhoff stress tensor piA, the equations of motion are

piA,A + ρobi = ρov.i , (4.7)

piAxj,A = pjAxi,A , (4.8)

and we also recall the relations

piB = xi,AsAB , sAB = sBA , (4.9)

Jtki = xk,ApiA = xk,Axi,BsAB . (4.10)

Since J = 1 + 0(ε), xi,A = δiA + ui,A = δiA + 0(ε), XA,i = δAi + 0(ε) as ε → 0, we have

piA = JtkiXA,k

= tkiδAk + 0(ε2) as ε → 0 . (4.11)

Similarly, since now piA = 0(ε) as ε → 0, (4.9)1 after using (4.11) yields

sAB = piBXA,i

= piBδAi + 0(ε2) as ε → 0 . (4.12)

It follows from (4.11) and (4.12) that if terms of 0(ε2) as ε → 0 are neglected, then piA, sAB, tij

- 85 -

are all equal and the distinction between the Cauchy stress and the Piola-Kirchhoff stresses

disappears. Hence, the linearized version of the equations of motion (4.7) can also be written as

sAB,B + ρobA = ρou..

A . (4.13)

We now turn our attention to the linearized version of the constitutive equation discussed in

the previous section. We recall that in the context of nonlinear elasticity, the constitutive equa-

tions for the stress tensor may be expressed as

tki = ρxi,Axk,B ∂EAB

∂ψhh

hhhhh . (4.14)

In the linear theory discussed above the components of the stress tensor tki = 0(ε) as ε → 0.

Since the right-hand side of (4.14) must also be of 0(ε) as ε → 0 in the linear theory, for an elas-

tic body which is initially "stress-free", after linearization of (4.14), it will suffice to assume that

ψhh

is quadratic in the infinitesimal strain eij. To justify this stipulation, we proceed to obtain the

linearized version of (4.14). Recall that in the infinitesimal theory,

J = 1 + 0(ε)

xi,A = δiA + 0(ε)

eij = EABδAiδBj + 0(ε2) .

Using these expressions, we may linearize equation (4.14) to obtain

tki = ρxi,Axk,B dEAB

dψhh

hhhhh = ρoIL1 − 0(ε) MO

ILδiA + 0(ε) MO

ILδkB + 0(ε) MO dEAB

dψhh

hhhhh

= ρo dEAB

dψhh

hhhhh δiAδkB + 0(ε2) = ρo ∂emn

∂ψhh

hhhhh∂EAB

∂emnhhhhh δiAδkB + 0(ε2)

= ρo ∂emn

∂ψhh

hhhhh∂EAB

∂(EPQδPmδQn)hhhhhhhhhhhh δiAδkB

= ρo ∂emn

∂ψhh

hhhhh∂EAB

∂EPQhhhhh δPmδQnδiAδkB . (4.15)

- 86 -

Since EAB is a symmetric second order tensor, we have

∂EAB

∂EPQhhhhh =21hh

IJL ∂EAB

∂EPQhhhhh +∂EBA

∂EPQhhhhhMJO

=21hh I

LδPAδQB + δPBδQAMO

- 107 -

Supplements to the Main Text

ME 185 Class Notes

- 108 -

Supplements to the Main text in Class Notes

Rather than writing expressions in terms of components of vectors and tensors, it is some-

times convenient to use a coordinate free notation. In coordinate free notation, a vector is

referred to by the symbol v, whereas in indicial notation this vector is written in terms of its

components vi with respect to a basis ei. The values of the components vi are dependent on the

choice of basis. Similarly, a tensor may be referred to in coordinate free notation as T or in indi-

cial notation as Tij with respect to a given basis. Notice that tensor multiplication is not commu-

tative; however, the components of a tensor in indicial notation are commutative. The following

relationships between indicial and coordinate free notation are sometimes useful:

Coordinate Free Notation Indicial Notation

________________________ _________________

v (a vector) vi

A (a second order tensor) Aij

Av Aijvj

ATv Ajivj

AB AijBjk

ATB AjiBjk

ABT AijBkj

Examples from kinematics:

C = FTF CAB = FiAFiB

B = FFT Bij = FiAFjA

U2 = C UADUDB = CAB

Eigenvalues, eigenvectors, and characteristic equations.

In general, a second order tensor T operating on a vector v results in another vector (say) w

which is not necessarily in the same direction and does not necessarily have the same magnitude

- 109 -

as v. This operation is expressed symbolically as

Tv = w or Tijvj = wi . (1)

However, if a nonzero vector v is such that when T operates on it the resulting vector w is paral-

lel to v, then v is called an eigenvector of the tensor T.

In this case, we can write

Tv = βv or Tijvj = βvi , (2)

where the scalar β is called an eigenvalue corresponding to the eigenvector v. From (2), we can

write

(T − βI)v = 0 or (Tij − βδij)vj = 0 , (3)

where I is the identity tensor. Equation (3) represents three equations which must be satisfied by

all three components of v in order that v be an eigenvector of the tensor T. According to a

theorem of linear algebra, a homogeneous system of three equations for three unknowns (such as

equation (3)), has a non-trivial solution if and only if the determinant of the matrix of

coefficients vanishes, i.e. if

det(T − βI) = 0 or det(Tij − βδij) = 0 . (4)

Equation (4) represents a cubic equation in β, which may be written as

φ(β) = − β3 + I1β2 − I2β + I3 = 0 . (5)

Equation (5) is called the characteristic equation of T and I1 , I2 , and I3 are called the principal

invariants of T. Expressions for the principal invariants can be obtained by solving the system

(4), giving

- 110 -

I1 = trT I1 = Tii

I2 =21hh(tr2T − trT2) or I2 =

21hh(TiiTjj − TijTji) (6)

I3 = detT I3 = det(Tij) =61hhεijkεlmnTlTjmTkn .

In general equation (5) has three roots which are not necessarily real and are not necessarily dis-

tinct. However, in the special case where T is symmetric, we can prove some additional

theorems.

Theorem 1: If T is symmetric, then all the roots of the characteristic equation (5) are real.

Proof: We will prove this theorem by contradiction. Suppose first that the roots of (5) are not

real. According to a theorem of algebra, the complex roots of a polynomial with real

coefficients must occur in conjugate pairs. Therefore, if there exists a complex root β(1) of

(5), then there must also exist a root β(2) of (5) which is a conjugate of β(1). Thus β(1) and

β(2) must be of the form

β(1) = µ + iγβ(2) = µ − iγ , (7)

where i2 = −1. The eigenvectors which correspond to the eigenvalues β(1) and β(2) are of

the form

v(1) = αα + iδδ vj(1) = αj + iδj

or (8)

v(2) = αα − iδδ vj(2) = αj − iδj .

Since equation (2) must hold for all eigenvectors and their corresponding eigenvalues, we

can write

- 111 -

Tv(1) = β(1)v(1) Tijvj(1) = β(1)vi

(1)

or (9)

Tv(2) = β(2)v(2) Tijvj(2) = β(2)vi

(2) .

Multiply (9)1 by vi(2) and (9)2 by vi

(1) to obtain (in indicial notation)

β(1)vi(1)vi

(2) = Tijvj(1)vi

(2)

β(2)vi(2)vi

(1) = Tijvj(2)vi

(1) = Tjivj(2)vi

(1) = Tijvj(1)vi

(2) . (10)

In obtaining (10)2, we have made use of the facts that Tij is symmetric and that i and j

are dummy variables. Subtracting (10)2 from (10)1 gives

(β(1) − β(2))vi(1)vi

(2) = 0 . (11)

Substituting (7) and (8) into (11), we can write

0 = 2γi(αj + iδj) (αj − iδj) = 2γi(αjαj + δj δj) . (12)

The term in parenthesis in (12) is always greater than zero since it is the sum of two

squared real numbers. Therefore, we must have γ = 0. Equation (7) then implies that

β(1) and β(2) are both real, which is a contradiction of our initial assumption. We can

therefore conclude that if T is symmetric, all the roots of its characteristic equation are

real.

Theorem 2: The eigenvectors which correspond to distinct eigenvalues of a symmetric tensor

are orthogonal.

Proof: Suppose that v(1) and v(2) are eigenvectors of a symmetric tensor T corresponding to dis-

tinct eigenvalues β(1) and β(2), i.e. β(1) ≠ β(2). As in the previous proof, we can show that

- 112 -

equation (11) must be satisfied. Since β(1) ≠ β(2), we must have

v(1) . v(2) = 0 or vi(1)vi

(2) = 0 . (13)

Equation (13) is exactly the definition of orthogonal vectors, thus the eigenvectors v(1)

and v(2) are orthogonal.

Theorem 3: Every second order symmetric tensor has three linearly independent principal

directions.

Proof: For the proof of this theorem, see Advanced Engineering Mathematicsiiiiiiiiiiiiiiiiiiiiiiiiiiiiii, Wylie, p. 541.

Remark: In the preceding development, we can, without loss of generality, replace the eigen-

vector v with a unit vector m, where

m = v/ | v | . (14)

The vector m is called the normalized eigenvectoriiiiiiiiiiiiiiiiiii.

Using theorems 1, 2 and 3 and the previous remark, we can state that any symmetric tensor T

has three real eigenvalues β(1) , β(2) , and β(3) and three orthonormal eigenvectors

m(1) , m(2) , and m(3). It can be shown, using a theorem of linear algebra, that since T has three

linearly independent eigenvectors, it can be transformed into a diagonalized form, denoted by

Λ, using the transformation

T′ij = Λij = amianjTmn ,

where

aij = ei. m(j) and xi = aijx′j . (15)

Here the primed quantities correspond to the transformed system. It can also be shown that the

- 113 -

diagonal members of Λ are simply the eigenvalues, so that

(Λij) =

RJJJQ0

0

β(1)

0

β(2)

0

β(3)

0

0HJJJP

. (16)

Quadratic forms and positive definiteness.

Associated with any symmetric second order tensor T is a scalar valued function of a vector

Q(v), where v is an arbitrary vector, defined by

Q(v) = (Tv) . v = Tij vjvi = v . (Tv) . (17)

We call Q(v) a quadratic formiiiiiiiiiiii. The tensor T is said to be positive definiteiiiiiiiiiiiii if for all

v ≠ 0 , Q(v) > 0 .

Theorem 4: A symmetric second order tensor T is positive definite if and only if all the eigen-

values of T are positive.

Proof: Again using a theorem from linear algebra, one can conclude that since the eigenvectors

m(1) , m(2) , and m(3) are orthogonal in a 3-dimensional space, then any arbitrary vector w

in that space may be represented as a linear combination of m(1) , m(2) , and m(3). Thus, we

can write

w = a1m(1) + a2m(2) + a3m(3) . (18)

The eigenvalues of T corresponding to m(1) , m(2) , and m(3) are β(1) , β(2) , and β(3), as

before, so equation (2) gives

Tm(i) = β(i)m(i) . (19)

- 114 -

Using (18) and (19), we can write

Tw = a1β(1)m(1) + a2β(2)m(2) + a3β(3)m(3) . (20)

Substituting (20) into (17) gives

Q(w) = Tijwjwi = a12β(1) + a

22β(2) + a

32β(3) . (21)

Necessity: If T is positive definite, then Q(w) > 0 for all w. Since a1 , a2 , and a3 are all real

numbers, the square of each of these terms must be positive (or zero). If w = a1m(1), then

Q(w) > 0 implies from (21) that β(1)al2 > 0, or that β(1) > 0. Since w is arbitrary, a similar

argument shows that β(2) and β(3) are both greater than zero. Thus, if T is positive definite,

then all the eigenvalues of T are positive.

Sufficiency: If all the eigenvalues of T are greater than zero, then by (21) it follows that

Q(w) > 0 for all w ≠ 0. Therefore, T is positive definite if all the eigenvalues of T are posi-

tive.

Theorem 5: If B is a positive definite symmetric tensor, then there exists a unique symmetric

positive definite square root T of B such that T2 = B.

Remarks on the calculation of the square root of B:

Let Bik = aipakqΛpq (see (15)), where Λ is a diagonal matrix as in equation (16). Define a

tensor Λ 21hh

pq as

(Λ 21hh

)pq =

RJJJQ0

0

√dddβ(1)

0

√dddβ(2)

0

√dddβ(3)

0

0HJJJP

,

so that

- 115 -

Λ 21hh

pqΛ 21hh

qs = Λps . (22)

Defining Tij = aipajqΛ 21hh

pq, we can show that

TijTjk = aipajq Λ 21hh

pqajmaknΛ 21hh

mn = aipajqajmaknΛ 21hh

pqΛ 21hh

mn

= aipaknδqm Λ 21hh

pqΛ 21hh

mn = aipaknΛpn = Bik

or

T2 = B . (23)

Theorem 6: Polar decomposition theorem (see class notes section I/3)

Any non-singular (invertible) tensor F may be uniquely expressed as

F = RU = VR , (24)

where R is orthogonal and U and V are symmetric positive definite tensors.

Proof: Existence

Define U = (FTF) 21hh

and R = FU−1. We first show that U is symmetric postive definite.

To show that U2 is positive definite, consider the quadratic form

Q(v) = v . U2v = v . FTFv = Fv . Fv .

Since F is invertible, then Fv ≠ 0 if v ≠ 0. Assuming that F and v are both real, it fol-

lows that Q(v) > 0 if v = 0. Thus, U2 is positive definite. Also, since FTF is sym-

metric, then U2 must be symmetric. From theorem 5, we know that U2 must have a

unique positive definite square root U, where U is also symmetric. To show that R is

orthogonal, note that

RTR = (U−1)TFTFU−1 = U−1FTFU−1 = U−1UUU−1 = I .

- 116 -

Using the definitions of U and R, we can write

RU = FU−1U = F .

- 117 -

Thus, we have shown the existence of the decomposition (24).

Uniqueness

To show uniqueness of the decomposition (24), assume first that two such decomposi-

tions exist, so that F = RU = R*U*. The star is used here to denote an alternative

decomposition. Notice that

FTF = UTRTRU = U2

and

FTF = U*TR*TR*U* = U*2 ,

thus U2 = U*2. By theorem 5, the square roots of U2 and U*2 are unique, so we must

have U = U*. Notice that

0 = F − F = RU − R*U* = (R − R*)U .

Since U is non-singular, U−1 exists. Thus,

0 = (R − R*)UU−1 = R − R* ,

and so R = R*. Thus, the decomposition F = RU is unique. Similarly, one can prove

the existence and uniqueness of the decomposition F = VRhh

, where we temporarily

allow Rhh

to be different from R. The relationship between V, U, R, and Rhh

may be

determined by observing that

F = RU = (RURT)R = VRhh

,

where RURT is symmetric and positive definite. Since the decomposition of F is

unique, we conclude that

V = RURT , Rhh

= R , and U = RTVR . (25)

- 118 -

Extremal properties of quadratic forms.

Consider the symmetric tensor T and the quadratic form

Q(v) = v . Tv = Tijvjvi . (26)

We wish to know the extreme values of Q(v) subject to the constraint that v be a unit vector

(i.e. v . v = vi vi = 1). Using the method of Lagrange multipliers, the conditions for the

extremum of (26) are

∂vk

∂hhhh Tijvivj − β(vivi − 1) = 0 , (27)

where β is the undetermined multiplier and v . v − 1 = 0 is the constraint equation. Performing

the differentiation, (27) becomes

Tijδikvj + Tijviδjk −2βviδik = 0

or

Tkjvj + Tikvi − 2βvk = 0 . (28)

Since T is symmetric, (28) becomes

Tkjvj − βvk = 0 . (29)

Therefore, Q(v) attains extreme values when v is a eigenvector of T. Using (2), it is clear that if

m is a unit eigenvector of T with corresponding eigenvalue β, then

Q(m) = m . (Tm) = β . (30)

In summary, the quadratic form obtains its extremum values when it is operating on an eigenvec-

tor. If the eigenvectors are normal, these extremum values are simply the corresponding eigen-

values.

- 119 -

Relationship between eigenvalues of U, C, V, B, E, and D and the principal stretch.

(see class notes sections I/4 and I/5)

The principal directions are the directions in which the stretch attains an extreme value.

The principal stretch λ is the extreme value of stretch in a given principal direction. As derived

in section I/4, the eigenvalues of U and V are equal to the principal values of stretch and the

eigenvalues of C and B are equal to λ2. It was also shown in this section that the eigenvectors of

U and C are coincident and that the eigenvectors of V and B coincide.

The relative strain tensor E is given by

E =21hh(C − I) . (31)

The principal values of strain, denoted by β, are equal to the eigenvalues of E. The principal

directions of strain, denoted by m, are equal to the eigenvectors of E. From (2), we have

Em = βm . (32)

Substituting (31) into (32) gives

21hh(C − I)m = βm ,

or

Cm = (2β + 1)m . (33)

Equation (33) implies that m is an eigenvector of C as well as of E and that 2β + 1 is an eigen-

value of C. As previously noted, the eigenvalue of C is λ2, so

2β + 1 = λ2 or β =21hh(λ2 − 1) . (34)

- 120 -

Thus, the eigenvectors of U , C , and E coincide and the principal value of strain is related to the

principal value of stretch by equation (34).

As was derived in section I/5, the eigenvalues and eigenvectors of the rate of deformation

tensor D are given by

Dm =λλ.hhm =

dtdhhh(ln λ) m , (35)

where m is an eigenvector of D, (m is also called the "principal direction of stretch"). It is clear

from the form of (35) that λ.

/ λ (called the logarithmic rate of stretching) is an eigenvalue of D.

Example calculation

Suppose that at some instant at a particular point in the body, the deformation gradient ten-

sor F is given by

[FiA] =RJJQ01

2

0

2

1

1

0

0 HJJP

. (36)

Notice that (36) yields a symmetric positive definite tensor. Since the polar decomposition of F

is unique, it follows that V = U = F and R = I . The principal invariants of F = U are

I1 = tr U = UAA = 5

I2 =21hh(tr2U − trU2) =

21hh(UAAUBB − UABUBA) = 7 (37)

I3 = det U = 3 .

The characteristic equation of F is then

−β3 + 5β2 − 7β + 3 = 0 . (38)

The roots of (38), i.e. the eigenvalues of F, are

- 121 -

β(1) = 3

β(2) = 1 (39)

β(3) = 1 .

To solve for the eigenvectors of F, substitute each of the eigenvalues into equation (2), or

Fv = βv . (40)

For β(1) = 3, equation (40) gives

(F − 3I)M(1) = 0 ,

or

− M1(1) + M

2(1) = 0 and −2M

3(1) = 0 .

Thus, the eigenvector M(1) is given by

[Mi(1)] = c1

RJJQ01

1 HJJP

,

where c1 is an arbitrary constant. Similarly, for β(2) = β(3) = 1, equation (40) gives

(F − I)M(α) = 0

where α = 2,3, or

M1(α) = −M

2(α) .

Solving for M(α) gives

[Mi(2)] = c2

RJJQ 0

1

−1 HJJP

and [Mi(3)] = c3

RJJQ10

0 HJJP

.

We may take unit eigenvectors

- 122 -

[Mi(1)]=

√dd2

1hhhRJJQ01

1 HJJP

, [Mi(2)] =

√dd2

1hhhRJJQ 0

1

−1 HJJP

, [Mi(3)] =

RJJQ10

0 HJJP

,

which form a right-handed triad. Note that every vector in the space spanned by M(2) and M(3)

is also an eigenvector with eigenvalue 1.

One can better understand the physical significance of these results by looking at the effect

of the deformation (36) upon a unit cube. The graph below gives the cross section of the cube

before and after the deformation. As expected from the preceding calculations, the principal

directions (i.e. the directions in which the stretch attains extremum values) are in the directions

of the eigenvectors M(1) , M(2) , and M(3). Line elements along M(1) are stretched to three

times the initial length. Thus, the stretch λ = 3 is equal to β(1), as expected. Line elements in the

directions M(2) and M(3) retain their initial length. Therefore, the stretch λ in these directions

equals unity, corresponding to β(2) = β(3) = 1. In this example, line elements along principal

directions undergo pure stretch as a result of the deformation; however, line elements in other

directions may undergo both stretch and rotation. This characteristic is true in general whenever

the deformation F is of the pure stretch type (i.e. whenever R = I).

- 123 -

Supplement to Part I, Section 5

Material line elements, area elements, volume elements and their material time derivatives.

Preliminary results

Claim:

det A =61hh εijkεlmnAliAmjAnk (1)

Proof: Consider an arbitrary tensor A with components Aij. Let am = Am1, bm = Am2, and

cm = Am3. Recall that

a . (b × c) = εlmnalbmcn = εlmnAl1Am2An3 = det

RJJQA13

A12

A11

A23

A22

A21

A33

A32

A31HJJP

= det AT ≡ A . (2)

Now define Tijk by

Tijk = εlmnAliAmjAnk (3)

where by (2), T123 = A. Since l, m, and n are dummy indices, we may switch them and use

the properties of εlmn to show that

Tijk = Tjki = Tkij = − Tikj = − Tkji = − Tjik . (4)

Recalling the properties of εijk, we can conclude from (4) that Tijk must be a multiple of εijk.

>From (2) and (4), we see that

T231 = T312 = T123 = A

T321 = T213 = T132 = − A

Tijk = 0 if i = j , j = k , or i = k .

Hence, it is clear that

Tijk = A εijk . (5)

- 124 -

Equate (3) and (5) and multiply the resulting equation by61hh εijk, noting that εijkεijk = 6, to

get

A = det AT =61hh εijkεlmnAliAmjAnk . (6)

Since det AT = det A for any tensor A, we have proved equation (1).

Claim:

∂A∂(det A)hhhhhhhh = (det A) A−T (7)

Proof: Assume that the components of A are independent, so that we can write

∂Ars

∂Aijhhhhh = δirδjs . (8)

Now differentiate (6) with respect to Ars to obtain

∂Ars

∂(det A)hhhhhhhh =∂Ars

∂Ahhhhh =61hh εijkεlmn δrlδsiAmjAnk + AliδrmδsjAnk + AliAmjδrnδsk

=61hhεsjkεrmnAmjAnk + εiskεlrnAliAnk + εijsεlmrAliAmj

=61hh3εsjkεrmnAmjAnk =

21hhεsjkεrmnAmjAnk . (9)

Suppose now that A is non-singular (det A ≠ 0), so that there exists a tensor B with com-

ponents Bij such that AliBir = δlr. If this is the case, then (9) may be rewritten using (3) and

(5) as

∂Ars

∂Ahhhhh =21hh εsjk(εlmnδrl)AmjAnk

=21hh εsjkεlmn(AliBir)AmjAnk

=21hh εsjk(εlmnAliAmjAnk)Bir

=21hh εsjkεijkABir =

21hh(2ABsr) = ABsr . (10)

- 125 -

Denoting B as A−T, equation (10) becomes

∂A∂(det A)hhhhhhhh = A A−T = (det A) A−T ,

which is identical to (7).

Material line elements

Consider a motion of a body B given by x = χ (X , t), where X is the position vector of a

certain particle in the reference configuration and x is the position vector of the same particle in

the current configuration at time t. Two neighboring material particles in the reference

configuration are located at X and X + dX. Consider a material line elementiiiiiiiiiiiiiiiii dX in the reference

configuration at the point X and having length dS and direction M, so that dX = dS M. Under

the motion χ(X , t), the material particles X and X + dX are taken to the positions x and x + dx.

The corresponding material line element dx in the current configuration having length ds and

direction m is related to dX by

dx = F dX , or dxi = FiA dXA = xi,AdXA , (11)

where FiA = xi,A are the components of the deformation gradient F. A diagram of this motion of

the material line segment is given below. Recall from Section I/4 of the class notes that the

square of the length of a material line element in the current configuration is given by

ds2 = xi,Axi,BdXAdXB . (12)

Recalling that xi,A

hhh.= vi,jxj,A, we can take the material derivative of (12) to get

ds2hhh.

= xi,A

hhh.xi,BdXAdXB + xi,Axi,B

hhh.dXAdXB

= vi,jxj,Axi,BdXAdXB + xi,Avi,jxj,BdXAdXB

= (vi,j + vj,i)xi,Axj,BdXAdXB

= 2dijxi,Axj,BdXAdXB . (13)

- 126 -

This last step uses the definition of the rate of deformation tensor D =21hh(L + LT). Equation (13)

may be rewritten using (11) as

ds2hhh.

= 2dijdxidxj . (14)

Material area elements

Let 1dX and 2dX represent two material line elements in the reference configuration located

at X. Suppose that the motion χ (X ,t) takes 1dX, 2dX into the elements 1dx, 2dx, respectively, at

x in the current configuration. Define the material area elementiiiiiiiiiiiiiiiiii dA with orientation N in the

reference configuration by

N dA = 1dX × 2dX (15)

so that

dAL = NLdA = N dA . eL = (1dX × 2dX) . eL = εLMN1dXM

2dXN . (16)

The corresponding material area element in the current configuration is da with orientation n

(see Figure 2) defined by

n da = 1dx × 2dx , (17)

so that

ni da = n da . ei = (1dx × 2dx) . ei (18)

>From (18) and (11) and noting that xm,DXD,i = δmi, we can write

ni da = εijkxj,Axk,B1dXA

2dXB = εmjkδmixj,Axk,B1dXA

2dXB

= εmjk(xm,DXD,i)xj,Axk,B1dXA

2dXB

= εmjkxm,Dxj,Axk,BXD,i1dXA

2dXB .

Using (3), (5), and (16), this expression becomes

ni da = εDAB(det F)XD,i1dXA

2dXB

- 127 -

or

ni da = J XD,i ND dA . (19)

Assuming that F is non-singular and using (7) and (19), we can write

ni dahhhh.

= J.

XD,i ND dA + J XD,i

hhhh.ND dA . (20)

Notice that

J.

=dtdJhhh =

∂FjA

∂Jhhhhh∂t

∂FjAhhhhh = J (F−1)Aj vj,kFkA = J vj,kδkj = J vj,j . (21)

Using (21), equation (20) becomes

ni dahhhh.

= (J XD,ivj,j − J XD,jvj,i)ND dA

or

ni dahhhh.

= J(vj,jXD,i − vj,iXD,j)ND dA . (22)

Material volume elements

Let 1dX, 2dX, and 3dX be three material line elements forming a right handed system

located at X in the reference configuration. Suppose that the motion χ (X , t) takes 1dX, 2dX,

and 3dX into three line elements 1dx, 2dx, and 3dx, respectively, at x in the current configuration

(see Figure 3). Define the material volume elementiiiiiiiiiiiiiiiiiiii dV in the reference configuration by

dV = 1dX . (2dX × 3dX) = εABC1dXA

2dXB3dXC . (23)

The corresponding material volume element in the current configuration is then

dv = 1dx . (2dx × 3dx) = εijk1dxi

2dxj3dxk . (24)

Using equations (3), (5), (11), and (23), we can write (24) as

dv = εijkxi,Axj,Bxk,C1dXA

2dXB3dXC = εABC(det F) 1dXA

2dXB3dXC

- 128 -

= J(εABC1dXA

2dXB3dXC)

- 129 -

or

dv = J dV . (25)

Taking the material derivative of (25) gives

dvhh.

= J.

dV = J vj,jdV = J(div v)dV , (26)

where equation (21) is used for the calculation of J..

A special case of the above analysis occurs for isochoric motionsiiiiiiiiiiiiiii (i.e. motions for which

dv = dV for all volumes occupied by any material region of the body). It is clear from this

definition that dvhh.

= 0 for isochoric motions. We see from equations (25) and (26) that the neces-

sary and sufficient condition for isochoric motion may be expressed either as

J = 1 or vi,i = div v = 0 . (27)

Summary

A summary of the basic equations derived in this appendix are given in the following:

Material line, area, and volume elements:

ds2 = xi,Axi,BdXAdXB

ni da = J XD,i ND dA

dv = J dV

Material derivatives of line, area, and volume elements:

ds2hhh.

= 2 dijxi,Axj,BdXAdXB

dai

hhh.= J(vj,jXD,i − vj,iXD,j)dAD

dvhh.

= J vj,jdV .

- 130 -

Supplement to Part I, Section 7

Interpretation of infinitesimal strain measures.

Recall the following expressions:

ds2 = dxidxi = xi,Axi,BdXAdXB

dS2 = dXAdXA = XA,iXA,jdxidxj

EAB =21hh(CAB − δAB) =

21hh(UA,B + UB,A + UM,AUM,B)

eij =21hh(δij − cij) =

21hh(ui,j + uj,i − um,ium,j) . (1)

Also recall that the stretch λ and the extension E are defined by

λ =dSdshhh = √ddddddddCABMAMB = √dddddddddddd1 + 2 EABMAMB (2a)

= 1/√dddddcijmimj = 1/√ddddddddd1 − 2eijmimj

and

E =dS

ds − dShhhhhhh = λ − 1 , (2b)

where

MAMA = mimi = 1 and dXA = MAdS , dxi = mids . (3)

For an infinitesimal deformation (see class notes Part I/7), we may approximate EAB by neglect-

ing term of O(ε2) as ε → 0 as

EAB =21hh(UA,B + UB,A) = O(ε) as ε → 0 . (4)

The distinction between Lagrangian and Eulerian strain disappears for an infinitesimal deforma-

tion. Thus,

- 131 -

eij = EABδiAδjB = O(ε) as ε → 0 . (5)

Hence, for an infinitesimal deformation the extension and the stretch given in (2) may be written

as

E = λ − 1 = EABMAMB + O(ε2) = eijmimj + O(ε2) (6)

and

λ = √dddddddddddd1 + 2EABMAMB = 1 + EABMAMB + O(ε2) (7a)

or, alternatively,

λ = 1/√ddddddddd1 − 2eijmimj = 1 + eijmimj + O(ε2) . (7b)

In the following discussion, we will use e11, e22, ..., to denote the components of the infinitesimal

strain tensor, since from (5) no distinction needs to be made between Lagrangian and Eulerian

strain as ε → 0. Consider a line element which lies along the X1 axis, i.e.

dX1 = dS , dX2 = dX3 = 0 , and MA = (1, 0, 0) ,

so that from (6)

E =dS

ds − dShhhhhhh = e11 . (8)

That is, e11 represents the extension (the change in length divided by the original length) of a

line element lying along the X1 axis. We note that to within O(ε2), the direction mi in the current

configuration coincides with the direction MA in the reference configuration. Similar results

hold for e22 and e33. In summary: the diagonal elements of the infinitesimal strain tensor

represent the extension of line elements in the corresponding coordinate directions.

Consider two orthogonal elements which initially lie along the X1 and X2 axes, so that

dX1 = dS , dX2 = dX3 = 0 , MA = (1, 0, 0) (9)

dXhh

2 = dSh

, dXhh

1 = dXhh

3 = 0 , Mhh

A = (0, 1, 0) .

- 132 -

Let θ denote the angle between the corresponding deformed line elements dx and dxh, whose

lengths are ds and dsh. >From (8), neglecting terms of O(ε2) as ε → 0, we have

ds = (1 + e11)dS and dsh

= (1 + e22)dSh

. (10)

Now observe that

cos θ =| dx | | dx

h|

dx . dxh

hhhhhhhhhh =ds ds

hdxi dx

hihhhhhh = xi,Axi,B

ds dsh

dXAdXhh

Bhhhhhhhh

= CABds ds

hdXAdX

hhBhhhhhhhh = CAB

(1 + e11)(1 + e22)

MAMhh

Bhhhhhhhhhhhhhhh

= C12/(1 + e11)(1 + e22) = 2e12/(1 + e11 + e22) + O(ε2)

= 2e12 + O(ε2) , (11)

where we have used (1)3, (5), (9), and (10) in deriving (11).

Let φ be the amount θ differs from a right angle (i.e. φ = −θ + π/2), so that

cos θ = sin φ = φ + O(φ3) . (12)

>From (11) and (12), we see, after neglecting terms of O(ε2), that

φ = 2e12 or e12 =2φhh . (13)

The angle φ is known as the sheariiiii angleiiiii. Similar results hold for e13 and e23. Hence: the off-

diagonal components of the infinitesimal strain tensor are seen to represent half of the change in

angle between two line elements initially along the corresponding coordinate axes.

Recall that dv=JdV, where J = det(xi,A). In the class notes Section I/7, it was shown that

neglecting terms of O(ε2) as ε → 0 gives

J = 1 + ekk . (14)

- 133 -

Thus, dv = JdV = dV + ekkdV, or

dVdv − dVhhhhhhhh = ekk . (15)

The invariant ekk is known as the dilationiiiiiii. Equation (15) implies that the trace of the

infinitesimal strain tensor measures the local change in volume per unit volume.

- 134 -

Supplement to Part III, Section 2

Isotropic tensors.

Under a rotation of coordinate system, recall that the components of a vector change

according to the rule

xi = aimx′m or x′m = ajmxj , (1)

where xi are the components of a vector x with respect to one coordinate system and x′m are the

components of the same vector with respect to a different coordinate system. The coefficients

aim are components of an orthogonal tensor such that

aijaik = ajiaki = δjk and det(aij) = ± 1 . (2)

Also recall that the components Tij...k of a tensor T transform according to the rule

Tij...k = aipajq. . . akrT′pq...r , (3)

where aij’s obey equation (2).

Definition: A tensor T is called isotropiciiiiiiii if its components are invariant to rotation of the coor-

dinate system. Thus, if T is isotropic, then

Tij...k = T′ij...k ,

or from (3),

Tij...k = aipajq. . . akrTpq...r (4)

for all orthogonal aij.

Theorem 1: A scalar invariant is an isotropic tensor of order zero.

Proof: Let φ = φ(xi) be a scalar invariant, so that from (3) φ(xi) = φ(x′i) = φ′. Since φ = φ′, the

scalar invariant φ satisfies the definition of an isotropic tensor given by (4).

Theorem 2: The only isotropic tensor of order one is the zero vector.

- 135 -

Proof: If ti are the components of an isotropic vector t, then from (4)

ti = ajitj (5)

for all components aji of orthogonal tensors. Consider, for instance, the choice

RQaji

HP =

RJJQ 0

0

−1

0

−1

0

1

0

0 HJJP

.

With this choice for aji, (5) becomes (t1 , t2 , t3) = (−t1 , −t2 , t3), which implies that t1 = 0

and t2 = 0. Now consider the choice

RQaji

HP =

RJJQ00

1

0

−1

0

−1

0

0 HJJP

.

for which equation (5) becomes (t1 , t2 , t3) = (t1 , −t2 , −t3), and so we must have t3 = 0

also. Thus, if t is an isotropic tensor of order one, then t = 0.

Theorem 3: Every isotropic tensor of order two is a scalar multiple of δij.

Proof: Let Tij be the components of an isotropic tensor T, so that from (4)

Tij = aipajqTpq (6)

for all orthogonal tensor components aij. Consider now the following choices for aij and the

resulting forms of equation (6):

RQaij

HP =

RJJQ 0

−1

0

1

0

0

0

0

−1 HJJP

so that

RJJJQT31

T21

T11

T32

T22

T12

T33

T23

T13HJJJP

=

RJJJQ−T23

T13

T33

−T21

T11

T31

T22

−T12

−T32HJJJP

,

and

RQaij

HP =

RJJQ01

0

−1

0

0

0

0

−1 HJJP

so that

RJJJQT31

T21

T11

T32

T22

T12

T33

T23

T13HJJJP

=

RJJJQ T23

−T13

T33

−T21

T11

−T31

T22

−T12

T32HJJJP

. (7)

- 136 -

In order for (7)1 and (7)2 to both be satisfied, we must have

T11 = T22 = T33 and T12 = T21 = T13 = T31 = T23 = T32 = 0 ,

or

Tij = λδij , (8)

where

λ = T11 = T22 = T33 .

Theorem 4: Every isotropic tensor of order three is a scalar multiple of the permutation tensor

εijk.

Proof: Let Tijk be an isotropic third order tensor, so that from (4) we can write

Tijk = aipajqakrTpqr (9)

for all orthogonal aij. We can represent Tijk as

RQTijk

HP =

RJJJQT311

T211

T111

T312

T212

T112

T313

T213|

T113|

T321

T221

T121

T322

T222

T122

T323

T223|

T123|

T331

T231

T131

T332

T232

T132

T333

T233

T133HJJJP

.

Consider the following choices for aij and the resulting forms of (9):

RQaij

HP=

RJJQ 0

−1

0

1

0

0

0

0

−1 HJJP

then RQTijk

HP=

RJJJQ T233

−T133

−T333

T231

−T131

−T331

−T232

T132|

T332|

T213

−T113

−T313

T211

−T111

−T311

−T212

T112|

T312|

−T223

T123

T323

−T221

T121

T321

T222

−T122

−T322HJJJP

RQaij

HP=

RJJQ01

0

1

0

0

0

0

1 HJJP

then RQTijk

HP=

RJJJQ T233

T133

T333

T231

T131

T331

T232

T132|

T332|

T213

T113

T313

T211

T111

T311

T212

T112|

T312|

T223

T123

T323

T221

T121

T321

T222

T122

T322HJJJP

RQaij

HP=

RJJQ−1

0

0

0

1

0

0

0

1 HJJP

then RQTijk

HP=

RJJJQ−T133

T233

T333

−T132

T232

T332

T131

−T231|

−T331|

−T123

T223

T323

−T122

T222

T322

T121

−T221|

−T321|

T113

−T213

−T313

T112

−T212

−T312

−T111

T211

T311HJJJP

.

(10)

- 137 -

>From equation (10), we must have

T333 = T331 = T332 = T111 = T112

= T122 = T133 = T211 = T233 = T311 = T322 = T121 = T131 = T212 == T232 = T313 = T323 = 0

and

T123 = T231 = T312 = −T321 = −T132 = −T213 ≡ λ ,

thus we can write

Tijk = λεijk . (11)

Sufficiency may also be shown by substituting equation (11) into equation (3), giving

aipajqakrTpqr = aipajqakrλεpqr = λ(det apq)εijk

which is equivalent to equation (9).

Theorem 5: The most general fourth order isotropic tensor has the form

Tijkl = λδijδkl + µδikδjl + γδilδjk .

Proof: Let Tijkl be the components of an isotropic tensor, so that from (4)

Tijkl = aipajqakralsTpqrs (12)

for all orthogonal aij. We proceed as in the previous proofs, choosing aij to be

RQaij

HP =

RJJQ 0

0

−1

0

−1

0

1

0

0 HJJP

,

RJJQ00

1

0

−1

0

−1

0

0 HJJP

,

RJJQ01

0

1

0

0

0

0

1 HJJP

,

RJJQ10

0

0

1

0

0

0

−1 HJJP

.

Using the above choices for aij in equation (12), we can show that the non-zero values of

Tijkl are related as follows:

T1111 = T2222 = T3333

T1122 = T2211 = T1133 = T3311 = T3322 = T2233 = λT1212 = T2121 = T1313 = T3131 = T2323 = T3232 = µT1221 = T2112 = T1331 = T3113 = T2332 = T3223 = γ . (13)

- 138 -

Now consider the choice

RQaij

HP =

RJJQ 0

−1/√dd21/√dd2

0

1/√dd21/√dd2

1

0

0 HJJP

.

If we substitute this value for aij into (12) and set i = j = k = l = 1, we find that

T1111 =41hh(T1111 + T2211 + T2121 + T1221 + T2112 + T1212 + T1122 + T2222) ,

or using (13),

T1111 = λ + µ + γ . (14)

>From the restriction (13) and (14), we see that Tijkl can be represented as

Tijkl = λδijδkl + µδikδjl + γδilδjk . (15)

Sufficiency may be shown by substituting the expression for Tijkl given by equation (15)

into equation (4), yielding

aipajqakralsTpqrs = aipajqakrals(λδpqδrs + µδprδqs + γδpsδqr)

= λaipajpakralr + µaipajqakpalq + γaipajqakqalp

= λδijδkl + µδikδjl + γδilδjk = Tijkl

which is identical to equation (12). If an isotropic fourth order tensor is restricted to have

the symmetry

Tijkl = Tjikl (16)

(i.e. symmetry in the first two indices), then from equation (13) we see that γ = µ. Equation

(15) may thus be written for this special case as

Tijkl = λδijδkl + µ(δikδjl + δilδjk) . (17)

Equation (17) is the most general form for an isotropic fourth order tensor symmetric in i

and j. Notice that the representation given in equation (17) also possesses the additional

symmetries given by

- 139 -

Tijkl = Tijlk = Tklij . (18)

- 140 -

Supplement to Part III, Section 4

Material symmetry of elastic solids (general considerations).

For a homogeneous elastic solid, the constitutive equation of interest expresses the strain

energy per unit mass as a function of some measure of strain. Thus, let the strain energy per unit

mass ψ be given by

ψ = ψ(EAB) . (1)

We examine the behavior of (1) under a change of coordinates in the reference configuration

given by

XP = APQX′Q ,

where

APQ = EP. E′Q and APM AQM = AMP AMQ = δPQ . (2)

Under the transformation (2), EAB becomes

EPQ = APM AQN E′MN , (3)

where E′MN are the components of the strain tensor referred to the base vectors e′A. Since ψ is a

scalar invariant, we know that

ψ′ = ψ . (4)

With the use of (1) and (3), the strain energy function after the transformation (2) can be written

as

ψ′ = ψ(APM AQN E′MN) , (5a)

where as before

ψ = ψ(EPQ) . (5b)

Substituting (5) into (4), we conclude that

ψ(EPQ) = ψ(APM AQN E′MN) (6)

- 141 -

where ψhh

is in general a different function than ψ. It is clear from equation (6) that the response

of an elastic solid depends, in general, on the coordinates used to describe the reference

configuration. The symmetry of a homogeneous elastic solid is characterized by the set of

orthogonal coordinate transformations which leave the strain energy function form-invariant;

that is, the set of APQ such that ψhh

is the same as ψ, or such that

ψ(EPQ) = ψ(E′PQ) . (7)

If equation (7) holds for all possible orthogonal coordinate transformations (i.e. for all orthogo-

nal tensors APQ), then the material is said to be isotropiciiiiiiii.

Linear elastic solids

It was shown in the class notes that, in the linear theory, the strain energy function for a

homogeneous elastic solid may be expressed in the form

ρoψ =21hh CABMNEABEMN , (8)

where CABMN is a fourth order tensor of material constants which has, in general, 81 indepen-

dent components. Since EAB is a symmetric tensor, the coefficients CABMN in (8) will have the

obvious symmetries

CABMN = CBAMN = CABNM . (9)

In addition, recalling that the strain energy is a scalar invariant we may write (after changing the

dummies AB into MN and MN into AB)

ρoψ =21hh CABMNEABEMN =

21hh CMNABEMNEAB . (10)

It then follows that CABMN must by symmetric in the pair of indices (AB) and (MN), i.e.,

CABMN = CMNAB . (11)

In summary, the fourth order tensor in (8) must satisfy the symmetries

CABMN = CBAMN = CABNM = CMNAB . (12)

- 142 -

The symmetry of CABMN shown in (12) allows us to reduce the number of independent com-

ponents of CABMN from 81 to 21. The remaining independent components are

C1111

C2222

C1122

C3333

C2233

C1133

C2323

C3323

C2223

C1123

C1313

C2313

C3313

C2213

C1113

C1212 .

C1312

C2312

C3312

C2212

C1112

(13)

Linear elastic solids with symmetry

As mentioned previously, the symmetry of an elastic solid is characterized by the set of

orthogonal tensors APQ such that (7) holds. Substituting (8) into (7), we find

ρoψ =21hh CABMNEABEMN =

21hh CABMNE′ABE′MN . (14)

Using (3), (14) can be written as

CABMNAAPABQAMRANSE′PQE′RS = CABMNE′ABE′MN . (15)

Switching dummy indices, (15) becomes

(CABMN − CPQRSAPAAQBARMASN)E′ABE′MN = 0 . (16)

Since (16) must hold for all E′AB and since the quantity in parenthesis in (16) is independent of

E′AB, we may conclude that

CABMN = CPQRSAPAAQBARMASN (17)

for all orthogonal APQ which lie in the set which characterizes the symmetry of the material.

Isotropic linear elastic solids

If the material is isotropic, then equation (17) must hold for all orthogonal APQ. Hence,

CABMN in an isotropic tensor which from (12) is symmetric in the first two indices. >From the

analysis given in the Appendix to Section 2 of Part III, we can represent CABMN as

- 143 -

CABMN = λδABδMN + µ(δAMδBN + δANδBM) . (18)

Eulerian strain

In the linear theory, there is no distinction between Lagrangian and Eulerian strain as is

clear from the expression

EAB = δAiδBjeij + O(ε2) .

Substituting this into (8), we may write

ρoψ =21hh (CABMNδAiδBjδMkδNl)eijekl + O(ε2) . (19)

Now define

Cijkl = CABMNδAiδBjδMkδNl , (20)

so that (19) becomes

ρoψ =21hh Cijkl eij ekl (21)

If the material is isotropic, then from (18)

Cijkl = λδijδkl + µ(δikδjl + δil δjk) . (22)

We will now consider materials with various symmetries and analyze the effect of the different

symmetries on CABMN.

Case I(a) - Symmetry with respect to the X1−X2 plane

Consider a homogeneous elastic solid which in the reference configuration has material

symmetry only with respect to the X1−X2 plane. For a material with such a symmetry, the

mechanical behavior in the X3 direction is the same as that in the −X3 direction (dentoed by X′3

in Figure 1). The coordinate transformation characterizing this particular type of symmetry is

represented schematically in Figure 1 and can be expressed algebraically as

X1 = X′1 , X2 = X′2 , X3

- 144 -

for which APQ is given by

- 145 -

RQAPQ

HP =

RJJQ00

1

0

1

0

−1

0

0 HJJP

. (23)

Thus, a coordinate transformation which reverses the direction of the X3 axis does not alter the

strain energy function. We will show that the number of material constants needed to describe

the behavior of a material with the above symmetry is reduced from 21 to 13.

For the symmetry under consideration, equation (17) must hold for the value of APQ given

in (23), but necessarily for other choices of APQ. Substituting (23) into (17) gives the relation-

ships

C1212 = C1212

C2313 = C2313

C3323 = −C3323

C2223 = −C2223

C1113 = −C1113

C1111 = C1111

C2312 = −C2312

C3313 = −C3313

C2213 = −C2213

C1112 = C1112

C1122 = C1122

C1313 = C1313

C3312 = C3312

C2212 = C2212

C2222 = C2222

C1133 = C1133

C1312 = −C1312

C2323 = C2323

C3333 = C3333

C2233 = C2233

C1123 = −C1123

which implies that

0 = C1123 = C1113 = C2223 = C2213 = C3323 = C3313 = C1312 = C2312 .

The remaining material constants from the set (13) are

C1111

C2222

C1122

C3333

C2233

C1133

C2323

0

0

0

C1313

C2313

0

0

0

C1212 .

0

0

C3312

C2212

C1112

(24)

The material properties of an elastic solid with material symmetry with respect to the X1 − X2

plane are characterized by the 13 constants in (24).

- 146 -

Case I(b) - Symmetry with respect to the X2−X3 plane

Consider now a solid that has material symmetry with respect to the X2−X3 plane. For such

a material, the mechanical behavior in the X1 direction is the same as the corresponding

behavior in the −X1 direction (denoted by X1 in Figure 2). The coordinate transformation for

which (17) holds, represented schematically in Figure 2, is given by

X1 = −X′1 , X2 = X′2 , X3 = X′3 . (25)

Since the material has symmetry with respect to the X2−X3 plane, a coordinate transformation

which reverses the direction of the X1 axis (such as that in (25)) will not alter the strain energy

function.

Case II - Orthotropy

A material which is symmetric with respect to two orthogonal planes is called orthotropic.iiiiiiiiii

Consider a homogenous elastic solid which has the material symmetry properties discussed in

I(a) as well as the symmetry properties discussed in I(b) (i.e. a material which is symmetric with

respect to both the X1−X2 plane and the X2−X3 plane). We will show that the number of

material constants needed to describe the mechanical behavior of a material with the above sym-

metries is 9. For the coordinate transformation given in equation (25), APQ is given by

RQAPQ

HP =

RJJQ 0

0

−1

0

1

0

1

0

0 HJJP

.

Substituting this choice for APQ into (17), we see that the 13 constants in (24) are related by

C1111 = C1111 ,

C2222 = C2222 ,

C1122 = C1122 ,

C1313 = C1313 ,

C2323 = C2323 ,

C3333 = C3333 ,

C2233 = C2233 ,

C1133 = C1133 ,

C1212 = C1212 ,

C2313 = −C2313

C3312 = −C3312

C2212 = −C2212

C1112 = −C1112

- 147 -

so that

0 = C1112 = C2212 = C3312 = C2313 .

The remaining independent constants are

C1111

C2222

C1122

C3333

C2233

C1133

C2323

0

0

0

C1313

0

0

0

0

C1212 .

0

0

0

0

0

(26)

The material properties of a linear elastic solid with material symmetry with respect to the

X1−X2 plane and with respect to the X2−X3 plane are thus characterized by the 9 constants given

in (26). Notice that by comparing (17) and (26), one can see that a material with the two sym-

metries described above is also symmetric with respect to a third plane which is orthogonal with

respect to both the original planes, i.e. the X1−X3 plane.

Case III - Transverse isotropy

A homogeneous orthotropic solid that also has material symmetry in every direction lying

in a certain plane (say, the X1−X2 plane) is said to be transversely isotropic.iiiiiiiiiiiiiiiiii For such a material,

a coordinate transformation which alters the X1 and X2 axes by an arbitrary rotation about the X3

axis will not alter the strain energy function. This property implies that equation (17) must hold

for a transformation of the type (also represented schematically in Figure 3)

X′1 = X1 cos α + X2 sin α

X′2 = −X1 sin α + X2 cos α (27)

X′3 = X3 ,

where the angle α is arbitrary. We will show that the number of material constants needed to

describe the mechanical behavior of a material with the above symmetry is 5. The components

- 148 -

of the orthogonal tensor APQ corresponding to (27) are

RQAPQ

HP =

RJQ 0sin αcos α

0cos α

− sin α

100 HJP

. (28)

Substituting (28) and (26), since the material is also orthotropic, into (17) gives the relationships

C1111 = C2222 , C1133 = C2233 , C1313 = C2323

C1212 =21hh(C1111 − C1122) .

The remaining independent compenents are

C1111

C1111

C1122

C3333

C1133

C1133

C1313

0

0

0

C1313

0

0

0

0

21hh(C1111 − C1122) .

0

0

0

0

0

(29)

The material properties of a linear elastic solid which is transversely isotropic are thus character-

ized by the five constants given in (29).

Case IV - Isotropic material

A homogeneous elastic solid which has material symmetry in the reference configuration

with respect to all directions is said to be isotropic.iiiiiiii Thus, equation (17) will remain valid for any

proper orthogonal choice of APQ. We will now derive equation (18) using the symmetries which

characterize an isotropic material. We will thus show that only two material constants are

needed to describe the mechanical behavior of a linear elastic isotropic solid.

An isotropic material can be considered as a material transversely isotropic in three mutu-

ally orthogonal planes (such as the X1−X2, X1−X3, and X2−X3 planes). Thus, a coordinate

transformation which alters the X2 and X3 axes by an arbitrary rotation about the X1 axis will not

- 149 -

alter the strain energy function (see Figure 4a). The same is true with respect to an arbitrary

rotation of the X1 and X3 axes about the X2 axes (see Figure 4b).

Thus, equation (17) must remain valid under the transformations

X′1 = X1

X′2 = X2 cos φ + X3 sin φ (30)

X′3 = −X2 sin φ + X3 cos φ

and

X′1 = X1 cos β − X3 sin βX′2 = X2 (31)

X′3 = X3 cos β + X1 sin β ,

where φ and β are arbitrary angles, in addition to the transformation (27). For the coordinate

transformation in (30), APQ is given by

RQAPQ

HP =

RJJQ00

1

sin φcos φ

0

cos φ−sin φ

0 HJJP

(32)

and for the coordinate transformation in (31), APQ is given by

RQAPQ

HP =

RJJQ−sin β

0

cos β

0

1

0

cos β0

sin β HJJP

. (33)

>From (17), (29), (32), and (33), we can show the relationships

C1122 = C1133 , C1111 = C3333

C1212 = C1313 =21hh(C1111 − C1122) .

The remaining independent constants are given by

C1111

C1111

C1122

C1111

C1122

C1122

µ

0

0

0

µ0

0

0

0

µ0

0

0

0

0

(34)

- 150 -

where µ =21hh(C1111 − C1122). Letting λ = C1122, equation (34) may be expressed as

λ + 2µλ + 2µ

λ

λ + 2µλλ

µ0

0

0

µ0

0

0

0

µ .

0

0

0

0

0

(35)

An equivalent way of writing (35) is given simply

CABMN = λδABδMN + µ(δAMδBN + δANδBM) . (36)

Notice that (36) is the same as equation (18) obtained previously.

- 151 -

Appendix L: Some Results from Linear Algebra

The object of this appendix is to introduce the concept of a tensor in a form which is espe-

cially useful in continuum mechanics. Thus, after a rapid review and summary of basic results

from linear algebra, a tensor is regarded as a linear transformation. Many aspects of the algebra

and the calculus of tensors are subsequently discussed in coordinate-free form and frequently are

represented with reference to rectangular Cartesian basis vectors.

1. Sets.

We use the symbol X to represent a set, class, collection or family which contains elements

or objects x1,x2,.... We write this as

X = x1,x2,.... (1.1)

The element x is a member of X, or belongs to X, or is in X, and this is denoted by

x ∈ X. (1.2)

Let X and Y be two sets. The set X is a subset of Y if every x in X is also in Y. This rela-

tion is designated by

X ⊆ Y. (1.3)

Also Y is said to include or contain X, and we write Y ⊇ X. The empty set is contained in every

set and is designated by ∅. If

X ⊆ Y and Y ⊆ X, (1.4)

then the set Y and X are said to coincide or be equal and we write

Y = X. (1.5)

- 152 -

If X is a subset of Y but without X being equal to Y, i.e., X ⊂ Y, then X is a proper subset

of Y and we write1

X ⊂ Y , Y ⊃ X. (1.6)

It should be clear that X ⊂ Y means that (i) if x ∈ X, then x ∈ Y and that (ii) there exists a y ∈

Y such that y ∈ X.

The union of two sets X and Y comprises all the elements of the two sets and is denoted by

X ∪ Y. (1.7)

The intersection of two sets consists only of those elements which belong to both X and Y and is

denoted by

X ∩ Y. (1.8)

The Cartesian product of two sets X and Y is the set

X × Y = (x,y) : x ∈ X, y ∈ Y (1.9)

of ordered pairs. For example, if X = x1,x2,x3 and Y = y1,y2, then X × Y =

(x1,y1),(x1,y2),(x2,y1),(x2,y2),(x3,y1),(x3,y2) .

For completeness, we introduce now the following definition of a function: A function f

from the set X to the set Y is a rule which assigns to every element of X a unique element of Y.

It is often displayed as

f : X → Y , x |→ y . (1.10)

hhhhhhhhhhhhhhh1 Our use of the symbols ⊆ and ⊂ is analogous to the use of the symbols ≤ and < to denote weak and strict ine-

qualities.

- 153 -

The first of (1.10) indicates that the set X is transformed into Y, while the second of (1.10) indi-

cates that the element x goes to the element y. The function f, or the rule defined by (1.10)1, is

sometimes called an operation or a transformation or a mapping. This definition of a function

rules out multivalued functions.

- 154 -

2. Vector spaces.

Consider a set V of arbitrary elements u,v,w,...,etc., and admit the following two operations

labeled as (1) and (2):

(1) Let there exist an operation, denoted by "+", which to every pair u,v assigns a unique

element u + v in V and which has the properties:

(A1) u + v = v + u (commutative property).

(A2) u + (v + w) = (u + v) + w (associative property).

(A3) There exists a zero vector o such that

v + o = v .

(A4) For every vector v there is a corresponding negative vector (−v)

such that

v + (−v) = o.

It is customary to write v + (-w) = v - w.

(2) Let there exist another operation, indicated by placing a juxtaposition of an element of

V and a real number α , which to every element v assigns a unique element αv in V and which

has the following properties:

(S1) 1v = v.

(S2) α(βv) = (αβ)v (associative property).

(S3) (α+β)v = αv + βv (distributive property for scalar addition).

(S4) α(v+w) = αv + αw (distributive property for vector addition).

- 155 -

We then say that V is a vector space over the field of real numbers and that u,v,w,... are

vectors in V. The first operation defined under (1) above is callled vector addition with elements

u + v called the sum of u and v. The second operation defined under (2) is called multiplication

of vectors by a real number.

All the usual algebraic results can be derived from the above axioms. For example, a vec-

tor equation of the form αu = o has the solution α = 0 or u = o.

Vector sub-spaces. A sub-space of a vector space V is any nonempty subset U of V which

is such that if u and v belong to U and α is any real number, then the vectors u + v and αv also

belong to U.

- 156 -

3. n-Dimensional vector spaces.

Let v1,v2,..,vp be any p vectors in a vector space V, p being some integer. The sumi=1Σp

αi vi

is called a linear combination of the p vectors vi with coefficients αi as real numbers. These vec-

tors are said to form a linearly independent set of order p if the only coefficients that satisfy the

equation

α1v1 + α2v2 + . . . + αpvp = o (3.1)

are α1 = α2 = α3 = . . . = αp = 0. A linearly dependent set is one which is not linearly

independent.

Note that every subset of vectors containing the zero vector is linearly dependent.

α1v1 + α2v2 + . . . + 1 o + αpvp = 0.

Consider the set of all systems of linearly independent vectors in the vector space V. There

are two possibilities: Either (1) there exist linearly independent systems of arbitrarily large order

p; then the vector space is said to be an infinite-dimensional space, or (2) the order of the linearly

independent sytstem is bounded. In the second case, there exists an integer n such that the order

p ≤ n and there exist linearly independent systems of order n but not of n + 1. The vector space

V is said to be a finite-dimensional vector space. The number n is termed the dimension of the

vector space and we shall use V n to designate finite n-dimensional vector space; and henceforth

in this Appendix we shall be concerned with finite dimensional vector spaces only. Let

g1,g2,..,gn be any such system of order n in a finite dimensional vector space V n ; it will be

called a basis of V n . Thus, we have the following definition:

A basis in a vector space V n is any linearly independent set of n vectors.

Let v be any vector in V n. The set of n + 1 vectors v,g1,g2,..,gn is necessarily linearly

dependent, so there exists n + 1 numbers λ,α1,α2,..,αn not all zero such that

- 157 -

λv + α1g1 + α2g2 + . . . + αngn = o. (3.2)

If v = o, then αigi = o, (no sum on i), and hence αi = 0. For v ≠ o, the number λ must

be different from zero, for otherwise the system gi (i = 1,2,..,n) will not be linearly independent.

Equation (3.2) can, therefore, be solved for v, and there exist numbers v1,v2,...,vn such that

v = v1g1 + v2g2 + . . . + vngn . (3.3)

Thus, the vector v is expressible as a linear combination of the gi . This combination is unique;

for otherwise, there would exist another, and their difference would constitute a linear combina-

tion of the gi equal to the null vector o and with coefficients not all zero. The numbers

(v1,v2,..,vn) are called the components of v with respect to the basis g1,g2,..,gn . It is con-

venient to replace (3.3) by the shorter notation

v = vkgk (3.4)

in which the summation convention is used. Whenever an index appears twice in the same term,

a summation is implied over all terms by letting that index assume all its possible values, unless

the contrary is stated. Normally the convention applies to an index which appears once as a sub-

script and once as a superscript; but, as will become apparent later, in a special case it will

suffice to adopt the convention for repeated subscripts. It is convenient to recall here a theorem

the proof of which can be easily found in a book of linear algebra2. A statement of the theorem

is as follows:

For a set of vectors to constitute a basis in V n it is necessary and sufficient that every vector

in V n can be expressed in one, and only one, way as a linear combination of the vectors of that

set.

hhhhhhhhhhhhhhh2 See a standard book on linear algebra.

- 158 -

4. Euclidean vector spaces.

Consider a vector space of dimension n defined over the field of real numbers. Suppose

there is an operation, denoted by " . ", which to every pair of vectors u and v assigns a real

number, denoted by u . v, and which has the following properties:

(I1) u . v = v . u .

(I2) u . (v+w) = u . v + u . w .

(I3) (αu) . v = u . (αv) = α(u . v) .

(I4) u . u ≥ 0 and u . u = 0 => u = o .

The above operation (known as a rule of composition) is called the dot product or the scalar pro-

duct or the inner product of two vectors. A vector space V n obeying the rules of composition

(I1) to (I4) is a real inner product space and is called a Euclidean vector space3 E n .

The magnitude or the norm of the vector v is defined by

c c v c c = (v.v)1⁄2 . (4.1)

If c c v c c = 1, the vector is said to be normalized. By (I4) c c v c c 2 is always positive if v ≠

o and it vanishes when v = o , so that its square root is real.

The norms of two vectors and the magnitude of their scalar product satisfy the Schwarz ine-

quality

hhhhhhhhhhhhhhh3 This terminology stems from the fact that once a scalar product is admitted, all the standard theorems of

Euclid’s geometry can be established by an appeal to the properties of an inner product space.

- 159 -

c u . v c ≤ c c u c c c c v c c . (4.2)

To prove this, consider the vector αu + v where α is an arbitrary real number. Then,

c c (αu + v) c c 2 = α2 c c u c c 2 + 2αu . v + c c v c c 2 .

The left-hand side of the last expression is positive or zero for all real α and this implies

(u . v)2 ≤ c c u c c 2 c c v c c 2 ,

which is equivalent to (4.2). In view of (4.2), we define the angle between two nonzero vectors

u and v by

cos θ =c c u c c c c v c c

u . vhhhhhhhhh , (4.3)

where 0 ≤ θ ≤ π.

Vectors u and v are said to be orthogonal if

u . v = 0 . (4.4)

A set of vectors in E n is said to be orthonormal if all the vectors in the set are normalized

and mutually orthogonal. Let ei be a system of orthonormal basis in E n . It follows that ei must

satisfy the conditions

ei. ek = δik , (4.5)

where δik (i,k = 1,2,...,n) is the Kronecker delta defined by

δik =IKL1

0

if

if

i

i

=

≠

k

k

.

,(4.6)

Every orthonormal set is necessarily linearly independent. An infinity of bases exist which are

orthonormal; and in this appendix a typical orthonormal basis will be denoted by ei . Any vector

- 160 -

v in E n, when referred to the basis ei, can be expressed in the form

v =i=1Σn

viei . (4.7)

Then, by taking the scalar product of both sides of (4.7) with ek , we have

vk = v . ek . (4.8)

The numbers vk are the components of v with respect to the orthonormal basis ek and the square

of the magnitude of v is

c c v c c 2 = v . v =i=1Σn

vivi . (4.9)

- 161 -

5. Points.

We denote points by boldface letters x, y, z, etc. Given any two points x and y , we associ-

ate with them a point difference y − x which is a vector v in E n and which we denote by

v = y − x (5.1)

satisfying the following rules:

(P1) (y − x) + (x − z) = (y − z) .

(P2) Given any point x and any vector v, there is a unique point y such

that y − x = v .

The point y determined in rule (P2) is denoted by

y = x + v = v + x . (5.2)

In view of (5.1) and (5.2), it is meaningful to speak of the difference of two points which is

a vector, and of the sum of a point and a vector , which is a point. In particular,

x − x = o (5.3)

holds for every point x .

The distance between two points x,y is

c x − y c = (x − y) . (x − y)1/2 . (5.4)

- 162 -

The set of all points associated with a Euclidean vector space E n is called Euclidean Point

Space.4

hhhhhhhhhhhhhhh4 Euclidean vector and point spaces are abstract concepts which at this point in our discussion are unrelated to or-

dinary vectors and points of elementary geometry. Geometrical representations of these abstract concepts will bediscussed in Section 6.

- 163 -

6. Geometric space.

Consider the space of elementary geometry. This space is composed of elements P,Q,R,

called points. Choose O to be a reference point or origin. The directed line segments

OP→

, OQ→

,... may be associated with points x,y,... of Euclidean point space (Section 5) and the

directed line segments PQ→

, QR→

may be associated with vectors u,v,... in Euclidean vector space

(Section 4). Thus we set

x = OP→

, y = OQ→

, z = OR→

,... (6.1)

and

u = PQ→

, v = QR→

,... o = PP→

. (6.2)

The properties (A1)−(A4) of abstract vectors and the properties (P1),(P2) of points have their

immediate geometrical representation in our geometric space. Thus, the class of all line seg-

ments with the same direction and length as PQ→

, called a geometrical vector or free vector,

represent vectors, and directed segments such as OP→

, called position vectors, represent points.

For example, we have the properties

PQ→ = −QP

→, PQ

→ + QR→ = PR

→.

Next, the properties (S1)−(S4) have their geometric representation in our space of elemen-

tary geometry. Multiplication of a vector by a scalar corresponds to multiplication of a line ele-

ment PQ→

by a scalar α and is represented by a line element of length c α c c PQ c , where c PQ c is

the length of PQ→

in the same or opposite direction as PQ→

according to whether α is positive or

negative.

The rules (I1)−(I4) of Section 4 have their representation in the geometrical space. Recal-

ling also (4.3), these rules are represented by the scalar product of two geometrical vectors as

PQ→ . RS

→ = c PQ c c RS c cos θ , (6.3)

- 164 -

where θ is the angle between PQ→

and RS→

. Thus, geometrical vectors give a geometrical

representation of a Euclidean point space with its associated Euclidean vector space. Geometri-

cal space has three dimensions.

Given a geometrical space, we select an origin O and a (constant) basis fi of the associ-

ated vector space. These are said to constitute a coordinate system consisting of origin and a

basis (O,fi) for our geometrical space. Any point x referred to the basis fi can be expressed in

terms of its components xh

i by the formula

x =i=1Σ3

xhifi , (6.4)

where now summation is over i = 1,2,3. The components xhi are called rectilinear coordinates of

x. If (fi) is identified with a constant orthonormal basis ei, then referred to ei the point x can be

represented as

x =i=1Σ3

xiei (6.5)

and xi are called Cartesian rectilinear coordinates of x.

We may select a different basis gi (say) at each point x in our geometric space so that

gi = gi(x). This situation arises when we consider curvilinear coordinates in geometric space.

- 165 -

7. Indicial notation.

- 166 -

8. Linear transformation. Tensors.

A linear transformation or tensor T is an operation which assigns to each vector v in V n

another vector Tv in V m such that5

T(v+w) = Tv + Tw , T(αv) = αTv (8.1)

for all v,w in V n. Thus, we have T: V n → V m .

We denote the set of all possible tensors which transform vectors in V n linearly into vec-

tors in V m by L(V n,V m) and we define the sums of two tensors and the scalar multiples of ten-

sors by

(T+S)v = Tv + Sv , (αT)v = α(Tv) . (8.2)

The transformations T + S and αT defined in (8.2) obey the rules (8.1), i.e., they are tensors.

Also the sums and scalar multiples defined in L(V n,V m) obey the rules (A1)−(A4) and (S1)−(S4).

Hence the set of all tensors constitutes a vector space. The zero element in L(V n,V m), i.e., the

element which is to be substituted for o in rules (A3) and (A4) is the transformation which

assigns the zero vector to every vector. We call this transformation the zero tensor and denote it

by O. In other words, the zero tensor O is such that

Ov = o (8.3)

for all vectors v.

The identity tensor can only be defined for transformations from V n to V n. Thus, when the

dimension m = n, we define the identity or the unit tensor I by

Iv = v (8.4)

hhhhhhhhhhhhhhh5 This special definition is particularly useful in the present Appendix. A more general definition of a tensor is

given in Appendix C.

- 167 -

for all vectors v in V n.

We associate with Euclidean vector spaces E n and E m a vector space of dimensions nm

denoted by E n ×b E m called the tensor product space of E n and E m, and defined in the follow-

ing way: If a ∈ E n and b ∈ E m, then a ×b b is an element of E n ×b E m such that

(a ×b b)v = a(b . v) (8.5)

for all vectors v ∈ E m. It is seen that a ×b b, which sometimes is denoted as ab, obeys the rules

(8.1) and hence is a tensor, i.e., it assigns to each vector v ∈ E m the vector a(b . v) ∈ E n.

Sums and scalar multiples of such tensors can then be calculated by the rules laid down for ten-

sors in (8.2). Not all tensors generated in this way can be expressed as an element of space of

the form a ×b b. The zero tensor in E n ×b E n is defined by the tensor o ×b o.

Consider now the tensor product space E n ×b E n of dimension n2. Let ei be an orthonormal

basis in Euclidean vector space E n. Then,

ei ×b ek (8.6)

is a basis in E n ×b E n. To prove this, suppose αik are a set of n2 numbers such that

αikei ×b ek = O .

Then,

(αikei ×b ek)er = Oer = o .

With the help of (8.5) this last equation becomes

αikeiδkr = αirei = o .

Since ei is a basis in E n it follows that

- 168 -

αik = 0 .

Hence ei ×b ek are linearly independent and form a basis in E n ×b E n. When u and v are vectors

in E n so that

u = uiei , v = viei ,

then

uv = u ×b v = uivkei ×b ek . (8.7)

Let T be a tensor in L(E n,E n) and let tik denote the components of the vector Tek with respect

to the basis ei in E n so that

Tek = tikei , tik = Tek. ei . (8.8)

Associated with each tensor T in L(E n,E n) we have the ordered array of n2 scalars tik which

combine by the rules (A1)−(A4) and (S1)−(S4) for vector spaces. Hence, L(E n,E n) is a vector

space of dimension n2. But, ei ×b ek is a linear transformation in the space L(E n,E n) and hence

also forms a basis in L(E n,E n) which is identical to E n ×b E n. Moreover, with the use of (8.8),

for any vector v ∈ E n we have

(T−tikei ×b ek)v = Tekvk − tikeivk = o .

Hence, we conclude that

T = tikei ×b ek . (8.9)

The n2 numbers tik in (8.9) are called the components (or Cartesian components) of T with

respect to the basis ei ×b ek .

Similarly, we write

I = δikei ×b ek (8.10)

- 169 -

so that

Iv = v (8.11)

for all vectors v in E n. The tensor I with components δik is the unit tensor in E n ×b E n or

L(E n,E n).

The system of numbers tik in (8.8) is usually regarded as an array of numbers, a matrix,

having n rows and n columns and denoted by [tik] or tik.

Let T be a tensor in E n ×b E n. The transpose of a tensor T is a tensor TT defined by

w . Tv = v . TTw (8.12)

for all vectors v and w in E n. The transpose TT satisfies (8.1) and is a tensor, and the opera-

tion which associates TT with T is called transposition. From (8.12) it may be shown that

(TT)T = T , (T+S)T = TT + ST , (αT)T = αTT , (8.13)

so that transposition is a linear operation. Further, it follows from (8.9) and (8.12) that

[tik] = ei. Tek = ek

. TTei = [tik]T ,

where [tik]T are components of the tensor TT.

A tensor T is said to be symmetric if

TT = T . (8.14)

Also, by (8.9) and (8.14) we have

tki = tik . (8.15)

Also, from (8.12) and (8.14) it follows that if T is symmetric, then

w . Tv = v . Tw (8.16)

- 170 -

for every v,w in E n and conversely.

A tensor T is said to be skew or anti-symmetric (skew- symmetric) if

TT = −T (8.17)

or

tki = −tik . (8.18)

Every tensor T in E n ×b E n can be written as

T = TS + TA , (8.19)

where

TS =21hh (T+TT) − (TS)T , TA =

21hh (T−TT) = −(TA)T . (8.20)

- 171 -

9. Multiplication of tensors.

Let T and S be two tensors in L(V n,V n). The product TS of two tensors T and S is the

composition of T and S , i.e., TS defined by the requirement that

(TS)v = T(Sv) (9.1)

hold for all vectors v in V n. It is seen that TS satisfies (8.1) and is a tensor in L(V n,V n). The

following rules may be established

(M1) (TS)R = T(SR) ,

(M2) T(R+S) = TR + TS ,

(M3) (R+S)T = RT+ST ,

(M4) α(TS) = (αT)S = T(αS) ,

(M5) IT = TI = T ,

where T,S,R are tensors in L(V n,V n).

For tensors T,S in L(E n,E n) it follows from (6.9) and (8.1) that

(TS)v = T(smkvkem) = timsmkvkei

for all vectors v ∈ E n so that

TS = timsmkei ×b ek (9.2)

and timsmk are the components of TS with respect to the basis ei ×b ej. If tik and sik are

matrix representations of the tensors T and S, respectively, then

tiksik = timsmk . (9.3)

- 172 -

Note that

(TS)T = STTT . (9.4)

The commutative law is not, in general, satisfied, i.e., TS is not the same as ST.

The truth of the following formulae can be readily verified:

(a ×b b)T = b ×b a ,

T(a ×b b) = Ta ×b b ,

(9.5)

a ×b (TTb) = (a ×b b)T ,

(a ×b b)(c ×b d) = (b . c)a ×b d .

The tensor T is invertible if, for every choice of w, the equation w = Tv can be solved

uniquely for the vector v. If T is invertible, then v is unique and we write v = T−1w. The

transformation T−1 obeys the rules (8.1) and is a tensor called the inverse of T. If the inverse

T−1 exists, then

(TT−1−I)w = o , (T−1T−I)v = o

for every choice of w and every choice of v so that

TT−1 = T−1T = I , (T−1)−1 = T . (9.6)

If S and T are invertible tensors in L(V n,V n) and if w = (TS)v, it follows that

Sv = T−1w , v = S−1T−1w .

Hence, the product TS is invertible and

(TS)−1 = S−1T−1 . (9.7)

- 173 -

If the only solution of the equation Tv = o is v = o, then T is nonsingular; otherwise it

is singular. If T is invertible, then T is nonsingular. It can be shown that a tensor T is invertible

or nonsingular if and only if det T ≠ 0.

A tensor Q is orthogonal if it is invertible and if the inverse coincides with its transpose,

i.e.,

Q−1 = QT or QQT = QTQ = I . (9.8)

A tensor Q is orthogonal if and only if it preserves inner products in the sense that

(Qu) . (Qv) = u . v (9.9)

for every vector u,v in E n.

The tensor T is positive semidefinite if it is symmetric and if v . Tv ≥ o for all vectors v in

E n. If v . Tv > o when v ≠ o, then T is positive definite.

An operation tr which assigns to each tensor T in E n ×b E n a number tr T is a trace if

tr(S+T) = tr S + tr T , tr(αT) = α tr T , (9.10)

tr a ×b b = a . b , (9.11)

for all vectors a,b in E n and all scalars α. The tr operation is a linear function and the follow-

ing consequences are clear:

tr T = tikei. ek = tii . (9.12)

Also,

tr TT = tr T , tr(TS) = tr(ST) ,

(9.13)

tr I = n , tr TS = tr T , tr TA = 0 ,

- 174 -

where n in (9.12)3 is the dimension of the space.

If one defines the inner product of two tensors T,S to be the number

T . S = tr(TST) , (9.14)

then the rules (I1)−(I5) are satisfied. Hence with tr(STT) as the inner product, the space of all

tensors is an inner product space. We define the magnitude of T as

| T | = √ddddddtr(TTT) . (9.15)

- 175 -

10. Change of basis.

Let ei and eh

i be two arbitrary orthonormal bases in E n. Each vector in one basis may be

expressed in terms of those of the other basis by a nonsingular transformation. Thus

eh

i = Aei = akiek ,

(10.1)

ei = Ahh

eh

i = ah

kieh

k , Ahh

= A−1 ,

where

eh

k. Aei = δik = ei

. Ahh

eh

k ,

(10.2)

ah

ki = ei. eh

k = aik , Ahh

= AT .

Hence, A−1 = AT and A is orthogonal.

If v is an arbitary vector and

v = viei = vh

ieh

i , (10.3)

then the components of v in the two bases ei and eh

i are related by the equations

vh

i = akivk , vi = aikvh

k . (10.4)

Also

v . v = vivi = vh

ivh

i , u . v = uivi = uh

ivh

i , (10.5)

and the scalar product has the same form in terms of the components ui,vi as it has in terms of

the components uh

i,vh

i and is called a scalar invariant.

Again, if T is an arbitrary tensor in E n ×b E n and

T = tikei ×b ek = th

ikeh

i ×b eh

k , (10.6)

- 176 -

then the components of T in the two bases ei and eh

i are related by

th

ik = ariasktrs , tik = airaksth

rs . (10.7)

If T and S are two tensors in E n ×b E n and

S = sikei ×b ek = sh

ikeh

i ×b eh

k ,

then

T .S = tirsir = th

irsh

ir ,

(10.8)

T . T = tirtir = th

irth

ir

are scalar invariant forms.

- 177 -

11. Point, vector and tensor functions.

The distance between two points x,y is defined in (6.4). The distance vanishes if and only

if x = y. This idea of distance can be used to define limits, convergence, continuity, etc. For

example, we say that

n→∞lim xn = x if

n→∞lim | xn−x | = O . (11.1)

Suppose z(t) is a function of a real variable t called a point function. The derivative z.(t),

if it exists, on an open subset of the real numbers is defined by

z.(t) =

dtdzhhh =

∆t→∞lim

∆tz(t+∆t) − z(t)hhhhhhhhhhhh . (11.2)

Such a derivative is a vector-valued function.

In a similar way we may define limits, derivatives, etc., of vector- and tensor-valued func-

tions v,T of t using the appropriate definitions (1.2) and (9.15) for distance. The derivative of a

vector-valued function of t is a vector function and the derivative of a tensor-valued function is a

tensor function. The usual rules of differential calculus are easily extended so as to apply to

point, vector or tensor functions. For example, if T = T(t) and S = S(t) are tensors, then

TShhh.

= T.S + TS

., (11.3)

where, of course, the correct order must be maintained for the tensors.

If Q = Q(t) is an orthogonal tensor function, then from (9.8) and (11.3),

Q.

QT + QQThhh.

= O . (11.4)

Since Q.

T = QThhh

it follows that

Ω = Q.

QT = − Q Q.

T (11.5)

- 178 -

is a skew tensor.

If φ(x) is a scalar function of x defined on some open region U of Euclidean point space,

then φ is differentiable on U if there is a vector field w(x) on U such that

x→oxlim

| x−ox |

| φ(x) − φ(ox) − w(ox) . (x − ox) |hhhhhhhhhhhhhhhhhhhhhhhhhhh = 0 (11.6)

for everyox in U. If this is so, w is unique. We call it the gradient of φ and write

w = grad φ(x) =∂x∂φhhh . (11.7)

Alternatively, the gradient of φ(x) may be defined by means of

β→0lim

βφ(x+βu) − φ(x)hhhhhhhhhhhhh =

dβdhhh φ(x + βu) c

c β=0

=∂x∂φhhh . u = w . u (11.8)

for all vectors u ∈ E n. It may be shown that this definition of gradient is equivalent to that

given in (11.6).

Sometimes we use the symbol ∇ to denote the gradient operator:

∇ =∂x∂hhh = er ∂xr

∂hhhh (11.9)

if x = xrer and er is an orthonormal basis. From (11.6) it follows that

w = grad φ(x) =∂x∂φhhh = ∇φ = ei ∂xi

∂φhhhh . (11.10)

If v(x) is a vector function of points x in some open region U of Euclidean space, then

v(x) is said to be a vector field. The gradient of a vector field v(x) is a tensor field T(x)

defined by

- 179 -

x→oxlim

| x−ox |

| v(x) − v(ox) − T(ox)(x−ox) |hhhhhhhhhhhhhhhhhhhhhhhhhh = o (11.11)

for every ox in U. We write this as

T = grad v(x) =∂x∂vhhh . (11.12)

If v = viei, then it follows from (11.12) that

T =∂xk

∂vihhhhei ×b ek (11.13)

so that∂xk

∂vihhhh are the components of the grad v with respect to the orthonormal basis ei. It can

be shown that

grad v(x)Tc = gradc.v(x) (11.14)

for every constant vector c. Equation (11.14) could be taken as the defining equation for grad

v(x).

The divergence of the vector field v(x) is a scalar field defined by

div v(x) = trgrad v(x) = ∇ . v = ei.

∂xi

∂vhhhh =∂xi

∂vihhhh . (11.15)

The divergence of a tensor field T(x) is the vector field div T(x) for which

c . div T(x) = divTT(x)c (11.16)

for every constant vector c. When

T = tijei ×b ej , (11.17)

- 180 -

it follows that

div T(x) =∂xj

∂tijhhhh ei . (11.18)

If P is a bounded region whose boundary ∂P is sufficiently well-behaved, then by applica-

tion of the divergence theorem (Green’s theorem) to a vector v and a tensor T we have

∫P

div v(x)dV(x) = ∫∂P

v(x) . n(x)dA(x) , (11.19)

∫P

div T(x)dV(x) = ∫∂P

T(x)n(x)dA(x) , (11.20)

where n is the outward unit normal to ∂P, the integrations are over the volume P and the boun-

dary surface ∂P and dV and dA represent elements of volume and area, respectively. In com-

ponent form, (11.19) and (11.20) read:

∫P

vi,i dv = ∫∂P

vi ni da , (11.21)

∫P

tij,j dv = ∫∂P

tij nj da . (11.22)

- 181 -

12. Vector product. Axial vectors. The curl operator.

Consider a Euclidean vector space E3 of three dimensions. If u,v,w are any vectors in E3 ,

the vector product of u and v is a vector in E3 denoted by u × v and defined by the properties:

(V1) w.(u×v) = u.(v×w) = v.(w×u) and is a scalar denoted by

[uvw] and called the scalar triple product ,

(V2) u × v = −v × u ,

(V3) | u×v | = | u | | v | sin θ ,

where θ is defined in (4.3). From (V1) and (V2), it follows that

u × u = o , u.(u×v) = 0 , v.(u×v) = 0 (12.1)

so that u × v is orthogonal to both u and v. It can be shown that three nonzero vectors u,v,w

are linearly independent if and only if their scalar triple product [u,v,w] = u.(v×w) = (u×v).w.

The following properties may be deduced from the definitions:

(αu) × v = α(u×v) = u × (αv) ,

u × (v+w) = u × v + u × w ,

(u+v) × w = u × w + v × w , (12.2)

u × (v×w) = v(u.w) − w(u.v) = (v ×b w − w ×b v)u ,

[u + x,v,w] = [uvw] + [xvw] .

The above definition of vector product does not restrict ei to be either right-handed or left-

handed basis.

- 182 -

Let e1,e2,e3 be an orthonormal system. Then, in view of (V3) and (12.1),

e1 × e2 = ± e3 . (12.3)

A right-handed orthonormal sytem is one for which

e1 × e2 = e3 (12.4)

and a left-handed orthonormal system corresponds to the choice of the - sign in (12.3). For a

right-handed orthonormal system it follows that

[e1e2e3] = 1 , ei × ej = eijkek , eijk = [eiejek] , (12.5)

where the components eijk are known as the permutation symbol or the components of the alter-

nating tensor. Also, for any two vectors

u = uiei , v = vjej ,

(12.6)

u × v = eijkuivjek .

Let TA be a skew tensor in E3 so that

TA = tikei ×b ek =21hh tik[ei ×b ek − ek ×b ei] . (12.7)

Hence, if u is an arbitrary vector in E3

TAu =21hh tik[ei(u.ek) − ek(u.ei)] =

21hh u × [tikei × ek] , (12.8)

if we use (12.2)4. Hence

TAu = tA × u , (12.9)

- 183 -

where the vector

tA =21hh tkiei × ek . (12.10)

Let ei be a right-handed orthonormal basis. Then, referred to ei, TA reads

TA = tijei ×b ej (12.11)

and the axial vector tA with components tkA assumes the form

tA =21hh tjieijkek = tk

A ek . (12.12)

Alternatively

tji = eijktkA . (12.13)

If

TA =∂x∂vhhh − (

∂x∂vhhh)T , (12.14)

the corresponding axial vector tA is called the curl of v and written as

tA = curl v . (12.15)

In a right-handed orthonormal system

curl v = eijk ∂xi

∂vjhhhh ek . (12.16)

- 184 -

References for Appendix L

Gel’fand, I. M. 1961 Lectures on linear algebra (translated by A. Shenitzer from the 2nd Rus-sian edition). Dover, New York.

Halmos, P. R. 1974 Finite dimensional vector spaces. Springer-Verlag, New York.

Hoffman, K. & Kunze, R. 1971 Linear algebra (2nd edition). Prentice-Hall, Inc., EnglewoodCliffs, New Jersey.

Stecher, M. 1988 Linear algebra. Harper & Row, New York.

Documents

ME 185 P. M. NAGHDI’S NOTES ON CONTINUUM MECHANICS · university of california at berkeley department of mechanical engineering me 185 p. m. naghdi’s notes on continuum mechanics