MATH143 Calculuspower/MATH143notes-1.pdf · Chain, product and quotient rules. Second-order and higher derivatives. Stationary points and their classiﬁcation. Parametric, implicit

MATH143 Calculus

Lecturer: Dr Alexander Belton ([email protected]) Office: Fylde B12

Syllabus

Differentiation

Introduction to differentiation using graphical methods. Differentiation from firstprinciples (up to third-order polynomials only). Basic differentiation using formulae.Chain, product and quotient rules. Second-order and higher derivatives. Stationarypoints and their classification. Parametric, implicit and logarithmic differentiation.

Integration

The definite integral (as signed area). Indefinite integration (as the process inverseto differentiation). Integration of trigonometric functions using product and double-angle formulae. Integration of rational functions by completing the square and bypartial fractions. Integration by parts and by substitution. Applications: arc lengthof a plane curve; area and centroid of a plane region; surface area, volume and centreof mass of a solid of revolution.

Numerical methods

The trapezium rule. Simpson’s rule (the derivation is not examinable). The Newton-Raphson method.

Further topics

Taylor and Maclaurin series; estimation of integrals. L’Hopital’s rule.

Recommended text books

Either Engineering Mathematics by A. Croft, R. Davison and M. Hargreaves, thirdedition, 2001, Prentice Hall, ISBN 0 130 26858 5, £39·99.Or Modern Engineering Mathematics by G. James, fourth edition, 2008, PearsonEducation Ltd, ISBN 978-0-13-239144-3, £37·99.

Lectures and class exercises

The notes for this course are complete and, together with the workshop questionsand assessed exercises, provide all the information you need – however, you shouldannotate them with your own notes taken during lectures. Examples are given inthe notes but further practice will be gained by working through the class exercises,given on separate sheets, during lecture time. You are expected to attend lecturesso that you can participate in and learn from the solution of these additionalexercises.

1

Learning outcomes

At the end of this module, students should be able to:

(a) use the notation of differentiation and integration correctly;

(b) differentiate simple functions, including powers, the exponential function,trigonometric functions and their inverses;

(c) differentiate sums, products and quotients of functions;

(d) differentiate composite functions;

(e) differentiate implicitly and use this method to differentiate functions ofthe form f(x) = g(x)h(x);

(f) differentiate functions related through a parameter;

(g) integrate basic functions and evaluate these integrals;

(h) integrate simple trigonometric functions using product and double-angleformulae;

(i) integrate rational functions using partial fractions or by completing thesquare;

(j) integrate functions of the formaf ′(x)

f(x)and af ′(x)f(x);

(k) integrate functions using the ‘by parts’ technique;

(l) integrate functions ‘by substitution’, using a change of variable;

(m) recognise the integration technique required for a given function;

(n) find lengths, areas, volumes, centres of mass and similar quantities byusing integration techniques;

(o) find Taylor series and Maclaurin series for functions;

(p) estimate the values of integrals using the trapezium and Simpson’s rulesand using Taylor series;

(q) find numerical solutions of equations using the Newton-Raphson method;

(r) use L’Hopital’s rule to determine limits.

2

Assessment

20% weekly coursework

40% end-of-module progress test

40% end-of-year examination

You will receive the weekly coursework assignment from your tutor at the Fridayworkshop. These workshops run from 11am until 1pm, with the first hour beingcompulsory for all students.

Please note that MATH143 coursework must be put in the first-yearcoursework pigeonhole no later than 12 NOON WEDNESDAY.

This will be then be given to your tutor. Work received later than this time will notnormally be accepted. Under no circumstances can coursework be accepted after themodel answers are given out at the following workshop. As stated in the first-yearhandbook, if a student has submitted less than 80% of the coursework required, heor she will be deemed not to be in good standing with his or her tutor.

Progress test

The progress test will take place on Thursday in Week 15 in normal lecture time.If a different room is to be used, you will be informed during lectures in Week 15 andthe information will be put on the LUVLE webpage. It is the student’s responsibilityto arrive promptly for this test. If you are ill, a medical certificate must be submittedto the Teaching Office (Engineering A38). If written confirmation of the reasonfor your absence is not received, you will be given a mark of zero.

Examination

The examination will take place towards the end of the Summer Term. You will beinformed by Student Registry, in writing, of the examination times and places. It isthe student’s responsibility to attend these examinations on time unless preventedby ill health.

3

1 Differentiation 51.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51.2 Limits of functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51.3 Rules for the manipulation of limits . . . . . . . . . . . . . . . . . . . 51.4 Continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71.5 Rules for manipulating continuous functions . . . . . . . . . . . . . . 81.6 Basic ideas of differentiation . . . . . . . . . . . . . . . . . . . . . . . 81.7 Differentiation from first principles . . . . . . . . . . . . . . . . . . . 101.8 Rules for differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . 131.9 Second and higher derivatives . . . . . . . . . . . . . . . . . . . . . . 171.10 Stationary points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181.11 Parametric differentiation . . . . . . . . . . . . . . . . . . . . . . . . 191.12 Implicit differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . 211.13 Logarithmic differentiation . . . . . . . . . . . . . . . . . . . . . . . . 22

2 Integration 242.1 Definite integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242.2 Indefinite integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262.3 The fundamental theorem of calculus . . . . . . . . . . . . . . . . . . 272.4 Elementary integration . . . . . . . . . . . . . . . . . . . . . . . . . . 292.5 Trigonometric identities . . . . . . . . . . . . . . . . . . . . . . . . . 302.6 Partial fractions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 312.7 Completing the square . . . . . . . . . . . . . . . . . . . . . . . . . . 322.8 Piecewise-standard functions . . . . . . . . . . . . . . . . . . . . . . . 332.9 Further methods of integration . . . . . . . . . . . . . . . . . . . . . 332.10 Integration by substitution . . . . . . . . . . . . . . . . . . . . . . . . 362.11 Applications of integration . . . . . . . . . . . . . . . . . . . . . . . . 40

3 Numerical methods 483.1 The trapezium rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . 483.2 Simpson’s rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 513.3 The Newton-Raphson method . . . . . . . . . . . . . . . . . . . . . . 55

4 Further topics 584.1 Taylor’s theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 584.2 L’Hopital’s rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 624.3 The modified Newton-Raphson method . . . . . . . . . . . . . . . . . 64

A The derivation of Simpson’s rule 66

4

1 Differentiation

1.1 Introduction

Calculus is used in many situations in order to analyse varying quantities. It involvestwo basic operations, differentiation and integration, which are opposite sides of thesame coin. The Greeks developed the rudimentary ideas of integration, Archimedesusing his ‘method of exhaustion’ to obtain an exact formula for the area of a circle.Differentiation was probably first used by Fermat in the 17th century, to determinethe maxima and minima of certain functions. The connection between the twotechniques was developed by Newton and Leibniz, working independently of eachother, as a way of dealing with change and motion; it is the notation developed byLeibniz that is used today. Calculus has many applications throughout science andengineering and provides an essential language: for example, the laws of physics areformulated used calculus.

1.2 Limits of functions

Limits have already appeared in MATH142 for sequences. As these ideas will beused to develop some of the basic formulas for differentiation, we will revisit thistopic briefly.

Definition

A function f(x) is said to approach (or tend to) a limit l as x approaches the value aif we can make the value of f(x) as close as we please to l by taking x sufficientlyclose to a. This is written as

limx→a

f(x) = l or f(x) → l as x→ a.

Note that we do not require that f(a) = l or even that f(a) exists.

1.3 Rules for the manipulation of limits

If f(x) → l and g(x) → m as x→ a then

(i) f(x) + g(x) → l +m as x→ a,

(ii) f(x)g(x) → lm as x→ a

and (iii) if m 6= 0 thenf(x)

g(x)→ l

mas x→ a.

5

Examples

1. Evaluate limx→0

3x2 + 5x3 − 2x4

2x2 + x3.

The rules for limits show that the denominator 2x2 + x3 → 0 as x → 0, sorule (iii) cannot be applied directly to this quotient. However, if we divide thenumerator and denominator by the lowest power of x which appears (here, x2)then, as x→ 0, the rules for limits apply and

limx→0

3x2 + 5x3 − 2x4

2x2 + x3= lim

x→0

3 + 5x− 2x2

2 + x=

3 + 0− 0

2 + 0=

3

2.

2. Evaluate limx→∞

3x2 + 4x− 2

5x2 − x+ 6.

Here, neither the numerator nor denominator converge as x → ∞; they bothgrow without limit. However, if we divide the numerator and denominator bythe highest power of x which appears then, as x → ∞, the individual termswill each tend to a limit. In this case, dividing by x2 gives

limx→∞

3x2 + 4x− 2

5x2 − x+ 6= lim

x→∞

3 + 4x− 2

x2

5− 1x+ 6

x2

=3 + 0− 0

5− 0 + 0=

3

5.

3. Evaluate limx→0

sin x

x. (As always, x is in radians.)

The series expansion for sin x is

sin x = x− x3

3!+x5

5!− · · · ,

sosin x

x= 1− x2

3!+x4

5!− · · · → 1

as x → 0. Strictly speaking, this working isn’t covered by the rules for limits,because there are infinitely many terms in the series. However, this argumentcan be made valid; you should remember this limit as a fact.

6

1.4 Continuity

Definition

The function f(x), defined at and around the point a, is said to be continuous at aif f(x) → f(a) as x→ a.

(Informally, the function f(x) is continuous at a if we can draw its graph around awithout lifting the pencil from the paper.)

A function is continuous if it is continuous at every point where it is defined.

Continuous functions include polynomials, such as x3 − 3x + 1, the exponentialfunction and the sine and cosine functions.

The function

f(x) =

{

1/x if x 6= 0,

0 if x = 0

is discontinuous (i.e., not continuous) at x = 0. (See Figure 1.)

–4

–3

–2

–1

0

1

2

3

4

–4 –3 –2 –1 1 2 3 4

x

f(x)

Figure 1: The graph of the function f(x)

7

1.5 Rules for manipulating continuous functions

If f(x) and g(x) are continuous at a then so are

(i) kf(x), where k is a constant,

(ii) the sum f(x) + g(x)

(iii) the product f(x)g(x) and

(iv) the quotient f(x)/g(x), as long as g(a) 6= 0.

If g(x) is continuous at a and f(x) is continuous at g(a) then

(v) the composite function f(g(x)) is continuous at a.

1.6 Basic ideas of differentiation

Differentiation is a means of measuring a ‘rate of change’. This can be the rate ofchange of y with respect to x, i.e., the ‘slope’ of the graph y = f(x), or the rate ofchange of a variable with respect to time, temperature et cetera.

0

1

2

3

4

5

s

1 2 3 4t

Figure 2: s = ut

One way to think about the meaning of differentiation is to consider an object movingin a straight line, with constant velocity ums−1. The distance s metres travelled bythe object in t seconds is given by the formula

s = ut.

The graph of distance s against time t is a straight line (see Figure 2) and u is thegradient of this line.

8

If the velocity varies with time (as in Figure 3) then the average velocity over thedistance travelled between time t0 and time t1 is

s(t1)− s(t0)

t1 − t0,

which is the gradient of the straight line between(

t0, s(t0))

and(

t1, s(t1))

.

0

1

2

3

4

5

6

s

1 2 3 4 5t

(

t0, s(t0))

(

t1, s(t1))

Figure 3: s = f(t)

If we put t1 = t0 + δt and s(t1) = s(t0) + δs then the gradient equals

s(t1)− s(t0)

t1 − t0=δs

δt.

If we now let t1 → t0 (equivalently, δt→ 0) then this quotient tends to the gradientof the tangent line to the graph at the point

(

t0, s(t0))

.

This process of passing from the function s(t) to the gradient of the tangent is calleddifferentiation and it measures the rate of change of s(t) with respect to t. Thegradient function is called the derivative of the function s(t).

9

Definition

The derivative of a function f(x) at the point x, written f ′(x) ordf

dx, is

limδx→0

f(x+ δx)− f(x)

δx= lim

δx→0

δf

δx.

(Informally, the derivative of a function f(x) at the point x is the gradient of thetangent to the curve y = f(x) at the point

(

x, f(x))

.)

This limit will not exist at x if

(a) f(x) is not continuous at x or

(b) the graph of f(x) has a ‘corner’ at x.

(Informally, a function is differentiable at x if its graph is continuous and ‘smooth’at and near to x.) We are not going to dwell on this possibility.

1.7 Differentiation from first principles

Examples

1. If f(x) = c, where c is a constant, then f(x+ δx) = c and

f ′(x) = limδx→0

f(x+ δx)− f(x)

δx= lim

δx→0

c− c

δx= lim

δx→00 = 0.

As the derivative is the gradient of the tangent to the curve, this result shouldseem fairly obvious: the graph is, in this case, a horizontal line. Alternatively,it should be clear that the rate of change of a constant is zero.

2. If f(x) = bx, where b is a constant, then

f(x+ δx) = b(x+ δx) = bx+ b δx

and


f(x+ δx)− f(x)

δx

= limδx→0

bx+ b δx− bx

δx= lim

δx→0

b δx

δx= lim

δx→0b = b.

Again, this makes sense: the equation of a straight line through the origin isy = mx, with m the gradient of that line. The derivative of a function is thegradient of the tangent to the graph of that function, so for a straight line thederivative is a constant, the line’s gradient.

10

3. If f(x) = ax2, where a is a constant, then

f(x+ δx) = a(x+ δx)2 = ax2 + 2ax δx+ a(δx)2

so


f(x+ δx)− f(x)

δx

= limδx→0

ax2 + 2ax δx+ a(δx)2 − ax2

δx

= limδx→0

2ax δx+ a(δx)2

δx= lim

δx→0(2ax+ a δx)

= 2ax.

4. Combining the working from Examples 1, 2 and 3, we can see that

d

dx(ax2 + bx+ c) = 2ax+ b.

In general, if y = xn (where n can be any number, integer or otherwise) then,using the binomial theorem, it may be shown that

dy

dx= nxn−1.

5. Find, from first principles, the derivative of f(x) = ex.

From the definition,


e(x+δx) − ex

δx= lim

δx→0

ex(eδx − 1)

δx.

The series expansion of eδx is

eδx = 1 + δx+(δx)2

2!+

(δx)3

3!+

(δx)4

4!+ · · ·

and therefore

f ′(x) = ex limδx→0

(1 + δx+ (δx)2

2!+ (δx)3

3!+ · · · − 1)

δx

= ex limδx→0

(

1 +δx

2!+

(δx)2

3!+ · · ·

)

= ex.

11

(As for Example 3 in Section 1.3, this last step isn’t properly justified by therules for limits above, but this argument can be made rigorous and gives theright answer.)

6. Find the derivatives of the functions f(x) = sin x and g(x) = cosx from firstprinciples.

Using the trigonometric identity

sinA− sinB = 2 cosA+B

2sin

A− B

2

with A = x+ δx and B = x, we have


sin(x+ δx)− sin x

δx= lim

δx→0

cos(x+ 12δx) sin(1

2δx)

12δx

.

By Example 3 in Section 1.3 (and the fact that cosx is continuous) it followsthat


cos(x+1

2δx) lim

δx→0

sin(12δx)

12δx

= (cosx)× 1 = cos x.

To differentiate the function g(x) = cosx from first principles, we use thetrigonometric identity

cosA− cosB = −2 sinA+B

2sin

A− B

2

with A = x+ δx and B = x, which implies that

g′(x) = limδx→0

cos(x+ δx)− cosx

δx

= limδx→0

− sin(x+ 12δx) sin(1

2δx)

12δx

= − limδx→0

sin(x+1

2δx) lim

δx→0

sin(12δx)

12δx

= − sin x.

Note

There are three ways of thinking of the derivative: (i) using its definition, as a limit;(ii) graphically, as the gradient of the tangent to the graph or (iii) kinematically, asthe rate of change of one variable with respect to another. Each of these should bekept in mind; choosing the most appropriate one may help when solving a problem.

12

1.8 Rules for differentiation

Scalar-multiplication rule

If y = f(x) and k is a constant then

d

dx(ky) = k

dy

dx= kf ′(x).

Sum rule

If y = f(x) and z = g(x) then

d

dx(y + z) =

dy

dx+

dz

dx= f ′(x) + g′(x).

Composite (‘chain’) rule

If y = f(z) and z = g(x) then y = f(g(x)) and

dy

dx=

dy

dz

dz

dx= f ′(z)g′(x).

Examples

1. If y = x6 + 3x4 − x2 + 1 then

dy

dx= 6x5 + 12x3 − 2x.

The first two rules (for scalar multiples and sums) are summed up by sayingthat differentiation is linear. Linearity, together with Example 4 in Section 1.7,means that we can differentiate any polynomial.

2. If y = e√x then y = ez, where z = x1/2. The chain rule states that

dy

dx=

dy

dz

dz

dx,

sody

dx=

d

dzez

d

dxx1/2 = ez × 1

2x−1/2 =

e√x

2√x.

3. If f(x) = sinh x = 12(ex − e−x) then, by linearity and the chain rule,

f ′(x) =ex − (−e−x)

2=ex + e−x

2= cosh x.

13

4. If f(x) = cosh x = 12(ex + e−x) then

f ′(x) =ex + (−e−x)

2=ex − e−x

2= sinh x.

So far we have been dealing with functions involving x and y. Whatever lettersthe variables are labelled with, the rules remain the same.

5. Differentiate the following functions with respect to the appropriate variable.

(i) 15t− 10t2 (ii) sin(

5θ +π

3

)

(iii) ecoshω

(i)d

dt(15t− 10t2) = 15− 20t.

(ii) If s = sin(

5θ + π3

)

and z = 5θ + π3then s = sin z and

ds

dθ=

ds

dz

dz

dθ= (cos z)× 5 = 5 cos

(

5θ +π

3

)

.

(iii) Let p = ecoshω and q = coshω, so that p = eq and

dp

dω=

dp

dq

dq

dω= eq × sinhω = ecoshω sinhω.

6. Evaluate the derivative of ln cos θ when θ = π/4.

Let y = ln cos θ and z = cos θ, so that y = ln z and

d

dθ(ln cos θ) =

dy

dθ=

dy

dz

dz

dθ=

1

z(− sin θ) = − sin θ

cos θ= − tan θ.

(We will show in Example 1 on p.16 thatd

dzln z =

1

z.) Hence the derivative

of ln cos θ at θ = π/4 is − tan(π/4) = −1.

7. Consider the function f(x) = 3x2 − 7x+ 1. Find

(i) the derivative of f(x),

(ii) the gradient of f(x) at x = 2 and

(iii) the equation of the tangent to y = f(x) at the point where x = 2.

14

(i) The derivative f ′(x) = 6x− 7.

(ii) The gradient of f(x) at x = 2 is therefore f ′(2) = 6(2)− 7 = 5.

(iii) The tangent is a straight line, so is described by the equation y = mx+ c.

The constant m is the gradient at x = 2, which is 5.

The tangent passes through the point(

2, f(2))

and

f(2) = 3(2)2 − 7(2) + 1 = −1.

Hence y = −1 when x = 2, so −1 = 5(2) + c and c = −11.

Thus the equation of the tangent line at x = 2 is y = 5x− 11.

Product rule


d

dx(yz) =

dy

dxz + y

dz

dx= f ′(x)g(x) + f(x)g′(x).

Quotient rule


d

dx

(y

z

)

=dydxz − y dz

dx

z2=f ′(x)g(x)− f(x)g′(x)

g(x)2.

Examples

1. If y = x3 sin x then

dy

dx=

d

dx(x3) sin x+ x3

d

dx(sin x) = 3x2 sin x+ x3 cos x.

2. If s = et√t then

ds

dt=

d

dt(et)t1/2 + et

d

dt(t1/2) = ett1/2 + et

(

1

2t−1/2

)

= et(√

t+1

2√t

)

.

3. If y = ln x sin x then

dy

dx=

d

dx(ln x) sin x+ ln x

d

dx(sin x) =

1

xsin x+ lnx cos x

=sin x

x+ lnx cos x.

15

4. If y =x+ 1

x2 + 3then

dy

dx=

(

ddx(x+ 1)

)

(x2 + 3)− (x+ 1) ddx(x2 + 3)

(x2 + 3)2

=(1)(x2 + 3)− (x+ 1)(2x)

(x2 + 3)2

=x2 + 3− 2x2 − 2x

(x2 + 3)2

=3− 2x− x2

(x2 + 3)2.

5. If T = tan θ =sin θ

cos θthen

dT

dθ=

(

ddθ

sin θ)

cos θ − sin θ ddθ

cos θ

cos2 θ=

cos2 θ + sin2 θ

cos2 θ=

1

cos2 θ= sec2 θ.

Inverse-function rule

If y = f−1(x) then x = f(

f−1(x))

= f(y). From the chain rule we have

1 =dx

dx=

dx

dy

dy

dx, so

dy

dx= 1

/

dx

dy=

1

f ′(y).

Examples

1. If y = ln x then x = ey anddx

dy= ey, so

dy

dx=

1

ey=

1

elnx=

1

x.

(This is one of the standard derivatives listed on p.6 of the Data Book.)

2. If y = sin−1 x then x = sin y anddx

dy= cos y, so

dy

dx=

1

cos y=

1√

1− sin2 y=

1√1− x2

.

(Since cos2 x+ sin2 x = 1, we know that cos y = ±√

1− sin2 y, but which signdo we take? As sin−1 x is increasing – draw a sketch to convince yourself ofthis – it has positive gradient, so we take the positive square root.)

16

3. If y = cos−1 x then x = cos y anddx

dy= − sin y, so

dy

dx=

1

− sin y=

−1√

1− cos2 y=

−1√1− x2

.

(Again, the positive root is chosen to give the right sign: the function cos−1 xis decreasing.)

4. If y = tanh−1 x then x = tanh y anddx

dy= sech2 y, so (using the fact that

sech2 y = 1− tanh2 y)

dy

dx=

1

sech2 y=

1

1− tanh2 y=

1

1− x2.

1.9 Second and higher derivatives

Definition

The second derivative of y = f(x), denoted byd2y

dx2or f ′′(x), is defined to be the

derivative of the first derivativedy

dx= f ′(x). In other words,

d2y

dx2=

d

dx

(

dy

dx

)

.

We can extend this to third, fourth and higher derivatives. In general, these are

written asdny

dxnor f (n)(x), where n is often written in Roman numerals in the latter

case: we have

f(x), f ′(x), f ′′(x), f ′′′(x), f (iv)(x), f (v)(x) and so on.

Examples

1. If f(x) = x3 − sin x then

f ′(x) = 3x2 − cos x,

f ′′(x) = 6x+ sin x,

f ′′′(x) = 6 + cosx,

f (iv)(x) = − sin x,

f (v)(x) = − cosx

et cetera.

17

2. A projectile is thrown with an initial vertical component of velocity 30ms−1.If the vertical height above the ground, s metres, is given by

s = 1·7 + 30t− 4·9t2,

find the time at which the projectile is at a maximum height above the groundand hence determine this maximum height. Calculate the acceleration of theprojectile.

The maximum height is achieved when the velocity v =ds

dtis zero, that is,

when the rate of change of distance with respect to time is zero. Thus

0 =ds

dt= 30− 9·8t

and the maximum height is achieved at time

t =30

9·8 = 3·06 (2 d.p.).

At this time,

s = 1·7 + 30(3·06)− 4·9(3·06)2 = 47.6 (1 d.p.),

so the maximum height achieved by the projectile is 47·6 metres (to one decimalplace).

Acceleration is the rate of change of velocity with respect to time, so this is

a =dv

dt=

d2s

dt2= −9·8.

Note that the acceleration is the same for any time t here. This is accelerationdue to the force of gravity acting toward the earth. Note also that, from theshape of the graph of s, it must have a maximum value and not a minimum.

1.10 Stationary points

If the derivative f ′(x) > 0 then the gradient of the function f(x) is positive, so thefunction is increasing; if f ′(x) < 0 then f has negative gradient, so the function isdecreasing.

Points where the derivative of a function is zero are called stationary points : theyare where the rate of change of the function is zero.

18

The second derivative can give us information on stationary points. Let x0 be astationary point for the function f(x), so that f ′(x0) = 0, and suppose f ′′(x0) > 0.

As f ′′(x0) is the gradient of f ′(x) at x0, the derivative f ′(x) has positive gradient,so is increasing, near x0. Hence f

′(x) must be negative to the left of x0 and positiveto the right of x0 (since f ′(x0) = 0). So the function f(x) is decreasing to the left ofx0 and increasing to the right of x0: the value f(x0) is a local minimum.

Similarly, if f ′′(x0) < 0 then the derivative f ′(x) is decreasing near x0, so the valuef(x0) is a local maximum.

In Example 2 on p.18,d2s

dt2= −9·8 < 0, so the height found there is a maximum (as

we have already seen).

If f ′′(x0) = 0 then the second derivative tells us nothing and we need to look directlyat the behaviour of f ′(x) for x near the stationary point.

Example

Find and classify the stationary points of the function y = x3.

Sincedy

dx= 3x2 = 0 ⇐⇒ x = 0,

the function has one stationary point, at x = 0. The second derivative

d2y

dx2= 6x,

which is zero at x = 0, so this tells us nothing about the nature of the stationary

point. Sincedy

dx> 0 if x 6= 0, the function y is increasing and x = 0 is neither a local

maximum nor a local minimum. (This should also be clear to you from a sketch ofthe function.)

1.11 Parametric differentiation

A curve may sometimes be expressed parametrically as x = g(t), y = h(t) ratherthan in the form y = f(x). For example,

(x, y) = (t2, 2t) (t > 0)

gives the curve y2 = 4x: a parabola.

19

In these circumstances we can use the chain rule to find the gradientdy

dx: note that

dy

dx=

dy

dt

dt

dx=

dy

dt

/

dx

dt.

The second derivative needs more care; in general,

d2y

dx26= d2y

dt2

/

d2x

dt2.

Instead, using the chain rule again, we have that

d2y

dx2=

d

dx

(dy

dx

)

=

(

d

dt

(dy

dx

)

)

dt

dx.

Examples

1. For the curve given parametrically by x = t2−1, y = t3−t, we see that dxdt

= 2t

anddy

dt= 3t2 − 1, so

dy

dx=

3t2 − 1

2t=

3

2t− 1

2t.

The second derivative is

d2y

dx2=

(

d

dt

(3

2t− 1

2t−1)

)

1

2t=(3

2+

1

2t−2) 1

2t=

3

4t+

1

4t3.

2. Consider the curve given parametrically by x = cos3 t, y = sin3 t; in this case,

dx

dt= −3 cos2 t sin t and

dy

dt= 3 sin2 t cos t.

Hencedy

dx=

3 sin2 t cos t

−3 cos2 t sin t= − sin t

cos t= − tan t

and

d2y

dx2=( d

dt(− tan t)

)( 1

−3 cos2 t sin t

)

=− sec2 t

−3 cos2 t sin t=

1

3 cos4 t sin t.

(Remember that sec t = 1/ cos t.)

20

1.12 Implicit differentiation

Sometimes y is not defined explicitly as a function of x or in terms of a parameter,but is merely related to x by means of an equation. For example, the circle withradius 2 and centre (0, 0) is given by the familiar formula

x2 + y2 = 4.

In this situation we treat y as an unknown function of x, differentiate both sides of

the equation with respect to x and then solve to find the derivativedy

dx.

To help with this we have the chain rule, which tells us that

d

dx

(

g(y))

=d

dy

(

g(y))dy

dx= g′(y)

dy

dx.

Examples

1. If x2 + y2 = 4 thend

dx(x2) +

d

dx(y2) =

d

dx(4) = 0,

so 2x+ 2ydy

dx= 0 and

dy

dx= −x

y.

2. Consider the ellipse given by the equation x2+ xy+ y2 = 3. Find the equationof the tangent to the ellipse at the point (1, 1).

The tangent at (1, 1) is a straight line with general equation y = mx + c; the

gradient m equals the value ofdy

dxwhen x = y = 1. Differentiating implicitly,

we haved

dx

(

x2 + xy + y2) =d

dx(3) = 0

and, noting that xy is a product, this implies that

2x+

(

y + xdy

dx

)

+ 2ydy

dx= 0, so (x+ 2y)

dy

dx= −2x− y.

Hencedy

dx= −2x+ y

x+ 2yand m = −2 + 1

1 + 2= −1.

To find c, note that the tangent passes through the point (1, 1), so 1 = −1 + cand c = 2. The equation of the tangent is therefore y = 2− x.

21

3. For the ellipse of Example 2, findd2y

dx2.

Differentiating both sides of the equation

2x+ xdy

dx+ y + 2y

dy

dx= 0

implicitly with respect to x, noting that xdy

dxand 2y

dy

dxare products, we have

2 +

(

dy

dx+ x

d2y

dx2

)

+dy

dx+

(

2

(

dy

dx

)2

+ 2yd2y

dx2

)

= 0.

Hence

(x+ 2y)d2y

dx2= −2 − 2

dy

dx− 2

(

dy

dx

)2

andd2y

dx2= −2

(

1 + dydx

+ (dydx)2

x+ 2y

)

= −2

(

1− 2x+yx+2y

+ (2x+yx+2y

)2

x+ 2y

)

.

Multiplying the numerator and denominator of this last quantity by (x+2y)2,we see that

d2y

dx2= −2

(

(x+ 2y)2 − (2x+ y)(x+ 2y) + (2x+ y)2

(x+ 2y)3

)

= −2

(

x2 + 4xy + 4y2 − 2x2 − 4xy − xy − 2y2 + 4x2 + 4xy + y2

(x+ 2y)3

)

= −2

(

3(x2 + xy + y2)

(x+ 2y)3

)

.

Finally, the original equation tells us that x2 + xy + y2 = 3, so

d2y

dx2=

−18

(x+ 2y)3.

1.13 Logarithmic differentiation

A neat trick, known as logarithmic differentiation, can be used to find the derivativeof expressions of the form y = f(x)g(x). The idea is to take the natural logarithmsof both sides and then differentiate with respect to x.

22

Note first that if y = ln(

h(x))

then, by the chain rule,

dy

dx=h′(x)

h(x). (⋆)

Example

If y = (cosh x)x thenln y = ln

(

(cosh x)x)

= x ln cosh x.

Differentiating both sides with respect to x (using the chain rule and noting thatx ln cosh x is a product) gives that

1

y

dy

dx= ln cosh x+ x

d

dx(ln cosh x) = ln cosh x+ x

(

sinh x

cosh x

)

,

sody

dx= y(ln cosh x+ x tanh x) = (cosh x)x(ln cosh x+ x tanh x).

This technique can also be used to simplify expressions which contain products,quotients and composite functions.

Example

Finddy

dxif y =

4√sin 2x

(x+ 1)2(3x− 1).

First, take logarithms of both sides:

ln y = ln 4 +1

2ln sin 2x− 2 ln(x+ 1)− ln(3x− 1).

Differentiating with respect to x, using the observation (⋆) above and noting thatln 4 is a constant, we see that

1

y

dy

dx=

cos 2x

sin 2x− 2

x+ 1− 3

3x− 1.

Rearranging and replacing y by its definition as a function of x, we see that

dy

dx=

4√sin 2x

(x+ 1)2(3x− 1)

(

cot 2x− 2

x+ 1− 3

3x− 1

)

.

23

2 Integration

2.1 Definite integrals

Definition

Let f(x) be a function on the interval [a, b], where a < b; in other words, f(x) isdefined for all values of x from x = a to x = b inclusive. Informally, we define thedefinite integral of f(x) from a to b, denoted by

∫ b

a

f(x) dx,

to be the area under the graph of y = f(x) between x = a and x = b. More precisely,we mean the area between y = f(x) and the x-axis, with the area below the x-axiscounting as negative. We also define

∫ a

b

f(x) dx = −∫ b

a

f(x) dx

and∫ a

a

f(x) dx = 0.

We call the function being integrated, f(x), the integrand of the integral, dx thedifferential of x and the points a and b the limits of integration. Think of theintegral sign

∫

(a stretched ‘S’, for ‘sum’) and the differential dx as brackets aroundthe integrand – they must always appear together. The x in dx tells us the name ofthe variable we are integrating over.

Examples

1. Find

∫ b

a

c dx, where c is constant.

The graph of y = c is a horizontal straight line, and the (positive) rectangulararea between this line and the x-axis is either (b− a)c (if c > 0) or (b− a)(−c)(if c < 0). As area below the x-axis counts as negative, we see that, in bothcases,

∫ b

a

c dx = (b− a)c.

24

B

A

–4

–2

0

2

4

6

1 2 3 4 5

y

x

Figure 4: y = 2x− 4

2. Calculate

∫ 5

0

(2x− 4) dx.

In Figure 4, area A = 12× 2 × 4 = 4 and area B = 1

2× 3 × 6 = 9. Since A is

below the x-axis, its area counts as negative, so

∫ 5

0

(2x− 4) dx = 9− 4 = 5.

For more complicated functions, the area is found (and the integral is defined) bytaking the limit of the approximations we obtain by dividing the area into rectangularstrips, each of width δx, and then letting δx → 0. (See Figure 5.)

0

1

2

3

4

5

6

7

1 2 3 4x

y

Figure 5: y = 12x(x− 1)(x− 31

10) + 2

25

Ifa = x0 < a+ δx = x1 < a+ 2δx = x2 < · · · < xn = b

and the height of each rectangle is 12

(

f(xi−1) + f(xi))

for i = 1, 2, 3, · · · , n (given nsuch rectangles) then the approximate area is equal to

n∑

i=1

1

2

(

f(xi−1) + f(xi))

δx.

As δx→ 0,n∑

i=1

1

2

(

f(xi−1) + f(xi))

δx→∫ b

a

f(x) dx.

This concept (the integral as the limit of certain sums) will come in useful later forsome of the applications of integration. As with differentiation, however, we willgenerally not work from first principles.

2.2 Indefinite integrals

Definition

The function F (x) is an indefinite integral for the continuous function f(x) if F (x)is differentiable and F ′(x) = f(x), in which case we write

∫

f(x) dx = F (x) + c,

where c is the constant of integration.

Here there are no limits of integration. A function, the indefinite integral F (x),is the result of indefinite integration, rather than a number (the area) for definiteintegration.

Example

Sinced

dxtanx = sec2 x, the function tan x is an indefinite integral for sec2 x and

∫

sec2 x dx = tan x+ c.

The Fundamental Theorem of Calculus gives the link between indefinite and definiteintegrals. It is what allows us to calculate definite integrals simply by recognisingderivatives, and without having to find the areas of lots of rectangles.

26

2.3 The fundamental theorem of calculus

Theorem (FTC)

If F (x) is an indefinite integral for the continuous function f(x), so that F ′(x) = f(x),then the definite integral

∫ b

a

f(x) dx =

[

F (x)

]b

a

= F (b)− F (a).

(The square-bracket notation is just shorthand for the final quantity.)

Put simply, this means that to integrate, we need only recognise the integrand as aderivative.

The constant of integration

The Fundamental Theorem of Calculus means that integration can be regarded as

the inverse process to differentiation. We know thatd

dx(x2) = 2x, so x2 is one choice

for the indefinite integral∫

2x dx.

However, differentiating x2 + 3 or x2 − 7 also results in 2x, since differentiatinga constant gives zero, so these three functions are all equally good choices for anindefinite integral of 2x. All we can say is

∫

2x dx = x2 + c;

a constant of integration must be included with the indefinite integral.

If F (x) and G(x) are both indefinite integrals for f(x) then

d

dx

(

F (x)−G(x))

=d

dxF (x)− d

dxG(x) = f(x)− f(x) = 0,

so F (x)−G(x) is an indefinite integral for the zero function. By the FTC,

0 =

∫ b

a

0 dx =(

G(b)− F (b))

−(

G(a)− F (a))

and rearranging this shows that G(b)−G(a) = F (b)− F (a). Hence any two choicesof indefinite integral for f(x) differ at most by a constant, and all choices give the

same value for the definite integral

∫ b

a

f(x) dx.

27

Examples

1. Find the indefinite integrals of (a) xn (when n 6= −1), (b)1

xand (c) sin x.

(a) We haved

dx(xn+1) = (n+ 1)xn, so

d

dx

(

1

n+ 1xn+1

)

= xn.

Hence

∫

xn dx =1

n+ 1xn+1 + c.

(b) For x > 0 we know thatd

dx(ln x) =

1

x. On the other hand, for x < 0 we

haved

dx

(

ln(−x))

= − 1

−x =1

x.

Hence

∫

1

xdx = ln x+ c1 if x > 0 and

∫

1

xdx = ln(−x) + c2 if x < 0. Thus

∫

1

xdx = ln |x|+ c.

(c) We know that

d

dx(− cosx) = − d

dxcos x = −(− sin x) = sin x

and therefore

∫

sin x dx = − cos x+ c.

2. Calculate

∫ 4

2

x2 dx.

From 1(a),

∫

x2 dx =1

2 + 1x2+1 + c =

1

3x3 + c. Therefore, by the FTC,

∫ 4

2

x2 dx =

[

1

3x3 + c

]4

2

=

(

1

343 + c

)

−(

1

323 + c

)

=64− 8

3=

56

3.

Note that the constant of integration always cancels out, so we can ignore cwhen finding a definite integral.

28

2.4 Elementary integration

Rules of integration

(a) Scalar-multiplication rule

If k is a constant then

∫

kf(x) dx = k

∫

f(x) dx.

(b) Sum rule∫

(

f(x) + g(x))

dx =

∫

f(x) dx+

∫

g(x) dx.

Hence integration, like differentiation, is linear.

(c) Linear-composite rule

If a and b are constants and F ′(x) = f(x) then∫

f(ax+ b) dx =1

aF (ax+ b) + c.

This is a special case of integration by substitution, which we will meet later.To see why it is true, we use the chain rule: let y = ax+ b and note that

d

dx

(1

aF (ax+ b)

)

=1

a

dF

dy

dy

dx=

1

aF ′(y)a = f(ax+ b),

as required.

Examples

1. Find

∫

(6x3 + 15x2 + 2 sin x) dx.

We have that∫

(6x3 + 15x2 + 2 sinx) dx

= 6

∫

x3 dx+ 15

∫

x2 dx+ 2

∫

sin x dx by (a) and (b)

= 61

4x4 + 15

1

3x3 + 2

∫

sin x dx by Example 1 on p.28

=3

2x4 + 5x3 − 2 cosx+ c by the FTC.

Note that we only need one constant of integration.

29

2. Find

∫

(1 + 7x)3/2 dx.

Since

∫

y3/2 dy =2

5y5/2 + c, the linear-composite rule (c) shows that

∫

(1 + 7x)3/2 dx =1

7× 2

5(1 + 7x)5/2 + c =

2

35(1 + 7x)5/2 + c.

3. Find

∫

(2 sin 3θ − 3 cos 5θ) dθ.

The FTC implies that

∫

sinψ dψ = − cosψ + c1 and

∫

cosψ dψ = − sinψ + c2.

Hence∫

(2 sin 3θ − 3 cos 5θ) dθ = 2

∫

sin 3θ dθ − 3

∫

cos 5θ dθ by (a) and (b)

= 2× 1

3(− cos 3θ)− 3× 1

5(sin 5θ) + c by (c)

= −2

3cos 3θ − 3

5sin 5θ + c.

Integration is inherently more difficult than differentiation. Whereas methodicallyapplying various rules allows us to differentiate most reasonable functions, the sameis not true for integration. Indeed, some fairly elementary functions such as

√sin x

have no elementary function as their integral, and numerical methods are required.Much of the practice of integration involves using certain techniques, some of whichare given below.

2.5 Trigonometric identities

Examples

1. Since sinα sin β = 12

(

cos(α− β)− cos(α + β))

,

∫

sin 4x sin 3x dx =1

2

∫

(cosx− cos 7x) dx =1

2sin x− 1

14sin 7x+ c.

30

2. Since cos 2x = 2 cos2 x− 1, it follows that∫ π/2

0

cos2 x dx =

∫ π/2

0

1

2(1 + cos 2x) dx

=1

2

[

x+1

2sin 2x

]π/2

0

=

(

π

4+

1

4sin π

)

−(

0 +1

4sin 0

)

=π

4.

2.6 Partial fractions

This method is used to integrate rational functions: these are functions that can bewritten as the quotient of two polynomials.

1. Find

∫

5x− 4

x2 − 8x+ 12dx.

Note first that5x− 4

x2 − 8x+ 12=

5x− 4

(x− 6)(x− 2)=

13

2(x− 6)− 3

2(x− 2), so

∫

5x− 4

x2 − 8x+ 12dx =

13

2

∫

1

x− 6dx− 3

2

∫

1

x− 2dx

=13

2ln |x− 6| − 3

2ln |x− 2|+ c.

2. Find

∫

4x+ 3

(x− 3)2dx.

Since4x+ 3

(x− 3)2=

15

(x− 3)2+

4

x− 3,

it follows that∫

4x+ 3

(x− 3)2dx = 15

∫

1

(x− 3)2dx+ 4

∫

1

x− 3dx =

−15

x− 3+ 4 ln |x− 3|+ c.

3. Find

∫

1

1− x2dx.

As1

1− x2=

1

2(1− x)+

1

2(1 + x), we see that

∫

1

1− x2dx =

1

2

∫

1

1− xdx+

1

2

∫

1

1 + xdx

= −1

2ln |1− x|+ 1

2ln |1 + x|+ c =

1

2ln

∣

∣

∣

∣

1 + x

1− x

∣

∣

∣

∣

+ c.

31

2.7 Completing the square

Any integral of the form

∫

1

ax2 + bx+ cdx or

∫

1√ax2 + bx+ c

dx,

where a 6= 0, can, by completing the square, be transformed into one of the formsgiven in the table of standard integrals on p.6 of the Data Book.

Examples

1. Find

∫

1

x2 + 4x+ 7dx.

Completing the square gives

x2 + 4x+ 7 = (x+ 2)2 + 3 = 3

(

1 +(x+ 2√

3

)2)

,

so∫

1

x2 + 4x+ 7dx =

1

3

∫

1

1 +(

(x+ 2)/√3)2 dx

=1

3×

√3 tan−1

(

x+ 2√3

)

+ c =1√3tan−1

(

x+ 2√3

)

+ c.

2. Find

∫

1√

x(1 + x)dx.

Completing the square gives

x(1 + x) = x+ x2 =(

x+1

2

)2 − 1

4=

1

4

(

(2x+ 1)2 − 1)

.

Hence∫

1√

x(1 + x)dx =

√4

∫

1√

(2x+ 1)2 − 1dx

= 2× 1

2cosh−1(2x+ 1) + c = cosh−1(2x+ 1) + c.

32

2.8 Piecewise-standard functions

The observation that∫ b

a

f(x) dx =

∫ c

a

f(x) dx+

∫ b

c

f(x) dx

whenever a < c < b allows us to integrate piecewise-standard functions, i.e., functionswhich are made up by ‘gluing together’ standard functions on subintervals.

Example

Evaluate

∫ 2

−1

|x3| dx.

Noting that |x3| = x3 if x > 0 and |x3| = −x3 if x 6 0, it follows that

∫ 2

−1

|x3| dx =

∫ 0

−1

(−x3) dx+∫ 2

0

x3 dx

=1

4

([

−x4]0

−1+[

x4]2

0

)

=1

4(0− (−1) + 24 − 0) =

17

4.

This idea will be useful when studying Fourier Series in MATH145.

2.9 Further methods of integration

Integration by parts

There is no simple ‘product rule’ for integration. However, the product rule fordifferentiation,

d

dx(uv) = u

dv

dx+ v

du

dx

can be rearranged to give

udv

dx=

d

dx(uv)− v

du

dx.

Integrating with respect to x and using the FTC, we see that∫

udv

dxdx = uv −

∫

vdu

dxdx.

If the function to be integrated can be expressed as udv

dxfor suitable functions u

and v then we can apply this formula, called ‘integration by parts’. The trick is tochoose a function u which gets simpler when differentiated.

33

Examples

1. Find

∫

xex dx.

Put u = x and v = ex, so thatdv

dx= ex and

du

dx= 1. Then

∫

xex dx =

∫

udv

dxdx

= uv −∫

vdu

dxdx = xex −

∫

ex dx = xex − ex + c.

By choosing u = x, the integrand vdu

dx= ex is a function we know how to

integrate. Sometimes getting to this stage requires more than one applicationof the formula.

2. Find

∫

x2 sin x dx.

Here we take u = x2 and v = − cos x. (If we choose u = sin x then the integrand

vdu

dx=

1

3x3 cosx is more complicated.) Then

du

dx= 2x and

∫

x2 sin x dx = −x2 cosx+ 2

∫

x cosx dx.

Using integration by parts again, on this new integral (taking u = x andv = sin x) we see that

∫

x2 sin x dx = −x2 cos x+ 2x sin x− 2

∫

sin x dx

= −x2 cos x+ 2x sin x+ 2 cosx+ c.

3. Find

∫

x3 ln x dx.

Put u = ln x and v =1

4x4, so that

du

dx=

1

xand

dv

dx= x3. Then

∫

x3 ln x dx =1

4x4 lnx− 1

4

∫

x41

xdx

=1

4x4 lnx− 1

4

∫

x3 dx =1

4x4 ln x− 1

16x4 + c.

34

In the examples above, the integrand was made simpler until it could be recognisedas the derivative of something. When dealing with a product involving functionssuch as eax, sin x and cosx, the integrand may not simplify. However, we may beable to integrate by parts twice, return to something very like the original and hencedetermine our integral. An example will explain this idea.

Example

Evaluate I =

∫ π/4

0

e5x sin 2x dx.

If u = e5x and v = −12cos 2x then

du

dx= 5e5x and

dv

dx= sin 2x, so integration by

parts shows that

I =

∫ π/4

0

e5x sin 2x dx =

[

−1

2e5x cos 2x

]π/4

0

+5

2

∫ π/4

0

e5x cos 2x dx

=

(

−1

2e5π/4 cos

π

2+

1

2e0 cos 0

)

+5

2

∫ π/4

0

e5x cos 2x dx

=1

2+

5

2

∫ π/4

0

e5x cos 2x dx.

Now put u = e5x and v = 12sin 2x, so

du

dx= 5e5x,

dv

dx= cos 2x and integrating by

parts once more gives that

I =1

2+

5

2

(

[

1

2e5x sin 2x

]π/4

0

− 5

2

∫ π/4

0

e5x sin 2x dx

)

=1

2+

5

2

(

1

2e5π/4 sin

π

2− 1

2e0 sin 0− 5

2I

)

=1

2+

5

4e5π/4 − 25

4I.

Hence29

4I =

1

2+

5

4e5π/4

and therefore

I =4

29

(

1

2+

5

4e5π/4

)

= 8·8197 (4 d.p.).

35

2.10 Integration by substitution

This technique is a consequence of the chain rule for differentiation. Let F (x) be anindefinite integral for f(x) and let y = g(x). By the chain rule,

d

dxF(

g(x))

= F ′(

g(x))

g′(x) = f(y)dy

dx.

The FTC implies that∫

f(y) dy = F (y) + c = F(

g(x))

+ c =

∫

f(y)dy

dxdx

and we have the following change-of-variable formula:∫

f(y) dy =

∫

f(y)dy

dxdx.

Some examples will show how this works in practice; recall thatdx

dy= 1

/ dy

dx.

Examples

1. To find I1 =

∫

x2√5 + x3 dx, put t = 5 + x3, so that

dt

dx= 3x2 and

I1 =

∫

x2√tdx

dtdt =

1

3

∫ √t dt =

2

9t3/2 + c =

2

9(5 + x3)3/2 + c.

2. To find I2 =

∫

x+ 2

x2 + 4x+ 7dx, put t = x2 + 4x+ 7, so that

dt

dx= 2x+ 4 and

I2 =

∫

x+ 2

t

dx

dtdt =

1

2

∫

1

tdt =

1

2ln |t|+ c =

1

2ln |x2 + 4x+ 7|+ c.

3. To find I3 =

∫

tan3 x dx, put t = tan x, so thatdt

dx= sec2 x = 1 + t2 and

I3 =

∫

t3dx

dtdt =

∫

t3

1 + t2dt

=

∫

(

t− t

1 + t2

)

dt

=1

2t2 − 1

2ln(1 + t2) + c =

1

2tan2 x+ ln | cosx| + c.

36

Often the integrand is not of a form which permits a straightforward use of thetechnique. A substitution is made which we hope will lead to a simplification.

Example

To find

∫

1

x+√2− x

dx, we try to get rid of the square root by putting t =√2− x.

Then t2 = 2− x, so x = 2− t2 anddx

dt= −2t. Hence

∫

dx

x+√2− x

=

∫

1

2− t2 + t

dx

dtdt

=

∫

2t

(t− 2)(t+ 1)dt

=

∫

4

3(t− 2)+

2

3(t+ 1)dt

=4

3ln |t− 2|+ 2

3ln |t+ 1|+ c

=4

3ln |

√2− x− 2|+ 2

3ln |

√2− x+ 1|+ c.

When applying substitution to definite integrals we do not need to substitute backto get the answer in terms of the original variable. Instead we modify the limits ofintegration.

Examples

1. Show that I1 =

∫ 7

2

1

(x+ 1)√x+ 2

dx = ln(3/2).

Put t =√x+ 2 and note that

dt

dx=

1

2√x+ 2

=1

2t. As x increases from 2

to 7, t increases from 2 to 3, so

I1 =

∫ 3

2

1

(t2 − 1)t

dx

dtdt = 2

∫ 3

2

1

t2 − 1dt =

∫ 3

2

( 1

t− 1− 1

t+ 1

)

dt

=[

ln(t− 1)− ln(t+ 1)]3

2

= ln 2− ln 4− ln 1 + ln 3

= ln(3/2)

= 0·4055 (4 d.p.).

37

2. Evaluate I2 =

∫ 3/2

0

x2√9− x2

dx.

Put x = 3 sin θ, so thatdx

dθ= 3 cos θ and

√9− x2 =

√

9− 9 sin2 θ = 3√

1− sin2 θ = ±3 cos θ.

To get the range of integration from x = 0 to x = 3/2, we take θ = 0 toθ = sin−1(1/2) = π/6. Then cos θ is positive on this interval, so

I2 =

∫ π/6

0

9 sin2 θ

3 cos θ

dx

dθdθ =

∫ π/6

0

3 sin2 θ

cos θ3 cos θ dθ = 9

∫ π/6

0

sin2 θ dθ.

Now,

∫ π/6

0

sin2 θ dθ =1

2

∫ π/6

0

(1− cos 2θ) dθ

=1

2

[

θ − 1

2sin 2θ

]π/6

0

=1

2

(

π

6− 1

2sin

π

3+

1

2sin 0

)

=1

2

(

π

6− 1

2

√3

2

)

and therefore I2 =3π

4− 9

√3

8= 0·4076 (4 d.p.).

3. Evaluate I3 =

∫ π/4

0

1

cos2 x+ 9 sin2 xdx.

Put t = tanx, so thatdt

dx= sec2 x and as x goes from 0 to π/4, the variable t

goes from 0 to 1. Then

I3 =

∫ 1

0

1

cos2 x+ 9 sin2 x

dx

dtdt

=

∫ 1

0

cos2 x

cos2 x+ 9 sin2 xdt =

∫ 1

0

1

1 + 9t2dt =

[1

3tan−1(3t)

]1

0=

1

3tan−1 3,

which equals 0·4163 (4 d.p.). (The penultimate equality follows from the linear-composite rule.)

38

Inverse-function rule

Integration by parts and by substitution combine to give the following formula.

If y = f−1(x), so that x = f(y), then∫

f−1(x) dx = xy −∫

f(y) dy.

(To see where this comes from, note that∫

y dx =

∫

1× y dx = xy −∫

xdy

dxdx = xy −

∫

x dy.

The trick of writing the integrand g(x) as 1× g(x) and then integrating by parts canbe useful in other situations.)

Examples

1. If y = ln x then x = ey and∫

ln x dx = xy −∫

ey dy = xy − ey + c = x ln x− x+ c.

2. If y = sin−1 x then x = sin y and∫

sin−1 x dx = xy −∫

sin y dy

= x sin−1 x+ cos y + c

= x sin−1 x+

√

1− sin2 y + c

= x sin−1 x+√1− x2 + c.

(We take the positive square root because cos y > 0 when −π/2 6 y 6 π/2,which is the range of values taken by sin−1.)

3. If y = sinh−1 x then x = sinh y and∫

sinh−1 x dx = xy −∫

sinh y dy

= x sinh−1 x− cosh y + c

= x sinh−1 x−√

1 + sinh2 y + c = x sinh−1 x−√1 + x2 + c.

(Since cosh y > 0 for all y, we take the positive square root.)

39

2.11 Applications of integration

Arc length of a curve

The length along the curve y = f(x) between x = a and x = b is equal to

s =

∫ b

a

√

1 +

(

dy

dx

)2

dx ≈∑

√

1 +

(

δy

δx

)2

δx =∑

√

(δx)2 + (δy)2.

Example

Find the length of the parabola y = x2 between x = 0 and x = 1.

From the formula given above,

s =

∫ 1

0

√

1 + (2x)2 dx =

∫ 1

0

√1 + 4x2 dx.

Putting x = 12sinh u, we have

dx

du= 1

2cosh u and

√1 + 4x2 =

√

1 + sinh2 u = cosh u.

When x = 0, u = sinh−1 0 = 0 and when x = 1, u = sinh−1 2, so

s =

∫ sinh−1 2

0

cosh udx

dudu

=1

2

∫ sinh−1 2

0

cosh2 u du

=1

4

∫ sinh−1 2

0

(1 + cosh 2u) du

=1

4

[

u+sinh 2u

2

]sinh−1 2

0

(⋆)

=1

4

[

u+ sinh u cosh u]sinh−1 2

0

=1

4

[

u+ sinh u√

1 + sinh2 u]sinh−1 2

0

=1

4

(

sinh−1 2 + 2√1 + 22 − 0− 0

)

=

√5

2+

1

4sinh−1 2 = 1·4789 (4 d.p.).

(You could use your calculator to find the answer once you reach (⋆).)

40

Area of a plane region

Suppose f(x) and g(x) are continuous functions and f(x) > g(x) as x ranges overthe interval [a, b], i.e., for all x such that a 6 x 6 b. The area of the plane regionbetween the graphs of f(x) and g(x) and the lines x = a and x = b is equal to

A =

∫ b

a

(

f(x)− g(x))

dx =

∫ x=b

x=a

∫ y=f(x)

y=g(x)

dy dx ≈∑∑

δx δy.

(Draw a picture to convince yourself of this.)

Example

Find the area enclosed by the curves y = sec2 x and y = sin x between x = 0 andx = π/4.

0

0.5

1

1.5

2

0.1 0.2 0.3 0.4 0.5 0.6 0.7

x

y

y = g(x)

y = f(x)

Figure 6: f(x) = sec2 x, g(x) = sin x

Note that sec2 x > 1 > sin x for 0 6 x 6 π/4, as required, so the area

A =

∫ π/4

0

(sec2 x− sin x) dx =[

tan x+ cosx]π/4

0

= tanπ

4+ cos

π

4− tan 0− cos 0

= 1 +1√2− 0− 1

=1√2= 0·7071 (4 d.p.).

41

Centroid of a plane area

As above, let f(x) and g(x) be continuous functions such that f(x) > g(x) on [a, b]and consider the same plane region, that bounded by the curves y = f(x), y = g(x),x = a and x = b, which has area A.

The centroid of this region has coordinates (x, y) given by

x =1

A

∫ b

a

x(

f(x)− g(x))

dx =1

A

∫ x=b

x=a

∫ y=f(x)

y=g(x)

x dy dx ≈ 1

A

∑∑

x δx δy

and

y =1

2A

∫ b

a

(f(x)2 − g(x)2) dx =1

A

∫ x=b

x=a

∫ y=f(x)

y=g(x)

y dy dx ≈ 1

A

∑∑

y δx δy.

In the special case where the lower curve is the x-axis (i.e., g(x) = 0) we have

A =

∫ b

a

f(x) dx, x =1

A

∫ b

a

xf(x) dx and y =1

2A

∫ b

a

f(x)2 dx.

If the region is made out of a thin material of uniform density then the centroid isits centre of mass. The centre of mass r of a finite number of particles is equal totheir average position, weighted by their masses: r =

∑

imiri/∑

imi; analogously,the coordinates of the centroid are given by the integrals of position times density.

Example

Find the coordinates of the centroid of the region enclosed by the parabola y = x2−xand the line y = x. (See Figure 7.)

–1

0

1

2

3

4

5

6

–1 1 2 3x

y

y = g(x)

y = f(x)

Figure 7: f(x) = x, g(x) = x2 − x

42

The two curves intersect when

x2 − x = x ⇐⇒ x2 − 2x = 0 ⇐⇒ x(x− 2) = 0 ⇐⇒ x = 0 or 2.

In between x = 0 and x = 2 we have x > x2 − x, so we take

f(x) = x and g(x) = x2 − x.

Then

A =

∫ 2

0

(2x− x2) dx =

[

x2 − x3

3

]2

0

= 4− 8

3− 0 + 0 =

4

3,

so

x =1

A

∫ 2

0

x(2x− x2) dx =3

4

[

2x3

3− x4

4

]2

0

=3

4

(

16

3− 4− 0 + 0

)

= 1

and

y =1

2A

∫ 2

0

(x2 − (x2 − x)2) dx =3

8

∫ 2

0

(x2 − x4 + 2x3 − x2) dx

=3

8

[

−1

5x5 +

1

2x4]2

0

=3

8

(

−32

5+ 8− 0 + 0

)

=3

5.

The coordinates of the centroid are therefore (1, 3/5).

Area of a surface of revolution

A surface of revolution is obtained by rotating the curve y = f(x) (from x = a tox = b) through 2π radians about the x-axis, where f(x) > 0 for a 6 x 6 b. Itssurface area S is given by the following formula:

S = 2π

∫ b

a

y

√

1 +

(

dy

dx

)2

dx

=

∫ θ=2π

θ=0

∫ x=b

x=a

y

√

1 +

(

dy

dx

)2

dx dθ ≈∑∑

√

(δx)2 + (δy)2 y δθ.

If θ is the angle through which the curve is rotated then a small patch of the surfacewill have width y δθ and length δs =

√

(δx)2 + (δy)2, so area δs y δθ.

43

Examples

1. Find the surface area of the part of the sphere x2 + y2 + z2 = r2 which isenclosed by the planes x = a and x = b, where −r 6 a < b 6 r.

The sphere may be obtained by rotating y =√r2 − x2 about the x-axis. On

this curve we have y2 = r2−x2, and differentiating with respect to x gives that

2ydy

dx= −2x ⇐⇒ dy

dx= −x

y.

Hence

S = 2π

∫ b

a

y

√

1 +x2

y2dx = 2π

∫ b

a

√

y2 + x2 dx = 2π

∫ b

a

r dx = 2πr(b− a).

2. Find the surface area of the surface of revolution obtained by rotating aboutthe x-axis the curve y = ex from x = 0 to x = ln 3.

–10

–5

0

5

10–10

–5

5

100.5

1

1.5

2

2.5

x

y

z

Figure 8: The surface of revolution obtained by rotating y = ex

From the formula,

S = 2π

∫ ln 3

0

y

√

1 +

(

dy

dx

)2

dx = 2π

∫ ln 3

0

y√

1 + y2 dx

= 2π

∫ 3

1

y√

1 + y2dx

dydy = 2π

∫ 3

1

√

1 + y2 dy.

44

Using the substitution u = sinh−1 y and noting thatdy

du= cosh u, we have

S = 2π

∫ sinh−1 3

sinh−1 1

√

1 + sinh2 udy

dudu = 2π

∫ sinh−1 3

sinh−1 1

cosh2 u du

= π

∫ sinh−1 3

sinh−1 1

(1 + cosh 2u) du

= π

[

u+1

2sinh 2u

]sinh−1 3

sinh−1 1

= 28·3048 (4 d.p.).

Volume of a solid of revolution

A solid of revolution is obtained by rotating about the x-axis the area beneath thecurve y = f(x), where x ranges over the interval [a, b] and f(x) > 0 for all such x.Its volume V is given by the formula

V = π

∫ b

a

f(x)2 dx =

∫ 2π

θ=0

∫ x=b

x=a

∫ f(x)

y=0

y dy dx dθ ≈∑∑∑

δx δy y δθ.

If θ is the angle through which the curve is rotated, a small piece of V will havelength δx, height δy and width y δθ.

Examples

1. Find the volume of the solid of revolution obtained by rotating the area belowthe parabola y = 2− x2 and above the x-axis.

–2

–1

0

1

2

–2 –1 1 2x

y

Figure 9: The parabola y = 2− x2

45

The parabola y = 2 − x2 crosses the x-axis at 2 − x2 = 0, so when x = ±√2,

and lies above the x-axis in between. Hence

V = π

∫

√2

−√2

(2− x2)2 dx = π

∫

√2

−√2

(4− 4x2 + x4) dx

= π

[

4x− 4

3x3 +

x5

5

]

√2

−√2

= π

(

4√2− 4

32√2 +

4√2

5+ 4

√2− 4

32√2 +

4√2

5

)

=64√2π

15= 18·9563 (4 d.p.).

2. A sphere of radius r is obtained by rotating the curve y =√r2 − x2 about the

x-axis, where −r 6 x 6 r. Show that its volume V = 4πr3/3.

From the formula above,

V = π

∫ r

−r

(r2 − x2) dx = π

[

r2x− x3

3

]r

−r

= π(

r3 − r3

3+ r3 − r3

3

)

=4πr3

3,

as required.

Centre of mass of a solid of revolution

A solid of revolution is obtained as above, by rotating about the x-axis the areabeneath the curve y = f(x), with x ranging from a to b and f(x) > 0. Its centre ofmass lies on the x-axis (i.e., has y and z coordinates y = z = 0) and has x coordinate

x =π

V

∫ b

a

xf(x)2 dx =1

V

∫ 2π

θ=0

∫ b

x=a

x

∫ f(x)

y=0

y dy dx dθ ≈ 1

V

∑∑∑

x δx δy y δθ.

As for the centroid, the coordinates of the centre of mass are given by integratingposition times density. As the solid is symmetrical about the x-z and x-y planes, theintegrals for y and z will both be zero.

46

Example

The area enclosed by the curve y = (3x + 1)1/4, the x-axis, the y-axis and the linex = 5 is rotated through 2π radians about the x-axis. Calculate the x coordinate ofthe centre of mass of the resulting solid of revolution.

–2

–1

0

1

2–2

–1

1

21

2

3

4

5

x

y

z

Figure 10: The solid of revolution obtained from y = (3x+ 1)1/4

From the formula above,

V = π

∫ 5

0

(

(3x+ 1)1/4)2

dx = π

∫ 5

0

(3x+ 1)1/2 dx

= π

[

2

9(3x+ 1)3/2

]5

0

=2π

9(163/2 − 13/2) = 14π.

Hence

x =π

V

∫ 5

0

xf(x)2 dx =1

14

∫ 5

0

x(3x+ 1)1/2 dx.

Using integration by parts with u = x and v =2

9(3x+ 1)3/2, we have

x =1

14

(

[

2

9x(3x+ 1)3/2

]5

0

− 2

9

∫ 5

0

(3x+ 1)3/2 dx

)

=1

63

(

320−[

2

15(3x+ 1)5/2

]5

0

)

=320

63− 2

63

1023

15=

102

35= 2·9143 (4 d.p.).

47

3 Numerical methods

3.1 The trapezium rule

In practical situations, functions often cannot be integrated analytically (in termsof familiar functions, as above). We have to resort to numerical methods to obtainanswers with the desired degree of accuracy.

0

0.5

1

1.5

2

2.5

3

3.5

1 2 3 4

x

y

Figure 11: The trapezium rule

To find an approximate value for

∫ b

a

f(x) dx, first subdivide the interval from a to binto n equal subintervals:

a = x0 < x1 < x2 < · · · < xn−1 < xn = b,

with xi = xi−1 + h for i = 1, . . . , n and h = (b − a)/n. We approximate the areabeneath the graph between xi−1 and xi by the area Ai of a trapezium:

Ai =12

(

f(xi−1) + f(xi))

h.

48

Adding these, we see that

∫ b

a

f(x) dx ≈n∑

i=1

Ai

=h

2

(

(

f(x0) + f(x1))

+(

f(x1) + f(x2))

+ · · ·+(

f(xn−1) + f(xn))

)

=h

2

(

f(x0) + 2f(x1) + 2f(x2) + · · ·+ 2f(xn−1) + f(xn))

= T (h).

This is the trapezium-rule approximation to

∫ b

a

f(x) dx using n strips of width h.

Example

Use the trapezium rule with 4 strips to find an approximation for I =

∫ 5

1

e−√x dx.

Work to 3 s.f.

As h = (5 − 1)/4 = 1, the strips have end points 1 < 2 < 3 < 4 < 5. Lettingf(x) = e−

√x, we see that

I ≈ T (1) =1

2

(

f(1) + 2f(2) + 2f(3) + 2f(4) + f(5))

= 0·793 (3 s.f.).

This is reasonably close to the actual value of I, which is 0·780 to 3 s.f.

Successive bisection

Often we do not know how many strips to use at first, so we repeat the process,increasing the number of strips until two successive answers agree to the requiredaccuracy. We can save a great deal of work if we successively halve the width (i.e.,double the number of strips by splitting each strip in half) since if n is even then

T (h) =h

2

(

f(x0) + 2f(x1) + 2f(x2) + · · ·+ 2f(xn−1) + f(xn))

=h

2

(

2f(x1) + 2f(x3) + · · ·+ 2f(xn−1))

+h

2

(

f(x0) + 2f(x2) + 2f(x4) + · · ·+ 2f(xn−2) + f(xn))

= h(

f(x1) + f(x3) + · · ·+ f(xn−1))

+1

2T (2h).

To understand this formula, let us see it in action.

49

Example

Use the trapezium rule to estimate I =

∫ 20

4

1

x2 − 1dx accurate to 1 d.p.

If f(x) =1

x2 − 1then, recording our working accurate to 6 decimal places (for later),

1 strip with end points 4 < 20 gives

T (16) =h

2

(

f(x0) + f(x1))

=16

2

(

f(4) + f(20))

= 8(0·066667 + 0·002506) = 0·553383.

2 strips with end points 4 < 12 < 20 give

T (8) = 8f(12) +1

2T (16) = 8(0·006993) + 1

20·553383 = 0·332636.

(To get this figure, we add h = 8 times the new ordinate, f(12), to half the previousanswer, T (16).)

4 strips with end points 4 < 8 < 12 < 16 < 20 give

T (4) = 4(

f(8) + f(16))

+1

2T (8)

= 4(

0·015873 + 0·003922) + 1

20·332636 = 0·245496.

(Again, we add h = 4 times the sum of the new ordinates, f(8) and f(16), to halfthe previous answer, T (8).)

8 strips with end points 4 < 6 < 8 < 10 < 12 < 14 < 16 < 18 < 20 give

T (2) = 2(

f(6) + f(10) + f(14) + f(18))

+1

2T (4)

= 2(

0·028571 + 0·010101 + 0·005128 + 0·003096)

+1

20·245496

= 0·216541.(Once more, we add h = 2 times the sum of the new ordinates to half the previousanswer.) As T (4) and T (2) agree sufficiently, I = 0·2 to 1 d.p.

In this case, we can find I exactly. Since x2 − 1 = (x − 1)(x + 1), it is an exerciseusing partial fractions to show that

∫ 20

4

1

x2 − 1dx =

1

2

[

lnx− 1

x+ 1

]20

4

=1

2ln

95

63= 0·205371 (6 d.p.).

50

The error

If we tabulate the estimates T (h) and the errors

ǫT (h) = T (h)−∫ 20

4

dx

x2 − 1= T (h)− 1

2ln

95

63

then, extending the working above to give two further estimates, we obtain Table 1.

h T (h) ǫT (h)16 0·553383 0·3480128 0·332636 0·1272654 0·245496 0·0401252 0·216541 0·0111701 0·208271 0·0029000·5 0·206104 0·000733

Table 1: Error in the trapezium-rule approximation to

∫ 20

4

1

x2 − 1dx

The error decreases approximately by a factor of 4 as the width is halved, and thisbehaviour is typical for the trapezium rule. The error is roughly a constant times h2

for small values of h: this is written as

T (h) =

∫ b

a

f(x) dx+O(h2),

where the notation O(h2) represents a function which is ‘about the same order ofmagnitude’ as h2 for small h. It is possible to do better by approximating the curvemore closely; this brings us to Simpson’s rule.

3.2 Simpson’s rule

The trapezium rule approximates the area under a curve by replacing that curvewith a number of straight-line segments, which are curves of the form y = mx + c.Simpson’s rule improves on this by using quadratic curves, i.e., curves of the formy = ax2 + bx + c. A quadratic curve is determined by three points: for Simpson’srule, the first curve is chosen to go through

(

x0, f(x0))

,(

x1, f(x1))

and(

x2, f(x2))

,the second through

(

x2, f(x2))

,(

x3, f(x3))

and(

x4, f(x4))

, and so on. In order forthis to work, the area must be divided into an even number of strips.

The derivation of Simpson’s rule is not required for this course. For the interestedreader, we include an explanation in Appendix A.

51

0

0.5

1

1.5

2

2.5

3

3.5

1 2 3 4

Figure 12: Simpson’s rule

Simpson’s rule states that

∫ b

a

f(x) dx ≈ h

3

(

f(x0) + 4f(x1) + 2f(x2) + 4f(x3) + 2f(x4)

+ · · ·+ 2f(xn−2) + 4f(xn−1) + f(xn))

,

where n is an even number and h is the width of each strip. A shorter form is oftenused:

∫ b

a

f(x) dx ≈ S(h) =h

3(F + L+ 4O + 2E),

where F and L are the first and last ordinates, O is the sum of the odd-positionordinates, (f(x1), f(x3) et cetera) and E is the sum of the even-position ordinates(f(x2), f(x4), et cetera). For comparison, the trapezium rule has the form

∫ b

a

f(x) dx ≈ T (h) =h

2(F + L+ 2O + 2E).

52

Example

Use Simpson’s rule with 2, 4 and 8 strips to estimate

∫ 20

4

1

x2 − 1dx. Work to 6 d.p.

2 strips with h = (20− 4)/2 = 8 and f(x) =1

x2 − 1give

S(8) =8

3

(

f(4)+f(20)+4f(12))

=8

3

(

0·066667+0·002506+4(0·006993))

= 0·259053,

4 strips with h = (20− 4)/4 = 4 give

S(4) =4

3

(

f(4) + f(20) + 4(

f(8) + f(16))

+ 2f(12))

=4

3

(

0·066667 + 0·002506 + 4(0·015873 + 0·003922) + 2(0·006993))

= 0·216450

and 8 strips with h = (20− 4)/8 = 2 give

S(2)

=2

3

(

f(4) + f(20) + 4(

f(6) + f(10) + f(14) + f(18))

+ 2(

f(8) + f(12) + f(16))

)

=2

3

(

0·066667 + 0·002506 + 4(0·028571 + 0·010101 + 0·005128 + 0·003096)+ 2(0·015873 + 0·006993 + 0·003922)

)

= 0·206890.

Hence I ≈ 0·2 to 1 d.p. Adding these (and two further calculations) to our previous

table yields Table 2, where the error ǫS(h) = S(h)−∫ 20

4

dx

x2 − 1.

h T (h) ǫT (h) S(h) ǫS(h)16 0·553383 0·3480128 0·332636 0·127265 0·259053 0·0536824 0·245496 0·040125 0·216450 0·0110792 0·216541 0·011170 0·206890 0·0015191 0·208271 0·002900 0·205514 0·0001430·5 0·206104 0·000733 0·205382 0·000010

Table 2: Approximation errors for the trapezium and Simpson’s rules

53

As the strip widths are halved, the errors are (approximately) divided by 16, i.e., theerror term is O(h4). This is how the error behaves for Simpson’s rule:

∫ b

a

f(x) dx = S(h) +O(h4).

It becomes accurate much more rapidly than is the case with the trapezium rule.

Example

Find I =

∫ 4

2

1√1 + x3

dx using Simpson’s rule with 4 and 8 strips. Work to 5 d.p.

and state your answer with the appropriate degree of accuracy.

x f(x) 4 strips 8 strips2·00 0·33333 F F2·25 0·28409 − O2·50 0·24526 O E2·75 0·21419 − O3·00 0·18898 E E3·25 0·16824 − O3·50 0·15097 O E3·75 0·13642 − O4·00 0·12403 L L

Table 3: Values of f(x) = (1 + x3)−1/2 to 5 s.f.

4 strips with h = (4− 2)/4 = 0·5 give

S(0·5) = 0·53

(

f(2) + f(4) + 4(

f(2·5) + f(3·5))

+ 2f(3))

=0·53

(

0·33333 + 0·12403 + 4(0·24526 + 0·15097) + 2(0·18898))

= 0·40337

to 5 d.p., whereas 8 strips with h = (4− 2)/8 = 0·25 give

S(0·25) = 0·253

(

f(2) + f(4) + 4(

f(2·25) + f(2·75) + f(3·25) + f(3·75))

+ 2(

f(2·5) + f(3) + f(3·5)))

=0·253

(

0·33333 + 0·12403 + 4(0·28409 + 0·21419 + 0·16824 + 0·13642)+ 2(0·24526 + 0·18898 + 0·15097)

)

= 0·40330 to 5 d.p.

The two answers agree to 3 significant figures, so I = 0·403 (3 s.f.).

54

3.3 The Newton-Raphson method

This is an iterative method of finding a root of the equation f(x) = 0. ‘Iterative’means that we start with an initial approximation x0, then use x0 to find our nextapproximation x1, which is hopefully more accurate. We then use x1 to find x2 andso on.

Consider the graph of y = f(x) shown in Figure 3. The x value at the point A wherethe graph crosses the x-axis gives the solution to the equation f(x) = 0

–1

0

1

2

3

4

–1 1 2 3 4

x

y

A

P

Q R

Figure 13: The Newton-Raphson method

If P =(

x0, f(x0))

is a point on the curve y = f(x) near to A, then x = x0 is anapproximate value for the root of f(x) = 0. The error in the approximation is givenby the distance AR, where R = (x0, 0).

Now if PQ is the tangent to the curve at P , crossing the x-axis at Q = (x1, 0), thenx = x1 is the next approximation to the root required.

55

The gradient of the tangent is f ′(x0) =PR

QRand f(x0) = PR, so

QR =PR

f ′(x0)=f(x0)

f ′(x0)and x1 = x0 −QR = x0 −

f(x0)

f ′(x0).

In general,

xn+1 = xn −f(xn)

f ′(xn);

this is the Newton-Raphson method.

As with the trapezium rule and Simpson’s rule, we proceed with the iteration untiltwo consecutive values agree to our desired level of accuracy.

Examples

1. Find, accurate to 4 decimal places, the root of f(x) = 2x3 − 7x+ 2 close to 0,starting with x0 = 0.

As f ′(x) = 6x2−7, recording the values to 5 decimal places gives the following.

x1 = x0 −f(x0)

f ′(x0)= 0− 2

−7= 0·28571

x2 = x1 −f(x1)

f ′(x1)= 0·28571− 0·04665

−6·51020 = 0·29288

x3 = x2 −f(x2)

f ′(x2)= 0·29288− 0·00009

−6·48533 = 0·29289

The last two values agree to 4 decimal places., so the root required is x = 0·2929to 4 d.p. (If one more iteration is completed, x3 and x4 agree to 9 d.p.)

Note that the value of f ′(x) is very large compared to the value of f(x) for eachiteration. Even for the first iteration, f ′(x) is more than 3 times the value off(x). This makes the ‘correction term’ f(xn)/f

′(xn) become small very quickly,giving rapid convergence to the root.

If the value of f ′(x) is not large in comparison with f(x), convergence to theroot can be much slower or, in certain circumstances, the successive valuescan diverge. There is a technique for dealing with this problem which will bedescribed in Section 4.3, after we have introduced Taylor series.

56

2. Find the root of sinh x = x+ 1 to 4 s.f.

The equation has been given in the form sinh x = x+1. The Newton-Raphsonmethod requires the form f(x) = 0, so rearranging we have

f(x) = sinh x− x− 1 and f ′(x) = cosh x− 1.

If we choose the starting point x0 = 0 then f(x0) = −1 and f ′(x0) = 0, so theNewton-Raphson formula makes no sense. When this happens, we start againwith a better choice of x0. From the sketch, x0 = 2 is reasonable.

–6

–4

–2

0

2

4

6

–3 –2 –1 1 2 3

x

y

y = x+ 1

y = sinh x

Figure 14: The graphs of y = sinh x and y = x+ 1

With x0 = 2 and working to 5 decimal places, we obtain the following.

x1 = x0 −f(x0)

f ′(x0)= 2− 0·62686

2·76220 = 1·77306

x2 = x1 −f(x1)

f ′(x1)= 1·77306− 0·08646

2·02933 = 1·73045

x3 = x2 −f(x2)

f ′(x2)= 1·73045− 0·00254

1·91020 = 1·72913

x4 = x3 −f(x3)

f ′(x3)= 1·72913− 0·00002

1·90659 = 1·72912

The last two values agree to at least 4 s.f., so the root is x = 1·729 to 4 s.f.

57

4 Further topics

4.1 Taylor’s theorem

If the function f(x) is well behaved near a then, when h is small, the quantity f(a+h)can be expressed as a power series, by writing

f(a+ h) = A0 + A1h+ A2h2 + · · · =

∞∑

n=0

Anhn,

where the coefficients A0, A1 et cetera are determined by the function f(x).

Taking h = 0 on both sides shows that A0 = f(a).

To find the other coefficients we differentiate f(a+h) with respect to h and set h = 0:

f ′(a+ h) = A1 + 2A2h + 3A3h2 + · · · ,

so putting h = 0 gives f ′(a) = A1. Continuing to differentiate, we have

f ′′(a + h) = 2A2 + (3× 2)A3h + (4× 3)A4h2 + · · · ,

so putting h = 0 gives f ′′(a) = 2A2 and therefore A2 =f ′′(a)

2.

In general, we have

An =f (n)(a)

n!,

where f (n) is the nth derivative of f(x) and n! = n(n−1)(n−2) · · · (2)(1) is n factorial.

The series

f(a+ x) = f(a) + f ′(a)x+f ′′(a)

2!x2 +

f ′′′(a)

3!+ · · · =

∞∑

n=0

f (n)(a)

n!xn

is called the Taylor-series expansion of f(x) about x = a.

The corresponding series with a = 0 is

f(x) = f(0) + f ′(0)x+f ′′(0)

2!x2 +

f ′′′(0)

3!x3 + · · · =

∞∑

n=0

f (n)(0)

n!xn,

and this is the Maclaurin-series expansion of f(x).

These series expansions are usually valid wherever the series converges, but notalways – we will not encounter any such difficulty in this course.

58

Examples

1. Find the Maclaurin series for f(x) = ex.

Since f ′(x) = ex = f(x), it follows that f (n)(x) = ex for n = 1, 2, 3, . . .. Hencef (n)(0) = e0 = 1 for all n and the required series is

ex = 1 + x+x2

2!+x3

3!+ · · · .

2. Find the Maclaurin series for f(x) = sin x.

Here f ′(x) = cosx, f ′′(x) = − sin x, f ′′′(x) = − cosx, f (iv)(x) = sin x et cetera.When x = 0, sin x = 0 and cos x = 1, so

sin x = 0 + x+ 0− x3

3!+ 0 +

x5

5!+ · · · = x− x3

3!+x5

5!− · · · .

(The particularly alert reader will realise the previous two answers are basedon circular reasoning, since we used the Maclaurin series of ex and sin x to findthese derivatives in the first place. Rest assured that it is possible to avoid thiscircularity, and there is nothing to worry about.)

These two series are valid for any choice of x, but this is not true for allfunctions.

3. If f(x) =1

1− xthen, by the linear-composite rule,

f ′(x) =1

(1− x)2, f ′′(x) =

2

(1− x)3

and, in general, f (n)(x) =n!

(1− x)n+1. The Maclaurin series is therefore

1

1− x= 1 + x+ x2 + x3 + · · ·

and we have recovered the familiar geometric series. This expansion is validfor |x| < 1 but not for |x| > 1.

In the examples above, finding f (n)(0) did not pose much of a problem. Thismay not always be so straightforward.

59

4. If f(x) =1

1 + x2then f ′(x) = − 2x

(1 + x2)2.

To find the higher derivatives in this case becomes rather complicated veryquickly – try it and see. Fortunately, there is an easier way: replace x by −x2in the series for 1/(1− x), to get that

1

1 + x2= 1− x2 + x4 − x6 + · · · ,

which is valid whenever | − x2| < 1 or, equivalently, |x| < 1.

5. Find the Maclaurin series for cos(

x/(1 − x2))

, up to and including the termin x4.

Again, to calculate this series by finding derivatives would be a bit complicated.Instead, recall that

cos y = 1− y2

2+y4

24+ · · · ,

where the dots represent terms of degree strictly greater than 4, so lettingy = x/(1− x2) = x+ x3 + · · · gives

cos(

x/(1− x2))

= 1− 1

2(x+ x3 + · · · )2 + 1

24(x+ x3 + · · · )4 + · · ·

= 1− 1

2(x2 + 2x4 + · · · ) + 1

24(x4 + · · · ) + · · ·

= 1− 1

2x2 − 23

24x4 + · · · .

Estimating integrals

The Taylor-series expansion gives us another way of finding approximate values fordefinite integrals. If f(x) has the Taylor series

f(x) = A0 + A1(x− a) + A2(x− a)2 + · · ·about a then f(x) is approximately equal to the polynomial

p(x) = A0 + A1(x− a) + · · ·+ An(x− a)n

for all x close to a. Hence, if c and d are close to a,∫ d

c

f(x) dx ≈∫ d

c

(

A0 + A1(x− a) + · · ·+ An(x− a)n)

dx

=

[

A0(x− a) +A1(x− a)2

2+ · · ·+ An(x− a)n+1

n + 1

]d

c

.

Taking more terms of the Taylor series will give greater accuracy.

60

Examples

1. Find a numerical approximation to

∫ 0·5

0

e−x2/2 dx.

Replacing x by −x2/2 in the Maclaurin series for ex, we see that

e−x2/2 = 1− x2

2+x4

8− x6

48+ · · · .

Hence∫ 0·5

0

e−x2/2 dx ≈∫ 0·5

0

(

1− x2

2+x4

8− x6

48

)

dx

=

[

x− x3

6+x5

40− x7

336

]0·5

0

= 0·479925 (6 d.p.).

(The true value is 0·479925219 to 9 decimal places.)

2. Find an approximate value for

∫ 4

3

1

1 + x3dx.

A little care must be taken here; the expansion of the integrand must be valid.Although

1

1 + x3= 1− x3 + x6 − x9 + · · · ,

this holds only when |x| < 1. Instead, we re-write the integrand: note that

∫ 4

3

1

1 + x3dx =

∫ 4

3

x−3

x−3 + 1dx =

∫ 4

3

x−3(

1− x−3 + x−6 + · · ·)

dx

≈∫ 4

3

(

x−3 − x−6 + x−9)

dx

=

[

−x−2

2+x−5

5− x−8

8

]4

3

= 0·02369 (5 d.p.).

(The correct value is 0·023694490 to 9 decimal places.)

61

4.2 L’Hopital’s rule

Let f(x) and g(x) be continuous functions. If g(a) 6= 0 then, by the rules for limits,

limx→a

f(x)

g(x)=f(a)

g(a).

If g(a) = 0 but f(a) 6= 0 then∣

∣

∣

∣

f(x)

g(x)

∣

∣

∣

∣

→ ∞ as x→ a,

since |f(x)| → |f(a)| but |1/g(x)| = 1/|g(x)| is getting larger and larger as x → a.This leaves the case where f(a) = g(a) = 0. In this situation, different possibilitiesarise. For example, as x→ 0,

sin x

x→ 1,

sin2 x

x→ 0 and

∣

∣

∣

sin x

x2

∣

∣

∣→ ∞.

In general, L’Hopital’s rule lets us find such a limit.

Put x = a+ h, so that x→ a as h→ 0. Using Taylor series,

f(x)

g(x)=f(a+ h)

g(a+ h)=f(a) + f ′(a)h+ f ′′(a)

2!h2 + · · ·

g(a) + g′(a)h+ g′′(a)2!h2 + · · ·

.

If f(a) = g(a) = 0 then dividing the numerator and denominator by h shows that

f(x)

g(x)=f ′(a) + f ′′(a)

2!h+ · · ·

g′(a) + g′′(a)2!h+ · · ·

→ f ′(a)

g′(a)

as h→ 0, as long as g′(a) 6= 0. Hence

limx→a

f(x)

g(x)=f ′(a)

g′(a)if f(a) = g(a) = 0 and g′(a) 6= 0.

What if g(a) = 0 and g′(a) = 0? The working above shows that this situation iscontrolled by the second derivatives: the limit

limx→a

f(x)

g(x)=f ′′(a)

g′′(a)if f(a) = g(a) = 0, f ′(a) = g′(a) = 0 and g′′(a) 6= 0.

Similarly, if f ′′(a) = g′′(a) = 0 but g′′′(a) 6= 0 then the method can be repeated, andso on. Summarising, we have the following general rule.

62

L’Hopital’s rule

If f(x) and g(x) are functions such that

f(a) = g(a) = · · · = f (n−1)(a) = g(n−1)(a) = 0

but g(n)(a) 6= 0 then

limx→a

f(x)

g(x)= lim

x→a

f ′(x)

g′(x)= · · · = lim

x→a

f (n−1)(x)

g(n−1)(x)=f (n)(a)

g(n)(a).

Examples

1. Find limx→0

sin x

x.

If f(x) = sin x and g(x) = x then f(0) = sin 0 = 0 and g(0) = 0. However,f ′(x) = cos x and g′(x) = 1, so f ′(0) = cos 1 and g′(0) = 1. Hence L’Hopital’srule gives that

limx→0

sin x

x=f ′(0)

g′(0)=

1

1= 1.

(L’Hopital’s rule is not really needed here: note that

limx→0

sin x

x= lim

x→0

f(x)− f(0)

x− 0= f ′(0) = 1,

by the definition of f ′(0).)

2. Find limx→0

3x− sin x

x.

If f(x) = 3x − sin x and g(x) = x then f(0) = 0 and g(0) = 0, but f ′(x) =3− cos x and g′(x) = 1. Hence L’Hopital’s rule applies and

limx→0

3x− sin x

x=f ′(0)

g′(0)=

3− 1

1= 2.

(Again, L’Hopital’s rule isn’t really needed here: by the rules for limits,

limx→0

3x− sin x

x= lim

x→0

3x

x− lim

x→0

sin x

x= 3− 1 = 2,

as expected.)

63

3. Find limx→0

√1 + x− 1− 1

2x

x2.

Let f(x) =√1 + x− 1− 1

2x and g(x) = x2. Then f(0) = g(0) = 0, but

f ′(x) =1

2(1 + x)−1/2 − 1

2and g′(x) = 2x,

so f ′(0) = g′(0) = 0 as well.

Differentiating again gives f ′′(x) = −1

4(1 + x)−3/2 and g′′(x) = 2, so

limx→0

f(x)

g(x)=f ′′(0)

g′′(0)=

−14

2= −1

8.

(We could also find this limit by using the binomial theorem to write down thefirst few terms of the Maclaurin series for (1 + x)1/2.)

4. Find limx→π/2

cosx

cos 3x.

Let f(x) = cosx and g(x) = cos 3x; note that f(0) = g(0) = 0. However,f ′(x) = − sin x and g′(x) = −3 sin 3x, so

limx→π/2

cosx

cos 3x=f ′(0)

g′(0)=

−1

+3= −1

3.

(Here, L’Hopital’s rule really is the best way of finding the desired limit.)

4.3 The modified Newton-Raphson method

The following material is worth noting for future reference and is included here forinterest only. It is another application of Taylor-series expansion.

Consider the equation f(x) = 0. If x = x0 is an approximate root and x = x0 + h isan exact root then, using a Taylor series, we have

0 = f(x0 + h) = f(x0) + hf ′(x0) +h2

2!f ′′(x0) + · · · .

Considering the first two terms gives 0 ≈ f(x0) + hf ′(x0), so h ≈ − f(x0)

f ′(x0). Hence

x1 = x0 −f(x0)

f ′(x0)

should be a more accurate approximation to x0 + h than x0 is; this is the standardNewton-Raphson method.

64

If f ′(x0), the slope of the tangent to the curve y = f(x) at x = x0, is small thenthe approximate value of h may be large and the second approximation x1 may befurther away from the exact root than x0. In other words, if f ′(x0) is small then theNewton-Raphson method as it stands may fail to converge. (See Figure 15.)

–2

–1

0

1

2

–1 1 2 3x

y

y = f(x)x0

Figure 15: A function with a shallow gradient near a root

For a more accurate first step in this case, we consider the first three terms in theTaylor series, so that

0 ≈ f(x0) + hf ′(x0) +h2

2!f ′′(x0),

which is a quadratic equation in h. However, as f ′(x0) is very small (or we wouldnot be doing this working) we may neglect the h term and

0 ≈ 2f(x0) + h2f ′′(x0) ⇐⇒ h ≈ ±√

−2f(x0)

f ′′(x0).

Hence we should take

x1 = x0 ±√

−2f(x0)

f ′′(x0)

in this case, where either sign may be chosen. After this step we revert to thestandard formula,

xn+1 = xn −f(xn)

f ′(xn).

65

A The derivation of Simpson’s rule

0

1

2

3

4

5

6

1 2 3 4 5 6 7 8x

y

Figure 16: Estimation of the area under a graph

Consider the diagram above. If the region between the x-axis, the curve y = f(x)and the lines x = a and x = b is divided into an even number of strips, each ofwidth h, then Simpson’s rule can be used to estimate this area. (In Figure 16 above,the end point a = 1, b = 8 and the width h = 1.) The area of two consecutive stripsis approximated by fitting a quadratic curve, i.e., a curve of the form

y = q(x) = rx2 + sx+ t,

through the three points on the original curve at the end points of those strips: thepoints with ordinates y− = f(xi−h), y0 = f(xi) and y+ = f(xi+h). (In comparison,the trapezium rule fits a straight line between the end points of each strip.) Notethat

y− = r(xi − h)2 + s(xi − h) + t, (i)

y0 = rx2i + sxi + t (ii)

and y+ = r(xi + h)2 + s(xi + h) + t, (iii)

so taking (i)− 2(ii) + (iii) shows that

y− − 2y0 + y+ = r(xi − h)2 + s(xi − h) + t

− 2rx2i − 2sxi − 2t+ r(xi + h)2 + s(xi + h) + t = 2rh2.

66

If Ai is the area under the fitted quadratic between x = xi − h and x = xi + h then

Ai =

∫ xi+h

xi−h

q(x) dx

=

[

rx3

3+sx2

2+ tx

]xi+h

xi−h

=r(xi + h)3

3+s(xi + h)2

2+ t(xi + h)− r(xi − h)3

3− s(xi − h)2

2− t(xi − h)

=6rx2ih + 2rh3

3+

4sxih

2+ 2th

=2rh3

3+ 2(rx2i + sxi + t)h

=h

3

(

y− − 2y0 + y+)

+ 2y0h

=h

3

(

y− + 4y0 + y+)

.

The area of the first pair of strips is, therefore,

A1 =h

3

(

f(x0) + 4f(x1) + f(x2))

;

the area under the quadratic curve fitted to the second pair of strips is A3, theestimated area of the third pair of strips is A5, and so on, where

A3 =h

3

(

f(x2) + 4f(x3) + f(x4))

, A5 =h

3

(

f(x4) + 4f(x5) + f(x6))

et cetera. Hence the estimated total area of the first three pairs of strips is

A1 + A3 + A5 =h

3

(

f(x0) + 4f(x1) + 2f(x2) + 4f(x3) + 2f(x4) + 4f(x5) + f(x6))

.

For a general number of pairs of strips, we have

∫ b

a

f(x) dx ≈ h

3

(

F + L+ 4O + 2E)

,

where F and L are the values of the first and last ordinates f(x0) and f(xn), O isthe sum of all the odd-placed ordinates (f(x1), f(x3) et cetera) and E is the sum ofall the even-placed ordinates (f(x2), f(x4) et cetera).

This is Simpson’s rule.

67

Documents

MATH143 Calculuspower/MATH143notes-1.pdf · Chain, product and quotient rules. Second-order and higher derivatives. Stationary points and their classiﬁcation. Parametric, implicit