Upload
others
View
16
Download
0
Embed Size (px)
Citation preview
Stochastic Calculus and Applications to
Mathematical Finance
by
GREG WHITE
Mihai Stoiciu, Advisor
A thesis submitted in partial fulfillment
of the requirements for the
Degree of Bachelor of Arts with Honors
in Mathematics
WILLIAMS COLLEGE
Williamstown, Massachusetts
May 16, 2012
Abstract
In this paper, we review fundamental probability theory, the theory of stochastic
processes, and Ito calculus. We also study an application of Ito calculus in math-
ematical finance: the Black-Scholes option pricing model for the European call
option. We study the development of the model and the assumptions necessary to
arrive at the Black-Scholes no arbitrage rational price for a European call option.
We supplement the simple Black-Scholes model by relaxing the assumption that
trading can be performed continuously in time, and studying the deviation the
Black-Scholes replicating portfolio exhibits from the self-financing characteristic of
the continuous-time portfolio. We term this deviation the cumulative correction
of the portfolio and explain in detail its construction. We study the cumulative
correction of Black-Scholes portfolios by performing a numerical analysis of the
cumulative correction for outcomes of the stock price stochastic process. While
finding a closed form probability distribution representing the cumulative correction
proves difficult and we do not pursue that route in this paper, the numerical
analysis indicates that the second central moment of the distribution of cumulative
corrections decreases as the number of discrete time steps at which the portfolio is
rebalanced increases. Additionally, we analyze the cumulative correction required
to replicate the European call option for the historical stock price data series of
certain actual stocks, finding examples of a stock that would have required a positive
cumulative correction and a stock that would have required a negative cumulative
correction.
Greg White
Williams College
Williamstown, Massachusetts 01267
1
Acknowledgements
I cannot possibly do justice to the role that Professor Mihai Stoiciu’s advising has
played in shaping this thesis and my study of the topics involved. His insights,
explanations, enthusiasm and patience have made working with him an incredibly
rewarding experience, and without him, this thesis would not have been possible.
Professor Thomas Garrity’s guidance as the second reader of this thesis has also
been invaluable. I am only beginning to understand the wisdom of Professor Gar-
rity’s favorite mathematical principle, that “functions describe the world” (although
my work during this thesis did further that understanding), but his teaching style
and the courses I have taken with Professor Garrity have fundamentally shaped my
experience studying at Williams, for which I am grateful.
The Professors of Williams College have provided an incredible learning environ-
ment during my time here, and because of that, I have gained so much more than
an understanding of academic topics from my time working with them.
I would also like to thank Zhaoning Wang, Williams ’11, whose thesis served as
an excellent learning example of creating a project in LATEX.
Finally, I would like to thank my friends and family, whose generosity, support,
and friendship are vital parts of my life. To say that this thesis would not have
been possible without them is an incredible understatement of the impact they have
on me: without them I would not be who I am today.
2
CONTENTS
Contents
1 Background 6
1.1 Probability spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2 The Borel σ-field . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Random variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.3.1 Discrete random variables . . . . . . . . . . . . . . . . . . . 11
1.3.2 Continuous random variables . . . . . . . . . . . . . . . . . 12
1.3.3 Properties of expected value of random variables . . . . . . . 13
2 Convergence of Random Variables 16
2.1 Definitions of modes of convergence . . . . . . . . . . . . . . . . . . 16
3 Characteristic Functions and Applications 17
3.1 Characteristic functions . . . . . . . . . . . . . . . . . . . . . . . . 17
3.2 Limit theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
4 The Multivariate Normal Distribution 20
4.1 Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
4.2 Change of variables . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
5 Stochastic Processes 24
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.2 Wiener process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
6 Ito Calculus 25
6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
6.2 Ito integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
6.2.1 Motivation for Ito integral . . . . . . . . . . . . . . . . . . . 25
6.2.2 Characteristics of integrator Ws . . . . . . . . . . . . . . . . 26
6.2.3 Conditions for integrand ψs . . . . . . . . . . . . . . . . . . 27
6.2.4 Defining
∫ ∞0
ψsdWs for random step functions ψs . . . . . . 28
6.2.5 Sequences of predictable step functions approaching ψ . . . 32
6.2.6 Existence and uniqueness of limit Ito integral I(ψ) . . . . . 35
6.2.7 Example:
∫ t
0
WsdWs . . . . . . . . . . . . . . . . . . . . . . 36
6.3 The Ito Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
3
CONTENTS
6.3.1 Key idea of the Ito Lemma: (dWt)2 m.s.−−→ dt as dt→ 0 . . . . 39
6.3.2 The Ito Lemma . . . . . . . . . . . . . . . . . . . . . . . . . 39
6.3.3 Ito processes . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
6.3.4 Geometric Brownian motion . . . . . . . . . . . . . . . . . . 42
7 Application: Black-Scholes Option Pricing Model 42
7.1 Black-Scholes option pricing model . . . . . . . . . . . . . . . . . . 42
7.1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
7.1.2 Set-up of the model . . . . . . . . . . . . . . . . . . . . . . . 43
7.1.3 European call option . . . . . . . . . . . . . . . . . . . . . . 45
7.1.4 Assumptions of the model . . . . . . . . . . . . . . . . . . . 51
7.2 Approximating the Black-Scholes portfolio at discrete time steps . . 54
7.2.1 Introduction and motivation . . . . . . . . . . . . . . . . . . 54
7.2.2 Deviation of a discrete-time portfolio from the self-financing
property: cumulative correction . . . . . . . . . . . . . . . . 54
7.2.3 Cumulative correction of sample paths of St . . . . . . . . . 56
7.2.4 Numerical analysis of cumulative correction over several sam-
ple paths . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
7.2.5 Cumulative correction of portfolios based on historical stock
price data . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
7.2.6 Summary of study of cumulative correction . . . . . . . . . . 68
8 References 69
A Appendix: Mathematica Models 70
A.1 European call option for GBM sample path . . . . . . . . . . . . . 70
A.2 Numerical analysis of cumulative correction . . . . . . . . . . . . . 72
A.3 European call option for historical data . . . . . . . . . . . . . . . . 74
4
LIST OF TABLES
List of Tables
1 Important discrete random variables . . . . . . . . . . . . . . . . . 15
2 Important continuous random variables . . . . . . . . . . . . . . . . 15
5
1 BACKGROUND
1 Background
This section is intended to provide the foundation for later work in the paper. We
define probability spaces and random variables and outline important properties
that we will use to investigate topics later in the paper.
1.1 Probability spaces
Definition 1.1.1. The sample space Ω of an experiment is the set of all possible
outcomes.
Definition 1.1.2. A σ-field F on a set S is a collection of subsets of S that satisfies
the following properties:
1. ∅ ∈ F ,
2. A1, A2, . . . ∈ F ⇒⋃∞i=1Ai ∈ F ,
3. A ∈ F ⇒ Ac ∈ F .
For the remainder of this section, F denotes a σ-field on Ω.
Definition 1.1.3. An event A is a subset of Ω with A ∈ F . We denote the
complement of a subset A of Ω as Ac.
Definition 1.1.4. A probability measure P on (Ω,F) is a function
P : F → [0, 1],
such that:
1. P(∅) = 0 and P(Ω) = 1,
2. If A1, A2, . . . satisfy Ai ∩ Aj = ∅ ∀ i 6= j, then
P
(∞⋃i=1
Ai
)=∞∑i=1
P(Ai).
Definition 1.1.5. A probability space is a triple (Ω,F ,P), where Ω is a sample
space, F is a σ-field on Ω, and P is a probability measure on (Ω,F).
6
1 BACKGROUND
Definition 1.1.6. An event A is a null event if P(A) = 0.
Definition 1.1.7. A probability space (Ω,F ,P) is complete if all subsets of null
events are elements of F .
Definition 1.1.8. A family of events Ai : i ∈ I is independent if
P
(⋂i∈J
Ai
)=∏i∈J
P(Ai),
for all finite subsets J of I.
Definition 1.1.9. If P(B) > 0 then the conditional probability that A occurs
given that B occurs is:
P(A|B) =P(A ∩B)
P(B).
Definition 1.1.10. A family B1, B2, . . . , Bn of events is a partition of Ω if:
Bi ∩Bj = ∅ when i 6= j andn⋃i=1
Bi = Ω.
Lemma 1.1.11. If B1, B2, . . . , Bn is a partition of Ω such that P(Bi) > 0 ∀ i, then
P(A) =n∑i=1
P(A|Bi)P(Bi).
Proof. A =⋃ni=1(A ∩Bi). This is a disjoint countable union, so
P(A) =n∑i=1
P(A ∩Bi)
=n∑i=1
P(A|Bi)P(Bi) by Definition 1.1.9.
1.2 The Borel σ-field
Lemma 1.2.1. If Fi : i ∈ I is a family of σ-fields of subsets of Ω, then G =⋂i∈I Fi is a σ-field as well.
7
1 BACKGROUND
Proof. We see that G satisfies the following:
1. Since ∅ ∈ Fi ∀ i ∈ I, ∅ ∈ G,
2. Suppose A1, A2, . . . ∈ G. Then ∀ i ∈ I,⋃∞i=1Ai ∈ Fi. Therefore
⋃∞i=1Ai ∈ G.
3. Suppose A ∈ G. Then A,Ac ∈ Fi ∀ i ∈ I. Therefore Ac ∈ G.
Definition 1.2.2. Let J be the collection of all open intervals of R. Let Bi : i ∈ Ibe the collection of σ-fields that contain J . Then the Borel σ-field B is defined
as B =⋂∞i=1 Bi. It is the unique smallest σ-field that contains J . Elements of B
are called Borel sets on R.
Note that B is a σ-field by Lemma 1.2.1. When we study functions of
random variables, it will be important to understand which functions of random
variables are themselves random variables. We will see later that functions that are
B-measurable will satisfy this criteria.
Definition 1.2.3. A function g : R→ R is Borel-measurable (B-measurable)
if we have, for all B ∈ B, g−1(B) ∈ B.
1.3 Random variables
Definition 1.3.1. A random variable X is a function X : Ω→ R such that for
every x ∈ R, ω ∈ Ω : X(ω) ≤ x ∈ F (the function is F-measurable).
Theorem 1.3.2. A function X : Ω→ R has the property that for all x ∈ R, ω ∈Ω : X(ω) ≤ x ∈ F (X is F-measurable) if and only if for all B ∈ B, ω ∈ Ω :
X(ω) ∈ B ∈ F .
Proof. (⇒) Suppose X is F -measurable. Let C = C ⊆ R : X−1(C) ∈ F . We will
show that C is a σ-field of R. We know that ∅ ∈ C because X−1(∅) = ∅ ∈ F .
Suppose C1, C2, . . . ∈ C. Then
X−1
(∞⋃n=1
Cn
)=∞⋃n=1
X−1(Cn) ∈ F .
8
1 BACKGROUND
Therefore,⋃∞n=1 Cn ∈ C. Now let A ∈ C. We see that X−1(Ac) = [X−1(A)]
c, so
Ac ∈ C. Thus C is a σ-field on R. Note that for all x ∈ R, we have (−∞, x] ∈ C.
We know that B is the smallest σ-field on R that contains all open intervals of R by
construction. We now need to show that this is equivalent to the smallest σ-field
that contains (−∞, x] ∀ x ∈ R. We need to show that a σ-field on R contains all
open intervals on R if and only if it contains (−∞, x] ∀ x ∈ R. Let D be a σ-field
on R.
(⇒) Suppose D contains all open intervals I ⊆ R.
Then for any x ∈ R, let Di = (−∞, x+ 1i). Since D is a σ-field we have
(−∞, x] =∞⋂i=1
Di =
(∞⋃i=1
Dci
)c
∈ D.
(⇐) Suppose D contains (−∞, x] ∀ x ∈ R.
We need to show that for all I ⊆ R, I ∈ D. We have the following 3 cases:
Case 1: I = (−∞, b) for some b ∈ R.
Let Ai = (−∞, b− 1i]. Then each Ai ∈ D, so
(−∞, b) =∞⋃i=1
Ai ∈ D.
Case 2: I = (a,∞) for some a ∈ R.
We know (−∞, a] ∈ D, so
(a,∞) = (−∞, a]c ∈ D.
Case 3: I = (a, b) for some a, b ∈ R with a < b.
We know (−∞, a] ∈ D and [b,∞) = (−∞, b)c ∈ D by Case 1. Then
(a, b) =(
(−∞, a]⋃
[b,−∞))c∈ D.
Then B is the smallest σ-field that contains (−∞, x] ∀ x ∈ R. Therefore
B ⊆ C, so for all B ∈ B, we have that X−1(B) ∈ F . Then
ω ∈ Ω : X(ω) ∈ B ∈ F .
9
1 BACKGROUND
(⇐) Suppose we have that ∀ B ∈ B, ω ∈ Ω : X(ω) ∈ B ∈ F . We know that ∀ x ∈R, (x,∞) ∈ B, so (−∞, x] = (x,∞)c ∈ B. Then ∀ x ∈ R, ω ∈ Ω : X(ω) ≤ x ∈F .
Definition 1.3.3. Every random variable X has a distribution function F :
R→ [0, 1] defined by F (x) = P(X ≤ x).
Given a random variable X, we can associate events A(x) ⊆ Ω with our
random variable by letting A(x) = ω ∈ Ω : X(ω) ≤ x. We will henceforth denote
A(x) as X ≤ x.
Definition 1.3.4. Random variables X and Y are independent if X ≤ x and
Y ≤ y are independent events.
We will often concern ourselves with functions of random variables. To do
so, we will define the conditions for which a function of a random variable is itself
a random variable.
Theorem 1.3.5. If X is a random variable and we have a function g : R → R,
then g(X) : Ω → R, defined by g(X)(ω) = g(X(ω)) is also a random variable if g
is B-measurable.
Proof. g(X) clearly maps Ω → R, so we only need to show that g(X) is F -
measurable. Suppose g is B-measurable. Then ∀ B ∈ B, g−1(B) ∈ B. By Theo-
rem 1.3.2, sinceX is F -measurable, we also have that ∀ B ∈ B, ω ∈ Ω : X(ω) ∈ B ∈F . Then we have ∀ B ∈ B:
ω ∈ Ω : g(X)(ω) ∈ B = ω ∈ Ω : X(ω) ∈ g−1(B) ∈ F .
Therefore, by Theorem 1.3.2, g(X) : Ω→ R is F -measurable.
Theorem 1.3.6. If random variables X and Y are independent, and g, h : R →R are B-measurable functions, then g(X) and h(Y ) are also independent random
variables.
10
1 BACKGROUND
Proof. We know that g(X) and h(Y ) are random variables by Theorem 1.3.5. Then:
P(g(X) ≤ x, h(Y ) ≤ y) = P(g(X) ∈ (−∞, x], h(Y ) ∈ (−∞, y])
= P(X ∈ g−1((−∞, x]), Y ∈ h−1((−∞, y]))
= P(X ∈ g−1((−∞, x]))P(Y ∈ h−1((−∞, y]))
= P(g(X) ∈ (−∞, x])P(h(Y ) ∈ (−∞, y])
= P(g(X) ≤ x)P(h(Y ) ≤ y).
We will also need to be able to understand multivariate random variables,
so we need to define the joint distribution function of a vector of random variables.
Definition 1.3.7. Suppose we have a vector of random variables X = (X1, X2, . . . , Xn)
on (Ω,F ,P). Then the joint distribution function of X is the function FX :
Rn → [0, 1] given by:
F (x) = P (X1 ≤ x1, X2 ≤ x2, . . . , Xn ≤ xn) .
1.3.1 Discrete random variables
Definition 1.3.8. A discrete random variable is a random variable that only
takes values in some countable subset x1, x2, . . . of R.
Definition 1.3.9. The (probability) mass function of a discrete random vari-
able f : R→ [0, 1] is defined by f(x) = P(X = x).
Definition 1.3.10. The expected value of a discrete random variable X with
mass function f is defined as:
E(X) =∑
x:f(x)>0
xf(x),
as long as this sum converges absolutely.
We now introduce indicator functions, a specific type of Bernoulli random
variable. Indicator functions will be useful in many ways, including allowing us to
understand properties of the expectation operator that will make it easier to work
with.
11
1 BACKGROUND
Definition 1.3.11. Let A be an event. Let the indicator function of A, IA :
Ω→ R be defined as follows:
IA(ω) =
1 ω ∈ A0 ω ∈ Ac
.
Then IA is a Bernoulli random variable with probability mass function:
f(x) =
P(A) x = 1
1− P(A) x = 0.
Definition 1.3.12. Suppose we have a vector of random variables X = (X1, X2, . . . , Xn)
on (Ω,F ,P). X is jointly discrete if it only takes values in some countable subset
of Rn. X has the joint mass function f : Rn → [0, 1] defined by:
f(x1, x2, . . . , xn) = P(X1 = x1, X2 = x2, . . . , Xn = xn).
1.3.2 Continuous random variables
Definition 1.3.13. A random variable X is a continuous random variable if
its distribution function F (x) can be expressed as:
F (x) =
∫ x
−∞f(u)du,
for some integrable f : R → [0,∞). The function f is called the (probability)
density function of X.
Definition 1.3.14. The expected value of a continuous random variable X with
probability density function f is:
E(X) =
∫ ∞−∞
xf(x)dx,
if this integral exists.
Definition 1.3.15. Suppose we have a vector of random variables X = (X1, X2, . . . , Xn)
on (Ω,F ,P). X is jointly continuous if its joint distribution function can be
12
1 BACKGROUND
expressed as
FX(x) =
∫ xn
un=−∞· · ·∫ x2
u2=−∞
∫ x1
u1=−∞f(u1, u2, . . . , un)du1du2 . . . dun,
for some integrable function f : Rn → [0,∞). This function f is called the joint
density function of X.
1.3.3 Properties of expected value of random variables
In this section, we state without proof some characteristics of the expectation
operator E that make it easier to work with.
Theorem 1.3.16. For random variables X and Y , if a, b ∈ R, then:
E(aX + bY ) = aE(X) + bE(Y ).
Theorem 1.3.17. If random variables X and Y are independent, then
E(XY ) = E(X)E(Y ).
When we need to calculate the expectation of a function of a random variable
g(X), rather than attempting to find the probability mass or probability density
function fg(X) of g(X), if we know the probability mass or probability density
function fX of X, we can calculate the expectation of g(X) using the following
lemma.
Lemma 1.3.18. If a discrete random variable X has probability mass function f
and we have a B-measurable function g : R→ R, then
E(g(X)) =∑x
g(x)f(x),
whenever this sum absolutely converges.
If a continuous random variable X has probability density function f and we have
a B-measurable function g : R→ R, then
E(g(X)) =
∫ ∞−∞
g(x)f(x),
13
1 BACKGROUND
whenever this integral exists.
We often describe characteristics of probability distributions in terms of the
moments and central moments of the distribution.
Definition 1.3.19. For k ∈ N, the kth momentmk ofX is defined bymk = E(Xk)
and the kth central moment σk is defined by σk = E((X −m1)k).
The most commonly referenced moments are the mean m1 and the variance
σ2 of a random variable.
14
1B
AC
KG
RO
UN
D
Table 1: Important discrete random variablesName Parameter(s) Distribution function Mass function m1 σ2
Bernoulli 0 ≤ p ≤ 1 F (x) =
0 x < 01− p 0 ≤ x < 11 x ≥ 1
f(x) =
1− p x = 0p x = 10 x /∈ 0, 1
p p(1− p)
Binomialn ∈ N
0 ≤ p ≤ 1F (x) =
∑k∈0,1,...,n,k≤x
f(k) f(x) =
(n
x
)px(1− p)n−x x ∈ 0, 1, . . . , n
0 x /∈ 0, 1, . . . , nnp np(1− p)
Poisson λ > 0 F (x) =∑
k∈0,1,2,...,k≤x
f(k) f(x) =
λx
x!e−λ x ∈ 0, 1, 2, . . .
0 x /∈ 0, 1, 2, . . .λ λ
Geometric 0 < p < 1 F (x) = 1− (1− p)bmaxx,0c f(x) =
p(1− p)x−1 x ∈ N0 x /∈ N
1
p
1− pp2
Table 2: Important continuous random variablesName Parameter(s) Distribution function Density function m1 σ2
Uniform [a, b] ∈ R F (x) =
0 x ≤ ax− ab− a
a < x ≤ b
1 x > b
f(x) =
1
b− ax ∈ [a, b]
0 x /∈ [a, b]
b+ a
2
(b− a)2
12
Exponential λ > 0 F (x) =
0 x < 01− e−λx x ≥ 0
f(x) =
0 x < 0λe−λx x ≥ 0
1
λ
1
λ2
Normalµ ∈ Rσ2 > 0
F (x) = 12
(1 + erf
(x−µ√
2σ2
))f(x) =
1√2πσ2
e−(x−µ)2
2σ2 µ σ2
Gamma λ, t > 0 F (x) =γ(t, λx)
Γ(t)f(x) =
λt
Γ(t)xt−1e−λx x ≥ 0
t
λ
t
λ2
15
2 CONVERGENCE OF RANDOM VARIABLES
2 Convergence of Random Variables
In order to understand stochastic calculus and its applications, we will need to
be able to make statements regarding the convergence of a sequence of random
variables to a limit random variable.
2.1 Definitions of modes of convergence
There are multiple types of convergence a sequence of random variables can exhibit,
and we define them here.
Definition 2.1.1. Let X,X1, X2, . . . be random variables on (Ω,F ,P). Then:
1. Xn → X almost surely, denoted Xna.s.−−→ X, if:
A = ω ∈ Ω : Xn(ω)→ X(ω) as n→∞ ∈ F with P(A) = 1.
2. Xn → X in rth mean, denoted Xnr−→ X, if:
E (|Xrn|) <∞ ∀ n and E (|Xn −X|r)→ 0 as n→∞.
For r = 2, we say that Xn → X in mean square, denoted Xnm.s.−−→ X.
3. Xn → X in probability, denoted XnP−→ X, if:
P (|Xn −X| > ε)→ 0 as n→∞ for all ε > 0.
4. Xn → X in distribution, denoted XnD−→ X, if:
P(Xn ≤ x)→ P(X ≤ x) as n→∞
for all x at which FX(x) = P(X ≤ x) is continuous.
16
3 CHARACTERISTIC FUNCTIONS AND APPLICATIONS
3 Characteristic Functions and Applications
3.1 Characteristic functions
Characteristic functions provide a method of studying functions of random variables
and moments of random variables, and will also provide a method of proving the
law of large numbers and the central limit theorem.
Definition 3.1.1. The characteristic function φX of a random variable X is the
function φX : R→ C defined by:
φX(t) = E(eitX).
Characteristic functions allow independent random variables and linear trans-
formations of random variables to be handled easily, as the next two theorems
demonstrate.
Theorem 3.1.2. If X and Y are independent random variables, then
φX+Y (t) = φX(t)φY (t).
Proof. We have:
φX+Y (t) = E(eit(X+Y ))
= E(eitXeitY )
= E (cos(tX) cos(tY )− sin(tX) sin(tY ) + [cos(tX) sin(tY ) + sin(tX) cos(tY )]i)
= E(cos(tX))E(cos(tY ))− E(sin(tX))E(sin(tY )) +
iE(cos(tX))E(sin(tY )) + iE(sin(tX))E(cos(tY ))
= [E(cos(tX)) + iE(sin(tX))][E(cos(tY )) + iE(sin(tY ))]
= E(cos(tX) + i sin(tX))E(cos(tY ) + i sin(tY ))
= φX(t)φY (t).
Theorem 3.1.3. If a, b ∈ R, and Y = aX + b is a random variable, then
φY (t) = eitbφX(at).
17
3 CHARACTERISTIC FUNCTIONS AND APPLICATIONS
Proof. We have:
φY (t) = E(eit(aX+b))
= E(eitbei(at)X)
= eitbφX(at).
The characteristic function of a random variable reveals a great deal of
information about the moments of the random variable. The following theorem is
quite similar to Taylor’s theorem for a function of a complex variable, and the proof
is omitted here.
Theorem 3.1.4. 1. If φ(k)(0) exists, then
E|Xk| <∞ if k is even,
E|Xk−1| <∞ if k is odd.
2. If E|Xk| <∞, then
φ(t) =k∑j=0
E(Xj)
j!(it)j + o(tk).
We will also state a theorem relating the convergence of distribution func-
tions and characteristic functions of random variables without proof here.
Theorem 3.1.5. Suppose that F1, F2, . . . is a sequence of distribution functions
with corresponding characteristic functions φ1, φ2, . . .. Then:
1. If Fn → F for some distribution function F with characteristic function φ,
then φn(t)→ φ(t) ∀ t.
2. If φ(t) = limn→∞
φn(t) exists and is continuous at t = 0, then φ is the charac-
teristic function of some distribution function F , and Fn → F .
3.2 Limit theorems
With the theory of characteristic functions and convergence of random variables,
we can state and prove two limit theorems regarding sequences of random variables,
the law of large numbers, and the central limit theorem.
18
3 CHARACTERISTIC FUNCTIONS AND APPLICATIONS
Theorem 3.2.1. Let X1, X2, . . . be a sequence of independent, identically dis-
tributed random variables with finite means µ. The partial sums Sn =∑n
k=1Xk
satisfy:Snn
D−→ µ as n→∞.
Proof. Let φ be the characteristic function of each of the Xk, and let φn be the
characteristic function of Snn
.
By Theorem 3.1.2, φ∑Xk = φ(t)n. Then by Theorem 3.1.3, φn =φ( t
n)n
.
By Theorem 3.1.4, φ(t) = 1 + iµt+ o(t). Therefore,
φn(t) =
1 +
itµ
n+ o
(t
n
)n→ eitµ as n→∞.
Note that eitµ is the characteristic function of the constant µ, so by Theorem 3.1.5,Snn
D−→ µ as n→∞.
The law of large numbers asserts that Sn is approximately equal to nµ
for large n. The central limit theorem provides information about the difference
between Sn and nµ.
Theorem 3.2.2. Let X1, X2, . . . be a sequence of independent identically distributed
random variables with finite mean µ and finite non-zero variance σ2, and let Sn =∑nk=1. Then
Sn − nµ√nσ2
D−→ N(0, 1) as n→∞.
Proof. Let Yk = Xk−µσ
, and let φY be the characteristic function of the Yk.
By Theorem 3.1.4, we have:
φY (t) =2∑j=0
E(Y j)
j!(it)j + o(t2)
= 1 + itE(Y )− t2E(Y 2)
2+ o(t2)
= 1 + it(µσ− µ
σ
)− t2
2E(X2 − 2µX + µ2
σ2
)+ o(t2)
= 1− t2
2σ2
(E(X2)− E(X)2
)+ o(t2)
= 1− t2
2+ o(t2).
19
4 THE MULTIVARIATE NORMAL DISTRIBUTION
By Theorem 3.1.2 and Theorem 3.1.3, the characteristic function ψn(t) of
Sn − nµ√nσ2
=1√n
n∑k=1
Yk
satisfies:
ψn(t) =
φY
(t√n
)n=
1− t2
2n+ o
(t2
n
)n→ e−
t2
2 as n→∞.
Because e−t2
2 is the characteristic function of the N(0, 1) distribution, by Theo-
rem 3.1.5, we have that
Sn − nµ√nσ2
D−→ N(0, 1) as n→∞.
4 The Multivariate Normal Distribution
A multivariate distribution that will be of particular interest for the remainder of
this paper is the multivariate normal distribution. This section includes multiple
definitions of the multivariate normal distribution, and some results about the
distribution that make it easier to calculate associated probabilities. Much of this
section is based on [5].
4.1 Definitions
There are multiple definitions of the multivariate normal distribution that will
be useful to us. Before we can define the distribution, however, we must outline
notational preliminaries. We begin by defining covariance and positive definite
symmetric matrices.
Definition 4.1.1. If X and Y are random variables, the covariance of X and Y
20
4 THE MULTIVARIATE NORMAL DISTRIBUTION
is
cov (X, Y ) = E [(X − E(X)) (Y − E(Y ))] .
Definition 4.1.2. Let A be a symmetric real nonsingular matrix. There exists an
orthogonal matrix B such that A = BΛB′, where Λ is the diagonal matrix with
the eigenvalues λ1, λ2, . . . , λn of A on the diagonal. Let
Q(x) = yΛy′ =∑i
λiy2i , where y = xB.
A is a positive definite matrix if Q(x) > 0 for all x having some non-zero
coordinate. We denote this A > 0.
Let
X =
X1
X2
...
Xn
with n ≥ 2
be an n-dimensional random variable. Let µi and σ2i denote the expected value and
variance, respectively, of Xi, and let σij = cov (Xi, Xj). The mean vector µ and
covariance matrix Σ of X are then:
µ =
µ1
µ2
...
µn
,Σ =
σ2
1 σ12 · · · σ1n
σ12 σ22 · · · σ2n
......
. . ....
σ1n σ2n · · · σ2n
.
The first definition of the multivariate normal distribution is based on the joint
probability density function of X.
Definition 4.1.3. An n-dimensional random variable X with mean vector µ and
covariance matrix Σ has the multivariate normal distribution, denotedNn (µ,Σ),
if Σ is a positive definite matrix and the density function of X is given by:
f(x) =1√
(2π)n|Σ|e−
(x−µ)′Σ−1(x−µ)2 , where x ∈ Rn.
It will also be helpful to define the multivariate normal distribution in terms
of a transformation from the standard multivariate normal distribution Nr (0, Ir).
21
4 THE MULTIVARIATE NORMAL DISTRIBUTION
Definition 4.1.4. An n-dimensional random variable X with mean vector µ and
covariance matrix Σ has the multivariate normal distribution, denotedNn (µ,Σ),
if there exists an n× r matrix C with rank r ≤ n such that:
1. CC’ = Σ, and
2. X and CZr + µ, where Zr ∼ Nr (0, Ir), are identically distributed.
4.2 Change of variables
Because it is often easier to calculate probabilities from multivariate normal distri-
butions when the variables are independent, we would like to be able to apply
transformations to vectors of random variables having the multivariate normal
distribution and to understand the resulting distribution. The theory behind these
transformations actually provides a more general framework than simply applying
a transformation that yields independent normally distributed variables, which we
now describe.
We begin by considering the characteristic functions of n-dimensional ran-
dom variables.
Theorem 4.2.1. Let X be an n-dimensional random variable with characteristic
function ψX(t). Let C be any given m × n real matrix, and let b be any m× 1 real
vector. Then the characteristic function of Y = CX + b is given by:
ψY(t) = eit′bψX(C′t).
Proof.
ψY(t) = E(eit′Y)
= E(eit′(CX+b)
)= eit
′bE(ei(C
′t)X).
We would like to find define the characteristic function of the multivariate
normal variable. We know that the characteristic function of the univariate normal
22
4 THE MULTIVARIATE NORMAL DISTRIBUTION
variable is e−t2
2 . Then the characteristic function of Zr is given by:
ψZr (t) = e−t′t2 for t ∈ Rr.
Then by Theorem 4.2.1, Definition 4.1.4, and the uniqueness theorem for
characteristic functions, we have the characteristic function of the multivariate
normal variable.
Theorem 4.2.2. X ∼ Nn (µ,Σ) if and only if its characteristic function is given
by:
ψX (t) = eit′µ−t′Σt
2 for t ∈ Rn.
Theorem 4.2.3. Let C be any given m × n real matrix, and let b be any m × 1
real vector. If X ∼ Nn (µ,Σ) and Y = CX + b, then Y ∼ Nm(Cµ+ b,CΣC′).
Proof. 1. Case 1: m = n.
Follows from Theorem 4.2.2 and Theorem 4.2.1.
2. Case 2: m < n.
Consider the following transformation:
Y∗ =
(Y1
Y2
)=
(C
B
)X +
(b
0n−m
),
where B is any (n−m)× n matrix. Then:(Y1
Y2
)∼ Nn
((Cµ+ b
Bµ
),
(CΣC′ CΣB′
BΣC′ BΣB′
)).
Therefore Y = Y1 = CX+b ∼ Nm (Cµ,CΣC′) .
3. Case 3: m > n.
If r ≤ n is the rank of Σ, by Definition 4.1.4, we know there exists an n× rmatrix D such that X and DZr + µ are identically distributed. Therefore
by Definition 4.1.4 CX + b and CDZr + (Cµ+ b) both have the singular
N (Cµ+ b,CΣC′) distribution.
23
5 STOCHASTIC PROCESSES
5 Stochastic Processes
5.1 Introduction
In order to study stochastic calculus, we must first understand stochastic processes.
In this section, we define stochastic processes in general and give definitions of some
of the most important processes.
Definition 5.1.1. A stochastic process X is a family Xt : t ∈ T of random
variables which map the sample space Ω into the state space S ⊆ R.
We can observe stochastic processes in two manners: by studying fixed
realizations of the process, or by studying the distributional properties of the
process.
Definition 5.1.2. The realization (or sample path) of X at ω for a fixed
ω ∈ Ω is the collection Xt(ω) : t ∈ T of members of S.
Definition 5.1.3. Let t = (t1, t2, . . . , tn) be a vector with each ti ∈ T . Then
the vector (Xt1 , Xt2 , . . . , Xtn) has the joint distribution function Ft : Rn → [0, 1]
defined by Ft(x) = P(Xt1 ≤ x1, Xt2 ≤ x2, . . . , Xtn ≤ xn). Letting t range over all
finite-length vectors of members of T , the collection Ft is called the collection of
finite-dimensional distributions (fdds) of X.
A specific class of stochastic processes, Gaussian processes, will be of par-
ticular interest to us.
Definition 5.1.4. A real-valued, meaning that S = R, continuous-time, meaning
that T = [0,∞), stochastic process X is called a Gaussian process if each
finite-dimensional vector (Xt1 , Xt2 , . . . , Xtn) has the multivariate normal distribu-
tion N (µ(t),Σ(t)) for mean vector µ(t) and covariance matrix Σ(t) which may
depend on t.
5.2 Wiener process
The Wiener process is a stochastic process that will be important for our study of
stochastic calculus and the applications of stochastic calculus.
Definition 5.2.1. A Wiener process W = Wt : t ≥ 0 is a real-valued Gaussian
process such that:
24
6 ITO CALCULUS
1. For any n, Xj = Wtj − Wsj where 1 ≤ j ≤ n are independent variables
whenever the intervals (sj, tj] are disjoint (W has independent increments).
2. Ws+t −Ws ∼ N (0, σ2t) ∀ s, t ≥ 0, where σ2 is a positive constant.
3. The sample paths of W are continuous.
A Wiener process W is called standard if W0 = 0 and σ2 = 1.
6 Ito Calculus
6.1 Introduction
In this section, we will define the integral∫∞
0ψsdWs, where ψs is a stochastic
process. We also explore one of the fundamental ideas of Ito calculus: the Ito
Lemma. We conclude the section with a brief look at solutions to stochastic
differential equations.
6.2 Ito integral
This approach closely follows Section 13.8 of [2] and Chapter 2 of [4]. As we
explore in the next section, the Wiener process cannot be integrated path-wise
with respect to itself, so we define the stochastic integral∫∞
0ψsdWs as the mean
square limit of Riemann-Stieltjes sums. We first define the integral for a simpler
set of stochastic processes, random step functions, as a Riemann-Stieltjes sum. We
then show that, for stochastic processes ψs satisfying certain conditions, the process
can be expressed as the limit of a sequence of random step functions. We define the
Ito integral to be the mean square limit of the sequence of integrals of the random
step functions.
6.2.1 Motivation for Ito integral
The standard tool in mathematics for integrating one function with respect to
another function is the Riemann-Stieltjes integral.
Definition 6.2.1. Let f, g : [0, 1]→ R. Consider a partition τn of [0, 1]:
τn : 0 = t0 < t1 < . . . < tn−1 < tn = 1,
25
6 ITO CALCULUS
and an intermediate partition σn of τn:
σn : ti−1 ≤ yi ≤ ti for i = 1, . . . , n.
The Riemann-Stieltjes sum corresponding to τn and σn is defined by:
Sn = Sn(τn, σn) =n∑i=1
f (yi) [g(ti)− g(ti−1)] .
The Riemann-Stieltjes integral of f with respect to g is defined by the limit
S = limn→∞
Sn,
if this limit exists as mesh(τn) → 0 and if the limit is independent of τn and σn.
The Riemann-Stieltjes integral is denoted:∫ 1
0
f(t)dg(t).
Because the Wiener process Ws is a function of s ∈ [0,∞) and ω ∈ Ω, we
could use the Riemann-Stieltjes integral to integrate sample paths of stochastic
processes ψs with respect to Ws so long as the sample paths of ψs satisfy certain
criteria. However, the set of processes ψs for which the Riemann-Stieltjes integral
exists is small enough (and does not include Ws) that we will need to define the
desired integral as the mean square limit of Riemann-Stieltjes sums.
6.2.2 Characteristics of integrator Ws
Let Ws be a standard Wiener process on the complete probability space (Ω,F ,P).
Note that Ws has the following characteristics:
1. E (|Ws|2) = s <∞ for all s.
2. E (|Ws+h −Ws|2) = h→ 0 as h ↓ 0 for all s.
3. Whenever u ≤ v ≤ s ≤ t,
E ([Wv −Wu][Wt −Ws]) = E (Wv −Wu)E (Wt −Ws) = 0,
26
6 ITO CALCULUS
by Theorem 1.3.17 because W has independent increments.
The first characteristic is important because we will be working with mean-square
convergence. The second is helpful for technical reasons. The third is helpful when
establishing the existence of the limits necessary to define the integral.
6.2.3 Conditions for integrand ψs
We now outline the conditions that must be satisfied by the integrand in order to
define the Ito integral. We begin by stating the definition of a measurable function.
Definition 6.2.2. Let (Ω,F ,P) be a probability space. A function X : Ω → R is
F-measurable if ω ∈ Ω : X(ω) ≤ x ∈ F for all x ∈ R.
Note that random variables are F -measurable functions mapping Ω to R.
Define Ft to be the smallest sub-σ-field of F which contains the null events and with
respect to which Ws, for all 0 ≤ s ≤ t, are measurable. The collection Ft : t ≥ 0is called the filtration, and denoted F . Let B denote the Borel σ-field of subsets of
[0,∞).
Definition 6.2.3. A stochastic process ψ is measurable if the function ψt(ω) of
t and ω ∈ Ω is measurable with respect to the product σ-field B ⊗ F .
Definition 6.2.4. A measurable stochastic process ψ is adapted to the the
filtration F if ψt is Ft-measurable for all t.
Before we can continue, we need to be able to understand the integral∫ baµtdt,
where µt is a real-valued stochastic process. µt is a function of t ∈ [0,∞) and ω ∈ Ω.
The integral∫ baµtdt is a function from Ω → R, and we can interpret it as follows.
For any fixed ω ∈ Ω, µt becomes a deterministic function mapping [0,∞) to R.
The integral∫ baµtdt evaluated at ω, then, is the value of the integral of the sample
path at ω. This integral∫ baµtdt is itself a random variable.
Let
A =
ψt : ψ is adapted to the filtration F and E
(∫ ∞0
|ψt|2dt)<∞
.
We are able to define the Ito integral for stochastic processes ψ ∈ A. For
ψ ∈ A, define
||ψ|| =
√E(∫ ∞
0
|ψt|2dt).
27
6 ITO CALCULUS
The reason we need to impose these conditions on the integrand ψ is that we need
the integrand and the integrator Ws to match in terms of measurability.
6.2.4 Defining
∫ ∞0
ψsdWs for random step functions ψs
Having established the conditions on the process ψs, we can proceed to define the
Ito integral for a predictable step function.
Definition 6.2.5. Let 0 = a0 < a1 < . . . < an = t and let C0, C1, . . . , Cn−1 be
random variables such that:
1. For each i, E (C2i ) <∞.
2. Each Ci is Faj -measurable.
Then we define a predictable step function φ by setting:
φt =n−1∑i=0
CiI(ai,ai+1](t) =
0 if t ≤ 0 or t > an
Ci if ai < t ≤ ai+1
,
where
I(ai,ai+1](t) =
0 if t ≤ ai or t > ai+1
1 if ai < t ≤ ai+1
.
We can now define the stochastic integral of a predictable step function φ
with respect to W .
Definition 6.2.6. Let φ be a predictable step function. Then the stochastic
integral I(φ) of φ with respect to W is defined by:
I(φ) =n−1∑i=0
Ci(Wai+1−Wai).
Before considering sequences of predictable step functions, we will outline
properties of the stochastic integral of a predictable step function.
Theorem 6.2.7. Let α, β ∈ R and let φ1, φ2 be predictable step functions. Then
I(αφ1 + βφ2) = αI(φ1) + βI(φ2).
28
6 ITO CALCULUS
Proof. We have that
(αφ1 + βφ2)t =
0 if t ≤ 0 or t > an
αC1i + βC2
i if ai < t ≤ ai+1
.
Then
I(αφ1 + βφ2) =n−1∑i=0
(αC1i + βC2
i )(Wai+1−Wai)
= αn−1∑i=0
C1i (Wai+1
−Wai) + βn−1∑i=0
C2i (Wai+1
−Wai)
= αI(φ1) + βI(φ2).
To understand another important property of the stochastic integral of a
predictable step function, we must first define the L2 norm.
Definition 6.2.8. Let U be a random variable. Then the L2 norm of U , denoted
||U ||2 is defined by:
||U ||2 =√
E (U2).
We state without proof here that the L2 norm satisfies the triangle inequality,
that is, for random variables U and V :
||U + V ||2 ≤ ||U ||2 + ||V ||2.
We must also understand a more general theory of conditional expectation of
random variables. Rather than simply conditioning random variables on families
of random variables, we can condition random variables on sub-σ-fields of F .
Definition 6.2.9. Let (Ω,F ,P) be a probability space, let Y be a random vari-
able with finite second moment, and let G be a sub-σ-field of F . Let H be the
collection of G-measurable random variables with finite second moment. Then the
conditional expectation E (Y |G) is itself a G-measurable random variable such
that
E ([Y − E(Y |G)]Z) = 0 for all Z ∈ H.
29
6 ITO CALCULUS
Now, ∅,Ω ∈ G, so IΩ ∈ H, yielding:
E([Y − E(Y |G)]IΩ) = 0
E(Y )− E(E(Y |G)) = 0 because IΩ = 1 ∀ ω ∈ Ω
E(Y ) = E(E(Y |G)).
Theorem 6.2.10. If φ is a predictable step function, ||I(φ)||2 = ||φ||.
Proof. We have:
E(|I(φ)|2
)= E
(n−1∑j=0
Cj(Waj+1−Waj)
n−1∑k=0
Ck(Wak+1−Wak)
)
= E
(n−1∑j=0
C2j (Waj+1
−Waj)2 + 2
∑0≤j<k≤n−1
CkCj(Wak+1−Wak)(Waj+1
−Waj)
).
To calculate E(C2j (Waj+1
−Waj)2), we will first show that, when considering only
events in Faj , Cj and Waj+1−Waj are independent.
It suffices to show that, for any I, J ⊂ R,
P((Waj+1−Waj) ∈ I and Cj ∈ J) = P((Waj+1
−Waj) ∈ I)P(Cj ∈ J).
Denote
(Waj+1−Waj)
−1(I) = ω ∈ Ω : (Waj+1−Waj)(ω) ∈ I and C−1
j (J) = ω ∈ Ω : Cj(ω) ∈ J.
Then we need to show that
P[(Waj+1−Waj)
−1(I)∩C−1j (J)] = P[(Waj+1
−Waj)−1(I)]P[C−1
j (J)]. We know that
Waj+1−Waj and Ws are independent for any s ≤ aj. Then we have that:
P[(Waj+1−Waj)
−1(I) ∩W−1s (J)] = P[(Waj+1
−Waj)−1(I)]P[W−1
s (J)]
P[(Waj+1−Waj)
−1(I) ∩K] = P[(Waj+1−Waj)
−1(I)]P[K] for any K = W−1s (J).
Because Faj is defined to be the smallest sub-σ-field of F with respect to which Ws
is measurable for all s ≤ aj, K and (Waj+1−Waj)
−1(I) are independent events for
any K ∈ Faj . We also know that Cj is Faj -measurable, so C−1j (J) ∈ Faj , which
gives us the desired independence. We condition on Faj because, if we allow for
30
6 ITO CALCULUS
all of the events in F , we would not be guaranteed the independence of Cj and
Waj+1−Waj .
Then, because Waj+1−Waj ∼ N (0, aj+1 − aj),
E(C2j (Waj+1
−Waj)2) = E
(E(C2
j (Waj+1−Waj)
2|Faj))
= E(C2j )(aj+1 − aj).
Now, to calculate E[CjCk(Waj+1−Waj)(Wak+1
−Wak)] when j < k, we first
show that, when considering only events in Fak , CjCk(Waj+1−Waj) and Wak+1
−Wak
are independent.
Again, it suffices to show that, for any I, J ⊂ R,
P([CjCk(Waj+1
−Waj)] ∈ I and (Wak+1−Wak) ∈ J
)= P(CjCk(Waj+1
−Waj) ∈ I)P((Wak+1−Wak) ∈ J).
Denote (CjCk(Waj+1−Waj))
−1(I) = ω ∈ Ω : (CjCk(Waj+1−Waj))(ω) ∈ I and
(Wak+1−Wak)
−1(J) = ω ∈ Ω : (Wak+1−Wak)(ω) ∈ J. Then we need to show that
P[(CjCk(Waj+1−Waj))
−1(I)∩(Wak+1−Wak)
−1(J)] = P[(CjCk(Waj+1−Waj))
−1(I)]P[(Wak+1−
Wak)−1(J)]. We know that Wak+1
−Wak and Ws are independent for any s ≤ ak.
Then we have that:
P[(Wak+1−Wak)
−1(J) ∩W−1s (I)] = P[(Wak+1
−Wak)−1(J)]P[W−1
s (I)]
P[(Wak+1−Wak)
−1(J) ∩K] = P[(Wak+1−Wak)
−1(J)]P[K] for any K = W−1s (I)
Because Fak is defined to be the smallest sub-σ-field of F with respect to which Ws
is measurable for all s ≤ ak, K and (Wak+1−Wak)
−1(J) are independent events for
any K ∈ Fak . We also know that Cj, Ck, and (Waj+1−Waj) are all Fak-measurable,
so [CjCk(Waj+1−Waj)]
−1(I) ∈ Fak , which gives us the desired independence.
Then, because E(Wak+1−Wak) = 0, we have:
E(|I(φ)|2) =∑j
E(C2j )(aj+1 − aj) = E
(∫ ∞0
|φ(t)|2dt)
= ||φ||2.
31
6 ITO CALCULUS
6.2.5 Sequences of predictable step functions approaching ψ
Having defined the stochastic integral for a predictable step function φ, we now
consider the limit of a sequence of predictable step functions. Specifically, we will
show that we can find a sequence of predictable step functions converging to any
stochastic process ψ satisfying the requirements outlined above.
Theorem 6.2.11. Let ψ ∈ A have continuous sample paths. Then there exists
a sequence φ = φ(n) of predictable step functions such that ||φ(n) − ψ|| → 0 as
n→ 0.
Proof. Define the predictable step function φ(n) by:
φ(n)(t) =
n
∫ jn
j−1n
ψsds for jn< t ≤ j+1
n, 1 ≤ j < n2
0 otherwise
.
We have, for j ≥ 1:
∫ j+1n
jn
|φ(n)t |2dt =
1
n|φ(n)t |2
= n
∣∣∣∣∣∫ j
n
j−1n
ψsds
∣∣∣∣∣2
.
We state without proof here one of the variations of the Cauchy-Schwarz inequality:∣∣∣∣∫ f(x)g(x)dx
∣∣∣∣2 ≤ ∫ |f(x)|2dx∫|g(x)|2dx.
Then
∫ j+1n
jn
|φ(n)t |2dt = n
∣∣∣∣∣∫ j
n
j−1n
1ψsds
∣∣∣∣∣2
≤ n
∫ jn
j−1n
ds
∫ jn
j−1n
|ψs|2ds by Cauchy-Schwarz inequality
=
∫ jn
j−1n
|ψs|2 ds.
32
6 ITO CALCULUS
Therefore, we have, for T ≥ 0:∫ ∞T
|φ(n)s |2ds ≤
∫ ∞T− 2
n
|ψs|2ds.
We also know that:∫ ∞0
|φ(n)s − ψs|2ds =
∫ T
0
|φ(n)s − ψs|2ds+
∫ ∞T
|φ(n)s − ψs|2ds.
To complete the proof, we will to show that
limn→∞
E(∫ ∞
0
∣∣φ(n)s − ψs
∣∣2 ds) = 0.
We will first show that
∫ ∞0
∣∣φ(n)s − ψs
∣∣2 ds a.s.−−→ 0.We will then show that
∫ ∞0
∣∣φ(n)s − ψs
∣∣2 dsis uniformly dominated by an integrable function on [0,∞), and use an application
of the dominated convergence theorem to prove the desired result.
Note that∫ ∞0
∣∣φ(n)s − ψs
∣∣2 ds =
∫ T
0
∣∣φ(n)s − ψs
∣∣2 ds+
∫ ∞T
∣∣φ(n)s − ψs
∣∣2 ds.On the interval ( j
n, j+1
n], the mean value of ψs is given by
n
∫ j+1n
jn
ψsds = φ(n)t for t ∈
(j + 1
n,j + 2
n
].
By the mean value theorem, φ(n)t = ψr for all t ∈
(jn, j+1
n
]and some r ∈(
jn, j+1
n
]. Since the sample paths of ψ are continuous on the compact interval [0, T ],
we know that the sample paths of ψ are uniformly continuous on [0, T ]. Then for
all ε > 0, there exists δ > 0 such that:
|x− y| < δ ⇒ |ψx − ψy| < ε.
Then given any ε > 0, let δ be such that whenever |x − y| < δ, |ψx − ψy| <√ε
T.
33
6 ITO CALCULUS
Let n0 satisfy n0 >T
δ. Then for any n ≥ n0, on the jth interval,
∫ j+1n
jn
∣∣φ(n)s − ψs
∣∣2 ds < ε
nT.
On [0, T ], there are nT intervals of length 1n, so,
∫ T
0
∣∣φ(n)s − ψs
∣∣2 ds < ε⇒∫ T
0
∣∣φ(n)s − ψs
∣∣2 ds a.s.−−→ 0
. Now, since for x, y ∈ R, |x+ y|2 ≤ 2 (|x|2 + |y|2), we have:∫ ∞T
∣∣φ(n)s − ψs
∣∣2 ds ≤ 2
(∫ ∞T
∣∣φ(n)s
∣∣2 ds+
∫ ∞T
|ψs|2 ds)
≤ 2
(∫ ∞T− 2
n
|ψs|2 ds+
∫ ∞T
|ψs|2 ds
)
= 4
∫ ∞T− 2
n
|ψs|2 ds−∫ T
T− 2n
|ψs|2 ds
≤ 4
∫ ∞T− 2
n
|ψs|2 ds.
Now let n→∞ and then T →∞ in:∫ ∞0
∣∣φ(n)s − ψs
∣∣2 ds =
∫ T
0
∣∣φ(n)s − ψs
∣∣2 ds+
∫ ∞T
∣∣φ(n)s − ψs
∣∣2 ds.Because ψ ∈ A, ∫ ∞
0
|ψs|2 ds <∞ almost surely.
Therefore, ∫ ∞T
∣∣φ(n)s − ψs
∣∣2 ds ≤ 4
∫ ∞T− 2
n
|ψs|2 dsa.s.−−→ 0 as T →∞.
We have shown that
∫ ∞0
∣∣φ(n)s − ψs
∣∣2 ds a.s.−−→ 0, and we must now show that∫ ∞0
∣∣φ(n)s − ψs
∣∣2 ds is dominated on [0,∞).
34
6 ITO CALCULUS
We know that
0 ≤∫ ∞
0
∣∣φ(n)s − ψs
∣∣2 ds ≤ 4
∫ ∞0
|ψs|2 ds.
Then we can apply the dominated convergence theorem as follows:
limn→∞
E(∫ ∞
0
∣∣φ(n)s − ψs
∣∣2 ds) = E[
limn→∞
∫ ∞0
∣∣φ(n)s − ψs
∣∣2 ds] = 0.
Therefore,
||φ(n) − ψ|| → 0 as n→∞.
6.2.6 Existence and uniqueness of limit Ito integral I(ψ)
We are now able to define the Ito integral∫∞
0ψsdWs. We will first show that, if
φ is a sequence of predictable step functions converging in A to ψ, that I(φ(n))
converges in L2 to a limit random variable. We then show that this limit random
variable is almost surely unique. This limit random variable is defined as the Ito
integral of the process ψ.
Theorem 6.2.12. Let ψ ∈ A and φ = φ(n) be a sequence of predictable step
functions converging in A to ψ (in the sense of Theorem 6.2.11). Then the sequence
I(φ(n)) converges mean square to a limit random variable, which we denote I(φ).
Proof. We first show that I(φ(n) is Cauchy in L2. φ(n) − φ(m) is a predictable step
function, so we have the following:
||I(φ(n))− I(φ(m))||2 = ||I(φ(n) − φ(m))||2= ||φ(n) − φ(m)|| by Theorem 6.2.10
≤ ||φ(n) − ψ||+ ||φ(m) − ψ|| by the triangle inequality
→ 0 as m,n→∞.
Because L2 is complete, and the L2 norm corresponds to mean-square convergence,
the sequence I(φ(n)) converges in mean square to a limit random variable.
The following theorem asserts the almost sure uniqueness of the limit random
variable I(φ).
35
6 ITO CALCULUS
Theorem 6.2.13. Let φ and ρ converge in A to ψ. Then P[I(φ) = I(ρ)] = 1.
We denote this almost surely unique random variable I(ψ).
Proof. By the triangle inequality,
||I(φ)− I(ρ)||2 ≤ ||I(φ)− I(φ(n))||2 + ||I(φ(n))− I(ρ(n))||2 + ||I(ρ(n))− (ρ)||2.
As n→∞, ||I(φ)− I(φ(n))||2, ||I(ρ(n))− I(ρ)||2 →∞ by Theorem 6.2.12. We also
have that:
||I(φ(n))− I(ρ(n))||2 = ||I(φ(n) − ρ(n))||2 by Theorem 6.2.7
= ||φ(n) − ρ(n)|| by Theorem 6.2.10
≤ ||φ(n) − ψ||+ ||ρ(n) − ψ|| by the triangle inequality
→ 0 as n→∞.
Then ||I(φ)− I(ρ)||2 = 0, so P(I(φ) = I(ρ)) = 1.
Definition 6.2.14. The Ito integral of the stochastic process ψ is defined to be
I(ψ). We denote the integral
∫ ∞0
ψsdWs.
∫ t
0
ψsdWs is defined by
∫ ∞0
I(0,t](s)ψsdWs,
where
I(0,t](s) =
1 if s ∈ (0, t]
0 otherwise.
6.2.7 Example:
∫ t
0
WsdWs
We wish to find the random variable
∫ t
0
WsdWs according to the definition of the
Ito integral outlined in the previous section. For a given n, let δ = tn. We consider
the sequence of predictable step functions given by:
φ(n)s =
Wjδ for jδ < s ≤ (j + 1)δ, 0 ≤ j ≤ n− 1
0 otherwise.
36
6 ITO CALCULUS
Then I(φ(n)s ) =
n−1∑j=0
Wjδ(W(j+1)δ −Wjδ). We have:
2I(φ(n)s ) = 2
n−1∑j=0
WjδW(j+1)δ −W 2jδ
=n−1∑j=0
W 2(j+1)δ −W 2
jδ −W 2jδ −W 2
(j+1)δ + 2W(j+1)δWjδ
=n−1∑j=0
(W 2
(j+1)δ −W 2jδ
)−
n−1∑j=0
(W(j+1)δ −Wjδ
)2
= W 2t −W 2
0 −n−1∑j=0
(W(j+1)δ −Wjδ
)2.
37
6 ITO CALCULUS
Note thatn−1∑j=0
(W(j+1)δ −Wjδ
)2 m.s.−−→ t as n→∞:
E
[n−1∑j=0
(W(j+1)δ −Wjδ
)2 − t
]2 = E
(n−1∑j=0
[(W(j+1)δ −Wjδ
)2 − t
n
])2
Let Aj =(W(j+1)δ −Wjδ
)2 − t
n= E
((A0 + A1 + . . . An−1)2
)= E
(n−1∑j=0
A2j + 2
∑0≤s<t≤n−1
AmAk
)
= E
(n−1∑j=0
A2j
)+ 2
∑0≤s<t≤n−1
E(AmAk)
= E
(n−1∑j=0
A2j
)+ 2
∑0≤s<t≤n−1
E(Am)E(Ak)
= E
(n−1∑j=0
A2j
)
=n−1∑j=0
E
([(W(j+1)δ −Wjδ
)2 − t
n
]2)
=n−1∑j=0
E((W(j+1)δ −Wjδ
)4)
− 2t
nE((W(j+1)δ −Wjδ
)2)
+t2
n2
= n
(3t2
n2− 2t2
n2+t2
n2
)=
2t2
n→ 0 as n→∞.
Therefore, I(φ(n))→ 12
(W 2t − t) in mean square, yielding:∫ t
0
WsdWs =1
2
(W 2t − t
).
38
6 ITO CALCULUS
6.3 The Ito Lemma
Now that we have defined the Ito integral, we can develop a tool that will allow us
to relatively easily define Ito integrals for a set of stochastic processes ψs. This tool
is the Ito Lemma, and we will use a similar argument in its construction to that
used to justify the classical chain rule of differentiation. The argument makes use of
Taylor expansions, and for this reason, the Ito Lemma is considered the stochastic
analog of the deterministic chain rule.
6.3.1 Key idea of the Ito Lemma: (dWt)2 m.s.−−→ dt as dt→ 0
Before using Taylor expansions to construct the Ito Lemma, it is important to
understand the most important concept of the Ito Lemma: that the increment of
the Wiener process squared, (dWt)2, over dt converges to dt in mean square as dt
approaches 0.
Theorem 6.3.1. Let dWt = Wt+dt −Wt. Then
(dWt)2 m.s.−−→ dt as dt→ 0.
Proof. We have, for all t ≥ 0:
E ([(dWt)2 − dt]2) = E ([(Wt+dt −Wt)
2 − dt]2)
= E ((Wt+dt −Wt)4 − 2dt(Wt+dt −Wt)
2 + dt2)
= E ((Wt+dt −Wt)4)− 2dtE (Wt+dt −Wt)
2) + dt2
= 3dt2 − 2dt2 + dt2
= 2dt2
→ 0 as dt→ 0.
This key idea will allow us to interpret the Taylor expansions of functions
of Wiener processes by determining which arguments in the expansion will have
non-negligible contributions to the Taylor expansion.
6.3.2 The Ito Lemma
Let f : R→ R be an infinitely differentiable function. If we consider a sample path
of a Wiener process Wt(ω), we have the following Taylor expansion, again writing
39
6 ITO CALCULUS
dWt = Wt+dt −Wt:
f(Wt + dWt)− f(Wt) = f ′(Wt)dWt +1
2f ′′(Wt)(dWt)
2 + . . . .
Then, because (dWt)2 m.s.−−→ dt as dt → 0, third and higher order terms have
negligible contribution to the Taylor expansion, so we have:
f(Wt)− f(Ws) =
∫ t
s
f ′(Wx)dWx +1
2
∫s
tf ′′(Wx)dx,
where∫ tsf ′(Wx)dWx is an Ito integral, and
∫ tsf ′′(Wx)dx is a Riemann integral, and
equality is in the mean square sense.
If we let f : R2 → R have infinite partial derivatives, the Taylor expansion yields:
f(t+ dt,Wt+dt)− f(t,Wt) = f1(t,Wt)dt+ f2(t,Wt)dWt
+12
[f11(t,Wt)(dt)2 + 2f12(t,Wt)dtdWt + f22(t,Wt)(dWt)
2]
. . . .
Because (dWt)2 m.s.−−→ dt as dt→ 0, we have the following:
1. The contribution of third and higher order terms to the Taylor expansion are
negligible.
2. The contribution of the dtdWt term is negligible.
Therefore, we have:
f(t,Wt)− f(s,Ws) =
∫ t
s
[f1(x,Wx) +
1
2f22(x,Wx)
]dx+
∫ t
s
f2(x,Wx)dWx,
where again∫ tsf2(x,Wx)dWx is an Ito integral, and
∫ ts
[f1(x,Wx) + 1
2f22(x,Wx)
]dx
is a Riemann integral, and equality is in the mean square sense.
6.3.3 Ito processes
There are a subset of stochastic processes, called Ito processes, which can be
represented as the solution to a stochastic differential equation (SDE).
40
6 ITO CALCULUS
Definition 6.3.2. A stochastic process Xt is an Ito process if it is a solution to
the stochastic differential equation:
Xt = X0 +
∫ t
0
µ(s,Xs)ds+
∫ t
0
σ(s,Xs)dWs,
where µ and σ are stochastic processes,∫ t
0µ(s,Xs)ds is a Riemann integral, and∫ t
0σ(s,Xs)dWs is an Ito integral. We abbreviate this SDE to:
dX = µ(t,X)dt+ σ(t,X)dW.
We state without proof the following facts:
1. So long as weak conditions on µ,σ, and X0 are satisfied, the SDE dX =
µ(t,X)dt + σ(t,X)dW has a unique solution which has continuous sample
paths.
2. If X is an Ito process, the processes µ and σ are uniquely determined:
µ1(t,X)dt+σ1(t,X)dW = dX = µ2(t,X)dt+σ2(t,X)dW ⇒ µ1 = µ2 and σ1 = σ2.
We will now extend the Ito Lemma to stochastic processes that are functions of Ito
processes. The justification that follows is similar to that in the earlier Ito Lemma
statements: a justification for the formula rather than a rigorous proof. Suppose
Xt is an Ito process with
dXt = µ(t,Xt)dt+ σ(t,Xt)dWt.
Let f : R2 → R have infinite partial derivatives. Then a Taylor expansion for
f(t+ dt,Xt+dt)− f(t,Xt) yields:
f(t+ dt,Xt+dt)− f(t,Xt) = f1(t,Xt)dt+ f2(t,Xt)dXt
+12
[f11(t,Xt)(dt)2 + 2f12(t,Xt)dtdXt + f22(t,Xt)(dXt)
2]
. . .
= f1(t,Xt)dt+ µf2(t,Xt)dt+ σf2(t,Xt)dWt
+12
[σ2f22(t,Xt)(dWt)2]
. . .
41
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Then we have the following extension of the Ito Lemma:
f(t,Xt)− f(s,Xs) =
∫ t
s
[f1(y,Xy) + µf2(y,Xy) +
σ2
2f22(y,Xy)
]dy
+
∫ t
s
σf2(y,Xy)dWy.
6.3.4 Geometric Brownian motion
Let µ, σ ∈ R with σ > 0. Consider the SDE:
dXt = Xt(µdt+ σdWt),
where Wt is a standard Wiener process. The unique solution to this SDE is given
by:
Xt = f(t,Wt) = X0e
(µ−σ
2
2
)t+σWt ,
and Xt is called a geometric Brownian motion (note that what we have called
a Wiener process is also sometimes called a Brownian motion). The geometric
Brownian motion of this form is clearly an Ito process with
µ(s,Xs) = µXs and σ(s,Xs) = σXs,
where µ(s,Xs) and σ(s,Xs) are stochastic processes determining the Ito process,
and µ and σ are constant parameters of the geometric Brownian motion.
7 Application: Black-Scholes Option Pricing Model
To this point, we have covered the mathematical background necessary to under-
stand an application of stochastic calculus in finance: the Black-Scholes option
pricing model. In this section, we explain and explore the model.
7.1 Black-Scholes option pricing model
The derivation of the Black-Scholes option pricing model and the solution for the
European call option closely follow the explanations given in Chapter 4 of [4] and
Section 13.10 of [2].
42
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
7.1.1 Introduction
Subject to a nontrivial set of assumptions, the Black-Scholes option pricing model
can be used to find the no-arbitrage rational price for an option, a financial instru-
ment that derives its value from the value of an underlying asset and which we will
explain in detail later. There are a couple of key motivating ideas that are crucial
to the model:
1. The price of the asset underlying the option (henceforth stock) can be modeled
using a geometric Brownian motion stochastic process.
2. There do not exist arbitrage opportunities, or opportunities to profit without
risk.
3. There exists a unique self-financing portfolio of the stock and a risk-free
bond that replicates the payout of the option. This portfolio is continuously
rebalanced between the stock and the bond in such a manner that changes to
the value of the portfolio are caused entirely by changes in the prices of the
stock and bond rather than a cash flow into or out of the portfolio. Because
it replicates the payout of the option and because of the assumption that
arbitrage opportunities do not exist, the value of the portfolio at time t = 0
must match the no-arbitrage rational price of the option at time t = 0.
The first two of these ideas are important assumptions of the Black-Scholes model,
while the third depends upon the type of option, and is the most significant step
in the derivation of the model.
7.1.2 Set-up of the model
Suppose the price per unit of the stock that underlies the option is given by a
stochastic process S = St : t ≥ 0. Further, assume that S satisfies the geometric
Brownian motion SDE, dSt = St(µdt + σdWt). This is a vital assumption for the
model, and one that we will explore in more detail in the next subsection. We know
that the unique solution to the geometric Brownian motion SDE is
St = f(t,Wt) = e
(µ−σ
2
2
)t+σWt ,
43
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
where we have normalized so that S0 = 1.
It is assumed that financial market participants can buy any quantity, including
a negative quantity, of the stock without transaction costs. Buying a negative
quantity of the stock is equivalent to selling the stock short: borrowing the stock
and selling it.
In addition to the stock, financial market participants can buy any quantity of
a non-risky asset, or bond, which has a constant continuously compounded interest
rate of r. If we denote the price of one unit of the bond at time t by βt, β satisfies
the integral equation
dβt = rβtdt, which yields βt = ert after normalizing so that β0 = 1.
Buying a negative quantity of a bond is equivalent to borrowing money at an
interest rate of r.
We call a pair (at, bt) of stochastic processes which are both adapted to the filtration
of σ-fields generated by Wt a portfolio, where at and bt represent the number of
units of stock and bond in the portfolio, respectively, at time t. The value of the
portfolio is given by:
Vt(a, b) = atSt + btβt.
The portfolio is called self-financing if changes in the value of the portfolio arise
only from changes in the price of the stock and the bond, and not from the input of
money from external sources into the portfolio or taking money out of the portfolio
for external use. The self-financing condition is:
dVt(a, b) = atdSt + btdβt.
In order to price an option, we make use of the no arbitrage assumption: that there
does not exist an opportunity for a risk-free profit. We need to find a self-financing
portfolio that replicates the payout of the option in question, and the portfolio will
be specific to the type of option. Then the value of the option at time t is equal to
the value, Vt(a, b) of the self-financing portfolio (a, b) that replicates the option.
44
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
7.1.3 European call option
In this section, we find the no-arbitrage rational price for a European call option
using the Black-Scholes model. The European call option is a contract that enables
the holder to purchase one unit of stock at a fixed strike price, K, at a given maturity
time, T . The holder of the European call option has the ability to purchase the
share only at time T , and can decide not to purchase the share. To find the value
of the option at time T , consider the following:
1. If ST > K, the holder of the option can purchase the share for K and sell it
immediately for ST , yielding a profit of ST −K.
2. If ST < K, the holder of the option will not purchase the share for K, because
he/she has can purchase the same share for ST < K. Thus the value of the
option is 0.
Then, the value of the option at time T is
maxST −K, 0 = (ST −K, 0)+.
Because the European call option can only be exercised at time T , a portfolio
(a, b) with value function Vt(a, b) is said to replicate the option if VT (a, b) = (ST −K, 0)+ almost surely. Assume that there exists a self-financing portfolio (a, b) that
replicates the European call option. We will later revisit this assumption. Then
because financial market participants can purchase either the option or the portfolio
at any time t < T , and because the portfolio replicates the option, the cost of the
portfolio, Vt(a, b) and the cost of the option must be the same at time t.
To find the value function of (a, b), assume that there exists a smooth deterministic
function u : R2 → R such that
Vt = atSt + btβt = u(T − t, St) for t ∈ [0, T ].
Because (a, b) replicates the option, we have the following terminal condition:
u(0, ST ) = VT (a, b) = (ST −K, 0)+.
We will derive the value function as follows: on the one hand, we will apply the Ito
Lemma to obtain an Ito process expression for Vt − V0, using the fact that St is an
45
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Ito process, and on the other hand, we will use the fact that (a, b) is self-financing
to obtain another Ito process expression for Vt − V0. We will combine these two
expressions to yield a partial differential equation that the value function satisfies,
and to which the solution is well understood. We will conclude the derivation by
proving that (a, b) is indeed self-financing and replicates the European call option.
Let f(t, x) = u(T − t, x). Then we have:
f1(t, x) = −u1(T − t, x), f2(t, x) = u2(T − t, x), f22(t, x) = u22(T − t, x) .
Note that St satisfies the geometric Brownian motion SDE dSt = St(µdt + σdWt),
so St is an Ito process with µ(t, St) = µSt and σ(t, St) = σSt. Then an application
of the Ito Lemma yields:
Vt − V0 = f(t, St)− f(0, S0)
=
∫ t
0
(f1(s, Ss) + µSsf2(s, Ss) +
1
2σ2S2
sf22(s, Ss)
)ds
+
∫ t
0
σSsf2(s, Ss)dWs
=
∫ t
0
(−u1(T − s, Ss) + µSsu2(T − s, Ss) +
1
2σ2S2
su22(T − s, Ss))ds
+
∫ t
0
σSsu2(T − s, Ss)dWs.
Then the value process is an Ito process with
µ(t, St) = −u1(T − s, Ss) + µSsu2(T − s, Ss) +1
2σ2S2
su22(T − s, Ss),
and
σ(t, St) = σSsu2(T − s, Ss).
46
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
We now find another Ito process representation of Vt beginning with the self-
financing condition:
dVt(a, b) = atdSt + btdβt
Vt − V0 =
∫ t
0
asdSs +
∫ t
0
bsdβs
=
∫ t
0
asdSs +
∫ t
0
Vs − asSsβs
rβsds, because bs =Vs − asSs
βsand dβs = rβsds
=
∫ t
0
asdSs +
∫ t
0
r (Vs − asSs) ds
=
∫ t
0
µasSsds+
∫ t
0
σasSsdWs +
∫ t
0
r (Vs − asSs) ds
=
∫ t
0
((µ− r)asSs + ru(T − s, Ss)) ds+
∫ t
0
σasSsdWs.
Therefore, we have found a representation for Vt as an Ito process with
µ(t, St) = (µ− r)asSs + ru(T − s, Ss),
and
σ(t, St) = σasSs.
Because we have two representations for the Ito process Vt, the coefficient processes
of the Riemann and Ito integrals must coincide:
σatSt = σStu2(T − t, St)at = u2(T − t, St),
and
(µ− r)atSt + ru(T − t, St) = −u1(T − t, St) + µStu2(T − t, St) + 12σ2S2
t u22(T − t, St)= (µ− r)u2(T − t, St)St + ru(T − t, St).
Then we have the following partial differential equation:
ru(t, x) =1
2σ2x2u22(t, x) + rxu2(t, x)− u1(t, x).
47
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
When combined with the terminal condition, this partial differential equation ac-
tually has the well-studied explicit solution given by:
u(t, x) = xΦ(g(t, x))−Ke−rtΦ(h(t, x)),
where
g(t, x) =ln(xK
)+(r + 1
2σ2)t
σ√t
,
h(t, x) = g(t, x)− σ√t,
and
Φ(x) =1√2π
∫ x
−∞e−y22 dy, for x ∈ R.
Therefore, at any time t < T , the Black-Scholes rational price of the option is given
by:
Vt = u(T − t, St) = StΦ(g(T − t, St))−Ke−r(T−t)Φ(h(T − t, St)).
Now, this function is the value function of the portfolio given by
at = Φ(g(T − t, St)),
and
bt = −Ke−rTΦ(h(T − t, St)),
because
bt =Vt − atSt
βtand βt = ert.
To conclude the derivation, we must now show that this portfolio is indeed self-
financing and that it replicates the European call option.
Let α, γ : R2 → R be smooth functions. Let (α, γ) be the portfolio that holds
α(t, St) units of stock and γ(t, St) units of bond at time t. Then the value function
is given by:
Vt(α, γ) = v(t, St) = Stα(t, St) + γ(t, St)ert.
Theorem 7.1.1. Suppose v(t, x) is twice continuously differentiable. (α, γ) is self-
financing if and only if the following hold:
xαx + ertγx = 0,
48
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
1
2σ2x2αx + xαt + ertγt = 0.
Proof. Applying the Ito Lemma to v(t, St) = Stα(t, St) + γ(t, St)ert yields:
dv(t, St) = vx(t, St)dSt +
[vt(t, St) +
1
2σ2S2
t vxx(t, St)
]dt.
Now, (α, γ) is self-financing if and only if:
dv(t, St) = α(t, St)dSt + rγ(t, St)ertdt.
Then we have two representations of vt as an Ito process, so coefficient processes
coincide, and (α, γ) is self-financing if and only if:
α = vx
= α + xαx + ertγx
0 = xαx + ertγx,
andrγert = xαt + γte
rt + rγert + 12σ2x2(xαxx + 2αx + ertγxx)
0 = xαt + γtert + 1
2σ2x2(xαxx + 2αx + ertγxx).
.
Differentiate 0 = xαx + ertγx with respect to x:
0 = xαxx + αx + ertγxx,
so
0 = xαt + ertγt +1
2σ2x2αx.
Theorem 7.1.2. Let v(t, x) be twice continuously differentiable. v(t, St) is the
value function of a self-financing portfolio if and only if
1
2σ2x2vxx + rxvx + vt − rv = 0.
Proof. Suppose1
2σ2x2vxx + rxvx + vt − rv = 0.
49
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Let
α = vx and γ = e−rt(v − xvx).
Then the portfolio (α, γ) has value function v(t, St), and
xαx + ertγx = xvxx + vx − vx − xvxx= 0,
and
12σ2x2αx + xαt + ertγt = 1
2σ2x2vxx + xvxt + (vt − xvxt)− r(v − xvx)
= 0 by assumption.
Therefore (α, γ) is self-financing, and v(t, St) is the value function of a self-financing
portfolio.
Now assume v(t, St) is the value function of a self-financing portfolio. Then, by
Theorem 7.1.1,
v(t, x) = xα(t, x) + ertγ(t, x)
for some (α, γ) such that:
xαx + ertγx = 0,
1
2σ2x2αx + xαt + ertγt = 0.
Thenvx = xαx + α + ertγx
= α,
and
γ = e−rt(v − xvx),
so0 = 1
2σ2x2αx + xαt + ertγt
= 12σ2x2vxx + xvxt + (vt − xvxt)− r(v − xvx)
= 12σ2x2vxx + rxvx + vt − rv.
Now, by Theorem 7.1.1 and Theorem 7.1.2, the portfolio given by
at = Φ(g(T − t, St))
50
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
and
bt = −Ke−rTΦ(h(T − t, St))
is self-financing.
We know by construction that
Vt(a, b) = StΦ(g(T − t, St))−Ke−r(T−t)Φ(h(T − t, St)).
Taking the limit as t ↑ T , we see that
g(T − t, St), h(T − t, St)→
−∞ if ST < K,
∞ if ST > K.
Therefore, if ST 6= K, we have VT (a, b) = (ST − K)+. Since P(ST = K) = 0,
VT (a, b) = (ST −K)+ almost surely, and (a, b) replicates the European call option.
7.1.4 Assumptions of the model
While we have explicitly mentioned some of the assumptions of the model in earlier
sections, we will restate the assumptions here, including some justification for the
validity of as well as potential problems with the assumptions.
Stock price is approximated by geometric Brownian motion St = e(µ−12σ2)t+σWt
This assumption is vital for multiple reasons. First, it includes the weaker as-
sumption that St is an Ito process, which is the fact that allows us to utilize the
Ito integral and Ito lemma to find solutions to the stochastic differential equations
that are derived from the model. Second, the specification of St as a geometric
Brownian motion assumes a constant mean rate of return µ and volatility σ from
time t = 0 to t = T , and determines the nature of the randomness that is modeled
into the stock price (namely that the Wiener process is a good approximation of
the randomness in a stock price over time).
St takes values in the positive reals
While this assumption is a consequence of modeling the stock price process as a
geometric Brownian motion, it is provably false in reality, because prices paid for
shares of stock cannot be irrational numbers. Therefore, in reality, stock prices
51
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
must be elements of the rational numbers, a countable subset of R. If we assume
that share prices are bounded above by some large finite bound, the set of values
that a share price can take is actually finite. In either case, the proof that the Black-
Scholes portfolio replicates the European call option relies upon the fact that, for a
continuous random variable X, P (X = x) = 0 ∀ x ∈ R. Of course, the strike price
K of the European call option will also be a rational number, so the assertion that
P (ST = K) = 0 is no longer trivially true.
No arbitrage
The assumption that there do not exist risk-free opportunities to make a profit is
vital to equate the value of the self-financing portfolio that replicates the European
call option, which is found using mathematical techniques described above, with the
value of the option itself. Without this assumption (or assumptions that provide
other ways to necessitate the equality of the value of the portfolio and the value
of the option), the mathematical theory would allow us to prove the existence and
calculate the value of the self-financing portfolio that replicates the option, but
would not allow us to make a conclusion about the rational price of the option
based on the portfolio.
Existence of risk-free bond with constant interest rate
This assumption, combined with the no arbitrage assumption, states that all finan-
cial market participants can borrow or lend money at the same risk-free constant
interest rate r, and is clearly not true in reality.
Ability to purchase and hold any positive or negative quantity of stock
and bond
While at ≥ 0 ∀ t ∈ [0, T ] and bt ≤ 0 ∀ t ∈ [0, T ] for the self-financing portfolio
described above, this assumption and the resulting self-financing portfolio does
imply that all financial market participants can borrow any amount of money at
the same risk-free constant interest rate r.
52
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Ability to make trades in continuous time
The construction of the self-financing portfolio vitally relies upon the assumption
that financial market participants can buy and sell units of stock and bond in contin-
uous time in order to instantaneously re-balance the portfolio in response to changes
in the stock price. Clearly, this is impossible in reality. In Section 7.2, we explore
how relaxing this assumption and only allowing financial market participants to re-
balance a portfolio that replicates the European call option at discrete time steps
impacts the amount the portfolio deviates from the self-financing property that
holds for instantaneously rebalanced portfolios.
No transaction costs
While the assumption that financial market participants need not pay transaction
costs is made to simplify calculations, it is problematic because even small transac-
tion costs could have a large impact on the value of a portfolio that is rebalanced
continuously in time, as the Black-Scholes model assumes, or even on the value of a
portfolio that is rebalanced at several discrete steps of time per day in an attempt to
minimize the amount the discrete time portfolio deviates from being self-financing
(the deviation of portfolios that are re-balanced at discrete time steps from being
self-financing is studied at length in Section 7.2).
No dividends
The assumption that the stock does not pay dividends is an assumption made to
simplify calculations.
Value function Vt of portfolio is a smooth deterministic function of t and
St
This assumption is required to be able to apply the Ito lemma to the value process.
The assumption that the function is smooth is potentially problematic. Because
sample paths of the Wiener process are not differentiable, the possibility exists
that the value process of the self-financing portfolio is not differentiable, or that it
is not continuous. Other methods of deriving the Black-Scholes model, such as the
change of measure technique, rely upon stronger backgrounds of probability theory
53
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
and measure theory, and allow this assumption to be relaxed, however. The change
of measure derivation of the Black-Scholes model is described in Section 13.10 of [2].
7.2 Approximating the Black-Scholes portfolio at discrete
time steps
7.2.1 Introduction and motivation
One of the assumptions of the Black-Scholes model that is essential to prove
that the portfolio that replicates the European call option is also self-financing
is that financial market participants can make transactions in continuous time,
instantaneously reacting to changes in the price of the stock and bond. In reality,
however, financial transactions are conducted at discrete time steps. In this section,
we study the effect that relaxing this assumption has on the self-financing property
of the replicating portfolio. The motivation to further study the effect of rebalancing
the portfolio at discrete time steps on the self-financing property of the portfolio
stems in part from [3].
We build a model that calculates a sample path of the stock price process St at
discrete time steps, conduct a numerical analysis over several sample paths of St to
study the relationship between the frequency of the time steps and the deviation
of the portfolio from the self-financing property, and build a model to study the
deviation of portfolios based on historical stock price data from the self-financing
property. Of course, before we can properly study the deviation of a portfolio
from the self-financing property, we must define the way that we will quantify that
deviation.
7.2.2 Deviation of a discrete-time portfolio from the self-financing prop-
erty: cumulative correction
Suppose we want to use the replicating portfolio from the Black-Scholes model
to find the value of a European call option with strike price K and maturity T
over time, and that we assume that financial market transactions occur at discrete
time steps (this assumption is the only deviation from the set-up of the Black-
Scholes model described in Section 7.1). Suppose we break the interval [0, T ] into
n ∈ N intervals each of length Tn
and that financial market participants perform
54
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
transactions only at times t ∈ jTn for j ∈ 0, 1, . . . n. At these times t, financial
market participants solve the Black-Scholes portfolio equations:
at = Φ(g(T − t, St)), and bt = −Ke−rTΦ(h(T − t, St)),
and adjust their holdings of stock and bond accordingly. Because the stock price St
changes between tj = jTn
and tj+1 = (j+1)Tn
without the financial market participant
instantaneously re-balancing the replicating portfolio, it is possible that the value of
the portfolio that the participant must purchase at time tj+1, Stj+1atj+1
+ βtj+1btj+1
is greater than or less than the value of the actual portfolio the participant holds
before he/she performs rebalancing transactions, Stj+1atj +βtj+1
btj . We can think of
the rebalancing transaction as the financial market participant selling the portfolio
that he/she held at time tj,(atj , btj
)at the prices at time tj+1 and purchasing
the desired portfolio at time tj+1,(atj+1
, btj+1
)at the same prices. The difference
between the money gained by selling(atj , btj
)and the money necessary to purchase(
atj+1, btj+1
)is what we term the correction over the time interval (tj, tj+1], and is
the way we construct our measure of the deviation of a portfolio from the self-
financing property.
Definition 7.2.1. The correction Cj of a portfolio over the time interval (tj, tj+1]
is given by:
Cj = Stj+1atj+1
+βtj+1btj+1−(Stj+1
atj + βtj+1btj)
= Stj+1
(atj+1
− atj)+βtj+1
(btj+1
− btj).
Cj > 0 indicates that the financial market participant is required to input additional
money from an external source into the portfolio in order perform the required
transaction at time tj+1, and Cj < 0 indicates that the financial market participant
is required to remove money from the portfolio in order perform the required
transaction.
The cumulative correction Ctk of a portfolio up to time tk is given by:
k−1∑j=0
Cj.
Note that Ctn is the cumulative correction of a portfolio up to the maturity T of
55
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
the option. We denote Ctn as C.
The models described in the following sections study the cumulative correc-
tion of different Black-Scholes portfolios in detail. In each of the models, we use
time units such that a time interval of length 1 represents a day.
7.2.3 Cumulative correction of sample paths of St
The first model we have built to study the cumulative correction of Black-Scholes
portfolios simulates one sample path of the stock price process St, and tracks the
cumulative correction of the portfolio over time. The Mathematica code for the
model can be found in Appendix A.1. The model allows for the specification of
K,T, µ, σ, r, and n. It then calculates Stj , atj , btj , for j ∈ 0, 1, . . . , n, and Ctj for
j ∈ 1, 2, . . . , n.
The following is a sample output from the model with
S0 = 1, β0 = 1, K = 1.1, T = 250, n = 250, and annualized µ = .1, σ = .1, r = .05.
Figure 1: Stock price and value of replicating portfolio for given sample path ω0 ofSt
56
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Figure 2: Composition of replicating portfolio over time for same sample path ω0
of St
Figure 3: Cumulative correction over time for same sample path ω0 of St
57
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
The sample path of St that generated this output is the stock price charted
in Figure 1. The composition, value, and cumulative correction of the replicating
Black-Scholes portfolio are charted in Figures 2, 1, 3, respectively. We can see that,
for this sample path of St, ST < K, aT = bT = 0, and the cumulative correction C is
positive, meaning that in order to replicate the European call option, the financial
market participant would have been required to use money from an external source
to construct the portfolio. From this output, we can make the following observation
regarding the cumulative correction and the construction of a replicating portfolio
in discrete time steps: relatively large increases in stock price over a time interval
require relatively large increases in the number of stock units held in the portfolio,
and are more likely to necessitate a positive correction over the interval. This
phenomenon is sensitive to the time remaining to maturity, the stock price relative
to the strike price, and σ.
This model allows us to study the cumulative correction for given sample paths
of St, and provides the fundamental model to perform a numerical analysis of the
cumulative correction over several sample paths, which we describe in the following
section.
7.2.4 Numerical analysis of cumulative correction over several sample
paths
Using the model described in Section 7.2.3 to calculate the cumulative correction
up to time T , C, for several sample paths, we have created a second model in order
to better understand the behavior of C for different values of nT
. The Mathematica
code for this model can be found in A.2. The ratio nT
measures the number of
times the portfolio is re-balanced per day, and intuitively, we expect that as nT
increases, C should be more likely to be close to 0. In order for our discrete
time step model to approximate the continuous time Black-Scholes model for large
n, we would like to be able to show that C is more likely to be close to 0 as
n → ∞. Our model allows for the specification K,T, µ, σ, r, the sequence of n to
be considered, and the number of iterations to be performed for each number n of
time steps. The following output was generated by our second model performing
50 iterations for each n ∈ 30, 300, 600, 900, 1200, 1500, 1800, 2100, 2400, 2700, 3000
58
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
with the parameters:
S0 = 1, β0 = 1, K = 1.02, T = 30, and annualized µ = .07, σ = .1, r = .05.
Figure 4: Cumulative correction over several sample paths and values of Tn
We do see that C shows a clear trend to be more likely to be close to 0 as nT
increases, however the rate at which C becomes more likely to be close to 0 is lower
for larger nT
and this simulation is far from a rigorous proof of the intuition. We
would also like to be able to study the behavior of the cumulative correction for
very large values of nT
, but computing limits necessitate a trade-off between the
number of iterations that can be performed at each nT
and values of nT
that can be
analyzed.
7.2.5 Cumulative correction of portfolios based on historical stock price
data
In order to understand the behavior of the cumulative correction when actually
attempting to create a Black-Scholes replicating portfolio in reality, we have built
a third model that creates a replicating portfolio and documents the cumulative
correction of that portfolio for a given series of historical stock price data. The
59
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Mathematica code for the model can be found in A.3. We use daily close prices of
stocks, and fix n = T , allowing financial market participants to re-balance portfolios
once per day. We create the Black-Scholes portfolio over the course of one year
(T = 250, the approximate number of trading days in a year), with the following
parameters for all examples:
S0 = 1, β0 = 1, T = 250, K = 1.1, and annualized r = .05.
The creation of a Black-Scholes portfolio based on historical price data rather than
based on a geometric Brownian motion St, however, raises a problem because the
Black-Scholes portfolio depends on the parameter σ, the volatility, of the price
process. In order to create the Black-Scholes formula at time t, we need to assume
some σ of the price process for times s > t. We take two different approaches to
calculating an assumed volatility, both involving historical volatility of the stock
in question. The first approach, which we call the constant volatility approach,
calculates the historical volatility of the stock over the six-month (125 day) time
interval immediately prior to t = 0, and assumes that σ takes that constant value
over the remaining time t ∈ [0, T ]. The second approach, which we call the rolling
volatility approach, calculates the rolling historical volatility of the stock for the
six months (125 days) prior to each time t, and uses that to calculate the Black-
Scholes portfolio at t. Calculation of historical volatility (v) follows the convention
in the financial literature which is described in [1] and is performed according to
the following method:
Given daily stock prices st, st+1, . . . st+n, let ri = log(st+i)log(st+i−1)
for i = 1, 2, . . . , n. Let
r = 1n
∑ni=1 ri. Then the daily volatility vdaily is given by
vdaily =
√√√√ 1
n− 1
n∑i=1
(ri − r)2,
and we annualize the volatility by multiplying by√
250,
vannual = σ =√
250vdaily.
60
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
We have performed this analysis for approximately 20 stocks, and the majority
have a positive cumulative correction over the course of a year, but we would
like to highlight some interesting examples that illustrate the spectrum of possible
cumulative corrections. Historical stock price data for these stocks are data series
of daily close prices as reported by Yahoo! Finance.
The first example that we will highlight is Apple, AAPL. The output of the third
model for AAPL is as follows:
Figure 5: AAPL: annualized constant and rolling historical volatility over time
61
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Figure 6: AAPL: stock price and value of replicating portfolios over time
Figure 7: AAPL: composition of replicating portfolios over time
62
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
For the series of historical AAPL data up through April 23, 2012, the model
charts the volatilities in Figure 5, the stock price and value of the replicating
portfolios in Figure 6, the composition of the replicating portfolios in Figure 7,
and the cumulative correction of the portfolios in Figure 8. We observe that both
the constant volatility and the rolling volatility approach to create the Black-Scholes
portfolio require a positive cumulative correction.
Figure 8: AAPL: cumulative correction of replicating portfolios over time
63
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
The second example that we will highlight is Dell, DELL. The output of the
third model for DELL is as follows:
Figure 9: DELL: annualized constant and rolling historical volatility over time
Figure 10: DELL: stock price and value of replicating portfolios over time
64
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Figure 11: DELL: composition of replicating portfolios over time
Figure 12: DELL: cumulative correction of replicating portfolios over time
Similarly to the case with AAPL, for the series of historical DELL data
up through May 1, 2012, the model charts the volatilities in Figure 9, the stock
price and value of the replicating portfolios in Figure 10, the composition of the
replicating portfolios in Figure 11, and the cumulative correction of the portfolios
in Figure 12. We observe that the constant volatility replicating portfolio requires a
positive cumulative correction, and the rolling volatility portfolio requires a negative
cumulative correction.
65
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
The third and final example that we will highlight is Ford, F. The output of
the third model for F is as follows:
Figure 13: F: annualized constant and rolling historical volatility over time
Figure 14: F: stock price and value of replicating portfolios over time
66
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Figure 15: F: composition of replicating portfolios over time
Figure 16: F: cumulative correction of replicating portfolios over time
Similarly to the case with AAPL and DELL, for the series of historical F
data up through May 1, 2012, the model charts the volatilities in Figure 13, the
stock price and value of the replicating portfolios in Figure 14, the composition
of the replicating portfolios in Figure 15, and the cumulative correction of the
portfolios in Figure 16. We observe that both the constant volatility and the
rolling volatility approach to create the Black-Scholes portfolio require a negative
cumulative correction. These three examples (AAPL, DELL, and F) indicate the
range of possible portfolios and cumulative correction that our third model (and
the attempt to replicate the European call option on actual stocks using the Black-
67
7 APPLICATION: BLACK-SCHOLES OPTION PRICING MODEL
Scholes portfolios using historical volatilities) can output.
7.2.6 Summary of study of cumulative correction
In Section 7.2, we have proposed and defined a quantity, the cumulative correction,
to study the amount the discrete time step Black-Scholes portfolio deviates from the
self-financing condition of the continuously rebalanced portfolio. We have described
the results of models that we have built to study the behavior of the cumulative
correction:
1. Studying the cumulative correction for fixed sample paths of the stock price
stochastic process allows us to make qualitative statements regarding the
behavior of the correction over a time interval as it relates to the change in
the stock price over that interval.
2. Numerical analysis of several sample paths at different portfolio rebalancing
frequencies shows that as the frequency of rebalancing the Black-Scholes
replicating portfolio is increased, the cumulative correction is more likely to
be close to 0.
3. While the majority of historical stock return series that we have studied have
caused the discrete time step Black-Scholes replicating portfolio to have a
positive cumulative correction, we have provided examples of a stock that
requires a positive cumulative correction for both approaches used to calculate
volatility (AAPL), a stock that requires a positive cumulative correction for
one approach used to calculate volatility and a negative cumulative correc-
tion for the other approach (DELL), and a stock that requires a negative
cumulative correction for both volatility approaches (F).
68
8 References
[1] Stephen Figlewski. “Forecasting Volatility Using Historical Data”, New
York University Stern School of Business Finance Department, Working
Paper Series, 1994. http://w4.stern.nyu.edu/finance/docs/WP/1994/pdf/
wpa94032.pdf.
[2] G. Grimmett, D. Stirzaker. Probability and Random Processes, Third Edition,
Oxford University Press, 2001.
[3] Andrzej Kozlowski. “Hedging the Black-Scholes Call Option”, from the
Wolfram Demonstrations Project. http://demonstrations.wolfram.com/
HedgingTheBlackScholesCallOption/.
[4] Thomas Mikosch. Elementary Stochastic Calculus with Finance in View,
Advanced Series on Statistical Science & Applied Probability, Volume 6, World
Scientific Publishing Co. Pte. Ltd., 2004.
[5] Y.L. Tong. The Multivariate Normal Distribution, Springer Series in Statistics,
Springer-Verlag New York, 1990.
69
H*
Greg WhiteEuropean Call Replication for Sample Path of Price Process
This model replicates the European call option for a sample path of the geometricBrownian motion stock price process. The calculation of the replicatingportfolio according to the Black-Scholes equation occurs at discrete time steps,
and the correction of the portfolio is documented*L
<< PlotLegends`
thesisPath = "UsersgregwhiteDesktopThesisGreg ThesisMathematica";
H*Parameters- constants and assumptions*LnumIter = 250.0;annualR = 0.05;strike = 1.1;maturity = 250.0;annualMu = 0.1;annualSigma = 0.1;tradingDaysPerYear = 250.0;
H*Calculate sample path of Wiener process*Ldelta = maturity numIter;wiener = RandomVariate@NormalDistribution@0, Sqrt@deltaDD, numIterD;
H*Annualize parameters*Lmu = annualMu tradingDaysPerYear;sigma = annualSigma Sqrt@tradingDaysPerYearD;r = annualR tradingDaysPerYear;
H*Initiate stock price and time data series*LstockPrice = [email protected];times = [email protected];
H*Functions for the Balck-Scholes formula*Lg@t_, x_D :=
HLog@x strikeD + Hr + 0.5 * Hsigma^2.0LL * Hmaturity - tLL Hsigma * Sqrt@maturity - tDL;h@t_, x_D := g@t, xD - sigma * Sqrt@maturity - tD;
H*Initiate replicating portfolio*LstockUnits = List@CDF@NormalDistribution@D, g@times@@1DD, stockPrice@@1DDDDD;bondUnits = List@-strike * Exp@-r * maturityD *
CDF@NormalDistribution@D, h@times@@1DD, stockPrice@@1DDDDD;
H*Initiate replicating correction flow*LcorrectionIncrements = List@0D;correctionSum = List@0D;
H*Initiate the format for the the stock price to be plotted*LpricePlot = List@8times@@1DD, stockPrice@@1DD<D;strikePlot = List@8times@@1DD, strike<D;payoutPlot = List@8times@@1DD, stockPrice@@1DD - strike<D;
H*Initiate the format for the replicating portfolio to be plotted*LstockUnitPlot = List@8times@@1DD, stockUnits@@1DD<D;bondUnitPlot = List@8times@@1DD, bondUnits@@1DD<D;correctionPlot = List@8times@@1DD, correctionSum@@1DD<D;portfolioValuePlot = List@
8times@@1DD, stockUnits@@1DD * stockPrice@@1DD + bondUnits@@1DD * Exp@r * times@@1DDD<D;
H*At each discrete time step,calculate the stock price and solve the Black-Scholes equations,
*L
A APPENDIX: MATHEMATICA MODELS
A Appendix: Mathematica Models
A.1 European call option for GBM sample path
70
calculate the stock price and solve the Black-Scholes equations,creating a replicating portfolio and documenting the correction*LDo@
H*Increment the stockPrice according toa discrete approximation of a geometric Brownian motion*L
times = Append@times, times@@mDD + deltaD;stockPrice =
Append@stockPrice, Exp@HHHmu - 0.5 * Hsigma^2LL * times@@m + 1DDL + sigma * wiener@@mDDLDD;
H*Replicating portfolio construction- g and h yield infinity when t=maturity*LIf@m == numIter, If@stockPrice@@m + 1DD > strike, stockUnits = Append@stockUnits, 1D;
bondUnits = Append@bondUnits, -strike * Exp@-r * maturityDD,stockUnits = Append@stockUnits, 0D; bondUnits = Append@bondUnits, 0DD, stockUnits =
Append@stockUnits, CDF@NormalDistribution@D, g@times@@m + 1DD, stockPrice@@m + 1DDDDD;bondUnits = Append@bondUnits, -strike * Exp@-r * maturityD *
CDF@NormalDistribution@D, h@times@@m + 1DD, stockPrice@@m + 1DDDDDD;
H*Format the stock price to be plotted*LpricePlot = Append@pricePlot, 8times@@m + 1DD, stockPrice@@m + 1DD<D;strikePlot = Append@strikePlot, 8times@@m + 1DD, strike<D;payoutPlot = Append@payoutPlot, 8times@@m + 1DD, stockPrice@@m + 1DD - strike<D;
H*Format the replicating portfolio to be plotted*LstockUnitPlot = Append@stockUnitPlot, 8times@@m + 1DD, stockUnits@@m + 1DD<D;bondUnitPlot = Append@bondUnitPlot, 8times@@m + 1DD, bondUnits@@m + 1DD<D;portfolioValuePlot = Append@portfolioValuePlot, 8times@@m + 1DD,
stockUnits@@m + 1DD * stockPrice@@m + 1DD + bondUnits@@m + 1DD * Exp@r * times@@m + 1DDD<D;
H*Document correction*Lcorrection = stockPrice@@m + 1DD * HstockUnits@@m + 1DD - stockUnits@@mDDL +
Exp@r * times@@m + 1DDD * HbondUnits@@m + 1DD - bondUnits@@mDDL;correctionIncrements = Append@correctionIncrements, correctionD;correctionSum = Append@correctionSum, correctionSum@@mDD + correctionD;correctionPlot = Append@correctionPlot, 8times@@m + 1DD, correctionSum@@m + 1DD<D;
, 8m, numIter<D;
H*Export plots*LExport@FileNameJoin@8thesisPath, "GraphicsAndCode", "generalPriceValue.png"<D,
ListLinePlot@8pricePlot, strikePlot, portfolioValuePlot, payoutPlot<,AxesLabel ® 8"Time HdaysL", $<,PlotLabel ® "Stock Price and Replicating Portfolio Value over Time",PlotLegend ® 8Style@"Stock price", FontSize ® 16D, Style@"Strike price", FontSize ® 16D,
Style@"Replicating portfolio\nvalue", FontSize ® 16D,Style@"Stock price minus\nstrike price", FontSize ® 16D<,
LegendPosition ® 80.85, -0.4<, LegendShadow ® None, BaseStyle ® 8FontSize ® 18<,AspectRatio ® 7.5 10.0, PlotRange ® AllD, ImageSize ® 72 * 15D;
Export@FileNameJoin@8thesisPath, "GraphicsAndCode", "generalCorrection.png"<D,ListLinePlot@correctionPlot, AxesLabel ® 8"Time HdaysL", $<,
PlotLabel ® "Cumulative Correction of Replicating Portfolio over Time", BaseStyle ®
8FontSize ® 18<, AspectRatio ® 7.5 10.0, PlotRange ® AllD, ImageSize ® 72 * 15D;
Export@FileNameJoin@8thesisPath, "GraphicsAndCode", "generalPortfolioComposition.png"<D,ListLinePlot@8stockUnitPlot, bondUnitPlot<, AxesLabel ® 8"Time HdaysL", "Units"<,
PlotLabel ® "Replicating Portfolio Composition over Time",PlotLegend ® 8Style@"Stock units", FontSize ® 16D, Style@"Bond units", FontSize ® 16D<,LegendPosition ® 81.1, -0.4<, LegendShadow ® None, BaseStyle ® 8FontSize ® 18<,AspectRatio ® 7.5 10.0, PlotRange ® AllD, ImageSize ® 72 * 15D;
2 BlackScholesEuropeanCallReplicationvF.nb
A APPENDIX: MATHEMATICA MODELS
71
H*
Greg WhiteNumerical Correction Analysis
This model analyzes the cumulative correction required toreplicate the European call option. The cumulative correction isplotted versus the number of discrete time steps per day for a numberof sample paths of the geometric Brownian motion stock price process.
*L
H*Parameters- constants and assumptions*LnumIter = 50;maturityInterval = 30.0;annualR = 0.05;strike = 1.02;startingMaturity = 30.0;annualMu = 0.07;annualSigma = 0.1;tradingDaysPerYear = 250.0;
H*Generate list of maturities to be analyzed*LlistOfMaturities = List@startingMaturityD;
H*Annualize parameters*Lmu = annualMu tradingDaysPerYear;sigma = annualSigma Sqrt@tradingDaysPerYearD;r = annualR tradingDaysPerYear;
H*Functions for the Black-Scholes formula*Lg@t_, x_, maturity_D :=
HLog@x strikeD + Hr + 0.5 * Hsigma^2.0LL * Hmaturity - tLL Hsigma * Sqrt@maturity - tDL;h@t_, x_, maturity_D := g@t, xD - sigma * Sqrt@maturity - tD;
H*For each maturity to be analyzed,perform iterations of portfolio creation and plot resulting cumulative correction*LDo@
H*Generate the list of numSteps to be analyzed*LstepsPerDayInterval = 10;interval = Floor@stepsPerDayInterval * maturityD;min = 1; max = 10; step = 1;listOfNumSteps = Prepend@Table@i * interval, 8i, min, max, step<D, maturityD;listOfNumStepsPerDay = Prepend@Table@i * stepsPerDayInterval, 8i, min, max, step<D, 1D;
H*Initialize plot of cumulative corrections*LcumulativeCorrectionPlot = List@D;
H*For every value of numSteps, do numIter iterations of portfolio creation*LDo@
Do@
H*Go through one iteration of the portfolio creation*L
delta = maturity numSteps;wiener = Accumulate@RandomVariate@NormalDistribution@0, Sqrt@deltaDD, numStepsDD;
stockPrice = [email protected];times = [email protected];
H*Initiate replicating portfolio*LstockUnits =
List@CDF@NormalDistribution@D, g@times@@1DD, stockPrice@@1DD, maturityDDD;bondUnits = List@-strike * Exp@-r * maturityD *
CDF@NormalDistribution@D, h@times@@1DD, stockPrice@@1DD, maturityDDD;
A APPENDIX: MATHEMATICA MODELS
A.2 Numerical analysis of cumulative correction
72
CDF@NormalDistribution@D, h@times@@1DD, stockPrice@@1DD, maturityDDD;
H*Initiate replicating correction flow*LcorrectionIncrements = List@0D;correctionSum = List@0D;
Do@H*Increment the stockPrice according
to a discrete approximation of a geometric Brownian motion*Ltimes = Append@times, times@@mDD + deltaD;stockPrice = Append@stockPrice,
Exp@HHHmu - 0.5 * Hsigma^2LL * times@@m + 1DDL + sigma * wiener@@mDDLDD;
H*Replicating portfolio construction- g and h yield infinity when t=maturity*LIf@m == numSteps, If@stockPrice@@m + 1DD > strike, stockUnits = Append@stockUnits, 1D;
bondUnits = Append@bondUnits, -strike * Exp@-r * maturityDD,stockUnits = Append@stockUnits, 0D; bondUnits = Append@bondUnits, 0DD,
stockUnits = Append@stockUnits, CDF@NormalDistribution@D,g@times@@m + 1DD, stockPrice@@m + 1DD, maturityDDD;
bondUnits = Append@bondUnits, -strike * Exp@-r * maturityD *
CDF@NormalDistribution@D, h@times@@m + 1DD, stockPrice@@m + 1DD, maturityDDDD;
H*Document correction*Lcorrection = stockPrice@@m + 1DD * HstockUnits@@m + 1DD - stockUnits@@mDDL +
Exp@r * times@@m + 1DDD * HbondUnits@@m + 1DD - bondUnits@@mDDL;correctionIncrements = Append@correctionIncrements, correctionD;correctionSum = Append@correctionSum, correctionSum@@mDD + correctionD;
, 8m, numSteps<D;
H*Add the iteration to the plot*LcumulativeCorrectionPlot =
Append@cumulativeCorrectionPlot, 8numSteps maturity, Last@correctionSumD<D;
, 8numSteps, listOfNumSteps<D;
, 8numIter<D;
H*Plot the cumulative correction versus the number of steps per day for maturity*L
thesisPath = "UsersgregwhiteDesktopThesisGreg ThesisMathematica";
Export@FileNameJoin@8thesisPath, "GraphicsAndCode", "StatisticalCumulative.png"<D,ListPlot@cumulativeCorrectionPlot,
AxesLabel ® 8"Number\nof steps\nper day", "Cumulative correction H$L"<,PlotLabel ® StringJoin@"Cumulative Correction versus Number of Steps per Day over ",
ToString@Floor@maturityDD, " Days"D, BaseStyle ® 8FontSize ® 18<,PlotRange ® 88listOfNumStepsPerDay@@1DD - stepsPerDayInterval,
Last@listOfNumStepsPerDayD + stepsPerDayInterval 2.<, All<,Ticks ® 8listOfNumStepsPerDay@@2 ;;DD, Automatic<D, ImageSize ® 72 * 15D
, 8maturity, listOfMaturities<D;
2 NumericalCorrectionAnalysisvF.nb
A APPENDIX: MATHEMATICA MODELS
73
H*
Greg WhiteReplication of European Call Option using Historical Stock Prices
This model creates the European call option replicating portfolio for a seriesof historical stock price data Hwhich is imported from a csv fileL. Theportfolio composition and correction are documented.
*L
<< PlotLegends`
thesisPath = "UsersgregwhiteDesktopThesisGreg ThesisMathematica";
H*Parameters- constants and assumptions*LannualR = 0.05;strike = 1.1;tradingDaysPerYear = 250.0;r = annualR tradingDaysPerYear;
stockTicker = "DELL";
H*Import historical stock price data*LnumDays = 251;H*Import all stock price data to be used to
calculate historical volatility and stock prices*LhistoricalPrices =
Reverse@Transpose@Import@FileNameJoin@8thesisPath, "HistoricalStockData",StringJoin@stockTicker, "_Historical.csv"D<DDD@@5DD@@2 ;;DDD;
H*stockPrice is the most recent numDays' worth of price data*LstartStockPrice = Length@historicalPricesD - numDays + 1;stockPrice = historicalPrices@@startStockPrice ;;DD;stockPrice = stockPrice stockPrice@@1DD;maturity = Length@stockPriceD - 1;times = Range@0, maturityD;
H*Find historical volatility*LnumMonthsOfVolatility = 6.0;numDaysOfVolatility = Floor@HnumMonthsOfVolatility 12.0L * tradingDaysPerYearD;logarithmReturns = List@D;Do@
logarithmReturns =
Append@logarithmReturns, Log@historicalPrices@@iDD historicalPrices@@i - 1DDDD;, 8i, startStockPrice - numDaysOfVolatility, startStockPrice, 1<D;
constantHistoricalVolatility = StandardDeviation@logarithmReturnsD;
H*Functions for the Black-Scholes formula*Lg@t_, x_, sigma_D :=
HLog@x strikeD + Hr + 0.5 * Hsigma^2.0LL * Hmaturity - tLL Hsigma * Sqrt@maturity - tDL;h@t_, x_, sigma_D := g@t, x, sigmaD - sigma * Sqrt@maturity - tD;
H*Initiate replicating portfolios*LconstantVolStockUnits = List@CDF@NormalDistribution@D,
g@times@@1DD, stockPrice@@1DD, constantHistoricalVolatilityDDD;constantVolBondUnits = List@-strike * Exp@-r * maturityD * CDF@NormalDistribution@D,
h@times@@1DD, stockPrice@@1DD, constantHistoricalVolatilityDDD;
rollingVolStockUnits = List@CDF@NormalDistribution@D,g@times@@1DD, stockPrice@@1DD, constantHistoricalVolatilityDDD;
rollingVolBondUnits = List@-strike * Exp@-r * maturityD * CDF@NormalDistribution@D,h@times@@1DD, stockPrice@@1DD, constantHistoricalVolatilityDDD;
H*Initiate replicating correction flows*LconstantVolCorrectionIncrements = List@0D;constantVolCorrectionSum = List@0D;
A APPENDIX: MATHEMATICA MODELS
A.3 European call option for historical data
74
constantVolCorrectionSum = List@0D;
rollingVolCorrectionIncrements = List@0D;rollingVolCorrectionSum = List@0D;
H*Initiate the format for the the stock price to be plotted*LpricePlot = List@8times@@1DD, stockPrice@@1DD<D;strikePlot = List@8times@@1DD, strike<D;payoutPlot = List@8times@@1DD, stockPrice@@1DD - strike<D;
H*Initiate the format for the replicating portfolios to be plotted*LconstantVolStockUnitPlot = List@8times@@1DD, constantVolStockUnits@@1DD<D;constantVolBondUnitPlot = List@8times@@1DD, constantVolBondUnits@@1DD<D;constantVolCorrectionPlot = List@8times@@1DD, constantVolCorrectionSum@@1DD<D;constantVolPortfolioValuePlot =
List@8times@@1DD, constantVolStockUnits@@1DD * stockPrice@@1DD +
constantVolBondUnits@@1DD * Exp@r * times@@1DDD<D;
rollingVolStockUnitPlot = List@8times@@1DD, rollingVolStockUnits@@1DD<D;rollingVolBondUnitPlot = List@8times@@1DD, rollingVolBondUnits@@1DD<D;rollingVolCorrectionPlot = List@8times@@1DD, rollingVolCorrectionSum@@1DD<D;rollingVolPortfolioValuePlot =
List@8times@@1DD, rollingVolStockUnits@@1DD * stockPrice@@1DD +
rollingVolBondUnits@@1DD * Exp@r * times@@1DDD<D;
H*Initiate the format for the volatilities to be plotted*LconstantVolPlot =
List@8times@@1DD, Sqrt@tradingDaysPerYearD * constantHistoricalVolatility<D;rollingVolPlot = List@8times@@1DD,
Sqrt@tradingDaysPerYearD * constantHistoricalVolatility<D;
Do@H*Calculate new rollingVolatility*LlogarithmReturns =
Append@logarithmReturns, Log@historicalPrices@@startStockPrice + m - 1DD historicalPrices@@startStockPrice + m - 2DDDD;
rollingVolatility = StandardDeviation@logarithmReturns@@-numDaysOfVolatility - 1 ;;DDD;
H*Add the volatilities to the plot*LconstantVolPlot = Append@constantVolPlot,
8times@@mDD, Sqrt@tradingDaysPerYearD * constantHistoricalVolatility<D;rollingVolPlot = Append@rollingVolPlot,
8times@@mDD, Sqrt@tradingDaysPerYearD * rollingVolatility<D;
H*Replicating portfolios construction- g and h yield infinity when t=maturity*LIf@m Length@timesD,
If@stockPrice@@mDD > strike, constantVolStockUnits = Append@constantVolStockUnits, 1D;rollingVolStockUnits = Append@rollingVolStockUnits, 1D;constantVolBondUnits = Append@constantVolBondUnits, -strike * Exp@-r * maturityDD;rollingVolBondUnits = Append@rollingVolBondUnits, -strike * Exp@-r * maturityDD,constantVolStockUnits = Append@constantVolStockUnits, 0D;rollingVolStockUnits = Append@rollingVolStockUnits, 0D;constantVolBondUnits = Append@constantVolBondUnits, 0D;rollingVolBondUnits = Append@rollingVolBondUnits, 0DD,
constantVolStockUnits = Append@constantVolStockUnits, CDF@NormalDistribution@D,g@times@@mDD, stockPrice@@mDD, constantHistoricalVolatilityDDD;
rollingVolStockUnits = Append@rollingVolStockUnits, CDF@NormalDistribution@D,g@times@@mDD, stockPrice@@mDD, rollingVolatilityDDD; constantVolBondUnits =
Append@constantVolBondUnits, -strike * Exp@-r * maturityD * CDF@NormalDistribution@D,h@times@@mDD, stockPrice@@mDD, constantHistoricalVolatilityDDD;
rollingVolBondUnits = Append@rollingVolBondUnits, -strike * Exp@-r * maturityD *
CDF@NormalDistribution@D, h@times@@mDD, stockPrice@@mDD, rollingVolatilityDDDD;
H*Format the stock price to be plotted*LpricePlot = Append@pricePlot, 8times@@mDD, stockPrice@@mDD<D;
;
2 BlackScholesEuropeanCallHistoricalvF.nb
A APPENDIX: MATHEMATICA MODELS
75
pricePlot = Append@pricePlot, 8times@@mDD, stockPrice@@mDD<D;strikePlot = Append@strikePlot, 8times@@mDD, strike<D;payoutPlot = Append@payoutPlot, 8times@@mDD, stockPrice@@mDD - strike<D;
H*Format the replicating portfolios to be plotted*LconstantVolStockUnitPlot =
Append@constantVolStockUnitPlot, 8times@@mDD, constantVolStockUnits@@mDD<D;constantVolBondUnitPlot = Append@constantVolBondUnitPlot,
8times@@mDD, constantVolBondUnits@@mDD<D;constantVolPortfolioValuePlot = Append@constantVolPortfolioValuePlot,
8times@@mDD, constantVolStockUnits@@mDD * stockPrice@@mDD +
constantVolBondUnits@@mDD * Exp@r * times@@mDDD<D;
rollingVolStockUnitPlot =
Append@rollingVolStockUnitPlot, 8times@@mDD, rollingVolStockUnits@@mDD<D;rollingVolBondUnitPlot = Append@rollingVolBondUnitPlot,
8times@@mDD, rollingVolBondUnits@@mDD<D;rollingVolPortfolioValuePlot = Append@rollingVolPortfolioValuePlot,
8times@@mDD, rollingVolStockUnits@@mDD * stockPrice@@mDD +
rollingVolBondUnits@@mDD * Exp@r * times@@mDDD<D;
H*Document corrections*LconstantVolCorrection =
stockPrice@@mDD * HconstantVolStockUnits@@mDD - constantVolStockUnits@@m - 1DDL +
Exp@r * times@@mDDD * HconstantVolBondUnits@@mDD - constantVolBondUnits@@m - 1DDL;constantVolCorrectionIncrements = Append@constantVolCorrectionIncrements,
constantVolCorrectionD;constantVolCorrectionSum = Append@constantVolCorrectionSum,
constantVolCorrectionSum@@m - 1DD + constantVolCorrectionD;constantVolCorrectionPlot = Append@constantVolCorrectionPlot,
8times@@mDD, constantVolCorrectionSum@@mDD<D;
rollingVolCorrection =
stockPrice@@mDD * HrollingVolStockUnits@@mDD - rollingVolStockUnits@@m - 1DDL +
Exp@r * times@@mDDD * HrollingVolBondUnits@@mDD - rollingVolBondUnits@@m - 1DDL;rollingVolCorrectionIncrements = Append@rollingVolCorrectionIncrements,
rollingVolCorrectionD;rollingVolCorrectionSum = Append@rollingVolCorrectionSum,
rollingVolCorrectionSum@@m - 1DD + rollingVolCorrectionD;rollingVolCorrectionPlot = Append@rollingVolCorrectionPlot,
8times@@mDD, rollingVolCorrectionSum@@mDD<D;
, 8m, 2, Length@timesD<D;
H*Export plots*L
Export@FileNameJoin@8thesisPath, "GraphicsAndCode", StringJoin@stockTicker, "PriceValue.png"D<D,
ListLinePlot@8pricePlot, strikePlot, constantVolPortfolioValuePlot,rollingVolPortfolioValuePlot, payoutPlot<, AxesLabel ® 8"Time HdaysL", $<, PlotLabel ®
StringJoin@stockTicker, " Price and Replicating Portfolio Value over Time"D,PlotLegend ® 8Style@StringJoin@stockTicker, " price"D, FontSize ® 16D,
Style@"Strike price", FontSize ® 16D, Style@"Constant volatility\nportfolio value",FontSize ® 16D, Style@"Rolling volatility\nportfolio value", FontSize ® 16D,
Style@StringJoin@stockTicker, " price\nminus strike\nprice"D, FontSize ® 16D<,LegendPosition ® 80.85, -0.4<, LegendShadow ® None, BaseStyle ® 8FontSize ® 18<,AspectRatio ® 7.5 10.0, PlotRange ® AllD, ImageSize ® 72 * 15D;
Export@FileNameJoin@8thesisPath, "GraphicsAndCode", StringJoin@stockTicker, "Correction.png"D<D,
ListLinePlot@8constantVolCorrectionPlot, rollingVolCorrectionPlot<,AxesLabel ® 8"Time HdaysL", $<, PlotLabel ®
StringJoin@stockTicker, " Cumulative Correction of Replicating Portfolio over Time"D,
BlackScholesEuropeanCallHistoricalvF.nb 3
A APPENDIX: MATHEMATICA MODELS
76
StringJoin@stockTicker, " Cumulative Correction of Replicating Portfolio over Time"D,PlotLegend ® 8Style@"Constant volatility\ncorrection", FontSize ® 16D,
Style@"Rolling volatility\ncorrection", FontSize ® 16D<,LegendPosition ® 81.1, -0.4<, LegendShadow ® None, BaseStyle ® 8FontSize ® 18<,AspectRatio ® 7.5 10.0, PlotRange ® AllD, ImageSize ® 72 * 15D;
Export@FileNameJoin@8thesisPath, "GraphicsAndCode", StringJoin@stockTicker, "Volatilities.png"D<D,
ListLinePlot@8constantVolPlot, rollingVolPlot<,AxesLabel ® 8"Time HdaysL",<, PlotLabel ®
StringJoin@stockTicker, " Annualized Constant and Rolling Volatility over Time"D,PlotLegend ® 8Style@"Constant\nvolatility", FontSize ® 16D,
Style@"Rolling\nvolatility", FontSize ® 16D<,LegendPosition ® 81.1, -0.4<, LegendShadow ® None, BaseStyle ® 8FontSize ® 18<,AspectRatio ® 7.5 10.0, PlotRange ® AllD, ImageSize ® 72 * 15D;
Export@FileNameJoin@8thesisPath, "GraphicsAndCode",StringJoin@stockTicker, "PortfolioComposition.png"D<D, ListLinePlot@
8constantVolStockUnitPlot, constantVolBondUnitPlot, rollingVolStockUnitPlot,rollingVolBondUnitPlot<, AxesLabel ® 8"Time HdaysL", "Units"<,
PlotLabel ® StringJoin@stockTicker, " Replicating Portfolio Composition over Time"D,PlotLegend ® 8Style@"Constant volatility\nstock units", FontSize ® 16D,
Style@"Constant volatility\nbond units", FontSize ® 16D,Style@"Rolling volatility\nstock units", FontSize ® 16D,Style@"Rolling volatility\nbond units", FontSize ® 16D<,
LegendPosition ® 81.1, -0.4<, LegendShadow ® None, BaseStyle ® 8FontSize ® 18<,AspectRatio ® 7.5 10.0, PlotRange ® AllD, ImageSize ® 72 * 15D;
4 BlackScholesEuropeanCallHistoricalvF.nb
A APPENDIX: MATHEMATICA MODELS
77