View
249
Download
1
Tags:
Embed Size (px)
Citation preview
Mathematical Modelling of Biological Networks
Prof:Rui [email protected]
973702406Dept Ciencies Mediques Basiques,
1st Floor, Room 1.08Website of the
Course:http://web.udl.es/usuaris/pg193845/Courses/Bioinformatics_2007/ Course: http://10.100.14.36/Student_Server/
Organization of the talk
• Network representations
• From networks to physiological behavior
• Types of models
• Types of problems
• Mathematical formalisms
• Creating and studying a mathematical model
Predicting protein networks using protein interaction data
Database of protein
interactions
Server/
Program
Your Sequence (A)
A
BC
D E
FContinue until you are satisfied
You do not need to go beyond this type of representation in Task 5!!!
Clear network representation is fundamental for clarity of analysis
A B
What does this mean?
Possibilities:
AB
Function
BA
Function
AB
Function
A B
Function
BA
Function
Having precise network representations is important
• We need to know exactly what is being represented, not just that A and B sort of interact in some way.
• This means that it is important to develop or use network representations that are accurate and in which a given element has a very specific meaning.
• Accurate computer representations and human readable representations are not necessarily the same.
Computer readable representations
• SBML (1999)
• CELLML (1999)
• BIOPAX (2002)
• Etc.There must be representations that are easier for humans to use.Let us take a look at one that chemist have been using for a century.
Defining network conventions
A B
C
Full arrow represents a flux of material between A and B
Dashed arrow represents modulation of a flux
+
Dashed arrow with a plus sign represents positive modulation of a flux
-
Dashed arrow with a minus sign represents negative modulation of a flux
A and B – Dependent Variables
(Change over time)
C – Independent variable
(constant value)
Defining network conventions
A B
C
Stoichiometric information needs to be included
Dashed arrow represents modulation of a flux
+
Dashed arrow with a plus sign represents positive modulation of a flux
Dashed arrow with a minus sign represents negative modulation of a flux
23 D+
Reversible Reaction
Defining network conventions
B
C
Stoichiometric information needs to be included
Dashed arrow represents modulation of a flux
+
Dashed arrow with a plus sign represents positive modulation of a flux
Dashed arrow with a minus sign represents negative modulation of a flux
2 A3 D
Renaming Conventions
C
Having too many names or names that are closely related may complicate interpretation and set up
of the model.
Therefore, using a structured nomenclature is important for book keeping
Let us call Xi to variable i
AB
DX3
X1X2
X4
Test Cases: Metabolic Pathway
1 – Metabolite 1 is produced from metabolite 0 by enzyme 1
2 – Metabolite 2 is produced from metabolite 1 by enzyme 2
3 – Metabolite 3 is produced from metabolite 2 by enzyme 3
4 – Metabolite 4 is produced from metabolite 3 by enzyme 4
5 – Metabolite 5 is produced from metabolite 3 by enzyme 5
6 – Metabolites 4 and 5 are consumed outside the system
7 – Metabolite 3 inhibits action of enzyme 1
8 – Metabolite 4 inhibits enzyme 4 and activates enzyme 5
9 – Metabolite 5 inhibits enzyme 5 and activates enzyme 4
Test Cases: Gene Circuit1 – mRNA is synthesized from nucleotides
2 – mRNA is degraded
3 – Protein is produced from amino acids
4 – Protein is degraded
5 – DNA is needed for mRNA synthesis and it transmits information for that synthesis
6 – mRNA is needed for protein synthesis it transmits information for that synthesis
7 – Protein is a transcription factor that negatively regulates expression of the mRNA
7 – Lactose binds the protein reversibly, with a stoichiometry of 1 and creates a form of the protein that does not bind DNA.
Test Cases: Signal transduction pathway
1 – 2 step phosphorylation cascade
2 – Receptor protein can be in one of two forms depending on a signal S (S activates R)
3 – Receptor in active form can phosphorylate a MAPKKK.
4 – MAPKKK can be phosphorylated in two different residues; both can be phosphorylated simultaneously
5 – MAPKK can be phosphorylated in two different residues; both can be phosphorylated simultaneously
6 – Residue 1 of MAPKK can only be phosphorylated if both residues of MAPKKK are phosphorylated
7 – Residue 2 of MAPKKK can be phosphorylated if one and only one of the residues of MAPKKK are phosphorylated.
8 – All phosphorylated residues can loose phosphate spontaneously
9 – Active R inactivates over time spontaneously
Organization of the talk
• Network representations
• From networks to physiological behavior
• Types of models
• Types of problems
• Mathematical formalisms
• Creating and studying a mathematical model
In silico networks are limited as predictors of physiological behavior
What happens?
Probably a very sick mutant?
Organization of the talk
• Network representations
• From networks to physiological behavior
• Types of models
• Types of problems
• Mathematical formalisms
• Creating and studying a mathematical model
Types of Model
• Finite State Models– Bolean Network Models
• Stoichiometric Models– Flux balance analysis models
• Deterministic Models– Homogeneous– Spatial Detail
• Stochastic Models– Homogeneous– Spatial Detail
Finite State models
• A Finite state model is composed of – a set of nodes that are connected by – a set of edges. – Each node can have a finite number of
states and the – rules for changing these states with time
are transmitted through the edges and based on the state of the neighbors.
Boolean Networks
• A Boolean network model is composed of – a set of nodes that are connected by – a set of edges. – each node can have TWO states – the rules for changing these states with
time are transmitted through the edges and based on the state of the neighbors.
Boolean Networks are usefull
• They can give you information about the connectivity of your metabolism or gene circuit
• What you organism can or can not do may also depend on the connectivity of the regulation
Simple Finite State Gene Circuit
A B C
A – Positively regulates itself and b
Negatively regulates C
B – Positively regulates itself and c
C – Positively regulates itself and b
Negatively regulates A
Regulation of the Circuit
A
B
C
A – Positively regulates itself and b
Negatively regulates C
B – Positively regulates itself and c
C – Positively regulates itself and b
Negatively regulates A
+ + _+ +
+
_
+
Threshold of expression in circuit
• A,B, C can have three levels of expression (0,1,2)
• Regulation of A or C occurs whenever a gene is in or above level 1
• Regulation of B occurs whenever a gene is in or above level 1
Logical rules for time change
Level of expression in the absence of all regulators
Level of expression in the presence of A, B or C
Level of expression of A,B, C in the presence of A and B,
A and C or B and C
Level of expression of A,B, C in the presence of A, B and C
Stoichiometric Models
• A Stoichiometric model is composed of – a stoichiometric matrix that informs on the
number of molecules that are transformed – a flux vector that describes the rates of
change in the system
A simple stoichiometric model
A model system comprising three metabolites (A, B and C) with three reactions (internal fluxes, vi
including one reversible reaction) and three exchange fluxes (bi).
Mass balance
• Stoichiometric matrix S
• Flux matrix v
• S · v = 0 in steady state. Mass balance equations
accounting for all reactions and
transport mechanisms are written for each species. These
equations are then rewritten in matrix
form. At steady state, this reduces
to S · V=0.
Deterministic Models
• A deterministic model is an extension of a stoichiometric model. It is formed of – a stoichiometric matrix that informs on the
number of molecules that are transformed – a flux vector that describes the rates of
change in the system. The functions are continuous: dX/dt= S · v
X3
X1
X2
X4
X5 X6
A model with a bifunctional sensor
3/ 1/
4 / 2 /
dX dt dX dt
dX dt dX dt
13 151
11 121
1/ 3 5
1 2
g g
h h
dX dt X X
X X
Bifunctional Sensor O(rdinary)D(iferential)E(quation)s
21 26 224 22
2322 / 1 6 4 2 3g h hg g XdX dt X X X X
Homogenous Deterministic Models
• Assume system is spatially homogeneous
• If not use P(artial)D(ifferential)E(quations)s
Partial Differential Equations - Diffusion
∂
accumulation of [S] due to
transport
dAnJSJ
Rate of change of [S]
local production
of [S]
dVSdt
d
dVfS
f
dVJ
]S[Dft
]S[
]S[DJ From the definition of flux
For one-dimensional space,
x
uD
xu,tf
t
u
X
XX2
2
VVgx
Ddt
dVC
E.g. Extending the Hodgkin-Huxley model to voltage spread:
Non-Homogenous Deterministic Models
• Numerical solution is effectively done by coupling many systems of ODEs
• Much heavier computationally
What do deterministic models assume?
• That the number of particle changes in a continuum
• Is this true? Can we have 1 and half molecules of A?– NO!!!!!
• Now what?!?!?!
What do deterministic models assume?
• Law of large numbers (statistics)– The larger the population, the better the mean
value is as a representation of the population – What if there are 10 TFs in a cell?
– Stochastic models, either homogeneous or non-homogeneous!!!
Stochastic Models
• Replace continuous assumptions by discrete events
• Use rate constants as measures of probability
• Assume that at any give sufficiently small time interval only one event occurs
Organization of the talk
• Network representations
• From networks to physiological behavior
• Types of models
• Types of problems
• Mathematical formalisms
• Creating and studying a mathematical model
Goals of the model (I)
• Large scale modeling– Reconstructing the full network of the genome– Red Blood Cell Metabolism
• Modeling Specific Pathways/Circuits– Non-catalytic lipid peroxidation– MAPK Pathways
• Generating alternative hypothesys for the topology of the model.– ISC Reconstruction– Phosphate metabolism reconstruction
Goals of the model (II)
• Estimating parameter values– Estimating parameter values in the purine
metabolism
• Identifying Design Principles– Latter
Organization of the talk
• Network representations
• From networks to physiological behavior
• Types of models
• Types of problems
• Mathematical formalism
• Creating and studying a mathematical model
Representing the time behavior of your system
/dA dt
/ ,dA
dA dt A f A Cdt
/
dAdA dt
dt
A B
C
+
/dA
dA dt Adt
What is the form of the function?
A B
C
+
A or C
Flux1 2k A k CLinear 1 2
1 2 3 4
k A k C
K K A K C K AC
Saturating
4 41 2
44 41 2 3 4
k A k C
K K A K C K AC
Sigmoid
What if the form of the function is unknown?
A B
C
+
int intintint int
2 2
2int int
int
, ,, ,
, ,
operating operatingoperatingpo popo operating operating
po po
operating operatingpo pooperating
po
df A C df A CdAf A C f A C A A C C
dt dA dC
d f A C d f A CA A C C
dAdC d C
2
int
int
2 2
2int
int
,...
operatingpooperating
po
operatingpooperating
po
C C
d f A CA A
d A
Taylor Theorem:
f(A,C) can be written as a polynomial function of A and C using the function’s
mathematical derivatives with respect to the variables (A,C)
Are all terms needed?
A B
C
+
int
intint
intint
, ,
,
,
operatingpo
operatingpooperating
po
operatingpooperating
po
dAf A C f A C
dt
df A CA A
dA
df A CC C
dC
f(A,C) can be approximated by considering only a few of its mathematical derivatives
with respect to the variables (A,C)
Linear approximation
A B
C
+
1 20, ,
dAf A C f A C k A k C
dt
Taylor Theorem:
f(A,C) is approximated with a linear function by its first order derivatives with respect to
the variables (A,C)
1 2k A k CLinear
Logarithmic space is non-linear
A B
C
+
1 2, g gdAf A C A C
dt
g<0 inhibits flux
g=0 no influence on flux
g>0 activates flux
Use Taylor theorem in Log space
Why log space?
• Intuitive parameters• Simple, yet non-linear• Convex representation in cartesian
space• Linearizes exponential space
–Many biological processes are close to exponential → Linearizes mathematics
Why is formalism important?
• Reproduction of observed behavior– For example, inverse space may be better for
some models.
• Tayloring of numerical methods to specific forms of mathematical equations
Inverse space is non-linear
A B
C
+
K<0 inhibits flux
K=0 no influence on flux
K>0 activates flux
Use Taylor theorem in inverse space
Organization of the talk
• Network representations
• From networks to physiological behavior
• Types of models
• Types of problems
• Mathematical formalism
• Creating and studying a mathematical model
A model of a biosynthetic pathway
10 13 111 1 0 3 1 1/ g g hdX dt X X X
11 222 1 1 2 2/ h hdX dt X X
X0 X1
_
+
X2 X3
X4
22 33 343 2 2 3 3 4/ h h hdX dt X X X
Constant
Protein using X3
What can you learn?
• Steady state response
• Long term or homeostatic systemic behavior of the network
• Transient response
• Transient or adaptive systemic behavior of the network
What else can you learn?
• Sensitivity of the system to perturbations in parameters or conditions in the medium
• Stability of the homeostatic behavior of the system
• Understand design principles in the network as a consequence of evolution
Steady state response analysis
10 13 111 1 0 3 1 1/ 0g g hdX dt X X X
11 222 1 1 2 2/ 0h hdX dt X X
22 33 343 2 2 3 3 4/ 0h h hdX dt X X X
How is homeostasis of the flux affected by changes in X0?
0 3 0 10 33 13( , ) ( , ) /L V X L X X g h g
Log[X0]
Log[V]
Low g10
Medium g10
Large g10
Increases in X0 always increase the homeostatic values of the flux through the pathway
How is flux affected by changes in demand for X3?
4 13 34 13 33( , ) / 0L V X g h g h
Log[X4]
Log[V]Large g13
Medium g13
Low g13
How is homeostasis affected by changes in demand for X3?
3 4 4 13 34 13 33( , ) ( , ) / / 0L X X L V X g h g h
Log[X4]
Log[X3]
Low g13
Medium g13
Large g13
What to look for in the analysis?
• Steady state response
•Long term or homeostatic systemic behavior of the network
• Transient response
•Transient of adaptive systemic behavior of the network
Transient response analysis
10 13 111 1 0 3 1 1/ g g hdX dt X X X
11 222 1 1 2 2/ h hdX dt X X
22 33 343 2 2 3 3 4/ h h hdX dt X X X
Solve numerically
Specific adaptive response10 13 11
1 1 0 3 1 1/ g g hdX dt X X X 11 22
2 1 1 2 2/ h hdX dt X X 22 33 34
3 2 2 3 3 4/ h h hdX dt X X X
Get parameter valuesGet concentration
valuesSubstitution
Solve numerically
Time
[X3]
Change in X4
General adaptive response10 13 11
1 1 0 3 1 1/ g g hdX dt X X X 11 22
2 1 1 2 2/ h hdX dt X X 22 33 34
3 2 2 3 3 4/ h h hdX dt X X X Normalize
Solve numerically with comprehensive scan of parameter values
Time
[X3]
Increase in X4
Low g13
Increasing g13
Threshold g13
High g13
Unstable system, uncapable of homeostasis if feedback is strong!!
Sensitivity analysis
• Sensitivity of the system to changes in environment–Increase in demand for product causes increase in flux through pathway
–Increase in strength of feedback increases response of flux to demand
–Increase in strength of feedback decreases homeostasis margin of the system
Stability analysis
• Stability of the homeostatic behavior
–Increase in strength of feedback decreases homeostasis margin of the system
How to do it
• Download programs/algorithms and do it– PLAS, GEPASI, COPASI SBML suites,
MatLab, Mathematica, etc.
• Use an on-line server to build the model and do the simulation– V-Cell, Basis
Design principles
•Why are there alternative designs of the same pathway?
•Dual modes of gene control
•Why is a given pathway design prefered over others?
•Overall feedback in biosynthetic pathways
Why regulation by overall feedback?
X0 X1
_
+
X2 X3
X4
X0 X1
_
+
X2 X3
X4
__
Overall feedback
Cascade feedback
Overall feedback improves functionality of the system
TimeSpurious stimulation
[C]Overall
Cascade
Proper stimulus
Overall
Cascade
[C]
StimulusOverall
Cascade
Alves & Savageau, 2000, Biophys. J.
Demand theory of gene control
Wall et al, 2004, Nature Genetics Reviews
• High demand for gene expression→ Positive Regulation
• Low demand for gene expression → Negative mode of regulation