233
/183 Natalio Krasnogor ASAP - Interdisciplinary Optimisation Laboratory School of Computer Science Centre for Integrative Systems Biology School of Biosciences Centre for Healthcare Associated Infections Institute of Infection, Immunity & Inflammation University of Nottingham Synthetic Biology 1 Copyright is held by the author/owner(s). GECCO’10, July 7–11, 2010, Portland, Oregon, USA. ACM 978-1-4503-0073-5/10/07.

Computational Synthetic Biology

Embed Size (px)

DESCRIPTION

These slides were used for a tutorial I gave at GECCO 2010. These are similar, yet not identical, to the other tutorials. The keynote file is too large for slideshare but if anybody needs the original I would be happy to provide a url from where to download it.

Citation preview

Page 1: Computational Synthetic Biology

/183

Natalio KrasnogorASAP - Interdisciplinary Optimisation LaboratorySchool of Computer Science

Centre for Integrative Systems BiologySchool of Biosciences

Centre for Healthcare Associated InfectionsInstitute of Infection, Immunity & Inflammation

University of Nottingham

Synthetic Biology

1

Copyright is held by the author/owner(s).GECCO’10, July 7–11, 2010, Portland, Oregon, USA.ACM 978-1-4503-0073-5/10/07.

Page 2: Computational Synthetic Biology

/183

Outline

•Essential Systems Biology

•Synthetic Biology

•Computational Modeling for Synthetic Biology

•Automated Model Synthesis and Optimisation

•A Note on Ethical, Social and Legal Issues

•Conclusions2

Page 3: Computational Synthetic Biology

L. Cronin, N. Krasnogor, B. G. Davis, C. Alexander, N. Robertson, J.H.G. Steinke, S.L.M. Schroeder, A.N. Khlobystov, G. Cooper, P. Gardner, P. Siepmann, and B. Whitaker. The imitation game - a computational chemical approach to recognizing life. Nature Biotechnology, 24:1203-1206, 2006.

/183

The Far Future

3

Chells & The Cellularity Scale

Page 4: Computational Synthetic Biology

/183

Outline

•Essential Systems Biology

•Synthetic Biology

•Computational Modeling for Synthetic Biology

•Automated Model Synthesis and Optimisation

•A Note on Ethical, Social and Legal Issues

•Conclusions4

Page 5: Computational Synthetic Biology

/183

Essential Systems Biology•The following slides are based on U. Alon’s papers & excellent introductory text book:

“An Introduction to Systems Biology: Design Principles of Biological Circuits”

•Also, you may want to check his group’s webpage for up-to-date papers/software:

http://www.weizmann.ac.il/mcb/UriAlon/

5

Page 6: Computational Synthetic Biology

/183

The Cell as an Information Processing Device

LeDuc et al. Towards an in vivo biologically inspired nanofactory. Nature (2007)

6

Page 7: Computational Synthetic Biology

/183

•The Cell senses the environment and its own internal states•Makes Plans, Takes Decisions and Act•Evolution is the master programmer

7

The Cell as an Intelligent (Evolved) Machine

Cell

Internal States

Environmental Inputs

Actions

Amir Mitchell, et al., Adaptive prediction of environmental changes by microorganisms. Nature June 2009.

Page 8: Computational Synthetic Biology

/183

•The Cell senses the environment and its own internal states•Makes Plans, Takes Decisions and Act•Evolution is the master programmer

7

The Cell as an Intelligent (Evolved) Machine

Cell

Internal States

Environmental Inputs

Actions

Amir Mitchell, et al., Adaptive prediction of environmental changes by microorganisms. Nature June 2009.

Wikimedia Commons

Page 9: Computational Synthetic Biology

/183

Transcription Networks

8

Gene1 Gene2 Gene3 GenekGenome

Transcription Factors

Signal2 Signal5Signal1 Signal3 Signal4 Signaln...Environment

Page 10: Computational Synthetic Biology

/1839

Page 11: Computational Synthetic Biology

/183

The Basic Unit: A Gene’s Transcription Regulation Mechanics

10

Gene YDNA

Promoter

Gene Y

mRNAtranscription

translation

Protein Y

RNA Polymerase

Y

Gene YDNA

PromoterY

X binding site

X

Gene Y

mRNA+ transcription

+ translationProtein Y

Activator X

Si

Bound activator

YXSi

Bound activator

no transcription

Bound activator

Unbound repressor X

mRNAtranscription

translationProtein Y

Page 12: Computational Synthetic Biology

/183

Network Motifs: Evolution’s Preferred Circuits•Biological networks are complex and vast•To understand their functionality in a scalable way one must choose the correct abstraction

•Moreover, these patterns are organised in non-trivial/non-random hierarchies

•Each network motif carries out a specific information-processing function

11

“Patterns that occur in the real network significantly more often than in randomized networks are called network motifs” Shai S. Shen-Orr et al., Network motifs in the transcriptional regulation

network of Escherichia coli. Nature Genetics 31, 64 - 68 (2002)

Radu Dobrin et al., Aggregation of topological motifs in the Escherichia coli transcriptional regulatory network. BMC Bioinformatics. 2004; 5: 10.

Page 13: Computational Synthetic Biology

x(t) = xste!!"t

dx

dt= ! ! " " x

t 12

=log 2!

xst =!

"

/18312

Y positively regulates X

Negative autoregulation

Positive autoregulation

Negative autoregulation

Simple regulation

Positive autoregulation

U. Alon. Network motifs: theory and experimental approaches. Nature Reviews Genetics (2007) vol. 8 (6) pp. 450-461

Y

Page 14: Computational Synthetic Biology

/18313

Shai S. Shen-Orr et al., Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genetics 31, 64 - 68 (2002)

A general transcription factor regulating a second TF, called specific TF, such that both regulate effector operon Z.

In a coherent FFL, the direct effect of the general transcription factor (X) has the same sign (+/-) than the indirect net effect through Y in the effector operon.

If the arrow from X to Z has different sign than the internal ones then the loop is an incoherent FFL

Page 15: Computational Synthetic Biology

/18314

most commonin E. Coli & S. Cerevisiae

The C1-FFL is a ‘sign-sensitive delay’ element and a persistence detector.

The I1-FFL is a pulse generator and response accelerator

U. Alon. Network motifs: theory and experimental approaches. Nature Reviews Genetics (2007) vol. 8 (6) pp. 450-461

Page 16: Computational Synthetic Biology

/18315

The C1-FFL is a ‘sign-sensitive delay’ element and a persistence detector.

If the integration function is “OR” (rather than “AND”), C1-FFL has now delay after stimulation by Sx but, instead, manifests the delay when the stimulation stops.

U. Alon. Network motifs: theory and experimental approaches. Nature Reviews Genetics (2007) vol. 8 (6) pp. 450-461

Page 17: Computational Synthetic Biology

/18316

The I1-FFL is a pulse generator and response accelerator

U. Alon. Network motifs: theory and experimental approaches. Nature Reviews Genetics (2007) vol. 8 (6) pp. 450-461

Page 18: Computational Synthetic Biology

/18317

Shai S. Shen-Orr et al., Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genetics 31, 64 - 68 (2002)

SIM is defined by one TF controlling a set of operons, with the same signs and no additional control.

TFs in SIMs are mostly negative autoregulatory (70% in E. coli)

U. Alon. Network motifs: theory and experimental approaches. Nature Reviews Genetics (2007) vol. 8 (6) pp. 450-461

As the activity of the master regulator X changes in time, it crosses the different activation threshold of the genes in the SIM at different times, this prioritizing the activation of the operons

Page 19: Computational Synthetic Biology

/18318

U. Alon. Network motifs: theory and experimental approaches. Nature Reviews Genetics (2007) vol. 8 (6) pp. 450-461

Page 20: Computational Synthetic Biology

/18319

Shai S. Shen-Orr et al., Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genetics 31, 64 - 68 (2002)

DORs are layers of dense sets of TFs affecting multiple operons.

To understand the specific function of these “gate-arrays” one needs to know the input functions (AND/OR) for each output gene. This data is not currently available in most cases.

Page 21: Computational Synthetic Biology

/18320

Shai S. Shen-Orr et al., Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genetics 31, 64 - 68 (2002)

•The correct abstract ions facilitates understanding in complex systems.

•Provide a route to engineering & programming cells.

Page 22: Computational Synthetic Biology

/183

Outline

•Essential Systems Biology

•Synthetic Biology

•Computational Modeling for Synthetic Biology

•Automated Model Synthesis and Optimisation

•Conclusions

21

Page 23: Computational Synthetic Biology

/183

What is Synthetic Biology

Synthetic Biology is

A) the design and construction of new biological parts, devices, and systems, and

B) the re-design of existing, natural biological systems for useful purposes.

22

http://syntheticbiology.org/

Page 24: Computational Synthetic Biology

/183

What is Synthetic Biology

Synthetic Biology is

A) the design and construction of new biological parts, devices, and systems, and

B) the re-design of existing, natural biological systems for useful purposes.

22

http://syntheticbiology.org/

C) Through rigorous mathematical, computational & engineering routes

Page 25: Computational Synthetic Biology

/183

Synthetic Biology• Aims at designing, constructing and developing artificial biological systems •Offers new routes to ‘genetically modified’ organisms, synthetic living entities, smart drugs and hybrid computational-biological devices.

• Potentially enormous societal impact, e.g., healthcare, environmental protection and remediation, etc

• Synthetic Biology's basic assumption:•Methods readily used to build non-biological systems could also be use to specify, design, implement, verify, test and deploy novel synthetic biosystems. •These method come from computer science, engineering and maths.•Modeling and optimisation run through all of the above.

23

Page 26: Computational Synthetic Biology

/183

Sys

tem

s B

iolo

gy

Syn

thet

ic B

iolo

gy

Basic goal: to clarify current understandings by formalising what the constitutive elements of a system are and how they interact

Intermediate goal: to test current understandings against experimental data

Advanced goal: to predict beyond current understanding and available data

Dream goal: (1) to combinatorially combine in silico well-understood

components/models for the design and generation of novel experiments and hypothesis and ultimately

(2) to design, program, optimise & control (new) biological systems

24

Synthetic & Systems Biology: Sisters Disciplines

Page 27: Computational Synthetic Biology

/183 20

Top-Down Synthetic Biology: An Approach to Engineering Biology

Cells are information processors. DNA is their programming language. DNA sequencing and PCR: Identification and isolation of cellular parts.

Recombinant DNA and DNA synthesis : Combination of DNA and construction of new systems.

Tools to make biology easier to engineer: Standardisation, encapsulation and abstraction (blueprints).

E. coli

Vibrio fischeri

Pseudomonas aeruginosaDiscosoma sp.

Aequoreavictoria

25

Page 28: Computational Synthetic Biology

/183 20

Top-Down Synthetic Biology: An Approach to Engineering Biology

Cells are information processors. DNA is their programming language. DNA sequencing and PCR: Identification and isolation of cellular parts.

Recombinant DNA and DNA synthesis : Combination of DNA and construction of new systems.

Tools to make biology easier to engineer: Standardisation, encapsulation and abstraction (blueprints).

E. coli

Vibrio fischeri

Pseudomonas aeruginosaDiscosoma sp.

Aequoreavictoria

25

Page 29: Computational Synthetic Biology

/183 20

Top-Down Synthetic Biology: An Approach to Engineering Biology

Cells are information processors. DNA is their programming language. DNA sequencing and PCR: Identification and isolation of cellular parts.

Recombinant DNA and DNA synthesis : Combination of DNA and construction of new systems.

Tools to make biology easier to engineer: Standardisation, encapsulation and abstraction (blueprints).

E. coli

Vibrio fischeri

Pseudomonas aeruginosa

plasmids

Discosoma sp.

Aequoreavictoria

25

Page 30: Computational Synthetic Biology

/183 20

Top-Down Synthetic Biology: An Approach to Engineering Biology

Cells are information processors. DNA is their programming language. DNA sequencing and PCR: Identification and isolation of cellular parts.

Recombinant DNA and DNA synthesis : Combination of DNA and construction of new systems.

Tools to make biology easier to engineer: Standardisation, encapsulation and abstraction (blueprints).

E. coli

Vibrio fischeri

Pseudomonas aeruginosa

plasmids

DNA synthesis

Discosoma sp.

Aequoreavictoria

25

Page 31: Computational Synthetic Biology

/183 20

Top-Down Synthetic Biology: An Approach to Engineering Biology

Cells are information processors. DNA is their programming language. DNA sequencing and PCR: Identification and isolation of cellular parts.

Recombinant DNA and DNA synthesis : Combination of DNA and construction of new systems.

Tools to make biology easier to engineer: Standardisation, encapsulation and abstraction (blueprints).

E. coli

Vibrio fischeri

Pseudomonas aeruginosa

plasmids

DNA synthesis

Discosoma sp.

Aequoreavictoria

Chassis

25

Page 32: Computational Synthetic Biology

/183 20

Top-Down Synthetic Biology: An Approach to Engineering Biology

Cells are information processors. DNA is their programming language. DNA sequencing and PCR: Identification and isolation of cellular parts.

Recombinant DNA and DNA synthesis : Combination of DNA and construction of new systems.

Tools to make biology easier to engineer: Standardisation, encapsulation and abstraction (blueprints).

E. coli

Vibrio fischeri

Pseudomonas aeruginosa

plasmids

DNA synthesis

Discosoma sp.

Aequoreavictoria

Circuit BlueprintChassis

25

Page 33: Computational Synthetic Biology

/183

Synthetic Biology’s Brick & Mortar (I)

26

D. Sprinzak & M.B. Elowitz (2005). Reconstruction of genetic circuits, Nature 438:24, 443-448.

Page 34: Computational Synthetic Biology

/18327

Synthetic Biology’s Brick & Mortar (II)

D. Sprinzak & M.B. Elowitz (2005). Reconstruction of genetic circuits, Nature 438:24, 443-448.

Page 35: Computational Synthetic Biology

/183

Example I: Elowitz & Leibler Represilator

28

M.B. Elowitz & S. Leibler (2000). A Synthetic Oscilatory Network of Transcriptional Regulators. Nature, 403:20, 335-338

Page 36: Computational Synthetic Biology

/18329

An Example: Elowitz & Leibler Represilator

M.B. Elowitz & S. Leibler (2000). A Synthetic Oscillatory Network of Transcriptional Regulators. Nature, 403:20, 335-338

Page 37: Computational Synthetic Biology

/183

Example II: Combinatorial Synthetic Logic

30

C.C. Guet et al., Combinatorial Synthesis of Genetic Networks, Science 296, 1466-1470, 2002

Page 38: Computational Synthetic Biology

/183

Example II: Combinatorial Synthetic Logic

31

C.C. Guet et al., Combinatorial Synthesis of Genetic Networks, Science 296, 1466-1470, 2002

Page 39: Computational Synthetic Biology

/18332

Example II: Combinatorial Synthetic Logic

C.C. Guet et al., Combinatorial Synthesis of Genetic Networks, Science 296, 1466-1470, 2002

Page 40: Computational Synthetic Biology

/183

Example III: Push-on/Push-off circuit

33

C. Lou et al., Synthesizing a novel genetic sequential logic circuit: a push-on push-off switch, Molecular Systems Biology 6; Article number 350

Page 41: Computational Synthetic Biology

/183

Example III: Push-on/Push-off circuit

34

C. Lou et al., Synthesizing a novel genetic sequential logic circuit: a push-on push-off switch, Molecular Systems Biology 6; Article number 350

Page 42: Computational Synthetic Biology

/183

• Two different bacterial strains carrying specific synthetic gene regulatory networks are used.

• The first strain produces a diffusible signal AHL.

• The second strain possesses a synthetic gene regulatory network which produces a pulse of GFP after AHL sensing within a range of values (Band Pass).

Axample IV: Ron Weiss' Pulse Generator

S. Basu, R. Mehreja, et al. (2004) Spatiotemporal control of gene expression with pulse generating networks, PNAS, 101, 6355-6360

35

Page 43: Computational Synthetic Biology

/18336

Page 44: Computational Synthetic Biology

/183

Outline

•Essential Systems Biology

•Synthetic Biology

•Computational Modeling for Synthetic Biology

•Automated Model Synthesis and Optimisation

•Conclusions

37

Page 45: Computational Synthetic Biology

/183

What is modeling?

• Is an attempt at describing in a precise way an understanding of the elements of a system of interest, their states and interactions

• A model should be operational, i.e. it should be formal, detailed and “runnable” or “executable”.

38

Page 46: Computational Synthetic Biology

/183

•“feature selection” is the first issue one must confront when building a model

•One starts from a system of interest and then a decision should be taken as to what will the model include/leave out

•That is, at what level the model will be built

39

Page 47: Computational Synthetic Biology

/183

The goals of Modelling

•To capture the essential features of a biological entity/phenomenon•To disambiguate the understanding behind those features and their interactions•To move from qualitative knowledge towards quantitative knowledge

40

Page 48: Computational Synthetic Biology

/183

Cells

Colonies

Networks

Systems Biology Synthetic Biology

• Understanding• Integration• Prediction• Life as it is

•Control• Design• Engineering•Life as it could be

Computational modelling toelucidate and characterisemodular patterns exhibitingrobustness, signal filtering,amplification, adaption, error correction, etc.

Computational modelling toengineer and evaluate possible cellular designsexhibiting a desiredbehaviour by combining well studied and characterised cellular modules

41

Modeling in Systems & Synthetic Biology

Page 49: Computational Synthetic Biology

/183

There is potentially a distinction between modeling for Synthetic Biology vs Systems Biology:

•Systems Biology is concerned with Biology as it is•Synthetic Biology is concerned with Biology as it could be

“Our view of engineering biology focuses on the abstraction and standardization of biological components” by R. Rettberg @ MIT newsbite August 2006.

“Well-characterized components help lower the barriers to modeling. The use of control elements (such as temperature for a temperature-sensitive protein, or an exogenous small molecule affecting a reaction) helps model validation” by Di Ventura et al, Nature, 2006

Co-design of parts and their models hence improving and making both more reliable

42

Page 50: Computational Synthetic Biology

/183

Model DevelopmentFrom [E. Klipp et al, Systems Biology in Practice, 2005]:

1) Formulation of the problem2) Verification of available information3) Selection of model structure4) Establishing a simple model5) Sensitivity analysis6) Experimental tests of the model predictions7) Stating the agreements and divergences between

experimental and modelling results8) Iterative refinement of model

43

Page 51: Computational Synthetic Biology

/183

The Challenge of Scales

Within a cell the dissociation constants of DNA/ transcription factor binding to specific/non-specific sites differ by 4-6 orders of magnitude

DNA protein binding occurs at 1-10s time scale very fast in comparison to a cell’s life cycle.

44

R. Milo, et al., BioNumbers—the database of key numbers in molecular and cell biology. Nucleic Acids

Research, 1–4 (2009) http://bionumbers.hms.harvard.edu/default.aspx

Page 52: Computational Synthetic Biology

/183

Tem

pora

l sca

le (l

og)

Spatial scale (log)Couplings

The Scale Separation Map

45

nm µm mm

fsps

nsµs

ms

ss*

103

bond vibration

energy transfer

micelle dynamics

vesicle dynamicsdiff. small mol

transcription translation

QS mol diff in colony

DPD

mcssa

Page 53: Computational Synthetic Biology

/183

The Challenge of Small Numbers in Cellular Systems

Most commonly recognised sources of noise in cellular system are low number of molecules and slow molecular interactions.

Over 80% of genes in E. coli express fewer than a hundred proteins per cell.

Mesoscopic, discrete and stochastic approaches are more suitable: Only relevant molecules are taken into account. Focus on the statistics of the molecular interactions and how often they

take place.

Mads Karn et al. Stochasticity in Gene Expression: From Theories to Phenotypes. Nature Reviews, 6, 451-464 (2005)

Purnananda Guptasarma. Does replication-induced transcription regulate synthesis of the myriad low copy number poteins of E. Coli. BioEssays, 17, 11, 987-997

46

Page 54: Computational Synthetic Biology

/183

Modelling Approaches

There exist many modeling approaches, each with its advantages and disadvantages.

Macroscopic, Microscopic and MesoscopicQuantitative and qualitativeDiscrete and ContinuousDeterministic and StochasticTop-down or Bottom-up

47

Page 55: Computational Synthetic Biology

/183

Modelling Frameworks•Denotational Semantics Models:

Set of equations showing relationships between molecular quantities and how they change over time.They are approximated numerically. (I.e. Ordinary Differential Equations, PDEs, etc)

•Operational Semantics Models:

Algorithm (list of instructions) executable by an abstract machine whose computation resembles the behaviour of the system under study. (i.e. Finite State Machine)

Jasmin Fisher and Thomas Henzinger. Executable cell biology. Nature Biotechnology, 25, 11, 1239-1249 (2008)

48

A. Regev, E. Shapiro. The π-calculus as an abstraction for biomolecular systems. Modelling in Molecular Biology., pages 1–50. Springer Berlin., 2004.

D. Harel, "A Grand Challenge for Computing: Full Reactive Modeling of a Multi-Cellular Animal", Bulletin of the EATCS , European Association for Theoretical Computer Science, no. 81, 2003, pp. 226-235

Page 56: Computational Synthetic Biology

/183

•From [D.E Goldberg, 2002] (adapted): “Since science and math are in the description

business, the model is the thing…The engineer or inventor has much different motives. The engineered object is the thing”

ε, e

rror

C, cost of modelling

Synthetic Biologist

Computer Scientist/Mathematician

Tools Suitability and Cost

49

Page 57: Computational Synthetic Biology

/18350

Page 58: Computational Synthetic Biology

/183

There are good reasons to think that information processing is a key viewpoint to take when modeling

Life as we know is:• coded in discrete units (DNA, RNA, Proteins)• combinatorially assembles interactions (DNA-RNA, DNA-Proteins,RNA-Proteins , etc) through evolution and self-organisation• Life emerges from these interacting parts• Information is:

• transported in time (heredity, memory e.g. neural, immune system, etc)• transported in space (molecular transport processes, channels, pumps, etc)

• Transport in time = storage/memory a computational process• Transport in space = communication a computational process• Signal Transduction = processing a computational process

51

Page 59: Computational Synthetic Biology

/183

Computer Science ContributionsMethodologies designed to cope with: • Languages to cope with complex, concurrent, systems of parts:

• ∏-calculus• Process Calculi• P Systems

• Tools to analyse and optimise:• EA, ML• Model Checking

52

J.Twycross, L.R. Band, M. J. Bennett, J.R. King, and N. Krasnogor. Stochastic and deterministic multiscale models for systems biology: an auxin-transport case study. BMC Systems Biology, 4(:34), March 2010

Page 60: Computational Synthetic Biology

/18353

InfoBiotics Workbench and Dashboardwww.infobiotics.net

Page 61: Computational Synthetic Biology

/18353

InfoBiotics Workbench and Dashboard

Spec

ifica

tion www.infobiotics.net

Page 62: Computational Synthetic Biology

/183

Executable Biology with P systems

•Field of membrane computing initiated by Gheorghe Păun in 2000

•Inspired by the hierarchical membrane structure of eukaryotic cells

•A formal language: precisely defined and machine processable

•An executable biology methodology

54

Page 63: Computational Synthetic Biology

/183

Functional EntitiesContainer

• A boundary defining self/non-self (symmetry breaking).• Maintain concentration gradients and avoid environmental damage.

Metabolism

• Confining raw materials to be processed.• Maintenance of internal structures (autopoiesis).

Information

• Sensing environmental signals / release of signals.• Genetic information

55

Page 64: Computational Synthetic Biology

/183

Distributed and parallel rewritting systems in compartmentalised hierarchical structures.

Compartments

Objects

Rewriting Rules

• Computational universality and efficiency.

• Modelling Framework

56

Page 65: Computational Synthetic Biology

/183

Cell-like P systems

the classic P system diagram appearing in most papers (Păun)

Intuitive Visual representation as a Venn diagram with a unique superset and without intersected sets.

57

Page 66: Computational Synthetic Biology

/183

Cell-like P systems

1

23

4

75

8

6

9

formally equivalent to a tree:

the classic P system diagram appearing in most papers (Păun)

Intuitive Visual representation as a Venn diagram with a unique superset and without intersected sets.

57

Page 67: Computational Synthetic Biology

/183

Cell-like P systems

• a string of matching parentheses: [1 [2 ]2 [3 ]3 [4 [5 ]5 [6 [8 ]8 [9 ]9 ]6

[7 ]7 ]4 ]1

1

23

4

75

8

6

9

formally equivalent to a tree:

the classic P system diagram appearing in most papers (Păun)

Intuitive Visual representation as a Venn diagram with a unique superset and without intersected sets.

57

Page 68: Computational Synthetic Biology

/183

P-Systems: Modelling PrinciplesMoleculesStructured Molecules

ObjectsStrings

Molecular Species Multisets of objects/strings

Membranes/organelles Membrane

Biochemical activity rules

Biochemical transport Communication rules

58

Page 69: Computational Synthetic Biology

/183

Stochastic P Systems

59

Page 70: Computational Synthetic Biology

/183

Rewriting Rules

used by Multi-volume Gillespie’s algorithm

60

Page 71: Computational Synthetic Biology

/183

Molecular Species A molecular species can be represented using

individual objects.

A molecular species with relevant internal structure can be represented using a string.

61

Page 72: Computational Synthetic Biology

/183

Molecular Interactions Comprehensive and relevant rule-based schema

for the most common molecular interactions taking place in living cells.

Transformation/Degradation Complex Formation and Dissociation Diffusion in / out Binding and Debinding Recruitment and Releasing Transcription Factor Binding/Debinding Transcription/Translation

62

Page 73: Computational Synthetic Biology

/183

Compartments / Cells Compartments and regions are explicitly

specified using membrane structures.

63

Page 74: Computational Synthetic Biology

/183

Colonies / Tissues Colonies and tissues are representing as

collection of P systems distributed over a lattice.

Objects can travel around the lattice through translocation rules.

v

64

Page 75: Computational Synthetic Biology

/183

Molecular Interactions Inside Compartments

65

Page 76: Computational Synthetic Biology

/183

Passive Diffusion of Molecules

66

Page 77: Computational Synthetic Biology

/183

Signal Sensing and Active Transport

67

Page 78: Computational Synthetic Biology

/183

Specification of Transcriptional Regulatory Networks

68

Page 79: Computational Synthetic Biology

/183

Post-Transcriptional Processes For each protein in the system, post-transcriptional processes like

translational initiation, messenger and protein degradation, protein dimerisation, signal sensing, signal diffusion etc are represented using modules of rules.

Modules can have also as parameters the stochastic kinetic constants associated with the corresponding rules in order to allow us to explore possible mutations in the promoters and ribosome binding sites in order to optimise the behaviour of the system.

69

Page 80: Computational Synthetic Biology

/183

Scalability through Modularity

Cellular functions arise from orchestrated interactions between motifs consisting of many molecular interacting species.

A P System model is a set of rules representing molecular interactions motifs that appear in many cellular systems.

70

Page 81: Computational Synthetic Biology

/183

Basic P System Modules Used

71

Page 82: Computational Synthetic Biology

/183 21

Characterisation/Encapsulation of Cellular Parts: Gene Promoters

A modeling language for the design of synthetic bacterial colonies.

A module, set of rules describing the molecular interactions involving a cellular part, provides encapsulation and abstraction. Collection or libraries of reusable cellular parts and reusable models.

E. Davidson (2006) The Regulatory Genome, Gene Regulation Networks in Development and Evolution, Elsevier

72

Page 83: Computational Synthetic Biology

/183 21

Characterisation/Encapsulation of Cellular Parts: Gene Promoters

A modeling language for the design of synthetic bacterial colonies.

A module, set of rules describing the molecular interactions involving a cellular part, provides encapsulation and abstraction. Collection or libraries of reusable cellular parts and reusable models.

01101110100001010100011110001011101010100011010100

E. Davidson (2006) The Regulatory Genome, Gene Regulation Networks in Development and Evolution, Elsevier

72

Page 84: Computational Synthetic Biology

/183 21

Characterisation/Encapsulation of Cellular Parts: Gene Promoters

A modeling language for the design of synthetic bacterial colonies.

A module, set of rules describing the molecular interactions involving a cellular part, provides encapsulation and abstraction. Collection or libraries of reusable cellular parts and reusable models.

E. Davidson (2006) The Regulatory Genome, Gene Regulation Networks in Development and Evolution, Elsevier

72

Page 85: Computational Synthetic Biology

/183 21

Characterisation/Encapsulation of Cellular Parts: Gene Promoters

A modeling language for the design of synthetic bacterial colonies.

A module, set of rules describing the molecular interactions involving a cellular part, provides encapsulation and abstraction. Collection or libraries of reusable cellular parts and reusable models.

LuxRAHL

CI

E. Davidson (2006) The Regulatory Genome, Gene Regulation Networks in Development and Evolution, Elsevier

72

Page 86: Computational Synthetic Biology

/183 21

Characterisation/Encapsulation of Cellular Parts: Gene Promoters

A modeling language for the design of synthetic bacterial colonies.

A module, set of rules describing the molecular interactions involving a cellular part, provides encapsulation and abstraction. Collection or libraries of reusable cellular parts and reusable models.

LuxRAHL

CI

PluxOR1({X},{c1, c2, c3, c4, c5, c6, c7, c8, c9},{l}) = {

type: promoter

sequence: ACCTGTAGGATCGTACAGGTTTACGCAAGAA ATGGTTTGTATAGTCGAATACCTCTGGCGGTGATA

rules: r1: [ LuxR2 + PluxPR.X ]_l -c1-> [ PluxPR.LuxR2.X ]_l r2: [ PluxPR.LuxR2.X ]_l -c2-> [ LuxR2 + PluxPR.X ]_l ... r5: [ CI2 + PluxPR.X ]_l -c5-> [ PluxPR.CI2.X ]_l r6: [ PluxPR.CI2.X ]_l -c6-> [ CI2 + PluxPR.X ]_l ... r9: [ PluxPR.LuxR2.X ]_l -c9-> [ PluxPR.LuxR2.X + RNAP.X ]_l}

E. Davidson (2006) The Regulatory Genome, Gene Regulation Networks in Development and Evolution, Elsevier

72

Page 87: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection

73

Page 88: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection Recombinant DNA: Objects variables can be instantiated with the name of specific

genes.

73

Page 89: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection Recombinant DNA: Objects variables can be instantiated with the name of specific

genes.

73

Page 90: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection Recombinant DNA: Objects variables can be instantiated with the name of specific

genes.

PluxOR1({X=tetR})

73

Page 91: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection Recombinant DNA: Objects variables can be instantiated with the name of specific

genes.

73

Page 92: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection Recombinant DNA: Objects variables can be instantiated with the name of specific

genes.

PluxOR1({X=GFP})

73

Page 93: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection Recombinant DNA: Objects variables can be instantiated with the name of specific

genes.

PluxOR1({X=GFP})

73

Page 94: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection

Directed evolution: Variables for stochastic constants can be instantiated with specific values.

Recombinant DNA: Objects variables can be instantiated with the name of specific genes.

PluxOR1({X=GFP})

73

Page 95: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection

A

Directed evolution: Variables for stochastic constants can be instantiated with specific values.

Recombinant DNA: Objects variables can be instantiated with the name of specific genes.

PluxOR1({X=GFP})

73

Page 96: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection

A

Directed evolution: Variables for stochastic constants can be instantiated with specific values.

Recombinant DNA: Objects variables can be instantiated with the name of specific genes.

PluxOR1({X=GFP})

PluxOR1({X=GFP},{...,c4=10,...})

73

Page 97: Computational Synthetic Biology

/183 22

Module Variables: Recombinant DNA, Directed Evolution, Chassis

selection

A

Directed evolution: Variables for stochastic constants can be instantiated with specific values.

Recombinant DNA: Objects variables can be instantiated with the name of specific genes.

PluxOR1({X=GFP})

PluxOR1({X=GFP},{...,c4=10,...}) Chassis Selection: The variable for the label can be instantiated with the name of a

chassis.

PluxOR1({X=GFP},{...,c4=10,...},{l=DH5α })

73

Page 98: Computational Synthetic Biology

/183 23

Characterisation/Encapsulation of Cellular Parts: Riboswitches

A riboswitch is a piece of RNA that folds in different ways depending on the presence of absence of specific molecules regulating translation.

ToppRibo({X},{c1, c2, c3, c4, c5, c6},{l}) = {

type: riboswitch

sequence:GGTGATACCAGCATCGTCTTGATGCCCTTGG CAGCACCCCGCTGCAAGACAACAAGATG rules: r1: [ RNAP.ToppRibo.X ]_l -c1-> [ ToppRibo.X ]_l r2: [ ToppRibo.X ]_l -c2-> [ ]_l r3: [ ToppRibo.X + theop ]_l –c3-> [ ToppRibo*.X ]_l r4: [ ToppRibo*.X ]_l –c4-> [ ToppRibo.X + theop ]_l r5: [ ToppRibo*.X ]_l –c5-> [ ]_l r6: [ ToppRibo*.X ]_l –c6-> [ToppRibo*.X + Rib.X ]_l}

74

Page 99: Computational Synthetic Biology

/183 24

Characterisation/Encapsulation of Cellular Parts: Degradation Tags

Degradation tags are amino acid sequences recognised by proteases. Once the corresponding DNA sequence is fused to a gene the half life of the protein is reduced considerably.

degLVA({X},{c1, c2},{l}) = {

type: degradation tag

sequence: CAGCAAACGACGAAAACTACGCTTTAGTAGCT

rules: r1: [ Rib.X.degLVA ]_l -c1-> [ X.degLVA ]_l r2: [ X.degLVA ]_l -c2-> [ ]_l}

75

Page 100: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

76

Page 101: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

3OC6_repressible_sensor({X}) = {

76

Page 102: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

3OC6_repressible_sensor({X}) = { PluxOR1({X=ToppRibo.geneX.degLVA},{...},{l=DH5α})

76

Page 103: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

3OC6_repressible_sensor({X}) = { PluxOR1({X=ToppRibo.geneX.degLVA},{...},{l=DH5α}) ToppRibo({X=geneX.degLVA},{...},{l=DH5α})

76

Page 104: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

3OC6_repressible_sensor({X}) = { PluxOR1({X=ToppRibo.geneX.degLVA},{...},{l=DH5α}) ToppRibo({X=geneX.degLVA},{...},{l=DH5α}) degLVA({X},{...},{l=DH5α})

76

Page 105: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

3OC6_repressible_sensor({X}) = { PluxOR1({X=ToppRibo.geneX.degLVA},{...},{l=DH5α}) ToppRibo({X=geneX.degLVA},{...},{l=DH5α}) degLVA({X},{...},{l=DH5α})}

76

Page 106: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

3OC6_repressible_sensor({X}) = { PluxOR1({X=ToppRibo.geneX.degLVA},{...},{l=DH5α}) ToppRibo({X=geneX.degLVA},{...},{l=DH5α}) degLVA({X},{...},{l=DH5α})}

X=GFP

76

Page 107: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

3OC6_repressible_sensor({X}) = { PluxOR1({X=ToppRibo.geneX.degLVA},{...},{l=DH5α}) ToppRibo({X=geneX.degLVA},{...},{l=DH5α}) degLVA({X},{...},{l=DH5α})}

X=GFP

76

Page 108: Computational Synthetic Biology

/183 25

Higher Order Modules: Building Synthetic Gene Circuits

PluxOR1 geneXToppRibo degLVA

3OC6_repressible_sensor({X}) = { PluxOR1({X=ToppRibo.geneX.degLVA},{...},{l=DH5α}) ToppRibo({X=geneX.degLVA},{...},{l=DH5α}) degLVA({X},{...},{l=DH5α})}

X=GFP

Plux({X=ToppRibo.geneCI.degLVA},{...},{l=DH5α}) ToppRibo({X=geneCI.degLVA},{...},{l=DH5α}) degLVA({CI},{...},{l=DH5α})

PtetR({X=ToppRibo.geneLuxR.degLVA},{...},{l=DH5α}) Weiss_RBS({X=LuxR},{...},{l=DH5α}) Deg({X=LuxR},{...},{l=DH5α})

76

Page 109: Computational Synthetic Biology

/183

luxIPconst

LuxI AHL

AHL

luxRPconst

cIPlux

gfpPluxOR1

LuxR

CI

GFPAHL

AHLSpecification of Multi-cellular

Systems: LPP systems

77

Page 110: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Cellular Parts

78

Page 111: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Libraries of Modules

Cellular Parts

78

Page 112: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Libraries of Modules

Cellular Parts

Module Combinations

78

Page 113: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Libraries of Modules

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 114: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Libraries of Modules

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 115: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Libraries of Modules

P systems

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 116: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Libraries of Modules

P systems

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 117: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Libraries of Modules

P systems

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 118: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Libraries of Modules

P systems LPP systems

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 119: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Synthetic Multi-cellular Systems

Libraries of Modules

P systems LPP systems

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 120: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Synthetic Multi-cellular Systems

Libraries of Modules

P systems LPP systems

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 121: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Synthetic Multi-cellular Systems

Libraries of Modules

P systems LPP systems

A compiler based on a BNF grammar

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 122: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Synthetic Multi-cellular Systems

Libraries of Modules

P systems LPP systems

Multi Compartmental Stochastic Simulations

based on Gillespie’s algorithm

A compiler based on a BNF grammar

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 123: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Synthetic Multi-cellular Systems

Libraries of Modules

P systems LPP systems

Multi Compartmental Stochastic Simulations

based on Gillespie’s algorithm

Spatio-temporal Dynamics Analysis

using Model Checking with PRISM and MC2

A compiler based on a BNF grammar

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 124: Computational Synthetic Biology

/183 29

Infobiotics: An Integrated Frameworkhttp://www.infobiotics.org/infobiotics-workbench/

Synthetic Multi-cellular Systems

Libraries of Modules

P systems LPP systems

Multi Compartmental Stochastic Simulations

based on Gillespie’s algorithm

Spatio-temporal Dynamics Analysis

using Model Checking with PRISM and MC2

Automatic Design of Synthetic Gene

Regulatory Circuits using Evolutionary Algorithms

A compiler based on a BNF grammar

Single Cells

Cellular Parts

Synthetic Circuits

Module Combinations

78

Page 125: Computational Synthetic Biology

/183

Stochastic P Systems Are Executable Programs

The virtual machine running these programs is a “Gillespie Algorithm (SSA)”. It generates trajectories of a stochastic syste:

A stochastic constant is associated with each rule.A propensity is computed for each rule by multiplying the stochastic constant by the number of distinct possible combinations of the elements on the left hand side of the rule.

F. J. Romero-Campero, J. Twycross, M. Camara, M. Bennett, M. Gheorghe, and N. Krasnogor. Modular assembly of cell systems biology models using p systems. International Journal of Foundations of Computer Science, 2009

79

Page 126: Computational Synthetic Biology

/183

Multicompartmental Gillespie Algorithm

1

2

3 r11,…,r1

n1

M1

r21,…,r2

n2

M2

r31,…,r3

n3

M3

( 1, τ1, r01)

( 2, τ2, r02)

( 3, τ3, r03)

( 2, τ2, r02)

( 1, τ1, r01)

( 3, τ3, r03)

Sort Compartments τ2 < τ1 < τ3

Local Gillespie

( 1, τ1-τ2, r01)

( 3, τ3-τ2, r03)

Update Waiting Times

( 2, τ2’, r02)( 1, τ1-τ2, r0

1)

( 2, τ2’, r02)

( 3, τ3-τ2, r03)

Insert new triplet τ1-τ2 <τ2’ < τ3-τ2

80

Page 127: Computational Synthetic Biology

/183

An Important Difference with “Normal” Programs

•Executable Stochastic P systems are not programs with stochastic behavior

•A cell is a living example of distributed stochastic computing.

81

function f1(p1,p2,p3,p4){if (p1<p2) RND print p3 RNDelse RND print p4 RND}

function f1(p1,p2,p3,p4){if (p1<p2) and (rand<0.5) print p3else print p4}

Page 128: Computational Synthetic Biology

/183

•Using P systems modules one can model a large variety of commonly occurring BRN:

•Gene Regulatory Networks•Signaling Networks•Metabolic Networks

•This can be done in an incremental way.

F. J. Romero-Campero, J. Twycross, M. Camara, M. Bennett, M. Gheorghe, and N. Krasnogor. Modular assembly of cell systems biology models using p systems. International Journal of Foundations of Computer Science, 2009

82

RapidModelPrototyping

Page 129: Computational Synthetic Biology

/18383 InfoBiotics Workbench and Dashboard

Spec

ifica

tion

Page 130: Computational Synthetic Biology

/18383 InfoBiotics Workbench and Dashboard

Spec

ifica

tion

Sim

ulat

ion

Analysis

Page 131: Computational Synthetic Biology

/183

• Two different bacterial strains carrying specific synthetic gene regulatory networks are used.

• The first strain produces a diffusible signal AHL.

• The second strain possesses a synthetic gene regulatory network which produces a pulse of GFP after AHL sensing within a range of values (Band Pass).

An example: A Pulse Generator

S. Basu, R. Mehreja, et al. (2004) Spatiotemporal control of gene expression with pulse generating networks, PNAS, 101, 6355-6360

84

Page 132: Computational Synthetic Biology

/183

Sender Cells

Pconst

LuxI AHL

AHL

SenderCell()=

{

Pconst({X = luxI },…)

PostTransc({X=LuxI},{c1=3.2,…})

Diff({X=AHL},{c=0.1})

}

luxI

85

Page 133: Computational Synthetic Biology

/183

luxRPconst

cIPlux

gfpPluxOR1

LuxR

CI

GFPAHL

AHL

PulseGenerator()=

{

Pconst({X=luxR},…)

PluxOR1({X=gfp},…)

Plux({X=cI},…)

Diff({X=AHL},…)

}

86

Pulse Generating Cells

Page 134: Computational Synthetic Biology

/183

Spatial Distribution of Senders and Pulse Generators

luxIPconst

LuxI AHL

AHL

AHL

87

luxRPconst

cIPlux

gfpPluxOR1

LuxR

CI

GFPAHL

AHL

Page 135: Computational Synthetic Biology

/183

Pulse propagation - simulation I

88

Simulation I

Page 136: Computational Synthetic Biology

/183

Pulse Generating Cells With Relay

luxRPconst

cIPlux

PluxOR1

LuxR

CI

AHL

AHL

luxIPlux

LuxI

AHL

PulseGenerator(X ) =

{ Pconst({X=luxR},…)

PluxOR1({X},…) ,

Plux({X=cI},…) ,

Diff({X=AHL},…) ,

Plux({X=luxI},…)

}89

Page 137: Computational Synthetic Biology

/183 3690

Simulation II

Pulse propagation & Rely- simulation II

Page 138: Computational Synthetic Biology

/183

A Signal Translatorfor Pattern Formation

act1Prep2

act2Prep1

rep1Pact1

rep2Pact2

rep3Prep1

rep4Prep2

I2Prep3

I1Prep4

FP2Pact2

FP1Pact1

91

Page 139: Computational Synthetic Biology

/183

Uniform Spatial Distribution of Signal Translators for Pattern Formation

92

Page 140: Computational Synthetic Biology

/183

Pattern Formation in synthetic bacterial colonies

93

Simulation III

Page 141: Computational Synthetic Biology

/183

Pattern Formation in synthetic bacterial colonies

94

Page 142: Computational Synthetic Biology

/183

Spatial Distribution of Signal Translators and propagators

luxRPconst

cIPlux

gfpPluxOR1

LuxR GFPSi

Si

luxI

LuxI

Si

Plux

95

CI

Page 143: Computational Synthetic Biology

/183

Alternating signal pulses in synthetic bacterial colonies

96

Simulation IV

Page 144: Computational Synthetic Biology

/183

Outline•Brief Introduction to Computational Modeling

•Modeling for Top Down SB•Executable Biology•A pinch of Model Checking

•Modeling for the Bottom Up SB•Dissipative Particle Dynamics

•Automated Model Synthesis and Optimisation

•Conclusions97

Page 145: Computational Synthetic Biology

/18398 InfoBiotics Workbench and Dashboard

Spec

ifica

tion

Sim

ulat

ion

Analysis

Page 146: Computational Synthetic Biology

/18398 InfoBiotics Workbench and Dashboard

Spec

ifica

tion

Sim

ulat

ion

Analysis

Optim

isation

Page 147: Computational Synthetic Biology

/183

Automated Model Synthesis and Optimisation

• Modeling is an intrinsically difficult process

• It involves “feature selection” and disambiguation

• Model Synthesis requires• design the topology or structure of the

system in terms of molecular interactions• estimate the kinetic parameters associated

with each molecular interaction

• All the above iterated99

Page 148: Computational Synthetic Biology

/183

• Once a model has been prototyped, whether derived from existing literature or “ab initio” ➡ Use some optimisation method to fine tune parameters/model structure

• adopts an incremental methodology, namely starting from very simple P system modules (BioBricks) specifying basic molecular interactions, more complicated modules are produced to model more complex molecular systems.

100

Page 149: Computational Synthetic Biology

/183

Large Literature on Model Synthesis• Mason et al. use a random Local Search (LS) as the mutation to

evolve electronic networks with desired dynamics

• Chickarmane et al. use a standard GA to optimize the kinetic parameters of a population of ODE-based reaction networks having the desired topology.

• Spieth et al. propose a memetic algorithm to find gene regulatory networks from experimental DNA microarray data where the network structure is optimized with a GA and the parameters are optimized with an Evolution Strategy (ES).

• Etc

101

Page 150: Computational Synthetic Biology

/183

Evolutionary Algorithms for Automated Model Synthesis and Optimisation

EA are potentially very useful for AMSO• There’s a substantial amount of work on:• using GP-like systems to evolve executable

structures• using EAs for continuous/discrete

optimisation• An EA population represents alternative

models (could lead to different experimental setups)

• EAs have the potential to capture, rather than avoid, evolvability of models

102

Page 151: Computational Synthetic Biology

/183

• The main idea is to use a nested evolutionary algorithm where the first layer evolves model structures while the inner layer acts as a local search for the parameters of the model. • It uses stochastic P systems as a computational, modular and discrete-stochastic modelling framework.

• It adopts an incremental methodology, namely starting from very simple P system modules specifying basic molecular interactions, more complicated modules are produced to model more complex molecular systems.

•Successfully validated evolved models can then be added to the models library

Evolving Executable Biology Models

103

Page 152: Computational Synthetic Biology

/183

Nested EA for Model Synthesis

F. Romero-Campero, H.Cao, M. Camara, and N. Krasnogor. Structure and parameter estimation for cell systems biology models. Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2008), pages 331-338. ACM Publisher, 2008.

H. Cao, F.J. Romero-Campero, S. Heeb, M. Camara, and N. Krasnogor. Evolving cell models for systems and synthetic biology. Systems and Synthetic Biology , 2009

104

Page 153: Computational Synthetic Biology

/183

Fitness Evaluation

105

Page 154: Computational Synthetic Biology

/183

Representation

106

Page 155: Computational Synthetic Biology

/183107

Page 156: Computational Synthetic Biology

/183

Structural Operators

108

Page 157: Computational Synthetic Biology

/183

Structural Operators

109

Page 158: Computational Synthetic Biology

/183

Parameter Optimisation

110

Page 159: Computational Synthetic Biology

/183

Parameter Optimisation

111

Page 160: Computational Synthetic Biology

/183

Simple Illustrative Case studies

• Case 1: molecular complexation

• Case 2: enzymatic reaction

• Case 3: autoregulation in transcriptional networks

•Case 3.1. negative autoregulation•Case 3.2. positive autoregulation

112

Page 161: Computational Synthetic Biology

/183

Case 1: Molecular ComplexationTarget P System Model Structure

Target P System Model Parameters

c1

c2

113

Page 162: Computational Synthetic Biology

/183

Case 1: Experimental Results

50/50 runs found the same P system model structure as the target one

Target and Evolved Time Series 114

Page 163: Computational Synthetic Biology

/183

Case 1: Experimental Results

50/50 runs found the same P system model structure as the target one

Best Model Parameters

Target Model Parameters

Target and Evolved Time Series 115

Page 164: Computational Synthetic Biology

/183

Case 2: Enzymatic ReactionTarget P System Model Structure

Target P System Model Parameters

c1

c2

c3

116

Page 165: Computational Synthetic Biology

/183

Case 2: Experimental ResultsTarget and Evolved Time Series

117

Page 166: Computational Synthetic Biology

/183

Case 2: Experimental Results

Best Model Parameters

Target Model Parameters Target and Evolved Time Series

Two Approaches Using Basic Modules

Adding the Newly Found in Case 1

Number of runs the target P system model was

found

12/50 39/50

Mean RMSE 3.8 2.85118

Page 167: Computational Synthetic Biology

/183

Case 3.1: Negative AutoregulationTarget P System Model Structure Target P System Model Parameters

c1

c2

c3

c4

c5

c6

119

Page 168: Computational Synthetic Biology

/183

Case 3.1: Experimental Results

Best Model in Design 1

Design Design 1 Design 2

Model

Number of runs 46/50 4/50

Mean +/- STD 4.11+/- 11.73 4.63 +/- 2.91

Best Model in Design 2

120

Page 169: Computational Synthetic Biology

/183

Case 3.2: Positive AutoregulationTarget P System Model Structure Target P System Model Parameters

c1

c2

c3

c4

c5

c6

c7

121

Page 170: Computational Synthetic Biology

/183

Case 3.2: Experimental ResultsDesign Design 1 Design 2

Model

Number of runs 30/50 20/50

Mean +/- STD 16.36 +/- 3.03 20.14 +/- 5.13

Best Model in Design 1

122

Page 171: Computational Synthetic Biology

/183

The Fitness Function• Multiple time-series per target

• Different time series have very different profiles, e.g., response time or maxima occur at different times/places

• Transient states (sometimes) as important as steady states

•RMSE will mislead search

H. Cao, F. Romero-Campero, M.Camara, N.Krasnogor (2009). Analysis of Alternative Fitness Methods for the Evolutionary Synthesis of Cell Systems Biology Models.

123

Page 172: Computational Synthetic Biology

/183

Four Fitness FunctionsN time seriesM time points in each time series Simulated data Target data

F1

F2

F2 normalises whithin a time series between [0,1]

124

Page 173: Computational Synthetic Biology

/183

Four Fitness FunctionsN time seriesM time points in each time series Simulated data Target data Randomly generated normalised

vector with:

F3

F3, unlike F1, does not assume an equal weighting of all the errors. It produces a randomised average of weights.

125

Page 174: Computational Synthetic Biology

/183

Four Fitness FunctionsN time seriesM time points in each time series Simulated data Target data

F4

F4 simply multiplies errors hence even small ones within a set with multiple orders of maginitudes can still have an effect. Indeed, this might lead to numerical instabilities due to nonlinear effects.

126

Page 175: Computational Synthetic Biology

/183

Problem Specification

127

Page 176: Computational Synthetic Biology

/183

Case Studies

128

Page 177: Computational Synthetic Biology

/183129

Page 178: Computational Synthetic Biology

/183130

Page 179: Computational Synthetic Biology

/183131

Page 180: Computational Synthetic Biology

/183132

Page 181: Computational Synthetic Biology

/183133

Page 182: Computational Synthetic Biology

/183134

Page 183: Computational Synthetic Biology

/183135

Page 184: Computational Synthetic Biology

/183

Results Study Case 1

136

Page 185: Computational Synthetic Biology

/183

Results Study Case 2

137

Page 186: Computational Synthetic Biology

/183

Results Study Case 3

138

Page 187: Computational Synthetic Biology

/183

Alternative, biologically valid, modules

Target model

139

Page 188: Computational Synthetic Biology

/183140

Page 189: Computational Synthetic Biology

/183

Results Study Case 4

141

Page 190: Computational Synthetic Biology

/183

Comparisons of the constants between the best fitness models obtained by F1, F4 and the target model for Test Case 4 (Values different from the target are in bold and underlined)

142

Page 191: Computational Synthetic Biology

/183143

Page 192: Computational Synthetic Biology

/183144

Page 193: Computational Synthetic Biology

/183145

Page 194: Computational Synthetic Biology

/183146

Page 195: Computational Synthetic Biology

/183147

Page 196: Computational Synthetic Biology

/183

Target

148

Page 197: Computational Synthetic Biology

/183

Target

149

Page 198: Computational Synthetic Biology

/183

The evolving curve of average fitness by four fitness methods (F1 ∼ F4) in 20 runs for Test Case 4.

150

Page 199: Computational Synthetic Biology

/183

The evolving curve of average model diversity by four fitness methods (F1 ∼ F4) in 20 runs for Test Case 4

151

Page 200: Computational Synthetic Biology

/183152

Page 201: Computational Synthetic Biology

/183

Multi-Objective Optimisation in Morphogenesis

The following slides are based on

Rui Dilão, Daniele Muraro, Miguel Nicolau, Marc Schoenauer. Validation of a morphogenesis model of Drosophila early development by a multi-objective evolutionary optimization algorithm. Proc. 7th European Conference on Evolutionary Computation, ML and Data Mining in BioInformatics (EvoBIO'09), April 2009.

153

Page 202: Computational Synthetic Biology

/183

• The authors investigate the use of MOEA for modeling morphogenesis

• The model organism is drosophila embrios Rui Dilão, Daniele Muraro, Miguel Nicolau, Marc Schoenauer. Validation of a morphogenesis model of Drosophila early development by a multi-objective evolutionary optimization algorithm. Proc. 7th European Conference on Evolutionary Computation, ML and Data Mining in BioInformatics (EvoBIO'09), April 2009

154

Page 203: Computational Synthetic Biology

/183

Initial Conditions

PDE model

155

Page 204: Computational Synthetic Biology

/183

Target

By Evolving:abcd, acad, Dbcd/Dcad, r, mRNA distributions and t

However: Goal is not perfect fit but rather robust fit

156

Page 205: Computational Synthetic Biology

/183

• The authors use both Single and Multi-objective optimisation• CMA-ES is at the core of both• CMA-ES is a (µ,λ)-ES that employs multivariate Gaussian

distributions • Uses “cumulative path” for co-variantly adapting this

distribution• For the MO case it uses the global pareto dominance based

selection

Bi-Objective Function

Objective Function

157

Page 206: Computational Synthetic Biology

/183158

Page 207: Computational Synthetic Biology

/183159

Page 208: Computational Synthetic Biology

/183160

Page 209: Computational Synthetic Biology

/183

Parameter Optimisation in Metabolic Models

161

A. Drager et al. (2009). Modeling metabolic networks in C. glutamicum: a comparison of rate laws in combination with various parameter optimization strategies. BMC Systems Biol ogy 2009, 3:5

Page 210: Computational Synthetic Biology

/183

Outline

•Essential Systems Biology

•Synthetic Biology

•Computational Modeling for Synthetic Biology

•Automated Model Synthesis and Optimisation

•A Note on Ethical, Social and Legal Issues

•Conclusions162

Page 211: Computational Synthetic Biology

/183

Synthetic BiologyThe new science of synthetic biology aims to re-engineer life at the molecular level and even create completely new forms of life. It has the potential to create new medicines, biofuels, assist climate change through carbon capture, and develop solutions to help clean up the environment.

163

Page 212: Computational Synthetic Biology

/183

What is Synthetic Biology?Synthetic Biology isA) the design and construction of new biological parts, devices, and systems, andB) the re-design of existing, natural biological systems for useful purposes.

http://syntheticbiology.org/

164

Page 213: Computational Synthetic Biology

/183

Been There, Done That

165

Page 214: Computational Synthetic Biology

/183

The Unjustified Fear of Chimera and Foreign Genes

TransplantationItaya, M., Tsuge, K. Koizumi, M., and Fujita, K. Combining two genomes in one C e l l : S t a b l e c l o n i n g o f t h e Synechosystis PCC6803 genome in the Bacillus subtilis 168 genome.Proc. Natl. Acad. Sci., USA, 102, 15971-15976 (2005)

+ =

150 times larger than the human genome

166

Page 215: Computational Synthetic Biology

/183

A Technology Not a Product“ The problem of deaths and injury as a result of road accidents is now acknowledge to be a global phenomenon.... publications show that in 1990 road accidents as a cause of death or dissability were in ninth place out of a total of over 100 identified causes.... by 2020 forecasts suggest... road accidents will move up to sixth...”

Estimating global road fatalities. G. Jacobs & A. Aeron-Thomas. Global Road Safety Partnership

167

Page 216: Computational Synthetic Biology

/183

A Technology Not a Product“ The problem of deaths and injury as a result of road accidents is now acknowledge to be a global phenomenon.... publications show that in 1990 road accidents as a cause of death or dissability were in ninth place out of a total of over 100 identified causes.... by 2020 forecasts suggest... road accidents will move up to sixth...”

And yet nobody seriously considers banning mechanical engineering

Estimating global road fatalities. G. Jacobs & A. Aeron-Thomas. Global Road Safety Partnership

167

Page 217: Computational Synthetic Biology

/183

A Technology Not a Product

168

Page 218: Computational Synthetic Biology

/183

A Technology Not a Product

And yet nobody seriously considers banning printing technology

168

Page 219: Computational Synthetic Biology

/183

But ....•Technologies are regulated:

•Cars have seat belts and laws establish speed limits•Main Kampf is banned in Germany and by, e.g., restricting google, China bans uncountable written material of all sorts!

•Societies must establish an informed dialogue involving:•tax payers•Scientists•Lobbies of all sorts•Government

169

Page 220: Computational Synthetic Biology

/183

What IS Synthetic Biology?Synthetic Biology isA) the design and construction of new biological parts, devices, and systems, andB) the re-design of existing, natural biological systems for useful purposes.

http://syntheticbiology.org/

C) Through rigorous mathematical, computational engineering routes

170

Page 221: Computational Synthetic Biology

/183

Biology only smarter, safer and clearer

171

Page 222: Computational Synthetic Biology

/183

Outline

•Essential Systems Biology

•Synthetic Biology

•Computational Modeling for Synthetic Biology

•Automated Model Synthesis and Optimisation

•A Note on Ethical, Social and Legal Issues

•Conclusions172

Page 223: Computational Synthetic Biology

/183

Summary & Conclusions•This talk has focused on an integrative methodology for Systems & Synthetic Biology

•Executable Biology•Parameter and Model Structure Discovery

•Computational models (or executable in Fisher & Henzinger’s jargon) adhere to (a degree) to an operational semantics.

•Refer to the excellent review [Fisher & Henzinger, Nature Biotechnology, 2007]

173

Page 224: Computational Synthetic Biology

/183

•The gap present in mathematical models between the model and its algorithmic implementation disappears in computational models as all of them are algorithms.•A new gap appears between the biology and the modeling technique and this can be solved by a judicious “feature selection”, i.e. the selection of the correct abstractions•Good computational models are more intuitive and analysable

Summary & Conclusions

174

Page 225: Computational Synthetic Biology

/183

•Computational models can thus be executed (quite a few tools out there, lots still missing)•Quantitative VS qualitative modelling: computational models can be very useful even when not every detail about a system is known.•Missing Parameters/model structures can sometimes be fitted with of-the-shelf optimisation strategies (e.g. COPASI, GAs, etc)•Computational models can be analysed by model checking: thus they can be used for testing hypothesis and expanding experimental data in a principled way

Summary & Conclusions

175

Page 226: Computational Synthetic Biology

/183

A nested evolutionary algorithm is proposed to automatically develop and optimise the modular structure and parameters of cellular models based on stochastic P systems. Several case studies with incremental model complexity demonstrate the effectiveness of our algorithm.

The fact that this algorithm produces alternative models for a specific biological signature is very encouraging as it could help biologists to design new experiments to discriminate among competing hypothesis (models).

Comparing results by only using the elementary modules and by adding newly found modules to the library shows the obvious advantage of the incremental methodology with modules. This points out the great potential to automatically design more complex cellular models in the future by using a modular approach.

Summary & Conclusions

176

Page 227: Computational Synthetic Biology

/183

•Synthetising Synthetic Biology Models is more like evolving general GP programs and less like fitting regresion or inter/extra-polation•We evolve executable structures, distributed programs(!)•These are noisy and expensive to execute•Like in GP programs, executable biology models might achieve similar behaviour through different program “structure”•Prone to bloat•Like in GP, complex relation between diversity and solution quality•However, diverse solutions of similar fit might lead to interesting experimental routes•Co-desig of models and wetware.

Summary & Conclusions

177

Page 228: Computational Synthetic Biology

/183

Summary & Conclusions•Some really nice tutorials and other sources:

•Luca Caderlli’s BraneCalculus & BioAmbients•Simulating Biological Systems in the Stochastic π−calculus by Phillips and Cardelli•From Pathway Databases to Network Models by Aguda and Goryachev•Modeling and analysis of biological processes by Brane Calculi and Membrane Systems by Busi and Zandron•D. Gilbert’s website contain several nice papers with related methods and tutorials

178

Page 229: Computational Synthetic Biology

/183

Other SourcesF. J. Romero-Campero, J. Twycross, M. Camara, M. Bennett, M. Gheorghe, and N. Krasnogor. Modular assembly of cell systems biology models using p systems. International Journal of Foundations of Computer Science, (to appear), 2009.

F.J. Romero-Camero and N. Krasnogor. An approach to biomodel engineering based on p systems. In Proceedings of Computation In Europe (CIE 2009), 2009.

J. Smaldon, N. Krasnogor, M. Gheorghe, and A. Cameron. Liposome logic. In Proceedings of the 2009 Genetic and Evolutionary Computation Conference (GECCO 2009), 2009

F. Romero-Campero, H.Cao, M. Camara, and N. Krasnogor. Structure and parameter estimation for cell systems biology models. In Maarten Keijzer et.al, editor, Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2008), pages 331-338. ACM Publisher, 2008. This paper won the Best Paper award at the Bioinformatics track.

J. Smaldon, J. Blake, D. Lancet, and N. Krasnogor. A multi-scaled approach to artificial life simulation with p systems and dissipative particle dynamics. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2008). ACM Publisher, 2008.

179

Page 230: Computational Synthetic Biology

/183

Other SourcesPăun, Gh. Computing with membranes. Journal of Computer and System Sciences 61 (2000) 108-143

P Systems Web Page http://psystems.disco.unimib.it/ Bianco L. Membrane Models of Biological Systems PhD thesis 2007

Bernardini F, Gheorghe M, Krasnogor N, Terrazas G. Membrane Computing - Current Results and Future Problems. CiE 2005 49-53

Bernardini F, Gheorghe M, Krasnogor N. Quorum sensing P systems. Theoretical Computer Science 371 (2007) 20-33

Miguel Nicolau, Marc Schoenauer. Evolving Specific Network Statistical Properties using a Gene Regulatory Network Model. In ECCS 2008, 5th European Conference on Complex Systems, 2008. Miguel Nicolau, Marc Schoenauer. Evolving Scale-Free Topologies using a Gene Regulatory Network Model. CEC 2008, IEEE Congress on Evolutionary Computation, pp. 3748-3755, IEEE Press, 2008.

180

Page 231: Computational Synthetic Biology

/183

A. Ridwan. A parallel implementation of Gillespie's Direct Method. Proc. of the International Conference on Computational Science, p.284-291, Krakow, Poland, June 2004.

G. C. Ewing et al. Akaroa2: Exploiting network computing by distributing stochastic simulation. Proc. of the European Simulation Multiconference, p.175-181, Warsaw, June 1999.

P. Hellekalek. Don't trust parallel Monte Carlo! ACM SIGSIM Simulation Digest, 28(1):82-89, 1998.

M. Schwehm. Parallel stochastic simulation of whole cell models. Proc. of the Second International Conference on Systems Biology, p.333-341, CalTech, C.A., November 2001

Other Sources

181

Page 232: Computational Synthetic Biology

/183

Acknowledgements

•Jonathan Blake

•Claudio Lima

•Francisco Romero-Campero

•Karima Righetti

•Jamie Twycross

Integrated Environment

Machine Learning & Optimisation

Modeling & Model Checking

Molecular Micro-Biology

Stochastic Simulations

Members of my team working on SB2

EP/E017215/1

EP/H024905/1

BB/F01855X/1

BB/D019613/1

University of NottinghamProf. M. Camara, Dr. S. Heeb, Dr. G. Rampioni, Prof. P. WilliamsWeizmann Institute of ScienceProf. D. Lancet, Prof. I. Pilpel

GECCO 2010 organisers for inviting this tutorial.

You for listening!

182

Page 233: Computational Synthetic Biology

/183

Any Questions?

www.synbiont.orgBecome a member and have access to a largeinternational community of Synthetic Biologists

183