27
Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University

Inferring Quantitative Models of Regulatory Networks From Expression Data

  • Upload
    gomer

  • View
    19

  • Download
    0

Embed Size (px)

DESCRIPTION

Inferring Quantitative Models of Regulatory Networks From Expression Data. Aviv Regev Harvard. Nir Friedman Hebrew University. Iftach Nachman Hebrew University. Conditions. Genes. Common approach: Interaction Networks Different semantics for networks - PowerPoint PPT Presentation

Citation preview

Page 1: Inferring Quantitative Models of Regulatory Networks From Expression Data

Inferring Quantitative Models of Regulatory Networks From

Expression Data

Iftach NachmanHebrew University

Aviv RegevHarvard

Nir FriedmanHebrew University

Page 2: Inferring Quantitative Models of Regulatory Networks From Expression Data

Goal: Reconstruct Cellular Networks

Biocarta. http://www.biocarta.com/

Structure Function Dynamics

Conditions

Ge

ne

s

Common approach:Interaction Networks

Different semantics for networks Boolean, probabilistic, differential

equations, …

Page 3: Inferring Quantitative Models of Regulatory Networks From Expression Data

A Major Assumption…

mRNA tr. rate

protein

active protein

mRNA

mRNA degradation

TF

G

TF

G

TF

G

TF

Activation signal

Hidden

mRNA

Observed

Page 4: Inferring Quantitative Models of Regulatory Networks From Expression Data

Realistic Regulation Modeling Model the closest

connection

Active protein levels are not measured

Transcript rates are computed from expression data and mRNA decay rates

Realistic biochemical model of transcription rates

TF

G

TF

G

TFHidden

Observed

proteinmRNA

mRNA tr. rate

active protein

Activation signal

mRNA degradation

HiddenObserved

Page 5: Inferring Quantitative Models of Regulatory Networks From Expression Data

OnOff

Modeling Transcription Rate

Simplest case: one activator

G

TF

mRNA transcripts

G

TF

On

[McAdams & Arkin, 1997; Ronen et al, 2002]

P( )• Avg rate = + P( )•

Page 6: Inferring Quantitative Models of Regulatory Networks From Expression Data

Modeling Transcription Rate

Steady state equations:G

TF

1tf][S[S]

[S][tf]κtf][Sκ bd

Concentration of free promoters

Concentration of bound promoters

Concentration of TF

[tf]1[tf]

[tf])| P(

[tf]11

[tf])| P(

][1][

([tf])Ratetf

tf

0

db

d

b

κκ

Page 7: Inferring Quantitative Models of Regulatory Networks From Expression Data

Modeling Transcription Rate

G

TF

0

db

= 1

= 4 = 20

= 250

TF activity

Tra

nsc

rip

tio

n

rate

TF

acti

vity

Time

= 1

= 4 = 20 = 250

Tra

ns

rate

Time

][1][

([tf])Ratetf

tf

Page 8: Inferring Quantitative Models of Regulatory Networks From Expression Data

[Buchler et al, 2003; Setty et al, 2002]

General Two Regulator FunctionTF2TF1

G

0 0.2 0.4 0.6

a

b

c

d

P(State)

a

b

c

d

1

X

2

G

TF

Similar models for other modes of binding: Competitive binding Cooperative binding

Page 9: Inferring Quantitative Models of Regulatory Networks From Expression Data

0 0.2 0.4 0.6

a

b

c

d

P(State)

0 0.2 0.4 0.6

a

b

c

d

P(State)

General Two Regulator FunctionTF2TF1

G

b= 0

a= 0

c= 0

d =1

b= 1

c= 1

b

a

c

d

X

X

X

X

= Average Rate

Rate

“AND” gate“OR” gate

a

b

c

d

[Buchler et al, 2003; Setty et al, 2002]

Avg rate = function of TF concentrations

Few parameters: Affinity parameters Rate parameters

Page 10: Inferring Quantitative Models of Regulatory Networks From Expression Data

Models of Regulatory NetworksRegulators(activity)

Target Genes(trans. rate)

G4

TF2TF1

G3G2G1

TF3

G5 G6 G7

Noise

Observed rates

?Predicted rates

TF

acti

vity

Time

Tra

ns

rate

Time

Page 11: Inferring Quantitative Models of Regulatory Networks From Expression Data

Learning

Learning From Data

Transcriptionrates

Expressiondata

mRNA decay rates

Kinetic parameters

G4

TF2TF1

G3G2G1

TF1

TF2

+

Gradient

ascent

Page 12: Inferring Quantitative Models of Regulatory Networks From Expression Data

TF1

TF2

G4

TF2TF1

G3G2G1

Learning

Cell Cycle Experiment

Transcriptionrates

Expressiondata

mRNA decay rates

Kinetic parameters

+

Biological Databases [YPD]

ChIP location [Lee et. al]

7 regulators & 141 target genes

Cell cycle gene expression

[Spellman et. al]

+

mRNA decay rates[Wang et al] Transcription rates

Page 13: Inferring Quantitative Models of Regulatory Networks From Expression Data

M/G1

G1

S

S/G2

G2/M

predictionsinput

par

amet

ers

0

2

1

Cell Cycle Experiment

17x141 = 2397Data points

466parameters

17x7 = 119Regulator activity

values

Page 14: Inferring Quantitative Models of Regulatory Networks From Expression Data

G1 G2 G1 G2

FKH1 FKH2

G1 G2 G1 G2

SWI5

ACE2

Regulator Activity Profiles

When are they active?

Known biology: SWI4 & MBP1:

mid-late G1 FHK1: S/G2 FKH2: G2/M SWI5: M/G1

G1 G2 G1 G2

MBP1

SWI4

Reconstructed activity profiles match direct experimental knowledge

Page 15: Inferring Quantitative Models of Regulatory Networks From Expression Data

Regulator Activity Profiles

When are they active? Could we reconstruct these

from mRNA profiles?

Known biology: SWI5 is transcriptionally

regulated MCM1 is notRegulator’s own mRNA is not sufficient to

reconstruct activity levels

mRNA profile

SWI5

Activity

mRNA

MCM1

Activity

mRNA

Page 16: Inferring Quantitative Models of Regulatory Networks From Expression Data

M/G1

G1

S

S/G2

G2/M

input predictions

Cell Cycle Experiment

How well are we doing?

residue

1predictedinput

r

Page 17: Inferring Quantitative Models of Regulatory Networks From Expression Data

ModelLearning

ab initio Learning

Transcriptionrates

Learning

Expressiondata

mRNA decay rates

Kinetic parameters

G4

TF2TF1

G3G2G1

TF1

TF2

+Big assumption: Network topology is given Unrealistic, even for well understood

systems

+

Challenge:

Reconstruct network topology? Number of regulators Their joint effect on target genes

Page 18: Inferring Quantitative Models of Regulatory Networks From Expression Data

How Do We Learn Structure?Standard approach: hill climbing search

G4

TF2TF1

G3G2G1

G4

TF2TF1

G3G2G1G4

TF2TF1

G3G2G1

G4

TF2TF1

G3G2G1

G4

TF2TF1

G3G2G1

G4

TF2TF1

G3G2G1

-17.23

-23.13-19.19

G4

TF2TF1

G3G2G1

TF3

Problem:

Scoring structures is costly Requires non-linear parameter

optimization Impractical on real data

Page 19: Inferring Quantitative Models of Regulatory Networks From Expression Data

Pred(G|TF,Y)

Ideal regulator

TimePred(G|TF)

TF

G

Y

Step 1:Compute optimal

hypothetical regulator

Time

reg

ula

tors

Step 2:Search for

“similar” regulator

TF1

TF2

TF3

TF4

Activity level

Target Profile

Ideal Regulator MethodGoal: Consider adding edges

Idea: Score only promising candidates

Page 20: Inferring Quantitative Models of Regulatory Networks From Expression Data

Parent(s) activity

Predicted(G|TF,TF2)

Time

reg

ula

tors

TF1

TF2

TF3

TF4

Step 3:Add new parent

and optimize parameters

Time

Step 1:Compute optimal

hypothetical regulator

Step 2:Search for

“similar” regulator

Pred(G|TF,Y)

Ideal regulator

Y

Target Profile

TF

G

TF2

Crucial point: Choice of similarity measure Principled approach see [Nachman et al UAI04]

Provides approximation to Δlikelihood

Ideal Regulator MethodGoal: Consider adding edges

Idea: Score only promising candidates

Page 21: Inferring Quantitative Models of Regulatory Networks From Expression Data

New regulator: “centroid” of selected ideal regulators

Adding New Regulator

Ideal regulators

Idea: Introduce hidden regulator for geneswith similar ideal regulator

TFnew

G1 G2 G4

G1

G2

G3

G4

G5

Y1

Y2

Y3

Y4

Y5

Time

Page 22: Inferring Quantitative Models of Regulatory Networks From Expression Data

M/G1

G1

S

S/G2

G2/M

Inputrates

0

2

1

Curatedprior knowledge

466 params

ab initiofrom scratch461 params

Ab initio Structure Learning

Page 23: Inferring Quantitative Models of Regulatory Networks From Expression Data

Inputrates

Curatedprior knowledge

466 params

ab initiofrom scratch461 params

M/G1

G1

S

S/G2

G2/M

0

2

1

0

200

400

600

800

1000

1200

1400

1600

Curated Ab Initio

log likelihood

BIC

Ab initio Structure Learning

Page 24: Inferring Quantitative Models of Regulatory Networks From Expression Data

0 20 40 60 80 100 120

H2

SWI5

H4SWI4

Significant target overlap & correlated activity

Significant target overlap & weak correlation

H1

MBP1

H3

FKH2

curated

ab initio

targetgenes

regulators

regulators

Regulators: ab initio vs. curated

H1 H2H4 H3H5 H6 H7

SWI4 MBP1 ACE2 FKH1 SWI5 MCM1 FKH2

Page 25: Inferring Quantitative Models of Regulatory Networks From Expression Data

curated

ab initio

targetgenes

regulators

regulators

Significant agreement with “known” topology Both in structure & dynamics

Improved predictions

Regulators: ab initio vs. curatedSWI4 MBP1 ACE2 FKH1 SWI5 MCM1 FKH2

H1 H2H4 H3H5 H6 H7

Page 26: Inferring Quantitative Models of Regulatory Networks From Expression Data

ModelLearning

Conclusions

Kinetic parameters

G4

TF2TF1

G3G2G1

TF1

TF2

+

+

Transcriptionrates

Network(prior knowledge)

G4

TF2TF1

G3G2G1

Realistic model, based on first principles

Learning procedure Reconstruct unobserved activity profiles Reconstruct network topology

Insights into Structure & Dynamics Function

Page 27: Inferring Quantitative Models of Regulatory Networks From Expression Data

Future Directions

Prior knowledge ChIP location Cis-regulatory

elements

External perturbations

Internal feedback

G4

TF2TF1

G3G2G1

TF3

G5 G6 G7