27
1 CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014 Network construction from RNAi data Tamer Kahveci

CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

  • Upload
    alodie

  • View
    38

  • Download
    0

Embed Size (px)

DESCRIPTION

CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014. Network construction from RNAi data Tamer Kahveci. Signaling Networks. MAPK network. Signal reachability. Luciferase. Reporter. Receptor. Signaling and RNA Interference. Luciferase. X. Not critical. X. Reporter. - PowerPoint PPT Presentation

Citation preview

Page 1: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

1

CIS 4930/6930 – Recent Advances in Bioinformatics

Spring 2014

Network construction from RNAi data

Tamer Kahveci

Page 2: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Signaling Networks

2

MAPK network

Page 3: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Signal reachability

3

Receptor Reporter

Luciferase

Page 4: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Signaling and RNA Interference

4

Receptor Reporter

Luciferase

X Not critical

X Critical

Page 5: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Signaling Network Reconstruction from RNAi data

5

Receptor Reporter

Not criticalCritical

Page 6: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

RNAi data and Reference Network

6

Receptor Reporter

Not criticalCritical

Reference networkInsert

Delete

Not consistent !Consistent

!

Page 7: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Overview

7

GR = (VR, ER)

Reference network

Constraints

1 10 0 0

GT = (VT, ET)

Target network

1 0

SiNeC (Signal Network Constructor)

S-SiNeC (Scalable Signal Network Constructor)

Given Fin

d

Goal: Minimize the number of edit

operations to make the reference

consistent.

NP-Complete !

Page 8: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

SiNeC algorithm

Three steps

1. Order the critical genes left to right based on the topology of GR. [Sloan, 1986]

– v1, v2, …, vc

2. Edge deletion phase

3. Edge insertion phase

8

Page 9: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Step 1: Order critical genes

9

Receptor Reporter3

1

2

Prioritize based on distance to the reporter + degree

Page 10: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Step 2: Edge deletion

10

Purpose: Eliminate detours around critical genes

Receptor Reportervi vkvj

• Find all (undesirable) paths between non-consecutive critical genes.

• i.e., Paths which go through only noncritical genes• Edges are weighted with the number of such paths they

belong to.• Remove greedily starting from the largest weight until al

paths are disrupted.

Bypassed !!!

Page 11: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Step 3: Edge insertion

11

Purpose: Make sure that critical are connected + noncritical genes are consistent

Receptor Reportervi-1 vi+1vi

Insert an edge from vi-1 to vi if 1. There is no path from vi-1 to vi.2. There is a noncritical gene on all paths from vi-1

to vi.

Page 12: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Overview

12

GR = (VR, ER)

Reference network

Constraints

1 10 0 0

GT = (VT, ET)

Target network

1 0

SiNeC (Signal Network Constructor)

S-SiNeC (Scalable Signal Network Constructor)

Given Fin

d

Finding all the paths can be too tim

e

consuming for large networks

Page 13: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

S-SiNeC algorithm

13

Edge insertion0 0 0 None0 0 1 None0 1 0 None0 1 1 A1

1 0 0 A2 + A3 + A4

1 0 1 A2 + A4

1 1 0 A3 + A4

1 1 1 A4

Criti

cal

Left

reac

habl

e

Righ

t rea

chab

le

Edge deletion

Reference network

vs vtvi

Page 14: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

S-SiNeC: Edge insertion (A1)

14

Reference network

vs vtvi

L R

Purpose: Make sure that noncritical genes are consistent

Page 15: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

S-SiNeC: Edge insertion (A2)

15

Reference network

vs vtvi

L R

Purpose: Make sure that critical genes are left reachable

Page 16: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

S-SiNeC: Edge insertion (A3)

16

Reference network

vs vtvi

L R

Purpose: Make sure that critical genes are right reachable

Page 17: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

S-SiNeC: Edge insertion

17

L/R e1 e2 e3

1 X2 X X3 X X4 X X5 X X6 X X7 X8 X

Page 18: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

S-SiNeC: Edge deletion (A4)

18

Reference network

vs vtvi

L R

Purpose: Make sure that no detours exist around critical genes

Solve minimum cut between L &

R

Page 19: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Dataset

19

• Reference networks are obtained by random edge shuffling at 5% to 40% mutation rates.

• 200 references per target network & per mutation rate.

Page 20: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Average distance to the true network

20

Page 21: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Accuracy based on edge class

21

vs vt

Hot

Cold

Page 22: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Running time results

22SiNeC > 1 hour per reference network.

Page 23: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Success rate on constraints

23

Page 24: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

24

Accuracy

Page 25: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

25

Functional Enrichment of the Pathway

Page 26: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Last Remarks

• Constructing very large signaling networks from RNAi data is possible in practical running time.

• Both SiNeC and S-SiNeC are robust to errors in reference network.

• We recommend– S-SiNeC for very large OR dense networks.– SiNeC otherwise.

26

Page 27: CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014

Acknowledgements

27

CCF - 0829867 IIS - 0845439260429