30
A PARALLEL ALGORITHM FOR EXTRACTING TRANSCRIPTIONAL REGULATORY NETWORK MOTIFS Fu Rong Wu

A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

Embed Size (px)

Citation preview

Page 1: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

A PARALLEL ALGORITHM FOR EXTRACTING TRANSCRIPTIONAL REGULATORY

NETWORK MOTIFS

Fu Rong Wu

Page 2: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

OUTLINE

PreliminaryPrevious WorkMethodExperimental ResultConclusion

Page 3: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

BIOLOGICAL MOTIFS Sequence motif

a sequence pattern of nucleotides in a DNA sequence or amino acids in a protein

Structural motif a pattern in a protein structure formed

by the spatial arrangement of amino acids Network motif

patterns (sub-graphs) that recur within a network much more often than expected at random

Page 4: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

TRANSCRIPTIONAL REGULATORY NETWORK describe the interactions between

transcription factor proteins and the genes that they regulate

Page 5: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

BIOLOGICAL NETWORK MOTIFS EXAMPLE

Autoregulation (AR)

Feed Forward Loops (FFL)

Regulating and Regulated Feedback Loops (RFL)

BiFan

Diamond

Page 6: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

OUTLINE

PreliminaryPrevious WorkMethodExperimental ResultConclusion

Page 7: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

PREVIOUS WORK exhaustive search algorithm

runtime increase dramatically for subgraphs with size ≥ 4.

Impractical to find high-order motifs because of its time complexity.

random sampling algorithm method improves the running time only estimate the frequency of subgraphs cannot

provide an exact solution

Page 8: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

OUTLINE

PreliminaryPrevious WorkMethodExperimental ResultConclusion

Page 9: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

METHODGoal: Find motif from a given graph

G(V,E) One Master Processor

Sort all nodes by degreePartition nodes to Slave processors

Slave ProcessorsFinding Neighborhoods from a NetworkFinding Subgraphs within NeighborhoodGather subgraph set to Master Processor

Page 10: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

FINDING NEIGHBORHOODS FROM A NETWORK

Page 11: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

FINDING NEIGHBORHOODS FROM A NETWORK

Page 12: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

REVIEW OF BFS

Page 13: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

REVIEW OF BFS

Page 14: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXAMPLE OF BFS TREE

Page 15: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

ALGORITHM 1 NBR(G,V)

Page 16: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

ALGORITHM 1 NBR(G,V)

Page 17: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXAMPLE OF ALGORITHM1 (a) A graph G with 8 nodes that are labeled from 1 to

8 (b) The neighborhood of node 1 in G with motif

size k = 4.(Nbr(1) )

Page 18: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu
Page 19: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXAMPLE FOR ALGORITHM2

Page 20: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu
Page 21: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXAMPLE FOR ALGORITHM3

Subgraph from (c)

Page 22: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu
Page 23: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

OUTLINE

PreliminaryPrevious WorkMethodExperimental ResultConclusion

Page 24: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXPERIMENTAL RESULT

The cluster has 32 machines with two 2.4GHz processorsThe programs are written in C and MPI library.

Page 25: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXPERIMENTAL RESULT

Real data set of interactions between transcription factors and operons in an E. coli network from the RegulonDB database

Each protein complex of a transcription factor or a gene is represented by a node.

Page 26: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXPERIMENTAL RESULT

Precision / Recall Given Truth Positive value(TP), False Positive

value(FP) and False Negative value(FN), Recall = TP/(FN + TP) and Precision = TP/(TP + FP)

Page 27: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXPERIMENTAL RESULT

For k=6Total number 15747motif number 22532584

Page 28: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

EXPERIMENTAL RESULT

Page 29: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

OUTLINE

PreliminaryPrevious WorkMethodExperimental ResultConclusion

Page 30: A P ARALLEL A LGORITHM FOR E XTRACTING T RANSCRIPTIONAL R EGULATORY N ETWORK M OTIFS Fu Rong Wu

CONCLUSION This parallel algorithm can accurately

find all high-order network motifs in a fast running time.

High-order motifs provide important information on biological system design.