14
STRING Modeling of biological systems through cross-species data integration Lars Juhl Jensen EMBL Heidelberg

STRING - Modeling of biological systems through cross-species data integration

Embed Size (px)

DESCRIPTION

12th European Congress on Biotechnology, Technical University of Denmark, Lyngby, Denmark, August 21-24, 2005

Citation preview

Page 1: STRING - Modeling of biological systems through cross-species data integration

STRINGModeling of biological systems through

cross-species data integration

Lars Juhl JensenEMBL Heidelberg

Page 2: STRING - Modeling of biological systems through cross-species data integration

Jensen et al., Drug Discovery Today: Targets, 2004

Page 3: STRING - Modeling of biological systems through cross-species data integration

de Lichtenberg et al., Science, 2005

Page 4: STRING - Modeling of biological systems through cross-species data integration

TOM

MRPLRibosome Related

MRPS

Vacuolar Acidification

Fatty Acid Biosynth.

Secondary RCC_Asy

RCC_Asy

RCCII

RCCIV

RCCVRCC_Asy

HAP Complex

Arg Biosynth.

PDH/KGD/GCV

Cell Wall & pH Reg.

DNA Repair

Glucose sensing and CH remodelling

APC

Fission/Fusion

rRNAProcessing

mRNAProcessing

TFIIICComplex

m-AAA Complex

TCA Cycle

Iron Homeostasis/Chaperone Activity

RCCI

rRNAProcessing

Leu/Val/IleBiosynth.

DNARepair

GARP Complex

Cytosolic Ribosome

TIM

RCC_Asy

Actin

tRNA Splicing

RCCIII

NUP

Replication/ DNA Repair

Page 5: STRING - Modeling of biological systems through cross-species data integration

TOM

MRPLRibosome Related

MRPS

Vacuolar Acidification

Fatty Acid Biosynth.

Secondary RCC_Asy

RCC_Asy

RCCII

RCCIV

RCCVRCC_Asy

HAP Complex

Arg Biosynth.

PDH/KGD/GCV

Cell Wall & pH Reg.

DNA Repair

Glucose sensing and CH remodelling

APC

Fission/Fusion

rRNAProcessing

mRNAProcessing

TFIIICComplex

m-AAA Complex

TCA Cycle

Iron Homeostasis/Chaperone Activity

RCCI

rRNAProcessing

Leu/Val/IleBiosynth.

DNARepair

GARP Complex

Cytosolic Ribosome

TIM

RCC_Asy

Actin

tRNA Splicing

RCCIII

NUP

Replication/ DNA Repair

Protobacterialorthologs

Page 6: STRING - Modeling of biological systems through cross-species data integration

TOM

MRPLRibosome Related

MRPS

Vacuolar Acidification

Fatty Acid Biosynth.

Secondary RCC_Asy

RCCII

RCCIVRCCV

RCC_Asy

HAP Complex

Arg Biosynth.

PDH/KGD/GCV

Cell Wall & pH Reg.

DNA Repair

Glucose sensing and CH remodelling

APC

Fission/Fusion

rRNAProcessing

mRNAProcessing

TFIIICComplex

m-AAA Complex

TCA Cycle

Iron Homeostasis/Chaperone Activity

RCCI

rRNAProcessing

Leu/Val/IleBiosynth.

DNARepair

GARP Complex

Cytosolic Ribosome

TIM

RCC_Asy

Actin

tRNA Splicing

RCCIII

NUP

Replication/ DNA Repair

Human diseaseorthologs

RCC_Asy

Page 7: STRING - Modeling of biological systems through cross-species data integration

Genomic neighborhood

Species co-occurrence

Gene fusions

Database imports

Exp. interaction data

Microarray expression data

Literature co-mentioning

von Mering et al., Nucleic Acids Res., 2005

Page 8: STRING - Modeling of biological systems through cross-species data integration

Restingprotuberances

Protractedprotuberance

Cellulose

© Trends Microbiol, 1999

CellCell wall

Anchoring proteins

Cellulosomes

Cellulose

The “Cellulosome”

Page 9: STRING - Modeling of biological systems through cross-species data integration
Page 10: STRING - Modeling of biological systems through cross-species data integration

Jensen et al., to appear in Nat. Rev. Genet., 2005

Page 11: STRING - Modeling of biological systems through cross-species data integration

Networ-Kin™

Phospho-peptidedata (from MS)

Predict the kinase class(NetPhosK and Scansite)

Page 12: STRING - Modeling of biological systems through cross-species data integration

Summary

• Quality control is crucial for large-scale data integration– The raw data sets from high-throughput experiments are dirty– Scoring, benchmarking, and filtering can greatly improve the

quality

• Data integration should be done across multiple species

• Automated literature mining methods are (finally) maturing– Restricted types of information can be extracted with high precision– Text mining methods can lead to novel discoveries

• Protein networks are more than just pretty pictures– Highly specific hypotheses can be made from high-quality

networks

Page 13: STRING - Modeling of biological systems through cross-species data integration

Acknowledgments

• The STRING team– Christian von Mering

– Berend Snel

– Martijn Huynen

– Daniel Jaeggi

– Steffen Schmidt

– Sean Hooper

– Mathilde Foglierini

– Julien Lagarde

– Peer Bork

• Text mining project– Jasmin Saric

– Rossitza Ouzounova

– Isabel Rojas

• Networ-Kin– Rune Linding

– Tony Pawson

• Analysis of yeast mitochondria– Fabiana Perocchi

– Lars Steinmetz

• Analysis of yeast cell cycle– Ulrik de Lichtenberg

– Thomas Skøt

– Anders Fausbøll

– Søren Brunak

Page 14: STRING - Modeling of biological systems through cross-species data integration

Thank you!