21
A C2B2 Activity! aps surprisingly, a concise definition of systems biology that most agree upon has yet to emerge. (Ruedi Aebersold, Faculty Member ISB)

A Systems and Structural Biology approach to the dissection of complex biological phenotypes

  • Upload
    gloria

  • View
    40

  • Download
    1

Embed Size (px)

DESCRIPTION

A C2B2 Activity!. A Systems and Structural Biology approach to the dissection of complex biological phenotypes. Perhaps surprisingly, a concise definition of systems biology that most of us Can agree upon has yet to emerge. ( Ruedi Aebersold, Faculty Member ISB ). MAGNet: Organization. - PowerPoint PPT Presentation

Citation preview

Page 1: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

A C2B2 Activity!

Perhaps surprisingly, a concise definition of systems biology that most of usCan agree upon has yet to emerge. (Ruedi Aebersold, Faculty Member ISB)

Page 2: A Systems and Structural Biology approach to the dissection of complex biological phenotypes
Page 3: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Andrea Califano, PIBarry Honig, co-PI

Core 1Coord: D. Waltz

C. FriedmanG. KaiserC. LeslieY. OfranK. RossR. ServedioC. WigginsY. Yemini4 Postdoc/GRA

Executive Steering CommmitteeA.Califano, B.Honig, D.Waltz, D.Murray,

A.Floratos, S.Sechi

Executive DirectorAris Floratos

Program DirectorDan Gallahan

Scientific LiaisonSalvatore SechiValerie Florance

Admin AssistantDesi Tahiraj

Core 2Coord: D. Murray

H. BussemakerA. CalifanoAris FloratorsB. HonigY. LussierD. PetreyA. RzhetskyD. Vitkup5 Postdoc/GRA4 Developers

Core 3Coord: L. Shapiro

L. ShapiroB. HonigR. Dalla FaveraA. CalifanoC. Gilliami3 Postdoc/GRAR. Mann

Core 4Coord: P. Glick

2.5 SysadminsDBA

Core 5Coord: D. Vitkup

R. Friedman

Core 6Coord: A.Floratos

Release EngineerTech Writer

Scientific Advisory BoardJ. Wooley (Chair), S. Kasif, D. Lauffenburger,

M. Levitt, L. Stein, M. Waterman,

Contacted Alfonso Valencia, Dafne Koller, and Soren Brunak to join the SAB

Page 4: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

To study the organization of the complex networks of biochemical interactions whose concerted activity determines cellular processes at increasing levels of granularity.

To provide an integrative computational framework to organize molecular interactions in the cell into manageable context dependent components.

To develop interoperable computational models and tools that can leverage such a map of cellular interactions to elucidate important biological processes and to address a variety of biomedical applications.

Page 5: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

CORE 1 Project 1: Machine Learning Project 2: Intermediating Models for Natural Language Processing and

Ontology Project 3: Meta-Ontologies for Bioinformatics Component Interoperability Project 4: Novel Tools for Integration of Biological Database and Analyses Project 5: GenAtomy: Algorithmic Approaches for the Integration of Genetic

Variation and Expression Data in the Study of Regulation

CORE 2 Project 1: Sequence and Structure Based Annotation of Protein Interactions Project 2: Reverse Engineering Gene Regulatory Networks from Genomic Data Project 3: Sharing, Querying, Filtering, and Classifying Heterogeneous

Phenoypes Project 4: geWorkbench

CORE 3 Project 1: Structural and Energetic Basis of Cadhering Binding Specficity Project 2: Regulatory Modules in Normal and Transformed B Cells Project 3: Genetic Determinants of Common Heritable Disorders Project 4: Understanding and Predicting Transcription Factor Specificity

Page 6: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Th

e E

volu

tion o

f Syst

em

s B

iolo

gy

<>

Regulatory Regulatory Network: Network:

Mesenchymal Mesenchymal Signature of Signature of High-grade High-grade

GliomaGlioma

Disease Models

Physiologic / PathologicPhenotype Regulation

Signaling

Transcriptional

Protein-Protein Complexes

Literature

StructureMol. Profiles

Evolution

Topology

Dynamics

Genomic

Page 7: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Identification of Transcription Factor (TF) targetsIdentification of Transcription Factor (TF) targets Identification of Protein-Protein Interactions in ComplexesIdentification of Protein-Protein Interactions in Complexes Identification of post-translational TF modifiers (Kinases, …)Identification of post-translational TF modifiers (Kinases, …) Identification of post-transcriptional TF modifiers (miRs)Identification of post-transcriptional TF modifiers (miRs) Polymorphysms’ role in molecular interactionsPolymorphysms’ role in molecular interactions

Can it be done computationally from high-throughput data?Can it be done computationally from high-throughput data? Can it be done accurately?Can it be done accurately? Can it be biochemically/biologically validated?Can it be biochemically/biologically validated?

TranscriptionTranscriptionFactorFactor

Target 1Target 1

RNARNARNARNA

PP

MMKinaseKinase

DegradationDegradationRNARNA RNARNA

Target 2Target 2

PP PP ComplexComplex

Target 3Target 3

miRmiR

PolymorphysmsPolymorphysms

Page 8: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Transcriptional Networks ARACNe: Transcriptional Networks (Mammalian) REDUCE: Transcriptional Networks (Yeast/Drosophila) MEDUSA: Transcriptional Programs (Yeast/Drosophila)

Signaling/Co-factors/SNPs MINDy: Signaling/co-factor Networks (Mammalian) Genatomy: Transcription/Variability Networks (Yeast)

MicroRNA Pipeline for the inference of microRNA from deep sequencing data Inference of microRNA Regulatory Targets

Structure MarkUs: Functional Annotation of Protein Surfaces Skyline: Homology-based Structural Protein Models

Page 9: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Interactome Assembly and Analysis GeneWays: Literature Based, Integrated Interactomes Mixed Bayesian Interactomes: Prot.-Prot./Prot.-DNA/microRNA-

mRNA B cells, Breast Cancer, Acute Myeloid Leukemia, Follicular

Lymphoma, etc... IDEA: Interactome Dysregulation Enrichment Analysis

Oncogenic Lesions, Drug/Biochemical Perturbations MOA TIPhEn: Target Identification for Phenotypic Endpoint

Topological Properties NetClass: Identification of Generative Models InfoMod: Modularity Analysis

(Many others …)

Page 10: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Integrated Genomics Platform Support for gene expression data,

sequences, pathways, structure, etc. (40+ visualization and analysis modules).

Access to local and remote data sources and analytical services.

Support for workflow scripting. Integration with caGRID.

Based on caWorkbench, an NCI/caBIG-funded effort

Open source, Java based platform

Development framework Open source development. Modular/extensible architecture, supporting

pluggable components with configurable user interface.

Formal (caBIG-registered) data models for multitude of bioinformatics concepts.

Easy integration of 3rd party components.

Page 11: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

DBPs: The molecular basis of cell-cell adhesion (Cadherin specificity) Dysregulated pathways in lymphomagenesis Polymorphisms dysregulated pathways in neurodegenerative diseases (U.Chicago) Combinatorial transcriptional regulation in early drosophila development.

Large Projects (Collaborative Projects RFA) caBIG Knowledge Center for geWorkbench, caIntegrator, and GenePattern. (joint with

Broad). The Serious Adverse Event Consortium (SAEC) Cancer Center P30: Excellent (Biomedical Informatics Shared Resource: Outstading)

Collaborative Projects Genome wide Dissection of metabolic pathways (Los Alamos) microRNA role in Prostate Cancer (MSK) The molecular basis of addiction (Scripps) (R01 in funding range)

Minority Supplement (recommended for funding) Others

Next-generation mouse models of prostate cancer progression and metastasis (submitted to the PCF), Systems-Based Dissection of Breast Cancer with Poor Prognosis (submitted to Avon), Master regulators of the Mesenchymal transformation of Glioblastoma Multiforme (R01 submitted), Transcriptional Networks determining stem cell differentiation (MSK), DLBCL Connectivity Map (BROAD), Methotraxate and PDX mechanism of action in T cell lymphoma (CC Pilot).

Page 12: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Jake M. Hofman, Chris H. Wiggins. A Bayesian Approach to Network Modularity. 2007. Submitted; URL

Lawrence David, Chris H. Wiggins. Benchmarking of Dynamic Bayesian Networks Inferred From Stochastic Time-Series Data. Annals of The New York Academy of Sciences, 2007. In press.

Etay Ziv, Ilya Nemenman, Chris H. Wiggins. Optimal Signal Processing in Small Stochastic Biochemical Networks. PLoS ONE, 2(10):e1077, Oct 2007. URL http://dx.doi.org/10.1371%2Fjournal.pone.0001077.

Kundaje A, …, Leslie C, and Zhang L. A predictive model of the oxygen and heme regulatory network in yeast. In review.

Kundaje A., …, Wiggins C, …, and Leslie C. Learning regulatory programs that accurately predict differential expression with MEDUSA. Annals of the New York Academy of Sciences, in press.

Christian Murphy, …, Gail Kaiser and Adam Cannon. Backstop: Debugging Tools for Novice Java Programmers. To appear in ACM SIGCSE Technical Symposium on Computer Science Education, March 2008.

Christian Murphy, Gail Kaiser and Marta Arias. Parameterizing Random Test Data According to Equivalence Classes. 2nd International Workshop on Random Testing, November 2007.

Rean Griffith, …, and Gail Kaiser. The Role of Reliability, Availability and Serviceability (RAS) Models in the Design and Evaluation of Self-Healing Systems. 3rd International Conference on Self-Organization and Autonomous Systems in Computing and Communications, September 2007

Rean Griffith, …, and Gail Kaiser. RAS Models: A Building Block for Self-Healing Benchmarks. 8th International Workshop on Performability Modeling of Computer and Communication Systems, September 2007.

Christian Murphy, Gail Kaiser and Marta Arias. An Approach to Software Testing of Machine Learning Applications. 19th International Conference on Software Engineering and Knowledge Engineering, July 2007.

Stoyanovich J et al., "EntityAuthority: Semantically Enriched Graph-Based Authority Propagation". Proceedings of the Tenth International Workshop on the Web and Databases (WebDB) June 2007.

Page 13: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Pe'er I and J Stoyanovich. "MutaGeneSys: Making Diagnostic Predictions Based on Genome-Wide Genotype Data in Association Studies", Columbia University technical report CUCS-012-07, 2007.

Ross KA et al. "A Faceted Query Engine Applied to Archaeology". Internet Archaeology 21, April 2007. Kenneth A. Ross and Julia Stoyanovich. "Schema Polynomials and Applications". Proceeding of the 11th International Conference on Extending Database Technology (to appear in March 2008).

Itsik Pe’er et al., “Estimation of the Multiple Testing Burden for Genomewide Association Studies of Nearly All Common Variants” Accepted Genetic Epidemiology.

Nair, R. and Rost, B. (2007) Predicting protein subcellular localization using intelligent systems. In Leon, D. and Markel, S. (eds), In Silico Technology in Drug Target Identification and Validation. Marcel Dekker.

Rost, B. (2007) Prediction of protein structure in 1D - Secondary structure, membrane regions, and solvent accessibility. In Bourne, P.E. and Weissig, H. (eds), Structural Bioinformatics - 2nd Edition. Wiley.

Schlessinger, …, and Rost, B. (2007) Natively unstructured loops differ from other loops, PLoS Comput Biol, 3, e140.

Schlessinger, …, and Rost, B. (2007) Natively unstructured regions in proteins identified from contact predictions, Bioinformatics, 23, 2376-2384.

Fasnacht, M., ... and Honig, B. (2007) Local Quality Assessment in Homology Models Using Statistical Potentials and Support Vector Machines. Prot. Sci. 6:1557-1568.

Boorsma, X, …, and H.J. Bussemaker. Inferring transcription factor co-modulation networks through regulon-based expression analysis. Proceedings of RECOMB Sattellite Meeting on Systems Biology, San Diego, CA, November 2007.

Stolovitzky GA, Monroe D, and Califano A (2007), Dialogue on Reverse Engineering Assessment and Methods: the DREAM of high throughput pathway inference, Ann NY Acad Sci. 2007 Oct 9.

Margolin AA, and Califano A (2007), Theory and limitations of genetic network inference from microarray data, Ann NY Acad Sci. 2007 Oct 9;

Page 14: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Mani K, …, Dalla-Favera R, and Califano A, (2007) A Systems Biology Approach to the Prediction of Causal Oncogenic Mechanisms and Drug Mechanism-of-Action Profiles in Cancer Phenotypes, Molecular Systems Biology, in press.

Wang K, …, Dalla-Favera R, and Califano A, (2007), Genome-wide identification of transcriptional network modulators in human B cells, submitted to Nature.

Carro MS, …, Califano A*, Iavarone A* (2007), A transcriptional regulatory network initiates and maintains the mesenchymal phenotype of human malignant glioma, submitted to Science.

Margolin AA, …, Califano A*, and Stolovitzky G* (2007), ChIP-on-chip significance analysis reveals large scale transcription factor activity, submitted to Nature Biotechnology.

Lussier YA* and Bodenreider O*. Clinical Ontologies for Discovery Applications. P.101-119 Chritopher Baker and Kei Cheung, Editors. Semantic Web: Revolutionizing Knowledge Discovery in the Life Sciences. Springer Verlag 2007, XXII, 450 p.

Lussier YA. Ontology Similarity Networks and Database Interoperability. Interface 2007, May 23-26, Philadelphia.

Tao Y, Li J, Friedman C*, Lussier YA*. Information Theory Applied to the Sparse Gene Ontology Annotation Network to Predict Novel Gene Function. Bioinformatics, 2007, 23(13)i529-38.

Tao Y, Patel C, Friedman C, Lussier YA*. PIE: A Phenotype Interface Engine for Automated Representation in PATO and “Semantic Web”-Based Exchange of Clinical Data and Narratives. AMIA Annu Symp Proc. 2007 (in press).

Payne PRO, Lussier YA et al. Re-engineering Translational Research: Are We Taking A Truly Translational Approach? (Submitted to the AMIA Summit in Biomedical Informatics).

Liu Y, …, Lussier YA*. Robust Methods for Accurate Diagnosis Using Microbiological Arrays. (Submitted to the AMIA Summit in Biomedical Informatics).

Sam L. , …, Friedman C and Lussier YA*. PhenoGO: A Resource for Multiscale Biological Data Integration. (Submitted to the AMIA Summit in Biomedical Informatics).

Page 15: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Ciatto, C., … , Honig, B. and Shapiro, L. Crystal structures and adhesive binding mechanism of T-cadherin. Structure, submitted.

Koehnke, J, …, Honig, B., …. and Shapiro, L., Crystal structures of the neureoligin-binding domains from neurexin 1 and neurexin 2. Structure, in press.

Koehnke, J,… , Honig, B. and Shapiro, L., Crystal structure of the ectodomain from Nlgn2. PNAS, in press.

Posy, S., Shapiro, L., and Honig, B., J. Mol. Biol, submitted.

J. Sebat, …, Gilliam TC, …, and M. Wigler (2007). Strong association of de novo copy number mutations with autism. Science, 316: 445-449.

Autism Genome Project Consortium. (2007). Mapping autism risk loci using genetic linkage and chromosomal rearrangements. Nat. Genet. 39: 319-328.

Ivan Iossifov, …, Gilliam TC and A Rzhetsky. Genetic-linkage mapping of complex hereditary disorders to a whole-genome molecular-interaction network. Submitted.

R.A. Kumar, …, Gilliam TC, et al. Discovery and Characterization of a Recurrent 6p11.2 Microdeletion Identified in 0.6% of Autism Patients. In review (Nature Genetics )

RA Kumar, …, Gilliam TC, et al. (2008). Recurrent 16p11.2 microdeletions in autism. Human Molecular Genetics, In press.

R Suresh, …, and Gilliam TC. Sex-specific interaction between ITGβ3 and SLC6A4 in Autism Spectrum Disorder. Submitted.

Wellcome Trust Case Control Consortium. (2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661-678.

Rzhetsky A, et al.. (2007). Probing genetic overlap among complex human phenotypes. Proc Natl Acad Sci, 104, 11694.

Page 16: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

4 very high quality proposals

Riccardo Dalla-Favera and Andrea Califano: microRNA regulatory networks in human B cells.

Levi Garraway and Dana Pe'er: Therapeutic Opportunities in Melanomals

Antonio Iavarone and Andrea Califano: A transcriptional regulatory network for the mesenchymal phenotype in malignant gliomas.

Richard Losick and Dennis Vitkup: Computational and experimental analysis of bacterial developmental (sporulation) network.

Full proposals due on April 30th Review and funding decision by Aug. 1st

Page 17: A Systems and Structural Biology approach to the dissection of complex biological phenotypes
Page 18: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

http://www.nyas.org/ebriefreps/splash.asp?intEbriefID=705

Page 19: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

Wed Oct 295pm

Sun Nov 21pm

Martha Bulyk (RG), Andrea Califano (SB), Manolis Kellis (RG), Gustavo Stolovitzky (DR)

Page 20: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

BroadInstitute

CSAIL

Cambridge MA

Page 21: A Systems and Structural Biology approach to the dissection of complex biological phenotypes

We have created the infrastructure in Years 1-3 Over the next 7 years, we can concentrate on the

science!