5
Review Systems biology from a yeast omics perspective Michael Snyder * , Jennifer E.G. Gallagher Department of Genetics, 300 Pasteur Dr, Stanford University, Stanford, CA 94305, United States article info Article history: Received 23 October 2009 Revised 5 November 2009 Accepted 5 November 2009 Available online 11 November 2009 Edited by Stefan Hohmann Keywords: Systems biology Genomics Proteomics Transcriptome Phosphorylome Saccharomyces cerevisiae abstract Systems biology represents a paradigm shift from the study of individual genes, proteins or other components to that of the analysis of entire pathways, cellular, developmental, or organismal pro- cesses. Large scale studies, primarily initiated in Saccharomyces cerevisiae, have allowed the identi- fication and characterization of components on an unprecedented level. Large scale interaction, transcription factor binding and phosphorylation data have enabled the elucidation of global regu- latory networks. These studies have helped provide an understanding of cellular pathways and pro- cesses at a global and systems level. Ó 2009 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved. 1. Introduction Systems Biology represents the study of entire pathways, pro- cesses or even organismal interactions usually at a molecular level. This larger analysis has allowed elucidation of basic principles that may not be apparent at the level of individual components. Such information is expected to be valuable to understanding how an entire system operates. 2. A paradigm shift Until the early 1990s nearly all biological research was focused on the detailed study of individual components in which proteins or genes were studied one at a time, a very laborious and ineffi- cient process. This approach shifted in the early and mid-1990s with several types of large-scale studies that allow the analysis of large numbers of components systematically and/or simulta- neously for the first time (Fig. 1 and Table 1). As the field matured, analysis moved from qualitative to quantitative experiments as a prerequisite to compare and draw meaningful conclusions from large datasets. The first study was our transposon tagging project which tagged a large number of yeast genes allowing for large- scale gene expression, protein localization, and disruption pheno- type analyzes as well as identification of unannotated sequences [1]. The second was DNA microarrays, which allowed the monitor- ing of gene expression of large numbers of, and ultimately all, known and potential genes within an organism [2]. Subsequently, studies to systematically disrupt gene function and biochemically characterize proteins on large-scale emerged (see Table 1). Each of these projects usually required development of a new method (RNAi for multicellular organisms; protein expression collections for biochemical assays). The vast majority of these projects were first pioneered in yeast because of (a) its relatively small genome and numbers of genes (approximately 6000), (b) the genome was one of the first se- quenced (completed in 1996; [3]) and (c) its facile genetics. Table 1 summarizes the different large-scale projects performed for yeast. Projects for characterizing gene expression, protein localiza- tion, gene disruption, transcription factor binding, biochemical studies and protein profiling have been carried out providing a wealth of information about each gene, and its corresponding RNA and protein in the cell. Perhaps most importantly, these stud- ies have generated valuable collections of tagged strains and mu- tants that have proven very valuable to the scientific community. The culture of sharing reagents and information within the yeast community has greatly enhanced the entire field and community. Subsequent to the launch of the different yeast studies, parallel projects have been performed for Caenorhabditis elegans, Drosoph- ila, Arabidopsis and vertebrates and enabled extensive gene charac- terization in those organisms [4–8]. Many of these have been performed as systematic efforts by consortia e.g. modENCODE to systematically analyze the genomic association of all transcription factors [9,10]. 0014-5793/$36.00 Ó 2009 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved. doi:10.1016/j.febslet.2009.11.011 * Corresponding author. E-mail address: [email protected] (M. Snyder). FEBS Letters 583 (2009) 3895–3899 journal homepage: www.FEBSLetters.org

Systems biology from a yeast omics perspective

Embed Size (px)

Citation preview

Page 1: Systems biology from a yeast omics perspective

FEBS Letters 583 (2009) 3895–3899

journal homepage: www.FEBSLetters .org

Review

Systems biology from a yeast omics perspective

Michael Snyder *, Jennifer E.G. GallagherDepartment of Genetics, 300 Pasteur Dr, Stanford University, Stanford, CA 94305, United States

a r t i c l e i n f o a b s t r a c t

Article history:Received 23 October 2009Revised 5 November 2009Accepted 5 November 2009Available online 11 November 2009

Edited by Stefan Hohmann

Keywords:Systems biologyGenomicsProteomicsTranscriptomePhosphorylomeSaccharomyces cerevisiae

0014-5793/$36.00 � 2009 Federation of European Biodoi:10.1016/j.febslet.2009.11.011

* Corresponding author.E-mail address: [email protected] (M. Snyd

Systems biology represents a paradigm shift from the study of individual genes, proteins or othercomponents to that of the analysis of entire pathways, cellular, developmental, or organismal pro-cesses. Large scale studies, primarily initiated in Saccharomyces cerevisiae, have allowed the identi-fication and characterization of components on an unprecedented level. Large scale interaction,transcription factor binding and phosphorylation data have enabled the elucidation of global regu-latory networks. These studies have helped provide an understanding of cellular pathways and pro-cesses at a global and systems level.� 2009 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

1. Introduction

Systems Biology represents the study of entire pathways, pro-cesses or even organismal interactions usually at a molecular level.This larger analysis has allowed elucidation of basic principles thatmay not be apparent at the level of individual components. Suchinformation is expected to be valuable to understanding how anentire system operates.

2. A paradigm shift

Until the early 1990s nearly all biological research was focusedon the detailed study of individual components in which proteinsor genes were studied one at a time, a very laborious and ineffi-cient process. This approach shifted in the early and mid-1990swith several types of large-scale studies that allow the analysisof large numbers of components systematically and/or simulta-neously for the first time (Fig. 1 and Table 1). As the field matured,analysis moved from qualitative to quantitative experiments as aprerequisite to compare and draw meaningful conclusions fromlarge datasets. The first study was our transposon tagging projectwhich tagged a large number of yeast genes allowing for large-scale gene expression, protein localization, and disruption pheno-type analyzes as well as identification of unannotated sequences[1]. The second was DNA microarrays, which allowed the monitor-

chemical Societies. Published by E

er).

ing of gene expression of large numbers of, and ultimately all,known and potential genes within an organism [2]. Subsequently,studies to systematically disrupt gene function and biochemicallycharacterize proteins on large-scale emerged (see Table 1). Eachof these projects usually required development of a new method(RNAi for multicellular organisms; protein expression collectionsfor biochemical assays).

The vast majority of these projects were first pioneered in yeastbecause of (a) its relatively small genome and numbers of genes(approximately 6000), (b) the genome was one of the first se-quenced (completed in 1996; [3]) and (c) its facile genetics. Table1 summarizes the different large-scale projects performed foryeast. Projects for characterizing gene expression, protein localiza-tion, gene disruption, transcription factor binding, biochemicalstudies and protein profiling have been carried out providing awealth of information about each gene, and its correspondingRNA and protein in the cell. Perhaps most importantly, these stud-ies have generated valuable collections of tagged strains and mu-tants that have proven very valuable to the scientific community.The culture of sharing reagents and information within the yeastcommunity has greatly enhanced the entire field and community.Subsequent to the launch of the different yeast studies, parallelprojects have been performed for Caenorhabditis elegans, Drosoph-ila, Arabidopsis and vertebrates and enabled extensive gene charac-terization in those organisms [4–8]. Many of these have beenperformed as systematic efforts by consortia e.g. modENCODE tosystematically analyze the genomic association of all transcriptionfactors [9,10].

lsevier B.V. All rights reserved.

Page 2: Systems biology from a yeast omics perspective

1994 1996 1998 2000 2002 2004 2006 2008

geno

me seq

uenc

ing fin

ished

trans

poso

n gen

e tag

ging:

protei

n

locali

zatio

n: ge

ne di

srupti

on

expre

ssion

micr

oarra

ys

geno

mic kn

ocko

ut co

llecti

on: m

etablo

me

geno

me-wide

GFP ta

gging

protei

n chip

s: glo

bal c

omple

x puri

ficati

on: tw

o-hyb

rid

synth

etic g

eneti

c arra

y: ChIP

-chip

phos

phory

lome:

Gateway

expre

ssion

colle

ction

:

E-MAPs

RNA-Seq

ChIP-S

eq

2010

protei

n loc

alozo

me

Fig. 1. Timeline of yeast projects.

Table 1Large scale yeast projects and methods.

Project/method Goal Reference

Transposon tagging Gene expression; protein localization; disruption phenotypes [1,28,55]DNA microarrays Gene expression [22]Systematic knockouts Phenotypes [56,57]Protein localization using directed tagging Subcellular protein localization [1,28,55,58]Biochemical protein characterization Biochemical activities [59,60]Protein microarrays Biochemical activities; protein modifications; interactions [21,45,60]Mass spectrometry Protein profiling; Quantitative protein levels [61–63]Protein–protein Interactions two hybrid Protein interaction maps; protein function prediction [14–16]Protein–protein interactions complex purification Protein interaction maps; protein function predication [19,20,64]Genetic interactions Global synthetic screens [37,39,65,66]Phosphorylation Kinase substrate map; large-scale phosophylation mapping [41,42,46,67]Other post-translational modifications Mapping of glycosylation; SUMO, acetylation, ubiquitination [68–72]

3896 M. Snyder, J.E.G. Gallagher / FEBS Letters 583 (2009) 3895–3899

These studies enabled assignment of functions to genes at a ra-pid pace. For example, when the yeast genome sequenced wascompleted in 1996 the function of two-thirds of the encoded geneswas not known [3]. Now 13 years later this figure has been reducedto 10% [11]. Through the integration of different types of large-scale data [12,13], some level of function can be inferred aboutmost yeast genes, although this information is by no means com-prehensive, and new function of proteins continue to emerge. Asa consequence of these various global studies, we also have a muchdeeper understanding of entire biological processes and pathways.

3. Analysis of networks and regulatory circuits

In conjunction with the advent of large-scale characterization ofgenes and proteins, came the large-scale analysis of their interac-tions and regulation. To date a number of interaction networkshave been generated including protein–protein interaction andtranscription factor binding to DNA. These global interaction stud-ies have help elucidate entire pathways as well as regulatory net-works controlling biological processes.

3.1. Protein–protein interactions

The first of the interaction studies were protein–protein inter-action projects. Initial research focused on high throughput yeasttwo-hybrid studies; these studies were initially incompletealthough a more recent study and related protein complementa-tion method generated much larger datasets [14–18]. Thesestudies generally identify direct interactions among protein. Sub-sequent to the initial two-hybrid studies large-scale studies usingaffinity purification of tagged proteins and identification of associ-

ated proteins by mass spectrometry were performed [19,20]. Thisapproach tends to identify members of a complex, usually at phys-iological levels in vivo and was a dramatic shift away from assign-ing protein function to pathways from protein–proteinassociations rather than classical mutant characterization. Smallerscale in vitro interaction studies using protein microarrays havealso been performed [21] and allowed computational detectionof non-canonical small molecule binding motifs. In general, theoverlap between the approaches is rather modest. This is partly be-cause these approaches themselves are incomplete and are rarelyperformed to saturation. Moreover, each method has its biasesand limitations and this likely also contributes to the observedincomplete overlap.

3.2. Transcription factors

A second major area for the analysis of regulatory networks isthe monitoring of gene expression using DNA microarrays. Expres-sion profiling has now been performed for a large number oforganisms and now thousands of microarray experiments havebeen performed for yeast, C. elegans, Drosophila, Arabidopsis, miceand humans [2,22–24]. More recent studies avoid DNA microarraysand involve direct sequencing of RNA (RNA-Seq; [25,26]), which ismuch more sensitive and accurate due to lack of cross hybridiza-tion [27]. These different studies have allowed the profiling of allannotated genes under diverse conditions, different tissues and/or different developmental stages. By correlating gene patterns, ithas been possible to determine which genes work together and of-ten identify common DNA sequence motifs in promoter regions.

Other large-scale studies have been performed to characterizetranscription factor binding sites. We have found that 27% of

Page 3: Systems biology from a yeast omics perspective

M. Snyder, J.E.G. Gallagher / FEBS Letters 583 (2009) 3895–3899 3897

epitope-tagged proteins localize to the nucleus and the majority ofthese exhibit punctuate patterns of staining of chromosomes usingmeiotic chromosome spreads [28]. Together with Dr. PatrickBrown’s laboratory, we invented the ChIP-chip method for large-scale identification of binding sites throughout the yeast genome[29,30]. This method involves immunoprecipitation of a transcrip-tion factor along with its associated DNA, followed by probing ofDNA microarrays containing genomic sequences. This method ledto several large-scale studies to map DNA binding sites of manyfactors during either vegetative growth or under different condi-tions [30–32]. A modification of the ChIP method is ChIP-Seq,which uses high-throughput DNA sequencing as its readout[33,34]. A version for yeast uses bar coding of samples so that largenumbers of samples can be analyzed at once [35]. ChIP-chip andChIP-Seq, when combined with gene expression studies, are a par-ticularly powerful method for dissecting regulatory circuits. Todate most of the global DNA binding studies have been performedat a single time point; however recent studies have demonstratedthat transcription factor binding sites can have vary temporallypresumably reflected the combinatorial effects of multiple bindingpartners [32]. After an initial challenge to yeast, changes in thetranscription factor binding can be graphed over time (Fig. 2). Pos-sible outcomes are no change, a rapid increase/decrease, a lag inchange transcription factor binding or a transient binding dissoci-ation of the transcription factor. Adding temporal component tomapping transcription factor binding allows a dynamic picture ofcellular response to environmental changes.

3.3. Genetic interactions

Genetic screens uncover interactions that are not necessarilyphysical, but rather indicate functional relationships between pro-teins at the pathway, cellular, or even organismal level. The mostcommon of these are synthetic lethal screen in which strains con-taining combinations of gene mutations in have a much more se-vere phenotype (typically death) than that of mutations inindividual genes [36]. The first screens systematically examinedthe effects of combining different null mutations for the ‘‘knockoutcollections” [37]. More recent studies using a collection of DAMPalleles, which contain hypomorphs of essential proteins [38], haverevealed synthetic interactions of essential genes. These types ofstudies have been extended to not only examine negative interac-tions (synthetic lethal interactions), but also positive interactions(suppression) between mutated genes. The analysis of multipleglobal screens can also reveal parallel pathways by clusteringinteractions by the similarities in of genetic interactions (E-MAPs)

Fig. 2. Types of temporal patterns of transcription factors binding to DNA. (A) Constitutivsignal). (C) Delayed binding (lag in binding and then no change in binding). (D) Trandissociation (immediate response and no reassociation). (F) Delayed dissociation (lag breassociation). (Reproduced from Ref. [32].)

that would otherwise be missed by analysis of only physical inter-actions [39,40].

3.4. Phosphorylome

A fourth area of global analysis is protein phosphorylation. Ini-tial estimates have suggested that 30% of proteins in yeast and hu-mans are phosphoproteins; although, this estimate is likely to be asignificant underestimate. However, from the limited number ofrecent studies, over 12 000 high confidence phosphorylation siteson �50% of yeast proteins have been identified in yeast using massspectrometry [41–44] and the number of phosphorylated sites isexpected to grow even higher as additional sites continue to bemapped. A significant challenge has been linking kinases to theirsubstrates: yeast have 122 putative kinases whereas humans have518 [45]. Thus far several methods have been employed for this. Tobegin to systematically identify 95 kinase substrates, we used pro-tein microarray containing 4400 yeast proteins to find in vitro tar-gets for 95 yeast kinases [46,47]. From the first 87 kinases, 4200substrates were identified, an average of 47 targets per kinases.Among the many interesting findings from this study was theobservation that closely related kinases such as the three proteinkinase A homologs, Tpk1, Tpk2 and Tpk3, each have different tar-gets. Likewise, the cyclin-dependent kinase, Pho85, phosphory-lated different targets when associated with different cyclins. Arelated approach has been to incubate purified kinases with GSTfusion proteins or to use modified kinases that only use a modifiedATP [48–50]. Another strategy has been to identify differences inphosphorylation patterns in the presence and absence of a proteinkinase [51]. Phosphoproteins are purified from cells lacking a pro-tein kinase and identified using mass spectrometry. Phosphoryla-tion is a dynamic post-translational modification crucial for rapidresponse in cell signaling pathways and mapping these sites canprovide connections between pathways.

3.5. Integration of datasets

Individually, these studies have provided a wealth of informa-tion for specific types of regulation. However, considerableemphasis is now being placed on integration of different datatypes. Initial studies integrated gene expression and transcriptionfactor binding data to determine the roles of transcription factors(e.g. positive or negative regulation). Moreover, studies have beenperformed to integrate additional data types such as protein–pro-tein interaction data, phosphorylation data [46], and even metab-olite data [52] into large ‘‘meta network” or ‘‘ridiculograms,”

e binding (no change). (B) Rapid binding (immediate response and no attenuation ofsient binding (immediate response then return to steady-state levels). (E) Rapidefore complete dissociation). (G) Transient dissociation (initial dissociation before

Page 4: Systems biology from a yeast omics perspective

Fig. 3. Simple motifs/modules found in complex regulatory networks. Seven different motifs of interactions described between kinates: kinases, their substrates andtranscription factors. (Reproduced from Ref. [53].)

3898 M. Snyder, J.E.G. Gallagher / FEBS Letters 583 (2009) 3895–3899

which are visually stunning networks containing extensive inter-actions. The combined data in these meta networks can besearched for motifs or modules that are overrepresented in thenetworks. Examples of overrepresented modules are shown inFig. 3, including the scaffold module, in which protein kinaseand their substrates each interacting with a third proteins andthe ‘‘ménage à trois” module in which kinases interact with twointeraction partners [46,53].

4. Interdisciplinary research

System Biology by its very nature operates at the interface ofBiology and Computational Biology. Biologists are typically respon-sible for collecting large data sets often involving many data pointsand computational biologist are typically responsible for both ini-tial scoring of results and more general analyses. The production ofthese dataset also provides materials to propose mathematicalmodels for pathways that can identify and/or predict key regula-tors. Furthermore, engineers are having a profound impact onreducing sample size therefore saving reagents and increasingthroughput. In some areas of systems biology, chemists and biolo-gist are designing small molecular probes or inhibitors to specificproteins or a class of proteins to probe function.

Regardless of the project, Systems Biology by necessity isbecoming more quantitative, both in data collecting and modelingpathways and biological processes. These models make predictionswhich in turn can be tested through additional experimentationwith respect to data collection as well as with regard to the mod-eling of biological pathways and processes.

5. Conclusion

Science today is different from that of 20 years ago. Whole gen-omes have been sequenced and the opportunity for comprehensiveanalysis of biological pathways has arrived. Not only is it possibleto uncover the biological components and events that occur, butthe ability to obtain a quantitative description is now possible. Thisinformation will be extremely valuable in medicine for under-standing how pathways go awry in human disease. Often commondiseases arise from complex interactions between genetics (poly-morphisms in coding and non-coding regions) influencing tran-scription and translation the proteome and the environment.Global analysis of molecular components and states during humandiseases is expected to be valuable for diagnostics and monitoringtreatments [54].

References

[1] Burns, N., Grimwade, B., Ross-Macdonald, P.B., Choi, E.Y., Finberg, K., Roeder,G.S. and Snyder, M. (1994) Large-scale analysis of gene expression, protein

localization, and gene disruption in Saccharomyces cerevisiae. Gene Dev. 8,1087–1105.

[2] Schena, M., Shalon, D., Davis, R.W. and Brown, P.O. (1995) Quantitativemonitoring of gene expression patterns with a complementary DNAmicroarray. Science 270, 467–470.

[3] Goffeau, A. et al. (1996) Life with 6000 genes. Science 274 (546), 563–567.[4] Fraser, A.G., Kamath, R.S., Zipperlen, P., Martinez-Campos, M., Sohrmann, M.

and Ahringer, J. (2000) Functional genomic analysis of C. elegans chromosome Iby systematic RNA interference. Nature 408, 325–330.

[5] Yamada, K. et al. (2003) Empirical analysis of transcriptional activity in theArabidopsis genome. Science 302, 842–846.

[6] Reboul, J. et al. (2001) Open-reading-frame sequence tags (OSTs) support theexistence of at least 17 300 genes in C. elegans. Nat. Genet. 27, 332–336.

[7] Bertone, P. et al. (2004) Global identification of human transcribed sequenceswith genome tiling arrays. Science 306, 2242–2246.

[8] Abdellah, Z. et al. (2004) Finishing the euchromatic sequence of the humangenome. Nature 431, 931–945.

[9] Celniker, S.E. et al. (2009) Unlocking the secrets of the genome. Nature 459,927–930.

[10] Birney, E. et al. (2007) Identification and analysis of functional elements in 1%of the human genome by the ENCODE pilot project. Nature 447, 799–816.

[11] Christie, K.R., Hong, E.L. and Cherry, J.M. (2009) Functional annotations for theSaccharomyces cerevisiae genome: the knowns and the known unknowns.Trends Microbiol. 17, 286–294.

[12] von Mering, C., Krause, R., Snel, B., Cornell, M., Oliver, S.G., Fields, S. and Bork,P. (2002) Comparative assessment of large-scale data sets of protein–proteininteractions. Nature 417, 399–403.

[13] Jansen, R. et al. (2003) A Bayesian networks approach for predicting protein–protein interactions from genomic data. Science 302, 449–453.

[14] Uetz, P. et al. (2000) A comprehensive analysis of protein–protein interactionsin Saccharomyces cerevisiae. Nature 403, 623–627.

[15] Ito, T. et al. (2000) Toward a protein–protein interaction map of the buddingyeast: a comprehensive system to examine two-hybrid interactions in allpossible combinations between the yeast proteins. Proc. Natl. Acad. Sci. USA97, 1143–1147.

[16] Yu, H. et al. (2008) High-quality binary protein interaction map of the yeastinteractome network. Science 322, 104–110.

[17] Tarassov, K. et al. (2008) An in vivo map of the yeast protein interactome.Science 320, 1465–1470.

[18] Simonis, N. et al. (2009) Empirically controlled mapping of the Caenorhabditiselegans protein–protein interactome network. Nat. Meth. 6, 47–54.

[19] Gavin, A.C. et al. (2002) Functional organization of the yeast proteome bysystematic analysis of protein complexes. Nature 415, 141–147.

[20] Ho, Y. et al. (2002) Systematic identification of protein complexes inSaccharomyces cerevisiae by mass spectrometry. Nature 415, 180–183.

[21] Zhu, H. et al. (2001) Global analysis of protein activities using proteome chips.Science 293, 2101–2105.

[22] DeRisi, J.L., Iyer, V.R. and Brown, P.O. (1997) Exploring the metabolic andgenetic control of gene expression on a genomic scale. Science 278, 680–686.

[23] Kim, S.K. et al. (2001) A gene expression map for Caenorhabditis elegans.Science 293, 2087–2092.

[24] White, K.P., Rifkin, S.A., Hurban, P. and Hogness, D.S. (1999) Microarrayanalysis of Drosophila development during metamorphosis. Science 286,2179–2184.

[25] Nagalakshmi, U., Wang, Z., Waern, K., Shou, C., Raha, D., Gerstein, M. andSnyder, M. (2008) The transcriptional landscape of the yeast genome definedby RNA sequencing. Science 320, 1344–1349.

[26] Wilhelm, B.T. et al. (2008) Dynamic repertoire of a eukaryotic transcriptomesurveyed at single-nucleotide resolution. Nature 453, 1239–1243.

[27] Wang, Z., Gerstein, M. and Snyder, M. (2009) RNA-Seq: a revolutionary tool fortranscriptomics. Nat. Rev. Genet. 10, 57–63.

[28] Ross-Macdonald, P. et al. (1999) Large-scale analysis of the yeast genome bytransposon tagging and gene disruption. Nature 402, 413–418.

[29] Iyer, V.R., Horak, C.E., Scafe, C.S., Botstein, D., Snyder, M. and Brown, P.O.(2001) Genomic binding sites of the yeast cell-cycle transcription factors SBFand MBF. Nature 409, 533–538.

Page 5: Systems biology from a yeast omics perspective

M. Snyder, J.E.G. Gallagher / FEBS Letters 583 (2009) 3895–3899 3899

[30] Horak, C.E., Luscombe, N.M., Qian, J., Bertone, P., Piccirrillo, S., Gerstein, M. andSnyder, M. (2002) Complex transcriptional circuitry at the G1/S transition inSaccharomyces cerevisiae. Gene Dev. 16, 3017–3033.

[31] Harbison, C.T. et al. (2004) Transcriptional regulatory code of a eukaryoticgenome. Nature 431, 99–104.

[32] Ni, L., Bruce, C., Hart, C., Leigh-Bell, J., Gelperin, D., Umansky, L., Gerstein, M.B.and Snyder, M. (2009) Dynamic and complex transcription factor bindingduring an inducible response in yeast. Gene Dev. 23, 1351–1363.

[33] Johnson, D.S., Mortazavi, A., Myers, R.M. and Wold, B. (2007) Genome-widemapping of in vivo protein-DNA interactions. Science 316, 1497–1502.

[34] Robertson, G. et al. (2007) Genome-wide profiles of STAT1 DNA associationusing chromatin immunoprecipitation and massively parallel sequencing. Nat.Meth. 4, 651–657.

[35] Lefrancois, P., Euskirchen, G.M., Auerbach, R.K., Rozowsky, J., Gibson, T.,Yellman, C.M., Gerstein, M. and Snyder, M. (2009) Efficient yeast ChIP-Sequsing multiplex short-read DNA sequencing. BMC Genom. 10, 37.

[36] Bender, A. and Pringle, J.R. (1991) Use of a screen for synthetic lethal andmulticopy suppressee mutants to identify two new genes involved inmorphogenesis in Saccharomyces cerevisiae. Mol. Cell Biol. 11, 1295–1305.

[37] Tong, A.H. et al. (2001) Systematic genetic analysis with ordered arrays ofyeast deletion mutants. Science 294, 2364–2368.

[38] Breslow, D.K. et al. (2008) A comprehensive strategy enabling high-resolutionfunctional analysis of the yeast genome. Nat. Meth. 5, 711–718.

[39] Collins, S.R. et al. (2007) Functional dissection of protein complexes involvedin yeast chromosome biology using a genetic interaction map. Nature 446,806–810.

[40] Schuldiner, M. et al. (2005) Exploration of the function and organization of theyeast early secretory pathway through an epistatic miniarray profile. Cell 123,507–519.

[41] Gerber, S.A., Rush, J., Stemman, O., Kirschner, M.W. and Gygi, S.P. (2003)Absolute quantification of proteins and phosphoproteins from cell lysates bytandem MS. Proc. Natl. Acad. Sci. USA 100, 6940–6945.

[42] Ficarro, S.B., McCleland, M.L., Stukenberg, P.T., Burke, D.J., Ross, M.M.,Shabanowitz, J., Hunt, D.F. and White, F.M. (2002) Phosphoproteomeanalysis by mass spectrometry and its application to Saccharomycescerevisiae. Nat. Biotechnol. 20, 301–305.

[43] Li, X., Gerber, S.A., Rudner, A.D., Beausoleil, S.A., Haas, W., Villen, J., Elias, J.E.and Gygi, S.P. (2007) Large-scale phosphorylation analysis of alpha-factor-arrested Saccharomyces cerevisiae. J. Proteome Res. 6, 1190–1197.

[44] Holt, L.J., Tuch, B.B., Villen, J., Johnson, A.D., Gygi, S.P. and Morgan, D.O. (2009)Global analysis of Cdk1 substrate phosphorylation sites provides insights intoevolution. Science 325, 1682–1686.

[45] Zhu, H. et al. (2000) Analysis of yeast protein kinases using protein chips. Nat.Genet. 26, 283–289.

[46] Ptacek, J. et al. (2005) Global analysis of protein phosphorylation in yeast.Nature 438, 679–684.

[47] Devgan, G.a.S.M. (2007) Kinase substrate identification using yeast proteinmicroarrays in: Functional Microarrays in Drug Discovery (Predki, P.F., Ed.),CRC, pp. 351–360.

[48] Loog, M. and Morgan, D.O. (2005) Cyclin specificity in the phosphorylation ofcyclin-dependent kinase substrates. Nature 434, 104–108.

[49] Holt, K. (2009) Identifying adaptive differences could provide insight. Nature458, 145.

[50] Ubersax, J.A., Woodbury, E.L., Quang, P.N., Paraz, M., Blethrow, J.D., Shah, K.,Shokat, K.M. and Morgan, D.O. (2003) Targets of the cyclin-dependent kinaseCdk1. Nature 425, 859–864.

[51] Huber, A., Bodenmiller, B., Uotila, A., Stahl, M., Wanka, S., Gerrits, B., Aebersold,R. and Loewith, R. (2009) Characterization of the rapamycin-sensitive

phosphoproteome reveals that Sch9 is a central coordinator of proteinsynthesis. Gene Dev. 23, 1929–1943.

[52] Li, X., Gianoulis, T., Yip K., Gerstein M. and Snyder M. (2009) Extensivemetabolite–protein interactions revealed by large-scale systematicanalysesed. Nature, Submitted for publication.

[53] Zhu, X., Gerstein, M. and Snyder, M. (2007) Getting connected: analysis andprinciples of biological networks. Gene Dev. 21, 1010–1024.

[54] Snyder, M., Weissman, S. and Gerstein, M. (2009) Personal phenotypes to gowith personal genomes. Mol. Syst. Biol. 5, 273.

[55] Kumar, A. et al. (2004) Large-scale mutagenesis of the yeast genome using aTn7-derived multipurpose transposon. Genome Res. 14, 1975–1986.

[56] Winzeler, E.A. et al. (1999) Functional characterization of the Saccharomycescerevisiae genome by gene deletion and parallel analysis. Science 285, 901–906.

[57] Giaever, G. et al. (2002) Functional profiling of the Saccharomyces cerevisiaegenome. Nature 418, 387–391.

[58] Huh, W.K., Falvo, J.V., Gerke, L.C., Carroll, A.S., Howson, R.W., Weissman, J.S.and O’Shea, E.K. (2003) Global analysis of protein localization in buddingyeast. Nature 425, 686–691.

[59] Martzen, M.R., McCraith, S.M., Spinelli, S.L., Torres, F.M., Fields, S., Grayhack,E.J. and Phizicky, E.M. (1999) A biochemical genomics approach for identifyinggenes by the activity of their products. Science 286, 1153–1155.

[60] Gelperin, D.M. et al. (2005) Biochemical and genetic analysis of the yeastproteome with a movable ORF collection. Gene Dev. 19, 2816–2826.

[61] Washburn, M.P., Wolters, D. and Yates 3rd, J.R. (2001) Large-scale analysis ofthe yeast proteome by multidimensional protein identification technology.Nat. Biotechnol. 19, 242–247.

[62] de Godoy, L.M., Olsen, J.V., Cox, J., Nielsen, M.L., Hubner, N.C., Frohlich, F.,Walther, T.C. and Mann, M. (2008) Comprehensive mass-spectrometry-basedproteome quantification of haploid versus diploid yeast. Nature 455, 1251–1254.

[63] Picotti, P., Bodenmiller, B., Mueller, L.N., Domon, B. and Aebersold, R. (2009)Full dynamic range proteome analysis of Saccharomyces cerevisiae by targetedproteomics. Cell 138, 795–806.

[64] Krogan, N.J. et al. (2006) Global landscape of protein complexes in the yeastSaccharomyces cerevisiae. Nature 440, 637–643.

[65] Pan, X., Ye, P., Yuan, D.S., Wang, X., Bader, J.S. and Boeke, J.D. (2006) A DNAintegrity network in the yeast Saccharomyces cerevisiae. Cell 124, 1069–1081.

[66] Tong, A.H. et al. (2004) Global mapping of the yeast genetic interactionnetwork. Science 303, 808–813.

[67] Gnad, F., de Godoy, L.M., Cox, J., Neuhauser, N., Ren, S., Olsen, J.V. and Mann, M.(2009) High-accuracy identification and bioinformatic analysis of in vivoprotein phosphorylation sites in yeast. Proteomics 9, 4642–4652.

[68] Kung, L.A., Tao, S.C., Qian, J., Smith, M.G., Snyder, M. and Zhu, H. (2009) Globalanalysis of the glycoproteome in Saccharomyces cerevisiae reveals new rolesfor protein glycosylation in eukaryotes. Mol.Syst.Biol. 5, 308.

[69] Wohlschlegel, J.A., Johnson, E.S., Reed, S.I. and Yates 3rd, J.R. (2004) Globalanalysis of protein sumoylation in Saccharomyces cerevisiae. J. Biol. Chem. 279,45662–45668.

[70] Peng, J. et al. (2003) A proteomics approach to understanding proteinubiquitination. Nat. Biotechnol. 21, 921–926.

[71] Choudhary, C., Kumar, C., Gnad, F., Nielsen, M.L., Rehman, M., Walther, T.C.,Olsen, J.V. and Mann, M. (2009) Lysine acetylation targets protein complexesand co-regulates major cellular functions. Science 325, 834–840.

[72] Denison, C., Rudner, A.D., Gerber, S.A., Bakalarski, C.E., Moazed, D. and Gygi,S.P. (2005) A proteomic strategy for gaining insights into protein sumoylationin yeast. Mol. Cell Proteomics 4, 246–254.