Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.


Science 26 November 2004:
Vol. 306. no. 5701, pp. 1555 - 1558
DOI: 10.1126/science.1099511

Reports

A Probabilistic Functional Network of Yeast Genes

Insuk Lee,1 Shailesh V. Date,1* Alex T. Adai,1{dagger} Edward M. Marcotte1,2{ddagger}

A conceptual framework for integrating diverse functional genomics data was developed by reinterpreting experiments to provide numerical likelihoods that genes are functionally linked. This allows direct comparison and integration of different classes of data. The resulting probabilistic gene network estimates the functional coupling between genes. Within this framework, we reconstructed an extensive, high-quality functional gene network for Saccharomyces cerevisiae, consisting of 4681 (~81%) of the known yeast genes linked by ~34,000 probabilistic linkages comparable in accuracy to small-scale interaction assays. The integrated linkages distinguish true from false-positive interactions in earlier data sets; new interactions emerge from genes' network contexts, as shown for genes in chromatin modification and ribosome biogenesis.

1 Center for Systems and Synthetic Biology, Institute for Molecular Biology, University of Texas at Austin, Austin, TX 78712–1064, USA.
2 Department of Chemistry and Biochemistry, Institute for Molecular Biology, University of Texas at Austin, Austin, TX 78712–1064, USA.



* Present address: Center for Bioinformatics, 423 Guardian Drive, University of Pennsylvania, Philadelphia, PA 19104, USA.

{dagger} Present address: Mission Bay Genentech Hall, 600 16th Street, Suite N472D, University of California at San Francisco, San Francisco, CA 94143–2240, USA.

{ddagger} To whom correspondence should be addressed. E-mail: marcotte{at}icmb.utexas.edu

Read the Full Text


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
Gene divergence and pathway duplication in the metabolic network of yeast and digital organisms.
P. Gerlee, T. Lundh, B. Zhang, and A. R. A. Anderson (2009)
J R Soc Interface 6, 1233-1245
   Abstract »    Full Text »    PDF »
Mining gene functional networks to improve mass-spectrometry-based protein identification.
S. R. Ramakrishnan, C. Vogel, T. Kwon, L. O. Penalva, E. M. Marcotte, and D. P. Miranker (2009)
Bioinformatics 25, 2955-2961
   Abstract »    Full Text »    PDF »
Data integration for plant genomics--exemplars from the integration of Arabidopsis thaliana databases.
A. Lysenko, M. M. Hindle, J. Taubert, M. Saqi, and C. J. Rawlings (2009)
Brief Bioinform 10, 676-693
   Abstract »    Full Text »    PDF »
Multi-dimensional correlations for gene coexpression and application to the large-scale data of Arabidopsis.
K. Kinoshita and T. Obayashi (2009)
Bioinformatics 25, 2677-2684
   Abstract »    Full Text »    PDF »
Detection of Mammalian Virulence Determinants in Highly Pathogenic Avian Influenza H5N1 Viruses: Multivariate Analysis of Published Data.
S. J. Lycett, M. J. Ward, F. I. Lewis, A. F. Y. Poon, S. L. Kosakovsky Pond, and A. J. L. Brown (2009)
J. Virol. 83, 9901-9910
   Abstract »    Full Text »    PDF »
Predicting eukaryotic transcriptional cooperativity by Bayesian network integration of genome-wide data.
Y. Wang, X.-S. Zhang, and Y. Xia (2009)
Nucleic Acids Res. 37, 5943-5958
   Abstract »    Full Text »    PDF »
Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function.
M. A. Grobei, E. Qeli, E. Brunner, H. Rehrauer, R. Zhang, B. Roschitzki, K. Basler, C. H. Ahrens, and U. Grossniklaus (2009)
Genome Res. 19, 1786-1800
   Abstract »    Full Text »    PDF »
The impact of incomplete knowledge on evaluation: an experimental benchmark for protein function prediction.
C. Huttenhower, M. A. Hibbs, C. L. Myers, A. A. Caudy, D. C. Hess, and O. G. Troyanskaya (2009)
Bioinformatics 25, 2404-2410
   Abstract »    Full Text »    PDF »
Global networks of functional coupling in eukaryotes from comprehensive data integration.
A. Alexeyenko and E. L.L. Sonnhammer (2009)
Genome Res. 19, 1107-1116
   Abstract »    Full Text »    PDF »
Exploring the human genome with functional maps.
C. Huttenhower, E. M. Haley, M. A. Hibbs, V. Dumeaux, D. R. Barrett, H. A. Coller, and O. G. Troyanskaya (2009)
Genome Res. 19, 1093-1106
   Abstract »    Full Text »    PDF »
Network-based multiple locus linkage analysis of expression traits.
W. Pan (2009)
Bioinformatics 25, 1390-1396
   Abstract »    Full Text »    PDF »
Human microRNAs co-silence in well-separated groups and have different predicted essentialities.
G. Boross, K. Orosz, and I. J. Farkas (2009)
Bioinformatics 25, 1063-1069
   Abstract »    Full Text »    PDF »
Combining multiple positive training sets to generate confidence scores for protein-protein interactions.
J. Yu and R. L. Finley Jr (2009)
Bioinformatics 25, 105-111
   Abstract »    Full Text »    PDF »
Functional coherence in domain interaction networks.
J. Pandey, M. Koyuturk, S. Subramaniam, and A. Grama (2008)
Bioinformatics 24, i28-i34
   Abstract »    Full Text »    PDF »
Assessing the functional structure of genomic data.
C. Huttenhower and O.G. Troyanskaya (2008)
Bioinformatics 24, i330-i338
   Abstract »    Full Text »    PDF »
Network-guided genetic screening: building, testing and using gene networks to predict gene function.
B. Lehner and I. Lee (2008)
Brief Funct Genomic Proteomic 7, 217-227
   Abstract »    Full Text »    PDF »
Protein networks in disease.
T. Ideker and R. Sharan (2008)
Genome Res. 18, 644-652
   Abstract »    Full Text »    PDF »
Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution.
P. R Kensche, V. van Noort, B. E Dutilh, and M. A Huynen (2008)
J R Soc Interface 5, 151-170
   Abstract »    Full Text »    PDF »
Incorporating gene networks into statistical tests for genomic data via a spatially correlated mixture model.
P. Wei and W. Pan (2008)
Bioinformatics 24, 404-411
   Abstract »    Full Text »    PDF »
Bacteriome.org an integrated protein interaction database for E. coli.
C. Su, J. M. Peregrin-Alvarez, G. Butland, S. Phanse, V. Fong, A. Emili, and J. Parkinson (2008)
Nucleic Acids Res. 36, D632-D636
   Abstract »    Full Text »    PDF »
Gene Ontology annotations at SGD: new data sources and annotation methods.
E. L. Hong, R. Balakrishnan, Q. Dong, K. R. Christie, J. Park, G. Binkley, M. C. Costanzo, S. S. Dwight, S. R. Engel, D. G. Fisk, et al. (2008)
Nucleic Acids Res. 36, D577-D581
   Abstract »    Full Text »    PDF »
An efficient strategy for extensive integration of diverse biological data for protein function prediction.
H. N. Chua, W.-K. Sung, and L. Wong (2007)
Bioinformatics 23, 3364-3373
   Abstract »    Full Text »    PDF »
Graph sharpening plus graph integration: a synergy that improves protein functional classification.
H. Shin, A. M. Lisewski, and O. Lichtarge (2007)
Bioinformatics 23, 3217-3224
   Abstract »    Full Text »    PDF »
Network Inference, Analysis, and Modeling in Systems Biology.
R. Albert (2007)
PLANT CELL 19, 3327-3338
   Full Text »    PDF »
Automated data integration for developmental biological research.
W. Zhong and P. W. Sternberg (2007)
Development 134, 3227-3238
   Abstract »    Full Text »    PDF »
Current progress in network research: toward reference networks for key model organisms.
B. S. Srinivasan, N. H. Shah, J. A. Flannick, E. Abeliuk, A. F. Novak, and S. Batzoglou (2007)
Brief Bioinform 8, 318-332
   Abstract »    Full Text »    PDF »
Context-sensitive data integration and prediction of biological networks.
C. L. Myers and O. G. Troyanskaya (2007)
Bioinformatics 23, 2322-2330
   Abstract »    Full Text »    PDF »
Computational modeling of Caenorhabditis elegans vulval induction.
X. Sun and P. Hong (2007)
Bioinformatics 23, i499-i507
   Abstract »    Full Text »    PDF »
Alignment of molecular networks by integer quadratic programming.
Z. Li, S. Zhang, Y. Wang, X.-S. Zhang, and L. Chen (2007)
Bioinformatics 23, 1631-1639
   Abstract »    Full Text »    PDF »
Modelling genotype-phenotype relationships and human disease with genetic interaction networks.
B. Lehner (2007)
J. Exp. Biol. 210, 1559-1566
   Abstract »    Full Text »    PDF »
Bayesian methods in bioinformatics and computational systems biology.
D. J. Wilkinson (2007)
Brief Bioinform
   Abstract »    Full Text »    PDF »
Toward a Comprehensive Atlas of the Physical Interactome of Saccharomyces cerevisiae.
S. R. Collins, P. Kemmeren, X.-C. Zhao, J. F. Greenblatt, F. Spencer, F. C. P. Holstege, J. S. Weissman, and N. J. Krogan (2007)
Mol. Cell. Proteomics 6, 439-450
   Abstract »    Full Text »    PDF »
Comparison of human protein protein interaction maps.
M. E. Futschik, G. Chaurasia, and H. Herzel (2007)
Bioinformatics 23, 605-611
   Abstract »    Full Text »    PDF »
Transcriptional regulation of protein complexes within and across species.
K. Tan, T. Shlomi, H. Feizi, T. Ideker, and R. Sharan (2007)
PNAS 104, 1283-1288
   Abstract »    Full Text »    PDF »
CellCircuits: a database of protein network models.
H. C. Mak, M. Daly, B. Gruebel, and T. Ideker (2007)
Nucleic Acids Res. 35, D538-D545
   Abstract »    Full Text »    PDF »
Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights.
P. M. Kim, L. J. Lu, Y. Xia, and M. B. Gerstein (2006)
Science 314, 1938-1941
   Abstract »    Full Text »    PDF »
A scalable method for integration and functional analysis of multiple microarray datasets.
C. Huttenhower, M. Hibbs, C. Myers, and O. G. Troyanskaya (2006)
Bioinformatics 22, 2890-2897
   Abstract »    Full Text »    PDF »
Systems interface biology.
F. J Doyle III and J. Stelling (2006)
J R Soc Interface 3, 603-616
   Abstract »    Full Text »    PDF »
Graemlin: General and robust alignment of multiple large interaction networks.
J. Flannick, A. Novak, B. S. Srinivasan, H. H. McAdams, and S. Batzoglou (2006)
Genome Res. 16, 1169-1181
   Abstract »    Full Text »    PDF »
A framework of integrating gene relations from heterogeneous data sources: an experiment on Arabidopsis thaliana.
J. Li, X. Li, H. Su, H. Chen, and D. W. Galbraith (2006)
Bioinformatics 22, 2037-2043
   Abstract »    Full Text »    PDF »
A molecular-properties-based approach to understanding PDZ domain proteins and PDZ ligands.
C. Giallourakis, Z. Cao, T. Green, H. Wachtel, X. Xie, M. Lopez-Illasaca, M. Daly, J. Rioux, and R. Xavier (2006)
Genome Res. 16, 1056-1072
   Abstract »    Full Text »    PDF »
VIRGO: computational prediction of gene functions..
N. Massjouni, C. G. Rivera, and T. M. Murali (2006)
Nucleic Acids Res. 34, W340-W344
   Abstract »    Full Text »    PDF »
Protein complex compositions predicted by structural similarity.
F. P. Davis, H. Braberg, M.-Y. Shen, U. Pieper, A. Sali, and M.S. Madhusudhan (2006)
Nucleic Acids Res. 34, 2943-2952
   Abstract »    Full Text »    PDF »
Computational modeling of the Plasmodium falciparum interactome reveals protein function on a genome-wide scale.
S. V. Date and C. J. Stoeckert Jr. (2006)
Genome Res. 16, 542-549
   Abstract »    Full Text »    PDF »
Modelling in molecular biology: describing transcription regulatory networks at different scales.
T. Schlitt and A. Brazma (2006)
Phil Trans R Soc B 361, 483-494
   Abstract »    Full Text »    PDF »
Genome-wide prediction of C. elegans genetic interactions..
W. Zhong and P. W. Sternberg (2006)
Science 311, 1481-1484
   Abstract »    Full Text »    PDF »
The Splicing Factor Prp43p, a DEAH Box ATPase, Functions in Ribosome Biogenesis.
N. B. Leeds, E. C. Small, S. L. Hiley, T. R. Hughes, and J. P. Staley (2006)
Mol. Cell. Biol. 26, 513-522
   Abstract »    Full Text »    PDF »
Prp43p Is a DEAH-Box Spliceosome Disassembly Factor Essential for Ribosome Biogenesis.
D. J. Combs, R. J. Nagel, M. Ares Jr., and S. W. Stevens (2006)
Mol. Cell. Biol. 26, 523-534
   Abstract »    Full Text »    PDF »
Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations..
X. Wu, L. Zhu, J. Guo, D.-Y. Zhang, and K. Lin (2006)
Nucleic Acids Res. 34, 2137-2150
   Abstract »    Full Text »    PDF »
Inferring functional information from domain co-evolution.
Y. Kim, M. Koyuturk, U. Topkara, A. Grama, and S. Subramaniam (2006)
Bioinformatics 22, 40-49
   Abstract »    Full Text »    PDF »
Changing perspectives in yeast research nearly a decade after the genome sequence.
K. Dolinski and D. Botstein (2005)
Genome Res. 15, 1611-1619
   Abstract »    Full Text »    PDF »
The Splicing ATPase Prp43p Is a Component of Multiple Preribosomal Particles.
S. Lebaron, C. Froment, M. Fromont-Racine, J.-C. Rain, B. Monsarrat, M. Caizergues-Ferrer, and Y. Henry (2005)
Mol. Cell. Biol. 25, 9269-9282
   Abstract »    Full Text »    PDF »
Assessing the limits of genomic data integration for predicting protein networks.
L. J. Lu, Y. Xia, A. Paccanaro, H. Yu, and M. Gerstein (2005)
Genome Res. 15, 945-953
   Abstract »    Full Text »    PDF »
Extension of the Visualization Tool MapMan to Allow Statistical Analysis of Arrays, Display of Coresponding Genes, and Comparison with Known Responses.
B. Usadel, A. Nagel, O. Thimm, H. Redestig, O. E. Blaesing, N. Palacios-Rojas, J. Selbig, J. Hannemann, M. C. Piques, D. Steinhauser, et al. (2005)
Plant Physiology 138, 1195-1204
   Abstract »    Full Text »    PDF »
Prediction of functional modules based on comparative genome analysis and Gene Ontology application.
H. Wu, Z. Su, F. Mao, V. Olman, and Y. Xu (2005)
Nucleic Acids Res. 33, 2822-2837
   Abstract »    Full Text »    PDF »
Uncovering transcriptional regulation of metabolism by using metabolic network topology.
K. R. Patil and J. Nielsen (2005)
PNAS 102, 2685-2689
   Abstract »    Full Text »    PDF »



To Advertise     Find Products


Science. ISSN 0036-8075 (print), 1095-9203 (online)