Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.
GoGreen Membership

Site Tools

  • AAAS
  • Subscribe
  • Feedback

Site Search

Search Advanced

Science 17 October 2003:
Vol. 302. no. 5644, pp. 449 - 453
DOI: 10.1126/science.1087361

Reports

A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data

Ronald Jansen,1* Haiyuan Yu,1 Dov Greenbaum,1 Yuval Kluger,1 Nevan J. Krogan,4 Sambath Chung,1,2 Andrew Emili,4 Michael Snyder,2 Jack F. Greenblatt,4 Mark Gerstein1,3{dagger}

We have developed an approach using Bayesian networks to predict protein-protein interactions genome-wide in yeast. Our method naturally weights and combines into reliable predictions genomic features only weakly associated with interaction (e.g., messenger RNAcoexpression, coessentiality, and colocalization). In addition to de novo predictions, it can integrate often noisy, experimental interaction data sets. We observe that at given levels of sensitivity, our predictions are more accurate than the existing high-throughput experimental data sets. We validate our predictions with TAP (tandem affinity purification) tagging experiments. Our analysis, which gives a comprehensive view of yeast interactions, is available at genecensus.org/intint.

1 Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, Post Office Box 208114, New Haven, CT 06520, USA.
2 Department of Molecular, Cellular and Developmental Biology, Yale University, 266 Whitney Avenue, Post Office Box 208114, New Haven, CT 06520, USA.
3 Department of Computer Science, Yale University, 266 Whitney Avenue, Post Office Box 208114, New Haven, CT 06520, USA.
4 Banting and Best Department of Medical Research, Department of Molecular and Medical Research, University of Toronto, Toronto, M5G 1L6, Ontario, Canada.

* Present address: Computational Biology Center, Memorial Sloan-Kettering Cancer Center, 307 West 63rd Street, New York, NY 10021, USA.

{dagger} To whom correspondence should be addressed. E-mail: mark.gerstein{at}yale.edu

Read the Full Text


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
VisANT: an integrative framework for networks in systems biology.
Z. Hu, E. S. Snitkin, and C. DeLisi (2008)
Brief Bioinform 9, 317-325
   Abstract »    Full Text »    PDF »
Discerning static and causal interactions in genome-wide reverse engineering problems.
M. Zampieri, N. Soranzo, and C. Altafini (2008)
Bioinformatics 24, 1510-1515
   Abstract »    PDF »
Assessing the functional structure of genomic data.
C. Huttenhower and O.G. Troyanskaya (2008)
Bioinformatics 24, i330-i338
   Abstract »    PDF »
Protein complex identification by supervised graph local clustering.
Y. Qi, F. Balem, C. Faloutsos, J. Klein-Seetharaman, and Z. Bar-Joseph (2008)
Bioinformatics 24, i250-i268
   Abstract »    PDF »
PRINCESS, a Protein Interaction Confidence Evaluation System with Multiple Data Sources.
D. Li, W. Liu, Z. Liu, J. Wang, Q. Liu, Y. Zhu, and F. He (2008)
Mol. Cell. Proteomics 7, 1043-1052
   Abstract »    Full Text »    PDF »
Network-guided genetic screening: building, testing and using gene networks to predict gene function.
B. Lehner and I. Lee (2008)
Brief Funct Genomic Proteomic 7, 217-227
   Abstract »    Full Text »    PDF »
The biological function of some human transcription factor binding motifs varies with position relative to the transcription start site.
K. Tharakaraman, O. Bodenreider, D. Landsman, J. L. Spouge, and L. Marino-Ramirez (2008)
Nucleic Acids Res. 36, 2777-2786
   Abstract »    Full Text »    PDF »
A review on models and algorithms for motif discovery in protein-protein interaction networks.
G. Ciriello and C. Guerra (2008)
Brief Funct Genomic Proteomic
   Abstract »    Full Text »    PDF »
Protein networks in disease.
T. Ideker and R. Sharan (2008)
Genome Res. 18, 644-652
   Abstract »    Full Text »    PDF »
Genome-wide inference of protein interaction sites: lessons from the yeast high-quality negative protein-protein interaction dataset.
J. Guo, X. Wu, D.-Y. Zhang, and K. Lin (2008)
Nucleic Acids Res. 36, 2002-2011
   Abstract »    Full Text »    PDF »
An assessment of the uses of homologous interactions.
R. Saeed and C. Deane (2008)
Bioinformatics 24, 689-695
   Abstract »    Full Text »    PDF »
Genome-wide B1 retrotransposon binds the transcription factors dioxin receptor and Slug and regulates gene expression in vivo.
A. C. Roman, D. A. Benitez, J. M. Carvajal-Gonzalez, and P. M. Fernandez-Salguero (2008)
PNAS 105, 1632-1637
   Abstract »    Full Text »    PDF »
AtPID: Arabidopsis thaliana protein interactome database an integrative platform for plant systems biology.
J. Cui, P. Li, G. Li, F. Xu, C. Zhao, Y. Li, Z. Yang, G. Wang, Q. Yu, Y. Li, et al. (2008)
Nucleic Acids Res. 36, D999-D1008
   Abstract »    Full Text »    PDF »
Gene Ontology annotations at SGD: new data sources and annotation methods.
E. L. Hong, R. Balakrishnan, Q. Dong, K. R. Christie, J. Park, G. Binkley, M. C. Costanzo, S. S. Dwight, S. R. Engel, D. G. Fisk, et al. (2008)
Nucleic Acids Res. 36, D577-D581
   Abstract »    Full Text »    PDF »
Host pathogen protein interactions predicted by comparative modeling.
F. P. Davis, D. T. Barkan, N. Eswar, J. H. McKerrow, and A. Sali (2007)
Protein Sci. 16, 2585-2596
   Abstract »    Full Text »    PDF »
Computational Prediction and Experimental Verification of the Gene Encoding the NAD+/NADP+-Dependent Succinate Semialdehyde Dehydrogenase in Escherichia coli.
T. Fuhrer, L. Chen, U. Sauer, and D. Vitkup (2007)
J. Bacteriol. 189, 8073-8078
   Abstract »    Full Text »    PDF »
Automated data integration for developmental biological research.
W. Zhong and P. W. Sternberg (2007)
Development 134, 3227-3238
   Abstract »    Full Text »    PDF »
Current progress in network research: toward reference networks for key model organisms.
B. S. Srinivasan, N. H. Shah, J. A. Flannick, E. Abeliuk, A. F. Novak, and S. Batzoglou (2007)
Brief Bioinform 8, 318-332
   Abstract »    Full Text »    PDF »
Context-sensitive data integration and prediction of biological networks.
C. L. Myers and O. G. Troyanskaya (2007)
Bioinformatics 23, 2322-2330
   Abstract »    Full Text »    PDF »
Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications.
H. Yu, R. Jansen, G. Stolovitzky, and M. Gerstein (2007)
Bioinformatics 23, 2163-2173
   Abstract »    Full Text »    PDF »
PDZ Domain Binding Selectivity Is Optimized Across the Mouse Proteome.
M. A. Stiffler, J. R. Chen, V. P. Grantcharova, Y. Lei, D. Fuchs, J. E. Allen, L. A. Zaslavskaia, and G. MacBeath (2007)
Science 317, 364-369
   Abstract »    Full Text »    PDF »
3D-partner: a web server to infer interacting partners and binding models.
Y.-C. Chen, Y.-S. Lo, W.-C. Hsu, and J.-M. Yang (2007)
Nucleic Acids Res. 35, W561-W567
   Abstract »    Full Text »    PDF »
Defining functional distance using manifold embeddings of gene ontology annotations.
G. Lerman and B. E. Shakhnovich (2007)
PNAS 104, 11334-11339
   Abstract »    Full Text »    PDF »
Supervised reconstruction of biological networks with local models.
K. Bleakley, G. Biau, and J.-P. Vert (2007)
Bioinformatics 23, i57-i65
   Abstract »    Full Text »    PDF »
Computational prediction of host-pathogen protein protein interactions.
M. D. Dyer, T. M. Murali, and B. W. Sobral (2007)
Bioinformatics 23, i159-i166
   Abstract »    Full Text »    PDF »
Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks.
W. K. Lim, K. Wang, C. Lefebvre, and A. Califano (2007)
Bioinformatics 23, i282-i288
   Abstract »    Full Text »    PDF »
Modelling genotype-phenotype relationships and human disease with genetic interaction networks.
B. Lehner (2007)
J. Exp. Biol. 210, 1559-1566
   Abstract »    Full Text »    PDF »
Getting connected: analysis and principles of biological networks.
X. Zhu, M. Gerstein, and M. Snyder (2007)
Genes & Dev. 21, 1010-1024
   Abstract »    Full Text »    PDF »
Bayesian methods in bioinformatics and computational systems biology.
D. J. Wilkinson (2007)
Brief Bioinform
   Abstract »    Full Text »    PDF »
Inferring genome-wide functional linkages in E. coli by combining improved genome context methods: Comparison with high-throughput experimental data.
S. Yellaboina, K. Goyal, and S. C. Mande (2007)
Genome Res. 17, 527-535
   Abstract »    Full Text »    PDF »
Comparison of human protein protein interaction maps.
M. E. Futschik, G. Chaurasia, and H. Herzel (2007)
Bioinformatics 23, 605-611
   Abstract »    Full Text »    PDF »
Transcriptional regulation of protein complexes within and across species.
K. Tan, T. Shlomi, H. Feizi, T. Ideker, and R. Sharan (2007)
PNAS 104, 1283-1288
   Abstract »    Full Text »    PDF »
CellCircuits: a database of protein network models.
H. C. Mak, M. Daly, B. Gruebel, and T. Ideker (2007)
Nucleic Acids Res. 35, D538-D545
   Abstract »    Full Text »    PDF »
Constrained models of evolution lead to improved prediction of functional linkage from correlated gain and loss of genes.
D. Barker, A. Meade, and M. Pagel (2007)
Bioinformatics 23, 14-20
   Abstract »    Full Text »    PDF »
Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights.
P. M. Kim, L. J. Lu, Y. Xia, and M. B. Gerstein (2006)
Science 314, 1938-1941
   Abstract »    Full Text »    PDF »
Colloquium Papers: Genomic analysis of the hierarchical structure of regulatory networks.
H. Yu and M. Gerstein (2006)
PNAS 103, 14724-14731
   Abstract »    Full Text »    PDF »
Colloquium Papers: Characterization and prediction of protein-protein interactions within and between complexes.
E. Sprinzak, Y. Altuvia, and H. Margalit (2006)
PNAS 103, 14718-14723
   Abstract »    Full Text »    PDF »
A framework of integrating gene relations from heterogeneous data sources: an experiment on Arabidopsis thaliana.
J. Li, X. Li, H. Su, H. Chen, and D. W. Galbraith (2006)
Bioinformatics 22, 2037-2043
   Abstract »    Full Text »    PDF »
Protein complex compositions predicted by structural similarity.
F. P. Davis, H. Braberg, M.-Y. Shen, U. Pieper, A. Sali, and M.S. Madhusudhan (2006)
Nucleic Acids Res. 34, 2943-2952
   Abstract »    Full Text »    PDF »
Capturing expert knowledge with argumentation: a case study in bioinformatics.
B. R. Jefferys, L. A. Kelley, M. J. Sergot, J. Fox, and M. J. E. Sternberg (2006)
Bioinformatics 22, 924-933
   Abstract »    Full Text »    PDF »
Assessing semantic similarity measures for the characterization of human regulatory pathways.
X. Guo, R. Liu, C. D. Shriver, H. Hu, and M. N. Liebman (2006)
Bioinformatics 22, 967-973
   Abstract »    Full Text »    PDF »
Computational modeling of the Plasmodium falciparum interactome reveals protein function on a genome-wide scale.
S. V. Date and C. J. Stoeckert Jr. (2006)
Genome Res. 16, 542-549
   Abstract »    Full Text »    PDF »
Hierarchical multi-label prediction of gene function.
Z. Barutcuoglu, R. E. Schapire, and O. G. Troyanskaya (2006)
Bioinformatics 22, 830-836
   Abstract »    Full Text »    PDF »
Predicting interactions in protein networks by completing defective cliques.
H. Yu, A. Paccanaro, V. Trifonov, and M. Gerstein (2006)
Bioinformatics 22, 823-829
   Abstract »    Full Text »    PDF »
The Effect of Multifunctionality on the Rate of Evolution in Yeast.
M. Salathe, M. Ackermann, and S. Bonhoeffer (2006)
Mol. Biol. Evol. 23, 721-722
   Abstract »    Full Text »    PDF »
Genome-wide prediction of C. elegans genetic interactions..
W. Zhong and P. W. Sternberg (2006)
Science 311, 1481-1484
   Abstract »    Full Text »    PDF »
Comprehensive Mutational Analysis of Yeast DEXD/H Box RNA Helicases Involved in Large Ribosomal Subunit Biogenesis.
K. A. Bernstein, S. Granneman, A. V. Lee, S. Manickam, and S. J. Baserga (2006)
Mol. Cell. Biol. 26, 1195-1208
   Abstract »    Full Text »    PDF »
Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations..
X. Wu, L. Zhu, J. Guo, D.-Y. Zhang, and K. Lin (2006)
Nucleic Acids Res. 34, 2137-2150
   Abstract »    Full Text »    PDF »
MPact: the MIPS protein interaction resource on yeast.
U. Guldener, M. Munsterkotter, M. Oesterheld, P. Pagel, A. Ruepp, H.-W. Mewes, and V. Stumpflen (2006)
Nucleic Acids Res. 34, D436-D441
   Abstract »    Full Text »    PDF »
Changing perspectives in yeast research nearly a decade after the genome sequence.
K. Dolinski and D. Botstein (2005)
Genome Res. 15, 1611-1619
   Abstract »    Full Text »    PDF »
A data integration methodology for systems biology.
D. Hwang, A. G. Rust, S. Ramsey, J. J. Smith, D. M. Leslie, A. D. Weston, P. de Atauri, J. D. Aitchison, L. Hood, A. F. Siegel, et al. (2005)
PNAS 102, 17296-17301
   Abstract »    Full Text »    PDF »
Interactome: gateway into systems biology.
M. E. Cusick, N. Klitgord, M. Vidal, and D. E. Hill (2005)
Hum. Mol. Genet. 14, R171-R181
   Abstract »    Full Text »    PDF »
Identifying cooperative transcriptional regulations using protein-protein interactions.
N. Nagamine, Y. Kawada, and Y. Sakakibara (2005)
Nucleic Acids Res. 33, 4828-4837
   Abstract »    Full Text »    PDF »
Large-scale identification of yeast integral membrane protein interactions.
J. P. Miller, R. S. Lo, A. Ben-Hur, C. Desmarais, I. Stagljar, W. S. Noble, and S. Fields (2005)
PNAS 102, 12123-12128
   Abstract »    Full Text »    PDF »
Inferring protein-protein interactions through high-throughput interaction data from diverse organisms.
Y. Liu, N. Liu, and H. Zhao (2005)
Bioinformatics 21, 3279-3285
   Abstract »    Full Text »    PDF »
A latent variable model for chemogenomic profiling.
P. Flaherty, G. Giaever, J. Kumm, M. I. Jordan, and A. P. Arkin (2005)
Bioinformatics 21, 3286-3293
   Abstract »    Full Text »    PDF »
Assessing the limits of genomic data integration for predicting protein networks.
L. J. Lu, Y. Xia, A. Paccanaro, H. Yu, and M. Gerstein (2005)
Genome Res. 15, 945-953
   Abstract »    Full Text »    PDF »
Mapping Molecular Networks Using Proteomics: A Vision for Patient-Tailored Combination Therapy.
E. F. Petricoin III, V. E. Bichsel, V. S. Calvert, V. Espina, M. Winters, L. Young, C. Belluco, B. J. Trock, M. Lippman, D. A. Fishman, et al. (2005)
J. Clin. Oncol. 23, 3614-3621
   Abstract »    Full Text »    PDF »
Prediction of functional modules based on comparative genome analysis and Gene Ontology application.
H. Wu, Z. Su, F. Mao, V. Olman, and Y. Xu (2005)
Nucleic Acids Res. 33, 2822-2837
   Abstract »    Full Text »    PDF »
Integrative data analysis for functional prediction: a multi-objective optimization approach.
F. Azuaje (2005)
Bioinformatics 21, 2099-2100
   Abstract »    Full Text »    PDF »
Force sensing and generation in cell phases: analyses of complex functions.
H.-G. Dobereiner, B. J. Dubin-Thaler, G. Giannone, and M. P. Sheetz (2005)
J Appl Physiol 98, 1542-1546
   Abstract »    Full Text »    PDF »
Predicting protein-protein interactions using signature products.
S. Martin, D. Roe, and J.-L. Faulon (2005)
Bioinformatics 21, 218-226
   Abstract »    Full Text »    PDF »
Predicting protein functions with message passing algorithms.
M. Leone and A. Pagnani (2005)
Bioinformatics 21, 239-247
   Abstract »    Full Text »    PDF »
Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae.
Y. Chen and D. Xu (2004)
Nucleic Acids Res. 32, 6414-6424
   Abstract »    Full Text »    PDF »
A Probabilistic Functional Network of Yeast Genes.
I. Lee, S. V. Date, A. T. Adai, and E. M. Marcotte (2004)
Science 306, 1555-1558
   Abstract »    Full Text »    PDF »
Interaction Networks of the Molecular Machines That Decode, Replicate, and Maintain the Integrity of the Human Genome.
B. Coulombe, C. Jeronimo, M.-F. Langelier, M. Cojocaru, and D. Bergeron (2004)
Mol. Cell. Proteomics 3, 851-856
   Abstract »    Full Text »    PDF »
The NEF4 Complex Regulates Rad4 Levels and Utilizes Snf2/Swi2-Related ATPase Activity for Nucleotide Excision Repair.
K. L. Ramsey, J. J. Smith, A. Dasgupta, N. Maqani, P. Grant, and D. T. Auble (2004)
Mol. Cell. Biol. 24, 6362-6378
   Abstract »    Full Text »    PDF »
Coevolution of gene expression among interacting proteins.
H. B. Fraser, A. E. Hirsh, D. P. Wall, and M. B. Eisen (2004)
PNAS 101, 9033-9038
   Abstract »    Full Text »    PDF »
Annotation Transfer Between Genomes: Protein-Protein Interologs and Protein-DNA Regulogs.
H. Yu, N. M. Luscombe, H. X. Lu, X. Zhu, Y. Xia, J.-D. J. Han, N. Bertin, S. Chung, M. Vidal, and M. Gerstein (2004)
Genome Res. 14, 1107-1118
   Abstract »    Full Text »    PDF »
Predicting Protein Complex Membership Using Probabilistic Network Reliability.
S. Asthana, O. D. King, F. D. Gibbons, and F. P. Roth (2004)
Genome Res. 14, 1170-1175
   Abstract »    Full Text »    PDF »
Role of the cytoskeleton in signaling networks.
G. Forgacs, S. H. Yook, P. A. Janmey, H. Jeong, and C. G. Burd (2004)
J. Cell Sci. 117, 2769-2775
   Abstract »    Full Text »    PDF »



ADVERTISEMENT
Click Me!

ADVERTISEMENT
Click Me!

To Advertise     Find Products


Science. ISSN 0036-8075 (print), 1095-9203 (online)