Related Content
Search Google Scholar for:
More Information
Related Jobs from ScienceCareers
|
|
Science 17 October 2003: Vol. 302. no. 5644, pp. 449 - 453 DOI: 10.1126/science.1087361
|
|
Reports
A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data
Ronald Jansen,1*
Haiyuan Yu,1
Dov Greenbaum,1
Yuval Kluger,1
Nevan J. Krogan,4
Sambath Chung,1,2
Andrew Emili,4
Michael Snyder,2
Jack F. Greenblatt,4
Mark Gerstein1,3
We have developed an approach using Bayesian networks to predict protein-protein interactions genome-wide in yeast. Our method naturally weights and combines into reliable predictions genomic features only weakly associated with interaction (e.g., messenger RNAcoexpression, coessentiality, and colocalization). In addition to de novo predictions, it can integrate often noisy, experimental interaction data sets. We observe that at given levels of sensitivity, our predictions are more accurate than the existing high-throughput experimental data sets. We validate our predictions with TAP (tandem affinity purification) tagging experiments. Our analysis, which gives a comprehensive view of yeast interactions, is available at genecensus.org/intint.
1 Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, Post Office Box 208114, New Haven, CT 06520, USA.
2 Department of Molecular, Cellular and Developmental Biology, Yale University, 266 Whitney Avenue, Post Office Box 208114, New Haven, CT 06520, USA.
3 Department of Computer Science, Yale University, 266 Whitney Avenue, Post Office Box 208114, New Haven, CT 06520, USA.
4 Banting and Best Department of Medical Research, Department of Molecular and Medical Research, University of Toronto, Toronto, M5G 1L6, Ontario, Canada.
* Present address: Computational Biology Center, Memorial Sloan-Kettering Cancer Center, 307 West 63rd Street, New York, NY 10021, USA.
To whom correspondence should be addressed. E-mail: mark.gerstein{at}yale.edu
Read the Full Text
THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
- VisANT: an integrative framework for networks in systems biology.
- Z. Hu, E. S. Snitkin, and C. DeLisi (2008)
Brief Bioinform
9, 317-325
| Abstract »
| Full Text »
| PDF »
- Discerning static and causal interactions in genome-wide reverse engineering problems.
- M. Zampieri, N. Soranzo, and C. Altafini (2008)
Bioinformatics
24, 1510-1515
| Abstract »
| PDF »
- Assessing the functional structure of genomic data.
- C. Huttenhower and O.G. Troyanskaya (2008)
Bioinformatics
24, i330-i338
| Abstract »
| PDF »
- Protein complex identification by supervised graph local clustering.
- Y. Qi, F. Balem, C. Faloutsos, J. Klein-Seetharaman, and Z. Bar-Joseph (2008)
Bioinformatics
24, i250-i268
| Abstract »
| PDF »
- PRINCESS, a Protein Interaction Confidence Evaluation System with Multiple Data Sources.
- D. Li, W. Liu, Z. Liu, J. Wang, Q. Liu, Y. Zhu, and F. He (2008)
Mol. Cell. Proteomics
7, 1043-1052
| Abstract »
| Full Text »
| PDF »
- Network-guided genetic screening: building, testing and using gene networks to predict gene function.
- B. Lehner and I. Lee (2008)
Brief Funct Genomic Proteomic
7, 217-227
| Abstract »
| Full Text »
| PDF »
- The biological function of some human transcription factor binding motifs varies with position relative to the transcription start site.
- K. Tharakaraman, O. Bodenreider, D. Landsman, J. L. Spouge, and L. Marino-Ramirez (2008)
Nucleic Acids Res.
36, 2777-2786
| Abstract »
| Full Text »
| PDF »
- A review on models and algorithms for motif discovery in protein-protein interaction networks.
- G. Ciriello and C. Guerra (2008)
Brief Funct Genomic Proteomic
| Abstract »
| Full Text »
| PDF »
- Protein networks in disease.
- T. Ideker and R. Sharan (2008)
Genome Res.
18, 644-652
| Abstract »
| Full Text »
| PDF »
- Genome-wide inference of protein interaction sites: lessons from the yeast high-quality negative protein-protein interaction dataset.
- J. Guo, X. Wu, D.-Y. Zhang, and K. Lin (2008)
Nucleic Acids Res.
36, 2002-2011
| Abstract »
| Full Text »
| PDF »
- An assessment of the uses of homologous interactions.
- R. Saeed and C. Deane (2008)
Bioinformatics
24, 689-695
| Abstract »
| Full Text »
| PDF »
- Genome-wide B1 retrotransposon binds the transcription factors dioxin receptor and Slug and regulates gene expression in vivo.
- A. C. Roman, D. A. Benitez, J. M. Carvajal-Gonzalez, and P. M. Fernandez-Salguero (2008)
PNAS
105, 1632-1637
| Abstract »
| Full Text »
| PDF »
- AtPID: Arabidopsis thaliana protein interactome database an integrative platform for plant systems biology.
- J. Cui, P. Li, G. Li, F. Xu, C. Zhao, Y. Li, Z. Yang, G. Wang, Q. Yu, Y. Li, et al. (2008)
Nucleic Acids Res.
36, D999-D1008
| Abstract »
| Full Text »
| PDF »
- Gene Ontology annotations at SGD: new data sources and annotation methods.
- E. L. Hong, R. Balakrishnan, Q. Dong, K. R. Christie, J. Park, G. Binkley, M. C. Costanzo, S. S. Dwight, S. R. Engel, D. G. Fisk, et al. (2008)
Nucleic Acids Res.
36, D577-D581
| Abstract »
| Full Text »
| PDF »
- Host pathogen protein interactions predicted by comparative modeling.
- F. P. Davis, D. T. Barkan, N. Eswar, J. H. McKerrow, and A. Sali (2007)
Protein Sci.
16, 2585-2596
| Abstract »
| Full Text »
| PDF »
- Computational Prediction and Experimental Verification of the Gene Encoding the NAD+/NADP+-Dependent Succinate Semialdehyde Dehydrogenase in Escherichia coli.
- T. Fuhrer, L. Chen, U. Sauer, and D. Vitkup (2007)
J. Bacteriol.
189, 8073-8078
| Abstract »
| Full Text »
| PDF »
- Automated data integration for developmental biological research.
- W. Zhong and P. W. Sternberg (2007)
Development
134, 3227-3238
| Abstract »
| Full Text »
| PDF »
- Current progress in network research: toward reference networks for key model organisms.
- B. S. Srinivasan, N. H. Shah, J. A. Flannick, E. Abeliuk, A. F. Novak, and S. Batzoglou (2007)
Brief Bioinform
8, 318-332
| Abstract »
| Full Text »
| PDF »
- Context-sensitive data integration and prediction of biological networks.
- C. L. Myers and O. G. Troyanskaya (2007)
Bioinformatics
23, 2322-2330
| Abstract »
| Full Text »
| PDF »
- Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications.
- H. Yu, R. Jansen, G. Stolovitzky, and M. Gerstein (2007)
Bioinformatics
23, 2163-2173
| Abstract »
| Full Text »
| PDF »
- PDZ Domain Binding Selectivity Is Optimized Across the Mouse Proteome.
- M. A. Stiffler, J. R. Chen, V. P. Grantcharova, Y. Lei, D. Fuchs, J. E. Allen, L. A. Zaslavskaia, and G. MacBeath (2007)
Science
317, 364-369
| Abstract »
| Full Text »
| PDF »
- 3D-partner: a web server to infer interacting partners and binding models.
- Y.-C. Chen, Y.-S. Lo, W.-C. Hsu, and J.-M. Yang (2007)
Nucleic Acids Res.
35, W561-W567
| Abstract »
| Full Text »
| PDF »
- Defining functional distance using manifold embeddings of gene ontology annotations.
- G. Lerman and B. E. Shakhnovich (2007)
PNAS
104, 11334-11339
| Abstract »
| Full Text »
| PDF »
- Supervised reconstruction of biological networks with local models.
- K. Bleakley, G. Biau, and J.-P. Vert (2007)
Bioinformatics
23, i57-i65
| Abstract »
| Full Text »
| PDF »
- Computational prediction of host-pathogen protein protein interactions.
- M. D. Dyer, T. M. Murali, and B. W. Sobral (2007)
Bioinformatics
23, i159-i166
| Abstract »
| Full Text »
| PDF »
- Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks.
- W. K. Lim, K. Wang, C. Lefebvre, and A. Califano (2007)
Bioinformatics
23, i282-i288
| Abstract »
| Full Text »
| PDF »
- Modelling genotype-phenotype relationships and human disease with genetic interaction networks.
- B. Lehner (2007)
J. Exp. Biol.
210, 1559-1566
| Abstract »
| Full Text »
| PDF »
- Getting connected: analysis and principles of biological networks.
- X. Zhu, M. Gerstein, and M. Snyder (2007)
Genes & Dev.
21, 1010-1024
| Abstract »
| Full Text »
| PDF »
- Bayesian methods in bioinformatics and computational systems biology.
- D. J. Wilkinson (2007)
Brief Bioinform
| Abstract »
| Full Text »
| PDF »
- Inferring genome-wide functional linkages in E. coli by combining improved genome context methods: Comparison with high-throughput experimental data.
- S. Yellaboina, K. Goyal, and S. C. Mande (2007)
Genome Res.
17, 527-535
| Abstract »
| Full Text »
| PDF »
- Comparison of human protein protein interaction maps.
- M. E. Futschik, G. Chaurasia, and H. Herzel (2007)
Bioinformatics
23, 605-611
| Abstract »
| Full Text »
| PDF »
- Transcriptional regulation of protein complexes within and across species.
- K. Tan, T. Shlomi, H. Feizi, T. Ideker, and R. Sharan (2007)
PNAS
104, 1283-1288
| Abstract »
| Full Text »
| PDF »
- CellCircuits: a database of protein network models.
- H. C. Mak, M. Daly, B. Gruebel, and T. Ideker (2007)
Nucleic Acids Res.
35, D538-D545
| Abstract »
| Full Text »
| PDF »
- Constrained models of evolution lead to improved prediction of functional linkage from correlated gain and loss of genes.
- D. Barker, A. Meade, and M. Pagel (2007)
Bioinformatics
23, 14-20
| Abstract »
| Full Text »
| PDF »
- Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights.
- P. M. Kim, L. J. Lu, Y. Xia, and M. B. Gerstein (2006)
Science
314, 1938-1941
| Abstract »
| Full Text »
| PDF »
- Colloquium Papers: Genomic analysis of the hierarchical structure of regulatory networks.
- H. Yu and M. Gerstein (2006)
PNAS
103, 14724-14731
| Abstract »
| Full Text »
| PDF »
- Colloquium Papers: Characterization and prediction of protein-protein interactions within and between complexes.
- E. Sprinzak, Y. Altuvia, and H. Margalit (2006)
PNAS
103, 14718-14723
| Abstract »
| Full Text »
| PDF »
- A framework of integrating gene relations from heterogeneous data sources: an experiment on Arabidopsis thaliana.
- J. Li, X. Li, H. Su, H. Chen, and D. W. Galbraith (2006)
Bioinformatics
22, 2037-2043
| Abstract »
| Full Text »
| PDF »
- Protein complex compositions predicted by structural similarity.
- F. P. Davis, H. Braberg, M.-Y. Shen, U. Pieper, A. Sali, and M.S. Madhusudhan (2006)
Nucleic Acids Res.
34, 2943-2952
| Abstract »
| Full Text »
| PDF »
- Capturing expert knowledge with argumentation: a case study in bioinformatics.
- B. R. Jefferys, L. A. Kelley, M. J. Sergot, J. Fox, and M. J. E. Sternberg (2006)
Bioinformatics
22, 924-933
| Abstract »
| Full Text »
| PDF »
- Assessing semantic similarity measures for the characterization of human regulatory pathways.
- X. Guo, R. Liu, C. D. Shriver, H. Hu, and M. N. Liebman (2006)
Bioinformatics
22, 967-973
| Abstract »
| Full Text »
| PDF »
- Computational modeling of the Plasmodium falciparum interactome reveals protein function on a genome-wide scale.
- S. V. Date and C. J. Stoeckert Jr. (2006)
Genome Res.
16, 542-549
| Abstract »
| Full Text »
| PDF »
- Hierarchical multi-label prediction of gene function.
- Z. Barutcuoglu, R. E. Schapire, and O. G. Troyanskaya (2006)
Bioinformatics
22, 830-836
| Abstract »
| Full Text »
| PDF »
- Predicting interactions in protein networks by completing defective cliques.
- H. Yu, A. Paccanaro, V. Trifonov, and M. Gerstein (2006)
Bioinformatics
22, 823-829
| Abstract »
| Full Text »
| PDF »
- The Effect of Multifunctionality on the Rate of Evolution in Yeast.
- M. Salathe, M. Ackermann, and S. Bonhoeffer (2006)
Mol. Biol. Evol.
23, 721-722
| Abstract »
| Full Text »
| PDF »
- Genome-wide prediction of C. elegans genetic interactions..
- W. Zhong and P. W. Sternberg (2006)
Science
311, 1481-1484
| Abstract »
| Full Text »
| PDF »
- Comprehensive Mutational Analysis of Yeast DEXD/H Box RNA Helicases Involved in Large Ribosomal Subunit Biogenesis.
- K. A. Bernstein, S. Granneman, A. V. Lee, S. Manickam, and S. J. Baserga (2006)
Mol. Cell. Biol.
26, 1195-1208
| Abstract »
| Full Text »
| PDF »
- Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations..
- X. Wu, L. Zhu, J. Guo, D.-Y. Zhang, and K. Lin (2006)
Nucleic Acids Res.
34, 2137-2150
| Abstract »
| Full Text »
| PDF »
- MPact: the MIPS protein interaction resource on yeast.
- U. Guldener, M. Munsterkotter, M. Oesterheld, P. Pagel, A. Ruepp, H.-W. Mewes, and V. Stumpflen (2006)
Nucleic Acids Res.
34, D436-D441
| Abstract »
| Full Text »
| PDF »
- Changing perspectives in yeast research nearly a decade after the genome sequence.
- K. Dolinski and D. Botstein (2005)
Genome Res.
15, 1611-1619
| Abstract »
| Full Text »
| PDF »
- A data integration methodology for systems biology.
- D. Hwang, A. G. Rust, S. Ramsey, J. J. Smith, D. M. Leslie, A. D. Weston, P. de Atauri, J. D. Aitchison, L. Hood, A. F. Siegel, et al. (2005)
PNAS
102, 17296-17301
| Abstract »
| Full Text »
| PDF »
- Interactome: gateway into systems biology.
- M. E. Cusick, N. Klitgord, M. Vidal, and D. E. Hill (2005)
Hum. Mol. Genet.
14, R171-R181
| Abstract »
| Full Text »
| PDF »
- Identifying cooperative transcriptional regulations using protein-protein interactions.
- N. Nagamine, Y. Kawada, and Y. Sakakibara (2005)
Nucleic Acids Res.
33, 4828-4837
| Abstract »
| Full Text »
| PDF »
- Large-scale identification of yeast integral membrane protein interactions.
- J. P. Miller, R. S. Lo, A. Ben-Hur, C. Desmarais, I. Stagljar, W. S. Noble, and S. Fields (2005)
PNAS
102, 12123-12128
| Abstract »
| Full Text »
| PDF »
- Inferring protein-protein interactions through high-throughput interaction data from diverse organisms.
- Y. Liu, N. Liu, and H. Zhao (2005)
Bioinformatics
21, 3279-3285
| Abstract »
| Full Text »
| PDF »
- A latent variable model for chemogenomic profiling.
- P. Flaherty, G. Giaever, J. Kumm, M. I. Jordan, and A. P. Arkin (2005)
Bioinformatics
21, 3286-3293
| Abstract »
| Full Text »
| PDF »
- Assessing the limits of genomic data integration for predicting protein networks.
- L. J. Lu, Y. Xia, A. Paccanaro, H. Yu, and M. Gerstein (2005)
Genome Res.
15, 945-953
| Abstract »
| Full Text »
| PDF »
- Mapping Molecular Networks Using Proteomics: A Vision for Patient-Tailored Combination Therapy.
- E. F. Petricoin III, V. E. Bichsel, V. S. Calvert, V. Espina, M. Winters, L. Young, C. Belluco, B. J. Trock, M. Lippman, D. A. Fishman, et al. (2005)
J. Clin. Oncol.
23, 3614-3621
| Abstract »
| Full Text »
| PDF »
- Prediction of functional modules based on comparative genome analysis and Gene Ontology application.
- H. Wu, Z. Su, F. Mao, V. Olman, and Y. Xu (2005)
Nucleic Acids Res.
33, 2822-2837
| Abstract »
| Full Text »
| PDF »
- Integrative data analysis for functional prediction: a multi-objective optimization approach.
- F. Azuaje (2005)
Bioinformatics
21, 2099-2100
| Abstract »
| Full Text »
| PDF »
- Force sensing and generation in cell phases: analyses of complex functions.
- H.-G. Dobereiner, B. J. Dubin-Thaler, G. Giannone, and M. P. Sheetz (2005)
J Appl Physiol
98, 1542-1546
| Abstract »
| Full Text »
| PDF »
- Predicting protein-protein interactions using signature products.
- S. Martin, D. Roe, and J.-L. Faulon (2005)
Bioinformatics
21, 218-226
| Abstract »
| Full Text »
| PDF »
- Predicting protein functions with message passing algorithms.
- M. Leone and A. Pagnani (2005)
Bioinformatics
21, 239-247
| Abstract »
| Full Text »
| PDF »
- Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae.
- Y. Chen and D. Xu (2004)
Nucleic Acids Res.
32, 6414-6424
| Abstract »
| Full Text »
| PDF »
- A Probabilistic Functional Network of Yeast Genes.
- I. Lee, S. V. Date, A. T. Adai, and E. M. Marcotte (2004)
Science
306, 1555-1558
| Abstract »
| Full Text »
| PDF »
- Interaction Networks of the Molecular Machines That Decode, Replicate, and Maintain the Integrity of the Human Genome.
- B. Coulombe, C. Jeronimo, M.-F. Langelier, M. Cojocaru, and D. Bergeron (2004)
Mol. Cell. Proteomics
3, 851-856
| Abstract »
| Full Text »
| PDF »
- The NEF4 Complex Regulates Rad4 Levels and Utilizes Snf2/Swi2-Related ATPase Activity for Nucleotide Excision Repair.
- K. L. Ramsey, J. J. Smith, A. Dasgupta, N. Maqani, P. Grant, and D. T. Auble (2004)
Mol. Cell. Biol.
24, 6362-6378
| Abstract »
| Full Text »
| PDF »
- Coevolution of gene expression among interacting proteins.
- H. B. Fraser, A. E. Hirsh, D. P. Wall, and M. B. Eisen (2004)
PNAS
101, 9033-9038
| Abstract »
| Full Text »
| PDF »
- Annotation Transfer Between Genomes: Protein-Protein Interologs and Protein-DNA Regulogs.
- H. Yu, N. M. Luscombe, H. X. Lu, X. Zhu, Y. Xia, J.-D. J. Han, N. Bertin, S. Chung, M. Vidal, and M. Gerstein (2004)
Genome Res.
14, 1107-1118
| Abstract »
| Full Text »
| PDF »
- Predicting Protein Complex Membership Using Probabilistic Network Reliability.
- S. Asthana, O. D. King, F. D. Gibbons, and F. P. Roth (2004)
Genome Res.
14, 1170-1175
| Abstract »
| Full Text »
| PDF »
- Role of the cytoskeleton in signaling networks.
- G. Forgacs, S. H. Yook, P. A. Janmey, H. Jeong, and C. G. Burd (2004)
J. Cell Sci.
117, 2769-2775
| Abstract »
| Full Text »
| PDF »
|
|