An integrated approach to the prediction of domain-domain interactions
- PMID: 16725050
- PMCID: PMC1481624
- DOI: 10.1186/1471-2105-7-269
An integrated approach to the prediction of domain-domain interactions
Abstract
Background: The development of high-throughput technologies has produced several large scale protein interaction data sets for multiple species, and significant efforts have been made to analyze the data sets in order to understand protein activities. Considering that the basic units of protein interactions are domain interactions, it is crucial to understand protein interactions at the level of the domains. The availability of many diverse biological data sets provides an opportunity to discover the underlying domain interactions within protein interactions through an integration of these biological data sets.
Results: We combine protein interaction data sets from multiple species, molecular sequences, and gene ontology to construct a set of high-confidence domain-domain interactions. First, we propose a new measure, the expected number of interactions for each pair of domains, to score domain interactions based on protein interaction data in one species and show that it has similar performance as the E-value defined by Riley et al. Our new measure is applied to the protein interaction data sets from yeast, worm, fruitfly and humans. Second, information on pairs of domains that coexist in known proteins and on pairs of domains with the same gene ontology function annotations are incorporated to construct a high-confidence set of domain-domain interactions using a Bayesian approach. Finally, we evaluate the set of domain-domain interactions by comparing predicted domain interactions with those defined in iPfam database that were derived based on protein structures. The accuracy of predicted domain interactions are also confirmed by comparing with experimentally obtained domain interactions from H. pylori. As a result, a total of 2,391 high-confidence domain interactions are obtained and these domain interactions are used to unravel detailed protein and domain interactions in several protein complexes.
Conclusion: Our study shows that integration of multiple biological data sets based on the Bayesian approach provides a reliable framework to predict domain interactions. By integrating multiple data sources, the coverage and accuracy of predicted domain interactions can be significantly increased.
Figures






Similar articles
-
Interrogating domain-domain interactions with parsimony based approaches.BMC Bioinformatics. 2008 Mar 26;9:171. doi: 10.1186/1471-2105-9-171. BMC Bioinformatics. 2008. PMID: 18366803 Free PMC article.
-
Statistical analysis of domains in interacting protein pairs.Bioinformatics. 2005 Apr 1;21(7):993-1001. doi: 10.1093/bioinformatics/bti086. Epub 2004 Oct 27. Bioinformatics. 2005. PMID: 15509600
-
Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships.Bioinformatics. 2005 Aug 15;21(16):3360-8. doi: 10.1093/bioinformatics/bti522. Epub 2005 Jun 16. Bioinformatics. 2005. PMID: 15961445
-
Predicting protein function from sequence and structural data.Curr Opin Struct Biol. 2005 Jun;15(3):275-84. doi: 10.1016/j.sbi.2005.04.003. Curr Opin Struct Biol. 2005. PMID: 15963890 Review.
-
Deciphering protein-protein interactions. Part II. Computational methods to predict protein and domain interaction partners.PLoS Comput Biol. 2007 Apr 27;3(4):e43. doi: 10.1371/journal.pcbi.0030043. PLoS Comput Biol. 2007. PMID: 17465672 Free PMC article. Review.
Cited by
-
In silico prediction of protein-protein interactions in human macrophages.BMC Res Notes. 2014 Mar 17;7:157. doi: 10.1186/1756-0500-7-157. BMC Res Notes. 2014. PMID: 24636261 Free PMC article.
-
Computational prediction of protein interactions related to the invasion of erythrocytes by malarial parasites.BMC Bioinformatics. 2014 Nov 30;15(1):393. doi: 10.1186/s12859-014-0393-z. BMC Bioinformatics. 2014. PMID: 25433733 Free PMC article.
-
A top-down approach to infer and compare domain-domain interactions across eight model organisms.PLoS One. 2009;4(3):e5096. doi: 10.1371/journal.pone.0005096. Epub 2009 Mar 31. PLoS One. 2009. PMID: 19333396 Free PMC article.
-
Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines.BMC Bioinformatics. 2010 Oct 29;11:537. doi: 10.1186/1471-2105-11-537. BMC Bioinformatics. 2010. PMID: 21034480 Free PMC article.
-
Algorithmic and analytical methods in network biology.Wiley Interdiscip Rev Syst Biol Med. 2010 May-Jun;2(3):277-292. doi: 10.1002/wsbm.61. Wiley Interdiscip Rev Syst Biol Med. 2010. PMID: 20836029 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials