Statistical analysis of domains in interacting protein pairs
- PMID: 15509600
- DOI: 10.1093/bioinformatics/bti086
Statistical analysis of domains in interacting protein pairs
Abstract
Motivation: Several methods have recently been developed to analyse large-scale sets of physical interactions between proteins in terms of physical contacts between the constituent domains, often with a view to predicting new pairwise interactions. Our aim is to combine genomic interaction data, in which domain-domain contacts are not explicitly reported, with the domain-level structure of individual proteins, in order to learn about the structure of interacting protein pairs. Our approach is driven by the need to assess the evidence for physical contacts between domains in a statistically rigorous way.
Results: We develop a statistical approach that assigns p-values to pairs of domain superfamilies, measuring the strength of evidence within a set of protein interactions that domains from these superfamilies form contacts. A set of p-values is calculated for SCOP superfamily pairs, based on a pooled data set of interactions from yeast. These p-values can be used to predict which domains come into contact in an interacting protein pair. This predictive scheme is tested against protein complexes in the Protein Quaternary Structure (PQS) database, and is used to predict domain-domain contacts within 705 interacting protein pairs taken from our pooled data set.
Similar articles
-
PIBASE: a comprehensive database of structurally defined protein interfaces.Bioinformatics. 2005 May 1;21(9):1901-7. doi: 10.1093/bioinformatics/bti277. Epub 2005 Jan 18. Bioinformatics. 2005. PMID: 15657096
-
Structure-templated predictions of novel protein interactions from sequence information.PLoS Comput Biol. 2007 Sep;3(9):1783-9. doi: 10.1371/journal.pcbi.0030182. PLoS Comput Biol. 2007. PMID: 17892321 Free PMC article.
-
Bayesian methods for predicting interacting protein pairs using domain information.Biometrics. 2007 Sep;63(3):824-33. doi: 10.1111/j.1541-0420.2007.00755.x. Biometrics. 2007. PMID: 17825014
-
Predicting protein function from sequence and structural data.Curr Opin Struct Biol. 2005 Jun;15(3):275-84. doi: 10.1016/j.sbi.2005.04.003. Curr Opin Struct Biol. 2005. PMID: 15963890 Review.
-
Deciphering protein-protein interactions. Part II. Computational methods to predict protein and domain interaction partners.PLoS Comput Biol. 2007 Apr 27;3(4):e43. doi: 10.1371/journal.pcbi.0030043. PLoS Comput Biol. 2007. PMID: 17465672 Free PMC article. Review.
Cited by
-
Characterization of protein hubs by inferring interacting motifs from protein interactions.PLoS Comput Biol. 2007 Sep;3(9):1761-71. doi: 10.1371/journal.pcbi.0030178. Epub 2007 Jul 30. PLoS Comput Biol. 2007. PMID: 17941705 Free PMC article.
-
The effects of incomplete protein interaction data on structural and evolutionary inferences.BMC Biol. 2006 Nov 3;4:39. doi: 10.1186/1741-7007-4-39. BMC Biol. 2006. PMID: 17081312 Free PMC article.
-
Embryonic stem cell interactomics: the beginning of a long road to biological function.Stem Cell Rev Rep. 2012 Dec;8(4):1138-54. doi: 10.1007/s12015-012-9400-9. Stem Cell Rev Rep. 2012. PMID: 22847281 Review.
-
The role of exon shuffling in shaping protein-protein interaction networks.BMC Genomics. 2010 Dec 22;11 Suppl 5(Suppl 5):S11. doi: 10.1186/1471-2164-11-S5-S11. BMC Genomics. 2010. PMID: 21210967 Free PMC article.
-
Triangle network motifs predict complexes by complementing high-error interactomes with structural information.BMC Bioinformatics. 2009 Jun 27;10:196. doi: 10.1186/1471-2105-10-196. BMC Bioinformatics. 2009. PMID: 19558694 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases