Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages
- PMID: 24489849
- PMCID: PMC3904977
- DOI: 10.1371/journal.pone.0087107
Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages
Abstract
Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html).
Conflict of interest statement
Figures
Similar articles
-
STING Contacts: a web-based application for identification and analysis of amino acid contacts within protein structure and across protein interfaces.Bioinformatics. 2004 Sep 1;20(13):2145-7. doi: 10.1093/bioinformatics/bth203. Epub 2004 Apr 8. Bioinformatics. 2004. PMID: 15073001
-
STING Millennium: A web-based suite of programs for comprehensive and simultaneous analysis of protein structure and sequence.Nucleic Acids Res. 2003 Jul 1;31(13):3386-92. doi: 10.1093/nar/gkg578. Nucleic Acids Res. 2003. PMID: 12824333 Free PMC article.
-
STING Report: convenient web-based application for graphic and tabular presentations of protein sequence, structure and function descriptors from the STING database.Nucleic Acids Res. 2005 Jan 1;33(Database issue):D269-74. doi: 10.1093/nar/gki111. Nucleic Acids Res. 2005. PMID: 15608194 Free PMC article.
-
Structural protein descriptors in 1-dimension and their sequence-based predictions.Curr Protein Pept Sci. 2011 Sep;12(6):470-89. doi: 10.2174/138920311796957711. Curr Protein Pept Sci. 2011. PMID: 21787299 Review.
-
Bridging the gap between structural bioinformatics and receptor research: the membrane-embedded, ligand-gated, P2X glycoprotein receptor.Curr Top Med Chem. 2004;4(16):1657-705. doi: 10.2174/1568026043387197. Curr Top Med Chem. 2004. PMID: 15579102 Review.
Cited by
-
A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites.Front Genet. 2021 Oct 26;12:752732. doi: 10.3389/fgene.2021.752732. eCollection 2021. Front Genet. 2021. PMID: 34764983 Free PMC article.
-
Study of specific nanoenvironments containing α-helices in all-α and (α+β)+(α/β) proteins.PLoS One. 2018 Jul 10;13(7):e0200018. doi: 10.1371/journal.pone.0200018. eCollection 2018. PLoS One. 2018. PMID: 29990352 Free PMC article.
-
Comparison of Algorithms for Prediction of Protein Structural Features from Evolutionary Data.PLoS One. 2016 Mar 10;11(3):e0150769. doi: 10.1371/journal.pone.0150769. eCollection 2016. PLoS One. 2016. PMID: 26963911 Free PMC article.
-
Algorithmic approaches to protein-protein interaction site prediction.Algorithms Mol Biol. 2015 Feb 15;10:7. doi: 10.1186/s13015-015-0033-9. eCollection 2015. Algorithms Mol Biol. 2015. PMID: 25713596 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials