Optimizing long intrinsic disorder predictors with protein evolutionary information
- PMID: 15751111
- DOI: 10.1142/s0219720005000886
Optimizing long intrinsic disorder predictors with protein evolutionary information
Abstract
Protein existing as an ensemble of structures, called intrinsically disordered, has been shown to be responsible for a wide variety of biological functions and to be common in nature. Here we focus on improving sequence-based predictions of long (>30 amino acid residues) regions lacking specific 3-D structure by means of four new neural-network-based Predictors Of Natural Disordered Regions (PONDRs): VL3, VL3H, VL3P, and VL3E. PONDR VL3 used several features from a previously introduced PONDR VL2, but benefitted from optimized predictor models and a slightly larger (152 vs. 145) set of disordered proteins that were cleaned of mislabeling errors found in the smaller set. PONDR VL3H utilized homologues of the disordered proteins in the training stage, while PONDR VL3P used attributes derived from sequence profiles obtained by PSI-BLAST searches. The measure of accuracy was the average between accuracies on disordered and ordered protein regions. By this measure, the 30-fold cross-validation accuracies of VL3, VL3H, and VL3P were, respectively, 83.6 +/- 1.4%, 85.3 +/- 1.4%, and 85.2 +/- 1.5%. By combining VL3H and VL3P, the resulting PONDR VL3E achieved an accuracy of 86.7 +/- 1.4%. This is a significant improvement over our previous PONDRs VLXT (71.6 +/- 1.3%) and VL2 (80.9 +/- 1.4%). The new disorder predictors with the corresponding datasets are freely accessible through the web server at http://www.ist.temple.edu/disprot.
Similar articles
-
Length-dependent prediction of protein intrinsic disorder.BMC Bioinformatics. 2006 Apr 17;7:208. doi: 10.1186/1471-2105-7-208. BMC Bioinformatics. 2006. PMID: 16618368 Free PMC article.
-
FoldUnfold: web server for the prediction of disordered regions in protein chain.Bioinformatics. 2006 Dec 1;22(23):2948-9. doi: 10.1093/bioinformatics/btl504. Epub 2006 Oct 4. Bioinformatics. 2006. PMID: 17021161
-
PONDR-FIT: a meta-predictor of intrinsically disordered amino acids.Biochim Biophys Acta. 2010 Apr;1804(4):996-1010. doi: 10.1016/j.bbapap.2010.01.011. Epub 2010 Jan 25. Biochim Biophys Acta. 2010. PMID: 20100603 Free PMC article.
-
Predicting mostly disordered proteins by using structure-unknown protein data.BMC Bioinformatics. 2007 Mar 6;8:78. doi: 10.1186/1471-2105-8-78. BMC Bioinformatics. 2007. PMID: 17338828 Free PMC article.
-
Natively disordered proteins: functions and predictions.Appl Bioinformatics. 2004;3(2-3):105-13. doi: 10.2165/00822942-200403020-00005. Appl Bioinformatics. 2004. PMID: 15693736 Review.
Cited by
-
The Pathophysiological Significance of Fibulin-3.Biomolecules. 2020 Sep 8;10(9):1294. doi: 10.3390/biom10091294. Biomolecules. 2020. PMID: 32911658 Free PMC article. Review.
-
Role of the activation peptide in the mechanism of protein C activation.Sci Rep. 2020 Jul 6;10(1):11079. doi: 10.1038/s41598-020-68078-z. Sci Rep. 2020. PMID: 32632109 Free PMC article.
-
The mysterious unfoldome: structureless, underappreciated, yet vital part of any given proteome.J Biomed Biotechnol. 2010;2010:568068. doi: 10.1155/2010/568068. J Biomed Biotechnol. 2010. PMID: 20011072 Free PMC article. Review.
-
Exploring structural features and potential lipid interactions of Pseudomonas aeruginosa type three secretion effector PemB by spectroscopic and calorimetric experiments.Protein Sci. 2023 Apr;32(4):e4627. doi: 10.1002/pro.4627. Protein Sci. 2023. PMID: 36916835 Free PMC article.
-
Structural and functional analysis of "non-smelly" proteins.Cell Mol Life Sci. 2020 Jun;77(12):2423-2440. doi: 10.1007/s00018-019-03292-1. Epub 2019 Sep 5. Cell Mol Life Sci. 2020. PMID: 31486849 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials