pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination
- PMID: 19429599
- DOI: 10.1093/bioinformatics/btp302
pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination
Abstract
Motivation: Generation of structural models and recognition of homologous relationships for unannotated protein sequences are fundamental problems in bioinformatics. Improving the sensitivity and selectivity of methods designed for these two tasks therefore has downstream benefits for many other bioinformatics applications.
Results: We describe the latest implementation of the GenTHREADER method for structure prediction on a genomic scale. The method combines profile-profile alignments with secondary-structure specific gap-penalties, classic pair- and solvation potentials using a linear combination optimized with a regression SVM model. We find this combination significantly improves both detection of useful templates and accuracy of sequence-structure alignments relative to other competitive approaches. We further present a second implementation of the protocol designed for the task of discriminating superfamilies from one another. This method, pDomTHREADER, is the first to incorporate both sequence and structural data directly in this task and improves sensitivity and selectivity over the standard version of pGenTHREADER and three other standard methods for remote homology detection.
Similar articles
-
AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings.Bioinformatics. 2007 May 15;23(10):1203-10. doi: 10.1093/bioinformatics/btm089. Epub 2007 Mar 22. Bioinformatics. 2007. PMID: 17379694
-
PASS2: an automated database of protein alignments organised as structural superfamilies.BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35. BMC Bioinformatics. 2004. PMID: 15059245 Free PMC article.
-
Beyond the Twilight Zone: automated prediction of structural properties of proteins by recursive neural networks and remote homology information.Proteins. 2009 Oct;77(1):181-90. doi: 10.1002/prot.22429. Proteins. 2009. PMID: 19422056
-
Sequence comparison and protein structure prediction.Curr Opin Struct Biol. 2006 Jun;16(3):374-84. doi: 10.1016/j.sbi.2006.05.006. Epub 2006 May 19. Curr Opin Struct Biol. 2006. PMID: 16713709 Review.
-
Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition.Int J Mol Sci. 2016 Dec 16;17(12):2118. doi: 10.3390/ijms17122118. Int J Mol Sci. 2016. PMID: 27999256 Free PMC article. Review.
Cited by
-
S-layers at second glance? Altiarchaeal grappling hooks (hami) resemble archaeal S-layer proteins in structure and sequence.Front Microbiol. 2015 Jun 9;6:543. doi: 10.3389/fmicb.2015.00543. eCollection 2015. Front Microbiol. 2015. PMID: 26106369 Free PMC article.
-
Insights into SCP/TAPS proteins of liver flukes based on large-scale bioinformatic analyses of sequence datasets.PLoS One. 2012;7(2):e31164. doi: 10.1371/journal.pone.0031164. Epub 2012 Feb 22. PLoS One. 2012. PMID: 22384000 Free PMC article.
-
Computational analysis predicts the Kaposi's sarcoma-associated herpesvirus tegument protein ORF63 to be alpha helical.Proteins. 2012 Aug;80(8):2063-70. doi: 10.1002/prot.24097. Epub 2012 May 17. Proteins. 2012. PMID: 22513832 Free PMC article.
-
Comprehensive in silico modeling of the rice plant PRR Xa21 and its interaction with RaxX21-sY and OsSERK2.RSC Adv. 2020 Apr 21;10(27):15800-15814. doi: 10.1039/d0ra01396j. eCollection 2020 Apr 21. RSC Adv. 2020. PMID: 35493652 Free PMC article.
-
Using the Predicted Structure of the Amot Coiled Coil Homology Domain to Understand Lipid Binding.Indiana Univ J Undergrad Res. 2018;4(1):27-46. doi: 10.14434/iujur.v4i1.24528. Epub 2018 Dec 16. Indiana Univ J Undergrad Res. 2018. PMID: 30957019 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous