How many potentially secreted proteins are contained in a bacterial genome?
- PMID: 10524242
- DOI: 10.1016/s0378-1119(99)00310-8
How many potentially secreted proteins are contained in a bacterial genome?
Abstract
Artificial neural networks were trained on the prediction of the subcellular location of bacterial proteins. A cross-validated average prediction accuracy of 93% was reached for distinction between cytoplasmic and non-cytoplasmic proteins, based on the analysis of protein amino-acid composition. Principal component analysis and self-organizing maps were used to create graphical representations of amino-acid sequence space. A clear separation of cytoplasmic, periplasmic, and extracellular proteins was observed. The neural network system was applied to predicting potentially secreted proteins in 15 complete genomes. For mesophile bacteria the predicted fractions of non-cytoplasmic proteins agree with previously published estimates, ranging between 15% and 30%. Characteristics of thermophile genomes might lead to an under-estimation of the fraction of secreted proteins by presently available prediction systems. A self-organizing map was constructed from all 15 bacterial genomes. This technique can reveal additional sequence features independent from exhaustive pair-wise sequence alignment. The Treponema pallidum and Mycobacterium tuberculosis data formed separate clusters indicating unusual characteristics of these genomes.
Similar articles
-
Identification of putative exported/secreted proteins in prokaryotic proteomes.Gene. 2001 May 16;269(1-2):195-204. doi: 10.1016/s0378-1119(01)00436-x. Gene. 2001. PMID: 11376951
-
Prediction of the subcellular location of prokaryotic proteins based on the hydrophobicity index of amino acids.Int J Biol Macromol. 2001 Mar 14;28(3):255-61. doi: 10.1016/s0141-8130(01)00121-0. Int J Biol Macromol. 2001. PMID: 11251233
-
Artificial neural network model for predicting protein subcellular location.Comput Chem. 2002 Jan;26(2):179-82. doi: 10.1016/s0097-8485(01)00106-1. Comput Chem. 2002. PMID: 11778941
-
Comparing genomes in terms of protein structure: surveys of a finite parts list.FEMS Microbiol Rev. 1998 Oct;22(4):277-304. doi: 10.1111/j.1574-6976.1998.tb00371.x. FEMS Microbiol Rev. 1998. PMID: 10357579 Review.
-
Methods for predicting bacterial protein subcellular localization.Nat Rev Microbiol. 2006 Oct;4(10):741-51. doi: 10.1038/nrmicro1494. Epub 2006 Sep 11. Nat Rev Microbiol. 2006. PMID: 16964270 Review.
Cited by
-
Economical evolution: microbes reduce the synthetic cost of extracellular proteins.mBio. 2010 Aug 24;1(3):e00131-10. doi: 10.1128/mBio.00131-10. mBio. 2010. PMID: 20824102 Free PMC article.
-
Archaeal signal peptides--a comparative survey at the genome level.Protein Sci. 2003 Sep;12(9):1833-43. doi: 10.1110/ps.03148703. Protein Sci. 2003. PMID: 12930983 Free PMC article.
-
Protein export systems of Mycobacterium tuberculosis: novel targets for drug development?Future Microbiol. 2010 Oct;5(10):1581-97. doi: 10.2217/fmb.10.112. Future Microbiol. 2010. PMID: 21073315 Free PMC article. Review.
-
Comparative genomics using data mining tools.J Biosci. 2002 Feb;27(1 Suppl 1):15-25. doi: 10.1007/BF02703680. J Biosci. 2002. PMID: 11927774
-
The Plasmodium export element revisited.PLoS One. 2008 Feb 6;3(2):e1560. doi: 10.1371/journal.pone.0001560. PLoS One. 2008. PMID: 18253504 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources