A knowledge base for predicting protein localization sites in eukaryotic cells
- PMID: 1478671
- PMCID: PMC7134799
- DOI: 10.1016/s0888-7543(05)80111-9
A knowledge base for predicting protein localization sites in eukaryotic cells
Abstract
To automate examination of massive amounts of sequence data for biological function, it is important to computerize interpretation based on empirical knowledge of sequence-function relationships. For this purpose, we have been constructing a knowledge base by organizing various experimental and computational observations as a collection of if-then rules. Here we report an expert system, which utilizes this knowledge base, for predicting localization sites of proteins only from the information on the amino acid sequence and the source origin. We collected data for 401 eukaryotic proteins with known localization sites (subcellular and extracellular) and divided them into training data and testing data. Fourteen localization sites were distinguished for animal cells and 17 for plant cells. When sorting signals were not well characterized experimentally, various sequence features were computationally derived from the training data. It was found that 66% of the training data and 59% of the testing data were correctly predicted by our expert system. This artificial intelligence approach is powerful and flexible enough to be used in genome analyses.
Similar articles
-
Expert system for predicting protein localization sites in gram-negative bacteria.Proteins. 1991;11(2):95-110. doi: 10.1002/prot.340110203. Proteins. 1991. PMID: 1946347
-
Protein subcellular localization prediction using artificial intelligence technology.Methods Mol Biol. 2008;484:435-63. doi: 10.1007/978-1-59745-398-1_27. Methods Mol Biol. 2008. PMID: 18592195
-
SubCellProt: predicting protein subcellular localization using machine learning approaches.In Silico Biol. 2009;9(1-2):35-44. In Silico Biol. 2009. PMID: 19537160
-
Predicting Subcellular Localization of Proteins by Bioinformatic Algorithms.Curr Top Microbiol Immunol. 2017;404:129-158. doi: 10.1007/82_2015_5006. Curr Top Microbiol Immunol. 2017. PMID: 26728066 Review.
-
An overview on predicting the subcellular location of a protein.In Silico Biol. 2002;2(3):291-303. In Silico Biol. 2002. PMID: 12542414 Review.
Cited by
-
Prediction of mitochondrial proteins using discrete wavelet transform.Protein J. 2006 Jun;25(4):241-9. doi: 10.1007/s10930-006-9007-6. Protein J. 2006. PMID: 16703470
-
A RING-H2 zinc-finger protein gene RIE1 is essential for seed development in Arabidopsis.Plant Mol Biol. 2003 Sep;53(1-2):37-50. doi: 10.1023/b:plan.0000009256.01620.a6. Plant Mol Biol. 2003. PMID: 14756305
-
Cohen syndrome is caused by mutations in a novel gene, COH1, encoding a transmembrane protein with a presumed role in vesicle-mediated sorting and intracellular protein transport.Am J Hum Genet. 2003 Jun;72(6):1359-69. doi: 10.1086/375454. Epub 2003 May 2. Am J Hum Genet. 2003. PMID: 12730828 Free PMC article.
-
Ninjurin2, a novel homophilic adhesion molecule, is expressed in mature sensory and enteric neurons and promotes neurite outgrowth.J Neurosci. 2000 Jan 1;20(1):187-95. doi: 10.1523/JNEUROSCI.20-01-00187.2000. J Neurosci. 2000. PMID: 10627596 Free PMC article.
-
pH signaling in Sclerotinia sclerotiorum: identification of a pacC/RIM1 homolog.Appl Environ Microbiol. 2001 Jan;67(1):75-81. doi: 10.1128/AEM.67.1.75-81.2001. Appl Environ Microbiol. 2001. PMID: 11133430 Free PMC article.
References
-
- Adams M.D., Dubnick M., Kerlavage A.R., Moreno R., Kelley J.M., Utterback T.R., Nagle J.W., Fields C., Venter J.C. Sequence identification of 2,375 human brain genes. Nature. 1992;355:632–634. - PubMed
-
- Baker K.P., Schatz G. Mitochondrial proteins essential for viability mediate protein import into yeast mitochondria. Nature. 1991;349:205–208. - PubMed
-
- Baranski T.J., Faust P.L., Kornfeld S. Generation of a lysosomal enzyme targeting signal in the secretory protein pepsinogen. Cell. 1990;63:281–291. - PubMed
-
- Barker W.C., George D.G., Hunt L.T. Protein sequence database. Methods Enzymol. 1990;183:31–49. - PubMed
-
- Bendiak B. A common peptide stretch among enzymes localized to the Golgi apparatus: Structural similarity of Golgi-associated glycosyltransferases. Biochem. Biophys. Res. Commun. 1990;170:879–882. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources