Search algorithm for pattern match analysis of nucleic acid sequences
- PMID: 6344023
- PMCID: PMC325935
- DOI: 10.1093/nar/11.9.2943
Search algorithm for pattern match analysis of nucleic acid sequences
Abstract
A new type of search algorithm to find biological information inherited in nucleic acid sequences was developed. The algorithm is of pattern match type and is based on the fact that genetic information often is a function of a predictable statistical occurrence of the four bases within parts of the sequence. The search algorithm compares the known statistical pattern of bases in e.g. a promoter, with an unknown sequence and calculates the statistical significance of the match at all positions in the unknown sequence. The program was tested on 54 published prokaryotic promoters. 44 or 49 could be found with 1 or 4 false answers, respectively. The program was also used on plasmid pBR322. All promoters functioning in an in vitro transcription system were found (tet, anti-tet, p4, bla and ori) except the so called p5 promoter. A search for donor and acceptor sites was performed in a human HLA genomic sequence that contains six introns. Five of the possible six donor and acceptor sites were found.
Similar articles
-
A novel method for promoter search enhanced by function-specific subgrouping of promoters--developed and tested on E.coli system.Nucleic Acids Res. 1989 Jun 26;17(12):4799-815. doi: 10.1093/nar/17.12.4799. Nucleic Acids Res. 1989. PMID: 2664710 Free PMC article.
-
Escherichia coli promoters. II. A spacing class-dependent promoter search protocol.J Biol Chem. 1989 Apr 5;264(10):5531-4. J Biol Chem. 1989. PMID: 2647721
-
Analysis of the occurrence of promoter-sites in DNA.Nucleic Acids Res. 1986 Jan 10;14(1):109-26. doi: 10.1093/nar/14.1.109. Nucleic Acids Res. 1986. PMID: 2935785 Free PMC article.
-
Nucleotide sequence of an Escherichia coli tRNA (Leu 1) operon and identification of the transcription promoter signal.Nucleic Acids Res. 1981 May 11;9(9):2121-39. doi: 10.1093/nar/9.9.2121. Nucleic Acids Res. 1981. PMID: 6272226 Free PMC article.
-
Analysis of E.coli promoter structures using neural networks.Nucleic Acids Res. 1994 Jun 11;22(11):2158-65. doi: 10.1093/nar/22.11.2158. Nucleic Acids Res. 1994. PMID: 8029027 Free PMC article.
Cited by
-
Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites.Nucleic Acids Res. 2005 Apr 22;33(7):2290-301. doi: 10.1093/nar/gki519. Print 2005. Nucleic Acids Res. 2005. PMID: 15849315 Free PMC article.
-
Genetic transformation in Streptococcus pneumoniae: nucleotide sequence and predicted amino acid sequence of recP.J Bacteriol. 1990 Jul;172(7):3669-74. doi: 10.1128/jb.172.7.3669-3674.1990. J Bacteriol. 1990. PMID: 2361942 Free PMC article.
-
Explainability in transformer models for functional genomics.Brief Bioinform. 2021 Sep 2;22(5):bbab060. doi: 10.1093/bib/bbab060. Brief Bioinform. 2021. PMID: 33834200 Free PMC article.
-
Characterization of the ColE1 mobilization region and its protein products.Mol Gen Genet. 1989 Jun;217(2-3):488-98. doi: 10.1007/BF02464922. Mol Gen Genet. 1989. PMID: 2671664
-
The lcrE gene is part of an operon in the lcr region of Yersinia enterocolitica O:3.J Bacteriol. 1990 Jun;172(6):3152-62. doi: 10.1128/jb.172.6.3152-3162.1990. J Bacteriol. 1990. PMID: 2160939 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous