New approaches for computer analysis of nucleic acid sequences
- PMID: 6577449
- PMCID: PMC384318
- DOI: 10.1073/pnas.80.18.5660
New approaches for computer analysis of nucleic acid sequences
Abstract
A new high-speed computer algorithm is outlined that ascertains within and between nucleic acid and protein sequences all direct repeats, dyad symmetries, and other structural relationships. Large repeats, repeats of high frequency, dyad symmetries of specified stem length and loop distance, and their distributions are determined. Significance of homologies is assessed by a hierarchy of permutation procedures. Applications are made to papovaviruses, the human papillomavirus HPV, lambda phage, the human and mouse mitochondrial genomes, and the human and mouse immunoglobulin kappa-chain genes.
Similar articles
-
Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries.Nucleic Acids Res. 1982 Jan 11;10(1):247-63. doi: 10.1093/nar/10.1.247. Nucleic Acids Res. 1982. PMID: 6801626 Free PMC article.
-
Comparative statistics for DNA and protein sequences: single sequence analysis.Proc Natl Acad Sci U S A. 1985 Sep;82(17):5800-4. doi: 10.1073/pnas.82.17.5800. Proc Natl Acad Sci U S A. 1985. PMID: 2994049 Free PMC article.
-
The frequency of matching sequences in DNA.J Theor Biol. 1984 May 7;108(1):111-22. doi: 10.1016/s0022-5193(84)80172-1. J Theor Biol. 1984. PMID: 6748676
-
Statistical significance of symmetrical and repetitive segments in DNA.Nucleic Acids Res. 1982 Dec 20;10(24):8323-39. doi: 10.1093/nar/10.24.8323. Nucleic Acids Res. 1982. PMID: 7162993 Free PMC article.
-
[Computer analysis of nucleotide sequences].Tanpakushitsu Kakusan Koso. 1983 Sep;28(10):1165-86. Tanpakushitsu Kakusan Koso. 1983. PMID: 6314435 Review. Japanese. No abstract available.
Cited by
-
An accurate approximation to the distribution of the length of the longest matching word between two random DNA sequences.Bull Math Biol. 1990;52(6):773-84. doi: 10.1007/BF02460808. Bull Math Biol. 1990. PMID: 2279194
-
The use of multiple alphabets in kappa-gene immunoglobulin DNA sequence comparisons.EMBO J. 1985 May;4(5):1217-23. doi: 10.1002/j.1460-2075.1985.tb03763.x. EMBO J. 1985. PMID: 3924599 Free PMC article.
-
Phase transitions in sequence matches and nucleic acid structure.Proc Natl Acad Sci U S A. 1987 Mar;84(5):1239-43. doi: 10.1073/pnas.84.5.1239. Proc Natl Acad Sci U S A. 1987. PMID: 3469666 Free PMC article.
-
Heuristic informational analysis of sequences.Nucleic Acids Res. 1986 Jan 10;14(1):179-96. doi: 10.1093/nar/14.1.179. Nucleic Acids Res. 1986. PMID: 3753763 Free PMC article.
-
An efficient algorithm for identifying matches with errors in multiple long molecular sequences.J Mol Biol. 1991 Oct 20;221(4):1367-78. doi: 10.1016/0022-2836(91)90938-3. J Mol Biol. 1991. PMID: 1942056 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources