Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 1992 Dec 25;20(24):6441-50.
doi: 10.1093/nar/20.24.6441.

Assessment of protein coding measures

Affiliations
Free PMC article
Review

Assessment of protein coding measures

J W Fickett et al. Nucleic Acids Res. .
Free PMC article

Abstract

A number of methods for recognizing protein coding genes in DNA sequence have been published over the last 13 years, and new, more comprehensive algorithms, drawing on the repertoire of existing techniques, continue to be developed. To optimize continued development, it is valuable to systematically review and evaluate published techniques. At the core of most gene recognition algorithms is one or more coding measures--functions which produce, given any sample window of sequence, a number or vector intended to measure the degree to which a sample sequence resembles a window of 'typical' exonic DNA. In this paper we review and synthesize the underlying coding measures from published algorithms. A standardized benchmark is described, and each of the measures is evaluated according to this benchmark. Our main conclusion is that a very simple and obvious measure--counting oligomers--is more effective than any of the more sophisticated measures. Different measures contain different information. However there is a great deal of redundancy in the current suite of measures. We show that in future development of gene recognition algorithms, attention can probably be limited to six of the twenty or so measures proposed to date.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Proteins. 1988;4(2):99-122 - PubMed
    1. DNA. 1987 Oct;6(5):493-5 - PubMed
    1. Nucleic Acids Res. 1986 Jan 10;14(1):127-35 - PubMed
    1. Nucleic Acids Res. 1985 Jan 11;13(1):185-94 - PubMed
    1. Gene. 1984 Oct;30(1-3):157-66 - PubMed

Publication types