Skip to main page content
U.S. flag

An official website of the United States government

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1998 Jul;8(7):748-54.
doi: 10.1101/gr.8.7.748.

Overlapping genomic sequences: a treasure trove of single-nucleotide polymorphisms

Affiliations

Overlapping genomic sequences: a treasure trove of single-nucleotide polymorphisms

P Taillon-Miller et al. Genome Res. 1998 Jul.

Abstract

An efficient strategy to develop a dense set of single-nucleotide polymorphism (SNP) markers is to take advantage of the human genome sequencing effort currently under way. Our approach is based on the fact that bacterial artificial chromosomes (BACs) and P1-based artificial chromosomes (PACs) used in long-range sequencing projects come from diploid libraries. If the overlapping clones sequenced are from different lineages, one is comparing the sequences from 2 homologous chromosomes in the overlapping region. We have analyzed in detail every SNP identified while sequencing three sets of overlapping clones found on chromosome 5p15.2, 7q21-7q22, and 13q12-13q13. In the 200.6 kb of DNA sequence analyzed in these overlaps, 153 SNPs were identified. Computer analysis for repetitive elements and suitability for STS development yielded 44 STSs containing 68 SNPs for further study. All 68 SNPs were confirmed to be present in at least one of the three (Caucasian, African-American, Hispanic) populations studied. Furthermore, 42 of the SNPs tested (62%) were informative in at least one population, 32 (47%) were informative in two or more populations, and 23 (34%) were informative in all three populations. These results clearly indicate that developing SNP markers from overlapping genomic sequence is highly efficient and cost effective, requiring only the two simple steps of developing STSs around the known SNPs and characterizing them in the appropriate populations.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Bonfield JK, Smith KF, Staden R. A new DNA sequence assembly program. Nucleic Acids Res. 1995;23:4992–4999. - PMC - PubMed
    1. Collins FS, Galas D. A new five-year plan for the U.S. Human Genome Program. Science. 1993;262:43–46. - PubMed
    1. Collins FS, Guyer MS, Chakravarti A. Variations on a theme: Cataloging human DNA sequence variation. Science. 1997;278:1580–1581. - PubMed
    1. Cooper DN, Smith BA, Cooke HJ, Niemann S, Schmidtke J. An estimate of unique DNA sequence heterozygosity in the human genome. Hum Genet. 1985;69:201–205. - PubMed
    1. Ewing BG, Hillier L, Wendl MC, Green P. Basecalling of automated sequencer traces using PHRED. I. Accuracy assessment. Genome Res. 1998;8:175–185. - PubMed

Publication types

MeSH terms

Associated data

LinkOut - more resources