Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Aug;120(1):58-68.
doi: 10.1007/s00439-006-0182-5. Epub 2006 May 6.

Efficient selection of tagging single-nucleotide polymorphisms in multiple populations

Affiliations

Efficient selection of tagging single-nucleotide polymorphisms in multiple populations

Bryan N Howie et al. Hum Genet. 2006 Aug.

Abstract

Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.

PubMed Disclaimer

Similar articles

Cited by

References

    1. PLoS Genet. 2006 Mar;2(3):e27 - PubMed
    1. Science. 2005 Apr 15;308(5720):385-9 - PubMed
    1. Science. 2002 Jun 21;296(5576):2225-9 - PubMed
    1. Science. 2001 Jul 20;293(5529):489-93 - PubMed
    1. Nature. 2001 Feb 15;409(6822):928-33 - PubMed

Publication types

LinkOut - more resources