The sampling distribution of linkage disequilibrium under an infinite allele model without selection
- PMID: 3979817
- PMCID: PMC1216291
- DOI: 10.1093/genetics/109.3.611
The sampling distribution of linkage disequilibrium under an infinite allele model without selection
Abstract
The sampling distributions of several statistics that measure the association of alleles on gametes (linkage disequilibrium) are estimated under a two-locus neutral infinite allele model using an efficient Monte Carlo method. An often used approximation for the mean squared linkage disequilibrium is shown to be inaccurate unless the proper statistical conditioning is used. The joint distribution of linkage disequilibrium and the allele frequencies in the sample is studied. This estimated joint distribution is sufficient for obtaining an approximate maximum likelihood estimate of C = 4Nc, where N is the population size and c is the recombination rate. It has been suggested that observations of high linkage disequilibrium might be a good basis for rejecting a neutral model in favor of a model in which natural selection maintains genetic variation. It is found that a single sample of chromosomes, examined at two loci cannot provide sufficient information for such a test if C less than 10, because with C this small, very high levels of linkage disequilibrium are not unexpected under the neutral model. In samples of size 50, it is found that, even when C is as large as 50, the distribution of linkage disequilibrium conditional on the allele frequencies is substantially different from the distribution when there is no linkage between the loci. When conditioned on the number of alleles at each locus in the sample, all of the sample statistics examined are nearly independent of theta = 4N mu, where mu is the neutral mutation rate.
Similar articles
-
The sampling distribution of linkage disequilibrium.Genetics. 1984 Sep;108(1):257-74. doi: 10.1093/genetics/108.1.257. Genetics. 1984. PMID: 6479585 Free PMC article.
-
Variance in estimated pairwise genetic distance under high versus low coverage sequencing: The contribution of linkage disequilibrium.Theor Popul Biol. 2017 Oct;117:51-63. doi: 10.1016/j.tpb.2017.08.001. Epub 2017 Aug 24. Theor Popul Biol. 2017. PMID: 28842178
-
Approximating genealogies for partially linked neutral loci under a selective sweep.J Math Biol. 2007 Sep;55(3):299-330. doi: 10.1007/s00285-007-0085-7. Epub 2007 Mar 30. J Math Biol. 2007. PMID: 17396267
-
On selecting markers for association studies: patterns of linkage disequilibrium between two and three diallelic loci.Genet Epidemiol. 2003 Jan;24(1):57-67. doi: 10.1002/gepi.10217. Genet Epidemiol. 2003. PMID: 12508256 Review.
-
Statistical genetics in and out of quasi-linkage equilibrium.Rep Prog Phys. 2023 Apr 4;86(5). doi: 10.1088/1361-6633/acc5fa. Rep Prog Phys. 2023. PMID: 36944245 Review.
Cited by
-
Divergence between human populations estimated from linkage disequilibrium.Am J Hum Genet. 2008 Dec;83(6):737-43. doi: 10.1016/j.ajhg.2008.10.019. Epub 2008 Nov 13. Am J Hum Genet. 2008. PMID: 19012875 Free PMC article.
-
Mathematical Properties of Linkage Disequilibrium Statistics Defined by Normalization of the Coefficient D = pAB - pApB.Hum Hered. 2019;84(3):127-143. doi: 10.1159/000504171. Epub 2020 Feb 11. Hum Hered. 2019. PMID: 32045910 Free PMC article.
-
Molecular and phenotypic variation of the white locus region in Drosophila melanogaster.Genetics. 1988 Sep;120(1):199-212. doi: 10.1093/genetics/120.1.199. Genetics. 1988. PMID: 2906026 Free PMC article.
-
Theory of the effects of population structure and sampling on patterns of linkage disequilibrium applied to genomic data from humans.Genetics. 2003 Jul;164(3):1043-53. doi: 10.1093/genetics/164.3.1043. Genetics. 2003. PMID: 12871914 Free PMC article.
-
Which locus has the oldest allele?J Math Biol. 1991;29(8):763-77. doi: 10.1007/BF00160191. J Math Biol. 1991. PMID: 1940668
References
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials