Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies
- PMID: 16415888
- DOI: 10.1038/ng1706
Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies
Erratum in
- Nat Genet. 2006 Mar;38(3):390
Abstract
Genome-wide association is a promising approach to identify common genetic variants that predispose to human disease. Because of the high cost of genotyping hundreds of thousands of markers on thousands of subjects, genome-wide association studies often follow a staged design in which a proportion (pi(samples)) of the available samples are genotyped on a large number of markers in stage 1, and a proportion (pi(samples)) of these markers are later followed up by genotyping them on the remaining samples in stage 2. The standard strategy for analyzing such two-stage data is to view stage 2 as a replication study and focus on findings that reach statistical significance when stage 2 data are considered alone. We demonstrate that the alternative strategy of jointly analyzing the data from both stages almost always results in increased power to detect genetic association, despite the need to use more stringent significance levels, even when effect sizes differ between the two stages. We recommend joint analysis for all two-stage genome-wide association studies, especially when a relatively large proportion of the samples are genotyped in stage 1 (pi(samples) >or= 0.30), and a relatively large proportion of markers are selected for follow-up in stage 2 (pi(markers) >or= 0.01).
Similar articles
-
Optimal designs for two-stage genome-wide association studies.Genet Epidemiol. 2007 Nov;31(7):776-88. doi: 10.1002/gepi.20240. Genet Epidemiol. 2007. PMID: 17549752
-
Optimal two-stage strategy for detecting interacting genes in complex diseases.BMC Genet. 2006 Jun 15;7:39. doi: 10.1186/1471-2156-7-39. BMC Genet. 2006. PMID: 16776843 Free PMC article.
-
A mixed two-stage method for detecting interactions in genomewide association studies.J Theor Biol. 2010 Feb 21;262(4):576-83. doi: 10.1016/j.jtbi.2009.10.029. Epub 2009 Nov 6. J Theor Biol. 2010. PMID: 19896954
-
Common statistical issues in genome-wide association studies: a review on power, data quality control, genotype calling and population structure.Curr Opin Lipidol. 2008 Apr;19(2):133-43. doi: 10.1097/MOL.0b013e3282f5dd77. Curr Opin Lipidol. 2008. PMID: 18388693 Review.
-
Meta-analysis of genetic association studies: methodologies, between-study heterogeneity and winner's curse.J Hum Genet. 2009 Nov;54(11):615-23. doi: 10.1038/jhg.2009.95. Epub 2009 Oct 23. J Hum Genet. 2009. PMID: 19851339 Review.
Cited by
-
Common risk variants in NPHS1 and TNFSF15 are associated with childhood steroid-sensitive nephrotic syndrome.Kidney Int. 2020 Nov;98(5):1308-1322. doi: 10.1016/j.kint.2020.05.029. Epub 2020 Jun 14. Kidney Int. 2020. PMID: 32554042 Free PMC article.
-
Evaluation of common genetic variants in 82 candidate genes as risk factors for neural tube defects.BMC Med Genet. 2012 Aug 2;13:62. doi: 10.1186/1471-2350-13-62. BMC Med Genet. 2012. PMID: 22856873 Free PMC article.
-
Common genetic variants in the CLDN2 and PRSS1-PRSS2 loci alter risk for alcohol-related and sporadic pancreatitis.Nat Genet. 2012 Dec;44(12):1349-54. doi: 10.1038/ng.2466. Epub 2012 Nov 11. Nat Genet. 2012. PMID: 23143602 Free PMC article.
-
Genetic Variants Associated with Colorectal Adenoma Susceptibility.PLoS One. 2016 Apr 14;11(4):e0153084. doi: 10.1371/journal.pone.0153084. eCollection 2016. PLoS One. 2016. PMID: 27078840 Free PMC article.
-
A multicenter study confirms CD226 gene association with systemic sclerosis-related pulmonary fibrosis.Arthritis Res Ther. 2012 Apr 24;14(2):R85. doi: 10.1186/ar3809. Arthritis Res Ther. 2012. PMID: 22531499 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous