A population model for genotyping indels from next-generation sequence data
- PMID: 23221639
- PMCID: PMC3562001
- DOI: 10.1093/nar/gks1143
A population model for genotyping indels from next-generation sequence data
Abstract
Insertion and deletion polymorphisms (indels) are an important source of genomic variation in plant and animal genomes, but accurate genotyping from low-coverage and exome next-generation sequence data remains challenging. We introduce an efficient population clustering algorithm for diploids and polyploids which was tested on a dataset of 2000 exomes. Compared with existing methods, we report a 4-fold reduction in overall indel genotype error rates with a 9-fold reduction in low coverage regions.
Figures
Similar articles
-
Optimized detection of insertions/deletions (INDELs) in whole-exome sequencing data.PLoS One. 2017 Aug 9;12(8):e0182272. doi: 10.1371/journal.pone.0182272. eCollection 2017. PLoS One. 2017. PMID: 28792971 Free PMC article.
-
mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development.BMC Genomics. 2016 Apr 14;17:290. doi: 10.1186/s12864-016-2614-5. BMC Genomics. 2016. PMID: 27079510 Free PMC article.
-
Development of genome-wide insertion and deletion markers for maize, based on next-generation sequencing data.BMC Genomics. 2015 Aug 13;16(1):601. doi: 10.1186/s12864-015-1797-5. BMC Genomics. 2015. PMID: 26269146 Free PMC article.
-
The distribution and mutagenesis of short coding INDELs from 1,128 whole exomes.BMC Genomics. 2015 Feb 28;16(1):143. doi: 10.1186/s12864-015-1333-7. BMC Genomics. 2015. PMID: 25765891 Free PMC article.
-
High throughput genotyping of structural variations in a complex plant genome using an original Affymetrix® axiom® array.BMC Genomics. 2019 Nov 13;20(1):848. doi: 10.1186/s12864-019-6136-9. BMC Genomics. 2019. PMID: 31722668 Free PMC article.
Cited by
-
The Concept of Immunogenetics.Adv Exp Med Biol. 2022;1367:1-17. doi: 10.1007/978-3-030-92616-8_1. Adv Exp Med Biol. 2022. PMID: 35286690
-
Development of genetic markers in Eucalyptus species by target enrichment and exome sequencing.PLoS One. 2015 Jan 20;10(1):e0116528. doi: 10.1371/journal.pone.0116528. eCollection 2015. PLoS One. 2015. PMID: 25602379 Free PMC article.
-
Pathogenic Variants in Cancer Predisposition Genes and Prostate Cancer Risk in Men of African Ancestry.JCO Precis Oncol. 2020;4:32-43. doi: 10.1200/po.19.00179. Epub 2020 Jan 31. JCO Precis Oncol. 2020. PMID: 32832836 Free PMC article.
-
Population genetic analysis of bi-allelic structural variants from low-coverage sequence data with an expectation-maximization algorithm.BMC Bioinformatics. 2014 May 29;15:163. doi: 10.1186/1471-2105-15-163. BMC Bioinformatics. 2014. PMID: 24884587 Free PMC article.
-
npInv: accurate detection and genotyping of inversions using long read sub-alignment.BMC Bioinformatics. 2018 Jul 13;19(1):261. doi: 10.1186/s12859-018-2252-9. BMC Bioinformatics. 2018. PMID: 30001702 Free PMC article.
References
-
- Li Y, Zheng H, Luo R, Wu H, Zhu H, Li R, Cao H, Wu B, Huang S, Shao H, et al. Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly. Nat. Biotechnol. 2011;29:723–730. - PubMed
-
- Cao J, Schneeberger K, Ossowski S, Gunther T, Bender S, Fitz J, Koenig D, Lanz C, Stegle O, Lippert C, et al. Whole-genome sequencing of multiple Arabidopsis thaliana populations. Nat. Genet. 2011;43:956–963. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources