iCall: a genotype-calling algorithm for rare, low-frequency and common variants on the Illumina exome array
- PMID: 24567545
- DOI: 10.1093/bioinformatics/btu107
iCall: a genotype-calling algorithm for rare, low-frequency and common variants on the Illumina exome array
Abstract
Motivation: Next-generation genotyping microarrays have been designed with insights from 1000 Genomes Project and whole-exome sequencing studies. These arrays additionally include variants that are typically present at lower frequencies. Determining the genotypes of these variants from hybridization intensities is challenging because there is less support to locate the presence of the minor alleles when the allele counts are low. Existing algorithms are mainly designed for calling common variants and are notorious for failing to generate accurate calls for low-frequency and rare variants. Here, we introduce a new calling algorithm, iCall, to call genotypes for variants across the whole spectrum of allele frequencies.
Results: We benchmarked iCall against four of the most commonly used algorithms, GenCall, optiCall, illuminus and GenoSNP, as well as a post-processing caller zCall that adopted a two-stage calling design. Normalized hybridization intensities for 12 370 individuals genotyped on the Illumina HumanExome BeadChip were considered, of which 81 individuals were also whole-genome sequenced. The sequence calls were used to benchmark the accuracy of the genotype calling, and our comparisons indicated that iCall outperforms all four single-stage calling algorithms in terms of call rates and concordance, particularly in the calling accuracy of minor alleles, which is the principal concern for rare and low-frequency variants. The application of zCall to post-process the output from iCall also produced marginally improved performance to the combination of zCall and GenCall.
Availability and implementation: iCall is implemented in C++ for use on Linux operating systems and is available for download at http://www.statgen.nus.edu.sg/∼software/icall.html.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Similar articles
-
optiCall: a robust genotype-calling algorithm for rare, low-frequency and common variants.Bioinformatics. 2012 Jun 15;28(12):1598-603. doi: 10.1093/bioinformatics/bts180. Epub 2012 Apr 12. Bioinformatics. 2012. PMID: 22500001 Free PMC article.
-
zCall: a rare variant caller for array-based genotyping: genetics and population analysis.Bioinformatics. 2012 Oct 1;28(19):2543-5. doi: 10.1093/bioinformatics/bts479. Epub 2012 Jul 27. Bioinformatics. 2012. PMID: 22843986 Free PMC article.
-
KRLMM: an adaptive genotype calling method for common and low frequency variants.BMC Bioinformatics. 2014 May 23;15:158. doi: 10.1186/1471-2105-15-158. BMC Bioinformatics. 2014. PMID: 24886250 Free PMC article.
-
A Novel Quality-Control Procedure to Improve the Accuracy of Rare Variant Calling in SNP Arrays.Front Genet. 2021 Oct 26;12:736390. doi: 10.3389/fgene.2021.736390. eCollection 2021. Front Genet. 2021. PMID: 34764980 Free PMC article. Review.
-
The role and challenges of exome sequencing in studies of human diseases.Front Genet. 2013 Aug 26;4:160. doi: 10.3389/fgene.2013.00160. Front Genet. 2013. PMID: 24032039 Free PMC article. Review.
Cited by
-
Establishing analytical validity of BeadChip array genotype data by comparison to whole-genome sequence and standard benchmark datasets.BMC Med Genomics. 2022 Mar 14;15(1):56. doi: 10.1186/s12920-022-01199-8. BMC Med Genomics. 2022. PMID: 35287663 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous