Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug 1;10 Suppl 2(Suppl 2):47.
doi: 10.1186/s12918-016-0300-5.

SNVSniffer: an integrated caller for germline and somatic single-nucleotide and indel mutations

Affiliations

SNVSniffer: an integrated caller for germline and somatic single-nucleotide and indel mutations

Yongchao Liu et al. BMC Syst Biol. .

Abstract

Background: Various approaches to calling single-nucleotide variants (SNVs) or insertion-or-deletion (indel) mutations have been developed based on next-generation sequencing (NGS). However, most of them are dedicated to a particular type of mutation, e.g. germline SNVs in normal cells, somatic SNVs in cancer/tumor cells, or indels only. In the literature, efficient and integrated callers for both germline and somatic SNVs/indels have not yet been extensively investigated.

Results: We present SNVSniffer, an efficient and integrated caller identifying both germline and somatic SNVs/indels from NGS data. In this algorithm, we propose the use of Bayesian probabilistic models to identify SNVs and investigate a multiple ungapped alignment approach to call indels. For germline variant calling, we model allele counts per site to follow a multinomial conditional distribution. For somatic variant calling, we rely on paired tumor-normal pairs from identical individuals and introduce a hybrid subtraction and joint sample analysis approach by modeling tumor-normal allele counts per site to follow a joint multinomial conditional distribution. A comprehensive performance evaluation has been conducted using a diversity of variant calling benchmarks. For germline variant calling, SNVSniffer demonstrates highly competitive accuracy with superior speed in comparison with the state-of-the-art FaSD, GATK and SAMtools. For somatic variant calling, our algorithm achieves comparable or even better accuracy, at fast speed, than the leading VarScan2, SomaticSniper, JointSNVMix2 and MuTect.

Conclusions: SNVSniffers demonstrates the feasibility to develop integrated solutions to fast and efficient identification of germline and somatic variants. Nonetheless, accurate discovery of genetic variations is critical yet challenging, and still requires substantially more research efforts being devoted. SNVSniffer and synthetic samples are publicly available at http://snvsniffer.sourceforge.net .

Keywords: Bayesian model; Indel calling; SNP calling; Somatic SNV calling.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Recall on virtual tumors in the function of tumor purity
Fig. 2
Fig. 2
Precision on virtual tumors in the function of tumor purity
Fig. 3
Fig. 3
F-score on virtual tumors in the function of tumor purity
Fig. 4
Fig. 4
Program diagram of SNVSniffer for variant calling

Similar articles

Cited by

References

    1. Li H, Durbin R. Fast and accurate long-read alignment with burrows–wheeler transform. Bioinformatics. 2010;26(5):589–95. doi: 10.1093/bioinformatics/btp698. - DOI - PMC - PubMed
    1. Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nat Methods. 2012;9(4):357–9. doi: 10.1038/nmeth.1923. - DOI - PMC - PubMed
    1. Liu Y, Schmidt B. Long read alignment based on maximal exact match seeds. Bioinformatics. 2012;28(18):318–24. doi: 10.1093/bioinformatics/bts414. - DOI - PMC - PubMed
    1. Liu Y, Popp B, Schmidt B. CUSHAW3: sensitive and accurate base-space and color-space short-read alignment with hybrid seeding. PloS ONE. 2014;9(1):86869. doi: 10.1371/journal.pone.0086869. - DOI - PMC - PubMed
    1. Li H, Ruan J, Durbin R. Mapping short dna sequencing reads and calling variants using mapping quality scores. Genome Res. 2008;18(11):1851–8. doi: 10.1101/gr.078212.108. - DOI - PMC - PubMed

Publication types