HapKled: a haplotype-aware structural variant calling approach for Oxford nanopore sequencing data
- PMID: 39045321
- PMCID: PMC11263161
- DOI: 10.3389/fgene.2024.1435087
HapKled: a haplotype-aware structural variant calling approach for Oxford nanopore sequencing data
Abstract
Introduction: Structural Variants (SVs) are a type of variation that can significantly influence phenotypes and cause diseases. Thus, the accurate detection of SVs is a vital part of modern genetic analysis. The advent of long-read sequencing technology ushers in a new era of more accurate and comprehensive SV calling, and many tools have been developed to call SVs using long-read data. Haplotype-tagging is a procedure that can tag haplotype information on reads and can thus potentially improve the SV detection; nevertheless, few methods make use of this information. In this article, we introduce HapKled, a new SV detection tool that can accurately detect SVs from Oxford Nanopore Technologies (ONT) long-read alignment data. Methods: HapKled utilizes haplotype information underlying alignment data by conducting haplotype-tagging using Whatshap on the reads to improve the detection performance, with three unique calling mechanics including altering clustering conditions according to haplotype information of signatures, determination of similar SVs based on haplotype information, and slack filtering conditions based on haplotype quality. Results: In our evaluations, HapKled outperformed state-of-the-art tools and can deliver better SV detection results on both simulated and real sequencing data. The code and experiments of HapKled can be obtained from https://github.com/CoREse/HapKled. Discussion: With the superb SV detection performance that HapKled can deliver, HapKled could be useful in bioinformatics research, clinical diagnosis, and medical research and development.
Keywords: Oxford nanopore sequencing; haplotype-tagging; long-read sequencing; structural variant; variant calling.
Copyright © 2024 Zhang, Liu, Li, Liu, Wang and Jiang.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
Similar articles
-
Duet: SNP-assisted structural variant calling and phasing using Oxford nanopore sequencing.BMC Bioinformatics. 2022 Nov 7;23(1):465. doi: 10.1186/s12859-022-05025-x. BMC Bioinformatics. 2022. PMID: 36344913 Free PMC article.
-
Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data.Front Genet. 2021 Nov 18;12:761791. doi: 10.3389/fgene.2021.761791. eCollection 2021. Front Genet. 2021. PMID: 34868242 Free PMC article.
-
Kled: an ultra-fast and sensitive structural variant detection tool for long-read sequencing data.Brief Bioinform. 2024 Jan 22;25(2):bbae049. doi: 10.1093/bib/bbae049. Brief Bioinform. 2024. PMID: 38385878 Free PMC article.
-
Application of long-read sequencing to the detection of structural variants in human cancer genomes.Comput Struct Biotechnol J. 2021 Jul 28;19:4207-4216. doi: 10.1016/j.csbj.2021.07.030. eCollection 2021. Comput Struct Biotechnol J. 2021. PMID: 34527193 Free PMC article. Review.
-
Resolving complex structural variants via nanopore sequencing.Front Genet. 2023 Aug 16;14:1213917. doi: 10.3389/fgene.2023.1213917. eCollection 2023. Front Genet. 2023. PMID: 37674481 Free PMC article. Review.
References
-
- Bennett E. P., Petersen B. L., Johansen I. E., Niu Y., Yang Z., Chamberlain C. A., et al. (2020). INDEL detection, the ‘Achilles heel’ of precise genome editing: a survey of methods for accurate profiling of gene editing induced indels. Nucleic Acids Res. 48, 11958–11981. 10.1093/nar/gkaa975 - DOI - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous