Towards accurate, contiguous and complete alignment-based polyploid phasing algorithms
- PMID: 35483655
- DOI: 10.1016/j.ygeno.2022.110369
Towards accurate, contiguous and complete alignment-based polyploid phasing algorithms
Abstract
Phasing, and in particular polyploid phasing, have been challenging problems held back by the limited read length of high-throughput short read sequencing methods which can't overcome the distance between heterozygous sites and labor high cost of alternative methods such as the physical separation of chromosomes for example. Recently developed single molecule long-read sequencing methods provide much longer reads which overcome this previous limitation. Here we review the alignment-based methods of polyploid phasing that rely on four main strategies: population inference methods, which leverage the genetic information of several individuals to phase a sample; objective function minimization methods, which minimize a function such as the Minimum Error Correction (MEC); graph partitioning methods, which represent the read data as a graph and split it into k haplotype subgraphs; cluster building methods, which iteratively grow clusters of similar reads into a final set of clusters that represent the haplotypes. We discuss the advantages and limitations of these methods and the metrics used to assess their performance, proposing that accuracy and contiguity are the most meaningful metrics. Finally, we propose the field of alignment-based polyploid phasing would greatly benefit from the use of a well-designed benchmarking dataset with appropriate evaluation metrics. We consider that there are still significant improvements which can be achieved to obtain more accurate and contiguous polyploid phasing results which reflect the complexity of polyploid genome architectures.
Keywords: Algorithms; Heterozygosity; Phased genome; Polyploid; Population genomics.
Copyright © 2022 The Authors. Published by Elsevier Inc. All rights reserved.
Similar articles
-
flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning.J Comput Biol. 2022 Feb;29(2):195-211. doi: 10.1089/cmb.2021.0436. Epub 2022 Jan 17. J Comput Biol. 2022. PMID: 35041529 Free PMC article.
-
Efficient algorithms for polyploid haplotype phasing.BMC Genomics. 2018 May 9;19(Suppl 2):110. doi: 10.1186/s12864-018-4464-9. BMC Genomics. 2018. PMID: 29764364 Free PMC article.
-
nPhase: an accurate and contiguous phasing method for polyploids.Genome Biol. 2021 Apr 29;22(1):126. doi: 10.1186/s13059-021-02342-x. Genome Biol. 2021. PMID: 33926549 Free PMC article.
-
Sequencing and Assembly of Polyploid Genomes.Methods Mol Biol. 2023;2545:429-458. doi: 10.1007/978-1-0716-2561-3_23. Methods Mol Biol. 2023. PMID: 36720827 Review.
-
Haplotyping-Assisted Diploid Assembly and Variant Detection with Linked Reads.Methods Mol Biol. 2023;2590:161-182. doi: 10.1007/978-1-0716-2819-5_11. Methods Mol Biol. 2023. PMID: 36335499 Review.
Cited by
-
Recent Advances in Assembly of Complex Plant Genomes.Genomics Proteomics Bioinformatics. 2023 Jun;21(3):427-439. doi: 10.1016/j.gpb.2023.04.004. Epub 2023 Apr 25. Genomics Proteomics Bioinformatics. 2023. PMID: 37100237 Free PMC article. Review.
-
Phased chromosome-scale genome assembly of an asexual, allopolyploid root-knot nematode reveals complex subgenomic structure.PLoS One. 2024 Jun 6;19(6):e0302506. doi: 10.1371/journal.pone.0302506. eCollection 2024. PLoS One. 2024. PMID: 38843263 Free PMC article.
-
A roadmap of phylogenomic methods for studying polyploid plant genera.Appl Plant Sci. 2024 Apr 22;12(4):e11580. doi: 10.1002/aps3.11580. eCollection 2024 Jul-Aug. Appl Plant Sci. 2024. PMID: 39184196 Free PMC article.
-
GCphase: an SNP phasing method using a graph partition and error correction algorithm.BMC Bioinformatics. 2024 Aug 19;25(1):267. doi: 10.1186/s12859-024-05901-8. BMC Bioinformatics. 2024. PMID: 39160480 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources