Haplotype-aware pantranscriptome analyses using spliced pangenome graphs
- PMID: 36646895
- DOI: 10.1038/s41592-022-01731-9
Haplotype-aware pantranscriptome analyses using spliced pangenome graphs
Abstract
Pangenomics is emerging as a powerful computational paradigm in bioinformatics. This field uses population-level genome reference structures, typically consisting of a sequence graph, to mitigate reference bias and facilitate analyses that were challenging with previous reference-based methods. In this work, we extend these methods into transcriptomics to analyze sequencing data using the pantranscriptome: a population-level transcriptomic reference. Our toolchain, which consists of additions to the VG toolkit and a standalone tool, RPVG, can construct spliced pangenome graphs, map RNA sequencing data to these graphs, and perform haplotype-aware expression quantification of transcripts in a pantranscriptome. We show that this workflow improves accuracy over state-of-the-art RNA sequencing mapping methods, and that it can efficiently quantify haplotype-specific transcript expression without needing to characterize the haplotypes of a sample beforehand.
© 2023. The Author(s), under exclusive licence to Springer Nature America, Inc.
Comment in
-
Capturing haplotype variation in populations using pangenome references.Nat Biotechnol. 2023 Feb;41(2):194. doi: 10.1038/s41587-023-01691-1. Nat Biotechnol. 2023. PMID: 36792708 No abstract available.
Similar articles
-
Haplotype-aware sequence alignment to pangenome graphs.Genome Res. 2024 Oct 11;34(9):1265-1275. doi: 10.1101/gr.279143.124. Genome Res. 2024. PMID: 39013594
-
Comparing methods for constructing and representing human pangenome graphs.Genome Biol. 2023 Nov 30;24(1):274. doi: 10.1186/s13059-023-03098-2. Genome Biol. 2023. PMID: 38037131 Free PMC article.
-
GRAFIMO: Variant and haplotype aware motif scanning on pangenome graphs.PLoS Comput Biol. 2021 Sep 27;17(9):e1009444. doi: 10.1371/journal.pcbi.1009444. eCollection 2021 Sep. PLoS Comput Biol. 2021. PMID: 34570769 Free PMC article.
-
Pangenome Graphs.Annu Rev Genomics Hum Genet. 2020 Aug 31;21:139-162. doi: 10.1146/annurev-genom-120219-080406. Epub 2020 May 26. Annu Rev Genomics Hum Genet. 2020. PMID: 32453966 Free PMC article. Review.
-
Characterizing and annotating the genome using RNA-seq data.Sci China Life Sci. 2017 Feb;60(2):116-125. doi: 10.1007/s11427-015-0349-4. Epub 2016 Jun 13. Sci China Life Sci. 2017. PMID: 27294835 Review.
Cited by
-
A stepwise guide for pangenome development in crop plants: an alfalfa (Medicago sativa) case study.BMC Genomics. 2024 Oct 31;25(1):1022. doi: 10.1186/s12864-024-10931-w. BMC Genomics. 2024. PMID: 39482604 Free PMC article. Review.
-
Efficient indexing and querying of annotations in a pangenome graph.bioRxiv [Preprint]. 2024 Oct 15:2024.10.12.618009. doi: 10.1101/2024.10.12.618009. bioRxiv. 2024. PMID: 39464141 Free PMC article. Preprint.
-
Integrated proteome and pangenome analysis revealed the variation of microalga Isochrysis galbana and associated bacterial community to 2,6-Di-tert-butyl-p-cresol (BHT) stress.World J Microbiol Biotechnol. 2024 Oct 24;40(11):364. doi: 10.1007/s11274-024-04171-z. World J Microbiol Biotechnol. 2024. PMID: 39446252
-
SAFARI: Pangenome Alignment of Ancient DNA Using Purine/Pyrimidine Encodings.bioRxiv [Preprint]. 2024 Oct 8:2024.08.12.607489. doi: 10.1101/2024.08.12.607489. bioRxiv. 2024. PMID: 39415996 Free PMC article. Preprint.
-
Ornaments for efficient allele-specific expression estimation with bias correction.Am J Hum Genet. 2024 Aug 8;111(8):1770-1781. doi: 10.1016/j.ajhg.2024.06.014. Epub 2024 Jul 23. Am J Hum Genet. 2024. PMID: 39047729
References
-
- Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinform. 12, 1–16 (2011). - DOI
Publication types
MeSH terms
Grants and funding
- U01HG010961/U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI)
- R01HG010485/U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI)
- U41HG010972/U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI)
- U24HG011853/U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI)
- OT2 OD026682/OD/NIH HHS/United States
LinkOut - more resources
Full Text Sources