Characterizing and annotating the genome using RNA-seq data
- PMID: 27294835
- DOI: 10.1007/s11427-015-0349-4
Characterizing and annotating the genome using RNA-seq data
Abstract
Bioinformatics methods for various RNA-seq data analyses are in fast evolution with the improvement of sequencing technologies. However, many challenges still exist in how to efficiently process the RNA-seq data to obtain accurate and comprehensive results. Here we reviewed the strategies for improving diverse transcriptomic studies and the annotation of genetic variants based on RNA-seq data. Mapping RNA-seq reads to the genome and transcriptome represent two distinct methods for quantifying the expression of genes/transcripts. Besides the known genes annotated in current databases, many novel genes/transcripts (especially those long noncoding RNAs) still can be identified on the reference genome using RNA-seq. Moreover, owing to the incompleteness of current reference genomes, some novel genes are missing from them. Genome- guided and de novo transcriptome reconstruction are two effective and complementary strategies for identifying those novel genes/transcripts on or beyond the reference genome. In addition, integrating the genes of distinct databases to conduct transcriptomics and genetics studies can improve the results of corresponding analyses.
Keywords: RNA-seq; de novo assembly; genetic variants; genome-guided transcriptome reconstruction; long noncoding RNA.
Similar articles
-
De Novo Plant Transcriptome Assembly and Annotation Using Illumina RNA-Seq Reads.Methods Mol Biol. 2019;1933:265-275. doi: 10.1007/978-1-4939-9045-0_16. Methods Mol Biol. 2019. PMID: 30945191
-
A high-quality annotated transcriptome of swine peripheral blood.BMC Genomics. 2017 Jun 24;18(1):479. doi: 10.1186/s12864-017-3863-7. BMC Genomics. 2017. PMID: 28646867 Free PMC article.
-
Comparative study of de novo assembly and genome-guided assembly strategies for transcriptome reconstruction based on RNA-Seq.Sci China Life Sci. 2013 Feb;56(2):143-55. doi: 10.1007/s11427-013-4442-z. Epub 2013 Feb 8. Sci China Life Sci. 2013. PMID: 23393030
-
Designing a transcriptome next-generation sequencing project for a nonmodel plant species.Am J Bot. 2012 Feb;99(2):257-66. doi: 10.3732/ajb.1100292. Epub 2012 Jan 19. Am J Bot. 2012. PMID: 22268224 Review.
-
Overview of available methods for diverse RNA-Seq data analyses.Sci China Life Sci. 2011 Dec;54(12):1121-8. doi: 10.1007/s11427-011-4255-x. Epub 2012 Jan 7. Sci China Life Sci. 2011. PMID: 22227904 Review.
Cited by
-
Integrative analyses of long and short-read RNA sequencing reveal the spliced isoform regulatory network of seedling growth dynamics in upland cotton.Funct Integr Genomics. 2024 Sep 4;24(5):156. doi: 10.1007/s10142-024-01420-0. Funct Integr Genomics. 2024. PMID: 39230785
-
Water stress modulates terpene biosynthesis and morphophysiology at different ploidal levels in Lippia alba (Mill.) N. E. Brown (Verbenaceae).Protoplasma. 2024 Mar;261(2):227-243. doi: 10.1007/s00709-023-01890-2. Epub 2023 Sep 4. Protoplasma. 2024. PMID: 37665420
-
Serum Long Noncoding RNA H19 and CKD Progression in IgA Nephropathy.J Nephrol. 2023 Mar;36(2):397-406. doi: 10.1007/s40620-022-01536-1. Epub 2022 Dec 27. J Nephrol. 2023. PMID: 36574208
-
Differentially expressed genes against Colletotrichum lindemuthiamum in a bean genotype carrying the Co-2 gene revealed by RNA-sequencing analysis.Front Plant Sci. 2022 Sep 15;13:981517. doi: 10.3389/fpls.2022.981517. eCollection 2022. Front Plant Sci. 2022. PMID: 36311094 Free PMC article.
-
Comprehensive annotation of the Chinese tree shrew genome by large-scale RNA sequencing and long-read isoform sequencing.Zool Res. 2021 Nov 18;42(6):692-709. doi: 10.24272/j.issn.2095-8137.2021.272. Zool Res. 2021. PMID: 34581030 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources