StringTie enables improved reconstruction of a transcriptome from RNA-seq reads
- PMID: 25690850
- PMCID: PMC4643835
- DOI: 10.1038/nbt.3122
StringTie enables improved reconstruction of a transcriptome from RNA-seq reads
Abstract
Methods used to sequence the transcriptome often produce more than 200 million short sequences. We introduce StringTie, a computational method that applies a network flow algorithm originally developed in optimization theory, together with optional de novo assembly, to assemble these complex data sets into transcripts. When used to analyze both simulated and real data sets, StringTie produces more complete and accurate reconstructions of genes and better estimates of expression levels, compared with other leading transcript assembly programs including Cufflinks, IsoLasso, Scripture and Traph. For example, on 90 million reads from human blood, StringTie correctly assembled 10,990 transcripts, whereas the next best assembly was of 7,187 transcripts by Cufflinks, which is a 53% increase in transcripts assembled. On a simulated data set, StringTie correctly assembled 7,559 transcripts, which is 20% more than the 6,310 assembled by Cufflinks. As well as producing a more complete transcriptome assembly, StringTie runs faster on all data sets tested to date compared with other assembly software, including Cufflinks.
Conflict of interest statement
The authors declare no competing financial interests.
Figures



Similar articles
-
Improved transcriptome assembly using a hybrid of long and short reads with StringTie.PLoS Comput Biol. 2022 Jun 1;18(6):e1009730. doi: 10.1371/journal.pcbi.1009730. eCollection 2022 Jun. PLoS Comput Biol. 2022. PMID: 35648784 Free PMC article.
-
TransComb: genome-guided transcriptome assembly via combing junctions in splicing graphs.Genome Biol. 2016 Oct 19;17(1):213. doi: 10.1186/s13059-016-1074-1. Genome Biol. 2016. PMID: 27760567 Free PMC article.
-
STAble: a novel approach to de novo assembly of RNA-seq data and its application in a metabolic model network based metatranscriptomic workflow.BMC Bioinformatics. 2018 Jul 9;19(Suppl 7):184. doi: 10.1186/s12859-018-2174-6. BMC Bioinformatics. 2018. PMID: 30066630 Free PMC article.
-
Protocol for transcriptome assembly by the TransBorrow algorithm.Biol Methods Protoc. 2023 Nov 1;8(1):bpad028. doi: 10.1093/biomethods/bpad028. eCollection 2023. Biol Methods Protoc. 2023. PMID: 38023349 Free PMC article. Review.
-
Mapping RNA-seq reads to transcriptomes efficiently based on learning to hash method.Comput Biol Med. 2020 Jan;116:103539. doi: 10.1016/j.compbiomed.2019.103539. Epub 2019 Nov 13. Comput Biol Med. 2020. PMID: 31765913 Review.
Cited by
-
Microprotein-encoding RNA regulation in cells treated with pro-inflammatory and pro-fibrotic stimuli.BMC Genomics. 2024 Nov 5;25(1):1034. doi: 10.1186/s12864-024-10948-1. BMC Genomics. 2024. PMID: 39497054 Free PMC article.
-
Fine mapping of a major QTL, qECQ8, for rice taste quality.BMC Plant Biol. 2024 Oct 31;24(1):1034. doi: 10.1186/s12870-024-05744-8. BMC Plant Biol. 2024. PMID: 39478453 Free PMC article.
-
HBEGF-TNF induce a complex outer retinal pathology with photoreceptor cell extrusion in human organoids.Nat Commun. 2022 Oct 19;13(1):6183. doi: 10.1038/s41467-022-33848-y. Nat Commun. 2022. PMID: 36261438 Free PMC article.
-
Implications for preeclampsia: hypoxia-induced Notch promotes trophoblast migration.Reproduction. 2021 May 14;161(6):681-696. doi: 10.1530/REP-20-0483. Reproduction. 2021. PMID: 33784241 Free PMC article.
-
Using online tools at the Bovine Genome Database to manually annotate genes in the new reference genome.Anim Genet. 2020 Oct;51(5):675-682. doi: 10.1111/age.12962. Epub 2020 Jun 14. Anim Genet. 2020. PMID: 32537769 Free PMC article.
References
Publication types
MeSH terms
Substances
Associated data
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials