StringTie enables improved reconstruction of a transcriptome from RNA-seq reads
- PMID: 25690850
- PMCID: PMC4643835
- DOI: 10.1038/nbt.3122
StringTie enables improved reconstruction of a transcriptome from RNA-seq reads
Abstract
Methods used to sequence the transcriptome often produce more than 200 million short sequences. We introduce StringTie, a computational method that applies a network flow algorithm originally developed in optimization theory, together with optional de novo assembly, to assemble these complex data sets into transcripts. When used to analyze both simulated and real data sets, StringTie produces more complete and accurate reconstructions of genes and better estimates of expression levels, compared with other leading transcript assembly programs including Cufflinks, IsoLasso, Scripture and Traph. For example, on 90 million reads from human blood, StringTie correctly assembled 10,990 transcripts, whereas the next best assembly was of 7,187 transcripts by Cufflinks, which is a 53% increase in transcripts assembled. On a simulated data set, StringTie correctly assembled 7,559 transcripts, which is 20% more than the 6,310 assembled by Cufflinks. As well as producing a more complete transcriptome assembly, StringTie runs faster on all data sets tested to date compared with other assembly software, including Cufflinks.
Conflict of interest statement
The authors declare no competing financial interests.
Figures
Similar articles
-
Improved transcriptome assembly using a hybrid of long and short reads with StringTie.PLoS Comput Biol. 2022 Jun 1;18(6):e1009730. doi: 10.1371/journal.pcbi.1009730. eCollection 2022 Jun. PLoS Comput Biol. 2022. PMID: 35648784 Free PMC article.
-
TransComb: genome-guided transcriptome assembly via combing junctions in splicing graphs.Genome Biol. 2016 Oct 19;17(1):213. doi: 10.1186/s13059-016-1074-1. Genome Biol. 2016. PMID: 27760567 Free PMC article.
-
STAble: a novel approach to de novo assembly of RNA-seq data and its application in a metabolic model network based metatranscriptomic workflow.BMC Bioinformatics. 2018 Jul 9;19(Suppl 7):184. doi: 10.1186/s12859-018-2174-6. BMC Bioinformatics. 2018. PMID: 30066630 Free PMC article.
-
Protocol for transcriptome assembly by the TransBorrow algorithm.Biol Methods Protoc. 2023 Nov 1;8(1):bpad028. doi: 10.1093/biomethods/bpad028. eCollection 2023. Biol Methods Protoc. 2023. PMID: 38023349 Free PMC article. Review.
-
Mapping RNA-seq reads to transcriptomes efficiently based on learning to hash method.Comput Biol Med. 2020 Jan;116:103539. doi: 10.1016/j.compbiomed.2019.103539. Epub 2019 Nov 13. Comput Biol Med. 2020. PMID: 31765913 Review.
Cited by
-
FEAtl: a comprehensive web-based expression atlas for functional genomics in tropical and subtropical fruit crops.BMC Plant Biol. 2024 Sep 30;24(1):890. doi: 10.1186/s12870-024-05595-3. BMC Plant Biol. 2024. PMID: 39343895
-
ACSL4-mediated lipid rafts prevent membrane rupture and inhibit immunogenic cell death in melanoma.Cell Death Dis. 2024 Sep 29;15(9):695. doi: 10.1038/s41419-024-07098-3. Cell Death Dis. 2024. PMID: 39343834
-
ANAgdb: a multi-omics and taxonomy database for ANA-grade.BMC Plant Biol. 2024 Sep 28;24(1):882. doi: 10.1186/s12870-024-05613-4. BMC Plant Biol. 2024. PMID: 39342076 Free PMC article.
-
Chromosome-level genome assembly of the bay scallop Argopecten irradians.Sci Data. 2024 Sep 28;11(1):1057. doi: 10.1038/s41597-024-03904-x. Sci Data. 2024. PMID: 39341805 Free PMC article.
-
Whole-transcriptome analyses of ovine lung microvascular endothelial cells infected with bluetongue virus.Vet Res. 2024 Sep 27;55(1):122. doi: 10.1186/s13567-024-01372-0. Vet Res. 2024. PMID: 39334220 Free PMC article.
References
Publication types
MeSH terms
Substances
Associated data
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials