Abstract
We describe Trans-ABySS, a de novo short-read transcriptome assembly and analysis pipeline that addresses variation in local read densities by assembling read substrings with varying stringencies and then merging the resulting contigs before analysis. Analyzing 7.4 gigabases of 50-base-pair paired-end Illumina reads from an adult mouse liver poly(A) RNA library, we identified known, new and alternative structures in expressed transcripts, and achieved high sensitivity and specificity relative to reference-based assembly methods.
Similar content being viewed by others
References
Pepke, S., Wold, B. & Mortazavi, A. Nat. Methods 6, S22–S32 (2009).
Griffith, M. et al. Nat. Methods 7, 843–847 (2010).
Ameur, A. et al. Genome Biol. 11, R34 (2010).
Au, K.F. et al. Nucleic Acids Res. 38, 4570–4578 (2010).
De Bona, F. et al. Bioinformatics 24, i174–i180 (2008).
Trapnell, C., Pachter, L. & Salzberg, S.L. Bioinformatics 25, 1105–1111 (2009).
Wu, T.D. & Nacu, S. Bioinformatics 26, 873–881 (2010).
Guttman, M. et al. Nat. Biotechnol. 28, 503–510 (2010).
Trapnell, C. et al. Nat. Biotechnol. 28, 511–515 (2010).
Li, B. et al. Bioinformatics 26, 493–500 (2010).
Li, J., Jiang, H. & Wong, W.H. Genome Biol. 11, R50 (2010).
Krawitz, P. et al. Bioinformatics 26, 722–729 (2010).
Cartwright, R.A. Mol. Biol. Evol. 26, 473–480 (2009).
Degner, J.F. et al. Bioinformatics 25, 3207–3212 (2009).
Birzele, F. et al. Nucleic Acids Res. 38, 3999–4010 (2010).
Simpson, J.T. et al. Genome Res. 19, 1117–1123 (2009).
Flicek, P. & Birney, E. Nat. Methods 6 (Suppl.), S6–S12 (2009).
Birol, I. et al. Bioinformatics 25, 2872–2877 (2009).
Slater, G.S. & Birney, E. BMC Bioinformatics 6, 31 (2005).
Li, H. & Durbin, R. Bioinformatics 25, 1754–1760 (2009).
Hubbard, T.J. et al. Nucleic Acids Res. 37, D690–D697 (2009).
Kent, W.J. Genome Res. 12, 656–664 (2002).
Hsu, F. et al. Bioinformatics 22, 1036–1046 (2006).
Pruitt, K.D., Tatusova, T. & Maglott, D.R. Nucleic Acids Res. 35, D61–D65 (2007).
Thierry-Mieg, D. & Thierry-Mieg, J. Genome Biol. 7 (Suppl.), 11–14 (2006).
Melamud, E. & Moult, J. Nucleic Acids Res. 37, 4873–4886 (2009).
Nagalakshmi, U. et al. Science 320, 1344–1349 (2008).
Jackman, S.D. & Birol, I. Genome Biol. 11, 202 (2010).
Sheth, N. et al. Nucleic Acids Res. 34, 3955–3967 (2006).
Rhead, B. et al. Nucleic Acids Res. 38 Database issue, D613–D619 (2010).
Koscielny, G. et al. Genomics 93, 213–220 (2009).
Trapnell, C. & Salzberg, S.L. Nat. Biotechnol. 27, 455–457 (2009).
Acknowledgements
Funding for this work was provided in part by Genome Canada, Genome British Columbia, Michael Smith Foundation for Health Research and the Canadian Institute of Health Research (CIHR), including the CIHR Bioinformatics Training Program for Health Research. We thank S. Morrissy and G. Taylor for insightful discussions, A. He for technical assistance, A. Fejes for assistance with coverage bias calculations, and A. Tuin and N. Watkins (DNA Software) for assistance with primer design.
Author information
Authors and Affiliations
Contributions
G.R. and J.S. wrote the paper. J.S., G.R. and K.M. reviewed predictions and recommended analysis methods. G.R. coordinated analysis and validation. B.K., A.-L.P. and A.T. constructed libraries under the supervision of YJ.Z. S.L. generated biological material and performed RT-PCR validation. R.A.M. supervised sequencing activities. Y.S.B., T.C., R. Corbett, R. Chiu, M.F., M.G., J.Q.Q., R.N., H.M.O., N.T., R.V., S.K.C. and R.S. developed analysis methods and code and performed analyses. R. Corbett and R. Chiu performed comparisons with reference-based methods. S.D.J. develops and maintains ABySS and generated the ABySS assemblies. A.R. contributed algorithms and code for ABySS. M.A.M., S.J.M.J. and P.A.H. directed research. S.J.M.J. suggested analysis methods. YJ.Z. and M.H. developed the WTSS protocol. J.S. supervised activities. P.A.H. supervised validation. I.B. developed ABySS and Trans-ABySS and directed bioinformatics work.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–21, Supplementary Tables 1–4, Supplementary Note (PDF 2262 kb)
Rights and permissions
About this article
Cite this article
Robertson, G., Schein, J., Chiu, R. et al. De novo assembly and analysis of RNA-seq data. Nat Methods 7, 909–912 (2010). https://doi.org/10.1038/nmeth.1517
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nmeth.1517
- Springer Nature America, Inc.