Illumina Synthetic Long Read Sequencing Allows Recovery of Missing Sequences even in the "Finished" C. elegans Genome
- PMID: 26039588
- PMCID: PMC4650653
- DOI: 10.1038/srep10814
Illumina Synthetic Long Read Sequencing Allows Recovery of Missing Sequences even in the "Finished" C. elegans Genome
Abstract
Most next-generation sequencing platforms permit acquisition of high-throughput DNA sequences, but the relatively short read length limits their use in genome assembly or finishing. Illumina has recently released a technology called Synthetic Long-Read Sequencing that can produce reads of unusual length, i.e., predominately around 10 Kb. However, a systematic assessment of their use in genome finishing and assembly is still lacking. We evaluate the promise and deficiency of the long reads in these aspects using isogenic C. elegans genome with no gap. First, the reads are highly accurate and capable of recovering most types of repetitive sequences. However, the presence of tandem repetitive sequences prevents pre-assembly of long reads in the relevant genomic region. Second, the reads are able to reliably detect missing but not extra sequences in the C. elegans genome. Third, the reads of smaller size are more capable of recovering repetitive sequences than those of bigger size. Fourth, at least 40 Kbp missing genomic sequences are recovered in the C. elegans genome using the long reads. Finally, an N50 contig size of at least 86 Kbp can be achieved with 24 × reads but with substantial mis-assembly errors, highlighting a need for novel assembly algorithm for the long reads.
Figures
Similar articles
-
MinION-based long-read sequencing and assembly extends the Caenorhabditis elegans reference genome.Genome Res. 2018 Feb;28(2):266-274. doi: 10.1101/gr.221184.117. Epub 2017 Dec 22. Genome Res. 2018. PMID: 29273626 Free PMC article.
-
Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.BMC Genomics. 2023 Mar 16;24(1):117. doi: 10.1186/s12864-023-09193-9. BMC Genomics. 2023. PMID: 36927511 Free PMC article.
-
Pseudo-Sanger sequencing: massively parallel production of long and near error-free reads using NGS technology.BMC Genomics. 2013 Oct 17;14(1):711. doi: 10.1186/1471-2164-14-711. BMC Genomics. 2013. PMID: 24134808 Free PMC article.
-
Genome assembly using Nanopore-guided long and error-free DNA reads.BMC Genomics. 2015 Apr 20;16(1):327. doi: 10.1186/s12864-015-1519-z. BMC Genomics. 2015. PMID: 25927464 Free PMC article.
-
Retrieval of long DNA reads from herbarium specimens.AoB Plants. 2023 Nov 8;15(6):plad074. doi: 10.1093/aobpla/plad074. eCollection 2023 Dec. AoB Plants. 2023. PMID: 38130422 Free PMC article. Review.
Cited by
-
The Application of Metagenomics to Study Microbial Communities and Develop Desirable Traits in Fermented Foods.Foods. 2022 Oct 21;11(20):3297. doi: 10.3390/foods11203297. Foods. 2022. PMID: 37431045 Free PMC article. Review.
-
Genetic exchange with an outcrossing sister species causes severe genome-wide dysregulation in a selfing Caenorhabditis nematode.Genome Res. 2022 Nov-Dec;32(11-12):2015-2027. doi: 10.1101/gr.277205.122. Epub 2022 Nov 9. Genome Res. 2022. PMID: 36351773 Free PMC article.
-
Genomic architecture of 5S rDNA cluster and its variations within and between species.BMC Genomics. 2022 Mar 27;23(1):238. doi: 10.1186/s12864-022-08476-x. BMC Genomics. 2022. PMID: 35346033 Free PMC article.
-
G-quadruplexes in helminth parasites.Nucleic Acids Res. 2022 Mar 21;50(5):2719-2735. doi: 10.1093/nar/gkac129. Nucleic Acids Res. 2022. PMID: 35234933 Free PMC article.
-
Comprehensive Wet-Bench and Bioinformatics Workflow for Complex Microbiota Using Oxford Nanopore Technologies.mSystems. 2021 Aug 31;6(4):e0075021. doi: 10.1128/mSystems.00750-21. Epub 2021 Aug 24. mSystems. 2021. PMID: 34427527 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous