De novo assembly of human genomes with massively parallel short read sequencing
- PMID: 20019144
- PMCID: PMC2813482
- DOI: 10.1101/gr.097261.109
De novo assembly of human genomes with massively parallel short read sequencing
Abstract
Next-generation massively parallel DNA sequencing technologies provide ultrahigh throughput at a substantially lower unit data cost; however, the data are very short read length sequences, making de novo assembly extremely challenging. Here, we describe a novel method for de novo assembly of large genomes from short read sequences. We successfully assembled both the Asian and African human genome sequences, achieving an N50 contig size of 7.4 and 5.9 kilobases (kb) and scaffold of 446.3 and 61.9 kb, respectively. The development of this de novo short read assembly method creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way.
Figures
Similar articles
-
State of the art de novo assembly of human genomes from massively parallel sequencing data.Hum Genomics. 2010 Apr;4(4):271-7. doi: 10.1186/1479-7364-4-4-271. Hum Genomics. 2010. PMID: 20511140 Free PMC article. Review.
-
Long-read sequencing and de novo assembly of a Chinese genome.Nat Commun. 2016 Jun 30;7:12065. doi: 10.1038/ncomms12065. Nat Commun. 2016. PMID: 27356984 Free PMC article.
-
Fine de novo sequencing of a fungal genome using only SOLiD short read data: verification on Aspergillus oryzae RIB40.PLoS One. 2013 May 7;8(5):e63673. doi: 10.1371/journal.pone.0063673. Print 2013. PLoS One. 2013. PMID: 23667655 Free PMC article.
-
Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly.Nat Biotechnol. 2011 Jul 24;29(8):723-30. doi: 10.1038/nbt.1904. Nat Biotechnol. 2011. PMID: 21785424
-
Genome structural variation discovery and genotyping.Nat Rev Genet. 2011 May;12(5):363-76. doi: 10.1038/nrg2958. Epub 2011 Mar 1. Nat Rev Genet. 2011. PMID: 21358748 Free PMC article. Review.
Cited by
-
Genomic data provides insights into the evolutionary history and adaptive differentiation of two tetraploid strawberries.Hortic Res. 2024 Jul 11;11(9):uhae194. doi: 10.1093/hr/uhae194. eCollection 2024 Sep. Hortic Res. 2024. PMID: 39257537 Free PMC article.
-
Whole-genome de novo sequencing reveals genomic variants associated with differences of sex development in SRY negative pigs.Biol Sex Differ. 2024 Sep 2;15(1):68. doi: 10.1186/s13293-024-00644-w. Biol Sex Differ. 2024. PMID: 39223676 Free PMC article.
-
Differential Strategies of Ectomycorrhizal Development between Suillus luteus and Pinus massoniana in Response to Nutrient Changes.J Fungi (Basel). 2024 Aug 19;10(8):587. doi: 10.3390/jof10080587. J Fungi (Basel). 2024. PMID: 39194913 Free PMC article.
-
Retrospective analysis of molecular characteristics, risk factors, and outcomes in carbapenem-resistant Klebsiella pneumoniae bloodstream infections.BMC Microbiol. 2024 Aug 22;24(1):309. doi: 10.1186/s12866-024-03465-4. BMC Microbiol. 2024. PMID: 39174950 Free PMC article.
-
GCphase: an SNP phasing method using a graph partition and error correction algorithm.BMC Bioinformatics. 2024 Aug 19;25(1):267. doi: 10.1186/s12859-024-05901-8. BMC Bioinformatics. 2024. PMID: 39160480 Free PMC article.
References
-
- Bentley DR. Whole-genome re-sequencing. Curr Opin Genet Dev. 2006;16:545–552. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases