Genome assembly comparison identifies structural variants in the human genome
- PMID: 17115057
- PMCID: PMC2674632
- DOI: 10.1038/ng1921
Genome assembly comparison identifies structural variants in the human genome
Abstract
Numerous types of DNA variation exist, ranging from SNPs to larger structural alterations such as copy number variants (CNVs) and inversions. Alignment of DNA sequence from different sources has been used to identify SNPs and intermediate-sized variants (ISVs). However, only a small proportion of total heterogeneity is characterized, and little is known of the characteristics of most smaller-sized (<50 kb) variants. Here we show that genome assembly comparison is a robust approach for identification of all classes of genetic variation. Through comparison of two human assemblies (Celera's R27c compilation and the Build 35 reference sequence), we identified megabases of sequence (in the form of 13,534 putative non-SNP events) that were absent, inverted or polymorphic in one assembly. Database comparison and laboratory experimentation further demonstrated overlap or validation for 240 variable regions and confirmed >1.5 million SNPs. Some differences were simple insertions and deletions, but in regions containing CNVs, segmental duplication and repetitive DNA, they were more complex. Our results uncover substantial undescribed variation in humans, highlighting the need for comprehensive annotation strategies to fully interpret genome scanning and personalized sequencing projects.
Figures
Similar articles
-
Towards a comprehensive structural variation map of an individual human genome.Genome Biol. 2010;11(5):R52. doi: 10.1186/gb-2010-11-5-r52. Epub 2010 May 19. Genome Biol. 2010. PMID: 20482838 Free PMC article.
-
Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery.BMC Genomics. 2011 Nov 15;12:559. doi: 10.1186/1471-2164-12-559. BMC Genomics. 2011. PMID: 22085807 Free PMC article.
-
Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly.Nat Biotechnol. 2011 Jul 24;29(8):723-30. doi: 10.1038/nbt.1904. Nat Biotechnol. 2011. PMID: 21785424
-
Pharmacogenetics: technologies to detect copy number variations.Curr Opin Mol Ther. 2009 Dec;11(6):670-80. Curr Opin Mol Ther. 2009. PMID: 20072944 Review.
-
Structural variants: changing the landscape of chromosomes and design of disease studies.Hum Mol Genet. 2006 Apr 15;15 Spec No 1:R57-66. doi: 10.1093/hmg/ddl057. Hum Mol Genet. 2006. PMID: 16651370 Review.
Cited by
-
Inversion variants in the human genome: role in disease and genome architecture.Genome Med. 2010 Feb 12;2(2):11. doi: 10.1186/gm132. Genome Med. 2010. PMID: 20156332 Free PMC article.
-
Methods and strategies for analyzing copy number variation using DNA microarrays.Nat Genet. 2007 Jul;39(7 Suppl):S16-21. doi: 10.1038/ng2028. Nat Genet. 2007. PMID: 17597776 Free PMC article. Review.
-
Copy number variants (CNVs) in primate species using array-based comparative genomic hybridization.Methods. 2009 Sep;49(1):18-25. doi: 10.1016/j.ymeth.2009.06.001. Epub 2009 Jun 21. Methods. 2009. PMID: 19545629 Free PMC article. Review.
-
Genetics of Congenital Anomalies of the Kidney and Urinary Tract: The Current State of Play.Int J Mol Sci. 2017 Apr 11;18(4):796. doi: 10.3390/ijms18040796. Int J Mol Sci. 2017. PMID: 28398236 Free PMC article. Review.
-
Comprehensively identifying and characterizing the missing gene sequences in human reference genome with integrated analytic approaches.Hum Genet. 2013 Aug;132(8):899-911. doi: 10.1007/s00439-013-1300-9. Epub 2013 Apr 10. Hum Genet. 2013. PMID: 23572138
References
-
- Marth GT, et al. A general approach to single-nucleotide polymorphism discovery. Nat. Genet. 1999;23:452–456. - PubMed
-
- Tuzun E, et al. Fine-scale structural variation of the human genome. Nat. Genet. 2005;37:727–732. - PubMed
-
- Lander ES, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. - PubMed
-
- Venter JC, et al. The sequence of the human genome. Science. 2001;291:1304–1351. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources