Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;8(12):e1002821.
doi: 10.1371/journal.pcbi.1002821. Epub 2012 Dec 27.

Chapter 6: Structural variation and medical genomics

Affiliations

Chapter 6: Structural variation and medical genomics

Benjamin J Raphael. PLoS Comput Biol. 2012.

Abstract

Differences between individual human genomes, or between human and cancer genomes, range in scale from single nucleotide variants (SNVs) through intermediate and large-scale duplications, deletions, and rearrangements of genomic segments. The latter class, called structural variants (SVs), have received considerable attention in the past several years as they are a previously under appreciated source of variation in human genomes. Much of this recent attention is the result of the availability of higher-resolution technologies for measuring these variants, including both microarray-based techniques, and more recently, high-throughput DNA sequencing. We describe the genomic technologies and computational techniques currently used to measure SVs, focusing on applications in human and cancer genomics.

PubMed Disclaimer

Conflict of interest statement

The author has declared that no competing interests exist.

Figures

Figure 1
Figure 1. An inversion resulting from non-allelic homologous recombination (NAHR) between two nearly identical segmental duplications (blue boxes) with opposite orientations (arrows).
The inversion flips the orientation of the subsequence, or block, formula image in one genome relative to the other genome.
Figure 2
Figure 2. Two major approaches to detect structural variants in an individual genome from next-generation sequencing data are de novo assembly and resequencing.
In de novo assembly, the individual genome sequence is constructed by examining overlaps between reads. In resequencing approaches, reads from the individual genome are aligned to a closely related reference genome. Examination of the resulting alignments reveals differences between the individual genome and the reference genome.
Figure 3
Figure 3. A strobe with 3 subreads.
Figure 4
Figure 4. Identification of a deletion in an individual genome by split read analysis (middle), and by depth of coverage analysis (bottom).
Figure 5
Figure 5. Paired end mapping (PEM).
Fragments from an individual genome are sequenced from both ends and the resulting paired reads are aligned to a reference genome. Most paired reads correspond to concordant pairs, where the distance between the alignment of each read agrees with the distribution of fragment lengths (right). The remaining discordant pairs suggest structural variants (here a deletion) that distinguish the individual and reference genomes.
Figure 6
Figure 6
(Top) A discordant pair (arc) indicates a deletion with unknown breakpoints formula image and formula image located in orange blocks. Positions formula image, formula image and the minimum formula image and maximum formula image length of end-sequenced fragments constrain breakpoints formula image to lie within the indicated orange blocks, and are governed by the indicated linear inequalities. (Bottom) A polygon in 2D genome space expresses the linear dependency between breakpoints formula image and formula image and records the uncertainty in the location of the breakpoints.
Figure 7
Figure 7. Mutation, selection, and clonal expansion in tumor development leads to genomic heterogeneity between cells in a tumor.
Current DNA sequencing approaches sequence DNA from many cells and thus result in a heterogenous mixture of mutations, with varying numbers of both passenger mutations (black) and driver mutations (red).

Similar articles

Cited by

References

    1. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, et al. (2009) Finding the missing heritability of complex diseases. Nature 461: 747–753. - PMC - PubMed
    1. Stratton MR (2011) Exploring the genomes of cancer cells: progress and promise. Science 331: 1553–1558. - PubMed
    1. Frazer K, Ballinger D, Cox D, Hinds D, Stuve L, et al. (2007) A second generation human haplotype map of over 3.1 million SNPs. Nature 449: 851–861. - PMC - PubMed
    1. Sharp AJ, Cheng Z, Eichler EE (2006) Structural variation of the human genome. Annu Rev Genomics Hum Genet 7: 407–442. - PubMed
    1. Iafrate A, Feuk L, Rivera M, Listewnik M, Donahoe P, et al. (2004) Detection of large-scale variation in the human genome. Nat Genet 36: 949–951. - PubMed

Publication types