Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Sep 26;501(7468):506-11.
doi: 10.1038/nature12531. Epub 2013 Sep 15.

Transcriptome and genome sequencing uncovers functional variation in humans

Collaborators, Affiliations

Transcriptome and genome sequencing uncovers functional variation in humans

Tuuli Lappalainen et al. Nature. .

Abstract

Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of messenger RNA and microRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project--the first uniformly processed high-throughput RNA-sequencing data from multiple human populations with high-quality genome sequences. We discover extremely widespread genetic variation affecting the regulation of most genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on the cellular mechanisms of regulatory and loss-of-function variation, and allows us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Transcriptome variation
a) Spearman rank correlation of replicate samples, based on mRNA exon and miRNA quantifications of 5 individuals sequenced 8 and 7 times for mRNA and miRNA, respectively, and separated by the individual or the sequencing lab being the same or different. The quantifications have been normalized only for the total number of mapped reads (see Fig. S11 for correlations after normalization). b) The proportion of expression level variation (as opposed to splicing) of the total transcription variation between individuals in each population, measured per gene. c) Proportion of genes with differential expression levels and/or transcript usage between population pairs, out of the total listed on the right-hand side. d) Network of significant miRNA families (P<0.001; yellow) and their significantly associated mRNA targets (P<0.05; purple). The edges display negative (green) and positive (red) associations.
Figure 2
Figure 2. Transcriptome QTLs
a) Enrichment of EUR exon eQTLs in functional annotations for the 1st, 2nd, 5th and 10th best associating eQTL variant per gene, relative to a matched null set of variants denoted by the horizontal line. The numbers are −log10 p-values of a Fisher test between the best eQTL and the null. b) Classification of changes caused by transcript ratio QTLs. c) The rank of the best Omni2.5M SNP among the significant EUR eQTL variants per gene. d) DGKD gene locus where an intronic SNP rs838705 is associated to calcium levels (red), and the top eQTL variant 21 kb downstream (blue) is a very likely causal variant, close the TSS of two transcripts in the MEF2A,C binding region.
Figure 3
Figure 3. Allele-specific effects on expression and transcript structure
a) Sharing of allele-specific expression (ASE) and transcript structure (ASTS) signals: the distribution of ASTS p-value of the sites with significant (p<0.005) ASE in the same individual, and vice versa. The ASE p-values are calculated from sites sampled to exactly 30 reads. The numbers denote the pi1 statistic measuring the enrichment of low p-values. b) Frequency of significant ASE event in the population (x-axis) and their effect size (|0.5 – REF/TOTAL|), calculated per ASE SNP. Only ASE SNPs with >=20 heterozygote individuals with >=30 reads were included, and the estimates were corrected for coverage bias and false positives by sampling and permutations. c) Enrichment of variants in regulatory annotations relative to a matched null distribution for the most significant eQTL variants, and for the subset of these that are also rSNPs. Categories with highest amount of data are shown (see Fig. S36 for all categories, see also Fig. 2a).
Figure 4
Figure 4. Transcriptome effects of loss-of-function variants
A) Nonsense-mediated decay due to premature stop codon variants was measured using allele-specific expression. The distribution of non-reference allele ratios (on the y-axis) for premature stop variants sorted on the x-axis according to derived allele frequency, split to sites predicted to trigger and escape NMD. The dots denote the median across individuals, and the vertical lines show the range of ratios for variants carried by several individuals. The grey vertical lines denote derived allele frequencies of 0, 0.001 and 0.01. B) Exon inclusion scores for variable exons for individuals that carry 0, 1 or 2 copies of variants that destroy a splice motif, with p-value from Mann-Whitney test.

Similar articles

Cited by

References

    1. Abecasis GR, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65. doi:nature11632 [pii] 10.1038/nature11632. - PMC - PubMed
    1. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447:661–678. doi:nature05911 [pii] 10.1038/nature05911. - PMC - PubMed
    1. Dunham I, et al. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi:nature11247 [pii] 10.1038/nature11247. - PMC - PubMed
    1. Emilsson V, et al. Genetics of gene expression and its effect on disease. Nature. 2008;452:423–428. doi:nature06758 [pii] 10.1038/nature06758. - PubMed
    1. Stranger BE, et al. Population genomics of human gene expression. Nat Genet. 2007;39:1217–1224. doi:ng2142 [pii] 10.1038/ng2142. - PMC - PubMed

Publication types