Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Dec;1(6):e78.
doi: 10.1371/journal.pgen.0010078. Epub 2005 Dec 16.

Genome-wide associations of gene expression variation in humans

Affiliations

Genome-wide associations of gene expression variation in humans

Barbara E Stranger et al. PLoS Genet. 2005 Dec.

Abstract

The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs) with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis-) to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I) HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

PubMed Disclaimer

Conflict of interest statement

Competing interests. The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. QQ Plot of cis versus trans HSA20 −log10 p-Values
The figure shows the contrast of −log10 p-values deriving from associations of SNPs and genes within the 10-Mb region of HSA20 with −log10 p-values deriving from associations between genes on the 10-Mb region HSA20 with SNPs in one of ten ENCODE regions. Note that the distribution falls off the diagonal around −log10 p = 4, which we consider the borderline for the high enrichment of cis significant effects. A similar pattern is observed with any set of trans −log10 p-values on HSA20 or any other cis vs. trans contrast in any region we tested.
Figure 2
Figure 2. Cis- Signals of SNP−Gene Associations in the Human Genome
(A) The relationship between statistical significance and distance from gene. Each data point represents the maximum −log10 p for a single gene and SNPs located cis- to its coding locus. The −log10 p-values from the additive model are plotted as a function of distance between the center of the genomic span of the gene and cis- located SNPs (cis- < 4 Mb). Only those gene-SNP associations that have −log10 p > 4 are shown. SNPs are from the 5-kb HapMap. This plot includes data for 101 genes (129 probes). (B) Cis- SNPs with −log10 p ≥ 4 from the 688 probes analyzed are plotted against their chromosomal location on NCBI34 coordinates of the human genome.
Figure 3
Figure 3. Examples of cis- Associations from the Genome-Wide and High-Density SNP Maps
(A) Genomic location of associated SNPs close to the SERPINB10 gene. Custom tracks in the UCSC genome browser (http://genome.ucsc.edu) show the location of the Illumina probe and proximal SNPs in the context of genome annotation. The lower horizontal black line indicates the −log10 p threshold where the corresponding q-value is 0.05 (i.e., any SNPs with values −log10 p that meet or exceed this threshold are significant at the q = 0.05 level), and the upper line is the Bonferroni genome-wide threshold. Additional tracks describe known genes, first-exon and promoter predictions, conserved transcription factor binding sites, Gencode genes, RNA polymerase 2, and Transcription factor 2 binding sites, identified by Affymetrix ChIP/chip experiments, and Sp1 and Sp3 binding sites identified by Stanford's ChIP/chip experiments. Consensus conserved elements are shown in the final track. HapMap LD information below is for the CEU individuals and suggests that there are two conserved haplotype clusters in this region.
Figure 3
Figure 3. Continued
(B) Genomic location of associated SNPs close to the TMEM8 and MRPL28 genes. Note the correlation between the p-values for the two genes. Custom tracks in the UCSC genome browser show the location of the Illumina probe and proximal SNPs in the context of genome annotation. The lower horizontal black line indicates the −log10 p threshold where the corresponding q-value is 0.05 (i.e., any SNPs with values −log10 p that meet or exceed this threshold are significant at the q = 0.05 level), and the upper line is the Bonferroni genome-wide threshold. Additional tracks describe known genes, first-exon and promoter predictions, conserved transcription factor binding sites, Gencode genes, RNA polymerase 2, and Transcription factor 2 binding sites, identified by Affymetrix ChIP/chip experiments, and Sp1 and Sp3 binding sites identified by Stanford's ChIP/chip experiments. Consensus conserved elements are shown in the final track. HapMap LD information below is for the CEU individuals (http://genome.ucsc.edu) [39].

Similar articles

Cited by

References

    1. Stranger BE, Dermitzakis ET. The genetics of regulatory variation in the human genome. Hum Genomics. 2005;2:126–131. - PMC - PubMed
    1. Storey JD, Akey JM, Kruglyak L. Multiple locus linkage analysis of genome-wide expression in yeast. PLoS Biol. 2005;3:e267. DOI: 10.1371/journal.pbio.0030267. - DOI - PMC - PubMed
    1. Brem RB, Kruglyak L. The landscape of genetic complexity across 5,700 gene expression traits in yeast. Proc Natl Acad Sci U S A. 2005;102:1572–1577. - PMC - PubMed
    1. Brem RB, Yvert G, Clinton R, Kruglyak L. Genetic dissection of transcriptional regulation in budding yeast. Science. 2002;296:752–755. - PubMed
    1. Yvert G, Brem RB, Whittle J, Akey JM, Foss E, et al. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet. 2003;35:57–64. - PubMed

Publication types

Associated data