Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 May;12(5):433-8.
doi: 10.1038/nmeth.3329. Epub 2015 Mar 23.

Identification of active transcriptional regulatory elements from GRO-seq data

Affiliations

Identification of active transcriptional regulatory elements from GRO-seq data

Charles G Danko et al. Nat Methods. 2015 May.

Abstract

Modifications to the global run-on and sequencing (GRO-seq) protocol that enrich for 5'-capped RNAs can be used to reveal active transcriptional regulatory elements (TREs) with high accuracy. Here, we introduce discriminative regulatory-element detection from GRO-seq (dREG), a sensitive machine learning method that uses support vector regression to identify active TREs from GRO-seq data without requiring cap-based enrichment (https://github.com/Danko-Lab/dREG/). This approach allows TREs to be assayed together with gene expression levels and other transcriptional features in a single experiment. Predicted TREs are more enriched for several marks of transcriptional activation—including expression quantitative trait loci, disease-associated polymorphisms, acetylated histone 3 lysine 27 (H3K27ac) and transcription factor binding—than those identified by alternative functional assays. Using dREG, we surveyed TREs in eight human cell types and provide new insights into global patterns of TRE function.

PubMed Disclaimer

Figures

Figure 1
Figure 1
dREG schematic and validation. (a) High PRO-seq signal intensity marks TREs (highlighted with pink background) and gene bodies (yellow background). dREG is a shape detector trained to recognize the characteristic pattern of TREs in PRO-seq data (#1). After training, dREG can be used to identify TREs using a new PRO-seq data set (red peaks) (#2). Browser shot compares dREG-predicted TREs to ChromHMM-predicted promoters (red), enhancers (yellow), and insulators (green) (#3). (b) Bar charts (left) represent the genome-wide sensitivity of dREG for various classes of TRE at a 5% (line) or 10% (bar) false discovery rate in K562 (pink) and GM12878 (blue) cells. Classes of regulatory elements represent GRO-cap transcribed DHS (Transcribed DHS), transcription start sites identified by CAGE (CAGE TSS), histone acetylation associated with DHS (Acetyl DHS), GRO-cap transcribed ChromHMM promoters (Promoters), GRO-cap transcribed chromHMM enhancers (Enhancers), GRO-cap TSS inside annotated Gene Bodies (Gene Body), and GRO-cap pairs (GRO-cap Pairs). Pie charts (right) represent the fraction of sites aligning within RefSeq transcription start sites (TSS), introns, or intergenic regions in each validation set.
Figure 2
Figure 2
Comparison of putative TREs detected using dREG, DNase-I, and ChromHMM. (a) Four-way Venn diagram depicting the relationships among separate genomic assays, which support the existence of four distinct classes of regulatory element. Numbers give the rounded overall fraction of TREs that fall into the specified intersection. TREs discovered using multiple assays were classified as +dREG, −dREG, Insulator, or as modified chromatin only (MCO). (b) Comparison of read-densities for H3K27ac (left) and H3K4me1 (right) in each class of functional element. (c) Distribution of MNase-seq reads in the +dREG (red), −dREG (black), and modified chromatin only classes (MCO; blue). (d) Histogram compares the number of transcription factors found in each of the four functional classes.
Figure 3
Figure 3
Sequence-specific transcription factors identified using dREG transcribed TREs. ROC plot shows the accuracy of predicting ELF1 binding to strong matches to the ELF1 consensus binding motif (sequence logo shown) using PIQ (black; AUC= 0.92), dREG (red; AUC= 0.88), the DNA sequence motif (gray; AUC= 0.67), or a joint logistic regression model considering all three variables (blue; AUC= 0.94). Motif matches that intersect ENCODE ChIP-seq peak calls were used as the set of true binding sites.
Figure 4
Figure 4
eQTL and GWAS SNP enrichments in the four classes of functional element. (a) The density of eQTL (n= 755) per site found in +dREG (further divided into promoters and enhancers using ChromHMM), −dREG, modified chromatin only (MCO), and Insulator classes. The asterisk indicates significantly lower eQTL densities than in dREG enhancers by a Fisher’s exact test (P<2×10−5). (b) The density of GWAS SNPs that correlate with cell-type specific phenotypes (autoimmune disorders) in GM12878, a B-cell line (blue), and primary CD4+ T-cells (red).

Similar articles

Cited by

References

    1. Calo E, Wysocka J. Modification of enhancer chromatin: what, how, and why? Mol. Cell. 2013;49:825–837. - PMC - PubMed
    1. Bernstein BE, et al. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. - PMC - PubMed
    1. Giresi PG, Lieb JD. Isolation of active regulatory elements from eukaryotic chromatin using FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements) Methods. 2009;48:233–239. - PMC - PubMed
    1. Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods. 2013;10:1213–1218. - PMC - PubMed
    1. Cusanovich DA, Pavlovic B, Pritchard JK, Gilad Y. The Functional Consequences of Variation in Transcription Factor Binding. PLoS Genet. 2014;10:e1004226. - PMC - PubMed

References (supplemental)

    1. Dimitriadou E, Hornik K, Leisch F, Meyer D, Weingessel A. e1071: Misc Functions of the Department of Statistics (e1071), TU Wien. 2010
    1. Chang C-C, Lin C-J. LIBSVM. ACM Trans. Intell. Syst. Technol. 2011;2:1–27.
    1. Hoffman MM, et al. Integrative annotation of chromatin elements from ENCODE data. Nucleic Acids Res. 2013;41:827–841. - PMC - PubMed
    1. Ernst J, Kellis M. Discovery and characterization of chromatin states for systematic annotation of the human genome. Nat. Biotechnol. 2010;28:817–825. - PMC - PubMed
    1. Neph S, et al. BEDOPS: high-performance genomic feature operations. Bioinformatics. 2012;28:1919–1920. - PMC - PubMed

Publication types

Associated data