Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Aug 2;488(7409):116-20.
doi: 10.1038/nature11243.

A map of the cis-regulatory sequences in the mouse genome

Affiliations

A map of the cis-regulatory sequences in the mouse genome

Yin Shen et al. Nature. .

Abstract

The laboratory mouse is the most widely used mammalian model organism in biomedical research. The 2.6 × 10(9) bases of the mouse genome possess a high degree of conservation with the human genome, so a thorough annotation of the mouse genome will be of significant value to understanding the function of the human genome. So far, most of the functional sequences in the mouse genome have yet to be found, and the cis-regulatory sequences in particular are still poorly annotated. Comparative genomics has been a powerful tool for the discovery of these sequences, but on its own it cannot resolve their temporal and spatial functions. Recently, ChIP-Seq has been developed to identify cis-regulatory elements in the genomes of several organisms including humans, Drosophila melanogaster and Caenorhabditis elegans. Here we apply the same experimental approach to a diverse set of 19 tissues and cell types in the mouse to produce a map of nearly 300,000 murine cis-regulatory sequences. The annotated sequences add up to 11% of the mouse genome, and include more than 70% of conserved non-coding sequences. We define tissue-specific enhancers and identify potential transcription factors regulating gene expression in each tissue or cell type. Finally, we show that much of the mouse genome is organized into domains of coordinately regulated enhancers and promoters. Our results provide a resource for the annotation of functional elements in the mammalian genome and for the study of mechanisms regulating tissue-specific gene expression.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests. Readers are welcome to comment on the online version of this article at www.nature.com/nature.

Figures

Figure 1
Figure 1. Identification of cis-regulatory elements in the mouse genome
a, UCSC genome browser views of ChIP-Seq and RNA-Seq data for mESC, heart and liver (chromosome 4). The values on the y axis for ChIP-Seq data are input normalized intensities. kb, kilobases. b, An overview of the predicted regulatory elements in the 19 tissue and cell types. E14.5, embryonic day 14.5; MEF, murine embryonic fibroblast. c, Percentages of known cis-regulatory elements recovered in this study.
Figure 2
Figure 2. Evolutionary conservation of the identified cis-regulatory elements
a, Evolutionary conservation of cis-regulatory elements, in comparison with exons and random genomic sequences. Asterisk, P < 0.001, Fisher's exact test. b, UCSC genome browser views of chromatin state and CTCF-binding sites at Sox2 loci for mESCs and human ESCs (hESCs) on chromosome 3. DNA sequences, chromatin states and CTCF binding are all conserved in this region. c, Number of hESC regulatory elements that are conserved and predicted as regulatory elements in mESCs. d, Number of mESC regulatory elements that are conserved and predicted as regulatory elements in hESCs. e, Functional annotation of the conserved non-coding sequences based on the cis-regulatory elements identified in this study. The asterisk in c, d and e indicates CTCF-binding sites that do not overlap with either promoters or enhancers.
Figure 3
Figure 3. Genomic organization of co-regulated promoters and enhancers
a, Tissue specificity of the usages of promoters (H3K4me3 and polII), enhancers (H3K4me1) and CTCF-binding sites. b, Distribution of the Spearman correlation coefficient of H3K4me1 at enhancers and polII at promoters of random permutation, the nearest TSS model, and the CTCF block model. c, Enhancers and promoters form co-regulated clusters of different sizes, as shown by the Spearman correlation coefficient of H3K4me1 at enhancers and polII at promoters on chromosome 19. d, Hi-C interaction heatmap showing that the physical partitioning of the genome is highly correlated with the EPUs that encompass Pcdhα, Pcdhβ and Pcdhγ gene clusters on chromosome 18. Top: normalized Hi-C interaction frequencies in mouse cortex as a two-dimensional heatmap. Bottom: UCSC genome browser views of the same regions, including the identified EPUs and the ChIP-Seq data (H3K27ac, H3K4me1, H3K4me3, polII and CTCF) in cortex. e, The average normalized Hi-C interaction frequencies for enhancer–promoter pairs within EPUs, between EPUs, and expected by random chance.
Figure 4
Figure 4. Motif analysis of tissue-specific enhancers
a, Classification of development stage-specific enhancers based on their chromatin state (H3K4me1) between embryonic (embryonic day 14.5; E14.5) and adult brain. b and c, Gene Ontology analysis for the genes associated with embryonic brain-specific enhancers and adult cortex-specific enhancers. d, Classification of tissue-specific enhancers on the basis of their chromatin state (H3K4me1) among different tissue and cell types. The first 19 tissue-specific clusters were used for further motif analysis. The last cluster contains enhancers enriched in multiple tissues with no clear patterns. e, Enrichment of three transcription factor recognition motifs in the predicted enhancers. REST, RE1-silencing transcription factor. f, Heatmap showing the clustering of 270 transcription factor motifs on the basis of their enrichment in the various groups of enhancers as identified in e.g, Boxplot showing that the de novo motifs found in tissue-specific enhancers are evolutionarily conserved. hk, Examples of motifs that show high sequence conservation: h, REST motif in cortex-specific enhancers; i, Hnf1 motif in kidney-specific enhancers; j, Oct4 motif in mESC-specific enhancers; k, Atoh1 motif in cerebellum-specific enhancers.

Similar articles

  • A comparative encyclopedia of DNA elements in the mouse genome.
    Yue F, Cheng Y, Breschi A, Vierstra J, Wu W, Ryba T, Sandstrom R, Ma Z, Davis C, Pope BD, Shen Y, Pervouchine DD, Djebali S, Thurman RE, Kaul R, Rynes E, Kirilusha A, Marinov GK, Williams BA, Trout D, Amrhein H, Fisher-Aylor K, Antoshechkin I, DeSalvo G, See LH, Fastuca M, Drenkow J, Zaleski C, Dobin A, Prieto P, Lagarde J, Bussotti G, Tanzer A, Denas O, Li K, Bender MA, Zhang M, Byron R, Groudine MT, McCleary D, Pham L, Ye Z, Kuan S, Edsall L, Wu YC, Rasmussen MD, Bansal MS, Kellis M, Keller CA, Morrissey CS, Mishra T, Jain D, Dogan N, Harris RS, Cayting P, Kawli T, Boyle AP, Euskirchen G, Kundaje A, Lin S, Lin Y, Jansen C, Malladi VS, Cline MS, Erickson DT, Kirkup VM, Learned K, Sloan CA, Rosenbloom KR, Lacerda de Sousa B, Beal K, Pignatelli M, Flicek P, Lian J, Kahveci T, Lee D, Kent WJ, Ramalho Santos M, Herrero J, Notredame C, Johnson A, Vong S, Lee K, Bates D, Neri F, Diegel M, Canfield T, Sabo PJ, Wilken MS, Reh TA, Giste E, Shafer A, Kutyavin T, Haugen E, Dunn D, Reynolds AP, Neph S, Humbert R, Hansen RS, De Bruijn M, Selleri L, Rudensky A, Josefowicz S, Samstein R, Eichler EE, Orkin SH, Levasseur D, Papayannopoulou T, Chang KH, Skoultchi A, Gosh S, Disteche C, Treuting P,… See abstract for full author list ➔ Yue F, et al. Nature. 2014 Nov 20;515(7527):355-64. doi: 10.1038/nature13992. Nature. 2014. PMID: 25409824 Free PMC article.
  • A map of cis-regulatory elements and 3D genome structures in zebrafish.
    Yang H, Luan Y, Liu T, Lee HJ, Fang L, Wang Y, Wang X, Zhang B, Jin Q, Ang KC, Xing X, Wang J, Xu J, Song F, Sriranga I, Khunsriraksakul C, Salameh T, Li D, Choudhary MNK, Topczewski J, Wang K, Gerhard GS, Hardison RC, Wang T, Cheng KC, Yue F. Yang H, et al. Nature. 2020 Dec;588(7837):337-343. doi: 10.1038/s41586-020-2962-9. Epub 2020 Nov 25. Nature. 2020. PMID: 33239788 Free PMC article.
  • A cis-regulatory map of the Drosophila genome.
    Nègre N, Brown CD, Ma L, Bristow CA, Miller SW, Wagner U, Kheradpour P, Eaton ML, Loriaux P, Sealfon R, Li Z, Ishii H, Spokony RF, Chen J, Hwang L, Cheng C, Auburn RP, Davis MB, Domanus M, Shah PK, Morrison CA, Zieba J, Suchy S, Senderowicz L, Victorsen A, Bild NA, Grundstad AJ, Hanley D, MacAlpine DM, Mannervik M, Venken K, Bellen H, White R, Gerstein M, Russell S, Grossman RL, Ren B, Posakony JW, Kellis M, White KP. Nègre N, et al. Nature. 2011 Mar 24;471(7339):527-31. doi: 10.1038/nature09990. Nature. 2011. PMID: 21430782 Free PMC article.
  • Cis-regulatory elements: molecular mechanisms and evolutionary processes underlying divergence.
    Wittkopp PJ, Kalay G. Wittkopp PJ, et al. Nat Rev Genet. 2011 Dec 6;13(1):59-69. doi: 10.1038/nrg3095. Nat Rev Genet. 2011. PMID: 22143240 Review.
  • Beyond the ENCODE project: using genomics and epigenomics strategies to study enhancer evolution.
    Sakabe NJ, Nobrega MA. Sakabe NJ, et al. Philos Trans R Soc Lond B Biol Sci. 2013 Nov 11;368(1632):20130022. doi: 10.1098/rstb.2013.0022. Print 2013 Dec 19. Philos Trans R Soc Lond B Biol Sci. 2013. PMID: 24218635 Free PMC article. Review.

Cited by

References

    1. Waterston RH, et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. - PubMed
    1. Visel A, Rubin EM, Pennacchio LA. Genomic views of distant-acting enhancers. Nature. 2009;461:199–205. - PMC - PubMed
    1. The ENCODE Project Consortium. A user's guide to the encyclopedia of DNA elements (ENCODE) PLoS Biol. 2011:9–e1001046. - PMC - PubMed
    1. Gerstein MB, et al. Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project. Science. 2010;330:1775–1787. - PMC - PubMed
    1. Roy S, et al. Identification of functional elements and regulatory circuits by Drosophila modENCODE. Science. 2010;330:1787–1797. - PMC - PubMed

Publication types

MeSH terms

Associated data