Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2018 Sep 18;46(19):9895–9906. doi: 10.1093/nar/gky763

Transcription-driven genome organization: a model for chromosome structure and the regulation of gene expression tested through simulations

Peter R Cook 1, Davide Marenduzzo 2,
PMCID: PMC6212781  PMID: 30239812

Abstract

Current models for the folding of the human genome see a hierarchy stretching down from chromosome territories, through A/B compartments and topologically-associating domains (TADs), to contact domains stabilized by cohesin and CTCF. However, molecular mechanisms underlying this folding, and the way folding affects transcriptional activity, remain obscure. Here we review physical principles driving proteins bound to long polymers into clusters surrounded by loops, and present a parsimonious yet comprehensive model for the way the organization determines function. We argue that clusters of active RNA polymerases and their transcription factors are major architectural features; then, contact domains, TADs and compartments just reflect one or more loops and clusters. We suggest tethering a gene close to a cluster containing appropriate factors—a transcription factory—increases the firing frequency, and offer solutions to many current puzzles concerning the actions of enhancers, super-enhancers, boundaries and eQTLs (expression quantitative trait loci). As a result, the activity of any gene is directly influenced by the activity of other transcription units around it in 3D space, and this is supported by Brownian-dynamics simulations of transcription factors binding to cognate sites on long polymers.

INTRODUCTION

Current reviews of DNA folding in interphase human nuclei focus on levels in the hierarchy between looped nucleosomal fibers and chromosome territories (1,2). Hi-C—a high-throughput variant of chromosome conformation capture (3C)—provides much of our knowledge in this area. The first Hi-C maps had low resolution (∼1 Mb), and revealed plaid-like patterns of A (active) and B (inactive) compartments that often contact others of the same type (3). Higher-resolution (∼40 kb) uncovered topologically-associating domains (TADs); intra-TAD contacts were more frequent than inter-TAD ones (4,5). Still higher-resolution (∼1 kbp) gave contact loops delimited by cohesin and CTCF bound to cognate motifs in convergent orientations (6), as well as domains not associated with CTCF, called ‘ordinary’ or ‘compartmental’ domains (6,7). [Nomenclature can be confusing, as domains of different types are generally defined using different algorithms.]

Despite these advances, critical features of the organization remain obscure. For example, Hi-C still has insufficient resolution to detect many loops seen earlier (Supplementary Note 1). Moreover, most mouse domains defined using the Arrowhead algorithm persist when CTCF is degraded (8) (see also bioRxiv: https://doi.org/10.1101/118737). and many other organisms get by without the protein, (e.g. Caenorhabditis elegans (9), Neurospora (10), budding (11) and fission yeast (12), Arabidopsis thaliana (13), and Caulobacter crescentus (14)). Therefore, it seems likely that loops stabilized by CTCF are a recent arrival in evolutionary history.

The relationship between structure and function is also obscure (15). For example, cohesin—which is a member of a conserved family—plays an important structural role in stabilizing CTCF loops (Supplementary Note 2), but only a minor functional role in human gene regulation as its degradation affects levels of nascent messenger RNAs (mRNAs) encoded by only 64 genes (16). Widespread use of vague terms like ‘regulatory neighborhood’ and ‘context’ reflects this deficit in understanding. Here, we discuss physical principles constraining the system, and describe a parsimonious model where clusters of active RNA polymerases and its transcription factors are major structural organizers—with contact domains, TADs, and compartments just reflecting this underlying framework. This model naturally explains how genes are regulated, and provides solutions to many current puzzles.

SOME PHYSICAL PRINCIPLES

Chromatin mobility

Time-lapse imaging of a GFP-tagged gene in a living mammalian cell is consistent with it diffusing for ∼1 min through a ‘corral’ in chromatin, ‘jumping’ to a nearby corral the next and bouncing back to the original one (17). Consequently, a gene explores a volume with a diameter of ∼250 nm in a min, ∼750 nm in 1 h and ∼1.4 μm in 24 h (18); therefore, it inspects only part of one territory in ∼24 h, as a yeast gene—which diffuses as fast—ranges throughout its smaller nucleus.

Entropic forces

Monte Carlo simulations of polymers confined in a sphere uncovered several entropic effects depending solely on excluded volume (19,20). Flexible thin polymers (‘euchromatin’) spontaneously move to the interior, and stiff thick ones (‘heterochromatin’) to the periphery—as seen in human nuclei (Supplementary Figure S1Ai); ‘euchromatin’ loses more configurations (and so entropy) than ‘heterochromatin’ when squashed against the lamina, and so ends up internally. Stiff polymers also contact each other more than flexible ones; this favors phase separation and formation of distinct A and B compartments. Additionally, linear polymers intermingle, but looped ones segregate into discrete territories (Supplementary Figure S1Aii).

Ellipsoidal territories and trans contacts

Whether a typical human gene diffuses within its own territory and makes cis contacts (i.e. involving contacts with the same chromosome), or visits others to make trans ones depends significantly on territory shape. Children who buy MInline graphicMs and Smarties sense ellipsoids pack more tightly than spheres of similar volume; packed ellipsoids also touch more neighbours than spheres (Supplementary Figure S1B). As territories found in cells and simulations are ellipsoidal, and as much of the volume of ellipsoids is near the surface, genes should make many cis contacts plus some trans ones (Supplementary Figure S1).

Some processes driving looping

If human chromosomes were a polymer melt in a sphere, two loci 40 Mb distant on the genetic map would be ∼4 μm apart in 3D space and interact as infrequently as loci on different chromosomes. If the two were 10, 1 or 0.1 Mb apart, they would interact with probabilities of ∼2 × 10−5, ∼5 × 10−4 and ∼1.5 × 10−2, respectively (calculated using a 20 nm fiber, 50 bp/nm and a threshold of 50 nm for contact detection; see also (1)). Hi-C shows some contacts occur more frequently; this begs the question—what drives looping?

One process is the classical one involving promoter-enhancer contacts (21). We discuss later that contacting partners are often transcriptionally active. We also use the term ‘promoter’ to describe the 5′ end of both genic and non-genic units, and ‘factor’ to include both activators and repressors. Many factors (often bound to polymerases) can bind to DNA and each other (e.g. YY1 (22)). Binding to two cognate sites spaced 10 kb apart creates a high local concentration, and—when two bound factors collide—dimerization stabilizes a loop if entropic looping costs are not prohibitive (Figure 1A). Such loops persist as long as factors remain bound (typically ∼10 s).

Figure 1.

Figure 1.

Some drivers of looping. (A) Dimerizing factors (equilibrium constant ∼10−7 M). (i) If present at a typical concentration (∼1 nM), <1% factors dimerize. (ii) Binding to cognate sites 10 kbp apart on DNA increases local concentrations, and ∼67% are now dimers stabilizing loops (21). (B) The depletion attraction. (i) In crowded nuclei, small brown molecules (diameter <5 nm) bombard (grey arrows) larger red complexes (5–25 nm). If large complexes collide, smaller molecules are sterically excluded from the green volume between the two and cannot knock them apart; consequently, small molecules exert a force on opposite sides of larger complexes keeping them together. (ii) If large complexes are bound to DNA, this force stabilizes a loop. (C) Cohesin. After loading, a cohesin ring embraces two fibers to stabilize a mini loop; this loop enlarges as the ring uses an inbuilt motor to move down the fiber until stalled by CTCF bound to convergent sites.

Another mechanism—the ‘depletion attraction’—is non-specific. It originates from the increase in entropy of macromolecules in a crowded cell when large complexes come together (Figure 1Bi (23)). Modeling indicates this attraction can cluster bound polymerases and stabilize loops (Figure 1Bii) that persist for as long as polymerases remain bound (i.e. seconds to hours; below).

A third mechanism involves cohesin—a ring-like complex that clips on to a fiber like a carabiner on a climber’s rope. In Hi-C maps, many human domains are contained in loops apparently delimited by CTCF bound to cognate sites in convergent orientations (6). Such ‘contact loops’—many with contour lengths of >1 Mb—are thought to arise as follows. A cohesin ring binds at a ‘loading site’ to form a tiny loop, this loop enlarges as an in-built motor translocates the ring down the fiber, and enlargement ceases when CTCF bound to convergent sites blocks further extrusion (Figure 1C (24,25)). This is known as the ‘loop-extrusion model’. We note that other mechanisms could enlarge such loops (including one not involving a motor; Supplementary Note 2), and that loop extrusion (by whatever mechanism) and its blocking by convergent CTCF sites can be readily incorporated into the model that follows.

A transcription-factor model

We now review results of simulations involving what we will call the ‘transcription-factor model’. This incorporates the few assumptions implicit in the classical model illustrated in Figure 1A: spheres (‘factors’) bind to selected beads in a string (‘cognate sites’ on ‘chromatin fibers’) to form molecular bridges stabilizing loops (2630). This superficially simple model yields several unexpected results.

First, and extraordinarily, bound factors cluster spontaneously in the absence of any specified DNA–DNA or protein–protein interactions (Figure 2A (27)). This clustering requires bi- or multi-valency (so factors can bridge different regions and make loops) plus reversible binding (otherwise the system does not evolve), and it occurs robustly with respect to changes in DNA–protein affinity and factor number. The process driving it was dubbed the ‘bridging-induced attraction’ (27). We stress this attraction occurs spontaneously without the need to specify any additional forces between one bead and another, or between one protein and another.

Figure 2.

Figure 2.

A process driving the spontaneous clustering of multivalent factors (a.k.a., the ‘bridging-induced attraction’). (A) Overview of one Brownian-dynamics simulation. Red and green ‘factors’ (colored spheres) bind reversibly to ‘chromatin’ (a string of beads); red factors bind only to pink beads, green factors only to light-green ones (non-binding beads shown as black dots). Bound factors spontaneously cluster—red with red and green with green—despite any specified interactions between proteins or between beads. (B) Explanation. Local concentrations create positive-feedback loops driving growth of nascent clusters; bound factors and binding beads rarely escape, and additional factors/beads are caught as they diffuse by. Red and green clusters are inevitably separate in 3D space because their cognate binding sites are separate in 1D sequence space. Cluster growth is limited by entropic costs of crowding together ever-more loops. (C) Comparison of contact maps obtained from 10 simulations (28) and Hi-C (6). (i) The model. The whole of chromosome 19 (red box) in GM12878 cells was simulated, and the zoom shows the region around RAD23A, which is active in these cells. Each bead in the fiber is colored according to whether the corresponding region is transcriptionally highly active (pink), weakly active (green) or silent (grey) on the Broad ChromHMM track on the UCSC browser; one bead carries both active and silent marks and so bears two colors. Pink (activating) and black (repressing) factors bind to cognate beads as indicated (the doubly-colored bead binds both factors); all other beads (black dots) are non-binding. (ii, iii) Contact maps are similar. Black double-headed arrows: limits of prominent TADs on diagonal. Red double-headed arrows: centers of off-diagonal blocks marking compartments.

The basic mechanism yielding clustering is a simple positive feedback loop which works as sketched in Figure 2A and B. First, proteins bind to chromatin (Figure 2A). Then, once a bridge forms, the local density of binding sites (e.g. pink spheres in Figure 2A) inevitably increases. This attracts further factors from the soluble pool (like 2 in Figure 2B): their binding further increases the local chromatin concentration (through bridging) creating a virtuous cycle which repeats. This triggers the self-assembly of stable protein clusters, where growth is eventually limited by entropic crowding costs (28). Several factors cluster in nuclei (e.g. Sox2 in living mouse cells (31)) and the bridging-induced attraction provides a simple and general explanation for this phenomenon.

This process drives local phase separation of polymerases and factors, and so naturally explains how super-enhancer (SE) clusters form (Supplementary Figure S2Ai (32)). This generic tendency to cluster will be augmented by specific protein–protein and DNA–protein interactions, with their balance determining whether protein or DNA lies at the core. Similarly, the same process—this time augmented by HP1, a multivalent protein that staples together histones carrying certain modifications—could drive phase separation and compaction of inactive heterochromatin (Supplementary Figure S2Aii (33,34)).

Creating stable clusters of different types, TADs and compartments

This transcription-factor model yields a second remarkable result: red and green factors binding to distinct sites on the string self-assemble into distinct clusters containing only red factors or only green ones (Figure 2A (28)). This has a simple basis: the model specifies that red and green binding sites are separate in 1D sequence space (as they are in vivo), so they are inevitably in different places in 3D space (Figure 2B).

A third result is that clusters and loops self-assemble into ‘TADs’ and ‘A/B compartments’ (26–28). Thus, if chromosome 19 in human GM12878 cells is modeled as a string of beads colored according to whether corresponding regions are active or inactive, binding of just red and black spheres (‘activators’ and ‘repressors’) yields contact maps much like Hi-C ones (Figure 2C). As neither TADs, compartments, nor experimental Hi-C data are used as inputs, this points to polymerases and their factors driving the organization without the need to invoke roles for higher-order features (see also (7)). We suggest TADs arise solely by aggregation of pre-existing loops/clusters (note that degradation of cohesin or its loader induces TAD disappearance and the emergence of complex sub-structures, as A/B compartments persist and become more prominent (16,35)).

The simple transcription-factor model has been extended to explain how pre-existing red clusters can evolve into green clusters, or persist for hours as individual factors exchange with the soluble pool in seconds—as in photo-bleaching experiments (Supplementary Figure S3A,B (28,36)). Additionally, introducing ‘bookmarking’ factors that bind selected beads (genomic sequences), as well as ‘writers’ that ‘mark’ chromatin beads and ‘readers’ which bind beads with specific marks, can create local ‘epigenetic states’ and epigenetic domains (e.g. domains of red and green marks, representing for instance active or inactive histone modifications). Such domains spontaneously establish around bookmarks, and are stably inherited through ‘semi-conservative replication’, when half of the marks are erased (and/or some of the bookmarks are lost due to dilution (37,38); Supplementary Figure S3C).

A PARSIMONIOUS MODEL: CLUSTERS OF POLYMERASES AND FACTORS

These physical principles lead naturally to a model in which a central architectural feature is a cluster of active polymerases/factors surrounded by loops—a ‘transcription factory’. A factory was defined as a site containing ≥2 polymerases active on ≥2 templates, just to distinguish it from cases where two enzymes are active on one (Figure 3A (39,40)). Much as car factories contain high local concentrations of parts required to make cars efficiently, these factories contain machinery that acts through the law of mass action to drive efficient RNA production. For RNA polymerase II in HeLa, the concentration in a factory (i.e. ∼1 mM) is ∼1000-fold higher than the soluble pool; consequently, essentially all transcription occurs in factories (Supplementary Note 3; Supplementary Note 4 describes some properties of factories).

Figure 3.

Figure 3.

Transcription factories in human cells. (A) Clusters organize loops stabilized by polymerases (ovals) and factors (lozenges). There are ∼16 loops per factory, but only a few are shown here and subsequently. Red and green factories specialize in transcribing different gene sets. Promoters tend to be transcribed in factories of the same color (because they are rich in appropriate factors); here, p and s can often visit the pink factory, but only p is likely to initiate there. (B) A transcription cycle. Promoter e collides with a polymerase in the factory (shown as a solid sphere from now on), initiates, and the fixed polymerase reels in the template as it extrudes a transcript; the template detaches on termination. (C) ‘Miller’ spreads. (i) A Christmas tree. (ii) If the polymerase tracks, it rotates about the template once for every 10-bp transcribed to give an entwined transcript. (iii) If immobile, the template rotates and the transcript is not entwined. Topoisomerases remove twin domains of supercoiling in both (ii) and (iii) (41). (D) Tether length affects how often a promoter visits a factory. Top: a 77-kbp loop tethered to a 75-nm sphere; intuition suggests p visits the green volume more than q. Bottom: results of Monte-Carlo simulations confirm this intuition. Adapted from (42) with permission; copyright 2006 Elsevier.

In all models, a gene only becomes active if appropriate polymerases (i.e. I, II or III) and factors are present; in this one, there are three more requirements. First, active polymerases are transiently immobile when active; they reel in their templates as they extrude their transcripts (Figure 3B). This contrasts with the traditional view where they track like locomotives down templates. Arguably, the best (perhaps only) evidence supporting the traditional view comes from iconic images of ‘Christmas trees’; a 3D structure is spread in 2D, and imaged in an electron microscope—polymerases are caught in the act of making RNA (Figure 3Ci). However, polymerases moving along helical templates generate entwined transcripts (Figure 3Cii), but these transcripts appear as un-entwined ‘branches’ in ‘Christmas trees’. How could such structures arise? As transcription requires lateral and rotational movement along/around the helix, we suggest templates move (not polymerases) to give un-entwined transcripts (Figure 3Ciii). Consequently, these images provide strong evidence against the traditional model, not for it (see also Supplementary Note 5, Supplementary Figure S4).

Second, in order to initiate, a promoter must have a high probability of colliding with a polymerase, and—as the highest polymerase concentrations are found in/around factories—this means the enzyme must first diffuse into/near a factory. [We remain agnostic as to the order with which promoter, polymerase, factors and factory bind to each other, and note that the participants in nucleotide excision repair—a process arguably better understood than transcription (43)—are not assembled one after the other; instead the productive complex forms once all participants happen to collide simultaneously into each other.] In Figure 3D, intuition suggests p often visits the nearby green volume, whereas q mainly roams ‘outer space’; simulations and experiments confirm this (42,44). Consequently, active genes tend to be tethered close to a factory, and inactive genes further away. Promoter-factory distances also seem to remain constant as nuclear volume changes; when mouse ES cells differentiate and their nuclei become 2-fold larger or 2-fold smaller, experiments show the system spontaneously adapts to ensure these distances remain roughly constant, and new simulations confirm this (Supplementary Figure S6; Supplementary Note 6).

Third, there are different types of factory (red and green clusters in Figure 3A), and a gene must visit an appropriate one to initiate. Just as some car factories make Toyotas and others Teslas, different factories specialize in transcribing different sets of genes. For example, distinct ‘ERα’, ‘KLF1’ and ‘NFκB’ factories specialize in transcribing genes involved in the estrogen response, globin production, and inflammation, respectively (45–47).

These three principles combine to ensure the structure is probabilistic and dynamic, with current shape depending on past and present environments. For example, as e in Figure 3D is transcribed, loop length changes continuously. And when e terminates, it dissociates; then, its diffusional path may take it back to the same factory where it may (or may not) re-initiate to reform a loop. Alternatively, e may spend some time diffusing through outer space before rebinding to the same or a different factory. Consequently, as factors and polymerase bind and dissociate, factories morph, loops appear and disappear—and the looping pattern of every chromosomal segment changes from moment to moment. Then, it is unlikely the 3D structure of any chromosome is like that of its homolog, either in the same cell or any other cell in a clonal population.

These physical principles also lead naturally to an explanation of how genes become inactive. Thus, q in Figure 3D is inactive because it lies far away from an appropriate factory and is unlikely to collide with a polymerase there. We speculate that inactivity results in histone modifications that thicken the fiber, so entropic effects collapse it with other heterochromatic fibers into B compartments and the nuclear periphery (as in Supplementary Figure S1Ai).

SOME DIFFICULT-TO-EXPLAIN OBSERVATIONS

We now describe results easily explained by this model, but difficult or impossible to explain by others without additional complicated assumptions (see also Supplementary Note 7).

Most contacts are between active transcription units

Contacts seen by 3C-based approaches often involve active promoters and enhancers; for example, FIRES (frequently-interacting regions) in 14 different human tissues and 7 human cell lines are usually active enhancers (48). Similarly, contacts detected by an independent method—genome architecture mapping—again involve enhancers and/or genic transcription start/end sites (49). Why should active sequences lie together? As factories nucleate local concentrations of active units, we expect promoters and enhancers to dominate contact lists.

While 3C focuses on contacts between two DNA sequences, the ligation involved can join >2 together (24 is the current record), and these again generally encode active sequences (50,51). Why do so many active sequences contact each other? We expect to see co-ligations involving some/all of the many anchors in a typical factory.

Early studies also point to a correlation between transcription and structure. For example, switching on/off many mammalian genes correlates with their attachment/detachment (40). What underlies this? Our model requires that units must attach before they can be transcribed.

Frequencies of cis and trans contacts

Cis Hi-C contacts fall off rapidly with increasing genetic distance, whereas trans ones are so rare they are often treated as background. However, ChIA-PET yields more trans than cis contacts when active sequences are selected by pulling down ERα or polymerase II (45,47). Our model again predicts this—active genes on different chromosomes are often co-transcribed in the same specialized factory (as genes diffuse out of one ellipsoidal territory into another).

In addition, cis:trans ratios can change rapidly, and we explain this by reference to ‘NFκB’ factories (47) (see also Supplementary Figure S5A). TNFα induces phosphorylation of NFκB, nuclear import of phospho-NFκB, and transcriptional initiation of many inflammatory genes including SAMD4A. Before induction, the SAMD4A promoter makes only a few local cis contacts (shown by 4C and ChIA-PET applied with a ‘pull-down’ of polymerase II); it spends most time roaming ‘outer space’ making a few chance contacts with nearby segments of its own loop, and—if it visits a factory—it cannot initiate in the absence of phospho-NFκB. But once phospho-NFκB appears (10 min after adding TNFα), it initiates. Then, NFκB binding sites in SAMD4A become tethered to the factory, these bind phospho-NFκB, exchange of the factor increases the local concentration, and this increases the chances that other inflammatory genes initiate when they pass by. And once they do, this creates a virtuous cycle; as more inflammatory genes initiate, more NFκB binding sites become tethered to the factory, the local NFκB concentration rises, this further increases the chances that passing responsive genes initiate, and the factory evolves into one specializing in transcribing inflammatory genes. As a result, the rapid concentration of inflammatory genes around the resulting ‘NFκB’ factory yields the rapid increase in cis and trans contacts between them seen by 3C-based methods and RNA-FISH (47).

TADs exist at all scales

Intra- and inter-TAD contact frequencies differ only ∼2-fold; therefore, it is unsurprising that TAD calling depends on which algorithm is used, and the resolution achieved (5255). However, it is surprising that TADs become more elusive as algorithms and resolution improve. For example, CaTCH (Caller of Topological Chromosomal Hierarchies) identifies a continuous spectrum of domains covering all scales; TADs do not stand out as distinct structures at any level in the hierarchy (55). Moreover, TADs are sometimes invisible in single-cell data (56,57), and—if detected—their borders weaken as cells progress through G1 into S phase (58). In our model, TADs do not exist as distinct entities representing anything other than one or more loops around one or more factories. [TADs are said to be major architectural features because they are invariant between cell types (4,5) and highly conserved (59). However, there are always slight differences between cell types that could reflect slight differences in expression profile, and the conservation could just reflect the conserved transcriptional pattern encoded by the underlying DNA sequence.]

The relationship between TADs and transcription

Various studies address this issue, and give conflicting results. For example, in mouse neural progenitor cells, one of the two X chromosomes is moderately compacted and largely inactive. Inactive regions do not assemble into A/B compartments or TADs, unlike active ones. Moreover, in different clones, different regions in the inactive X escape inactivation, and these form TADs (60). Here, structure and activity are tightly correlated (in accord with our model). Similarly, inhibiting transcription in the fly leads to a general reorganization of TAD structure, and a weakening of border strength (61).

Another study points to some TADs appearing even though transcription is inhibited (62). After fertilization, the zygotic nucleus in the fly egg is transcriptionally inactive. As the embryo divides, zygotic genome activation occurs so that by nuclear cycle 8 (nc8), ∼180 genes are active, and these seem to nucleate a few TADs detected at nc12 (so transcriptional onset and the appearance of loops/TADs correlate—again in accord with our model). As more genes become active at nc13, 3-fold more TADs develop by nc14, and polymerase II plus Zelda (a zinc-finger transcription factor) are at boundaries (again a positive correlation). If transcriptional inhibitors are injected into embryos before nc8, boundaries and TADs seen at nc14 are less prominent, but some TADs still develop (implying loops/TADs appear independently of transcription, which is inconsistent with our model). However, interpretation is complicated. Although inhibitors reduce levels of five mRNAs already being expressed, they only slightly affect levels of polymerase II bound at the 5′ end of genes expressed at nc14; this indicates that inhibition is inefficient, so it remains possible that the remaining transcription stabilizes the loops/TADs seen.

Studies on mouse eggs and embryos also provide conflicting data. Thus, activity is lost as oocytes mature, and TADs plus A/B compartments disappear (56,63,64); therefore, loss of structure and activity again correlate (consistent with our model). After fertilization, the zygote contains two nuclei with different conformations; both contain TADs, but the maternal one lacks A/B compartments. Then, as transcription begins, TADs appear (again a positive correlation), but α-amanitin (a transcriptional inhibitor) does not prevent this (63,64)—which is inconsistent with our model. However, interpretation is again complicated: α-amanitin acts notoriously slowly (65), and inhibition was demonstrated indirectly (levels of steady-state poly(A)+ RNA fall, but reduction of intronic RNA would be a more direct indicator of inhibition).

Data from zebrafish make unified interpretation even more difficult. In contrast to some cases cited earlier, TADs and compartments exist before zygotic gene activation, and many of each are lost when transcription begins (66). Clearly, TAD-centric models will find it difficult to explain such conflicting data. In ours, TADs are not major architectural features determining function; they just reflect the underlying network of loops, and—even if all polymerases are inactive—bound factors can still stabilize some loops (and so TADs).

Enhancers and super-enhancers

Enhancers are important regulatory motifs, but there remains little agreement on how they work (67). They were originally defined as motifs stimulating firing of genic promoters when inserted in either orientation upstream or downstream. However, their molecular marks are so like those of their targets (68) that FANTOM5 now defines them solely as promoters firing to yield eRNAs (enhancer RNAs) rather than mRNAs (69). Then, is it eRNA production or some role of the eRNA product that underlies function? Studies of the Sfmbt2 enhancer in mouse ES cells indicates it is the former (70). Thus, deleting the eRNA promoter (but not downstream sequences) impairs enhancer activity; this points to the promoter being required. Moreover, inserting a poly(A) site just 40 bp down-stream of the eRNA promoter abolishes enhancer activity, and amounts of polymerase on the enhancer (and enhancer activity) increase as the insert is moved progressively 3′; this points to a reduction in transcription correlating with reduced enhancer activity.

Our model suggests a simple mechanism for enhancer function: transcription of e in Figure 4Ai ensures p is tethered close to an appropriate factory. In other words, e is an enhancer of p because close tethering increases the probability that p collides with a polymerase in the factory (and so often initiates). The model also explains how enhancers can act over such great distances (Supplementary Figure S5B,C). Thus, a typical factory in a human cell is associated with ∼10 loops each with an average contour length of ∼86 kbp (Supplementary Note 1), so an enhancer anchored to it can (indirectly) tether a target promoter in any one of these other loops to the same factory. As we will see, enhancers can act over even greater distances to tether targets in a nuclear region containing an appropriate factory.

Figure 4.

Figure 4.

Enhancers and boundaries. (A) Enhancer action. (i) p is tethered by enhancer e close to a factory—so p is likely to collide with the factory. (ii) p has initiated, and the polymerase is about to transcribe 1. (iii) The same polymerase will now transcribe 2; then, e-p contacts apparently track with the polymerase away from p. (iv) Both polymerases now terminate, e and p detach, and e reinitiates. As p is still tethered close to the factory, it is likely to initiate again and continue the transcriptional burst. (v) Both polymerases have terminated, and the fiber has diffused away from the factory; both e and p enter a silent period, as both are far from the factory. (B) SEs increase the time p is close to a factory. (i) The structure is as Ai, but now the enhancer contains 3 promoters; as before, p is tethered close to a factory and likely to initiate. (ii) The polymerase transcribing e has terminated; as there are 3 SE promoters, there is a 3-fold higher chance one will collide with the factory (here e’) compared to A. (iii) e’ has initiated, so p remains closely-tethered for longer and likely to initiate more often than in A. (C) Boundaries. (i) a, b, and c have initiated in different factories. (ii) a has terminated, and is more likely to visit the upper green factory compared to the distant lower one. (iii) a has re-initiated in the nearby green factory. We call b a boundary because it apparently prevents a from contacting c.

This model provides solutions to many conundrums associated with enhancers, including the following. (i) Enhancer activity depends on contact with its target promoter (71,72). We suggest the two often share a factory, and so are often in contact. (ii) Enhancers can act on two targets simultaneously, and coordinate their firing (73,74)—impossible according to classical models. In Figure 4Ai, e acts on both d and p, and it is easy to imagine that d and p initiate coordinately because the two polymerases involved sit side-by-side in the same factory. (iii) Promoters of protein-coding genes are often enhancers of other protein-coding genes (70,75,76). In our model, e is an enhancer irrespective of whether it encodes an mRNA or eRNA. (iv) Enhancers act both promiscuously and selectively. They interact with many other enhancers and targets (7779), with ≥4 controlling a typical gene expressed during fly embryogenesis (80). At the same time, they are selective; thousands have the potential to activate a fly gene encoding an ubiquitously-expressed ribosomal-protein, whilst a different set can act on a developmentally-regulated factor (81). In our model, ‘red’ enhancers tether ‘red’ genic promoters close to ‘red’ factories, as ‘green’ ones do the same with a different set. (v) Enhancer-target contacts apparently track with the polymerase down the target (82). Thus, when mouse Kit becomes active, the enhancer first touches the Kit promoter before contacts move progressively 3′ at the speed of the pioneering polymerase. This is impossible with conventional models, but simply explained if polymerases transcribing enhancer and target are attached to one factory (Figure 4Aii,iii). (vi) Single-molecule RNA FISH shows forced looping of the β-globin enhancer to its target increases transcriptional burst frequency but not burst size (83), and this general effect is confirmed by live-cell imaging of Drosophila embryos (73,74). Such bursting arises because many ‘active’ genes are silent much of the time, and when active they are associated with only one elongating polymerase (Supplementary Note 8). Periods of activity do not occur randomly; rather, short bursts are interspersed by long silent periods. Bursting is usually explained by an equilibrium between ill-defined permissive and restrictive states; we explain it as follows. In Figure 4A, p often fires when tethered near the factory (giving a burst). Then, once e terminates, close tethering is lost—and p remains silent for as long as it remains far from an appropriate factory. RNA FISH experiments on human SAMD4A support this explanation; the promoter is usually silent, but adding TNFα induces successive attachments/detachments to/from a factory (44).

A related conundrum concerns how SEs work. SEs are groups of enhancers that are closely-spaced on the genetic map and often target genes determining cell identity (32,84). In Figure 4Bi, increasing the number of closely-spaced promoters (e, e’, e’’) in the SE increases the time p spends near a factory (to increase its firing probability).

Boundaries

TAD boundaries in higher eukaryotes are often marked by CTCF; however, they are also rich in active units marked by polymerase II, nascent RNA, and factors like YY1 (4,6,22). Similarly, fly boundaries are rich in constitutively-active genes but de-enriched for insulators dCTCF and Su(Hw) (7,85). Additionally, in yeast (which lacks CTCF), boundaries are often active promoters (11). Then, does the act of transcription create a boundary? Studies in Caulobacter crescentus—which lacks CTCF but possesses TADs—shows it does (14). For example, in a rich medium, a rDNA gene is a strong boundary; however, this boundary disappears in a poor medium when rRNA synthesis subsides. Inserting active rsaA in the middle of a TAD also creates a new boundary, and boundary strength progressively falls when the length of the transcribed insert is reduced. We imagine ongoing transcription underlies boundary activity (Figure 4C).

A GREAT MYSTERY: GENE REGULATION IS WIDELY DISTRIBUTED

Classical studies on bacterial repressors (lambda, lac) inform our thinking on how regulators work: they act locally as binary switches. We assume eukaryotes are more complicated, with more local switches, plus a few global ones (e.g. Oct3/4, Sox2, c-Myc, Klf4). We are encouraged to think this by studies on some diseases (86). For example, KLF1 regulates β globin expression by binding to its cognate site upstream of the β-globin gene (HBB); a C to G substitution at position -87 reduces binding, and this reduces HBB expression and causes β-thalassaemia. Therefore, we might expect binding of factors to promoters of coding genes drives phenotypic variation. However, results obtained using GWAS (genome-wide association studies)—an unbiased way of finding which genetic loci affect a phenotype—lead to a different view for many diseases; they are so unexpected that only general explanations are proffered for them (86–88).

eQTLs

Quantitative trait loci (QTLs) are sequence variants (usually single-nucleotide changes) occurring naturally in populations that influence phenotypes. Most QTLs affecting disease do not encode transcription factors or global regulators; instead, they map to non-coding regions, especially enhancers (77,88). eQTLs are QTLs affecting transcript levels, and were also expected to encode transcription factors; but again, many do not (88,89). They also map to enhancers (88) and regulate distant genes both cis and trans (9092). Additionally, eQTLs and their targets are often in contact (77), and one trans-eQTL can act on hundreds of genes around the genome—which often encode functionally-related proteins regulated by similar factors (88,90,92,93). In summary, eukaryotic gene regulation involves distant and distributed eQTLs that look like enhancers. Moreover, copy number of a transcript is a polygenic trait much like susceptibility to type II diabetes or human height—traits where hundreds of regulatory loci have been identified and where many more await discovery (91). This complexity is captured by the ‘omnigenic’ model, where eQTLs affect levels of target mRNAs indirectly; they modulate levels, locations, and post-translational modifications of unrelated proteins, and these changes percolate throughout the cellular network before feeding back into nuclei to affect transcription of targets (88). We suggest another—very direct—mechanism.

A model for direct eQTL action

In Figure 5A, all units in the volume determine network structure, and how often each unit visits an appropriate factory; consequently, all units directly affect production of all other transcripts. In other words, gene regulation is widely distributed. A single nucleotide change in enhancer b (perhaps an eQTL) might reduce binding of a ‘yellow’ factor and b’s firing frequency, and this has consequential effects on how often d and a are tethered close to the yellow factory—and so can initiate. But this change influences the whole network. By altering positions relative to appropriate factories, an eQTL ‘communicates’ directly with functionally-related targets, and indirectly (but still at the level of transcription) with all other genes around it in nuclear space. This neatly reconciles how eQTLs target functionally-related genes whilst having omnigenic effects (because targets often share the same specialized factory and nuclear volume, respectively).

Figure 5.

Figure 5.

Regulation is widely distributed—an omnigenic model. (A) Activity of every transcription unit (small circles) in the volume depends on the activity of neighbours. b acts simultaneously as an enhancer of a and d (by tethering them close to the yellow factory) and a silencer of c (by tethering it far from a pink factory). r acts as a boundary between different TADs containing p and s; it also silences q, by preventing it from accessing a gray factory. Purple units are promiscuous, often initiating in factories of another color. (B) Molecular-dynamics simulations of eQTL action. (i) Overview. One simulation in a set of 200 involves 5 ‘factors’ (colored 30-nm spheres) binding reversibly to cognate beads of similar color randomly distributed along a ‘wild-type’ string (30-nm bead—3 kb). Factors can be ‘de-phosphorylated/phosphorylated’ to lose/gain affinity at equal rates (∼0.00001 inverse Brownian times, or ∼0.001 s−1). Another set involves a ‘mutant’ string with an ‘eQTL’ where yellow bead 1983 becomes non-binding. (iiiii) Snapshots of ‘wild-type’ and ‘mutant’ fibers (bead 1983 shown black, arrowed; factors not shown). Boxes: magnifications of regions around bead 1983 with/without non-binding beads (grey). (iv) Positions and colors of all binding beads with altered transcription probabilities. We assume a chromatin bead is transcribed if it is within 54 nm of a factor of the corresponding color—when transcribed a bead is also typically in a cluster. Statistical significance for changes in histograms for binding beads shown is calculated assuming Gaussian statistics; histograms are different with p-value p < 0.009, and <2 beads are expected to change this much by chance.

The idea that altering one loop in a network has global effects was tested using simulations of five factors binding to cognate sites in a 5000-bead string (Figure 5Bi; Supplementary Note 6 gives details); as expected, bound factors spontaneously cluster (Figure 5Bii). We next create an ‘eQTL’ in the middle of the (‘wild-type’) string by abolishing binding to one yellow bead. This ‘mutant’ bead is now rarely in a cluster (Figure 5Biii, arrow), and it increases or decreases clustering probabilities of many other genes on the string (Figure 5Biv). As clustering determines activity, these simulations provide a physical basis for direct omnigenic effects, and open up the possibility of modeling their action. Results are robust, as, for instance, simulations with different binding affinity, or with factors and binding sites of only a single color, lead to qualitatively similar conclusions.

LIMITATIONS OF THE MODEL

Whilst we have seen that the transcription-factory and transcription-factor models can explain many disparate observations, from phase separation of active and inactive chromatin through to eQTL action, this review would not be complete without a critical discussion of their limitations. Besides the complicated relation between TADs and transcription already reviewed, we list here some other challenges to our model.

First, the simplest version of our model does not immediately account for the bias in favor of convergent CTCF loops (over divergent ones)—which is naturally explained by the ‘loop-extrusion’ model (24,25,94,95) (see also Supplementary Note 2). However, the loop-extrusion and transcription-factor model are not alternative to one another, but complementary, so convergent loops are naturally recovered by a combined model where chromosomes are organized by both transcription factors and cohesin (bioRxiv: https://doi.org/10.1101/305359). Additionally, the motor activity behind loop extrusion, if present, may be provided by transcription itself (96) (Supplementary Note 2).

Second, the structures of mitotic and sperm chromatin pose a challenge to all models (Supplementary Notes 9 and 10). For ours, it is difficult to reconcile the persistence of loops during these stages with the common assumption that all factors are lost from chromatin. However, recent results suggest this assumption is incorrect, and that many factors do actually remain bound in mitosis (97) (Supplementary Note 9). The case of sperm is harder to explain. We speculate cohesin and other factors may still operate, and this might be sufficient to explain the observations (Supplementary Note 10).

CONCLUSION

Seeing is believing. While clusters of RNA polymerase II tagged with GFP are seen in images of living cells (98102), decisive experiments confirming ideas presented here will probably involve high-resolution temporal and spatial imaging of single polymerases active on specified templates. But these are demanding experiments because it is so difficult to know which kinetic population is being imaged. For example, an inactive pool of polymerase constitutes a high background; ∼80% is in a rapidly-exchanging pool, and so soluble or bound non-specifically (103). If mammalian polymerases are like bacterial ones, most at promoters fails to initiate, and—of ones that do initiate—99% abort within ∼10 nucleotides to yield transcripts too short to be seen by RNA-seq (104). Then, eukaryotic enzymes on both strands abort within 20–500 nucleotides to give products seen by RNA-seq as promoter-proximal peaks (105). On top of this, ∼60% further into genes pause for unknown periods (106). We may also think that active and inactive polymerases are easily distinguished using inhibitors, but DRB and flavopiridol do not block some polymerases at promoters (e.g. ones phosphorylated at Ser5 of the C-terminal domain), α-amanitin takes hours to act, and both α-amanitin and triptolide trigger polymerase destruction (65).

In biology, structure and function are inter-related. Here, we suggest that many individual acts of transcription determine global genome conformation, and this—in turn—feeds back to directly influence the firing of each individual transcription unit. Consequently, ‘omnigenic’ effects work both ways. [Note the term ‘omnigenic’ is used here to include both genic and non-genic transcription units.] In other words, transcription is the most ancient and basic driver of the organization in all kingdoms, with recently-evolved factors like CTCF modulating this basic structure. It also seems likely that transcription factories nucleate related ones involved in replication, repair, and recombination (40), as well as organizing mitotic chromosomes (Supplementary Note 9). They may also play important roles in other mysterious processes like meiotic chromosome pairing and transvection (107).

Supplementary Material

Supplementary Data

ACKNOWLEDGEMENTS

We thank Robert Beagrie, Chris A. Brackley, Davide Michieletto and Akis Papantonis for helpful discussions.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

European Research Council [CoG 648050, THREEDCELLPHYSICS; DM]; Medical Research Council [MR/KO10867/1; PRC]. Funding for open access charge: European Research Council [CoG 648050, THREEDCELLPHYSICS].

Conflict of interest statement. None declared.

REFERENCES

  • 1. Dekker J., Mirny L.. The 3D genome as moderator of chromosomal communication. Cell. 2016; 164:1110–1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Dixon J.R., Gorkin D.U., Ren B.. Chromatin domains: the unit of chromosome organization. Mol. Cell. 2016; 62:668–680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Lieberman-Aiden E., van Berkum N.L., Williams L., Imakaev M., Ragoczy T., Telling A., Amit I., Lajoie B.R., Sabo P.J., Dorschner M.O. et al. Comprehensive mapping of Long-Range interactions reveals folding principles of the human genome. Science. 2009; 326:289–293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Dixon J.R., Selvaraj S., Yue F., Kim A., Li Y., Shen Y., Hu M., Liu J.S., Ren B.. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012; 485:376–380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Nora E.P., Lajoie B.R., Schulz E.G., Giorgetti L., Okamoto I., Servant N., Piolot T., van Berkum N.L., Meisig J., Sedat J. et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature. 2012; 485:381–385. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Rao S.S., Huntley M.H., Durand N.C., Stamenova E.K., Bochkov I.D., Robinson J.T., Sanborn A.L., Machol I., Omer A.D., Lander E.S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014; 159:1665–1680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Rowley M.J., Nichols M.H., Lyu X., Ando-Kuri M., Rivera I.S.M., Hermetz K., Wang P., Ruan Y., Corces V.G.. Evolutionarily conserved principles predict 3D chromatin organization. Mol. Cell. 2017; 67:837–852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Nora E.P., Goloborodko A., Valton A.-L., Gibcus J.H., Uebersohn A., Abdennur N., Dekker J., Mirny L.A., Bruneau B.G.. Targeted degradation of CTCF decouples local insulation of chromosome domains from genomic compartmentalization. Cell. 2017; 169:930–944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Crane E., Bian Q., McCord R.P., Lajoie B.R., Wheeler B.S., Ralston E.J., Uzawa S., Dekker J., Meyer B.J.. Condensin-driven remodelling of X chromosome topology during dosage compensation. Nature. 2015; 523:240–244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Galazka J.M., Klocko A.D., Uesaka M., Honda S., Selker E.U., Freitag M.. Neurospora chromosomes are organized by blocks of importin alpha-dependent heterochromatin that are largely independent of H3K9me3. Genome Res. 2016; 26:1069–1080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Hsieh T.-H.S., Weiner A., Lajoie B., Dekker J., Friedman N., Rando O.J.. Mapping nucleosome resolution chromosome folding in yeast by micro-C. Cell. 2015; 162:108–119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Mizuguchi T., Fudenberg G., Mehta S., Belton J.-M., Taneja N., Folco H.D., FitzGerald P., Dekker J., Mirny L., Barrowman J. et al. Cohesin-dependent globules and heterochromatin shape 3D genome architecture in S. pombe. Nature. 2014; 516:432–435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Liu C., Wang C., Wang G., Becker C., Zaidem M., Weigel D.. Genome-wide analysis of chromatin packing in Arabidopsis thaliana at single-gene resolution. Genome Res. 2016; 26:1057–1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Le T.B., Laub M.T.. Transcription rate and transcript length drive formation of chromosomal interaction domain boundaries. EMBO J. 2016; 35:1582–1595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Dekker J., Belmont A.S., Guttman M., Leshyk V.O., Lis J.T., Lomvardas S., Mirny L.A., Oshea C.C., Park P.J., Ren B. et al. The 4D nucleome project. Nature. 2017; 549:219–226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Rao S.S., Huang S.-C., St Hilaire B.G., Engreitz J.M., Perez E.M., Kieffer-Kwon K.-R., Sanborn A.L., Johnstone S.E., Bascom G.D., Bochkov I.D. et al. Cohesin loss eliminates all loop domains. Cell. 2017; 171:305–320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Levi V., Ruan Q., Plutz M., Belmont A.S., Gratton E.. Chromatin dynamics in interphase cells revealed by tracking in a two-photon excitation microscope. Biophys. J. 2005; 89:4275–4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Lucas J.S., Zhang Y., Dudko O.K., Murre C.. 3D trajectories adopted by coding and regulatory DNA elements: first-passage times for genomic interactions. Cell. 2014; 158:339–352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Cook P.R., Marenduzzo D.. Entropic organization of interphase chromosomes. J. Cell. Biol. 2009; 186:825–834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Jun S., Wright A.. Entropy as the driver of chromosome segregation. Nat. Rev. Microbiol. 2010; 8:600–607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Rippe K. Making contacts on a nucleic acid polymer. Trends Biochem. Sci. 2001; 26:733–740. [DOI] [PubMed] [Google Scholar]
  • 22. Weintraub A.S., Li C.H., Zamudio A.V., Sigova A.A., Hannett N.M., Day D.S., Abraham B.J., Cohen M.A., Nabet B., Buckley D.L. et al. YY1 is a structural regulator of enhancer-promoter loops. Cell. 2017; 171:1573–1588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Marenduzzo D., Finan K., Cook P.R.. The depletion attraction: an underappreciated force driving cellular organization. J. Cell Biol. 2006; 175:681–686. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Sanborn A.L., Rao S. S.P., Huang S.-C., Durand N.C., Huntley M.H., Jewett A.I., Bochkov I.D., Chinnappan D., Cutkosky A., Lia J. et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl. Acad. Sci. U.S.A. 2015; 112:E6456–E6465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Fudenberg G., Imakaev M., Lu C., Goloborodko A., Abdennur N., Mirny L.A.. Formation of chromosomal domains by loop extrusion. Cell Rep. 2016; 15:2038–2049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Barbieri M., Chotalia M., Fraser J., Lavitas L.-M., Dostie J., Pombo A., Nicodemi M.. Complexity of chromatin folding is captured by the strings and binders switch model. Proc. Natl. Acad. Sci. U.S.A. 2012; 109:16173–16178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Brackley C.A., Taylor S., Papantonis A., Cook P.R., Marenduzzo D.. Nonspecific bridging-induced attraction drives clustering of DNA-binding proteins and genome organization. Proc. Natl. Acad. Sci. U.S.A. 2013; 110:E3605–E3611. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Brackley C.A., Johnson J., Kelly S., Cook P.R., Marenduzzo D.. Simulated binding of transcription factors to active and inactive regions folds human chromosomes into loops, rosettes and topological domains. Nucleic Acids Res. 2016; 44:3503–3512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Bianco S., Chiariello A.M., Annunziatella C., Esposito A., Nicodemi M.. Predicting chromatin architecture from models of polymer physics. Chromosome Res. 2017; 25:25–34. [DOI] [PubMed] [Google Scholar]
  • 30. Haddad N., Jost D., Vaillant C.. Perspectives: using polymer modeling to understand the formation and function of nuclear compartments. Chromosome Res. 2017; 25:35–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Liu Z., Legant W.R., Chen B.C., Li L., Grimm J.B., Lavis L.D., Betzig E., Tjian R.. 3D imaging of Sox2 enhancer clusters in embryonic stem cells. Elife. 2014; 3:e04236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Hnisz D., Shrinivas K., Young R.A., Chakraborty A.K., Sharp P.A.. A phase separation model for transcriptional control. Cell. 2017; 169:13–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Larson A.G., Elnatan D., Keenen M.M., Trnka M.J., Johnston J.B., Burlingame A.L., Agard D.A., Redding S., Narlikar G.J.. Liquid droplet formation by HP1α suggests a role for phase separation in heterochromatin. Nature. 2017; 547:236–240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Strom A.R., Emelyanov A.V., Mir M., Fyodorov D.V., Darzacq X., Karpen G.H.. Phase separation drives heterochromatin domain formation. Nature. 2017; 547:241–245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Schwarzer W., Abdennur N., Goloborodko A., Pekowska A., Fudenberg G., Loe-Mie Y., Fonseca N.A., Huber W., Haering C.H., Mirny L. et al. Two independent modes of chromatin organization revealed by cohesin removal. Nature. 2017; 551:51–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Brackley C.A., Liebchen B., Michieletto D., Mouvet F.L., Cook P.R., Marenduzzo D.. Ephemeral protein binding to DNA shapes stable nuclear bodies and chromatin domains. Biophys. J. 2017; 28:1085–1093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Michieletto D., Orlandini E., Marenduzzo D.. Polymer model with epigenetic recolouring reveals a pathway for the de novo establishment and 3D organisation of chromatin domains. Phys. Rev. X. 2016; 6:041047. [Google Scholar]
  • 38. Michieletto D., Chiang M., Coli D., Papantonis A., Orlandini E., Cook P.R., Marenduzzo D.. Shaping epigenetic memory via genomic bookmarking. Nucleic Acids Res. 2017; 46:83–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Rieder D., Trajanoski Z., McNally J.. Transcription factories. Front. Genetics. 2012; 3:221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Papantonis A., Cook P.R.. Transcription factories: genome organization and gene regulation. Chem. Rev. 2013; 113:8683–8705. [DOI] [PubMed] [Google Scholar]
  • 41. Ahmed W., Sala C., Hegde S.R., Jha R.K., Cole S.T., Nagaraja V.. Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase. PLoS Genet. 2017; 13:e1006754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Bon M., Marenduzzo D., Cook P.R.. Modeling a self-avoiding chromatin loop: relation to the packing problem, action-at-a-distance, and nuclear context. Structure. 2006; 14:197–204. [DOI] [PubMed] [Google Scholar]
  • 43. Dinant C., Luijsterburg M., Hofer T., von Bornstaedt G., Vermeulen W., Houtsmuller A., van Driel R.. Assembly of multiprotein complexes that control genome function. J. Cell. Biol. 2009; 185:21–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Larkin J.D., Papantonis A., Cook P.R., Marenduzzo D.. Space exploration by the promoter of a long human gene during one transcription cycle. Nucleic Acids Res. 2013; 41:2216–2227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Fullwood M.J., Liu M.H., Pan Y.F., Liu J., Xu H., Mohamed Y.B., Orlov Y.L., Velkov S., Ho A., Mei P.H. et al. An oestrogen-receptor-α-bound human chromatin interactome. Nature. 2009; 462:58–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Schoenfelder S., Sexton T., Chakalova L., Cope N.F., Horton A., Andrews S., Kurukuti S., Mitchell J.A., Umlauf D., Dimitrova D.S. et al. Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells. Nat. Genet. 2010; 42:53–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Papantonis A., Kohro T., Baboo S., Larkin J.D., Deng B., Short P., Tsutsumi S., Taylor S., Kanki Y., Kobayashi M. et al. TNFα signals through specialized factories where responsive coding and miRNA genes are transcribed. EMBO J. 2012; 31:4404–4414. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Schmitt A.D., Hu M., Jung I., Xu Z., Qiu Y., Tan C.L., Li Y., Lin S., Lin Y., Barr C.L. et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 2016; 17:2042–2059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Beagrie R.A., Scialdone A., Schueler M., Kraemer D. C.A., Chotalia M., Xie S.Q., Barbieri M., de Santiago I., Lavitas L.-M., Branco M.R. et al. Complex multi-enhancer contacts captured by genome architecture mapping. Nature. 2017; 543:519–524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Ay F., Vu T.H., Zeitz M.J., Varoquaux N., Carette J.E., Vert J.-P., Hoffman A.R., Noble W.S.. Identifying multi-locus chromatin contacts in human cells using tethered multiple 3C. BMC Genomics. 2015; 16:121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Olivares-Chauvet P., Mukamel Z., Lifshitz A., Schwartzman O., Elkayam N.O., Lubling Y., Deikus G., Sebra R.P., Tanay A.. Capturing pairwise and multi-way chromosomal conformations using chromosomal walks. Nature. 2016; 540:296–300. [DOI] [PubMed] [Google Scholar]
  • 52. Schmitt A.D., Hu M., Ren B.. Genome-wide mapping and analysis of chromosome architecture. Nat. Rev. Mol. Cell Biol. 2016; 17:743–755. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Dali R., Blanchette M.. A critical assessment of topologically associating domain prediction tools. Nucleic Acids Res. 2017; 45:2994–3005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Forcato M., Nicoletti C., Pal K., Livi C.M., Ferrari F., Bicciato S.. Comparison of computational methods for Hi-C data analysis. Nat. Methods. 2017; 14:679–685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Zhan Y., Mariani L., Barozzi I., Schulz E.G., Blüthgen N., Stadler M., Tiana G., Giorgetti L.. Reciprocal insulation analysis of Hi-C data shows that TADs represent a functionally but not structurally privileged scale in the hierarchical folding of chromosomes. Genome Res. 2017; 27:479–490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Flyamer I.M., Gassler J., Imakaev M., Brandao H.B., Ulianov S.V., Abdennur N., Razin S.V., Mirny L.A., Tachibana-Konwalski K.. Single-nucleus Hi-C reveals unique chromatin reorganization at oocyte-to-zygote transition. Nature. 2017; 544:110–114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Stevens T.J., Lando D., Basu S., Atkinson L.P., Cao Y., Lee S.F., Leeb M., Wohlfahrt K.J., Boucher W., O’Shaughnessy-Kirwan et al. 3D structures of individual mammalian genomes studied by single-cell Hi-C. Nature. 2017; 544:59–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Nagano T., Lubling Y., Varnai C., Dudley C., Leung W., Baran Y., Mendelson-Cohen N., Wingett S., Fraser P., Tanay A.. Cell-cycle dynamics of chromosomal organisation at single-cell resolution. Nature. 2017; 547:61–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Harmston N., Ing-Simmons E., Tan G., Perry M., Merkenschlager M., Lenhard B.. Topologically associating domains are ancient features that coincide with Metazoan clusters of extreme noncoding conservation. Nat. Commun. 2017; 8:441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Giorgetti L., Lajoie B.R., Carter A.C., Attia M., Zhan Y., Xu J., Chen C.J., Kaplan N., Chang H.Y., Heard E. et al. Structural organization of the inactive X chromosome in the mouse. Nature. 2016; 535:575–579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Li L., Lyu X., Hou C., Takenaka N., Nguyen H.Q., Ong C.-T., Cubeñas-Potts C., Hu M., Lei E.P., Bosco G. et al. Widespread rearrangement of 3D chromatin organization underlies polycomb-mediated stress-induced silencing. Mol. Cell. 2015; 58:216–231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Hug C.B., Grimaldi A.G., Kruse K., Vaquerizas J.M.. Chromatin architecture emerges during zygotic genome activation independent of transcription. Cell. 2017; 169:216–228. [DOI] [PubMed] [Google Scholar]
  • 63. Du Z., Zheng H., Huang B., Ma R., Wu J., Zhang X., He J., Xiang Y., Wang Q., Li Y. et al. Allelic reprogramming of 3D chromatin architecture during early mammalian development. Nature. 2017; 547:232–235. [DOI] [PubMed] [Google Scholar]
  • 64. Ke Y., Xu Y., Chen X., Feng S., Liu Z., Sun Y., Yao X., Li F., Zhu W., Gao L. et al. 3D chromatin structures of mature gametes and structural reprogramming during mammalian embryogenesis. Cell. 2017; 170:367–381. [DOI] [PubMed] [Google Scholar]
  • 65. Bensaude O. Inhibiting eukaryotic transcription. Which compound to choose? How to evaluate its activity. Transcription. 2011; 2:103–108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Kaaij L.J., van der Weide R.H., Ketting R.F., de Wit E.. Systemic loss and gain of chromatin architecture throughout zebrafish development. Cell Rep. 2018; 24:1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Long H.K., Prescott S.L., Wysocka J.. Ever-changing landscapes: transcriptional enhancers in development and evolution. Cell. 2016; 167:1170–1187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Kim T.-K., Shiekhattar R.. Architectural and functional commonalities between enhancers and promoters. Cell. 2015; 162:948–959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Andersson R., Gebhard C., Miguel-Escalada I., Hoof I., Bornholdt J., Boyd M., Chen Y., Zhao X., Schmidl C., Suzuki T. et al. An atlas of active enhancers across human cell types and tissues. Nature. 2014; 507:455–461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Engreitz J.M., Haines J.E., Perez E.M., Munson G., Chen J., Kane M., McDonel P.E., Guttman M., Lander E.S.. Local regulation of gene expression by lncRNA promoters, transcription and splicing. Nature. 2016; 539:452–455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Deng W., Rupon J.W., Krivega I., Breda L., Motta I., Jahn K.S., Reik A., Gregory P.D., Rivella S., Dean A. et al. Reactivation of developmentally silenced globin genes by forced chromatin looping. Cell. 2014; 158:849–860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Levine M., Cattoglio C., Tjian R.. Looping back to leap forward: transcription enters a new era. Cell. 2014; 157:13–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Fukaya T., Lim B., Levine M.. Enhancer control of transcriptional bursting. Cell. 2016; 166:358–368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Muerdter F., Stark A.. Gene regulation: activation through space. Curr. Biol. 2016; 26:R895–R898. [DOI] [PubMed] [Google Scholar]
  • 75. Dao L.T., Galindo-Albarrán A.O., Castro-Mondragon J.A., Andrieu-Soler C., Medina-Rivera A., Souaid C., Charbonnier G., Griffon A., Vanhille L., Stephen T. et al. Genome-wide characterization of mammalian promoters with distal enhancer functions. Nat. Genet. 2017; 49:1073–1081. [DOI] [PubMed] [Google Scholar]
  • 76. Diao Y., Fang R., Li B., Meng Z., Yu J., Qiu Y., Lin K.C., Huang H., Liu T., Marina R.J. et al. A tiling-deletion-based genetic screen for cis-regulatory element identification in mammalian cells. Nat. Methods. 2017; 14:629–635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Javierre B.M., Burren O.S., Wilder S.P., Kreuzhuber R., Hill S.M., Sewitz S., Cairns J., Wingett S.W., Várnai C., Thiecke M.J. et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell. 2016; 167:1369–1384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Pancaldi V., Carrillo-de Santa-Pau E., Javierre B.M., Juan D., Fraser P., Spivakov M., Valencia A., Rico D.. Integrating epigenomic data and 3D genomic structure with a new measure of chromatin assortativity. Genome Biol. 2016; 17:152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Whalen S., Truty R.M., Pollard K.S.. Enhancer–promoter interactions are encoded by complex genomic signatures on looping chromatin. Nat. Genet. 2016; 48:488–496. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Kvon E.Z., Kazmar T., Stampfel G., Yáñez-Cuna J.O., Pagani M., Schernhuber K., Dickson B.J., Stark A.. Genome-scale functional characterization of Drosophila developmental enhancers in vivo. Nature. 2014; 512:91–95. [DOI] [PubMed] [Google Scholar]
  • 81. Zabidi M.A., Arnold C.D., Schernhuber K., Pagani M., Rath M., Frank O., Stark A.. Enhancer–core-promoter specificity separates developmental and housekeeping gene regulation. Nature. 2015; 518:556–559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82. Lee K., Hsiung C. C.-S., Huang P., Raj A., Blobel G.A.. Dynamic enhancer–gene body contacts during transcription elongation. Genes Dev. 2015; 29:1992–1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Bartman C.R., Hsu S.C., Hsiung C. C.-S., Raj A., Blobel G.A.. Enhancer regulation of transcriptional bursting parameters revealed by forced chromatin looping. Mol. Cell. 2016; 62:237–247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Whyte W.A., Orlando D.A., Hnisz D., Abraham B.J., Lin C.Y., Kagey M.H., Rahl P.B., Lee T.I., Young R.A.. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013; 153:307–319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Ulianov S.V., Khrameeva E.E., Gavrilov A.A., Flyamer I.M., Kos P., Mikhaleva E.A., Penin A.A., Logacheva M.D., Imakaev M.V., Chertovich A. et al. Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains. Genome Res. 2016; 26:70–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Deplancke B., Alpern D., Gardeux V.. The genetics of transcription factor DNA binding variation. Cell. 2016; 166:538–554. [DOI] [PubMed] [Google Scholar]
  • 87. Albert F.W., Kruglyak L.. The role of regulatory variation in complex traits and disease. Nat. Rev. Genet. 2015; 16:197–212. [DOI] [PubMed] [Google Scholar]
  • 88. Boyle E.A., Li Y.I., Pritchard J.K.. An expanded view of complex traits: from polygenic to omnigenic. Cell. 2017; 169:1177–1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Yvert G., Brem R.B., Whittle J., Akey J.M., Foss E., Smith E.N., Mackelprang R., Kruglyak L.. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat. Genet. 2003; 35:57–64. [DOI] [PubMed] [Google Scholar]
  • 90. Brynedal B., Choi J., Raj T., Bjornson R., Stranger B.E., Neale B.M., Voight B.F., Cotsapas C.. Large-scale trans-eQTLs affect hundreds of transcripts and mediate patterns of transcriptional co-regulation. Am. J. Hum. Genet. 2017; 100:581–591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91. The GTEx Consortium. Genetic effects on gene expression across human tissues. Nature. 2017; 550:204–213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92. Yao C., Joehanes R., Johnson A.D., Huan T., Liu C., Freedman J.E., Munson P.J., Hill D.E., Vidal M., Levy D.. Dynamic role of trans regulation of gene expression in relation to complex traits. Am. J. Hum. Genet. 2017; 100:571–580. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93. Platig J., Castaldi P.J., DeMeo D., Quackenbush J.. Bipartite community structure of eQTLs. PLoS Comput. Biol. 2016; 12:e1005033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94. Nasmyth K. Cohesin: a catenase with separate entry and exit gates. Nat. Cell Biol. 2011; 13:1170–1177. [DOI] [PubMed] [Google Scholar]
  • 95. Brackley C.A., Johnson J., Michieletto D., Morozov A.N., Nicodemi M., Cook P.R., Marenduzzo D.. Non-equilibrium chromosome looping via molecular slip-links. Phys. Rev. Lett. 2017; 119:138101. [DOI] [PubMed] [Google Scholar]
  • 96. Racko D., Benedetti F., Dorier J., Stasiak A.. Transcription-induced supercoiling as the driving force of chromatin loop extrusion during formation of TADs in interphase chromosomes. Nucleic Acids Res. 2018; 46:1648–1660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97. Teves S.S., An L., Hansen A.S., Xie L., Darzacq X., Tjian R.. A dynamic mode of mitotic bookmarking by transcription factors. Elife. 2016; 5:1–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98. Sugaya K., Vigneron M., Cook P.R.. Mammalian cell lines expressing functional RNA polymerase II tagged with the green fluorescent protein. J. Cell Sci. 2000; 113:2679–2683. [DOI] [PubMed] [Google Scholar]
  • 99. Cisse I.I., Izeddin I., Causse S.Z., Boudarene L., Senecal A., Muresan L., Dugast-Darzacq C., Hajj B., Dahan M., Darzacq X.. Real-time dynamics of RNA polymerase II clustering in live human cells. Science. 2013; 341:664–667. [DOI] [PubMed] [Google Scholar]
  • 100. Chen X., Wei M., Zheng M.M., Zhao J., Hao H., Chang L., Xi P., Sun Y.. Study of RNA polymerase II clustering inside live-cell nuclei using Bayesian nanoscopy. ACS Nano. 2016; 10:2447–2454. [DOI] [PubMed] [Google Scholar]
  • 101. Cho W.-K., Jayanth N., English B.P., Inoue T., Andrews J.O., Conway W., Grimm J.B., Spille J.-H., Lavis L.D., Lionnet T. et al. RNA Polymerase II cluster dynamics predict mRNA output in living cells. Elife. 2016; 5:e13617. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102. Cho W.-K., Jayanth N., Mullen S., Tan T.H., Jung Y.J., Cissé I.I.. Super-resolution imaging of fluorescently labeled, endogenous RNA Polymerase II in living cells with CRISPR/Cas9-mediated gene editing. Sci. Rep. 2016; 6:35949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103. Kimura H., Tao Y., Roeder R.G., Cook P.R.. Quantitation of RNA polymerase II and its transcription factors in an HeLa cell: little soluble holoenzyme but significant amounts of polymerases attached to the nuclear substructure. Mol. Cell. Biol. 1999; 19:5383–5392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104. Goldman S.R., Ebright R.H., Nickels B.E.. Direct detection of abortive RNA transcripts in vivo. Science. 2009; 324:927–928. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105. Ehrensberger A.H., Kelly G.P., Svejstrup J.Q.. Mechanistic interpretation of promoter-proximal peaks and RNAPII density maps. Cell. 2013; 154:713–715. [DOI] [PubMed] [Google Scholar]
  • 106. Day D.S., Zhang B., Stevens S.M., Ferrari F., Larschan E.N., Park P.J., Pu W.T.. Comprehensive analysis of promoter-proximal RNA polymerase II pausing across mammalian cell types. Genome Biol. 2016; 17:120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107. Xu M., Cook P.R.. The role of specialized transcription factories in chromosome pairing. Biochim. Biophys. Acta. 2008; 1783:2155–2160. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES