Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep 30;6(9):e1000952.
doi: 10.1371/journal.pcbi.1000952.

HIV promoter integration site primarily modulates transcriptional burst size rather than frequency

Affiliations

HIV promoter integration site primarily modulates transcriptional burst size rather than frequency

Ron Skupsky et al. PLoS Comput Biol. .

Abstract

Mammalian gene expression patterns, and their variability across populations of cells, are regulated by factors specific to each gene in concert with its surrounding cellular and genomic environment. Lentiviruses such as HIV integrate their genomes into semi-random genomic locations in the cells they infect, and the resulting viral gene expression provides a natural system to dissect the contributions of genomic environment to transcriptional regulation. Previously, we showed that expression heterogeneity and its modulation by specific host factors at HIV integration sites are key determinants of infected-cell fate and a possible source of latent infections. Here, we assess the integration context dependence of expression heterogeneity from diverse single integrations of a HIV-promoter/GFP-reporter cassette in Jurkat T-cells. Systematically fitting a stochastic model of gene expression to our data reveals an underlying transcriptional dynamic, by which multiple transcripts are produced during short, infrequent bursts, that quantitatively accounts for the wide, highly skewed protein expression distributions observed in each of our clonal cell populations. Interestingly, we find that the size of transcriptional bursts is the primary systematic covariate over integration sites, varying from a few to tens of transcripts across integration sites, and correlating well with mean expression. In contrast, burst frequencies are scattered about a typical value of several per cell-division time and demonstrate little correlation with the clonal means. This pattern of modulation generates consistently noisy distributions over the sampled integration positions, with large expression variability relative to the mean maintained even for the most productive integrations, and could contribute to specifying heterogeneous, integration-site-dependent viral production patterns in HIV-infected cells. Genomic environment thus emerges as a significant control parameter for gene expression variation that may contribute to structuring mammalian genomes, as well as be exploited for survival by integrating viruses.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. HIV LTR expression distributions are wide and highly skewed.
A) Experimental system: Sample clonal histograms, spanning a range of ‘dim’ and ‘bright’ integrations, with autofluorescence included for comparison, represent fluorescence measurements on approximately 4000 cells. Smooth curves through each are the result of an optimized low-pass Fourier filter, and are used for model fitting. All measurements are given in cytometer-based relative fluorescent units (RFU). B) Trend in distribution-shape variation: Log-log linear regression coefficients quantify trends in distribution-shape variation as a power-law relationship between distribution variance (σ2) and mean (μ): formula image; formula image formula image (R2 = 0.89, coefficients ±95% confidence). Dashed lines demonstrate Poisson-like scaling (‘Poisson’, α = 1) and over-all distribution scaling (‘Scaling’, α = 2). C) Characteristic distribution shape: Smoothed, autoflorescence-deconvolved histograms were shifted by a constant fluorescence to a fixed mean (specified by the median over the set of integration clones, μ0), and fluorescence values were scaled about that mean according to the variance regression in 1B (inset), by a factor formula image with α = 1.7 (thin curves). The grey curve averages the transformed distributions and represents a ‘typical’ HIV-LTR expression profile that is wide (coefficient of variation = σ0/μ0∼60%) and highly skewed.
Figure 2
Figure 2. Transcript production in bursts qualitatively explains HIV LTR distribution shapes and variation with integration site.
A) Model schematic. φa/r = active/repressed gene state, T = Transcript, P = Protein, X = Degraded, κ = Probability/unit time. Bold parameters are considered to be integration-site dependent, while others were measured separately and fixed for all clones. All rates are measured relative to the transcript degradation rate, κt. B) Model regimes depend on the ratio of gene-state to transcriptional dynamics: ‘Fast’ (formula image), ‘Slow’ (formula image), ‘Intermediate’ (formula image), ‘Bursting’ (formula image). Transcript production rates (κt+) for sample distributions are set to reproduce the same mean number of transcript copies (left) and protein copies (right) at steady state as predicted for the ‘typical’ experimental distribution in Fig. 1C; autoflourescence is not included, and distributions are binned on a linear RFU scale for comparison. C) Distribution shape-variation in the bursting regime: burst-frequency variation (κa) leads to an approximate Poisson-like shape-variation, burst-size (b) yields an approximate distribution-scaling shape variation, and the combination with formula image (combined) gives a shape-variation most closely resembling the experimental data (compare Fig. 1B). Insets are sample, log-binned distributions with varying burst size or frequency. The fixed parameter in each sample is set to the value that approximately reproduces the ‘typical’ distribution shape in Fig. 1C.
Figure 3
Figure 3. Transcriptional bursts are short, but only an upper bound on their duration can be resolved.
A) Sample fits for several fixed values of burst duration, τ (measured relative to the transcript degradation time): experimental distribution (solid curve with 95% confidence region in grey); optimal fit at small τ (long dash); fits for larger formula image (short dash), demonstrating increased deviations (τ increasing along arrow in for inset). B) Relative fit deviation, Devr decreases for shorter active duration τ, for each clone (solid lines), and are optimal (Devr = DevrOpt) when burst durations are shortest (i.e. in the bursting regime). DevrDevrOpt = 1 (dashed line) is considered a cut-off, beyond which fit quality is significantly worse than the optimum, specifying a distinguishability cut-off upper-bound on τ, marked by the intersection of dashed line and the solid lines and referred to as formula image for each clone. C) Resolution of bursting dynamics: Calculated upper-bound active-duration (formula image) and optimal transcriptional burst size (bOpt) for each clone. Predicted large transcriptional bursts (bOpt≫1) identify clones for which the inferred transcriptional dynamics differ significantly from continuous transcription at a single fixed rate, and small formula image indicates good resolution of short bursts from less ‘noisy’ dynamics.
Figure 4
Figure 4. Modulation of transcriptional bursts by integration site.
Best-fit transcriptional burst size (b) and burst frequency (κa) that minimize the relative fit deviation (Devr) were calculated at formula image (which specifies a short active duration that was nearly optimal for all clones). Error bars represent the maximum 95% confidence interval for simultaneous parameter variations that increase Devr by 1. Log-log regression coefficients represent power-law scaling of fit parameters with distribution mean (μ, measured in cytometer RFU), of the form formula image (x = b or κa), and are given with 95% confidence intervals. A) b: α = 0.76±0.14; β = 0.13±0.2; R2 = 0.66. B) κa: α = 0.2±0.15; β = −0.5±0.2; R2 = 0.2.
Figure 5
Figure 5. Active-fraction variation distinguishes modes of transcriptional regulation.
The best-fit restriction of τ below τMax in Fig. 3 specifies an upper bound on the predicted fraction of cells with the LTR in the active state (formula image), which is marked by bars for each clone. Each Mode of integration-site modulation of transcriptional dynamics leads to a different expected variation of formula image that distinguishes them. For Mode 1, active-state stability varies over integration clones, with the active-state transcription rate fixed (formula image was used for this example), while for Mode 2 the active-state transcription rate varies over integrations, with the active duration fixed (formula image was used for this example).
Figure 6
Figure 6. Basal promoter fluctuations as determinants of infected-cell fate.
Possible decomposition of the ‘space’ of basal burst-parameters inferred by the current analysis into ranges of parameter combinations that, in the presence of positive feedback from Tat, may lead to active viral replication vs. latent fates. Region I: Basal transcription pattern never leads to Tat-transactivation. Region II: Fast transactivation always leads to a stable highly expressing state. Region III: Bimodal expression patterns, where large fluctuations in basal transcriptional bursting can infrequently drive transitions from basal to transactivated states. Inset histograms demonstrate representative expression patterns for single-integration clones of a similar vector that includes Tat , , and region boundaries are hypothetical.

Similar articles

Cited by

References

    1. Tuckwell HC, Shipman PD, Perelson AS. The probability of HIV infection in a new host and its reduction with microbicides. Math Biosci. 2008;214:81–86. - PubMed
    1. Salazar-Gonzalez JF, Salazar MG, Keele BF, Learn GH, Giorgi EE, et al. Genetic identity, biological phenotype, and evolutionary pathways of transmitted/founder viruses in acute and early HIV-1 infection. J Exp Med. 2009;206:1273–1289. - PMC - PubMed
    1. Wang GP, Ciuffi A, Leipzig J, Berry CC, Bushman FD. HIV integration site selection: analysis by massively parallel pyrosequencing reveals association with epigenetic modifications. Genome Res. 2007;17:1186–1194. - PMC - PubMed
    1. Weinberger LS, Burnett JC, Toettcher JE, Arkin AP, Schaffer DV. Stochastic gene expression in a lentiviral positive-feedback loop: HIV-1 Tat fluctuations drive phenotypic diversity. Cell. 2005;122:169–182. - PubMed
    1. Marks AJ, Pillay D, McLean AR. The effect of intrinsic stochasticity on transmitted HIV drug resistance patterns. J Theor Biol. 2009;262:1–13. - PubMed

Publication types