Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov 9;2(11):100334.
doi: 10.1016/j.crmeth.2022.100334. eCollection 2022 Nov 21.

Systematic generation and imaging of tandem repeats reveal base-pairing properties that promote RNA aggregation

Affiliations

Systematic generation and imaging of tandem repeats reveal base-pairing properties that promote RNA aggregation

Atagun U Isiktas et al. Cell Rep Methods. .

Abstract

A common pathological feature of RNAs containing expanded repeat sequences is their propensity to aggregate in cells. While some repeat RNA aggregates have been shown to cause toxicity by sequestering RNA-binding proteins, the molecular mechanism of aggregation remains unclear. Here, we devised an efficient method to generate long tandem repeat DNAs de novo and applied it to systematically determine the sequence features underlying RNA aggregation. Live-cell imaging of repeat RNAs indicated that aggregation was promoted by multivalent RNA-RNA interactions via either canonical or noncanonical base pairs. While multiple runs of two consecutive base pairs were sufficient, longer runs of base pairs such as those formed by GGGGCC hexanucleotide repeats further enhanced aggregation. In summary, our study provides a unifying model for the molecular basis of repeat RNA aggregation and a generalizable approach for identifying the sequence and structural determinants underlying the distinct properties of repeat DNAs and RNAs.

Keywords: G-quadruplex; RNA aggregation; RNA base-pairing; RNA imaging; RNA structure; microsatellite; multivalent interactions; repeat expansion disorder; short tandem repeat.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Generation of long tandem repeats by RepEx-PCR (A) Schematic illustration of RepEx-PCR. (B) Length distribution of RepEx-PCR products. (C) Sanger sequencing results of a ∼100x CAG construct, showing the junction between 5ʹ flanking region and repeat insert (top) and a magnified view of CAG repeats (bottom). (D) Schematic illustration of a dual-luciferase reporter for detecting frameshifted CAG repeats. (E) Relative RLuc/FLuc activities of out-of-frame 0x or 100x CAG constructs. Data are shown as mean ± SD, #p > 0.1; ∗p < 0.05, two-tailed Student’s t test. (F) Percentage of clones with CAG or GGGGCC repeats inserted in the sense and antisense direction. (G) Restriction enzyme digestion of clones containing antisense (GGCCCC) repeats inserted in the original or ori-inverted vector.
Figure 2
Figure 2
Sequence determinants of CAG repeat RNA aggregation (A) Schematic illustration of the 12x MS2-tagged doxycycline-inducible repeat RNA imaging construct. (B) Representative images of cells expressing 20x (left) and ∼200x (right) CAG repeat RNAs. Scale bar, 10 μm. (C) Potential base-pairing patterns (top) and representative images (bottom) of CAG sequence variants. Substituted nucleotides are highlighted in orange. See Figure 3C for quantification. Scale bar, 10 μm.
Figure 3
Figure 3
Expanded analysis of trinucleotide repeat RNA aggregation (A) List of all of 20 non-redundant triplet repeats. Homopolymers are indicated in gray. (B) Representative images of cells expressing each of 15 triplet repeats not included in Figure 2. Scale bar, 10 μm. (C) Relationship between predicted base pairs and foci forming ability. p value, two-tailed Mann-Whitney U test.
Figure 4
Figure 4
Base-pairing properties in repeat RNA aggregation (A) Potential base-pairing pattern of ACAG repeats. The added A nucleotides are highlighted in orange. (B) Representative image (left) and quantification (right) of cells expressing 150x ACAG repeat RNA. Scale bar, 10 μm. Quantification data are shown as mean ± SD, ∗p < 0.05, two-tailed Student’s t test. (C) Predicted G-quadruplex structure of GGGAA repeats. (D) Representative image (left) and quantification (right) of cells expressing 120x GGGAA or 200x CAG repeat RNAs, treated with DMSO or PDS. Scale bar, 10 μm. Quantification data are shown as mean ± SD, #p > 0.1; ∗p < 0.05, two-tailed Student’s t test.
Figure 5
Figure 5
Aggregation of ALS/FTD-associated GGGGCC repeat RNAs (A) Potential base-pairing patterns of GGGGCC repeats. (B) Representative images of cells expressing 10x (left) or 90x (right) GGGGCC repeat RNAs. Scale bar, 10 μm. (C) Potential base-pairing patterns (top) and representative images (bottom) of cells expressing each GGGGCC variant. Substituted nucleotides are highlighted in orange. Scale bar, 10 μm. (D) Quantification of foci + cells expressing each GGGGCC variant. Quantification data are shown as mean ± SD, #p > 0.1; ∗p < 0.05, two-tailed Student’s t test.

Similar articles

Cited by

References

    1. Biffi G., Di Antonio M., Tannahill D., Balasubramanian S. Visualization and selective chemical targeting of RNA G-quadruplex structures in the cytoplasm of human cells. Nat. Chem. 2014;6:75–80. doi: 10.1038/nchem.1805. - DOI - PMC - PubMed
    1. Conlon E.G., Lu L., Sharma A., Yamazaki T., Tang T., Shneider N.A., Manley J.L. The C9ORF72 GGGGCC expansion forms RNA G-quadruplex inclusions and sequesters hnRNP H to disrupt splicing in ALS brains. Elife. 2016;5:e17820. doi: 10.7554/eLife.17820. - DOI - PMC - PubMed
    1. Dansithong W., Paul S., Comai L., Reddy S. MBNL1 is the primary determinant of focus formation and aberrant insulin receptor splicing in DM1. J. Biol. Chem. 2005;280:5773–5780. doi: 10.1074/jbc.M410781200. - DOI - PubMed
    1. DeJesus-Hernandez M., Mackenzie I.R., Boeve B.F., Boxer A.L., Baker M., Rutherford N.J., Nicholson A.M., Finch N.A., Flynn H., Adamson J., et al. Expanded GGGGCC hexanucleotide repeat in noncoding region of C9ORF72 causes chromosome 9p-linked FTD and ALS. Neuron. 2011;72:245–256. doi: 10.1016/j.neuron.2011.09.011. - DOI - PMC - PubMed
    1. Didiot M.C., Ferguson C.M., Ly S., Coles A.H., Smith A.O., Bicknell A.A., Hall L.M., Sapp E., Echeverria D., Pai A.A., et al. Nuclear localization of huntingtin mRNA is specific to cells of neuronal origin. Cell Rep. 2018;24:2553–2560.e5. doi: 10.1016/j.celrep.2018.07.106. - DOI - PMC - PubMed

Publication types