Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 20:12:e84043.
doi: 10.7554/eLife.84043.

Native functions of short tandem repeats

Affiliations

Native functions of short tandem repeats

Shannon E Wright et al. Elife. .

Abstract

Over a third of the human genome is comprised of repetitive sequences, including more than a million short tandem repeats (STRs). While studies of the pathologic consequences of repeat expansions that cause syndromic human diseases are extensive, the potential native functions of STRs are often ignored. Here, we summarize a growing body of research into the normal biological functions for repetitive elements across the genome, with a particular focus on the roles of STRs in regulating gene expression. We propose reconceptualizing the pathogenic consequences of repeat expansions as aberrancies in normal gene regulation. From this altered viewpoint, we predict that future work will reveal broader roles for STRs in neuronal function and as risk alleles for more common human neurological diseases.

Keywords: Fragile X; autism; disease mechanism; gene expression; genetics; genomic instability; genomics; neurodegeneration.

PubMed Disclaimer

Conflict of interest statement

SW No competing interests declared, PT Dr Todd served as a consultant to Denali Therapeutics and holds a shared patent on ASOs developed with Ionis Pharmaceuticals

Figures

Figure 1.
Figure 1.. Native functions of genomic repeats.
Repeats in DNA influence larger 3D chromatin structures, regulate binding of nucleosomes and (de)acetylases and (de)methylases. They also influence transcription factor binding and polymerase processivity to affect downstream RNA production. Repeats in RNA can affect pre-mRNA processing such as alternative splicing and can affect RNA binding protein function through direct or indirect sequestration. Repeats in 3’ UTRs serve as localization signals, directing mRNA transport. Repeats in 5’ UTRs regulate translational output by impeding ribosome processivity. Repeating units in proteins can provide structural flexibility within a protein or serve as binding sites for the formation of multi-protein complexes.
Figure 2.
Figure 2.. STR-associated toxicity in Repeat Expansion Disorders.
Repeat expansions can alter global 3D chromatin structure, and influence transcription via blocking or enhancing binding of nucleosomes, (de)acetylases, (de)methylases, and transcription factors. Expanded repeats may also impede polymerase processivity. In some cases, elevated transcription of repeat expansion RNA can lead to depletion of RNA-binding proteins. Depletion of these proteins can impact many processes to which they contribute, including pre-mRNA splicing and processing, and mRNA localization. Expanded repeat RNA and bound RBPs can also aggregate into RNA foci, causing toxicity. Expanded repeat RNA can stall translational complexes, leading to repeat-associated non-AUG (RAN) translation, and contribute to the production of polymeric proteins. Polymeric proteins are aggregate prone. Longer polymeric stretches in native proteins may also cause dysfunction by preventing proper protein folding or causing the folded protein to mis-localize within the cell.
Figure 3.
Figure 3.. The effects of STR variation on local gene expressivity.
(A) Bi-allelic variation in a gene through single nucleotide polymorphisms often result in small and discrete differences in gene expression, offering limited phenotypic differences across a population with slow evolutionary timescales. In contrast, STRs in promoters and 5’UTRs can influence protein expression over a broader dynamic range, with an inverse correlation between repeat length and protein output within transcribed regions and with differential effects on transcription dependent on the repeat and local epigenetic context. Unstable repeats change rapidly from generation to generation (and even within an individual through somatic variation), creating a mechanism by which mRNA or protein expression can vary broadly and subtly across a population, offering greater genetic and phenotypic diversity and a greater propensity for disease-causing aberrancies at the extremes. (B) Predicted effects of CGG repeat length on FMR1 gene expression. CGG repeat length influences FMR1 promoter epigenetic state (more open chromatin with initial expansion, then DNA methylation and closed chromatin at >200 CGG repeats), FMR1 mRNA expression, and FMRP protein expression across the polymorphic range.

Similar articles

Cited by

References

    1. Abruzzi KC, Lacadie S, Rosbash M. Biochemical analysis of TREX complex recruitment to intronless and intron-containing yeast genes. The EMBO Journal. 2004;23:2620–2631. doi: 10.1038/sj.emboj.7600261. - DOI - PMC - PubMed
    1. Allingham-Hawkins DJ, Babul-Hirji R, Chitayat D, Holden JJ, Yang KT, Lee C, Hudson R, Gorwill H, Nolin SL, Glicksman A, Jenkins EC, Brown WT, Howard-Peebles PN, Becchi C, Cummings E, Fallon L, Seitz S, Black SH, Vianna-Morgante AM, Costa SS, Otto PA, Mingroni-Netto RC, Murray A, Webb J, Vieri F. Fragile X premutation is a significant risk factor for premature ovarian failure: the International collaborative POF in fragile X study -- preliminary data. American Journal of Medical Genetics. 1999;83:322–325. - PMC - PubMed
    1. Andken BB, Lim I, Benson G, Vincent JJ, Ferenc MT, Heinrich B, Jarzylo LA, Man HY, Deshler JO. 3’-Utr SIRF: a database for identifying clusters of whort interspersed repeats in 3’ untranslated regions. BMC Bioinformatics. 2007;8:274. doi: 10.1186/1471-2105-8-274. - DOI - PMC - PubMed
    1. Andrade MA, Perez-Iratxeta C, Ponting CP. Protein repeats: structures, functions, and evolution. Journal of Structural Biology. 2001;134:117–131. doi: 10.1006/jsbi.2001.4392. - DOI - PubMed
    1. Annear DJ, Vandeweyer G, Sanchis-Juan A, Raymond FL, Kooy RF. Non-Mendelian inheritance patterns and extreme deviation rates of CGG repeats in autism. Genome Research. 2022;32:1967–1980. doi: 10.1101/gr.277011.122. - DOI - PMC - PubMed

Publication types