Intrinsically unstructured proteins evolve by repeat expansion
- PMID: 12938174
- DOI: 10.1002/bies.10324
Intrinsically unstructured proteins evolve by repeat expansion
Abstract
The proportion of the genome encoding intrinsically unstructured proteins increases with the complexity of organisms, which demands specific mechanism(s) for generating novel genetic material of this sort. Here it is suggested that one such mechanism is the expansion of internal repeat regions, i.e., coding micro- and minisatellites. An analysis of 126 known unstructured sequences shows the preponderance of repeats: the percentage of proteins with tandemly repeated short segments is much higher in this class (39%) than earlier reported for all Swiss-Prot (14%), yeast (18%) or human (28%) proteins. Furthermore, prime examples, such as salivary proline-rich proteins, titin, eukaryotic RNA polymerase II, the prion protein and several others, demonstrate that the repetitive segments carry fundamental function in these proteins. In addition, their repeat numbers show functionally significant interspecies variation and polymorphism, which underlines that these regions have been shaped by intense evolutionary activity. In all, the major point of this paper is that the genetic instability of repetitive regions combined with the structurally and functionally permissive nature of unstructured proteins has powered the extension and possible functional expansion of this newly recognized protein class.
Copyright 2003 Wiley Periodicals, Inc.
Similar articles
-
Genomic and evolutionary insights into genes encoding proteins with single amino acid repeats.Mol Biol Evol. 2006 Jul;23(7):1357-69. doi: 10.1093/molbev/msk022. Epub 2006 Apr 17. Mol Biol Evol. 2006. PMID: 16618963
-
A census of protein repeats.J Mol Biol. 1999 Oct 15;293(1):151-60. doi: 10.1006/jmbi.1999.3136. J Mol Biol. 1999. PMID: 10512723
-
Evolution of tRNA-like sequences and genome variability.Gene. 2004 Jun 23;335:57-71. doi: 10.1016/j.gene.2004.03.005. Gene. 2004. PMID: 15194190
-
Molecular evolution of the RNA polymerase II CTD.Trends Genet. 2008 Jun;24(6):289-96. doi: 10.1016/j.tig.2008.03.010. Epub 2008 May 9. Trends Genet. 2008. PMID: 18472177 Review.
-
Simple sequence repeats in proteins and their significance for network evolution.Gene. 2005 Jan 17;345(1):113-8. doi: 10.1016/j.gene.2004.11.023. Epub 2004 Dec 15. Gene. 2005. PMID: 15716087 Review.
Cited by
-
Patterned sequence in the transcriptome of vascular plants.BMC Genomics. 2007 Jun 15;8:173. doi: 10.1186/1471-2164-8-173. BMC Genomics. 2007. PMID: 17573970 Free PMC article.
-
Natural selection drives the accumulation of amino acid tandem repeats in human proteins.Genome Res. 2010 Jun;20(6):745-54. doi: 10.1101/gr.101261.109. Epub 2010 Mar 24. Genome Res. 2010. PMID: 20335526 Free PMC article.
-
Natural variation of the amino-terminal glutamine-rich domain in Drosophila argonaute2 is not associated with developmental defects.PLoS One. 2010 Dec 17;5(12):e15264. doi: 10.1371/journal.pone.0015264. PLoS One. 2010. PMID: 21253006 Free PMC article.
-
A comparative proteomic analysis of the simple amino acid repeat distributions in Plasmodia reveals lineage specific amino acid selection.PLoS One. 2009 Jul 14;4(7):e6231. doi: 10.1371/journal.pone.0006231. PLoS One. 2009. PMID: 19597555 Free PMC article.
-
Classification of intrinsically disordered regions and proteins.Chem Rev. 2014 Jul 9;114(13):6589-631. doi: 10.1021/cr400525m. Epub 2014 Apr 29. Chem Rev. 2014. PMID: 24773235 Free PMC article. Review. No abstract available.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases