Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids

doi:10.4161/cc.8.11.8580

. 2009 Jun 1;8(11):1698-710.

doi: 10.4161/cc.8.11.8580. Epub 2009 Jun 27.

Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids

Lakshminarayan M Iyer¹, Mamta Tahiliani, Anjana Rao, L Aravind

Affiliations

PMID: 19411852
PMCID: PMC2995806
DOI: 10.4161/cc.8.11.8580

Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids

Lakshminarayan M Iyer et al. Cell Cycle. 2009.

. 2009 Jun 1;8(11):1698-710.

doi: 10.4161/cc.8.11.8580. Epub 2009 Jun 27.

Authors

Lakshminarayan M Iyer¹, Mamta Tahiliani, Anjana Rao, L Aravind

Affiliation

¹ National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.

PMID: 19411852
PMCID: PMC2995806
DOI: 10.4161/cc.8.11.8580

Abstract

Modified bases in nucleic acids present a layer of information that directs biological function over and beyond the coding capacity of the conventional bases. While a large number of modified bases have been identified, many of the enzymes generating them still remain to be discovered. Recently, members of the 2-oxoglutarate- and iron(II)-dependent dioxygenase super-family, which modify diverse substrates from small molecules to biopolymers, were predicted and subsequently confirmed to catalyze oxidative modification of bases in nucleic acids. Of these, two distinct families, namely the AlkB and the kinetoplastid base J binding proteins (JBP) catalyze in situ hydroxylation of bases in nucleic acids. Using sensitive computational analysis of sequences, structures and contextual information from genomic structure and protein domain architectures, we report five distinct families of 2-oxoglutarate- and iron(II)-dependent dioxygenase that we predict to be involved in nucleic acid modifications. Among the DNA-modifying families, we show that the dioxygenase domains of the kinetoplastid base J-binding proteins belong to a larger family that includes the Tet proteins, prototyped by the human oncogene Tet1, and proteins from basidiomycete fungi, chlorophyte algae, heterolobosean amoeboflagellates and bacteriophages. We present evidence that some of these proteins are likely to be involved in oxidative modification of the 5-methyl group of cytosine leading to the formation of 5-hydroxymethylcytosine. The Tet/JBP homologs from basidiomycete fungi such as Laccaria and Coprinopsis show large lineage-specific expansions and a tight linkage with genes encoding a novel and distinct family of predicted transposases, and a member of the Maelstrom-like HMG family. We propose that these fungal members are part of a mobile transposon. To the best of our knowledge, this is the first report of a eukaryotic transposable element that encodes its own DNA-modification enzyme with a potential regulatory role. Through a wider analysis of other poorly characterized DNA-modifying enzymes we also show that the phage Mu Mom-like proteins, which catalyze the N6-carbamoylmethylation of adenines, are also linked to diverse families of bacterial transposases, suggesting that DNA modification by transposable elements might have a more general presence than previously appreciated. Among the other families of 2-oxoglutarate- and iron(II)-dependent dioxygenases identified in this study, one which is found in algae, is predicted to mainly comprise of RNA-modifying enzymes and shows a striking diversity in protein domain architectures suggesting the presence of RNA modifications with possibly unique adaptive roles. The results presented here are likely to provide the means for future investigation of unexpected epigenetic modifications, such as hydroxymethyl cytosine, that could profoundly impact our understanding of gene regulation and processes such as DNA demethylation.

PubMed Disclaimer

Figures

**Figure 1**
Multiple alignment of selected examples of the newly predicted families of the nucleic-acid-modifying 2-oxoglutarate- and iron(II)-dependent dioxygenase superfamily. Protein sequences are represented by their gene names, species names and GenBank index numbers (where available). Temporary gene names were assigned for predicted proteins from Naegleria, Aureococcus, Daphnia and Micromonas. The full length protein sequences from these are available in the Supplementary material. The coloring scheme and consensus abbreviations are shown in the key. Family names are shown to the right of the alignment. The distinct inserts of the TET/JBP and the AlkB families are shown within boxes. The key conserved residues defining the 2OGFeDO protein have been marked below the alignment. The consensus secondary structure derived from crystal structures of characterized members of the superfamily is shown above.

**Figure 2**
Representative domain architectures of the newly identified versions of nucleic-acid-modifying 2OGFeDO proteins. Architectures are arranged by their phylogenetic and family affinities, and are labeled by their gene and species names. Domain architectures within a family are boxed. Domains are typically denoted by their standard names. Non-standard domain nomenclatures are clarified in the inset key at the bottom of the figure. Operons are shown as arrows where the arrow head points from the 5′ to the 3′ direction of the coding frame of the gene. Gene neighborhoods are labeled by the gene coding for the 2OGFeDO protein.

**Figure 3**
Genomic organization and domain architectures of predicted transposons encoding DNA-modifying enzymes. Genes are depicted as arrows with the arrow head pointing from the 5′ to the 3′ direction of the coding sequence. Gene neighborhoods of the predicted transposase are typically labeled with the gene name of the 2OGFeDO containing protein, the species name and the gi number. In potential fragmentary elements where the 2OGFeDO is absent, the gene neighborhood is labeled with the gene name of the predicted transposase-containing gene. The key at the bottom of the figure explains non-standard domain and gene names, while other gene and domains names are as commonly used in literature.

**Figure 4**
Multiple alignment of the proposed catalytic domain of the transposase of the novel predicted transposon encoding 2OGFeDO proteins. Protein sequences are represented by their gene names, species names and GenBank index numbers. The predicted secondary structure is shown above the alignment. The coloring scheme and consensus abbreviations are shown in the key. Conserved residues defining the catalytic site of the predicted transposase are marked below the alignment.

See this image and copyright information in PMC

Cited by

Tet family of 5-methylcytosine dioxygenases in mammalian development.
Zhao H, Chen T. Zhao H, et al. J Hum Genet. 2013 Jul;58(7):421-7. doi: 10.1038/jhg.2013.63. Epub 2013 May 30. J Hum Genet. 2013. PMID: 23719188 Free PMC article. Review.
Distinct and stage-specific contributions of TET1 and TET2 to stepwise cytosine oxidation in the transition from naive to primed pluripotency.
Mulholland CB, Traube FR, Ugur E, Parsa E, Eckl EM, Schönung M, Modic M, Bartoschek MD, Stolz P, Ryan J, Carell T, Leonhardt H, Bultmann S. Mulholland CB, et al. Sci Rep. 2020 Jul 21;10(1):12066. doi: 10.1038/s41598-020-68600-3. Sci Rep. 2020. PMID: 32694513 Free PMC article.
Alterations of metabolic genes and metabolites in cancer.
Oermann EK, Wu J, Guan KL, Xiong Y. Oermann EK, et al. Semin Cell Dev Biol. 2012 Jun;23(4):370-80. doi: 10.1016/j.semcdb.2012.01.013. Epub 2012 Jan 28. Semin Cell Dev Biol. 2012. PMID: 22306135 Free PMC article. Review.
DNA methylation, its mediators and genome integrity.
Meng H, Cao Y, Qin J, Song X, Zhang Q, Shi Y, Cao L. Meng H, et al. Int J Biol Sci. 2015 Apr 8;11(5):604-17. doi: 10.7150/ijbs.11218. eCollection 2015. Int J Biol Sci. 2015. PMID: 25892967 Free PMC article. Review.
Tet family proteins and 5-hydroxymethylcytosine in development and disease.
Tan L, Shi YG. Tan L, et al. Development. 2012 Jun;139(11):1895-902. doi: 10.1242/dev.070771. Development. 2012. PMID: 22569552 Free PMC article. Review.

See all "Cited by" articles

References

1. Bloomfield VA, Crothers DM, Tinoco I., Jr . Nucleic Acids: Structures, Properties and Functions. Sausalito, CA: University Science Books; 2000.
1. Anantharaman V, Koonin EV, Aravind L. Comparative genomics and evolution of proteins involved in RNA metabolism. Nucleic Acids Res. 2002;30:1427–64. - PMC - PubMed
1. Czerwoniec A, Dunin-Horkawicz S, Purta E, Kaminska KH, Kasprzak JM, Bujnicki JM, et al. MODOMICS: a database of RNA modification pathways. 2008 update. Nucleic Acids Res. 2009;37:118–21. - PMC - PubMed
1. Roberts RJ, Vincze T, Posfai J, Macelis D. REBASE—enzymes and genes for DNA restriction and modification. Nucleic Acids Res. 2007;35:269–70. - PMC - PubMed
1. Pfeifer GP. Mutagenesis at methylated CpG sequences. Curr Top Microbiol Immunol. 2006;301:259–81. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Molecular Biology Databases
- REBASE - The Restriction Enzyme Database
Research Materials
- NCI CPTC Antibody Characterization Program

[1] Bloomfield VA, Crothers DM, Tinoco I., Jr . Nucleic Acids: Structures, Properties and Functions. Sausalito, CA: University Science Books; 2000.

[2] Bloomfield VA, Crothers DM, Tinoco I., Jr . Nucleic Acids: Structures, Properties and Functions. Sausalito, CA: University Science Books; 2000.

[3] Anantharaman V, Koonin EV, Aravind L. Comparative genomics and evolution of proteins involved in RNA metabolism. Nucleic Acids Res. 2002;30:1427–64. - PMC - PubMed

[4] Anantharaman V, Koonin EV, Aravind L. Comparative genomics and evolution of proteins involved in RNA metabolism. Nucleic Acids Res. 2002;30:1427–64. - PMC - PubMed

[5] Czerwoniec A, Dunin-Horkawicz S, Purta E, Kaminska KH, Kasprzak JM, Bujnicki JM, et al. MODOMICS: a database of RNA modification pathways. 2008 update. Nucleic Acids Res. 2009;37:118–21. - PMC - PubMed

[6] Czerwoniec A, Dunin-Horkawicz S, Purta E, Kaminska KH, Kasprzak JM, Bujnicki JM, et al. MODOMICS: a database of RNA modification pathways. 2008 update. Nucleic Acids Res. 2009;37:118–21. - PMC - PubMed

[7] Roberts RJ, Vincze T, Posfai J, Macelis D. REBASE—enzymes and genes for DNA restriction and modification. Nucleic Acids Res. 2007;35:269–70. - PMC - PubMed

[8] Roberts RJ, Vincze T, Posfai J, Macelis D. REBASE—enzymes and genes for DNA restriction and modification. Nucleic Acids Res. 2007;35:269–70. - PMC - PubMed

[9] Pfeifer GP. Mutagenesis at methylated CpG sequences. Curr Top Microbiol Immunol. 2006;301:259–81. - PubMed

[10] Pfeifer GP. Mutagenesis at methylated CpG sequences. Curr Top Microbiol Immunol. 2006;301:259–81. - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids

Affiliation

Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases

Research Materials

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases

Research Materials