Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013;5(8):1443-56.
doi: 10.1093/gbe/evt103.

Cross-species analysis of genic GC3 content and DNA methylation patterns

Affiliations

Cross-species analysis of genic GC3 content and DNA methylation patterns

Tatiana Tatarinova et al. Genome Biol Evol. 2013.

Abstract

The GC content in the third codon position (GC(3)) exhibits a unimodal distribution in many plant and animal genomes. Interestingly, grasses and homeotherm vertebrates exhibit a unique bimodal distribution. High GC(3) was previously found to be associated with variable expression, higher frequency of upstream TATA boxes, and an increase of GC(3) from 5' to 3'. Moreover, GC(3)-rich genes are predominant in certain gene classes and are enriched in CpG dinucleotides that are potential targets for methylation. Based on the GC(3) bimodal distribution we hypothesize that GC(3) has a regulatory role involving methylation and gene expression. To test that hypothesis, we selected diverse taxa (rice, thale cress, bee, and human) that varied in the modality of their GC(3) distribution and tested the association between GC(3), DNA methylation, and gene expression. We examine the relationship between cytosine methylation levels and GC(3), gene expression, genome signature, gene length, and other gene compositional features. We find a strong negative correlation (Pearson's correlation coefficient r = -0.67, P value < 0.0001) between GC(3) and genic CpG methylation. The comparison between 5'-3' gradients of CG(3)-skew and genic methylation for the taxa in the study suggests interplay between gene-body methylation and transcription-coupled cytosine deamination effect. Compositional features are correlated with methylation levels of genes in rice, thale cress, human, bee, and fruit fly (which acts as an unmethylated control). These patterns allow us to generate evolutionary hypotheses about the relationships between GC(3) and methylation and how these affect expression patterns. Specifically, we propose that the opposite effects of methylation and compositional gradients along coding regions of GC(3)-poor and GC(3)-rich genes are the products of several competing processes.

Keywords: Apis mellifera; Arabidopsis thaliana; DNA methylation; GC3; Homo sapiens; Oryza sativa; gene expression; grasses; homeotherms.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.—
Fig. 1.—
Distributions of GC3 content for rice, Arabidopsis, bee, and human.
F<sc>ig</sc>. 2.—
Fig. 2.—
GC3 vs expression for four organisms: bee (green), rice (blue), Arabidopsis (red), and human (purple). (A) Relationship between standardized values of GC3 and average expression. (B) Gene expression variability as a function of GC3. Every point represents a mean across at least 100 genes and the standard error of the mean does not exceed 0.1 (plot A) and 0.06 (plot B).
F<sc>ig</sc>. 3.—
Fig. 3.—
Oryza sativa: Relationship between GC3 (purple triangles), gene expression strength (blue diamonds), expression variability (red squares), and methylation. Standard error of the mean is below 0.03 (GC3), 0.11 (expression), and 0.03 (expression variability).
F<sc>ig</sc>. 4.—
Fig. 4.—
Arabidopsis thaliana: Methylation level in Arabidopsis as a function of GC3 (A) and differential methylation between shoots and roots (B). Blue diamond: CG; red square: C; green triangle: CHG; violet cross: CHH. Every point represents an average across 100 or more genes. The absolute relative difference is calculated as formula image. Standard error of the mean does not exceed 0.05 for the mean methylation levels calculation.
F<sc>ig</sc>. 5.—
Fig. 5.—
Apis mellifera: (A) Relative difference in gene-body methylation levels formula image as a function of GC3 between worker and queen bee. (B) Difference in gene-body methylation between worker and queen bee as a function of GC3. (C) Queen and (D) worker bee methylation as a function of GC3. Every point represents an average of at least 228 genes. Standard error of the mean for methylation levels was below 0.006.
F<sc>ig</sc>. 6.—
Fig. 6.—
Homo sapiens: Methylation as a function of GC3. Every point represents a mean across at least 100 genes, with standard error of the mean not exceeding 10% of the mean. Methylation as a function of GC3.

Similar articles

Cited by

References

    1. Aamodt RM. Age-and caste-dependent decrease in expression of genes maintaining DNA and RNA quality and mitochondrial integrity in the honeybee wing muscle. Exp Gerontol. 2009;44(9):586–593. - PubMed
    1. Aceituno FF, Moseyko N, Rhee SY, Gutiérrez RA. The rules of gene expression in plants: organ identity and gene body methylation are key factors for regulation of gene expression in Arabidopsis thaliana. BMC Genomics. 2008;9:438. - PMC - PubMed
    1. Adams J. Imprinting and genetic disease: Angelman, Prader-Willi and Beckwith-Weidemann syndromes. Nat Educ. 2008;1(1)
    1. Ahmad T, et al. Evaluation of codon biology in citrus and Poncirus trifoliata based on genomic features and frame corrected expressed sequence tags. DNA Res. 2013;20:135–150. - PMC - PubMed
    1. Anastasiadou C, Malousi A, Maglaveras N, Kouidou S. Human epigenome data reveal increased CpG methylation in alternatively spliced sites and putative exonic splicing enhancers. DNA Cell Biol. 2011;30(5):267–275. - PubMed

Publication types

LinkOut - more resources