Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Feb;10(2):128-32.
doi: 10.1038/nmeth.2330. Epub 2013 Jan 6.

Identifying RNA editing sites using RNA sequencing data alone

Affiliations

Identifying RNA editing sites using RNA sequencing data alone

Gokul Ramaswami et al. Nat Methods. 2013 Feb.

Abstract

We show that RNA editing sites can be called with high confidence using RNA sequencing data from multiple samples across either individuals or species, without the need for matched genomic DNA sequence. We identified many previously unidentified editing sites in both humans and Drosophila; our results nearly double the known number of human protein recoding events. We also found that human genes harboring conserved editing sites within Alu repeats are enriched for neuronal functions.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Identification and validation of A-to-I RNA editing sites using RNA-seq data from human brain tissues. (a,b) Overview of the separate samples method (a) and the pooled samples method (b). (c,d) Identification of editing sites using the separate samples method. Minimum number of samples containing each variant relative to the proportion of variants that are either A-to-G or T-to-C mismatches (c). Variants were required to be supported by at least one read in each sample in c. Percentage of all 12 mismatch types (d); variants in Alu and non-Alu regions were required to be present in least one or two samples, respectively. (e,f) Identification of editing sites using the pooled samples method. Minimum number of reads supporting each variant relative to the proportion of variants that are either A-to-G or T-to-C mismatches (e). Percentage of all 12 mismatch types (f); variants in Alu and non-Alu regions were required to be supported by at least one or two reads, respectively. (g) RNA editing levels of 115 nonsynonymous nonrepetitive editing sites measured using the pooled alignments for all 50 brain data samples. Editing sites covered by less than 20 reads are identified as open diamonds. Measurements of RNA editing levels for each site in each sample (where ≥ 10 reads were available) are shown by error bars, with highest and lowest editing levels observed. Previously unidentified sites that were validated are marked with an ‘×’ on the x axis; previously unidentified sites that were not validated by PCR are marked with open circles on the x axis.
Figure 2
Figure 2
Accurate identification of RNA editing sites in the primate lineage. (a–c) Relationship between the A-to-G proportion of detected mismatches and the minimum editing level in Alu (a), repetitive non-Alu (b) and nonrepetitive regions (c). (d) Phylogenetic relationships among the four analyzed mammalian species (left); Myr: million years ago. Number of conserved A-to-G variants that were identified in human and each of the other three selected species (right), with a magnification for variants in non-Alu regions (bottom). (e) Functional enrichment in transcripts with edited Alu repeats. The conserved editing sites in Alu repeats, sites edited in human and chimpanzee and/or rhesus macaque brains, occur in 1,400 genes. As controls, we collected two groups of genes (1,065 and 831, respectively) with editing sites in Alu repeats that were edited in human only (Online Methods). The P values (Expression Analysis Systematic Explorer (EASE) scores) shown on the x axis were corrected for multiple hypotheses testing using the Benjamini-Hochberg method.
Figure 3
Figure 3
RNA editing site identification in Drosophila. (a) A-to-G proportion of detected mismatches relative to the minimum editing level for the species labeled in b. (b) Number of A-to-G variants relative to the minimum editing level. (c) RNA editing levels of 863 editing sites measured from the heads of male wild-type and Adar5G1 mutant flies. (d) Phylogenetic relationships among the four analyzed Drosophila species (left). Myr: Million years ago. Numbers of A-to-G variants identified in D. melanogaster and each of the other three selected Drosophila species analyzed (right).

Similar articles

Cited by

References

    1. Nishikura K. Annu. Rev. Biochem. 2010;79:321–349. - PMC - PubMed
    1. Kim U, Wang Y, Sanford T, Zeng Y, Nishikura K. Proc. Natl. Acad. Sci. USA. 1994;91:11457–11461. - PMC - PubMed
    1. Levanon EY, et al. Nat. Biotechnol. 2004;22:1001–1005. - PubMed
    1. Eisenberg E, Li JB, Levanon EY. RNA Biol. 2010;7:248–252. - PubMed
    1. Rosenthal JJ, Seeburg PH. Neuron. 2012;74:432–439. - PMC - PubMed

Publication types

Associated data