Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jul;42(Web Server issue):W361-7.
doi: 10.1093/nar/gku406. Epub 2014 May 14.

RBPmap: a web server for mapping binding sites of RNA-binding proteins

Affiliations

RBPmap: a web server for mapping binding sites of RNA-binding proteins

Inbal Paz et al. Nucleic Acids Res. 2014 Jul.

Abstract

Regulation of gene expression is executed in many cases by RNA-binding proteins (RBPs) that bind to mRNAs as well as to non-coding RNAs. RBPs recognize their RNA target via specific binding sites on the RNA. Predicting the binding sites of RBPs is known to be a major challenge. We present a new webserver, RBPmap, freely accessible through the website http://rbpmap.technion.ac.il/ for accurate prediction and mapping of RBP binding sites. RBPmap has been developed specifically for mapping RBPs in human, mouse and Drosophila melanogaster genomes, though it supports other organisms too. RBPmap enables the users to select motifs from a large database of experimentally defined motifs. In addition, users can provide any motif of interest, given as either a consensus or a PSSM. The algorithm for mapping the motifs is based on a Weighted-Rank approach, which considers the clustering propensity of the binding sites and the overall tendency of regulatory regions to be conserved. In addition, RBPmap incorporates a position-specific background model, designed uniquely for different genomic regions, such as splice sites, 5' and 3' UTRs, non-coding RNA and intergenic regions. RBPmap was tested on high-throughput RNA-binding experiments and was proved to be highly accurate.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A pipeline summarizing RBPmap algorithm. (A) The mandatory input parameters for RBPmap run; a query sequence and a motif of interest to be mapped to the sequence. (B) A match score for the motif is calculated for each site in the query sequence, in overlapping windows of the motif size. (C) The match scores are compared to the average match score that is calculated for each motif in a background of randomly chosen regulatory regions. This step uses two different thresholds; a significant threshold for the anchor site (default P-value<0.005) and a suboptimal threshold for the secondary sites (default P-value<0.01) used to evaluate the clustering propensity. (D) A WR score is calculated for a window of 50 nts around each significant site. This score reflects the propensity of suboptimal sites to cluster around the significant site, weighted by their match score to the motif of interest. (E) To reduce false-positive predictions, the WR scores are compared to a region-specific background model that is generated independently per each motif for different genomic regions, removing non-significant results (P-value≥0.05). The figure exemplifies the procedure conducted for a query sequence spanning three different genomic regions (mid-intron, intronic region flanking a splice site and an internal exon). (F) Finally, a conservation-based filtering step is applied only to sites mapped to mid-intron/intergenic regions, filtering out sites which fall in non-conserved regions (below the average conservation level calculated for intronic regulatory regions).
Figure 2.
Figure 2.
A view of RBPmap input and output pages. (A) An example of RBPmap home page demonstrating the mandatory input parameters needed for RBPmap run. (B) By clicking the link ‘Click here to select motifs from RBPmap full list’, a sorted list of all motifs in the RBPmap database is opened and the user is prompted to select the proteins/motifs of interest. (C) An example of RBPmap output page. In the example presented the job includes more than one query sequence. The results per each sequence are shown followed by a link to a text file summarizes the binding sites predictions for all the input sequences. (D) An example of the output summary of all predicted binding sites within one query sequence in a web-based presentation. The results are provided for each of the proteins selected by the user, where all the occurrences of motifs belonging to the same protein are listed together. (E) A visualized presentation of the predicted binding sites as custom tracks in the UCSC Genome Browser.

Similar articles

Cited by

References

    1. Li X., Kazan H., Lipshitz H.D., Morris Q.D. Finding the target sites of RNA-binding proteins. Wiley Interdiscip. Rev. RNA. 2014;5:111–130. - PMC - PubMed
    1. Ray D., Kazan H., Chan E.T., Peña Castillo L., Chaudhry S., Talukder S., Blencowe B.J., Morris Q., Hughes T.R. Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins. Nat. Biotechnol. 2009;27:667–670. - PubMed
    1. Ray D., Kazan H., Cook K.B., Weirauch M.T., Najafabadi H.S., Li X., Gueroussov S., Albu M., Zheng H., Yang A., et al. A compendium of RNA-binding motifs for decoding gene regulation. Nature. 2013;499:172–177. - PMC - PubMed
    1. Tenenbaum S.A., Carson C.C., Lager P.J., Keene J.D. Identifying mRNA subsets in messenger ribonucleoprotein complexes by using cDNA arrays. PNAS. 2000;97:14085–14090. - PMC - PubMed
    1. Keene J.D., Komisarow J.M., Friedersdorf M.B. RIP-Chip: the isolation and identification of mRNAs, microRNAs and protein components of ribonucleoprotein complexes from cell extracts. Nat. Protoc. 2006;1:302–307. - PubMed

Publication types