Discovery of Protein-lncRNA Interactions by Integrating Large-Scale CLIP-Seq and RNA-Seq Datasets
- PMID: 25642422
- PMCID: PMC4294205
- DOI: 10.3389/fbioe.2014.00088
Discovery of Protein-lncRNA Interactions by Integrating Large-Scale CLIP-Seq and RNA-Seq Datasets
Abstract
Long non-coding RNAs (lncRNAs) are emerging as important regulatory molecules in developmental, physiological, and pathological processes. However, the precise mechanism and functions of most of lncRNAs remain largely unknown. Recent advances in high-throughput sequencing of immunoprecipitated RNAs after cross-linking (CLIP-Seq) provide powerful ways to identify biologically relevant protein-lncRNA interactions. In this study, by analyzing millions of RNA-binding protein (RBP) binding sites from 117 CLIP-Seq datasets generated by 50 independent studies, we identified 22,735 RBP-lncRNA regulatory relationships. We found that one single lncRNA will generally be bound and regulated by one or multiple RBPs, the combination of which may coordinately regulate gene expression. We also revealed the expression correlation of these interaction networks by mining expression profiles of over 6000 normal and tumor samples from 14 cancer types. Our combined analysis of CLIP-Seq data and genome-wide association studies data discovered hundreds of disease-related single nucleotide polymorphisms resided in the RBP binding sites of lncRNAs. Finally, we developed interactive web implementations to provide visualization, analysis, and downloading of the aforementioned large-scale datasets. Our study represented an important step in identification and analysis of RBP-lncRNA interactions and showed that these interactions may play crucial roles in cancer and genetic diseases.
Keywords: CLIP-Seq; GWAS; RNA-Seq; RNA-binding protein; long non-coding RNA.
Figures
Similar articles
-
starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data.Nucleic Acids Res. 2014 Jan;42(Database issue):D92-7. doi: 10.1093/nar/gkt1248. Epub 2013 Dec 1. Nucleic Acids Res. 2014. PMID: 24297251 Free PMC article.
-
CLIPdb: a CLIP-seq database for protein-RNA interactions.BMC Genomics. 2015 Feb 5;16(1):51. doi: 10.1186/s12864-015-1273-2. BMC Genomics. 2015. PMID: 25652745 Free PMC article.
-
Large-Scale Profiling of RBP-circRNA Interactions from Public CLIP-Seq Datasets.Genes (Basel). 2020 Jan 3;11(1):54. doi: 10.3390/genes11010054. Genes (Basel). 2020. PMID: 31947823 Free PMC article.
-
[Advances in technologies for large-scale enrichment and identification of ribonucleic acid-protein complexes].Se Pu. 2021 Feb;39(2):105-111. doi: 10.3724/SP.J.1123.2020.07019. Se Pu. 2021. PMID: 34227341 Free PMC article. Review. Chinese.
-
New insights into the interplay between long non-coding RNAs and RNA-binding proteins in cancer.Cancer Commun (Lond). 2022 Feb;42(2):117-140. doi: 10.1002/cac2.12254. Epub 2022 Jan 12. Cancer Commun (Lond). 2022. PMID: 35019235 Free PMC article. Review.
Cited by
-
Design and bioinformatics analysis of genome-wide CLIP experiments.Nucleic Acids Res. 2015 Jun 23;43(11):5263-74. doi: 10.1093/nar/gkv439. Epub 2015 May 9. Nucleic Acids Res. 2015. PMID: 25958398 Free PMC article. Review.
-
Application of sequence semantic and integrated cellular geography approach to study alternative biogenesis of exonic circular RNA.BMC Bioinformatics. 2023 Apr 17;24(1):148. doi: 10.1186/s12859-023-05279-z. BMC Bioinformatics. 2023. PMID: 37069509 Free PMC article.
-
Long Non-Coding RNAs: A Novel Paradigm for Toxicology.Toxicol Sci. 2017 Jan;155(1):3-21. doi: 10.1093/toxsci/kfw203. Epub 2016 Nov 17. Toxicol Sci. 2017. PMID: 27864543 Free PMC article. Review.
-
Interactome determination of a Long Noncoding RNA implicated in Embryonic Stem Cell Self-Renewal.Sci Rep. 2018 Dec 4;8(1):17568. doi: 10.1038/s41598-018-34864-z. Sci Rep. 2018. PMID: 30514857 Free PMC article.
-
Long noncoding RNA study: Genome-wide approaches.Genes Dis. 2022 Nov 29;10(6):2491-2510. doi: 10.1016/j.gendis.2022.10.024. eCollection 2023 Nov. Genes Dis. 2022. PMID: 37554208 Free PMC article. Review.
References
-
- Alwohhaib M., Alwaheeb S., Alyatama N., Dashti A. A., Abdelghani A., Hussain N. (2014). Single nucleotide polymorphisms at erythropoietin, superoxide dismutase 1, splicing factor, arginine/serin-rich 15 and plasmacytoma variant translocation genes association with diabetic nephropathy. Saudi J. Kidney Dis. Transpl. 25, 577–581. - PubMed
-
- Benjamini Y., Hochberg Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Series B Stat. Methodol. 57, 289–300.
LinkOut - more resources
Full Text Sources
Other Literature Sources