Defining Essentiality Score of Protein-Coding Genes and Long Noncoding RNAs
- PMID: 30356729
- PMCID: PMC6189311
- DOI: 10.3389/fgene.2018.00380
Defining Essentiality Score of Protein-Coding Genes and Long Noncoding RNAs
Abstract
Measuring the essentiality of genes is critically important in biology and medicine. Here we proposed a computational method, GIC (Gene Importance Calculator), which can efficiently predict the essentiality of both protein-coding genes and long noncoding RNAs (lncRNAs) based on only sequence information. For identifying the essentiality of protein-coding genes, GIC outperformed well-established computational scores. In an independent mouse lncRNA dataset, GIC also achieved an exciting performance (AUC = 0.918). In contrast, the traditional computational methods are not applicable to lncRNAs. Moreover, we explored several potential applications of GIC score. Firstly, we revealed a correlation between gene GIC score and research hotspots of genes. Moreover, GIC score can be used to evaluate whether a gene in mouse is representative for its homolog in human by dissecting its cross-species difference. This is critical for basic medicine because many basic medical studies are performed in animal models. Finally, we showed that GIC score can be used to identify candidate genes from a transcriptomics study. GIC is freely available at http://www.cuilab.cn/gic/.
Keywords: essentiality; lncRNAs; machine learning; prediction; protein-coding genes.
Figures
Similar articles
-
miES: predicting the essentiality of miRNAs with machine learning and sequence features.Bioinformatics. 2019 Mar 15;35(6):1053-1054. doi: 10.1093/bioinformatics/bty738. Bioinformatics. 2019. PMID: 30165607
-
LncTar: a tool for predicting the RNA targets of long noncoding RNAs.Brief Bioinform. 2015 Sep;16(5):806-12. doi: 10.1093/bib/bbu048. Epub 2014 Dec 17. Brief Bioinform. 2015. PMID: 25524864
-
CRlncRC: a machine learning-based method for cancer-related long noncoding RNA identification using integrated features.BMC Med Genomics. 2018 Dec 31;11(Suppl 6):120. doi: 10.1186/s12920-018-0436-9. BMC Med Genomics. 2018. PMID: 30598114 Free PMC article.
-
Long non-coding RNAs and complex diseases: from experimental results to computational models.Brief Bioinform. 2017 Jul 1;18(4):558-576. doi: 10.1093/bib/bbw060. Brief Bioinform. 2017. PMID: 27345524 Free PMC article. Review.
-
LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property.Brief Bioinform. 2019 Nov 27;20(6):2009-2027. doi: 10.1093/bib/bby065. Brief Bioinform. 2019. PMID: 30084867 Free PMC article. Review.
Cited by
-
Pig-specific RNA editing during early embryo development revealed by genome-wide comparisons.FEBS Open Bio. 2020 Jul;10(7):1389-1402. doi: 10.1002/2211-5463.12900. Epub 2020 Jun 25. FEBS Open Bio. 2020. PMID: 32433824 Free PMC article.
-
A New Metric Quantifying Chemical and Biological Property of Small Molecule Metabolites and Drugs.Front Mol Biosci. 2020 Dec 15;7:594800. doi: 10.3389/fmolb.2020.594800. eCollection 2020. Front Mol Biosci. 2020. PMID: 33385011 Free PMC article.
-
dbEssLnc: A manually curated database of human and mouse essential lncRNA genes.Comput Struct Biotechnol J. 2022 May 23;20:2657-2663. doi: 10.1016/j.csbj.2022.05.043. eCollection 2022. Comput Struct Biotechnol J. 2022. PMID: 35685362 Free PMC article.
-
Ribosomal modification protein rimK-like family member A activates betaine-homocysteine S-methyltransferase 1 to ameliorate hepatic steatosis.Signal Transduct Target Ther. 2024 Aug 8;9(1):214. doi: 10.1038/s41392-024-01914-0. Signal Transduct Target Ther. 2024. PMID: 39117631 Free PMC article.
-
Long Non-Coding RNA in the Pathogenesis of Cancers.Cells. 2019 Sep 1;8(9):1015. doi: 10.3390/cells8091015. Cells. 2019. PMID: 31480503 Free PMC article. Review.
References
LinkOut - more resources
Full Text Sources