Leveraging multiple gene networks to prioritize GWAS candidate genes via network representation learning
- PMID: 29874547
- DOI: 10.1016/j.ymeth.2018.06.002
Leveraging multiple gene networks to prioritize GWAS candidate genes via network representation learning
Abstract
Genome-wide association studies (GWAS) have successfully discovered a number of disease-associated genetic variants in the past decade, providing an unprecedented opportunity for deciphering genetic basis of human inherited diseases. However, it is still a challenging task to extract biological knowledge from the GWAS data, due to such issues as missing heritability and weak interpretability. Indeed, the fact that the majority of discovered loci fall into noncoding regions without clear links to genes has been preventing the characterization of their functions and appealing for a sophisticated approach to bridge genetic and genomic studies. Towards this problem, network-based prioritization of candidate genes, which performs integrated analysis of gene networks with GWAS data, has emerged as a promising direction and attracted much attention. However, most existing methods overlook the sparse and noisy properties of gene networks and thus may lead to suboptimal performance. Motivated by this understanding, we proposed a novel method called REGENT for integrating multiple gene networks with GWAS data to prioritize candidate genes for complex diseases. We leveraged a technique called the network representation learning to embed a gene network into a compact and robust feature space, and then designed a hierarchical statistical model to integrate features of multiple gene networks with GWAS data for the effective inference of genes associated with a disease of interest. We applied our method to six complex diseases and demonstrated the superior performance of REGENT over existing approaches in recovering known disease-associated genes. We further conducted a pathway analysis and showed that the ability of REGENT to discover disease-associated pathways. We expect to see applications of our method to a broad spectrum of diseases for post-GWAS analysis. REGENT is freely available at https://github.com/wmmthu/REGENT.
Copyright © 2018 Elsevier Inc. All rights reserved.
Similar articles
-
Simultaneous inference of phenotype-associated genes and relevant tissues from GWAS data via Bayesian integration of multiple tissue-specific gene networks.J Mol Cell Biol. 2017 Dec 1;9(6):436-452. doi: 10.1093/jmcb/mjx059. J Mol Cell Biol. 2017. PMID: 29300920 Free PMC article.
-
SigMod: an exact and efficient method to identify a strongly interconnected disease-associated module in a gene network.Bioinformatics. 2017 May 15;33(10):1536-1544. doi: 10.1093/bioinformatics/btx004. Bioinformatics. 2017. PMID: 28069594
-
ancGWAS: a post genome-wide association study method for interaction, pathway and ancestry analysis in homogeneous and admixed populations.Bioinformatics. 2016 Feb 15;32(4):549-56. doi: 10.1093/bioinformatics/btv619. Epub 2015 Oct 27. Bioinformatics. 2016. PMID: 26508762 Free PMC article.
-
Network propagation for GWAS analysis: a practical guide to leveraging molecular networks for disease gene discovery.Brief Bioinform. 2024 Jan 22;25(2):bbae014. doi: 10.1093/bib/bbae014. Brief Bioinform. 2024. PMID: 38340090 Free PMC article. Review.
-
From genetic associations to genes: methods, applications, and challenges.Trends Genet. 2024 Aug;40(8):642-667. doi: 10.1016/j.tig.2024.04.008. Epub 2024 May 11. Trends Genet. 2024. PMID: 38734482 Review.
Cited by
-
Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci.Front Genet. 2020 Apr 15;11:350. doi: 10.3389/fgene.2020.00350. eCollection 2020. Front Genet. 2020. PMID: 32351543 Free PMC article. Review.
-
HiChIPdb: a comprehensive database of HiChIP regulatory interactions.Nucleic Acids Res. 2023 Jan 6;51(D1):D159-D166. doi: 10.1093/nar/gkac859. Nucleic Acids Res. 2023. PMID: 36215037 Free PMC article.
-
Constructing tissue-specific transcriptional regulatory networks via a Markov random field.BMC Genomics. 2018 Dec 31;19(Suppl 10):884. doi: 10.1186/s12864-018-5277-6. BMC Genomics. 2018. PMID: 30598101 Free PMC article.
-
SilencerDB: a comprehensive database of silencers.Nucleic Acids Res. 2021 Jan 8;49(D1):D221-D228. doi: 10.1093/nar/gkaa839. Nucleic Acids Res. 2021. PMID: 33045745 Free PMC article.
-
IID 2021: towards context-specific protein interaction analyses by increased coverage, enhanced annotation and enrichment analysis.Nucleic Acids Res. 2022 Jan 7;50(D1):D640-D647. doi: 10.1093/nar/gkab1034. Nucleic Acids Res. 2022. PMID: 34755877 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous