A literature network of human genes for high-throughput analysis of gene expression
- PMID: 11326270
- DOI: 10.1038/ng0501-21
A literature network of human genes for high-throughput analysis of gene expression
Abstract
We have carried out automated extraction of explicit and implicit biomedical knowledge from publicly available gene and text databases to create a gene-to-gene co-citation network for 13,712 named human genes by automated analysis of titles and abstracts in over 10 million MEDLINE records. The associations between genes have been annotated by linking genes to terms from the medical subject heading (MeSH) index and terms from the gene ontology (GO) database. The extracted database and accompanying web tools for gene-expression analysis have collectively been named 'PubGene'. We validated the extracted networks by three large-scale experiments showing that co-occurrence reflects biologically meaningful relationships, thus providing an approach to extract and structure known biology. We validated the applicability of the tools by analyzing two publicly available microarray data sets.
Comment in
-
Community watch.Nat Genet. 2001 May;28(1):1-2. doi: 10.1038/ng0501-1. Nat Genet. 2001. PMID: 11326259 No abstract available.
-
Linking microarray data to the literature.Nat Genet. 2001 May;28(1):9-10. doi: 10.1038/ng0501-9. Nat Genet. 2001. PMID: 11326264 No abstract available.
Similar articles
-
MILANO--custom annotation of microarray results using automatic literature searches.BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12. BMC Bioinformatics. 2005. PMID: 15661078 Free PMC article.
-
SNOMAD (Standardization and NOrmalization of MicroArray Data): web-accessible gene expression data analysis.Bioinformatics. 2002 Nov;18(11):1540-1. doi: 10.1093/bioinformatics/18.11.1540. Bioinformatics. 2002. PMID: 12424128
-
Biosphere: the interoperation of web services in microarray cluster analysis.Appl Bioinformatics. 2004;3(4):253-6. doi: 10.2165/00822942-200403040-00007. Appl Bioinformatics. 2004. PMID: 15702956
-
Bioinformatics methods for the analysis of expression arrays: data clustering and information extraction.J Biotechnol. 2002 Sep 25;98(2-3):269-83. doi: 10.1016/s0168-1656(02)00137-2. J Biotechnol. 2002. PMID: 12141992 Review.
-
Interpreting microarray results with gene ontology and MeSH.Methods Mol Biol. 2007;377:223-42. doi: 10.1007/978-1-59745-390-5_14. Methods Mol Biol. 2007. PMID: 17634620 Review.
Cited by
-
Establishing a baseline for literature mining human genetic variants and their relationships to disease cohorts.BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):68. doi: 10.1186/s12911-016-0294-3. BMC Med Inform Decis Mak. 2016. PMID: 27454860 Free PMC article.
-
EGAN: exploratory gene association networks.Bioinformatics. 2010 Jan 15;26(2):285-6. doi: 10.1093/bioinformatics/btp656. Epub 2009 Nov 23. Bioinformatics. 2010. PMID: 19933825 Free PMC article.
-
TopoGSA: network topological gene set analysis.Bioinformatics. 2010 May 1;26(9):1271-2. doi: 10.1093/bioinformatics/btq131. Epub 2010 Mar 24. Bioinformatics. 2010. PMID: 20335277 Free PMC article.
-
Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data.BMC Bioinformatics. 2011 Aug 31;12:359. doi: 10.1186/1471-2105-12-359. BMC Bioinformatics. 2011. PMID: 21884587 Free PMC article.
-
Hot and Cold Theory: Evidence in Systems Biology.Adv Exp Med Biol. 2021;1343:135-160. doi: 10.1007/978-3-030-80983-6_9. Adv Exp Med Biol. 2021. PMID: 35015281
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources