H-InvDB in 2013: an omics study platform for human functional gene and transcript discovery
- PMID: 23197657
- PMCID: PMC3531145
- DOI: 10.1093/nar/gks1245
H-InvDB in 2013: an omics study platform for human functional gene and transcript discovery
Abstract
H-InvDB (http://www.h-invitational.jp/) is a comprehensive human gene database started in 2004. In the latest version, H-InvDB 8.0, a total of 244 709 human complementary DNA was mapped onto the hg19 reference genome and 43 829 gene loci, including nonprotein-coding ones, were identified. Of these loci, 35 631 were identified as potential protein-coding genes, and 22 898 of these were identical to known genes. In our analysis, 19 309 annotated genes were specific to H-InvDB and not found in RefSeq and Ensembl. In fact, 233 genes of the 19 309 turned out to have protein functions in this version of H-InvDB; they were annotated as unknown protein functions in the previous version. Furthermore, 11 genes were identified as known Mendelian disorder genes. It is advantageous that many biologically functional genes are hidden in the H-InvDB unique genes. As large-scale proteomic projects have been conducted to elucidate the functions of all human proteins, we have enhanced the proteomic information with an advanced protein view and new subdatabase of protein complexes (Protein Complex Database with quality index). We propose that H-InvDB is an important resource for finding novel candidate targets for medical care and drug development.
Figures
Similar articles
-
The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcripts.Nucleic Acids Res. 2008 Jan;36(Database issue):D793-9. doi: 10.1093/nar/gkm999. Epub 2007 Dec 18. Nucleic Acids Res. 2008. PMID: 18089548 Free PMC article.
-
Investigation of protein functions through data-mining on integrated human transcriptome database, H-Invitational database (H-InvDB).Gene. 2005 Dec 30;364:99-107. doi: 10.1016/j.gene.2005.05.036. Epub 2005 Sep 26. Gene. 2005. PMID: 16185827
-
H-InvDB in 2009: extended database and data mining resources for human genes and transcripts.Nucleic Acids Res. 2010 Jan;38(Database issue):D626-32. doi: 10.1093/nar/gkp1020. Epub 2009 Nov 23. Nucleic Acids Res. 2010. PMID: 19933760 Free PMC article.
-
Multi-omics annotation of human long non-coding RNAs.Biochem Soc Trans. 2020 Aug 28;48(4):1545-1556. doi: 10.1042/BST20191063. Biochem Soc Trans. 2020. PMID: 32756901 Review.
-
An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review.
Cited by
-
Multiple evidence strands suggest that there may be as few as 19,000 human protein-coding genes.Hum Mol Genet. 2014 Nov 15;23(22):5866-78. doi: 10.1093/hmg/ddu309. Epub 2014 Jun 16. Hum Mol Genet. 2014. PMID: 24939910 Free PMC article.
-
Gateways to the FANTOM5 promoter level mammalian expression atlas.Genome Biol. 2015 Jan 5;16(1):22. doi: 10.1186/s13059-014-0560-6. Genome Biol. 2015. PMID: 25723102 Free PMC article.
-
Transmembrane protein 208: a novel ER-localized protein that regulates autophagy and ER stress.PLoS One. 2013 May 14;8(5):e64228. doi: 10.1371/journal.pone.0064228. Print 2013. PLoS One. 2013. PMID: 23691174 Free PMC article.
-
aLeaves facilitates on-demand exploration of metazoan gene family trees on MAFFT sequence alignment server with enhanced interactivity.Nucleic Acids Res. 2013 Jul;41(Web Server issue):W22-8. doi: 10.1093/nar/gkt389. Epub 2013 May 15. Nucleic Acids Res. 2013. PMID: 23677614 Free PMC article.
-
Review: Alternative Splicing (AS) of Genes As An Approach for Generating Protein Complexity.Curr Genomics. 2013 May;14(3):182-94. doi: 10.2174/1389202911314030004. Curr Genomics. 2013. PMID: 24179441 Free PMC article.
References
-
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. - PubMed
-
- Ota T, Suzuki Y, Nishikawa T, Otsuki T, Sugiyama T, Irie R, Wakamatsu A, Hayashi K, Sato H, Nagai K, et al. Complete sequencing and characterization of 21,243 full-length human cDNAs. Nat. Genet. 2004;36:40–45. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Research Materials