Kernel-based data fusion and its application to protein function prediction in yeast
- PMID: 14992512
- DOI: 10.1142/9789812704856_0029
Kernel-based data fusion and its application to protein function prediction in yeast
Abstract
Kernel methods provide a principled framework in which to represent many types of data, including vectors, strings, trees and graphs. As such, these methods are useful for drawing inferences about biological phenomena. We describe a method for combining multiple kernel representations in an optimal fashion, by formulating the problem as a convex optimization problem that can be solved using semidefinite programming techniques. The method is applied to the problem of predicting yeast protein functional classifications using a support vector machine (SVM) trained on five types of data. For this problem, the new method performs better than a previously-described Markov random field method, and better than the SVM trained on any single type of data.
Similar articles
-
Learning gene functional classifications from multiple data types.J Comput Biol. 2002;9(2):401-11. doi: 10.1089/10665270252935539. J Comput Biol. 2002. PMID: 12015889
-
Protein function prediction with the shortest path in functional linkage graph and boosting.Int J Bioinform Res Appl. 2008;4(4):375-84. doi: 10.1504/IJBRA.2008.021175. Int J Bioinform Res Appl. 2008. PMID: 19008182
-
A statistical framework for genomic data fusion.Bioinformatics. 2004 Nov 1;20(16):2626-35. doi: 10.1093/bioinformatics/bth294. Epub 2004 May 6. Bioinformatics. 2004. PMID: 15130933
-
ENZPRED-enzymatic protein class predicting by machine learning.Curr Top Med Chem. 2013;13(14):1674-80. doi: 10.2174/15680266113139990118. Curr Top Med Chem. 2013. PMID: 23889047 Review.
-
Advances in the prediction of protein targeting signals.Proteomics. 2004 Jun;4(6):1571-80. doi: 10.1002/pmic.200300786. Proteomics. 2004. PMID: 15174127 Review.
Cited by
-
Bayesian Markov Random Field analysis for protein function prediction based on network data.PLoS One. 2010 Feb 24;5(2):e9293. doi: 10.1371/journal.pone.0009293. PLoS One. 2010. PMID: 20195360 Free PMC article.
-
Towards region-specific propagation of protein functions.Bioinformatics. 2019 May 15;35(10):1737-1744. doi: 10.1093/bioinformatics/bty834. Bioinformatics. 2019. PMID: 30304483 Free PMC article.
-
Gene function prediction using labeled and unlabeled data.BMC Bioinformatics. 2008 Jan 28;9:57. doi: 10.1186/1471-2105-9-57. BMC Bioinformatics. 2008. PMID: 18221567 Free PMC article.
-
Supervised regularized canonical correlation analysis: integrating histologic and proteomic measurements for predicting biochemical recurrence following prostate surgery.BMC Bioinformatics. 2011 Dec 19;12:483. doi: 10.1186/1471-2105-12-483. BMC Bioinformatics. 2011. PMID: 22182303 Free PMC article.
-
Functional protein representations from biological networks enable diverse cross-species inference.Nucleic Acids Res. 2019 May 21;47(9):e51. doi: 10.1093/nar/gkz132. Nucleic Acids Res. 2019. PMID: 30847485 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials