Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data
- PMID: 12967959
- DOI: 10.1093/bioinformatics/btg210
Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data
Abstract
Motivation: Novel methods, both molecular and statistical, are urgently needed to take advantage of recent advances in biotechnology and the human genome project for disease diagnosis and prognosis. Mass spectrometry (MS) holds great promise for biomarker identification and genome-wide protein profiling. It has been demonstrated in the literature that biomarkers can be identified to distinguish normal individuals from cancer patients using MS data. Such progress is especially exciting for the detection of early-stage ovarian cancer patients. Although various statistical methods have been utilized to identify biomarkers from MS data, there has been no systematic comparison among these approaches in their relative ability to analyze MS data.
Results: We compare the performance of several classes of statistical methods for the classification of cancer based on MS spectra. These methods include: linear discriminant analysis, quadratic discriminant analysis, k-nearest neighbor classifier, bagging and boosting classification trees, support vector machine, and random forest (RF). The methods are applied to ovarian cancer and control serum samples from the National Ovarian Cancer Early Detection Program clinic at Northwestern University Hospital. We found that RF outperforms other methods in the analysis of MS data.
Similar articles
-
Sample classification from protein mass spectrometry, by 'peak probability contrasts'.Bioinformatics. 2004 Nov 22;20(17):3034-44. doi: 10.1093/bioinformatics/bth357. Epub 2004 Jun 29. Bioinformatics. 2004. PMID: 15226172 Clinical Trial.
-
Proteomic biomarker identification for diagnosis of early relapse in ovarian cancer.J Bioinform Comput Biol. 2006 Dec;4(6):1159-79. doi: 10.1142/s0219720006002399. J Bioinform Comput Biol. 2006. PMID: 17245808
-
Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data.Bioinformatics. 2005 May 15;21(10):2200-9. doi: 10.1093/bioinformatics/bti370. Epub 2005 Mar 22. Bioinformatics. 2005. PMID: 15784749
-
Proteomic analysis for early detection of ovarian cancer: a realistic approach?Int J Gynecol Cancer. 2003 Nov-Dec;13 Suppl 2:133-9. doi: 10.1111/j.1525-1438.2003.13358.x. Int J Gynecol Cancer. 2003. PMID: 14656269 Review.
-
Ovarian cancer biomarkers: current options and future promise.J Natl Compr Canc Netw. 2008 Sep;6(8):795-802. doi: 10.6004/jnccn.2008.0059. J Natl Compr Canc Netw. 2008. PMID: 18926090 Free PMC article. Review.
Cited by
-
Probing differentiation in cancer cell lines by single-cell micro-Raman spectroscopy.J Biomed Opt. 2015 Aug;20(8):85001. doi: 10.1117/1.JBO.20.8.085001. J Biomed Opt. 2015. PMID: 26244913
-
Ovarian cancer detection from metabolomic liquid chromatography/mass spectrometry data by support vector machines.BMC Bioinformatics. 2009 Aug 22;10:259. doi: 10.1186/1471-2105-10-259. BMC Bioinformatics. 2009. PMID: 19698113 Free PMC article.
-
On the analysis of glycomics mass spectrometry data via the regularized area under the ROC curve.BMC Bioinformatics. 2007 Dec 12;8:477. doi: 10.1186/1471-2105-8-477. BMC Bioinformatics. 2007. PMID: 18076765 Free PMC article.
-
Proteomics and the analysis of proteomic data: an overview of current protein-profiling technologies.Curr Protoc Bioinformatics. 2005 Jul;Chapter 13:Unit 13.1. doi: 10.1002/0471250953.bi1301s10. Curr Protoc Bioinformatics. 2005. PMID: 18428746 Free PMC article. Review.
-
MSFC: a new feature construction method for accurate diagnosis of mass spectrometry data.Sci Rep. 2023 Sep 21;13(1):15694. doi: 10.1038/s41598-023-42395-5. Sci Rep. 2023. PMID: 37735183 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical