A flexible R package for nonnegative matrix factorization
- PMID: 20598126
- PMCID: PMC2912887
- DOI: 10.1186/1471-2105-11-367
A flexible R package for nonnegative matrix factorization
Abstract
Background: Nonnegative Matrix Factorization (NMF) is an unsupervised learning technique that has been applied successfully in several fields, including signal processing, face recognition and text mining. Recent applications of NMF in bioinformatics have demonstrated its ability to extract meaningful information from high-dimensional data such as gene expression microarrays. Developments in NMF theory and applications have resulted in a variety of algorithms and methods. However, most NMF implementations have been on commercial platforms, while those that are freely available typically require programming skills. This limits their use by the wider research community.
Results: Our objective is to provide the bioinformatics community with an open-source, easy-to-use and unified interface to standard NMF algorithms, as well as with a simple framework to help implement and test new NMF methods. For that purpose, we have developed a package for the R/BioConductor platform. The package ports public code to R, and is structured to enable users to easily modify and/or add algorithms. It includes a number of published NMF algorithms and initialization methods and facilitates the combination of these to produce new NMF strategies. Commonly used benchmark data and visualization methods are provided to help in the comparison and interpretation of the results.
Conclusions: The NMF package helps realize the potential of Nonnegative Matrix Factorization, especially in bioinformatics, providing easy access to methods that have already yielded new insights in many applications. Documentation, source code and sample data are available from CRAN.
Figures
Similar articles
-
Semi-supervised Nonnegative Matrix Factorization for gene expression deconvolution: a case study.Infect Genet Evol. 2012 Jul;12(5):913-21. doi: 10.1016/j.meegid.2011.08.014. Epub 2011 Sep 10. Infect Genet Evol. 2012. PMID: 21930246
-
NMF-mGPU: non-negative matrix factorization on multi-GPU systems.BMC Bioinformatics. 2015 Feb 13;16:43. doi: 10.1186/s12859-015-0485-4. BMC Bioinformatics. 2015. PMID: 25887585 Free PMC article.
-
Convex nonnegative matrix factorization with manifold regularization.Neural Netw. 2015 Mar;63:94-103. doi: 10.1016/j.neunet.2014.11.007. Epub 2014 Dec 4. Neural Netw. 2015. PMID: 25523040
-
Nonnegative matrix factorization: an analytical and interpretive tool in computational biology.PLoS Comput Biol. 2008 Jul 25;4(7):e1000029. doi: 10.1371/journal.pcbi.1000029. PLoS Comput Biol. 2008. PMID: 18654623 Free PMC article. Review.
-
Inferring cellular and molecular processes in single-cell data with non-negative matrix factorization using Python, R and GenePattern Notebook implementations of CoGAPS.Nat Protoc. 2023 Dec;18(12):3690-3731. doi: 10.1038/s41596-023-00892-x. Epub 2023 Nov 21. Nat Protoc. 2023. PMID: 37989764 Free PMC article. Review.
Cited by
-
Tankyrase inhibition promotes endocrine commitment of hPSC-derived pancreatic progenitors.Nat Commun. 2024 Oct 9;15(1):8754. doi: 10.1038/s41467-024-53068-w. Nat Commun. 2024. PMID: 39384787 Free PMC article.
-
Testicular large B-cell lymphoma is genetically similar to PCNSL and distinct from nodal DLBCL.Hemasphere. 2024 Oct 7;8(10):e70024. doi: 10.1002/hem3.70024. eCollection 2024 Oct. Hemasphere. 2024. PMID: 39380845 Free PMC article.
-
Heparin-enriched plasma proteome is significantly altered in Alzheimer's disease.Mol Neurodegener. 2024 Oct 8;19(1):67. doi: 10.1186/s13024-024-00757-1. Mol Neurodegener. 2024. PMID: 39380021 Free PMC article.
-
The estrogen response in fibroblasts promotes ovarian metastases of gastric cancer.Nat Commun. 2024 Sep 30;15(1):8447. doi: 10.1038/s41467-024-52615-9. Nat Commun. 2024. PMID: 39349474 Free PMC article.
-
Progression from cardiomyopathy to heart failure with reduced ejection fraction: A CORIN deficient course.Heliyon. 2024 Sep 11;10(18):e37838. doi: 10.1016/j.heliyon.2024.e37838. eCollection 2024 Sep 30. Heliyon. 2024. PMID: 39315128 Free PMC article.
References
-
- Paatero P, Tapper U. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmetrics. 1994;5(2):111–126. doi: 10.1002/env.3170050203. http://dx.doi.org/10.1002/env.3170050203 - DOI - DOI
-
- Lee D, Seung H. Learning the parts of objects by non-negative matrix factorization. Nature. 1999;401:788–791. doi: 10.1038/44565. http://www.nature.com/nature/journal/v401/n6755/abs/401788a0.html - DOI - PubMed
-
- Devarajan K. Nonnegative matrix factorization: an analytical and interpretive tool in computational biology. PLoS computational biology. 2008;4:e1000029. doi: 10.1371/journal.pcbi.1000029. http://www.ncbi.nlm.nih.gov/pubmed/18654623 - DOI - PMC - PubMed
-
- Brunet JP, Tamayo P, Golub TR, Mesirov JP. Metagenes and molecular pattern discovery using matrix factorization. Proceedings of the National Academy of Sciences of the United States of America. 2004;101:4164–9. doi: 10.1073/pnas.0308531101. http://www.ncbi.nlm.nih.gov/pubmed/15016911 - DOI - PMC - PubMed
-
- Pehkonen P, Wong G, Toronen P. Theme discovery from gene lists for identification and viewing of multiple functional groups. BMC Bioinformatics. 2005;6:162. doi: 10.1186/1471-2105-6-162. http://www.biomedcentral.com/1471-2105/6/162 - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources