Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis
Top Cited Papers
Open Access
- 5 May 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (12) , 1495-1502
- https://doi.org/10.1093/bioinformatics/btm134
Abstract
Motivation: Many practical pattern recognition problems require non-negativity constraints. For example, pixels in digital images and chemical concentrations in bioinformatics are non-negative. Sparse non-negative matrix factorizations (NMFs) are useful when the degree of sparseness in the non-negative basis matrix or the non-negative coefficient matrix in an NMF needs to be controlled in approximating high-dimensional data in a lower dimensional space. Results: In this article, we introduce a novel formulation of sparse NMF and show how the new formulation leads to a convergent sparse NMF algorithm via alternating non-negativity-constrained least squares. We apply our sparse NMF algorithm to cancer-class discovery and gene expression data analysis and offer biological analysis of the results obtained. Our experimental results illustrate that the proposed sparse NMF algorithm often achieves better clustering performance with shorter computing time compared to other existing NMF algorithms. Availability: The software is available as supplementary material. Contact:hskim@cc.gatech.edu, hpark@acc.gatech.edu Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 23 references indexed in Scilit:
- High-resolution genomic profiles define distinct clinico-pathogenetic subgroups of multiple myeloma patientsCancer Cell, 2006
- Biclustering of gene expression data by non-smooth non-negative matrix factorizationBMC Bioinformatics, 2006
- Discovering semantic features in the literature: a foundation for building functional associationsBMC Bioinformatics, 2006
- Improving molecular cancer class discovery through sparse non-negative matrix factorizationBioinformatics, 2005
- Multi-way clustering of microarray data using probabilistic sparse matrix factorizationBioinformatics, 2005
- Metagenes and molecular pattern discovery using matrix factorizationProceedings of the National Academy of Sciences, 2004
- Onto-Tools, the toolkit of the modern biologist: Onto-Express, Onto-Compare, Onto-Design and Onto-TranslateNucleic Acids Research, 2003
- Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression MonitoringScience, 1999
- Matrices, Vector Spaces, and Information RetrievalSIAM Review, 1999
- A fast non-negativity-constrained least squares algorithmJournal of Chemometrics, 1997