bioNMF: a versatile tool for non-negative matrix factorization in biology
Open Access
- 28 July 2006
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 7 (1) , 366
- https://doi.org/10.1186/1471-2105-7-366
Abstract
In the Bioinformatics field, a great deal of interest has been given to Non-negative matrix factorization technique (NMF), due to its capability of providing new insights and relevant information about the complex latent relationships in experimental data sets. This method, and some of its variants, has been successfully applied to gene expression, sequence analysis, functional characterization of genes and text mining. Even if the interest on this technique by the bioinformatics community has been increased during the last few years, there are not many available simple standalone tools to specifically perform these types of data analysis in an integrated environment. In this work we propose a versatile and user-friendly tool that implements the NMF methodology in different analysis contexts to support some of the most important reported applications of this new methodology. This includes clustering and biclustering gene expression data, protein sequence analysis, text mining of biomedical literature and sample classification using gene expression. The tool, which is named bioNMF, also contains a user-friendly graphical interface to explore results in an interactive manner and facilitate in this way the exploratory data analysis process. bioNMF is a standalone versatile application which does not require any special installation or libraries. It can be used for most of the multiple applications proposed in the bioinformatics field or to support new research using this method. This tool is publicly available at .Keywords
This publication has 26 references indexed in Scilit:
- GenePattern 2.0Nature Genetics, 2006
- High-resolution genomic profiles define distinct clinico-pathogenetic subgroups of multiple myeloma patientsCancer Cell, 2006
- Dimension Reduction for Classification with Gene Expression Microarray DataStatistical Applications in Genetics and Molecular Biology, 2006
- Improving molecular cancer class discovery through sparse non-negative matrix factorizationBioinformatics, 2005
- Two subclasses of lung squamous cell carcinoma with different gene expression profiles and prognosis identified by hierarchical clustering and non-negative matrix factorizationOncogene, 2005
- Biclustering algorithms for biological data analysis: a surveyIEEE/ACM Transactions on Computational Biology and Bioinformatics, 2004
- Biologically valid linear factor models of gene expressionBioinformatics, 2004
- Subsystem Identification Through Dimensionality Reduction of Large-Scale Gene Expression DataGenome Research, 2003
- Coupled two-way clustering analysis of gene microarray dataProceedings of the National Academy of Sciences, 2000
- Functional Discovery via a Compendium of Expression ProfilesCell, 2000