A multi-step approach to time series analysis and gene expression clustering
Open Access
- 5 January 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 22 (5) , 589-596
- https://doi.org/10.1093/bioinformatics/btk026
Abstract
Motivation: The huge growth in gene expression data calls for the implementation of automatic tools for data processing and interpretation. Results: We present a new and comprehensive machine learning data mining framework consisting in a non-linear PCA neural network for feature extraction, and probabilistic principal surfaces combined with an agglomerative approach based on Negentropy aimed at clustering gene microarray data. The method, which provides a user-friendly visualization interface, can work on noisy data with missing points and represents an automatic procedure to get, with no a priori assumptions, the number of clusters present in the data. Cell-cycle dataset and a detailed analysis confirm the biological nature of the most significant clusters. Availability: The software described here is a subpackage part of the ASTRONEURAL package and is available upon request from the corresponding author. Contact:robtag@unisa.it Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 35 references indexed in Scilit:
- Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networksNature Biotechnology, 2005
- Error Distribution for Gene Expression DataStatistical Applications in Genetics and Molecular Biology, 2005
- Comparison of computational methods for the identification of cell cycle-regulated genesBioinformatics, 2004
- A multifrequency analysis of radio variability of blazarsAstronomy & Astrophysics, 2004
- Fuzzy Neural Network Applied to Gene Expression Profiling for Predicting the Prognosis of Diffuse Large B‐cell LymphomaJapanese Journal of Cancer Research, 2002
- Interactive Exploration of Microarray Gene Expression Patterns in a Reduced Dimensional SpaceGenome Research, 2002
- Assessing Gene Significance from cDNA Microarray Expression Data via Mixed ModelsJournal of Computational Biology, 2001
- Functional Discovery via a Compendium of Expression ProfilesCell, 2000
- Spectral analysis of stellar light curves by means of neural networksAstronomy and Astrophysics Supplement Series, 1999
- Representation and separation of signals using nonlinear PCA type learningNeural Networks, 1994