Graphical Exploration of Gene Expression Data: A Comparative Study of Three Multivariate Methods
- 11 December 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Biometrics
- Vol. 59 (4) , 1131-1139
- https://doi.org/10.1111/j.0006-341x.2003.00130.x
Abstract
Summary. This article describes three multivariate projection methods and compares them for their ability to identify clusters of biological samples and genes using real‐life data on gene expression levels of leukemia patients. It is shown that principal component analysis (PCA) has the disadvantage that the resulting principal factors are not very informative, while correspondence factor analysis (CFA) has difficulties interpreting distances between objects. Spectral map analysis (SMA) is introduced as an alternative approach to the analysis of microarray data. Weighted SMA outperforms PCA, and is at least as powerful as CFA, in finding clusters in the samples, as well as identifying genes related to these clusters. SMA addresses the problem of data analysis in microarray experiments in a more appropriate manner than CFA, and allows more flexible weighting to the genes and samples. Proper weighting is important, since it enables less reliable data to be down‐weighted and more reliable information to be emphasized.Keywords
This publication has 19 references indexed in Scilit:
- Molecular characterisation of antidepressant effects in the mouse brain using gene expression profilingJournal of Psychiatric Research, 2002
- Using biplots to interpret gene expression patterns in plantsBioinformatics, 2002
- Correspondence analysis applied to microarray dataProceedings of the National Academy of Sciences, 2001
- Identification of the TCL1 gene involved in T-cell malignancies.Proceedings of the National Academy of Sciences, 1994
- Multivariate Ratio Analysis: A Graphical Method for Ecological OrdinationEcology, 1991
- Similarities and differences among multivariate display techniques illustrated by belgian cancer mortality distribution dataChemometrics and Intelligent Laboratory Systems, 1988
- Toward an objective classification of cells in the immune system.Proceedings of the National Academy of Sciences, 1988
- The biplot graphic display of matrices with application to principal component analysisBiometrika, 1971
- Analysis of a complex of statistical variables into principal components.Journal of Educational Psychology, 1933
- LIII. On lines and planes of closest fit to systems of points in spaceJournal of Computers in Education, 1901