A new optimization criterion for generalized discriminant analysis on undersampled problems
- 23 April 2004
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
A new optimization criterion for discriminant analysis is presented. The new criterion extends the optimization criteria of the classical linear discriminant analysis (LDA) by introducing the pseudo-inverse when the scatter matrices are singular. It is applicable regardless of the relative sizes of the data dimension and sample size, overcoming a limitation of the classical LDA. Recently, a new algorithm called LDA/GSVD for structure-preserving dimension reduction has been introduced, which extends the classical LDA to very high-dimensional undersampled problems by using the generalized singular value decomposition (GSVD). The solution from the LDA/GSVD algorithm is a special case of the solution for our generalized criterion, which is also based on GSVD. We also present an approximate solution for our GSVD-based solution, which reduces computational complexity by finding subclusters of each cluster, and using their centroids to capture the structure of each cluster. This reduced problem yields much smaller matrices of which the GSVD can be applied efficiently. Experiments on text data, with up to 7000 dimensions, show that the approximation algorithm produces results that are close to those produced by the exact algorithm.Keywords
This publication has 8 references indexed in Scilit:
- A new optimization criterion for generalized discriminant analysis on undersampled problemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Structure Preserving Dimension Reduction for Clustered Text Data Based on the Generalized Singular Value DecompositionSIAM Journal on Matrix Analysis and Applications, 2003
- Lower Dimensional Representation of Text Data Based on Centroids and Least SquaresBIT Numerical Mathematics, 2003
- Concept Decompositions for Large Sparse Text Data Using ClusteringMachine Learning, 2001
- Using Linear Algebra for Intelligent Information RetrievalSIAM Review, 1995
- Indexing by latent semantic analysisJournal of the American Society for Information Science, 1990
- Towards a Generalized Singular Value DecompositionSIAM Journal on Numerical Analysis, 1981
- Generalizing the Singular Value DecompositionSIAM Journal on Numerical Analysis, 1976