Bayesian learning of sparse classifiers
- 25 August 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1 (10636919)
- https://doi.org/10.1109/cvpr.2001.990453
Abstract
Bayesian approaches to supervised learning use priors on the classifier parameters. However, few priors aim at achieving "sparse" classifiers, where irrelevant/redundant parameters are automatically set to zero. Two well-known ways of obtaining sparse classifiers are: use a zero-mean Laplacian prior on the parameters, and the "support vector machine" (SVM). Whether one uses a Laplacian prior or an SVM, one still needs to specify/estimate the parameters that control the degree of sparseness of the resulting classifiers. We propose a Bayesian approach to learning sparse classifiers which does not involve any parameters controlling the degree of sparseness. This is achieved by a hierarchical-Bayes interpretation of the Laplacian prior, followed by the adoption of a Jeffreys' non-informative hyper-prior Implementation is carried out by an EM algorithm. Experimental evaluation of the proposed method shows that it performs competitively with (often better than) the best classification techniques available.Keywords
This publication has 22 references indexed in Scilit:
- Regression Shrinkage and Selection via The Lasso: A RetrospectiveJournal of the Royal Statistical Society Series B: Statistical Methodology, 2011
- Wavelet-based image estimation: an empirical Bayes approach using Jeffrey's noninformative priorIEEE Transactions on Image Processing, 2001
- Statistical pattern recognition: a reviewPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Atomic Decomposition by Basis PursuitSIAM Journal on Scientific Computing, 1998
- Bayesian classification with Gaussian processesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1998
- Bayesian Regularization and Pruning Using a Laplace PriorNeural Computation, 1995
- Normal/Independent Distributions and Their Applications in Robust RegressionJournal of Computational and Graphical Statistics, 1993
- Bayesian Analysis of Binary and Polychotomous Response DataJournal of the American Statistical Association, 1993
- Networks for approximation and learningProceedings of the IEEE, 1990
- A Correspondence Between Bayesian Estimation on Stochastic Processes and Smoothing by SplinesThe Annals of Mathematical Statistics, 1970