Consistency of data-driven histogram methods for density estimation and classification
Open Access
- 1 April 1996
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 24 (2) , 687-706
- https://doi.org/10.1214/aos/1032894460
Abstract
We present general sufficient conditions for the almost sure $L_1$-consistency of histogram density estimates based on data-dependent partitions. Analogous conditions guarantee the almost-sure risk consistency of histogram classification schemes based on data-dependent partitions. Multivariate data are considered throughout. In each case, the desired consistency requires shrinking cells, subexponential growth of a combinatorial complexity measure and sublinear growth of the number of cells. It is not required that the cells of every partition be rectangles with sides parallel to the coordinate axis or that each cell contain a minimum number of points. No assumptions are made concerning the common distribution of the training vectors. We apply the results to establish the consistency of several known partitioning estimates, including the $k_n$-spacing density estimate, classifiers based on statistically equivalent blocks and classifiers based on multivariate clustering schemes.Keywords
This publication has 15 references indexed in Scilit:
- Histogram regression estimation using data-dependent partitionsThe Annals of Statistics, 1996
- Rates of convergence in the source coding theorem, in empirical quantizer design, and in universal lossy source codingIEEE Transactions on Information Theory, 1994
- Automatic pattern recognition: a study of the probability of errorPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988
- Almost sure L1-norm convergence for data-based histogram density estimatesJournal of Multivariate Analysis, 1987
- Almost surely consistent nonparametric regression from recursive partitioning schemesJournal of Multivariate Analysis, 1984
- Consistent nonparametric regression from recursive partitioning schemesJournal of Multivariate Analysis, 1980
- Asymptotically Efficient Solutions to the Classification ProblemThe Annals of Statistics, 1978
- Consistent Nonparametric RegressionThe Annals of Statistics, 1977
- A histogram method of density estimationCommunications in Statistics, 1973
- A Consistent Nonparametric Multivariate Density Estimator Based on Statistically Equivalent BlocksThe Annals of Mathematical Statistics, 1970