On Optimal and Data-Based Histograms
- 1 December 1979
- journal article
- Published by JSTOR in Biometrika
- Vol. 66 (3) , 605-610
- https://doi.org/10.2307/2335182
Abstract
In this paper the formula for the optimal histogram bin width is derived which asymptotically minimizes the integrated mean squared error. Monte Carlo methods are used to verify the usefulness of this formula for small samples. A data-based procedure for choosing the bin width parameter is proposed, which assumes a Gaussian reference standard and requires only the sample size and an estimate of the standard deviation. The sensitivity of the procedure is investigated using several probability models which violate the Gaussian assumption.Keywords
This publication has 0 references indexed in Scilit: