Data analysis recipes: Choosing the binning for a histogram
Data Analysis, Statistics and Probability
2008-07-31 v1 Astrophysics
Abstract
Data points are placed in bins when a histogram is created, but there is always a decision to be made about the number or width of the bins. This decision is often made arbitrarily or subjectively, but it need not be. A jackknife or leave-one-out cross-validation likelihood is defined and employed as a scalar objective function for optimization of the locations and widths of the bins. The objective is justified as being related to the histogram's usefulness for predicting future data. The method works for data or histograms of any dimensionality.
Keywords
Cite
@article{arxiv.0807.4820,
title = {Data analysis recipes: Choosing the binning for a histogram},
author = {David W. Hogg},
journal= {arXiv preprint arXiv:0807.4820},
year = {2008}
}
Comments
not submitted anywhere but here