English

Data analysis recipes: Choosing the binning for a histogram

Data Analysis, Statistics and Probability 2008-07-31 v1 Astrophysics

Abstract

Data points are placed in bins when a histogram is created, but there is always a decision to be made about the number or width of the bins. This decision is often made arbitrarily or subjectively, but it need not be. A jackknife or leave-one-out cross-validation likelihood is defined and employed as a scalar objective function for optimization of the locations and widths of the bins. The objective is justified as being related to the histogram's usefulness for predicting future data. The method works for data or histograms of any dimensionality.

Keywords

Cite

@article{arxiv.0807.4820,
  title  = {Data analysis recipes: Choosing the binning for a histogram},
  author = {David W. Hogg},
  journal= {arXiv preprint arXiv:0807.4820},
  year   = {2008}
}

Comments

not submitted anywhere but here

R2 v1 2026-06-21T11:05:48.759Z