English

From Data to Probability Densities without Histograms

Data Analysis, Statistics and Probability 2009-11-13 v1 Statistical Mechanics High Energy Physics - Lattice Computational Physics

Abstract

When one deals with data drawn from continuous variables, a histogram is often inadequate to display their probability density. It deals inefficiently with statistical noise, and binsizes are free parameters. In contrast to that, the empirical cumulative distribution function (obtained after sorting the data) is parameter free. But it is a step function, so that its differentiation does not give a smooth probability density. Based on Fourier series expansion and Kolmogorov tests, we introduce a simple method, which overcomes this problem. Error bars on the estimated probability density are calculated using a jackknife method. We give several examples and provide computer code reproducing them. You may want to look at the corresponding figures 4 to 9 first.

Keywords

Cite

@article{arxiv.0712.3852,
  title  = {From Data to Probability Densities without Histograms},
  author = {Bernd A. Berg and Robert C. Harris},
  journal= {arXiv preprint arXiv:0712.3852},
  year   = {2009}
}

Comments

9 pages, 9 figures

R2 v1 2026-06-21T09:57:06.302Z