English
Related papers

Related papers: Optimal Data-Based Binning for Histograms

200 papers

The histogram is an analysis tool in widespread use within many sciences, with high energy physics as a prime example. However, there exists an inherent bias in the choice of binning for the histogram, with different choices potentially…

Data Analysis, Statistics and Probability · Physics 2014-05-21 Abram Krislock , Nathan Krislock

Rank histograms are popular tools for assessing the reliability of meteorological ensemble forecast systems. A reliable forecast system leads to a uniform rank histogram, and deviations from uniformity can indicate miscalibrations. However,…

Applications · Statistics 2022-09-30 Claudio Heinrich

The histogram method is a powerful non-parametric approach for estimating the probability density function of a continuous variable. But the construction of a histogram, compared to the parametric approaches, demands a large number of…

Machine Learning · Statistics 2015-12-29 Hideaki Kim , Hiroshi Sawada

Data points are placed in bins when a histogram is created, but there is always a decision to be made about the number or width of the bins. This decision is often made arbitrarily or subjectively, but it need not be. A jackknife or…

Data Analysis, Statistics and Probability · Physics 2008-07-31 David W. Hogg

The histogram is widely used as a simple, exploratory display of data, but it is usually not clear how to choose the number and size of bins. We construct a confidence set of distribution functions that optimally address the two main tasks…

Statistics Theory · Mathematics 2020-02-13 Housen Li , Axel Munk , Hannes Sieling , Guenther Walther

This paper presents a quantitative user study to evaluate how well users can visually perceive the underlying data distribution from a histogram representation. We used different sample and bin sizes and four different distributions…

Human-Computer Interaction · Computer Science 2021-09-15 Raphael Sahann , Torsten Möller , Johanna Schmidt

We present sparse tree-based and list-based density estimation methods for binary/categorical data. Our density estimation models are higher dimensional analogies to variable bin width histograms. In each leaf of the tree (or list), the…

Machine Learning · Statistics 2023-11-16 Siong Thye Goh , Lesia Semenova , Cynthia Rudin

When one deals with data drawn from continuous variables, a histogram is often inadequate to display their probability density. It deals inefficiently with statistical noise, and binsizes are free parameters. In contrast to that, the…

Data Analysis, Statistics and Probability · Physics 2009-11-13 Bernd A. Berg , Robert C. Harris

The simplicity and expressiveness of a histogram render it a useful feature in different contexts including deep learning. Although the process of computing a histogram is non-differentiable, researchers have proposed differentiable…

Machine Learning · Computer Science 2020-12-14 Ibrahim Yusuf , George Igwegbe , Oluwafemi Azeez

Context. Visualization of 2D distributions is an essential task, commonly done with a 2D histogram. The histogram is built by subdividing the sample space into regions and color-coding the number of samples in each region. Aims. We aim to…

Instrumentation and Methods for Astrophysics · Physics 2026-04-02 Igor Vaiman

The histogram is a key method for visualizing data and estimating the underlying probability distribution. Incorrect conclusions about the data result from over or under-binning. A new method based on the Shannon entropy of the histogram…

Data Analysis, Statistics and Probability · Physics 2022-10-07 Stephen Watts , Lisa Crow

Accurate calibration of probabilistic predictive models learned is critical for many practical prediction and decision-making tasks. There are two main categories of methods for building calibrated classifiers. One approach is to develop…

Machine Learning · Statistics 2014-01-16 Mahdi Pakdaman Naeini , Gregory F. Cooper , Milos Hauskrecht

We propose a new method of histogram construction, providing a fully Bayesian approach to irregular histograms. Our procedure applies Bayesian model selection to a piecewise constant model of the underlying distribution, resulting in a…

Methodology · Statistics 2026-03-13 Oskar Høgberg Simensen , Dennis Christensen , Nils Lid Hjort

Density Estimation is one of the central areas of statistics whose purpose is to estimate the probability density function underlying the observed data. It serves as a building block for many tasks in statistical inference, visualization,…

Machine Learning · Statistics 2019-04-02 Zhipeng Wang , David W. Scott

Weighted histogram in Monte-Carlo simulations is often used for the estimation of a probability density function. It is obtained as a result of random experiment with random events that have weights. In this paper the bin contents of…

Data Analysis, Statistics and Probability · Physics 2008-11-28 N. D. Gagunashvili

When reading peer-reviewed scientific literature describing any analysis of empirical data, it is natural and correct to proceed with the underlying assumption that experiments have made good faith efforts to ensure that their analyses…

Data Analysis, Statistics and Probability · Physics 2012-09-13 S. Towers

In this work, we investigate the statistical computation of the Boltzmann entropy of statistical samples. For this purpose, we use both histogram and kernel function to estimate the probability density function of statistical samples. We…

Methodology · Statistics 2015-06-23 Ning Sui , Min Li , Ping He

Predictions are often probabilities; e.g., a prediction could be for precipitation tomorrow, but with only a 30% chance. Given such probabilistic predictions together with the actual outcomes, "reliability diagrams" help detect and diagnose…

Statistics Theory · Mathematics 2022-11-15 Imanol Arrieta-Ibarra , Paman Gujral , Jonathan Tannen , Mark Tygert , Cherie Xu

We investigate the problem of testing whether a discrete probability distribution over an ordered domain is a histogram on a specified number of bins. One of the most common tools for the succinct approximation of data, $k$-histograms over…

Data Structures and Algorithms · Computer Science 2022-07-15 Clément L. Canonne , Ilias Diakonikolas , Daniel M. Kane , Sihan Liu

Reliable density estimation is fundamental for numerous applications in statistics and machine learning. In many practical scenarios, data are best modeled as mixtures of component densities that capture complex and multimodal patterns.…

Machine Learning · Computer Science 2025-09-30 Mustafa Musab , Joseph K. Chege , Arie Yeredor , Martin Haardt
‹ Prev 1 2 3 10 Next ›