Related papers: Density estimation using the perceptron

Density Estimation via Discrepancy

Given i.i.d samples from some unknown continuous density on hyper-rectangle $[0, 1]^d$, we attempt to learn a piecewise constant function that approximates this underlying density non-parametrically. Our density estimate is defined on a…

Machine Learning · Statistics 2015-09-24 Kun Yang , Hao Su , Wing Hung Wang

Density Estimation via Discrepancy Based Adaptive Sequential Partition

Given $iid$ observations from an unknown absolute continuous distribution defined on some domain $\Omega$, we propose a nonparametric method to learn a piecewise constant function to approximate the underlying probability density function.…

Machine Learning · Statistics 2018-03-13 Dangna Li , Kun Yang , Wing Hung Wong

Generative diffusion for perceptron problems: statistical physics analysis and efficient algorithms

We consider random instances of non-convex perceptron problems in the high-dimensional limit of a large number of examples $M$ and weights $N$, with finite load $\alpha = M/N$. We develop a formalism based on replica theory to predict the…

Disordered Systems and Neural Networks · Physics 2026-02-11 Elizaveta Demyanenko , Davide Straziota , Carlo Baldassi , Carlo Lucibello

Relative Density-Ratio Estimation for Robust Distribution Comparison

Divergence estimators based on direct approximation of density-ratios without going through separate approximation of numerator and denominator densities have been successfully applied to machine learning tasks that involve distribution…

Machine Learning · Statistics 2011-06-24 Makoto Yamada , Taiji Suzuki , Takafumi Kanamori , Hirotaka Hachiya , Masashi Sugiyama

Statistical Inference for Generative Models with Maximum Mean Discrepancy

While likelihood-based inference and its variants provide a statistically efficient and widely applicable approach to parametric inference, their application to models involving intractable likelihoods poses challenges. In this work, we…

Methodology · Statistics 2019-06-17 Francois-Xavier Briol , Alessandro Barp , Andrew B. Duncan , Mark Girolami

Diffusion Models are Minimax Optimal Distribution Estimators

While efficient distribution learning is no doubt behind the groundbreaking success of diffusion modeling, its theoretical guarantees are quite limited. In this paper, we provide the first rigorous analysis on approximation and…

Machine Learning · Statistics 2023-03-06 Kazusato Oko , Shunta Akiyama , Taiji Suzuki

The Perceptron with Dynamic Margin

The classical perceptron rule provides a varying upper bound on the maximum margin, namely the length of the current weight vector divided by the total number of updates up to that time. Requiring that the perceptron updates its internal…

Machine Learning · Computer Science 2011-05-31 Constantinos Panagiotakopoulos , Petroula Tsampouka

On the Observability of Gaussian Models using Discrete Density Approximations

This paper proposes a novel method for testing observability in Gaussian models using discrete density approximations (deterministic samples) of (multivariate) Gaussians. Our notion of observability is defined by the existence of the…

Systems and Control · Electrical Eng. & Systems 2022-08-19 Ariane Hanebeck , Claudia Czado

Estimation of time series by Maximum Mean Discrepancy

We define two minimum distance estimators for dependent data by minimizing some approximated Maximum Mean Discrepancy distances between the true empirical distribution of observations and their assumed (parametric) model distribution. When…

Methodology · Statistics 2026-01-19 Pierre Alquier , Jean-David Fermanian , Benjamin Poignard

Bias Detection via Maximum Subgroup Discrepancy

Bias evaluation is fundamental to trustworthy AI, both in terms of checking data quality and in terms of checking the outputs of AI systems. In testing data quality, for example, one may study the distance of a given dataset, viewed as a…

Machine Learning · Computer Science 2025-06-12 Jiří Němeček , Mark Kozdoba , Illia Kryvoviaz , Tomáš Pevný , Jakub Mareček

A new method for estimation and model selection: $\rho$-estimation

The aim of this paper is to present a new estimation procedure that can be applied in many statistical frameworks including density and regression and which leads to both robust and optimal (or nearly optimal) estimators. In density…

Statistics Theory · Mathematics 2017-01-23 Yannick Baraud , Lucien Birgé , Mathieu Sart

Sample-Optimal Density Estimation in Nearly-Linear Time

We design a new, fast algorithm for agnostically learning univariate probability distributions whose densities are well approximated by piecewise polynomial functions. Let $f$ be the density function of an arbitrary univariate distribution,…

Data Structures and Algorithms · Computer Science 2015-06-03 Jayadev Acharya , Ilias Diakonikolas , Jerry Li , Ludwig Schmidt

Minimum Local Distance Density Estimation

We present a local density estimator based on first order statistics. To estimate the density at a point, $x$, the original sample is divided into subsets and the average minimum sample distance to $x$ over all such subsets is used to…

Methodology · Statistics 2014-12-10 Vikram V. Garg , Luis Tenorio , Karen Willcox

Robust and Efficient Estimation in Ordinal Response Models using the Density Power Divergence

In real life, we frequently come across data sets that involve some independent explanatory variable(s) generating a set of ordinal responses. These ordinal responses may correspond to an underlying continuous latent variable, which is…

Methodology · Statistics 2024-01-08 Arijit Pyne , Subhrajyoty Roy , Abhik Ghosh , Ayanendranath Basu

Optimal Estimation under a Semiparametric Density Ratio Model

In many statistical and econometric applications, we gather individual samples from various interconnected populations that undeniably exhibit common latent structures. Utilizing a model that incorporates these latent structures for such…

Methodology · Statistics 2023-09-19 Archer Gong Zhang , Jiahua Chen

Optimal estimation of the null distribution in large-scale inference

The advent of large-scale inference has spurred reexamination of conventional statistical thinking. In a Gaussian model for $n$ many $z$-scores with at most $k < \frac{n}{2}$ nonnulls, Efron suggests estimating the location and scale…

Statistics Theory · Mathematics 2025-01-15 Subhodh Kotekal , Chao Gao

Density estimation via mixture discrepancy and moments

With the aim of generalizing histogram statistics to higher dimensional cases, density estimation via discrepancy based sequential partition (DSP) has been proposed to learn an adaptive piecewise constant approximation defined on a binary…

Machine Learning · Statistics 2025-12-23 Zhengyang Lei , Lirong Qu , Sihong Shao , Yunfeng Xiong

Efficient Discrepancy Testing for Learning with Distribution Shift

A fundamental notion of distance between train and test distributions from the field of domain adaptation is discrepancy distance. While in general hard to compute, here we provide the first set of provably efficient algorithms for testing…

Data Structures and Algorithms · Computer Science 2024-06-14 Gautam Chandrasekaran , Adam R. Klivans , Vasilis Kontonis , Konstantinos Stavropoulos , Arsen Vasilyan

Efficient Density Estimation via Piecewise Polynomial Approximation

We give a highly efficient "semi-agnostic" algorithm for learning univariate probability distributions that are well approximated by piecewise polynomial density functions. Let $p$ be an arbitrary distribution over an interval $I$ which is…

Machine Learning · Computer Science 2013-05-15 Siu-On Chan , Ilias Diakonikolas , Rocco A. Servedio , Xiaorui Sun

Storage capacity of perceptron with variable selection

A central challenge in machine learning is to distinguish genuine structure from chance correlations in high-dimensional data. In this work, we address this issue for the perceptron, a foundational model of neural computation. Specifically,…

Information Theory · Computer Science 2025-12-02 Yingying Xu , Masayuki Ohzeki , Yoshiyuki Kabashima