Related papers: Efficient Density Estimation via Piecewise Polynom…

Sample-Optimal Density Estimation in Nearly-Linear Time

We design a new, fast algorithm for agnostically learning univariate probability distributions whose densities are well approximated by piecewise polynomial functions. Let $f$ be the density function of an arbitrary univariate distribution,…

Data Structures and Algorithms · Computer Science 2015-06-03 Jayadev Acharya , Ilias Diakonikolas , Jerry Li , Ludwig Schmidt

Density Estimation via Discrepancy Based Adaptive Sequential Partition

Given $iid$ observations from an unknown absolute continuous distribution defined on some domain $\Omega$, we propose a nonparametric method to learn a piecewise constant function to approximate the underlying probability density function.…

Machine Learning · Statistics 2018-03-13 Dangna Li , Kun Yang , Wing Hung Wong

Near-Optimal Density Estimation in Near-Linear Time Using Variable-Width Histograms

Let $p$ be an unknown and arbitrary probability distribution over $[0,1)$. We consider the problem of {\em density estimation}, in which a learning algorithm is given i.i.d. draws from $p$ and must (with high probability) output a…

Machine Learning · Computer Science 2014-11-04 Siu-On Chan , Ilias Diakonikolas , Rocco A. Servedio , Xiaorui Sun

Approximating L1-distances between mixture distributions using random projections

We consider the problem of computing L1-distances between every pair ofcprobability densities from a given family. We point out that the technique of Cauchy random projections (Indyk'06) in this context turns into stochastic integrals with…

Data Structures and Algorithms · Computer Science 2008-04-09 Satyaki Mahalanabis , Daniel Stefankovic

Piecewise polynomial approximation of probability density functions with application to uncertainty quantification for stochastic PDEs

The probability density function (PDF) associated with a given set of samples is approximated by a piecewise-linear polynomial constructed with respect to a binning of the sample space. The kernel functions are a compactly supported basis…

Numerical Analysis · Mathematics 2020-08-04 Giacomo Capodaglio , Max Gunzburger

Faster and Sample Near-Optimal Algorithms for Proper Learning Mixtures of Gaussians

We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians without any separability assumptions. Given $\tilde{O}(1/\varepsilon^2)$ samples from an unknown mixture, our algorithm outputs a mixture that is…

Data Structures and Algorithms · Computer Science 2014-05-20 Constantinos Daskalakis , Gautam Kamath

Efficient Robust Proper Learning of Log-concave Distributions

We study the {\em robust proper learning} of univariate log-concave distributions (over continuous and discrete domains). Given a set of samples drawn from an unknown target distribution, we want to compute a log-concave hypothesis…

Data Structures and Algorithms · Computer Science 2016-06-10 Ilias Diakonikolas , Daniel M. Kane , Alistair Stewart

SURF: A Simple, Universal, Robust, Fast Distribution Learning Algorithm

Sample- and computationally-efficient distribution estimation is a fundamental tenet in statistics and machine learning. We present SURF, an algorithm for approximating distributions by piecewise polynomials. SURF is: simple, replacing…

Machine Learning · Statistics 2021-02-15 Yi Hao , Ayush Jain , Alon Orlitsky , Vaishakh Ravindrakumar

Contrastive Moments: Unsupervised Halfspace Learning in Polynomial Time

We give a polynomial-time algorithm for learning high-dimensional halfspaces with margins in $d$-dimensional space to within desired TV distance when the ambient distribution is an unknown affine transformation of the $d$-fold product of an…

Machine Learning · Computer Science 2023-11-03 Xinyuan Cao , Santosh S. Vempala

Density Estimation via Discrepancy

Given i.i.d samples from some unknown continuous density on hyper-rectangle $[0, 1]^d$, we attempt to learn a piecewise constant function that approximates this underlying density non-parametrically. Our density estimate is defined on a…

Machine Learning · Statistics 2015-09-24 Kun Yang , Hao Su , Wing Hung Wang

Density Estimation with Distribution Element Trees

The estimation of probability densities based on available data is a central task in many statistical applications. Especially in the case of large ensembles with many samples or high-dimensional sample spaces, computationally efficient…

Methodology · Statistics 2017-05-04 Daniel W. Meyer

Almost-Uniform Sampling of Points on High-Dimensional Algebraic Varieties

We consider the problem of uniform sampling of points on an algebraic variety. Specifically, we develop a randomized algorithm that, given a small set of multivariate polynomials over a sufficiently large finite field, produces a common…

Data Structures and Algorithms · Computer Science 2009-02-10 Mahdi Cheraghchi , Amin Shokrollahi

Learning $k$-Modal Distributions via Testing

A $k$-modal probability distribution over the discrete domain $\{1,...,n\}$ is one whose histogram has at most $k$ "peaks" and "valleys." Such distributions are natural generalizations of monotone ($k=0$) and unimodal ($k=1$) probability…

Data Structures and Algorithms · Computer Science 2014-09-16 Constantinos Daskalakis , Ilias Diakonikolas , Rocco A. Servedio

Rapid parametric density estimation

Parametric density estimation, for example as Gaussian distribution, is the base of the field of statistics. Machine learning requires inexpensive estimation of much more complex densities, and the basic approach is relatively costly…

Machine Learning · Computer Science 2017-02-21 Jarek Duda

Settling the Polynomial Learnability of Mixtures of Gaussians

Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate the mixture parameters. We give an algorithm for this problem that has a running time, and data requirement polynomial in the dimension and…

Machine Learning · Computer Science 2010-04-27 Ankur Moitra , Gregory Valiant

Unified Sample-Optimal Property Estimation in Near-Linear Time

We consider the fundamental learning problem of estimating properties of distributions over large domains. Using a novel piecewise-polynomial approximation technique, we derive the first unified methodology for constructing sample- and…

Machine Learning · Computer Science 2020-03-18 Yi Hao , Alon Orlitsky

On efficient algorithms for computing near-best polynomial approximations to high-dimensional, Hilbert-valued functions from limited samples

Sparse polynomial approximation has become indispensable for approximating smooth, high- or infinite-dimensional functions from limited samples. This is a key task in computational science and engineering, e.g., surrogate modelling in…

Numerical Analysis · Mathematics 2023-11-08 Ben Adcock , Simone Brugiapaglia , Nick Dexter , Sebastian Moraga

Piecewise Analysis of Probabilistic Programs via $k$-Induction

In probabilistic program analysis, quantitative analysis aims at deriving tight numerical bounds for probabilistic properties such as expectation and assertion probability. Most previous works consider numerical bounds over the whole…

Programming Languages · Computer Science 2026-01-06 Tengshun Yang , Shenghua Feng , Hongfei Fu , Naijun Zhan , Jingyu Ke , Shiyang Wu

Computing the Invariant Measure and the Lyapunov Exponent for One-Dimensional Maps using a Measure-Preserving Polynomial Basis

We consider a generalisation of Ulam's method for approximating invariant densities of one-dimensional chaotic maps. Rather than use piecewise constant polynomials to approximate the density, we use polynomials of degree n which are defined…

Numerical Analysis · Mathematics 2011-11-28 Philip J. Aston , Oliver Junge

Optimal Algorithms and Lower Bounds for Testing Closeness of Structured Distributions

We give a general unified method that can be used for $L_1$ {\em closeness testing} of a wide range of univariate structured distribution families. More specifically, we design a sample optimal and computationally efficient algorithm for…

Data Structures and Algorithms · Computer Science 2015-08-25 Ilias Diakonikolas , Daniel M. Kane , Vladimir Nikishkin