Related papers: The Optimal Approximation Factor in Density Estima…

Statistically Near-Optimal Hypothesis Selection

Hypothesis Selection is a fundamental distribution learning problem where given a comparator-class $Q=\{q_1,\ldots, q_n\}$ of distributions, and a sampling access to an unknown target distribution $p$, the goal is to output a distribution…

Machine Learning · Computer Science 2021-08-19 Olivier Bousquet , Mark Braverman , Klim Efremenko , Gillat Kol , Shay Moran

Near-Optimal Density Estimation in Near-Linear Time Using Variable-Width Histograms

Let $p$ be an unknown and arbitrary probability distribution over $[0,1)$. We consider the problem of {\em density estimation}, in which a learning algorithm is given i.i.d. draws from $p$ and must (with high probability) output a…

Machine Learning · Computer Science 2014-11-04 Siu-On Chan , Ilias Diakonikolas , Rocco A. Servedio , Xiaorui Sun

Nearly-Linear Time Private Hypothesis Selection with the Optimal Approximation Factor

Estimating the density of a distribution from its samples is a fundamental problem in statistics. Hypothesis selection addresses the setting where, in addition to a sample set, we are given $n$ candidate distributions -- referred to as…

Data Structures and Algorithms · Computer Science 2025-10-23 Maryam Aliakbarpour , Zhan Shi , Ria Stevens , Vincent X. Wang

Optimal Algorithms for Augmented Testing of Discrete Distributions

We consider the problem of hypothesis testing for discrete distributions. In the standard model, where we have sample access to an underlying distribution $p$, extensive research has established optimal bounds for uniformity testing,…

Machine Learning · Computer Science 2024-12-03 Maryam Aliakbarpour , Piotr Indyk , Ronitt Rubinfeld , Sandeep Silwal

Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity

The current paper studies the problem of agnostic $Q$-learning with function approximation in deterministic systems where the optimal $Q$-function is approximable by a function in the class $\mathcal{F}$ with approximation error $\delta \ge…

Machine Learning · Computer Science 2020-02-18 Simon S. Du , Jason D. Lee , Gaurav Mahajan , Ruosong Wang

How fast can you find a good hypothesis?

In the hypothesis selection problem, we are given sample and query access to finite set of candidate distributions (hypotheses), $\mathcal{H} = \{H_1, \ldots, H_n\}$, and samples from an unknown distribution $P$, both over a domain…

Data Structures and Algorithms · Computer Science 2025-11-12 Anders Aamand , Maryam Aliakbarpour , Justin Y. Chen , Sandeep Silwal

Adaptive Estimation for Approximate k-Nearest-Neighbor Computations

Algorithms often carry out equally many computations for "easy" and "hard" problem instances. In particular, algorithms for finding nearest neighbors typically have the same running time regardless of the particular problem instance. In…

Data Structures and Algorithms · Computer Science 2020-03-25 Daniel LeJeune , Richard G. Baraniuk , Reinhard Heckel

Density estimation with quadratic loss: a confidence intervals method

In a previous article, a least square regression estimation procedure was proposed: first, we condiser a family of functions and study the properties of an estimator in every unidimensionnal model defined by one of these functions; we then…

Statistics Theory · Mathematics 2007-06-13 Pierre Alquier

Near-Optimal Relative Error Streaming Quantile Estimation via Elastic Compactors

Computing the approximate quantiles or ranks of a stream is a fundamental task in data monitoring. Given a stream of elements $x_1, x_2, \dots, x_n$ and a query $x$, a relative-error quantile estimation algorithm can estimate the rank of…

Data Structures and Algorithms · Computer Science 2024-11-05 Elena Gribelyuk , Pachara Sawettamalya , Hongxun Wu , Huacheng Yu

Instance-Optimal Private Density Estimation in the Wasserstein Distance

Estimating the density of a distribution from samples is a fundamental problem in statistics. In many practical settings, the Wasserstein distance is an appropriate error metric for density estimation. For example, when estimating…

Machine Learning · Computer Science 2024-07-01 Vitaly Feldman , Audra McMillan , Satchit Sivakumar , Kunal Talwar

Improved Algorithms for Adaptive Compressed Sensing

In the problem of adaptive compressed sensing, one wants to estimate an approximately $k$-sparse vector $x\in\mathbb{R}^n$ from $m$ linear measurements $A_1 x, A_2 x,\ldots, A_m x$, where $A_i$ can be chosen based on the outcomes $A_1…

Data Structures and Algorithms · Computer Science 2018-04-26 Vasileios Nakos , Xiaofei Shi , David P. Woodruff , Hongyang Zhang

An adaptive procedure for Fourier estimators: illustration to deconvolution and decompounding

We introduce a new procedure to select the optimal cutoff parameter for Fourier density estimators that leads to adaptive rate optimal estimators, up to a logarithmic factor. This adaptive procedure applies for different inverse problems.…

Statistics Theory · Mathematics 2018-02-15 Céline Duval , Johanna Kappus

Approximating Dense Max 2-CSPs

In this paper, we present a polynomial-time algorithm that approximates sufficiently high-value Max 2-CSPs on sufficiently dense graphs to within $O(N^{\varepsilon})$ approximation ratio for any constant $\varepsilon > 0$. Using this…

Data Structures and Algorithms · Computer Science 2015-07-31 Pasin Manurangsi , Dana Moshkovitz

Optimal Recovery of Local Truth

Probability mass curves the data space with horizons. Let f be a multivariate probability density function with continuous second order partial derivatives. Consider the problem of estimating the true value of f(z) > 0 at a single point z,…

Data Analysis, Statistics and Probability · Physics 2009-11-06 Carlos C. Rodriguez

Optimal Algorithms and Lower Bounds for Testing Closeness of Structured Distributions

We give a general unified method that can be used for $L_1$ {\em closeness testing} of a wide range of univariate structured distribution families. More specifically, we design a sample optimal and computationally efficient algorithm for…

Data Structures and Algorithms · Computer Science 2015-08-25 Ilias Diakonikolas , Daniel M. Kane , Vladimir Nikishkin

Probability Density Estimation via Optimal Control

We employ optimal control theory to study the problem of estimating the probability density function from a data set originating from an unknown probability distribution. The original variational problem is reformulated as a multi-stage…

Optimization and Control · Mathematics 2025-10-02 Markus Hegland , C. Yalçın Kaya

Optimal and Near-Optimal Adaptive Vector Quantization

Quantization is a fundamental optimization for many machine-learning use cases, including compressing gradients, model weights and activations, and datasets. The most accurate form of quantization is \emph{adaptive}, where the error is…

Machine Learning · Computer Science 2025-08-01 Ran Ben-Basat , Yaniv Ben-Itzhak , Michael Mitzenmacher , Shay Vargaftik

Constant-Factor Approximation for the Uniform Decision Tree

We resolve a long-standing open question, about the existence of a constant-factor approximation algorithm for the average-case \textsc{Decision Tree} problem with uniform probability distribution over the hypotheses. We answer the question…

Data Structures and Algorithms · Computer Science 2026-04-29 Michał Szyfelbein

TURF: A Two-factor, Universal, Robust, Fast Distribution Learning Algorithm

Approximating distributions from their samples is a canonical statistical-learning problem. One of its most powerful and successful modalities approximates every distribution to an $\ell_1$ distance essentially at most a constant times…

Machine Learning · Statistics 2022-06-22 Yi Hao , Ayush Jain , Alon Orlitsky , Vaishakh Ravindrakumar

A Faster Algorithm for Max Cut in Dense Graphs

We design an algorithm for approximating the size of \emph{Max Cut} in dense graphs. Given a proximity parameter $\varepsilon \in (0,1)$, our algorithm approximates the size of \emph{Max Cut} of a graph $G$ with $n$ vertices, within an…

Data Structures and Algorithms · Computer Science 2021-12-21 Arijit Ghosh , Gopinath Mishra , Rahul Raychaudhury , Sayantan Sen