Related papers: Almost Optimal Proper Learning and Testing Polynom…

Properly Learning Poisson Binomial Distributions in Almost Polynomial Time

We give an algorithm for properly learning Poisson binomial distributions. A Poisson binomial distribution (PBD) of order $n$ is the discrete probability distribution of the sum of $n$ mutually independent Bernoulli random variables. Given…

Data Structures and Algorithms · Computer Science 2015-11-13 Ilias Diakonikolas , Daniel M. Kane , Alistair Stewart

Learning stochastic decision trees

We give a quasipolynomial-time algorithm for learning stochastic decision trees that is optimally resilient to adversarial noise. Given an $\eta$-corrupted set of uniform random samples labeled by a size-$s$ stochastic decision tree, our…

Machine Learning · Computer Science 2021-05-11 Guy Blanc , Jane Lange , Li-Yang Tan

Faster and Sample Near-Optimal Algorithms for Proper Learning Mixtures of Gaussians

We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians without any separability assumptions. Given $\tilde{O}(1/\varepsilon^2)$ samples from an unknown mixture, our algorithm outputs a mixture that is…

Data Structures and Algorithms · Computer Science 2014-05-20 Constantinos Daskalakis , Gautam Kamath

Efficient active learning of sparse halfspaces with arbitrary bounded noise

We study active learning of homogeneous $s$-sparse halfspaces in $\mathbb{R}^d$ under the setting where the unlabeled data distribution is isotropic log-concave and each label is flipped with probability at most $\eta$ for a parameter $\eta…

Machine Learning · Computer Science 2021-08-16 Chicheng Zhang , Jie Shen , Pranjal Awasthi

Properly learning monotone functions via local reconstruction

We give a $2^{\tilde{O}(\sqrt{n}/\epsilon)}$-time algorithm for properly learning monotone Boolean functions under the uniform distribution over $\{0,1\}^n$. Our algorithm is robust to adversarial label noise and has a running time nearly…

Data Structures and Algorithms · Computer Science 2023-03-29 Jane Lange , Ronitt Rubinfeld , Arsen Vasilyan

Sample-Optimal Density Estimation in Nearly-Linear Time

We design a new, fast algorithm for agnostically learning univariate probability distributions whose densities are well approximated by piecewise polynomial functions. Let $f$ be the density function of an arbitrary univariate distribution,…

Data Structures and Algorithms · Computer Science 2015-06-03 Jayadev Acharya , Ilias Diakonikolas , Jerry Li , Ludwig Schmidt

A Near-optimal SQ Lower Bound for Smoothed Agnostic Learning of Boolean Halfspaces

We study the complexity of smoothed agnostic learning of halfspaces on $\{\pm 1\}^n$ under uniform marginals in the model of~\cite{KM25}, where each input coordinate is independently flipped with probability $\sigma \in (0, {1}/{2})$. We…

Machine Learning · Computer Science 2026-05-14 Tim Sinen

Robust Sparse Mean Estimation via Sum of Squares

We study the problem of high-dimensional sparse mean estimation in the presence of an $\epsilon$-fraction of adversarial outliers. Prior work obtained sample and computationally efficient algorithms for this task for identity-covariance…

Data Structures and Algorithms · Computer Science 2024-07-08 Ilias Diakonikolas , Daniel M. Kane , Sushrut Karmalkar , Ankit Pensia , Thanasis Pittas

Near-Optimal Bounds for Learning Gaussian Halfspaces with Random Classification Noise

We study the problem of learning general (i.e., not necessarily homogeneous) halfspaces with Random Classification Noise under the Gaussian distribution. We establish nearly-matching algorithmic and Statistical Query (SQ) lower bound…

Machine Learning · Computer Science 2023-07-18 Ilias Diakonikolas , Jelena Diakonikolas , Daniel M. Kane , Puqian Wang , Nikos Zarifis

Probabilistic Polynomials and Hamming Nearest Neighbors

We show how to compute any symmetric Boolean function on $n$ variables over any field (as well as the integers) with a probabilistic polynomial of degree $O(\sqrt{n \log(1/\epsilon)})$ and error at most $\epsilon$. The degree dependence on…

Data Structures and Algorithms · Computer Science 2016-11-18 Josh Alman , Ryan Williams

Nearly optimal algorithms to learn sparse quantum Hamiltonians in physically motivated distances

We study the problem of learning Hamiltonians $H$ that are $s$-sparse in the Pauli basis, given access to their time evolution. Although Hamiltonian learning has been extensively investigated, two issues recur in much of the existing…

Quantum Physics · Physics 2025-09-15 Amira Abbas , Nunzia Cerrato , Francisco Escudero Gutiérrez , Dmitry Grinko , Francesco Anna Mele , Pulkit Sinha

Faster Algorithms for High-Dimensional Robust Covariance Estimation

We study the problem of estimating the covariance matrix of a high-dimensional distribution when a small constant fraction of the samples can be arbitrarily corrupted. Recent work gave the first polynomial time algorithms for this problem…

Machine Learning · Computer Science 2019-06-12 Yu Cheng , Ilias Diakonikolas , Rong Ge , David Woodruff

Efficient Algorithms for Outlier-Robust Regression

We give the first polynomial-time algorithm for performing linear or polynomial regression resilient to adversarial corruptions in both examples and labels. Given a sufficiently large (polynomial-size) training set drawn i.i.d. from…

Machine Learning · Computer Science 2020-06-05 Adam Klivans , Pravesh K. Kothari , Raghu Meka

Toward Attribute Efficient Learning Algorithms

We make progress on two important problems regarding attribute efficient learnability. First, we give an algorithm for learning decision lists of length $k$ over $n$ variables using $2^{\tilde{O}(k^{1/3})} \log n$ examples and time…

Machine Learning · Computer Science 2007-05-23 Adam R. Klivans , Rocco A. Servedio

Algorithmic Polynomials

The approximate degree of a Boolean function $f(x_{1},x_{2},\ldots,x_{n})$ is the minimum degree of a real polynomial that approximates $f$ pointwise within $1/3$. Upper bounds on approximate degree have a variety of applications in…

Computational Complexity · Computer Science 2018-01-16 Alexander A. Sherstov

A Fully Polynomial-Time Algorithm for Robustly Learning Halfspaces over the Hypercube

We give the first fully polynomial-time algorithm for learning halfspaces with respect to the uniform distribution on the hypercube in the presence of contamination, where an adversary may corrupt some fraction of examples and labels…

Data Structures and Algorithms · Computer Science 2025-11-11 Gautam Chandrasekaran , Adam R. Klivans , Konstantinos Stavropoulos , Arsen Vasilyan

Efficiently Testing Sparse GF(2) Polynomials

We give the first algorithm that is both query-efficient and time-efficient for testing whether an unknown function $f: \{0,1\}^n \to \{0,1\}$ is an $s$-sparse GF(2) polynomial versus $\eps$-far from every such polynomial. Our algorithm…

Computational Complexity · Computer Science 2008-05-14 Ilias Diakonikolas , Homin K. Lee , Kevin Matulef , Rocco A. Servedio , Andrew Wan

Robust Sparse Regression with Non-Isotropic Designs

We develop a technique to design efficiently computable estimators for sparse linear regression in the simultaneous presence of two adversaries: oblivious and adaptive. We design several robust algorithms that outperform the state of the…

Machine Learning · Computer Science 2024-11-01 Chih-Hung Liu , Gleb Novikov

Small Covers for Near-Zero Sets of Polynomials and Learning Latent Variable Models

Let $V$ be any vector space of multivariate degree-$d$ homogeneous polynomials with co-dimension at most $k$, and $S$ be the set of points where all polynomials in $V$ {\em nearly} vanish. We establish a qualitatively optimal upper bound on…

Machine Learning · Computer Science 2020-12-15 Ilias Diakonikolas , Daniel M. Kane

Near-Optimal Algorithms for Gaussians with Huber Contamination: Mean Estimation and Linear Regression

We study the fundamental problems of Gaussian mean estimation and linear regression with Gaussian covariates in the presence of Huber contamination. Our main contribution is the design of the first sample near-optimal and almost linear-time…

Data Structures and Algorithms · Computer Science 2023-12-05 Ilias Diakonikolas , Daniel M. Kane , Ankit Pensia , Thanasis Pittas