Related papers: A Sub-Quadratic Time Algorithm for Robust Sparse M…

Robust Sparse Mean Estimation via Sum of Squares

We study the problem of high-dimensional sparse mean estimation in the presence of an $\epsilon$-fraction of adversarial outliers. Prior work obtained sample and computationally efficient algorithms for this task for identity-covariance…

Data Structures and Algorithms · Computer Science 2024-07-08 Ilias Diakonikolas , Daniel M. Kane , Sushrut Karmalkar , Ankit Pensia , Thanasis Pittas

Robust Sparse Estimation for Gaussians with Optimal Error under Huber Contamination

We study Gaussian sparse estimation tasks in Huber's contamination model with a focus on mean estimation, PCA, and linear regression. For each of these tasks, we give the first sample and computationally efficient robust estimators with…

Machine Learning · Computer Science 2024-03-18 Ilias Diakonikolas , Daniel M. Kane , Sushrut Karmalkar , Ankit Pensia , Thanasis Pittas

List-Decodable Sparse Mean Estimation

Robust mean estimation is one of the most important problems in statistics: given a set of samples in $\mathbb{R}^d$ where an $\alpha$ fraction are drawn from some distribution $D$ and the rest are adversarially corrupted, we aim to…

Machine Learning · Computer Science 2022-12-07 Shiwei Zeng , Jie Shen

Outlier-Robust High-Dimensional Sparse Estimation via Iterative Filtering

We study high-dimensional sparse estimation tasks in a robust setting where a constant fraction of the dataset is adversarially corrupted. Specifically, we focus on the fundamental problems of robust sparse mean estimation and robust sparse…

Data Structures and Algorithms · Computer Science 2019-11-20 Ilias Diakonikolas , Sushrut Karmalkar , Daniel Kane , Eric Price , Alistair Stewart

Faster Algorithms for High-Dimensional Robust Covariance Estimation

We study the problem of estimating the covariance matrix of a high-dimensional distribution when a small constant fraction of the samples can be arbitrarily corrupted. Recent work gave the first polynomial time algorithms for this problem…

Machine Learning · Computer Science 2019-06-12 Yu Cheng , Ilias Diakonikolas , Rong Ge , David Woodruff

High-Dimensional Robust Mean Estimation in Nearly-Linear Time

We study the fundamental problem of high-dimensional mean estimation in a robust model where a constant fraction of the samples are adversarially corrupted. Recent work gave the first polynomial time algorithms for this problem with…

Machine Learning · Computer Science 2018-11-26 Yu Cheng , Ilias Diakonikolas , Rong Ge

Outlier-Robust Sparse Mean Estimation for Heavy-Tailed Distributions

We study the fundamental task of outlier-robust mean estimation for heavy-tailed distributions in the presence of sparsity. Specifically, given a small number of corrupted samples from a high-dimensional heavy-tailed distribution whose mean…

Data Structures and Algorithms · Computer Science 2022-11-30 Ilias Diakonikolas , Daniel M. Kane , Jasper C. H. Lee , Ankit Pensia

How Hard Is Robust Mean Estimation?

Robust mean estimation is the problem of estimating the mean $\mu \in \mathbb{R}^d$ of a $d$-dimensional distribution $D$ from a list of independent samples, an $\epsilon$-fraction of which have been arbitrarily corrupted by a malicious…

Computational Complexity · Computer Science 2019-06-05 Samuel B. Hopkins , Jerry Li

A spectral algorithm for robust regression with subgaussian rates

We study a new linear up to quadratic time algorithm for linear regression in the absence of strong assumptions on the underlying distributions of samples, and in the presence of outliers. The goal is to design a procedure which comes with…

Machine Learning · Statistics 2020-07-14 Jules Depersin

Sparse Mean Estimation in Adversarial Settings via Incremental Learning

In this paper, we study the problem of sparse mean estimation under adversarial corruptions, where the goal is to estimate the $k$-sparse mean of a heavy-tailed distribution from samples contaminated by adversarial noise. Existing methods…

Machine Learning · Computer Science 2025-08-26 Jianhao Ma , Rui Ray Chen , Yinghui He , Salar Fattahi , Wei Hu

Robust Sparse Regression with Non-Isotropic Designs

We develop a technique to design efficiently computable estimators for sparse linear regression in the simultaneous presence of two adversaries: oblivious and adaptive. We design several robust algorithms that outperform the state of the…

Machine Learning · Computer Science 2024-11-01 Chih-Hung Liu , Gleb Novikov

Robust Sparse Estimation Tasks in High Dimensions

In this paper we initiate the study of whether or not sparse estimation tasks can be performed efficiently in high dimensions, in the robust setting where an $\eps$-fraction of samples are corrupted adversarially. We study the natural…

Machine Learning · Computer Science 2017-03-02 Jerry Li

High Dimensional Robust Sparse Regression

We provide a novel -- and to the best of our knowledge, the first -- algorithm for high dimensional sparse regression with constant fraction of corruptions in explanatory and/or response variables. Our algorithm recovers the true sparse…

Machine Learning · Computer Science 2019-05-31 Liu Liu , Yanyao Shen , Tianyang Li , Constantine Caramanis

Sparse PCA: Algorithms, Adversarial Perturbations and Certificates

We study efficient algorithms for Sparse PCA in standard statistical models (spiked covariance in its Wishart form). Our goal is to achieve optimal recovery guarantees while being resilient to small perturbations. Despite a long history of…

Machine Learning · Computer Science 2020-11-13 Tommaso d'Orsi , Pravesh K. Kothari , Gleb Novikov , David Steurer

Efficient Sparse PCA via Block-Diagonalization

Sparse Principal Component Analysis (Sparse PCA) is a pivotal tool in data analysis and dimensionality reduction. However, Sparse PCA is a challenging problem in both theory and practice: it is known to be NP-hard and current exact methods…

Machine Learning · Computer Science 2025-03-06 Alberto Del Pia , Dekun Zhou , Yinglun Zhu

Outlier-robust Mean Estimation near the Breakdown Point via Sum-of-Squares

We revisit the problem of estimating the mean of a high-dimensional distribution in the presence of an $\varepsilon$-fraction of adversarial outliers. When $\varepsilon$ is at most some sufficiently small constant, previous works can…

Data Structures and Algorithms · Computer Science 2024-11-22 Hongjie Chen , Deepak Narayanan Sridharan , David Steurer

Computational-Statistical Gaps for Improper Learning in Sparse Linear Regression

We study computational-statistical gaps for improper learning in sparse linear regression. More specifically, given $n$ samples from a $k$-sparse linear model in dimension $d$, we ask what is the minimum sample complexity to efficiently (in…

Machine Learning · Computer Science 2024-06-26 Rares-Darius Buhai , Jingqiu Ding , Stefan Tiegel

Efficient Algorithms for Outlier-Robust Regression

We give the first polynomial-time algorithm for performing linear or polynomial regression resilient to adversarial corruptions in both examples and labels. Given a sufficiently large (polynomial-size) training set drawn i.i.d. from…

Machine Learning · Computer Science 2020-06-05 Adam Klivans , Pravesh K. Kothari , Raghu Meka

Polynomial-Time Sum-of-Squares Can Robustly Estimate Mean and Covariance of Gaussians Optimally

In this work, we revisit the problem of estimating the mean and covariance of an unknown $d$-dimensional Gaussian distribution in the presence of an $\varepsilon$-fraction of adversarial outliers. The pioneering work of [DKK+16] gave a…

Data Structures and Algorithms · Computer Science 2021-10-25 Pravesh K. Kothari , Peter Manohar , Brian Hu Zhang

Robust subgaussian estimation of a mean vector in nearly linear time

We construct an algorithm, running in time $\tilde{\mathcal O}(N d + uK d)$, which is robust to outliers and heavy-tailed data and which achieves the subgaussian rate from [Lugosi, Mendelson] \begin{equation}\label{eq:intro_subgaus_rate}…

Statistics Theory · Mathematics 2019-06-28 Jules Depersin , Guillaume Lecué