English
Related papers

Related papers: Efficient Algorithms for Outlier-Robust Regression

200 papers

We consider the task of privately obtaining prediction error guarantees in ordinary least-squares regression problems with Gaussian covariates (with unknown covariance structure). We provide the first sample-optimal polynomial time…

Data Structures and Algorithms · Computer Science 2025-04-01 Prashanti Anderson , Ainesh Bakshi , Mahbod Majid , Stefan Tiegel

We study the problem of high-dimensional sparse mean estimation in the presence of an $\epsilon$-fraction of adversarial outliers. Prior work obtained sample and computationally efficient algorithms for this task for identity-covariance…

Data Structures and Algorithms · Computer Science 2024-07-08 Ilias Diakonikolas , Daniel M. Kane , Sushrut Karmalkar , Ankit Pensia , Thanasis Pittas

We study a new linear up to quadratic time algorithm for linear regression in the absence of strong assumptions on the underlying distributions of samples, and in the presence of outliers. The goal is to design a procedure which comes with…

Machine Learning · Statistics 2020-07-14 Jules Depersin

We give the first polynomial-time algorithm for robust regression in the list-decodable setting where an adversary can corrupt a greater than $1/2$ fraction of examples. For any $\alpha < 1$, our algorithm takes as input a sample…

Data Structures and Algorithms · Computer Science 2019-05-31 Sushrut Karmalkar , Adam R. Klivans , Pravesh K. Kothari

We develop a technique to design efficiently computable estimators for sparse linear regression in the simultaneous presence of two adversaries: oblivious and adaptive. We design several robust algorithms that outperform the state of the…

Machine Learning · Computer Science 2024-11-01 Chih-Hung Liu , Gleb Novikov

We study the problem of high-dimensional linear regression in a robust model where an $\epsilon$-fraction of the samples can be adversarially corrupted. We focus on the fundamental setting where the covariates of the uncorrupted samples are…

Machine Learning · Computer Science 2018-06-04 Ilias Diakonikolas , Weihao Kong , Alistair Stewart

We study the fundamental problem of high-dimensional mean estimation in a robust model where a constant fraction of the samples are adversarially corrupted. Recent work gave the first polynomial time algorithms for this problem with…

Machine Learning · Computer Science 2018-11-26 Yu Cheng , Ilias Diakonikolas , Rong Ge

We study the problem of estimating the covariance matrix of a high-dimensional distribution when a small constant fraction of the samples can be arbitrarily corrupted. Recent work gave the first polynomial time algorithms for this problem…

Machine Learning · Computer Science 2019-06-12 Yu Cheng , Ilias Diakonikolas , Rong Ge , David Woodruff

We provide a novel -- and to the best of our knowledge, the first -- algorithm for high dimensional sparse regression with constant fraction of corruptions in explanatory and/or response variables. Our algorithm recovers the true sparse…

Machine Learning · Computer Science 2019-05-31 Liu Liu , Yanyao Shen , Tianyang Li , Constantine Caramanis

Subsampling methods have been recently proposed to speed up least squares estimation in large scale settings. However, these algorithms are typically not robust to outliers or corruptions in the observed covariates. The concept of influence…

Machine Learning · Statistics 2014-06-20 Brian McWilliams , Gabriel Krummenacher , Mario Lucic , Joachim M. Buhmann

In today's era of big data, robust least-squares regression becomes a more challenging problem when considering the adversarial corruption along with explosive growth of datasets. Traditional robust methods can handle the noise but suffer…

Data Structures and Algorithms · Computer Science 2017-10-04 Xuchao Zhang , Liang Zhao , Arnold P. Boedihardjo , Chang-Tien Lu

We obtain robust and computationally efficient estimators for learning several linear models that achieve statistically optimal convergence rate under minimal distributional assumptions. Concretely, we assume our data is drawn from a…

Machine Learning · Statistics 2020-12-07 Ainesh Bakshi , Adarsh Prasad

High-dimensional linear regression under heavy-tailed noise or outlier corruption is challenging, both computationally and statistically. Convex approaches have been proven statistically optimal but suffer from high computational costs,…

Statistics Theory · Mathematics 2023-05-11 Yinan Shen , Jingyang Li , Jian-Feng Cai , Dong Xia

We study high-dimensional least-squares regression within a subgaussian statistical learning framework with heterogeneous noise. It includes $s$-sparse and $r$-low-rank least-squares regression when a fraction $\epsilon$ of the labels are…

Statistics Theory · Mathematics 2023-11-01 Philip Thompson

We study the fundamental problems of Gaussian mean estimation and linear regression with Gaussian covariates in the presence of Huber contamination. Our main contribution is the design of the first sample near-optimal and almost linear-time…

Data Structures and Algorithms · Computer Science 2023-12-05 Ilias Diakonikolas , Daniel M. Kane , Ankit Pensia , Thanasis Pittas

We develop efficient algorithms for estimating low-degree moments of unknown distributions in the presence of adversarial outliers. The guarantees of our algorithms improve in many cases significantly over the best previous ones, obtained…

Data Structures and Algorithms · Computer Science 2017-12-27 Pravesh K. Kothari , David Steurer

We revisit the problem of robust linear regression under Gaussian covariates with an unknown covariance matrix of condition number $\kappa$. For this fundamental problem, significant gaps remain in our understanding of the trade-offs among…

Data Structures and Algorithms · Computer Science 2026-05-19 Deeksha Adil , Jarosław Błasiok , Hongjie Chen , Deepak Narayanan Sridharan

We use the Sum of Squares method to develop new efficient algorithms for learning well-separated mixtures of Gaussians and robust mean estimation, both in high dimensions, that substantially improve upon the statistical guarantees achieved…

Data Structures and Algorithms · Computer Science 2017-11-21 Samuel B. Hopkins , Jerry Li

We give the first polynomial-time algorithm to estimate the mean of a $d$-variate probability distribution with bounded covariance from $\tilde{O}(d)$ independent samples subject to pure differential privacy. Prior algorithms for this…

Data Structures and Algorithms · Computer Science 2022-06-06 Samuel B. Hopkins , Gautam Kamath , Mahbod Majid

Learning from data in the presence of outliers is a fundamental problem in statistics. Until recently, no computationally efficient algorithms were known to compute the mean of a high dimensional distribution under natural assumptions in…

Data Structures and Algorithms · Computer Science 2021-01-22 Yeshwanth Cherapanamjeri , Sidhanth Mohanty , Morris Yau
‹ Prev 1 2 3 10 Next ›