English
Related papers

Related papers: Outlier-Robust High-Dimensional Sparse Estimation …

200 papers

We provide a novel -- and to the best of our knowledge, the first -- algorithm for high dimensional sparse regression with constant fraction of corruptions in explanatory and/or response variables. Our algorithm recovers the true sparse…

Machine Learning · Computer Science 2019-05-31 Liu Liu , Yanyao Shen , Tianyang Li , Constantine Caramanis

We explore the connection between outlier-robust high-dimensional statistics and non-convex optimization in the presence of sparsity constraints, with a focus on the fundamental tasks of robust sparse mean estimation and robust sparse PCA.…

Machine Learning · Computer Science 2022-11-15 Yu Cheng , Ilias Diakonikolas , Rong Ge , Shivam Gupta , Daniel M. Kane , Mahdi Soltanolkotabi

In this paper we initiate the study of whether or not sparse estimation tasks can be performed efficiently in high dimensions, in the robust setting where an $\eps$-fraction of samples are corrupted adversarially. We study the natural…

Machine Learning · Computer Science 2017-03-02 Jerry Li

We study Gaussian sparse estimation tasks in Huber's contamination model with a focus on mean estimation, PCA, and linear regression. For each of these tasks, we give the first sample and computationally efficient robust estimators with…

Machine Learning · Computer Science 2024-03-18 Ilias Diakonikolas , Daniel M. Kane , Sushrut Karmalkar , Ankit Pensia , Thanasis Pittas

The problem of robust mean estimation in high dimensions is studied, in which a certain fraction (less than half) of the datapoints can be arbitrarily corrupted. Motivated by compressive sensing, the robust mean estimation problem is…

Applications · Statistics 2022-12-08 Aditya Deshmukh , Jing Liu , Venugopal V. Veeravalli

We consider high dimensional sparse regression, and develop strategies able to deal with arbitrary -- possibly, severe or coordinated -- errors in the covariance matrix $X$. These may come from corrupted data, persistent experimental…

Machine Learning · Statistics 2013-01-15 Yudong Chen , Constantine Caramanis , Shie Mannor

We study the algorithmic problem of sparse mean estimation in the presence of adversarial outliers. Specifically, the algorithm observes a \emph{corrupted} set of samples from $\mathcal{N}(\mu,\mathbf{I}_d)$, where the unknown mean $\mu \in…

Data Structures and Algorithms · Computer Science 2024-03-08 Ankit Pensia

We consider the problem of outlier robust PCA (OR-PCA) where the goal is to recover principal directions despite the presence of outlier data points. That is, given a data matrix $M^*$, where $(1-\alpha)$ fraction of the points are noisy…

Machine Learning · Computer Science 2017-02-21 Yeshwanth Cherapanamjeri , Prateek Jain , Praneeth Netrapalli

Sparse estimation methods capable of tolerating outliers have been broadly investigated in the last decade. We contribute to this research considering high-dimensional regression problems contaminated by multiple mean-shift outliers which…

Methodology · Statistics 2025-10-21 Luca Insolia , Ana Kenney , Francesca Chiaromonte , Giovanni Felici

Principal component analysis (PCA) is widely used for dimensionality reduction, with well-documented merits in various applications involving high-dimensional data, including computer vision, preference measurement, and bioinformatics. In…

Machine Learning · Statistics 2013-10-01 Gonzalo Mateos , Georgios B. Giannakis

We study the fundamental task of outlier-robust mean estimation for heavy-tailed distributions in the presence of sparsity. Specifically, given a small number of corrupted samples from a high-dimensional heavy-tailed distribution whose mean…

Data Structures and Algorithms · Computer Science 2022-11-30 Ilias Diakonikolas , Daniel M. Kane , Jasper C. H. Lee , Ankit Pensia

We study the problem of high-dimensional sparse mean estimation in the presence of an $\epsilon$-fraction of adversarial outliers. Prior work obtained sample and computationally efficient algorithms for this task for identity-covariance…

Data Structures and Algorithms · Computer Science 2024-07-08 Ilias Diakonikolas , Daniel M. Kane , Sushrut Karmalkar , Ankit Pensia , Thanasis Pittas

Sparse and outlier-robust Principal Component Analysis (PCA) has been a very active field of research recently. Yet, most existing methods apply PCA to a single dataset whereas multi-source data-i.e. multiple related datasets requiring…

Methodology · Statistics 2026-02-26 Patricia Puchhammer , Ines Wilms , Peter Filzmoser

Many conventional statistical procedures are extremely sensitive to seemingly minor deviations from modeling assumptions. This problem is exacerbated in modern high-dimensional settings, where the problem dimension can grow with and…

Machine Learning · Statistics 2017-02-27 Simon S. Du , Sivaraman Balakrishnan , Aarti Singh

Learning in the presence of outliers is a fundamental problem in statistics. Until recently, all known efficient unsupervised learning algorithms were very sensitive to outliers in high dimensions. In particular, even for the task of robust…

Data Structures and Algorithms · Computer Science 2019-11-15 Ilias Diakonikolas , Daniel M. Kane

We develop machinery to design efficiently computable and consistent estimators, achieving estimation error approaching zero as the number of observations grows, when facing an oblivious adversary that may corrupt responses in all but an…

Machine Learning · Computer Science 2021-11-05 Tommaso d'Orsi , Chih-Hung Liu , Rajai Nasser , Gleb Novikov , David Steurer , Stefan Tiegel

We study the fundamental problem of high-dimensional mean estimation in a robust model where a constant fraction of the samples are adversarially corrupted. Recent work gave the first polynomial time algorithms for this problem with…

Machine Learning · Computer Science 2018-11-26 Yu Cheng , Ilias Diakonikolas , Rong Ge

Singular Value Decomposition (and Principal Component Analysis) is one of the most widely used techniques for dimensionality reduction: successful and efficiently computable, it is nevertheless plagued by a well-known, well-documented…

Machine Learning · Computer Science 2011-01-04 Huan Xu , Constantine Caramanis , Sujay Sanghavi

Fully robust versions of the elastic net estimator are introduced for linear and logistic regression. The algorithms to compute the estimators are based on the idea of repeatedly applying the non-robust classical estimators to data subsets…

Methodology · Statistics 2017-03-16 Fatma Sevinc Kurnaz , Irene Hoffmann , Peter Filzmoser

Robust statistical estimators offer resilience against outliers but are often computationally challenging, particularly in high-dimensional sparse settings. Modern optimization techniques are utilized for robust sparse association…

Computation · Statistics 2025-02-03 Pia Pfeiffer , Andreas Alfons , Peter Filzmoser
‹ Prev 1 2 3 10 Next ›