English
Related papers

Related papers: Robust Methods for High-Dimensional Linear Learnin…

200 papers

The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the…

Machine Learning · Computer Science 2016-08-11 Chang Liu , Bo Li , Yevgeniy Vorobeychik , Alina Oprea

This paper considers the problem of supervised learning with linear methods when both features and labels can be corrupted, either in the form of heavy tailed data and/or corrupted rows. We introduce a combination of coordinate gradient…

Machine Learning · Statistics 2022-02-01 Stéphane Gaïffas , Ibrahim Merad

We propose a residual randomization procedure designed for robust Lasso-based inference in the high-dimensional setting. Compared to earlier work that focuses on sub-Gaussian errors, the proposed procedure is designed to work robustly in…

Methodology · Statistics 2021-08-20 Y. Samuel Wang , Si Kai Lee , Panos Toulis , Mladen Kolar

We propose a distributionally robust approach to learning hyperparameters for first-order methods in convex optimization. Given a dataset of problem instances, we minimize a Wasserstein distributionally robust version of the performance…

Machine Learning · Computer Science 2026-05-08 Vinit Ranjan , Jisun Park , Bartolomeo Stellato

High-dimensional linear regression under heavy-tailed noise or outlier corruption is challenging, both computationally and statistically. Convex approaches have been proven statistically optimal but suffer from high computational costs,…

Statistics Theory · Mathematics 2023-05-11 Yinan Shen , Jingyang Li , Jian-Feng Cai , Dong Xia

This paper introduces a two-stage framework designed to enhance long-tail class incremental learning, enabling the model to progressively learn new classes, while mitigating catastrophic forgetting in the context of long-tailed data…

Computer Vision and Pattern Recognition · Computer Science 2023-11-03 Jayateja Kalla , Soma Biswas

Real-world data is laden with outlying values. The challenge for machine learning is that the learner typically has no prior knowledge of whether the feedback it receives (losses, gradients, etc.) will be heavy-tailed or not. In this work,…

Machine Learning · Statistics 2020-12-16 Matthew J. Holland

We study the fundamental problem of high-dimensional mean estimation in a robust model where a constant fraction of the samples are adversarially corrupted. Recent work gave the first polynomial time algorithms for this problem with…

Machine Learning · Computer Science 2018-11-26 Yu Cheng , Ilias Diakonikolas , Rong Ge

Robust loss minimization is an important strategy for handling robust learning issue on noisy labels. Current robust loss functions, however, inevitably involve hyperparameter(s) to be tuned, manually or heuristically through cross…

Machine Learning · Computer Science 2020-02-18 Jun Shu , Qian Zhao , Keyu Chen , Zongben Xu , Deyu Meng

We present an adaptive approach for robust learning from corrupted training sets. We identify corrupted and non-corrupted samples with latent Bernoulli variables and thus formulate the learning problem as maximization of the likelihood…

Machine Learning · Statistics 2024-06-17 Aleksandr Karakulev , Dave Zachariah , Prashant Singh

In many applications, data is collected in batches, some of which are corrupt or even adversarial. Recent work derived optimal robust algorithms for estimating discrete distributions in this setting. We consider a general framework of…

Machine Learning · Statistics 2020-02-26 Ayush Jain , Alon Orlitsky

We adapt a manifold sampling algorithm for the nonsmooth, nonconvex formulations of learning that arise when imposing robustness to outliers present in the training data. We demonstrate the approach on objectives based on trimmed loss.…

Optimization and Control · Mathematics 2018-07-10 Matt Menickelly , Stefan M. Wild

Robust learning methods aim to learn a clean target distribution from noisy and corrupted training data where a specific corruption pattern is often assumed a priori. Our proposed method can not only successfully learn the clean target…

Machine Learning · Computer Science 2023-02-08 Jeongeun Park , Seungyoun Shin , Sangheum Hwang , Sungjoon Choi

In this work, we give a ${\rm poly}(d,k)$ time and sample algorithm for efficiently learning the parameters of a mixture of $k$ spherical distributions in $d$ dimensions. Unlike all previous methods, our techniques apply to heavy-tailed…

Data Structures and Algorithms · Computer Science 2026-05-22 Alkis Kalavasis , Pravesh K. Kothari , Shuchen Li , Manolis Zampetakis

We propose a framework for distributed robust statistical learning on {\em big contaminated data}. The Distributed Robust Learning (DRL) framework can reduce the computational time of traditional robust learning methods by several orders of…

Machine Learning · Statistics 2015-02-10 Jiashi Feng , Huan Xu , Shie Mannor

In this work, we consider learning sparse models in large scale settings, where the number of samples and the feature dimension can grow as large as millions or billions. Two immediate issues occur under such challenging scenario: (i)…

Machine Learning · Statistics 2023-01-31 Atul Dhingra , Jie Shen , Nicholas Kleene

We provide a novel -- and to the best of our knowledge, the first -- algorithm for high dimensional sparse regression with constant fraction of corruptions in explanatory and/or response variables. Our algorithm recovers the true sparse…

Machine Learning · Computer Science 2019-05-31 Liu Liu , Yanyao Shen , Tianyang Li , Constantine Caramanis

With the tremendous success of deep learning in visual tasks, the representations extracted from intermediate layers of learned models, that is, deep features, attract much attention of researchers. Previous empirical analysis shows that…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Qi Qian , Juhua Hu , Hao Li

Real-world data is often unbalanced and long-tailed, but deep models struggle to recognize rare classes in the presence of frequent classes. To address unbalanced data, most studies try balancing the data, the loss, or the classifier to…

Machine Learning · Computer Science 2021-11-02 Dvir Samuel , Gal Chechik

We study scalable alternatives to robust gradient descent (RGD) techniques that can be used when the losses and/or gradients can be heavy-tailed, though this will be unknown to the learner. The core technique is simple: instead of trying to…

Machine Learning · Statistics 2020-12-15 Matthew J. Holland
‹ Prev 1 2 3 10 Next ›