English
Related papers

Related papers: Robust supervised learning with coordinate gradien…

200 papers

We consider a general statistical learning problem where an unknown fraction of the training data is corrupted. We develop a robust learning method that only requires specifying an upper bound on the corrupted data fraction. The method…

Machine Learning · Statistics 2020-02-10 Muhammad Osama , Dave Zachariah , Peter Stoica

We study the problem of semi-supervised learning on graphs in the regime where data labels are scarce or possibly corrupted. We propose an approach called $p$-conductance learning that generalizes the $p$-Laplace and Poisson learning…

Machine Learning · Computer Science 2025-02-14 Sawyer Jack Robertson , Chester Holtz , Zhengchao Wan , Gal Mishne , Alexander Cloninger

We propose statistically robust and computationally efficient linear learning methods in the high-dimensional batch setting, where the number of features $d$ may exceed the sample size $n$. We employ, in a generic learning setting, two…

Machine Learning · Statistics 2023-05-30 Ibrahim Merad , Stéphane Gaïffas

Graph Semi-Supervised learning is an important data analysis tool, where given a graph and a set of labeled nodes, the aim is to infer the labels to the remaining unlabeled nodes. In this paper, we start by considering an optimization-based…

Machine Learning · Computer Science 2023-09-26 Sara Venturini , Andrea Cristofari , Francesco Rinaldi , Francesco Tudisco

Training deep neural models in the presence of corrupted supervision is challenging as the corrupted data points may significantly impact the generalization performance. To alleviate this problem, we present an efficient robust algorithm…

Machine Learning · Computer Science 2021-02-16 Boyang Liu , Mengying Sun , Ding Wang , Pang-Ning Tan , Jiayu Zhou

Designing learning algorithms that are resistant to perturbations of the underlying data distribution is a problem of wide practical and theoretical importance. We present a general approach to this problem focusing on unsupervised…

Machine Learning · Computer Science 2021-02-22 Andreas Maurer , Daniela A. Parletta , Andrea Paudice , Massimiliano Pontil

Real-world data is laden with outlying values. The challenge for machine learning is that the learner typically has no prior knowledge of whether the feedback it receives (losses, gradients, etc.) will be heavy-tailed or not. In this work,…

Machine Learning · Statistics 2020-12-16 Matthew J. Holland

Distributed gradient descent algorithms have come to the fore in modern machine learning, especially in parallelizing the handling of large datasets that are distributed across several workers. However, scant attention has been paid to…

Signal Processing · Electrical Eng. & Systems 2025-02-06 Shuche Wang , Vincent Y. F. Tan

Self-supervised contrastive learning is a powerful tool to learn visual representation without labels. Prior work has primarily focused on evaluating the recognition accuracy of various pre-training algorithms, but has overlooked other…

Computer Vision and Pattern Recognition · Computer Science 2022-06-13 Yuanyi Zhong , Haoran Tang , Junkun Chen , Jian Peng , Yu-Xiong Wang

The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the…

Machine Learning · Computer Science 2016-08-11 Chang Liu , Bo Li , Yevgeniy Vorobeychik , Alina Oprea

Semi-supervised learning (SSL) is the branch of machine learning that aims to improve learning performance by leveraging unlabeled data when labels are insufficient. Recently, SSL with deep models has proven to be successful on standard…

Machine Learning · Computer Science 2022-11-15 Lan-Zhe Guo , Zhi Zhou , Yu-Feng Li

Numerous modern optimization and machine learning algorithms rely on subgradient information being trustworthy and hence, they may fail to converge when such information is corrupted. In this paper, we consider the setting where subgradient…

Optimization and Control · Mathematics 2021-03-23 Berkay Turan , Cesar A. Uribe , Hoi-To Wai , Mahnoosh Alizadeh

To improve the off-sample generalization of classical procedures minimizing the empirical risk under potentially heavy-tailed data, new robust learning algorithms have been proposed in recent years, with generalized median-of-means…

Machine Learning · Statistics 2018-10-16 Matthew J. Holland

Robust learning from noisy demonstrations is a practical but highly challenging problem in imitation learning. In this paper, we first theoretically show that robust imitation learning can be achieved by optimizing a classification risk…

Machine Learning · Statistics 2021-02-22 Voot Tangkaratt , Nontawat Charoenphakdee , Masashi Sugiyama

Minimizing the empirical risk is a popular training strategy, but for learning tasks where the data may be noisy or heavy-tailed, one may require many observations in order to generalize well. To achieve better performance under less…

Machine Learning · Statistics 2018-10-16 Matthew J. Holland , Kazushi Ikeda

We study fast algorithms for statistical regression problems under the strong contamination model, where the goal is to approximately optimize a generalized linear model (GLM) given adversarially corrupted samples. Prior works in this line…

Data Structures and Algorithms · Computer Science 2021-06-23 Arun Jambulapati , Jerry Li , Tselil Schramm , Kevin Tian

The amount of manually labeled data is limited in medical applications, so semi-supervised learning and automatic labeling strategies can be an asset for training deep neural networks. However, the quality of the automatically generated…

Machine Learning · Computer Science 2022-03-04 Wenhui Cui , Haleh Akrami , Anand A. Joshi , Richard M. Leahy

Labeling cost is often expensive and is a fundamental limitation of supervised learning. In this paper, we study importance labeling problem, in which we are given many unlabeled data and select a limited number of data to be labeled from…

Machine Learning · Computer Science 2021-04-13 Tomoya Murata , Taiji Suzuki

We present an adaptive approach for robust learning from corrupted training sets. We identify corrupted and non-corrupted samples with latent Bernoulli variables and thus formulate the learning problem as maximization of the likelihood…

Machine Learning · Statistics 2024-06-17 Aleksandr Karakulev , Dave Zachariah , Prashant Singh

Modern neural networks are typically trained in an over-parameterized regime where the parameters of the model far exceed the size of the training data. Such neural networks in principle have the capacity to (over)fit any set of labels…

Machine Learning · Computer Science 2019-07-05 Mingchen Li , Mahdi Soltanolkotabi , Samet Oymak
‹ Prev 1 2 3 10 Next ›