Related papers: Unshuffling Data for Improved Generalization

Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization

Domain generalization is the problem of machine learning when the training data and the test data come from different data domains. We present a simple theoretical model of learning to generalize across domains in which there is a…

Machine Learning · Computer Science 2020-02-14 Vikas K. Garg , Adam Kalai , Katrina Ligett , Zhiwei Steven Wu

Exploiting non-i.i.d. data towards more robust machine learning algorithms

In the field of machine learning there is a growing interest towards more robust and generalizable algorithms. This is for example important to bridge the gap between the environment in which the training data was collected and the…

Machine Learning · Computer Science 2020-10-08 Wim Casteels , Peter Hellinckx

Out-of-Variable Generalization for Discriminative Models

The ability of an agent to do well in new environments is a critical aspect of intelligence. In machine learning, this ability is known as $\textit{strong}$ or $\textit{out-of-distribution}$ generalization. However, merely considering…

Machine Learning · Computer Science 2024-02-09 Siyuan Guo , Jonas Wildberger , Bernhard Schölkopf

Linear Regression with Distributed Learning: A Generalization Error Perspective

Distributed learning provides an attractive framework for scaling the learning task by sharing the computational load over multiple nodes in a network. Here, we investigate the performance of distributed learning for large-scale linear…

Machine Learning · Statistics 2021-11-03 Martin Hellkvist , Ayça Özçelikkale , Anders Ahlén

Robust Generalization despite Distribution Shift via Minimum Discriminating Information

Training models that perform well under distribution shifts is a central challenge in machine learning. In this paper, we introduce a modeling framework where, in addition to training data, we have partial structural knowledge of the…

Machine Learning · Computer Science 2021-10-28 Tobias Sutter , Andreas Krause , Daniel Kuhn

Out of Distribution Generalization in Machine Learning

Machine learning has achieved tremendous success in a variety of domains in recent years. However, a lot of these success stories have been in places where the training and the testing distributions are extremely similar to each other. In…

Machine Learning · Statistics 2021-03-05 Martin Arjovsky

Decorr: Environment Partitioning for Invariant Learning and OOD Generalization

Invariant learning methods, aimed at identifying a consistent predictor across multiple environments, are gaining prominence in out-of-distribution (OOD) generalization. Yet, when environments aren't inherent in the data, practitioners must…

Machine Learning · Computer Science 2024-05-24 Yufan Liao , Qi Wu , Zhaodi Wu , Xing Yan

Few-Shot Unsupervised Continual Learning through Meta-Examples

In real-world applications, data do not reflect the ones commonly used for neural networks training, since they are usually few, unlabeled and can be available as a stream. Hence many existing deep learning solutions suffer from a limited…

Machine Learning · Computer Science 2020-11-18 Alessia Bertugli , Stefano Vincenzi , Simone Calderara , Andrea Passerini

Competitive Training of Mixtures of Independent Deep Generative Models

A common assumption in causal modeling posits that the data is generated by a set of independent mechanisms, and algorithms should aim to recover this structure. Standard unsupervised learning, however, is often concerned with training a…

Machine Learning · Computer Science 2019-03-05 Francesco Locatello , Damien Vincent , Ilya Tolstikhin , Gunnar Rätsch , Sylvain Gelly , Bernhard Schölkopf

Data Dropout: Optimizing Training Data for Convolutional Neural Networks

Deep learning models learn to fit training data while they are highly expected to generalize well to testing data. Most works aim at finding such models by creatively designing architectures and fine-tuning parameters. To adapt to…

Computer Vision and Pattern Recognition · Computer Science 2018-09-10 Tianyang Wang , Jun Huan , Bo Li

Invariant Meta Learning for Out-of-Distribution Generalization

Modern deep learning techniques have illustrated their excellent capabilities in many areas, but relies on large training data. Optimization-based meta-learning train a model on a variety tasks, such that it can solve new learning tasks…

Machine Learning · Computer Science 2023-02-23 Penghao Jiang , Ke Xin , Zifeng Wang , Chunxi Li

Uncertainty Modeling for Out-of-Distribution Generalization

Though remarkable progress has been achieved in various vision tasks, deep neural networks still suffer obvious performance degradation when tested in out-of-distribution scenarios. We argue that the feature statistics (mean and standard…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Xiaotong Li , Yongxing Dai , Yixiao Ge , Jun Liu , Ying Shan , Ling-Yu Duan

Efficient Generalization via Multimodal Co-Training under Data Scarcity and Distribution Shift

This paper explores a multimodal co-training framework designed to enhance model generalization in situations where labeled data is limited and distribution shifts occur. We thoroughly examine the theoretical foundations of this framework,…

Machine Learning · Computer Science 2025-10-10 Tianyu Bell Pan , Damon L. Woodard

Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models

Learning models that generalize under different distribution shifts in medical imaging has been a long-standing research challenge. There have been several proposals for efficient and robust visual representation learning among vision…

Computer Vision and Pattern Recognition · Computer Science 2021-12-30 Enoch Tetteh , Joseph Viviano , Yoshua Bengio , David Krueger , Joseph Paul Cohen

Invariance Pair-Guided Learning: Enhancing Robustness in Neural Networks

Out-of-distribution generalization of machine learning models remains challenging since the models are inherently bound to the training data distribution. This especially manifests, when the learned models rely on spurious correlations.…

Machine Learning · Computer Science 2025-02-27 Martin Surner , Abdelmajid Khelil , Ludwig Bothmann

Unrolling Particles: Unsupervised Learning of Sampling Distributions

Particle filtering is used to compute good nonlinear estimates of complex systems. It samples trajectories from a chosen distribution and computes the estimate as a weighted average. Easy-to-sample distributions often lead to degenerate…

Machine Learning · Computer Science 2021-10-07 Fernando Gama , Nicolas Zilberstein , Richard G. Baraniuk , Santiago Segarra

Assaying Out-Of-Distribution Generalization in Transfer Learning

Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e.g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs…

Machine Learning · Computer Science 2022-10-24 Florian Wenzel , Andrea Dittadi , Peter Vincent Gehler , Carl-Johann Simon-Gabriel , Max Horn , Dominik Zietlow , David Kernert , Chris Russell , Thomas Brox , Bernt Schiele , Bernhard Schölkopf , Francesco Locatello

Out-of-Distribution Generalization in Kernel Regression

In real word applications, data generating process for training a machine learning model often differs from what the model encounters in the test stage. Understanding how and whether machine learning models generalize under such…

Machine Learning · Statistics 2022-02-08 Abdulkadir Canatar , Blake Bordelon , Cengiz Pehlevan

Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

Mixup is a highly successful technique to improve generalization of neural networks by augmenting the training data with combinations of random pairs. Selective mixup is a family of methods that apply mixup to specific pairs, e.g. only…

Machine Learning · Computer Science 2023-06-06 Damien Teney , Jindong Wang , Ehsan Abbasnejad

Convergence Analysis of Distributed Stochastic Gradient Descent with Shuffling

When using stochastic gradient descent to solve large-scale machine learning problems, a common practice of data processing is to shuffle the training data, partition the data across multiple machines if needed, and then perform several…

Machine Learning · Statistics 2017-10-02 Qi Meng , Wei Chen , Yue Wang , Zhi-Ming Ma , Tie-Yan Liu