Related papers: Regularizing Meta-Learning via Gradient Dropout

Dropout Reduces Underfitting

Introduced by Hinton et al. in 2012, dropout has stood the test of time as a regularizer for preventing overfitting in neural networks. In this study, we demonstrate that dropout can also mitigate underfitting when used at the start of…

Machine Learning · Computer Science 2023-06-01 Zhuang Liu , Zhiqiu Xu , Joseph Jin , Zhiqiang Shen , Trevor Darrell

Improving Generalization in Meta-Learning via Meta-Gradient Augmentation

Meta-learning methods typically follow a two-loop framework, where each loop potentially suffers from notorious overfitting, hindering rapid adaptation and generalization to new tasks. Existing schemes solve it by enhancing the…

Machine Learning · Computer Science 2023-06-16 Ren Wang , Haoliang Sun , Qi Wei , Xiushan Nie , Yuling Ma , Yilong Yin

Meta-Regularization: An Approach to Adaptive Choice of the Learning Rate in Gradient Descent

We propose \textit{Meta-Regularization}, a novel approach for the adaptive choice of the learning rate in first-order gradient descent methods. Our approach modifies the objective function by adding a regularization term on the learning…

Machine Learning · Computer Science 2021-04-13 Guangzeng Xie , Hao Jin , Dachao Lin , Zhihua Zhang

Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm

Learning to learn is a powerful paradigm for enabling models to learn from data more effectively and efficiently. A popular approach to meta-learning is to train a recurrent model to read in a training dataset as input and output the…

Machine Learning · Computer Science 2018-02-16 Chelsea Finn , Sergey Levine

Reducing Variance in Meta-Learning via Laplace Approximation for Regression Tasks

Given a finite set of sample points, meta-learning algorithms aim to learn an optimal adaptation strategy for new, unseen tasks. Often, this data can be ambiguous as it might belong to different tasks concurrently. This is particularly the…

Machine Learning · Computer Science 2024-10-24 Alfredo Reichlin , Gustaf Tegnér , Miguel Vasco , Hang Yin , Mårten Björkman , Danica Kragic

Domain Generalization Guided by Gradient Signal to Noise Ratio of Parameters

Overfitting to the source domain is a common issue in gradient-based training of deep neural networks. To compensate for the over-parameterized models, numerous regularization techniques have been introduced such as those based on dropout.…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Mateusz Michalkiewicz , Masoud Faraki , Xiang Yu , Manmohan Chandraker , Mahsa Baktashmotlagh

Understanding Benign Overfitting in Gradient-Based Meta Learning

Meta learning has demonstrated tremendous success in few-shot learning with limited supervised data. In those settings, the meta model is usually overparameterized. While the conventional statistical learning theory suggests that…

Machine Learning · Computer Science 2022-11-10 Lisha Chen , Songtao Lu , Tianyi Chen

Meta-learning the Learning Trends Shared Across Tasks

Meta-learning stands for 'learning to learn' such that generalization to new tasks is achieved. Among these methods, Gradient-based meta-learning algorithms are a specific sub-class that excel at quick adaptation to new tasks with limited…

Machine Learning · Computer Science 2020-10-20 Jathushan Rajasegaran , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Mubarak Shah

Towards Understanding Generalization in Gradient-Based Meta-Learning

In this work we study generalization of neural networks in gradient-based meta-learning by analyzing various properties of the objective landscapes. We experimentally demonstrate that as meta-training progresses, the meta-test solutions,…

Machine Learning · Computer Science 2019-07-18 Simon Guiroy , Vikas Verma , Christopher Pal

Meta-Learning with Warped Gradient Descent

Learning an efficient update rule from data that promotes rapid learning of new tasks from the same distribution remains an open problem in meta-learning. Typically, previous works have approached this issue either by attempting to train a…

Machine Learning · Computer Science 2020-02-19 Sebastian Flennerhag , Andrei A. Rusu , Razvan Pascanu , Francesco Visin , Hujun Yin , Raia Hadsell

Accelerating Meta-Learning by Sharing Gradients

The success of gradient-based meta-learning is primarily attributed to its ability to leverage related tasks to learn task-invariant information. However, the absence of interactions between different tasks in the inner loop leads to…

Machine Learning · Computer Science 2023-12-15 Oscar Chang , Hod Lipson

SoftTarget Regularization: An Effective Technique to Reduce Over-Fitting in Neural Networks

Deep neural networks are learning models with a very high capacity and therefore prone to over-fitting. Many regularization techniques such as Dropout, DropConnect, and weight decay all attempt to solve the problem of over-fitting by…

Machine Learning · Computer Science 2016-12-06 Armen Aghajanyan

Gradient Agreement as an Optimization Objective for Meta-Learning

This paper presents a novel optimization method for maximizing generalization over tasks in meta-learning. The goal of meta-learning is to learn a model for an agent adapting rapidly when presented with previously unseen tasks. Tasks are…

Machine Learning · Computer Science 2018-10-19 Amir Erfan Eshratifar , David Eigen , Massoud Pedram

Curriculum Dropout

Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network…

Neural and Evolutionary Computing · Computer Science 2017-08-04 Pietro Morerio , Jacopo Cavazza , Riccardo Volpi , Rene Vidal , Vittorio Murino

Meta Dropout: Learning to Perturb Features for Generalization

A machine learning model that generalizes well should obtain low errors on unseen test examples. Thus, if we know how to optimally perturb training examples to account for test examples, we may achieve better generalization performance.…

Machine Learning · Computer Science 2022-02-15 Hae Beom Lee , Taewook Nam , Eunho Yang , Sung Ju Hwang

PLACE dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization

Domain generalization (DG) aims to learn a generic model from multiple observed source domains that generalizes well to arbitrary unseen target domains without further training. The major challenge in DG is that the model inevitably faces a…

Machine Learning · Computer Science 2023-09-19 Jintao Guo , Lei Qi , Yinghuan Shi , Yang Gao

Meta-Learning with Network Pruning

Meta-learning is a powerful paradigm for few-shot learning. Although with remarkable success witnessed in many applications, the existing optimization based meta-learning models with over-parameterized neural networks have been evidenced to…

Machine Learning · Computer Science 2020-07-23 Hongduan Tian , Bo Liu , Xiao-Tong Yuan , Qingshan Liu

A Survey on Dropout Methods and Experimental Verification in Recommendation

Overfitting is a common problem in machine learning, which means the model too closely fits the training data while performing poorly in the test data. Among various methods of coping with overfitting, dropout is one of the representative…

Machine Learning · Computer Science 2022-05-17 Yangkun Li , Weizhi Ma , Chong Chen , Min Zhang , Yiqun Liu , Shaoping Ma , Yuekui Yang

Analytic theory of dropout regularization

Dropout is a regularization technique widely used in training artificial neural networks to mitigate overfitting. It consists of dynamically deactivating subsets of the network during training to promote more robust representations. Despite…

Machine Learning · Statistics 2025-09-10 Francesco Mori , Francesca Mignacco

MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning

Equipping a deep model the abaility of few-shot learning, i.e., learning quickly from only few examples, is a core challenge for artificial intelligence. Gradient-based meta-learning approaches effectively address the challenge by learning…

Machine Learning · Computer Science 2024-01-09 Baoquan Zhang , Chuyao Luo , Demin Yu , Huiwei Lin , Xutao Li , Yunming Ye , Bowen Zhang