Related papers: Multi-step Estimation for Gradient-based Meta-lear…

Meta-Learning with Implicit Gradients

A core capability of intelligent systems is the ability to quickly learn new tasks by drawing on prior experience. Gradient (or optimization) based meta-learning has recently emerged as an effective approach for few-shot learning. In this…

Machine Learning · Computer Science 2019-09-11 Aravind Rajeswaran , Chelsea Finn , Sham Kakade , Sergey Levine

Large-Scale Meta-Learning with Continual Trajectory Shifting

Meta-learning of shared initialization parameters has shown to be highly effective in solving few-shot learning tasks. However, extending the framework to many-shot scenarios, which may further enhance its practicality, has been relatively…

Machine Learning · Computer Science 2022-02-17 Jaewoong Shin , Hae Beom Lee , Boqing Gong , Sung Ju Hwang

MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning

Equipping a deep model the abaility of few-shot learning, i.e., learning quickly from only few examples, is a core challenge for artificial intelligence. Gradient-based meta-learning approaches effectively address the challenge by learning…

Machine Learning · Computer Science 2024-01-09 Baoquan Zhang , Chuyao Luo , Demin Yu , Huiwei Lin , Xutao Li , Yunming Ye , Bowen Zhang

Gradient-Based Meta-Learning Using Uncertainty to Weigh Loss for Few-Shot Learning

Model-Agnostic Meta-Learning (MAML) is one of the most successful meta-learning techniques for few-shot learning. It uses gradient descent to learn commonalities between various tasks, enabling the model to learn the meta-initialization of…

Machine Learning · Computer Science 2022-08-18 Lin Ding , Peng Liu , Wenfeng Shen , Weijia Lu , Shengbo Chen

Memory Efficient Meta-Learning with Large Images

Meta learning approaches to few-shot classification are computationally efficient at test time, requiring just a few optimization steps or single forward pass to learn a new task, but they remain highly memory-intensive to train. This…

Machine Learning · Statistics 2023-12-27 John Bronskill , Daniela Massiceti , Massimiliano Patacchiola , Katja Hofmann , Sebastian Nowozin , Richard E. Turner

Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace

Gradient-based meta-learning methods leverage gradient descent to learn the commonalities among various tasks. While previous such methods have been successful in meta-learning tasks, they resort to simple gradient descent during…

Machine Learning · Statistics 2018-06-15 Yoonho Lee , Seungjin Choi

Gradient-EM Bayesian Meta-learning

Bayesian meta-learning enables robust and fast adaptation to new tasks with uncertainty assessment. The key idea behind Bayesian meta-learning is empirical Bayes inference of hierarchical model. In this work, we extend this framework to…

Machine Learning · Computer Science 2020-11-19 Yayi Zou , Xiaoqi Lu

Provable Guarantees for Gradient-Based Meta-Learning

We study the problem of meta-learning through the lens of online convex optimization, developing a meta-algorithm bridging the gap between popular gradient-based meta-learning and classical regularization-based multi-task transfer methods.…

Machine Learning · Computer Science 2019-05-17 Mikhail Khodak , Maria-Florina Balcan , Ameet Talwalkar

Accelerating Meta-Learning by Sharing Gradients

The success of gradient-based meta-learning is primarily attributed to its ability to leverage related tasks to learn task-invariant information. However, the absence of interactions between different tasks in the inner loop leads to…

Machine Learning · Computer Science 2023-12-15 Oscar Chang , Hod Lipson

Meta-Learning Neural Procedural Biases

The goal of few-shot learning is to generalize and achieve high performance on new unseen learning tasks, where each task has only a limited number of examples available. Gradient-based meta-learning attempts to address this challenging…

Machine Learning · Computer Science 2024-06-13 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Incremental Meta-Learning via Indirect Discriminant Alignment

Majority of the modern meta-learning methods for few-shot classification tasks operate in two phases: a meta-training phase where the meta-learner learns a generic representation by solving multiple few-shot tasks sampled from a large…

Machine Learning · Computer Science 2020-04-23 Qing Liu , Orchid Majumder , Alessandro Achille , Avinash Ravichandran , Rahul Bhotika , Stefano Soatto

Probabilistic Model-Agnostic Meta-Learning

Meta-learning for few-shot learning entails acquiring a prior over previous tasks and experiences, such that new tasks be learned from small amounts of data. However, a critical challenge in few-shot learning is task ambiguity: even when a…

Machine Learning · Computer Science 2019-10-18 Chelsea Finn , Kelvin Xu , Sergey Levine

Optimizing ML Training with Metagradient Descent

A major challenge in training large-scale machine learning models is configuring the training process to maximize model performance, i.e., finding the best training setup from a vast design space. In this work, we unlock a gradient-based…

Machine Learning · Statistics 2025-03-19 Logan Engstrom , Andrew Ilyas , Benjamin Chen , Axel Feldmann , William Moses , Aleksander Madry

Gradient Descent with Provably Tuned Learning-rate Schedules

Gradient-based iterative optimization methods are the workhorse of modern machine learning. They crucially rely on careful tuning of parameters like learning rate and momentum. However, one typically sets them using heuristic approaches…

Machine Learning · Computer Science 2025-12-05 Dravyansh Sharma

Subspace Adaptation Prior for Few-Shot Learning

Gradient-based meta-learning techniques aim to distill useful prior knowledge from a set of training tasks such that new tasks can be learned more efficiently with gradient descent. While these methods have achieved successes in various…

Machine Learning · Computer Science 2023-10-16 Mike Huisman , Aske Plaat , Jan N. van Rijn

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Large pretrained language models (LMs) like BERT have improved performance in many disparate natural language processing (NLP) tasks. However, fine tuning such models requires a large number of training examples for each target task.…

Computation and Language · Computer Science 2022-01-28 Jixuan Wang , Kuan-Chieh Wang , Frank Rudzicz , Michael Brudno

Meta-SGD: Learning to Learn Quickly for Few-Shot Learning

Few-shot learning is challenging for learning algorithms that learn each task in isolation and from scratch. In contrast, meta-learning learns from many related tasks a meta-learner that can learn a new task more accurately and faster with…

Machine Learning · Computer Science 2017-09-29 Zhenguo Li , Fengwei Zhou , Fei Chen , Hang Li

Few-Shot Learning by Dimensionality Reduction in Gradient Space

We introduce SubGD, a novel few-shot learning method which is based on the recent finding that stochastic gradient descent updates tend to live in a low-dimensional parameter subspace. In experimental and theoretical analyses, we show that…

Machine Learning · Computer Science 2023-01-30 Martin Gauch , Maximilian Beck , Thomas Adler , Dmytro Kotsur , Stefan Fiel , Hamid Eghbal-zadeh , Johannes Brandstetter , Johannes Kofler , Markus Holzleitner , Werner Zellinger , Daniel Klotz , Sepp Hochreiter , Sebastian Lehner

Meta-learning the Learning Trends Shared Across Tasks

Meta-learning stands for 'learning to learn' such that generalization to new tasks is achieved. Among these methods, Gradient-based meta-learning algorithms are a specific sub-class that excel at quick adaptation to new tasks with limited…

Machine Learning · Computer Science 2020-10-20 Jathushan Rajasegaran , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Mubarak Shah

Efficient Meta-Learning for Continual Learning with Taylor Expansion Approximation

Continual learning aims to alleviate catastrophic forgetting when handling consecutive tasks under non-stationary distributions. Gradient-based meta-learning algorithms have shown the capability to implicitly solve the transfer-interference…

Machine Learning · Computer Science 2022-10-04 Xiaohan Zou , Tong Lin