Related papers: A Bridge Between Hyperparameter Optimization and L…

Bilevel Programming for Hyperparameter Optimization and Meta-Learning

We introduce a framework based on bilevel programming that unifies gradient-based hyperparameter optimization and meta-learning. We show that an approximate version of the bilevel problem can be solved by taking into explicit account the…

Machine Learning · Statistics 2018-07-04 Luca Franceschi , Paolo Frasconi , Saverio Salzo , Riccardo Grazzi , Massimilano Pontil

When Meta-Learning Meets Online and Continual Learning: A Survey

Over the past decade, deep neural networks have demonstrated significant success using the training scheme that involves mini-batch stochastic gradient descent on extensive datasets. Expanding upon this accomplishment, there has been a…

Machine Learning · Computer Science 2024-11-11 Jaehyeon Son , Soochan Lee , Gunhee Kim

Meta-Learning and representation learner: A short theoretical note

Meta-learning, or "learning to learn," is a subfield of machine learning where the goal is to develop models and algorithms that can learn from various tasks and improve their learning process over time. Unlike traditional machine learning…

Machine Learning · Computer Science 2024-07-23 Mouad El Bouchattaoui

Meta-Learning in Neural Networks: A Survey

The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent years. Contrary to conventional approaches to AI where tasks are solved from scratch using a fixed learning algorithm, meta-learning aims to…

Machine Learning · Computer Science 2020-11-10 Timothy Hospedales , Antreas Antoniou , Paul Micaelli , Amos Storkey

A Comprehensive Overview and Survey of Recent Advances in Meta-Learning

This article reviews meta-learning also known as learning-to-learn which seeks rapid and accurate model adaptation to unseen tasks with applications in highly automated AI, few-shot learning, natural language processing and robotics. Unlike…

Machine Learning · Computer Science 2020-10-27 Huimin Peng

Gradient Agreement as an Optimization Objective for Meta-Learning

This paper presents a novel optimization method for maximizing generalization over tasks in meta-learning. The goal of meta-learning is to learn a model for an agent adapting rapidly when presented with previously unseen tasks. Tasks are…

Machine Learning · Computer Science 2018-10-19 Amir Erfan Eshratifar , David Eigen , Massoud Pedram

Scalable Nested Optimization for Deep Learning

Gradient-based optimization has been critical to the success of machine learning, updating a single set of parameters to minimize a single loss. A growing number of applications rely on a generalization of this, where we have a bilevel or…

Machine Learning · Computer Science 2024-07-02 Jonathan Lorraine

Meta-Learning with Differentiable Convex Optimization

Many meta-learning approaches for few-shot learning rely on simple base learners such as nearest-neighbor classifiers. However, even in the few-shot regime, discriminatively trained linear predictors can offer better generalization. We…

Computer Vision and Pattern Recognition · Computer Science 2019-04-24 Kwonjoon Lee , Subhransu Maji , Avinash Ravichandran , Stefano Soatto

Practical recommendations for gradient-based training of deep architectures

Learning algorithms related to artificial neural networks and in particular for Deep Learning may seem to involve many bells and whistles, called hyper-parameters. This chapter is meant as a practical guide with recommendations for some of…

Machine Learning · Computer Science 2012-09-18 Yoshua Bengio

Learned Optimizers that Scale and Generalize

Learning to learn has emerged as an important direction for achieving artificial intelligence. Two of the primary barriers to its adoption are an inability to scale to larger problems and a limited ability to generalize to new tasks. We…

Machine Learning · Computer Science 2017-09-11 Olga Wichrowska , Niru Maheswaranathan , Matthew W. Hoffman , Sergio Gomez Colmenarejo , Misha Denil , Nando de Freitas , Jascha Sohl-Dickstein

A Survey on Multi-Objective based Parameter Optimization for Deep Learning

Deep learning models form one of the most powerful machine learning models for the extraction of important features. Most of the designs of deep neural models, i.e., the initialization of parameters, are still manually tuned. Hence,…

Machine Learning · Computer Science 2023-05-18 Mrittika Chakraborty , Wreetbhas Pal , Sanghamitra Bandyopadhyay , Ujjwal Maulik

Guarantees for Tuning the Step Size using a Learning-to-Learn Approach

Choosing the right parameters for optimization algorithms is often the key to their success in practice. Solving this problem using a learning-to-learn approach -- using meta-gradient descent on a meta-objective based on the trajectory that…

Machine Learning · Statistics 2021-06-14 Xiang Wang , Shuai Yuan , Chenwei Wu , Rong Ge

Unsupervised Learning for Combinatorial Optimization Needs Meta-Learning

A general framework of unsupervised learning for combinatorial optimization (CO) is to train a neural network (NN) whose output gives a problem solution by directly optimizing the CO objective. Albeit with some advantages over traditional…

Machine Learning · Computer Science 2023-01-24 Haoyu Wang , Pan Li

Reconciling meta-learning and continual learning with online mixtures of tasks

Learning-to-learn or meta-learning leverages data-driven inductive bias to increase the efficiency of learning on a novel task. This approach encounters difficulty when transfer is not advantageous, for instance, when tasks are considerably…

Machine Learning · Computer Science 2019-06-20 Ghassen Jerfel , Erin Grant , Thomas L. Griffiths , Katherine Heller

Learning Gradient Descent: Better Generalization and Longer Horizons

Training deep neural networks is a highly nontrivial task, involving carefully selecting appropriate training algorithms, scheduling step sizes and tuning other hyperparameters. Trying different combinations can be quite labor-intensive and…

Machine Learning · Computer Science 2017-06-13 Kaifeng Lv , Shunhua Jiang , Jian Li

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm

Learning to learn is a powerful paradigm for enabling models to learn from data more effectively and efficiently. A popular approach to meta-learning is to train a recurrent model to read in a training dataset as input and output the…

Machine Learning · Computer Science 2018-02-16 Chelsea Finn , Sergey Levine

Hierarchical Expert Networks for Meta-Learning

The goal of meta-learning is to train a model on a variety of learning tasks, such that it can adapt to new problems within only a few iterations. Here we propose a principled information-theoretic model that optimally partitions the…

Machine Learning · Statistics 2020-09-10 Heinke Hihn , Daniel A. Braun

Modeling and Optimization Trade-off in Meta-learning

By searching for shared inductive biases across tasks, meta-learning promises to accelerate learning on novel tasks, but with the cost of solving a complex bilevel optimization problem. We introduce and rigorously define the trade-off…

Machine Learning · Computer Science 2021-04-15 Katelyn Gao , Ozan Sener

Decision-Focused Learning: Through the Lens of Learning to Rank

In the last years decision-focused learning framework, also known as predict-and-optimize, have received increasing attention. In this setting, the predictions of a machine learning model are used as estimated cost coefficients in the…

Machine Learning · Computer Science 2022-06-20 Jayanta Mandi , Víctor Bucarey , Maxime Mulamba , Tias Guns