Related papers: Meta-Learning Parameterized First-Order Optimizers…

Overcoming Catastrophic Forgetting via Direction-Constrained Optimization

This paper studies a new design of the optimization algorithm for training deep learning models with a fixed architecture of the classification network in a continual learning framework. The training data is non-stationary and the…

Machine Learning · Computer Science 2022-07-05 Yunfei Teng , Anna Choromanska , Murray Campbell , Songtao Lu , Parikshit Ram , Lior Horesh

Unsupervised Learning for Combinatorial Optimization Needs Meta-Learning

A general framework of unsupervised learning for combinatorial optimization (CO) is to train a neural network (NN) whose output gives a problem solution by directly optimizing the CO objective. Albeit with some advantages over traditional…

Machine Learning · Computer Science 2023-01-24 Haoyu Wang , Pan Li

MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters

We address the challenge of optimizing meta-parameters (hyperparameters) in machine learning, a key factor for efficient training and high model performance. Rather than relying on expensive meta-parameter search methods, we introduce…

Machine Learning · Computer Science 2025-07-10 Arsalan Sharifnassab , Saber Salehkaleybar , Richard Sutton

Learning Algorithm Hyperparameters for Fast Parametric Convex Optimization

We introduce a machine-learning framework to learn the hyperparameter sequence of first-order methods (e.g., the step sizes in gradient descent) to quickly solve parametric convex optimization problems. Our computational architecture…

Optimization and Control · Mathematics 2024-12-23 Rajiv Sambharya , Bartolomeo Stellato

Investigation into the Training Dynamics of Learned Optimizers

Optimization is an integral part of modern deep learning. Recently, the concept of learned optimizers has emerged as a way to accelerate this optimization process by replacing traditional, hand-crafted algorithms with meta-learned…

Machine Learning · Computer Science 2023-12-13 Jan Sobotka , Petr Šimánek , Daniel Vašata

Feasibility-Aware Decision-Focused Learning for Predicting Parameters in the Constraints

When some parameters of a constrained optimization problem (COP) are uncertain, this gives rise to a predict-then-optimize (PtO) problem, comprising two stages: the prediction of the unknown parameters from contextual information and the…

Machine Learning · Computer Science 2025-10-28 Jayanta Mandi , Marianne Defresne , Senne Berden , Tias Guns

One-step learning algorithm selection for classification via convolutional neural networks

As with any task, the process of building machine learning models can benefit from prior experience. Meta-learning for classifier selection leverages knowledge about the characteristics of different datasets and/or the past performance of…

Machine Learning · Computer Science 2025-08-26 Sebastian Maldonado , Carla Vairetti , Ignacio Figueroa

Large-Scale Meta-Learning with Continual Trajectory Shifting

Meta-learning of shared initialization parameters has shown to be highly effective in solving few-shot learning tasks. However, extending the framework to many-shot scenarios, which may further enhance its practicality, has been relatively…

Machine Learning · Computer Science 2022-02-17 Jaewoong Shin , Hae Beom Lee , Boqing Gong , Sung Ju Hwang

Can Learned Optimization Make Reinforcement Learning Less Difficult?

While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from…

Machine Learning · Computer Science 2025-04-16 Alexander David Goldie , Chris Lu , Matthew Thomas Jackson , Shimon Whiteson , Jakob Nicolaus Foerster

Development of Deep Learning Optimizers: Approaches, Concepts, and Update Rules

Deep learning optimizers are optimization algorithms that enable deep neural networks to learn. The effectiveness of learning is highly dependent on the optimizer employed in the training process. Alongside the rapid advancement of deep…

Machine Learning · Computer Science 2025-09-24 Doğay Altınel

Meta-Regularization: An Approach to Adaptive Choice of the Learning Rate in Gradient Descent

We propose \textit{Meta-Regularization}, a novel approach for the adaptive choice of the learning rate in first-order gradient descent methods. Our approach modifies the objective function by adding a regularization term on the learning…

Machine Learning · Computer Science 2021-04-13 Guangzeng Xie , Hao Jin , Dachao Lin , Zhihua Zhang

Arbitrary Order Meta-Learning with Simple Population-Based Evolution

Meta-learning, the notion of learning to learn, enables learning systems to quickly and flexibly solve new tasks. This usually involves defining a set of outer-loop meta-parameters that are then used to update a set of inner-loop…

Machine Learning · Computer Science 2023-03-17 Chris Lu , Sebastian Towers , Jakob Foerster

Improving physics-informed neural networks with meta-learned optimization

We show that the error achievable using physics-informed neural networks for solving systems of differential equations can be substantially reduced when these networks are trained using meta-learned optimization methods rather than to using…

Machine Learning · Computer Science 2023-03-15 Alex Bihlo

A Review of Meta-level Learning in the Context of Multi-component, Multi-level Evolving Prediction Systems

The exponential growth of volume, variety and velocity of data is raising the need for investigations of automated or semi-automated ways to extract useful patterns from the data. It requires deep expert knowledge and extensive…

Machine Learning · Computer Science 2020-07-22 Abbas Raza Ali , Marcin Budka , Bogdan Gabrys

Greedy Learning to Optimize with Convergence Guarantees

Learning to optimize is an approach that leverages training data to accelerate the solution of optimization problems. Many approaches use unrolling to parametrize the update step and learn optimal parameters. Although L2O has shown…

Optimization and Control · Mathematics 2025-07-15 Patrick Fahy , Mohammad Golbabaee , Matthias J. Ehrhardt

Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation

Using machine learning to solve combinatorial optimization (CO) problems is challenging, especially when the data is unlabeled. This work proposes an unsupervised learning framework for CO problems. Our framework follows a standard…

Machine Learning · Computer Science 2022-10-25 Haoyu Wang , Nan Wu , Hang Yang , Cong Hao , Pan Li

Learning Acceleration Algorithms for Fast Parametric Convex Optimization with Certified Robustness

We develop a machine-learning framework to learn hyperparameter sequences for accelerated first-order methods (e.g., the step size and momentum sequences in accelerated gradient descent) to quickly solve parametric convex optimization…

Optimization and Control · Mathematics 2025-10-07 Rajiv Sambharya , Jinho Bok , Nikolai Matni , George Pappas

On First-Order Meta-Learning Algorithms

This paper considers meta-learning problems, where there is a distribution of tasks, and we would like to obtain an agent that performs well (i.e., learns quickly) when presented with a previously unseen task sampled from this distribution.…

Machine Learning · Computer Science 2018-10-23 Alex Nichol , Joshua Achiam , John Schulman

Provable Guarantees for Gradient-Based Meta-Learning

We study the problem of meta-learning through the lens of online convex optimization, developing a meta-algorithm bridging the gap between popular gradient-based meta-learning and classical regularization-based multi-task transfer methods.…

Machine Learning · Computer Science 2019-05-17 Mikhail Khodak , Maria-Florina Balcan , Ameet Talwalkar

Distributed Optimization for Over-Parameterized Learning

Distributed optimization often consists of two updating phases: local optimization and inter-node communication. Conventional approaches require working nodes to communicate with the server every one or few iterations to guarantee…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-17 Chi Zhang , Qianxiao Li