Related papers: Training Learned Optimizers with Randomly Initiali…

Investigation into the Training Dynamics of Learned Optimizers

Optimization is an integral part of modern deep learning. Recently, the concept of learned optimizers has emerged as a way to accelerate this optimization process by replacing traditional, hand-crafted algorithms with meta-learned…

Machine Learning · Computer Science 2023-12-13 Jan Sobotka , Petr Šimánek , Daniel Vašata

Reverse engineering learned optimizers reveals known and novel mechanisms

Learned optimizers are algorithms that can themselves be trained to solve optimization problems. In contrast to baseline optimizers (such as momentum or Adam) that use simple update rules derived from theoretical principles, learned…

Machine Learning · Computer Science 2021-12-09 Niru Maheswaranathan , David Sussillo , Luke Metz , Ruoxi Sun , Jascha Sohl-Dickstein

Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves

Much as replacing hand-designed features with learned functions has revolutionized how we solve perceptual tasks, we believe learned algorithms will transform how we train models. In this work we focus on general-purpose learned optimizers…

Machine Learning · Computer Science 2020-09-24 Luke Metz , Niru Maheswaranathan , C. Daniel Freeman , Ben Poole , Jascha Sohl-Dickstein

Understanding and correcting pathologies in the training of learned optimizers

Deep learning has shown that learned functions can dramatically outperform hand-designed functions on perceptual tasks. Analogously, this suggests that learned optimizers may similarly outperform current hand-designed optimizers, especially…

Neural and Evolutionary Computing · Computer Science 2019-06-11 Luke Metz , Niru Maheswaranathan , Jeremy Nixon , C. Daniel Freeman , Jascha Sohl-Dickstein

Humans learn too: Better Human-AI Interaction using Optimized Human Inputs

Humans rely more and more on systems with AI components. The AI community typically treats human inputs as a given and optimizes AI models only. This thinking is one-sided and it neglects the fact that humans can learn, too. In this work,…

Human-Computer Interaction · Computer Science 2020-09-22 Johannes Schneider

Can Learned Optimization Make Reinforcement Learning Less Difficult?

While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from…

Machine Learning · Computer Science 2025-04-16 Alexander David Goldie , Chris Lu , Matthew Thomas Jackson , Shimon Whiteson , Jakob Nicolaus Foerster

Practical tradeoffs between memory, compute, and performance in learned optimizers

Optimization plays a costly and crucial role in developing machine learning systems. In learned optimizers, the few hyperparameters of commonly used hand-designed optimizers, e.g. Adam or SGD, are replaced with flexible parametric…

Machine Learning · Computer Science 2022-07-19 Luke Metz , C. Daniel Freeman , James Harrison , Niru Maheswaranathan , Jascha Sohl-Dickstein

A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases

Learned optimizers -- neural networks that are trained to act as optimizers -- have the potential to dramatically accelerate training of machine learning models. However, even when meta-trained across thousands of tasks at huge…

Machine Learning · Computer Science 2022-09-23 James Harrison , Luke Metz , Jascha Sohl-Dickstein

HUB: Guiding Learned Optimizers with Continuous Prompt Tuning

Learned optimizers are a crucial component of meta-learning. Recent advancements in scalable learned optimizers have demonstrated their superior performance over hand-designed optimizers in various tasks. However, certain characteristics of…

Machine Learning · Computer Science 2023-06-01 Gaole Dai , Wei Wu , Ziyu Wang , Jie Fu , Shanghang Zhang , Tiejun Huang

Automated Optimization Modeling via a Localizable Error-Driven Perspective

Automated optimization modeling via Large Language Models (LLMs) has emerged as a promising approach to assist complex human decision-making. While post-training has become a pivotal technique to enhance LLMs' capabilities in this domain,…

Machine Learning · Computer Science 2026-02-13 Weiting Liu , Han Wu , Yufei Kuang , Xiongwei Han , Tao Zhong , Jianfeng Feng , Wenlian Lu

Narrowing the Focus: Learned Optimizers for Pretrained Models

In modern deep learning, the models are learned by applying gradient updates using an optimizer, which transforms the updates based on various statistics. Optimizers are often hand-designed and tuning their hyperparameters is a big part of…

Machine Learning · Computer Science 2024-10-08 Gus Kristiansen , Mark Sandler , Andrey Zhmoginov , Nolan Miller , Anirudh Goyal , Jihwan Lee , Max Vladymyrov

Human-like machine learning: limitations and suggestions

This paper attempts to address the issues of machine learning in its current implementation. It is known that machine learning algorithms require a significant amount of data for training purposes, whereas recent developments in deep…

Machine Learning · Computer Science 2018-11-16 Georgios Mastorakis

Learning to Optimize for Reinforcement Learning

In recent years, by leveraging more data, computation, and diverse tasks, learned optimizers have achieved remarkable success in supervised learning, outperforming classical hand-designed optimizers. Reinforcement learning (RL) is…

Machine Learning · Computer Science 2024-06-05 Qingfeng Lan , A. Rupam Mahmood , Shuicheng Yan , Zhongwen Xu

Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach

Sampling-based model predictive control (MPC) has found significant success in optimal control problems with non-smooth system dynamics and cost function. Many machine learning-based works proposed to improve MPC by a) learning or…

Machine Learning · Computer Science 2024-01-08 Sungwook Yang , Chaoying Pei , Ran Dai , Chuangchuang Sun

Learning to Optimize in Swarms

Learning to optimize has emerged as a powerful framework for various optimization and machine learning tasks. Current such "meta-optimizers" often learn in the space of continuous optimization algorithms that are point-based and…

Machine Learning · Computer Science 2019-11-19 Yue Cao , Tianlong Chen , Zhangyang Wang , Yang Shen

Learning Randomized Algorithms with Transformers

Randomization is a powerful tool that endows algorithms with remarkable properties. For instance, randomized algorithms excel in adversarial settings, often surpassing the worst-case performance of deterministic algorithms with large…

Machine Learning · Computer Science 2024-08-21 Johannes von Oswald , Seijin Kobayashi , Yassir Akram , Angelika Steger

Self-training Language Models for Arithmetic Reasoning

Recent language models achieve impressive results in tasks involving complex multistep reasoning, but scaling these capabilities further traditionally requires expensive collection of more annotated data. In this work, we explore the…

Computation and Language · Computer Science 2024-10-25 Marek Kadlčík , Michal Štefánik

Arbitrary Order Meta-Learning with Simple Population-Based Evolution

Meta-learning, the notion of learning to learn, enables learning systems to quickly and flexibly solve new tasks. This usually involves defining a set of outer-loop meta-parameters that are then used to update a set of inner-loop…

Machine Learning · Computer Science 2023-03-17 Chris Lu , Sebastian Towers , Jakob Foerster

Unsupervised Meta-Learning for Reinforcement Learning

Meta-learning algorithms use past experience to learn to quickly solve new tasks. In the context of reinforcement learning, meta-learning algorithms acquire reinforcement learning procedures to solve new problems more efficiently by…

Machine Learning · Computer Science 2020-05-01 Abhishek Gupta , Benjamin Eysenbach , Chelsea Finn , Sergey Levine

Learning to Optimize

Algorithm design is a laborious process and often requires many iterations of ideation and validation. In this paper, we explore automating algorithm design and present a method to learn an optimization algorithm, which we believe to be the…

Machine Learning · Computer Science 2016-06-07 Ke Li , Jitendra Malik