Related papers: Experience-based Optimization: A Coevolutionary Ap…

Accelerating exploration and representation learning with offline pre-training

Sequential decision-making agents struggle with long horizon tasks, since solving them requires multi-step reasoning. Most reinforcement learning (RL) algorithms address this challenge by improved credit assignment, introducing memory…

Machine Learning · Computer Science 2023-04-04 Bogdan Mazoure , Jake Bruce , Doina Precup , Rob Fergus , Ankit Anand

Learning from Offline and Online Experiences: A Hybrid Adaptive Operator Selection Framework

In many practical applications, usually, similar optimisation problems or scenarios repeatedly appear. Learning from previous problem-solving experiences can help adjust algorithm components of meta-heuristics, e.g., adaptively selecting…

Neural and Evolutionary Computing · Computer Science 2024-04-17 Jiyuan Pei , Jialin Liu , Yi Mei

Offline Decision Transformers for Neural Combinatorial Optimization: Surpassing Heuristics on the Traveling Salesman Problem

Combinatorial optimization problems like the Traveling Salesman Problem are critical in industry yet NP-hard. Neural Combinatorial Optimization has shown promise, but its reliance on online reinforcement learning (RL) hampers deployment and…

Machine Learning · Computer Science 2026-03-27 Hironori Ohigashi , Shinichiro Hamada

Locally Adaptive Multi-Objective Learning

We consider the general problem of learning a predictor that satisfies multiple objectives of interest simultaneously, a broad framework that captures a range of specific learning goals including calibration, regret, and multiaccuracy. We…

Machine Learning · Computer Science 2026-02-17 Jivat Neet Kaur , Isaac Gibbs , Michael I. Jordan

Learning by Examples Based on Multi-level Optimization

Learning by examples, which learns to solve a new problem by looking into how similar problems are solved, is an effective learning method in human learning. When a student learns a new topic, he/she finds out exemplar topics that are…

Machine Learning · Computer Science 2021-09-23 Shentong Mo , Pengtao Xie

An Evolutionary Strategy based on Partial Imitation for Solving Optimization Problems

In this work we introduce an evolutionary strategy to solve combinatorial optimization tasks, i.e. problems characterized by a discrete search space. In particular, we focus on the Traveling Salesman Problem (TSP), i.e. a famous problem…

Disordered Systems and Neural Networks · Physics 2016-08-05 Marco Alberto Javarone

"Give Me an Example Like This": Episodic Active Reinforcement Learning from Demonstrations

Reinforcement Learning (RL) has achieved great success in sequential decision-making problems, but often at the cost of a large number of agent-environment interactions. To improve sample efficiency, methods like Reinforcement Learning from…

Artificial Intelligence · Computer Science 2024-10-04 Muhan Hou , Koen Hindriks , A. E. Eiben , Kim Baraka

Quality over Quantity: An Effective Large-Scale Data Reduction Strategy Based on Pointwise V-Information

In order to increase the effectiveness of model training, data reduction is essential to data-centric Artificial Intelligence (AI). It achieves this by locating the most instructive examples in massive datasets. To increase data quality and…

Machine Learning · Computer Science 2025-08-11 Fei Chen , Wenchi Zhou

Representation Matters: Offline Pretraining for Sequential Decision Making

The recent success of supervised learning methods on ever larger offline datasets has spurred interest in the reinforcement learning (RL) field to investigate whether the same paradigms can be translated to RL algorithms. This research…

Machine Learning · Computer Science 2021-02-12 Mengjiao Yang , Ofir Nachum

Using Offline Data to Speed Up Reinforcement Learning in Procedurally Generated Environments

One of the key challenges of Reinforcement Learning (RL) is the ability of agents to generalise their learned policy to unseen settings. Moreover, training RL agents requires large numbers of interactions with the environment. Motivated by…

Machine Learning · Computer Science 2024-12-10 Alain Andres , Lukas Schäfer , Stefano V. Albrecht , Javier Del Ser

Behavior-based Neuroevolutionary Training in Reinforcement Learning

In addition to their undisputed success in solving classical optimization problems, neuroevolutionary and population-based algorithms have become an alternative to standard reinforcement learning methods. However, evolutionary methods often…

Neural and Evolutionary Computing · Computer Science 2021-05-18 Jörg Stork , Martin Zaefferer , Nils Eisler , Patrick Tichelmann , Thomas Bartz-Beielstein , A. E. Eiben

A Reinforcement Learning Inspired Latent Yield Based Adaptive Algorithm Switching Mechanism

Selecting the most suitable algorithm for a given problem instance remains a challenging task, particularly in online or dynamic environments where problem characteristics evolve over time. Relying solely on instantaneous performance…

Multiagent Systems · Computer Science 2026-05-26 Jayprakash S. Nair , Jimson Mathew , Shivashankar B. Nair

Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies

Data collection is crucial for learning robust world models in model-based reinforcement learning. The most prevalent strategies are to actively collect trajectories by interacting with the environment during online training or training on…

Machine Learning · Computer Science 2025-09-09 Jiaqi Chen , Ji Shi , Cansu Sancaktar , Jonas Frey , Georg Martius

Local training and enrichment based on a residual localization strategy

To efficiently tackle parametrized multi and/or large scale problems, we propose an adaptive localized model order reduction framework combining both local offline training and local online enrichment with localized error control. For the…

Numerical Analysis · Mathematics 2024-04-26 Tim Keil , Mario Ohlberger , Felix Schindler , Julia Schleuß

Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation

We consider the problem of sequential recommendation, where the current recommendation is made based on past interactions. This recommendation task requires efficient processing of the sequential data and aims to provide recommendations…

Information Retrieval · Computer Science 2023-07-28 Xumei Xi , Yuke Zhao , Quan Liu , Liwen Ouyang , Yang Wu

Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons

We consider reinforcement learning (RL) methods in offline domains without additional online data collection, such as mobile health applications. Most of existing policy optimization algorithms in the computer science literature are…

Machine Learning · Statistics 2022-07-28 Chengchun Shi , Shikai Luo , Yuan Le , Hongtu Zhu , Rui Song

Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization

Applying reinforcement learning (RL) to combinatorial optimization problems is attractive as it removes the need for expert knowledge or pre-solved instances. However, it is unrealistic to expect an agent to solve these (often NP-)hard…

Artificial Intelligence · Computer Science 2023-11-15 Nathan Grinsztajn , Daniel Furelos-Blanco , Shikha Surana , Clément Bonnet , Thomas D. Barrett

Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization

Reinforcement learning (RL) has shown significant promise for sequential portfolio optimization tasks, such as stock trading, where the objective is to maximize cumulative returns while minimizing risks using historical data. However,…

Machine Learning · Computer Science 2025-05-20 Haochen Yuan , Minting Pan , Yunbo Wang , Siyu Gao , Philip S. Yu , Xiaokang Yang

Online Experiential Learning for Language Models

The prevailing paradigm for improving large language models relies on offline training with human annotations or simulated environments, leaving the rich experience accumulated during real-world deployment entirely unexploited. We propose…

Computation and Language · Computer Science 2026-03-18 Tianzhu Ye , Li Dong , Qingxiu Dong , Xun Wu , Shaohan Huang , Furu Wei

Training Learned Optimizers with Randomly Initialized Learned Optimizers

Learned optimizers are increasingly effective, with performance exceeding that of hand designed optimizers such as Adam~\citep{kingma2014adam} on specific tasks \citep{metz2019understanding}. Despite the potential gains available, in…

Machine Learning · Computer Science 2021-01-20 Luke Metz , C. Daniel Freeman , Niru Maheswaranathan , Jascha Sohl-Dickstein