Related papers: Offline Learning for Planning: A Summary

Non-Stationary Off-Policy Optimization

Off-policy learning is a framework for evaluating and optimizing policies without deploying them, from data collected by another policy. Real-world environments are typically non-stationary and the offline learned policies should adapt to…

Machine Learning · Computer Science 2021-04-06 Joey Hong , Branislav Kveton , Manzil Zaheer , Yinlam Chow , Amr Ahmed

Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies

Data collection is crucial for learning robust world models in model-based reinforcement learning. The most prevalent strategies are to actively collect trajectories by interacting with the environment during online training or training on…

Machine Learning · Computer Science 2025-09-09 Jiaqi Chen , Ji Shi , Cansu Sancaktar , Jonas Frey , Georg Martius

Bayesian Inverse Transition Learning for Offline Settings

Offline Reinforcement learning is commonly used for sequential decision-making in domains such as healthcare and education, where the rewards are known and the transition dynamics $T$ must be estimated on the basis of batch data. A key…

Machine Learning · Computer Science 2023-08-10 Leo Benac , Sonali Parbhoo , Finale Doshi-Velez

Model-Based Offline Planning

Offline learning is a key part of making reinforcement learning (RL) useable in real systems. Offline RL looks at scenarios where there is data from a system's operation, but no direct access to the system when learning a policy. Recent…

Machine Learning · Computer Science 2021-03-18 Arthur Argenson , Gabriel Dulac-Arnold

Evaluation-Time Policy Switching for Offline Reinforcement Learning

Offline reinforcement learning (RL) looks at learning how to optimally solve tasks using a fixed dataset of interactions from the environment. Many off-policy algorithms developed for online learning struggle in the offline setting as they…

Machine Learning · Computer Science 2025-03-18 Natinael Solomon Neggatu , Jeremie Houssineau , Giovanni Montana

Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling

Recent advances in batch (offline) reinforcement learning have shown promising results in learning from available offline data and proved offline reinforcement learning to be an essential toolkit in learning control policies in a model-free…

Machine Learning · Computer Science 2022-12-19 Ashish Kumar , Ilya Kuzovkin

A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets

Recent Offline Reinforcement Learning methods have succeeded in learning high-performance policies from fixed datasets of experience. A particularly effective approach learns to first identify and then mimic optimal decision-making…

Machine Learning · Computer Science 2023-12-12 Jake Grigsby , Yanjun Qi

Revisiting Design Choices in Offline Model-Based Reinforcement Learning

Offline reinforcement learning enables agents to leverage large pre-collected datasets of environment transitions to learn control policies, circumventing the need for potentially expensive or unsafe online data collection. Significant…

Machine Learning · Computer Science 2022-03-17 Cong Lu , Philip J. Ball , Jack Parker-Holder , Michael A. Osborne , Stephen J. Roberts

On the Learnability of Offline Model-Based Optimization: A Ranking Perspective

Offline model-based optimization (MBO) seeks to discover high-performing designs using only a fixed dataset of past evaluations. Most existing methods rely on learning a surrogate model via regression and implicitly assume that good…

Machine Learning · Computer Science 2026-03-05 Shen-Huan Lyu , Rong-Xi Tan , Ke Xue , Yi-Xiao He , Yu Huang , Qingfu Zhang , Chao Qian

What are the Statistical Limits of Offline RL with Linear Function Approximation?

Offline reinforcement learning seeks to utilize offline (observational) data to guide the learning of (causal) sequential decision making strategies. The hope is that offline reinforcement learning coupled with function approximation…

Machine Learning · Computer Science 2020-10-23 Ruosong Wang , Dean P. Foster , Sham M. Kakade

Benchmarks and Algorithms for Offline Preference-Based Reward Learning

Learning a reward function from human preferences is challenging as it typically requires having a high-fidelity simulator or using expensive and potentially unsafe actual physical rollouts in the environment. However, in many tasks the…

Machine Learning · Computer Science 2023-01-05 Daniel Shin , Anca D. Dragan , Daniel S. Brown

Online Off-policy Prediction

This paper investigates the problem of online prediction learning, where learning proceeds continuously as the agent interacts with an environment. The predictions made by the agent are contingent on a particular way of behaving,…

Machine Learning · Computer Science 2018-11-08 Sina Ghiassian , Andrew Patterson , Martha White , Richard S. Sutton , Adam White

Adaptive Policy Learning for Offline-to-Online Reinforcement Learning

Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is impractical when online interactions are costly. Offline RL provides an alternative solution by directly learning from the previously collected…

Machine Learning · Computer Science 2023-03-15 Han Zheng , Xufang Luo , Pengfei Wei , Xuan Song , Dongsheng Li , Jing Jiang

A Strong Baseline for Batch Imitation Learning

Imitation of expert behaviour is a highly desirable and safe approach to the problem of sequential decision making. We provide an easy-to-implement, novel algorithm for imitation learning under a strict data paradigm, in which the agent…

Machine Learning · Computer Science 2023-02-07 Matthew Smith , Lucas Maystre , Zhenwen Dai , Kamil Ciosek

Diffusion Policies for Risk-Averse Behavior Modeling in Offline Reinforcement Learning

Offline reinforcement learning (RL) presents distinct challenges as it relies solely on observational data. A central concern in this context is ensuring the safety of the learned policy by quantifying uncertainties associated with various…

Machine Learning · Computer Science 2025-07-03 Xiaocong Chen , Siyu Wang , Tong Yu , Lina Yao

F1tenth Autonomous Racing With Offline Reinforcement Learning Methods

Autonomous racing serves as a critical platform for evaluating automated driving systems and enhancing vehicle mobility intelligence. This work investigates offline reinforcement learning methods to train agents within the dynamic F1tenth…

Robotics · Computer Science 2024-08-09 Prajwal Koirala , Cody Fleming

Online Learning and Planning in Partially Observable Domains without Prior Knowledge

How an agent can act optimally in stochastic, partially observable domains is a challenge problem, the standard approach to address this issue is to learn the domain model firstly and then based on the learned model to find the (near)…

Artificial Intelligence · Computer Science 2019-06-13 Yunlong Liu , Jianyang Zheng

On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples

Offline reinforcement learning (offline RL) considers problems where learning is performed using only previously collected samples and is helpful for the settings in which collecting new data is costly or risky. In model-based offline RL,…

Machine Learning · Computer Science 2023-03-09 Mustafa O. Karabag , Ufuk Topcu

Active Learning for Control-Oriented Identification of Nonlinear Systems

Model-based reinforcement learning is an effective approach for controlling an unknown system. It is based on a longstanding pipeline familiar to the control community in which one performs experiments on the environment to collect a…

Systems and Control · Electrical Eng. & Systems 2024-08-14 Bruce D. Lee , Ingvar Ziemann , George J. Pappas , Nikolai Matni

A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective

Offline reinforcement learning endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the…

Machine Learning · Computer Science 2024-11-12 Yunpeng Qing , Shunyu liu , Jingyuan Cong , Kaixuan Chen , Yihe Zhou , Mingli Song