Related papers: Continuous-Time Model-Based Reinforcement Learning

Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs

We present two elegant solutions for modeling continuous-time dynamics, in a novel model-based reinforcement learning (RL) framework for semi-Markov decision processes (SMDPs), using neural ordinary differential equations (ODEs). Our models…

Machine Learning · Computer Science 2020-10-27 Jianzhun Du , Joseph Futoma , Finale Doshi-Velez

Sample-efficient and Scalable Exploration in Continuous-Time RL

Reinforcement learning algorithms are typically designed for discrete-time dynamics, even though the underlying real-world control systems are often continuous in time. In this paper, we study the problem of continuous-time reinforcement…

Machine Learning · Computer Science 2026-03-03 Klemens Iten , Lenart Treven , Bhavya Sukhija , Florian Dörfler , Andreas Krause

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Model-free reinforcement learning (RL) is a powerful, general tool for learning complex behaviors. However, its sample efficiency is often impractically large for solving challenging real-world problems, even with off-policy algorithms such…

Machine Learning · Computer Science 2020-02-25 Vitchyr Pong , Shixiang Gu , Murtaza Dalal , Sergey Levine

Continual Model-Based Reinforcement Learning with Hypernetworks

Effective planning in model-based reinforcement learning (MBRL) and model-predictive control (MPC) relies on the accuracy of the learned dynamics model. In many instances of MBRL and MPC, this model is assumed to be stationary and is…

Machine Learning · Computer Science 2026-05-27 Yizhou Huang , Kevin Xie , Homanga Bharadhwaj , Florian Shkurti

Output-feedback online optimal control for a class of nonlinear systems

In this paper an output-feedback model-based reinforcement learning (MBRL) method for a class of second-order nonlinear systems is developed. The control technique uses exact model knowledge and integrates a dynamic state estimator within…

Systems and Control · Computer Science 2021-07-07 Ryan Self , Michael Harlan , Rushikesh Kamalapurkar

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under…

Machine Learning · Computer Science 2026-04-03 Klemens Iten , Bruce Lee , Chenhao Li , Lenart Treven , Andreas Krause , Bhavya Sukhija

Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control

Model-based reinforcement learning (MBRL) is believed to have much higher sample efficiency compared to model-free algorithms by learning a predictive model of the environment. However, the performance of MBRL highly relies on the quality…

Machine Learning · Computer Science 2022-11-16 Xin-Yang Liu , Jian-Xun Wang

Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning

Offline reinforcement learning (RL) offers a powerful paradigm for data-driven control. Compared to model-free approaches, offline model-based RL (MBRL) explicitly learns a world model from a static dataset and uses it as a surrogate…

Machine Learning · Computer Science 2026-02-02 Jiayu Chen , Le Xu , Aravind Venugopal , Jeff Schneider

Model-based Lookahead Reinforcement Learning

Model-based Reinforcement Learning (MBRL) allows data-efficient learning which is required in real world applications such as robotics. However, despite the impressive data-efficiency, MBRL does not achieve the final performance of…

Machine Learning · Computer Science 2019-08-19 Zhang-Wei Hong , Joni Pajarinen , Jan Peters

Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control

We introduce $\mathcal{L}_1$-MBRL, a control-theoretic augmentation scheme for Model-Based Reinforcement Learning (MBRL) algorithms. Unlike model-free approaches, MBRL algorithms learn a model of the transition function using data and use…

Systems and Control · Electrical Eng. & Systems 2024-03-25 Minjun Sung , Sambhu H. Karumanchi , Aditya Gahlawat , Naira Hovakimyan

Offline Model-Based Reinforcement Learning with Anti-Exploration

Model-based reinforcement learning (MBRL) algorithms learn a dynamics model from collected data and apply it to generate synthetic trajectories to enable faster learning. This is an especially promising paradigm in offline reinforcement…

Machine Learning · Computer Science 2024-08-21 Padmanaba Srinivasan , William Knottenbelt

A Unified Framework for Alternating Offline Model Training and Policy Learning

In offline model-based reinforcement learning (offline MBRL), we learn a dynamic model from historically collected data, and subsequently utilize the learned model and fixed datasets for policy learning, without further interacting with the…

Machine Learning · Computer Science 2022-10-13 Shentao Yang , Shujian Zhang , Yihao Feng , Mingyuan Zhou

Accelerating Model-Based Reinforcement Learning with State-Space World Models

Reinforcement learning (RL) is a powerful approach for robot learning. However, model-free RL (MFRL) requires a large number of environment interactions to learn successful control policies. This is due to the noisy RL training updates and…

Robotics · Computer Science 2025-02-28 Maria Krinner , Elie Aljalbout , Angel Romero , Davide Scaramuzza

Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation

Continuous-time reinforcement learning (CTRL) provides a natural framework for sequential decision-making in dynamic environments where interactions evolve continuously over time. While CTRL has shown growing empirical success, its ability…

Machine Learning · Computer Science 2025-12-04 Runze Zhao , Yue Yu , Ruhan Wang , Chunfeng Huang , Dongruo Zhou

Multi-Agent Reinforcement Learning with Multi-Step Generative Models

We consider model-based reinforcement learning (MBRL) in 2-agent, high-fidelity continuous control problems -- an important domain for robots interacting with other agents in the same workspace. For non-trivial dynamical systems, MBRL…

Machine Learning · Computer Science 2019-11-04 Orr Krupnik , Igor Mordatch , Aviv Tamar

Efficient Exploration in Continuous-time Model-based Reinforcement Learning

Reinforcement learning algorithms typically consider discrete-time dynamics, even though the underlying systems are often continuous in time. In this paper, we introduce a model-based reinforcement learning algorithm that represents…

Machine Learning · Computer Science 2023-11-01 Lenart Treven , Jonas Hübotter , Bhavya Sukhija , Florian Dörfler , Andreas Krause

Improving planning and MBRL with temporally-extended actions

Continuous time systems are often modeled using discrete time dynamics but this requires a small simulation step to maintain accuracy. In turn, this requires a large planning horizon which leads to computationally demanding planning…

Machine Learning · Computer Science 2025-10-23 Palash Chatterjee , Roni Khardon

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement learning (MBRL) offers the prospect of good performance with little environment interaction. However, it is common in practice for the learned model to be inaccurate,…

Machine Learning · Computer Science 2021-03-31 Behzad Haghgoo , Allan Zhou , Archit Sharma , Chelsea Finn

Model Imitation for Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) aims to learn a dynamic model to reduce the number of interactions with real-world environments. However, due to estimation error, rollouts in the learned model, especially those of long horizons,…

Machine Learning · Computer Science 2020-03-17 Yueh-Hua Wu , Ting-Han Fan , Peter J. Ramadge , Hao Su