Related papers: Efficient Exploration in Continuous-time Model-bas…

Sample-efficient and Scalable Exploration in Continuous-Time RL

Reinforcement learning algorithms are typically designed for discrete-time dynamics, even though the underlying real-world control systems are often continuous in time. In this paper, we study the problem of continuous-time reinforcement…

Machine Learning · Computer Science 2026-03-03 Klemens Iten , Lenart Treven , Bhavya Sukhija , Florian Dörfler , Andreas Krause

Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs

We present two elegant solutions for modeling continuous-time dynamics, in a novel model-based reinforcement learning (RL) framework for semi-Markov decision processes (SMDPs), using neural ordinary differential equations (ODEs). Our models…

Machine Learning · Computer Science 2020-10-27 Jianzhun Du , Joseph Futoma , Finale Doshi-Velez

Reinforcement Learning with Fast Stabilization in Linear Dynamical Systems

In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynamical systems. When learning a dynamical system, one needs to stabilize the unknown dynamics in order to avoid system blow-ups. We propose an…

Machine Learning · Computer Science 2022-06-06 Sahin Lale , Kamyar Azizzadenesheli , Babak Hassibi , Anima Anandkumar

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under…

Machine Learning · Computer Science 2026-04-03 Klemens Iten , Bruce Lee , Chenhao Li , Lenart Treven , Andreas Krause , Bhavya Sukhija

Active Learning for Control-Oriented Identification of Nonlinear Systems

Model-based reinforcement learning is an effective approach for controlling an unknown system. It is based on a longstanding pipeline familiar to the control community in which one performs experiments on the environment to collect a…

Systems and Control · Electrical Eng. & Systems 2024-08-14 Bruce D. Lee , Ingvar Ziemann , George J. Pappas , Nikolai Matni

Continuous-Time Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) approaches rely on discrete-time state transition models whereas physical systems and the vast majority of control tasks operate in continuous-time. To avoid time-discretization approximation of the…

Machine Learning · Computer Science 2021-06-14 Çağatay Yıldız , Markus Heinonen , Harri Lähdesmäki

Model-based Reinforcement Learning for Continuous Control with Posterior Sampling

Balancing exploration and exploitation is crucial in reinforcement learning (RL). In this paper, we study model-based posterior sampling for reinforcement learning (PSRL) in continuous state-action spaces theoretically and empirically.…

Machine Learning · Computer Science 2021-11-18 Ying Fan , Yifei Ming

Overcoming Model Bias for Robust Offline Deep Reinforcement Learning

State-of-the-art reinforcement learning algorithms mostly rely on being allowed to directly interact with their environment to collect millions of observations. This makes it hard to transfer their success to industrial control problems,…

Machine Learning · Computer Science 2021-07-23 Phillip Swazinna , Steffen Udluft , Thomas Runkler

Adaptive Discretization for Model-Based Reinforcement Learning

We introduce the technique of adaptive discretization to design an efficient model-based episodic reinforcement learning algorithm in large (potentially continuous) state-action spaces. Our algorithm is based on optimistic one-step value…

Machine Learning · Computer Science 2020-10-26 Sean R. Sinclair , Tianyu Wang , Gauri Jain , Siddhartha Banerjee , Christina Lee Yu

A closer look at temporal variability in dynamic online learning

This work focuses on the setting of dynamic regret in the context of online learning with full information. In particular, we analyze regret bounds with respect to the temporal variability of the loss functions. By assuming that the…

Machine Learning · Computer Science 2021-02-16 Nicolò Campolongo , Francesco Orabona

Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning

We study the problem of reinforcement learning in infinite-horizon discounted linear Markov decision processes (MDPs), and propose the first computationally efficient algorithm achieving rate-optimal regret guarantees in this setting. Our…

Machine Learning · Computer Science 2026-03-16 Antoine Moulin , Gergely Neu , Luca Viano

Memory-efficient model-based deep learning with convergence and robustness guarantees

Computational imaging has been revolutionized by compressed sensing algorithms, which offer guaranteed uniqueness, convergence, and stability properties. Model-based deep learning methods that combine imaging physics with learned…

Computer Vision and Pattern Recognition · Computer Science 2023-03-01 Aniket Pramanik , M. Bridget Zimmerman , Mathews Jacob

Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation

Model-based deep reinforcement learning has achieved success in various domains that require high sample efficiencies, such as Go and robotics. However, there are some remaining issues, such as planning efficient explorations to learn more…

Machine Learning · Computer Science 2021-07-06 Yao Yao , Li Xiao , Zhicheng An , Wanpeng Zhang , Dijun Luo

Online convex optimization for data-driven control of dynamical systems

We propose an algorithm based on online convex optimization for controlling discrete-time linear dynamical systems. The algorithm is data-driven, i.e., does not require a model of the system, and is able to handle a priori unknown and…

Optimization and Control · Mathematics 2022-11-17 Marko Nonhoff , Matthias A. Müller

Discrete-Time Mean-Variance Strategy Based on Reinforcement Learning

This paper studies a discrete-time mean-variance model based on reinforcement learning. Compared with its continuous-time counterpart in \cite{zhou2020mv}, the discrete-time model makes more general assumptions about the asset's return…

Mathematical Finance · Quantitative Finance 2023-12-27 Xiangyu Cui , Xun Li , Yun Shi , Si Zhao

Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning

We propose Deterministic Sequencing of Exploration and Exploitation (DSEE) algorithm with interleaving exploration and exploitation epochs for model-based RL problems that aim to simultaneously learn the system model, i.e., a Markov…

Machine Learning · Computer Science 2022-12-21 Piyush Gupta , Vaibhav Srivastava

Efficient Model-Based Reinforcement Learning for Robot Control via Online Optimization

We present an online model-based reinforcement learning algorithm suitable for controlling complex robotic systems directly in the real world. Unlike prevailing sim-to-real pipelines that rely on extensive offline simulation and model-free…

Robotics · Computer Science 2026-05-07 Fang Nan , Hao Ma , Qinghua Guan , Josie Hughes , Michael Muehlebach , Marco Hutter

Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems

We study reinforcement learning (RL) for the same class of continuous-time stochastic linear--quadratic (LQ) control problems as in \cite{huang2024sublinear}, where volatilities depend on both states and controls while states are…

Machine Learning · Computer Science 2025-07-24 Yilie Huang , Xun Yu Zhou

Efficient Model-Free Reinforcement Learning Using Gaussian Process

Efficient Reinforcement Learning usually takes advantage of demonstration or good exploration strategy. By applying posterior sampling in model-free RL under the hypothesis of GP, we propose Gaussian Process Posterior Sampling Reinforcement…

Machine Learning · Computer Science 2018-12-12 Ying Fan , Letian Chen , Yizhou Wang

Online Constrained Model-based Reinforcement Learning

Applying reinforcement learning to robotic systems poses a number of challenging problems. A key requirement is the ability to handle continuous state and action spaces while remaining within a limited time and resource budget.…

Machine Learning · Computer Science 2020-06-29 Benjamin van Niekerk , Andreas Damianou , Benjamin Rosman