Related papers: Model-based Reinforcement Learning for Parameteriz…

Reinforcement Learning with Parameterized Actions

We introduce a model-free algorithm for learning in Markov decision processes with parameterized actions-discrete actions with continuous parameters. At each step the agent must select both which action to use and which parameters to use…

Artificial Intelligence · Computer Science 2015-11-30 Warwick Masson , Pravesh Ranchod , George Konidaris

Learning Dynamics Models for Model Predictive Agents

Model-Based Reinforcement Learning involves learning a \textit{dynamics model} from data, and then using this model to optimise behaviour, most often with an online \textit{planner}. Much of the recent research along these lines presents a…

Machine Learning · Computer Science 2021-09-30 Michael Lutter , Leonard Hasenclever , Arunkumar Byravan , Gabriel Dulac-Arnold , Piotr Trochim , Nicolas Heess , Josh Merel , Yuval Tassa

Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes

Hybrid action models are widely considered an effective approach to reinforcement learning (RL) modeling. The current mainstream method is to train agents under Parameterized Action Markov Decision Processes (PAMDPs), which performs well in…

Machine Learning · Computer Science 2025-01-07 Zijian Wang , Bin Wang , Mingwen Shao , Hongbo Dou , Boxiang Tao

Predictable MDP Abstraction for Unsupervised Model-Based RL

A key component of model-based reinforcement learning (RL) is a dynamics model that predicts the outcomes of actions. Errors in this predictive model can degrade the performance of model-based controllers, and complex Markov decision…

Machine Learning · Computer Science 2023-06-06 Seohong Park , Sergey Levine

Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments

Motion planning of autonomous agents in partially known environments with incomplete information is a challenging problem, particularly for complex tasks. This paper proposes a model-free reinforcement learning approach to address this…

Artificial Intelligence · Computer Science 2023-05-02 Junchao Li , Mingyu Cai , Zhen Kan , Shaoping Xiao

Meta-Reinforcement Learning Using Model Parameters

In meta-reinforcement learning, an agent is trained in multiple different environments and attempts to learn a meta-policy that can efficiently adapt to a new environment. This paper presents RAMP, a Reinforcement learning Agent using Model…

Machine Learning · Computer Science 2022-10-28 Gabriel Hartmann , Amos Azaria

Delay-Aware Model-Based Reinforcement Learning for Continuous Control

Action delays degrade the performance of reinforcement learning in many real-world systems. This paper proposes a formal definition of delay-aware Markov Decision Process and proves it can be transformed into standard MDP with augmented…

Machine Learning · Computer Science 2021-05-10 Baiming Chen , Mengdi Xu , Liang Li , Ding Zhao

Model-based Policy Optimization with Unsupervised Model Adaptation

Model-based reinforcement learning methods learn a dynamics model with real data sampled from the environment and leverage it to generate simulated data to derive an agent. However, due to the potential distribution mismatch between…

Machine Learning · Computer Science 2020-10-29 Jian Shen , Han Zhao , Weinan Zhang , Yong Yu

Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic

This paper investigates the motion planning of autonomous dynamical systems modeled by Markov decision processes (MDP) with unknown transition probabilities over continuous state and action spaces. Linear temporal logic (LTL) is used to…

Machine Learning · Computer Science 2022-01-25 Mingyu Cai , Mohammadhosein Hasanbeig , Shaoping Xiao , Alessandro Abate , Zhen Kan

Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning

Accurately predicting the dynamics of robotic systems is crucial for model-based control and reinforcement learning. The most common way to estimate dynamics is by fitting a one-step ahead prediction model and using it to recursively…

Machine Learning · Computer Science 2021-09-02 Nathan O. Lambert , Albert Wilcox , Howard Zhang , Kristofer S. J. Pister , Roberto Calandra

Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration

In this paper, we present a robotic model-based reinforcement learning method that combines ideas from model identification and model predictive control. We use a feature-based representation of the dynamics that allows the dynamics model…

Machine Learning · Computer Science 2016-03-16 Christopher Xie , Sachin Patil , Teodor Moldovan , Sergey Levine , Pieter Abbeel

DiAReL: Reinforcement Learning with Disturbance Awareness for Robust Sim2Real Policy Transfer in Robot Control

Delayed Markov decision processes (DMDPs) fulfill the Markov property by augmenting the state space of agents with a finite time window of recently committed actions. In reliance on these state augmentations, delay-resolved reinforcement…

Robotics · Computer Science 2025-11-17 Mohammadhossein Malmir , Josip Josifovski , Noah Klarmann , Alois Knoll

Planning with Abstract Learned Models While Learning Transferable Subtasks

We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contained transition and reward models suitable for probabilistic planning at multiple levels of abstraction. We call this framework Planning with…

Machine Learning · Computer Science 2020-06-15 John Winder , Stephanie Milani , Matthew Landen , Erebus Oh , Shane Parr , Shawn Squire , Marie desJardins , Cynthia Matuszek

Continuous Motion Planning with Temporal Logic Specifications using Deep Neural Networks

In this paper, we propose a model-free reinforcement learning method to synthesize control policies for motion planning problems with continuous states and actions. The robot is modelled as a labeled discrete-time Markov decision process…

Artificial Intelligence · Computer Science 2020-10-01 Chuanzheng Wang , Yinan Li , Stephen L. Smith , Jun Liu

Linear Dynamics meets Linear MDPs: Closed-Form Optimal Policies via Reinforcement Learning

Many applications -- including power systems, robotics, and economics -- involve a dynamical system interacting with a stochastic and hard-to-model environment. We adopt a reinforcement learning approach to control such systems.…

Optimization and Control · Mathematics 2025-08-26 Abed AlRahman Al Makdah , Oliver Kosut , Lalitha Sankar , Shaofeng Zou

Adaptive Reinforcement Learning for Unobservable Random Delays

In standard Reinforcement Learning (RL) settings, the interaction between the agent and the environment is typically modeled as a Markov Decision Process (MDP), which assumes that the agent observes the system state instantaneously, selects…

Machine Learning · Computer Science 2025-06-18 John Wikman , Alexandre Proutiere , David Broman

Reinforcement Learning with Partially Known World Dynamics

Reinforcement learning would enjoy better success on real-world problems if domain knowledge could be imparted to the algorithm by the modelers. Most problems have both hidden state and unknown dynamics. Partially observable Markov decision…

Machine Learning · Computer Science 2013-01-07 Christian R. Shelton

Model-based Reinforcement Learning: A Survey

Sequential decision making, commonly formalized as Markov Decision Process (MDP) optimization, is a important challenge in artificial intelligence. Two key approaches to this problem are reinforcement learning (RL) and planning. This paper…

Machine Learning · Computer Science 2022-04-01 Thomas M. Moerland , Joost Broekens , Aske Plaat , Catholijn M. Jonker

Action Pick-up in Dynamic Action Space Reinforcement Learning

Most reinforcement learning algorithms are based on a key assumption that Markov decision processes (MDPs) are stationary. However, non-stationary MDPs with dynamic action space are omnipresent in real-world scenarios. Yet problems of…

Machine Learning · Computer Science 2023-04-04 Jiaqi Ye , Xiaodong Li , Pangjing Wu , Feng Wang

Reinforcement Learning with History-Dependent Dynamic Contexts

We introduce Dynamic Contextual Markov Decision Processes (DCMDPs), a novel reinforcement learning framework for history-dependent environments that generalizes the contextual MDP framework to handle non-Markov environments, where contexts…

Machine Learning · Computer Science 2023-05-19 Guy Tennenholtz , Nadav Merlis , Lior Shani , Martin Mladenov , Craig Boutilier