Related papers: Estimation and Control Using Sampling-Based Bayesi…

Simultaneous active parameter estimation and control using sampling-based Bayesian reinforcement learning

Robots performing manipulation tasks must operate under uncertainty about both their pose and the dynamics of the system. In order to remain robust to modeling error and shifts in payload dynamics, agents must simultaneously perform…

Systems and Control · Computer Science 2017-07-31 Patrick Slade , Preston Culbertson , Zachary Sunberg , Mykel Kochenderfer

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

Bayesian model-based reinforcement learning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, finding the resulting Bayes-optimal…

Machine Learning · Computer Science 2015-03-20 Arthur Guez , David Silver , Peter Dayan

Bayesian Optimized Monte Carlo Planning

Online solvers for partially observable Markov decision processes have difficulty scaling to problems with large action spaces. Monte Carlo tree search with progressive widening attempts to improve scaling by sampling from the action space…

Artificial Intelligence · Computer Science 2021-11-04 John Mern , Anil Yildiz , Zachary Sunberg , Tapan Mukerji , Mykel J. Kochenderfer

Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning

Offline reinforcement learning (RL) is a powerful approach for data-driven decision-making and control. Compared to model-free methods, offline model-based reinforcement learning (MBRL) explicitly learns world models from a static dataset…

Machine Learning · Computer Science 2026-01-28 Jiayu Chen , Le Xu , Wentse Chen , Jeff Schneider

Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search

Bayes-optimal behavior, while well-defined, is often difficult to achieve. Recent advances in the use of Monte-Carlo tree search (MCTS) have shown that it is possible to act near-optimally in Markov Decision Processes (MDPs) with very large…

Artificial Intelligence · Computer Science 2012-02-20 John Asmuth , Michael L. Littman

Maneuver Decision-Making Through Proximal Policy Optimization And Monte Carlo Tree Search

Maneuver decision-making can be regarded as a Markov decision process and can be address by reinforcement learning. However, original reinforcement learning algorithms can hardly solve the maneuvering decision-making problem. One reason is…

Artificial Intelligence · Computer Science 2023-09-19 Zhang Hong-Peng

A Bayesian Approach to Online Planning

The combination of Monte Carlo tree search and neural networks has revolutionized online planning. As neural network approximations are often imperfect, we ask whether uncertainty estimates about the network outputs could be used to improve…

Artificial Intelligence · Computer Science 2024-06-05 Nir Greshler , David Ben Eli , Carmel Rabinovitz , Gabi Guetta , Liran Gispan , Guy Zohar , Aviv Tamar

Bayesian Policy Optimization for Model Uncertainty

Addressing uncertainty is critical for autonomous systems to robustly adapt to the real world. We formulate the problem of model uncertainty as a continuous Bayes-Adaptive Markov Decision Process (BAMDP), where an agent maintains a…

Robotics · Computer Science 2019-05-09 Gilwoo Lee , Brian Hou , Aditya Mandalika , Jeongseok Lee , Sanjiban Choudhury , Siddhartha S. Srinivasa

Nonparametric Adaptive Bayesian Stochastic Control Under Model Uncertainty

In this paper we propose a new methodology for solving a discrete time stochastic Markovian control problem under model uncertainty. By utilizing the Dirichlet process, we model the unknown distribution of the underlying stochastic process…

Optimization and Control · Mathematics 2022-03-29 Tao Chen , Jiyoun Myung

Sequential Bayesian Optimisation as a POMDP for Environment Monitoring with UAVs

Bayesian Optimisation has gained much popularity lately, as a global optimisation technique for functions that are expensive to evaluate or unknown a priori. While classical BO focuses on where to gather an observation next, it does not…

Robotics · Computer Science 2017-03-14 Philippe Morere , Roman Marchant , Fabio Ramos

Learning in POMDPs with Monte Carlo Tree Search

The POMDP is a powerful framework for reasoning under outcome and information uncertainty, but constructing an accurate POMDP model is difficult. Bayes-Adaptive Partially Observable Markov Decision Processes (BA-POMDPs) extend POMDPs to…

Artificial Intelligence · Computer Science 2018-06-15 Sammie Katt , Frans A. Oliehoek , Christopher Amato

Learning-Based Optimal Control with Performance Guarantees for Unknown Systems with Latent States

As control engineering methods are applied to increasingly complex systems, data-driven approaches for system identification appear as a promising alternative to physics-based modeling. While the Bayesian approaches prevalent for…

Systems and Control · Electrical Eng. & Systems 2024-08-07 Robert Lefringhausen , Supitsana Srithasan , Armin Lederer , Sandra Hirche

Sampling-Based Robust Control of Autonomous Systems with Non-Gaussian Noise

Controllers for autonomous systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modelled as process noise, and common assumptions are that the underlying distributions are…

Systems and Control · Electrical Eng. & Systems 2022-12-08 Thom S. Badings , Alessandro Abate , Nils Jansen , David Parker , Hasan A. Poonawala , Marielle Stoelinga

Cooperative Trajectory Planning in Uncertain Environments with Monte Carlo Tree Search and Risk Metrics

Automated vehicles require the ability to cooperate with humans for smooth integration into today's traffic. While the concept of cooperation is well known, developing a robust and efficient cooperative trajectory planning method is still a…

Multiagent Systems · Computer Science 2022-11-15 Philipp Stegmaier , Karl Kurzer , J. Marius Zöllner

Bayesian Optimization with Adaptive Kernels for Robot Control

Active policy search combines the trial-and-error methodology from policy search with Bayesian optimization to actively find the optimal policy. First, policy search is a type of reinforcement learning which has become very popular for…

Robotics · Computer Science 2024-02-13 Ruben Martinez-Cantin

Speeding Up Path Planning via Reinforcement Learning in MCTS for Automated Parking

In this paper, we address a method that integrates reinforcement learning into the Monte Carlo tree search to boost online path planning under fully observable environments for automated parking tasks. Sampling-based planning methods under…

Artificial Intelligence · Computer Science 2025-01-03 Xinlong Zheng , Xiaozhou Zhang , Donghao Xu

Multi-Objective and Model-Predictive Tree Search for Spatiotemporal Informative Planning

Adaptive sampling and planning in robotic environmental monitoring are challenging when the target environmental process varies over space and time. The underlying environmental dynamics require the planning module to integrate future…

Robotics · Computer Science 2023-06-19 Weizhe Chen , Lantao Liu

Bayesian learning of the optimal action-value function in a Markov decision process

The Markov Decision Process (MDP) is a popular framework for sequential decision-making problems, and uncertainty quantification is an essential component of it to learn optimal decision-making strategies. In particular, a Bayesian…

Machine Learning · Statistics 2025-05-06 Jiaqi Guo , Chon Wai Ho , Sumeetpal S. Singh

Inferring the Optimal Policy using Markov Chain Monte Carlo

This paper investigates methods for estimating the optimal stochastic control policy for a Markov Decision Process with unknown transition dynamics and an unknown reward function. This form of model-free reinforcement learning comprises…

Machine Learning · Computer Science 2019-12-06 Brandon Trabucco , Albert Qu , Simon Li , Ganeshkumar Ashokavardhanan

Risk-averse Behavior Planning for Autonomous Driving under Uncertainty

Autonomous vehicles have to navigate the surrounding environment with partial observability of other objects sharing the road. Sources of uncertainty in autonomous vehicle measurements include sensor fusion errors, limited sensor range due…

Artificial Intelligence · Computer Science 2018-12-05 Mohammad Naghshvar , Ahmed K. Sadek , Auke J. Wiggers