English
Related papers

Related papers: Estimation and Control Using Sampling-Based Bayesi…

200 papers

Robots performing manipulation tasks must operate under uncertainty about both their pose and the dynamics of the system. In order to remain robust to modeling error and shifts in payload dynamics, agents must simultaneously perform…

Systems and Control · Computer Science 2017-07-31 Patrick Slade , Preston Culbertson , Zachary Sunberg , Mykel Kochenderfer

Bayesian model-based reinforcement learning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, finding the resulting Bayes-optimal…

Machine Learning · Computer Science 2015-03-20 Arthur Guez , David Silver , Peter Dayan

Online solvers for partially observable Markov decision processes have difficulty scaling to problems with large action spaces. Monte Carlo tree search with progressive widening attempts to improve scaling by sampling from the action space…

Artificial Intelligence · Computer Science 2021-11-04 John Mern , Anil Yildiz , Zachary Sunberg , Tapan Mukerji , Mykel J. Kochenderfer

Offline reinforcement learning (RL) is a powerful approach for data-driven decision-making and control. Compared to model-free methods, offline model-based reinforcement learning (MBRL) explicitly learns world models from a static dataset…

Machine Learning · Computer Science 2026-01-28 Jiayu Chen , Le Xu , Wentse Chen , Jeff Schneider

Bayes-optimal behavior, while well-defined, is often difficult to achieve. Recent advances in the use of Monte-Carlo tree search (MCTS) have shown that it is possible to act near-optimally in Markov Decision Processes (MDPs) with very large…

Artificial Intelligence · Computer Science 2012-02-20 John Asmuth , Michael L. Littman

Maneuver decision-making can be regarded as a Markov decision process and can be address by reinforcement learning. However, original reinforcement learning algorithms can hardly solve the maneuvering decision-making problem. One reason is…

Artificial Intelligence · Computer Science 2023-09-19 Zhang Hong-Peng

The combination of Monte Carlo tree search and neural networks has revolutionized online planning. As neural network approximations are often imperfect, we ask whether uncertainty estimates about the network outputs could be used to improve…

Artificial Intelligence · Computer Science 2024-06-05 Nir Greshler , David Ben Eli , Carmel Rabinovitz , Gabi Guetta , Liran Gispan , Guy Zohar , Aviv Tamar

Addressing uncertainty is critical for autonomous systems to robustly adapt to the real world. We formulate the problem of model uncertainty as a continuous Bayes-Adaptive Markov Decision Process (BAMDP), where an agent maintains a…

In this paper we propose a new methodology for solving a discrete time stochastic Markovian control problem under model uncertainty. By utilizing the Dirichlet process, we model the unknown distribution of the underlying stochastic process…

Optimization and Control · Mathematics 2022-03-29 Tao Chen , Jiyoun Myung

Bayesian Optimisation has gained much popularity lately, as a global optimisation technique for functions that are expensive to evaluate or unknown a priori. While classical BO focuses on where to gather an observation next, it does not…

Robotics · Computer Science 2017-03-14 Philippe Morere , Roman Marchant , Fabio Ramos

The POMDP is a powerful framework for reasoning under outcome and information uncertainty, but constructing an accurate POMDP model is difficult. Bayes-Adaptive Partially Observable Markov Decision Processes (BA-POMDPs) extend POMDPs to…

Artificial Intelligence · Computer Science 2018-06-15 Sammie Katt , Frans A. Oliehoek , Christopher Amato

As control engineering methods are applied to increasingly complex systems, data-driven approaches for system identification appear as a promising alternative to physics-based modeling. While the Bayesian approaches prevalent for…

Systems and Control · Electrical Eng. & Systems 2024-08-07 Robert Lefringhausen , Supitsana Srithasan , Armin Lederer , Sandra Hirche

Controllers for autonomous systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modelled as process noise, and common assumptions are that the underlying distributions are…

Systems and Control · Electrical Eng. & Systems 2022-12-08 Thom S. Badings , Alessandro Abate , Nils Jansen , David Parker , Hasan A. Poonawala , Marielle Stoelinga

Automated vehicles require the ability to cooperate with humans for smooth integration into today's traffic. While the concept of cooperation is well known, developing a robust and efficient cooperative trajectory planning method is still a…

Multiagent Systems · Computer Science 2022-11-15 Philipp Stegmaier , Karl Kurzer , J. Marius Zöllner

Active policy search combines the trial-and-error methodology from policy search with Bayesian optimization to actively find the optimal policy. First, policy search is a type of reinforcement learning which has become very popular for…

Robotics · Computer Science 2024-02-13 Ruben Martinez-Cantin

In this paper, we address a method that integrates reinforcement learning into the Monte Carlo tree search to boost online path planning under fully observable environments for automated parking tasks. Sampling-based planning methods under…

Artificial Intelligence · Computer Science 2025-01-03 Xinlong Zheng , Xiaozhou Zhang , Donghao Xu

Adaptive sampling and planning in robotic environmental monitoring are challenging when the target environmental process varies over space and time. The underlying environmental dynamics require the planning module to integrate future…

Robotics · Computer Science 2023-06-19 Weizhe Chen , Lantao Liu

The Markov Decision Process (MDP) is a popular framework for sequential decision-making problems, and uncertainty quantification is an essential component of it to learn optimal decision-making strategies. In particular, a Bayesian…

Machine Learning · Statistics 2025-05-06 Jiaqi Guo , Chon Wai Ho , Sumeetpal S. Singh

This paper investigates methods for estimating the optimal stochastic control policy for a Markov Decision Process with unknown transition dynamics and an unknown reward function. This form of model-free reinforcement learning comprises…

Machine Learning · Computer Science 2019-12-06 Brandon Trabucco , Albert Qu , Simon Li , Ganeshkumar Ashokavardhanan

Autonomous vehicles have to navigate the surrounding environment with partial observability of other objects sharing the road. Sources of uncertainty in autonomous vehicle measurements include sensor fusion errors, limited sensor range due…

Artificial Intelligence · Computer Science 2018-12-05 Mohammad Naghshvar , Ahmed K. Sadek , Auke J. Wiggers
‹ Prev 1 2 3 10 Next ›