Related papers: Optimizing Sequential Experimental Design with Dee…

Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning

We present a mathematical framework and computational methods to optimally design a finite number of sequential experiments. We formulate this sequential optimal experimental design (sOED) problem as a finite-horizon partially observable…

Machine Learning · Computer Science 2024-03-28 Wanggang Shen , Xun Huan

Sequential Bayesian Optimal Experimental Design in Infinite Dimensions via Policy Gradient Reinforcement Learning

Sequential Bayesian optimal experimental design (SBOED) for PDE-governed inverse problems is computationally challenging, especially for infinite-dimensional random field parameters. High-fidelity approaches require repeated forward and…

Optimization and Control · Mathematics 2026-01-12 Kaichen Shen , Peng Chen

Sequential Bayesian experimental designs via reinforcement learning

Bayesian experimental design (BED) has been used as a method for conducting efficient experiments based on Bayesian inference. The existing methods, however, mostly focus on maximizing the expected information gain (EIG); the cost of…

Machine Learning · Computer Science 2022-02-16 Hikaru Asano

Sequential Bayesian optimal experimental design via approximate dynamic programming

The design of multiple experiments is commonly undertaken via suboptimal strategies, such as batch (open-loop) design that omits feedback or greedy (myopic) design that does not account for future effects. This paper introduces new…

Methodology · Statistics 2016-04-29 Xun Huan , Youssef M. Marzouk

Receding Horizon Curiosity

Sample-efficient exploration is crucial not only for discovering rewarding experiences but also for adapting to environment changes in a task-agnostic fashion. A principled treatment of the problem of optimal input synthesis for system…

Machine Learning · Computer Science 2019-10-10 Matthias Schultheis , Boris Belousov , Hany Abdulsamad , Jan Peters

Reinforcement Learning based Sequential Batch-sampling for Bayesian Optimal Experimental Design

Engineering problems that are modeled using sophisticated mathematical methods or are characterized by expensive-to-conduct tests or experiments, are encumbered with limited budget or finite computational resources. Moreover, practical…

Machine Learning · Computer Science 2021-12-24 Yonatan Ashenafi , Piyush Pandita , Sayan Ghosh

Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring

With the growing needs of online A/B testing to support the innovation in industry, the opportunity cost of running an experiment becomes non-negligible. Therefore, there is an increasing demand for an efficient continuous monitoring…

Machine Learning · Computer Science 2023-04-04 Runzhe Wan , Yu Liu , James McQueen , Doug Hains , Rui Song

Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design

We introduce Deep Adaptive Design (DAD), a method for amortizing the cost of adaptive Bayesian experimental design that allows experiments to be run in real-time. Traditional sequential Bayesian optimal experimental design approaches…

Machine Learning · Statistics 2021-06-14 Adam Foster , Desi R. Ivanova , Ilyas Malik , Tom Rainforth

Scaling Up Robust MDPs by Reinforcement Learning

We consider large-scale Markov decision processes (MDPs) with parameter uncertainty, under the robust MDP paradigm. Previous studies showed that robust MDPs, based on a minimax approach to handle uncertainty, can be solved using dynamic…

Machine Learning · Computer Science 2013-06-27 Aviv Tamar , Huan Xu , Shie Mannor

Performance Comparisons of Reinforcement Learning Algorithms for Sequential Experimental Design

Recent developments in sequential experimental design look to construct a policy that can efficiently navigate the design space, in a way that maximises the expected information gain. Whilst there is work on achieving tractable policies for…

Machine Learning · Computer Science 2025-08-20 Yasir Zubayr Barlas , Kizito Salako

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

To overcome the curses of dimensionality and modeling of Dynamic Programming (DP) methods to solve Markov Decision Process (MDP) problems, Reinforcement Learning (RL) methods are adopted in practice. Contrary to traditional RL algorithms…

Machine Learning · Computer Science 2021-08-24 Arghyadip Roy , Vivek Borkar , Abhay Karandikar , Prasanna Chaporkar

Bayesian learning of the optimal action-value function in a Markov decision process

The Markov Decision Process (MDP) is a popular framework for sequential decision-making problems, and uncertainty quantification is an essential component of it to learn optimal decision-making strategies. In particular, a Bayesian…

Machine Learning · Statistics 2025-05-06 Jiaqi Guo , Chon Wai Ho , Sumeetpal S. Singh

Probabilistic Inference in Reinforcement Learning Done Right

A popular perspective in Reinforcement learning (RL) casts the problem as probabilistic inference on a graphical model of the Markov decision process (MDP). The core object of study is the probability of each state-action pair being visited…

Machine Learning · Computer Science 2023-11-23 Jean Tarbouriech , Tor Lattimore , Brendan O'Donoghue

Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts

Informed and robust decision making in the face of uncertainty is critical for robots that perform physical tasks alongside people. We formulate this as Bayesian Reinforcement Learning over latent Markov Decision Processes (MDPs). While…

Robotics · Computer Science 2020-02-11 Gilwoo Lee , Brian Hou , Sanjiban Choudhury , Siddhartha S. Srinivasa

Designing Biological Sequences via Meta-Reinforcement Learning and Bayesian Optimization

The ability to accelerate the design of biological sequences can have a substantial impact on the progress of the medical field. The problem can be framed as a global optimization problem where the objective is an expensive black-box…

Machine Learning · Computer Science 2022-09-15 Leo Feng , Padideh Nouri , Aneri Muni , Yoshua Bengio , Pierre-Luc Bacon

Safe Policies for Reinforcement Learning via Primal-Dual Methods

In this paper, we study the learning of safe policies in the setting of reinforcement learning problems. This is, we aim to control a Markov Decision Process (MDP) of which we do not know the transition probabilities, but we have access to…

Systems and Control · Electrical Eng. & Systems 2022-01-14 Santiago Paternain , Miguel Calvo-Fullana , Luiz F. O. Chamon , Alejandro Ribeiro

A reinforcement learning approach to hybrid control design

In this paper we design hybrid control policies for hybrid systems whose mathematical models are unknown. Our contributions are threefold. First, we propose a framework for modelling the hybrid control design problem as a single Markov…

Systems and Control · Electrical Eng. & Systems 2020-09-03 Meet Gandhi , Atreyee Kundu , Shalabh Bhatnagar

Efficient Solution Algorithms for Factored MDPs

This paper addresses the problem of planning under uncertainty in large Markov Decision Processes (MDPs). Factored MDPs represent a complex state space using state variables and the transition model using a dynamic Bayesian network. This…

Artificial Intelligence · Computer Science 2011-06-10 C. Guestrin , D. Koller , R. Parr , S. Venkataraman

A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

In the reinforcement learning literature, there are many algorithms developed for either Contextual Bandit (CB) or Markov Decision Processes (MDP) environments. However, when deploying reinforcement learning algorithms in the real world,…

Machine Learning · Computer Science 2022-08-02 Kelly W. Zhang , Omer Gottesman , Finale Doshi-Velez

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning

In the predict-then-optimize framework, the objective is to train a predictive model, mapping from environment features to parameters of an optimization problem, which maximizes decision quality when the optimization is subsequently solved.…

Machine Learning · Computer Science 2022-07-19 Kai Wang , Sanket Shah , Haipeng Chen , Andrew Perrault , Finale Doshi-Velez , Milind Tambe