Related papers: Bayesian Reinforcement Learning via Deep, Sparse S…

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

Bayesian model-based reinforcement learning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, finding the resulting Bayes-optimal…

Machine Learning · Computer Science 2015-03-20 Arthur Guez , David Silver , Peter Dayan

Cover Tree Bayesian Reinforcement Learning

This paper proposes an online tree-based Bayesian approach for reinforcement learning. For inference, we employ a generalised context tree model. This defines a distribution on multivariate Gaussian piecewise-linear models, which can be…

Machine Learning · Statistics 2014-05-05 Nikolaos Tziortziotis , Christos Dimitrakakis , Konstantinos Blekas

Model-Based Bayesian Reinforcement Learning in Large Structured Domains

Model-based Bayesian reinforcement learning has generated significant interest in the AI community as it provides an elegant solution to the optimal exploration-exploitation tradeoff in classical reinforcement learning. Unfortunately, the…

Artificial Intelligence · Computer Science 2012-06-18 Stephane Ross , Joelle Pineau

Local policy search with Bayesian optimization

Reinforcement learning (RL) aims to find an optimal policy by interaction with an environment. Consequently, learning complex behavior requires a vast number of samples, which can be prohibitive in practice. Nevertheless, instead of…

Machine Learning · Computer Science 2021-11-23 Sarah Müller , Alexander von Rohr , Sebastian Trimpe

Tree Exploration for Bayesian RL Exploration

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The first employs a Bayesian framework, where optimality improves with increased computational time.…

Machine Learning · Statistics 2011-09-22 Christos Dimitrakakis

A Greedy Approximation of Bayesian Reinforcement Learning with Probably Optimistic Transition Model

Bayesian Reinforcement Learning (RL) is capable of not only incorporating domain knowledge, but also solving the exploration-exploitation dilemma in a natural way. As Bayesian RL is intractable except for special cases, previous work has…

Artificial Intelligence · Computer Science 2013-06-14 Kenji Kawaguchi , Mauricio Araya

Learning Bayesian Network Structure from Massive Datasets: The "Sparse Candidate" Algorithm

Learning Bayesian networks is often cast as an optimization problem, where the computational task is to find a structure that maximizes a statistically motivated score. By and large, existing learning tools address this optimization problem…

Machine Learning · Computer Science 2013-01-30 Nir Friedman , Iftach Nachman , Dana Pe'er

A Bayesian Sampling Approach to Exploration in Reinforcement Learning

We present a modular approach to reinforcement learning that uses a Bayesian representation of the uncertainty over models. The approach, BOSS (Best of Sampled Set), drives exploration by sampling multiple models from the posterior and…

Machine Learning · Computer Science 2012-05-14 John Asmuth , Lihong Li , Michael L. Littman , Ali Nouri , David Wingate

Efficient Variational Inference for Sparse Deep Learning with Theoretical Guarantee

Sparse deep learning aims to address the challenge of huge storage consumption by deep neural networks, and to recover the sparse structure of target functions. Although tremendous empirical successes have been achieved, most sparse deep…

Machine Learning · Statistics 2020-11-17 Jincheng Bai , Qifan Song , Guang Cheng

Bayesian Curiosity for Efficient Exploration in Reinforcement Learning

Balancing exploration and exploitation is a fundamental part of reinforcement learning, yet most state-of-the-art algorithms use a naive exploration protocol like $\epsilon$-greedy. This contributes to the problem of high sample complexity,…

Machine Learning · Computer Science 2019-11-21 Tom Blau , Lionel Ott , Fabio Ramos

On Efficient Bayesian Exploration in Model-Based Reinforcement Learning

In this work, we address the challenge of data-efficient exploration in reinforcement learning by examining existing principled, information-theoretic approaches to intrinsic motivation. Specifically, we focus on a class of exploration…

Machine Learning · Computer Science 2025-07-04 Alberto Caron , Chris Hicks , Vasilios Mavroudis

Better Optimism By Bayes: Adaptive Planning with Rich Models

The computational costs of inference and planning have confined Bayesian model-based reinforcement learning to one of two dismal fates: powerful Bayes-adaptive planning but only for simplistic models, or powerful, Bayesian non-parametric…

Artificial Intelligence · Computer Science 2014-02-11 Arthur Guez , David Silver , Peter Dayan

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning

Agents that interact with other agents often do not know a priori what the other agents' strategies are, but have to maximise their own online return while interacting with and learning about others. The optimal adaptive behaviour under…

Machine Learning · Computer Science 2022-04-19 Luisa Zintgraf , Sam Devlin , Kamil Ciosek , Shimon Whiteson , Katja Hofmann

Dynamic Subgoal-based Exploration via Bayesian Optimization

Reinforcement learning in sparse-reward navigation environments with expensive and limited interactions is challenging and poses a need for effective exploration. Motivated by complex navigation tasks that require real-world training (when…

Optimization and Control · Mathematics 2023-10-13 Yijia Wang , Matthias Poloczek , Daniel R. Jiang

A Bayesian Approach to Online Planning

The combination of Monte Carlo tree search and neural networks has revolutionized online planning. As neural network approximations are often imperfect, we ask whether uncertainty estimates about the network outputs could be used to improve…

Artificial Intelligence · Computer Science 2024-06-05 Nir Greshler , David Ben Eli , Carmel Rabinovitz , Gabi Guetta , Liran Gispan , Guy Zohar , Aviv Tamar

Bayesian Design Principles for Offline-to-Online Reinforcement Learning

Offline reinforcement learning (RL) is crucial for real-world applications where exploration can be costly or unsafe. However, offline learned policies are often suboptimal, and further online fine-tuning is required. In this paper, we…

Machine Learning · Computer Science 2024-06-03 Hao Hu , Yiqin Yang , Jianing Ye , Chengjie Wu , Ziqing Mai , Yujing Hu , Tangjie Lv , Changjie Fan , Qianchuan Zhao , Chongjie Zhang

Program-Based Strategy Induction for Reinforcement Learning

Typical models of learning assume incremental estimation of continuously-varying decision variables like expected rewards. However, this class of models fails to capture more idiosyncratic, discrete heuristics and strategies that people and…

Machine Learning · Computer Science 2024-02-27 Carlos G. Correa , Thomas L. Griffiths , Nathaniel D. Daw

Bayesian Batch Active Learning as Sparse Subset Approximation

Leveraging the wealth of unlabeled data produced in recent years provides great potential for improving supervised models. When the cost of acquiring labels is high, probabilistic active learning methods can be used to greedily select the…

Machine Learning · Statistics 2021-02-09 Robert Pinsler , Jonathan Gordon , Eric Nalisnick , José Miguel Hernández-Lobato

Adaptive, Robust and Scalable Bayesian Filtering for Online Learning

In this thesis, we introduce Bayesian filtering as a principled framework for tackling diverse sequential machine learning problems, including online (continual) learning, prequential (one-step-ahead) forecasting, and contextual bandits. To…

Machine Learning · Statistics 2025-05-13 Gerardo Duran-Martin

Bayesian Optimization for Policy Search via Online-Offline Experimentation

Online field experiments are the gold-standard way of evaluating changes to real-world interactive machine learning systems. Yet our ability to explore complex, multi-dimensional policy spaces - such as those found in recommendation and…

Machine Learning · Statistics 2019-04-30 Benjamin Letham , Eytan Bakshy