English
Related papers

Related papers: Bayesian Reinforcement Learning via Deep, Sparse S…

200 papers

Bayesian model-based reinforcement learning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, finding the resulting Bayes-optimal…

Machine Learning · Computer Science 2015-03-20 Arthur Guez , David Silver , Peter Dayan

This paper proposes an online tree-based Bayesian approach for reinforcement learning. For inference, we employ a generalised context tree model. This defines a distribution on multivariate Gaussian piecewise-linear models, which can be…

Machine Learning · Statistics 2014-05-05 Nikolaos Tziortziotis , Christos Dimitrakakis , Konstantinos Blekas

Model-based Bayesian reinforcement learning has generated significant interest in the AI community as it provides an elegant solution to the optimal exploration-exploitation tradeoff in classical reinforcement learning. Unfortunately, the…

Artificial Intelligence · Computer Science 2012-06-18 Stephane Ross , Joelle Pineau

Reinforcement learning (RL) aims to find an optimal policy by interaction with an environment. Consequently, learning complex behavior requires a vast number of samples, which can be prohibitive in practice. Nevertheless, instead of…

Machine Learning · Computer Science 2021-11-23 Sarah Müller , Alexander von Rohr , Sebastian Trimpe

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The first employs a Bayesian framework, where optimality improves with increased computational time.…

Machine Learning · Statistics 2011-09-22 Christos Dimitrakakis

Bayesian Reinforcement Learning (RL) is capable of not only incorporating domain knowledge, but also solving the exploration-exploitation dilemma in a natural way. As Bayesian RL is intractable except for special cases, previous work has…

Artificial Intelligence · Computer Science 2013-06-14 Kenji Kawaguchi , Mauricio Araya

Learning Bayesian networks is often cast as an optimization problem, where the computational task is to find a structure that maximizes a statistically motivated score. By and large, existing learning tools address this optimization problem…

Machine Learning · Computer Science 2013-01-30 Nir Friedman , Iftach Nachman , Dana Pe'er

We present a modular approach to reinforcement learning that uses a Bayesian representation of the uncertainty over models. The approach, BOSS (Best of Sampled Set), drives exploration by sampling multiple models from the posterior and…

Machine Learning · Computer Science 2012-05-14 John Asmuth , Lihong Li , Michael L. Littman , Ali Nouri , David Wingate

Sparse deep learning aims to address the challenge of huge storage consumption by deep neural networks, and to recover the sparse structure of target functions. Although tremendous empirical successes have been achieved, most sparse deep…

Machine Learning · Statistics 2020-11-17 Jincheng Bai , Qifan Song , Guang Cheng

Balancing exploration and exploitation is a fundamental part of reinforcement learning, yet most state-of-the-art algorithms use a naive exploration protocol like $\epsilon$-greedy. This contributes to the problem of high sample complexity,…

Machine Learning · Computer Science 2019-11-21 Tom Blau , Lionel Ott , Fabio Ramos

In this work, we address the challenge of data-efficient exploration in reinforcement learning by examining existing principled, information-theoretic approaches to intrinsic motivation. Specifically, we focus on a class of exploration…

Machine Learning · Computer Science 2025-07-04 Alberto Caron , Chris Hicks , Vasilios Mavroudis

The computational costs of inference and planning have confined Bayesian model-based reinforcement learning to one of two dismal fates: powerful Bayes-adaptive planning but only for simplistic models, or powerful, Bayesian non-parametric…

Artificial Intelligence · Computer Science 2014-02-11 Arthur Guez , David Silver , Peter Dayan

Agents that interact with other agents often do not know a priori what the other agents' strategies are, but have to maximise their own online return while interacting with and learning about others. The optimal adaptive behaviour under…

Machine Learning · Computer Science 2022-04-19 Luisa Zintgraf , Sam Devlin , Kamil Ciosek , Shimon Whiteson , Katja Hofmann

Reinforcement learning in sparse-reward navigation environments with expensive and limited interactions is challenging and poses a need for effective exploration. Motivated by complex navigation tasks that require real-world training (when…

Optimization and Control · Mathematics 2023-10-13 Yijia Wang , Matthias Poloczek , Daniel R. Jiang

The combination of Monte Carlo tree search and neural networks has revolutionized online planning. As neural network approximations are often imperfect, we ask whether uncertainty estimates about the network outputs could be used to improve…

Artificial Intelligence · Computer Science 2024-06-05 Nir Greshler , David Ben Eli , Carmel Rabinovitz , Gabi Guetta , Liran Gispan , Guy Zohar , Aviv Tamar

Offline reinforcement learning (RL) is crucial for real-world applications where exploration can be costly or unsafe. However, offline learned policies are often suboptimal, and further online fine-tuning is required. In this paper, we…

Machine Learning · Computer Science 2024-06-03 Hao Hu , Yiqin Yang , Jianing Ye , Chengjie Wu , Ziqing Mai , Yujing Hu , Tangjie Lv , Changjie Fan , Qianchuan Zhao , Chongjie Zhang

Typical models of learning assume incremental estimation of continuously-varying decision variables like expected rewards. However, this class of models fails to capture more idiosyncratic, discrete heuristics and strategies that people and…

Machine Learning · Computer Science 2024-02-27 Carlos G. Correa , Thomas L. Griffiths , Nathaniel D. Daw

Leveraging the wealth of unlabeled data produced in recent years provides great potential for improving supervised models. When the cost of acquiring labels is high, probabilistic active learning methods can be used to greedily select the…

Machine Learning · Statistics 2021-02-09 Robert Pinsler , Jonathan Gordon , Eric Nalisnick , José Miguel Hernández-Lobato

In this thesis, we introduce Bayesian filtering as a principled framework for tackling diverse sequential machine learning problems, including online (continual) learning, prequential (one-step-ahead) forecasting, and contextual bandits. To…

Machine Learning · Statistics 2025-05-13 Gerardo Duran-Martin

Online field experiments are the gold-standard way of evaluating changes to real-world interactive machine learning systems. Yet our ability to explore complex, multi-dimensional policy spaces - such as those found in recommendation and…

Machine Learning · Statistics 2019-04-30 Benjamin Letham , Eytan Bakshy
‹ Prev 1 2 3 10 Next ›