Related papers: Analytically Tractable Bayesian Deep Q-Learning

Analytically Tractable Inference in Deep Neural Networks

Since its inception, deep learning has been overwhelmingly reliant on backpropagation and gradient-based optimization algorithms in order to learn weight and bias parameter values. Tractable Approximate Gaussian Inference (TAGI) algorithm…

Machine Learning · Computer Science 2021-03-10 Luong-Ha Nguyen , James-A. Goulet

Analytically Tractable Hidden-States Inference in Bayesian Neural Networks

With few exceptions, neural networks have been relying on backpropagation and gradient descent as the inference engine in order to learn the model parameters, because the closed-form Bayesian inference for neural networks has been…

Machine Learning · Computer Science 2021-07-09 Luong-Ha Nguyen , James-A. Goulet

Tractable Approximate Gaussian Inference for Bayesian Neural Networks

In this paper, we propose an analytical method for performing tractable approximate Gaussian inference (TAGI) in Bayesian neural networks. The method enables the analytical Gaussian inference of the posterior mean vector and diagonal…

Machine Learning · Computer Science 2021-12-08 James-A. Goulet , Luong Ha Nguyen , Saeid Amiri

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

Deep reinforcement learning includes a broad family of algorithms that parameterise an internal representation, such as a value function or policy, by a deep neural network. Each algorithm optimises its parameters with respect to an…

Machine Learning · Computer Science 2020-07-17 Zhongwen Xu , Hado van Hasselt , Matteo Hessel , Junhyuk Oh , Satinder Singh , David Silver

Target-Aligned Reinforcement Learning

Many value-based deep reinforcement learning algorithms rely on target networks - lagged copies of the online network - to stabilize training. While effective, this mechanism introduces a fundamental stability-recency tradeoff: slower…

Machine Learning · Computer Science 2026-05-20 Leonard S. Pleiss , James Harrison , Maximilian Schiffer

Deep Reinforcement Learning With Macro-Actions

Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain. In this paper, we explore output representation…

Machine Learning · Computer Science 2016-06-16 Ishan P. Durugkar , Clemens Rosenbaum , Stefan Dernbach , Sridhar Mahadevan

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

Deep artificial neural networks (DNNs) are typically trained via gradient-based learning algorithms, namely backpropagation. Evolution strategies (ES) can rival backprop-based algorithms such as Q-learning and policy gradients on…

Neural and Evolutionary Computing · Computer Science 2018-04-24 Felipe Petroski Such , Vashisht Madhavan , Edoardo Conti , Joel Lehman , Kenneth O. Stanley , Jeff Clune

GAN Q-learning

Distributional reinforcement learning (distributional RL) has seen empirical success in complex Markov Decision Processes (MDPs) in the setting of nonlinear function approximation. However, there are many different ways in which one can…

Machine Learning · Statistics 2018-07-24 Thang Doan , Bogdan Mazoure , Clare Lyle

A Deep Learning Approach for Joint Video Frame and Reward Prediction in Atari Games

Reinforcement learning is concerned with identifying reward-maximizing behaviour policies in environments that are initially unknown. State-of-the-art reinforcement learning approaches, such as deep Q-networks, are model-free and learn to…

Artificial Intelligence · Computer Science 2017-08-18 Felix Leibfried , Nate Kushman , Katja Hofmann

Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger

Recent developments have established the vulnerability of deep Reinforcement Learning (RL) to policy manipulation attacks via adversarial perturbations. In this paper, we investigate the robustness and resilience of deep RL to training-time…

Artificial Intelligence · Computer Science 2017-12-29 Vahid Behzadan , Arslan Munir

Importance of using appropriate baselines for evaluation of data-efficiency in deep reinforcement learning for Atari

Reinforcement learning (RL) has seen great advancements in the past few years. Nevertheless, the consensus among the RL community is that currently used methods, despite all their benefits, suffer from extreme data inefficiency, especially…

Machine Learning · Computer Science 2020-04-01 Kacper Kielak

Accelerated Methods for Deep Reinforcement Learning

Deep reinforcement learning (RL) has achieved many recent successes, yet experiment turn-around time remains a key bottleneck in research and in practice. We investigate how to optimize existing deep RL algorithms for modern computers,…

Machine Learning · Computer Science 2019-01-14 Adam Stooke , Pieter Abbeel

Retrieval-Augmented Reinforcement Learning

Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive,…

Machine Learning · Computer Science 2022-05-25 Anirudh Goyal , Abram L. Friesen , Andrea Banino , Theophane Weber , Nan Rosemary Ke , Adria Puigdomenech Badia , Arthur Guez , Mehdi Mirza , Peter C. Humphreys , Ksenia Konyushkova , Laurent Sifre , Michal Valko , Simon Osindero , Timothy Lillicrap , Nicolas Heess , Charles Blundell

Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes

The potential of offline reinforcement learning (RL) is that high-capacity models trained on large, heterogeneous datasets can lead to agents that generalize broadly, analogously to similar advances in vision and NLP. However, recent works…

Machine Learning · Computer Science 2023-04-19 Aviral Kumar , Rishabh Agarwal , Xinyang Geng , George Tucker , Sergey Levine

An Information-Theoretic Optimality Principle for Deep Reinforcement Learning

We methodologically address the problem of Q-value overestimation in deep reinforcement learning to handle high-dimensional state spaces efficiently. By adapting concepts from information theory, we introduce an intrinsic penalty signal…

Artificial Intelligence · Computer Science 2018-11-21 Felix Leibfried , Jordi Grau-Moya , Haitham Bou-Ammar

Deep Reinforcement Learning with Adjustments

Deep reinforcement learning (RL) algorithms can learn complex policies to optimize agent operation over time. RL algorithms have shown promising results in solving complicated problems in recent years. However, their application on…

Machine Learning · Computer Science 2021-09-29 Hamed Khorasgani , Haiyan Wang , Chetan Gupta , Susumu Serita

Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning

In order perform a large variety of tasks and to achieve human-level performance in complex real-world environments, Artificial Intelligence (AI) Agents must be able to learn from their past experiences and gain both knowledge and an…

Machine Learning · Computer Science 2019-05-13 Andrei Claudiu Roibu

Explainable Deep Reinforcement Learning Using Introspection in a Non-episodic Task

Explainable reinforcement learning allows artificial agents to explain their behavior in a human-like manner aiming at non-expert end-users. An efficient alternative of creating explanations is to use an introspection-based method that…

Machine Learning · Computer Science 2021-08-23 Angel Ayala , Francisco Cruz , Bruno Fernandes , Richard Dazeley

GRAC: Self-Guided and Self-Regularized Actor-Critic

Deep reinforcement learning (DRL) algorithms have successfully been demonstrated on a range of challenging decision making and control tasks. One dominant component of recent deep reinforcement learning algorithms is the target network…

Machine Learning · Computer Science 2020-11-12 Lin Shao , Yifan You , Mengyuan Yan , Qingyun Sun , Jeannette Bohg

Transformer Based Reinforcement Learning For Games

Recent times have witnessed sharp improvements in reinforcement learning tasks using deep reinforcement learning techniques like Deep Q Networks, Policy Gradients, Actor Critic methods which are based on deep learning based models and…

Machine Learning · Computer Science 2019-12-10 Uddeshya Upadhyay , Nikunj Shah , Sucheta Ravikanti , Mayanka Medhe