Related papers: Dyna Planning using a Feature Based Generative Mod…

Deep Generative Models for Decision-Making and Control

Deep model-based reinforcement learning methods offer a conceptually simple approach to the decision-making and control problem: use learning for the purpose of estimating an approximate dynamics model, and offload the rest of the work to…

Machine Learning · Computer Science 2023-07-13 Michael Janner

Dynamic Planning Networks

We introduce Dynamic Planning Networks (DPN), a novel architecture for deep reinforcement learning, that combines model-based and model-free aspects for online planning. Our architecture learns to dynamically construct plans using a learned…

Machine Learning · Computer Science 2019-02-05 Norman Tasfi , Miriam Capretz

Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey

Deep reinforcement learning has shown remarkable success in the past few years. Highly complex sequential decision making problems have been solved in tasks such as game playing and robotics. Unfortunately, the sample complexity of most…

Machine Learning · Computer Science 2020-12-03 Aske Plaat , Walter Kosters , Mike Preuss

Improved Classification Based on Deep Belief Networks

For better classification generative models are used to initialize the model and model features before training a classifier. Typically it is needed to solve separate unsupervised and supervised learning problems. Generative restricted…

Machine Learning · Statistics 2019-08-13 Jaehoon Koo , Diego Klabjan

A New View on Planning in Online Reinforcement Learning

This paper investigates a new approach to model-based reinforcement learning using background planning: mixing (approximate) dynamic programming updates and model-free updates, similar to the Dyna architecture. Background planning with…

Machine Learning · Computer Science 2024-06-04 Kevin Roice , Parham Mohammad Panahi , Scott M. Jordan , Adam White , Martha White

Shaping Belief States with Generative Environment Models for RL

When agents interact with a complex environment, they must form and maintain beliefs about the relevant aspects of that environment. We propose a way to efficiently train expressive generative models in complex environments. We show that a…

Machine Learning · Computer Science 2019-06-25 Karol Gregor , Danilo Jimenez Rezende , Frederic Besse , Yan Wu , Hamza Merzic , Aaron van den Oord

Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees

In this work we present a preliminary investigation of a novel algorithm called Dyna-T. In reinforcement learning (RL) a planning agent has its own representation of the environment as a model. To discover an optimal policy to interact with…

Machine Learning · Computer Science 2022-01-20 Tarek Faycal , Claudio Zito

Learning Dynamics Models for Model Predictive Agents

Model-Based Reinforcement Learning involves learning a \textit{dynamics model} from data, and then using this model to optimise behaviour, most often with an online \textit{planner}. Much of the recent research along these lines presents a…

Machine Learning · Computer Science 2021-09-30 Michael Lutter , Leonard Hasenclever , Arunkumar Byravan , Gabriel Dulac-Arnold , Piotr Trochim , Nicolas Heess , Josh Merel , Yuval Tassa

The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces

Dyna is a fundamental approach to model-based reinforcement learning (MBRL) that interleaves planning, acting, and learning in an online setting. In the most typical application of Dyna, the dynamics model is used to generate one-step…

Artificial Intelligence · Computer Science 2019-04-01 G. Zacharias Holland , Erin J. Talvitie , Michael Bowling

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available after each interaction with the world. This paper develops an…

Artificial Intelligence · Computer Science 2012-06-18 Richard S. Sutton , Csaba Szepesvari , Alborz Geramifard , Michael P. Bowling

Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning

We study how a Reinforcement Learning (RL) system can remain sample-efficient when learning from an imperfect model of the environment. This is particularly challenging when the learning system is resource-constrained and in continual…

Machine Learning · Computer Science 2024-07-01 Bradley Burega , John D. Martin , Luke Kapeluck , Michael Bowling

Deep Belief Network Training Improvement Using Elite Samples Minimizing Free Energy

Nowadays this is very popular to use deep architectures in machine learning. Deep Belief Networks (DBNs) are deep architectures that use stack of Restricted Boltzmann Machines (RBM) to create a powerful generative model using training data.…

Machine Learning · Computer Science 2015-08-21 Mohammad Ali Keyvanrad , Mohammad Mehdi Homayounpour

High-Accuracy Model-Based Reinforcement Learning, a Survey

Deep reinforcement learning has shown remarkable success in the past few years. Highly complex sequential decision making problems from game playing and robotics have been solved with deep model-free methods. Unfortunately, the sample…

Machine Learning · Computer Science 2021-07-20 Aske Plaat , Walter Kosters , Mike Preuss

Should Models Be Accurate?

Model-based Reinforcement Learning (MBRL) holds promise for data-efficiency by planning with model-generated experience in addition to learning with experience from the environment. However, in complex or changing environments, models in…

Machine Learning · Computer Science 2022-05-24 Esra'a Saleh , John D. Martin , Anna Koop , Arash Pourzarabi , Michael Bowling

Shaping Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models

The potential benefits of model-free reinforcement learning to real robotics systems are limited by its uninformed exploration that leads to slow convergence, lack of data-efficiency, and unnecessary interactions with the environment. To…

Robotics · Computer Science 2020-11-04 Yuchen Wu , Melissa Mozifian , Florian Shkurti

Delta Schema Network in Model-based Reinforcement Learning

This work is devoted to unresolved problems of Artificial General Intelligence - the inefficiency of transfer learning. One of the mechanisms that are used to solve this problem in the area of reinforcement learning is a model-based…

Machine Learning · Computer Science 2020-07-09 Andrey Gorodetskiy , Alexandra Shlychkova , Aleksandr I. Panov

Learning and Inferring Movement with Deep Generative Model

Learning and inference movement is a very challenging problem due to its high dimensionality and dependency to varied environments or tasks. In this paper, we propose an effective probabilistic method for learning and inference of basic…

Machine Learning · Computer Science 2018-10-30 Mingxuan Jing , Xiaojian Ma , Fuchun Sun , Huaping Liu

Pathfinding in Random Partially Observable Environments with Vision-Informed Deep Reinforcement Learning

Deep reinforcement learning is a technique for solving problems in a variety of environments, ranging from Atari video games to stock trading. This method leverages deep neural network models to make decisions based on observations of a…

Machine Learning · Computer Science 2022-09-13 Anthony Dowling

Planning with Diffusion for Flexible Behavior Synthesis

Model-based reinforcement learning methods often use learning only for the purpose of estimating an approximate dynamics model, offloading the rest of the decision-making work to classical trajectory optimizers. While conceptually simple,…

Machine Learning · Computer Science 2022-12-22 Michael Janner , Yilun Du , Joshua B. Tenenbaum , Sergey Levine

Frequency-based Search-control in Dyna

Model-based reinforcement learning has been empirically demonstrated as a successful strategy to improve sample efficiency. In particular, Dyna is an elegant model-based architecture integrating learning and planning that provides huge…

Machine Learning · Computer Science 2020-02-17 Yangchen Pan , Jincheng Mei , Amir-massoud Farahmand