Related papers: Frequency-based Search-control in Dyna

Hill Climbing on Value Estimates for Search-control in Dyna

Dyna is an architecture for model-based reinforcement learning (RL), where simulated experience from a model is used to update policies or value functions. A key component of Dyna is search-control, the mechanism to generate the state and…

Machine Learning · Computer Science 2019-07-05 Yangchen Pan , Hengshuai Yao , Amir-massoud Farahmand , Martha White

Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning

We study how a Reinforcement Learning (RL) system can remain sample-efficient when learning from an imperfect model of the environment. This is particularly challenging when the learning system is resource-constrained and in continual…

Machine Learning · Computer Science 2024-07-01 Bradley Burega , John D. Martin , Luke Kapeluck , Michael Bowling

Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains

Model-based strategies for control are critical to obtain sample efficient learning. Dyna is a planning paradigm that naturally interleaves learning and planning, by simulating one-step experience to update the action-value function. This…

Artificial Intelligence · Computer Science 2018-06-13 Yangchen Pan , Muhammad Zaheer , Adam White , Andrew Patterson , Martha White

Dyna Planning using a Feature Based Generative Model

Dyna-style reinforcement learning is a powerful approach for problems where not much real data is available. The main idea is to supplement real trajectories, or sequences of sampled states over time, with simulated ones sampled from a…

Machine Learning · Computer Science 2018-05-28 Ryan Faulkner , Doina Precup

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen

Active Learning for Control-Oriented Identification of Nonlinear Systems

Model-based reinforcement learning is an effective approach for controlling an unknown system. It is based on a longstanding pipeline familiar to the control community in which one performs experiments on the environment to collect a…

Systems and Control · Electrical Eng. & Systems 2024-08-14 Bruce D. Lee , Ingvar Ziemann , George J. Pappas , Nikolai Matni

Deep Generative Models for Decision-Making and Control

Deep model-based reinforcement learning methods offer a conceptually simple approach to the decision-making and control problem: use learning for the purpose of estimating an approximate dynamics model, and offload the rest of the work to…

Machine Learning · Computer Science 2023-07-13 Michael Janner

Efficient Learning of Voltage Control Strategies via Model-based Deep Reinforcement Learning

This article proposes a model-based deep reinforcement learning (DRL) method to design emergency control strategies for short-term voltage stability problems in power systems. Recent advances show promising results in model-free DRL-based…

Systems and Control · Electrical Eng. & Systems 2022-12-07 Ramij R. Hossain , Tianzhixi Yin , Yan Du , Renke Huang , Jie Tan , Wenhao Yu , Yuan Liu , Qiuhua Huang

Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees

In this work we present a preliminary investigation of a novel algorithm called Dyna-T. In reinforcement learning (RL) a planning agent has its own representation of the environment as a model. To discover an optimal policy to interact with…

Machine Learning · Computer Science 2022-01-20 Tarek Faycal , Claudio Zito

Sample-efficient reinforcement learning using deep Gaussian processes

Reinforcement learning provides a framework for learning to control which actions to take towards completing a task through trial-and-error. In many applications observing interactions is costly, necessitating sample-efficient learning. In…

Machine Learning · Statistics 2020-11-04 Charles Gadd , Markus Heinonen , Harri Lähdesmäki , Samuel Kaski

Value function estimation using conditional diffusion models for control

A fairly reliable trend in deep reinforcement learning is that the performance scales with the number of parameters, provided a complimentary scaling in amount of training data. As the appetite for large models increases, it is imperative…

Machine Learning · Computer Science 2023-06-14 Bogdan Mazoure , Walter Talbott , Miguel Angel Bautista , Devon Hjelm , Alexander Toshev , Josh Susskind

Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration

Recent advancements in deep reinforcement learning (RL) have demonstrated notable progress in sample efficiency, spanning both model-based and model-free paradigms. Despite the identification and mitigation of specific bottlenecks in prior…

Machine Learning · Computer Science 2024-04-02 Yibo Wang , Jiang Zhao

Model-specific Data Subsampling with Influence Functions

Model selection requires repeatedly evaluating models on a given dataset and measuring their relative performances. In modern applications of machine learning, the models being considered are increasingly more expensive to evaluate and the…

Machine Learning · Computer Science 2020-10-21 Anant Raj , Cameron Musco , Lester Mackey , Nicolo Fusi

High-Accuracy Model-Based Reinforcement Learning, a Survey

Deep reinforcement learning has shown remarkable success in the past few years. Highly complex sequential decision making problems from game playing and robotics have been solved with deep model-free methods. Unfortunately, the sample…

Machine Learning · Computer Science 2021-07-20 Aske Plaat , Walter Kosters , Mike Preuss

Signal Reconstruction via H-infinity Sampled-Data Control Theory: Beyond the Shannon Paradigm

This paper presents a new method for signal reconstruction by leveraging sampled-data control theory. We formulate the signal reconstruction problem in terms of an analog performance optimization problem using a stable discrete-time filter.…

Information Theory · Computer Science 2015-06-16 Yutaka Yamamoto , Masaaki Nagahara , Pramod P. Khargonekar

State-Novelty Guided Action Persistence in Deep Reinforcement Learning

While a powerful and promising approach, deep reinforcement learning (DRL) still suffers from sample inefficiency, which can be notably improved by resorting to more sophisticated techniques to address the exploration-exploitation dilemma.…

Machine Learning · Computer Science 2024-09-10 Jianshu Hu , Paul Weng , Yutong Ban

Direct Random Search for Fine Tuning of Deep Reinforcement Learning Policies

Researchers have demonstrated that Deep Reinforcement Learning (DRL) is a powerful tool for finding policies that perform well on complex robotic systems. However, these policies are often unpredictable and can induce highly variable…

Robotics · Computer Science 2022-03-08 Sean Gillen , Asutay Ozmen , Katie Byl

Is Data All That Matters? The Role of Control Frequency for Learning-Based Sampled-Data Control of Uncertain Systems

Learning models or control policies from data has become a powerful tool to improve the performance of uncertain systems. While a strong focus has been placed on increasing the amount and quality of data to improve performance, data can…

Systems and Control · Electrical Eng. & Systems 2024-10-02 Ralf Römer , Lukas Brunke , Siqi Zhou , Angela P. Schoellig

Model-Based Data-Efficient and Robust Reinforcement Learning

A data-efficient learning-based control design method is proposed in this paper. It is based on learning a system dynamics model that is then leveraged in a two-level procedure. On the higher level, a simple but powerful optimization…

Systems and Control · Electrical Eng. & Systems 2026-02-03 Ludvig Svedlund , Constantin Cronrath , Jonas Fredriksson , Bengt Lennartson

Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments

Model-based next state prediction and state value prediction are slow to converge. To address these challenges, we do the following: i) Instead of a neural network, we do model-based planning using a parallel memory retrieval system (which…

Artificial Intelligence · Computer Science 2023-02-02 John Chong Min Tan , Mehul Motani