Related papers: Debiased Model-based Representations for Sample-ef…

SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning

Model-based reinforcement learning (RL) has proven to be a data efficient approach for learning control tasks but is difficult to utilize in domains with complex observations such as images. In this paper, we present a method for learning…

Machine Learning · Computer Science 2019-06-25 Marvin Zhang , Sharad Vikram , Laura Smith , Pieter Abbeel , Matthew J. Johnson , Sergey Levine

Continuous Deep Q-Learning with Model-based Acceleration

Model-free reinforcement learning has been successfully applied to a range of challenging problems, and has recently been extended to handle large neural network policies and value functions. However, the sample complexity of model-free…

Machine Learning · Computer Science 2016-03-03 Shixiang Gu , Timothy Lillicrap , Ilya Sutskever , Sergey Levine

Distributed Q-Learning for Dynamically Decoupled Systems

Control of large-scale networked systems often necessitates the availability of complex models for the interactions amongst the agents. However in many applications, building accurate models of agents or interactions amongst them might be…

Optimization and Control · Mathematics 2019-03-21 Siavash Alemzadeh , Mehran Mesbahi

Model Predictive Control with Self-supervised Representation Learning

Over the last few years, we have not seen any major developments in model-free or model-based learning methods that would make one obsolete relative to the other. In most cases, the used technique is heavily dependent on the use case…

Machine Learning · Computer Science 2023-04-17 Jonas Matthies , Muhammad Burhan Hafez , Mostafa Kotb , Stefan Wermter

Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms

Distributed Distributional DrQ is a model-free and off-policy RL algorithm for continuous control tasks based on the state and observation of the agent, which is an actor-critic method with the data-augmentation and the distributional…

Machine Learning · Computer Science 2024-04-17 Zehao Zhou

Curious Meta-Controller: Adaptive Alternation between Model-Based and Model-Free Control in Deep Reinforcement Learning

Recent success in deep reinforcement learning for continuous control has been dominated by model-free approaches which, unlike model-based approaches, do not suffer from representational limitations in making assumptions about the world…

Machine Learning · Computer Science 2019-05-07 Muhammad Burhan Hafez , Cornelius Weber , Matthias Kerzel , Stefan Wermter

Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning

In state of the art model-free off-policy deep reinforcement learning, a replay memory is used to store past experience and derive all network updates. Even if both state and action spaces are continuous, the replay memory only holds a…

Machine Learning · Computer Science 2020-07-16 Sabrina Hoppe , Marc Toussaint

Model-free Reinforcement Learning for Model-based Control: Towards Safe, Interpretable and Sample-efficient Agents

Training sophisticated agents for optimal decision-making under uncertainty has been key to the rapid development of modern autonomous systems across fields. Notably, model-free reinforcement learning (RL) has enabled decision-making agents…

Machine Learning · Computer Science 2025-07-21 Thomas Banker , Ali Mesbah

Exploiting Estimation Bias in Clipped Double Q-Learning for Continous Control Reinforcement Learning Tasks

Continuous control Deep Reinforcement Learning (RL) approaches are known to suffer from estimation biases, leading to suboptimal policies. This paper introduces innovative methods in RL, focusing on addressing and exploiting estimation…

Machine Learning · Computer Science 2024-10-14 Niccolò Turcato , Alberto Sinigaglia , Alberto Dalla Libera , Ruggero Carli , Gian Antonio Susto

Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings

Learning continuous control in high-dimensional sparse reward settings, such as robotic manipulation, is a challenging problem due to the number of samples often required to obtain accurate optimal value and policy estimates. While many…

Robotics · Computer Science 2021-07-29 Sreehari Rammohan , Shangqun Yu , Bowen He , Eric Hsiung , Eric Rosen , Stefanie Tellex , George Konidaris

Mitigating Estimation Bias with Representation Learning in TD Error-Driven Regularization

Deterministic policy gradient algorithms for continuous control suffer from value estimation biases that degrade performance. While double critics reduce such biases, the exploration potential of double actors remains underexplored.…

Machine Learning · Computer Science 2025-11-21 Haohui Chen , Zhiyong Chen , Aoxiang Liu , Wentuo Fang

Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning

Training task-completion dialogue agents with reinforcement learning usually requires a large number of real user experiences. The Dyna-Q algorithm extends Q-learning by integrating a world model, and thus can effectively boost training…

Computation and Language · Computer Science 2018-11-20 Yuexin Wu , Xiujun Li , Jingjing Liu , Jianfeng Gao , Yiming Yang

Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems

This paper introduces and analyzes an improved Q-learning algorithm for discrete-time linear time-invariant systems. The proposed method does not require any knowledge of the system dynamics, and it enjoys significant efficiency advantages…

Systems and Control · Electrical Eng. & Systems 2023-04-03 Victor G. Lopez , Mohammad Alsalti , Matthias A. Müller

Information Theoretic Model Predictive Q-Learning

Model-free Reinforcement Learning (RL) works well when experience can be collected cheaply and model-based RL is effective when system dynamics can be modeled accurately. However, both assumptions can be violated in real world problems such…

Machine Learning · Computer Science 2020-05-07 Mohak Bhardwaj , Ankur Handa , Dieter Fox , Byron Boots

Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid

Autonomous and learning systems based on Deep Reinforcement Learning have firmly established themselves as a foundation for approaches to creating resilient and efficient Cyber-Physical Energy Systems. However, most current approaches…

Artificial Intelligence · Computer Science 2024-04-03 Eric MSP Veith , Torben Logemann , Aleksandr Berezin , Arlena Wellßow , Stephan Balduin

Distributed Deep Q-Learning

We propose a distributed deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is based on the deep Q-network, a convolutional neural network trained…

Machine Learning · Computer Science 2015-10-16 Hao Yi Ong , Kevin Chavez , Augustus Hong

Model-Free Quantum Control with Reinforcement Learning

Model bias is an inherent limitation of the current dominant approach to optimal quantum control, which relies on a system simulation for optimization of control policies. To overcome this limitation, we propose a circuit-based approach for…

Quantum Physics · Physics 2022-03-31 V. V. Sivak , A. Eickbusch , H. Liu , B. Royer , I. Tsioutsios , M. H. Devoret

Model-Augmented Actor-Critic: Backpropagating through Paths

Current model-based reinforcement learning approaches use the model simply as a learned black-box simulator to augment the data for policy optimization or value function learning. In this paper, we show how to make more effective use of the…

Machine Learning · Computer Science 2020-05-19 Ignasi Clavera , Violet Fu , Pieter Abbeel

Uncertainty-aware Model-based Policy Optimization

Model-based reinforcement learning has the potential to be more sample efficient than model-free approaches. However, existing model-based methods are vulnerable to model bias, which leads to poor generalization and asymptotic performance…

Machine Learning · Computer Science 2019-06-27 Tung-Long Vuong , Kenneth Tran

Deep Disentangled Representation Network for Treatment Effect Estimation

Estimating individual-level treatment effect from observational data is a fundamental problem in causal inference and has attracted increasing attention in the fields of education, healthcare, and public policy.In this work, we concentrate…

Machine Learning · Computer Science 2025-07-10 Hui Meng , Keping Yang , Xuyu Peng , Bo Zheng