Related papers: Q-DPM: An Efficient Model-Free Dynamic Power Manag…

Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems

This paper introduces and analyzes an improved Q-learning algorithm for discrete-time linear time-invariant systems. The proposed method does not require any knowledge of the system dynamics, and it enjoys significant efficiency advantages…

Systems and Control · Electrical Eng. & Systems 2023-04-03 Victor G. Lopez , Mohammad Alsalti , Matthias A. Müller

Continuous Deep Q-Learning with Model-based Acceleration

Model-free reinforcement learning has been successfully applied to a range of challenging problems, and has recently been extended to handle large neural network policies and value functions. However, the sample complexity of model-free…

Machine Learning · Computer Science 2016-03-03 Shixiang Gu , Timothy Lillicrap , Ilya Sutskever , Sergey Levine

Composable Deep Reinforcement Learning for Robotic Manipulation

Model-free deep reinforcement learning has been shown to exhibit good performance in domains ranging from video games to simulated robotic manipulation and locomotion. However, model-free methods are known to perform poorly when the…

Machine Learning · Computer Science 2018-03-20 Tuomas Haarnoja , Vitchyr Pong , Aurick Zhou , Murtaza Dalal , Pieter Abbeel , Sergey Levine

DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction

The DeeP-Mod framework builds an environment model using features from a Deep Dynamic Programming Network (DDPN), trained via a Deep Q-Network (DQN). While Deep Q-Learning is effective in decision-making, state information is lost in deeper…

Machine Learning · Computer Science 2025-08-26 Chris Child , Lam Ngo

Model-Based Data-Efficient and Robust Reinforcement Learning

A data-efficient learning-based control design method is proposed in this paper. It is based on learning a system dynamics model that is then leveraged in a two-level procedure. On the higher level, a simple but powerful optimization…

Systems and Control · Electrical Eng. & Systems 2026-02-03 Ludvig Svedlund , Constantin Cronrath , Jonas Fredriksson , Bengt Lennartson

Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning

In state of the art model-free off-policy deep reinforcement learning, a replay memory is used to store past experience and derive all network updates. Even if both state and action spaces are continuous, the replay memory only holds a…

Machine Learning · Computer Science 2020-07-16 Sabrina Hoppe , Marc Toussaint

Efficient Model-free Reinforcement Learning in Metric Spaces

Model-free Reinforcement Learning (RL) algorithms such as Q-learning [Watkins, Dayan 92] have been widely used in practice and can achieve human level performance in applications such as video games [Mnih et al. 15]. Recently, equipped with…

Machine Learning · Computer Science 2019-05-03 Zhao Song , Wen Sun

Reinforcement Learning for Learning of Dynamical Systems in Uncertain Environment: a Tutorial

In this paper, a review of model-free reinforcement learning for learning of dynamical systems in uncertain environments has discussed. For this purpose, the Markov Decision Process (MDP) will be reviewed. Furthermore, some learning…

Machine Learning · Computer Science 2019-05-21 Mehran Attar , Mohammadreza Dabirian

Efficient Learning of Voltage Control Strategies via Model-based Deep Reinforcement Learning

This article proposes a model-based deep reinforcement learning (DRL) method to design emergency control strategies for short-term voltage stability problems in power systems. Recent advances show promising results in model-free DRL-based…

Systems and Control · Electrical Eng. & Systems 2022-12-07 Ramij R. Hossain , Tianzhixi Yin , Yan Du , Renke Huang , Jie Tan , Wenhao Yu , Yuan Liu , Qiuhua Huang

Performance-Aware Power Management in Embedded Controllers with Multiple-Voltage Processors

The goal of this work is to minimize the energy dissipation of embedded controllers without jeopardizing the quality of control (QoC). Taking advantage of the dynamic voltage scaling (DVS) technology, this paper develops a performance-aware…

Other Computer Science · Computer Science 2008-09-30 Feng Xia , Liping Liu , Longhua Ma , Youxian Sun , Jinxiang Dong

A study on a Q-Learning algorithm application to a manufacturing assembly problem

The development of machine learning algorithms has been gathering relevance to address the increasing modelling complexity of manufacturing decision-making problems. Reinforcement learning is a methodology with great potential due to the…

Machine Learning · Computer Science 2023-04-18 Miguel Neves , Miguel Vieira , Pedro Neto

Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL

Recent works have shown that tackling offline reinforcement learning (RL) with a conditional policy produces promising results. The Decision Transformer (DT) combines the conditional policy approach and a transformer architecture, showing…

Machine Learning · Computer Science 2023-05-26 Taku Yamagata , Ahmed Khalil , Raul Santos-Rodriguez

Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL

Reinforcement learning holds tremendous promise in accelerator controls. The primary goal of this paper is to show how this approach can be utilised on an operational level on accelerator physics problems. Despite the success of model-free…

Machine Learning · Computer Science 2022-01-27 Simon Hirlaender , Niky Bruchon

Modeling Time-Dependent Systems using Dynamic Quantum Bayesian Networks

Advances in data collection using inexpensive sensors have enabled monitoring the performance of dynamic systems, and to implement appropriate control actions to improve their performance. Moreover, engineering systems often operate under…

Quantum Physics · Physics 2021-07-05 Sima E. Borujeni , Saideep Nannapaneni

Model-Based Reinforcement Learning via Meta-Policy Optimization

Model-based reinforcement learning approaches carry the promise of being data efficient. However, due to challenges in learning dynamics models that sufficiently match the real-world dynamics, they struggle to achieve the same asymptotic…

Machine Learning · Computer Science 2018-09-17 Ignasi Clavera , Jonas Rothfuss , John Schulman , Yasuhiro Fujita , Tamim Asfour , Pieter Abbeel

Energy-Aware Disk Storage Management: Online Approach with Application in DBMS

Energy consumption has become a first-class optimization goal in design and implementation of data-intensive computing systems. This is particularly true in the design of database management systems (DBMS), which was found to be the major…

Databases · Computer Science 2017-03-09 Peyman Behzadnia , Yi-Cheng Tu , Bo Zeng , Wei Yuan

Data-driven inventory management for new products: An adjusted Dyna-$Q$ approach with transfer learning

In this paper, we propose a novel reinforcement learning algorithm for inventory management of newly launched products with no historical demand information. The algorithm follows the classic Dyna-$Q$ structure, balancing the model-free and…

Machine Learning · Computer Science 2025-06-10 Xinye Qu , Longxiao Liu , Wenjie Huang

Q-learning-based Model-free Safety Filter

Ensuring safety via safety filters in real-world robotics presents significant challenges, particularly when the system dynamics is complex or unavailable. To handle this issue, learning-based safety filters recently gained popularity,…

Robotics · Computer Science 2024-12-02 Guo Ning Sue , Yogita Choudhary , Richard Desatnik , Carmel Majidi , John Dolan , Guanya Shi

Empirical learning of dynamical decoupling on quantum processors

Dynamical decoupling (DD) is a low-overhead method for quantum error suppression. Despite extensive work in DD design, finding pulse sequences that optimally decouple computational qubits on noisy quantum hardware is not well understood. In…

Quantum Physics · Physics 2026-04-22 Christopher Tong , Helena Zhang , Bibek Pokharel

Optimal control of batch processes via a deterministic Q-learning method

Dynamic optimization of nonlinear chemical systems -- such as batch reactors -- should be applied online, and the suitable control taken should be according to the current state of the system rather than the current time instant. The recent…

Systems and Control · Computer Science 2019-04-16 Abdelrahman ElMezain , Mohamed Saleh , Jie Zhang , Ahmed Soliman , Seif Fateen