Related papers: Model-Augmented Q-learning

Meta-Q-Learning

This paper introduces Meta-Q-Learning (MQL), a new off-policy algorithm for meta-Reinforcement Learning (meta-RL). MQL builds upon three simple ideas. First, we show that Q-learning is competitive with state-of-the-art meta-RL algorithms if…

Machine Learning · Computer Science 2020-04-07 Rasool Fakoor , Pratik Chaudhari , Stefano Soatto , Alexander J. Smola

MADQRL: Distributed Quantum Reinforcement Learning Framework for Multi-Agent Environments

Reinforcement learning (RL) is one of the most practical ways to learn from real-life use-cases. Motivated from the cognitive methods used by humans makes it a widely acceptable strategy in the field of artificial intelligence. Most of the…

Artificial Intelligence · Computer Science 2026-04-14 Abhishek Sawaika , Samuel Yen-Chi Chen , Udaya Parampalli , Rajkumar Buyya

BiCQL-ML: A Bi-Level Conservative Q-Learning Framework for Maximum Likelihood Inverse Reinforcement Learning

Offline inverse reinforcement learning (IRL) aims to recover a reward function that explains expert behavior using only fixed demonstration data, without any additional online interaction. We propose BiCQL-ML, a policy-free offline IRL…

Machine Learning · Computer Science 2025-12-01 Junsung Park

A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning

Many sequential decision-making tasks involve optimizing multiple conflicting objectives, requiring policies that adapt to different user preferences. In multi-objective reinforcement learning (MORL), one widely studied approach} addresses…

Machine Learning · Computer Science 2026-04-28 Ying-Tu Chen , Wei Hung , Bing-Shu Wu , Zhang-Wei Hong , Ping-Chun Hsieh

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Reinforcement Learning by Comparing Immediate Reward

This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate rewards using a variation of Q-Learning algorithm. Unlike the conventional Q-Learning, the proposed algorithm compares current reward with…

Machine Learning · Computer Science 2010-09-15 Punit Pandey , Deepshikha Pandey , Shishir Kumar

Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning

Model-based offline reinforcement learning (RL) is a compelling approach that addresses the challenge of learning from limited, static data by generating imaginary trajectories using learned models. However, these approaches often struggle…

Machine Learning · Computer Science 2024-12-04 Kwanyoung Park , Youngwoon Lee

Joint Inference of Reward Machines and Policies for Reinforcement Learning

Incorporating high-level knowledge is an effective way to expedite reinforcement learning (RL), especially for complex tasks with sparse rewards. We investigate an RL problem where the high-level knowledge is in the form of reward machines,…

Artificial Intelligence · Computer Science 2022-02-10 Zhe Xu , Ivan Gavran , Yousef Ahmad , Rupak Majumdar , Daniel Neider , Ufuk Topcu , Bo Wu

Neural-Network-Driven Reward Prediction as a Heuristic: Advancing Q-Learning for Mobile Robot Path Planning

Q-learning is a widely used reinforcement learning technique for solving path planning problems. It primarily involves the interaction between an agent and its environment, enabling the agent to learn an optimal strategy that maximizes…

Robotics · Computer Science 2024-12-18 Yiming Ji , Kaijie Yun , Yang Liu , Zongwu Xie , Hong Liu

Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization

Reinforcement learning (RL) is a classical tool to solve network control or policy optimization problems in unknown environments. The original Q-learning suffers from performance and complexity challenges across very large networks. Herein,…

Machine Learning · Computer Science 2024-09-02 Talha Bozkus , Urbashi Mitra

Towards General-Purpose Model-Free Reinforcement Learning

Reinforcement learning (RL) promises a framework for near-universal problem-solving. In practice however, RL algorithms are often tailored to specific benchmarks, relying on carefully tuned hyperparameters and algorithmic choices. Recently,…

Machine Learning · Computer Science 2025-01-28 Scott Fujimoto , Pierluca D'Oro , Amy Zhang , Yuandong Tian , Michael Rabbat

Uniform State Abstraction For Reinforcement Learning

Potential Based Reward Shaping combined with a potential function based on appropriately defined abstract knowledge has been shown to significantly improve learning speed in Reinforcement Learning. MultiGrid Reinforcement Learning (MRL) has…

Machine Learning · Computer Science 2020-04-08 John Burden , Daniel Kudenko

Flow Q-Learning

We present flow Q-learning (FQL), a simple and performant offline reinforcement learning (RL) method that leverages an expressive flow-matching policy to model arbitrarily complex action distributions in data. Training a flow policy with RL…

Machine Learning · Computer Science 2025-05-27 Seohong Park , Qiyang Li , Sergey Levine

Residual Q-Networks for Value Function Factorizing in Multi-Agent Reinforcement Learning

Multi-Agent Reinforcement Learning (MARL) is useful in many problems that require the cooperation and coordination of multiple agents. Learning optimal policies using reinforcement learning in a multi-agent setting can be very difficult as…

Machine Learning · Computer Science 2022-05-31 Rafael Pina , Varuna De Silva , Joosep Hook , Ahmet Kondoz

Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression

Offline reinforcement learning (RL) enables policy learning from fixed datasets without further environment interaction, making it particularly valuable in high-risk or costly domains. Extreme $Q$-Learning (XQL) is a recent offline RL…

Machine Learning · Computer Science 2026-04-15 Xinming Gao , Shangzhe Li , Yujin Cai , Wenwu Yu

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement learning (MBRL) offers the prospect of good performance with little environment interaction. However, it is common in practice for the learned model to be inaccurate,…

Machine Learning · Computer Science 2021-03-31 Behzad Haghgoo , Allan Zhou , Archit Sharma , Chelsea Finn

Boosting Offline Reinforcement Learning with Residual Generative Modeling

Offline reinforcement learning (RL) tries to learn the near-optimal policy with recorded offline experience without online exploration. Current offline RL research includes: 1) generative modeling, i.e., approximating a policy using fixed…

Machine Learning · Computer Science 2021-06-23 Hua Wei , Deheng Ye , Zhao Liu , Hao Wu , Bo Yuan , Qiang Fu , Wei Yang , Zhenhui Li

Active Measure Reinforcement Learning for Observation Cost Minimization

Standard reinforcement learning (RL) algorithms assume that the observation of the next state comes instantaneously and at no cost. In a wide variety of sequential decision making tasks ranging from medical treatment to scientific…

Artificial Intelligence · Computer Science 2020-05-27 Colin Bellinger , Rory Coles , Mark Crowley , Isaac Tamblyn

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

Recent reinforcement learning (RL) methods have substantially enhanced the planning capabilities of Large Language Models (LLMs), yet the theoretical basis for their effectiveness remains elusive. In this work, we investigate RL's benefits…

Artificial Intelligence · Computer Science 2026-03-04 Siwei Wang , Yifei Shen , Haoran Sun , Shi Feng , Shang-Hua Teng , Li Dong , Yaru Hao , Wei Chen

Conservative Q-Learning for Offline Reinforcement Learning

Effectively leveraging large, previously collected datasets in reinforcement learning (RL) is a key challenge for large-scale real-world applications. Offline RL algorithms promise to learn effective policies from previously-collected,…

Machine Learning · Computer Science 2020-08-20 Aviral Kumar , Aurick Zhou , George Tucker , Sergey Levine