Related papers: Transfer Q-learning

Target Transfer Q-Learning and Its Convergence Analysis

Q-learning is one of the most popular methods in Reinforcement Learning (RL). Transfer Learning aims to utilize the learned knowledge from source tasks to help new tasks to improve the sample complexity of the new tasks. Considering that…

Machine Learning · Computer Science 2018-09-25 Yue Wang , Qi Meng , Wei Cheng , Yuting Liug , Zhi-Ming Ma , Tie-Yan Liu

Data-Driven Knowledge Transfer in Batch $Q^*$ Learning

In data-driven decision-making in marketing, healthcare, and education, it is desirable to utilize a large amount of data from existing ventures to navigate high-dimensional feature spaces and address data scarcity in new ventures. We…

Machine Learning · Computer Science 2026-01-13 Elynn Chen , Xi Chen , Wenbo Jing

Provably Efficient Multi-Task Reinforcement Learning with Model Transfer

We study multi-task reinforcement learning (RL) in tabular episodic Markov decision processes (MDPs). We formulate a heterogeneous multi-player RL problem, in which a group of players concurrently face similar but not necessarily identical…

Machine Learning · Computer Science 2022-01-19 Chicheng Zhang , Zhi Wang

Transition Transfer $Q$-Learning for Composite Markov Decision Processes

To bridge the gap between empirical success and theoretical understanding in transfer reinforcement learning (RL), we study a principled approach with provable performance guarantees. We introduce a novel composite MDP framework where…

Machine Learning · Statistics 2025-02-04 Jinhang Chai , Elynn Chen , Lin Yang

Deep Transfer $Q$-Learning for Offline Non-Stationary Reinforcement Learning

In dynamic decision-making scenarios across business and healthcare, leveraging sample trajectories from diverse populations can significantly enhance reinforcement learning (RL) performance for specific target populations, especially when…

Machine Learning · Statistics 2025-04-15 Jinhang Chai , Elynn Chen , Jianqing Fan

Reinforcement Learning in the Wild with Maximum Likelihood-based Model Transfer

In this paper, we study the problem of transferring the available Markov Decision Process (MDP) models to learn and plan efficiently in an unknown but similar MDP. We refer to it as \textit{Model Transfer Reinforcement Learning (MTRL)}…

Machine Learning · Computer Science 2023-02-21 Hannes Eriksson , Debabrota Basu , Tommy Tram , Mina Alibeigi , Christos Dimitrakakis

Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization

Reinforcement learning (RL) is a classical tool to solve network control or policy optimization problems in unknown environments. The original Q-learning suffers from performance and complexity challenges across very large networks. Herein,…

Machine Learning · Computer Science 2024-09-02 Talha Bozkus , Urbashi Mitra

Transfer from Multiple MDPs

Transfer reinforcement learning (RL) methods leverage on the experience collected on a set of source tasks to speed-up RL algorithms. A simple and effective approach is to transfer samples from source tasks and include them into the…

Artificial Intelligence · Computer Science 2011-09-02 Alessandro Lazaric , Marcello Restelli

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis

In Markov decision processes (MDPs), quantile risk measures such as Value-at-Risk are a standard metric for modeling RL agents' preferences for certain outcomes. This paper proposes a new Q-learning algorithm for quantile optimization in…

Machine Learning · Computer Science 2024-11-01 Jia Lin Hau , Erick Delage , Esther Derman , Mohammad Ghavamzadeh , Marek Petrik

Accelerated Reinforcement Learning for Temporal Logic Control Objectives

This paper addresses the problem of learning control policies for mobile robots, modeled as unknown Markov Decision Processes (MDPs), that are tasked with temporal logic missions, such as sequencing, coverage, or surveillance. The MDP…

Robotics · Computer Science 2022-07-13 Yiannis Kantaros

Periodic Q-Learning

The use of target networks is a common practice in deep reinforcement learning for stabilizing the training; however, theoretical understanding of this technique is still limited. In this paper, we study the so-called periodic Q-learning…

Machine Learning · Computer Science 2020-02-25 Donghwan Lee , Niao He

On The Transferability of Deep-Q Networks

Transfer Learning (TL) is an efficient machine learning paradigm that allows overcoming some of the hurdles that characterize the successful training of deep neural networks, ranging from long training times to the needs of large datasets.…

Machine Learning · Computer Science 2021-11-24 Matthia Sabatelli , Pierre Geurts

Semi-Markov Offline Reinforcement Learning for Healthcare

Reinforcement learning (RL) tasks are typically framed as Markov Decision Processes (MDPs), assuming that decisions are made at fixed time intervals. However, many applications of great importance, including healthcare, do not satisfy this…

Machine Learning · Computer Science 2022-03-22 Mehdi Fatemi , Mary Wu , Jeremy Petch , Walter Nelson , Stuart J. Connolly , Alexander Benz , Anthony Carnicelli , Marzyeh Ghassemi

Transfer Entropy in MDPs with Temporal Logic Specifications

Emerging applications in autonomy require control techniques that take into account uncertain environments, communication and sensing constraints, while satisfying highlevel mission specifications. Motivated by this need, we consider a…

Systems and Control · Computer Science 2018-09-19 Suda Bharadwaj , Mohamadreza Ahmadi , Takashi Tanaka , Ufuk Topcu

A Structure-aware Online Learning Algorithm for Markov Decision Processes

To overcome the curse of dimensionality and curse of modeling in Dynamic Programming (DP) methods for solving classical Markov Decision Process (MDP) problems, Reinforcement Learning (RL) algorithms are popular. In this paper, we consider…

Machine Learning · Computer Science 2018-11-29 Arghyadip Roy , Vivek Borkar , Abhay Karandikar , Prasanna Chaporkar

$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations

The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of…

Machine Learning · Statistics 2015-06-04 Soummya Kar , Jose' M. F. Moura , H. Vincent Poor

Transferring Knowledge Fragments for Learning Distance Metric from A Heterogeneous Domain

The goal of transfer learning is to improve the performance of target learning task by leveraging information (or transferring knowledge) from other related tasks. In this paper, we examine the problem of transfer distance metric learning…

Machine Learning · Statistics 2019-04-09 Yong Luo , Yonggang Wen , Tongliang Liu , Dacheng Tao

Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes

Due to physiological variation, patients diagnosed with the same condition may exhibit divergent, but related, responses to the same treatments. Hidden Parameter Markov Decision Processes (HiP-MDPs) tackle this transfer-learning problem by…

Machine Learning · Statistics 2016-12-05 Taylor Killian , George Konidaris , Finale Doshi-Velez

Reinforcement Learning in Switching Non-Stationary Markov Decision Processes: Algorithms and Convergence Analysis

Reinforcement learning in non-stationary environments is challenging due to abrupt and unpredictable changes in dynamics, often causing traditional algorithms to fail to converge. However, in many real-world cases, non-stationarity has some…

Machine Learning · Computer Science 2025-03-25 Mohsen Amiri , Sindri Magnússon

Stochastic Lipschitz Q-Learning

In an episodic Markov Decision Process (MDP) problem, an online algorithm chooses from a set of actions in a sequence of $H$ trials, where $H$ is the episode length, in order to maximize the total payoff of the chosen actions. Q-learning,…

Machine Learning · Computer Science 2019-07-11 Xu Zhu