Related papers: Robust Q-learning

Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning

The $Q$-learning algorithm is a simple and widely-used stochastic approximation scheme for reinforcement learning, but the basic protocol can exhibit instability in conjunction with function approximation. Such instability can be observed…

Machine Learning · Computer Science 2022-06-03 Andrea Zanette , Martin J. Wainwright

Online Robust Reinforcement Learning with Model Uncertainty

Robust reinforcement learning (RL) is to find a policy that optimizes the worst-case performance over an uncertainty set of MDPs. In this paper, we focus on model-free robust RL, where the uncertainty set is defined to be centering at a…

Machine Learning · Computer Science 2021-10-29 Yue Wang , Shaofeng Zou

$Q$- and $A$-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

In clinical practice, physicians make a series of treatment decisions over the course of a patient's disease based on his/her baseline and evolving characteristics. A dynamic treatment regime is a set of sequential decision rules that…

Methodology · Statistics 2015-02-04 Phillip J. Schulte , Anastasios A. Tsiatis , Eric B. Laber , Marie Davidian

Penalized Q-Learning for Dynamic Treatment Regimes

A dynamic treatment regime effectively incorporates both accrued information and long-term effects of treatment from specially designed clinical trials. As these become more and more popular in conjunction with longitudinal data from…

Methodology · Statistics 2011-08-29 Rui Song , Weiwei Wang , Donglin Zeng , Michael R. Kosorok

Near-Equivalent Q-learning Policies for Dynamic Treatment Regimes

Precision medicine aims to tailor therapeutic decisions to individual patient characteristics. This objective is commonly formalized through dynamic treatment regimes, which use statistical and machine learning methods to derive sequential…

Machine Learning · Statistics 2026-03-23 Sophia Yazzourh , Erica E. M. Moodie

Towards Better Interpretability in Deep Q-Networks

Deep reinforcement learning techniques have demonstrated superior performance in a wide variety of environments. As improvements in training algorithms continue at a brisk pace, theoretical or empirical studies on understanding what these…

Machine Learning · Computer Science 2018-11-16 Raghuram Mandyam Annasamy , Katia Sycara

Neural-Network-Driven Reward Prediction as a Heuristic: Advancing Q-Learning for Mobile Robot Path Planning

Q-learning is a widely used reinforcement learning technique for solving path planning problems. It primarily involves the interaction between an agent and its environment, enabling the agent to learn an optimal strategy that maximizes…

Robotics · Computer Science 2024-12-18 Yiming Ji , Kaijie Yun , Yang Liu , Zongwu Xie , Hong Liu

Regularized Q-learning through Robust Averaging

We propose a new Q-learning variant, called 2RA Q-learning, that addresses some weaknesses of existing Q-learning methods in a principled manner. One such weakness is an underlying estimation bias which cannot be controlled and often…

Optimization and Control · Mathematics 2024-05-30 Peter Schmitt-Förster , Tobias Sutter

Robust High-Dimensional Linear Regression

The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the…

Machine Learning · Computer Science 2016-08-11 Chang Liu , Bo Li , Yevgeniy Vorobeychik , Alina Oprea

On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm

Robust reinforcement learning (RRL) aims at seeking a robust policy to optimize the worst case performance over an uncertainty set of Markov decision processes (MDPs). This set contains some perturbed MDPs from a nominal MDP (N-MDP) that…

Machine Learning · Computer Science 2023-11-21 Ukjo Hwang , Songnam Hong

A Flexible Framework for Incorporating Patient Preferences Into Q-Learning

In real-world healthcare settings, treatment decisions often involve optimizing for multivariate outcomes such as treatment efficacy and severity of side effects based on individual preferences. However, existing statistical methods for…

Machine Learning · Statistics 2025-09-03 Joshua P. Zitovsky , Yating Zou , Leslie Wilson , Michael R. Kosorok

Diagnosing Bottlenecks in Deep Q-learning Algorithms

Q-learning methods represent a commonly used class of algorithms in reinforcement learning: they are generally efficient and simple, and can be combined readily with function approximators for deep reinforcement learning (RL). However, the…

Machine Learning · Computer Science 2019-02-28 Justin Fu , Aviral Kumar , Matthew Soh , Sergey Levine

Robust Learning-Based Control via Bootstrapped Multiplicative Noise

Despite decades of research and recent progress in adaptive control and reinforcement learning, there remains a fundamental lack of understanding in designing controllers that provide robustness to inherent non-asymptotic uncertainties…

Machine Learning · Computer Science 2021-08-13 Benjamin Gravell , Tyler Summers

Linear Regression Using Quantum Annealing with Continuous Variables

Linear regression is a data analysis technique, which is categorized as supervised learning. By utilizing known data, we can predict unknown data. Recently, researchers have explored the use of quantum annealing (QA) to perform linear…

Quantum Physics · Physics 2024-10-14 Asuka Koura , Takashi Imoto , Katsuki Ura , Yuichiro Matsuzaki

Quantile Regression of Latent Longitudinal Trajectory Features

Quantile regression has demonstrated promising utility in longitudinal data analysis. Existing work is primarily focused on modeling cross-sectional outcomes, while outcome trajectories often carry more substantive information in practice.…

Methodology · Statistics 2018-06-19 Huijuan Ma , Limin Peng , Haoda Fu

Robust Learning of Recurrent Neural Networks in Presence of Exogenous Noise

Recurrent Neural networks (RNN) have shown promising potential for learning dynamics of sequential data. However, artificial neural networks are known to exhibit poor robustness in presence of input noise, where the sequential architecture…

Machine Learning · Computer Science 2021-05-05 Arash Amini , Guangyi Liu , Nader Motee

Robust Reinforcement Learning using Offline Data

The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the uncertainty in model parameters. Parameter uncertainty commonly occurs in many real-world RL applications due to simulator modeling errors,…

Machine Learning · Computer Science 2022-10-19 Kishan Panaganti , Zaiyan Xu , Dileep Kalathil , Mohammad Ghavamzadeh

Towards Adapting Reinforcement Learning Agents to New Tasks: Insights from Q-Values

While contemporary reinforcement learning research and applications have embraced policy gradient methods as the panacea of solving learning problems, value-based methods can still be useful in many domains as long as we can wrangle with…

Machine Learning · Computer Science 2024-07-16 Ashwin Ramaswamy , Ransalu Senanayake

Reachable Sets of Classifiers and Regression Models: (Non-)Robustness Analysis and Robust Training

Neural networks achieve outstanding accuracy in classification and regression tasks. However, understanding their behavior still remains an open challenge that requires questions to be addressed on the robustness, explainability and…

Machine Learning · Computer Science 2021-05-13 Anna-Kathrin Kopetzki , Stephan Günnemann

Deep Q-learning: a robust control approach

In this paper, we place deep Q-learning into a control-oriented perspective and study its learning dynamics with well-established techniques from robust control. We formulate an uncertain linear time-invariant model by means of the neural…

Machine Learning · Computer Science 2022-11-08 Balazs Varga , Balazs Kulcsar , Morteza Haghir Chehreghani