English
Related papers

Related papers: Robust Q-learning

200 papers

The $Q$-learning algorithm is a simple and widely-used stochastic approximation scheme for reinforcement learning, but the basic protocol can exhibit instability in conjunction with function approximation. Such instability can be observed…

Machine Learning · Computer Science 2022-06-03 Andrea Zanette , Martin J. Wainwright

Robust reinforcement learning (RL) is to find a policy that optimizes the worst-case performance over an uncertainty set of MDPs. In this paper, we focus on model-free robust RL, where the uncertainty set is defined to be centering at a…

Machine Learning · Computer Science 2021-10-29 Yue Wang , Shaofeng Zou

In clinical practice, physicians make a series of treatment decisions over the course of a patient's disease based on his/her baseline and evolving characteristics. A dynamic treatment regime is a set of sequential decision rules that…

Methodology · Statistics 2015-02-04 Phillip J. Schulte , Anastasios A. Tsiatis , Eric B. Laber , Marie Davidian

A dynamic treatment regime effectively incorporates both accrued information and long-term effects of treatment from specially designed clinical trials. As these become more and more popular in conjunction with longitudinal data from…

Methodology · Statistics 2011-08-29 Rui Song , Weiwei Wang , Donglin Zeng , Michael R. Kosorok

Precision medicine aims to tailor therapeutic decisions to individual patient characteristics. This objective is commonly formalized through dynamic treatment regimes, which use statistical and machine learning methods to derive sequential…

Machine Learning · Statistics 2026-03-23 Sophia Yazzourh , Erica E. M. Moodie

Deep reinforcement learning techniques have demonstrated superior performance in a wide variety of environments. As improvements in training algorithms continue at a brisk pace, theoretical or empirical studies on understanding what these…

Machine Learning · Computer Science 2018-11-16 Raghuram Mandyam Annasamy , Katia Sycara

Q-learning is a widely used reinforcement learning technique for solving path planning problems. It primarily involves the interaction between an agent and its environment, enabling the agent to learn an optimal strategy that maximizes…

Robotics · Computer Science 2024-12-18 Yiming Ji , Kaijie Yun , Yang Liu , Zongwu Xie , Hong Liu

We propose a new Q-learning variant, called 2RA Q-learning, that addresses some weaknesses of existing Q-learning methods in a principled manner. One such weakness is an underlying estimation bias which cannot be controlled and often…

Optimization and Control · Mathematics 2024-05-30 Peter Schmitt-Förster , Tobias Sutter

The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the…

Machine Learning · Computer Science 2016-08-11 Chang Liu , Bo Li , Yevgeniy Vorobeychik , Alina Oprea

Robust reinforcement learning (RRL) aims at seeking a robust policy to optimize the worst case performance over an uncertainty set of Markov decision processes (MDPs). This set contains some perturbed MDPs from a nominal MDP (N-MDP) that…

Machine Learning · Computer Science 2023-11-21 Ukjo Hwang , Songnam Hong

In real-world healthcare settings, treatment decisions often involve optimizing for multivariate outcomes such as treatment efficacy and severity of side effects based on individual preferences. However, existing statistical methods for…

Machine Learning · Statistics 2025-09-03 Joshua P. Zitovsky , Yating Zou , Leslie Wilson , Michael R. Kosorok

Q-learning methods represent a commonly used class of algorithms in reinforcement learning: they are generally efficient and simple, and can be combined readily with function approximators for deep reinforcement learning (RL). However, the…

Machine Learning · Computer Science 2019-02-28 Justin Fu , Aviral Kumar , Matthew Soh , Sergey Levine

Despite decades of research and recent progress in adaptive control and reinforcement learning, there remains a fundamental lack of understanding in designing controllers that provide robustness to inherent non-asymptotic uncertainties…

Machine Learning · Computer Science 2021-08-13 Benjamin Gravell , Tyler Summers

Linear regression is a data analysis technique, which is categorized as supervised learning. By utilizing known data, we can predict unknown data. Recently, researchers have explored the use of quantum annealing (QA) to perform linear…

Quantum Physics · Physics 2024-10-14 Asuka Koura , Takashi Imoto , Katsuki Ura , Yuichiro Matsuzaki

Quantile regression has demonstrated promising utility in longitudinal data analysis. Existing work is primarily focused on modeling cross-sectional outcomes, while outcome trajectories often carry more substantive information in practice.…

Methodology · Statistics 2018-06-19 Huijuan Ma , Limin Peng , Haoda Fu

Recurrent Neural networks (RNN) have shown promising potential for learning dynamics of sequential data. However, artificial neural networks are known to exhibit poor robustness in presence of input noise, where the sequential architecture…

Machine Learning · Computer Science 2021-05-05 Arash Amini , Guangyi Liu , Nader Motee

The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the uncertainty in model parameters. Parameter uncertainty commonly occurs in many real-world RL applications due to simulator modeling errors,…

Machine Learning · Computer Science 2022-10-19 Kishan Panaganti , Zaiyan Xu , Dileep Kalathil , Mohammad Ghavamzadeh

While contemporary reinforcement learning research and applications have embraced policy gradient methods as the panacea of solving learning problems, value-based methods can still be useful in many domains as long as we can wrangle with…

Machine Learning · Computer Science 2024-07-16 Ashwin Ramaswamy , Ransalu Senanayake

Neural networks achieve outstanding accuracy in classification and regression tasks. However, understanding their behavior still remains an open challenge that requires questions to be addressed on the robustness, explainability and…

Machine Learning · Computer Science 2021-05-13 Anna-Kathrin Kopetzki , Stephan Günnemann

In this paper, we place deep Q-learning into a control-oriented perspective and study its learning dynamics with well-established techniques from robust control. We formulate an uncertain linear time-invariant model by means of the neural…

Machine Learning · Computer Science 2022-11-08 Balazs Varga , Balazs Kulcsar , Morteza Haghir Chehreghani
‹ Prev 1 2 3 10 Next ›