Related papers: Learning Robust Options

Robust Reinforcement Learning using Offline Data

The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the uncertainty in model parameters. Parameter uncertainty commonly occurs in many real-world RL applications due to simulator modeling errors,…

Machine Learning · Computer Science 2022-10-19 Kishan Panaganti , Zaiyan Xu , Dileep Kalathil , Mohammad Ghavamzadeh

Reliable Policy Iteration: Performance Robustness Across Architecture and Environment Perturbations

In a recent work, we proposed Reliable Policy Iteration (RPI), that restores policy iteration's monotonicity-of-value-estimates property to the function approximation setting. Here, we assess the robustness of RPI's empirical performance on…

Artificial Intelligence · Computer Science 2025-12-16 S. R. Eshwar , Aniruddha Mukherjee , Kintan Saha , Krishna Agarwal , Gugan Thoppe , Aditya Gopalan , Gal Dalal

Robust Policy Learning over Multiple Uncertainty Sets

Reinforcement learning (RL) agents need to be robust to variations in safety-critical environments. While system identification methods provide a way to infer the variation from online experience, they can fail in settings where fast…

Machine Learning · Computer Science 2022-03-07 Annie Xie , Shagun Sodhani , Chelsea Finn , Joelle Pineau , Amy Zhang

Robust Proxy: Improving Adversarial Robustness by Robust Proxy Learning

Recently, it has been widely known that deep neural networks are highly vulnerable and easily broken by adversarial attacks. To mitigate the adversarial vulnerability, many defense algorithms have been proposed. Recently, to improve…

Computer Vision and Pattern Recognition · Computer Science 2023-06-28 Hong Joo Lee , Yong Man Ro

Model-Free Robust $\phi$-Divergence Reinforcement Learning Using Both Offline and Online Data

The robust $\phi$-regularized Markov Decision Process (RRMDP) framework focuses on designing control policies that are robust against parameter uncertainties due to mismatches between the simulator (nominal) model and real-world settings.…

Machine Learning · Computer Science 2024-05-10 Kishan Panaganti , Adam Wierman , Eric Mazumdar

Action Robust Reinforcement Learning via Optimal Adversary Aware Policy Optimization

Reinforcement Learning (RL) has achieved remarkable success in sequential decision tasks. However, recent studies have revealed the vulnerability of RL policies to different perturbations, raising concerns about their effectiveness and…

Machine Learning · Computer Science 2025-07-08 Buqing Nie , Yangqing Fu , Jingtian Ji , Yue Gao

Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization

The Robust Regularized Markov Decision Process (RRMDP) is proposed to learn policies robust to dynamics shifts by adding regularization to the transition dynamics in the value function. Existing methods mostly use unstructured…

Machine Learning · Computer Science 2025-11-03 Cheng Tang , Zhishuai Liu , Pan Xu

Robust Risk-Aware Option Hedging

The objectives of option hedging/trading extend beyond mere protection against downside risks, with a desire to seek gains also driving agent's strategies. In this study, we showcase the potential of robust risk-aware reinforcement learning…

Computational Finance · Quantitative Finance 2023-12-27 David Wu , Sebastian Jaimungal

Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation

This paper studies the robustness of reinforcement learning algorithms to errors in the learning process. Specifically, we revisit the benchmark problem of discrete-time linear quadratic regulation (LQR) and study the long-standing open…

Optimization and Control · Mathematics 2021-03-16 Bo Pang , Zhong-Ping Jiang

Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees

This paper addresses the problem of model-free reinforcement learning for Robust Markov Decision Process (RMDP) with large state spaces. The goal of the RMDP framework is to find a policy that is robust against the parameter uncertainties…

Machine Learning · Computer Science 2021-02-15 Kishan Panaganti , Dileep Kalathil

A Bayesian Approach to Robust Reinforcement Learning

Robust Markov Decision Processes (RMDPs) intend to ensure robustness with respect to changing or adversarial system behavior. In this framework, transitions are modeled as arbitrary elements of a known and properly structured uncertainty…

Machine Learning · Computer Science 2019-07-25 Esther Derman , Daniel Mankowitz , Timothy Mann , Shie Mannor

Robust Optimization using Machine Learning for Uncertainty Sets

Our goal is to build robust optimization problems for making decisions based on complex data from the past. In robust optimization (RO) generally, the goal is to create a policy for decision-making that is robust to our uncertainty about…

Optimization and Control · Mathematics 2014-07-07 Theja Tulabandhula , Cynthia Rudin

Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach

In order for reinforcement learning techniques to be useful in real-world decision making processes, they must be able to produce robust performance from limited data. Deep policy optimization methods have achieved impressive results on…

Machine Learning · Computer Science 2020-12-22 James Queeney , Ioannis Ch. Paschalidis , Christos G. Cassandras

Robust Reinforcement Learning with Distributional Risk-averse formulation

Robust Reinforcement Learning tries to make predictions more robust to changes in the dynamics or rewards of the system. This problem is particularly important when the dynamics and rewards of the environment are estimated from the data. In…

Machine Learning · Computer Science 2022-06-15 Pierre Clavier , Stéphanie Allassonière , Erwan Le Pennec

Robust Regularized Policy Iteration under Transition Uncertainty

Offline reinforcement learning (RL) enables data-efficient and safe policy learning without online exploration, but its performance often degrades under distribution shift. The learned policy may visit out-of-distribution state-action pairs…

Artificial Intelligence · Computer Science 2026-03-17 Hongqiang Lin , Zhenghui Fu , Weihao Tang , Pengfei Wang , Yiding Sun , Qixian Huang , Dongxu Zhang

Robust Offline Reinforcement Learning for Non-Markovian Decision Processes

Distributionally robust offline reinforcement learning (RL) aims to find a policy that performs the best under the worst environment within an uncertainty set using an offline dataset collected from a nominal model. While recent advances in…

Machine Learning · Computer Science 2025-01-07 Ruiquan Huang , Yingbin Liang , Jing Yang

Policy Gradient Method For Robust Reinforcement Learning

This paper develops the first policy gradient method with global optimality guarantee and complexity analysis for robust reinforcement learning under model mismatch. Robust reinforcement learning is to learn a policy robust to model…

Machine Learning · Computer Science 2022-05-17 Yue Wang , Shaofeng Zou

Robust Reinforcement Learning for Continuous Control with Model Misspecification

We provide a framework for incorporating robustness -- to perturbations in the transition dynamics which we refer to as model misspecification -- into continuous control Reinforcement Learning (RL) algorithms. We specifically focus on…

Machine Learning · Computer Science 2020-02-12 Daniel J. Mankowitz , Nir Levine , Rae Jeong , Yuanyuan Shi , Jackie Kay , Abbas Abdolmaleki , Jost Tobias Springenberg , Timothy Mann , Todd Hester , Martin Riedmiller

Provably Sample-Efficient Robust Reinforcement Learning with Average Reward

Robust reinforcement learning (RL) under the average-reward criterion is essential for long-term decision-making, particularly when the environment may differ from its specification. However, a significant gap exists in understanding the…

Machine Learning · Computer Science 2025-09-26 Zachary Roch , Chi Zhang , George Atia , Yue Wang

Robustness Reprogramming for Representation Learning

This work tackles an intriguing and fundamental open challenge in representation learning: Given a well-trained deep learning model, can it be reprogrammed to enhance its robustness against adversarial or noisy input perturbations without…

Machine Learning · Computer Science 2024-10-08 Zhichao Hou , MohamadAli Torkamani , Hamid Krim , Xiaorui Liu