Related papers: Composing Efficient, Robust Tests for Policy Selec…

Robust Policy Learning over Multiple Uncertainty Sets

Reinforcement learning (RL) agents need to be robust to variations in safety-critical environments. While system identification methods provide a way to infer the variation from online experience, they can fail in settings where fast…

Machine Learning · Computer Science 2022-03-07 Annie Xie , Shagun Sodhani , Chelsea Finn , Joelle Pineau , Amy Zhang

Learning Diverse Risk Preferences in Population-based Self-play

Among the great successes of Reinforcement Learning (RL), self-play algorithms play an essential role in solving competitive games. Current self-play algorithms optimize the agent to maximize expected win-rates against its current or…

Machine Learning · Computer Science 2023-12-18 Yuhua Jiang , Qihan Liu , Xiaoteng Ma , Chenghao Li , Yiqin Yang , Jun Yang , Bin Liang , Qianchuan Zhao

Robust Subtask Learning for Compositional Generalization

Compositional reinforcement learning is a promising approach for training policies to perform complex long-horizon tasks. Typically, a high-level task is decomposed into a sequence of subtasks and a separate policy is trained to perform…

Machine Learning · Computer Science 2023-06-09 Kishor Jothimurugan , Steve Hsu , Osbert Bastani , Rajeev Alur

Learning Robust Controllers Via Probabilistic Model-Based Policy Search

Model-based Reinforcement Learning estimates the true environment through a world model in order to approximate the optimal policy. This family of algorithms usually benefits from better sample efficiency than their model-free counterparts.…

Machine Learning · Computer Science 2021-10-27 Valentin Charvet , Bjørn Sand Jensen , Roderick Murray-Smith

Reusable Test Suites for Reinforcement Learning

Reinforcement learning (RL) agents show great promise in solving sequential decision-making tasks. However, validating the reliability and performance of the agent policies' behavior for deployment remains challenging. Most reinforcement…

Software Engineering · Computer Science 2025-09-01 Jørn Eirik Betten , Quentin Mazouni , Dennis Gross , Pedro Lind , Helge Spieker

Robust Predictable Control

Many of the challenges facing today's reinforcement learning (RL) algorithms, such as robustness, generalization, transfer, and computational efficiency are closely related to compression. Prior work has convincingly argued why minimizing…

Machine Learning · Computer Science 2021-09-08 Benjamin Eysenbach , Ruslan Salakhutdinov , Sergey Levine

What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study

In recent years, on-policy reinforcement learning (RL) has been successfully applied to many different continuous control tasks. While RL algorithms are often conceptually simple, their state-of-the-art implementations take numerous low-…

Machine Learning · Computer Science 2020-06-11 Marcin Andrychowicz , Anton Raichuk , Piotr Stańczyk , Manu Orsini , Sertan Girgin , Raphael Marinier , Léonard Hussenot , Matthieu Geist , Olivier Pietquin , Marcin Michalski , Sylvain Gelly , Olivier Bachem

Constrained Reinforcement Learning Under Model Mismatch

Existing studies on constrained reinforcement learning (RL) may obtain a well-performing policy in the training environment. However, when deployed in a real environment, it may easily violate constraints that were originally satisfied…

Machine Learning · Computer Science 2024-05-06 Zhongchang Sun , Sihong He , Fei Miao , Shaofeng Zou

Robust Algorithmic Collusion

This paper develops a formal framework to assess policies of learning algorithms in economic games. We investigate whether reinforcement-learning agents with collusive pricing policies can successfully extrapolate collusive behavior from…

General Economics · Economics 2022-01-06 Nicolas Eschenbaum , Filip Mellgren , Philipp Zahn

Policy Gradient Method For Robust Reinforcement Learning

This paper develops the first policy gradient method with global optimality guarantee and complexity analysis for robust reinforcement learning under model mismatch. Robust reinforcement learning is to learn a policy robust to model…

Machine Learning · Computer Science 2022-05-17 Yue Wang , Shaofeng Zou

Safe Model-based Reinforcement Learning with Stability Guarantees

Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world…

Machine Learning · Statistics 2017-11-15 Felix Berkenkamp , Matteo Turchetta , Angela P. Schoellig , Andreas Krause

Robust Optimization using Machine Learning for Uncertainty Sets

Our goal is to build robust optimization problems for making decisions based on complex data from the past. In robust optimization (RO) generally, the goal is to create a policy for decision-making that is robust to our uncertainty about…

Optimization and Control · Mathematics 2014-07-07 Theja Tulabandhula , Cynthia Rudin

Learning from Scarce Experience

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each change of the target policy, its value is estimated from the…

Artificial Intelligence · Computer Science 2007-05-23 Leonid Peshkin , Christian R. Shelton

Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

Algorithmic recommendations and decisions have become ubiquitous in today's society. Many of these data-driven policies, especially in the realm of public policy, are based on known, deterministic rules to ensure their transparency and…

Machine Learning · Statistics 2025-04-02 Eli Ben-Michael , D. James Greiner , Kosuke Imai , Zhichao Jiang

Robust Reinforcement Learning with Dynamic Distortion Risk Measures

In a reinforcement learning (RL) setting, the agent's optimal strategy heavily depends on her risk preferences and the underlying model dynamics of the training environment. These two aspects influence the agent's ability to make…

Machine Learning · Computer Science 2025-09-23 Anthony Coache , Sebastian Jaimungal

Robust Reinforcement Learning through Efficient Adversarial Herding

Although reinforcement learning (RL) is considered the gold standard for policy design, it may not always provide a robust solution in various scenarios. This can result in severe performance degradation when the environment is exposed to…

Machine Learning · Computer Science 2023-06-14 Juncheng Dong , Hao-Lun Hsu , Qitong Gao , Vahid Tarokh , Miroslav Pajic

Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model

In high-stake scenarios like medical treatment and auto-piloting, it's risky or even infeasible to collect online experimental data to train the agent. Simulation-based training can alleviate this issue, but may suffer from its inherent…

Machine Learning · Computer Science 2022-03-16 Jialian Li , Tongzheng Ren , Dong Yan , Hang Su , Jun Zhu

Improving Fictitious Play Reinforcement Learning with Expanding Models

Fictitious play with reinforcement learning is a general and effective framework for zero-sum games. However, using the current deep neural network models, the implementation of fictitious play faces crucial challenges. Neural network model…

Machine Learning · Computer Science 2019-12-02 Rong-Jun Qin , Jing-Cheng Pang , Yang Yu

Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach

In order for reinforcement learning techniques to be useful in real-world decision making processes, they must be able to produce robust performance from limited data. Deep policy optimization methods have achieved impressive results on…

Machine Learning · Computer Science 2020-12-22 James Queeney , Ioannis Ch. Paschalidis , Christos G. Cassandras

Robust Safe Reinforcement Learning under Adversarial Disturbances

Safety is a primary concern when applying reinforcement learning to real-world control tasks, especially in the presence of external disturbances. However, existing safe reinforcement learning algorithms rarely account for external…

Machine Learning · Computer Science 2023-10-12 Zeyang Li , Chuxiong Hu , Shengbo Eben Li , Jia Cheng , Yunan Wang