Related papers: Probabilistic Performance Guarantees for Multi-Tas…

R2L: Reliable Reinforcement Learning: Guaranteed Return & Reliable Policies in Reinforcement Learning

In this work, we address the problem of determining reliable policies in reinforcement learning (RL), with a focus on optimization under uncertainty and the need for performance guarantees. While classical RL algorithms aim at maximizing…

Machine Learning · Computer Science 2025-10-22 Nadir Farhi

Safe Reinforcement Learning with Probabilistic Guarantees Satisfying Temporal Logic Specifications in Continuous Action Spaces

Vanilla Reinforcement Learning (RL) can efficiently solve complex tasks but does not provide any guarantees on system behavior. To bridge this gap, we propose a three-step safe RL procedure for continuous action spaces that provides…

Robotics · Computer Science 2023-09-29 Hanna Krasowski , Prithvi Akella , Aaron D. Ames , Matthias Althoff

Reinforcement Learning with Probabilistic Guarantees for Autonomous Driving

Designing reliable decision strategies for autonomous urban driving is challenging. Reinforcement learning (RL) has been used to automatically derive suitable behavior in uncertain environments, but it does not provide any guarantee on the…

Robotics · Computer Science 2019-05-31 Maxime Bouton , Jesper Karlsson , Alireza Nakhaei , Kikuo Fujimura , Mykel J. Kochenderfer , Jana Tumova

Multi-Task Policy Search

Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for…

Machine Learning · Statistics 2014-02-13 Marc Peter Deisenroth , Peter Englert , Jan Peters , Dieter Fox

Efficient Probabilistic Performance Bounds for Inverse Reinforcement Learning

In the field of reinforcement learning there has been recent progress towards safety and high-confidence bounds on policy performance. However, to our knowledge, no practical methods exist for determining high-confidence policy performance…

Artificial Intelligence · Computer Science 2018-06-26 Daniel S. Brown , Scott Niekum

Provably Good Batch Reinforcement Learning Without Great Exploration

Batch reinforcement learning (RL) is important to apply RL algorithms to many high stakes tasks. Doing batch RL in a way that yields a reliable new policy in large domains is challenging: a new decision policy may visit states and actions…

Machine Learning · Computer Science 2020-07-23 Yao Liu , Adith Swaminathan , Alekh Agarwal , Emma Brunskill

Recovery Guarantees for Continual Learning of Dependent Tasks: Memory, Data-Dependent Regularization, and Data-Dependent Weights

Continual learning (CL) is concerned with learning multiple tasks sequentially without forgetting previously learned tasks. Despite substantial empirical advances over recent years, the theoretical development of CL remains in its infancy.…

Machine Learning · Computer Science 2026-04-27 Liangzu Peng , Uday Kiran Reddy Tadipatri , Ziqing Xu , Eric Eaton , René Vidal

A Note on High-Probability versus In-Expectation Guarantees of Generalization Bounds in Machine Learning

Statistical machine learning theory often tries to give generalization guarantees of machine learning models. Those models naturally underlie some fluctuation, as they are based on a data sample. If we were unlucky, and gathered a sample…

Machine Learning · Computer Science 2022-11-21 Alexander Mey

Multi-CALF: A Policy Combination Approach with Statistical Guarantees

We introduce Multi-CALF, an algorithm that intelligently combines reinforcement learning policies based on their relative value improvements. Our approach integrates a standard RL policy with a theoretically-backed alternative policy,…

Machine Learning · Computer Science 2025-05-20 Georgiy Malaniya , Anton Bolychev , Grigory Yaremenko , Anastasia Krasnaya , Pavel Osinenko

Meta Reinforcement Learning with Finite Training Tasks -- a Density Estimation Approach

In meta reinforcement learning (meta RL), an agent learns from a set of training tasks how to quickly solve a new task, drawn from the same task distribution. The optimal meta RL policy, a.k.a. the Bayes-optimal behavior, is well defined,…

Machine Learning · Computer Science 2024-04-01 Zohar Rimon , Aviv Tamar , Gilad Adler

Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty

Robust reinforcement learning (RL) aims to find a policy that optimizes the worst-case performance in the face of uncertainties. In this paper, we focus on action robust RL with the probabilistic policy execution uncertainty, in which,…

Machine Learning · Computer Science 2023-07-21 Guanlin Liu , Zhihan Zhou , Han Liu , Lifeng Lai

Towards Safe Continuing Task Reinforcement Learning

Safety is a critical feature of controller design for physical systems. When designing control policies, several approaches to guarantee this aspect of autonomy have been proposed, such as robust controllers or control barrier functions.…

Machine Learning · Computer Science 2021-02-26 Miguel Calvo-Fullana , Luiz F. O. Chamon , Santiago Paternain

Constrained Meta Reinforcement Learning with Provable Test-Time Safety

Meta reinforcement learning (RL) allows agents to leverage experience across a distribution of tasks on which the agent can train at will, enabling faster learning of optimal policies on new test tasks. Despite its success in improving…

Machine Learning · Computer Science 2026-05-27 Tingting Ni , Maryam Kamgarpour

Robust Policy Optimization with Baseline Guarantees

Our goal is to compute a policy that guarantees improved return over a baseline policy even when the available MDP model is inaccurate. The inaccurate model may be constructed, for example, by system identification techniques when the true…

Optimization and Control · Mathematics 2015-06-17 Yinlam Chow , Marek Petrik , Mohammad Ghavamzadeh

Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees

This research delves deeply into Meta Reinforcement Learning (Meta RL) through a exploration focusing on defining generalization limits and ensuring convergence. By employing a approach this article introduces an innovative theoretical…

Machine Learning · Computer Science 2024-05-24 Cangqing Wang , Mingxiu Sui , Dan Sun , Zecheng Zhang , Yan Zhou

Supervised Learning with Evolving Tasks and Performance Guarantees

Multiple supervised learning scenarios are composed by a sequence of classification tasks. For instance, multi-task learning and continual learning aim to learn a sequence of tasks that is either fixed or grows over time. Existing…

Machine Learning · Statistics 2025-01-10 Verónica Álvarez , Santiago Mazuelas , Jose A. Lozano

Reliable Off-policy Evaluation for Reinforcement Learning

In a sequential decision-making problem, off-policy evaluation estimates the expected cumulative reward of a target policy using logged trajectory data generated from a different behavior policy, without execution of the target policy.…

Machine Learning · Computer Science 2022-11-04 Jie Wang , Rui Gao , Hongyuan Zha

Safe Reinforcement Learning via Confidence-Based Filters

Ensuring safety is a crucial challenge when deploying reinforcement learning (RL) to real-world systems. We develop confidence-based safety filters, a control-theoretic approach for certifying state safety constraints for nominal policies…

Machine Learning · Computer Science 2022-07-05 Sebastian Curi , Armin Lederer , Sandra Hirche , Andreas Krause

Probabilistic Safeguard for Reinforcement Learning Using Safety Index Guided Gaussian Process Models

Safety is one of the biggest concerns to applying reinforcement learning (RL) to the physical world. In its core part, it is challenging to ensure RL agents persistently satisfy a hard state constraint without white-box or black-box…

Robotics · Computer Science 2023-10-19 Weiye Zhao , Tairan He , Changliu Liu

Stability-certified reinforcement learning: A control-theoretic perspective

We investigate the important problem of certifying stability of reinforcement learning policies when interconnected with nonlinear dynamical systems. We show that by regulating the input-output gradients of policies, strong guarantees of…

Systems and Control · Computer Science 2018-10-30 Ming Jin , Javad Lavaei