Related papers: Behaviour Suite for Reinforcement Learning

SuperSuit: Simple Microwrappers for Reinforcement Learning Environments

In reinforcement learning, wrappers are universally used to transform the information that passes between a model and an environment. Despite their ubiquity, no library exists with reasonable implementations of all popular preprocessing…

Machine Learning · Computer Science 2021-11-17 J. K. Terry , Benjamin Black , Ananth Hari

Pluralistic Behavior Suite: Stress-Testing Multi-Turn Adherence to Custom Behavioral Policies

Large language models (LLMs) are typically aligned to a universal set of safety and usage principles intended for broad public acceptability. Yet, real-world applications of LLMs often take place within organizational ecosystems shaped by…

Computation and Language · Computer Science 2025-11-10 Prasoon Varshney , Makesh Narsimhan Sreedhar , Liwei Jiang , Traian Rebedea , Christopher Parisien

RLInspect: An Interactive Visual Approach to Assess Reinforcement Learning Algorithm

Reinforcement Learning (RL) is a rapidly growing area of machine learning that finds its application in a broad range of domains, from finance and healthcare to robotics and gaming. Compared to other machine learning techniques, RL agents…

Artificial Intelligence · Computer Science 2024-11-14 Geetansh Kalra , Divye Singh , Justin Jose

Gym4ReaL: A Suite for Benchmarking Real-World Reinforcement Learning

In recent years, \emph{Reinforcement Learning} (RL) has made remarkable progress, achieving superhuman performance in a wide range of simulated environments. As research moves toward deploying RL in real-world applications, the field faces…

Machine Learning · Computer Science 2025-07-02 Davide Salaorni , Vincenzo De Paola , Samuele Delpero , Giovanni Dispoto , Paolo Bonetti , Alessio Russo , Giuseppe Calcagno , Francesco Trovò , Matteo Papini , Alberto Maria Metelli , Marco Mussi , Marcello Restelli

A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges

Reinforcement Learning (RL) has emerged as a powerful paradigm in Artificial Intelligence (AI), enabling agents to learn optimal behaviors through interactions with their environments. Drawing from the foundations of trial and error, RL…

Artificial Intelligence · Computer Science 2025-02-04 Majid Ghasemi , Amir Hossein Moosavi , Dariush Ebrahimi

RRLS : Robust Reinforcement Learning Suite

Robust reinforcement learning is the problem of learning control policies that provide optimal worst-case performance against a span of adversarial environments. It is a crucial ingredient for deploying algorithms in real-world scenarios…

Machine Learning · Computer Science 2024-06-13 Adil Zouitine , David Bertoin , Pierre Clavier , Matthieu Geist , Emmanuel Rachelson

Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Reinforcement learning (RL) agents improve through trial-and-error, but when reward is sparse and the agent cannot discover successful action sequences, learning stagnates. This has been a notable problem in training deep RL agents to…

Artificial Intelligence · Computer Science 2018-02-27 Evan Zheran Liu , Kelvin Guu , Panupong Pasupat , Tianlin Shi , Percy Liang

Resilient Computing with Reinforcement Learning on a Dynamical System: Case Study in Sorting

Robots and autonomous agents often complete goal-based tasks with limited resources, relying on imperfect models and sensor measurements. In particular, reinforcement learning (RL) and feedback control can be used to help a robot achieve a…

Artificial Intelligence · Computer Science 2018-09-26 Aleksandra Faust , James B. Aimone , Conrad D. James , Lydia Tapia

Introduction to Reinforcement Learning

Reinforcement Learning (RL), a subfield of Artificial Intelligence (AI), focuses on training agents to make decisions by interacting with their environment to maximize cumulative rewards. This paper provides an overview of RL, covering its…

Artificial Intelligence · Computer Science 2024-12-04 Majid Ghasemi , Dariush Ebrahimi

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Reinforcement learning provides a general framework for flexible decision making and control, but requires extensive data collection for each new task that an agent needs to learn. In other machine learning fields, such as natural language…

Machine Learning · Computer Science 2020-11-20 Avi Singh , Huihan Liu , Gaoyue Zhou , Albert Yu , Nicholas Rhinehart , Sergey Levine

LExCI: A Framework for Reinforcement Learning with Embedded Systems

Advances in artificial intelligence (AI) have led to its application in many areas of everyday life. In the context of control engineering, reinforcement learning (RL) represents a particularly promising approach as it is centred around the…

Machine Learning · Computer Science 2024-06-28 Kevin Badalian , Lucas Koch , Tobias Brinkmann , Mario Picerno , Marius Wegener , Sung-Yong Lee , Jakob Andert

UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning

The recent DeepSeek-R1 has showcased the emergence of reasoning capabilities in LLMs through reinforcement learning (RL) with rule-based rewards. Despite its success in language models, its application in multi-modal domains, particularly…

Artificial Intelligence · Computer Science 2025-05-27 Zhengxi Lu , Yuxiang Chai , Yaxuan Guo , Xi Yin , Liang Liu , Hao Wang , Han Xiao , Shuai Ren , Guanjing Xiong , Hongsheng Li

Reusable Test Suites for Reinforcement Learning

Reinforcement learning (RL) agents show great promise in solving sequential decision-making tasks. However, validating the reliability and performance of the agent policies' behavior for deployment remains challenging. Most reinforcement…

Software Engineering · Computer Science 2025-09-01 Jørn Eirik Betten , Quentin Mazouni , Dennis Gross , Pedro Lind , Helge Spieker

An empirical investigation of the challenges of real-world reinforcement learning

Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is beginning to show some successes in real-world scenarios. However, much of the research advances in RL are hard to leverage in real-world systems due…

Machine Learning · Computer Science 2021-03-05 Gabriel Dulac-Arnold , Nir Levine , Daniel J. Mankowitz , Jerry Li , Cosmin Paduraru , Sven Gowal , Todd Hester

DeepMind Control Suite

The DeepMind Control Suite is a set of continuous control tasks with a standardised structure and interpretable rewards, intended to serve as performance benchmarks for reinforcement learning agents. The tasks are written in Python and…

Artificial Intelligence · Computer Science 2018-01-03 Yuval Tassa , Yotam Doron , Alistair Muldal , Tom Erez , Yazhe Li , Diego de Las Casas , David Budden , Abbas Abdolmaleki , Josh Merel , Andrew Lefrancq , Timothy Lillicrap , Martin Riedmiller

CompoSuite: A Compositional Reinforcement Learning Benchmark

We present CompoSuite, an open-source simulated robotic manipulation benchmark for compositional multi-task reinforcement learning (RL). Each CompoSuite task requires a particular robot arm to manipulate one individual object to achieve a…

Machine Learning · Computer Science 2022-07-12 Jorge A. Mendez , Marcel Hussing , Meghna Gummadi , Eric Eaton

Progress Constraints for Reinforcement Learning in Behavior Trees

Behavior Trees (BTs) provide a structured and reactive framework for decision-making, commonly used to switch between sub-controllers based on environmental conditions. Reinforcement Learning (RL), on the other hand, can learn near-optimal…

Artificial Intelligence · Computer Science 2026-02-12 Finn Rietz , Mart Kartašev , Petter Ögren , Johannes A. Stork

GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Existing efforts in building Graphical User Interface (GUI) agents largely rely on the training paradigm of supervised fine-tuning on Large Vision-Language Models (LVLMs). However, this approach not only demands extensive amounts of…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Run Luo , Lu Wang , Wanwei He , Longze Chen , Jiaming Li , Xiaobo Xia

WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection

Search agents have achieved significant advancements in enabling intelligent information retrieval and decision-making within interactive environments. Although reinforcement learning has been employed to train agentic models capable of…

Computation and Language · Computer Science 2025-10-22 Guanzhong He , Zhen Yang , Jinxin Liu , Bin Xu , Lei Hou , Juanzi Li

WebSuite: Systematically Evaluating Why Web Agents Fail

We describe WebSuite, the first diagnostic benchmark for generalist web agents, designed to systematically evaluate why agents fail. Advances in AI have led to the rise of numerous web agents that autonomously operate a browser to complete…

Software Engineering · Computer Science 2024-06-05 Eric Li , Jim Waldo