Related papers: EnvPool: A Highly Parallel Reinforcement Learning …

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Equipping LLMs with tool-use capabilities via Agentic Reinforcement Learning (Agentic RL) is bottlenecked by two challenges: the lack of scalable, robust execution environments and the scarcity of realistic training data that captures…

Computation and Language · Computer Science 2026-05-19 Minrui Xu , Zilin Wang , Mengyi DENG , Zhiwei Li , Zhicheng Yang , Xiao Zhu , Yinhong Liu , Boyu Zhu , Baiyu Huang , Chao Chen , Heyuan Deng , Fei Mi , Lifeng Shang , Xingshan Zeng , Zhijiang Guo

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Deep reinforcement learning (RL) is a powerful framework to train decision-making models in complex environments. However, RL can be slow as it requires repeated interaction with a simulation of the environment. In particular, there are key…

Machine Learning · Computer Science 2021-10-12 Tian Lan , Sunil Srinivasa , Huan Wang , Stephan Zheng

HetRL: Efficient Reinforcement Learning for LLMs in Heterogeneous Environments

As large language models (LLMs) continue to scale and new GPUs are released even more frequently, there is an increasing demand for LLM post-training in heterogeneous environments to fully leverage underutilized mid-range or…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-14 Yongjun He , Shuai Zhang , Jiading Gai , Xiyuan Zhang , Boran Han , Bernie Wang , Huzefa Rangwala , George Karypis

This letter compares the performance of four different, popular simulation environments for robotics and reinforcement learning (RL) through a series of benchmarks. The benchmarked scenarios are designed carefully with current industrial…

Robotics · Computer Science 2021-03-09 Marian Körber , Johann Lange , Stephan Rediske , Simon Steinmann , Roland Glück

Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning

We propose Pgx, a suite of board game reinforcement learning (RL) environments written in JAX and optimized for GPU/TPU accelerators. By leveraging JAX's auto-vectorization and parallelization over accelerators, Pgx can efficiently scale to…

Artificial Intelligence · Computer Science 2024-01-17 Sotetsu Koyamada , Shinri Okano , Soichiro Nishimori , Yu Murata , Keigo Habara , Haruka Kita , Shin Ishii

Automatic Generation of High-Performance RL Environments

Translating complex reinforcement learning (RL) environments into high-performance implementations has traditionally required months of specialized engineering. We present a closed-loop methodology that produces equivalent high-performance…

Machine Learning · Computer Science 2026-05-19 Seth Karten , Rahul Dev Appapogu , Chi Jin

MemPool: A Scalable Manycore Architecture with a Low-Latency Shared L1 Memory

Shared L1 memory clusters are a common architectural pattern (e.g., in GPGPUs) for building efficient and flexible multi-processing-element (PE) engines. However, it is a common belief that these tightly-coupled clusters would not scale…

Hardware Architecture · Computer Science 2023-11-29 Samuel Riedel , Matheus Cavalcante , Renzo Andri , Luca Benini

JaxMARL: Multi-Agent RL Environments and Algorithms in JAX

Benchmarks are crucial in the development of machine learning algorithms, with available environments significantly influencing reinforcement learning (RL) research. Traditionally, RL environments run on the CPU, which limits their…

Machine Learning · Computer Science 2024-11-05 Alexander Rutherford , Benjamin Ellis , Matteo Gallici , Jonathan Cook , Andrei Lupu , Gardar Ingvarsson , Timon Willi , Ravi Hammond , Akbir Khan , Christian Schroeder de Witt , Alexandra Souly , Saptarashmi Bandyopadhyay , Mikayel Samvelyan , Minqi Jiang , Robert Tjarko Lange , Shimon Whiteson , Bruno Lacerda , Nick Hawes , Tim Rocktaschel , Chris Lu , Jakob Nicolaus Foerster

EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning

Evolutionary Reinforcement Learning (EvoRL) has emerged as a promising approach to overcoming the limitations of traditional reinforcement learning (RL) by integrating the Evolutionary Computation (EC) paradigm with RL. However, the…

Neural and Evolutionary Computing · Computer Science 2025-07-22 Bowen Zheng , Ran Cheng , Kay Chen Tan

Evolutionary Reinforcement Learning: A Survey

Reinforcement learning (RL) is a machine learning approach that trains agents to maximize cumulative rewards through interactions with environments. The integration of RL with deep learning has recently resulted in impressive achievements…

Neural and Evolutionary Computing · Computer Science 2023-08-31 Hui Bai , Ran Cheng , Yaochu Jin

Relax: An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Reinforcement learning (RL) post-training has proven effective at unlocking reasoning, self-reflection, and tool-use capabilities in large language models. As models extend to omni-modal inputs and agentic multi-turn workflows, RL training…

Computation and Language · Computer Science 2026-04-15 Liujie Zhang , Benzhe Ning , Rui Yang , Xiaoyan Yu , Jiaxing Li , Lumeng Wu , Jia Liu , Minghao Li , Weihang Chen , Weiqi Hu , Lei Zhang

Spreeze: High-Throughput Parallel Reinforcement Learning Framework

The promotion of large-scale applications of reinforcement learning (RL) requires efficient training computation. While existing parallel RL frameworks encompass a variety of RL algorithms and parallelization techniques, the excessively…

Machine Learning · Computer Science 2023-12-12 Jing Hou , Guang Chen , Ruiqi Zhang , Zhijun Li , Shangding Gu , Changjun Jiang

SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems

Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To…

Machine Learning · Computer Science 2024-05-08 Kailash Gogineni , Sai Santosh Dayapule , Juan Gómez-Luna , Karthikeya Gogineni , Peng Wei , Tian Lan , Mohammad Sadrosadati , Onur Mutlu , Guru Venkataramani

Open-Sourced Reinforcement Learning Environments for Surgical Robotics

Reinforcement Learning (RL) is a machine learning framework for artificially intelligent systems to solve a variety of complex problems. Recent years has seen a surge of successes solving challenging games and smaller domain problems,…

Robotics · Computer Science 2020-01-28 Florian Richter , Ryan K. Orosco , Michael C. Yip

Efficient Parallel Reinforcement Learning Framework using the Reactor Model

Parallel Reinforcement Learning (RL) frameworks are essential for mapping RL workloads to multiple computational resources, allowing for faster generation of samples, estimation of values, and policy improvement. These computational…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-02-06 Jacky Kwok , Marten Lohstroh , Edward A. Lee

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process

We present a novel reinforcement learning (RL) environment designed to both optimize industrial sorting systems and study agent behavior in evolving spaces. In simulating material flow within a sorting process our environment follows the…

Machine Learning · Computer Science 2025-03-14 Tom Maus , Nico Zengeler , Tobias Glasmachers

EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models

Reinforcement learning (RL) has become a pivotal component of large language model (LLM) post-training, and agentic RL extends this paradigm to operate as agents through multi-turn interaction and tool use. Scaling such systems exposes two…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-08 Zheyue Tan , Mustapha Abdullahi , Tuo Shi , Huining Yuan , Zelai Xu , Chao Yu , Boxun Li , Bo Zhao

FCPO: Federated Continual Policy Optimization for Real-Time High-Throughput Edge Video Analytics

The growing complexity of Edge Video Analytics (EVA) facilitates new kind of intelligent applications, but creates challenges in real-time inference serving systems. State-of-the-art (SOTA) scheduling systems optimize global workload…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-25 Lucas Liebe , Thanh-Tung Nguyen , Dongman Lee

Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation

Reinforcement learning (RL) is an agent-based approach for teaching robots to navigate within the physical world. Gathering data for RL is known to be a laborious task, and real-world experiments can be risky. Simulators facilitate the…

Robotics · Computer Science 2024-10-28 Jack Saunders , Sajad Saeedi , Wenbin Li