English
Related papers

Related papers: EnvPool: A Highly Parallel Reinforcement Learning …

200 papers

Equipping LLMs with tool-use capabilities via Agentic Reinforcement Learning (Agentic RL) is bottlenecked by two challenges: the lack of scalable, robust execution environments and the scarcity of realistic training data that captures…

Deep reinforcement learning (RL) is a powerful framework to train decision-making models in complex environments. However, RL can be slow as it requires repeated interaction with a simulation of the environment. In particular, there are key…

Machine Learning · Computer Science 2021-10-12 Tian Lan , Sunil Srinivasa , Huan Wang , Stephan Zheng

As large language models (LLMs) continue to scale and new GPUs are released even more frequently, there is an increasing demand for LLM post-training in heterogeneous environments to fully leverage underutilized mid-range or…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-14 Yongjun He , Shuai Zhang , Jiading Gai , Xiyuan Zhang , Boran Han , Bernie Wang , Huzefa Rangwala , George Karypis

This letter compares the performance of four different, popular simulation environments for robotics and reinforcement learning (RL) through a series of benchmarks. The benchmarked scenarios are designed carefully with current industrial…

Robotics · Computer Science 2021-03-09 Marian Körber , Johann Lange , Stephan Rediske , Simon Steinmann , Roland Glück

We propose Pgx, a suite of board game reinforcement learning (RL) environments written in JAX and optimized for GPU/TPU accelerators. By leveraging JAX's auto-vectorization and parallelization over accelerators, Pgx can efficiently scale to…

Artificial Intelligence · Computer Science 2024-01-17 Sotetsu Koyamada , Shinri Okano , Soichiro Nishimori , Yu Murata , Keigo Habara , Haruka Kita , Shin Ishii

Translating complex reinforcement learning (RL) environments into high-performance implementations has traditionally required months of specialized engineering. We present a closed-loop methodology that produces equivalent high-performance…

Machine Learning · Computer Science 2026-05-19 Seth Karten , Rahul Dev Appapogu , Chi Jin

Shared L1 memory clusters are a common architectural pattern (e.g., in GPGPUs) for building efficient and flexible multi-processing-element (PE) engines. However, it is a common belief that these tightly-coupled clusters would not scale…

Hardware Architecture · Computer Science 2023-11-29 Samuel Riedel , Matheus Cavalcante , Renzo Andri , Luca Benini

Benchmarks are crucial in the development of machine learning algorithms, with available environments significantly influencing reinforcement learning (RL) research. Traditionally, RL environments run on the CPU, which limits their…

Evolutionary Reinforcement Learning (EvoRL) has emerged as a promising approach to overcoming the limitations of traditional reinforcement learning (RL) by integrating the Evolutionary Computation (EC) paradigm with RL. However, the…

Neural and Evolutionary Computing · Computer Science 2025-07-22 Bowen Zheng , Ran Cheng , Kay Chen Tan

Reinforcement learning (RL) is a machine learning approach that trains agents to maximize cumulative rewards through interactions with environments. The integration of RL with deep learning has recently resulted in impressive achievements…

Neural and Evolutionary Computing · Computer Science 2023-08-31 Hui Bai , Ran Cheng , Yaochu Jin

Reinforcement learning (RL) post-training has proven effective at unlocking reasoning, self-reflection, and tool-use capabilities in large language models. As models extend to omni-modal inputs and agentic multi-turn workflows, RL training…

Computation and Language · Computer Science 2026-04-15 Liujie Zhang , Benzhe Ning , Rui Yang , Xiaoyan Yu , Jiaxing Li , Lumeng Wu , Jia Liu , Minghao Li , Weihang Chen , Weiqi Hu , Lei Zhang

The promotion of large-scale applications of reinforcement learning (RL) requires efficient training computation. While existing parallel RL frameworks encompass a variety of RL algorithms and parallelization techniques, the excessively…

Machine Learning · Computer Science 2023-12-12 Jing Hou , Guang Chen , Ruiqi Zhang , Zhijun Li , Shangding Gu , Changjun Jiang

Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To…

Reinforcement Learning (RL) is a machine learning framework for artificially intelligent systems to solve a variety of complex problems. Recent years has seen a surge of successes solving challenging games and smaller domain problems,…

Robotics · Computer Science 2020-01-28 Florian Richter , Ryan K. Orosco , Michael C. Yip

Parallel Reinforcement Learning (RL) frameworks are essential for mapping RL workloads to multiple computational resources, allowing for faster generation of samples, estimation of values, and policy improvement. These computational…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-02-06 Jacky Kwok , Marten Lohstroh , Edward A. Lee

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

We present a novel reinforcement learning (RL) environment designed to both optimize industrial sorting systems and study agent behavior in evolving spaces. In simulating material flow within a sorting process our environment follows the…

Machine Learning · Computer Science 2025-03-14 Tom Maus , Nico Zengeler , Tobias Glasmachers

Reinforcement learning (RL) has become a pivotal component of large language model (LLM) post-training, and agentic RL extends this paradigm to operate as agents through multi-turn interaction and tool use. Scaling such systems exposes two…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-08 Zheyue Tan , Mustapha Abdullahi , Tuo Shi , Huining Yuan , Zelai Xu , Chao Yu , Boxun Li , Bo Zhao

The growing complexity of Edge Video Analytics (EVA) facilitates new kind of intelligent applications, but creates challenges in real-time inference serving systems. State-of-the-art (SOTA) scheduling systems optimize global workload…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-25 Lucas Liebe , Thanh-Tung Nguyen , Dongman Lee

Reinforcement learning (RL) is an agent-based approach for teaching robots to navigate within the physical world. Gathering data for RL is known to be a laborious task, and real-world experiments can be risky. Simulators facilitate the…

Robotics · Computer Science 2024-10-28 Jack Saunders , Sajad Saeedi , Wenbin Li
‹ Prev 1 2 3 10 Next ›