Related papers: High-Throughput Synchronous Deep RL

Hierarchical Reinforcement Learning with Timed Subgoals

Hierarchical reinforcement learning (HRL) holds great potential for sample-efficient learning on challenging long-horizon tasks. In particular, letting a higher level assign subgoals to a lower level has been shown to enable fast learning…

Machine Learning · Computer Science 2021-12-07 Nico Gürtler , Dieter Büchler , Georg Martius

An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search

Deep reinforcement learning (DRL) algorithms and evolution strategies (ES) have been applied to various tasks, showing excellent performances. These have the opposite properties, with DRL having good sample efficiency and poor stability,…

Machine Learning · Computer Science 2021-04-06 Kyunghyun Lee , Byeong-Uk Lee , Ukcheol Shin , In So Kweon

Hierarchical Reinforcement Learning with Hindsight

Reinforcement Learning (RL) algorithms can suffer from poor sample efficiency when rewards are delayed and sparse. We introduce a solution that enables agents to learn temporally extended actions at multiple levels of abstraction in a…

Machine Learning · Computer Science 2019-03-11 Andrew Levy , Robert Platt , Kate Saenko

Deep RTS: A Game Environment for Deep Reinforcement Learning in Real-Time Strategy Games

Reinforcement learning (RL) is an area of research that has blossomed tremendously in recent years and has shown remarkable potential for artificial intelligence based opponents in computer games. This success is primarily due to the vast…

Artificial Intelligence · Computer Science 2018-08-16 Per-Arne Andersen , Morten Goodwin , Ole-Christoffer Granmo

Hierarchical Reinforcement Learning with Optimal Level Synchronization based on a Deep Generative Model

The high-dimensional or sparse reward task of a reinforcement learning (RL) environment requires a superior potential controller such as hierarchical reinforcement learning (HRL) rather than an atomic RL because it absorbs the complexity of…

Machine Learning · Computer Science 2021-07-20 JaeYoon Kim , Junyu Xuan , Christy Liang , Farookh Hussain

Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Deep reinforcement learning (DRL) has significantly advanced the field of combinatorial optimization (CO). However, its practicality is hindered by the necessity for a large number of reward evaluations, especially in scenarios involving…

Machine Learning · Computer Science 2024-07-18 Hyeonah Kim , Minsu Kim , Sungsoo Ahn , Jinkyoo Park

Meta-Reinforcement Learning for Heuristic Planning

In Meta-Reinforcement Learning (meta-RL) an agent is trained on a set of tasks to prepare for and learn faster in new, unseen, but related tasks. The training tasks are usually hand-crafted to be representative of the expected distribution…

Artificial Intelligence · Computer Science 2021-07-07 Ricardo Luna Gutierrez , Matteo Leonetti

Hybrid Cross-domain Robust Reinforcement Learning

Robust reinforcement learning (RL) aims to learn policies that remain effective despite uncertainties in its environment, which frequently arise in real-world applications due to variations in environment dynamics. The robust RL methods…

Machine Learning · Computer Science 2025-05-30 Linh Le Pham Van , Minh Hoang Nguyen , Hung Le , Hung The Tran , Sunil Gupta

Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning

Despite advances in reinforcement learning (RL)-based video reasoning with large language models (LLMs), data collection and fine-tuning remain significant challenges. These methods often rely on large-scale supervised fine-tuning (SFT)…

Computer Vision and Pattern Recognition · Computer Science 2025-10-27 Ziyang Wang , Jaehong Yoon , Shoubin Yu , Md Mohaiminul Islam , Gedas Bertasius , Mohit Bansal

Temporal Shift Reinforcement Learning

The function approximators employed by traditional image-based Deep Reinforcement Learning (DRL) algorithms usually lack a temporal learning component and instead focus on learning the spatial component. We propose a technique, Temporal…

Machine Learning · Computer Science 2021-10-28 Deepak George Thomas , Tichakorn Wongpiromsarn , Ali Jannesari

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data

Online Reinforcement learning (RL) typically requires high-stakes online interaction data to learn a policy for a target task. This prompts interest in leveraging historical data to improve sample efficiency. The historical data may come…

Machine Learning · Computer Science 2024-11-07 Chengrui Qu , Laixi Shi , Kishan Panaganti , Pengcheng You , Adam Wierman

Hierarchical Reinforcement Learning in Multi-Goal Spatial Navigation with Autonomous Mobile Robots

Hierarchical reinforcement learning (HRL) is hypothesized to be able to leverage the inherent hierarchy in learning tasks where traditional reinforcement learning (RL) often fails. In this research, HRL is evaluated and contrasted with…

Artificial Intelligence · Computer Science 2025-08-20 Brendon Johnson , Alfredo Weitzenfeld

RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$

Meta reinforcement learning (Meta-RL) methods such as RL$^2$ have emerged as promising approaches for learning data-efficient RL algorithms tailored to a given task distribution. However, they show poor asymptotic performance and struggle…

Machine Learning · Computer Science 2025-07-29 Abhinav Bhatia , Samer B. Nashed , Shlomo Zilberstein

Exploiting Hybrid Policy in Reinforcement Learning for Interpretable Temporal Logic Manipulation

Reinforcement Learning (RL) based methods have been increasingly explored for robot learning. However, RL based methods often suffer from low sampling efficiency in the exploration phase, especially for long-horizon manipulation tasks, and…

Robotics · Computer Science 2024-12-31 Hao Zhang , Hao Wang , Xiucai Huang , Wenrui Chen , Zhen Kan

Data-Efficient Hierarchical Reinforcement Learning

Hierarchical reinforcement learning (HRL) is a promising approach to extend traditional reinforcement learning (RL) methods to solve more complex tasks. Yet, the majority of current HRL methods require careful task-specific design and…

Machine Learning · Computer Science 2018-10-08 Ofir Nachum , Shixiang Gu , Honglak Lee , Sergey Levine

A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs

With the growing popularity of deep reinforcement learning (DRL), human-in-the-loop (HITL) approach has the potential to revolutionize the way we approach decision-making problems and create new opportunities for human-AI collaboration. In…

Artificial Intelligence · Computer Science 2025-04-25 Jalal Arabneydi , Saiful Islam , Srijita Das , Sai Krishna Gottipati , William Duguay , Cloderic Mars , Matthew E. Taylor , Matthew Guzdial , Antoine Fagette , Younes Zerouali

Hysteresis-Based RL: Robustifying Reinforcement Learning-based Control Policies via Hybrid Control

Reinforcement learning (RL) is a promising approach for deriving control policies for complex systems. As we show in two control problems, the derived policies from using the Proximal Policy Optimization (PPO) and Deep Q-Network (DQN)…

Machine Learning · Computer Science 2022-04-05 Jan de Priester , Ricardo G. Sanfelice , Nathan van de Wouw

Hybrid Inverse Reinforcement Learning

The inverse reinforcement learning approach to imitation learning is a double-edged sword. On the one hand, it can enable learning from a smaller number of expert demonstrations with more robustness to error compounding than behavioral…

Machine Learning · Computer Science 2024-06-06 Juntao Ren , Gokul Swamy , Zhiwei Steven Wu , J. Andrew Bagnell , Sanjiban Choudhury

Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition

Deep reinforcement learning (deep RL) is a combination of deep learning with reinforcement learning principles to create efficient methods that can learn by interacting with its environment. This led to breakthroughs in many complex tasks…

Sound · Computer Science 2019-10-29 Thejan Rajapakshe , Rajib Rana , Siddique Latif , Sara Khalifa , Björn W. Schuller

Deep Reinforcement Learning with Adjustments

Deep reinforcement learning (RL) algorithms can learn complex policies to optimize agent operation over time. RL algorithms have shown promising results in solving complicated problems in recent years. However, their application on…

Machine Learning · Computer Science 2021-09-29 Hamed Khorasgani , Haiyan Wang , Chetan Gupta , Susumu Serita