Related papers: Asynchronous Methods for Deep Reinforcement Learni…

Efficient Parallel Methods for Deep Reinforcement Learning

We propose a novel framework for efficient parallelization of deep reinforcement learning algorithms, enabling these algorithms to learn from multiple actors on a single machine. The framework is algorithm agnostic and can be applied to…

Machine Learning · Computer Science 2017-05-17 Alfredo V. Clemente , Humberto N. Castejón , Arjun Chandra

Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning

Synchronizing decisions across multiple agents in realistic settings is problematic since it requires agents to wait for other agents to terminate and communicate about termination reliably. Ideally, agents should learn and execute…

Machine Learning · Computer Science 2022-10-12 Yuchen Xiao , Weihao Tan , Christopher Amato

Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU

We introduce a hybrid CPU/GPU version of the Asynchronous Advantage Actor-Critic (A3C) algorithm, currently the state-of-the-art method in reinforcement learning for various gaming tasks. We analyze its computational traits and concentrate…

Machine Learning · Computer Science 2017-03-08 Mohammad Babaeizadeh , Iuri Frosio , Stephen Tyree , Jason Clemons , Jan Kautz

Combining Deep Reinforcement Learning And Local Control For The Acrobot Swing-up And Balance Task

In this work we present a novel extension of soft actor critic, a state of the art deep reinforcement algorithm. Our method allows us to combine traditional controllers with learned neural network policies. This combination allows us to…

Robotics · Computer Science 2020-12-23 Sean Gillen , Marco Molnar , Katie Byl

Accelerated Methods for Deep Reinforcement Learning

Deep reinforcement learning (RL) has achieved many recent successes, yet experiment turn-around time remains a key bottleneck in research and in practice. We investigate how to optimize existing deep RL algorithms for modern computers,…

Machine Learning · Computer Science 2019-01-14 Adam Stooke , Pieter Abbeel

Reinforcement Learning for Learning Rate Control

Stochastic gradient descent (SGD), which updates the model parameters by adding a local gradient times a learning rate at each step, is widely used in model training of machine learning algorithms such as neural networks. It is observed…

Machine Learning · Computer Science 2017-06-01 Chang Xu , Tao Qin , Gang Wang , Tie-Yan Liu

Asynchronous Advantage Actor-Critic Agent for Starcraft II

Deep reinforcement learning, and especially the Asynchronous Advantage Actor-Critic algorithm, has been successfully used to achieve super-human performance in a variety of video games. Starcraft II is a new challenge for the reinforcement…

Artificial Intelligence · Computer Science 2018-07-25 Basel Alghanem , Keerthana P G

Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees

Actor-critic (AC) methods are widely used in reinforcement learning (RL) and benefit from the flexibility of using any policy gradient method as the actor and value-based method as the critic. The critic is usually trained by minimizing the…

Machine Learning · Computer Science 2023-11-01 Sharan Vaswani , Amirreza Kazemi , Reza Babanezhad , Nicolas Le Roux

Atari games and Intel processors

The asynchronous nature of the state-of-the-art reinforcement learning algorithms such as the Asynchronous Advantage Actor-Critic algorithm, makes them exceptionally suitable for CPU computations. However, given the fact that deep…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-17 Robert Adamski , Tomasz Grel , Maciej Klimek , Henryk Michalewski

Broad Critic Deep Actor Reinforcement Learning for Continuous Control

In the domain of continuous control, deep reinforcement learning (DRL) demonstrates promising results. However, the dependence of DRL on deep neural networks (DNNs) results in the demand for extensive data and increased computational cost.…

Machine Learning · Computer Science 2025-04-15 Shiron Thalagala , Pak Kin Wong , Xiaozheng Wang , Tianang Sun

Superstition in the Network: Deep Reinforcement Learning Plays Deceptive Games

Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm…

Machine Learning · Computer Science 2019-08-14 Philip Bontrager , Ahmed Khalifa , Damien Anderson , Matthew Stephenson , Christoph Salge , Julian Togelius

Agent Modeling as Auxiliary Task for Deep Reinforcement Learning

In this paper we explore how actor-critic methods in deep reinforcement learning, in particular Asynchronous Advantage Actor-Critic (A3C), can be extended with agent modeling. Inspired by recent works on representation learning and…

Multiagent Systems · Computer Science 2019-07-24 Pablo Hernandez-Leal , Bilal Kartal , Matthew E. Taylor

A Deep Actor-Critic Reinforcement Learning Framework for Dynamic Multichannel Access

To make efficient use of limited spectral resources, we in this work propose a deep actor-critic reinforcement learning based framework for dynamic multichannel access. We consider both a single-user case and a scenario in which multiple…

Machine Learning · Computer Science 2019-08-23 Chen Zhong , Ziyang Lu , M. Cenk Gursoy , Senem Velipasalar

Actor-Critic based Improper Reinforcement Learning

We consider an improper reinforcement learning setting where a learner is given $M$ base controllers for an unknown Markov decision process, and wishes to combine them optimally to produce a potentially new controller that can outperform…

Machine Learning · Computer Science 2022-07-20 Mohammadi Zaki , Avinash Mohan , Aditya Gopalan , Shie Mannor

Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes

We present a study in Distributed Deep Reinforcement Learning (DDRL) focused on scalability of a state-of-the-art Deep Reinforcement Learning algorithm known as Batch Asynchronous Advantage ActorCritic (BA3C). We show that using the Adam…

Artificial Intelligence · Computer Science 2018-04-10 Igor Adamski , Robert Adamski , Tomasz Grel , Adam Jędrych , Kamil Kaczmarek , Henryk Michalewski

Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations

Pretraining with expert demonstrations have been found useful in speeding up the training process of deep reinforcement learning algorithms since less online simulation data is required. Some people use supervised learning to speed up the…

Artificial Intelligence · Computer Science 2018-02-12 Xiaoqin Zhang , Huimin Ma

Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training

Actor-critic deep reinforcement learning (DRL) algorithms have recently achieved prominent success in tackling various challenging reinforcement learning (RL) problems, particularly complex control tasks with high-dimensional continuous…

Machine Learning · Computer Science 2023-05-04 Gang Chen , Victoria Huang

Human-Level Control without Server-Grade Hardware

Deep Q-Network (DQN) marked a major milestone for reinforcement learning, demonstrating for the first time that human-level control policies could be learned directly from raw visual inputs via reward maximization. Even years after its…

Machine Learning · Computer Science 2021-11-03 Brett Daley , Christopher Amato

Asynchrony begets Momentum, with an Application to Deep Learning

Asynchronous methods are widely used in deep learning, but have limited theoretical justification when applied to non-convex problems. We show that running stochastic gradient descent (SGD) in an asynchronous manner can be viewed as adding…

Machine Learning · Statistics 2016-11-28 Ioannis Mitliagkas , Ce Zhang , Stefan Hadjis , Christopher Ré

Towards Understanding Asynchronous Advantage Actor-critic: Convergence and Linear Speedup

Asynchronous and parallel implementation of standard reinforcement learning (RL) algorithms is a key enabler of the tremendous success of modern RL. Among many asynchronous RL algorithms, arguably the most popular and effective one is the…

Machine Learning · Computer Science 2023-08-02 Han Shen , Kaiqing Zhang , Mingyi Hong , Tianyi Chen