English
Related papers

Related papers: Asynchronous Methods for Deep Reinforcement Learni…

200 papers

We propose a novel framework for efficient parallelization of deep reinforcement learning algorithms, enabling these algorithms to learn from multiple actors on a single machine. The framework is algorithm agnostic and can be applied to…

Machine Learning · Computer Science 2017-05-17 Alfredo V. Clemente , Humberto N. Castejón , Arjun Chandra

Synchronizing decisions across multiple agents in realistic settings is problematic since it requires agents to wait for other agents to terminate and communicate about termination reliably. Ideally, agents should learn and execute…

Machine Learning · Computer Science 2022-10-12 Yuchen Xiao , Weihao Tan , Christopher Amato

We introduce a hybrid CPU/GPU version of the Asynchronous Advantage Actor-Critic (A3C) algorithm, currently the state-of-the-art method in reinforcement learning for various gaming tasks. We analyze its computational traits and concentrate…

Machine Learning · Computer Science 2017-03-08 Mohammad Babaeizadeh , Iuri Frosio , Stephen Tyree , Jason Clemons , Jan Kautz

In this work we present a novel extension of soft actor critic, a state of the art deep reinforcement algorithm. Our method allows us to combine traditional controllers with learned neural network policies. This combination allows us to…

Robotics · Computer Science 2020-12-23 Sean Gillen , Marco Molnar , Katie Byl

Deep reinforcement learning (RL) has achieved many recent successes, yet experiment turn-around time remains a key bottleneck in research and in practice. We investigate how to optimize existing deep RL algorithms for modern computers,…

Machine Learning · Computer Science 2019-01-14 Adam Stooke , Pieter Abbeel

Stochastic gradient descent (SGD), which updates the model parameters by adding a local gradient times a learning rate at each step, is widely used in model training of machine learning algorithms such as neural networks. It is observed…

Machine Learning · Computer Science 2017-06-01 Chang Xu , Tao Qin , Gang Wang , Tie-Yan Liu

Deep reinforcement learning, and especially the Asynchronous Advantage Actor-Critic algorithm, has been successfully used to achieve super-human performance in a variety of video games. Starcraft II is a new challenge for the reinforcement…

Artificial Intelligence · Computer Science 2018-07-25 Basel Alghanem , Keerthana P G

Actor-critic (AC) methods are widely used in reinforcement learning (RL) and benefit from the flexibility of using any policy gradient method as the actor and value-based method as the critic. The critic is usually trained by minimizing the…

Machine Learning · Computer Science 2023-11-01 Sharan Vaswani , Amirreza Kazemi , Reza Babanezhad , Nicolas Le Roux

The asynchronous nature of the state-of-the-art reinforcement learning algorithms such as the Asynchronous Advantage Actor-Critic algorithm, makes them exceptionally suitable for CPU computations. However, given the fact that deep…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-17 Robert Adamski , Tomasz Grel , Maciej Klimek , Henryk Michalewski

In the domain of continuous control, deep reinforcement learning (DRL) demonstrates promising results. However, the dependence of DRL on deep neural networks (DNNs) results in the demand for extensive data and increased computational cost.…

Machine Learning · Computer Science 2025-04-15 Shiron Thalagala , Pak Kin Wong , Xiaozheng Wang , Tianang Sun

Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm…

Machine Learning · Computer Science 2019-08-14 Philip Bontrager , Ahmed Khalifa , Damien Anderson , Matthew Stephenson , Christoph Salge , Julian Togelius

In this paper we explore how actor-critic methods in deep reinforcement learning, in particular Asynchronous Advantage Actor-Critic (A3C), can be extended with agent modeling. Inspired by recent works on representation learning and…

Multiagent Systems · Computer Science 2019-07-24 Pablo Hernandez-Leal , Bilal Kartal , Matthew E. Taylor

To make efficient use of limited spectral resources, we in this work propose a deep actor-critic reinforcement learning based framework for dynamic multichannel access. We consider both a single-user case and a scenario in which multiple…

Machine Learning · Computer Science 2019-08-23 Chen Zhong , Ziyang Lu , M. Cenk Gursoy , Senem Velipasalar

We consider an improper reinforcement learning setting where a learner is given $M$ base controllers for an unknown Markov decision process, and wishes to combine them optimally to produce a potentially new controller that can outperform…

Machine Learning · Computer Science 2022-07-20 Mohammadi Zaki , Avinash Mohan , Aditya Gopalan , Shie Mannor

We present a study in Distributed Deep Reinforcement Learning (DDRL) focused on scalability of a state-of-the-art Deep Reinforcement Learning algorithm known as Batch Asynchronous Advantage ActorCritic (BA3C). We show that using the Adam…

Artificial Intelligence · Computer Science 2018-04-10 Igor Adamski , Robert Adamski , Tomasz Grel , Adam Jędrych , Kamil Kaczmarek , Henryk Michalewski

Pretraining with expert demonstrations have been found useful in speeding up the training process of deep reinforcement learning algorithms since less online simulation data is required. Some people use supervised learning to speed up the…

Artificial Intelligence · Computer Science 2018-02-12 Xiaoqin Zhang , Huimin Ma

Actor-critic deep reinforcement learning (DRL) algorithms have recently achieved prominent success in tackling various challenging reinforcement learning (RL) problems, particularly complex control tasks with high-dimensional continuous…

Machine Learning · Computer Science 2023-05-04 Gang Chen , Victoria Huang

Deep Q-Network (DQN) marked a major milestone for reinforcement learning, demonstrating for the first time that human-level control policies could be learned directly from raw visual inputs via reward maximization. Even years after its…

Machine Learning · Computer Science 2021-11-03 Brett Daley , Christopher Amato

Asynchronous methods are widely used in deep learning, but have limited theoretical justification when applied to non-convex problems. We show that running stochastic gradient descent (SGD) in an asynchronous manner can be viewed as adding…

Machine Learning · Statistics 2016-11-28 Ioannis Mitliagkas , Ce Zhang , Stefan Hadjis , Christopher Ré

Asynchronous and parallel implementation of standard reinforcement learning (RL) algorithms is a key enabler of the tremendous success of modern RL. Among many asynchronous RL algorithms, arguably the most popular and effective one is the…

Machine Learning · Computer Science 2023-08-02 Han Shen , Kaiqing Zhang , Mingyi Hong , Tianyi Chen
‹ Prev 1 2 3 10 Next ›