Machine Learning · Computer Science
A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning
Wesley Suttle, Zhuoran Yang, Kaiqing Zhang, Zhaoran Wang +2
2019-11-20
Machine Learning · Computer Science
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning
Riashat Islam, Raihan Seraj, Samin Yeasar Arnob, Doina Precup
2019-12-12
Machine Learning · Computer Science
Optimal Actor-Critic Policy with Optimized Training Datasets
Chayan Banerjee, Zhiyong Chen, Nasimul Noman, Mohsen Zamani
2021-12-02
Machine Learning · Computer Science
Off-Policy Average Reward Actor-Critic with Deterministic Policy Search
Naman Saxena, Subhojyoti Khastigir, Shishir Kolathaya, Shalabh Bhatnagar
2023-07-20
Machine Learning · Computer Science
Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Raghuram Bharadwaj Diddigi, Prateek Jain, Prabuchandran K. J., Shalabh Bhatnagar
2022-06-16
Machine Learning · Computer Science
Variance Penalized On-Policy and Off-Policy Actor-Critic
Arushi Jain, Gandharv Patil, Ayush Jain, Khimya Khetarpal +1
2021-02-04
Statistics Theory · Mathematics
Batch Policy Learning in Average Reward Markov Decision Processes
Peng Liao, Zhengling Qi, Runzhe Wan, Predrag Klasnja +1
2022-09-20
Machine Learning · Computer Science
Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction
Jiachen Li, Shuo Cheng, Zhenyu Liao, Huayan Wang +2
2022-11-23
Machine Learning · Computer Science
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov, Kumar Krishna Agrawal, Debidatta Dwibedi, Sergey Levine +1
2018-10-16
Machine Learning · Computer Science
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine
2018-08-10
Machine Learning · Computer Science
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Sriram Srinivasan, Marc Lanctot, Vinicius Zambaldi, Julien Perolat +3
2020-06-15
Machine Learning · Computer Science
Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains
James Bannon, Brad Windsor, Wenbo Song, Tao Li
2020-06-05