Related papers: Multi-agent Reinforcement Learning Accelerated MCM…

A Method for Fast Autonomy Transfer in Reinforcement Learning

This paper introduces a novel reinforcement learning (RL) strategy designed to facilitate rapid autonomy transfer by utilizing pre-trained critic value functions from multiple environments. Unlike traditional methods that require extensive…

Machine Learning · Computer Science 2024-07-31 Dinuka Sahabandu , Bhaskar Ramasubramanian , Michail Alexiou , J. Sukarno Mertoguno , Linda Bushnell , Radha Poovendran

Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

Multi-agent adversarial inverse reinforcement learning (MA-AIRL) is a recent approach that applies single-agent AIRL to multi-agent problems where we seek to recover both policies for our agents and reward functions that promote expert-like…

Multiagent Systems · Computer Science 2020-02-26 Wonseok Jeon , Paul Barde , Derek Nowrouzezahrai , Joelle Pineau

Multi-Actor Multi-Critic Deep Deterministic Reinforcement Learning with a Novel Q-Ensemble Method

Reinforcement learning has gathered much attention in recent years due to its rapid development and rich applications, especially on control systems and robotics. When tackling real-world applications with reinforcement learning method, the…

Machine Learning · Computer Science 2025-10-02 Andy Wu , Chun-Cheng Lin , Rung-Tzuo Liaw , Yuehua Huang , Chihjung Kuo , Chia Tong Weng

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

Many existing reinforcement learning (RL) methods employ stochastic gradient iteration on the back end, whose stability hinges upon a hypothesis that the data-generating process mixes exponentially fast with a rate parameter that appears in…

Machine Learning · Computer Science 2023-02-02 Wesley A. Suttle , Amrit Singh Bedi , Bhrij Patel , Brian M. Sadler , Alec Koppel , Dinesh Manocha

On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation

Reinforcement learning, mathematically described by Markov Decision Problems, may be approached either through dynamic programming or policy search. Actor-critic algorithms combine the merits of both approaches by alternating between steps…

Machine Learning · Computer Science 2023-01-31 Harshat Kumar , Alec Koppel , Alejandro Ribeiro

Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator

This paper presents a novel approach to multi-agent reinforcement learning (RL) for linear systems with convex polytopic constraints. Existing work on RL has demonstrated the use of model predictive control (MPC) as a function approximator…

Systems and Control · Electrical Eng. & Systems 2025-01-06 Samuel Mallick , Filippo Airaldi , Azita Dabiri , Bart De Schutter

Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method

The actor-critic RL is widely used in various robotic control tasks. By viewing the actor-critic RL from the perspective of variational inference (VI), the policy network is trained to obtain the approximate posterior of actions given the…

Machine Learning · Computer Science 2022-01-04 Duo Xu , Faramarz Fekri

Automation and Feature Selection Enhancement with Reinforcement Learning (RL)

Effective feature selection, representation and transformation are principal steps in machine learning to improve prediction accuracy, model generalization and computational efficiency. Reinforcement learning provides a new perspective…

Machine Learning · Computer Science 2025-03-18 Sumana Sanyasipura Nagaraju

Scalable Online Disease Diagnosis via Multi-Model-Fused Actor-Critic Reinforcement Learning

For those seeking healthcare advice online, AI based dialogue agents capable of interacting with patients to perform automatic disease diagnosis are a viable option. This application necessitates efficient inquiry of relevant disease…

Machine Learning · Computer Science 2022-06-09 Weijie He , Ting Chen

Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics

Many real world tasks require multiple agents to work together. Multi-agent reinforcement learning (RL) methods have been proposed in recent years to solve these tasks, but current methods often fail to efficiently learn policies. We thus…

Machine Learning · Computer Science 2019-12-03 Johannes Ackermann , Volker Gabler , Takayuki Osa , Masashi Sugiyama

Multi-Agent Inverse Q-Learning from Demonstrations

When reward functions are hand-designed, deep reinforcement learning algorithms often suffer from reward misspecification, causing them to learn suboptimal policies in terms of the intended task objectives. In the single-agent case, inverse…

Multiagent Systems · Computer Science 2025-03-07 Nathaniel Haynam , Adam Khoja , Dhruv Kumar , Vivek Myers , Erdem Bıyık

Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems

This paper describes a purely data-driven solution to a class of sequential decision-making problems with a large number of concurrent online decisions, with applications to computing systems and operations research. We assume that while…

Artificial Intelligence · Computer Science 2019-10-02 Hardik Meisheri , Vinita Baniwal , Nazneen N Sultana , Balaraman Ravindran , Harshad Khadilkar

Multi-Agent Adversarial Inverse Reinforcement Learning

Reinforcement learning agents are prone to undesired behaviors due to reward mis-specification. Finding a set of reward functions to properly guide agent behaviors is particularly challenging in multi-agent scenarios. Inverse reinforcement…

Machine Learning · Computer Science 2019-08-01 Lantao Yu , Jiaming Song , Stefano Ermon

Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning

Policy gradient methods have become popular in multi-agent reinforcement learning, but they suffer from high variance due to the presence of environmental stochasticity and exploring agents (i.e., non-stationarity), which is potentially…

Machine Learning · Computer Science 2021-12-21 Yuchen Xiao , Xueguang Lyu , Christopher Amato

Efficient and Scalable Deep Reinforcement Learning for Mean Field Control Games

Mean Field Control Games (MFCGs) provide a powerful theoretical framework for analyzing systems of infinitely many interacting agents, blending elements from Mean Field Games (MFGs) and Mean Field Control (MFC). However, solving the coupled…

Machine Learning · Computer Science 2025-01-03 Nianli Peng , Yilin Wang

Breaking the Computational Barrier: Provably Efficient Actor-Critic for Low-Rank MDPs

Reinforcement learning (RL) is a fundamental framework for sequential decision-making, in which an agent learns an optimal policy through interactions with an unknown environment. In settings with function approximation, many existing RL…

Machine Learning · Computer Science 2026-05-05 Ruiquan Huang , Donghao Li , Yingbin Liang , Jing Yang

Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations

Providing densely shaped reward functions for RL algorithms is often exceedingly challenging, motivating the development of RL algorithms that can learn from easier-to-specify sparse reward functions. This sparsity poses new exploration…

Machine Learning · Computer Science 2022-10-24 Albert Wilcox , Ashwin Balakrishna , Jules Dedieu , Wyame Benslimane , Daniel S. Brown , Ken Goldberg

Symmetry-Guided Multi-Agent Inverse Reinforcement Learning

In robotic systems, the performance of reinforcement learning depends on the rationality of predefined reward functions. However, manually designed reward functions often lead to policy failures due to inaccuracies. Inverse Reinforcement…

Robotics · Computer Science 2025-09-12 Yongkai Tian , Yirong Qi , Xin Yu , Wenjun Wu , Jie Luo

Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal Control

Reinforcement learning (RL) is a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, and deep neural networks further enhance its learning power. However, centralized RL is infeasible…

Machine Learning · Computer Science 2019-03-13 Tianshu Chu , Jie Wang , Lara Codecà , Zhaojian Li

Model-based Reinforcement Learning with Multi-step Plan Value Estimation

A promising way to improve the sample efficiency of reinforcement learning is model-based methods, in which many explorations and evaluations can happen in the learned models to save real-world samples. However, when the learned model has a…

Machine Learning · Computer Science 2022-09-14 Haoxin Lin , Yihao Sun , Jiaji Zhang , Yang Yu