Related papers: Bayesian Action Decoder for Deep Multi-Agent Reinf…

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

In recent years we have seen fast progress on a number of benchmark problems in AI, with modern methods achieving near or super human performance in Go, Poker and Dota. One common aspect of all of these challenges is that they are by design…

Artificial Intelligence · Computer Science 2021-05-13 Hengyuan Hu , Jakob N Foerster

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning

Agents that interact with other agents often do not know a priori what the other agents' strategies are, but have to maximise their own online return while interacting with and learning about others. The optimal adaptive behaviour under…

Machine Learning · Computer Science 2022-04-19 Luisa Zintgraf , Sam Devlin , Kamil Ciosek , Shimon Whiteson , Katja Hofmann

A Behavior-Aware Approach for Deep Reinforcement Learning in Non-stationary Environments without Known Change Points

Deep reinforcement learning is used in various domains, but usually under the assumption that the environment has stationary conditions like transitions and state distributions. When this assumption is not met, performance suffers. For this…

Machine Learning · Computer Science 2024-05-24 Zihe Liu , Jie Lu , Guangquan Zhang , Junyu Xuan

Multi-agent Cooperative Games Using Belief Map Assisted Training

In a multi-agent system, agents share their local observations to gain global situational awareness for decision making and collaboration using a message passing system. When to send a message, how to encode a message, and how to leverage…

Multiagent Systems · Computer Science 2024-07-01 Qinwei Huang , Chen Luo , Alex B. Wu , Simon Khan , Hai Li , Qinru Qiu

Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement Learning

In this work, we propose Behavior-Guided Actor-Critic (BAC), an off-policy actor-critic deep RL algorithm. BAC mathematically formulates the behavior of the policy through autoencoders by providing an accurate estimation of how frequently…

Machine Learning · Computer Science 2021-04-12 Ammar Fayad , Majd Ibrahim

Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning

Executing actions in a correlated manner is a common strategy for human coordination that often leads to better cooperation, which is also potentially beneficial for cooperative multi-agent reinforcement learning (MARL). However, the recent…

Multiagent Systems · Computer Science 2023-06-06 Dingyang Chen , Qi Zhang

Solving Common-Payoff Games with Approximate Policy Iteration

For artificially intelligent learning systems to have widespread applicability in real-world settings, it is important that they be able to operate decentrally. Unfortunately, decentralized control is difficult -- computing even an…

Artificial Intelligence · Computer Science 2021-01-13 Samuel Sokota , Edward Lockhart , Finbarr Timbers , Elnaz Davoodi , Ryan D'Orazio , Neil Burch , Martin Schmid , Michael Bowling , Marc Lanctot

Too many cooks: Bayesian inference for coordinating multi-agent collaboration

Collaboration requires agents to coordinate their behavior on the fly, sometimes cooperating to solve a single task together and other times dividing it up into sub-tasks to work on in parallel. Underlying the human ability to collaborate…

Artificial Intelligence · Computer Science 2020-07-07 Rose E. Wang , Sarah A. Wu , James A. Evans , Joshua B. Tenenbaum , David C. Parkes , Max Kleiman-Weiner

Online Bayesian Learning of Agent Behavior in Differential Games

This work introduces an online Bayesian game-theoretic method for behavior identification in multi-agent dynamical systems. By casting Hamilton-Jacobi-Bellman optimality conditions as linear-in-parameter residuals, the method enables fast…

Systems and Control · Electrical Eng. & Systems 2026-01-09 Francesco Bianchin , Robert Lefringhausen , Sandra Hirche

A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

In the reinforcement learning literature, there are many algorithms developed for either Contextual Bandit (CB) or Markov Decision Processes (MDP) environments. However, when deploying reinforcement learning algorithms in the real world,…

Machine Learning · Computer Science 2022-08-02 Kelly W. Zhang , Omer Gottesman , Finale Doshi-Velez

Behaviour-conditioned policies for cooperative reinforcement learning tasks

The cooperation among AI systems, and between AI systems and humans is becoming increasingly important. In various real-world tasks, an agent needs to cooperate with unknown partner agent types. This requires the agent to assess the…

Machine Learning · Computer Science 2021-10-05 Antti Keurulainen , Isak Westerlund , Ariel Kwiatkowski , Samuel Kaski , Alexander Ilin

An Action Language for Multi-Agent Domains: Foundations

In multi-agent domains (MADs), an agent's action may not just change the world and the agent's knowledge and beliefs about the world, but also may change other agents' knowledge and beliefs about the world and their knowledge and beliefs…

Artificial Intelligence · Computer Science 2020-12-29 Chitta Baral , Gregory Gelfond , Enrico Pontelli , Tran Cao Son

Affect Control Processes: Intelligent Affective Interaction using a Partially Observable Markov Decision Process

This paper describes a novel method for building affectively intelligent human-interactive agents. The method is based on a key sociological insight that has been developed and extensively verified over the last twenty years, but has yet to…

Human-Computer Interaction · Computer Science 2015-10-23 Jesse Hoey , Tobias Schroeder , Areej Alhothali

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

Predicting and executing a sequence of actions without intermediate replanning, known as action chunking, is increasingly used in robot learning from human demonstrations. Yet, its effects on the learned policy remain inconsistent: some…

Robotics · Computer Science 2025-04-28 Yuejiang Liu , Jubayer Ibn Hamid , Annie Xie , Yoonho Lee , Maximilian Du , Chelsea Finn

Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria

A key challenge in the study of multiagent cooperation is the need for individual agents not only to cooperate effectively, but to decide with whom to cooperate. This is particularly critical in situations when other agents have hidden,…

Artificial Intelligence · Computer Science 2022-01-07 Kavya Kopparapu , Edgar A. Duéñez-Guzmán , Jayd Matyas , Alexander Sasha Vezhnevets , John P. Agapiou , Kevin R. McKee , Richard Everett , Janusz Marecki , Joel Z. Leibo , Thore Graepel

Learning to Deceive in Multi-Agent Hidden Role Games

Deception is prevalent in human social settings. However, studies into the effect of deception on reinforcement learning algorithms have been limited to simplistic settings, restricting their applicability to complex real-world problems.…

Multiagent Systems · Computer Science 2022-09-07 Matthew Aitchison , Lyndon Benke , Penny Sweetser

Mean Actor Critic

We propose a new algorithm, Mean Actor-Critic (MAC), for discrete-action continuous-state reinforcement learning. MAC is a policy gradient algorithm that uses the agent's explicit representation of all action values to estimate the gradient…

Machine Learning · Statistics 2018-05-24 Cameron Allen , Kavosh Asadi , Melrose Roderick , Abdel-rahman Mohamed , George Konidaris , Michael Littman

Bayesian learning of the optimal action-value function in a Markov decision process

The Markov Decision Process (MDP) is a popular framework for sequential decision-making problems, and uncertainty quantification is an essential component of it to learn optimal decision-making strategies. In particular, a Bayesian…

Machine Learning · Statistics 2025-05-06 Jiaqi Guo , Chon Wai Ho , Sumeetpal S. Singh

Multiagent Reinforcement Learning with Neighbor Action Estimation

Multiagent reinforcement learning, as a prominent intelligent paradigm, enables collaborative decision-making within complex systems. However, existing approaches often rely on explicit action exchange between agents to evaluate action…

Robotics · Computer Science 2026-01-09 Zhenglong Luo , Zhiyong Chen , Aoxiang Liu

Risk-Sensitive Bayesian Games for Multi-Agent Reinforcement Learning under Policy Uncertainty

In stochastic games with incomplete information, the uncertainty is evoked by the lack of knowledge about a player's own and the other players' types, i.e. the utility function and the policy space, and also the inherent stochasticity of…

Machine Learning · Computer Science 2022-03-21 Hannes Eriksson , Debabrota Basu , Mina Alibeigi , Christos Dimitrakakis