Related papers: Advantage Alignment Algorithms

Learning Robust Social Strategies with Large Language Models

As agentic AI becomes more widespread, agents with distinct and possibly conflicting goals will interact in complex ways. These multi-agent interactions pose a fundamental challenge, particularly in social dilemmas, where agents' individual…

Machine Learning · Computer Science 2025-12-02 Dereck Piche , Mohammed Muqeeth , Milad Aghajohari , Juan Duque , Michael Noukhovitch , Aaron Courville

Towards Sustainable Investment Policies Informed by Opponent Shaping

Addressing climate change requires global coordination, yet rational economic actors often prioritize immediate gains over collective welfare, resulting in social dilemmas. InvestESG is a recently proposed multi-agent simulation that…

Machine Learning · Computer Science 2026-02-13 Juan Agustin Duque , Razvan Ciuca , Ayoub Echchahed , Hugo Larochelle , Aaron Courville

Value Alignment Equilibrium in Multiagent Systems

Value alignment has emerged in recent years as a basic principle to produce beneficial and mindful Artificial Intelligence systems. It mainly states that autonomous entities should behave in a way that is aligned with our human values. In…

Multiagent Systems · Computer Science 2021-06-28 Nieves Montes , Carles Sierra

AI Alignment Dialogues: An Interactive Approach to AI Alignment in Support Agents

AI alignment is about ensuring AI systems only pursue goals and activities that are beneficial to humans. Most of the current approach to AI alignment is to learn what humans value from their behavioural data. This paper proposes a…

Artificial Intelligence · Computer Science 2023-10-06 Pei-Yu Chen , Myrthe L. Tielman , Dirk K. J. Heylen , Catholijn M. Jonker , M. Birna van Riemsdijk

LLM Active Alignment: A Nash Equilibrium Perspective

We develop a game-theoretic framework for predicting and steering the behavior of populations of large language models (LLMs) through Nash equilibrium (NE) analysis. To avoid the intractability of equilibrium computation in open-ended text…

Artificial Intelligence · Computer Science 2026-02-09 Tonghan Wang , Yuqi Pan , Xinyi Yang , Yanchen Jiang , Milind Tambe , David C. Parkes

Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game

Human preference alignment is essential to improve the interaction quality of large language models (LLMs). Existing alignment methods depend on manually annotated preference data to guide the LLM optimization directions. However,…

Computation and Language · Computer Science 2024-06-04 Pengyu Cheng , Yifan Yang , Jian Li , Yong Dai , Tianhao Hu , Peixin Cao , Nan Du , Xiaolong Li

Consistent Opponent Modeling in Imperfect-Information Games

The goal of agents in multi-agent environments is to maximize total reward against the opposing agents that are encountered. Following a game-theoretic solution concept, such as Nash equilibrium, may obtain a strong performance in some…

Computer Science and Game Theory · Computer Science 2026-01-05 Sam Ganzfried

Norms for Beneficial A.I.: A Computational Analysis of the Societal Value Alignment Problem

The rise of artificial intelligence (A.I.) based systems is already offering substantial benefits to the society as a whole. However, these systems may also enclose potential conflicts and unintended consequences. Notably, people will tend…

Computers and Society · Computer Science 2020-12-23 Pedro Fernandes , Francisco C. Santos , Manuel Lopes

Automated Configuration and Usage of Strategy Portfolios for Bargaining

Bargaining can be used to resolve mixed-motive games in multi-agent systems. Although there is an abundance of negotiation strategies implemented in automated negotiating agents, most agents are based on single fixed strategies, while it is…

Multiagent Systems · Computer Science 2022-12-21 Bram M. Renting , Holger H. Hoos , Catholijn M. Jonker

Emergent Alignment via Competition

Aligning AI systems with human values remains a fundamental challenge, but does our inability to create perfectly aligned models preclude obtaining the benefits of alignment? We study a strategic setting where a human user interacts with…

Machine Learning · Computer Science 2026-02-04 Natalie Collina , Surbhi Goel , Aaron Roth , Emily Ryu , Mirah Shi

Multiagent Learning in Large Anonymous Games

In large systems, it is important for agents to learn to act effectively, but sophisticated multi-agent learning algorithms generally do not scale. An alternative approach is to find restricted classes of games where simple, efficient…

Multiagent Systems · Computer Science 2009-03-16 Ian A. Kash , Eric J. Friedman , Joseph Y. Halpern

On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality

In this work, we study the system of interacting non-cooperative two Q-learning agents, where one agent has the privilege of observing the other's actions. We show that this information asymmetry can lead to a stable outcome of population…

Machine Learning · Computer Science 2021-01-26 Ezra Tampubolon , Haris Ceribasic , Holger Boche

Approximate Equilibrium and Incentivizing Social Coordination

We study techniques to incentivize self-interested agents to form socially desirable solutions in scenarios where they benefit from mutual coordination. Towards this end, we consider coordination games where agents have different intrinsic…

Computer Science and Game Theory · Computer Science 2014-04-21 Elliot Anshelevich , Shreyas Sekar

Strategic Interactions between Large Language Models-based Agents in Beauty Contests

The growing adoption of large language models (LLMs) presents potential for deeper understanding of human behaviours within game theory frameworks. Addressing research gap on multi-player competitive games, this paper examines the strategic…

General Economics · Economics 2024-10-04 Siting Estee Lu

Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games

Many real-world multi-agent interactions consider multiple distinct criteria, i.e. the payoffs are multi-objective in nature. However, the same multi-objective payoff vector may lead to different utilities for each participant. Therefore,…

Multiagent Systems · Computer Science 2020-11-17 Roxana Rădulescu , Timothy Verstraeten , Yijie Zhang , Patrick Mannion , Diederik M. Roijers , Ann Nowé

Multi-Agent Distributed Reinforcement Learning for Making Decentralized Offloading Decisions

We formulate computation offloading as a decentralized decision-making problem with autonomous agents. We design an interaction mechanism that incentivizes agents to align private and system goals by balancing between competition and…

Multiagent Systems · Computer Science 2022-06-22 Jing Tan , Ramin Khalili , Holger Karl , Artur Hecker

Opponent Shaping in LLM Agents

Large Language Models (LLMs) are increasingly being deployed as autonomous agents in real-world environments. As these deployments scale, multi-agent interactions become inevitable, making it essential to understand strategic behavior in…

Machine Learning · Computer Science 2025-10-10 Marta Emili Garcia Segura , Stephen Hailes , Mirco Musolesi

Learning to Reach Agreement in a Continuous Ultimatum Game

It is well-known that acting in an individually rational manner, according to the principles of classical game theory, may lead to sub-optimal solutions in a class of problems named social dilemmas. In contrast, humans generally do not have…

Computer Science and Game Theory · Computer Science 2014-01-16 Steven de Jong , Simon Uyttendaele , Karl Tuyls

Multi-Agent Generative Adversarial Imitation Learning

Imitation learning algorithms can be used to learn a policy from expert demonstrations without access to a reward signal. However, most existing approaches are not applicable in multi-agent settings due to the existence of multiple (Nash)…

Machine Learning · Computer Science 2018-07-27 Jiaming Song , Hongyu Ren , Dorsa Sadigh , Stefano Ermon

Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

Cooperation between self-interested individuals is a widespread phenomenon in the natural world, but remains elusive in interactions between artificially intelligent agents. Instead, naive reinforcement learning algorithms typically…

Multiagent Systems · Computer Science 2025-01-16 John L. Zhou , Weizhe Hong , Jonathan C. Kao