Related papers: Model-Based Opponent Modeling

Learning to Model Opponent Learning

Multi-Agent Reinforcement Learning (MARL) considers settings in which a set of coexisting agents interact with one another and their environment. The adaptation and learning of other agents induces non-stationarity in the environment…

Machine Learning · Computer Science 2020-06-09 Ian Davies , Zheng Tian , Jun Wang

SOM: Structured Opponent Modeling for LLM-based Agents via Structural Causal Model

Accurately predicting opponents' behavior from interactions is a fundamental capability for large language model (LLM)-based agents in multi-agent and game-theoretic environments. Existing approaches often entangle opponent modeling with…

Artificial Intelligence · Computer Science 2026-05-11 Shiyue Cao , Pei Xu , Likun Yang , Lei Cui , Xiaotang Chen , Kaiqi Huang

Opponent Modeling in Deep Reinforcement Learning

Opponent modeling is necessary in multi-agent settings where secondary agents with competing goals also adapt their strategies, yet it remains challenging because strategies interact with each other and change. Most previous work focuses on…

Machine Learning · Computer Science 2016-09-20 He He , Jordan Boyd-Graber , Kevin Kwok , Hal Daumé

Variational Autoencoders for Opponent Modeling in Multi-Agent Systems

Multi-agent systems exhibit complex behaviors that emanate from the interactions of multiple agents in a shared environment. In this work, we are interested in controlling one agent in a multi-agent system and successfully learn to interact…

Machine Learning · Computer Science 2020-01-30 Georgios Papoudakis , Stefano V. Albrecht

Preference-based opponent shaping in differentiable games

Strategy learning in game environments with multi-agent is a challenging problem. Since each agent's reward is determined by the joint strategy, a greedy learning strategy that aims to maximize its own reward may fall into a local optimum.…

Artificial Intelligence · Computer Science 2026-02-02 Xinyu Qiao , Yudong Hu , Congying Han , Weiyan Wu , Tiande Guo

Multi-agent Actor-Critic with Time Dynamical Opponent Model

In multi-agent reinforcement learning, multiple agents learn simultaneously while interacting with a common environment and each other. Since the agents adapt their policies during learning, not only the behavior of a single agent becomes…

Artificial Intelligence · Computer Science 2022-04-13 Yuan Tian , Klaus-Rudolf Kladny , Qin Wang , Zhiwu Huang , Olga Fink

Opponent Shaping in LLM Agents

Large Language Models (LLMs) are increasingly being deployed as autonomous agents in real-world environments. As these deployments scale, multi-agent interactions become inevitable, making it essential to understand strategic behavior in…

Machine Learning · Computer Science 2025-10-10 Marta Emili Garcia Segura , Stephen Hailes , Mirco Musolesi

Metric Policy Representations for Opponent Modeling

In multi-agent reinforcement learning, the inherent non-stationarity of the environment caused by other agents' actions posed significant difficulties for an agent to learn a good policy independently. One way to deal with non-stationarity…

Machine Learning · Computer Science 2022-06-22 Haobin Jiang , Yifan Yu , Zongqing Lu

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

This paper investigates the model-based methods in multi-agent reinforcement learning (MARL). We specify the dynamics sample complexity and the opponent sample complexity in MARL, and conduct a theoretic analysis of return discrepancy upper…

Machine Learning · Computer Science 2022-03-18 Weinan Zhang , Xihuai Wang , Jian Shen , Ming Zhou

Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation

Reinforcement learning is well suited for optimizing policies of recommender systems. Current solutions mostly focus on model-free approaches, which require frequent interactions with the real environment, and thus are expensive in model…

Machine Learning · Computer Science 2020-01-22 Xueying Bai , Jian Guan , Hongning Wang

Contrastive learning-based agent modeling for deep reinforcement learning

Multi-agent systems often require agents to collaborate with or compete against other agents with diverse goals, behaviors, or strategies. Agent modeling is essential when designing adaptive policies for intelligent machine agents in…

Multiagent Systems · Computer Science 2025-07-29 Wenhao Ma , Yu-Cheng Chang , Jie Yang , Yu-Kai Wang , Chin-Teng Lin

Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games

Recent research on vulnerabilities of deep reinforcement learning (RL) has shown that adversarial policies adopted by an adversary agent can influence a target RL agent (victim agent) to perform poorly in a multi-agent environment. In…

Machine Learning · Computer Science 2022-11-01 The Viet Bui , Tien Mai , Thanh H. Nguyen

Fact-based Agent modeling for Multi-Agent Reinforcement Learning

In multi-agent systems, agents need to interact and collaborate with other agents in environments. Agent modeling is crucial to facilitate agent interactions and make adaptive cooperation strategies. However, it is challenging for agents to…

Artificial Intelligence · Computer Science 2023-10-20 Baofu Fang , Caiming Zheng , Hao Wang

Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games

Many real-world multi-agent interactions consider multiple distinct criteria, i.e. the payoffs are multi-objective in nature. However, the same multi-objective payoff vector may lead to different utilities for each participant. Therefore,…

Multiagent Systems · Computer Science 2020-11-17 Roxana Rădulescu , Timothy Verstraeten , Yijie Zhang , Patrick Mannion , Diederik M. Roijers , Ann Nowé

Scaling Inference-Time Computation via Opponent Simulation: Enabling Online Strategic Adaptation in Repeated Negotiation

While large language models (LLMs) have emerged as powerful decision-makers across a wide range of single-agent and stationary environments, fewer efforts have been devoted to settings where LLMs must engage in \emph{repeated} and…

Multiagent Systems · Computer Science 2026-02-27 Xiangyu Liu , Di Wang , Zhe Feng , Aranyak Mehta

Opponent Modeling in Negotiation Dialogues by Related Data Adaptation

Opponent modeling is the task of inferring another party's mental state within the context of social interactions. In a multi-issue negotiation, it involves inferring the relative importance that the opponent assigns to each issue under…

Computation and Language · Computer Science 2022-05-04 Kushal Chawla , Gale M. Lucas , Jonathan May , Jonathan Gratch

Teaching Models to Balance Resisting and Accepting Persuasion

Large language models (LLMs) are susceptible to persuasion, which can pose risks when models are faced with an adversarial interlocutor. We take a first step towards defending models against persuasion while also arguing that defense…

Computation and Language · Computer Science 2025-02-11 Elias Stengel-Eskin , Peter Hase , Mohit Bansal

Robust Opponent Modeling via Adversarial Ensemble Reinforcement Learning in Asymmetric Imperfect-Information Games

This paper presents an algorithmic framework for learning robust policies in asymmetric imperfect-information games, where the joint reward could depend on the uncertain opponent type (a private information known only to the opponent itself…

Artificial Intelligence · Computer Science 2020-03-05 Macheng Shen , Jonathan P. How

Policy-focused Agent-based Modeling using RL Behavioral Models

Agent-based Models (ABMs) are valuable tools for policy analysis. ABMs help analysts explore the emergent consequences of policy interventions in multi-agent decision-making settings. But the validity of inferences drawn from ABM…

Machine Learning · Computer Science 2020-11-09 Osonde A. Osoba , Raffaele Vardavas , Justin Grana , Rushil Zutshi , Amber Jaycocks

Provably Efficient Model-based Policy Adaptation

The high sample complexity of reinforcement learning challenges its use in practice. A promising approach is to quickly adapt pre-trained policies to new environments. Existing methods for this policy adaptation problem typically rely on…

Machine Learning · Computer Science 2020-06-16 Yuda Song , Aditi Mavalankar , Wen Sun , Sicun Gao