Related papers: Learning to Bluff

The emergence of bluff in poker-like games

We present a couple of adaptive learning models of poker-like games, by means of which we show how bluffing strategies emerge very naturally, and can also be rational and evolutively stable. Despite their very simple learning algorithms,…

Physics and Society · Physics 2009-01-23 Andrea Guazzini , Daniele Vilone

Learning from Learners: Adapting Reinforcement Learning Agents to be Competitive in a Card Game

Learning how to adapt to complex and dynamic environments is one of the most important factors that contribute to our intelligence. Endowing artificial agents with this ability is not a simple task, particularly in competitive scenarios. In…

Artificial Intelligence · Computer Science 2020-04-09 Pablo Barros , Ana Tanevska , Alessandra Sciutti

Adaptive Mechanism Design: Learning to Promote Cooperation

In the future, artificial learning agents are likely to become increasingly widespread in our society. They will interact with both other learning agents and humans in a variety of complex settings including social dilemmas. We consider the…

Computer Science and Game Theory · Computer Science 2019-11-21 Tobias Baumann , Thore Graepel , John Shawe-Taylor

Predict Responsibly: Improving Fairness and Accuracy by Learning to Defer

In many machine learning applications, there are multiple decision-makers involved, both automated and human. The interaction between these agents often goes unaddressed in algorithmic development. In this work, we explore a simple version…

Machine Learning · Statistics 2018-09-10 David Madras , Toniann Pitassi , Richard Zemel

Learning to Assist Agents by Observing Them

The ability of an AI agent to assist other agents, such as humans, is an important and challenging goal, which requires the assisting agent to reason about the behavior and infer the goals of the assisted agent. Training such an ability by…

Artificial Intelligence · Computer Science 2021-10-05 Antti Keurulainen , Isak Westerlund , Samuel Kaski , Alexander Ilin

An architecture for the evaluation of intelligent systems

One of the main research areas in Artificial Intelligence is the coding of agents (programs) which are able to learn by themselves in any situation. This means that agents must be useful for purposes other than those they were created for,…

Artificial Intelligence · Computer Science 2011-02-04 Javier Insa-Cabrera , Jose Hernandez-Orallo

How and Why to Manipulate Your Own Agent: On the Incentives of Users of Learning Agents

The usage of automated learning agents is becoming increasingly prevalent in many online economic applications such as online auctions and automated trading. Motivated by such applications, this paper is dedicated to fundamental modeling…

Computer Science and Game Theory · Computer Science 2023-01-04 Yoav Kolumbus , Noam Nisan

Learning Altruistic Behaviours in Reinforcement Learning without External Rewards

Can artificial agents learn to assist others in achieving their goals without knowing what those goals are? Generic reinforcement learning agents could be trained to behave altruistically towards others by rewarding them for altruistic…

Artificial Intelligence · Computer Science 2022-03-22 Tim Franzmeyer , Mateusz Malinowski , João F. Henriques

Learning Not to Spoof

As intelligent trading agents based on reinforcement learning (RL) gain prevalence, it becomes more important to ensure that RL agents obey laws, regulations, and human behavioral expectations. There is substantial literature concerning the…

Machine Learning · Computer Science 2023-06-12 David Byrd

Learning to Lie: Reinforcement Learning Attacks Damage Human-AI Teams and Teams of LLMs

As artificial intelligence (AI) assistants become more widely adopted in safety-critical domains, it becomes important to develop safeguards against potential failures or adversarial attacks. A key prerequisite to developing these…

Human-Computer Interaction · Computer Science 2025-04-04 Abed Kareem Musaffar , Anand Gokhale , Sirui Zeng , Rasta Tadayon , Xifeng Yan , Ambuj Singh , Francesco Bullo

Honesty Is the Best Policy: Defining and Mitigating AI Deception

Deceptive agents are a challenge for the safety, trustworthiness, and cooperation of AI systems. We focus on the problem that agents might deceive in order to achieve their goals (for instance, in our experiments with language models, the…

Artificial Intelligence · Computer Science 2023-12-05 Francis Rhys Ward , Francesco Belardinelli , Francesca Toni , Tom Everitt

Playing the Werewolf game with artificial intelligence for language understanding

The Werewolf game is a social deduction game based on free natural language communication, in which players try to deceive others in order to survive. An important feature of this game is that a large portion of the conversations are false…

Artificial Intelligence · Computer Science 2023-02-22 Hisaichi Shibata , Soichiro Miki , Yuta Nakamura

Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game

Agents built with large language models (LLMs) have shown great potential across a wide range of domains. However, in complex decision-making tasks, pure LLM-based agents tend to exhibit intrinsic bias in their choice of actions, which is…

Artificial Intelligence · Computer Science 2025-05-30 Zelai Xu , Chao Yu , Fei Fang , Yu Wang , Yi Wu

Fairness and Deception in Human Interactions with Artificial Agents

Online information ecosystems are now central to our everyday social interactions. Of the many opportunities and challenges this presents, the capacity for artificial agents to shape individual and collective human decision-making in such…

Populations and Evolution · Quantitative Biology 2023-12-07 Theodor Cimpeanu , Alexander J. Stewart

Learning in two-player games between transparent opponents

We consider a scenario in which two reinforcement learning agents repeatedly play a matrix game against each other and update their parameters after each round. The agents' decision-making is transparent to each other, which allows each…

Artificial Intelligence · Computer Science 2021-08-23 Adrian Hutter

When to (or not to) trust intelligent machines: Insights from an evolutionary game theory analysis of trust in repeated games

The actions of intelligent agents, such as chatbots, recommender systems, and virtual assistants are typically not fully transparent to the user. Consequently, using such an agent involves the user exposing themselves to the risk that the…

Computer Science and Game Theory · Computer Science 2020-07-23 The Anh Han , Cedric Perret , Simon T. Powers

Manipulating a Learning Defender and Ways to Counteract

In Stackelberg security games when information about the attacker's payoffs is uncertain, algorithms have been proposed to learn the optimal defender commitment by interacting with the attacker and observing their best responses. In this…

Computer Science and Game Theory · Computer Science 2019-11-01 Jiarui Gan , Qingyu Guo , Long Tran-Thanh , Bo An , Michael Wooldridge

Learning Latent Representations to Influence Multi-Agent Interaction

Seamlessly interacting with humans or robots is hard because these agents are non-stationary. They update their policy in response to the ego agent's behavior, and the ego agent must anticipate these changes to co-adapt. Inspired by humans,…

Robotics · Computer Science 2020-11-16 Annie Xie , Dylan P. Losey , Ryan Tolsma , Chelsea Finn , Dorsa Sadigh

Don't lie to your friends: Learning what you know from collaborative self-play

To be helpful assistants, AI agents must be aware of their own capabilities and limitations. This includes knowing when to answer from parametric knowledge versus using tools, when to trust tool outputs, and when to abstain or hedge. Such…

Machine Learning · Computer Science 2025-09-01 Jacob Eisenstein , Reza Aghajani , Adam Fisch , Dheeru Dua , Fantine Huot , Mirella Lapata , Vicky Zayats , Jonathan Berant

Toward Agents That Reason About Their Computation

While reinforcement learning agents can achieve superhuman performance in many complex tasks, they typically do not become more computationally efficient as they improve. In contrast, humans gradually require less cognitive effort as they…

Artificial Intelligence · Computer Science 2025-10-28 Adrian Orenstein , Jessica Chen , Gwyneth Anne Delos Santos , Bayley Sapara , Michael Bowling