Kate Larson — Scifaro

Information and Contract Design for Repeated Interactions between Agents with Misaligned Incentives

We study the consequences of information asymmetries and misaligned incentives in settings with multiple independent agents. We model an interaction between a Sender, who holds vital private information but cannot act, and a Receiver, who…

Multiagent Systems · Computer Science 2026-05-13 Nanda Kishore Sreenivas , Kate Larson

Your Recourse, My Loss? Algorithmic Recourse under Shared Constraints

Decision makers are increasingly relying on machine learning in sensitive situations. Algorithmic recourse aims to provide individuals with actionable and minimally costly steps to reverse unfavorable AI-driven decisions. While existing…

Artificial Intelligence · Computer Science 2026-05-12 Zahra Khotanlou , Kate Larson , Amir-Hossein Karimi

Nash without Numbers: A Social Choice Approach to Mixed Equilibria in Context-Ordinal Games

Nash equilibrium serves as a fundamental mathematical tool in economics and game theory. However, it classically assumes knowledge of player utilities, whereas economics generally regards preferences as more fundamental. To leverage…

Computer Science and Game Theory · Computer Science 2026-05-11 Ian Gemp , Crystal Qian , Marc Lanctot , Kate Larson

Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences

Recursive retraining of generative models poses a critical representation challenge: when synthetic outputs are curated based on a fixed reward signal, the model tends to collapse onto a narrow set of outputs that over-optimize that…

Machine Learning · Computer Science 2026-05-11 Ali Falahati , Mohammad Mohammadi Amiri , Kate Larson , Lukasz Golab

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

Opponent modeling methods typically involve two crucial steps: building a belief distribution over opponents' strategies, and exploiting this opponent model by playing a best response. However, existing approaches typically require…

Artificial Intelligence · Computer Science 2026-04-07 Zun Li , Marc Lanctot , Kevin R. McKee , Luke Marris , Ian Gemp , Daniel Hennes , Paul Muller , Kate Larson , Yoram Bachrach , Michael P. Wellman

Active Evaluation of General Agents: Problem Definition and Comparison of Baseline Algorithms

As intelligent agents become more generally-capable, i.e. able to master a wide variety of tasks, the complexity and cost of properly evaluating them rises significantly. Tasks that assess specific capabilities of the agents can be…

Artificial Intelligence · Computer Science 2026-02-12 Marc Lanctot , Kate Larson , Ian Gemp , Michael Kaisers

Revealed Multi-Objective Utility Aggregation in Human Driving

A central design problem in game theoretic analysis is the estimation of the players' utilities. In many real-world interactive situations of human decision making, including human driving, the utilities are multi-objective in nature;…

Artificial Intelligence · Computer Science 2026-02-02 Atrisha Sarkar , Kate Larson , Krzysztof Czarnecki

Generalized dynamic cognitive hierarchy models for strategic driving behavior

While there has been an increasing focus on the use of game theoretic models for autonomous driving, empirical evidence shows that there are still open questions around dealing with the challenges of common knowledge assumptions as well as…

Artificial Intelligence · Computer Science 2026-02-02 Atrisha Sarkar , Kate Larson , Krzysztof Czarnecki

Procedural Fairness in Multi-Agent Bandits

In the context of multi-agent multi-armed bandits (MA-MAB), fairness is often reduced to outcomes: maximizing welfare, reducing inequality, or balancing utilities. However, evidence in psychology, economics, and Rawlsian theory suggests…

Multiagent Systems · Computer Science 2026-01-16 Joshua Caiata , Carter Blair , Kate Larson

Imagining and building wise machines: The centrality of AI metacognition

Although AI has become increasingly smart, its wisdom has not kept pace. In this article, we examine what is known about human wisdom and sketch a vision of its AI counterpart. We analyze human wisdom as a set of strategies for solving…

Artificial Intelligence · Computer Science 2026-01-08 Samuel G. B. Johnson , Amir-Hossein Karimi , Yoshua Bengio , Nick Chater , Tobias Gerstenberg , Kate Larson , Sydney Levine , Melanie Mitchell , Iyad Rahwan , Bernhard Schölkopf , Igor Grossmann

The Alignment Game: A Theory of Long-Horizon Alignment Through Recursive Curation

In self-consuming generative models that train on their own outputs, alignment with user preferences becomes a recursive rather than one-time process. We provide the first formal foundation for analyzing the long-term effects of such…

Machine Learning · Computer Science 2025-11-18 Ali Falahati , Mohammad Mohammadi Amiri , Kate Larson , Lukasz Golab

Generating Fair Consensus Statements with Social Choice on Token-Level MDPs

Current frameworks for consensus statement generation with large language models lack the inherent structure needed to provide provable fairness guarantees when aggregating diverse free-form opinions. We model the task as a multi-objective,…

Artificial Intelligence · Computer Science 2025-10-17 Carter Blair , Kate Larson

What Voting Rules Actually Do: A Data-Driven Analysis of Multi-Winner Voting

Committee-selection problems arise in many contexts and applications, and there has been increasing interest within the social choice research community on identifying which properties are satisfied by different multi-winner voting rules.…

Artificial Intelligence · Computer Science 2025-08-11 Joshua Caiata , Ben Armstrong , Kate Larson

Evaluating Agents using Social Choice Theory

We argue that many general evaluation problems can be viewed through the lens of voting theory. Each task is interpreted as a separate voter, which requires only ordinal rankings or pairwise comparisons of agents to produce an overall…

Artificial Intelligence · Computer Science 2025-07-01 Marc Lanctot , Kate Larson , Yoram Bachrach , Luke Marris , Zun Li , Avishkar Bhoopchand , Thomas Anthony , Brian Tanner , Anna Koop

Soft Condorcet Optimization for Ranking of General Agents

Driving progress of AI models and agents requires comparing their performance on standardized benchmarks; for general agents, individual performances must be aggregated across a potentially wide variety of different tasks. In this paper, we…

Multiagent Systems · Computer Science 2025-06-30 Marc Lanctot , Kate Larson , Michael Kaisers , Quentin Berthet , Ian Gemp , Manfred Diaz , Roberto-Rafael Maura-Rivero , Yoram Bachrach , Anna Koop , Doina Precup

Reflective Verbal Reward Design for Pluralistic Alignment

AI agents are commonly aligned with "human values" through reinforcement learning from human feedback (RLHF), where a single reward model is learned from aggregated human feedback and used to align an agent's behavior. However, human values…

Artificial Intelligence · Computer Science 2025-06-24 Carter Blair , Kate Larson , Edith Law

Multi-Agent Risks from Advanced AI

The rapid development of advanced AI agents and the imminent deployment of many instances of these agents will give rise to multi-agent systems of unprecedented complexity. These systems pose novel and under-explored risks. In this report,…

Multiagent Systems · Computer Science 2025-02-21 Lewis Hammond , Alan Chan , Jesse Clifton , Jason Hoelscher-Obermaier , Akbir Khan , Euan McLean , Chandler Smith , Wolfram Barfuss , Jakob Foerster , Tomáš Gavenčiak , The Anh Han , Edward Hughes , Vojtěch Kovařík , Jan Kulveit , Joel Z. Leibo , Caspar Oesterheld , Christian Schroeder de Witt , Nisarg Shah , Michael Wellman , Paolo Bova , Theodor Cimpeanu , Carson Ezell , Quentin Feuillade-Montixi , Matija Franklin , Esben Kran , Igor Krawczuk , Max Lamparth , Niklas Lauffer , Alexander Meinke , Sumeet Motwani , Anka Reuel , Vincent Conitzer , Michael Dennis , Iason Gabriel , Adam Gleave , Gillian Hadfield , Nika Haghtalab , Atoosa Kasirzadeh , Sébastien Krier , Kate Larson , Joel Lehman , David C. Parkes , Georgios Piliouras , Iyad Rahwan

Jackpot! Alignment as a Maximal Lottery

Reinforcement Learning from Human Feedback (RLHF), the standard for aligning Large Language Models (LLMs) with human values, is known to fail to satisfy properties that are intuitively desirable, such as respecting the preferences of the…

Artificial Intelligence · Computer Science 2025-02-03 Roberto-Rafael Maura-Rivero , Marc Lanctot , Francesco Visin , Kate Larson

Democratizing Reward Design for Personal and Representative Value-Alignment

Aligning AI agents with human values is challenging due to diverse and subjective notions of values. Standard alignment methods often aggregate crowd feedback, which can result in the suppression of unique or minority preferences. We…

Artificial Intelligence · Computer Science 2024-10-30 Carter Blair , Kate Larson , Edith Law

Liquid Ensemble Selection for Continual Learning

Continual learning aims to enable machine learning models to continually learn from a shifting data distribution without forgetting what has already been learned. Such shifting distributions can be broken into disjoint subsets of related…

Machine Learning · Computer Science 2024-07-29 Carter Blair , Ben Armstrong , Kate Larson