Related papers: Code World Models for General Game Playing

Distilling Game Code World Model Generation into Lightweight Large Language Models

Large Language Models (LLMs) have shown great ability in generating executable code from natural language, opening the possibility of automatically constructing environments for AI agents. Recent work on Code World Models (CWMs)…

Artificial Intelligence · Computer Science 2026-05-26 Tyrone Serapio , Arjun Prakash , Haoyang Xu , Kevin Wang , Amy Greenwald

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

As Large Language Models (LLMs) are integrated into critical real-world applications, their strategic and logical reasoning abilities are increasingly crucial. This paper evaluates LLMs' reasoning abilities in competitive environments…

Computation and Language · Computer Science 2024-06-11 Jinhao Duan , Renming Zhang , James Diffenderfer , Bhavya Kailkhura , Lichao Sun , Elias Stengel-Eskin , Mohit Bansal , Tianlong Chen , Kaidi Xu

Mastering Board Games by External and Internal Planning with Language Models

Advancing planning and reasoning capabilities of Large Language Models (LLMs) is one of the key prerequisites towards unlocking their potential for performing reliably in complex and impactful domains. In this paper, we aim to demonstrate…

Artificial Intelligence · Computer Science 2025-05-26 John Schultz , Jakub Adamek , Matej Jusup , Marc Lanctot , Michael Kaisers , Sarah Perrin , Daniel Hennes , Jeremy Shar , Cannada Lewis , Anian Ruoss , Tom Zahavy , Petar Veličković , Laurel Prince , Satinder Singh , Eric Malmi , Nenad Tomašev

Boardwalk: Towards a Framework for Creating Board Games with LLMs

Implementing board games in code can be a time-consuming task. However, Large Language Models (LLMs) have been proven effective at generating code for domain-specific tasks with simple contextual information. We aim to investigate whether…

Machine Learning · Computer Science 2025-11-10 Álvaro Guglielmin Becker , Gabriel Bauer de Oliveira , Lana Bertoldo Rossato , Anderson Rocha Tavares

Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

This paper examines the reasoning capabilities of Large Language Models (LLMs) from a novel perspective, focusing on their ability to operate within formally specified, rule-governed environments. We evaluate four LLMs (Gemini 2.5 Pro and…

Artificial Intelligence · Computer Science 2026-02-24 Maciej Świechowski , Adam Żychowski , Jacek Mańdziuk

Word2World: Generating Stories and Worlds through Large Language Models

Large Language Models (LLMs) have proven their worth across a diverse spectrum of disciplines. LLMs have shown great potential in Procedural Content Generation (PCG) as well, but directly generating a level through a pre-trained LLM is…

Computation and Language · Computer Science 2024-05-14 Muhammad U. Nasir , Steven James , Julian Togelius

Real-Time World Crafting: Generating Structured Game Behaviors from Natural Language with Large Language Models

We present a novel architecture for safely integrating Large Language Models (LLMs) into interactive game engines, allowing players to "program" new behaviors using natural language. Our framework mitigates risks by using an LLM to…

Human-Computer Interaction · Computer Science 2025-10-21 Austin Drake , Hang Dong

Codenames as a Benchmark for Large Language Models

In this paper, we propose the use of the popular word-based board game Codenames as a suitable benchmark for evaluating the reasoning capabilities of Large Language Models (LLMs). Codenames presents a highly interesting challenge for…

Artificial Intelligence · Computer Science 2025-04-23 Matthew Stephenson , Matthew Sidji , Benoît Ronval

Can Large Language Models Play Games? A Case Study of A Self-Play Approach

Large Language Models (LLMs) harness extensive data from the Internet, storing a broad spectrum of prior knowledge. While LLMs have proven beneficial as decision-making aids, their reliability is hampered by limitations in reasoning,…

Artificial Intelligence · Computer Science 2024-03-12 Hongyi Guo , Zhihan Liu , Yufeng Zhang , Zhaoran Wang

Evaluating LLMs in Open-Source Games

Large Language Models' (LLMs) programming capabilities enable their participation in open-source games: a game-theoretic setting in which players submit computer programs in lieu of actions. These programs offer numerous advantages,…

Computer Science and Game Theory · Computer Science 2025-12-02 Swadesh Sistla , Max Kleiman-Weiner

Large Language Models Are Neurosymbolic Reasoners

A wide range of real-world applications is characterized by their symbolic nature, necessitating a strong capability for symbolic reasoning. This paper investigates the potential application of Large Language Models (LLMs) as symbolic…

Computation and Language · Computer Science 2024-01-18 Meng Fang , Shilong Deng , Yudi Zhang , Zijing Shi , Ling Chen , Mykola Pechenizkiy , Jun Wang

Large Language Models as Commonsense Knowledge for Large-Scale Task Planning

Large-scale task planning is a major challenge. Recent work exploits large language models (LLMs) directly as a policy and shows surprisingly interesting results. This paper shows that LLMs provide a commonsense model of the world in…

Robotics · Computer Science 2023-10-31 Zirui Zhao , Wee Sun Lee , David Hsu

Strategic Reasoning with Language Models

Strategic reasoning enables agents to cooperate, communicate, and compete with other agents in diverse situations. Existing approaches to solving strategic games rely on extensive training, yielding strategies that do not generalize to new…

Artificial Intelligence · Computer Science 2023-05-31 Kanishk Gandhi , Dorsa Sadigh , Noah D. Goodman

Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search

In this work we consider Code World Models, world models generated by a Large Language Model (LLM) in the form of Python code for model-based Reinforcement Learning (RL). Calling code instead of LLMs for planning has potential to be more…

Artificial Intelligence · Computer Science 2024-10-31 Nicola Dainese , Matteo Merler , Minttu Alakuijala , Pekka Marttinen

CWM: An Open-Weights LLM for Research on Code Generation with World Models

We release Code World Model (CWM), a 32-billion-parameter open-weights LLM, to advance research on code generation with world models. To improve code understanding beyond what can be learned from training on static code alone, we mid-train…

Software Engineering · Computer Science 2025-10-13 FAIR CodeGen team , Jade Copet , Quentin Carbonneaux , Gal Cohen , Jonas Gehring , Jacob Kahn , Jannik Kossen , Felix Kreuk , Emily McMilin , Michel Meyer , Yuxiang Wei , David Zhang , Kunhao Zheng , Jordi Armengol-Estapé , Pedram Bashiri , Maximilian Beck , Pierre Chambon , Abhishek Charnalia , Chris Cummins , Juliette Decugis , Zacharias V. Fisches , François Fleuret , Fabian Gloeckle , Alex Gu , Michael Hassid , Daniel Haziza , Badr Youbi Idrissi , Christian Keller , Rahul Kindi , Hugh Leather , Gallil Maimon , Aram Markosyan , Francisco Massa , Pierre-Emmanuel Mazaré , Vegard Mella , Naila Murray , Keyur Muzumdar , Peter O'Hearn , Matteo Pagliardini , Dmitrii Pedchenko , Tal Remez , Volker Seeker , Marco Selvi , Oren Sultan , Sida Wang , Luca Wehrstedt , Ori Yoran , Lingming Zhang , Taco Cohen , Yossi Adi , Gabriel Synnaeve

GAMEBoT: Transparent Assessment of LLM Reasoning in Games

Large Language Models (LLMs) are increasingly deployed in real-world applications that demand complex reasoning. To track progress, robust benchmarks are required to evaluate their capabilities beyond superficial pattern recognition.…

Computation and Language · Computer Science 2025-06-03 Wenye Lin , Jonathan Roberts , Yunhan Yang , Samuel Albanie , Zongqing Lu , Kai Han

LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning

In recent years, large language models (LLMs) have shown significant advancements in natural language processing (NLP), with strong capa-bilities in generation, comprehension, and rea-soning. These models have found applications in…

Artificial Intelligence · Computer Science 2025-04-02 Hui Wang

Can Large Language Models Serve as Rational Players in Game Theory? A Systematic Analysis

Game theory, as an analytical tool, is frequently utilized to analyze human behavior in social science research. With the high alignment between the behavior of Large Language Models (LLMs) and humans, a promising research direction is to…

Artificial Intelligence · Computer Science 2023-12-13 Caoyun Fan , Jindou Chen , Yaohui Jin , Hao He

From Code to Play: Benchmarking Program Search for Games Using Large Language Models

Large language models (LLMs) have shown impressive capabilities in generating program code, opening exciting opportunities for applying program synthesis to games. In this work, we explore the potential of LLMs to directly synthesize usable…

Artificial Intelligence · Computer Science 2025-07-16 Manuel Eberhardinger , James Goodman , Alexander Dockhorn , Diego Perez-Liebana , Raluca D. Gaina , Duygu Çakmak , Setareh Maghsudi , Simon Lucas

Economics Arena for Large Language Models

Large language models (LLMs) have been extensively used as the backbones for general-purpose agents, and some economics literature suggest that LLMs are capable of playing various types of economics games. Following these works, to overcome…

Computer Science and Game Theory · Computer Science 2024-01-04 Shangmin Guo , Haoran Bu , Haochuan Wang , Yi Ren , Dianbo Sui , Yuming Shang , Siting Lu