Related papers: Aligning Agentic World Models via Knowledgeable Ex…

Language Models Meet World Models: Embodied Experiences Enhance Language Models

While large language models (LMs) have shown remarkable capabilities across numerous tasks, they often struggle with simple reasoning and planning in physical environments, such as understanding object permanence or planning household…

Computation and Language · Computer Science 2023-10-31 Jiannan Xiang , Tianhua Tao , Yi Gu , Tianmin Shu , Zirui Wang , Zichao Yang , Zhiting Hu

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Large Language Model (LLM) agents have shown great potential in addressing real-world data science problems. LLM-driven data science agents promise to automate the entire machine learning pipeline, yet their real-world effectiveness remains…

Computation and Language · Computer Science 2025-10-09 Yixin Ou , Yujie Luo , Jingsheng Zheng , Lanning Wei , Zhuoyun Yu , Shuofei Qiao , Jintian Zhang , Da Zheng , Yuren Mao , Yunjun Gao , Huajun Chen , Ningyu Zhang

Agent Planning with World Knowledge Model

Recent endeavors towards directly using large language models (LLMs) as agent models to execute interactive planning tasks have shown commendable results. Despite their achievements, however, they still struggle with brainless…

Computation and Language · Computer Science 2025-01-06 Shuofei Qiao , Runnan Fang , Ningyu Zhang , Yuqi Zhu , Xiang Chen , Shumin Deng , Yong Jiang , Pengjun Xie , Fei Huang , Huajun Chen

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Agentic reinforcement learning increasingly relies on experience-driven scaling, yet real-world environments remain non-adaptive, limited in coverage, and difficult to scale. World models offer a potential way to improve learning efficiency…

Computation and Language · Computer Science 2026-03-06 Yixia Li , Hongru Wang , Jiahao Qiu , Zhenfei Yin , Dongdong Zhang , Cheng Qian , Zeping Li , Pony Ma , Guanhua Chen , Heng Ji

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Can we build accurate world models out of large language models (LLMs)? How can world models benefit LLM agents? The gap between the prior knowledge of LLMs and the specified environment's dynamics usually bottlenecks LLMs' performance as…

Artificial Intelligence · Computer Science 2025-04-23 Siyu Zhou , Tianyi Zhou , Yijun Yang , Guodong Long , Deheng Ye , Jing Jiang , Chengqi Zhang

WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents

Can large language models (LLMs) directly serve as powerful world models for model-based agents? While the gaps between the prior knowledge of LLMs and the specified environment's dynamics do exist, our study reveals that the gaps can be…

Artificial Intelligence · Computer Science 2024-10-15 Siyu Zhou , Tianyi Zhou , Yijun Yang , Guodong Long , Deheng Ye , Jing Jiang , Chengqi Zhang

Making Large Language Models into World Models with Precondition and Effect Knowledge

World models, which encapsulate the dynamics of how actions affect environments, are foundational to the functioning of intelligent agents. In this work, we explore the potential of Large Language Models (LLMs) to operate as world models.…

Computation and Language · Computer Science 2024-10-04 Kaige Xie , Ian Yang , John Gunerli , Mark Riedl

Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents

Large Language Models (LLMs) represent a landmark achievement in Artificial Intelligence (AI), demonstrating unprecedented proficiency in procedural tasks such as text generation, code completion, and conversational coherence. These…

Artificial Intelligence · Computer Science 2025-05-07 Schaun Wheeler , Olivier Jeunen

WorldLLM: Improving LLMs' world modeling using curiosity-driven theory-making

Large Language Models (LLMs) possess general world knowledge but often struggle to generate precise predictions in structured, domain-specific contexts such as simulations. These limitations arise from their inability to ground their broad,…

Artificial Intelligence · Computer Science 2026-01-30 Guillaume Levy , Cedric Colas , Pierre-Yves Oudeyer , Thomas Carta , Clement Romac

Learning the Effects of Physical Actions in a Multi-modal Environment

Large Language Models (LLMs) handle physical commonsense information inadequately. As a result of being trained in a disembodied setting, LLMs often fail to predict an action's outcome in a given environment. However, predicting the effects…

Computation and Language · Computer Science 2023-02-06 Gautier Dagan , Frank Keller , Alex Lascarides

Language Models Struggle to Use Representations Learned In-Context

Though large language models (LLMs) have enabled great success across a wide variety of tasks, they still appear to fall short of one of the loftier goals of artificial intelligence research: creating an artificial system that can adapt its…

Computation and Language · Computer Science 2026-05-04 Michael A. Lepori , Tal Linzen , Ann Yuan , Katja Filippova

Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling

Large language models (LLMs) are prone to hallucination stemming from misaligned self-awareness, particularly when processing queries exceeding their knowledge boundaries. While existing mitigation strategies employ uncertainty estimation…

Computation and Language · Computer Science 2025-10-10 Hang Zheng , Hongshen Xu , Yuncong Liu , Lu Chen , Pascale Fung , Kai Yu

Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning

While large language models (LLMs) excel in mathematical and code reasoning, we observe they struggle with social reasoning tasks, exhibiting cognitive confusion, logical inconsistencies, and conflation between objective world states and…

Computation and Language · Computer Science 2025-10-14 Jialu Du , Guiyang Hou , Yihui Fu , Chen Wu , Wenqi Zhang , Yongliang Shen , Weiming Lu

KnowMap: Efficient Knowledge-Driven Task Adaptation for LLMs

While Large Language Models (LLMs) possess significant capabilities in open-world agent tasks, they also face challenges in rapidly adapting to new, specialized tasks due to their reliance on static pre-trained knowledge. Traditional…

Computation and Language · Computer Science 2025-06-25 Kelin Fu , Kaigui Bian

The Model Agreed, But Didn't Learn: Diagnosing Surface Compliance in Large Language Models

Large Language Models (LLMs) internalize vast world knowledge as parametric memory, yet inevitably inherit the staleness and errors of their source corpora. Consequently, ensuring the reliability and malleability of these internal…

Computation and Language · Computer Science 2026-04-08 Xiaojie Gu , Ziying Huang , Weicong Hong , Jian Xie , Renze Lou , Kai Zhang

World Models as an Intermediary between Agents and the Real World

Large language model (LLM) agents trained using reinforcement learning has achieved superhuman performance in low-cost environments like games, mathematics, and coding. However, these successes have not translated to complex domains where…

Artificial Intelligence · Computer Science 2026-02-03 Sherry Yang

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Recent advances in large language model (LLM) have empowered autonomous agents to perform multi-turn interactions with tools and environments. However, scaling such agent training is limited by the lack of diverse and reliable environments.…

Artificial Intelligence · Computer Science 2026-05-26 Zhaoyang Wang , Canwen Xu , Boyi Liu , Yite Wang , Siwei Han , Zhewei Yao , Huaxiu Yao , Yuxiong He

MetaWorld: Skill Transfer and Composition in a Hierarchical World Model for Grounding High-Level Instructions

Humanoid robot loco-manipulation remains constrained by the semantic-physical gap. Current methods face three limitations: Low sample efficiency in reinforcement learning, poor generalization in imitation learning, and physical…

Robotics · Computer Science 2026-01-27 Yutong Shen , Hangxu Liu , Kailin Pei , Ruizhe Xia , Tongtong Feng

HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models

Large language models (LLMs) are increasingly being adopted as the cognitive core of embodied agents. However, inherited hallucinations, which stem from failures to ground user instructions in the observed physical environment, can lead to…

Machine Learning · Computer Science 2025-10-15 Trishna Chakraborty , Udita Ghosh , Xiaopan Zhang , Fahim Faisal Niloy , Yue Dong , Jiachen Li , Amit K. Roy-Chowdhury , Chengyu Song

AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models

Recent large language models (LLMs) are promising for making decisions in grounded environments. However, LLMs frequently fail in complex decision-making tasks due to the misalignment between the pre-trained knowledge in LLMs and the actual…

Computation and Language · Computer Science 2023-10-27 Siqi Ouyang , Lei Li