English
Related papers

Related papers: Aligning Agentic World Models via Knowledgeable Ex…

200 papers

While large language models (LMs) have shown remarkable capabilities across numerous tasks, they often struggle with simple reasoning and planning in physical environments, such as understanding object permanence or planning household…

Computation and Language · Computer Science 2023-10-31 Jiannan Xiang , Tianhua Tao , Yi Gu , Tianmin Shu , Zirui Wang , Zichao Yang , Zhiting Hu

Large Language Model (LLM) agents have shown great potential in addressing real-world data science problems. LLM-driven data science agents promise to automate the entire machine learning pipeline, yet their real-world effectiveness remains…

Computation and Language · Computer Science 2025-10-09 Yixin Ou , Yujie Luo , Jingsheng Zheng , Lanning Wei , Zhuoyun Yu , Shuofei Qiao , Jintian Zhang , Da Zheng , Yuren Mao , Yunjun Gao , Huajun Chen , Ningyu Zhang

Recent endeavors towards directly using large language models (LLMs) as agent models to execute interactive planning tasks have shown commendable results. Despite their achievements, however, they still struggle with brainless…

Computation and Language · Computer Science 2025-01-06 Shuofei Qiao , Runnan Fang , Ningyu Zhang , Yuqi Zhu , Xiang Chen , Shumin Deng , Yong Jiang , Pengjun Xie , Fei Huang , Huajun Chen

Agentic reinforcement learning increasingly relies on experience-driven scaling, yet real-world environments remain non-adaptive, limited in coverage, and difficult to scale. World models offer a potential way to improve learning efficiency…

Computation and Language · Computer Science 2026-03-06 Yixia Li , Hongru Wang , Jiahao Qiu , Zhenfei Yin , Dongdong Zhang , Cheng Qian , Zeping Li , Pony Ma , Guanhua Chen , Heng Ji

Can we build accurate world models out of large language models (LLMs)? How can world models benefit LLM agents? The gap between the prior knowledge of LLMs and the specified environment's dynamics usually bottlenecks LLMs' performance as…

Artificial Intelligence · Computer Science 2025-04-23 Siyu Zhou , Tianyi Zhou , Yijun Yang , Guodong Long , Deheng Ye , Jing Jiang , Chengqi Zhang

Can large language models (LLMs) directly serve as powerful world models for model-based agents? While the gaps between the prior knowledge of LLMs and the specified environment's dynamics do exist, our study reveals that the gaps can be…

Artificial Intelligence · Computer Science 2024-10-15 Siyu Zhou , Tianyi Zhou , Yijun Yang , Guodong Long , Deheng Ye , Jing Jiang , Chengqi Zhang

World models, which encapsulate the dynamics of how actions affect environments, are foundational to the functioning of intelligent agents. In this work, we explore the potential of Large Language Models (LLMs) to operate as world models.…

Computation and Language · Computer Science 2024-10-04 Kaige Xie , Ian Yang , John Gunerli , Mark Riedl

Large Language Models (LLMs) represent a landmark achievement in Artificial Intelligence (AI), demonstrating unprecedented proficiency in procedural tasks such as text generation, code completion, and conversational coherence. These…

Artificial Intelligence · Computer Science 2025-05-07 Schaun Wheeler , Olivier Jeunen

Large Language Models (LLMs) possess general world knowledge but often struggle to generate precise predictions in structured, domain-specific contexts such as simulations. These limitations arise from their inability to ground their broad,…

Artificial Intelligence · Computer Science 2026-01-30 Guillaume Levy , Cedric Colas , Pierre-Yves Oudeyer , Thomas Carta , Clement Romac

Large Language Models (LLMs) handle physical commonsense information inadequately. As a result of being trained in a disembodied setting, LLMs often fail to predict an action's outcome in a given environment. However, predicting the effects…

Computation and Language · Computer Science 2023-02-06 Gautier Dagan , Frank Keller , Alex Lascarides

Though large language models (LLMs) have enabled great success across a wide variety of tasks, they still appear to fall short of one of the loftier goals of artificial intelligence research: creating an artificial system that can adapt its…

Computation and Language · Computer Science 2026-05-04 Michael A. Lepori , Tal Linzen , Ann Yuan , Katja Filippova

Large language models (LLMs) are prone to hallucination stemming from misaligned self-awareness, particularly when processing queries exceeding their knowledge boundaries. While existing mitigation strategies employ uncertainty estimation…

Computation and Language · Computer Science 2025-10-10 Hang Zheng , Hongshen Xu , Yuncong Liu , Lu Chen , Pascale Fung , Kai Yu

While large language models (LLMs) excel in mathematical and code reasoning, we observe they struggle with social reasoning tasks, exhibiting cognitive confusion, logical inconsistencies, and conflation between objective world states and…

Computation and Language · Computer Science 2025-10-14 Jialu Du , Guiyang Hou , Yihui Fu , Chen Wu , Wenqi Zhang , Yongliang Shen , Weiming Lu

While Large Language Models (LLMs) possess significant capabilities in open-world agent tasks, they also face challenges in rapidly adapting to new, specialized tasks due to their reliance on static pre-trained knowledge. Traditional…

Computation and Language · Computer Science 2025-06-25 Kelin Fu , Kaigui Bian

Large Language Models (LLMs) internalize vast world knowledge as parametric memory, yet inevitably inherit the staleness and errors of their source corpora. Consequently, ensuring the reliability and malleability of these internal…

Computation and Language · Computer Science 2026-04-08 Xiaojie Gu , Ziying Huang , Weicong Hong , Jian Xie , Renze Lou , Kai Zhang

Large language model (LLM) agents trained using reinforcement learning has achieved superhuman performance in low-cost environments like games, mathematics, and coding. However, these successes have not translated to complex domains where…

Artificial Intelligence · Computer Science 2026-02-03 Sherry Yang

Recent advances in large language model (LLM) have empowered autonomous agents to perform multi-turn interactions with tools and environments. However, scaling such agent training is limited by the lack of diverse and reliable environments.…

Artificial Intelligence · Computer Science 2026-05-26 Zhaoyang Wang , Canwen Xu , Boyi Liu , Yite Wang , Siwei Han , Zhewei Yao , Huaxiu Yao , Yuxiong He

Humanoid robot loco-manipulation remains constrained by the semantic-physical gap. Current methods face three limitations: Low sample efficiency in reinforcement learning, poor generalization in imitation learning, and physical…

Robotics · Computer Science 2026-01-27 Yutong Shen , Hangxu Liu , Kailin Pei , Ruizhe Xia , Tongtong Feng

Large language models (LLMs) are increasingly being adopted as the cognitive core of embodied agents. However, inherited hallucinations, which stem from failures to ground user instructions in the observed physical environment, can lead to…

Recent large language models (LLMs) are promising for making decisions in grounded environments. However, LLMs frequently fail in complex decision-making tasks due to the misalignment between the pre-trained knowledge in LLMs and the actual…

Computation and Language · Computer Science 2023-10-27 Siqi Ouyang , Lei Li
‹ Prev 1 2 3 10 Next ›