English
Related papers

Related papers: BrainMem: Brain-Inspired Evolving Memory for Embod…

200 papers

An ideal embodied agent should possess lifelong learning capabilities to handle long-horizon and complex tasks, enabling continuous operation in general environments. This not only requires the agent to accurately accomplish given tasks but…

Artificial Intelligence · Computer Science 2026-03-24 Sen Wang , Bangwei Liu , Zhenkun Gao , Lizhuang Ma , Xuhong Wang , Yuan Xie , Xin Tan

Humans excel at performing complex tasks by leveraging long-term memory across temporal and spatial experiences. In contrast, current Large Language Models (LLMs) struggle to effectively plan and act in dynamic, multi-room 3D environments.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-18 Wenbo Hu , Yining Hong , Yanjun Wang , Leison Gao , Zibu Wei , Xingcheng Yao , Nanyun Peng , Yonatan Bitton , Idan Szpektor , Kai-Wei Chang

The realization of Artificial General Intelligence (AGI) necessitates Embodied AI agents capable of robust spatial perception, effective task planning, and adaptive execution in physical environments. However, current large language models…

Planning has been a cornerstone of artificial intelligence for solving complex problems, and recent progress in LLM-based multi-agent frameworks have begun to extend this capability. However, the role of human-like memory within these…

Multiagent Systems · Computer Science 2025-12-09 Wenzhe Fan , Ning Yan , Masood Mortazavi

Equipping agents with memory is essential for solving real-world long-horizon problems. However, most existing agent memory mechanisms rely on static and hand-crafted workflows. This limits the performance and generalization ability of…

Artificial Intelligence · Computer Science 2026-03-30 Yupeng Huo , Yaxi Lu , Zhong Zhang , Haotian Chen , Yankai Lin

Deploying Multimodal Large Language Models as the brain of embodied agents remains challenging, particularly under long-horizon observations and limited context budgets. Existing memory assisted methods often rely on textual summaries,…

Robotics · Computer Science 2026-03-03 Ji Li , Bo Wang , Jing Xia , Mingyi Li , Shiyan Hu

Embodied intelligence aims to enable robots to learn, reason, and generalize robustly across complex real-world environments. However, existing approaches often struggle with partial observability, fragmented spatial reasoning, and…

LLM-powered embodied agents have shown success on conventional object-rearrangement tasks, but providing personalized assistance that leverages user-specific knowledge from past interactions presents new challenges. We investigate these…

Computation and Language · Computer Science 2026-02-16 Taeyoon Kwon , Dongwook Choi , Hyojun Kim , Sunghwan Kim , Seungjun Moon , Beong-woo Kwak , Kuan-Hao Huang , Jinyoung Yeo

Language-model-based agents operating over extended interaction horizons face persistent challenges in preserving temporally grounded information and maintaining behavioral consistency across sessions, a failure mode we term soul erosion.…

Computation and Language · Computer Science 2026-01-29 Yang Li , Jiaxiang Liu , Yusong Wang , Yujie Wu , Mingkun Xu

To enable embodied agents to operate effectively over extended timeframes, it is crucial to develop models that form and access memories to stay contextualized in their environment. In the current paradigm of training transformer-based…

Artificial Intelligence · Computer Science 2025-12-01 Gunshi Gupta , Karmesh Yadav , Zsolt Kira , Yarin Gal , Rahaf Aljundi

The remarkable progress of vision-language models (VLMs) has enabled GUI agents to interact with computers in a human-like manner. Yet real-world computer-use tasks remain difficult due to long-horizon workflows, diverse interfaces, and…

Artificial Intelligence · Computer Science 2026-03-12 Sibo Zhu , Wenyi Wu , Kun Zhou , Stephen Wang , Biwei Huang

Embodied agents are expected to operate persistently in dynamic physical environments, continuously acquiring new capabilities over time. Existing approaches to improving agent performance often rely on modifying the agent itself -- through…

Robotics · Computer Science 2026-05-22 Xue Qin , Simin Luan , John See , Cong Yang , Zhijun Li

Despite recent advances in understanding and leveraging long-range conversational memory, existing benchmarks still lack systematic evaluation of large language models(LLMs) across diverse memory dimensions, particularly in multi-session…

Computation and Language · Computer Science 2026-01-08 Ye Shen , Dun Pei , Yiqiu Guo , Junying Wang , Yijin Guo , Zicheng Zhang , Qi Jia , Jun Zhou , Guangtao Zhai

Embodied reasoning is inherently viewpoint-dependent: what is visible, occluded, or reachable depends critically on where the agent stands. However, existing spatial memory systems for embodied agents typically store either multi-view…

Artificial Intelligence · Computer Science 2026-03-17 JooHyun Park , HyeongYeop Kang

Large language model (LLM) agents face fundamental limitations in long-horizon reasoning due to finite context windows, making effective memory management critical. Existing methods typically handle long-term memory (LTM) and short-term…

Computation and Language · Computer Science 2026-05-01 Yi Yu , Liuyi Yao , Yuexiang Xie , Qingquan Tan , Jiaqi Feng , Yaliang Li , Libing Wu

A key objective of embodied intelligence is enabling agents to perform long-horizon tasks in dynamic environments while maintaining robust decision-making and adaptability. To achieve this goal, we propose the Spatio-Temporal Memory Agent…

Artificial Intelligence · Computer Science 2025-03-04 Mingcong Lei , Yiming Zhao , Ge Wang , Zhixin Mai , Shuguang Cui , Yatong Han , Jinke Ren

Experience-driven learning has emerged as a promising paradigm for enabling agents to improve from interaction trajectories by accumulating and reusing past experience. However, existing approaches are predominantly developed in textual…

Artificial Intelligence · Computer Science 2026-05-19 Xingyu Sui , Weixiang Zhao , Yongxin Tang , Yanyan Zhao , Yang Wu , Dandan Tu , Bing Qin

Memory is critical for enabling large language model (LLM) based agents to maintain coherent behavior over long-horizon interactions. However, existing agent memory systems suffer from two key gaps: they rely on a one-size-fits-all memory…

Artificial Intelligence · Computer Science 2026-02-17 Mingfei Lu , Mengjia Wu , Feng Liu , Jiawei Xu , Weikai Li , Haoyang Wang , Zhengdong Hu , Ying Ding , Yizhou Sun , Jie Lu , Yi Zhang

Embodied planning requires agents to make coherent multi-step decisions based on dynamic visual observations and natural language goals. While recent vision-language models (VLMs) excel at static perception tasks, they struggle with the…

Artificial Intelligence · Computer Science 2025-07-15 Di Wu , Jiaxin Fan , Junzhe Zang , Guanbo Wang , Wei Yin , Wenhao Li , Bo Jin

Embodied decision-making enables agents to translate high-level goals into executable actions through continuous interactions within the physical world, forming a cornerstone of general-purpose embodied intelligence. Large language models…

Artificial Intelligence · Computer Science 2025-10-15 Zixing Lei , Sheng Yin , Yichen Xiong , Yuanzhuo Ding , Wenhao Huang , Yuxi Wei , Qingyao Xu , Yiming Li , Weixin Li , Yunhong Wang , Siheng Chen
‹ Prev 1 2 3 10 Next ›