Related papers: BrainMem: Brain-Inspired Evolving Memory for Embod…

Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration

An ideal embodied agent should possess lifelong learning capabilities to handle long-horizon and complex tasks, enabling continuous operation in general environments. This not only requires the agent to accurately accomplish given tasks but…

Artificial Intelligence · Computer Science 2026-03-24 Sen Wang , Bangwei Liu , Zhenkun Gao , Lizhuang Ma , Xuhong Wang , Yuan Xie , Xin Tan

3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model

Humans excel at performing complex tasks by leveraging long-term memory across temporal and spatial experiences. In contrast, current Large Language Models (LLMs) struggle to effectively plan and act in dynamic, multi-room 3D environments.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-18 Wenbo Hu , Yining Hong , Yanjun Wang , Leison Gao , Zibu Wei , Xingcheng Yao , Nanyun Peng , Yonatan Bitton , Idan Szpektor , Kai-Wei Chang

EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence

The realization of Artificial General Intelligence (AGI) necessitates Embodied AI agents capable of robust spatial perception, effective task planning, and adaptive execution in physical environments. However, current large language models…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Ding Zou , Feifan Wang , Mengyu Ge , Siyuan Fan , Zongbing Zhang , Wei Chen , Lingfeng Wang , Zhongyou Hu , Wenrui Yan , Zhengwei Gao , Hao Wang , Weizhao Jin , Yu Zhang , Hainan Zhao , Mingliang Zhang , Xianxian Xi , Yaru Zhang , Wenyuan Li , Zhengguang Gao , Yurui Zhu

EvoMem: Improving Multi-Agent Planning with Dual-Evolving Memory

Planning has been a cornerstone of artificial intelligence for solving complex problems, and recent progress in LLM-based multi-agent frameworks have begun to extend this capability. However, the role of human-like memory within these…

Multiagent Systems · Computer Science 2025-12-09 Wenzhe Fan , Ning Yan , Masood Mortazavi

AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

Equipping agents with memory is essential for solving real-world long-horizon problems. However, most existing agent memory mechanisms rely on static and hand-crafted workflows. This limits the performance and generalization ability of…

Artificial Intelligence · Computer Science 2026-03-30 Yupeng Huo , Yaxi Lu , Zhong Zhang , Haotian Chen , Yankai Lin

HIMM: Human-Inspired Long-Term Memory Modeling for Embodied Exploration and Question Answering

Deploying Multimodal Large Language Models as the brain of embodied agents remains challenging, particularly under long-horizon observations and limited context budgets. Existing memory assisted methods often rely on textual summaries,…

Robotics · Computer Science 2026-03-03 Ji Li , Bo Wang , Jing Xia , Mingyi Li , Shiyan Hu

RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Interactive Environmental Learning in Physical Embodied Systems

Embodied intelligence aims to enable robots to learn, reason, and generalize robustly across complex real-world environments. However, existing approaches often struggle with partial observability, fragmented spatial reasoning, and…

Robotics · Computer Science 2026-03-25 Mingcong Lei , Honghao Cai , Yuyuan Yang , Yimou Wu , Jinke Ren , Zezhou Cui , Liangchen Tan , Junkun Hong , Gehan Hu , Shuangyu Zhu , Shaohan Jiang , Ge Wang , Junyuan Tan , Zhenglin Wan , Zheng Li , Zhen Li , Shuguang Cui , Yiming Zhao , Yatong Han

Embodied Agents Meet Personalization: Investigating Challenges and Solutions Through the Lens of Memory Utilization

LLM-powered embodied agents have shown success on conventional object-rearrangement tasks, but providing personalized assistance that leverages user-specific knowledge from past interactions presents new challenges. We investigate these…

Computation and Language · Computer Science 2026-02-16 Taeyoon Kwon , Dongwook Choi , Hyojun Kim , Sunghwan Kim , Seungjun Moon , Beong-woo Kwak , Kuan-Hao Huang , Jinyoung Yeo

BMAM: Brain-inspired Multi-Agent Memory Framework

Language-model-based agents operating over extended interaction horizons face persistent challenges in preserving temporally grounded information and maintaining behavioral consistency across sessions, a failure mode we term soul erosion.…

Computation and Language · Computer Science 2026-01-29 Yang Li , Jiaxiang Liu , Yusong Wang , Yujie Wu , Mingkun Xu

Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning

To enable embodied agents to operate effectively over extended timeframes, it is crucial to develop models that form and access memories to stay contextualized in their environment. In the current paradigm of training transformer-based…

Artificial Intelligence · Computer Science 2025-12-01 Gunshi Gupta , Karmesh Yadav , Zsolt Kira , Yarin Gal , Rahaf Aljundi

Hybrid Self-evolving Structured Memory for GUI Agents

The remarkable progress of vision-language models (VLMs) has enabled GUI agents to interact with computers in a human-like manner. Yet real-world computer-use tasks remain difficult due to long-horizon workflows, diverse interfaces, and…

Artificial Intelligence · Computer Science 2026-03-12 Sibo Zhu , Wenyi Wu , Kun Zhou , Stephen Wang , Biwei Huang

Learning Without Losing Identity: Capability Evolution for Embodied Agents

Embodied agents are expected to operate persistently in dynamic physical environments, continuously acquiring new capabilities over time. Existing approaches to improving agent performance often rely on modifying the agent itself -- through…

Robotics · Computer Science 2026-05-22 Xue Qin , Simin Luan , John See , Cong Yang , Zhijun Li

EvolMem: A Cognitive-Driven Benchmark for Multi-Session Dialogue Memory

Despite recent advances in understanding and leveraging long-range conversational memory, existing benchmarks still lack systematic evaluation of large language models(LLMs) across diverse memory dimensions, particularly in multi-session…

Computation and Language · Computer Science 2026-01-08 Ye Shen , Dun Pei , Yiqiu Guo , Junying Wang , Yijin Guo , Zicheng Zhang , Qi Jia , Jun Zhou , Guangtao Zhai

RenderMem: Rendering as Spatial Memory Retrieval

Embodied reasoning is inherently viewpoint-dependent: what is visible, occluded, or reachable depends critically on where the agent stands. However, existing spatial memory systems for embodied agents typically store either multi-view…

Artificial Intelligence · Computer Science 2026-03-17 JooHyun Park , HyeongYeop Kang

Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents

Large language model (LLM) agents face fundamental limitations in long-horizon reasoning due to finite context windows, making effective memory management critical. Existing methods typically handle long-term memory (LTM) and short-term…

Computation and Language · Computer Science 2026-05-01 Yi Yu , Liuyi Yao , Yuexiang Xie , Qingquan Tan , Jiaqi Feng , Yaliang Li , Libing Wu

STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning

A key objective of embodied intelligence is enabling agents to perform long-horizon tasks in dynamic environments while maintaining robust decision-making and adaptability. To achieve this goal, we propose the Spatio-Temporal Memory Agent…

Artificial Intelligence · Computer Science 2025-03-04 Mingcong Lei , Yiming Zhao , Ge Wang , Zhixin Mai , Shuguang Cui , Yatong Han , Jinke Ren

Learning to Learn from Multimodal Experience

Experience-driven learning has emerged as a promising paradigm for enabling agents to improve from interaction trajectories by accumulating and reusing past experience. However, existing approaches are predominantly developed in textual…

Artificial Intelligence · Computer Science 2026-05-19 Xingyu Sui , Weixiang Zhao , Yongxin Tang , Yanyan Zhao , Yang Wu , Dandan Tu , Bing Qin

Choosing How to Remember: Adaptive Memory Structures for LLM Agents

Memory is critical for enabling large language model (LLM) based agents to maintain coherent behavior over long-horizon interactions. However, existing agent memory systems suffer from two key gaps: they rely on a one-size-fits-all memory…

Artificial Intelligence · Computer Science 2026-02-17 Mingfei Lu , Mengjia Wu , Feng Liu , Jiawei Xu , Weikai Li , Haoyang Wang , Zhengdong Hu , Ying Ding , Yizhou Sun , Jie Lu , Yi Zhang

Reinforced Reasoning for Embodied Planning

Embodied planning requires agents to make coherent multi-step decisions based on dynamic visual observations and natural language goals. While recent vision-language models (VLMs) excel at static perception tasks, they struggle with the…

Artificial Intelligence · Computer Science 2025-07-15 Di Wu , Jiaxin Fan , Junzhe Zang , Guanbo Wang , Wei Yin , Wenhao Li , Bo Jin

EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making

Embodied decision-making enables agents to translate high-level goals into executable actions through continuous interactions within the physical world, forming a cornerstone of general-purpose embodied intelligence. Large language models…

Artificial Intelligence · Computer Science 2025-10-15 Zixing Lei , Sheng Yin , Yichen Xiong , Yuanzhuo Ding , Wenhao Huang , Yuxi Wei , Qingyao Xu , Yiming Li , Weixin Li , Yunhong Wang , Siheng Chen