Related papers: MEMRES: A Memory-Augmented Resolver with Confidenc…

MemR$^3$: Memory Retrieval via Reflective Reasoning for LLM Agents

Memory systems have been designed to leverage past experiences in Large Language Model (LLM) agents. However, many deployed memory systems primarily optimize compression and storage, with comparatively less emphasis on explicit, closed-loop…

Artificial Intelligence · Computer Science 2025-12-24 Xingbo Du , Loka Li , Duzhen Zhang , Le Song

MEME: Multi-entity & Evolving Memory Evaluation

LLM-based agents increasingly operate in persistent environments where they must store, update, and reason over information across many sessions. While prior benchmarks evaluate only single-entity updates, MEME defines six tasks spanning…

Machine Learning · Computer Science 2026-05-13 Seokwon Jung , Alexander Rubinstein , Arnas Uselis , Sangdoo Yun , Seong Joon Oh

MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

Memory is essential for large vision-language models (LVLMs) to handle long, multimodal interactions, with two method directions providing this capability: long-context LVLMs and memory-augmented agents. However, no existing benchmark…

Computer Vision and Pattern Recognition · Computer Science 2026-05-15 Xiyu Ren , Zhaowei Wang , Yiming Du , Zhongwei Xie , Chi Liu , Xinlin Yang , Haoyue Feng , Wenjun Pan , Tianshi Zheng , Baixuan Xu , Zhengnan Li , Yangqiu Song , Ginny Wong , Simon See

Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents

As large language models (LLMs) evolve into autonomous agents, persistent memory at the API layer is essential for enabling context-aware behavior across LLMs and multi-session interactions. Existing approaches force vendor lock-in and rely…

Machine Learning · Computer Science 2026-03-23 Luiz C. Borro , Luiz A. B. Macarini , Gordon Tindall , Michael Montero , Adam B. Struck

Memp: Exploring Agent Procedural Memory

Large Language Models (LLMs) based agents excel at diverse tasks, yet they suffer from brittle procedural memory that is manually engineered or entangled in static parameters. In this work, we investigate strategies to endow agents with a…

Computation and Language · Computer Science 2026-04-16 Runnan Fang , Yuan Liang , Xiaobin Wang , Jialong Wu , Shuofei Qiao , Pengjun Xie , Fei Huang , Huajun Chen , Ningyu Zhang

MemRepair: Hierarchical Memory for Agentic Repository-Level Vulnerability Repair

Modern software ecosystems face a rapidly growing number of disclosed vulnerabilities, increasing the need for automated repair techniques that can operate reliably at repository scale. Although Large Language Model (LLM)-based agents have…

Software Engineering · Computer Science 2026-05-19 Simiao Liu , Li Zhang , Fang Liu , Xiaoli Lian , Yang Liu , Yinghao Zhu

MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models

Large Language Models (LLMs) have emerged as foundational infrastructure in the pursuit of Artificial General Intelligence (AGI). Despite their remarkable capabilities in language perception and generation, current LLMs fundamentally lack a…

Computation and Language · Computer Science 2025-05-29 Zhiyu Li , Shichao Song , Hanyu Wang , Simin Niu , Ding Chen , Jiawei Yang , Chenyang Xi , Huayi Lai , Jihao Zhao , Yezhaohui Wang , Junpeng Ren , Zehao Lin , Jiahao Huo , Tianyi Chen , Kai Chen , Kehang Li , Zhiqiang Yin , Qingchen Yu , Bo Tang , Hongkang Yang , Zhi-Qin John Xu , Feiyu Xiong

The Last Dependency Crusade: Solving Python Dependency Conflicts with LLMs

Resolving Python dependency issues remains a tedious and error-prone process, forcing developers to manually trial compatible module versions and interpreter configurations. Existing automated solutions, such as knowledge-graph-based and…

Software Engineering · Computer Science 2025-10-17 Antony Bartlett , Cynthia Liem , Annibale Panichella

MemRec: Collaborative Memory-Augmented Agentic Recommender System

The evolution of recommender systems has shifted from traditional collaborative filtering to LLM-based agentic systems, which rely on semantic user and item memories to make predictions. However, existing agents maintain these memories in…

Information Retrieval · Computer Science 2026-04-29 Weixin Chen , Yuhan Zhao , Jingyuan Huang , Zihe Ye , Clark Mingxuan Ju , Tong Zhao , Neil Shah , Li Chen , Yongfeng Zhang

Memory-Efficient Large Language Models for Program Repair with Semantic-Guided Patch Generation

In this paper, we first show that increases in beam size, even for small-sized LLMs (1B-7B params), require extensive GPU usage, leading to up to 80% of recurring crashes due to memory overloads in LLM-based APR. Seemingly simple solutions…

Software Engineering · Computer Science 2025-10-20 Thanh Le-Cong , Bach Le , Toby Murray

Hermes: Memory-Efficient Pipeline Inference for Large Models on Edge Devices

The application of Transformer-based large models has achieved numerous success in recent years. However, the exponential growth in the parameters of large models introduces formidable memory challenge for edge deployment. Prior works to…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-09-11 Xueyuan Han , Zinuo Cai , Yichu Zhang , Chongxin Fan , Junhan Liu , Ruhui Ma , Rajkumar Buyya

Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution

Procedural memory enables large language model (LLM) agents to internalize "how-to" knowledge, theoretically reducing redundant trial-and-error. However, existing frameworks predominantly suffer from a "passive accumulation" paradigm,…

Artificial Intelligence · Computer Science 2026-04-16 Zouying Cao , Jiaji Deng , Li Yu , Weikang Zhou , Zhaoyang Liu , Bolin Ding , Hai Zhao

AMEM4Rec: Leveraging Cross-User Similarity for Memory Evolution in Agentic LLM Recommenders

Agentic systems powered by Large Language Models (LLMs) have shown strong potential in recommender systems but remain hindered by several challenges. Fine-tuning LLMs is parameter-inefficient, and prompt-based agentic reasoning is limited…

Information Retrieval · Computer Science 2026-02-10 Minh-Duc Nguyen , Hai-Dang Kieu , Dung D. Le

Breaking the Dependency Chaos: A Constraint-Driven Python Dependency Resolution Strategy with Selective LLM Imputation

Dependency resolution is the task of selecting package versions that can be installed together without conflicts. It accounts for a significant share of build failures in modern software projects. In the Python ecosystem, this task is…

Software Engineering · Computer Science 2026-05-13 Kowshik Chowdhury , Dipayan Banik , Shazibul Islam Shamim

Agentic Recommender System with Hierarchical Belief-State Memory

Memory-augmented LLM agents have advanced personalized recommendation, yet existing approaches universally adopt flat memory representations that conflate ephemeral signals with stable preferences, and none provides a complete lifecycle…

Computation and Language · Computer Science 2026-05-18 Xiang Shen , Yuhang Zhou , Yifan Wu , Zhuokai Zhao , Siyu Lin , Lei Huang , Qianqian Zhong , Lizhu Zhang , Benyu Zhang , Xiangjun Fan , Hong Yan

Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory

Multi-agent systems built on Large Language Models (LLMs) show exceptional promise for complex collaborative problem-solving, yet they face fundamental challenges stemming from context window limitations that impair memory consistency, role…

Artificial Intelligence · Computer Science 2026-01-13 Sizhe Yuen , Francisco Gomez Medina , Ting Su , Yali Du , Adam J. Sobey

MemVerse: Multimodal Memory for Lifelong Learning Agents

Despite rapid progress in large-scale language and vision models, AI agents still suffer from a fundamental limitation: they cannot remember. Without reliable memory, agents catastrophically forget past experiences, struggle with…

Artificial Intelligence · Computer Science 2025-12-04 Junming Liu , Yifei Sun , Weihua Cheng , Haodong Lei , Yirong Chen , Licheng Wen , Xuemeng Yang , Daocheng Fu , Pinlong Cai , Nianchen Deng , Yi Yu , Shuyue Hu , Botian Shi , Ding Wang

MemGym: a Long-Horizon Memory Environment for LLM Agents

Memory is a central capability for LLM agents operating across long-horizon tasks. Existing memory benchmarks predominantly evaluate retention of personalized information in multi-turn chat scenarios, overlooking the dynamic memory…

Computation and Language · Computer Science 2026-05-21 Wujiang Xu , Yu Wang , Kai Mei , Kaiqu Liang , Zhenting Wang , Mingyu Jin , Han Zhang , Shi-Xiong Zhang , Wenyue Hua , Sambit Sahu , Dimitris N. Metaxas

HiMeS: Hippocampus-inspired Memory System for Personalized AI Assistants

Large language models (LLMs) power many interactive systems such as chatbots, customer-service agents, and personal assistants. In knowledge-intensive scenarios requiring user-specific personalization, conventional retrieval-augmented…

Artificial Intelligence · Computer Science 2026-01-13 Hailong Li , Feifei Li , Wenhui Que , Xingyu Fan

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Multimodal Large Language Models (MLLMs) have demonstrated proficiency in handling a variety of visual-language tasks. However, current MLLM benchmarks are predominantly designed to evaluate reasoning based on static information about a…

Computer Vision and Pattern Recognition · Computer Science 2024-01-26 Xiyao Wang , Yuhang Zhou , Xiaoyu Liu , Hongjin Lu , Yuancheng Xu , Feihong He , Jaehong Yoon , Taixi Lu , Gedas Bertasius , Mohit Bansal , Huaxiu Yao , Furong Huang