Related papers: CodeMEM: AST-Guided Adaptive Memory for Repository…

CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models

Large language models (LLMs) excel at single-turn reasoning but often lose accuracy and coherence over extended, multi-turn interactions. Recent evaluations such as TurnBench highlight recurring failure modes-reasoning bias, task drift,…

Computation and Language · Computer Science 2025-12-17 Yiran Zhang , Jincheng Hu , Mark Dras , Usman Naseem

LightMem: Lightweight and Efficient Memory-Augmented Generation

Despite their remarkable capabilities, Large Language Models (LLMs) struggle to effectively leverage historical interaction information in dynamic and complex environments. Memory systems enable LLMs to move beyond stateless interactions by…

Computation and Language · Computer Science 2026-03-03 Jizhan Fang , Xinle Deng , Haoming Xu , Ziyan Jiang , Yuqi Tang , Ziwen Xu , Shumin Deng , Yunzhi Yao , Mengru Wang , Shuofei Qiao , Huajun Chen , Ningyu Zhang

Augmenting Language Models with Long-Term Memory

Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models…

Computation and Language · Computer Science 2023-06-13 Weizhi Wang , Li Dong , Hao Cheng , Xiaodong Liu , Xifeng Yan , Jianfeng Gao , Furu Wei

TeleMem: Building Long-Term and Multimodal Memory for Agentic AI

Large language models (LLMs) excel at many NLP tasks but struggle to sustain long-term interactions due to limited attention over extended dialogue histories. Retrieval-augmented generation (RAG) mitigates this issue but lacks reliable…

Computation and Language · Computer Science 2026-01-23 Chunliang Chen , Ming Guan , Xiao Lin , Jiaxu Li , Luxi Lin , Qiyi Wang , Xiangyu Chen , Jixiang Luo , Changzhi Sun , Dell Zhang , Xuelong Li

FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models

Large language models (LLMs) excel in generating coherent text, but they often struggle with context awareness, leading to inaccuracies in tasks requiring faithful adherence to provided information. We introduce FastMem, a novel method…

Computation and Language · Computer Science 2024-10-08 Junyi Zhu , Shuochen Liu , Yu Yu , Bo Tang , Yibo Yan , Zhiyu Li , Feiyu Xiong , Tong Xu , Matthew B. Blaschko

CALMem : Application-Layer Dual Memory for Conversational AI

Large language models (LLMs) operate within fixed context windows that fundamentally limit conversational continuity. When context fills, compaction discards history irreversibly; when sessions end, all memory resets to zero. Existing…

Information Retrieval · Computer Science 2026-05-21 Rajendra Narayan Jena , Rajan Padmanabhan , Sankar Arumugam

MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization

Existing memory systems enable Large Language Models (LLMs) to support long-horizon human-LLM interactions by persisting historical interactions beyond limited context windows. However, while recent approaches have succeeded in constructing…

Computation and Language · Computer Science 2026-04-21 Haidong Xin , Xinze Li , Zhenghao Liu , Yukun Yan , Shuo Wang , Cheng Yang , Yu Gu , Ge Yu , Maosong Sun

AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

Large language model (LLM) agents increasingly rely on external memory to support long-horizon interaction, personalized assistance, and multi-step reasoning. However, existing memory systems still face three core challenges: they often…

Computation and Language · Computer Science 2026-04-30 Shannan Yan , Jingchen Ni , Leqi Zheng , Jiajun Zhang , Peixi Wu , Dacheng Yin , Jing Lyu , Chun Yuan , Fengyun Rao

BudgetMem: Learning Selective Memory Policies for Cost-Efficient Long-Context Processing in Language Models

Large Language Models (LLMs) face significant computational and memory constraints when processing long contexts, despite growing demand for applications requiring reasoning over extensive documents, multi-session dialogues, and book length…

Computation and Language · Computer Science 2026-02-10 Chandra Vamsi Krishna Alla , Harish Naidu Gaddam , Manohar Kommi

RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents

Memory systems often organize user-agent interactions as retrievable external memory and are crucial for long-running agents by overcoming the limited context windows of LLMs. However, existing memory systems invoke LLMs to process every…

Computation and Language · Computer Science 2026-05-18 Zijie Dai , Shiyuan Deng , Sheng Guan , Yizhou Tian , Xin Yao , Xiao Yan , James Cheng

SimpleMem: Efficient Lifelong Memory for LLM Agents

To support long-term interaction in complex environments, LLM agents require memory systems that manage historical experiences. Existing approaches either retain full interaction histories via passive context extension, leading to…

Artificial Intelligence · Computer Science 2026-01-30 Jiaqi Liu , Yaofeng Su , Peng Xia , Siwei Han , Zeyu Zheng , Cihang Xie , Mingyu Ding , Huaxiu Yao

ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation

Code generation tasks aim to automate the conversion of user requirements into executable code, significantly reducing manual development efforts and enhancing software productivity. The emergence of large language models (LLMs) has…

Software Engineering · Computer Science 2026-01-15 Sicong Liu , Yanxian Huang , Mingwei Liu , Jiachi Chen , Ensheng Shi , Yuchi Ma , Hongyu Zhang , Yin Zhang , Yanlin Wang

CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

Existing methods fail to effectively steer Large Language Models (LLMs) between textual reasoning and code generation, leaving symbolic computing capabilities underutilized. We introduce CodeSteer, an effective method for guiding LLM…

Computation and Language · Computer Science 2025-05-30 Yongchao Chen , Yilun Hao , Yueying Liu , Yang Zhang , Chuchu Fan

RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction

As Large Language Models (LLMs) evolve from static dialogue interfaces to autonomous general agents, effective memory is paramount to ensuring long-term consistency. However, existing benchmarks primarily focus on casual conversation or…

Computation and Language · Computer Science 2026-01-13 Haonan Bian , Zhiyuan Yao , Sen Hu , Zishan Xu , Shaolei Zhang , Yifu Guo , Ziliang Yang , Xueran Han , Huacan Wang , Ronghao Chen

HingeMem: Boundary Guided Long-Term Memory with Query Adaptive Retrieval for Scalable Dialogues

Long-term memory is critical for dialogue systems that support continuous, sustainable, and personalized interactions. However, existing methods rely on continuous summarization or OpenIE-based graph construction paired with fixed…

Computation and Language · Computer Science 2026-04-09 Yijie Zhong , Yunfan Gao , Haofen Wang

CAIM: Development and Evaluation of a Cognitive AI Memory Framework for Long-Term Interaction with Intelligent Agents

Large language models (LLMs) have advanced the field of artificial intelligence (AI) and are a powerful enabler for interactive systems. However, they still face challenges in long-term interactions that require adaptation towards the user…

Artificial Intelligence · Computer Science 2025-05-20 Rebecca Westhäußer , Frederik Berenz , Wolfgang Minker , Sebastian Zepf

Lightweight LLM Agent Memory with Small Language Models

Although LLM agents can leverage tools for complex tasks, they still need memory to maintain cross-turn consistency and accumulate reusable information in long-horizon interactions. However, retrieval-based external memory systems incur low…

Artificial Intelligence · Computer Science 2026-04-23 Jiaquan Zhang , Chaoning Zhang , Shuxu Chen , Zhenzhen Huang , Pengcheng Zheng , Zhicheng Wang , Ping Guo , Fan Mo , Sung-Ho Bae , Jie Zou , Jiwei Wei , Yang Yang

Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents

As large language models (LLMs) evolve into autonomous agents, persistent memory at the API layer is essential for enabling context-aware behavior across LLMs and multi-session interactions. Existing approaches force vendor lock-in and rely…

Machine Learning · Computer Science 2026-03-23 Luiz C. Borro , Luiz A. B. Macarini , Gordon Tindall , Michael Montero , Adam B. Struck

EvolMem: A Cognitive-Driven Benchmark for Multi-Session Dialogue Memory

Despite recent advances in understanding and leveraging long-range conversational memory, existing benchmarks still lack systematic evaluation of large language models(LLMs) across diverse memory dimensions, particularly in multi-session…

Computation and Language · Computer Science 2026-01-08 Ye Shen , Dun Pei , Yiqiu Guo , Junying Wang , Yijin Guo , Zicheng Zhang , Qi Jia , Jun Zhou , Guangtao Zhai

Developing Adaptive Context Compression Techniques for Large Language Models (LLMs) in Long-Running Interactions

Large Language Models (LLMs) often experience performance degradation during long-running interactions due to increasing context length, memory saturation, and computational overhead. This paper presents an adaptive context compression…

Computer Vision and Pattern Recognition · Computer Science 2026-04-01 Payal Fofadiya , Sunil Tiwari