Related papers: L2MAC: Large Language Model Automatic Computer for…

Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents

Large Language Models (LLMs) represent a landmark achievement in Artificial Intelligence (AI), demonstrating unprecedented proficiency in procedural tasks such as text generation, code completion, and conversational coherence. These…

Artificial Intelligence · Computer Science 2025-05-07 Schaun Wheeler , Olivier Jeunen

Augmenting Language Models with Long-Term Memory

Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models…

Computation and Language · Computer Science 2023-06-13 Weizhi Wang , Li Dong , Hao Cheng , Xiaodong Liu , Xifeng Yan , Jianfeng Gao , Furu Wei

Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory

Multi-agent systems built on Large Language Models (LLMs) show exceptional promise for complex collaborative problem-solving, yet they face fundamental challenges stemming from context window limitations that impair memory consistency, role…

Artificial Intelligence · Computer Science 2026-01-13 Sizhe Yuen , Francisco Gomez Medina , Ting Su , Yali Du , Adam J. Sobey

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Large Language Models (LLMs) have demonstrated impressive capabilities across a wide range of NLP tasks, but they remain fundamentally stateless, constrained by limited context windows that hinder long-horizon reasoning. Recent efforts to…

Computation and Language · Computer Science 2026-01-15 Sikuan Yan , Xiufeng Yang , Zuchao Huang , Ercong Nie , Zifeng Ding , Zonggen Li , Xiaowen Ma , Jinhe Bi , Kristian Kersting , Jeff Z. Pan , Hinrich Schütze , Volker Tresp , Yunpu Ma

Memory for Autonomous LLM Agents:Mechanisms, Evaluation, and Emerging Frontiers

Large language model (LLM) agents increasingly operate in settings where a single context window is far too small to capture what has happened, what was learned, and what should not be repeated. Memory -- the ability to persist, organize,…

Artificial Intelligence · Computer Science 2026-03-10 Pengfei Du

LM2: Large Memory Models

This paper introduces the Large Memory Model (LM2), a decoder-only Transformer architecture enhanced with an auxiliary memory module that aims to address the limitations of standard Transformers in multi-step reasoning, relational…

Computation and Language · Computer Science 2025-02-11 Jikun Kang , Wenqi Wu , Filippos Christianos , Alex J. Chan , Fraser Greenlee , George Thomas , Marvin Purtorab , Andy Toulis

Mem2ActBench: A Benchmark for Evaluating Long-Term Memory Utilization in Task-Oriented Autonomous Agents

Large Language Model (LLM)-based agents are increasingly deployed for complex, tool-based tasks where long-term memory is critical to driving actions. Existing benchmarks, however, primarily test a angent's ability to passively retrieve…

Computation and Language · Computer Science 2026-01-29 Yiting Shen , Kun Li , Wei Zhou , Songlin Hu

SCM: Enhancing Large Language Model with Self-Controlled Memory Framework

Large Language Models (LLMs) are constrained by their inability to process lengthy inputs, resulting in the loss of critical historical information. To address this limitation, in this paper, we propose the Self-Controlled Memory (SCM)…

Computation and Language · Computer Science 2025-03-19 Bing Wang , Xinnian Liang , Jian Yang , Hui Huang , Shuangzhi Wu , Peihao Wu , Lu Lu , Zejun Ma , Zhoujun Li

Adaptive Memory Admission Control for LLM Agents

LLM-based agents increasingly rely on long-term memory to support multi-session reasoning and interaction, yet current systems provide little control over what information is retained. In practice, agents either accumulate large volumes of…

Artificial Intelligence · Computer Science 2026-03-06 Guilin Zhang , Wei Jiang , Xiejiashan Wang , Aisha Behr , Kai Zhao , Jeffrey Friedman , Xu Chu , Amine Anoun

Empowering Working Memory for Large Language Model Agents

Large language models (LLMs) have achieved impressive linguistic capabilities. However, a key limitation persists in their lack of human-like memory faculties. LLMs exhibit constrained memory retention across sequential interactions,…

Computation and Language · Computer Science 2024-05-29 Jing Guo , Nan Li , Jianchuan Qi , Hang Yang , Ruiqiao Li , Yuzhen Feng , Si Zhang , Ming Xu

Algorithmic Language Models with Neurally Compiled Libraries

Important tasks such as reasoning and planning are fundamentally algorithmic, meaning that solving them robustly requires acquiring true reasoning or planning algorithms, rather than shortcuts. Large Language Models lack true algorithmic…

Artificial Intelligence · Computer Science 2025-05-27 Lucas Saldyt , Subbarao Kambhampati

Needle in the Haystack for Memory Based Large Language Models

Current large language models (LLMs) often perform poorly on simple fact retrieval tasks. Here we investigate if coupling a dynamically adaptable external memory to a LLM can alleviate this problem. For this purpose, we test Larimar, a…

Computation and Language · Computer Science 2024-07-15 Elliot Nelson , Georgios Kollias , Payel Das , Subhajit Chaudhury , Soham Dan

Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management

Large Language Models (LLMs) suffer from significant performance degradation when processing long contexts due to proactive interference, where irrelevant information in earlier parts of the context disrupts reasoning and memory recall.…

Computation and Language · Computer Science 2025-09-30 Mo Li , L. H. Xu , Qitai Tan , Long Ma , Ting Cao , Yunxin Liu

A Survey on the Memory Mechanism of Large Language Model based Agents

Large language model (LLM) based agents have recently attracted much attention from the research and industry communities. Compared with original LLMs, LLM-based agents are featured in their self-evolving capability, which is the basis for…

Artificial Intelligence · Computer Science 2024-04-23 Zeyu Zhang , Xiaohe Bo , Chen Ma , Rui Li , Xu Chen , Quanyu Dai , Jieming Zhu , Zhenhua Dong , Ji-Rong Wen

Do Large Language Models Understand Performance Optimization?

Large Language Models (LLMs) have emerged as powerful tools for software development tasks such as code completion, translation, and optimization. However, their ability to generate efficient and correct code, particularly in complex…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-19 Bowen Cui , Tejas Ramesh , Oscar Hernandez , Keren Zhou

LSTM-MAS: A Long Short-Term Memory Inspired Multi-Agent System for Long-Context Understanding

Effectively processing long contexts remains a fundamental yet unsolved challenge for large language models (LLMs). Existing single-LLM-based methods primarily reduce the context window or optimize the attention mechanism, but they often…

Computation and Language · Computer Science 2026-04-22 Yichen Jiang , Jiakang Yuan , Chongjun Tu , Peng Ye , Tao Chen

MemGPT: Towards LLMs as Operating Systems

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows,…

Artificial Intelligence · Computer Science 2024-02-13 Charles Packer , Sarah Wooders , Kevin Lin , Vivian Fang , Shishir G. Patil , Ion Stoica , Joseph E. Gonzalez

Coding Agents are Effective Long-Context Processors

Large Language Models (LLMs) have demonstrated remarkable progress in scaling to access massive contexts. However, the access is via the latent and uninterpretable attention mechanisms, and LLMs fail to effective process long context,…

Computation and Language · Computer Science 2026-03-24 Weili Cao , Xunjian Yin , Bhuwan Dhingra , Shuyan Zhou

Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks

Large Language Models (LLMs) have shown remarkable capabilities in code generation tasks, yet they face significant limitations in handling complex, long-context programming challenges and demonstrating complex compositional reasoning…

Artificial Intelligence · Computer Science 2025-01-14 Amr Almorsi , Mohanned Ahmed , Walid Gomaa

SysLLMatic: Large Language Models are Software System Optimizers

Automatic software system optimization can improve software speed, reduce operating costs, and save energy. Traditional approaches to optimization rely on manual tuning and compiler heuristics, limiting their ability to generalize across…

Software Engineering · Computer Science 2026-05-07 Huiyun Peng , Arjun Gupte , Ryan Hasler , Nicholas John Eliopoulos , Chien-Chou Ho , Rishi Mantri , Leo Deng , Konstantin Läufer , George K. Thiruvathukal , James C. Davis