Related papers: $\text{Memory}^3$: Language Modeling with Explicit…

ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks

Large language models suffer from knowledge staleness and lack of interpretability due to implicit knowledge storage across entangled network parameters, preventing targeted updates and reasoning transparency. We propose ExplicitLM, a novel…

Artificial Intelligence · Computer Science 2025-11-04 Chengzhang Yu , Zening Lu , Chenyang Zheng , Chiyue Wang , Yiming Zhang , Zhanpeng Jin

External Reasoning: Towards Multi-Large-Language-Models Interchangeable Assistance with Human Feedback

Memory is identified as a crucial human faculty that allows for the retention of visual and linguistic information within the hippocampus and neurons in the brain, which can subsequently be retrieved to address real-world challenges that…

Computation and Language · Computer Science 2023-08-29 Akide Liu

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Large language models (LLMs) are central to modern natural language processing, delivering exceptional performance in various tasks. However, their substantial computational and memory requirements present challenges, especially for devices…

Computation and Language · Computer Science 2024-08-01 Keivan Alizadeh , Iman Mirzadeh , Dmitry Belenko , Karen Khatamifard , Minsik Cho , Carlo C Del Mundo , Mohammad Rastegari , Mehrdad Farajtabar

MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory

While current large language models (LLMs) perform well on many knowledge-related tasks, they are limited by relying on their parameters as an implicit storage mechanism. As a result, they struggle with memorizing rare events and with…

Computation and Language · Computer Science 2025-04-18 Ali Modarressi , Abdullatif Köksal , Ayyoob Imani , Mohsen Fayyaz , Hinrich Schütze

Beyond Words: A Latent Memory Approach to Internal Reasoning in LLMs

Recent advances in large language models (LLMs) have popularized the chain-of-thought (CoT) paradigm, in which models produce explicit reasoning steps in natural language. Although this approach improves interpretability and facilitates…

Computation and Language · Computer Science 2025-03-03 José I. Orlicki

Pre-training Limited Memory Language Models with Internal and External Knowledge

Neural language models are black-boxes--both linguistic patterns and factual knowledge are distributed across billions of opaque parameters. This entangled encoding makes it difficult to reliably inspect, verify, or update specific facts.…

Computation and Language · Computer Science 2025-10-06 Linxi Zhao , Sofian Zalouk , Christian K. Belardi , Justin Lovelace , Jin Peng Zhou , Ryan Thomas Noonan , Dongyoung Go , Kilian Q. Weinberger , Yoav Artzi , Jennifer J. Sun

MemR$^3$: Memory Retrieval via Reflective Reasoning for LLM Agents

Memory systems have been designed to leverage past experiences in Large Language Model (LLM) agents. However, many deployed memory systems primarily optimize compression and storage, with comparatively less emphasis on explicit, closed-loop…

Artificial Intelligence · Computer Science 2025-12-24 Xingbo Du , Loka Li , Duzhen Zhang , Le Song

Understanding Users' Privacy Perceptions Towards LLM's RAG-based Memory

Large Language Models (LLMs) are increasingly integrating memory functionalities to provide personalized and context-aware interactions. However, user understanding, practices and expectations regarding these memory systems are not yet well…

Human-Computer Interaction · Computer Science 2025-08-12 Shuning Zhang , Rongjun Ma , Ying Ma , Shixuan Li , Yiqun Xu , Xin Yi , Hewu Li

MemOS: A Memory OS for AI System

Large Language Models (LLMs) have become an essential infrastructure for Artificial General Intelligence (AGI), yet their lack of well-defined memory management systems hinders the development of long-context reasoning, continual…

Computation and Language · Computer Science 2025-12-04 Zhiyu Li , Chenyang Xi , Chunyu Li , Ding Chen , Boyu Chen , Shichao Song , Simin Niu , Hanyu Wang , Jiawei Yang , Chen Tang , Qingchen Yu , Jihao Zhao , Yezhaohui Wang , Peng Liu , Zehao Lin , Pengyuan Wang , Jiahao Huo , Tianyi Chen , Kai Chen , Kehang Li , Zhen Tao , Huayi Lai , Hao Wu , Bo Tang , Zhengren Wang , Zhaoxin Fan , Ningyu Zhang , Linfeng Zhang , Junchi Yan , Mingchuan Yang , Tong Xu , Wei Xu , Huajun Chen , Haofen Wang , Hongkang Yang , Wentao Zhang , Zhi-Qin John Xu , Siheng Chen , Feiyu Xiong

Quantifying Memorization Across Neural Language Models

Large language models (LMs) have been shown to memorize parts of their training data, and when prompted appropriately, they will emit the memorized training data verbatim. This is undesirable because memorization violates privacy (exposing…

Machine Learning · Computer Science 2023-03-07 Nicholas Carlini , Daphne Ippolito , Matthew Jagielski , Katherine Lee , Florian Tramer , Chiyuan Zhang

Tuning LLMs by RAG Principles: Towards LLM-native Memory

Memory, additional information beyond the training of large language models (LLMs), is crucial to various real-world applications, such as personal assistant. The two mainstream solutions to incorporate memory into the generation process…

Computation and Language · Computer Science 2025-03-21 Jiale Wei , Shuchi Wu , Ruochen Liu , Xiang Ying , Jingbo Shang , Fangbo Tao

Disentangling Memory and Reasoning Ability in Large Language Models

Large Language Models (LLMs) have demonstrated strong performance in handling complex tasks requiring both extensive knowledge and reasoning abilities. However, the existing LLM inference pipeline operates as an opaque process without…

Computation and Language · Computer Science 2025-05-16 Mingyu Jin , Weidi Luo , Sitao Cheng , Xinyi Wang , Wenyue Hua , Ruixiang Tang , William Yang Wang , Yongfeng Zhang

Large Language Models for Extrapolative Modeling of Manufacturing Processes

Conventional predictive modeling of parametric relationships in manufacturing processes is limited by the subjectivity of human expertise and intuition on the one hand and by the cost and time of experimental data generation on the other…

Computation and Language · Computer Science 2025-06-26 Kiarash Naghavi Khanghah , Anandkumar Patel , Rajiv Malhotra , Hongyi Xu

MLP Memory: A Retriever-Pretrained Memory for Large Language Models

Modern approaches to enhancing Large Language Models' factual accuracy and knowledge utilization face a fundamental trade-off: non-parametric retrieval-augmented generation (RAG) provides flexible access to external knowledge but suffers…

Computation and Language · Computer Science 2026-03-02 Rubin Wei , Jiaqi Cao , Jiarui Wang , Jushi Kai , Qipeng Guo , Bowen Zhou , Zhouhan Lin

UNLEARN Efficient Removal of Knowledge in Large Language Models

Given the prevalence of large language models (LLMs) and the prohibitive cost of training these models from scratch, dynamically forgetting specific knowledge e.g., private or proprietary, without retraining the model has become an…

Computation and Language · Computer Science 2024-08-09 Tyler Lizzo , Larry Heck

The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction

Large language models (LLMs) excel on a variety of reasoning benchmarks, but previous studies suggest they sometimes struggle to generalize to unseen questions, potentially due to over-reliance on memorized training examples. However, the…

Computation and Language · Computer Science 2025-04-01 Yihuai Hong , Dian Zhou , Meng Cao , Lei Yu , Zhijing Jin

Recognition, recall, and retention of few-shot memories in large language models

The training of modern large language models (LLMs) takes place in a regime where most training examples are seen only a few times by the model during the course of training. What does a model remember about such examples seen only a few…

Computation and Language · Computer Science 2023-03-31 A. Emin Orhan

MeMo: Memory as a Model

Large language models (LLMs) achieve strong performance across a wide range of tasks, but remain frozen after pretraining until subsequent updates. Many real-world applications require timely, domain-specific information, motivating the…

Computation and Language · Computer Science 2026-05-21 Ryan Wei Heng Quek , Sanghyuk Lee , Alfred Wei Lun Leong , Arun Verma , Alok Prakash , Nancy F. Chen , Bryan Kian Hsiang Low , Daniela Rus , Armando Solar-Lezama

Implicit Reasoning in Large Language Models: A Comprehensive Survey

Large Language Models (LLMs) have demonstrated strong generalization across a wide range of tasks. Reasoning with LLMs is central to solving multi-step problems and complex decision-making. To support efficient reasoning, recent studies…

Computation and Language · Computer Science 2025-09-03 Jindong Li , Yali Fu , Li Fan , Jiahong Liu , Yao Shu , Chengwei Qin , Menglin Yang , Irwin King , Rex Ying

Reversing Large Language Models for Efficient Training and Fine-Tuning

Large Language Models (LLMs) are known for their expensive and time-consuming training. Thus, oftentimes, LLMs are fine-tuned to address a specific task, given the pretrained weights of a pre-trained LLM considered a foundation model. In…

Computation and Language · Computer Science 2025-12-05 Eshed Gal , Moshe Eliasof , Javier Turek , Uri Ascher , Eran Treister , Eldad Haber