English
Related papers

Related papers: $\text{Memory}^3$: Language Modeling with Explicit…

200 papers

Large language models suffer from knowledge staleness and lack of interpretability due to implicit knowledge storage across entangled network parameters, preventing targeted updates and reasoning transparency. We propose ExplicitLM, a novel…

Artificial Intelligence · Computer Science 2025-11-04 Chengzhang Yu , Zening Lu , Chenyang Zheng , Chiyue Wang , Yiming Zhang , Zhanpeng Jin

Memory is identified as a crucial human faculty that allows for the retention of visual and linguistic information within the hippocampus and neurons in the brain, which can subsequently be retrieved to address real-world challenges that…

Computation and Language · Computer Science 2023-08-29 Akide Liu

Large language models (LLMs) are central to modern natural language processing, delivering exceptional performance in various tasks. However, their substantial computational and memory requirements present challenges, especially for devices…

While current large language models (LLMs) perform well on many knowledge-related tasks, they are limited by relying on their parameters as an implicit storage mechanism. As a result, they struggle with memorizing rare events and with…

Computation and Language · Computer Science 2025-04-18 Ali Modarressi , Abdullatif Köksal , Ayyoob Imani , Mohsen Fayyaz , Hinrich Schütze

Recent advances in large language models (LLMs) have popularized the chain-of-thought (CoT) paradigm, in which models produce explicit reasoning steps in natural language. Although this approach improves interpretability and facilitates…

Computation and Language · Computer Science 2025-03-03 José I. Orlicki

Neural language models are black-boxes--both linguistic patterns and factual knowledge are distributed across billions of opaque parameters. This entangled encoding makes it difficult to reliably inspect, verify, or update specific facts.…

Memory systems have been designed to leverage past experiences in Large Language Model (LLM) agents. However, many deployed memory systems primarily optimize compression and storage, with comparatively less emphasis on explicit, closed-loop…

Artificial Intelligence · Computer Science 2025-12-24 Xingbo Du , Loka Li , Duzhen Zhang , Le Song

Large Language Models (LLMs) are increasingly integrating memory functionalities to provide personalized and context-aware interactions. However, user understanding, practices and expectations regarding these memory systems are not yet well…

Human-Computer Interaction · Computer Science 2025-08-12 Shuning Zhang , Rongjun Ma , Ying Ma , Shixuan Li , Yiqun Xu , Xin Yi , Hewu Li

Large Language Models (LLMs) have become an essential infrastructure for Artificial General Intelligence (AGI), yet their lack of well-defined memory management systems hinders the development of long-context reasoning, continual…

Large language models (LMs) have been shown to memorize parts of their training data, and when prompted appropriately, they will emit the memorized training data verbatim. This is undesirable because memorization violates privacy (exposing…

Machine Learning · Computer Science 2023-03-07 Nicholas Carlini , Daphne Ippolito , Matthew Jagielski , Katherine Lee , Florian Tramer , Chiyuan Zhang

Memory, additional information beyond the training of large language models (LLMs), is crucial to various real-world applications, such as personal assistant. The two mainstream solutions to incorporate memory into the generation process…

Computation and Language · Computer Science 2025-03-21 Jiale Wei , Shuchi Wu , Ruochen Liu , Xiang Ying , Jingbo Shang , Fangbo Tao

Large Language Models (LLMs) have demonstrated strong performance in handling complex tasks requiring both extensive knowledge and reasoning abilities. However, the existing LLM inference pipeline operates as an opaque process without…

Computation and Language · Computer Science 2025-05-16 Mingyu Jin , Weidi Luo , Sitao Cheng , Xinyi Wang , Wenyue Hua , Ruixiang Tang , William Yang Wang , Yongfeng Zhang

Conventional predictive modeling of parametric relationships in manufacturing processes is limited by the subjectivity of human expertise and intuition on the one hand and by the cost and time of experimental data generation on the other…

Computation and Language · Computer Science 2025-06-26 Kiarash Naghavi Khanghah , Anandkumar Patel , Rajiv Malhotra , Hongyi Xu

Modern approaches to enhancing Large Language Models' factual accuracy and knowledge utilization face a fundamental trade-off: non-parametric retrieval-augmented generation (RAG) provides flexible access to external knowledge but suffers…

Computation and Language · Computer Science 2026-03-02 Rubin Wei , Jiaqi Cao , Jiarui Wang , Jushi Kai , Qipeng Guo , Bowen Zhou , Zhouhan Lin

Given the prevalence of large language models (LLMs) and the prohibitive cost of training these models from scratch, dynamically forgetting specific knowledge e.g., private or proprietary, without retraining the model has become an…

Computation and Language · Computer Science 2024-08-09 Tyler Lizzo , Larry Heck

Large language models (LLMs) excel on a variety of reasoning benchmarks, but previous studies suggest they sometimes struggle to generalize to unseen questions, potentially due to over-reliance on memorized training examples. However, the…

Computation and Language · Computer Science 2025-04-01 Yihuai Hong , Dian Zhou , Meng Cao , Lei Yu , Zhijing Jin

The training of modern large language models (LLMs) takes place in a regime where most training examples are seen only a few times by the model during the course of training. What does a model remember about such examples seen only a few…

Computation and Language · Computer Science 2023-03-31 A. Emin Orhan

Large language models (LLMs) achieve strong performance across a wide range of tasks, but remain frozen after pretraining until subsequent updates. Many real-world applications require timely, domain-specific information, motivating the…

Large Language Models (LLMs) have demonstrated strong generalization across a wide range of tasks. Reasoning with LLMs is central to solving multi-step problems and complex decision-making. To support efficient reasoning, recent studies…

Computation and Language · Computer Science 2025-09-03 Jindong Li , Yali Fu , Li Fan , Jiahong Liu , Yao Shu , Chengwei Qin , Menglin Yang , Irwin King , Rex Ying

Large Language Models (LLMs) are known for their expensive and time-consuming training. Thus, oftentimes, LLMs are fine-tuned to address a specific task, given the pretrained weights of a pre-trained LLM considered a foundation model. In…

Computation and Language · Computer Science 2025-12-05 Eshed Gal , Moshe Eliasof , Javier Turek , Uri Ascher , Eran Treister , Eldad Haber
‹ Prev 1 2 3 10 Next ›