English
Related papers

Related papers: Enhancing LLM Knowledge Learning through Generaliz…

200 papers

Recent advancements in large language models (LLMs) have enhanced natural-language reasoning. However, their limited parametric memory and susceptibility to hallucination present persistent challenges for tasks requiring accurate,…

Computation and Language · Computer Science 2025-06-02 Yu-Hsuan Lin , Qian-Hui Chen , Yi-Jie Cheng , Jia-Ren Zhang , Yi-Hung Liu , Liang-Yu Hsia , Yun-Nung Chen

Despite the recent observation that large language models (LLMs) can store substantial factual knowledge, there is a limited understanding of the mechanisms of how they acquire factual knowledge through pretraining. This work addresses this…

Computation and Language · Computer Science 2024-11-13 Hoyeon Chang , Jinho Park , Seonghyeon Ye , Sohee Yang , Youngkyung Seo , Du-Seong Chang , Minjoon Seo

As Large Language Models (LLMs) continue to evolve, more are being designed to handle long-context inputs. Despite this advancement, most of them still face challenges in accurately handling long-context tasks, often showing the "lost in…

Computation and Language · Computer Science 2024-12-13 Yijiong Yu , Yongfeng Huang , Zhixiao Qi , Zhe Zhou

Large Language Models (LLMs) store an extensive amount of factual knowledge obtained from vast collections of text. To effectively utilize these models for downstream tasks, it is crucial to have reliable methods for measuring their…

Computation and Language · Computer Science 2023-06-13 Pouya Pezeshkpour

Adapting large language models (LLMs) to new and diverse knowledge is essential for their lasting effectiveness in real-world applications. This survey provides an overview of state-of-the-art methods for expanding the knowledge of LLMs,…

Computation and Language · Computer Science 2025-02-19 Mingyang Wang , Alisa Stoll , Lukas Lange , Heike Adel , Hinrich Schütze , Jannik Strötgen

Despite their impressive performance on diverse tasks, large language models (LMs) still struggle with tasks requiring rich world knowledge, implying the limitations of relying solely on their parameters to encode a wealth of world…

Computation and Language · Computer Science 2023-07-04 Alex Mallen , Akari Asai , Victor Zhong , Rajarshi Das , Daniel Khashabi , Hannaneh Hajishirzi

Large pre-trained language models have demonstrated their proficiency in storing factual knowledge within their parameters and achieving remarkable results when fine-tuned for downstream natural language processing tasks. Nonetheless, their…

Computation and Language · Computer Science 2023-09-29 Konstantinos Andriopoulos , Johan Pouwelse

Knowledge augmentation has significantly enhanced the performance of Large Language Models (LLMs) in knowledge-intensive tasks. However, existing methods typically operate on the simplistic premise that model performance equates with…

Computation and Language · Computer Science 2026-02-16 Hao Chen , Ye He , Yuchun Fan , Yukun Yan , Zhenghao Liu , Qingfu Zhu , Maosong Sun , Wanxiang Che

Previous works show that Pre-trained Language Models (PLMs) can capture factual knowledge. However, some analyses reveal that PLMs fail to perform it robustly, e.g., being sensitive to the changes of prompts when extracting factual…

Computation and Language · Computer Science 2022-10-21 Shaobo Li , Xiaoguang Li , Lifeng Shang , Chengjie Sun , Bingquan Liu , Zhenzhou Ji , Xin Jiang , Qun Liu

Large language models (LLMs) often require vast amounts of text to effectively acquire new knowledge. While continuing pre-training on large corpora or employing retrieval-augmented generation (RAG) has proven successful, updating an LLM…

Computation and Language · Computer Science 2025-08-11 Hugo Abonizio , Thales Almeida , Roberto Lotufo , Rodrigo Nogueira

Large language models (LLMs) are a promising venue for natural language understanding and generation tasks. However, current LLMs are far from reliable: they are prone to generate non-factual information and, more crucially, to contradict…

Machine Learning · Computer Science 2024-04-22 Diego Calanzone , Stefano Teso , Antonio Vergari

Masked language modeling (MLM) plays a key role in pretraining large language models. But the MLM objective is often dominated by high-frequency words that are sub-optimal for learning factual knowledge. In this work, we propose an approach…

Computation and Language · Computer Science 2023-04-05 Nafis Sadeq , Byungkyu Kang , Prarit Lamba , Julian McAuley

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

Pre-trained Language Models (PLMs) are trained on vast unlabeled data, rich in world knowledge. This fact has sparked the interest of the community in quantifying the amount of factual knowledge present in PLMs, as this explains their…

Computation and Language · Computer Science 2023-12-06 Paul Youssef , Osman Alperen Koraş , Meijie Li , Jörg Schlötterer , Christin Seifert

Large language models (LLMs) can store a vast amount of world knowledge, often extractable via question-answering (e.g., "What is Abraham Lincoln's birthday?"). However, do they answer such questions based on exposure to similar questions…

Computation and Language · Computer Science 2024-07-17 Zeyuan Allen-Zhu , Yuanzhi Li

Retrieval-Augmented Generation (RAG) has emerged as a prominent method for incorporating domain knowledge into Large Language Models (LLMs). While RAG enhances response relevance by incorporating retrieved domain knowledge in the context,…

Computation and Language · Computer Science 2025-03-28 Kushagra Bhushan , Yatin Nandwani , Dinesh Khandelwal , Sonam Gupta , Gaurav Pandey , Dinesh Raghu , Sachindra Joshi

While the reasoning capabilities of Large Language Models (LLMs) excel in analytical tasks such as mathematics and code generation, their utility for abstractive summarization remains widely assumed but largely unverified. To bridge this…

Computation and Language · Computer Science 2025-12-10 Haohan Yuan , Haopeng Zhang

Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the…

Computation and Language · Computer Science 2024-04-12 Linyi Yang , Shuibai Zhang , Zhuohao Yu , Guangsheng Bao , Yidong Wang , Jindong Wang , Ruochen Xu , Wei Ye , Xing Xie , Weizhu Chen , Yue Zhang

Large language models (LLMs) have shown impressive prowess in solving a wide range of tasks with world knowledge. However, it remains unclear how well LLMs are able to perceive their factual knowledge boundaries, particularly under…

Computation and Language · Computer Science 2024-11-20 Ruiyang Ren , Yuhao Wang , Yingqi Qu , Wayne Xin Zhao , Jing Liu , Hao Tian , Hua Wu , Ji-Rong Wen , Haifeng Wang

Large language models have demonstrated impressive retrieval-augmented capabilities. However, a crucial area remains underexplored: their ability to appropriately adapt responses to the certainty of the retrieved information. It is a…

Computation and Language · Computer Science 2026-05-11 Behzad Shayegh , Mohamed Osama Ahmed , Fred Tung , Leo Feng
‹ Prev 1 2 3 10 Next ›