English
Related papers

Related papers: How Large Language Models Encode Context Knowledge…

200 papers

Large language models (LLMs) have shown remarkable performances across a wide range of tasks. However, the mechanisms by which these models encode tasks of varying complexities remain poorly understood. In this paper, we explore the…

Computation and Language · Computer Science 2025-02-06 Mingyu Jin , Qinkai Yu , Jingyuan Huang , Qingcheng Zeng , Zhenting Wang , Wenyue Hua , Haiyan Zhao , Kai Mei , Yanda Meng , Kaize Ding , Fan Yang , Mengnan Du , Yongfeng Zhang

Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the…

Computation and Language · Computer Science 2024-04-12 Linyi Yang , Shuibai Zhang , Zhuohao Yu , Guangsheng Bao , Yidong Wang , Jindong Wang , Ruochen Xu , Wei Ye , Xing Xie , Weizhu Chen , Yue Zhang

Humankind's understanding of the world is fundamentally linked to our perception and cognition, with \emph{human languages} serving as one of the major carriers of \emph{world knowledge}. In this vein, \emph{Large Language Models} (LLMs)…

Artificial Intelligence · Computer Science 2024-06-27 Huajun Chen

Large Language Models (LLMs) demonstrate an impressive capacity to recall a vast range of factual knowledge. However, understanding their underlying reasoning and internal mechanisms in exploiting this knowledge remains a key research area.…

Computation and Language · Computer Science 2024-08-07 Marco Bronzini , Carlo Nicolini , Bruno Lepri , Jacopo Staiano , Andrea Passerini

Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or…

Computation and Language · Computer Science 2024-05-07 Zheng Zhao , Emilio Monti , Jens Lehmann , Haytham Assem

Large language models (LLMs) are increasingly strong contenders in machine translation. In this work, we focus on document-level translation, where some words cannot be translated without context from outside the sentence. Specifically, we…

Computation and Language · Computer Science 2025-02-17 Wafaa Mohammed , Vlad Niculae

Large Language Models (LLMs) have emerged as highly capable systems and are increasingly being integrated into various uses. However, the rapid pace of their deployment has outpaced a comprehensive understanding of their internal mechanisms…

Computation and Language · Computer Science 2025-10-27 Gabriele Prato , Jerry Huang , Prasanna Parthasarathi , Shagun Sodhani , Sarath Chandar

Large Language Models (LLMs) store an extensive amount of factual knowledge obtained from vast collections of text. To effectively utilize these models for downstream tasks, it is crucial to have reliable methods for measuring their…

Computation and Language · Computer Science 2023-06-13 Pouya Pezeshkpour

Over the past few years, the abilities of large language models (LLMs) have received extensive attention, which have performed exceptionally well in complicated scenarios such as logical reasoning and symbolic inference. A significant…

Computation and Language · Computer Science 2024-02-20 Junbing Yan , Chengyu Wang , Jun Huang , Wei Zhang

Large Language Models (LLMs) have demonstrated remarkable capabilities in various tasks, yet they often struggle with context-faithfulness generations that properly reflect contextual knowledge. While existing approaches focus on enhancing…

Computation and Language · Computer Science 2025-04-23 Xiaowei Yuan , Zhao Yang , Ziyang Huang , Yequan Wang , Siqi Fan , Yiming Ju , Jun Zhao , Kang Liu

Large language models (LLMs) have led to breakthroughs in language tasks, yet the internal mechanisms that enable their remarkable generalization and reasoning abilities remain opaque. This lack of transparency presents challenges such as…

Computation and Language · Computer Science 2024-04-17 Haiyan Zhao , Fan Yang , Bo Shen , Himabindu Lakkaraju , Mengnan Du

Large language models (LLMs) acquire knowledge across diverse domains such as science, history, and geography encountered during generative pre-training. However, due to their stochasticity, it is difficult to predict what LLMs have…

Computation and Language · Computer Science 2026-01-27 Kartik Sharma , Yiqiao Jin , Rakshit Trivedi , Srijan Kumar

Language Models (LMs) acquire parametric knowledge from their training process, embedding it within their weights. The increasing scalability of LMs, however, poses significant challenges for understanding a model's inner workings and…

Computation and Language · Computer Science 2026-03-11 Isabelle Augenstein

Large language models (LLMs) have demonstrated remarkable proficiency in understanding and generating responses to complex queries through large-scale pre-training. However, the efficacy of these models in memorizing and reasoning among…

Computation and Language · Computer Science 2024-02-23 Qiyuan He , Yizhong Wang , Wenya Wang

Human languages are full of metaphorical expressions. Metaphors help people understand the world by connecting new concepts and domains to more familiar ones. Large pre-trained language models (PLMs) are therefore assumed to encode…

Computation and Language · Computer Science 2022-03-29 Ehsan Aghazadeh , Mohsen Fayyaz , Yadollah Yaghoobzadeh

Large language models (LLMs) have emerged as strong contenders in machine translation.Yet, they still struggle to adequately handle discourse phenomena, such as pronoun resolution and lexical cohesion at the document level. In this study,…

Computation and Language · Computer Science 2025-10-09 Wafaa Mohammed , Vlad Niculae , Chrysoula Zerva

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of LLMs encompasses various…

Computation and Language · Computer Science 2024-02-02 Yilun Zhu , Joel Ruben Antony Moniz , Shruti Bhargava , Jiarui Lu , Dhivya Piraviperumal , Site Li , Yuan Zhang , Hong Yu , Bo-Hsiang Tseng

Though large language models (LLMs) have enabled great success across a wide variety of tasks, they still appear to fall short of one of the loftier goals of artificial intelligence research: creating an artificial system that can adapt its…

Computation and Language · Computer Science 2026-05-04 Michael A. Lepori , Tal Linzen , Ann Yuan , Katja Filippova

Large Language Models (LLMs) exhibit impressive performance on a range of NLP tasks, due to the general-purpose linguistic knowledge acquired during pretraining. Existing model interpretability research (Tenney et al., 2019) suggests that a…

While large language models (LLMs) have demonstrated impressive capabilities across various natural language processing tasks by acquiring rich factual knowledge from their broad training data, their ability to synthesize and logically…

Computation and Language · Computer Science 2024-07-31 Tianshi Zheng , Jiaxin Bai , Yicheng Wang , Tianqing Fang , Yue Guo , Yauwai Yim , Yangqiu Song
‹ Prev 1 2 3 10 Next ›