Related papers: How Large Language Models Encode Context Knowledge…

Exploring Concept Depth: How Large Language Models Acquire Knowledge and Concept at Different Layers?

Large language models (LLMs) have shown remarkable performances across a wide range of tasks. However, the mechanisms by which these models encode tasks of varying complexities remain poorly understood. In this paper, we explore the…

Computation and Language · Computer Science 2025-02-06 Mingyu Jin , Qinkai Yu , Jingyuan Huang , Qingcheng Zeng , Zhenting Wang , Wenyue Hua , Haiyan Zhao , Kai Mei , Yanda Meng , Kaize Ding , Fan Yang , Mengnan Du , Yongfeng Zhang

Supervised Knowledge Makes Large Language Models Better In-context Learners

Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the…

Computation and Language · Computer Science 2024-04-12 Linyi Yang , Shuibai Zhang , Zhuohao Yu , Guangsheng Bao , Yidong Wang , Jindong Wang , Ruochen Xu , Wei Ye , Xing Xie , Weizhu Chen , Yue Zhang

Large Knowledge Model: Perspectives and Challenges

Humankind's understanding of the world is fundamentally linked to our perception and cognition, with \emph{human languages} serving as one of the major carriers of \emph{world knowledge}. In this vein, \emph{Large Language Models} (LLMs)…

Artificial Intelligence · Computer Science 2024-06-27 Huajun Chen

Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph

Large Language Models (LLMs) demonstrate an impressive capacity to recall a vast range of factual knowledge. However, understanding their underlying reasoning and internal mechanisms in exploiting this knowledge remains a key research area.…

Computation and Language · Computer Science 2024-08-07 Marco Bronzini , Carlo Nicolini , Bruno Lepri , Jacopo Staiano , Andrea Passerini

Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or…

Computation and Language · Computer Science 2024-05-07 Zheng Zhao , Emilio Monti , Jens Lehmann , Haytham Assem

Context-Aware or Context-Insensitive? Assessing LLMs' Performance in Document-Level Translation

Large language models (LLMs) are increasingly strong contenders in machine translation. In this work, we focus on document-level translation, where some words cannot be translated without context from outside the sentence. Specifically, we…

Computation and Language · Computer Science 2025-02-17 Wafaa Mohammed , Vlad Niculae

Do Large Language Models Know How Much They Know?

Large Language Models (LLMs) have emerged as highly capable systems and are increasingly being integrated into various uses. However, the rapid pace of their deployment has outpaced a comprehensive understanding of their internal mechanisms…

Computation and Language · Computer Science 2025-10-27 Gabriele Prato , Jerry Huang , Prasanna Parthasarathi , Shagun Sodhani , Sarath Chandar

Measuring and Modifying Factual Knowledge in Large Language Models

Large Language Models (LLMs) store an extensive amount of factual knowledge obtained from vast collections of text. To effectively utilize these models for downstream tasks, it is crucial to have reliable methods for measuring their…

Computation and Language · Computer Science 2023-06-13 Pouya Pezeshkpour

Do Large Language Models Understand Logic or Just Mimick Context?

Over the past few years, the abilities of large language models (LLMs) have received extensive attention, which have performed exceptionally well in complicated scenarios such as logical reasoning and symbolic inference. A significant…

Computation and Language · Computer Science 2024-02-20 Junbing Yan , Chengyu Wang , Jun Huang , Wei Zhang

Exploiting Contextual Knowledge in LLMs through V-usable Information based Layer Enhancement

Large Language Models (LLMs) have demonstrated remarkable capabilities in various tasks, yet they often struggle with context-faithfulness generations that properly reflect contextual knowledge. While existing approaches focus on enhancing…

Computation and Language · Computer Science 2025-04-23 Xiaowei Yuan , Zhao Yang , Ziyang Huang , Yequan Wang , Siqi Fan , Yiming Ju , Jun Zhao , Kang Liu

Towards Uncovering How Large Language Model Works: An Explainability Perspective

Large language models (LLMs) have led to breakthroughs in language tasks, yet the internal mechanisms that enable their remarkable generalization and reasoning abilities remain opaque. This lack of transparency presents challenges such as…

Computation and Language · Computer Science 2024-04-17 Haiyan Zhao , Fan Yang , Bo Shen , Himabindu Lakkaraju , Mengnan Du

Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings

Large language models (LLMs) acquire knowledge across diverse domains such as science, history, and geography encountered during generative pre-training. However, due to their stochasticity, it is difficult to predict what LLMs have…

Computation and Language · Computer Science 2026-01-27 Kartik Sharma , Yiqiao Jin , Rakshit Trivedi , Srijan Kumar

Understanding the Interplay between LLMs' Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 2025

Language Models (LMs) acquire parametric knowledge from their training process, embedding it within their weights. The increasing scalability of LMs, however, poses significant challenges for understanding a model's inner workings and…

Computation and Language · Computer Science 2026-03-11 Isabelle Augenstein

Can Language Models Act as Knowledge Bases at Scale?

Large language models (LLMs) have demonstrated remarkable proficiency in understanding and generating responses to complex queries through large-scale pre-training. However, the efficacy of these models in memorizing and reasoning among…

Computation and Language · Computer Science 2024-02-23 Qiyuan He , Yizhong Wang , Wenya Wang

Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages

Human languages are full of metaphorical expressions. Metaphors help people understand the world by connecting new concepts and domains to more familiar ones. Large pre-trained language models (PLMs) are therefore assumed to encode…

Computation and Language · Computer Science 2022-03-29 Ehsan Aghazadeh , Mohsen Fayyaz , Yadollah Yaghoobzadeh

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding

Large language models (LLMs) have emerged as strong contenders in machine translation.Yet, they still struggle to adequately handle discourse phenomena, such as pronoun resolution and lexical cohesion at the document level. In this study,…

Computation and Language · Computer Science 2025-10-09 Wafaa Mohammed , Vlad Niculae , Chrysoula Zerva

Can Large Language Models Understand Context?

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of LLMs encompasses various…

Computation and Language · Computer Science 2024-02-02 Yilun Zhu , Joel Ruben Antony Moniz , Shruti Bhargava , Jiarui Lu , Dhivya Piraviperumal , Site Li , Yuan Zhang , Hong Yu , Bo-Hsiang Tseng

Language Models Struggle to Use Representations Learned In-Context

Though large language models (LLMs) have enabled great success across a wide variety of tasks, they still appear to fall short of one of the loftier goals of artificial intelligence research: creating an artificial system that can adapt its…

Computation and Language · Computer Science 2026-05-04 Michael A. Lepori , Tal Linzen , Ann Yuan , Katja Filippova

Probing LLMs for Joint Encoding of Linguistic Categories

Large Language Models (LLMs) exhibit impressive performance on a range of NLP tasks, due to the general-purpose linguistic knowledge acquired during pretraining. Existing model interpretability research (Tenney et al., 2019) suggests that a…

Computation and Language · Computer Science 2023-10-31 Giulio Starace , Konstantinos Papakostas , Rochelle Choenni , Apostolos Panagiotopoulos , Matteo Rosati , Alina Leidinger , Ekaterina Shutova

CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge

While large language models (LLMs) have demonstrated impressive capabilities across various natural language processing tasks by acquiring rich factual knowledge from their broad training data, their ability to synthesize and logically…

Computation and Language · Computer Science 2024-07-31 Tianshi Zheng , Jiaxin Bai , Yicheng Wang , Tianqing Fang , Yue Guo , Yauwai Yim , Yangqiu Song