Related papers: Mitigating Temporal Misalignment by Discarding Out…

Evidence-Based Temporal Fact Verification

Automated fact verification plays an essential role in fostering trust in the digital space. Despite the growing interest, the verification of temporal facts has not received much attention in the community. Temporal fact verification…

Information Retrieval · Computer Science 2024-08-20 Anab Maulana Barik , Wynne Hsu , Mong Li Lee

Time-Aware Language Models as Temporal Knowledge Bases

Many facts come with an expiration date, from the name of the President to the basketball team Lebron James plays for. But language models (LMs) are trained on snapshots of data collected at a specific moment in time, and this can limit…

Computation and Language · Computer Science 2022-04-26 Bhuwan Dhingra , Jeremy R. Cole , Julian Martin Eisenschlos , Daniel Gillick , Jacob Eisenstein , William W. Cohen

TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models

Language Models (LMs) become outdated as the world changes; they often fail to perform tasks requiring recent factual information which was absent or different during training, a phenomenon called temporal misalignment. This is especially a…

Computation and Language · Computer Science 2023-04-13 Joel Jang , Seonghyeon Ye , Changho Lee , Sohee Yang , Joongbo Shin , Janghoon Han , Gyeonghun Kim , Minjoon Seo

Counterfactual-Consistency Prompting for Relative Temporal Understanding in Large Language Models

Despite the advanced capabilities of large language models (LLMs), their temporal reasoning ability remains underdeveloped. Prior works have highlighted this limitation, particularly in maintaining temporal consistency when understanding…

Computation and Language · Computer Science 2025-06-18 Jongho Kim , Seung-won Hwang

TARDIS: Mitigating Temporal Misalignment via Representation Steering

Language models often struggle with temporal misalignment, performance degradation caused by shifts in the temporal distribution of data. Continuously updating models to avoid degradation is expensive. Can models be adapted without updating…

Machine Learning · Computer Science 2025-03-26 Changho Shin , Xinya Yan , Suenggwan Jo , Sungjun Cho , Shourjo Aditya Chaudhuri , Frederic Sala

MuLan: A Study of Fact Mutability in Language Models

Facts are subject to contingencies and can be true or false in different circumstances. One such contingency is time, wherein some facts mutate over a given period, e.g., the president of a country or the winner of a championship.…

Computation and Language · Computer Science 2024-04-05 Constanza Fierro , Nicolas Garneau , Emanuele Bugliarello , Yova Kementchedjhieva , Anders Søgaard

Time Waits for No One! Analysis and Challenges of Temporal Misalignment

When an NLP model is trained on text data from one time period and tested or deployed on data from another, the resulting temporal misalignment can degrade end-task performance. In this work, we establish a suite of eight diverse tasks…

Computation and Language · Computer Science 2022-07-04 Kelvin Luu , Daniel Khashabi , Suchin Gururangan , Karishma Mandyam , Noah A. Smith

Question Answering under Temporal Conflict: Evaluating and Organizing Evolving Knowledge with LLMs

Large language models (LLMs) exhibit remarkable capabilities in question answering and reasoning thanks to their extensive parametric memory. However, their knowledge is inherently limited by the scope of their pre-training data, while…

Computation and Language · Computer Science 2025-06-10 Atahan Özer , Çağatay Yıldız

When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation

The rapid evolution of large language models (LLMs) and the real world has outpaced the static nature of widely used evaluation benchmarks, raising concerns about their reliability for evaluating LLM factuality. While substantial works…

Computation and Language · Computer Science 2026-01-21 Xunyi Jiang , Dingyi Chang , Julian McAuley , Xin Xu

The Effect of Masking Strategies on Knowledge Retention by Language Models

Language models retain a significant amount of world knowledge from their pre-training stage. This allows knowledgeable models to be applied to knowledge-intensive tasks prevalent in information retrieval, such as ranking or question…

Computation and Language · Computer Science 2023-06-13 Jonas Wallat , Tianyi Zhang , Avishek Anand

ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models

Large language models (LLMs) face significant challenges in ex-ante reasoning, where analysis, inference, or predictions must be made without access to information from future events. Even with explicit prompts enforcing temporal cutoffs,…

Machine Learning · Computer Science 2025-05-27 Yachuan Liu , Xiaochun Wei , Lin Shi , Xinnuo Li , Bohan Zhang , Paramveer Dhillon , Qiaozhu Mei

Neural Datalog Through Time: Informed Temporal Modeling via Logical Specification

Learning how to predict future events from patterns of past events is difficult when the set of possible event types is large. Training an unrestricted neural model might overfit to spurious patterns. To exploit domain-specific knowledge of…

Machine Learning · Computer Science 2020-08-18 Hongyuan Mei , Guanghui Qin , Minjie Xu , Jason Eisner

Understanding Data Temporality Impact on Large Language Models Pre-training

Large language models (LLMs) are typically trained on shuffled corpora, yielding models whose knowledge is frozen at train time and whose temporal grounding remains poorly understood. In this work, we study the impact of pre-training…

Computation and Language · Computer Science 2026-05-26 Hippolyte Pilchen , Romain Fabre , Franck Signe Talla , Patrick Perez , Edouard Grave

From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts

Sample efficiency is a crucial property of language models with practical implications for training efficiency. In real-world text, information follows a long-tailed distribution. Yet, we expect models to learn and recall frequent and…

Computation and Language · Computer Science 2025-06-23 Daniel Christoph , Max Ploner , Patrick Haller , Alan Akbik

Set the Clock: Temporal Alignment of Pretrained Language Models

Language models (LMs) are trained on web text originating from many points in time and, in general, without any explicit temporal grounding. This work investigates the temporal chaos of pretrained LMs and explores various methods to align…

Computation and Language · Computer Science 2024-06-11 Bowen Zhao , Zander Brumbaugh , Yizhong Wang , Hannaneh Hajishirzi , Noah A. Smith

On the Impact of Temporal Concept Drift on Model Explanations

Explanation faithfulness of model predictions in natural language processing is typically evaluated on held-out data from the same temporal distribution as the training data (i.e. synchronous settings). While model performance often…

Computation and Language · Computer Science 2022-10-18 Zhixue Zhao , George Chrysostomou , Kalina Bontcheva , Nikolaos Aletras

Towards Logically Consistent Language Models via Probabilistic Reasoning

Large language models (LLMs) are a promising venue for natural language understanding and generation tasks. However, current LLMs are far from reliable: they are prone to generate non-factual information and, more crucially, to contradict…

Machine Learning · Computer Science 2024-04-22 Diego Calanzone , Stefano Teso , Antonio Vergari

Factual Knowledge in Language Models: Robustness and Anomalies under Simple Temporal Context Variations

This paper explores the robustness of language models (LMs) to variations in the temporal context within factual knowledge. It examines whether LMs can correctly associate a temporal context with a past fact valid over a defined period, by…

Computation and Language · Computer Science 2025-06-24 Hichem Ammar Khodja , Frédéric Béchet , Quentin Brabant , Alexis Nasr , Gwénolé Lecorvé

Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy

Large Language Models~(LLMs) struggle with providing current information due to the outdated pre-training data. Existing methods for updating LLMs, such as knowledge editing and continual fine-tuning, have significant drawbacks in…

Computation and Language · Computer Science 2024-02-12 Pengfei Yu , Heng Ji

Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models

Reasoning about time is of fundamental importance. Many facts are time-dependent. For example, athletes change teams from time to time, and different government officials are elected periodically. Previous time-dependent question answering…

Computation and Language · Computer Science 2023-06-28 Qingyu Tan , Hwee Tou Ng , Lidong Bing