Related papers: Dated Data: Tracing Knowledge Cutoffs in Large Lan…

Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge Cutoffs

Large Language Models (LLMs) are widely used for temporal prediction, but their reliance on pretraining data raises contamination concerns, as accurate predictions on pre-cutoff test data may reflect memorization rather than reasoning,…

Computation and Language · Computer Science 2025-10-16 Xin Gao , Ruiyi Zhang , Daniel Du , Saurabh Mahindre , Sai Ashish Somayajula , Pengtao Xie

Beyond the Reported Cutoff: Where Large Language Models Fall Short on Financial Knowledge

Large Language Models (LLMs) are frequently utilized as sources of knowledge for question-answering. While it is known that LLMs may lack access to real-time data or newer data produced after the model's cutoff date, it is less clear how…

Computation and Language · Computer Science 2025-07-30 Agam Shah , Liqin Ye , Sebastian Jaskowski , Wei Xu , Sudheer Chava

LLMLagBench: Identifying Temporal Training Boundaries in Large Language Models

Large Language Models (LLMs) are pretrained on textual data up to a specific temporal cutoff. This creates a strict knowledge boundary beyond which models cannot provide accurate information without querying external sources. More subtly,…

Computation and Language · Computer Science 2025-11-18 Piotr Pęzik , Konrad Kaczyński , Maria Szymańska , Filip Żarnecki , Zuzanna Deckert , Jakub Kwiatkowski , Wojciech Janowski

ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models

Large language models (LLMs) face significant challenges in ex-ante reasoning, where analysis, inference, or predictions must be made without access to information from future events. Even with explicit prompts enforcing temporal cutoffs,…

Machine Learning · Computer Science 2025-05-27 Yachuan Liu , Xiaochun Wei , Lin Shi , Xinnuo Li , Bohan Zhang , Paramveer Dhillon , Qiaozhu Mei

Temporal Blind Spots in Large Language Models

Large language models (LLMs) have recently gained significant attention due to their unparalleled ability to perform various natural language processing tasks. These models, benefiting from their advanced natural language understanding…

Computation and Language · Computer Science 2024-01-23 Jonas Wallat , Adam Jatowt , Avishek Anand

Understanding Data Temporality Impact on Large Language Models Pre-training

Large language models (LLMs) are typically trained on shuffled corpora, yielding models whose knowledge is frozen at train time and whose temporal grounding remains poorly understood. In this work, we study the impact of pre-training…

Computation and Language · Computer Science 2026-05-26 Hippolyte Pilchen , Romain Fabre , Franck Signe Talla , Patrick Perez , Edouard Grave

Long-Tail Knowledge in Large Language Models: Taxonomy, Mechanisms, Interventions and Implications

Large language models (LLMs) are trained on web-scale corpora that exhibit steep power-law distributions, in which the distribution of knowledge is highly long-tailed, with most appearing infrequently. While scaling has improved…

Computation and Language · Computer Science 2026-02-19 Sanket Badhe , Deep Shah , Nehal Kathrotia

Teaching Large Language Models When Not to Know: Learning Temporal Critique for Ex-Ante Reasoning

Large language models (LLMs) often fail to reason under temporal cutoffs: when prompted to answer from the standpoint of an earlier time, they exploit knowledge that became available only later. We study this failure through the lens of…

Artificial Intelligence · Computer Science 2026-05-15 Chenlu Ding , Jiancan Wu , Yanchen Luo , Zheyuan Liu , Yancheng Yuan , Xiang Wang

Efficient Alignment of Large Language Models via Data Sampling

LLM alignment ensures that large language models behave safely and effectively by aligning their outputs with human values, goals, and intentions. Aligning LLMs employ huge amounts of data, computation, and time. Moreover, curating data…

Machine Learning · Computer Science 2025-02-19 Amrit Khera , Rajat Ghosh , Debojyoti Dutta

Large Language Models Lack Temporal Awareness of Medical Knowledge

The existing methods for evaluating the medical knowledge of Large Language Models (LLMs) are largely based on atemporal examination-style benchmarks, while in reality, medical knowledge is inherently dynamic and continuously evolves as new…

Machine Learning · Computer Science 2026-05-14 Zihan Guan , Qiao Jin , Guangzhi Xiong , Fangyuan Chen , Mengxuan Hu , Qingyu Chen , Yifan Peng , Zhiyong Lu , Anil Vullikanti

Is More Context Always Better? Examining LLM Reasoning Capability for Time Interval Prediction

Large Language Models (LLMs) have demonstrated impressive capabilities in reasoning and prediction across different domains. Yet, their ability to infer temporal regularities from structured behavioral data remains underexplored. This paper…

Artificial Intelligence · Computer Science 2026-01-27 Yanan Cao , Farnaz Fallahi , Murali Mohana Krishna Dandu , Lalitesh Morishetti , Kai Zhao , Luyi Ma , Sinduja Subramaniam , Jianpeng Xu , Evren Korpeoglu , Kaushiki Nag , Sushant Kumar , Kannan Achan

TEMPO: Temporal Enforcement via Mode-Separated Policy Optimization for Trustworthy LLM Backtesting

Backtesting large language models on historical events requires reasoning exclusively from information available before a specified cutoff date. Yet models routinely leak post-cutoff knowledge from pre-training into their reasoning,…

Machine Learning · Computer Science 2026-05-20 Zeyu Zhang , Bradly C. Stadie

When Large Language Models Meet Citation: A Survey

Citations in scholarly work serve the essential purpose of acknowledging and crediting the original sources of knowledge that have been incorporated or referenced. Depending on their surrounding textual context, these citations are used for…

Digital Libraries · Computer Science 2023-09-19 Yang Zhang , Yufei Wang , Kai Wang , Quan Z. Sheng , Lina Yao , Adnan Mahmood , Wei Emma Zhang , Rongying Zhao

TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining

Large Language Models (LLMs) trained on historical web data inevitably become outdated. We investigate evaluation strategies and update methods for LLMs as new data becomes available. We introduce a web-scale dataset for time-continual…

Machine Learning · Computer Science 2025-06-09 Jeffrey Li , Mohammadreza Armandpour , Iman Mirzadeh , Sachin Mehta , Vaishaal Shankar , Raviteja Vemulapalli , Samy Bengio , Oncel Tuzel , Mehrdad Farajtabar , Hadi Pouransari , Fartash Faghri

Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models

Declarative knowledge and procedural knowledge are two key parts in meta-cognitive theory, and these two hold significant importance in pre-training and inference of LLMs. However, a comprehensive analysis comparing these two types of…

Computation and Language · Computer Science 2024-03-18 Zhuoqun Li , Hongyu Lin , Yaojie Lu , Hao Xiang , Xianpei Han , Le Sun

From Parameters to Prompts: Understanding and Mitigating the Factuality Gap between Fine-Tuned LLMs

Factual knowledge extraction aims to explicitly extract knowledge parameterized in pre-trained language models for application in downstream tasks. While prior work has been investigating the impact of supervised fine-tuning data on the…

Computation and Language · Computer Science 2025-05-30 Xuan Gong , Hanbo Huang , Shiyu Liang

LLM-based Knowledge Pruning for Time Series Data Analytics on Edge-computing Devices

Limited by the scale and diversity of time series data, the neural networks trained on time series data often overfit and show unsatisfacotry performances. In comparison, large language models (LLMs) recently exhibit impressive…

Machine Learning · Computer Science 2024-06-14 Ruibing Jin , Qing Xu , Min Wu , Yuecong Xu , Dan Li , Xiaoli Li , Zhenghua Chen

Are Large Language Models Temporally Grounded?

Are Large language models (LLMs) temporally grounded? Since LLMs cannot perceive and interact with the environment, it is impossible to answer this question directly. Instead, we provide LLMs with textual narratives and probe them with…

Computation and Language · Computer Science 2023-11-17 Yifu Qiu , Zheng Zhao , Yftah Ziser , Anna Korhonen , Edoardo M. Ponti , Shay B. Cohen

Challenges and Contributing Factors in the Utilization of Large Language Models (LLMs)

With the development of large language models (LLMs) like the GPT series, their widespread use across various application scenarios presents a myriad of challenges. This review initially explores the issue of domain specificity, where LLMs…

Computation and Language · Computer Science 2023-10-23 Xiaoliang Chen , Liangbin Li , Le Chang , Yunhe Huang , Yuxuan Zhao , Yuxiao Zhang , Dinuo Li

Test of Time: Rethinking Temporal Signal of Benchmark Contamination

Post-cutoff performance decay of LLMs has been widely interpreted as a temporal signal for benchmark contamination, where public information released before the training cutoff may have been included into training corpora and inflated model…

Artificial Intelligence · Computer Science 2026-05-14 Terry Jingchen Zhang , Gopal Dev , Ning Wang , Max Obreiter , Punya Syon Pandey , Keenan Samway , Wenyuan Jiang , Yinya Huang , Bernhard Schölkopf , Mrinmaya Sachan , Zhijing Jin