Time Machine GPT

Felix Drinkall; Eghbal Rahimikia; Janet B. Pierrehumbert; Stefan Zohren

Time Machine GPT

Computation and Language 2024-04-30 v1 Computational Engineering, Finance, and Science Machine Learning

Authors: Felix Drinkall , Eghbal Rahimikia , Janet B. Pierrehumbert , Stefan Zohren

Abstract

Large language models (LLMs) are often trained on extensive, temporally indiscriminate text corpora, reflecting the lack of datasets with temporal metadata. This approach is not aligned with the evolving nature of language. Conventional methods for creating temporally adapted language models often depend on further pre-training static models on time-specific data. This paper presents a new approach: a series of point-in-time LLMs called Time Machine GPT (TiMaGPT), specifically designed to be nonprognosticative. This ensures they remain uninformed about future factual information and linguistic changes. This strategy is beneficial for understanding language evolution and is of critical importance when applying models in dynamic contexts, such as time-series forecasting, where foresight of future information can prove problematic. We provide access to both the models and training datasets.

Keywords

large language model instruction tuning language modeling

Cite

@article{arxiv.2404.18543,
  title  = {Time Machine GPT},
  author = {Felix Drinkall and Eghbal Rahimikia and Janet B. Pierrehumbert and Stefan Zohren},
  journal= {arXiv preprint arXiv:2404.18543},
  year   = {2024}
}

Comments

NAACL Findings 2024

Time Machine GPT

Abstract

Keywords

Cite

Comments

Related papers