Related papers: Probing Script Knowledge from Pre-Trained Models

proScript: Partially Ordered Scripts Generation via Pre-trained Language Models

Scripts - standardized event sequences describing typical everyday activities - have been shown to help understand narratives by providing expectations, resolving ambiguity, and filling in unstated information. However, to date they have…

Computation and Language · Computer Science 2021-04-19 Keisuke Sakaguchi , Chandra Bhagavatula , Ronan Le Bras , Niket Tandon , Peter Clark , Yejin Choi

Aligning Script Events with Narrative Texts

Script knowledge plays a central role in text understanding and is relevant for a variety of downstream tasks. In this paper, we consider two recent datasets which provide a rich and general representation of script events in terms of…

Computation and Language · Computer Science 2019-05-21 Simon Ostermann , Michael Roth , Stefan Thater , Manfred Pinkal

Iteratively Prompt Pre-trained Language Models for Chain of Thought

While Pre-trained Language Models (PLMs) internalize a great amount of world knowledge, they have been shown incapable of recalling these knowledge to solve tasks requiring complex & multi-step reasoning. Similar to how humans develop a…

Computation and Language · Computer Science 2022-10-25 Boshi Wang , Xiang Deng , Huan Sun

Prompting Language Models for Linguistic Structure

Although pretrained language models (PLMs) can be prompted to perform a wide range of language tasks, it remains an open question how much this ability comes from generalizable linguistic understanding versus surface-level lexical patterns.…

Computation and Language · Computer Science 2023-05-23 Terra Blevins , Hila Gonen , Luke Zettlemoyer

LERT: A Linguistically-motivated Pre-trained Language Model

Pre-trained Language Model (PLM) has become a representative foundation model in the natural language processing field. Most PLMs are trained with linguistic-agnostic pre-training tasks on the surface form of the text, such as the masked…

Computation and Language · Computer Science 2022-11-11 Yiming Cui , Wanxiang Che , Shijin Wang , Ting Liu

Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction

Predicting future events is an important activity with applications across multiple fields and domains. For example, the capacity to foresee stock market trends, natural disasters, business developments, or political events can facilitate…

Computation and Language · Computer Science 2025-01-13 Petraq Nako , Adam Jatowt

AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts

The remarkable success of pretrained language models has motivated the study of what kinds of knowledge these models learn during pretraining. Reformulating tasks as fill-in-the-blanks problems (e.g., cloze tests) is a natural approach for…

Computation and Language · Computer Science 2020-11-10 Taylor Shin , Yasaman Razeghi , Robert L. Logan , Eric Wallace , Sameer Singh

Do Language Models Learn about Legal Entity Types during Pretraining?

Language Models (LMs) have proven their ability to acquire diverse linguistic knowledge during the pretraining phase, potentially serving as a valuable source of incidental supervision for downstream tasks. However, there has been limited…

Computation and Language · Computer Science 2023-10-23 Claire Barale , Michael Rovatsos , Nehal Bhuta

INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers

Pre-trained models of source code have recently been successfully applied to a wide variety of Software Engineering tasks; they have also seen some practical adoption in practice, e.g. for code completion. Yet, we still know very little…

Software Engineering · Computer Science 2023-12-11 Anjan Karmakar , Romain Robbes

Can Pre-trained Language Models Interpret Similes as Smart as Human?

Simile interpretation is a crucial task in natural language processing. Nowadays, pre-trained language models (PLMs) have achieved state-of-the-art performance on many tasks. However, it remains under-explored whether PLMs can interpret…

Computation and Language · Computer Science 2022-03-17 Qianyu He , Sijie Cheng , Zhixu Li , Rui Xie , Yanghua Xiao

An Investigation of Language Model Interpretability via Sentence Editing

Pre-trained language models (PLMs) like BERT are being used for almost all language-related tasks, but interpreting their behavior still remains a significant challenge and many important questions remain largely unanswered. In this work,…

Computation and Language · Computer Science 2021-09-28 Samuel Stevens , Yu Su

ECONET: Effective Continual Pretraining of Language Models for Event Temporal Reasoning

While pre-trained language models (PTLMs) have achieved noticeable success on many NLP tasks, they still struggle for tasks that require event temporal reasoning, which is essential for event-centric applications. We present a continual…

Computation and Language · Computer Science 2021-09-20 Rujun Han , Xiang Ren , Nanyun Peng

Learning Semantic Script Knowledge with Event Embeddings

Induction of common sense knowledge about prototypical sequences of events has recently received much attention. Instead of inducing this knowledge in the form of graphs, as in much of the previous work, in our method, distributed…

Machine Learning · Computer Science 2017-02-13 Ashutosh Modi , Ivan Titov

Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks

We study the problem of incorporating prior knowledge into a deep Transformer-based model,i.e.,Bidirectional Encoder Representations from Transformers (BERT), to enhance its performance on semantic textual matching tasks. By probing and…

Computation and Language · Computer Science 2021-02-23 Tingyu Xia , Yue Wang , Yuan Tian , Yi Chang

Graph Enhanced BERT for Query Understanding

Query understanding plays a key role in exploring users' search intents and facilitating users to locate their most desired information. However, it is inherently challenging since it needs to capture semantic information from short and…

Information Retrieval · Computer Science 2023-11-20 Juanhui Li , Yao Ma , Wei Zeng , Suqi Cheng , Jiliang Tang , Shuaiqiang Wang , Dawei Yin

Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge Cutoffs

Large Language Models (LLMs) are widely used for temporal prediction, but their reliance on pretraining data raises contamination concerns, as accurate predictions on pre-cutoff test data may reflect memorization rather than reasoning,…

Computation and Language · Computer Science 2025-10-16 Xin Gao , Ruiyi Zhang , Daniel Du , Saurabh Mahindre , Sai Ashish Somayajula , Pengtao Xie

Event knowledge in large language models: the gap between the impossible and the unlikely

Word co-occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse…

Computation and Language · Computer Science 2023-10-27 Carina Kauf , Anna A. Ivanova , Giulia Rambelli , Emmanuele Chersoni , Jingyuan Selena She , Zawad Chowdhury , Evelina Fedorenko , Alessandro Lenci

Pre-trained Language Models for Keyphrase Generation: A Thorough Empirical Study

Neural models that do not rely on pre-training have excelled in the keyphrase generation task with large annotated datasets. Meanwhile, new approaches have incorporated pre-trained language models (PLMs) for their data efficiency. However,…

Computation and Language · Computer Science 2024-02-26 Di Wu , Wasi Uddin Ahmad , Kai-Wei Chang

Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base

Pre-trained language models (PLMs) like BERT have made significant progress in various downstream NLP tasks. However, by asking models to do cloze-style tests, recent work finds that PLMs are short in acquiring knowledge from unstructured…

Computation and Language · Computer Science 2023-10-12 Cunxiang Wang , Fuli Luo , Yanyang Li , Runxin Xu , Fei Huang , Yue Zhang

Counteracts: Testing Stereotypical Representation in Pre-trained Language Models

Recently, language models have demonstrated strong performance on various natural language understanding tasks. Language models trained on large human-generated corpus encode not only a significant amount of human knowledge, but also the…

Computation and Language · Computer Science 2023-04-10 Damin Zhang , Julia Rayz , Romila Pradhan