English
Related papers

Related papers: Probing Script Knowledge from Pre-Trained Models

200 papers

Scripts - standardized event sequences describing typical everyday activities - have been shown to help understand narratives by providing expectations, resolving ambiguity, and filling in unstated information. However, to date they have…

Computation and Language · Computer Science 2021-04-19 Keisuke Sakaguchi , Chandra Bhagavatula , Ronan Le Bras , Niket Tandon , Peter Clark , Yejin Choi

Script knowledge plays a central role in text understanding and is relevant for a variety of downstream tasks. In this paper, we consider two recent datasets which provide a rich and general representation of script events in terms of…

Computation and Language · Computer Science 2019-05-21 Simon Ostermann , Michael Roth , Stefan Thater , Manfred Pinkal

While Pre-trained Language Models (PLMs) internalize a great amount of world knowledge, they have been shown incapable of recalling these knowledge to solve tasks requiring complex & multi-step reasoning. Similar to how humans develop a…

Computation and Language · Computer Science 2022-10-25 Boshi Wang , Xiang Deng , Huan Sun

Although pretrained language models (PLMs) can be prompted to perform a wide range of language tasks, it remains an open question how much this ability comes from generalizable linguistic understanding versus surface-level lexical patterns.…

Computation and Language · Computer Science 2023-05-23 Terra Blevins , Hila Gonen , Luke Zettlemoyer

Pre-trained Language Model (PLM) has become a representative foundation model in the natural language processing field. Most PLMs are trained with linguistic-agnostic pre-training tasks on the surface form of the text, such as the masked…

Computation and Language · Computer Science 2022-11-11 Yiming Cui , Wanxiang Che , Shijin Wang , Ting Liu

Predicting future events is an important activity with applications across multiple fields and domains. For example, the capacity to foresee stock market trends, natural disasters, business developments, or political events can facilitate…

Computation and Language · Computer Science 2025-01-13 Petraq Nako , Adam Jatowt

The remarkable success of pretrained language models has motivated the study of what kinds of knowledge these models learn during pretraining. Reformulating tasks as fill-in-the-blanks problems (e.g., cloze tests) is a natural approach for…

Computation and Language · Computer Science 2020-11-10 Taylor Shin , Yasaman Razeghi , Robert L. Logan , Eric Wallace , Sameer Singh

Language Models (LMs) have proven their ability to acquire diverse linguistic knowledge during the pretraining phase, potentially serving as a valuable source of incidental supervision for downstream tasks. However, there has been limited…

Computation and Language · Computer Science 2023-10-23 Claire Barale , Michael Rovatsos , Nehal Bhuta

Pre-trained models of source code have recently been successfully applied to a wide variety of Software Engineering tasks; they have also seen some practical adoption in practice, e.g. for code completion. Yet, we still know very little…

Software Engineering · Computer Science 2023-12-11 Anjan Karmakar , Romain Robbes

Simile interpretation is a crucial task in natural language processing. Nowadays, pre-trained language models (PLMs) have achieved state-of-the-art performance on many tasks. However, it remains under-explored whether PLMs can interpret…

Computation and Language · Computer Science 2022-03-17 Qianyu He , Sijie Cheng , Zhixu Li , Rui Xie , Yanghua Xiao

Pre-trained language models (PLMs) like BERT are being used for almost all language-related tasks, but interpreting their behavior still remains a significant challenge and many important questions remain largely unanswered. In this work,…

Computation and Language · Computer Science 2021-09-28 Samuel Stevens , Yu Su

While pre-trained language models (PTLMs) have achieved noticeable success on many NLP tasks, they still struggle for tasks that require event temporal reasoning, which is essential for event-centric applications. We present a continual…

Computation and Language · Computer Science 2021-09-20 Rujun Han , Xiang Ren , Nanyun Peng

Induction of common sense knowledge about prototypical sequences of events has recently received much attention. Instead of inducing this knowledge in the form of graphs, as in much of the previous work, in our method, distributed…

Machine Learning · Computer Science 2017-02-13 Ashutosh Modi , Ivan Titov

We study the problem of incorporating prior knowledge into a deep Transformer-based model,i.e.,Bidirectional Encoder Representations from Transformers (BERT), to enhance its performance on semantic textual matching tasks. By probing and…

Computation and Language · Computer Science 2021-02-23 Tingyu Xia , Yue Wang , Yuan Tian , Yi Chang

Query understanding plays a key role in exploring users' search intents and facilitating users to locate their most desired information. However, it is inherently challenging since it needs to capture semantic information from short and…

Information Retrieval · Computer Science 2023-11-20 Juanhui Li , Yao Ma , Wei Zeng , Suqi Cheng , Jiliang Tang , Shuaiqiang Wang , Dawei Yin

Large Language Models (LLMs) are widely used for temporal prediction, but their reliance on pretraining data raises contamination concerns, as accurate predictions on pre-cutoff test data may reflect memorization rather than reasoning,…

Computation and Language · Computer Science 2025-10-16 Xin Gao , Ruiyi Zhang , Daniel Du , Saurabh Mahindre , Sai Ashish Somayajula , Pengtao Xie

Word co-occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse…

Neural models that do not rely on pre-training have excelled in the keyphrase generation task with large annotated datasets. Meanwhile, new approaches have incorporated pre-trained language models (PLMs) for their data efficiency. However,…

Computation and Language · Computer Science 2024-02-26 Di Wu , Wasi Uddin Ahmad , Kai-Wei Chang

Pre-trained language models (PLMs) like BERT have made significant progress in various downstream NLP tasks. However, by asking models to do cloze-style tests, recent work finds that PLMs are short in acquiring knowledge from unstructured…

Computation and Language · Computer Science 2023-10-12 Cunxiang Wang , Fuli Luo , Yanyang Li , Runxin Xu , Fei Huang , Yue Zhang

Recently, language models have demonstrated strong performance on various natural language understanding tasks. Language models trained on large human-generated corpus encode not only a significant amount of human knowledge, but also the…

Computation and Language · Computer Science 2023-04-10 Damin Zhang , Julia Rayz , Romila Pradhan
‹ Prev 1 2 3 10 Next ›