Related papers: Language Models as Knowledge Bases?

Unsupervised Pre-training with Structured Knowledge for Improving Natural Language Inference

While recent research on natural language inference has considerably benefited from large annotated datasets, the amount of inference-related knowledge (including commonsense) provided in the annotated data is still rather limited. There…

Computation and Language · Computer Science 2021-09-10 Xiaoyu Yang , Xiaodan Zhu , Zhan Shi , Tianda Li

Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model

Recent breakthroughs of pretrained language models have shown the effectiveness of self-supervised learning for a wide range of natural language processing (NLP) tasks. In addition to standard syntactic and semantic NLP tasks, pretrained…

Computation and Language · Computer Science 2019-12-23 Wenhan Xiong , Jingfei Du , William Yang Wang , Veselin Stoyanov

BERTnesia: Investigating the capture and forgetting of knowledge in BERT

Probing complex language models has recently revealed several insights into linguistic and semantic patterns found in the learned representations. In this article, we probe BERT specifically to understand and measure the relational…

Computation and Language · Computer Science 2021-09-09 Jonas Wallat , Jaspreet Singh , Avishek Anand

Language Models as Fact Checkers?

Recent work has suggested that language models (LMs) store both common-sense and factual knowledge learned from pre-training data. In this paper, we leverage this implicit knowledge to create an effective end-to-end fact checker using a…

Computation and Language · Computer Science 2020-07-27 Nayeon Lee , Belinda Z. Li , Sinong Wang , Wen-tau Yih , Hao Ma , Madian Khabsa

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

It has recently been observed that neural language models trained on unstructured text can implicitly store and retrieve knowledge using natural language queries. In this short paper, we measure the practical utility of this approach by…

Computation and Language · Computer Science 2020-10-07 Adam Roberts , Colin Raffel , Noam Shazeer

Can BERT Refrain from Forgetting on Sequential Tasks? A Probing Study

Large pre-trained language models help to achieve state of the art on a variety of natural language processing (NLP) tasks, nevertheless, they still suffer from forgetting when incrementally learning a sequence of tasks. To alleviate this…

Computation and Language · Computer Science 2023-03-03 Mingxu Tao , Yansong Feng , Dongyan Zhao

A Comprehensive Comparison of Pre-training Language Models

Recently, the development of pre-trained language models has brought natural language processing (NLP) tasks to the new state-of-the-art. In this paper we explore the efficiency of various pre-trained language models. We pre-train a list of…

Computation and Language · Computer Science 2023-07-27 Tong Guo

Linguistic Profiling of a Neural Language Model

In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set…

Computation and Language · Computer Science 2024-02-27 Alessio Miaschi , Dominique Brunato , Felice Dell'Orletta , Giulia Venturi

Knowledge Graph Fusion for Language Model Fine-tuning

Language Models such as BERT have grown in popularity due to their ability to be pre-trained and perform robustly on a wide range of Natural Language Processing tasks. Often seen as an evolution over traditional word embedding techniques,…

Computation and Language · Computer Science 2022-06-30 Nimesh Bhana , Terence L. van Zyl

BERTnesia: Investigating the capture and forgetting of knowledge in BERT

Probing complex language models has recently revealed several insights into linguistic and semantic patterns found in the learned representations. In this paper, we probe BERT specifically to understand and measure the relational knowledge…

Computation and Language · Computer Science 2021-09-09 Jonas Wallat , Jaspreet Singh , Avishek Anand

Foundation Models for Natural Language Processing -- Pre-trained Language Models Integrating Media

This open access book provides a comprehensive overview of the state of the art in research and applications of Foundation Models and is intended for readers familiar with basic Natural Language Processing (NLP) concepts. Over the recent…

Computation and Language · Computer Science 2023-02-20 Gerhard Paaß , Sven Giesselbach

How Context Affects Language Models' Factual Predictions

When pre-trained on large unsupervised textual corpora, language models are able to store and retrieve factual knowledge to some extent, making it possible to use them directly for zero-shot cloze-style question answering. However, storing…

Computation and Language · Computer Science 2020-05-12 Fabio Petroni , Patrick Lewis , Aleksandra Piktus , Tim Rocktäschel , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel

A Review on Language Models as Knowledge Bases

Recently, there has been a surge of interest in the NLP community on the use of pretrained Language Models (LMs) as Knowledge Bases (KBs). Researchers have shown that LMs trained on a sufficiently large (web) corpus will encode a…

Computation and Language · Computer Science 2022-04-14 Badr AlKhamissi , Millicent Li , Asli Celikyilmaz , Mona Diab , Marjan Ghazvininejad

Pre-trained language models as knowledge bases for Automotive Complaint Analysis

Recently it has been shown that large pre-trained language models like BERT (Devlin et al., 2018) are able to store commonsense factual knowledge captured in its pre-training corpus (Petroni et al., 2019). In our work we further evaluate…

Computation and Language · Computer Science 2020-12-07 V. D. Viellieber , M. Aßenmacher

An Empirical Study of Pre-trained Language Models in Simple Knowledge Graph Question Answering

Large-scale pre-trained language models (PLMs) such as BERT have recently achieved great success and become a milestone in natural language processing (NLP). It is now the consensus of the NLP community to adopt PLMs as the backbone for…

Computation and Language · Computer Science 2023-03-21 Nan Hu , Yike Wu , Guilin Qi , Dehai Min , Jiaoyan Chen , Jeff Z. Pan , Zafar Ali

Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity

Unsupervised pretraining models have been shown to facilitate a wide range of downstream NLP applications. These models, however, retain some of the limitations of traditional static word embeddings. In particular, they encode only the…

Computation and Language · Computer Science 2020-04-21 Anne Lauscher , Ivan Vulić , Edoardo Maria Ponti , Anna Korhonen , Goran Glavaš

Large Language Model Programs

In recent years, large pre-trained language models (LLMs) have demonstrated the ability to follow instructions and perform novel tasks from a few examples. The possibility to parameterise an LLM through such in-context examples widens their…

Machine Learning · Computer Science 2023-05-10 Imanol Schlag , Sainbayar Sukhbaatar , Asli Celikyilmaz , Wen-tau Yih , Jason Weston , Jürgen Schmidhuber , Xian Li

REALM: Retrieval-Augmented Language Model Pre-Training

Language model pre-training has been shown to capture a surprising amount of world knowledge, crucial for NLP tasks such as question answering. However, this knowledge is stored implicitly in the parameters of a neural network, requiring…

Computation and Language · Computer Science 2020-02-21 Kelvin Guu , Kenton Lee , Zora Tung , Panupong Pasupat , Ming-Wei Chang

What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models

Pre-training by language modeling has become a popular and successful approach to NLP tasks, but we have yet to understand exactly what linguistic capacities these pre-training processes confer upon models. In this paper we introduce a…

Computation and Language · Computer Science 2020-07-14 Allyson Ettinger

Inducing Relational Knowledge from BERT

One of the most remarkable properties of word embeddings is the fact that they capture certain types of semantic and syntactic relationships. Recently, pre-trained language models such as BERT have achieved groundbreaking results across a…

Computation and Language · Computer Science 2019-12-02 Zied Bouraoui , Jose Camacho-Collados , Steven Schockaert