Related papers: Interpreting Language Models Through Knowledge Gra…

Probing Pretrained Language Models for Lexical Semantics

The success of large pretrained language models (LMs) such as BERT and RoBERTa has sparked interest in probing their representations, in order to unveil what types of knowledge they implicitly capture. While prior research focused on…

Computation and Language · Computer Science 2020-10-13 Ivan Vulić , Edoardo Maria Ponti , Robert Litschko , Goran Glavaš , Anna Korhonen

RelBERT: Embedding Relations with Language Models

Many applications need access to background knowledge about how different concepts and entities are related. Although Knowledge Graphs (KG) and Large Language Models (LLM) can address this need to some extent, KGs are inevitably incomplete…

Computation and Language · Computer Science 2023-10-10 Asahi Ushio , Jose Camacho-Collados , Steven Schockaert

BERTnesia: Investigating the capture and forgetting of knowledge in BERT

Probing complex language models has recently revealed several insights into linguistic and semantic patterns found in the learned representations. In this article, we probe BERT specifically to understand and measure the relational…

Computation and Language · Computer Science 2021-09-09 Jonas Wallat , Jaspreet Singh , Avishek Anand

ALBERT with Knowledge Graph Encoder Utilizing Semantic Similarity for Commonsense Question Answering

Recently, pre-trained language representation models such as bidirectional encoder representations from transformers (BERT) have been performing well in commonsense question answering (CSQA). However, there is a problem that the models do…

Computation and Language · Computer Science 2022-11-15 Byeongmin Choi , YongHyun Lee , Yeunwoong Kyung , Eunchan Kim

SPOT: Knowledge-Enhanced Language Representations for Information Extraction

Knowledge-enhanced pre-trained models for language representation have been shown to be more effective in knowledge base construction tasks (i.e.,~relation extraction) than language models such as BERT. These knowledge-enhanced language…

Computation and Language · Computer Science 2022-10-25 Jiacheng Li , Yannis Katsis , Tyler Baldwin , Ho-Cheol Kim , Andrew Bartko , Julian McAuley , Chun-Nan Hsu

Knowledge Graph Fusion for Language Model Fine-tuning

Language Models such as BERT have grown in popularity due to their ability to be pre-trained and perform robustly on a wide range of Natural Language Processing tasks. Often seen as an evolution over traditional word embedding techniques,…

Computation and Language · Computer Science 2022-06-30 Nimesh Bhana , Terence L. van Zyl

What does BERT know about books, movies and music? Probing BERT for Conversational Recommendation

Heavily pre-trained transformer models such as BERT have recently shown to be remarkably powerful at language modelling by achieving impressive results on numerous downstream tasks. It has also been shown that they are able to implicitly…

Information Retrieval · Computer Science 2021-03-05 Gustavo Penha , Claudia Hauff

Enriching language models with graph-based context information to better understand textual data

A considerable number of texts encountered daily are somehow connected with each other. For example, Wikipedia articles refer to other articles via hyperlinks, scientific papers relate to others via citations or (co)authors, while tweets…

Computation and Language · Computer Science 2025-08-08 Albert Roethel , Maria Ganzha , Anna Wróblewska

Exploring the Capacity of a Large-scale Masked Language Model to Recognize Grammatical Errors

In this paper, we explore the capacity of a language model-based method for grammatical error detection in detail. We first show that 5 to 10% of training data are enough for a BERT-based error detection method to achieve performance…

Computation and Language · Computer Science 2021-08-30 Ryo Nagata , Manabu Kimura , Kazuaki Hanawa

Language Models as Knowledge Bases?

Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the…

Computation and Language · Computer Science 2019-09-05 Fabio Petroni , Tim Rocktäschel , Patrick Lewis , Anton Bakhtin , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel

A Closer Look at Linguistic Knowledge in Masked Language Models: The Case of Relative Clauses in American English

Transformer-based language models achieve high performance on various tasks, but we still lack understanding of the kind of linguistic knowledge they learn and rely on. We evaluate three models (BERT, RoBERTa, and ALBERT), testing their…

Computation and Language · Computer Science 2020-11-03 Marius Mosbach , Stefania Degaetano-Ortlieb , Marie-Pauline Krielke , Badr M. Abdullah , Dietrich Klakow

What does BERT learn about prosody?

Language models have become nearly ubiquitous in natural language processing applications achieving state-of-the-art results in many tasks including prosody. As the model design does not define predetermined linguistic targets during…

Computation and Language · Computer Science 2023-04-26 Sofoklis Kakouros , Johannah O'Mahony

Injecting linguistic knowledge into BERT for Dialogue State Tracking

Dialogue State Tracking (DST) models often employ intricate neural network architectures, necessitating substantial training data, and their inference process lacks transparency. This paper proposes a method that extracts linguistic…

Computation and Language · Computer Science 2024-07-15 Xiaohan Feng , Xixin Wu , Helen Meng

Comparison of biomedical relationship extraction methods and models for knowledge graph creation

Biomedical research is growing at such an exponential pace that scientists, researchers, and practitioners are no more able to cope with the amount of published literature in the domain. The knowledge presented in the literature needs to be…

Artificial Intelligence · Computer Science 2024-07-09 Nikola Milosevic , Wolfgang Thielemann

Inducing Relational Knowledge from BERT

One of the most remarkable properties of word embeddings is the fact that they capture certain types of semantic and syntactic relationships. Recently, pre-trained language models such as BERT have achieved groundbreaking results across a…

Computation and Language · Computer Science 2019-12-02 Zied Bouraoui , Jose Camacho-Collados , Steven Schockaert

BERTnesia: Investigating the capture and forgetting of knowledge in BERT

Probing complex language models has recently revealed several insights into linguistic and semantic patterns found in the learned representations. In this paper, we probe BERT specifically to understand and measure the relational knowledge…

Computation and Language · Computer Science 2021-09-09 Jonas Wallat , Jaspreet Singh , Avishek Anand

Comparative Study of Language Models on Cross-Domain Data with Model Agnostic Explainability

With the recent influx of bidirectional contextualized transformer language models in the NLP, it becomes a necessity to have a systematic comparative study of these models on variety of datasets. Also, the performance of these language…

Computation and Language · Computer Science 2020-09-10 Mayank Chhipa , Hrushikesh Mahesh Vazurkar , Abhijeet Kumar , Mridul Mishra

Analysis and Evaluation of Language Models for Word Sense Disambiguation

Transformer-based language models have taken many fields in NLP by storm. BERT and its derivatives dominate most of the existing evaluation benchmarks, including those for Word Sense Disambiguation (WSD), thanks to their ability in…

Computation and Language · Computer Science 2021-03-19 Daniel Loureiro , Kiamehr Rezaee , Mohammad Taher Pilehvar , Jose Camacho-Collados

Entity-aware Transformers for Entity Search

Pre-trained language models such as BERT have been a key ingredient to achieve state-of-the-art results on a variety of tasks in natural language processing and, more recently, also in information retrieval.Recent research even claims that…

Information Retrieval · Computer Science 2022-05-03 Emma J. Gerritse , Faegheh Hasibi , Arjen P. de Vries

Probing Across Time: What Does RoBERTa Know and When?

Models of language trained on very large corpora have been demonstrated useful for NLP. As fixed artifacts, they have become the object of intense study, with many researchers "probing" the extent to which linguistic abstractions, factual…

Computation and Language · Computer Science 2021-09-21 Leo Z. Liu , Yizhong Wang , Jungo Kasai , Hannaneh Hajishirzi , Noah A. Smith