Related papers: Knowledge-Aware Language Model Pretraining

KILM: Knowledge Injection into Encoder-Decoder Language Models

Large pre-trained language models (PLMs) have been shown to retain implicit knowledge within their parameters. To enhance this implicit knowledge, we propose Knowledge Injection into Language Models (KILM), a novel approach that injects…

Computation and Language · Computer Science 2023-02-21 Yan Xu , Mahdi Namazifar , Devamanyu Hazarika , Aishwarya Padmakumar , Yang Liu , Dilek Hakkani-Tür

A Survey of Knowledge Enhanced Pre-trained Models

Pre-trained language models learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of natural language processing (NLP) after fine-tuning.…

Computation and Language · Computer Science 2023-10-31 Jian Yang , Xinyu Hu , Gang Xiao , Yulong Shen

KALA: Knowledge-Augmented Language Model Adaptation

Pre-trained language models (PLMs) have achieved remarkable success on various natural language understanding tasks. Simple fine-tuning of PLMs, on the other hand, might be suboptimal for domain-specific tasks because they cannot possibly…

Computation and Language · Computer Science 2022-08-05 Minki Kang , Jinheon Baek , Sung Ju Hwang

Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition

Traditional language models are unable to efficiently model entity names observed in text. All but the most popular named entities appear infrequently in text providing insufficient context. Recent efforts have recognized that context can…

Computation and Language · Computer Science 2019-06-25 Angli Liu , Jingfei Du , Veselin Stoyanov

KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs

Incorporating factual knowledge into pre-trained language models (PLM) such as BERT is an emerging trend in recent NLP studies. However, most of the existing methods combine the external knowledge integration module with a modified…

Computation and Language · Computer Science 2022-05-06 Yinquan Lu , Haonan Lu , Guirong Fu , Qun Liu

Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model

Recent breakthroughs of pretrained language models have shown the effectiveness of self-supervised learning for a wide range of natural language processing (NLP) tasks. In addition to standard syntactic and semantic NLP tasks, pretrained…

Computation and Language · Computer Science 2019-12-23 Wenhan Xiong , Jingfei Du , William Yang Wang , Veselin Stoyanov

Informed Pre-Training on Prior Knowledge

When training data is scarce, the incorporation of additional prior knowledge can assist the learning process. While it is common to initialize neural networks with weights that have been pre-trained on other large data sets, pre-training…

Machine Learning · Computer Science 2022-05-24 Laura von Rueden , Sebastian Houben , Kostadin Cvejoski , Christian Bauckhage , Nico Piatkowski

KGLM: Integrating Knowledge Graph Structure in Language Models for Link Prediction

The ability of knowledge graphs to represent complex relationships at scale has led to their adoption for various needs including knowledge representation, question-answering, and recommendation systems. Knowledge graphs are often…

Computation and Language · Computer Science 2023-05-18 Jason Youn , Ilias Tagkopoulos

Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples

Knowledge-enhanced language representation learning has shown promising results across various knowledge-intensive NLP tasks. However, prior methods are limited in efficient utilization of multilingual knowledge graph (KG) data for language…

Computation and Language · Computer Science 2022-10-20 Linlin Liu , Xin Li , Ruidan He , Lidong Bing , Shafiq Joty , Luo Si

Knowledge Rumination for Pre-trained Language Models

Previous studies have revealed that vanilla pre-trained language models (PLMs) lack the capacity to handle knowledge-intensive NLP tasks alone; thus, several works have attempted to integrate external knowledge into PLMs. However, despite…

Computation and Language · Computer Science 2023-10-12 Yunzhi Yao , Peng Wang , Shengyu Mao , Chuanqi Tan , Fei Huang , Huajun Chen , Ningyu Zhang

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

As retrieval-augmented generation prevails in large language models, embedding models are becoming increasingly crucial. Despite the growing number of general embedding models, prior work often overlooks the critical role of training data…

Computation and Language · Computer Science 2025-01-16 Xinshuo Hu , Zifei Shan , Xinping Zhao , Zetian Sun , Zhenyu Liu , Dongfang Li , Shaolin Ye , Xinyuan Wei , Qian Chen , Baotian Hu , Haofen Wang , Jun Yu , Min Zhang

REALM: Retrieval-Augmented Language Model Pre-Training

Language model pre-training has been shown to capture a surprising amount of world knowledge, crucial for NLP tasks such as question answering. However, this knowledge is stored implicitly in the parameters of a neural network, requiring…

Computation and Language · Computer Science 2020-02-21 Kelvin Guu , Kenton Lee , Zora Tung , Panupong Pasupat , Ming-Wei Chang

Competence-Based Analysis of Language Models

Despite the recent successes of large, pretrained neural language models (LLMs), comparatively little is known about the representations of linguistic structure they learn during pretraining, which can lead to unexpected behaviors in…

Computation and Language · Computer Science 2024-12-24 Adam Davies , Jize Jiang , ChengXiang Zhai

K-XLNet: A General Method for Combining Explicit Knowledge with Language Model Pretraining

Though pre-trained language models such as Bert and XLNet, have rapidly advanced the state-of-the-art on many NLP tasks, they implicit semantics only relying on surface information between words in corpus. Intuitively, background knowledge…

Computation and Language · Computer Science 2021-06-01 Ruiqing Yan , Lanchang Sun , Fang Wang , Xiaoming Zhang

Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base

Pre-trained language models (PLMs) like BERT have made significant progress in various downstream NLP tasks. However, by asking models to do cloze-style tests, recent work finds that PLMs are short in acquiring knowledge from unstructured…

Computation and Language · Computer Science 2023-10-12 Cunxiang Wang , Fuli Luo , Yanyang Li , Runxin Xu , Fei Huang , Yue Zhang

Investigating Pre-trained Language Models on Cross-Domain Datasets, a Step Closer to General AI

Pre-trained language models have recently emerged as a powerful tool for fine-tuning a variety of language tasks. Ideally, when models are pre-trained on large amount of data, they are expected to gain implicit knowledge. In this paper, we…

Computation and Language · Computer Science 2023-06-22 Mohamad Ballout , Ulf Krumnack , Gunther Heidemann , Kai-Uwe Kühnberger

Verb Knowledge Injection for Multilingual Event Processing

In parallel to their overwhelming success across NLP tasks, language ability of deep Transformer networks, pretrained via language modeling (LM) objectives has undergone extensive scrutiny. While probing revealed that these models encode a…

Computation and Language · Computer Science 2021-10-19 Olga Majewska , Ivan Vulić , Goran Glavaš , Edoardo M. Ponti , Anna Korhonen

KgPLM: Knowledge-guided Language Model Pre-training via Generative and Discriminative Learning

Recent studies on pre-trained language models have demonstrated their ability to capture factual knowledge and applications in knowledge-aware downstream tasks. In this work, we present a language model pre-training framework guided by…

Computation and Language · Computer Science 2020-12-08 Bin He , Xin Jiang , Jinghui Xiao , Qun Liu

Do Language Models Learn about Legal Entity Types during Pretraining?

Language Models (LMs) have proven their ability to acquire diverse linguistic knowledge during the pretraining phase, potentially serving as a valuable source of incidental supervision for downstream tasks. However, there has been limited…

Computation and Language · Computer Science 2023-10-23 Claire Barale , Michael Rovatsos , Nehal Bhuta

Language Models as Knowledge Bases?

Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the…

Computation and Language · Computer Science 2019-09-05 Fabio Petroni , Tim Rocktäschel , Patrick Lewis , Anton Bakhtin , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel