English
Related papers

Related papers: Knowledge-Aware Language Model Pretraining

200 papers

Large pre-trained language models (PLMs) have been shown to retain implicit knowledge within their parameters. To enhance this implicit knowledge, we propose Knowledge Injection into Language Models (KILM), a novel approach that injects…

Computation and Language · Computer Science 2023-02-21 Yan Xu , Mahdi Namazifar , Devamanyu Hazarika , Aishwarya Padmakumar , Yang Liu , Dilek Hakkani-Tür

Pre-trained language models learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of natural language processing (NLP) after fine-tuning.…

Computation and Language · Computer Science 2023-10-31 Jian Yang , Xinyu Hu , Gang Xiao , Yulong Shen

Pre-trained language models (PLMs) have achieved remarkable success on various natural language understanding tasks. Simple fine-tuning of PLMs, on the other hand, might be suboptimal for domain-specific tasks because they cannot possibly…

Computation and Language · Computer Science 2022-08-05 Minki Kang , Jinheon Baek , Sung Ju Hwang

Traditional language models are unable to efficiently model entity names observed in text. All but the most popular named entities appear infrequently in text providing insufficient context. Recent efforts have recognized that context can…

Computation and Language · Computer Science 2019-06-25 Angli Liu , Jingfei Du , Veselin Stoyanov

Incorporating factual knowledge into pre-trained language models (PLM) such as BERT is an emerging trend in recent NLP studies. However, most of the existing methods combine the external knowledge integration module with a modified…

Computation and Language · Computer Science 2022-05-06 Yinquan Lu , Haonan Lu , Guirong Fu , Qun Liu

Recent breakthroughs of pretrained language models have shown the effectiveness of self-supervised learning for a wide range of natural language processing (NLP) tasks. In addition to standard syntactic and semantic NLP tasks, pretrained…

Computation and Language · Computer Science 2019-12-23 Wenhan Xiong , Jingfei Du , William Yang Wang , Veselin Stoyanov

When training data is scarce, the incorporation of additional prior knowledge can assist the learning process. While it is common to initialize neural networks with weights that have been pre-trained on other large data sets, pre-training…

Machine Learning · Computer Science 2022-05-24 Laura von Rueden , Sebastian Houben , Kostadin Cvejoski , Christian Bauckhage , Nico Piatkowski

The ability of knowledge graphs to represent complex relationships at scale has led to their adoption for various needs including knowledge representation, question-answering, and recommendation systems. Knowledge graphs are often…

Computation and Language · Computer Science 2023-05-18 Jason Youn , Ilias Tagkopoulos

Knowledge-enhanced language representation learning has shown promising results across various knowledge-intensive NLP tasks. However, prior methods are limited in efficient utilization of multilingual knowledge graph (KG) data for language…

Computation and Language · Computer Science 2022-10-20 Linlin Liu , Xin Li , Ruidan He , Lidong Bing , Shafiq Joty , Luo Si

Previous studies have revealed that vanilla pre-trained language models (PLMs) lack the capacity to handle knowledge-intensive NLP tasks alone; thus, several works have attempted to integrate external knowledge into PLMs. However, despite…

Computation and Language · Computer Science 2023-10-12 Yunzhi Yao , Peng Wang , Shengyu Mao , Chuanqi Tan , Fei Huang , Huajun Chen , Ningyu Zhang

As retrieval-augmented generation prevails in large language models, embedding models are becoming increasingly crucial. Despite the growing number of general embedding models, prior work often overlooks the critical role of training data…

Computation and Language · Computer Science 2025-01-16 Xinshuo Hu , Zifei Shan , Xinping Zhao , Zetian Sun , Zhenyu Liu , Dongfang Li , Shaolin Ye , Xinyuan Wei , Qian Chen , Baotian Hu , Haofen Wang , Jun Yu , Min Zhang

Language model pre-training has been shown to capture a surprising amount of world knowledge, crucial for NLP tasks such as question answering. However, this knowledge is stored implicitly in the parameters of a neural network, requiring…

Computation and Language · Computer Science 2020-02-21 Kelvin Guu , Kenton Lee , Zora Tung , Panupong Pasupat , Ming-Wei Chang

Despite the recent successes of large, pretrained neural language models (LLMs), comparatively little is known about the representations of linguistic structure they learn during pretraining, which can lead to unexpected behaviors in…

Computation and Language · Computer Science 2024-12-24 Adam Davies , Jize Jiang , ChengXiang Zhai

Though pre-trained language models such as Bert and XLNet, have rapidly advanced the state-of-the-art on many NLP tasks, they implicit semantics only relying on surface information between words in corpus. Intuitively, background knowledge…

Computation and Language · Computer Science 2021-06-01 Ruiqing Yan , Lanchang Sun , Fang Wang , Xiaoming Zhang

Pre-trained language models (PLMs) like BERT have made significant progress in various downstream NLP tasks. However, by asking models to do cloze-style tests, recent work finds that PLMs are short in acquiring knowledge from unstructured…

Computation and Language · Computer Science 2023-10-12 Cunxiang Wang , Fuli Luo , Yanyang Li , Runxin Xu , Fei Huang , Yue Zhang

Pre-trained language models have recently emerged as a powerful tool for fine-tuning a variety of language tasks. Ideally, when models are pre-trained on large amount of data, they are expected to gain implicit knowledge. In this paper, we…

Computation and Language · Computer Science 2023-06-22 Mohamad Ballout , Ulf Krumnack , Gunther Heidemann , Kai-Uwe Kühnberger

In parallel to their overwhelming success across NLP tasks, language ability of deep Transformer networks, pretrained via language modeling (LM) objectives has undergone extensive scrutiny. While probing revealed that these models encode a…

Computation and Language · Computer Science 2021-10-19 Olga Majewska , Ivan Vulić , Goran Glavaš , Edoardo M. Ponti , Anna Korhonen

Recent studies on pre-trained language models have demonstrated their ability to capture factual knowledge and applications in knowledge-aware downstream tasks. In this work, we present a language model pre-training framework guided by…

Computation and Language · Computer Science 2020-12-08 Bin He , Xin Jiang , Jinghui Xiao , Qun Liu

Language Models (LMs) have proven their ability to acquire diverse linguistic knowledge during the pretraining phase, potentially serving as a valuable source of incidental supervision for downstream tasks. However, there has been limited…

Computation and Language · Computer Science 2023-10-23 Claire Barale , Michael Rovatsos , Nehal Bhuta

Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the…

Computation and Language · Computer Science 2019-09-05 Fabio Petroni , Tim Rocktäschel , Patrick Lewis , Anton Bakhtin , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel
‹ Prev 1 2 3 10 Next ›