English
Related papers

Related papers: KILM: Knowledge Injection into Encoder-Decoder Lan…

200 papers

Large language models (LLMs) have demonstrated remarkable performance in a wide range of natural language tasks. However, as these models continue to grow in size, they face significant challenges in terms of computational costs.…

Computation and Language · Computer Science 2023-08-08 Ankush Agarwal , Sakharam Gawade , Amar Prakash Azad , Pushpak Bhattacharyya

How much knowledge do pretrained language models hold? Recent research observed that pretrained transformers are adept at modeling semantics but it is unclear to what degree they grasp human knowledge, or how to ensure they do so. In this…

Computation and Language · Computer Science 2021-02-05 Corby Rosset , Chenyan Xiong , Minh Phan , Xia Song , Paul Bennett , Saurabh Tiwary

Pre-trained language models (LMs) have been shown to memorize a substantial amount of knowledge from the pre-training corpora; however, they are still limited in recalling factually correct knowledge given a certain context. Hence, they…

Computation and Language · Computer Science 2022-04-08 Ruibo Liu , Guoqing Zheng , Shashank Gupta , Radhika Gaonkar , Chongyang Gao , Soroush Vosoughi , Milad Shokouhi , Ahmed Hassan Awadallah

Recent research demonstrates that external knowledge injection can advance pre-trained language models (PLMs) in a variety of downstream NLP tasks. However, existing knowledge injection methods are either applicable to structured knowledge…

Computation and Language · Computer Science 2023-05-08 Deming Ye , Yankai Lin , Zhengyan Zhang , Maosong Sun

Pre-trained language models(PLM) have made impressive results in various NLP tasks. It has been revealed that one of the key factors to their success is the parameters of these models implicitly learn all kinds of knowledge during…

Computation and Language · Computer Science 2023-09-19 Xin Cheng , Yankai Lin , Xiuying Chen , Dongyan Zhao , Rui Yan

Knowledge-Enhanced Pre-trained Language Models (KEPLMs) are pre-trained models with relation triples injecting from knowledge graphs to improve language understanding abilities. To guarantee effective knowledge injection, previous studies…

Computation and Language · Computer Science 2022-10-18 Taolin Zhang , Chengyu Wang , Nan Hu , Minghui Qiu , Chengguang Tang , Xiaofeng He , Jun Huang

Large Language Models (LLMs) have achieved exceptional capabilities in open generation across various domains, yet they encounter difficulties with tasks that require intensive knowledge. To address these challenges, methods for integrating…

Computation and Language · Computer Science 2024-12-17 Fali Wang , Runxue Bao , Suhang Wang , Wenchao Yu , Yanchi Liu , Wei Cheng , Haifeng Chen

Knowledge-enhanced Pre-trained Language Model (PLM) has recently received significant attention, which aims to incorporate factual knowledge into PLMs. However, most existing methods modify the internal structures of fixed types of PLMs by…

Computation and Language · Computer Science 2022-10-18 Jianing Wang , Wenkang Huang , Qiuhui Shi , Hongbin Wang , Minghui Qiu , Xiang Li , Ming Gao

Previous studies have revealed that vanilla pre-trained language models (PLMs) lack the capacity to handle knowledge-intensive NLP tasks alone; thus, several works have attempted to integrate external knowledge into PLMs. However, despite…

Computation and Language · Computer Science 2023-10-12 Yunzhi Yao , Peng Wang , Shengyu Mao , Chuanqi Tan , Fei Huang , Huajun Chen , Ningyu Zhang

Pre-trained language models learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of natural language processing (NLP) after fine-tuning.…

Computation and Language · Computer Science 2023-10-31 Jian Yang , Xinyu Hu , Gang Xiao , Yulong Shen

Pre-trained language representation models (PLMs) cannot well capture factual knowledge from text. In contrast, knowledge embedding (KE) methods can effectively represent the relational facts in knowledge graphs (KGs) with informative…

Computation and Language · Computer Science 2020-11-24 Xiaozhi Wang , Tianyu Gao , Zhaocheng Zhu , Zhengyan Zhang , Zhiyuan Liu , Juanzi Li , Jian Tang

Recent explorations of large-scale pre-trained language models (PLMs) have revealed the power of PLMs with huge amounts of parameters, setting off a wave of training ever-larger PLMs. However, it requires tremendous computational resources…

Computation and Language · Computer Science 2022-04-27 Yujia Qin , Yankai Lin , Jing Yi , Jiajie Zhang , Xu Han , Zhengyan Zhang , Yusheng Su , Zhiyuan Liu , Peng Li , Maosong Sun , Jie Zhou

Traditional language models are unable to efficiently model entity names observed in text. All but the most popular named entities appear infrequently in text providing insufficient context. Recent efforts have recognized that context can…

Computation and Language · Computer Science 2019-06-25 Angli Liu , Jingfei Du , Veselin Stoyanov

Incorporating factual knowledge into pre-trained language models (PLM) such as BERT is an emerging trend in recent NLP studies. However, most of the existing methods combine the external knowledge integration module with a modified…

Computation and Language · Computer Science 2022-05-06 Yinquan Lu , Haonan Lu , Guirong Fu , Qun Liu

Pre-trained Language Models (PLMs) which are trained on large text corpus via self-supervised learning method, have yielded promising performance on various tasks in Natural Language Processing (NLP). However, though PLMs with huge…

Computation and Language · Computer Science 2023-08-31 Linmei Hu , Zeyi Liu , Ziwang Zhao , Lei Hou , Liqiang Nie , Juanzi Li

Large language models (LLMs) often require vast amounts of text to effectively acquire new knowledge. While continuing pre-training on large corpora or employing retrieval-augmented generation (RAG) has proven successful, updating an LLM…

Computation and Language · Computer Science 2025-08-11 Hugo Abonizio , Thales Almeida , Roberto Lotufo , Rodrigo Nogueira

Large Language Models (LLMs) have demonstrated remarkable success in various tasks such as natural language understanding, text summarization, and machine translation. However, their general-purpose nature often limits their effectiveness…

Computation and Language · Computer Science 2025-09-03 Zirui Song , Bin Yan , Yuhan Liu , Miao Fang , Mingzhe Li , Rui Yan , Xiuying Chen

Injecting external knowledge can improve the performance of pre-trained language models (PLMs) on various downstream NLP tasks. However, massive retraining is required to deploy new knowledge injection methods or knowledge bases for…

Computation and Language · Computer Science 2023-12-05 Zhengyan Zhang , Zhiyuan Zeng , Yankai Lin , Huadong Wang , Deming Ye , Chaojun Xiao , Xu Han , Zhiyuan Liu , Peng Li , Maosong Sun , Jie Zhou

Autoregressive large language models (LLMs) pre-trained by next token prediction are inherently proficient in generative tasks. However, their performance on knowledge-driven tasks such as factual knowledge querying remains unsatisfactory.…

Computation and Language · Computer Science 2026-01-14 Peng Yu , Cheng Deng , Beiya Dai , Xinbing Wang , Ying Wen

Integrating structured knowledge from Knowledge Graphs (KGs) into Large Language Models (LLMs) remains a key challenge for symbolic reasoning. Existing methods mainly rely on prompt engineering or fine-tuning, which lose structural fidelity…

Machine Learning · Computer Science 2025-05-13 Erica Coppolillo
‹ Prev 1 2 3 10 Next ›