Related papers: Knowledge Efficient Deep Learning for Natural Lang…

Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples

Knowledge-enhanced language representation learning has shown promising results across various knowledge-intensive NLP tasks. However, prior methods are limited in efficient utilization of multilingual knowledge graph (KG) data for language…

Computation and Language · Computer Science 2022-10-20 Linlin Liu , Xin Li , Ruidan He , Lidong Bing , Shafiq Joty , Luo Si

A Survey on Knowledge-Enhanced Pre-trained Language Models

Natural Language Processing (NLP) has been revolutionized by the use of Pre-trained Language Models (PLMs) such as BERT. Despite setting new records in nearly every NLP task, PLMs still face a number of challenges including poor…

Computation and Language · Computer Science 2022-12-29 Chaoqi Zhen , Yanlei Shang , Xiangyu Liu , Yifei Li , Yong Chen , Dell Zhang

Deep Knowledge Tracing

Knowledge tracing---where a machine models the knowledge of a student as they interact with coursework---is a well established problem in computer supported education. Though effectively modeling student knowledge would have high…

Artificial Intelligence · Computer Science 2015-06-22 Chris Piech , Jonathan Spencer , Jonathan Huang , Surya Ganguli , Mehran Sahami , Leonidas Guibas , Jascha Sohl-Dickstein

MKD: a Multi-Task Knowledge Distillation Approach for Pretrained Language Models

Pretrained language models have led to significant performance gains in many NLP tasks. However, the intensive computing resources to train such models remain an issue. Knowledge distillation alleviates this problem by learning a…

Computation and Language · Computer Science 2020-05-04 Linqing Liu , Huan Wang , Jimmy Lin , Richard Socher , Caiming Xiong

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Machine Learning has been the quintessential solution for many AI problems, but learning is still heavily dependent on the specific training data. Some learning models can be incorporated with a prior knowledge in the Bayesian set up, but…

Computation and Language · Computer Science 2018-05-22 K M Annervaz , Somnath Basu Roy Chowdhury , Ambedkar Dukkipati

Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks

We study the problem of incorporating prior knowledge into a deep Transformer-based model,i.e.,Bidirectional Encoder Representations from Transformers (BERT), to enhance its performance on semantic textual matching tasks. By probing and…

Computation and Language · Computer Science 2021-02-23 Tingyu Xia , Yue Wang , Yuan Tian , Yi Chang

Self-Referenced Deep Learning

Knowledge distillation is an effective approach to transferring knowledge from a teacher neural network to a student target network for satisfying the low-memory and fast running requirements in practice use. Whilst being able to create…

Computer Vision and Pattern Recognition · Computer Science 2018-11-20 Xu Lan , Xiatian Zhu , Shaogang Gong

TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models

KEPLMs are pre-trained models that utilize external knowledge to enhance language understanding. Previous language models facilitated knowledge acquisition by incorporating knowledge-related pre-training tasks learned from relation triples…

Computation and Language · Computer Science 2024-03-19 Junbing Yan , Chengyu Wang , Taolin Zhang , Xiaofeng He , Jun Huang , Longtao Huang , Hui Xue , Wei Zhang

A Comprehensive Survey on Knowledge Distillation

Deep Neural Networks (DNNs) have achieved notable performance in the fields of computer vision and natural language processing with various applications in both academia and industry. However, with recent advancements in DNNs and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Amir M. Mansourian , Rozhan Ahmadi , Masoud Ghafouri , Amir Mohammad Babaei , Elaheh Badali Golezani , Zeynab Yasamani Ghamchi , Vida Ramezanian , Alireza Taherian , Kimia Dinashi , Amirali Miri , Shohreh Kasaei

KRNet: Towards Efficient Knowledge Replay

The knowledge replay technique has been widely used in many tasks such as continual learning and continuous domain adaptation. The key lies in how to effectively encode the knowledge extracted from previous data and replay them during…

Machine Learning · Computer Science 2022-05-24 Yingying Zhang , Qiaoyong Zhong , Di Xie , Shiliang Pu

Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting

Deep pretrained language models have achieved great success in the way of pretraining first and then fine-tuning. But such a sequential transfer learning paradigm often confronts the catastrophic forgetting problem and leads to sub-optimal…

Computation and Language · Computer Science 2020-04-28 Sanyuan Chen , Yutai Hou , Yiming Cui , Wanxiang Che , Ting Liu , Xiangzhan Yu

Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources

For languages with no annotated resources, transferring knowledge from rich-resource languages is an effective solution for named entity recognition (NER). While all existing methods directly transfer from source-learned model to a target…

Computation and Language · Computer Science 2020-07-16 Qianhui Wu , Zijia Lin , Guoxin Wang , Hui Chen , Börje F. Karlsson , Biqing Huang , Chin-Yew Lin

EduBERT: Pretrained Deep Language Models for Learning Analytics

The use of large pretrained neural networks to create contextualized word embeddings has drastically improved performance on several natural language processing (NLP) tasks. These computationally expensive models have begun to be applied to…

Computers and Society · Computer Science 2019-12-03 Benjamin Clavié , Kobi Gal

Knowledge-Driven Deep Learning Paradigms for Wireless Network Optimization in 6G

In the sixth-generation (6G) networks, newly emerging diversified services of massive users in dynamic network environments are required to be satisfied by multi-dimensional heterogeneous resources. The resulting large-scale complicated…

Networking and Internet Architecture · Computer Science 2024-02-08 Ruijin Sun , Nan Cheng , Changle Li , Fangjiong Chen , Wen Chen

Evolving Knowledge Distillation with Large Language Models and Active Learning

Large language models (LLMs) have demonstrated remarkable capabilities across various NLP tasks. However, their computational costs are prohibitively high. To address this issue, previous research has attempted to distill the knowledge of…

Computation and Language · Computer Science 2024-03-12 Chengyuan Liu , Yangyang Kang , Fubang Zhao , Kun Kuang , Zhuoren Jiang , Changlong Sun , Fei Wu

A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training

Modern Natural Language Generation (NLG) models come with massive computational and storage requirements. In this work, we study the potential of compressing them, which is crucial for real-world applications serving millions of users. We…

Computation and Language · Computer Science 2023-05-29 Nitay Calderon , Subhabrata Mukherjee , Roi Reichart , Amir Kantor

Deep Transform and Metric Learning Networks

Based on its great successes in inference and denosing tasks, Dictionary Learning (DL) and its related sparse optimization formulations have garnered a lot of research interest. While most solutions have focused on single layer…

Machine Learning · Computer Science 2021-04-22 Wen Tang , Emilie Chouzenoux , Jean-Christophe Pesquet , Hamid Krim

Using External knowledge to Enhanced PLM for Semantic Matching

Modeling semantic relevance has always been a challenging and critical task in natural language processing. In recent years, with the emergence of massive amounts of annotated data, it has become feasible to train complex models, such as…

Computation and Language · Computer Science 2025-05-13 Min Li , Chun Yuan

Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies

Emerging Large Language Models (LLMs) like GPT-4 have revolutionized Natural Language Processing (NLP), showing potential in traditional tasks such as Named Entity Recognition (NER). Our study explores a three-phase training strategy that…

Computation and Language · Computer Science 2024-03-26 Yining Huang , Keke Tang , Meilian Chen

Neural Skill Transfer from Supervised Language Tasks to Reading Comprehension

Reading comprehension is a challenging task in natural language processing and requires a set of skills to be solved. While current approaches focus on solving the task as a whole, in this paper, we propose to use a neural network `skill'…

Computation and Language · Computer Science 2017-11-13 Todor Mihaylov , Zornitsa Kozareva , Anette Frank