English
Related papers

Related papers: Knowledge Efficient Deep Learning for Natural Lang…

200 papers

Knowledge-enhanced language representation learning has shown promising results across various knowledge-intensive NLP tasks. However, prior methods are limited in efficient utilization of multilingual knowledge graph (KG) data for language…

Computation and Language · Computer Science 2022-10-20 Linlin Liu , Xin Li , Ruidan He , Lidong Bing , Shafiq Joty , Luo Si

Natural Language Processing (NLP) has been revolutionized by the use of Pre-trained Language Models (PLMs) such as BERT. Despite setting new records in nearly every NLP task, PLMs still face a number of challenges including poor…

Computation and Language · Computer Science 2022-12-29 Chaoqi Zhen , Yanlei Shang , Xiangyu Liu , Yifei Li , Yong Chen , Dell Zhang

Knowledge tracing---where a machine models the knowledge of a student as they interact with coursework---is a well established problem in computer supported education. Though effectively modeling student knowledge would have high…

Artificial Intelligence · Computer Science 2015-06-22 Chris Piech , Jonathan Spencer , Jonathan Huang , Surya Ganguli , Mehran Sahami , Leonidas Guibas , Jascha Sohl-Dickstein

Pretrained language models have led to significant performance gains in many NLP tasks. However, the intensive computing resources to train such models remain an issue. Knowledge distillation alleviates this problem by learning a…

Computation and Language · Computer Science 2020-05-04 Linqing Liu , Huan Wang , Jimmy Lin , Richard Socher , Caiming Xiong

Machine Learning has been the quintessential solution for many AI problems, but learning is still heavily dependent on the specific training data. Some learning models can be incorporated with a prior knowledge in the Bayesian set up, but…

Computation and Language · Computer Science 2018-05-22 K M Annervaz , Somnath Basu Roy Chowdhury , Ambedkar Dukkipati

We study the problem of incorporating prior knowledge into a deep Transformer-based model,i.e.,Bidirectional Encoder Representations from Transformers (BERT), to enhance its performance on semantic textual matching tasks. By probing and…

Computation and Language · Computer Science 2021-02-23 Tingyu Xia , Yue Wang , Yuan Tian , Yi Chang

Knowledge distillation is an effective approach to transferring knowledge from a teacher neural network to a student target network for satisfying the low-memory and fast running requirements in practice use. Whilst being able to create…

Computer Vision and Pattern Recognition · Computer Science 2018-11-20 Xu Lan , Xiatian Zhu , Shaogang Gong

KEPLMs are pre-trained models that utilize external knowledge to enhance language understanding. Previous language models facilitated knowledge acquisition by incorporating knowledge-related pre-training tasks learned from relation triples…

Computation and Language · Computer Science 2024-03-19 Junbing Yan , Chengyu Wang , Taolin Zhang , Xiaofeng He , Jun Huang , Longtao Huang , Hui Xue , Wei Zhang

Deep Neural Networks (DNNs) have achieved notable performance in the fields of computer vision and natural language processing with various applications in both academia and industry. However, with recent advancements in DNNs and…

The knowledge replay technique has been widely used in many tasks such as continual learning and continuous domain adaptation. The key lies in how to effectively encode the knowledge extracted from previous data and replay them during…

Machine Learning · Computer Science 2022-05-24 Yingying Zhang , Qiaoyong Zhong , Di Xie , Shiliang Pu

Deep pretrained language models have achieved great success in the way of pretraining first and then fine-tuning. But such a sequential transfer learning paradigm often confronts the catastrophic forgetting problem and leads to sub-optimal…

Computation and Language · Computer Science 2020-04-28 Sanyuan Chen , Yutai Hou , Yiming Cui , Wanxiang Che , Ting Liu , Xiangzhan Yu

For languages with no annotated resources, transferring knowledge from rich-resource languages is an effective solution for named entity recognition (NER). While all existing methods directly transfer from source-learned model to a target…

Computation and Language · Computer Science 2020-07-16 Qianhui Wu , Zijia Lin , Guoxin Wang , Hui Chen , Börje F. Karlsson , Biqing Huang , Chin-Yew Lin

The use of large pretrained neural networks to create contextualized word embeddings has drastically improved performance on several natural language processing (NLP) tasks. These computationally expensive models have begun to be applied to…

Computers and Society · Computer Science 2019-12-03 Benjamin Clavié , Kobi Gal

In the sixth-generation (6G) networks, newly emerging diversified services of massive users in dynamic network environments are required to be satisfied by multi-dimensional heterogeneous resources. The resulting large-scale complicated…

Networking and Internet Architecture · Computer Science 2024-02-08 Ruijin Sun , Nan Cheng , Changle Li , Fangjiong Chen , Wen Chen

Large language models (LLMs) have demonstrated remarkable capabilities across various NLP tasks. However, their computational costs are prohibitively high. To address this issue, previous research has attempted to distill the knowledge of…

Computation and Language · Computer Science 2024-03-12 Chengyuan Liu , Yangyang Kang , Fubang Zhao , Kun Kuang , Zhuoren Jiang , Changlong Sun , Fei Wu

Modern Natural Language Generation (NLG) models come with massive computational and storage requirements. In this work, we study the potential of compressing them, which is crucial for real-world applications serving millions of users. We…

Computation and Language · Computer Science 2023-05-29 Nitay Calderon , Subhabrata Mukherjee , Roi Reichart , Amir Kantor

Based on its great successes in inference and denosing tasks, Dictionary Learning (DL) and its related sparse optimization formulations have garnered a lot of research interest. While most solutions have focused on single layer…

Machine Learning · Computer Science 2021-04-22 Wen Tang , Emilie Chouzenoux , Jean-Christophe Pesquet , Hamid Krim

Modeling semantic relevance has always been a challenging and critical task in natural language processing. In recent years, with the emergence of massive amounts of annotated data, it has become feasible to train complex models, such as…

Computation and Language · Computer Science 2025-05-13 Min Li , Chun Yuan

Emerging Large Language Models (LLMs) like GPT-4 have revolutionized Natural Language Processing (NLP), showing potential in traditional tasks such as Named Entity Recognition (NER). Our study explores a three-phase training strategy that…

Computation and Language · Computer Science 2024-03-26 Yining Huang , Keke Tang , Meilian Chen

Reading comprehension is a challenging task in natural language processing and requires a set of skills to be solved. While current approaches focus on solving the task as a whole, in this paper, we propose to use a neural network `skill'…

Computation and Language · Computer Science 2017-11-13 Todor Mihaylov , Zornitsa Kozareva , Anette Frank
‹ Prev 1 2 3 10 Next ›