English
Related papers

Related papers: Semantics-aware BERT for Language Understanding

200 papers

Language representation models such as BERT could effectively capture contextual semantic information from plain text, and have been proved to achieve promising results in lots of downstream NLP tasks with appropriate fine-tuning. However,…

Computation and Language · Computer Science 2020-10-07 Deming Ye , Yankai Lin , Jiaju Du , Zhenghao Liu , Peng Li , Maosong Sun , Zhiyuan Liu

The recently proposed BERT has shown great power on a variety of natural language understanding tasks, such as text classification, reading comprehension, etc. However, how to effectively apply BERT to neural machine translation (NMT) lacks…

Computation and Language · Computer Science 2020-02-18 Jinhua Zhu , Yingce Xia , Lijun Wu , Di He , Tao Qin , Wengang Zhou , Houqiang Li , Tie-Yan Liu

Models based on large-pretrained language models, such as S(entence)BERT, provide effective and efficient sentence embeddings that show high correlation to human similarity ratings, but lack interpretability. On the other hand, graph…

Computation and Language · Computer Science 2025-10-17 Juri Opitz , Anette Frank

Models based on the transformer architecture, such as BERT, have marked a crucial step forward in the field of Natural Language Processing. Importantly, they allow the creation of word embeddings that capture important semantic information…

Computation and Language · Computer Science 2021-01-01 Jacob Turton , David Vinson , Robert Elliott Smith

Neural networks provide new possibilities to automatically learn complex language patterns and query-document relations. Neural IR models have achieved promising results in learning query-document relevance patterns, but few explorations…

Information Retrieval · Computer Science 2019-05-23 Zhuyun Dai , Jamie Callan

Transformer-based language models have taken many fields in NLP by storm. BERT and its derivatives dominate most of the existing evaluation benchmarks, including those for Word Sense Disambiguation (WSD), thanks to their ability in…

Computation and Language · Computer Science 2021-03-19 Daniel Loureiro , Kiamehr Rezaee , Mohammad Taher Pilehvar , Jose Camacho-Collados

Sentence embedding is an important research topic in natural language processing (NLP) since it can transfer knowledge to downstream tasks. Meanwhile, a contextualized word representation, called BERT, achieves the state-of-the-art…

Computation and Language · Computer Science 2020-06-02 Bin Wang , C. -C. Jay Kuo

Contextualized entity representations learned by state-of-the-art transformer-based language models (TLMs) like BERT, GPT, T5, etc., leverage the attention mechanism to learn the data context from training data corpus. However, these models…

Computation and Language · Computer Science 2021-09-06 Keyur Faldu , Amit Sheth , Prashant Kikani , Hemang Akbari

Contextualized embeddings such as BERT can serve as strong input representations to NLP tasks, outperforming their static embeddings counterparts such as skip-gram, CBOW and GloVe. However, such embeddings are dynamic, calculated according…

Computation and Language · Computer Science 2020-04-07 Yile Wang , Leyang Cui , Yue Zhang

Tremendous amounts of multimedia associated with speech information are driving an urgent need to develop efficient and effective automatic summarization methods. To this end, we have seen rapid progress in applying supervised deep neural…

Computation and Language · Computer Science 2020-06-03 Shi-Yan Weng , Tien-Hong Lo , Berlin Chen

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. Unlike recent language representation models, BERT is designed to pre-train deep bidirectional…

Computation and Language · Computer Science 2019-05-28 Jacob Devlin , Ming-Wei Chang , Kenton Lee , Kristina Toutanova

This paper presents new state-of-the-art models for three tasks, part-of-speech tagging, syntactic parsing, and semantic parsing, using the cutting-edge contextualized embedding framework known as BERT. For each task, we first replicate and…

Computation and Language · Computer Science 2020-05-26 Han He , Jinho D. Choi

Contextualized word embeddings, i.e. vector representations for words in context, are naturally seen as an extension of previous noncontextual distributional semantic models. In this work, we focus on BERT, a deep neural network that…

Computation and Language · Computer Science 2020-05-11 Timothee Mickus , Denis Paperno , Mathieu Constant , Kees van Deemter

Semantic parsing is the task of transforming sentences from natural language into formal representations of predicate-argument structures. Under this research area, frame-semantic parsing has attracted much interest. This parsing approach…

Computation and Language · Computer Science 2019-11-01 Sang-Sang Tan , Jin-Cheon Na

Contextual embeddings, such as ELMo and BERT, move beyond global word representations like Word2Vec and achieve ground-breaking performance on a wide range of natural language processing tasks. Contextual embeddings assign each word a…

Computation and Language · Computer Science 2020-04-14 Qi Liu , Matt J. Kusner , Phil Blunsom

Transformer architectures show significant promise for natural language processing. Given that a single pretrained model can be fine-tuned to perform well on many different tasks, these networks appear to extract generally useful linguistic…

Machine Learning · Computer Science 2019-10-29 Andy Coenen , Emily Reif , Ann Yuan , Been Kim , Adam Pearce , Fernanda Viégas , Martin Wattenberg

We present iBERT (interpretable-BERT), an encoder to produce inherently interpretable and controllable embeddings - designed to modularize and expose the discriminative cues present in language, such as semantic or stylistic structure. Each…

Computation and Language · Computer Science 2026-01-27 Vishal Anand , Milad Alshomary , Kathleen McKeown

Recent advances in natural language processing (NLP) have been driven bypretrained language models like BERT, RoBERTa, T5, and GPT. Thesemodels excel at understanding complex texts, but biomedical literature, withits domain-specific…

Computation and Language · Computer Science 2025-07-28 K. Sahit Reddy , N. Ragavenderan , Vasanth K. , Ganesh N. Naik , Vishalakshi Prabhu , Nagaraja G. S

Adding linguistic information (syntax or semantics) to neural machine translation (NMT) has mostly focused on using point estimates from pre-trained models. Directly using the capacity of massive pre-trained contextual word embedding models…

Computation and Language · Computer Science 2021-04-08 Hassan S. Shavarani , Anoop Sarkar

The ability to learn from large unlabeled corpora has allowed neural language models to advance the frontier in natural language understanding. However, existing self-supervision techniques operate at the word form level, which serves as a…

Computation and Language · Computer Science 2020-05-19 Yoav Levine , Barak Lenz , Or Dagan , Ori Ram , Dan Padnos , Or Sharir , Shai Shalev-Shwartz , Amnon Shashua , Yoav Shoham
‹ Prev 1 2 3 10 Next ›