Related papers: Subword ELMo

Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents

User interaction with voice-powered agents generates large amounts of unlabeled utterances. In this paper, we explore techniques to efficiently transfer the knowledge from these unlabeled utterances to improve model performance on Spoken…

Computation and Language · Computer Science 2018-11-14 Aditya Siddhant , Anuj Goyal , Angeliki Metallinou

SCELMo: Source Code Embeddings from Language Models

Continuous embeddings of tokens in computer programs have been used to support a variety of software development tools, including readability, code search, and program repair. Contextual embeddings are common in natural language processing…

Software Engineering · Computer Science 2020-04-29 Rafael - Michael Karampatsis , Charles Sutton

Multimodal Embeddings from Language Models

Word embeddings such as ELMo have recently been shown to model word semantics with greater efficacy through contextualized learning on large-scale language corpora, resulting in significant improvement in state of the art across many…

Computation and Language · Computer Science 2019-09-11 Shao-Yen Tseng , Panayiotis Georgiou , Shrikanth Narayanan

Probing Biomedical Embeddings from Language Models

Contextualized word embeddings derived from pre-trained language models (LMs) show significant improvements on downstream NLP tasks. Pre-training on domain-specific corpora, such as biomedical articles, further improves their performance.…

Computation and Language · Computer Science 2019-04-05 Qiao Jin , Bhuwan Dhingra , William W. Cohen , Xinghua Lu

Masked ELMo: An evolution of ELMo towards fully contextual RNN language models

This paper presents Masked ELMo, a new RNN-based model for language model pre-training, evolved from the ELMo language model. Contrary to ELMo which only uses independent left-to-right and right-to-left contexts, Masked ELMo learns fully…

Computation and Language · Computer Science 2020-10-12 Gregory Senay , Emmanuelle Salin

Entity-aware ELMo: Learning Contextual Entity Representation for Entity Disambiguation

We present a new local entity disambiguation system. The key to our system is a novel approach for learning entity representations. In our approach we learn an entity aware extension of Embedding for Language Model (ELMo) which we call…

Computation and Language · Computer Science 2019-08-23 Hamed Shahbazi , Xiaoli Z. Fern , Reza Ghaeini , Rasha Obeidat , Prasad Tadepalli

ECLM: Entity Level Language Model for Spoken Language Understanding with Chain of Intent

Large Language Models (LLMs) have demonstrated impressive capabilities in language generation and general task performance. However, their application to spoken language understanding (SLU) remains challenging, particularly for token-level…

Computation and Language · Computer Science 2025-10-09 Shangjian Yin , Peijie Huang , Jiatian Chen , Haojing Huang , Yuhong Xu

Alternative Weighting Schemes for ELMo Embeddings

ELMo embeddings (Peters et. al, 2018) had a huge impact on the NLP community and may recent publications use these embeddings to boost the performance for downstream NLP tasks. However, integration of ELMo embeddings in existent NLP…

Computation and Language · Computer Science 2019-04-08 Nils Reimers , Iryna Gurevych

Language (Re)modelling: Towards Embodied Language Understanding

While natural language understanding (NLU) is advancing rapidly, today's technology differs from human-like language understanding in fundamental ways, notably in its inferior efficiency, interpretability, and generalization. This work…

Computation and Language · Computer Science 2020-07-10 Ronen Tamari , Chen Shani , Tom Hope , Miriam R. L. Petruck , Omri Abend , Dafna Shahaf

Cross-lingual alignments of ELMo contextual embeddings

Building machine learning prediction models for a specific NLP task requires sufficient training data, which can be difficult to obtain for less-resourced languages. Cross-lingual embeddings map word embeddings from a less-resourced…

Computation and Language · Computer Science 2022-06-01 Matej Ulčar , Marko Robnik-Šikonja

EmoSLLM: Parameter-Efficient Adaptation of LLMs for Speech Emotion Recognition

Emotion recognition from speech is a challenging task that requires capturing both linguistic and paralinguistic cues, with critical applications in human-computer interaction and mental health monitoring. Recent works have highlighted the…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-21 Hugo Thimonier , Antony Perzo , Renaud Seguier

Can You Tell Me How to Get Past Sesame Street? Sentence-Level Pretraining Beyond Language Modeling

Natural language understanding has recently seen a surge of progress with the use of sentence encoders like ELMo (Peters et al., 2018a) and BERT (Devlin et al., 2019) which are pretrained on variants of language modeling. We conduct the…

Computation and Language · Computer Science 2019-07-24 Alex Wang , Jan Hula , Patrick Xia , Raghavendra Pappagari , R. Thomas McCoy , Roma Patel , Najoung Kim , Ian Tenney , Yinghui Huang , Katherin Yu , Shuning Jin , Berlin Chen , Benjamin Van Durme , Edouard Grave , Ellie Pavlick , Samuel R. Bowman

What do you learn from context? Probing for sentence structure in contextualized word representations

Contextualized representation models such as ELMo (Peters et al., 2018a) and BERT (Devlin et al., 2018) have recently achieved state-of-the-art results on a diverse array of downstream NLP tasks. Building on recent token-level probing work,…

Computation and Language · Computer Science 2019-05-16 Ian Tenney , Patrick Xia , Berlin Chen , Alex Wang , Adam Poliak , R Thomas McCoy , Najoung Kim , Benjamin Van Durme , Samuel R. Bowman , Dipanjan Das , Ellie Pavlick

Efficient Contextual Representation Learning Without Softmax Layer

Contextual representation models have achieved great success in improving various downstream tasks. However, these language-model-based encoders are difficult to train due to the large parameter sizes and high computational complexity. By…

Computation and Language · Computer Science 2019-03-01 Liunian Harold Li , Patrick H. Chen , Cho-Jui Hsieh , Kai-Wei Chang

Effective Subword Segmentation for Text Comprehension

Representation learning is the foundation of machine reading comprehension and inference. In state-of-the-art models, character-level representations have been broadly adopted to alleviate the problem of effectively representing rare or…

Computation and Language · Computer Science 2019-06-12 Zhuosheng Zhang , Hai Zhao , Kangwei Ling , Jiangtong Li , Zuchao Li , Shexia He , Guohong Fu

BERMo: What can BERT learn from ELMo?

We propose BERMo, an architectural modification to BERT, which makes predictions based on a hierarchy of surface, syntactic and semantic language features. We use linear combination scheme proposed in Embeddings from Language Models (ELMo)…

Computation and Language · Computer Science 2021-11-01 Sangamesh Kodge , Kaushik Roy

Designing and Interpreting Probes with Control Tasks

Probes, supervised models trained to predict properties (like parts-of-speech) from representations (like ELMo), have achieved high accuracy on a range of linguistic tasks. But does this mean that the representations encode linguistic…

Computation and Language · Computer Science 2019-09-10 John Hewitt , Percy Liang

Neural Named Entity Recognition from Subword Units

Named entity recognition (NER) is a vital task in spoken language understanding, which aims to identify mentions of named entities in text e.g., from transcribed speech. Existing neural models for NER rely mostly on dedicated word-level…

Computation and Language · Computer Science 2019-09-24 Abdalghani Abujabal , Judith Gaspers

Retrofitting Contextualized Word Embeddings with Paraphrases

Contextualized word embedding models, such as ELMo, generate meaningful representations of words and their context. These models have been shown to have a great impact on downstream applications. However, in many cases, the contextualized…

Computation and Language · Computer Science 2019-09-27 Weijia Shi , Muhao Chen , Pei Zhou , Kai-Wei Chang

Language Modeling Teaches You More Syntax than Translation Does: Lessons Learned Through Auxiliary Task Analysis

Recent work using auxiliary prediction task classifiers to investigate the properties of LSTM representations has begun to shed light on why pretrained representations, like ELMo (Peters et al., 2018) and CoVe (McCann et al., 2017), are so…

Computation and Language · Computer Science 2019-01-08 Kelly W. Zhang , Samuel R. Bowman