Related papers: Multi-Sense Language Modelling

Patterns of Lexical Ambiguity in Contextualised Language Models

One of the central aspects of contextualised language models is that they should be able to distinguish the meaning of lexically ambiguous words by their contexts. In this paper we investigate the extent to which the contextualised…

Computation and Language · Computer Science 2021-09-30 Janosch Haber , Massimo Poesio

PolyLM: Learning about Polysemy through Language Modeling

To avoid the "meaning conflation deficiency" of word embeddings, a number of models have aimed to embed individual word senses. These methods at one time performed well on tasks such as word sense induction (WSI), but they have since been…

Computation and Language · Computer Science 2021-01-27 Alan Ansell , Felipe Bravo-Marquez , Bernhard Pfahringer

Evaluating a Multi-sense Definition Generation Model for Multiple Languages

Most prior work on definition modeling has not accounted for polysemy, or has done so by considering definition modeling for a target word in a given context. In contrast, in this study, we propose a context-agnostic approach to definition…

Computation and Language · Computer Science 2020-06-16 Arman Kabiri , Paul Cook

Learning to Represent Words in Context with Multilingual Supervision

We present a neural network architecture based on bidirectional LSTMs to compute representations of words in the sentential contexts. These context-sensitive word representations are suitable for, e.g., distinguishing different word senses…

Computation and Language · Computer Science 2015-11-23 Kazuya Kawakami , Chris Dyer

Shades of meaning: Uncovering the geometry of ambiguous word representations through contextualised language models

Lexical ambiguity presents a profound and enduring challenge to the language sciences. Researchers for decades have grappled with the problem of how language users learn, represent and process words with more than one meaning. Our work…

Computation and Language · Computer Science 2023-04-27 Benedetta Cevoli , Chris Watkins , Yang Gao , Kathleen Rastle

A Classification Approach to Word Prediction

The eventual goal of a language model is to accurately predict the value of a missing word given its context. We present an approach to word prediction that is based on learning a representation for each word as a function of words and…

Computation and Language · Computer Science 2007-05-23 Yair Even-Zohar , Dan Roth

Modelling Lexical Ambiguity with Density Matrices

Words can have multiple senses. Compositional distributional models of meaning have been argued to deal well with finer shades of meaning variation known as polysemy, but are not so well equipped to handle word senses that are…

Computation and Language · Computer Science 2020-10-13 Francois Meyer , Martha Lewis

A Simple and Efficient Method To Generate Word Sense Representations

Distributed representations of words have boosted the performance of many Natural Language Processing tasks. However, usually only one representation per word is obtained, not acknowledging the fact that some words have multiple meanings.…

Computation and Language · Computer Science 2016-02-22 Luis Nieto Piña , Richard Johansson

Neural Lattice Language Models

In this work, we propose a new language modeling paradigm that has the ability to perform both prediction and moderation of information flow at multiple granularities: neural lattice language models. These models construct a lattice of…

Computation and Language · Computer Science 2018-03-15 Jacob Buckman , Graham Neubig

Jointly Learning to Label Sentences and Tokens

Learning to construct text representations in end-to-end systems can be difficult, as natural languages are highly compositional and task-specific annotated datasets are often limited in size. Methods for directly supervising language…

Computation and Language · Computer Science 2018-11-15 Marek Rei , Anders Søgaard

Tokenization Impacts Multilingual Language Modeling: Assessing Vocabulary Allocation and Overlap Across Languages

Multilingual language models have recently gained attention as a promising solution for representing multiple languages in a single model. In this paper, we propose new criteria to evaluate the quality of lexical representation and…

Computation and Language · Computer Science 2023-05-30 Tomasz Limisiewicz , Jiří Balhar , David Mareček

Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings

Contextualized word embeddings in language models have given much advance to NLP. Intuitively, sentential information is integrated into the representation of words, which can help model polysemy. However, context sensitivity also leads to…

Computation and Language · Computer Science 2022-08-23 Yile Wang , Yue Zhang

Many Languages, One Parser

We train one multilingual model for dependency parsing and use it to parse sentences in several languages. The parsing model uses (i) multilingual word clusters and embeddings; (ii) token-level language information; and (iii)…

Computation and Language · Computer Science 2016-07-27 Waleed Ammar , George Mulcaire , Miguel Ballesteros , Chris Dyer , Noah A. Smith

Benchmarking Concept-Spilling Across Languages in LLMs

Multilingual Large Language Models (LLMs) exhibit remarkable cross-lingual abilities, yet often exhibit a systematic bias toward the representations from other languages, resulting in semantic interference when generating content in…

Computation and Language · Computer Science 2026-01-21 Ilia Badanin , Daniil Dzenhaliou , Imanol Schlag

Contextualized word senses: from attention to compositionality

The neural architectures of language models are becoming increasingly complex, especially that of Transformers, based on the attention mechanism. Although their application to numerous natural language processing tasks has proven to be very…

Computation and Language · Computer Science 2023-12-04 Pablo Gamallo

To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models

Polysemy and synonymy are two crucial interrelated facets of lexical ambiguity. While both phenomena are widely documented in lexical resources and have been studied extensively in NLP, leading to dedicated systems, they are often being…

Computation and Language · Computer Science 2025-11-10 Bastien Liétard , Pascal Denis , Mikaela Keller

Contextual modulation of language comprehension in a dynamic neural model of lexical meaning

We computationally implement and experimentally test the behavioral predictions of a dynamic neural model of lexical meaning in the framework of Dynamic Field Theory. We demonstrate the architecture and behavior of the model using as a test…

Computation and Language · Computer Science 2025-09-18 Michael C. Stern , Maria M. Piñango

Real Multi-Sense or Pseudo Multi-Sense: An Approach to Improve Word Representation

Previous researches have shown that learning multiple representations for polysemous words can improve the performance of word embeddings on many tasks. However, this leads to another problem. Several vectors of a word may actually point to…

Computation and Language · Computer Science 2017-01-09 Haoyue Shi , Caihua Li , Junfeng Hu

Multilingual Word Embeddings using Multigraphs

We present a family of neural-network--inspired models for computing continuous word representations, specifically designed to exploit both monolingual and multilingual text. This framework allows us to perform unsupervised training of…

Computation and Language · Computer Science 2016-12-15 Radu Soricut , Nan Ding

Systematic word meta-sense extension

The meaning of polysemous words often varies in a highly productive yet predictable way. Generalizing the regularity between conventional senses to derive novel word meaning is crucial for automated processing of non-literal language uses…

Computation and Language · Computer Science 2023-11-23 Lei Yu