Related papers: What do Language Representations Really Represent?

Multilingual Distributed Representations without Word Alignment

Distributed representations of meaning are a natural way to encode covariance relationships between words and phrases in NLP. By overcoming data sparsity problems, as well as providing information about semantic relatedness which is not…

Computation and Language · Computer Science 2014-03-21 Karl Moritz Hermann , Phil Blunsom

Distributed Representations for Compositional Semantics

The mathematical representation of semantics is a key issue for Natural Language Processing (NLP). A lot of research has been devoted to finding ways of representing the semantics of individual words in vector spaces. Distributional…

Computation and Language · Computer Science 2014-11-13 Karl Moritz Hermann

Emerging Language Spaces Learned From Massively Multilingual Corpora

Translations capture important information about languages that can be used as implicit supervision in learning linguistic properties and semantic representations. In an information-centric view, translated texts may be considered as…

Computation and Language · Computer Science 2018-02-02 Jörg Tiedemann

Joint Word Representation Learning using a Corpus and a Semantic Lexicon

Methods for learning word representations using large text corpora have received much attention lately due to their impressive performance in numerous natural language processing (NLP) tasks such as, semantic similarity measurement, and…

Computation and Language · Computer Science 2015-11-23 Danushka Bollegala , Alsuhaibani Mohammed , Takanori Maehara , Ken-ichi Kawarabayashi

Limitations of Cross-Lingual Learning from Image Search

Cross-lingual representation learning is an important step in making NLP scale to all the world's languages. Recent work on bilingual lexicon induction suggests that it is possible to learn cross-lingual representations of words based on…

Computation and Language · Computer Science 2017-09-19 Mareike Hartmann , Anders Soegaard

On the Linguistic Representational Power of Neural Machine Translation Models

Despite the recent success of deep neural networks in natural language processing (NLP), their interpretability remains a challenge. We analyze the representations learned by neural machine translation models at various levels of…

Computation and Language · Computer Science 2019-11-04 Yonatan Belinkov , Nadir Durrani , Fahim Dalvi , Hassan Sajjad , James Glass

Training Models on Dialects of Translationese Shows How Lexical Diversity and Source-Target Syntactic Similarity Shape Learning

Machine-translated data is widely used in multilingual NLP, particularly when native text is scarce. However, translated text differs systematically from native text. This phenomenon is known as translationese, and it reflects both traces…

Computation and Language · Computer Science 2026-02-19 Jenny Kunz

Semantic Representations of Word Senses and Concepts

Representing the semantics of linguistic items in a machine-interpretable form has been a major goal of Natural Language Processing since its earliest days. Among the range of different linguistic items, words have attracted the most…

Computation and Language · Computer Science 2016-08-04 José Camacho-Collados , Ignacio Iacobacci , Roberto Navigli , Mohammad Taher Pilehvar

Low-Dimensional Structure in the Space of Language Representations is Reflected in Brain Responses

How related are the representations learned by neural language models, translation models, and language tagging tasks? We answer this question by adapting an encoder-decoder transfer learning method from computer vision to investigate the…

Computation and Language · Computer Science 2025-12-11 Richard Antonello , Javier Turek , Vy Vo , Alexander Huth

On the Acquisition of Shared Grammatical Representations in Bilingual Language Models

Crosslingual transfer is crucial to contemporary language models' multilingual capabilities, but how it occurs is not well understood. We ask what happens to a monolingual language model when it begins to be trained on a second language.…

Computation and Language · Computer Science 2025-06-05 Catherine Arnett , Tyler A. Chang , James A. Michaelov , Benjamin K. Bergen

Visualizing Linguistic Shift

Neural network based models are a very powerful tool for creating word embeddings, the objective of these models is to group similar words together. These embeddings have been used as features to improve results in various applications such…

Computation and Language · Computer Science 2016-11-27 Salman Mahmood , Rami Al-Rfou , Klaus Mueller

Not All Neural Embeddings are Born Equal

Neural language models learn word representations that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models. We show that translation-based embeddings outperform…

Computation and Language · Computer Science 2014-11-14 Felix Hill , KyungHyun Cho , Sebastien Jean , Coline Devin , Yoshua Bengio

Continuous multilinguality with language vectors

Most existing models for multilingual natural language processing (NLP) treat language as a discrete category, and make predictions for either one language or the other. In contrast, we propose using continuous vector representations of…

Computation and Language · Computer Science 2017-03-21 Robert Östling , Jörg Tiedemann

Embedding Word Similarity with Neural Machine Translation

Neural language models learn word representations, or embeddings, that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models, a recently-developed class of neural…

Computation and Language · Computer Science 2015-04-06 Felix Hill , Kyunghyun Cho , Sebastien Jean , Coline Devin , Yoshua Bengio

Learning Visual Representations via Language-Guided Sampling

Although an object may appear in numerous contexts, we often describe it in a limited number of ways. Language allows us to abstract away visual variation to represent and communicate concepts. Building on this intuition, we propose an…

Computer Vision and Pattern Recognition · Computer Science 2023-03-30 Mohamed El Banani , Karan Desai , Justin Johnson

Learning Language Representations for Typology Prediction

One central mystery of neural NLP is what neural models "know" about their subject matter. When a neural machine translation system learns to translate from one language to another, does it learn the syntax or semantics of the languages?…

Computation and Language · Computer Science 2017-08-01 Chaitanya Malaviya , Graham Neubig , Patrick Littell

Language Embeddings Sometimes Contain Typological Generalizations

To what extent can neural network models learn generalizations about language structure, and how do we find out what they have learned? We explore these questions by training neural models for a range of natural language processing tasks on…

Computation and Language · Computer Science 2023-01-20 Robert Östling , Murathan Kurfalı

Discovering Representation Sprachbund For Multilingual Pre-Training

Multilingual pre-trained models have demonstrated their effectiveness in many multilingual NLP tasks and enabled zero-shot or few-shot transfer from high-resource languages to low resource ones. However, due to significant typological…

Computation and Language · Computer Science 2021-09-02 Yimin Fan , Yaobo Liang , Alexandre Muzio , Hany Hassan , Houqiang Li , Ming Zhou , Nan Duan

Brains and language models converge on a shared conceptual space across different languages

Human languages differ widely in their forms, each having distinct sounds, scripts, and syntax. Yet, they can all convey similar meaning. Do different languages converge on a shared neural substrate for conceptual meaning? We used language…

Neurons and Cognition · Quantitative Biology 2025-06-26 Zaid Zada , Samuel A Nastase , Jixing Li , Uri Hasson

Learning Word Representations from Relational Graphs

Attributes of words and relations between two words are central to numerous tasks in Artificial Intelligence such as knowledge representation, similarity measurement, and analogy detection. Often when two words share one or more attributes…

Computation and Language · Computer Science 2014-12-09 Danushka Bollegala , Takanori Maehara , Yuichi Yoshida , Ken-ichi Kawarabayashi