Related papers: Word Equations: Inherently Interpretable Sparse Wo…

Interpretable Neural Embeddings with Sparse Self-Representation

Interpretability benefits the theoretical understanding of representations. Existing word embeddings are generally dense representations. Hence, the meaning of latent dimensions is difficult to interpret. This makes word embeddings like a…

Computation and Language · Computer Science 2023-06-27 Minxue Xia , Hao Zhu

Evaluating Sparse Interpretable Word Embeddings for Biomedical Domain

Word embeddings have found their way into a wide range of natural language processing tasks including those in the biomedical domain. While these vector representations successfully capture semantic and syntactic word relations, hidden…

Computation and Language · Computer Science 2020-05-12 Mohammad Amin Samadi , Mohammad Sadegh Akhondzadeh , Sayed Jalal Zahabi , Mohammad Hossein Manshaei , Zeinab Maleki , Payman Adibi

Learning and Evaluating Sparse Interpretable Sentence Embeddings

Previous research on word embeddings has shown that sparse representations, which can be either learned on top of existing dense embeddings or obtained through model constraints during training time, have the benefit of increased…

Computation and Language · Computer Science 2018-09-26 Valentin Trifonov , Octavian-Eugen Ganea , Anna Potapenko , Thomas Hofmann

Semantic Structure and Interpretability of Word Embeddings

Dense word embeddings, which encode semantic meanings of words to low dimensional vector spaces have become very popular in natural language processing (NLP) research due to their state-of-the-art performances in many NLP tasks. Word…

Computation and Language · Computer Science 2018-07-20 Lutfi Kerem Senel , Ihsan Utlu , Veysel Yucesoy , Aykut Koc , Tolga Cukur

Imparting Interpretability to Word Embeddings while Preserving Semantic Structure

As an ubiquitous method in natural language processing, word embeddings are extensively employed to map semantic properties of words into a dense vector representation. They capture semantic and syntactic relations among words but the…

Computation and Language · Computer Science 2020-07-03 Lutfi Kerem Senel , Ihsan Utlu , Furkan Şahinuç , Haldun M. Ozaktas , Aykut Koç

Analyzing Structures in the Semantic Vector Space: A Framework for Decomposing Word Embeddings

Word embeddings are rich word representations, which in combination with deep neural networks, lead to large performance gains for many NLP tasks. However, word embeddings are represented by dense, real-valued vectors and they are therefore…

Computation and Language · Computer Science 2019-12-24 Andreas Hanselowski , Iryna Gurevych

SEMIE: SEMantically Infused Embeddings with Enhanced Interpretability for Domain-specific Small Corpus

Word embeddings are a basic building block of modern NLP pipelines. Efforts have been made to learn rich, efficient, and interpretable embeddings for large generic datasets available in the public domain. However, these embeddings have…

Computation and Language · Computer Science 2021-03-23 Rishabh Gupta , Rajesh N Rao

xSense: Learning Sense-Separated Sparse Representations and Textual Definitions for Explainable Word Sense Networks

Despite the success achieved on various natural language processing tasks, word embeddings are difficult to interpret due to the dense vector representations. This paper focuses on interpreting the embeddings for various aspects, including…

Computation and Language · Computer Science 2018-09-11 Ting-Yun Chang , Ta-Chung Chi , Shang-Chi Tsai , Yun-Nung Chen

Sparse Overcomplete Word Vector Representations

Current distributed representations of words show little resemblance to theories of lexical semantics. The former are dense and uninterpretable, the latter largely based on familiar, discrete classes (e.g., supersenses) and relations (e.g.,…

Computation and Language · Computer Science 2015-06-08 Manaal Faruqui , Yulia Tsvetkov , Dani Yogatama , Chris Dyer , Noah Smith

Supervised Understanding of Word Embeddings

Pre-trained word embeddings are widely used for transfer learning in natural language processing. The embeddings are continuous and distributed representations of the words that preserve their similarities in compact Euclidean spaces.…

Computation and Language · Computer Science 2020-06-25 Halid Ziya Yerebakan , Parmeet Bhatia , Yoshihisa Shinagawa

Interpretable Word Embeddings via Informative Priors

Word embeddings have demonstrated strong performance on NLP tasks. However, lack of interpretability and the unsupervised nature of word embeddings have limited their use within computational social science and digital humanities. We…

Computation and Language · Computer Science 2019-09-05 Miriam Hurtado Bodell , Martin Arvidsson , Måns Magnusson

Using Sparse Semantic Embeddings Learned from Multimodal Text and Image Data to Model Human Conceptual Knowledge

Distributional models provide a convenient way to model semantics using dense embedding spaces derived from unsupervised learning algorithms. However, the dimensions of dense embedding spaces are not designed to resemble human semantic…

Computation and Language · Computer Science 2018-11-15 Steven Derby , Paul Miller , Brian Murphy , Barry Devereux

SPINE: SParse Interpretable Neural Embeddings

Prediction without justification has limited utility. Much of the success of neural models can be attributed to their ability to learn rich, dense and expressive representations. While these representations capture the underlying complexity…

Computation and Language · Computer Science 2017-11-27 Anant Subramanian , Danish Pruthi , Harsh Jhamtani , Taylor Berg-Kirkpatrick , Eduard Hovy

Adjusting Interpretable Dimensions in Embedding Space with Human Judgments

Embedding spaces contain interpretable dimensions indicating gender, formality in style, or even object properties. This has been observed multiple times. Such interpretable dimensions are becoming valuable tools in different areas of…

Computation and Language · Computer Science 2024-04-04 Katrin Erk , Marianna Apidianaki

Interpretable Text Embeddings and Text Similarity Explanation: A Survey

Text embeddings are a fundamental component in many NLP tasks, including classification, regression, clustering, and semantic search. However, despite their ubiquitous application, challenges persist in interpreting embeddings and…

Computation and Language · Computer Science 2025-10-03 Juri Opitz , Lucas Möller , Andrianos Michail , Sebastian Padó , Simon Clematide

Learning Domain-Specific Word Embeddings from Sparse Cybersecurity Texts

Word embedding is a Natural Language Processing (NLP) technique that automatically maps words from a vocabulary to vectors of real numbers in an embedding space. It has been widely used in recent years to boost the performance of a vari-ety…

Computation and Language · Computer Science 2017-09-25 Arpita Roy , Youngja Park , SHimei Pan

Neural Random Projections for Language Modelling

Neural network-based language models deal with data sparsity problems by mapping the large discrete space of words into a smaller continuous space of real-valued vectors. By learning distributed vector representations for words, each…

Computation and Language · Computer Science 2018-09-27 Davide Nunes , Luis Antunes

Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Word embeddings are ubiquitous in NLP and information retrieval, but it is unclear what they represent when the word is polysemous. Here it is shown that multiple word senses reside in linear superposition within the word embedding and…

Computation and Language · Computer Science 2018-12-10 Sanjeev Arora , Yuanzhi Li , Yingyu Liang , Tengyu Ma , Andrej Risteski

Learning Semantic Representations for Novel Words: Leveraging Both Form and Context

Word embeddings are a key component of high-performing natural language processing (NLP) systems, but it remains a challenge to learn good representations for novel words on the fly, i.e., for words that did not occur in the training data.…

Computation and Language · Computer Science 2018-11-12 Timo Schick , Hinrich Schütze

Word Shape Matters: Robust Machine Translation with Visual Embedding

Neural machine translation has achieved remarkable empirical performance over standard benchmark datasets, yet recent evidence suggests that the models can still fail easily dealing with substandard inputs such as misspelled words, To…

Computation and Language · Computer Science 2020-10-21 Haohan Wang , Peiyan Zhang , Eric P. Xing