Related papers: High-Dimensional Vector Semantics

Exploring Sentence Vector Spaces through Automatic Summarization

Given vector representations for individual words, it is necessary to compute vector representations of sentences for many applications in a compositional manner, often using artificial neural networks. Relatively little work has explored…

Computation and Language · Computer Science 2018-10-18 Adly Templeton , Jugal Kalita

Deriving Word Vectors from Contextualized Language Models using Topic-Aware Mention Selection

One of the long-standing challenges in lexical semantics consists in learning representations of words which reflect their semantic properties. The remarkable success of word embeddings for this purpose suggests that high-quality…

Computation and Language · Computer Science 2021-06-16 Yixiao Wang , Zied Bouraoui , Luis Espinosa Anke , Steven Schockaert

Context Vectors are Reflections of Word Vectors in Half the Dimensions

This paper takes a step towards theoretical analysis of the relationship between word embeddings and context embeddings in models such as word2vec. We start from basic probabilistic assumptions on the nature of word vectors, context…

Machine Learning · Statistics 2019-02-27 Zhenisbek Assylbekov , Rustem Takhanov

Random vector generation of a semantic space

We show how random vectors and random projection can be implemented in the usual vector space model to construct a Euclidean semantic space from a French synonym dictionary. We evaluate theoretically the resulting noise and show the…

Computation and Language · Computer Science 2017-03-08 Jean-François Delpech , Sabine Ploux

Representing Sets as Summed Semantic Vectors

Representing meaning in the form of high dimensional vectors is a common and powerful tool in biologically inspired architectures. While the meaning of a set of concepts can be summarized by taking a (possibly weighted) sum of their…

Artificial Intelligence · Computer Science 2018-09-25 Douglas Summers-Stay , Peter Sutor , Dandan Li

Sentence Analogies: Exploring Linguistic Relationships and Regularities in Sentence Embeddings

While important properties of word vector representations have been studied extensively, far less is known about the properties of sentence vector representations. Word vectors are often evaluated by assessing to what degree they exhibit…

Computation and Language · Computer Science 2020-03-10 Xunjie Zhu , Gerard de Melo

Word Embedding Visualization Via Dictionary Learning

Co-occurrence statistics based word embedding techniques have proved to be very useful in extracting the semantic and syntactic representation of words as low dimensional continuous vectors. In this work, we discovered that dictionary…

Computation and Language · Computer Science 2021-03-16 Juexiao Zhang , Yubei Chen , Brian Cheung , Bruno A Olshausen

Semantic projection: recovering human knowledge of multiple, distinct object features from word embeddings

The words of a language reflect the structure of the human mind, allowing us to transmit thoughts between individuals. However, language can represent only a subset of our rich and detailed cognitive architecture. Here, we ask what kinds of…

Computation and Language · Computer Science 2018-03-07 Gabriel Grand , Idan Asher Blank , Francisco Pereira , Evelina Fedorenko

A short survey on almost orthogonal vectors in a few specific large dimensions

The concept of \emph{almost orthogonal vectors}, i.e.\ vectors whose cosine similarity is close to $0$, relates to topics both in pure mathematics and in coding theory under the guises of spherical packing and spherical codes. In recent…

Metric Geometry · Mathematics 2025-10-29 Rami Luisto

Low-dimensional Semantic Space: from Text to Word Embedding

This article focuses on the study of Word Embedding, a feature-learning technique in Natural Language Processing that maps words or phrases to low-dimensional vectors. Beginning with the linguistic theories concerning contextual…

Computation and Language · Computer Science 2019-11-05 Xiaolei Lu , Bin Ni

Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines

Vector representations and vector space modeling (VSM) play a central role in modern machine learning. We propose a novel approach to `vector similarity searching' over dense semantic representations of words and documents that can be…

Information Retrieval · Computer Science 2017-06-06 Jan Rygl , Jan Pomikálek , Radim Řehůřek , Michal Růžička , Vít Novotný , Petr Sojka

Retrofitting Word Vectors to Semantic Lexicons

Vector space word representations are learned from distributional information of words in large corpora. Although such statistics are semantically informative, they disregard the valuable information that is contained in semantic lexicons…

Computation and Language · Computer Science 2015-03-24 Manaal Faruqui , Jesse Dodge , Sujay K. Jauhar , Chris Dyer , Eduard Hovy , Noah A. Smith

Using Holographically Compressed Embeddings in Question Answering

Word vector representations are central to deep learning natural language processing models. Many forms of these vectors, known as embeddings, exist, including word2vec and GloVe. Embeddings are trained on large corpora and learn the word's…

Computation and Language · Computer Science 2020-07-16 Salvador E. Barbosa

Evaluating vector-space models of analogy

Vector-space representations provide geometric tools for reasoning about the similarity of a set of objects and their relationships. Recent machine learning methods for deriving vector-space embeddings of words (e.g., word2vec) have…

Computation and Language · Computer Science 2017-06-12 Dawn Chen , Joshua C. Peterson , Thomas L. Griffiths

Rehabilitation of Count-based Models for Word Vector Representations

Recent works on word representations mostly rely on predictive models. Distributed word representations (aka word embeddings) are trained to optimally predict the contexts in which the corresponding words tend to appear. Such models have…

Computation and Language · Computer Science 2015-04-10 Rémi Lebret , Ronan Collobert

Semantic Sentence Embeddings for Paraphrasing and Text Summarization

This paper introduces a sentence to vector encoding framework suitable for advanced natural language processing. Our latent representation is shown to encode sentences with common semantic information with similar vector representations.…

Computation and Language · Computer Science 2018-09-30 Chi Zhang , Shagan Sah , Thang Nguyen , Dheeraj Peri , Alexander Loui , Carl Salvaggio , Raymond Ptucha

Learning Mathematical Properties of Integers

Embedding words in high-dimensional vector spaces has proven valuable in many natural language applications. In this work, we investigate whether similarly-trained embeddings of integers can capture concepts that are useful for mathematical…

Computation and Language · Computer Science 2021-09-16 Maria Ryskina , Kevin Knight

Efficient Estimation of Word Representations in Vector Space

We propose two novel model architectures for computing continuous vector representations of words from very large data sets. The quality of these representations is measured in a word similarity task, and the results are compared to the…

Computation and Language · Computer Science 2013-09-10 Tomas Mikolov , Kai Chen , Greg Corrado , Jeffrey Dean

Consistent Alignment of Word Embedding Models

Word embedding models offer continuous vector representations that can capture rich contextual semantics based on their word co-occurrence patterns. While these word vectors can provide very effective features used in many NLP tasks such as…

Computation and Language · Computer Science 2017-02-27 Cem Safak Sahin , Rajmonda S. Caceres , Brandon Oselio , William M. Campbell

A Vector Space for Distributional Semantics for Entailment

Distributional semantics creates vector-space representations that capture many forms of semantic similarity, but their relation to semantic entailment has been less clear. We propose a vector-space model which provides a formal foundation…

Computation and Language · Computer Science 2016-07-14 James Henderson , Diana Nicoleta Popa