Related papers: Tile2Vec: Unsupervised representation learning for…

voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data

Relationships in scientific data, such as the numerical and spatial distribution relations of features in univariate data, the scalar-value combinations' relations in multivariate data, and the association of volumes in time-varying and…

Machine Learning · Computer Science 2022-07-25 Xiangyang He , Yubo Tao , Shuoliu Yang , Haoran Dai , Hai Lin

Dis-S2V: Discourse Informed Sen2Vec

Vector representation of sentences is important for many text processing tasks that involve clustering, classifying, or ranking sentences. Recently, distributed representation of sentences learned by neural models from unlabeled data has…

Computation and Language · Computer Science 2016-10-27 Tanay Kumar Saha , Shafiq Joty , Naeemul Hassan , Mohammad Al Hasan

Topic2Vec: Learning Distributed Representations of Topics

Latent Dirichlet Allocation (LDA) mining thematic structure of documents plays an important role in nature language processing and machine learning areas. However, the probability distribution from LDA only describes the statistical…

Computation and Language · Computer Science 2015-06-30 Li-Qiang Niu , Xin-Yu Dai

S2Vec: Self-Supervised Geospatial Embeddings for the Built Environment

Scalable general-purpose representations of the built environment are crucial for geospatial artificial intelligence applications. This paper introduces S2Vec, a novel self-supervised framework for learning such geospatial embeddings. S2Vec…

Social and Information Networks · Computer Science 2026-01-08 Shushman Choudhury , Elad Aharoni , Chandrakumari Suvarna , Iveel Tsogsuren , Abdul Rahman Kreidieh , Chun-Ta Lu , Neha Arora

Points2Vec: Unsupervised Object-level Feature Learning from Point Clouds

Unsupervised representation learning techniques, such as learning word embeddings, have had a significant impact on the field of natural language processing. Similar representation learning techniques have not yet become commonplace in the…

Computer Vision and Pattern Recognition · Computer Science 2021-02-09 Joël Bachmann , Kenneth Blomqvist , Julian Förster , Roland Siegwart

Plan2Vec: Unsupervised Representation Learning by Latent Plans

In this paper we introduce plan2vec, an unsupervised representation learning approach that is inspired by reinforcement learning. Plan2vec constructs a weighted graph on an image dataset using near-neighbor distances, and then extrapolates…

Machine Learning · Computer Science 2020-05-08 Ge Yang , Amy Zhang , Ari S. Morcos , Joelle Pineau , Pieter Abbeel , Roberto Calandra

Distributed Representation of Subgraphs

Network embeddings have become very popular in learning effective feature representations of networks. Motivated by the recent successes of embeddings in natural language processing, researchers have tried to find network embeddings in…

Social and Information Networks · Computer Science 2017-02-23 Bijaya Adhikari , Yao Zhang , Naren Ramakrishnan , B. Aditya Prakash

DoE2Vec: Deep-learning Based Features for Exploratory Landscape Analysis

We propose DoE2Vec, a variational autoencoder (VAE)-based methodology to learn optimization landscape characteristics for downstream meta-learning tasks, e.g., automated selection of optimization algorithms. Principally, using large…

Optimization and Control · Mathematics 2023-04-05 Bas van Stein , Fu Xing Long , Moritz Frenzel , Peter Krause , Markus Gitterle , Thomas Bäck

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

While the general idea of self-supervised learning is identical across modalities, the actual algorithms and objectives differ widely because they were developed with a single modality in mind. To get us closer to general self-supervised…

Machine Learning · Computer Science 2022-10-27 Alexei Baevski , Wei-Ning Hsu , Qiantong Xu , Arun Babu , Jiatao Gu , Michael Auli

AttrE2vec: Unsupervised Attributed Edge Representation Learning

Representation learning has overcome the often arduous and manual featurization of networks through (unsupervised) feature learning as it results in embeddings that can apply to a variety of downstream learning tasks. The focus of…

Machine Learning · Computer Science 2021-01-01 Piotr Bielak , Tomasz Kajdanowicz , Nitesh V. Chawla

Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language

Current self-supervised learning algorithms are often modality-specific and require large amounts of computational resources. To address these issues, we increase the training efficiency of data2vec, a learning objective that generalizes…

Machine Learning · Computer Science 2023-06-16 Alexei Baevski , Arun Babu , Wei-Ning Hsu , Michael Auli

A Theoretical Analysis of Contrastive Unsupervised Representation Learning

Recent empirical works have successfully used unlabeled data to learn feature representations that are broadly useful in downstream classification tasks. Several of these methods are reminiscent of the well-known word2vec embedding…

Machine Learning · Computer Science 2019-02-26 Sanjeev Arora , Hrishikesh Khandeparkar , Mikhail Khodak , Orestis Plevrakis , Nikunj Saunshi

Towards a Theoretical Understanding of Word and Relation Representation

Representing words by vectors, or embeddings, enables computational reasoning and is foundational to automating natural language tasks. For example, if word embeddings of similar words contain similar values, word similarity can be readily…

Computation and Language · Computer Science 2022-02-02 Carl Allen

Multi-Scale Representation Learning for Spatial Feature Distributions using Grid Cells

Unsupervised text encoding models have recently fueled substantial progress in NLP. The key idea is to use neural networks to convert words in texts to vector space representations based on word positions in a sentence and their contexts,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Gengchen Mai , Krzysztof Janowicz , Bo Yan , Rui Zhu , Ling Cai , Ni Lao

Can Network Embedding of Distributional Thesaurus be Combined with Word Vectors for Better Representation?

Distributed representations of words learned from text have proved to be successful in various natural language processing tasks in recent times. While some methods represent words as vectors computed from text using predictive model…

Computation and Language · Computer Science 2018-02-20 Abhik Jana , Pawan Goyal

Embedding Words in Non-Vector Space with Unsupervised Graph Learning

It has become a de-facto standard to represent words as elements of a vector space (word2vec, GloVe). While this approach is convenient, it is unnatural for language: words form a graph with a latent hierarchical structure, and this…

Computation and Language · Computer Science 2020-10-07 Max Ryabinin , Sergei Popov , Liudmila Prokhorenkova , Elena Voita

Representation Learning for Words and Entities

This thesis presents new methods for unsupervised learning of distributed representations of words and entities from text and knowledge bases. The first algorithm presented in the thesis is a multi-view algorithm for learning…

Computation and Language · Computer Science 2019-06-14 Pushpendre Rastogi

A Vector Space for Distributional Semantics for Entailment

Distributional semantics creates vector-space representations that capture many forms of semantic similarity, but their relation to semantic entailment has been less clear. We propose a vector-space model which provides a formal foundation…

Computation and Language · Computer Science 2016-07-14 James Henderson , Diana Nicoleta Popa

Top2Vec: Distributed Representations of Topics

Topic modeling is used for discovering latent semantic structure, usually referred to as topics, in a large collection of documents. The most widely used methods are Latent Dirichlet Allocation and Probabilistic Latent Semantic Analysis.…

Computation and Language · Computer Science 2020-08-24 Dimo Angelov

Learning Latent Vector Spaces for Product Search

We introduce a novel latent vector space model that jointly learns the latent representations of words, e-commerce products and a mapping between the two without the need for explicit annotations. The power of the model lies in its ability…

Information Retrieval · Computer Science 2016-08-26 Christophe Van Gysel , Maarten de Rijke , Evangelos Kanoulas