Related papers: Task-Specific Dependency-based Word Embedding Meth…

A Deeper Look into Dependency-Based Word Embeddings

We investigate the effect of various dependency-based word embeddings on distinguishing between functional and domain similarity, word similarity rankings, and two downstream tasks in English. Variations include word embeddings trained…

Computation and Language · Computer Science 2018-04-18 Sean MacAvaney , Amir Zeldes

Task-Oriented Learning of Word Embeddings for Semantic Relation Classification

We present a novel learning method for word embeddings designed for relation classification. Our word embeddings are trained by predicting words between noun pairs using lexical relation-specific features on a large unlabeled corpus. This…

Computation and Language · Computer Science 2015-06-23 Kazuma Hashimoto , Pontus Stenetorp , Makoto Miwa , Yoshimasa Tsuruoka

Utility of General and Specific Word Embeddings for Classifying Translational Stages of Research

Conventional text classification models make a bag-of-words assumption reducing text into word occurrence counts per document. Recent algorithms such as word2vec are capable of learning semantic meaning and similarity between words in an…

Computation and Language · Computer Science 2018-07-11 Vincent Major , Alisa Surkis , Yindalon Aphinyanaphongs

Inductive Document Network Embedding with Topic-Word Attention

Document network embedding aims at learning representations for a structured text corpus i.e. when documents are linked to each other. Recent algorithms extend network embedding approaches by incorporating the text content associated with…

Machine Learning · Computer Science 2020-01-13 Robin Brochier , Adrien Guille , Julien Velcin

Learning Domain-Specific Word Embeddings from Sparse Cybersecurity Texts

Word embedding is a Natural Language Processing (NLP) technique that automatically maps words from a vocabulary to vectors of real numbers in an embedding space. It has been widely used in recent years to boost the performance of a vari-ety…

Computation and Language · Computer Science 2017-09-25 Arpita Roy , Youngja Park , SHimei Pan

Relevance-based Word Embedding

Learning a high-dimensional dense representation for vocabulary terms, also known as a word embedding, has recently attracted much attention in natural language processing and information retrieval tasks. The embedding vectors are typically…

Information Retrieval · Computer Science 2017-07-18 Hamed Zamani , W. Bruce Croft

Word Embeddings via Tensor Factorization

Most popular word embedding techniques involve implicit or explicit factorization of a word co-occurrence based matrix into low rank factors. In this paper, we aim to generalize this trend by using numerical methods to factor higher-order…

Machine Learning · Statistics 2017-09-19 Eric Bailey , Shuchin Aeron

Bilingual Distributed Word Representations from Document-Aligned Comparable Data

We propose a new model for learning bilingual word representations from non-parallel document-aligned data. Following the recent advances in word representation learning, our model learns dense real-valued word vectors, that is, bilingual…

Computation and Language · Computer Science 2016-03-01 Ivan Vulić , Marie-Francine Moens

Learning Efficient Task-Specific Meta-Embeddings with Word Prisms

Word embeddings are trained to predict word cooccurrence statistics, which leads them to possess different lexical properties (syntactic, semantic, etc.) depending on the notion of context defined at training time. These properties manifest…

Computation and Language · Computer Science 2020-11-06 Jingyi He , KC Tsiolis , Kian Kenyon-Dean , Jackie Chi Kit Cheung

Learning Word Embeddings from Intrinsic and Extrinsic Views

While word embeddings are currently predominant for natural language processing, most of existing models learn them solely from their contexts. However, these context-based word embeddings are limited since not all words' meaning can be…

Computation and Language · Computer Science 2016-08-23 Jifan Chen , Kan Chen , Xipeng Qiu , Qi Zhang , Xuanjing Huang , Zheng Zhang

Theme-weighted Ranking of Keywords from Text Documents using Phrase Embeddings

Keyword extraction is a fundamental task in natural language processing that facilitates mapping of documents to a concise set of representative single and multi-word phrases. Keywords from text documents are primarily extracted using…

Computation and Language · Computer Science 2018-07-17 Debanjan Mahata , John Kuriakose , Rajiv Ratn Shah , Roger Zimmermann , John R. Talburt

Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings

Models of acoustic word embeddings (AWEs) learn to map variable-length spoken word segments onto fixed-dimensionality vector representations such that different acoustic exemplars of the same word are projected nearby in the embedding…

Computation and Language · Computer Science 2022-09-20 Badr M. Abdullah , Bernd Möbius , Dietrich Klakow

Word Embeddings and Their Use In Sentence Classification Tasks

This paper have two parts. In the first part we discuss word embeddings. We discuss the need for them, some of the methods to create them, and some of their interesting properties. We also compare them to image embeddings and see how word…

Machine Learning · Computer Science 2016-10-27 Amit Mandelbaum , Adi Shalev

Fusing Vector Space Models for Domain-Specific Applications

We address the problem of tuning word embeddings for specific use cases and domains. We propose a new method that automatically combines multiple domain-specific embeddings, selected from a wide range of pre-trained domain-specific…

Computation and Language · Computer Science 2019-09-06 Laura Rettig , Julien Audiffren , Philippe Cudré-Mauroux

Intrinsic analysis for dual word embedding space models

Recent word embeddings techniques represent words in a continuous vector space, moving away from the atomic and sparse representations of the past. Each such technique can further create multiple varieties of embeddings based on different…

Computation and Language · Computer Science 2020-12-08 Mohit Mayank

Unsupervised Word and Dependency Path Embeddings for Aspect Term Extraction

In this paper, we develop a novel approach to aspect term extraction based on unsupervised learning of distributed representations of words and dependency paths. The basic idea is to connect two words (w1 and w2) with the dependency path…

Computation and Language · Computer Science 2016-05-26 Yichun Yin , Furu Wei , Li Dong , Kaimeng Xu , Ming Zhang , Ming Zhou

Mapping Unseen Words to Task-Trained Embedding Spaces

We consider the supervised training setting in which we learn task-specific word embeddings. We assume that we start with initial embeddings learned from unlabelled data and update them to learn task-specific embeddings for words in the…

Computation and Language · Computer Science 2016-06-24 Pranava Swaroop Madhyastha , Mohit Bansal , Kevin Gimpel , Karen Livescu

Word-Level Loss Extensions for Neural Temporal Relation Classification

Unsupervised pre-trained word embeddings are used effectively for many tasks in natural language processing to leverage unlabeled textual data. Often these embeddings are either used as initializations or as fixed word representations for…

Computation and Language · Computer Science 2018-08-08 Artuur Leeuwenberg , Marie-Francine Moens

Substitute Based SCODE Word Embeddings in Supervised NLP Tasks

We analyze a word embedding method in supervised tasks. It maps words on a sphere such that words co-occurring in similar contexts lie closely. The similarity of contexts is measured by the distribution of substitutes that can fill them. We…

Computation and Language · Computer Science 2014-07-28 Volkan Cirik , Deniz Yuret

Good, Better, Best: Choosing Word Embedding Context

We propose two methods of learning vector representations of words and phrases that each combine sentence context with structural features extracted from dependency trees. Using several variations of neural network classifier, we show that…

Computation and Language · Computer Science 2015-11-20 James Cross , Bing Xiang , Bowen Zhou