English
Related papers

Related papers: Closed Form Word Embedding Alignment

200 papers

Word embedding models offer continuous vector representations that can capture rich contextual semantics based on their word co-occurrence patterns. While these word vectors can provide very effective features used in many NLP tasks such as…

Computation and Language · Computer Science 2017-02-27 Cem Safak Sahin , Rajmonda S. Caceres , Brandon Oselio , William M. Campbell

Ensembling word embeddings to improve distributed word representations has shown good success for natural language processing tasks in recent years. These approaches either carry out straightforward mathematical operations over a set of…

Computation and Language · Computer Science 2018-08-14 James O' Neill , Danushka Bollegala

A currently successful approach to computational semantics is to represent words as embeddings in a machine-learned vector space. We present an ensemble method that combines embeddings produced by GloVe (Pennington et al., 2014) and…

Computation and Language · Computer Science 2019-12-20 Robyn Speer , Joshua Chin

Word embedding systems such as Word2Vec and GloVe are well-known in deep learning approaches to NLP. This is largely due to their ability to capture semantic relationships between words. In this work we investigated their usefulness in…

Computation and Language · Computer Science 2022-04-15 Hosein Rezaei

Linguistic similarity is multi-faceted. For instance, two words may be similar with respect to semantics, syntax, or morphology inter alia. Continuous word-embeddings have been shown to capture most of these shades of similarity to some…

Computation and Language · Computer Science 2019-07-05 Ryan Cotterell , Hinrich Schütze

Distributed language representation has become the most widely used technique for language representation in various natural language processing tasks. Most of the natural language processing models that are based on deep learning…

Computation and Language · Computer Science 2020-05-11 Martina Toshevska , Frosina Stojanovska , Jovan Kalajdjieski

Word embeddings learnt from large corpora have been adopted in various applications in natural language processing and served as the general input representations to learning systems. Recently, a series of post-processing methods have been…

Machine Learning · Computer Science 2019-10-25 Shuai Tang , Mahta Mousavi , Virginia R. de Sa

One of the most important problems in machine translation (MT) evaluation is to evaluate the similarity between translation hypotheses with different surface forms from the reference, especially at the segment level. We propose to use word…

Computation and Language · Computer Science 2017-04-04 Junki Matsuo , Mamoru Komachi , Katsuhito Sudoh

Neural embedding approaches have become a staple in the fields of computer vision, natural language processing, and more recently, graph analytics. Given the pervasive nature of these algorithms, the natural question becomes how to exploit…

Computation and Language · Computer Science 2020-10-27 Alexander Kalinowski , Yuan An

Conventional text classification models make a bag-of-words assumption reducing text into word occurrence counts per document. Recent algorithms such as word2vec are capable of learning semantic meaning and similarity between words in an…

Computation and Language · Computer Science 2018-07-11 Vincent Major , Alisa Surkis , Yindalon Aphinyanaphongs

In this paper we introduce a word embedding composition method based on the intuitive idea that a fair embedding representation for a given set of words should satisfy that the new vector will be at the same distance of the vector…

Computation and Language · Computer Science 2024-06-18 Roberto Santana , Mauricio Romero Sicre

Word alignment over parallel corpora has a wide variety of applications, including learning translation lexicons, cross-lingual transfer of language processing tools, and automatic evaluation or analysis of translation outputs. The great…

Computation and Language · Computer Science 2021-08-13 Zi-Yi Dou , Graham Neubig

Word translation is an integral part of language translation. In machine translation, each language is considered a domain with its own word embedding. The alignment between word embeddings allows linking semantically equivalent words in…

Computation and Language · Computer Science 2020-06-23 Antonio H. O. Fonseca , David van Dijk

Word2vec is one of the most used algorithms to generate word embeddings because of a good mix of efficiency, quality of the generated representations and cognitive grounding. However, word meaning is not static and depends on the context in…

Artificial Intelligence · Computer Science 2020-04-15 Federico Bianchi , Valerio Di Carlo , Paolo Nicoli , Matteo Palmonari

Ontology matching is a core task when creating interoperable and linked open datasets. In this paper, we explore a novel structure-based mapping approach which is based on knowledge graph embeddings: The ontologies to be matched are…

Artificial Intelligence · Computer Science 2022-04-11 Jan Portisch , Guilherme Costa , Karolin Stefani , Katharina Kreplin , Michael Hladik , Heiko Paulheim

Word embedding, which encodes words into vectors, is an important starting point in natural language processing and commonly used in many text-based machine learning tasks. However, in most current word embedding approaches, the similarity…

Computation and Language · Computer Science 2018-12-27 Denis Sedov , Zhirong Yang

Uncontextualized word embeddings are reliable feature representations of words used to obtain high quality results for various NLP applications. Given the historical success of word embeddings in NLP, we propose a retrospective on some of…

Computation and Language · Computer Science 2019-12-02 Edward Newell , Kian Kenyon-Dean , Jackie Chi Kit Cheung

Word vector representations open up new opportunities to extract useful information from unstructured text. Defining a word as a vector made it easy for the machine learning algorithms to understand a text and extract information from. Word…

Computation and Language · Computer Science 2021-05-19 Mohammed Ibrahim , Susan Gauch , Tyler Gerth , Brandon Cox

Owing to the rapidly growing multimedia content available on the Internet, extractive spoken document summarization, with the purpose of automatically selecting a set of representative sentences from a spoken document to concisely express…

Computation and Language · Computer Science 2015-06-16 Kuan-Yu Chen , Shih-Hung Liu , Hsin-Min Wang , Berlin Chen , Hsin-Hsi Chen

Cross-lingual word embeddings aim to bridge the gap between high-resource and low-resource languages by allowing to learn multilingual word representations even without using any direct bilingual signal. The lion's share of the methods are…

Computation and Language · Computer Science 2020-09-03 Magdalena Biesialska , Marta R. Costa-jussà
‹ Prev 1 2 3 10 Next ›