English
Related papers

Related papers: Fusing Vector Space Models for Domain-Specific App…

200 papers

Conventional text classification models make a bag-of-words assumption reducing text into word occurrence counts per document. Recent algorithms such as word2vec are capable of learning semantic meaning and similarity between words in an…

Computation and Language · Computer Science 2018-07-11 Vincent Major , Alisa Surkis , Yindalon Aphinyanaphongs

Word embeddings are widely used in Natural Language Processing, mainly due to their success in capturing semantic information from massive corpora. However, their creation process does not allow the different meanings of a word to be…

Computation and Language · Computer Science 2017-06-22 Massimiliano Mancini , Jose Camacho-Collados , Ignacio Iacobacci , Roberto Navigli

Generic word embeddings are trained on large-scale generic corpora; Domain Specific (DS) word embeddings are trained only on data from a domain of interest. This paper proposes a method to combine the breadth of generic embeddings with the…

Computation and Language · Computer Science 2018-05-15 Prathusha K Sarma , YIngyu Liang , William A Sethares

This work presents a new and simple approach for fine-tuning pretrained word embeddings for text classification tasks. In this approach, the class in which a term appears, acts as an additional contextual variable during the fine tuning…

Computation and Language · Computer Science 2019-12-17 Amr Al-Khatib , Samhaa R. El-Beltagy

Sentence embeddings enable us to capture the semantic similarity of short texts. Most sentence embedding models are trained for general semantic textual similarity tasks. Therefore, to use sentence embeddings in a particular domain, the…

Computation and Language · Computer Science 2023-09-26 Tim Schopf , Dennis N. Schneider , Florian Matthes

The meaning of a word often varies depending on its usage in different domains. The standard word embedding models struggle to represent this variation, as they learn a single global representation for a word. We propose a method to learn…

Computation and Language · Computer Science 2019-10-22 Lahari Poddar , Gyorgy Szarvas , Lea Frermann

This paper proposes a way to improve the performance of existing algorithms for text classification in domains with strong language semantics. We propose a domain adaptation layer learns weights to combine a generic and a domain specific…

Information Retrieval · Computer Science 2019-08-20 Prathusha K Sarma , Yingyu Liang , William A Sethares

As a fundamental task in natural language processing, word embedding converts each word into a representation in a vector space. A challenge with word embedding is that as the vocabulary grows, the vector space's dimension increases, which…

Computation and Language · Computer Science 2024-11-05 Jintang Xue , Yun-Cheng Wang , Chengwei Wei , C. -C. Jay Kuo

Word embeddings have been widely used in sentiment classification because of their efficacy for semantic representations of words. Given reviews from different domains, some existing methods for word embeddings exploit sentiment…

Computation and Language · Computer Science 2018-05-11 Bei Shi , Zihao Fu , Lidong Bing , Wai Lam

Conventional word embeddings represent words with fixed vectors, which are usually trained based on co-occurrence patterns among words. In doing so, however, the power of such representations is limited, where the same word might be…

Computation and Language · Computer Science 2020-01-10 Hongming Zhang , Jiaxin Bai , Yan Song , Kun Xu , Changlong Yu , Yangqiu Song , Wilfred Ng , Dong Yu

There are two main approaches to the distributed representation of words: low-dimensional deep learning embeddings and high-dimensional distributional models, in which each dimension corresponds to a context word. In this paper, we combine…

Computation and Language · Computer Science 2014-02-19 Irina Sergienya , Hinrich Schütze

Semantic similarity measures are an important part in Natural Language Processing tasks. However Semantic similarity measures built for general use do not perform well within specific domains. Therefore in this study we introduce a domain…

Computation and Language · Computer Science 2019-06-07 Keet Sugathadasa , Buddhi Ayesha , Nisansa de Silva , Amal Shehan Perera , Vindula Jayawardana , Dimuthu Lakmal , Madhavi Perera

Word embedding is a Natural Language Processing (NLP) technique that automatically maps words from a vocabulary to vectors of real numbers in an embedding space. It has been widely used in recent years to boost the performance of a vari-ety…

Computation and Language · Computer Science 2017-09-25 Arpita Roy , Youngja Park , SHimei Pan

We investigate the problem of inducing word embeddings that are tailored for a particular bilexical relation. Our learning algorithm takes an existing lexical vector space and compresses it such that the resulting word embeddings are good…

Computation and Language · Computer Science 2015-04-13 Pranava Swaroop Madhyastha , Xavier Carreras , Ariadna Quattoni

Complementary to finding good general word embeddings, an important question for representation learning is to find dynamic word embeddings, e.g., across time or domain. Current methods do not offer a way to use or predict information on…

Computation and Language · Computer Science 2022-10-12 Stephanie Brandl , David Lassner , Anne Baillot , Shinichi Nakajima

Learning high-quality domain word embeddings is important for achieving good performance in many NLP tasks. General-purpose embeddings trained on large-scale corpora are often sub-optimal for domain-specific applications. However,…

Computation and Language · Computer Science 2018-05-28 Hu Xu , Bing Liu , Lei Shu , Philip S. Yu

Word embeddings are traditionally trained on a large corpus in an unsupervised setting, with no specific design for incorporating domain knowledge. This can lead to unsatisfactory performances when training data originate from heterogeneous…

Computation and Language · Computer Science 2019-06-24 Guoyin Wang , Yan Song , Yue Zhang , Dong Yu

This paper have two parts. In the first part we discuss word embeddings. We discuss the need for them, some of the methods to create them, and some of their interesting properties. We also compare them to image embeddings and see how word…

Machine Learning · Computer Science 2016-10-27 Amit Mandelbaum , Adi Shalev

Word embeddings -- distributed representations of words -- in deep learning are beneficial for many tasks in natural language processing (NLP). However, different embedding sets vary greatly in quality and characteristics of the captured…

Computation and Language · Computer Science 2015-12-31 Wenpeng Yin , Hinrich Schütze

Word embeddings have gained significant attention as learnable representations of semantic relations between words, and have been shown to improve upon the results of traditional word representations. However, little effort has been devoted…

Information Retrieval · Computer Science 2019-05-23 Gloria Feher , Andreas Spitz , Michael Gertz
‹ Prev 1 2 3 10 Next ›