Related papers: Automatic Argumentative-Zoning Using Word2vec

Sentiment Analysis of Citations Using Word2vec

Citation sentiment analysis is an important task in scientific paper analysis. Existing machine learning techniques for citation sentiment analysis are focusing on labor-intensive feature engineering, which requires large annotated corpus.…

Computation and Language · Computer Science 2017-04-04 Haixia Liu

MuLMS-AZ: An Argumentative Zoning Dataset for the Materials Science Domain

Scientific publications follow conventionalized rhetorical structures. Classifying the Argumentative Zone (AZ), e.g., identifying whether a sentence states a Motivation, a Result or Background information, has been proposed to improve…

Computation and Language · Computer Science 2023-07-06 Timo Pierre Schrader , Teresa Bürkle , Sophie Henning , Sherry Tan , Matteo Finco , Stefan Grünewald , Maira Indrikova , Felix Hildebrand , Annemarie Friedrich

Utility of General and Specific Word Embeddings for Classifying Translational Stages of Research

Conventional text classification models make a bag-of-words assumption reducing text into word occurrence counts per document. Recent algorithms such as word2vec are capable of learning semantic meaning and similarity between words in an…

Computation and Language · Computer Science 2018-07-11 Vincent Major , Alisa Surkis , Yindalon Aphinyanaphongs

Word Embeddings and Their Use In Sentence Classification Tasks

This paper have two parts. In the first part we discuss word embeddings. We discuss the need for them, some of the methods to create them, and some of their interesting properties. We also compare them to image embeddings and see how word…

Machine Learning · Computer Science 2016-10-27 Amit Mandelbaum , Adi Shalev

Exploring Sentence Vector Spaces through Automatic Summarization

Given vector representations for individual words, it is necessary to compute vector representations of sentences for many applications in a compositional manner, often using artificial neural networks. Relatively little work has explored…

Computation and Language · Computer Science 2018-10-18 Adly Templeton , Jugal Kalita

Encouraging Paragraph Embeddings to Remember Sentence Identity Improves Classification

While paragraph embedding models are remarkably effective for downstream classification tasks, what they learn and encode into a single vector remains opaque. In this paper, we investigate a state-of-the-art paragraph embedding method…

Computation and Language · Computer Science 2019-06-11 Tu Vu , Mohit Iyyer

Word Embeddings for the Construction Domain

We introduce word vectors for the construction domain. Our vectors were obtained by running word2vec on an 11M-word corpus that we created from scratch by leveraging freely-accessible online sources of construction-related text. We first…

Computation and Language · Computer Science 2016-10-31 Antoine J. -P. Tixier , Michalis Vazirgiannis , Matthew R. Hallowell

Improved Semantic-Aware Network Embedding with Fine-Grained Word Alignment

Network embeddings, which learn low-dimensional representations for each vertex in a large-scale network, have received considerable attention in recent years. For a wide range of applications, vertices in a network are typically…

Computation and Language · Computer Science 2018-08-30 Dinghan Shen , Xinyuan Zhang , Ricardo Henao , Lawrence Carin

tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification

The use of background knowledge is largely unexploited in text classification tasks. This paper explores word taxonomies as means for constructing new semantic features, which may improve the performance and robustness of the learned…

Computation and Language · Computer Science 2020-12-01 Blaž Škrlj , Matej Martinc , Jan Kralj , Nada Lavrač , Senja Pollak

Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks

There is a lot of research interest in encoding variable length sentences into fixed length vectors, in a way that preserves the sentence meanings. Two common methods include representations based on averaging word vectors, and…

Computation and Language · Computer Science 2017-02-10 Yossi Adi , Einat Kermany , Yonatan Belinkov , Ofer Lavi , Yoav Goldberg

Dis-S2V: Discourse Informed Sen2Vec

Vector representation of sentences is important for many text processing tasks that involve clustering, classifying, or ranking sentences. Recently, distributed representation of sentences learned by neural models from unlabeled data has…

Computation and Language · Computer Science 2016-10-27 Tanay Kumar Saha , Shafiq Joty , Naeemul Hassan , Mohammad Al Hasan

Good, Better, Best: Choosing Word Embedding Context

We propose two methods of learning vector representations of words and phrases that each combine sentence context with structural features extracted from dependency trees. Using several variations of neural network classifier, we show that…

Computation and Language · Computer Science 2015-11-20 James Cross , Bing Xiang , Bowen Zhou

The Influence of Feature Representation of Text on the Performance of Document Classification

In this paper we perform a comparative analysis of three models for feature representation of text documents in the context of document classification. In particular, we consider the most often used family of models bag-of-words, recently…

Computation and Language · Computer Science 2017-07-06 Sanda Martinčić-Ipšić , Tanja Miličić , Ljupčo Todorovski

Bayesian Paragraph Vectors

Word2vec (Mikolov et al., 2013) has proven to be successful in natural language processing by capturing the semantic relationships between different words. Built on top of single-word embeddings, paragraph vectors (Le and Mikolov, 2014)…

Computation and Language · Computer Science 2017-12-11 Geng Ji , Robert Bamler , Erik B. Sudderth , Stephan Mandt

A Simple and Effective Approach for Fine Tuning Pre-trained Word Embeddings for Improved Text Classification

This work presents a new and simple approach for fine-tuning pretrained word embeddings for text classification tasks. In this approach, the class in which a term appears, acts as an additional contextual variable during the fine tuning…

Computation and Language · Computer Science 2019-12-17 Amr Al-Khatib , Samhaa R. El-Beltagy

Active Discriminative Text Representation Learning

We propose a new active learning (AL) method for text classification with convolutional neural networks (CNNs). In AL, one selects the instances to be manually labeled with the aim of maximizing model performance with minimal effort. Neural…

Computation and Language · Computer Science 2016-12-02 Ye Zhang , Matthew Lease , Byron C. Wallace

Gram2Vec: An Interpretable Document Vectorizer

We present Gram2Vec, a grammatical style embedding system that embeds documents into a higher dimensional space by extracting the normalized relative frequencies of grammatical features present in the text. Compared to neural approaches,…

Computation and Language · Computer Science 2025-11-27 Peter Zeng , Hannah Stortz , Eric Sclafani , Alina Shabaeva , Maria Elizabeth Garza , Daniel Greeson , Owen Rambow

Hash2Vec, Feature Hashing for Word Embeddings

In this paper we propose the application of feature hashing to create word embeddings for natural language processing. Feature hashing has been used successfully to create document vectors in related tasks like document classification. In…

Computation and Language · Computer Science 2017-04-18 Luis Argerich , Joaquín Torré Zaffaroni , Matías J Cano

Attribute CNNs for Word Spotting in Handwritten Documents

Word spotting has become a field of strong research interest in document image analysis over the last years. Recently, AttributeSVMs were proposed which predict a binary attribute representation. At their time, this influential method…

Computer Vision and Pattern Recognition · Computer Science 2017-12-21 Sebastian Sudholt , Gernot Fink

Class Vectors: Embedding representation of Document Classes

Distributed representations of words and paragraphs as semantic embeddings in high dimensional data are used across a number of Natural Language Understanding tasks such as retrieval, translation, and classification. In this work, we…

Computation and Language · Computer Science 2015-08-04 Devendra Singh Sachan , Shailesh Kumar