Related papers: An Iterative Contextualization Algorithm with Seco…

Contextualized word senses: from attention to compositionality

The neural architectures of language models are becoming increasingly complex, especially that of Transformers, based on the attention mechanism. Although their application to numerous natural language processing tasks has proven to be very…

Computation and Language · Computer Science 2023-12-04 Pablo Gamallo

Second-Order Word Embeddings from Nearest Neighbor Topological Features

We introduce second-order vector representations of words, induced from nearest neighborhood topological features in pre-trained contextual word embeddings. We then analyze the effects of using second-order embeddings as input features in…

Computation and Language · Computer Science 2017-05-25 Denis Newman-Griffis , Eric Fosler-Lussier

Joint Embedding of Words and Labels for Text Classification

Word embeddings are effective intermediate representations for capturing semantic regularities between words, when learning the representations of text sequences. We propose to view text classification as a label-word joint embedding…

Computation and Language · Computer Science 2018-05-14 Guoyin Wang , Chunyuan Li , Wenlin Wang , Yizhe Zhang , Dinghan Shen , Xinyuan Zhang , Ricardo Henao , Lawrence Carin

Boost Image Captioning with Knowledge Reasoning

Automatically generating a human-like description for a given image is a potential research in artificial intelligence, which has attracted a great of attention recently. Most of the existing attention methods explore the mapping…

Computer Vision and Pattern Recognition · Computer Science 2020-11-03 Feicheng Huang , Zhixin Li , Haiyang Wei , Canlong Zhang , Huifang Ma

Image Captioning with Semantic Attention

Automatically generating a natural language description of an image has attracted interests recently both because of its importance in practical applications and because it connects two major artificial intelligence fields: computer vision…

Computer Vision and Pattern Recognition · Computer Science 2016-03-15 Quanzeng You , Hailin Jin , Zhaowen Wang , Chen Fang , Jiebo Luo

Tiered Clustering to Improve Lexical Entailment

Many tasks in Natural Language Processing involve recognizing lexical entailment. Two different approaches to this problem have been proposed recently that are quite different from each other. The first is an asymmetric similarity measure…

Computation and Language · Computer Science 2014-12-03 John Wieting

Attentive Mimicking: Better Word Embeddings by Attending to Informative Contexts

Learning high-quality embeddings for rare words is a hard problem because of sparse context information. Mimicking (Pinter et al., 2017) has been proposed as a solution: given embeddings learned by a standard algorithm, a model is first…

Computation and Language · Computer Science 2019-04-08 Timo Schick , Hinrich Schütze

Encoding word order in complex embeddings

Sequential word order is important when processing text. Currently, neural networks (NNs) address this by modeling word position using position embeddings. The problem is that position embeddings capture the position of individual words,…

Computation and Language · Computer Science 2020-06-30 Benyou Wang , Donghao Zhao , Christina Lioma , Qiuchi Li , Peng Zhang , Jakob Grue Simonsen

Probing Contextualized Sentence Representations with Visual Awareness

We present a universal framework to model contextualized sentence representations with visual awareness that is motivated to overcome the shortcomings of the multimodal parallel data with manual annotations. For each sentence, we first…

Computation and Language · Computer Science 2019-11-12 Zhuosheng Zhang , Rui Wang , Kehai Chen , Masao Utiyama , Eiichiro Sumita , Hai Zhao

Guiding Attention using Partial-Order Relationships for Image Captioning

The use of attention models for automated image captioning has enabled many systems to produce accurate and meaningful descriptions for images. Over the years, many novel approaches have been proposed to enhance the attention process using…

Computer Vision and Pattern Recognition · Computer Science 2022-04-18 Murad Popattia , Muhammad Rafi , Rizwan Qureshi , Shah Nawaz

A Structured Self-attentive Sentence Embedding

This paper proposes a new model for extracting an interpretable sentence embedding by introducing self-attention. Instead of using a vector, we use a 2-D matrix to represent the embedding, with each row of the matrix attending on a…

Computation and Language · Computer Science 2017-03-10 Zhouhan Lin , Minwei Feng , Cicero Nogueira dos Santos , Mo Yu , Bing Xiang , Bowen Zhou , Yoshua Bengio

A Simple and Effective Approach for Fine Tuning Pre-trained Word Embeddings for Improved Text Classification

This work presents a new and simple approach for fine-tuning pretrained word embeddings for text classification tasks. In this approach, the class in which a term appears, acts as an additional contextual variable during the fine tuning…

Computation and Language · Computer Science 2019-12-17 Amr Al-Khatib , Samhaa R. El-Beltagy

Optimal synthesis embeddings

In this paper we introduce a word embedding composition method based on the intuitive idea that a fair embedding representation for a given set of words should satisfy that the new vector will be at the same distance of the vector…

Computation and Language · Computer Science 2024-06-18 Roberto Santana , Mauricio Romero Sicre

Neural Text Classification by Jointly Learning to Cluster and Align

Distributional text clustering delivers semantically informative representations and captures the relevance between each word and semantic clustering centroids. We extend the neural text clustering approach to text classification tasks by…

Computation and Language · Computer Science 2020-11-25 Yekun Chai , Haidong Zhang , Shuo Jin

Learning Sentence Representation with Guidance of Human Attention

Recently, much progress has been made in learning general-purpose sentence representations that can be used across domains. However, most of the existing models typically treat each word in a sentence equally. In contrast, extensive studies…

Computation and Language · Computer Science 2017-05-10 Shaonan Wang , Jiajun Zhang , Chengqing Zong

Context Matters: Recovering Human Semantic Structure from Machine Learning Analysis of Large-Scale Text Corpora

Applying machine learning algorithms to large-scale, text-based corpora (embeddings) presents a unique opportunity to investigate at scale how human semantic knowledge is organized and how people use it to judge fundamental relationships,…

Computation and Language · Computer Science 2020-07-17 Marius Cătălin Iordan , Tyler Giallanza , Cameron T. Ellis , Nicole M. Beckage , Jonathan D. Cohen

Topological Sort for Sentence Ordering

Sentence ordering is the task of arranging the sentences of a given text in the correct order. Recent work using deep neural networks for this task has framed it as a sequence prediction problem. In this paper, we propose a new framing of…

Computation and Language · Computer Science 2020-05-04 Shrimai Prabhumoye , Ruslan Salakhutdinov , Alan W Black

Word Embeddings and Their Use In Sentence Classification Tasks

This paper have two parts. In the first part we discuss word embeddings. We discuss the need for them, some of the methods to create them, and some of their interesting properties. We also compare them to image embeddings and see how word…

Machine Learning · Computer Science 2016-10-27 Amit Mandelbaum , Adi Shalev

IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval

Enabling bi-directional retrieval of images and texts is important for understanding the correspondence between vision and language. Existing methods leverage the attention mechanism to explore such correspondence in a fine-grained manner.…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Hui Chen , Guiguang Ding , Xudong Liu , Zijia Lin , Ji Liu , Jungong Han

Context is Key: New Approaches to Neural Coherence Modeling

We formulate coherence modeling as a regression task and propose two novel methods to combine techniques from our setup with pairwise approaches. The first of our methods is a model that we call "first-next," which operates similarly to…

Computation and Language · Computer Science 2018-12-13 David McClure , Shayne O'Brien , Deb Roy