Related papers: Graph Sequence Learning for Premise Selection

Improving Image Captioning with Better Use of Captions

Image captioning is a multimodal problem that has drawn extensive attention in both the natural language processing and computer vision community. In this paper, we present a novel image captioning architecture to better explore semantics…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Zhan Shi , Xu Zhou , Xipeng Qiu , Xiaodan Zhu

Vector Learning for Cross Domain Representations

Recently, generative adversarial networks have gained a lot of popularity for image generation tasks. However, such models are associated with complex learning mechanisms and demand very large relevant datasets. This work borrows concepts…

Machine Learning · Computer Science 2018-09-28 Shagan Sah , Chi Zhang , Thang Nguyen , Dheeraj Kumar Peri , Ameya Shringi , Raymond Ptucha

Next Word Suggestion using Graph Neural Network

Language Modeling is a prevalent task in Natural Language Processing. The currently existing most recent and most successful language models often tend to build a massive model with billions of parameters, feed in a tremendous amount of…

Computation and Language · Computer Science 2025-05-16 Abisha Thapa Magar , Anup Shakya

Unsupervised Image Captioning

Deep neural networks have achieved great successes on the image captioning task. However, most of the existing models depend heavily on paired image-sentence datasets, which are very expensive to acquire. In this paper, we make the first…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Yang Feng , Lin Ma , Wei Liu , Jiebo Luo

Premise Selection for Theorem Proving by Deep Graph Embedding

We propose a deep learning-based approach to the problem of premise selection: selecting mathematical statements relevant for proving a given conjecture. We represent a higher-order logic formula as a graph that is invariant to variable…

Artificial Intelligence · Computer Science 2017-10-09 Mingzhe Wang , Yihe Tang , Jian Wang , Jia Deng

Image Representations and New Domains in Neural Image Captioning

We examine the possibility that recent promising results in automatic caption generation are due primarily to language models. By varying image representation quality produced by a convolutional neural network, we find that a…

Computation and Language · Computer Science 2015-08-11 Jack Hessel , Nicolas Savva , Michael J. Wilber

From Images to Sentences through Scene Description Graphs using Commonsense Reasoning and Knowledge

In this paper we propose the construction of linguistic descriptions of images. This is achieved through the extraction of scene description graphs (SDGs) from visual scenes using an automatically constructed knowledge base. SDGs are…

Computer Vision and Pattern Recognition · Computer Science 2015-11-12 Somak Aditya , Yezhou Yang , Chitta Baral , Cornelia Fermuller , Yiannis Aloimonos

Graph-based Neural Sentence Ordering

Sentence ordering is to restore the original paragraph from a set of sentences. It involves capturing global dependencies among sentences regardless of their input order. In this paper, we propose a novel and flexible graph-based neural…

Computation and Language · Computer Science 2019-12-17 Yongjing Yin , Linfeng Song , Jinsong Su , Jiali Zeng , Chulun Zhou , Jiebo Luo

ClipCap: CLIP Prefix for Image Captioning

Image captioning is a fundamental task in vision-language understanding, where the model predicts a textual informative caption to a given input image. In this paper, we present a simple approach to address this task. We use CLIP encoding…

Computer Vision and Pattern Recognition · Computer Science 2021-11-19 Ron Mokady , Amir Hertz , Amit H. Bermano

Bypass Network for Semantics Driven Image Paragraph Captioning

Image paragraph captioning aims to describe a given image with a sequence of coherent sentences. Most existing methods model the coherence through the topic transition that dynamically infers a topic vector from preceding sentences.…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Qi Zheng , Chaoyue Wang , Dadong Wang

Word to Sentence Visual Semantic Similarity for Caption Generation: Lessons Learned

This paper focuses on enhancing the captions generated by image-caption generation systems. We propose an approach for improving caption generation systems by choosing the most closely related output to the image rather than the most likely…

Computation and Language · Computer Science 2023-07-10 Ahmed Sabir

Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech

Image captioning is an ambiguous problem, with many suitable captions for an image. To address ambiguity, beam search is the de facto method for sampling multiple captions. However, beam search is computationally expensive and known to…

Computer Vision and Pattern Recognition · Computer Science 2019-04-12 Aditya Deshpande , Jyoti Aneja , Liwei Wang , Alexander Schwing , D. A. Forsyth

Self-Supervised Graph Representation Learning via Global Context Prediction

To take full advantage of fast-growing unlabeled networked data, this paper introduces a novel self-supervised strategy for graph representation learning by exploiting natural supervision provided by the data itself. Inspired by human…

Machine Learning · Computer Science 2025-11-20 Zhen Peng , Yixiang Dong , Minnan Luo , Xiao-Ming Wu , Qinghua Zheng

Graph-based Approach to Automatic Taxonomy Generation (GraBTax)

We propose a novel graph-based approach for constructing concept hierarchy from a large text corpus. Our algorithm, GraBTax, incorporates both statistical co-occurrences and lexical similarity in optimizing the structure of the taxonomy. To…

Information Retrieval · Computer Science 2014-04-30 Pucktada Treeratpituk , Madian Khabsa , C. Lee Giles

Image Captions are Natural Prompts for Text-to-Image Models

With the rapid development of Artificial Intelligence Generated Content (AIGC), it has become a common practice to train models on synthetic data due to data-scarcity and privacy leakage problems. Owing to massive and diverse information…

Computer Vision and Pattern Recognition · Computer Science 2025-06-24 Shiye Lei , Hao Chen , Sen Zhang , Bo Zhao , Dacheng Tao

Learning a Recurrent Visual Representation for Image Caption Generation

In this paper we explore the bi-directional mapping between images and their sentence-based descriptions. We propose learning this mapping using a recurrent neural network. Unlike previous approaches that map both sentences and images to a…

Computer Vision and Pattern Recognition · Computer Science 2014-11-21 Xinlei Chen , C. Lawrence Zitnick

Figure Captioning with Reasoning and Sequence-Level Training

Figures, such as bar charts, pie charts, and line plots, are widely used to convey important information in a concise format. They are usually human-friendly but difficult for computers to process automatically. In this work, we investigate…

Computer Vision and Pattern Recognition · Computer Science 2019-06-10 Charles Chen , Ruiyi Zhang , Eunyee Koh , Sungchul Kim , Scott Cohen , Tong Yu , Ryan Rossi , Razvan Bunescu

Image Captioning with Compositional Neural Module Networks

In image captioning where fluency is an important factor in evaluation, e.g., $n$-gram metrics, sequential models are commonly used; however, sequential models generally result in overgeneralized expressions that lack the details that may…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Junjiao Tian , Jean Oh

Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks

Recurrent Neural Networks can be trained to produce sequences of tokens given some input, as exemplified by recent results in machine translation and image captioning. The current approach to training them consists of maximizing the…

Machine Learning · Computer Science 2015-09-24 Samy Bengio , Oriol Vinyals , Navdeep Jaitly , Noam Shazeer

Modeling, comprehending and summarizing textual content by graphs

Automatic Text Summarization strategies have been successfully employed to digest text collections and extract its essential content. Usually, summaries are generated using textual corpora that belongs to the same domain area where the…

Computation and Language · Computer Science 2018-07-03 Vinicius Woloszyn , Guilherme Medeiros Machado , Leandro Krug Wives , José Palazzo Moreira de Oliveira