Related papers: Sequentially Generated Instance-Dependent Image Re…

Adaptive Scene Category Discovery with Generative Learning and Compositional Sampling

This paper investigates a general framework to discover categories of unlabeled scene images according to their appearances (i.e., textures and structures). We jointly solve the two coupled tasks in an unsupervised manner: (i) classifying…

Computer Vision and Pattern Recognition · Computer Science 2015-02-03 Liang Lin , Ruimao Zhang , Xiaohua Duan

Discriminative models for robust image classification

A variety of real-world tasks involve the classification of images into pre-determined categories. Designing image classification algorithms that exhibit robustness to acquisition noise and image distortions, particularly when the available…

Machine Learning · Statistics 2016-03-10 Umamahesh Srinivas

Approximate Bayesian inference in spatial environments

Model-based approaches bear great promise for decision making of agents interacting with the physical world. In the context of spatial environments, different types of problems such as localisation, mapping, navigation or autonomous…

Machine Learning · Statistics 2019-06-21 Atanas Mirchev , Baris Kayalibay , Maximilian Soelch , Patrick van der Smagt , Justin Bayer

Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data

Conditional image generation is effective for diverse tasks including training data synthesis for learning-based computer vision. However, despite the recent advances in generative adversarial networks (GANs), it is still a challenging task…

Computer Vision and Pattern Recognition · Computer Science 2018-11-30 Yutaro Miyauchi , Yusuke Sugano , Yasuyuki Matsushita

Generative Interventions for Causal Learning

We introduce a framework for learning robust visual representations that generalize to new viewpoints, backgrounds, and scene contexts. Discriminative models often learn naturally occurring spurious correlations, which cause them to fail on…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Chengzhi Mao , Augustine Cha , Amogh Gupta , Hao Wang , Junfeng Yang , Carl Vondrick

Convolutional Pose Machines

Pose Machines provide a sequential prediction framework for learning rich implicit spatial models. In this work we show a systematic design for how convolutional networks can be incorporated into the pose machine framework for learning…

Computer Vision and Pattern Recognition · Computer Science 2016-04-13 Shih-En Wei , Varun Ramakrishna , Takeo Kanade , Yaser Sheikh

Scalable Fine-grained Generated Image Classification Based on Deep Metric Learning

Recently, generated images could reach very high quality, even human eyes could not tell them apart from real images. Although there are already some methods for detecting generated images in current forensic community, most of these…

Computer Vision and Pattern Recognition · Computer Science 2019-12-25 Xinsheng Xuan , Bo Peng , Wei Wang , Jing Dong

Differentiable Scene Graphs

Reasoning about complex visual scenes involves perception of entities and their relations. Scene graphs provide a natural representation for reasoning tasks, by assigning labels to both entities (nodes) and relations (edges). Unfortunately,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Moshiko Raboh , Roei Herzig , Gal Chechik , Jonathan Berant , Amir Globerson

Region-Adaptive Generative Compression with Spatially Varying Diffusion Models

Generative image codecs aim to optimize perceptual quality, producing realistic and detailed reconstructions. However, they often overlook a key property of human vision: our tendency to focus on particular aspects of a visual scene (e.g.,…

Image and Video Processing · Electrical Eng. & Systems 2026-04-02 Lucas Relic , Roberto Azevedo , Yang Zhang , Stephan Mandt , Markus Gross , Christopher Schroers

Learning Rich Representations For Structured Visual Prediction Tasks

We describe an approach to learning rich representations for images, that enables simple and effective predictors in a range of vision tasks involving spatially structured maps. Our key idea is to map small image elements to feature…

Computer Vision and Pattern Recognition · Computer Science 2019-09-02 Mohammadreza Mostajabi

Multi-Object Representation Learning with Iterative Variational Inference

Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even…

Machine Learning · Computer Science 2020-07-29 Klaus Greff , Raphaël Lopez Kaufman , Rishabh Kabra , Nick Watters , Chris Burgess , Daniel Zoran , Loic Matthey , Matthew Botvinick , Alexander Lerchner

Learning Deep Representations for Semantic Image Parsing: a Comprehensive Overview

Semantic image parsing, which refers to the process of decomposing images into semantic regions and constructing the structure representation of the input, has recently aroused widespread interest in the field of computer vision. The recent…

Computer Vision and Pattern Recognition · Computer Science 2018-10-11 Lili Huang , Jiefeng Peng , Ruimao Zhang , Guanbin Li , Liang Lin

Generative Image Modeling Using Spatial LSTMs

Modeling the distribution of natural images is challenging, partly because of strong statistical dependencies which can extend over hundreds of pixels. Recurrent neural networks have been successful in capturing long-range dependencies in a…

Machine Learning · Statistics 2015-09-21 Lucas Theis , Matthias Bethge

Tagger: Deep Unsupervised Perceptual Grouping

We present a framework for efficient perceptual inference that explicitly reasons about the segmentation of its inputs and features. Rather than being trained for any specific segmentation, our framework learns the grouping process in an…

Computer Vision and Pattern Recognition · Computer Science 2016-11-29 Klaus Greff , Antti Rasmus , Mathias Berglund , Tele Hotloo Hao , Jürgen Schmidhuber , Harri Valpola

Compositional Scene Understanding through Inverse Generative Modeling

Generative models have demonstrated remarkable abilities in generating high-fidelity visual content. In this work, we explore how generative models can further be used not only to synthesize visual content but also to understand the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yanbo Wang , Justin Dauwels , Yilun Du

Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition

Recognizing multiple labels of images is a fundamental but challenging task in computer vision, and remarkable progress has been attained by localizing semantic-aware image regions and predicting their labels with deep convolutional neural…

Computer Vision and Pattern Recognition · Computer Science 2017-12-21 Tianshui Chen , Zhouxia Wang , Guanbin Li , Liang Lin

Superpixel Contracted Graph-Based Learning for Hyperspectral Image Classification

A central problem in hyperspectral image classification is obtaining high classification accuracy when using a limited amount of labelled data. In this paper we present a novel graph-based framework, which aims to tackle this problem in the…

Computer Vision and Pattern Recognition · Computer Science 2020-06-24 Philip Sellars , Angelica Aviles-Rivero , Carola-Bibiane Schönlieb

Classify and Generate: Using Classification Latent Space Representations for Image Generations

Utilization of classification latent space information for downstream reconstruction and generation is an intriguing and a relatively unexplored area. In general, discriminative representations are rich in class-specific features but are…

Machine Learning · Computer Science 2021-12-15 Saisubramaniam Gopalakrishnan , Pranshu Ranjan Singh , Yasin Yazici , Chuan-Sheng Foo , Vijay Chandrasekhar , ArulMurugan Ambikapathi

Efficient Region-Based Image Querying

Retrieving images from large and varied repositories using visual contents has been one of major research items, but a challenging task in the image management community. In this paper we present an efficient approach for region-based image…

Computer Vision and Pattern Recognition · Computer Science 2010-06-24 S. Sadek , A. Al-Hamadi , B. Michaelis , U. Sayed

Iterative Scene Graph Generation with Generative Transformers

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur