Related papers: Generative Visual Rationales

Towards Learning a Vocabulary of Visual Concepts and Operators using Deep Neural Networks

Deep neural networks have become the default choice for many applications like image and video recognition, segmentation and other image and video related tasks.However, a critical challenge with these models is the lack of…

Computer Vision and Pattern Recognition · Computer Science 2021-09-02 Sunil Kumar Vengalil , Neelam Sinha

Learning Generalized Medical Image Representations through Image-Graph Contrastive Pretraining

Medical image interpretation using deep learning has shown promise but often requires extensive expert-annotated datasets. To reduce this annotation burden, we develop an Image-Graph Contrastive Learning framework that pairs chest X-rays…

Image and Video Processing · Electrical Eng. & Systems 2024-05-17 Sameer Khanna , Daniel Michael , Marinka Zitnik , Pranav Rajpurkar

Latent Feature Representation via Unsupervised Learning for Pattern Discovery in Massive Electron Microscopy Image Volumes

We propose a method to facilitate exploration and analysis of new large data sets. In particular, we give an unsupervised deep learning approach to learning a latent representation that captures semantic similarity in the data set. The core…

Computer Vision and Pattern Recognition · Computer Science 2020-12-23 Gary B Huang , Huei-Fang Yang , Shin-ya Takemura , Pat Rivlin , Stephen M Plaza

Unsupervised Real Image Super-Resolution via Generative Variational AutoEncoder

Benefited from the deep learning, image Super-Resolution has been one of the most developing research fields in computer vision. Depending upon whether using a discriminator or not, a deep convolutional neural network can provide an image…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Zhi-Song Liu , Wan-Chi Siu , Li-Wen Wang , Chu-Tak Li , Marie-Paule Cani , Yui-Lam Chan

A Deep Generative Model for Graph Layout

Different layouts can characterize different aspects of the same graph. Finding a "good" layout of a graph is thus an important task for graph visualization. In practice, users often visualize a graph in multiple layouts by using different…

Social and Information Networks · Computer Science 2019-10-16 Oh-Hyun Kwon , Kwan-Liu Ma

Inferencing Based on Unsupervised Learning of Disentangled Representations

Combining Generative Adversarial Networks (GANs) with encoders that learn to encode data points has shown promising results in learning data representations in an unsupervised way. We propose a framework that combines an encoder and a…

Computer Vision and Pattern Recognition · Computer Science 2018-03-08 Tobias Hinz , Stefan Wermter

Interpretable Mammographic Image Classification using Case-Based Reasoning and Deep Learning

When we deploy machine learning models in high-stakes medical settings, we must ensure these models make accurate predictions that are consistent with known medical science. Inherently interpretable networks address this need by explaining…

Computer Vision and Pattern Recognition · Computer Science 2021-10-06 Alina Jade Barnett , Fides Regina Schwartz , Chaofan Tao , Chaofan Chen , Yinhao Ren , Joseph Y. Lo , Cynthia Rudin

DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort

We introduce DatasetGAN: an automatic procedure to generate massive datasets of high-quality semantically segmented images requiring minimal human effort. Current deep networks are extremely data-hungry, benefiting from training on…

Computer Vision and Pattern Recognition · Computer Science 2021-04-21 Yuxuan Zhang , Huan Ling , Jun Gao , Kangxue Yin , Jean-Francois Lafleche , Adela Barriuso , Antonio Torralba , Sanja Fidler

Unsupervised learning for concept detection in medical images: a comparative analysis

As digital medical imaging becomes more prevalent and archives increase in size, representation learning exposes an interesting opportunity for enhanced medical decision support systems. On the other hand, medical imaging data is often…

Computer Vision and Pattern Recognition · Computer Science 2018-07-25 Eduardo Pinho , Carlos Costa

Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers

In the era of generative AI, deep generative models (DGMs) with latent representations have gained tremendous popularity. Despite their impressive empirical performance, the statistical properties of these models remain underexplored. DGMs…

Machine Learning · Statistics 2025-08-07 Seunghyun Lee , Yuqi Gu

Reconstructing Interpretable Features in Computational Super-Resolution microscopy via Regularized Latent Search

Supervised deep learning approaches can artificially increase the resolution of microscopy images by learning a mapping between two image resolutions or modalities. However, such methods often require a large set of hard-to-get…

Image and Video Processing · Electrical Eng. & Systems 2024-11-20 Marzieh Gheisari , Auguste Genovesio

Fine-Grained Visual Entailment

Visual entailment is a recently proposed multimodal reasoning task where the goal is to predict the logical relationship of a piece of text to an image. In this paper, we propose an extension of this task, where the goal is to predict the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Christopher Thomas , Yipeng Zhang , Shih-Fu Chang

Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention

Interpretability is an important property for visual models as it helps researchers and users understand the internal mechanism of a complex model. However, generating semantic explanations about the learned representation is challenging…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Yu Yang , Seungbae Kim , Jungseock Joo

Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning

Deep generative models have significantly advanced medical imaging analysis by enhancing dataset size and quality. Beyond mere data augmentation, our research in this paper highlights an additional, significant capacity of deep generative…

Computer Vision and Pattern Recognition · Computer Science 2024-10-18 Xiaodan Xing , Junzhi Ning , Yang Nan , Guang Yang

Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization

Training deep networks with limited labeled data while achieving a strong generalization ability is key in the quest to reduce human annotation efforts. This is the goal of semi-supervised learning, which exploits more widely available…

Computer Vision and Pattern Recognition · Computer Science 2021-04-14 Daiqing Li , Junlin Yang , Karsten Kreis , Antonio Torralba , Sanja Fidler

Representation Disentanglement for Multi-task Learning with application to Fetal Ultrasound

One of the biggest challenges for deep learning algorithms in medical image analysis is the indiscriminate mixing of image properties, e.g. artifacts and anatomy. These entangled image properties lead to a semantically redundant feature…

Machine Learning · Computer Science 2019-08-22 Qingjie Meng , Nick Pawlowski , Daniel Rueckert , Bernhard Kainz

Learned Disentangled Latent Representations for Scalable Image Coding for Humans and Machines

As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily…

Image and Video Processing · Electrical Eng. & Systems 2023-01-12 Ezgi Ozyilkan , Mateen Ulhaq , Hyomin Choi , Fabien Racape

Orientation-Disentangled Unsupervised Representation Learning for Computational Pathology

Unsupervised learning enables modeling complex images without the need for annotations. The representation learned by such models can facilitate any subsequent analysis of large image datasets. However, some generative factors that cause…

Image and Video Processing · Electrical Eng. & Systems 2020-08-27 Maxime W. Lafarge , Josien P. W. Pluim , Mitko Veta

Discovering interpretable models of scientific image data with deep learning

How can we find interpretable, domain-appropriate models of natural phenomena given some complex, raw data such as images? Can we use such models to derive scientific insight from the data? In this paper, we propose some methods for…

Machine Learning · Computer Science 2024-02-06 Christopher J. Soelistyo , Alan R. Lowe

Learning Disentangled Representations with Reference-Based Variational Autoencoders

Learning disentangled representations from visual data, where different high-level generative factors are independently encoded, is of importance for many computer vision tasks. Solving this problem, however, typically requires to…

Computer Vision and Pattern Recognition · Computer Science 2019-01-25 Adria Ruiz , Oriol Martinez , Xavier Binefa , Jakob Verbeek