English
Related papers

Related papers: Representation-Conditioned Diffusion Models for Gu…

200 papers

Denoising diffusion models have gained popularity as a generative modeling technique for producing high-quality and diverse images. Applying these models to downstream tasks requires conditioning, which can take the form of text, class…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Alexandros Graikos , Srikar Yellapragada , Dimitris Samaras

Latent diffusion models (LDMs) dominate high-quality image generation, yet integrating representation learning with generative modeling remains a challenge. We introduce a novel generative image modeling framework that seamlessly bridges…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Theodoros Kouzelis , Efstathios Karypidis , Ioannis Kakogeorgiou , Spyros Gidaris , Nikos Komodakis

Joint machine learning models that allow synthesizing and classifying data often offer uneven performance between those tasks or are unstable to train. In this work, we depart from a set of empirical observations that indicate the…

Machine Learning · Computer Science 2023-04-06 Kamil Deja , Tomasz Trzcinski , Jakub M. Tomczak

To synthesize high-fidelity samples, diffusion models typically require auxiliary data to guide the generation process. However, it is impractical to procure the painstaking patch-level annotation effort required in specialized domains like…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Alexandros Graikos , Srikar Yellapragada , Minh-Quan Le , Saarthak Kapse , Prateek Prasanna , Joel Saltz , Dimitris Samaras

Diffusion probabilistic models have achieved enormous success in the field of image generation and manipulation. In this paper, we explore a novel paradigm of using the diffusion model and classifier guidance in the latent semantic space…

Computer Vision and Pattern Recognition · Computer Science 2023-05-25 Changhao Shi , Haomiao Ni , Kai Li , Shaobo Han , Mingfu Liang , Martin Renqiang Min

Incorporating diffusion-generated synthetic data into adversarial training (AT) has been shown to substantially improve the training of robust image classifiers. In this work, we extend the role of diffusion models beyond merely generating…

Machine Learning · Computer Science 2026-02-24 Pin-Han Huang , Shang-Tse Chen , Hsuan-Tien Lin

Diffusion models (DMs) have achieved state-of-the-art results for image synthesis tasks as well as density estimation. Applied in the latent space of a powerful pretrained autoencoder (LDM), their immense computational requirements can be…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Jeremias Traub

In recent years, diffusion models have gained popularity for their ability to generate higher-quality images in comparison to GAN models. However, like any other large generative models, these models require a huge amount of data,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Rajesh Shrestha , Bowen Xie

While deep learning techniques have proven successful in image-related tasks, the exponentially increased data storage and computation costs become a significant challenge. Dataset distillation addresses these challenges by synthesizing…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Zhe Li , Weitong Zhang , Sarah Cechnicka , Bernhard Kainz

Deep generative models are becoming increasingly powerful, now generating diverse high fidelity photo-realistic samples given text prompts. Have they reached the point where models of natural images can be used for generative data…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Shekoofeh Azizi , Simon Kornblith , Chitwan Saharia , Mohammad Norouzi , David J. Fleet

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For…

Machine Learning · Computer Science 2021-06-02 Prafulla Dhariwal , Alex Nichol

Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such as text…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Nithesh Chandher Karthikeyan , Jonas Unger , Gabriel Eilertsen

Discriminative classifiers have become a foundational tool in deep learning for medical imaging, excelling at learning separable features of complex data distributions. However, these models often need careful design, augmentation, and…

Computer Vision and Pattern Recognition · Computer Science 2025-08-11 Gian Mario Favero , Parham Saremi , Emily Kaczmarek , Brennan Nichyporuk , Tal Arbel

Obtaining annotated table structure data for complex tables is a challenging task due to the inherent diversity and complexity of real-world document layouts. The scarcity of publicly available datasets with comprehensive annotations for…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Syed Jawwad Haider Hamdani , Saifullah Saifullah , Stefan Agne , Andreas Dengel , Sheraz Ahmed

Classifier-free guided diffusion models have recently been shown to be highly effective at high-resolution image generation, and they have been widely used in large-scale diffusion frameworks including DALLE-2, Stable Diffusion and Imagen.…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Chenlin Meng , Robin Rombach , Ruiqi Gao , Diederik P. Kingma , Stefano Ermon , Jonathan Ho , Tim Salimans

Deep learning-based food image classification enables precise identification of food categories, further facilitating accurate nutritional analysis. However, real-world food images often show a skewed distribution, with some food types…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 GaYeon Koh , Hyun-Jic Oh , Jeonghyun Noh , Won-Ki Jeong

The class-conditional image generation based on diffusion models is renowned for generating high-quality and diverse images. However, most prior efforts focus on generating images for general categories, e.g., 1000 classes in ImageNet-1k. A…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Ziying Pan , Kun Wang , Gang Li , Feihong He , Yongxuan Lai

Image generation abilities of text-to-image diffusion models have significantly advanced, yielding highly photo-realistic images from descriptive text and increasing the viability of leveraging synthetic images to train computer vision…

Computer Vision and Pattern Recognition · Computer Science 2025-02-10 Jiahui Chen , Amy Zhang , Adriana Romero-Soriano

Diffusion models can be improved with additional guidance towards more effective representations of input. Indeed, prior empirical work has already shown that aligning internal representations of the diffusion model with those of…

Machine Learning · Computer Science 2025-10-14 Chenyu Wang , Cai Zhou , Sharut Gupta , Zongyu Lin , Stefanie Jegelka , Stephen Bates , Tommi Jaakkola

Diffusion-based methods represented as stochastic differential equations on a continuous-time domain have recently proven successful as a non-adversarial generative model. Training such models relies on denoising score matching, which can…

Machine Learning · Computer Science 2024-11-05 Sarthak Mittal , Korbinian Abstreiter , Stefan Bauer , Bernhard Schölkopf , Arash Mehrjou
‹ Prev 1 2 3 10 Next ›