Related papers: DatasetDM: Synthesizing Data with Perception Annot…

SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations

In hyperspectral remote sensing field, some downstream dense prediction tasks, such as semantic segmentation (SS) and change detection (CD), rely on supervised learning to improve model performance and require a large amount of manually…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Wendi Liu , Pei Yang , Wenhui Hong , Xiaoguang Mei , Jiayi Ma

SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture

This paper introduces a methodology for generating synthetic annotated data to address data scarcity in semantic segmentation tasks within the precision agriculture domain. Utilizing Denoising Diffusion Probabilistic Models (DDPMs) and…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Andrew Heschl , Mauricio Murillo , Keyhan Najafian , Farhad Maleki

Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation

Preparing training data for deep vision models is a labor-intensive task. To address this, generative models have emerged as an effective solution for generating synthetic data. While current generative models produce image-level category…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Quang Nguyen , Truong Vu , Anh Tran , Khoi Nguyen

Denoising Diffusion Probabilistic Models for Generation of Realistic Fully-Annotated Microscopy Image Data Sets

Recent advances in computer vision have led to significant progress in the generation of realistic image data, with denoising diffusion probabilistic models proving to be a particularly effective method. In this study, we demonstrate that…

Image and Video Processing · Electrical Eng. & Systems 2023-08-09 Dennis Eschweiler , Rüveyda Yilmaz , Matisse Baumann , Ina Laube , Rijo Roy , Abin Jose , Daniel Brückner , Johannes Stegmaier

DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort

We introduce DatasetGAN: an automatic procedure to generate massive datasets of high-quality semantically segmented images requiring minimal human effort. Current deep networks are extremely data-hungry, benefiting from training on…

Computer Vision and Pattern Recognition · Computer Science 2021-04-21 Yuxuan Zhang , Huan Ling , Jun Gao , Kangxue Yin , Jean-Francois Lafleche , Adela Barriuso , Antonio Torralba , Sanja Fidler

DataDAM: Efficient Dataset Distillation with Attention Matching

Researchers have long tried to minimize training costs in deep learning while maintaining strong generalization across diverse datasets. Emerging research on dataset distillation aims to reduce training costs by creating a small synthetic…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Ahmad Sajedi , Samir Khaki , Ehsan Amjadian , Lucy Z. Liu , Yuri A. Lawryshyn , Konstantinos N. Plataniotis

DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models

Collecting and annotating images with pixel-wise labels is time-consuming and laborious. In contrast, synthetic data can be freely available using a generative model (e.g., DALL-E, Stable Diffusion). In this paper, we show that it is…

Computer Vision and Pattern Recognition · Computer Science 2024-01-23 Weijia Wu , Yuzhong Zhao , Mike Zheng Shou , Hong Zhou , Chunhua Shen

One Category One Prompt: Dataset Distillation using Diffusion Models

The extensive amounts of data required for training deep neural networks pose significant challenges on storage and transmission fronts. Dataset distillation has emerged as a promising technique to condense the information of massive…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Ali Abbasi , Ashkan Shahbazi , Hamed Pirsiavash , Soheil Kolouri

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Current perceptive models heavily depend on resource-intensive datasets, prompting the need for innovative solutions. Leveraging recent advances in diffusion models, synthetic data, by constructing image inputs from various annotations,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-21 Yibo Wang , Ruiyuan Gao , Kai Chen , Kaiqiang Zhou , Yingjie Cai , Lanqing Hong , Zhenguo Li , Lihui Jiang , Dit-Yan Yeung , Qiang Xu , Kai Zhang

High-Resolution Image Synthesis with Latent Diffusion Models

By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image data and beyond. Additionally, their formulation allows for a…

Computer Vision and Pattern Recognition · Computer Science 2022-04-14 Robin Rombach , Andreas Blattmann , Dominik Lorenz , Patrick Esser , Björn Ommer

Dataset Distillation with Probabilistic Latent Features

As deep learning models grow in complexity and the volume of training data increases, reducing storage and computational costs becomes increasingly important. Dataset distillation addresses this challenge by synthesizing a compact set of…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Zhe Li , Sarah Cechnicka , Cheng Ouyang , Katharina Breininger , Peter Schüffler , Bernhard Kainz

Meta-Sim: Learning to Generate Synthetic Datasets

Training models to high-end performance requires availability of large labeled datasets, which are expensive to get. The goal of our work is to automatically synthesize labeled datasets that are relevant for a downstream task. We propose…

Computer Vision and Pattern Recognition · Computer Science 2019-04-29 Amlan Kar , Aayush Prakash , Ming-Yu Liu , Eric Cameracci , Justin Yuan , Matt Rusiniak , David Acuna , Antonio Torralba , Sanja Fidler

Representation Learning with Diffusion Models

Diffusion models (DMs) have achieved state-of-the-art results for image synthesis tasks as well as density estimation. Applied in the latent space of a powerful pretrained autoencoder (LDM), their immense computational requirements can be…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Jeremias Traub

DiM: Distilling Dataset into Generative Model

Dataset distillation reduces the network training cost by synthesizing small and informative datasets from large-scale ones. Despite the success of the recent dataset distillation algorithms, three drawbacks still limit their wider…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Kai Wang , Jianyang Gu , Daquan Zhou , Zheng Zhu , Wei Jiang , Yang You

Unsupervised Class Generation to Expand Semantic Segmentation Datasets

Semantic segmentation is a computer vision task where classification is performed at a pixel level. Due to this, the process of labeling images for semantic segmentation is time-consuming and expensive. To mitigate this cost there has been…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Javier Montalvo , Álvaro García-Martín , Pablo Carballeira , Juan C. SanMiguel

Synthetic data enables faster annotation and robust segmentation for multi-object grasping in clutter

Object recognition and object pose estimation in robotic grasping continue to be significant challenges, since building a labelled dataset can be time consuming and financially costly in terms of data collection and annotation. In this…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Dongmyoung Lee , Wei Chen , Nicolas Rojas

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture

Diffusion models (DMs) have demonstrated exceptional generative capabilities across various domains, including image, video, and so on. A key factor contributing to their effectiveness is the high quantity and quality of data used during…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Qianlong Xiang , Miao Zhang , Yuzhang Shang , Jianlong Wu , Yan Yan , Liqiang Nie

BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations

Annotating images with pixel-wise labels is a time-consuming and costly process. Recently, DatasetGAN showcased a promising alternative - to synthesize a large labeled dataset via a generative adversarial network (GAN) by exploiting a small…

Computer Vision and Pattern Recognition · Computer Science 2022-01-14 Daiqing Li , Huan Ling , Seung Wook Kim , Karsten Kreis , Adela Barriuso , Sanja Fidler , Antonio Torralba

Semi-Supervised and Task-Driven Data Augmentation

Supervised deep learning methods for segmentation require large amounts of labelled training data, without which they are prone to overfitting, not generalizing well to unseen images. In practice, obtaining a large number of annotations…

Computer Vision and Pattern Recognition · Computer Science 2019-03-01 Krishna Chaitanya , Neerav Karani , Christian Baumgartner , Olivio Donati , Anton Becker , Ender Konukoglu

The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models

Despite the notable accomplishments of deep object detection models, a major challenge that persists is the requirement for extensive amounts of training data. The process of procuring such real-world data is a laborious undertaking, which…

Computer Vision and Pattern Recognition · Computer Science 2023-06-19 Roy Voetman , Maya Aghaei , Klaas Dijkstra