Related papers: Label-Efficient Model Selection for Text Generatio…

Can Diffusion Model Achieve Better Performance in Text Generation? Bridging the Gap between Training and Inference!

Diffusion models have been successfully adapted to text generation tasks by mapping the discrete text into the continuous space. However, there exist nonnegligible gaps between training and inference, owing to the absence of the forward…

Computation and Language · Computer Science 2023-05-09 Zecheng Tang , Pinzheng Wang , Keyan Zhou , Juntao Li , Ziqiang Cao , Min Zhang

Clean or Annotate: How to Spend a Limited Data Collection Budget

Crowdsourcing platforms are often used to collect datasets for training machine learning models, despite higher levels of inaccurate labeling compared to expert labeling. There are two common strategies to manage the impact of such noise.…

Computation and Language · Computer Science 2022-06-14 Derek Chen , Zhou Yu , Samuel R. Bowman

DiffIER: Optimizing Diffusion Models with Iterative Error Reduction

Diffusion models have demonstrated remarkable capabilities in generating high-quality samples and enhancing performance across diverse domains through Classifier-Free Guidance (CFG). However, the quality of generated samples is highly…

Computer Vision and Pattern Recognition · Computer Science 2025-08-21 Ao Chen , Lihe Ding , Tianfan Xue

DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion Models

Generating high-quality labeled image datasets is crucial for training accurate and robust machine learning models in the field of computer vision. However, the process of manually labeling real images is often time-consuming and costly. To…

Computer Vision and Pattern Recognition · Computer Science 2023-09-04 Michael Shenoda , Edward Kim

Detect and Correct: A Selective Noise Correction Method for Learning with Noisy Labels

Falsely annotated samples, also known as noisy labels, can significantly harm the performance of deep learning models. Two main approaches for learning with noisy labels are global noise estimation and data filtering. Global noise…

Machine Learning · Computer Science 2025-07-31 Yuval Grinberg , Nimrod Harel , Jacob Goldberger , Ofir Lindenbaum

IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models

In-context learning is a promising paradigm that utilizes in-context examples as prompts for the predictions of large language models. These prompts are crucial for achieving strong performance. However, since the prompts need to be sampled…

Computation and Language · Computer Science 2025-07-15 Shaokun Zhang , Xiaobo Xia , Zhaoqing Wang , Ling-Hao Chen , Jiale Liu , Qingyun Wu , Tongliang Liu

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Diffusion models have shown impressive results in generating high-quality conditional samples using guidance techniques such as Classifier-Free Guidance (CFG). However, existing methods often require additional training or neural function…

Machine Learning · Computer Science 2025-07-22 Kwanyoung Kim , Byeongsu Sim

Diffuse to Detect: Bi-Level Sample Rebalancing with Pseudo-Label Diffusion for Point-Supervised Infrared Small-Target Detection

Point supervision has become a scalable solution to address dense annotation for infrared small target detection, but its performance is limited by two coupled bottlenecks: unstable pseudo-label evolution in cluttered, low-contrast infrared…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Zhu Liu , Yuanhang Yao , Ping Qian , Zihang Chen , Risheng Liu

Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models

The proposed method, Discriminator Guidance, aims to improve sample generation of pre-trained diffusion models. The approach introduces a discriminator that gives explicit supervision to a denoising sample path whether it is realistic or…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Dongjun Kim , Yeongmin Kim , Se Jung Kwon , Wanmo Kang , Il-Chul Moon

Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models

Diffusion models have demonstrated powerful performance in generating high-quality images. A typical example is text-to-image generator like Stable Diffusion. However, their widespread use also poses potential privacy risks. A key concern…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Guo Li , Weihong Chen , Yongfu Fan

Efficient Deep Representation Learning by Adaptive Latent Space Sampling

Supervised deep learning requires a large amount of training samples with annotations (e.g. label class for classification task, pixel- or voxel-wised label map for segmentation tasks), which are expensive and time-consuming to obtain.…

Computer Vision and Pattern Recognition · Computer Science 2020-04-14 Yuanhan Mo , Shuo Wang , Chengliang Dai , Rui Zhou , Zhongzhao Teng , Wenjia Bai , Yike Guo

On Efficient and Statistical Quality Estimation for Data Annotation

Annotated datasets are an essential ingredient to train, evaluate, compare and productionalize supervised machine learning models. It is therefore imperative that annotations are of high quality. For their creation, good quality management…

Machine Learning · Computer Science 2024-05-30 Jan-Christoph Klie , Juan Haladjian , Marc Kirchner , Rahul Nair

Label-Efficient Point Cloud Semantic Segmentation: An Active Learning Approach

Deep learning models are the state-of-the-art methods for semantic point cloud segmentation, the success of which relies on the availability of large-scale annotated datasets. However, it can be extremely time-consuming and prohibitively…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Xian Shi , Xun Xu , Ke Chen , Lile Cai , Chuan Sheng Foo , Kui Jia

Attention-Aware Answers of the Crowd

Crowdsourcing is a relatively economic and efficient solution to collect annotations from the crowd through online platforms. Answers collected from workers with different expertise may be noisy and unreliable, and the quality of annotated…

Machine Learning · Computer Science 2020-01-08 Jingzheng Tu , Guoxian Yu , Jun Wang , Carlotta Domeniconi , Xiangliang Zhang

AdaDiff: Adaptive Step Selection for Fast Diffusion Models

Diffusion models, as a type of generative model, have achieved impressive results in generating images and videos conditioned on textual conditions. However, the generation process of diffusion models involves denoising dozens of steps to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-31 Hui Zhang , Zuxuan Wu , Zhen Xing , Jie Shao , Yu-Gang Jiang

Self-Guided Diffusion Models

Diffusion models have demonstrated remarkable progress in image generation quality, especially when guidance is used to control the generative process. However, guidance requires a large amount of image-annotation pairs for training and is…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Vincent Tao Hu , David W Zhang , Yuki M. Asano , Gertjan J. Burghouts , Cees G. M. Snoek

How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model

Deep neural networks (DNNs) have demonstrated exceptional performance across various image segmentation tasks. However, the process of preparing datasets for training segmentation DNNs is both labor-intensive and costly, as it typically…

Computer Vision and Pattern Recognition · Computer Science 2025-01-24 Yixin Zhang , Shen Zhao , Hanxue Gu , Maciej A. Mazurowski

Identifying Wrongly Predicted Samples: A Method for Active Learning

State-of-the-art machine learning models require access to significant amount of annotated data in order to achieve the desired level of performance. While unlabelled data can be largely available and even abundant, annotation process can…

Machine Learning · Computer Science 2020-10-15 Rahaf Aljundi , Nikolay Chumerin , Daniel Olmeda Reino

Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models

Neural networks struggle with image classification when biases are learned and misleads correlations, affecting their generalization and performance. Previous methods require attribute labels (e.g. background, color) or utilizes Generative…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Donggeun Ko , Dongjun Lee , Namjun Park , Wonkyeong Shim , Jaekwang Kim

Diffusion Model for Data-Driven Black-Box Optimization

Generative AI has redefined artificial intelligence, enabling the creation of innovative content and customized solutions that drive business practices into a new era of efficiency and creativity. In this paper, we focus on diffusion…

Machine Learning · Computer Science 2024-03-21 Zihao Li , Hui Yuan , Kaixuan Huang , Chengzhuo Ni , Yinyu Ye , Minshuo Chen , Mengdi Wang