Related papers: Class-Balancing Diffusion Models

Training Class-Imbalanced Diffusion Model Via Overlap Optimization

Diffusion models have made significant advances recently in high-quality image synthesis and related tasks. However, diffusion models trained on real-world datasets, which often follow long-tailed distributions, yield inferior fidelity for…

Computer Vision and Pattern Recognition · Computer Science 2024-02-19 Divin Yan , Lu Qi , Vincent Tao Hu , Ming-Hsuan Yang , Meng Tang

Addressing Multilabel Imbalance with an Efficiency-Focused Approach Using Diffusion Model-Generated Synthetic Samples

Predictive models trained on imbalanced data tend to produce biased results. This problem is exacerbated when there is not just one output label, but a set of them. This is the case for multilabel learning (MLL) algorithms used to classify…

Machine Learning · Computer Science 2025-01-22 Francisco Charte , Miguel Ángel Dávila , María Dolores Pérez-Godoy , María José del Jesus

Latent-based Diffusion Model for Long-tailed Recognition

Long-tailed imbalance distribution is a common issue in practical computer vision applications. Previous works proposed methods to address this problem, which can be categorized into several classes: re-sampling, re-weighting, transfer…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Pengxiao Han , Changkun Ye , Jieming Zhou , Jing Zhang , Jie Hong , Xuesong Li

CORAL: Disentangling Latent Representations in Long-Tailed Diffusion

Diffusion models have achieved impressive performance in generating high-quality and diverse synthetic data. However, their success typically assumes a class-balanced training distribution. In real-world settings, multi-class data often…

Machine Learning · Computer Science 2025-12-02 Esther Rodriguez , Monica Welfert , Samuel McDowell , Nathan Stromberg , Julian Antolin Camarena , Lalitha Sankar

Class-Balanced Distillation for Long-Tailed Visual Recognition

Real-world imagery is often characterized by a significant imbalance of the number of images per class, leading to long-tailed distributions. An effective and simple approach to long-tailed visual recognition is to learn feature…

Computer Vision and Pattern Recognition · Computer Science 2022-01-14 Ahmet Iscen , André Araujo , Boqing Gong , Cordelia Schmid

Anisotropic Diffusion Probabilistic Model for Imbalanced Image Classification

Real-world data often has a long-tailed distribution, where the scarcity of tail samples significantly limits the model's generalization ability. Denoising Diffusion Probabilistic Models (DDPM) are generative models based on stochastic…

Computer Vision and Pattern Recognition · Computer Science 2024-09-24 Jingyu Kong , Yuan Guo , Yu Wang , Yuping Duan

Towards Principled Dataset Distillation: A Spectral Distribution Perspective

Dataset distillation (DD) aims to compress large-scale datasets into compact synthetic counterparts for efficient model training. However, existing DD methods exhibit substantial performance degradation on long-tailed datasets. We identify…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Ruixi Wu , Shaobo Wang , Jiahuan Chen , Zhiyuan Liu , Yicun Yang , Zhaorun Chen , Zekai Li , Kaixin Li , Xinming Wang , Hongzhu Yi , Kai Wang , Linfeng Zhang

Label-Aware Distribution Calibration for Long-tailed Classification

Real-world data usually present long-tailed distributions. Training on imbalanced data tends to render neural networks perform well on head classes while much worse on tail classes. The severe sparseness of training instances for the tail…

Machine Learning · Computer Science 2021-11-10 Chaozheng Wang , Shuzheng Gao , Cuiyun Gao , Pengyun Wang , Wenjie Pei , Lujia Pan , Zenglin Xu

Contrastive Conditional-Unconditional Alignment for Long-tailed Diffusion Model

Training data for class-conditional image synthesis often exhibit a long-tailed distribution with limited images for tail classes. Such an imbalance causes mode collapse and reduces the diversity of synthesized images for tail classes. For…

Computer Vision and Pattern Recognition · Computer Science 2025-10-28 Fang Chen , Alex Villa , Gongbo Liang , Xiaoyi Lu , Meng Tang

Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective

Object frequency in the real world often follows a power law, leading to a mismatch between datasets with long-tailed class distributions seen by a machine learning model and our expectation of the model to perform well on all classes. We…

Computer Vision and Pattern Recognition · Computer Science 2020-03-25 Muhammad Abdullah Jamal , Matthew Brown , Ming-Hsuan Yang , Liqiang Wang , Boqing Gong

DiffuLT: How to Make Diffusion Model Useful for Long-tail Recognition

This paper proposes a new pipeline for long-tail (LT) recognition. Instead of re-weighting or re-sampling, we utilize the long-tailed dataset itself to generate a balanced proxy that can be optimized through cross-entropy (CE).…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Jie Shao , Ke Zhu , Hanxiao Zhang , Jianxin Wu

Diffusion-Driven Synthetic Tabular Data Generation for Enhanced DoS/DDoS Attack Classification

Class imbalance refers to a situation where certain classes in a dataset have significantly fewer samples than oth- ers, leading to biased model performance. Class imbalance in network intrusion detection using Tabular Denoising Diffusion…

Cryptography and Security · Computer Science 2026-02-02 Aravind B , Anirud R. S. , Sai Surya Teja N , Bala Subrahmanya Sriranga Navaneeth A , Karthika R , Mohankumar N

Feature Generation for Long-tail Classification

The visual world naturally exhibits an imbalance in the number of object or scene instances resulting in a \emph{long-tailed distribution}. This imbalance poses significant challenges for classification models based on deep learning.…

Computer Vision and Pattern Recognition · Computer Science 2021-11-12 Rahul Vigneswaran , Marc T. Law , Vineeth N. Balasubramanian , Makarand Tapaswi

Debiasing Diffusion Model: Enhancing Fairness through Latent Representation Learning in Stable Diffusion Model

Image generative models, particularly diffusion-based models, have surged in popularity due to their remarkable ability to synthesize highly realistic images. However, since these models are data-driven, they inherit biases from the…

Machine Learning · Computer Science 2025-03-18 Lin-Chun Huang , Ching Chieh Tsao , Fang-Yi Su , Jung-Hsien Chiang

Revisiting Long-tailed Image Classification: Survey and Benchmarks with New Evaluation Metrics

Recently, long-tailed image classification harvests lots of research attention, since the data distribution is long-tailed in many real-world situations. Piles of algorithms are devised to address the data imbalance problem by biasing the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-06 Chaowei Fang , Dingwen Zhang , Wen Zheng , Xue Li , Le Yang , Lechao Cheng , Junwei Han

Advancing Image Classification with Discrete Diffusion Classification Modeling

Image classification is a well-studied task in computer vision, and yet it remains challenging under high-uncertainty conditions, such as when input images are corrupted or training data are limited. Conventional classification approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Omer Belhasin , Shelly Golan , Ran El-Yaniv , Michael Elad

Synthetic Data Generation for Long-Tail Medical Image Classification: A Case Study in Skin Lesions

Long-tailed class distributions are pervasive in multi-class medical datasets and pose significant challenges for deep learning models which typically underperform on tail classes with limited samples. This limitation is particularly…

Computer Vision and Pattern Recognition · Computer Science 2026-05-06 Jiaxiang Jiang , Mahesh Subedar , Omesh Tickoo

Decision Boundary-aware Generation for Long-tailed Learning

Long-tailed data bias decision boundaries toward head classes and degrade tail class accuracy. Diffusion-based generative augmentation address this problem by generating additional data, while head-to-tail transfer further mitigate the…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Jiacheng Yang , Ruichi Zhang , Chikai Shang , Mengke Li , Xinyi Shang , Junlong Gao , Yonggang Zhang , Yang Lu

Synthetic Data Augmentation using Pre-trained Diffusion Models for Long-tailed Food Image Classification

Deep learning-based food image classification enables precise identification of food categories, further facilitating accurate nutritional analysis. However, real-world food images often show a skewed distribution, with some food types…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 GaYeon Koh , Hyun-Jic Oh , Jeonghyun Noh , Won-Ki Jeong

Constrained Diffusion Models via Dual Training

Diffusion models have attained prominence for their ability to synthesize a probability distribution for a given dataset via a diffusion process, enabling the generation of new data points with high fidelity. However, diffusion processes…

Machine Learning · Computer Science 2024-11-25 Shervin Khalafi , Dongsheng Ding , Alejandro Ribeiro