English
Related papers

Related papers: Diverse Generation while Maintaining Semantic Coor…

200 papers

In computer vision, it is well-known that a lack of data diversity will impair model performance. In this study, we address the challenges of enhancing the dataset diversity problem in order to benefit various downstream tasks such as…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Yuhang Li , Xin Dong , Chen Chen , Weiming Zhuang , Lingjuan Lyu

Data augmentation for domain-specific image classification tasks often struggles to simultaneously address diversity, faithfulness, and label clarity of generated data, leading to suboptimal performance in downstream tasks. While existing…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Yixuan Dong , Fang-Yi Su , Jung-Hsien Chiang

Image data augmentation constitutes a critical methodology in modern computer vision tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; thereby, improving the performance and robustness of…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Panagiotis Alimisis , Ioannis Mademlis , Panagiotis Radoglou-Grammatikis , Panagiotis Sarigiannidis , Georgios Th. Papadopoulos

Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Brandon Trabucco , Kyle Doherty , Max Gurinas , Ruslan Salakhutdinov

Cultural heritage applications and advanced machine learning models are creating a fruitful synergy to provide effective and accessible ways of interacting with artworks. Smart audio-guides, personalized art-related content and gamification…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Dario Cioni , Lorenzo Berlincioni , Federico Becattini , Alberto del Bimbo

Collecting and annotating datasets for pixel-level semantic segmentation tasks are highly labor-intensive. Data augmentation provides a viable solution by enhancing model generalization without additional real-world data collection.…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Huy Che , Dinh-Duy Phan , Duc-Khai Lam

Diffusion models has underpinned much recent advances of dataset augmentation in various computer vision tasks. However, when involving generating multi-object images as real scenarios, most existing methods either rely entirely on text…

Computer Vision and Pattern Recognition · Computer Science 2025-07-09 Haoyu Wang , Lei Zhang , Wei Wei , Chen Ding , Yanning Zhang

This paper proposes a dataset augmentation method by fine-tuning pre-trained diffusion models. Generating images using a pre-trained diffusion model with textual conditioning often results in domain discrepancy between real data and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-23 Abdullah Al Rahat , Hemanth Venkateswara

Despite continued advancement in recent years, deep neural networks still rely on large amounts of training data to avoid overfitting. However, labeled training data for real-world applications such as healthcare is limited and difficult to…

We present a method for expanding a dataset by incorporating knowledge from the wide distribution of pre-trained latent diffusion models. Data augmentations typically incorporate inductive biases about the image formation process into the…

Computer Vision and Pattern Recognition · Computer Science 2024-06-13 Orest Kupyn , Christian Rupprecht

Simple data augmentation techniques, such as rotations and flips, are widely used to enhance the generalization power of computer vision models. However, these techniques often fail to modify high-level semantic attributes of a class. To…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Tobias Lingenberg , Markus Reuter , Gopika Sudhakaran , Dominik Gojny , Stefan Roth , Simone Schaub-Meyer

Recently, a number of image-mixing-based augmentation techniques have been introduced to improve the generalization of deep neural networks. In these techniques, two or more randomly selected natural images are mixed together to generate an…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Khawar Islam , Muhammad Zaigham Zaheer , Arif Mahmood , Karthik Nandakumar

This paper investigates the impact of various data augmentation techniques on the performance of object detection models. Specifically, we explore classical augmentation methods, image compositing, and advanced generative models such as…

Computer Vision and Pattern Recognition · Computer Science 2025-02-20 Ang Jia Ning Shermaine , Michalis Lazarou , Tania Stathaki

Recently, researchers have proposed various deep learning methods to accurately detect infrared targets with the characteristics of indistinct shape and texture. Due to the limited variety of infrared datasets, training deep learning models…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Yukai Shi , Yupei Lin , Pengxu Wei , Xiaoyu Xian , Tianshui Chen , Liang Lin

High-quality Earth Observation (EO) imagery is essential for accurate analysis and informed decision making across sectors. However, data scarcity caused by atmospheric conditions, seasonal variations, and limited geographical coverage…

Computer Vision and Pattern Recognition · Computer Science 2025-03-27 Tiago Sousa , Benoît Ries , Nicolas Guelfi

Data augmentation is one of the most common tools in deep learning, underpinning many recent advances including tasks such as classification, detection, and semantic segmentation. The standard approach to data augmentation involves simple…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Fulong Ma , Weiqing Qi , Guoyang Zhao , Ming Liu , Jun Ma

Scaling laws dictate that the performance of AI models is proportional to the amount of available data. Data augmentation is a promising solution to expanding the dataset size. Traditional approaches focused on augmentation using rotation,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Fazle Rahat , M Shifat Hossain , Md Rubel Ahmed , Sumit Kumar Jha , Rickard Ewetz

Latent diffusion models excel at producing high-quality images from text. Yet, concerns appear about the lack of diversity in the generated imagery. To tackle this, we introduce Diverse Diffusion, a method for boosting image diversity…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Mariia Zameshina , Olivier Teytaud , Laurent Najman

A significant challenge in object detection is accurate identification of an object's position in image space, whereas one algorithm with one set of parameters is usually not enough, and the fusion of multiple algorithms and/or parameters…

Computer Vision and Pattern Recognition · Computer Science 2018-03-20 Pan Wei , John E. Ball , Derek T. Anderson

Saliency prediction models are constrained by the limited diversity and quantity of labeled data. Standard data augmentation techniques such as rotating and cropping alter scene composition, affecting saliency. We propose a novel data…

Computer Vision and Pattern Recognition · Computer Science 2024-09-12 Bahar Aydemir , Deblina Bhattacharjee , Tong Zhang , Mathieu Salzmann , Sabine Süsstrunk
‹ Prev 1 2 3 10 Next ›