English
Related papers

Related papers: A Simple Background Augmentation Method for Object…

200 papers

Recent studies emphasize the crucial role of data augmentation in enhancing the performance of object detection models. However,existing methodologies often struggle to effectively harmonize dataset diversity with semantic coordination.To…

Computer Vision and Pattern Recognition · Computer Science 2024-08-07 Sen Nie , Zhuo Wang , Xinxin Wang , Kun He

We present a method for expanding a dataset by incorporating knowledge from the wide distribution of pre-trained latent diffusion models. Data augmentations typically incorporate inductive biases about the image formation process into the…

Computer Vision and Pattern Recognition · Computer Science 2024-06-13 Orest Kupyn , Christian Rupprecht

Data augmentation for domain-specific image classification tasks often struggles to simultaneously address diversity, faithfulness, and label clarity of generated data, leading to suboptimal performance in downstream tasks. While existing…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Yixuan Dong , Fang-Yi Su , Jung-Hsien Chiang

This paper investigates the impact of various data augmentation techniques on the performance of object detection models. Specifically, we explore classical augmentation methods, image compositing, and advanced generative models such as…

Computer Vision and Pattern Recognition · Computer Science 2025-02-20 Ang Jia Ning Shermaine , Michalis Lazarou , Tania Stathaki

Image data augmentation constitutes a critical methodology in modern computer vision tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; thereby, improving the performance and robustness of…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Panagiotis Alimisis , Ioannis Mademlis , Panagiotis Radoglou-Grammatikis , Panagiotis Sarigiannidis , Georgios Th. Papadopoulos

Generating background scenes for salient objects plays a crucial role across various domains including creative design and e-commerce, as it enhances the presentation and context of subjects by integrating them into tailored environments.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-17 Amir Erfan Eshratifar , Joao V. B. Soares , Kapil Thadani , Shaunak Mishra , Mikhail Kuznetsov , Yueh-Ning Ku , Paloma de Juan

Performing data augmentation for learning deep neural networks is well known to be important for training visual recognition systems. By artificially increasing the number of training examples, it helps reducing overfitting and improves…

Computer Vision and Pattern Recognition · Computer Science 2018-07-20 Nikita Dvornik , Julien Mairal , Cordelia Schmid

Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Brandon Trabucco , Kyle Doherty , Max Gurinas , Ruslan Salakhutdinov

In this paper, we present a novel paradigm to enhance the ability of object detector, e.g., expanding categories or improving detection performance, by training on synthetic dataset generated from diffusion models. Specifically, we…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Chengjian Feng , Yujie Zhong , Zequn Jie , Weidi Xie , Lin Ma

Data augmentation plays a crucial role in deep learning, enhancing the generalization and robustness of learning-based models. Standard approaches involve simple transformations like rotations and flips for generating extra data. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Shichao Dong , Ze Yang , Guosheng Lin

Data augmentation is one of the most common tools in deep learning, underpinning many recent advances including tasks such as classification, detection, and semantic segmentation. The standard approach to data augmentation involves simple…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Fulong Ma , Weiqing Qi , Guoyang Zhao , Ming Liu , Jun Ma

Instance segmentation datasets play a crucial role in training accurate and robust computer vision models. However, obtaining accurate mask annotations to produce high-quality segmentation datasets is a costly and labor-intensive process.…

Computer Vision and Pattern Recognition · Computer Science 2024-02-27 Markus Pobitzer , Filip Janicki , Mattia Rigotti , Cristiano Malossi

In this study, we show that diffusion models can be used in industrial scenarios to improve the data augmentation procedure in the context of surface defect detection. In general, defect detection classifiers are trained on ground-truth…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Luigi Capogrosso , Federico Girella , Francesco Taioli , Michele Dalla Chiara , Muhammad Aqeel , Franco Fummi , Francesco Setti , Marco Cristani

It is challenging to train a robust object detector under the supervised learning setting when the annotated data are scarce. Thus, previous approaches tackling this problem are in two categories: semi-supervised learning models that…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Guanghan Ning , Guang Chen , Chaowei Tan , Si Luo , Liefeng Bo , Heng Huang

Despite diffusion models' superior capabilities in modeling complex distributions, there are still non-trivial distributional discrepancies between generated and ground-truth images, which has resulted in several notable problems in image…

Computer Vision and Pattern Recognition · Computer Science 2024-10-08 Yujian Liu , Yang Zhang , Tommi Jaakkola , Shiyu Chang

A plethora of text-guided image editing methods has recently been developed by leveraging the impressive capabilities of large-scale diffusion-based generative models especially Stable Diffusion. Despite the success of diffusion models in…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Qihe Pan , Zhen Zhao , Zicheng Wang , Sifan Long , Yiming Wu , Wei Ji , Haoran Liang , Ronghua Liang

Image captioning, an important vision-language task, often requires a tremendous number of finely labeled image-caption pairs for learning the underlying alignment between images and texts. In this paper, we proposed a multimodal data…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Changrong Xiao , Sean Xin Xu , Kunpeng Zhang

Computer vision tasks such as object detection and segmentation rely on the availability of extensive, accurately annotated datasets. In this work, We present CIA, a modular pipeline, for (1) generating synthetic images for dataset…

Modern agricultural operations increasingly rely on integrated monitoring systems that combine multiple data sources for farm optimization. Aerial drone-based animal health monitoring serves as a key component but faces limited data…

Computer Vision and Pattern Recognition · Computer Science 2026-01-20 Nisha Pillai

Scene text detection techniques have garnered significant attention due to their wide-ranging applications. However, existing methods have a high demand for training data, and obtaining accurate human annotations is labor-intensive and…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Ling Fu , Zijie Wu , Yingying Zhu , Yuliang Liu , Xiang Bai
‹ Prev 1 2 3 10 Next ›