English
Related papers

Related papers: Diversify Your Vision Datasets with Automatic Diff…

200 papers

Scaling laws dictate that the performance of AI models is proportional to the amount of available data. Data augmentation is a promising solution to expanding the dataset size. Traditional approaches focused on augmentation using rotation,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Fazle Rahat , M Shifat Hossain , Md Rubel Ahmed , Sumit Kumar Jha , Rickard Ewetz

In recent years, one of the most popular techniques in the computer vision community has been the deep learning technique. As a data-driven technique, deep model requires enormous amounts of accurately labelled training data, which is often…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Zihan Yang , Richard O. Sinnott , James Bailey , Qiuhong Ke

Convolutional neural networks have been widely applied to medical image segmentation and have achieved considerable performance. However, the performance may be significantly affected by the domain gap between training data (source domain)…

Image and Video Processing · Electrical Eng. & Systems 2022-07-28 Junyan Lyu , Yiqi Zhang , Yijin Huang , Li Lin , Pujin Cheng , Xiaoying Tang

In recent years, much progress has been made in learning robotic manipulation policies that follow natural language instructions. Such methods typically learn from corpora of robot-language data that was either collected with specific tasks…

Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Brandon Trabucco , Kyle Doherty , Max Gurinas , Ruslan Salakhutdinov

Data augmentation aims to enrich training samples for alleviating the overfitting issue in low-resource or class-imbalanced situations. Traditional methods first devise task-specific operations such as Synonym Substitute, then preset the…

Computation and Language · Computer Science 2021-09-03 Shuhuai Ren , Jinchao Zhang , Lei Li , Xu Sun , Jie Zhou

While recent automated data augmentation methods lead to state-of-the-art results, their design spaces and the derived data augmentation strategies still incorporate strong human priors. In this work, instead of fixing a set of hand-picked…

Computer Vision and Pattern Recognition · Computer Science 2022-03-16 Yu Zheng , Zhi Zhang , Shen Yan , Mi Zhang

Simple data augmentation techniques, such as rotations and flips, are widely used to enhance the generalization power of computer vision models. However, these techniques often fail to modify high-level semantic attributes of a class. To…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Tobias Lingenberg , Markus Reuter , Gopika Sudhakaran , Dominik Gojny , Stefan Roth , Simone Schaub-Meyer

Synthetically augmenting training datasets with diffusion models has become an effective strategy for improving the generalization of image classifiers. However, existing approaches typically increase dataset size by 10-30x and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Dang Nguyen , Jiping Li , Jinghao Zheng , Baharan Mirzasoleiman

As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where deep learning techniques may fail. It is widely applied in computer vision then introduced to natural language processing and achieves improvements in…

Computation and Language · Computer Science 2022-06-28 Bohan Li , Yutai Hou , Wanxiang Che

Autonomous systems (AS) often use Deep Neural Network (DNN) classifiers to allow them to operate in complex, high-dimensional, non-linear, and dynamically changing environments. Due to the complexity of these environments, DNN classifiers…

Machine Learning · Computer Science 2024-08-16 Abanoub Ghobrial , Xuan Zheng , Darryl Hond , Hamid Asgari , Kerstin Eder

Generalizing to unseen image domains is a challenging problem primarily due to the lack of diverse training data, inaccessible target data, and the large domain shift that may exist in many real-world settings. As such data augmentation is…

Computer Vision and Pattern Recognition · Computer Science 2023-10-04 Sheng Cheng , Tejas Gokhale , Yezhou Yang

In Fine-Grained Visual Classification (FGVC), distinguishing highly similar subcategories remains a formidable challenge, often necessitating datasets with extensive variability. The acquisition and annotation of such FGVC datasets are…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Qiyu Liao , Xin Yuan , Min Xu , Dadong Wang

Training a deep learning model to classify histopathological images is challenging, because of the color and shape variability of the cells and tissues, and the reduced amount of available data, which does not allow proper learning of those…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Saypraseuth Mounsaveng , Issam Laradji , David Vázquez , Marco Perdersoli , Ismail Ben Ayed

Data augmentation has proved extremely useful by increasing training data variance to alleviate overfitting and improve deep neural networks' generalization performance. In medical image analysis, a well-designed augmentation policy usually…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Yunhe Gao , Zhiqiang Tang , Mu Zhou , Dimitris Metaxas

Vision-language-action models (VLAs) have become an increasingly popular approach for addressing robot manipulation problems in recent years. However, such models need to output actions at a rate suitable for robot control, which limits the…

Robotics · Computer Science 2025-09-30 Eric Hannus , Miika Malin , Tran Nguyen Le , Ville Kyrki

The acquisition of large-scale, high-quality data is a resource-intensive and time-consuming endeavor. Compared to conventional Data Augmentation (DA) techniques (e.g. cropping and rotation), exploiting prevailing diffusion models for data…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Yunxiang Fu , Chaoqi Chen , Yu Qiao , Yizhou Yu

Weakly-supervised semantic segmentation (WSSS) has achieved remarkable progress using only image-level labels. However, most existing WSSS methods focus on designing new network structures and loss functions to generate more accurate dense…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Wangyu Wu , Xianglin Qiu , Siqi Song , Zhenhong Chen , Xiaowei Huang , Fei Ma , Jimin Xiao

Language plays a critical role in Vision-Language-Action (VLA) models, yet the linguistic characteristics of the datasets used to train and evaluate these systems remain poorly documented. In this work, we present a systematic dataset audit…

Computation and Language · Computer Science 2026-04-29 Selma Wanna , Agnes Luhtaru , Jonathan Salfity , Ryan Barron , Juston Moore , Cynthia Matuszek , Mitch Pryor

Although instruction tuning is widely used to adjust behavior in Large Language Models (LLMs), extensive empirical evidence and research indicates that it is primarily a process where the model fits to specific task formats, rather than…

Artificial Intelligence · Computer Science 2024-08-21 Yuanhao Zeng , Fei Ren , Xinpeng Zhou , Yihang Wang , Yingxia Shao
‹ Prev 1 2 3 10 Next ›