English
Related papers

Related papers: AutoSimulate: (Quickly) Learning Synthetic Data Ge…

200 papers

Simulation is a useful tool in situations where training data for machine learning models is costly to annotate or even hard to acquire. In this work, we propose a reinforcement learning-based method for automatically adjusting the…

Machine Learning · Computer Science 2019-05-15 Nataniel Ruiz , Samuel Schulter , Manmohan Chandraker

Deep learning is now the gold standard in computer vision-based quality inspection systems. In order to detect defects, supervised learning is often utilized, but necessitates a large amount of annotated images, which can be costly:…

Computer Vision and Pattern Recognition · Computer Science 2021-07-23 Pierre Gutierrez , Maria Luschkova , Antoine Cordier , Mustafa Shukor , Mona Schappert , Tim Dahmen

Although many AI applications of interest require specialized multi-modal models, relevant data to train such models is inherently scarce or inaccessible. Filling these gaps with human annotators is prohibitively expensive, error-prone, and…

Artificial Intelligence · Computer Science 2026-04-01 Tim R. Davidson , Benoit Seguin , Enrico Bacis , Cesar Ilharco , Hamza Harkous

Synthetic data is being used lately for training deep neural networks in computer vision applications such as object detection, object segmentation and 6D object pose estimation. Domain randomization hereby plays an important role in…

Computer Vision and Pattern Recognition · Computer Science 2024-05-13 Parth Rawal , Mrunal Sompura , Wolfgang Hintze

Training models to high-end performance requires availability of large labeled datasets, which are expensive to get. The goal of our work is to automatically synthesize labeled datasets that are relevant for a downstream task. We propose…

Computer Vision and Pattern Recognition · Computer Science 2019-04-29 Amlan Kar , Aayush Prakash , Ming-Yu Liu , Eric Cameracci , Justin Yuan , Matt Rusiniak , David Acuna , Antonio Torralba , Sanja Fidler

Accurately evaluating model performance is crucial for deploying machine learning systems in real-world applications. Traditional methods often require a sufficiently large labeled test set to ensure a reliable evaluation. However, in many…

Machine Learning · Computer Science 2025-11-04 Hai Hoang Thanh , Duy-Tung Nguyen , Hung The Tran , Khoat Than

The performance of supervised deep learning algorithms depends significantly on the scale, quality and diversity of the data used for their training. Collecting and manually annotating large amount of data can be both time-consuming and…

Computer Vision and Pattern Recognition · Computer Science 2021-07-02 C. Symeonidis , P. Nousi , P. Tosidis , K. Tsampazis , N. Passalis , A. Tefas , N. Nikolaidis

Recent breakthroughs in synthetic data generation approaches made it possible to produce highly photorealistic images which are hardly distinguishable from real ones. Furthermore, synthetic generation pipelines have the potential to…

Computer Vision and Pattern Recognition · Computer Science 2023-07-06 Alon Shoshan , Nadav Bhonker , Igor Kviatkovsky , Matan Fintz , Gerard Medioni

Generative models have become a powerful tool for synthesizing training data in computer vision tasks. Current approaches solely focus on aligning generated images with the target dataset distribution. As a result, they capture only the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-08 Zerun Wang , Jiafeng Mao , Xueting Wang , Toshihiko Yamasaki

The usage of medical image data for the training of large-scale machine learning approaches is particularly challenging due to its scarce availability and the costly generation of data annotations, typically requiring the engagement of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-26 Joshua Niemeijer , Jan Ehrhardt , Hristina Uzunova , Heinz Handels

This paper addresses the challenge of overfitting in the learning of dynamical systems by introducing a novel approach for the generation of synthetic data, aimed at enhancing model generalization and robustness in scenarios characterized…

Machine Learning · Computer Science 2024-03-11 Dario Piga , Matteo Rufolo , Gabriele Maroni , Manas Mejari , Marco Forgione

While synthetic tabular data generation using Deep Generative Models (DGMs) offers a compelling solution to data scarcity and privacy concerns, their effectiveness relies on the availability of substantial training data, often lacking in…

Machine Learning · Computer Science 2025-08-01 Patricia A. Apellániz , Ana Jiménez , Borja Arroyo Galende , Juan Parras , Santiago Zazo

Generative models trained on synthetic plan data are a promising approach to generalized planning. Recent work has focused on finding any valid plan, rather than a high-quality solution. We address the challenge of producing high-quality…

Synthetic training data has gained prominence in numerous learning tasks and scenarios, offering advantages such as dataset augmentation, generalization evaluation, and privacy preservation. Despite these benefits, the efficiency of…

Machine Learning · Computer Science 2024-03-21 Jianhao Yuan , Jie Zhang , Shuyang Sun , Philip Torr , Bo Zhao

In precision agriculture, the scarcity of labeled data and significant covariate shifts pose unique challenges for training machine learning models. This scarcity is particularly problematic due to the dynamic nature of the environment and…

Computer Vision and Pattern Recognition · Computer Science 2025-02-26 Leonardo Saraceni , Ionut Marian Motoi , Daniele Nardi , Thomas Alessandro Ciarfuglia

We present an overview and evaluation of a new, systematic approach for generation of highly realistic, annotated synthetic data for training of deep neural networks in computer vision tasks. The main contribution is a procedural world…

Computer Vision and Pattern Recognition · Computer Science 2017-10-19 Apostolia Tsirikoglou , Joel Kronander , Magnus Wrenninge , Jonas Unger

Synthetic data generation has emerged as a crucial topic for financial institutions, driven by multiple factors, such as privacy protection and data augmentation. Many algorithms have been proposed for synthetic data generation but reaching…

Machine Learning · Computer Science 2024-05-13 Shinpei Nakamura-Sakai , Fadi Hamad , Saheed Obitayo , Vamsi K. Potluru

Synthetic data is emerging as a promising solution to the scalability issue of supervised deep learning, especially when real data are difficult to acquire or hard to annotate. Synthetic data generation, however, can itself be prohibitively…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Aayush Prakash , Shoubhik Debnath , Jean-Francois Lafleche , Eric Cameracci , Gavriel State , Stan Birchfield , Marc T. Law

Object recognition and object pose estimation in robotic grasping continue to be significant challenges, since building a labelled dataset can be time consuming and financially costly in terms of data collection and annotation. In this…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Dongmyoung Lee , Wei Chen , Nicolas Rojas

As insufficient data volume and quality remain the key impediments to the adoption of modern subsymbolic AI, techniques of synthetic data generation are in high demand. Simulation offers an apt, systematic approach to generating diverse…

Artificial Intelligence · Computer Science 2026-02-18 Xiaoran Liu , Istvan David
‹ Prev 1 2 3 10 Next ›