English
Related papers

Related papers: Simulation-Enhanced Data Augmentation for Machine …

200 papers

In supervised machine learning (SML) research, large training datasets are essential for valid results. However, obtaining primary data in learning analytics (LA) is challenging. Data augmentation can address this by expanding and…

Machine Learning · Computer Science 2024-12-04 Valdemar Švábenský , Conrad Borchers , Elizabeth B. Cloude , Atsushi Shimada

Machine learning has emerged as a promising approach to path loss prediction, yet its effectiveness often degrades when measurement data are scarce. To address this limitation, we propose an ensemble-based machine learning framework that…

Signal Processing · Electrical Eng. & Systems 2026-05-26 Ahmed P. Mohamed , Byunghyun Lee , Yaguang Zhang , Christopher R. Anderson , David J. Love , James V. Krogmeier

The growing number of pretrained models in Machine Learning (ML) presents significant challenges for practitioners. Given a new dataset, they need to determine the most suitable deep learning (DL) pipeline, consisting of the pretrained…

Machine Learning · Computer Science 2025-06-17 Fabio Ferreira

Can we improve machine-learning (ML) emulators with synthetic data? If data are scarce or expensive to source and a physical model is available, statistically generated data may be useful for augmenting training sets cheaply. Here we…

Machine Learning · Computer Science 2021-09-28 David Meyer , Thomas Nagler , Robin J. Hogan

Machine Learning (ML) in low-data settings remains an underappreciated yet crucial problem. Hence, data augmentation methods to increase the sample size of datasets needed for ML are key to unlocking the transformative potential of ML in…

Machine Learning · Computer Science 2024-07-02 Nabeel Seedat , Nicolas Huynh , Boris van Breugel , Mihaela van der Schaar

The integration of machine learning (ML) models enhances the efficiency, affordability, and reliability of feature detection in microscopy, yet their development and applicability are hindered by the dependency on scarce and often flawed…

Computer Vision and Pattern Recognition · Computer Science 2025-09-04 Matthew J. Lynch , Ryan Jacobs , Gabriella Bruno , Priyam Patki , Dane Morgan , Kevin G. Field

Synthetic augmentation is increasingly used to mitigate data scarcity in financial machine learning, yet its statistical role remains poorly understood. We formalize synthetic augmentation as a modification of the effective training…

Artificial Intelligence · Computer Science 2026-04-17 Mel Sohm , Charles Dezons , Sami Sellami , Oscar Ninou , Axel Pincon

Deep learning approaches are increasingly used to tackle forecasting tasks involving datasets with multiple univariate time series. A key factor in the successful application of these methods is a large enough training sample size, which is…

Machine Learning · Computer Science 2025-01-06 Vitor Cerqueira , Moisés Santos , Luis Roque , Yassine Baghoussi , Carlos Soares

Training and fine-tuning deep learning models, especially large language models (LLMs), on limited and imbalanced datasets poses substantial challenges. These issues often result in poor generalization, where models overfit to dominant…

Computation and Language · Computer Science 2025-01-14 Ashok Choudhary , Cornelius Thiels , Hojjat Salehinejad

Metabolic Syndrome (MetS) is a cluster of interrelated risk factors that significantly increases the risk of cardiovascular diseases and type 2 diabetes. Despite its global prevalence, accurate prediction of MetS remains challenging due to…

Machine Learning · Computer Science 2026-03-10 Sanyam Paresh Shah , Abdullah Mamun , Shovito Barua Soumma , Hassan Ghasemzadeh

Machine learning has significant potential for optimizing various industrial processes. However, data acquisition remains a major challenge as it is both time-consuming and costly. Synthetic data offers a promising solution to augment…

Artificial Intelligence · Computer Science 2025-11-12 Georg Rottenwalter , Marcel Tilly , Christian Bielenberg , Katharina Obermeier

New discoveries in chemistry and materials science, with increasingly expanding volume of requisite knowledge and experimental workload, provide unique opportunities for machine learning (ML) to take critical roles in accelerating research…

Recent breakthroughs in synthetic data generation approaches made it possible to produce highly photorealistic images which are hardly distinguishable from real ones. Furthermore, synthetic generation pipelines have the potential to…

Computer Vision and Pattern Recognition · Computer Science 2023-07-06 Alon Shoshan , Nadav Bhonker , Igor Kviatkovsky , Matan Fintz , Gerard Medioni

Learning the distance metric between pairs of samples has been studied for image retrieval and clustering. With the remarkable success of pair-based metric learning losses, recent works have proposed the use of generated synthetic points on…

Computer Vision and Pattern Recognition · Computer Science 2020-04-24 Byungsoo Ko , Geonmo Gu

Synthesizing realistic medical images provides a feasible solution to the shortage of training data in deep learning based medical image recognition systems. However, the quality control of synthetic images for data augmentation purposes is…

Computer Vision and Pattern Recognition · Computer Science 2020-08-27 Jiarong Ye , Yuan Xue , L. Rodney Long , Sameer Antani , Zhiyun Xue , Keith Cheng , Xiaolei Huang

Automated data augmentation, which aims at engineering augmentation policy automatically, recently draw a growing research interest. Many previous auto-augmentation methods utilized a Density Matching strategy by evaluating policies in…

Computer Vision and Pattern Recognition · Computer Science 2022-07-22 Jianwei Zhang , Dong Li , Lituan Wang , Lei Zhang

Deep learning models with a large number of parameters, often referred to as over-parameterized models, have achieved exceptional performance across various tasks. Despite concerns about overfitting, these models frequently generalize well…

Machine Learning · Computer Science 2025-06-10 Ilya Kaufman Sirot , Omri Azencot

Attacks on computer networks have increased significantly in recent days, due in part to the availability of sophisticated tools for launching such attacks as well as thriving underground cyber-crime economy to support it. Over the past…

Machine Learning · Computer Science 2022-04-04 Ayesha S. Dina , A. B. Siddique , D. Manivannan

We consider quantitative analyses of spectral data using laser-induced breakdown spectroscopy. We address the small size of training data available, and the validation of the predictions during inference on unknown data. For the purpose, we…

Machine learning is offering powerful new tools for the development and discovery of reduced models of nonlinear, multiscale plasma dynamics from the data of first-principles kinetic simulations. However, ensuring the physical consistency…

Plasma Physics · Physics 2026-02-25 Madox C. McGrae-Menge , Jacob R. Pierce , Frederico Fiuza , E. Paulo Alves
‹ Prev 1 2 3 10 Next ›