English
Related papers

Related papers: Synthetic Flight Data Generation Using Generative …

200 papers

Flight diversions are rare but high-impact events in aviation, making their reliable prediction vital for both safety and operational efficiency. However, their scarcity in historical records impedes the training of machine learning models…

Machine Learning · Computer Science 2026-04-23 Karim Aly , Alexei Sharpanskykh , Jacco Hoekstra

The generation of synthetic data is receiving increasing attention from the scientific community, thanks to its ability to solve problems like data scarcity and privacy, and is starting to find applications in air transport. We here tackle…

Machine Learning · Computer Science 2026-01-09 Pau Esteve , Massimiliano Zanin

Access to comprehensive flight operations data remains severely restricted in aviation due to commercial sensitivity and competitive considerations, hindering the development of predictive models for operational planning. This paper…

Machine Learning · Computer Science 2025-08-05 Abdulmajid Murad , Massimiliano Ruocco

In modern air traffic management, generating synthetic flight trajectories has emerged as a promising solution for addressing data scarcity, protecting sensitive information, and supporting large-scale analyses. In this paper, we propose a…

Machine Learning · Computer Science 2025-04-15 Abdulmajid Murad , Massimiliano Ruocco

Generative modeling has recently seen many exciting developments with the advent of deep generative architectures such as Variational Auto-Encoders (VAE) or Generative Adversarial Networks (GAN). The ability to draw synthetic i.i.d.…

Machine Learning · Computer Science 2021-02-19 Johan Leduc , Nicolas Grislain

The sharing of large-scale transportation data is beneficial for transportation planning and policymaking. However, it also raises significant security and privacy concerns, as the data may include identifiable personal information, such as…

Machine Learning · Computer Science 2025-02-14 Chengen Wang , Alvaro Cardenas , Gurcan Comert , Murat Kantarcioglu

Synthetic financial data provides a practical solution to the privacy, accessibility, and reproducibility challenges that often constrain empirical research in quantitative finance. This paper investigates the use of deep generative models,…

Statistical Finance · Quantitative Finance 2025-12-30 Christophe D. Hounwanou , Yae Ulrich Gaba

Deep learning models need a sufficient amount of data in order to be able to find the hidden patterns in it. It is the purpose of generative modeling to learn the data distribution, thus allowing us to sample more data and augment the…

Machine Learning · Computer Science 2024-11-28 José Fernando Núñez , Jamie Arjona , Javier Béjar

The ability to generate synthetic data has a variety of use cases across different domains. In education research, there is a growing need to have access to synthetic data to test certain concepts and ideas. In recent years, several deep…

Machine Learning · Computer Science 2022-10-18 Herkulaas MvE Combrink , Vukosi Marivate , Benjamin Rosman

Synthetic data generation becomes prevalent as a solution to privacy leakage and data shortage. Generative models are designed to generate a realistic synthetic dataset, which can precisely express the data distribution for the real…

Machine Learning · Computer Science 2021-04-22 Bingyang Wen , Luis Oliveros Colon , K. P. Subbalakshmi , R. Chandramouli

Background: Heart failure (HF) research is constrained by limited access to large, shareable datasets due to privacy regulations and institutional barriers. Synthetic data generation offers a promising solution to overcome these challenges…

The emergence of generative AI models has dramatically expanded the availability and use of synthetic data across scientific, industrial, and policy domains. While these developments open new possibilities for data analysis, they also raise…

Machine Learning · Statistics 2026-03-06 Ahmad Abdel-Azim , Ruoyu Wang , Xihong Lin

Synthetic data generation, a cornerstone of Generative Artificial Intelligence, promotes a paradigm shift in data science by addressing data scarcity and privacy while enabling unprecedented performance. As synthetic data becomes more…

Machine Learning · Statistics 2024-03-12 Xiaotong Shen , Yifei Liu , Rex Shen

The widespread adoption of electronic health records and digital healthcare data has created a demand for data-driven insights to enhance patient outcomes, diagnostics, and treatments. However, using real patient data presents privacy and…

Machine Learning · Computer Science 2023-11-15 Aryan Jadon , Shashank Kumar

This article provides a comprehensive synthesis of the recent developments in synthetic data generation via deep generative models, focusing on tabular datasets. We specifically outline the importance of synthetic data generation in the…

Machine Learning · Computer Science 2023-08-29 Conor Hassan , Robert Salomone , Kerrie Mengersen

Synthetic tabular data generation has emerged as a promising method to address limited data availability and privacy concerns. With the sharp increase in the performance of large language models in recent years, researchers have been…

Machine Learning · Computer Science 2025-03-28 Reilly Cannon , Nicolette M. Laird , Caesar Vazquez , Andy Lin , Amy Wagler , Tony Chiang

Accurate and comprehensive clinical documentation is crucial for delivering high-quality healthcare, facilitating effective communication among providers, and ensuring compliance with regulatory requirements. However, manual transcription…

Computation and Language · Computer Science 2024-06-12 Anjanava Biswas , Wrick Talukdar

Deep generative models have become useful for synthetic data generation, particularly population synthesis. The models implicitly learn the probability distribution of a dataset and can draw samples from a distribution. Several models have…

Machine Learning · Computer Science 2022-11-28 Daniel Opoku Mensah , Godwin Badu-Marfo , Bilal Farooq

Recent breakthroughs in synthetic data generation approaches made it possible to produce highly photorealistic images which are hardly distinguishable from real ones. Furthermore, synthetic generation pipelines have the potential to…

Computer Vision and Pattern Recognition · Computer Science 2023-07-06 Alon Shoshan , Nadav Bhonker , Igor Kviatkovsky , Matan Fintz , Gerard Medioni

High-quality training data is critical to the performance of machine learning models, particularly Large Language Models (LLMs). However, obtaining real, high-quality data can be challenging, especially for smaller organizations and…

Machine Learning · Computer Science 2025-06-24 Cristian Del Gobbo
‹ Prev 1 2 3 10 Next ›