English
Related papers

Related papers: Diffusion Transformers for Tabular Data Time Serie…

200 papers

While generative modeling on time series facilitates more capable and flexible probabilistic forecasting, existing generative time series models do not address the multi-dimensional properties of time series data well. The prevalent…

Machine Learning · Computer Science 2026-02-09 Haoran Zhang , Haixuan Liu , Yong Liu , Yunzhong Qiu , Yuxuan Wang , Jianmin Wang , Mingsheng Long

Data imputation and data generation have important applications for many domains, like healthcare and finance, where incomplete or missing data can hinder accurate analysis and decision-making. Diffusion models have emerged as powerful…

Machine Learning · Computer Science 2025-06-10 Mario Villaizán-Vallelado , Matteo Salvatori , Carlos Segura , Ioannis Arapakis

Generating synthetic tabular data is critical in machine learning, especially when real data is limited or sensitive. Traditional generative models often face challenges due to the unique characteristics of tabular data, such as mixed data…

Machine Learning · Computer Science 2024-10-30 Vitaliy Kinakh , Slava Voloshynovskiy

Diffusion models, a family of generative models based on deep learning, have become increasingly prominent in cutting-edge machine learning research. With a distinguished performance in generating samples that resemble the observed data,…

Machine Learning · Computer Science 2023-05-02 Lequan Lin , Zhengkun Li , Ruikun Li , Xuliang Li , Junbin Gao

Deep generative models have made rapid progress in image, text, audio, and video generation, and are increasingly being applied to structured records. For tabular data, however, generative modeling remains difficult: a dataset may contain…

Machine Learning · Computer Science 2026-05-25 Zhong Li , Qi Huang , Lincen Yang , Jiayang Shi , Zhao Yang , Niki van Stein , Thomas Bäck , Matthijs van Leeuwen

Tabular data is one of the most ubiquitous modalities, yet the literature on tabular generative foundation models is lagging far behind its text and vision counterparts. Creating such a model is hard, due to the heterogeneous feature spaces…

Machine Learning · Computer Science 2024-06-26 Boris van Breugel , Jonathan Crabbé , Rob Davis , Mihaela van der Schaar

With the development of Artificial Intelligence, numerous real-world tasks have been accomplished using technology integrated with deep learning. To achieve optimal performance, deep neural networks typically require large volumes of data…

Machine Learning · Computer Science 2025-05-09 Yuren Zhang , Zhongnan Pu , Lei Jing

Synthesizing high-quality tabular data is an important topic in many data science tasks, ranging from dataset augmentation to privacy protection. However, developing expressive generative models for tabular data is challenging due to its…

Machine Learning · Computer Science 2025-02-18 Juntong Shi , Minkai Xu , Harper Hua , Hengrui Zhang , Stefano Ermon , Jure Leskovec

Diffusion models have been widely used in time series and spatio-temporal data, enhancing generative, inferential, and downstream capabilities. These models are applied across diverse fields such as healthcare, recommendation, climate,…

Machine Learning · Computer Science 2025-12-09 Yiyuan Yang , Ming Jin , Haomin Wen , Chaoli Zhang , Yuxuan Liang , Lintao Ma , Yi Wang , Chenghao Liu , Bin Yang , Zenglin Xu , Shirui Pan , Qingsong Wen

There is a recent growing interest in applying Deep Learning techniques to tabular data, in order to replicate the success of other Artificial Intelligence areas in this structured domain. Specifically interesting is the case in which…

Machine Learning · Computer Science 2025-05-06 Simone Luetto , Fabrizio Garuti , Enver Sangineto , Lorenzo Forni , Rita Cucchiara

The ability to train generative models that produce realistic, safe and useful tabular data is essential for data privacy, imputation, oversampling, explainability or simulation. However, generating tabular data is not straightforward due…

Machine Learning · Computer Science 2025-09-18 G. Charbel N. Kindji , Lina Maria Rojas-Barahona , Elisa Fromont , Tanguy Urvoy

Generative models have been successfully used in the field of time series generation. However, when dealing with long-term time series, which span over extended periods and exhibit more complex long-term temporal patterns, the task of…

Machine Learning · Computer Science 2025-09-01 Xuan Hou , Shuhan Liu , Zhaohui Peng , Yaohui Chu , Yue Zhang , Yining Wang

Tabular datasets are ubiquitous in data science applications. Given their importance, it seems natural to apply state-of-the-art deep learning algorithms in order to fully unlock their potential. Here we propose neural network models that…

Machine Learning · Computer Science 2021-02-15 Inkit Padhi , Yair Schiff , Igor Melnyk , Mattia Rigotti , Youssef Mroueh , Pierre Dognin , Jerret Ross , Ravi Nair , Erik Altman

In recent years, generative diffusion models have achieved a rapid paradigm shift in deep generative models by showing groundbreaking performance across various applications. Meanwhile, structured data, encompassing tabular and time series…

Machine Learning · Computer Science 2023-07-11 Heejoon Koo , To Eun Kim

Tabular data is one of the most prevalent and important data formats in real-world applications such as healthcare, finance, and education. However, its effective use in machine learning is often constrained by data scarcity, privacy…

Machine Learning · Computer Science 2025-07-18 Ruxue Shi , Yili Wang , Mengnan Du , Xu Shen , Yi Chang , Xin Wang

Diffusion model has become a main paradigm for synthetic data generation in many subfields of modern machine learning, including computer vision, language model, or speech synthesis. In this paper, we leverage the power of diffusion model…

Machine Learning · Statistics 2023-11-20 Namjoon Suh , Xiaofeng Lin , Din-Yin Hsieh , Merhdad Honarkhah , Guang Cheng

Diffusion models are increasingly being utilised to create synthetic tabular and time series data for privacy-preserving augmentation. Tabular Denoising Diffusion Probabilistic Models (TabDDPM) generate high-quality synthetic data from…

Machine Learning · Computer Science 2026-04-08 Umang Dobhal , Christina Garcia , Sozo Inoue

Autoregressive models are predominant in natural language generation, while their application in tabular data remains underexplored. We posit that this can be attributed to two factors: 1) tabular data contains heterogeneous data type,…

Machine Learning · Computer Science 2024-10-30 Hengrui Zhang , Liancheng Fang , Qitian Wu , Philip S. Yu

Time series generation is widely used in real-world applications such as simulation, data augmentation, and hypothesis testing. Recently, diffusion models have emerged as the de facto approach to time series generation, enabling diverse…

Machine Learning · Computer Science 2025-12-25 Jinseong Park , Seungyun Lee , Woojin Jeong , Yujin Choi , Jaewook Lee

Training data has been proven to be one of the most critical components in training generative AI. However, obtaining high-quality data remains challenging, with data privacy issues presenting a significant hurdle. To address the need for…

Computation and Language · Computer Science 2025-06-18 Jia-Chen Zhang , Zheng Zhou , Yu-Jie Xiong , Chun-Ming Xia , Fei Dai
‹ Prev 1 2 3 10 Next ›