Related papers: Pretrained-Guided Conditional Diffusion Models for…

DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data

Microbiome data analysis is essential for understanding host health and disease, yet its inherent sparsity and noise pose major challenges for accurate imputation, hindering downstream tasks such as biomarker discovery. Existing imputation…

Machine Learning · Computer Science 2025-08-01 Rabeya Tus Sadia , Qiang Cheng

Bayesian Sparse Regression for Microbiome-Metabolite Data Integration

Numerous studies have shown that microbial metabolites, which represent the products of bacteria in the human gut, play a key role in shaping cancer risk and response to treatment. However, metabolite data typically contain a large…

Applications · Statistics 2026-05-19 Kai Jiang , Satabdi Saha , Christine B. Peterson

Latent Variable Modeling for the Microbiome

The human microbiome is a complex ecological system, and describing its structure and function under different environmental conditions is important from both basic scientific and medical perspectives. Viewed through a biostatistical lens,…

Applications · Statistics 2017-11-17 Kris Sankaran , Susan P. Holmes

MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification

Deep learning models have made significant advances in histological prediction tasks in recent years. However, for adaptation in clinical practice, their lack of robustness to varying conditions such as staining, scanner, hospital, and…

Image and Video Processing · Electrical Eng. & Systems 2025-06-23 David Jacob Drexlin , Jonas Dippel , Julius Hense , Niklas Prenißl , Grégoire Montavon , Frederick Klauschen , Klaus-Robert Müller

MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation

Diffusion models have recently emerged as powerful tools for missing data imputation by modeling the joint distribution of observed and unobserved variables. However, existing methods, typically based on stochastic denoising diffusion…

Artificial Intelligence · Computer Science 2025-08-06 Youran Zhou , Mohamed Reda Bouadjenek , Sunil Aryal

Lung Cancer Risk Estimation with Incomplete Data: A Joint Missing Imputation Perspective

Data from multi-modality provide complementary information in clinical prediction, but missing data in clinical cohorts limits the number of subjects in multi-modal learning context. Multi-modal missing imputation is challenging with…

Image and Video Processing · Electrical Eng. & Systems 2021-07-27 Riqiang Gao , Yucheng Tang , Kaiwen Xu , Ho Hin Lee , Steve Deppen , Kim Sandler , Pierre Massion , Thomas A. Lasko , Yuankai Huo , Bennett A. Landman

Imputation techniques on missing values in breast cancer treatment and fertility data

Clinical decision support using data mining techniques offers more intelligent way to reduce the decision error in the last few years. However, clinical datasets often suffer from high missingness, which adversely impacts the quality of…

Machine Learning · Computer Science 2020-11-20 Xuetong Wu , Hadi Akbarzadeh Khorshidi , Uwe Aickelin , Zobaida Edib , Michelle Peate

Meta-Imputation Balanced (MIB): An Ensemble Approach for Handling Missing Data in Biomedical Machine Learning

Missing data represents a fundamental challenge in machine learning applications, often reducing model performance and reliability. This problem is particularly acute in fields like bioinformatics and clinical machine learning, where…

Machine Learning · Computer Science 2025-09-04 Fatemeh Azad , Zoran Bosnić , Matjaž Kukar

Physics-Guided Conditional Diffusion Networks for Microwave Image Reconstruction

A conditional latent-diffusion based framework for solving the electromagnetic inverse scattering problem associated with microwave imaging is introduced. This generative machine-learning model explicitly mirrors the non-uniqueness of the…

Image and Video Processing · Electrical Eng. & Systems 2025-10-30 Shirin Chehelgami , Joe LoVetri , Vahab Khoshdel

Sparse tree-based clustering of microbiome data to characterize microbiome heterogeneity in pancreatic cancer

There is a keen interest in characterizing variation in the microbiome across cancer patients, given increasing evidence of its important role in determining treatment outcomes. Here our goal is to discover subgroups of patients with…

Applications · Statistics 2022-12-06 Yushu Shi , Liangliang Zhang , Kim-Anh Do , Robert Jenq , Christine Peterson

Image Classification Using a Diffusion Model as a Pre-Training Model

In this paper, we propose a diffusion model that integrates a representation-conditioning mechanism, where the representations derived from a Vision Transformer (ViT) are used to condition the internal process of a Transformer-based…

Machine Learning · Computer Science 2025-05-13 Kosuke Ukita , Ye Xiaolong , Tsuyoshi Okita

MIMIX: a Bayesian Mixed-Effects Model for Microbiome Data from Designed Experiments

Recent advances in bioinformatics have made high-throughput microbiome data widely available, and new statistical tools are required to maximize the information gained from these data. For example, analysis of high-dimensional microbiome…

Methodology · Statistics 2017-03-23 Neal S. Grantham , Brian J. Reich , Elizabeth T. Borer , Kevin Gross

Model-Based Diffusion for Trajectory Optimization

Recent advances in diffusion models have demonstrated their strong capabilities in generating high-fidelity samples from complex distributions through an iterative refinement process. Despite the empirical success of diffusion models in…

Robotics · Computer Science 2024-07-03 Chaoyi Pan , Zeji Yi , Guanya Shi , Guannan Qu

Bayesian Mixed Effects Models for Zero-inflated Compositions in Microbiome Data Analysis

Detecting associations between microbial compositions and sample characteristics is one of the most important tasks in microbiome studies. Most of the existing methods apply univariate models to single microbial species separately, with…

Methodology · Statistics 2021-03-18 Boyu Ren , Sergio Bacallado , Stefano Favaro , Tommi Vatanen , Curtis Huttenhower , Lorenzo Trippa

MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer

Esophageal cancer is one of the most common types of cancer worldwide and ranks sixth in cancer-related mortality. Accurate computer-assisted diagnosis of cancer progression can help physicians effectively customize personalized treatment…

Image and Video Processing · Electrical Eng. & Systems 2024-05-17 Chengyu Wu , Chengkai Wang , Yaqi Wang , Huiyu Zhou , Yatao Zhang , Qifeng Wang , Shuai Wang

Integration of multiview microbiome data for deciphering microbiome-metabolome-disease pathways

The intricate interplay between host organisms and their gut microbiota has catalyzed research into the microbiome's role in disease, shedding light on novel aspects of disease pathogenesis. However, the mechanisms through which the…

Methodology · Statistics 2024-02-20 Lei Fang , Yue Wang , Chenglong Ye

impuTMAE: Multi-modal Transformer with Masked Pre-training for Missing Modalities Imputation in Cancer Survival Prediction

The use of diverse modalities, such as omics, medical images, and clinical data can not only improve the performance of prognostic models but also deepen an understanding of disease mechanisms and facilitate the development of novel…

Image and Video Processing · Electrical Eng. & Systems 2025-08-14 Maria Boyko , Aleksandra Beliaeva , Dmitriy Kornilov , Alexander Bernstein , Maxim Sharaev

stMCDI: Masked Conditional Diffusion Model with Graph Neural Network for Spatial Transcriptomics Data Imputation

Spatially resolved transcriptomics represents a significant advancement in single-cell analysis by offering both gene expression data and their corresponding physical locations. However, this high degree of spatial resolution entails a…

Genomics · Quantitative Biology 2024-03-19 Xiaoyu Li , Wenwen Min , Shunfang Wang , Changmiao Wang , Taosheng Xu

Diffusion models for missing value imputation in tabular data

Missing value imputation in machine learning is the task of estimating the missing values in the dataset accurately using available information. In this task, several deep generative modeling methods have been proposed and demonstrated…

Machine Learning · Computer Science 2023-03-14 Shuhan Zheng , Nontawat Charoenphakdee

Time-dependent Iterative Imputation for Multivariate Longitudinal Clinical Data

Missing data is a major challenge in clinical research. In electronic medical records, often a large fraction of the values in laboratory tests and vital signs are missing. The missingness can lead to biased estimates and limit our ability…

Machine Learning · Computer Science 2023-04-18 Omer Noy , Ron Shamir