Related papers: Heterogeneous Decentralized Diffusion Models

Decentralized Diffusion Models

Large-scale AI model training divides work across thousands of GPUs, then synchronizes gradients across them at each step. This incurs a significant network burden that only centralized, monolithic clusters can support, driving up…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 David McAllister , Matthew Tancik , Jiaming Song , Angjoo Kanazawa

Decentralized Training of Foundation Models in Heterogeneous Environments

Training foundation models, such as GPT-3 and PaLM, can be extremely expensive, often involving tens of thousands of GPUs running continuously for months. These models are typically trained in specialized clusters featuring fast,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-22 Binhang Yuan , Yongjun He , Jared Quincy Davis , Tianyi Zhang , Tri Dao , Beidi Chen , Percy Liang , Christopher Re , Ce Zhang

FedDM: Enhancing Communication Efficiency and Handling Data Heterogeneity in Federated Diffusion Models

We introduce FedDM, a novel training framework designed for the federated training of diffusion models. Our theoretical analysis establishes the convergence of diffusion models when trained in a federated setting, presenting the specific…

Machine Learning · Computer Science 2024-07-23 Jayneel Vora , Nader Bouacida , Aditya Krishnan , Prasant Mohapatra

Training Diffusion Models with Federated Learning

The training of diffusion-based models for image generation is predominantly controlled by a select few Big Tech companies, raising concerns about privacy, copyright, and data authority due to their lack of transparency regarding training…

Machine Learning · Computer Science 2024-06-19 Matthijs de Goede , Bart Cox , Jérémie Decouchant

FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression

To alleviate hardware scarcity in training large deep neural networks (DNNs), particularly large language models (LLMs), we present FusionLLM, a decentralized training system designed and implemented for training DNNs using geo-distributed…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-17 Zhenheng Tang , Xueze Kang , Yiming Yin , Xinglin Pan , Yuxin Wang , Xin He , Qiang Wang , Rongfei Zeng , Kaiyong Zhao , Shaohuai Shi , Amelie Chi Zhou , Bo Li , Bingsheng He , Xiaowen Chu

Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling

Diffusion models have achieved remarkable progress in high-fidelity image, video, and audio generation, yet inference remains computationally expensive. Nevertheless, current diffusion acceleration methods based on distributed parallelism…

Computer Vision and Pattern Recognition · Computer Science 2026-02-26 Euisoo Jung , Byunghyun Kim , Hyunjin Kim , Seonghye Cho , Jae-Gil Lee

Improving Progressive Generation with Decomposable Flow Matching

Generating high-dimensional visual modalities is a computationally intensive task. A common solution is progressive generation, where the outputs are synthesized in a coarse-to-fine spectral autoregressive manner. While diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Moayed Haji-Ali , Willi Menapace , Ivan Skorokhodov , Arpit Sahni , Sergey Tulyakov , Vicente Ordonez , Aliaksandr Siarohin

Federated Learning for Diffusion Models

Diffusion models are powerful generative models that can produce highly realistic samples for various tasks. Typically, these models are constructed using centralized, independently and identically distributed (IID) training data. However,…

Machine Learning · Computer Science 2025-03-14 Zihao Peng , Xijun Wang , Shengbo Chen , Hong Rao , Cong Shen

HADFL: Heterogeneity-aware Decentralized Federated Learning Framework

Federated learning (FL) supports training models on geographically distributed devices. However, traditional FL systems adopt a centralized synchronous strategy, putting high communication pressure and model generalization challenge.…

Machine Learning · Computer Science 2021-11-17 Jing Cao , Zirui Lian , Weihong Liu , Zongwei Zhu , Cheng Ji

Fixed Point Diffusion Models

We introduce the Fixed Point Diffusion Model (FPDM), a novel approach to image generation that integrates the concept of fixed point solving into the framework of diffusion-based generative modeling. Our approach embeds an implicit fixed…

Computer Vision and Pattern Recognition · Computer Science 2024-01-18 Xingjian Bai , Luke Melas-Kyriazi

Phoenix: A Federated Generative Diffusion Model

Generative AI has made impressive strides in enabling users to create diverse and realistic visual content such as images, videos, and audio. However, training generative models on large centralized datasets can pose challenges in terms of…

Machine Learning · Computer Science 2025-10-03 Fiona Victoria Stanley Jothiraj , Afra Mashhadi

HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale Models

Training large-scale models relies on a vast number of computing resources. For example, training the GPT-4 model (1.8 trillion parameters) requires 25000 A100 GPUs . It is a challenge to build a large-scale cluster with one type of…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-08-12 Si Xu , Zixiao Huang , Yan Zeng , Shengen Yan , Xuefei Ning , Quanlu Zhang , Haolin Ye , Sipei Gu , Chunsheng Shui , Zhezheng Lin , Hao Zhang , Sheng Wang , Guohao Dai , Yu Wang

From Diffusion to One-Step Generation: A Comparative Study of Flow-Based Models with Application to Image Inpainting

We present a comprehensive comparative study of three generative modeling paradigms: Denoising Diffusion Probabilistic Models (DDPM), Conditional Flow Matching (CFM), and MeanFlow. While DDPM and CFM require iterative sampling, MeanFlow…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Umang Agarwal , Rudraksh Sangore , Sumit Laddha

Discrete Meanflow Training Curriculum

Flow-based image generative models exhibit stable training and produce high quality samples when using multi-step sampling procedures. One-step generative models can produce high quality image samples but can be difficult to optimize as…

Machine Learning · Computer Science 2026-04-13 Chia-Hong Hsu , Frank Wood

Better Diffusion Models Further Improve Adversarial Training

It has been recognized that the data generated by the denoising diffusion probabilistic model (DDPM) improves adversarial training. After two years of rapid development in diffusion models, a question naturally arises: can better diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-06-02 Zekai Wang , Tianyu Pang , Chao Du , Min Lin , Weiwei Liu , Shuicheng Yan

Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation

Denoising diffusion probabilistic models (DDPMs) have achieved unprecedented success in computer vision. However, they remain underutilized in medical imaging, a field crucial for disease diagnosis and treatment planning. This is primarily…

Image and Video Processing · Electrical Eng. & Systems 2025-08-22 Hongxu Jiang , Muhammad Imran , Teng Zhang , Yuyin Zhou , Muxuan Liang , Kuang Gong , Wei Shao

Computationally Efficient Diffusion Models in Medical Imaging: A Comprehensive Review

The diffusion model has recently emerged as a potent approach in computer vision, demonstrating remarkable performances in the field of generative artificial intelligence. Capable of producing high-quality synthetic images, diffusion models…

Image and Video Processing · Electrical Eng. & Systems 2025-05-14 Abdullah , Tao Huang , Ickjai Lee , Euijoon Ahn

DFLOP: A Data-driven Framework for Multimodal LLM Training Pipeline Optimization

Multimodal Large Language Models (MLLMs) have achieved remarkable advances by integrating text, image, and audio understanding within a unified architecture. However, existing distributed training frameworks remain fundamentally data-blind:…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-20 Hyeonjun An , Sihyun Kim , Chaerim Lim , Hyunjoon Kim , Rathijit Sen , Sangmin Jung , Hyeonsoo Lee , Dongwook Kim , Takki Yu , Jinkyu Jeong , Youngsok Kim , Kwanghyun Park

FedPhD: Federated Pruning with Hierarchical Learning of Diffusion Models

Federated Learning (FL), as a distributed learning paradigm, trains models over distributed clients' data. FL is particularly beneficial for distributed training of Diffusion Models (DMs), which are high-quality image generators that…

Machine Learning · Computer Science 2025-07-10 Qianyu Long , Qiyuan Wang , Christos Anagnostopoulos , Daning Bi

Efficiency vs. Fidelity: A Comparative Analysis of Diffusion Probabilistic Models and Flow Matching on Low-Resource Hardware

Denoising Diffusion Probabilistic Models (DDPMs) have established a new state-of-the-art in generative image synthesis, yet their deployment is hindered by significant computational overhead during inference, often requiring up to 1,000…

Machine Learning · Computer Science 2025-11-25 Srishti Gupta , Yashasvee Taiwade