Related papers: Diffusion Transformers with Hybrid Conditioning fo…

Sampling 3D Molecular Conformers with Diffusion Transformers

Diffusion Transformers (DiTs) have demonstrated strong performance in generative modeling, particularly in image synthesis, making them a compelling choice for molecular conformer generation. However, applying DiTs to molecules introduces…

Machine Learning · Computer Science 2025-11-12 J. Thorben Frank , Winfried Ripken , Gregor Lied , Klaus-Robert Müller , Oliver T. Unke , Stefan Chmiela

Dynamic Diffusion Transformer

Diffusion Transformer (DiT), an emerging diffusion model for image generation, has demonstrated superior performance but suffers from substantial computational costs. Our investigations reveal that these costs stem from the static inference…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Wangbo Zhao , Yizeng Han , Jiasheng Tang , Kai Wang , Yibing Song , Gao Huang , Fan Wang , Yang You

Diffusing the Optimal Topology: A Generative Optimization Approach

Topology Optimization seeks to find the best design that satisfies a set of constraints while maximizing system performance. Traditional iterative optimization methods like SIMP can be computationally expensive and get stuck in local…

Machine Learning · Computer Science 2023-03-20 Giorgio Giannone , Faez Ahmed

Transformer-based Topology Optimization

Topology optimization enables the design of highly efficient and complex structures, but conventional iterative methods, such as SIMP-based approaches, often suffer from high computational costs and sensitivity to initial conditions.…

Computational Engineering, Finance, and Science · Computer Science 2025-09-18 Aaron Lutheran , Srijan Das , Alireza Tabarraei

SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices

Recent advances in diffusion transformers (DiTs) have set new standards in image generation, yet remain impractical for on-device deployment due to their high computational and memory costs. In this work, we present an efficient DiT…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Dongting Hu , Aarush Gupta , Magzhan Gabidolla , Arpit Sahni , Huseyin Coskun , Yanyu Li , Yerlan Idelbayev , Ahsan Mahmood , Aleksei Lebedev , Dishani Lahiri , Anujraaj Goyal , Ju Hu , Mingming Gong , Sergey Tulyakov , Anil Kag

Structure-guided Diffusion Transformer for Low-Light Image Enhancement

While the diffusion transformer (DiT) has become a focal point of interest in recent years, its application in low-light image enhancement remains a blank area for exploration. Current methods recover the details from low-light images while…

Computer Vision and Pattern Recognition · Computer Science 2026-01-14 Xiangchen Yin , Zhenda Yu , Longtao Jiang , Xin Gao , Xiao Sun , Zhi Liu , Xun Yang

Towards Unified Semantic and Controllable Image Fusion: A Diffusion Transformer Approach

Image fusion aims to blend complementary information from multiple sensing modalities, yet existing approaches remain limited in robustness, adaptability, and controllability. Most current fusion networks are tailored to specific tasks and…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Jiayang Li , Chengjie Jiang , Junjun Jiang , Pengwei Liang , Jiayi Ma , Liqiang Nie

MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model

Transformer architectures, particularly Diffusion Transformers (DiTs), have become widely used in diffusion and flow-matching models due to their strong performance compared to convolutional UNets. However, the isotropic design of DiTs…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Quan Dao , Dimitris Metaxas

Physics-Informed Transformer for Real-Time High-Fidelity Topology Optimization

Topology optimization is used for the design of high-performance structures but remains fundamentally limited by its iterative nature, requiring repeated finite element analyses that prevent real-time deployment and large-scale design…

Computational Engineering, Finance, and Science · Computer Science 2026-04-07 Aaron Lutheran , Srijan Das , Alireza Tabarraei

SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers

We present Scalable Interpolant Transformers (SiT), a family of generative models built on the backbone of Diffusion Transformers (DiT). The interpolant framework, which allows for connecting two distributions in a more flexible way than…

Computer Vision and Pattern Recognition · Computer Science 2024-09-24 Nanye Ma , Mark Goldstein , Michael S. Albergo , Nicholas M. Boffi , Eric Vanden-Eijnden , Saining Xie

On Statistical Rates of Conditional Diffusion Transformers: Approximation, Estimation and Minimax Optimality

We investigate the approximation and estimation rates of conditional diffusion transformers (DiTs) with classifier-free guidance. We present a comprehensive analysis for ``in-context'' conditional DiTs under four common data assumptions. We…

Machine Learning · Statistics 2024-11-27 Jerry Yao-Chieh Hu , Weimin Wu , Yi-Chen Lee , Yu-Chao Huang , Minshuo Chen , Han Liu

JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers

We present JointDiT, a diffusion transformer that models the joint distribution of RGB and depth. By leveraging the architectural benefit and outstanding image prior of the state-of-the-art diffusion transformer, JointDiT not only generates…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Kwon Byung-Ki , Qi Dai , Lee Hyoseok , Chong Luo , Tae-Hyun Oh

FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification

Diffusion Transformers (DiT) have attracted significant attention in research. However, they suffer from a slow convergence rate. In this paper, we aim to accelerate DiT training without any architectural modification. We identify the…

Computer Vision and Pattern Recognition · Computer Science 2024-11-01 Jingfeng Yao , Wang Cheng , Wenyu Liu , Xinggang Wang

SDiT: Semantic Region-Adaptive for Diffusion Transformers

Diffusion Transformers (DiTs) achieve state-of-the-art performance in text-to-image synthesis but remain computationally expensive due to the iterative nature of denoising and the quadratic cost of global attention. In this work, we observe…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Bowen Lin , Fanjiang Ye , Yihua Liu , Zhenghui Guo , Boyuan Zhang , Weijian Zheng , Yufan Xu , Tiancheng Xing , Yuke Wang , Chengming Zhang

DyDiT++: Diffusion Transformers with Timestep and Spatial Dynamics for Efficient Visual Generation

Diffusion Transformer (DiT), an emerging diffusion model for visual generation, has demonstrated superior performance but suffers from substantial computational costs. Our investigations reveal that these costs primarily stem from the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Wangbo Zhao , Yizeng Han , Jiasheng Tang , Kai Wang , Hao Luo , Yibing Song , Gao Huang , Fan Wang , Yang You

Diffusion Models Beat GANs on Topology Optimization

Structural topology optimization, which aims to find the optimal physical structure that maximizes mechanical performance, is vital in engineering design applications in aerospace, mechanical, and civil engineering. Generative adversarial…

Machine Learning · Computer Science 2022-12-07 François Mazé , Faez Ahmed

Elastic Diffusion Transformer

Diffusion Transformers (DiT) have demonstrated remarkable generative capabilities but remain highly computationally expensive. Previous acceleration methods, such as pruning and distillation, typically rely on a fixed computational…

Computer Vision and Pattern Recognition · Computer Science 2026-02-17 Jiangshan Wang , Zeqiang Lai , Jiarui Chen , Jiayi Guo , Hang Guo , Xiu Li , Xiangyu Yue , Chunchao Guo

All-atom Diffusion Transformers: Unified generative modelling of molecules and materials

Diffusion models are the standard toolkit for generative modelling of 3D atomic systems. However, for different types of atomic systems -- such as molecules and materials -- the generative processes are usually highly specific to the target…

Machine Learning · Computer Science 2025-05-23 Chaitanya K. Joshi , Xiang Fu , Yi-Lun Liao , Vahe Gharakhanyan , Benjamin Kurt Miller , Anuroop Sriram , Zachary W. Ulissi

Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition

Diffusion transformer (DiT) models have achieved remarkable success in image generation, thanks for their exceptional generative capabilities and scalability. Nonetheless, the iterative nature of diffusion models (DMs) results in high…

Computer Vision and Pattern Recognition · Computer Science 2025-05-12 Zhiyuan Chen , Keyi Li , Yifan Jia , Le Ye , Yufei Ma

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Despite their remarkable performance, modern Diffusion Transformers are hindered by substantial resource requirements during inference, stemming from the fixed and large amount of compute needed for each denoising step. In this work, we…

Machine Learning · Computer Science 2025-02-28 Sotiris Anagnostidis , Gregor Bachmann , Yeongmin Kim , Jonas Kohler , Markos Georgopoulos , Artsiom Sanakoyeu , Yuming Du , Albert Pumarola , Ali Thabet , Edgar Schönfeld