English
Related papers

Related papers: Diffusion Transformers with Hybrid Conditioning fo…

200 papers

Diffusion Transformers (DiTs) have demonstrated strong performance in generative modeling, particularly in image synthesis, making them a compelling choice for molecular conformer generation. However, applying DiTs to molecules introduces…

Machine Learning · Computer Science 2025-11-12 J. Thorben Frank , Winfried Ripken , Gregor Lied , Klaus-Robert Müller , Oliver T. Unke , Stefan Chmiela

Diffusion Transformer (DiT), an emerging diffusion model for image generation, has demonstrated superior performance but suffers from substantial computational costs. Our investigations reveal that these costs stem from the static inference…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Wangbo Zhao , Yizeng Han , Jiasheng Tang , Kai Wang , Yibing Song , Gao Huang , Fan Wang , Yang You

Topology Optimization seeks to find the best design that satisfies a set of constraints while maximizing system performance. Traditional iterative optimization methods like SIMP can be computationally expensive and get stuck in local…

Machine Learning · Computer Science 2023-03-20 Giorgio Giannone , Faez Ahmed

Topology optimization enables the design of highly efficient and complex structures, but conventional iterative methods, such as SIMP-based approaches, often suffer from high computational costs and sensitivity to initial conditions.…

Computational Engineering, Finance, and Science · Computer Science 2025-09-18 Aaron Lutheran , Srijan Das , Alireza Tabarraei

Recent advances in diffusion transformers (DiTs) have set new standards in image generation, yet remain impractical for on-device deployment due to their high computational and memory costs. In this work, we present an efficient DiT…

While the diffusion transformer (DiT) has become a focal point of interest in recent years, its application in low-light image enhancement remains a blank area for exploration. Current methods recover the details from low-light images while…

Computer Vision and Pattern Recognition · Computer Science 2026-01-14 Xiangchen Yin , Zhenda Yu , Longtao Jiang , Xin Gao , Xiao Sun , Zhi Liu , Xun Yang

Image fusion aims to blend complementary information from multiple sensing modalities, yet existing approaches remain limited in robustness, adaptability, and controllability. Most current fusion networks are tailored to specific tasks and…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Jiayang Li , Chengjie Jiang , Junjun Jiang , Pengwei Liang , Jiayi Ma , Liqiang Nie

Transformer architectures, particularly Diffusion Transformers (DiTs), have become widely used in diffusion and flow-matching models due to their strong performance compared to convolutional UNets. However, the isotropic design of DiTs…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Quan Dao , Dimitris Metaxas

Topology optimization is used for the design of high-performance structures but remains fundamentally limited by its iterative nature, requiring repeated finite element analyses that prevent real-time deployment and large-scale design…

Computational Engineering, Finance, and Science · Computer Science 2026-04-07 Aaron Lutheran , Srijan Das , Alireza Tabarraei

We present Scalable Interpolant Transformers (SiT), a family of generative models built on the backbone of Diffusion Transformers (DiT). The interpolant framework, which allows for connecting two distributions in a more flexible way than…

Computer Vision and Pattern Recognition · Computer Science 2024-09-24 Nanye Ma , Mark Goldstein , Michael S. Albergo , Nicholas M. Boffi , Eric Vanden-Eijnden , Saining Xie

We investigate the approximation and estimation rates of conditional diffusion transformers (DiTs) with classifier-free guidance. We present a comprehensive analysis for ``in-context'' conditional DiTs under four common data assumptions. We…

Machine Learning · Statistics 2024-11-27 Jerry Yao-Chieh Hu , Weimin Wu , Yi-Chen Lee , Yu-Chao Huang , Minshuo Chen , Han Liu

We present JointDiT, a diffusion transformer that models the joint distribution of RGB and depth. By leveraging the architectural benefit and outstanding image prior of the state-of-the-art diffusion transformer, JointDiT not only generates…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Kwon Byung-Ki , Qi Dai , Lee Hyoseok , Chong Luo , Tae-Hyun Oh

Diffusion Transformers (DiT) have attracted significant attention in research. However, they suffer from a slow convergence rate. In this paper, we aim to accelerate DiT training without any architectural modification. We identify the…

Computer Vision and Pattern Recognition · Computer Science 2024-11-01 Jingfeng Yao , Wang Cheng , Wenyu Liu , Xinggang Wang

Diffusion Transformers (DiTs) achieve state-of-the-art performance in text-to-image synthesis but remain computationally expensive due to the iterative nature of denoising and the quadratic cost of global attention. In this work, we observe…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Bowen Lin , Fanjiang Ye , Yihua Liu , Zhenghui Guo , Boyuan Zhang , Weijian Zheng , Yufan Xu , Tiancheng Xing , Yuke Wang , Chengming Zhang

Diffusion Transformer (DiT), an emerging diffusion model for visual generation, has demonstrated superior performance but suffers from substantial computational costs. Our investigations reveal that these costs primarily stem from the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Wangbo Zhao , Yizeng Han , Jiasheng Tang , Kai Wang , Hao Luo , Yibing Song , Gao Huang , Fan Wang , Yang You

Structural topology optimization, which aims to find the optimal physical structure that maximizes mechanical performance, is vital in engineering design applications in aerospace, mechanical, and civil engineering. Generative adversarial…

Machine Learning · Computer Science 2022-12-07 François Mazé , Faez Ahmed

Diffusion Transformers (DiT) have demonstrated remarkable generative capabilities but remain highly computationally expensive. Previous acceleration methods, such as pruning and distillation, typically rely on a fixed computational…

Computer Vision and Pattern Recognition · Computer Science 2026-02-17 Jiangshan Wang , Zeqiang Lai , Jiarui Chen , Jiayi Guo , Hang Guo , Xiu Li , Xiangyu Yue , Chunchao Guo

Diffusion models are the standard toolkit for generative modelling of 3D atomic systems. However, for different types of atomic systems -- such as molecules and materials -- the generative processes are usually highly specific to the target…

Diffusion transformer (DiT) models have achieved remarkable success in image generation, thanks for their exceptional generative capabilities and scalability. Nonetheless, the iterative nature of diffusion models (DMs) results in high…

Computer Vision and Pattern Recognition · Computer Science 2025-05-12 Zhiyuan Chen , Keyi Li , Yifan Jia , Le Ye , Yufei Ma

Despite their remarkable performance, modern Diffusion Transformers are hindered by substantial resource requirements during inference, stemming from the fixed and large amount of compute needed for each denoising step. In this work, we…

‹ Prev 1 2 3 10 Next ›