Computer Vision and Pattern Recognition · Computer Science
Scalable Diffusion Models with State Space Backbone
Zhengcong Fei, Mingyuan Fan, Changqian Yu, Junshi Huang
2024-03-29
Computer Vision and Pattern Recognition · Computer Science
TerDiT: Ternary Diffusion Models with Transformers
Xudong Lu, Aojun Zhou, Ziyi Lin, Qi Liu +6
2025-04-08
Computer Vision and Pattern Recognition · Computer Science
U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
Yuchuan Tian, Zhijun Tu, Hanting Chen, Jie Hu +2
2024-10-31
Computer Vision and Pattern Recognition · Computer Science
Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Hao Li, Shamit Lal, Zhiheng Li, Yusheng Xie +8
2024-12-18
Computer Vision and Pattern Recognition · Computer Science
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Lianghui Zhu, Zilong Huang, Bencheng Liao, Jun Hao Liew +3
2024-11-28
Computer Vision and Pattern Recognition · Computer Science
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers
Moayed Haji-Ali, Willi Menapace, Ivan Skorokhodov, Dogyun Park +5
2026-03-13
Computer Vision and Pattern Recognition · Computer Science
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion
Emiel Hoogeboom, Thomas Mensink, Jonathan Heek, Kay Lamerigts +2
2025-03-25
Computer Vision and Pattern Recognition · Computer Science
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers
Nanye Ma, Mark Goldstein, Michael S. Albergo, Nicholas M. Boffi +2
2024-09-24
Computer Vision and Pattern Recognition · Computer Science
On the Scalability of Diffusion-based Text-to-Image Generation
Hao Li, Yang Zou, Ying Wang, Orchid Majumder +6
2024-04-04
Machine Learning · Computer Science
Scaling Diffusion Transformers Efficiently via $\mu$P
Chenyu Zheng, Xinyu Zhang, Rongzhen Wang, Wei Huang +4
2025-11-03
Computer Vision and Pattern Recognition · Computer Science
PixelDiT: Pixel Diffusion Transformers for Image Generation
Yongsheng Yu, Wei Xiong, Weili Nie, Yichen Sheng +2
2026-04-17
Computer Vision and Pattern Recognition · Computer Science
DiT4Edit: Diffusion Transformer for Image Editing
Kunyu Feng, Yue Ma, Bingyuan Wang, Chenyang Qi +3
2024-11-08
Computer Vision and Pattern Recognition · Computer Science
Effective Diffusion Transformer Architecture for Image Super-Resolution
Kun Cheng, Lei Yu, Zhijun Tu, Xiao He +6
2024-10-01
Computer Vision and Pattern Recognition · Computer Science
Scaling Laws For Diffusion Transformers
Zhengyang Liang, Hao He, Ceyuan Yang, Bo Dai
2026-03-05
Computer Vision and Pattern Recognition · Computer Science
Unveiling Redundancy in Diffusion Transformers (DiTs): A Systematic Study
Xibo Sun, Jiarui Fang, Aoyu Li, Jinzhe Pan
2024-11-22
Computer Vision and Pattern Recognition · Computer Science
A training-free framework for high-fidelity appearance transfer via diffusion transformers
Shengrong Gu, Ye Wang, Song Wu, Rui Ma +3
2026-03-31
Computer Vision and Pattern Recognition · Computer Science
$\Delta$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
Pengtao Chen, Mingzhu Shen, Peng Ye, Jianjian Cao +4
2024-06-04
Computer Vision and Pattern Recognition · Computer Science
Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations
Chaofan Gan, Yuanpeng Tu, Xi Chen, Tieyuan Chen +3
2025-11-11
Computer Vision and Pattern Recognition · Computer Science
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
Dongting Hu, Aarush Gupta, Magzhan Gabidolla, Arpit Sahni +11
2026-02-12
Computer Vision and Pattern Recognition · Computer Science
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
Shentong Mo, Enze Xie, Ruihang Chu, Lewei Yao +3
2023-07-06
Computer Vision and Pattern Recognition · Computer Science
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
Philipp Becker, Abhinav Mehrotra, Ruchika Chavhan, Malcolm Chadwick +4
2025-08-12
Image and Video Processing · Electrical Eng. & Systems
DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression
Junqi Shi, Ming Lu, Xingchen Li, Anle Ke +2
2026-03-16
Computer Vision and Pattern Recognition · Computer Science
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
Yuang Ai, Qihang Fan, Xuefeng Hu, Zhenheng Yang +2
2025-09-23