English
Related papers

Related papers: MSF: Efficient Diffusion Model Via Multi-Scale Lat…

200 papers

We propose a novel framework for representing neural fields on triangle meshes that is multi-resolution across both spatial and frequency domains. Inspired by the Neural Fourier Filter Bank (NFFB), our architecture decomposes the spatial…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Avigail Cohen Rimon , Tal Shnitzer , Mirela Ben Chen

Diffusion models, emerging as powerful deep generative tools, excel in various applications. They operate through a two-steps process: introducing noise into training samples and then employing a model to convert random noise into new…

Computer Vision and Pattern Recognition · Computer Science 2026-02-13 Huijie Zhang , Yifu Lu , Ismail Alkhouri , Saiprasad Ravishankar , Dogyoon Song , Qing Qu

We present a novel generative modeling framework,Wavelet-Fourier-Diffusion, which adapts the diffusion paradigm to hybrid frequency representations in order to synthesize high-quality, high-fidelity images with improved spatial…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Andrew Kiruluta , Andreas Lemos

By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image data and beyond. Additionally, their formulation allows for a…

Computer Vision and Pattern Recognition · Computer Science 2022-04-14 Robin Rombach , Andreas Blattmann , Dominik Lorenz , Patrick Esser , Björn Ommer

Diffusion models have achieved remarkable success in image generation but their practical application is often hindered by the slow sampling speed. Prior efforts of improving efficiency primarily focus on compressing models or reducing the…

Computer Vision and Pattern Recognition · Computer Science 2026-03-16 Jiuyi Xu , Qing Jin , Meida Chen , Andrew Feng , Yang Sui , Yangming Shi

Discrete diffusion models are a powerful class of generative models with strong performance across many domains. For efficiency, however, discrete diffusion typically parameterizes the generative (reverse) process with factorized…

Machine Learning · Statistics 2026-05-19 Grigory Bartosh , Teodora Pandeva , Sushrut Karmalkar , Javier Zazo

Feature Transformation (FT) crafts new features from original ones via mathematical operations to enhance dataset expressiveness for downstream models. However, existing FT methods exhibit critical limitations: discrete search struggles…

Machine Learning · Computer Science 2025-05-22 Nanxu Gong , Zijun Li , Sixun Dong , Haoyue Bai , Wangyang Ying , Xinyuan Wang , Yanjie Fu

Diffusion models are rising as a powerful solution for high-fidelity image generation, which exceeds GANs in quality in many circumstances. However, their slow training and inference speed is a huge bottleneck, blocking them from being used…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Hao Phung , Quan Dao , Anh Tran

Latent diffusion models (LDMs) dominate high-quality image generation, yet integrating representation learning with generative modeling remains a challenge. We introduce a novel generative image modeling framework that seamlessly bridges…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Theodoros Kouzelis , Efstathios Karypidis , Ioannis Kakogeorgiou , Spyros Gidaris , Nikos Komodakis

Diffusion models generate new samples by progressively decreasing the noise from the initially provided random distribution. This inference procedure generally utilizes a trained neural network numerous times to obtain the final output,…

Diffusion models, which learn to reverse a signal destruction process to generate new data, typically require the signal at each step to have the same dimension. We argue that, considering the spatial redundancy in image signals, there is…

Machine Learning · Computer Science 2022-11-30 Han Zhang , Ruili Feng , Zhantao Yang , Lianghua Huang , Yu Liu , Yifei Zhang , Yujun Shen , Deli Zhao , Jingren Zhou , Fan Cheng

Image tokenization plays a central role in modern generative modeling by mapping visual inputs into compact representations that serve as an intermediate signal between pixels and generative models. Diffusion-based decoders have recently…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Chuhan Wang , Hao Chen

Diffusion models are emerging as powerful solutions for generating high-fidelity and diverse images, often surpassing GANs under many circumstances. However, their slow inference speed hinders their potential for real-time applications. To…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Luan Thanh Trinh , Tomoki Hamagami

Diffusion models have emerged as a leading technique for generating images due to their ability to create high-resolution and realistic images. Despite their strong performance, diffusion models still struggle in managing image collections…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Hailong Yang , Te Zhang , Kup-sze Choi , Zhaohong Deng

Latent diffusion models excel at generating high-quality images but lose the benefits of end-to-end modeling. They discard information during image encoding, require a separately trained decoder, and model an auxiliary distribution to the…

Computer Vision and Pattern Recognition · Computer Science 2026-02-13 Alan Baade , Eric Ryan Chan , Kyle Sargent , Changan Chen , Justin Johnson , Ehsan Adeli , Li Fei-Fei

Score-based stochastic denoising models have recently been demonstrated as powerful machine learning tools for conditional and unconditional image generation. The existing methods are based on a forward stochastic process wherein the…

Generating high-dimensional visual modalities is a computationally intensive task. A common solution is progressive generation, where the outputs are synthesized in a coarse-to-fine spectral autoregressive manner. While diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Moayed Haji-Ali , Willi Menapace , Ivan Skorokhodov , Arpit Sahni , Sergey Tulyakov , Vicente Ordonez , Aliaksandr Siarohin

Denoising diffusion probabilistic models (DDPMs) have achieved impressive performance on various image generation tasks, including image super-resolution. By learning to reverse the process of gradually diffusing the data distribution into…

Image and Video Processing · Electrical Eng. & Systems 2023-07-25 Kai Zhao , Alex Ling Yu Hung , Kaifeng Pang , Haoxin Zheng , Kyunghyun Sung

We develop a neural network architecture which, trained in an unsupervised manner as a denoising diffusion model, simultaneously learns to both generate and segment images. Learning is driven entirely by the denoising diffusion objective,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Xin Yuan , Michael Maire

High-resolution image synthesis remains a core challenge in generative modeling, particularly in balancing computational efficiency with the preservation of fine-grained visual detail. We present Latent Wavelet Diffusion (LWD), a…

Computer Vision and Pattern Recognition · Computer Science 2026-04-17 Luigi Sigillo , Shengfeng He , Danilo Comminiello
‹ Prev 1 2 3 10 Next ›