English
Related papers

Related papers: Diffusion-based Visual Anagram as Multi-task Learn…

200 papers

We address the problem of synthesizing multi-view optical illusions: images that change appearance upon a transformation, such as a flip or rotation. We propose a simple, zero-shot method for obtaining these illusions from off-the-shelf…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Daniel Geng , Inbum Park , Andrew Owens

Diffusion models have emerged as a key pillar of foundation models in visual domains. One of their critical applications is to universally solve different downstream inverse tasks via a single diffusion prior without re-training for each…

Machine Learning · Computer Science 2023-10-03 Morteza Mardani , Jiaming Song , Jan Kautz , Arash Vahdat

Novel-view synthesis through diffusion models has demonstrated remarkable potential for generating diverse and high-quality images. Yet, the independent process of image generation in these prevailing methods leads to challenges in…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Xianghui Yang , Yan Zuo , Sameera Ramasinghe , Loris Bazzani , Gil Avraham , Anton van den Hengel

Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Mubarak Shah

Diffusion models have demonstrated remarkable performance in generation tasks. Nevertheless, explaining the diffusion process remains challenging due to it being a sequence of denoising noisy images that are difficult for experts to…

Computer Vision and Pattern Recognition · Computer Science 2024-02-19 Ji-Hoon Park , Yeong-Joon Ju , Seong-Whan Lee

Visual illusions in humans arise when interpreting out-of-distribution stimuli: if the observer is adapted to certain statistics, perception of outliers deviates from reality. Recent studies have shown that artificial neural networks (ANNs)…

Computer Vision and Pattern Recognition · Computer Science 2024-12-16 Alex Gomez-Villa , Kai Wang , Alejandro C. Parraga , Bartlomiej Twardowski , Jesus Malo , Javier Vazquez-Corral , Joost van de Weijer

Creating novel images by fusing visual cues from multiple sources is a fundamental yet underexplored problem in image-to-image generation, with broad applications in artistic creation, virtual reality and visual media. Existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Zeren Xiong , Yue Yu , Zedong Zhang , Shuo Chen , Jian Yang , Jun Li

In supervised learning for image denoising, usually the paired clean images and noisy images are collected or synthesised to train a denoising model. L2 norm loss or other distance functions are used as the objective function for training.…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Yutong Xie , Minne Yuan , Bin Dong , Quanzheng Li

Recent advances in diffusion models have achieved remarkable success in isolated computer vision tasks such as text-to-image generation, depth estimation, and optical flow. However, these models are often restricted by a…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Yilin Gao , Shuguang Dou , Junzhou Li , Zhiheng Yu , Yin Li , Dongsheng Jiang , Shugong Xu

With the success of image generation, generative diffusion models are increasingly adopted for discriminative tasks, as pixel generation provides a unified perception interface. However, directly repurposing the generative denoising process…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Ziqi Pang , Xin Xu , Yu-Xiong Wang

A diffusion probabilistic model (DPM), which constructs a forward diffusion process by gradually adding noise to data points and learns the reverse denoising process to generate new samples, has been shown to handle complex data…

Computer Vision and Pattern Recognition · Computer Science 2023-10-16 Zhengxiong Luo , Dayou Chen , Yingya Zhang , Yan Huang , Liang Wang , Yujun Shen , Deli Zhao , Jingren Zhou , Tieniu Tan

Image inversion is a fundamental task in generative models, aiming to map images back to their latent representations to enable downstream applications such as editing, restoration, and style transfer. This paper provides a comprehensive…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Yinan Chen , Jiangning Zhang , Yali Bi , Xiaobin Hu , Teng Hu , Zhucun Xue , Ran Yi , Yong Liu , Ying Tai

While many unsupervised learning models focus on one family of tasks, either generative or discriminative, we explore the possibility of a unified representation learner: a model which addresses both families of tasks simultaneously. We…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Soumik Mukhopadhyay , Matthew Gwilliam , Yosuke Yamaguchi , Vatsal Agarwal , Namitha Padmanabhan , Archana Swaminathan , Tianyi Zhou , Jun Ohya , Abhinav Shrivastava

Complex degradations like noise, blur, and low resolution are typical challenges in real world image fusion tasks, limiting the performance and practicality of existing methods. End to end neural network based approaches are generally…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Yu Shi , Yu Liu , Zhong-Cheng Wu , Juan Cheng , Huafeng Li , Xun Chen

Cross-Modal learning tasks have picked up pace in recent times. With plethora of applications in diverse areas, generation of novel content using multiple modalities of data has remained a challenging problem. To address the same, various…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Nikhil Verma

Denoising diffusion models are a powerful type of generative models used to capture complex distributions of real-world signals. However, their applicability is limited to scenarios where training samples are readily available, which is not…

Computer Vision and Pattern Recognition · Computer Science 2023-11-20 Ayush Tewari , Tianwei Yin , George Cazenavette , Semon Rezchikov , Joshua B. Tenenbaum , Frédo Durand , William T. Freeman , Vincent Sitzmann

Creative visual concept generation often draws inspiration from specific concepts in a reference image to produce relevant outcomes. However, existing methods are typically constrained to single-aspect concept generation or are easily…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Yangyang Li , Daqing Liu , Wu Liu , Allen He , Xinchen Liu , Yongdong Zhang , Guoqing Jin

How do diffusion generative models convert pure noise into meaningful images? In a variety of pretrained diffusion models (including conditional latent space models like Stable Diffusion), we observe that the reverse diffusion process that…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Binxu Wang , John J. Vastola

Diffusion-based generative models are a design framework that allows generating new images from processes analogous to those found in non-equilibrium thermodynamics. These models model the reversal of a physical diffusion process in which…

Artificial Intelligence · Computer Science 2023-02-21 Jordi de la Torre

Diffusion models are powerful generative models that map noise to data using stochastic processes. However, for many applications such as image editing, the model input comes from a distribution that is not random noise. As such, diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Linqi Zhou , Aaron Lou , Samar Khanna , Stefano Ermon
‹ Prev 1 2 3 10 Next ›