Related papers: DiffusionMat: Alpha Matting as Sequential Refineme…

Diffusion for Natural Image Matting

We aim to leverage diffusion to address the challenging image matting task. However, the presence of high computational overhead and the inconsistency of noise sampling between the training and inference processes pose significant obstacles…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Yihan Hu , Yiheng Lin , Wei Wang , Yao Zhao , Yunchao Wei , Humphrey Shi

Diffusion Model for Dense Matching

The objective for establishing dense correspondence between paired images consists of two terms: a data term and a prior term. While conventional techniques focused on defining hand-designed prior terms, which are difficult to formulate,…

Computer Vision and Pattern Recognition · Computer Science 2024-01-26 Jisu Nam , Gyuseong Lee , Sunwoo Kim , Hyeonsu Kim , Hyoungwon Cho , Seyeon Kim , Seungryong Kim

High-Resolution Deep Image Matting

Image matting is a key technique for image and video editing and composition. Conventionally, deep learning approaches take the whole input image and an associated trimap to infer the alpha matte using convolutional neural networks. Such…

Computer Vision and Pattern Recognition · Computer Science 2021-01-18 Haichao Yu , Ning Xu , Zilong Huang , Yuqian Zhou , Humphrey Shi

High-Fidelity Diffusion-based Image Editing

Diffusion models have attained remarkable success in the domains of image generation and editing. It is widely recognized that employing larger inversion and denoising steps in diffusion model leads to improved image reconstruction quality.…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Chen Hou , Guoqiang Wei , Zhibo Chen

ADT: Tuning Diffusion Models with Adversarial Supervision

Diffusion models have achieved outstanding image generation by reversing a forward noising process to approximate true data distributions. During training, these models predict diffusion scores from noised versions of true samples in a…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Dazhong Shen , Guanglu Song , Yi Zhang , Bingqi Ma , Lujundong Li , Dongzhi Jiang , Zhuofan Zong , Yu Liu

TransMatting: Enhancing Transparent Objects Matting with Transformers

Image matting refers to predicting the alpha values of unknown foreground areas from natural images. Prior methods have focused on propagating alpha values from known to unknown regions. However, not all natural images have a specifically…

Computer Vision and Pattern Recognition · Computer Science 2022-09-07 Huanqia Cai , Fanglei Xue , Lele Xu , Lili Guo

TransMatting: Tri-token Equipped Transformer Model for Image Matting

Image matting aims to predict alpha values of elaborate uncertainty areas of natural images, like hairs, smoke, and spider web. However, existing methods perform poorly when faced with highly transparent foreground objects due to the large…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 Huanqia Cai , Fanglei Xue , Lele Xu , Lili Guo

Disentangled Image Matting

Most previous image matting methods require a roughly-specificed trimap as input, and estimate fractional alpha values for all pixels that are in the unknown region of the trimap. In this paper, we argue that directly estimating the alpha…

Computer Vision and Pattern Recognition · Computer Science 2019-09-12 Shaofan Cai , Xiaoshuai Zhang , Haoqiang Fan , Haibin Huang , Jiangyu Liu , Jiaming Liu , Jiaying Liu , Jue Wang , Jian Sun

Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps

Diffusion models have achieved remarkable success in image synthesis. However, addressing artifacts and unrealistic regions remains a critical challenge. We propose self-refining diffusion, a novel framework that enhances image generation…

Computer Vision and Pattern Recognition · Computer Science 2025-12-10 Seoyeon Lee , Gwangyeol Yu , Chaewon Kim , Jonghyuk Park

Mask Approximation Net: A Novel Diffusion Model Approach for Remote Sensing Change Captioning

Remote sensing image change description represents an innovative multimodal task within the realm of remote sensing processing.This task not only facilitates the detection of alterations in surface conditions, but also provides…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Dongwei Sun , Jing Yao , Wu Xue , Changsheng Zhou , Pedram Ghamisi , Xiangyong Cao

Deep Image Matting

Image matting is a fundamental computer vision problem and has many applications. Previous algorithms have poor performance when an image has similar foreground and background colors or complicated textures. The main reasons are prior…

Computer Vision and Pattern Recognition · Computer Science 2017-04-12 Ning Xu , Brian Price , Scott Cohen , Thomas Huang

An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques

Image classification serves as the cornerstone of computer vision, traditionally achieved through discriminative models based on deep neural networks. Recent advancements have introduced classification methods derived from generative…

Computer Vision and Pattern Recognition · Computer Science 2024-12-16 Chunxiao Li , Xiaoxiao Wang , Boming Miao , Chuanlong Xie , Zizhe Wang , Yao Zhu

ExposureDiffusion: Learning to Expose for Low-light Image Enhancement

Previous raw image-based low-light image enhancement methods predominantly relied on feed-forward neural networks to learn deterministic mappings from low-light to normally-exposed images. However, they failed to capture critical…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Yufei Wang , Yi Yu , Wenhan Yang , Lanqing Guo , Lap-Pui Chau , Alex C. Kot , Bihan Wen

DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

We present DiffusionBERT, a new generative masked language model based on discrete diffusion models. Diffusion models and many pre-trained language models have a shared training objective, i.e., denoising, making it possible to combine the…

Computation and Language · Computer Science 2022-12-02 Zhengfu He , Tianxiang Sun , Kuanning Wang , Xuanjing Huang , Xipeng Qiu

VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion

We propose a new task, video referring matting, which obtains the alpha matte of a specified instance by inputting a referring caption. We treat the dense prediction task of matting as video generation, leveraging the text-to-video…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 Lehan Yang , Jincen Song , Tianlong Wang , Daiqing Qi , Weili Shi , Yuheng Liu , Sheng Li

DiffMark: Diffusion-based Robust Watermark Against Deepfakes

Deepfakes pose significant security and privacy threats through malicious facial manipulations. While robust watermarking can aid in authenticity verification and source tracking, existing methods often lack the sufficient robustness…

Computer Vision and Pattern Recognition · Computer Science 2025-10-13 Chen Sun , Haiyang Sun , Zhiqing Guo , Yunfeng Diao , Liejun Wang , Dan Ma , Gaobo Yang , Keqin Li

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Discrete diffusion models have emerged as a promising direction for vision-language tasks, offering bidirectional context modeling and theoretical parallelization. However, their practical application is severely hindered by a…

Computation and Language · Computer Science 2025-10-24 Yatai Ji , Teng Wang , Yuying Ge , Zhiheng Liu , Sidi Yang , Ying Shan , Ping Luo

Boosting General Trimap-free Matting in the Real-World Image

Image matting aims to obtain an alpha matte that separates foreground objects from the background accurately. Recently, trimap-free matting has been well studied because it requires only the original image without any extra input. Such…

Computer Vision and Pattern Recognition · Computer Science 2024-05-29 Leo Shan Wenzhang Zhou Grace Zhao

DiffHarmony: Latent Diffusion Model Meets Image Harmonization

Image harmonization, which involves adjusting the foreground of a composite image to attain a unified visual consistency with the background, can be conceptualized as an image-to-image translation task. Diffusion models have recently…

Computer Vision and Pattern Recognition · Computer Science 2024-04-10 Pengfei Zhou , Fangxiang Feng , Xiaojie Wang

Fine-Tuning Diffusion-Based Recommender Systems via Reinforcement Learning with Reward Function Optimization

Diffusion models recently emerged as a powerful paradigm for recommender systems, offering state-of-the-art performance by modeling the generative process of user-item interactions. However, training such models from scratch is both…

Information Retrieval · Computer Science 2025-11-11 Yu Hou , Hua Li , Ha Young Kim , Won-Yong Shin